The present application claims priority from Japanese application JP 2007-065993 filed on Mar. 15, 2007, the content of which is hereby incorporated by reference into this application.
1. Field of the Invention
The present invention relates to a method of diagnosing depression. More particularly, the present invention relates to a method of diagnosing depression for determining whether or not a subject suffers from depression based on expression levels of specified 18 genes in the peripheral blood isolated from the subject.
2. Background Art
Depression is a disease with high lifetime morbidity of approximately 10%, and this rate is predicted to further increase in the future due to stress in contemporary society. This disease seriously afflicts patients mentally and physically and imposes enormous damage upon their social lives. In addition, it is a serious disease that often leads to suicide. It is deduced that many of the people who commit suicide (as many as 30,000 or more per year in Japan) are afflicted with depression. This disease is also deeply associated with societal problems such as truancy, unemployment, and social withdrawal or medical problems such as alcohol-related disorders. Establishment of methods of precisely diagnosing and promptly treating this disease is indispensable for improving the quality of life, and thus is an urgent need of society as a whole.
Diagnosis of depression is, however, far from simple. Cardinal symptoms of depression are, for example, depressive mood, hypobulia, loss of interest and pleasure, disrupted concentration and attention, lowered self-esteem and self-confidence, feelings of guilt and worthlessness, pessimism about the future, thoughts of suicide, sleep disorders, and loss of appetite. These symptoms have features peculiar to depression, which differ from depressed feelings experienced by anyone, and also differ from the lowered mental activity and sense of exhaustion experienced by people afflicted with physical diseases. The symptoms of depression are mainly comprehended by taking a precise medical history, questioning when and how the symptoms in terms of mental activity were developed and what types of damages have been imposed upon their social and domestic lives, and confirming various symptoms based on a patient's attitude or the contents of conversations during consultation. For example, family medical history, anamnesis, physical conditions, early developmental history, life history, personality inclination, premorbid social adaptation, and the occurrence of any episode(s) that had triggered the disease can be important references. In order to accurately comprehend these factors, an interview needs to be conducted by a highly skilled specialist in psychiatric medicine for approximately 1 hour. Further, it should be confirmed that a patient does not have any major abnormalities in terms of general physical or neurological conditions. If necessary, the possibility of the existence of organic brain disorders is to be eliminated by electroencephalography or brain imaging tests. The patient is then subjected to diagnosis. The findings are compared with the diagnostic standards issued by the World Health Organization (WHO) or the American Psychiatric Association, and the diagnosis can be generally confirmed.
As a major drawback, conventional diagnostic methods require skilled techniques. Needless to say, thorough knowledge and practice concerning depression are required. However, there are numerous psychological, mental, and physical states that result in the exhibition of depressive conditions even though they are not forms of depression. Differential diagnosis also becomes essential. Accordingly, diagnosis must be conducted by a thoroughly trained specialist in psychiatric medicine. Depression, which is a common disease with lifetime morbidity of approximately 10%, however, is often the subject of consultation with primary care doctors. Diagnosis of depression without objective medical findings is not always easy for general doctors who may not be acquainted with psychiatric consultation. Depression is a medical disease that requires treatment of the body (brain), including medication. Accordingly, it is difficult for specialists in clinical psychology, such as clinical psychotherapists, or mental health workers, such as public health nurses, to independently diagnose depression.
Technical skill is required for diagnosis mainly because of a lack of simple and objective methods of diagnosis regarding symptoms. Although there is a screening method utilizing a self-administered questionnaire at present, people tend to fill in the questionnaire based on their subjective viewpoints. Thus, genuine depression cannot be distinguished from depressed feelings caused by personality-based factors, environmental factors, or poor physical conditions. Symptom rating scales employed by doctors are often used in determination of severity, although adequate questioning is required to evaluate each item. Thus, such methods cannot be alternatives to diagnosis.
Many testing methods have been heretofore attempted, with the aim of utilizing them as objective indicators. Depression causes functional alteration in brain monoamine systems. This alteration is known to have a considerable influence upon the neuroendocrine system, the neuroimmune system, and the autonomic nervous system via psychosomatic correlation. In particular, the application of the results of a dexamethasone suppression test that allows accurate comprehension of neuroendocrine abnormalities, i.e., a minor level of adrenal cortical hormone hypersecretion, to diagnosis of depression has been extensively examined from the 1980s onwards. Clinical application thereof was, however, not realized due to the necessity for complicated procedures such as the administration of test drugs and limitations in terms of sensitivity or specificity. At the study phase, other abnormalities in the neuroendocrine system, the neuroimmune system, the autonomic nervous system, circadian rhythms, sleep architecture, and the like had been reported. Recently, changes regarding conditions of brain blood flow or brain monoamine receptors are also pointed out as objective indicators, although they are still disadvantageous in terms of sensitivity and reproducibility. Given the aforementioned factors, diagnosis of a complicated psychiatric disease, i.e., depression, is difficult by a method of testing limited factors. Enormous amounts of time and labor are required to perform conventional testing methods and to diagnose the disease. From the viewpoint of simplicity, conventional techniques cannot be applied to routine medical care at present.
The present inventors analyzed the expression patterns peculiar to patients afflicted with depression via peripheral-blood-targeted gene expression analysis. They developed a method for diagnosing depression using such feature as an indicator and reported such method (JP Patent Publication No. 2004-208547A (U.S. Patent Publication No. 2004-185474) and JP Patent Publication No. 2005-312435A (U.S. Patent Publication No. 2005-239110)). The method disclosed therein, however, involves the use of microarrays having about 1,500 genes mounted thereon to search for disease-associated genes, and such number of genes is very small compared with the types of gene transcripts that are expressed in peripheral blood cells (i.e., about 10,000 to 20,000 types). Accordingly, such a search of markers may not be sufficient, and marker genes that exhibit expression behaviors more peculiar to patients afflicted with depression may be missed.
An object of the present invention is to provide a simple and accurate method for diagnosing depression via assay of a large number of factors. More particularly, microarrays that are capable of assay of expression levels of as many as 41,000 types of gene transcripts, which represents the entire number of human genes, at one time are used to select marker genes that exhibit expression levels peculiar to patients afflicted with depression.
In the past, the catecholamine hypothesis and the indoleamine hypothesis were proposed as causes of depression. In addition, the GABA hypothesis, the glutamine hypothesis, the dopamine hypothesis, the neurogenesis hypothesis, and the like have been proposed as causes of depression in recent years. Many discrepancies in these hypotheses have been pointed out, and they have not yet resulted in conclusions. Linkage studies and association studies based on molecular genetic engineering and the search for sensitive chromosome domains by linkage analysis have been carried out. In the case of a disease such as depression, the diathesis (biological feature) thereof is generated through interactions among multiple genes and environmental factors such as stress, and therefore pathogenic gene analysis is extremely difficult. Based on past gene analysis, genes such as those related to serotonin transporter, serotonin 1A/2C receptor, dopamine D2/D3 receptor, dopamine transporter, tyrosine hydroxylase, tryptophan hydroxylase, and monoamine oxidase have been reported as candidate functional genes associated with depression. Some researchers are, however, skeptical about the aforementioned reports, and additional tests have been conducted thereon.
The present inventors have focused on peripheral leukocytes that can be easily obtained as specimens and express many receptors of stress-associated factors in order to objectively diagnose the conditions of depression, which is often caused by stress. They extracted RNAs directly from the whole blood because isolation of leukocytes from the whole blood would impose serious damage to such leukocytes. They extensively analyzed the expression patterns of genes exhibiting high expression levels in leukocytes and then patterned the expression levels. Through this analysis, microarrays that are capable of extensive measurement of as many as 41,000 types of gene transcripts, which represents the entire number of human genes, are used to search for and select marker genes that exhibit expression levels peculiar to patients afflicted with depression.
Thus, the present inventors determined 18 novel genes that can serve as marker genes of depression and completed development of a method for diagnosing the morbidity of a subject with depression with high accuracy based on the expression levels of such 18 genes or mean expression level thereof.
The present invention provides a method for diagnosing depression comprising the steps of: measuring expression levels of 18 genes selected from the group consisting of FASLG, CX3CR1, TBX21, ID2, SLAMF7, PRSS23, YWHAQ, TARDBP, ADRB2, PPP1R8, MMAA, SQLE, PDHA1, HAVCR2, RACGAP1, AHNAK, EDG8, and DUSP5, in peripheral blood isolated from a subject; and determining whether or not the subject suffers from depression based on the expression levels of the 18 genes.
According to an embodiment, the expression levels of the 18 genes of the subject are compared with the expression levels of the same genes of healthy individual to determine whether or not the subject suffers from depression.
According to another embodiment, the expression ratio of the expression levels of the 18 genes of the subject to the expression levels of the same genes of healthy individual is obtained, and the expression ratio is compared with predetermined date concerning expression levels of the 18 genes of depressed patient and expression levels of the 18 genes of healthy individual to determine whether or not the subject suffers from depression. This comparative analysis can be carried out using, for example, a support vector machine.
According to another embodiment, a mean value of expression levels of the 18 genes of the subject is obtained and the mean value is compared with a mean value of expression levels of the 18 genes of healthy individual to determine whether or not the subject suffers from depression. When the mean value of the 18 genes of the subject is significantly lower than that of healthy individual, specifically, the subject can be determined as having a high possibility of suffering from depression.
Gene expression levels can be measured in a simple manner with the use of nucleic acid-immobilized solid substrates, such as DNA chips or arrays.
Expression levels of some of the aforementioned 18 genes may be used to determine whether or not the subject suffers from depression. Another aspect of the present invention provides a method for diagnosing depression, wherein the expression level of at least one of the aforementioned 18 genes is measured to determine whether or not the subject suffers from depression based on the expression level.
In this method, the expression level of at least one of the aforementioned 18 genes of the subject is compared with a predetermined reference value to determine whether or not the subject suffers from depression based on the comparison results.
Alternatively, a ratio of the expression level of at least one of the aforementioned 18 genes to the predetermined reference value is obtained, and the ratio is compared with a predetermined threshold value to determine whether or not the subject suffers from depression based on the comparison results.
Also, the mean value of expression levels of at least two of the aforementioned 18 genes of the subject is obtained, and the mean value is compared with a predetermined threshold value to determine whether or not the subject suffers from depression based on the comparison results. When the mean value of expression levels of at least two of the aforementioned 18 genes of the subject is lower than the aforementioned threshold value, for example, the subject can be determined as having a high possibility of suffering from depression.
The present invention also provides a program for performing the method for diagnosing depression of the present invention. The program of the present invention comprises:
1) a means for inputting the expression levels of 18 genes selected from the group consisting of FASLG, CX3CR1, TBX21, ID2, SLAMF7, PRSS23, YWHAQ, TARDBP, ADRB2, PPP1R8, MMAA, SQLE, PDHA1, HAVCR2, RACGAP1, AHNAK, EDG8, and DUSP5, in the peripheral blood isolated from a subject;
2) a means for storing data concerning expression levels of the aforementioned 18 genes of depressed patients and of healthy individual that have been inputted in advance;
3) a means for comparing the expression levels of the 18 genes of the subject with expression levels of the 18 genes of healthy individual;
4) a means for determining whether or not the subject suffers from depression based on the results of comparison; and
5) a means for outputting the results of determination.
According to one embodiment, in 3) above, an expression ratio of the expression levels of the 18 genes of the subject to those of healthy individual is obtained in order to compare said expression ratio with an expression ratio of expression levels of the 18 genes of depressed patient to expression levels of the 18 genes of healthy individual that have been stored in advance. This analysis can be carried out using, for example, a support vector machine.
According to another embodiment, in 3) above, the mean value of expression levels of the 18 genes of the subject is obtained, and this mean value is compared with the mean value of expression levels of the 18 genes of healthy individual.
The program of the present invention preferably comprises a means for storing the expression levels of the 18 genes of the subject and updating the data of the depressed patient and that of healthy individual according to need, in addition to the aforementioned means.
Techniques for examining the gene expression levels employed in the present invention are not limited to nucleic acid-immobilized solid substrates such as a DNA chip or a microarray. For example, the availability of techniques such as quantitative PCR or Northern blotting is apparent for those skilled in the art.
According to the present invention, preferably, the expression data of the patients afflicted with depression and of healthy individuals are stored in the database in combination with the clinical information, and the expression data of the subject is analyzed with reference to the database to examine the condition of depression of the subject. It is apparent that a means for data analysis is not limited to a support vector machine, and the algorithm of other learning machines can also be used. Preferably, expression data for patients afflicted with depression and those for healthy individuals are previously stored in the computer, and the computer is allowed to determine which of the expression patterns for patients or healthy individuals are more similar to the subject's expression patterns, thereby diagnosing the conditions of depression in the subject.
The publications (JP Patent Publication No. 2004-208547A (U.S. Patent Publication No. 2004-185474) and JP Patent Publication No. 2005-312435A (U.S. Patent Publication No. 2005-239110)) each disclose a method for diagnosing whether or not a subject suffers from a disease by a method of hierarchical cluster analysis, wherein the expression levels of marker genes are used as indicators. According to cluster analysis, a dendrogram for classification of the subjects' expression patterns into the patient group or healthy individual group must be selected by a human in the last stage. Thus, it is difficult to maintain objectivity. According to a classification method using a support vector machine, however, selection is made by a computer, and objectivity is thus maintained (Proceedings of National Academy of Sciences of the United States of America, Vol. 97, Issue 1, 262-267, (2000), “Knowledge-based analysis of microarray gene expression data using support vector machines”). Further, distance from a borderline (i.e., the borderline between patients afflicted with depression and healthy individuals) can also be determined, and the distance of the subject from the borderline can be objectively determined. Accordingly, such technique is suitable for a method of the present invention, wherein the expression levels of a plurality of genes are comprehensively measured to evaluate the affliction.
The present invention provides a method of diagnosing depression by collectively quantifying the RNA expression levels in peripheral leukocyte blood to find evidence that is peculiar to depression. This would innovatively improve medical care for depression.
The method of the present invention can conduct the analysis with the use of 2 to 5 ml of blood obtained by conventional blood sampling without special cooperation provided by a patient. This diagnostic method can be carried out in a non-invasive, simple, and routine manner. This method of multidimensionally comprehending biological functions based on numerous RNA expression levels is more adequate as a method of diagnosing complicated psychiatric diseases involving both mental and physical conditions such as depression in terms of its principle compared with the conventional method that assays only limited factors.
The results attained by the method of the present invention can be simply and clearly evaluated, they can be easily employed by primary care doctors as objective indicators for depression, and they are extremely useful for the establishment of diagnosis and introduction of therapy. A high-risk group can be selected from among the groups of people in a simple, accurate, and cost-effective manner through medical checkups or complete physical examinations provided by workplaces, schools, and communities. This enables early detection of depression. Accordingly, the method of the present invention significantly contributes to the improvement of peoples' mental health from the viewpoint of preventive care.
The usefulness of the method according to the present invention is not limited to primary care and medical checkups. Specialists in psychiatric medicine can apply this technique to the search for psychological, social, and environmental factors associated with the development of depression, evaluation of clinical conditions, diagnosis, evaluation of treatment, and determination of prognosis. Thus, this technique can be a revolutionary test technique in the field of psychiatric medicine.
Hereafter, the embodiments of the present invention are described in detail with reference to concrete examples.
The methods for searching for and selecting the marker genes according to the present invention are described.
The present inventors collected blood from patients and healthy individuals as described below. RNA was extracted from the whole blood, and gene expression of patients was then analyzed using DNA chips, along with that of healthy individuals. A DNA chip comprises DNA fragments having nucleotide sequences corresponding to numerous genes immobilized on a substrate such as a glass substrate, and it is used for detecting DNA or RNA in a sample by hybridization.
Target patients were as follows. Target patients were those who had agreed with the written description for participating in the research for developing the present diagnostic method selected from among untreated patients afflicted with depression who had visited the Department of Psychiatry and Neurology of the Tokushima University Hospital between November 2002 and December 2006. This research was approved by the ethics committee of Tokushima University Hospital. Diagnosis was made in accordance with depressive episode specified in the International Classification of Diseases, 10th revision (ICD-10). Patients with serious physical complications or those taking therapeutic agents for physical diseases were excluded. Blood was collected by a doctor or nurse between 10:00 am and 1:00 pm from the patients under fasting conditions through cubitus veins under resting conditions.
Detailed information of subjects is summarized in Table 1. Forty six patients whose samples before treatment had been obtained were 17 males and 29 females aged 17 to 76 (42 years old on average), and their Hamilton scores were 19.8 on average (standard deviation: 6.8).
A hundred and twenty two healthy individuals whose samples had been obtained were 49 males and 73 females aged 21 to 88 (45 years old on average) (i.e., control subjects. The male-female ratio and the age of patients before treatment were substantially the same as those of healthy individuals. Blood was collected from healthy individuals under fasting conditions between 10:00 am and 1:00 pm.
Samples after antidepressant treatment were obtained from 40 patients, and those samples were obtained one month after the initiation of treatment (32 patients), two months after the initiation of treatment (7 patients), and three months after the initiation of treatment (1 patient). The Hamilton scores after treatment were 19.8 points, 7.8 points, and 4 points on average, respectively. A total of 208 specimens; i.e., 46 specimens obtained from patients afflicted with depression before treatment, 40 specimens obtained from patients afflicted with depression after treatment, and 122 specimens obtained from healthy individuals, were subjected to analysis.
Blood (5 ml; 2.5 ml×2) was collected from all subjects using a PAXgene Blood RNA System (Qiagen), and total RNA was extracted. The yield of total RNA was 5 μg to 15 μg. Subsequently, quality of total RNA extracted from the subjects was inspected using the Bioanalyzer 2100 (Agilent) to confirm that total RNA had not been decomposed. Total RNA (0.2 μg) was then subjected to the in vitro transcription reaction to synthesize cRNA into which aminoallyl-CTP had been introduced in the presence of aminoallyl-CTP using the Agilent reagent that amplifies and synthesizes cRNA (Low RNA Input Linear Amp Kit PLUS, One-Color). Subsequently, an amino group in the synthesized cRNA was subjected to coupling to a succinimide-containing fluorescent dye (Cy3, Amersham) to synthesize fluorescent-labeled RNA. Subsequently, the fluorescent-labeled RNA was subjected to hybridization to the Agilent microarrays (Whole Human Genome Microarray 4 Pack) at 65° C. for 17 hours. The microarrays were washed in accordance with the Agilent's given protocols, and the fluorescent image was read using the Agilent scanner (Agilent). The image data was converted into the numerical data using a special software, Feature Extraction (Agilent). Subsequently, normalization was carried out so that a sum of signal intensities of genes exhibiting signal intensities of 25% to 0.75% would be the same among 208 specimens. The values representing the same sums were 11, 234, and 345. Subsequently, genes exhibiting 100 or higher signal intensities in 104 or more specimens, i.e., a half of the entire dataset comprising 208 specimens, were extracted, these genes were designated as genes expressed in the peripheral blood, and this group of target genes was subjected to the following analysis. As target genes, 21,895 genes were extracted.
After normalization and gene selection in accordance with signal intensities, the data of 122 specimens obtained from healthy individuals was subjected to determination of the mean expression intensity of the genes to prepare a dataset of mean values as the reference dataset. Subsequently, the data of expression intensity of 208 specimens each was divided by the reference dataset to determine the expression intensity ratios among the 208 specimens. The data of expression ratios among the 208 specimens was subjected to the following data analysis.
In order to extract genes that exhibit significantly different expression levels between the group of 46 specimens obtained from patients before treatment and the group of 122 specimens obtained from healthy individuals, a t-test was carried out at the 0.05 significance level with Bonferroni correction for multiple comparison without the homoscedasticity assumption. As a result, 631 genes were extracted. According to the expression levels, three-fourth or more of the genes exhibited the decrease in the expression levels in patients than in healthy individuals, and the degrees of decrease were substantially uniform.
Thus, these 631 genes were found to exhibit sufficient expression levels in the peripheral blood samples. Also, these genes were found to be useful as marker genes exhibiting significant differences in expression levels between healthy individuals and patients.
Erythrocytes do not contain nuclei. Thus, most RNAs in the whole blood samples are considered to originate from leukocytes. Leukocytes consist of monocytes, granulocytes, and lymphocytes, and, for example, T cells, B cells, and NK cells are also lymphocytes. Accordingly, the whole blood samples used in this test contain RNAs derived from a numerous different kinds of cells. Thus, the gene expression levels examined in this test are the result of multiplying the gene expression levels in each type of cells by the number of the cell. If the numbers of specific types of cells in the blood are different between patients and healthy individuals, accordingly, a significant difference would be observed in the expression levels of genes exhibiting specifically high expression levels in the cells. The differences in expression levels of such genes result from differences in the number of the cells; in other words, the differences in gene expression levels would be substantially uniform.
The cells from which the 631 genes exhibiting significantly different levels of expression between the group of healthy individuals and the group of patients described in Example 1 mainly originate were inspected in the following manner.
About 30 ml of blood was collected from two healthy individuals, T cells, neutrophil leucocytes, and monocytes were fractionated from 25 ml thereof using surface antigen-recognizing micro-magnetic beads (Miltenyi Biotec K. K.), RNAs were extracted therefrom, and RNA was extracted from 5 ml of the remaining whole blood. Gene expression was compared and analyzed between the whole blood and each fraction. The expression level determined as the mean between two subjects was used to extract genes that are expressed specifically for each of T cells, neutrophil leucocytes, and monocytes in terms of higher expression levels by two times or more in comparison with the other two fractions and differences in expression levels by two times or less between two subjects. As a result, 141 genes, 120 genes, and 204 genes that are expressed specifically for T cells, neutrophil leucocytes, and monocytes, respectively, were extracted. In order to inspect the correlation between these genes and the 631 genes extracted by the intergroup test, the data of the 208 specimens was subjected to cluster analysis for grouping the target 21,895 genes, and the dataset of the 631 genes and that of the genes expressed specifically for each fraction were mapped thereon. As a result, the target 21,895 genes were roughly divided into four groups, and 281 of 631 depression-associated genes were found to be collectively present in the group of genes collectively comprising 107 of the 141 genes expressed specifically for T-cells. Genes expressed specifically for other fractions were collectively present in the other group of genes. This indicates that 631 depression-associated genes exhibiting significantly different expression levels between the group of healthy individuals and the group of patients originate from T cells. Affliction with depression causes changes in the number of T cells or the abundance of a variety of T cells, such as killer T cells, helper T cells, and suppressor T cells.
The above results demonstrate that depression can be diagnosed by analyzing expression of genes obtained from the peripheral whole blood of subjects, measuring the expression levels of genes exhibiting the expression levels at least two times higher in T cells than in other blood cells, and determining whether or not a subject suffers from depression based on the results of the expression levels of the genes.
Among the 631 depression-associated genes described in Example 1, the group of genes that should be focused in order to effectively diagnose depression was examined. Genes were selected from among the 631 genes in ascending order of p values, the mean expression levels thereof were calculated, and the determined mean was subjected to a significant difference test between the group of 122 healthy individuals and the group of 46 patients to inspect the resulting p values. Specifically, a significant difference test was carried out using the mean Log value among genes for the ratio of the expression levels of patients' genes to the expression levels of healthy individuals' genes.
Homo sapiens Fas ligand (TNF superfamily, member 6) (FASLG), mRNA
Homo sapiens chemokine (C-X3-C motif) receptor 1 (CX3CR1), mRNA
Homo sapiens T-box 21 (TBX21), mRNA [NM_013351]
Homo sapiens inhibitor of DNA binding 2, dominant negative helix-loop-helix
Homo sapiens SLAM family member 7 (SLAMF7), mRNA [NM_021181]
Homo sapiens protease, serine, 23 (PRSS23), mRNA [NM_007173]
Homo sapiens tyrosine 3-monooxygenase/tryptophan 5-monooxygenase
Homo sapiens TAR DNA binding protein (TARDBP), mRNA [NM_007375]
Homo sapiens adrenergic, beta-2-, receptor, surface (ADRB2), mRNA
Homo sapiens protein phosphatase 1, regulatory (inhibitor) subunit 8 (PPP1R8),
Homo sapiens methylmalonic aciduria (cobalamin deficiency) cblA type (MMAA),
Homo sapiens squalene epoxidase (SQLE), mRNA [NM_003129]
Homo sapiens pyruvate dehydrogenase (lipoamide) alpha 1 (PDHA1), mRNA
Homo sapiens hepatitis A virus cellular receptor 2 (HAVCR2), mRNA [NM_032782]
Homo sapiens Rac GTPase activating protein 1 (RACGAP1), mRNA [NM_013277]
Homo sapiens AHNAK nucleoprotein (desmoyokin) (AHNAK), transcript variant 1,
Homo sapiens endothelial differentiation, sphingolipid G-protein-coupled receptor,
Homo sapiens dual specificity phosphatase 5 (DUSP5), mRNA [NM_004419]
Tables 3 to 6 each show the data of expression levels (fluorescent intensities) of 18 genes of patients, and Tables 7 to 17 each show the data of expression levels (fluorescent intensities) of 18 genes of healthy individuals. A comparison of the data of patients with the data of healthy individuals shows significantly lowered expression levels of the 18 genes of patients afflicted with depression compared with those of healthy individuals.
The analysis demonstrated that a healthy individual could be distinguished from a patient by analyzing the expression levels of FASLG, CX3CR1, TBX21, ID2, SLAMF7, PRSS23, YWHAQ, TARDBP, ADRB2, PPP1R8, MMAA, SQLE, PDHA1, HAVCR2, RACGAP1, AHNAK, EDG8, and DUSP5 in mRNA extracted from the peripheral whole blood of the subject, determining the mean expression level thereof, and using the mean as an indicator.
Also, all p values of genes shown in Table 2 were 0.000004 or smaller. Thus, the expression level of a single gene sufficiently enables distinguishing of a healthy individual from a patient. This indicates that depression can be diagnosed by measuring the expression level of at least one gene selected from among FASLG; CX3CR1, TBX21, ID2, SLAMF7, PRSS23, YWHAQ, TARDBP, ADRB2, PPP1R8, MMAA, SQLE, PDHA1, HAVCR2, RACGAP1, AHNAK, EDG8, and DUSP5 in mRNA extracted from the peripheral whole blood of the subject, and determining whether or not a subject suffers from depression based on the expression level.
The expression levels of 18 genes described in Example 3 were used as indicators to perform diagnosis using a support vector machine. The sensitivity and the specificity in such a case were evaluated by the leave-one-out method. The results are shown in Table 18.
The results demonstrate that a depressed patient can be satisfactorily distinguished from a healthy individual with sensitivity of about 83%, specificity of about 92%, and accuracy of about 89%. This indicates that depression can be diagnosed by determining the expression levels of FASLG, CX3CR1, TBX21, ID2, SLAMF7, PRSS23, YWHAQ, TARDBP, ADRB2, PPP1R8, MMAA, SQLE, PDHA1, HAVCR2, RACGAP1, AHNAK, EDG8, and DUSP5 in mRNA extracted from the peripheral whole blood of the subjects, and determining whether or not a subject suffers from depression based on the results obtained with the use of a support vector machine.
The results of diagnosis of depression with the elapse of treatment time were then examined. In addition to the patients before treatment, the blood samples obtained from 32 patients one month after the initiation of drug treatment and from seven patients two months after the initiation of drug treatment were subjected to gene expression analysis. The mean expression level of the 18 genes of the patients and that of the healthy individuals (i.e., the reference dataset) were determined, the mean Log value for the former mean to the latter mean were determined and plotted in
Further, the mean expression levels of the 18 genes (i.e., the mean Log ratio) of patients afflicted with depression before treatment (D1) and of healthy individuals (N1) were plotted in
The results indicate that the mean expression levels of the 18 genes can function not only as the indicators for distinguishing a healthy individual from a patient but also as the effective indicators for evaluating the effects of treatment. This demonstrates that gene expression analysis that is performed in the method of diagnosing depression enables inspection of therapeutic effects on depression.
Blood (5 ml each) was collected from healthy individuals (6 in total; 1 male and 1 female each in their 30's, 40's, and 50's) using a PAXgene Blood RNA System (Qiagen), and total RNA was extracted. The yield of total RNA was 5 μg to 15 μg. The quality of total RNA extracted from the subjects was inspected using the Bioanalyzer 2100 (Agilent) to confirm that total RNA had not been decomposed. Total RNA (0.2 mg) was then subjected to the in vitro transcription reaction to synthesize cRNA into which aminoallyl-CTP had been introduced in the presence of aminoallyl-CTP using the Agilent reagent that amplifies and synthesizes cRNA (Low RNA Input Linear Amp Kit PLUS, One-Color). Subsequently, an amino group in the synthesized cRNA was subjected to coupling to a succinimide-containing fluorescent dye (Cy3, Amersham) to synthesize fluorescent-labeled RNA. Subsequently, the fluorescent-labeled RNA was subjected to hybridization to the Agilent microarrays (Whole Human Genome Microarray 4 Pack) at 65° C. for 17 hours. The microarrays were washed in accordance with the Agilent's given protocols, and the fluorescent image was read using the Agilent scanner (Agilent). The image data was converted into the numerical data using a special software, Feature Extraction (Agilent). Subsequently, normalization was carried out so that a sum of signal intensities of genes exhibiting signal intensities of 25% to 75% would be 11, 234, and 345.
Target patients were as follows. Diagnosis was made in accordance with depressive episode specified in the International Classification of Diseases, 10th revision (ICD-10). Patients with serious physical complications or those taking therapeutic agents for physical diseases were excluded. Six patients whose samples before treatment had been obtained were 3 males and 3 females aged 38 to 55 (44 years old on average), and their Hamilton scores were from 17 to 31 (mean: 25.2). Blood (5 ml each) was collected from patients using a PAXgene Blood RNA System (Qiagen), and total RNA was extracted. The yield of total RNA was 5 μg to 15 μg. The quality of total RNA extracted from the patients was inspected using the Bioanalyzer 2100 (Agilent) to confirm that total RNA had not been decomposed. Total RNA (0.2 mg) was then subjected to the in vitro transcription reaction to synthesize cRNA into which aminoallyl-CTP had been introduced in the presence of aminoallyl-CTP using the Agilent reagent that amplifies and synthesizes cRNA (Low RNA Input Linear Amp Kit PLUS, One-Color). Subsequently, an amino group in the synthesized cRNA was subjected to coupling to a succinimide-containing fluorescent dye (Cy3, Amersham) to synthesize fluorescent-labeled RNA. Subsequently, the fluorescent-labeled RNA was subjected to unicolor hybridization to the Agilent microarrays (Whole Human Genome Microarray 4 Pack) at 65° C. for 17 hours. The microarrays were washed in accordance with the Agilent's given protocols, and the fluorescent image was read using the Agilent scanner (Agilent). The image data was converted into the numerical data using a special software, Feature Extraction (Agilent). Subsequently, normalization was carried out so that a sum of signal intensities of genes exhibiting signal intensities of 25% to 75% would be 11, 234, and 345.
As the indicators of diagnosis, FASLG, CX3CR1, TBX21, ID2, SLAMF7, PRSS23, YWHAQ, TARDBP, ADRB2, PPP1R8, MMAA, SQLE, PDHA1, HAVCR2, RACGAP1, AHNAK, EDG8, and DUSP5 genes were used, and expression levels thereof were determined. As the reference dataset, the mean dataset of 122 healthy individuals used in Examples 1 was used, and the data of the subjects were each divided by the reference data to determine the expression ratio. The databases of the group of patients and of the group of healthy individuals and the support vector machine-based diagnosis program prepared by the method described in Example 4 were used, and the data regarding the expression levels of six healthy individuals and six patients were applied as the query to the support vector machine-based software to diagnosis the subjects. The results are shown in Table 19.
The results shown in Table 19 demonstrate that a depressed patient can be satisfactorily distinguished from a healthy individual with sensitivity of about 83.3%, specificity of 100%, and accuracy of about 91.7%.
Thus, diagnosis of depression via analysis of expression of a given group of genes was satisfactorily consistent with the results attained by clinical observation. This indicates that effects of the present invention are very high.
The expression levels of 9 genes of the 18 genes described in Example 3 (i.e., FASLG, CX3CR1, TBX21, ID2, SLAMF7, PRSS23, YWHAQ, TARDBP, and ADRB2 genes) were used as indicators to perform diagnosis using a support vector machine. The sensitivity and the specificity in such a case were evaluated by the leave-one-out method. The results are shown below.
The expression levels of 7 genes of the 18 genes described in Example 3 (i.e., FASLG, CX3CR1, ID2, YWHAQ, TARDBP, EDG8, and DUSP5 genes) were used as indicators to perform diagnosis using a support vector machine. The sensitivity and the specificity in such a case were evaluated by the leave-one-out method. The results are shown below.
The expression levels of 5 genes of the 18 genes described in Example 3 (i.e., FASLG, CX3CR1, ID2, YWHAQ, and TARDBP genes) were used as indicators to perform diagnosis using a support vector machine. The sensitivity and the specificity in such a case were evaluated by the leave-one-out method. The results are shown below.
The expression levels of 11 genes of the 18 genes described in Example 3 (i.e., TBX21, SLAMF7, PRSS23, ADRB2, PPP1R8, MMAA, SQLE, PDHA1, HAVCR2, RACGAP1, and AHNAK genes) were used as indicators to perform diagnosis using a support vector machine. The sensitivity and the specificity in such a case were evaluated by the leave-one-out method. The results are shown below.
The results obtained in Examples 7 to 10 demonstrate that accuracy of 70% or higher can be realized even when only some of the 18 genes are used. The accuracy for diagnosis of depression by a physician who is not specialized in depression is approximately 20% to 40%. Thus, the method for diagnosing depression according to the embodiments of the present invention was found to be useful as a tool for supporting diagnosis of depression. The method of the present invention can provide very useful information to a specialized physician as reference information for diagnosing a patient.
In the above embodiments, depression was diagnosed with the use of the reference dataset determined by the expression levels of healthy individuals, although a method for diagnosing depression is not limited thereto. For example, the expression levels of genes of many healthy individuals and/or the expression levels of the genes of many patients may be used to determine the cut-off values for the expression levels of the genes used for diagnosis, selected from among the 18 genes, and the determined values may be compared with the cut-off values of the expression levels of genes of the subjects. Thus, depression can be diagnosed.
Alternatively, the expression levels of genes of many healthy individuals and/or the expression levels of the genes of many patients may be used to determine the reference values for the expression levels of the genes used for diagnosis, selected from among the 18 genes, the ratio of the expression levels of the genes of target subjects to the reference levels of the genes is determined, and the determined ratio may be compared with the preset cut-off values. Thus, depression can be diagnosed.
Also, the expression levels of genes of many healthy individuals and/or the expression levels of the genes of many patients may be used to determine the cut-off values for the mean expression levels of the genes used for diagnosis among the 18 genes, and the expression levels of the genes of the target subjects may be compared with the cut-off values. Thus, depression can be diagnosed.
The present invention has been completed based on the results of experiment that has analyzed gene expression in the peripheral whole blood sample of a depressed patient. With the use of the diagnostic method of the present invention, depression can be diagnosed in a simple manner with high accuracy.
Any patents or publications mentioned in this specification are indicative of the levels of those skilled in the art to which the invention pertains. Further, these patents and publications are incorporated by reference herein to the same extent as if each individual publication was specifically and individually indicated to be incorporated by reference.
Sequence Listing
Number | Date | Country | Kind |
---|---|---|---|
2007-065993 | Mar 2007 | JP | national |