The present invention relates to a biomarker for detecting the effectiveness of allergen immunotherapy upon performing the therapy on a patient with an immediate (Type I) allergy such as a pollen allergy.
For treating allergy disorders, typically pollen allergies, symptomatic based therapies have been performed in which an anti-allergic drug or the like is administered. Under such circumstances, immunotherapies have been employed for treating the underlying cause (Non-patent document 1). Currently, in Japan, allergen immunotherapy is performed as insured medical care by subcutaneously injecting a causative antigen extract, whereas, in Europe, sublingual allergen immunotherapy has been performed for some years through immunization by putting drops of an antigen extract in a mouth under the tongue (Non-patent documents 1 and 3).
However, the existing allergen immunoterapies require a long-term treatment period of two to three years regardless of the subcutaneously injection or the sublingual method, which, depending on the results of the treatment, may place enormous temporal and financial burdens on the patient.
In order to reduce the burden including medical expenses placed on the patients, a biomarker that can predict the therapeutic efficacy of allergen immunotherapy needs to be searched. Therefore, the present invention has an objective of providing a biomarker for detecting whether or not allergen immunotherapy is effective when it is performed on a patient suffering from an immediate (Type I) allergy such as a pollen allergy.
In order to solve the above-described problems, the present inventors have gone through intensive investigation, as a result of which, found that the copy number variations of certain genes are useful for detecting effectiveness of allergen immunotherapy, thereby accomplishing the present invention.
Thus, the present invention is as follows.
A method for detecting effectiveness of allergen immunotherapy for an immediate allergy in a subject, the method comprising the steps of: detecting a copy number variation of at least one gene selected from the following gene group in a specimen to be examined collected from the subject; comparing the obtained result from detecting the copy number variation with data of the copy number variation of the same gene from the parent population to correlate with effectiveness of the allergen immunotherapy:
<Gene Group>
AGXT2L2, AZGP1, CAV3, CEP72, CUL4A, CXCR7, C12orf60, C16orf48, C19orf34, DNAJB8, FTH1, GEMIN4, GIGYF2, GJC3, HTT, ITCH, MARK2, METT10D, NAV3, NOB1, PAFAH2, PLEKHG4B, PPFIA1, SCARNA11, SDF2L1, SMPD3, SNORA44, SRP14, ST6GALNAC1, SULF2, TMED6, TYRP1, WFDC13, ZNFX1, NCAM2, PCDH17, CHODL, BTG3, DIAPH3, GBA3, IFRD1, KCNT2, THOC7, TMEM168, BST1, C7orf53, CD38, DCUN1D1, FGFBP1, FOXP2, GLRB, GTF2B, HSP90AB2P, MCCC1, MDFIC, ODF2L, PDGFC, SEP15, SH3GLB1, TDRD3, YEATS2, CCDC127, C14orf180, SIVA1, TNFRSFI4, AHNAK2, C14orf79, LOC25845, PLD4, GPR132, LOC389257, BRF1 and ADSSL. According to the present invention, the result from detecting the copy number variation can be correlated with the effectiveness of the allergen immunotherapy based on any of the following judgment criteria (a) to (d) or a combination thereof:
<Judgment Criteria>
(a) the allergen immunotherapy is effective when the copy number of at least one gene selected from the group consisting of NAV3, TYRP1, NCAM2, PCDH17, CHODL, BTG3, DIAPH3, GBA3, IFRD1, KCNT2, THOC7, TMEM168, BST1, C7orf53, CD38, DCUN1D1, FGFBP1, FOXP2, GLRB, GTF2B, HSP90AB2P, MCCC1, MDFIC, ODF2L, PDGFC, SEP15, SH3GLB1, TDRD3 and YEATS2 is smaller than the most frequent copy number (for example, 2) of the same gene from the parent population, preferably 1;
(b) the allergen immunotherapy is effective when the copy number of at least one gene selected from the group consisting of AGXT2L2, AZGP1, CAV3, CEP72, CUL4A, CXCR7, C19orf34, DNAJB8, FTH1, GEMIN4, GIGYF2, GJC3, ITCH, HTT, MARK2, METT10D, PAFAH2, PLEKHG4B, PPFIA1, SCARNA11, SDF2L1, SNORA44, ST6GALNAC1, SULF2, WFDC13, ZNFX1, CCDC127, C14orf180, SIVA1, TNFRSF14, AHNAK2, C14orf79, LOC25845, PLD4, GPR132, LOC389257, BRF1 and ADSSL1 is larger than the most frequent copy number (for example, 2) of the same gene from the parent population, preferably 3;
(c) the allergen immunotherapy is ineffective when the copy number of at least one gene selected from the group consisting of AGXT2L2, CAV3, CEP72, CUL4A, CXCR7, C16orf48, C19orf34, DNAJB8, FTH1, GEMIN4, GIGYF2, GJC3, HTT, ITCH, MARK2, METT10D, NOB1, PAFAH2, PLEKHG4B, SCARNA11, SDF2L1, SMPD3, SNORA44, SRP14, ST6GALNAC1, SULF2, TMED6, WFDC13, ZNFX1, CCDC127, C14orf180, SIVA1, TNFRSF14, AHNAK2, C14orf79, LOC25845, PLD4, GPR132, LOC389257, BRF1 and ADSSL1 is smaller than the most frequent copy number (for example, 2) of the same gene from the parent population, preferably 1; and
(d) the allergen immunotherapy is ineffective when the copy number of at least one gene selected from the group consisting of C12orf60, NAV3, TYRP1, NCAM2, PCDH17, CHODL, BTG3, DIAPH3, GBA3, IFRD1, KCNT2, THOC7, TMEM168, BST1, C7orf53, CD38, FGFBP1, FOXP2, GLRB, GTF2B, HSP90AB2P, MDFIC, ODF2L, PDGFC, SEP15, SH3GLB1 and TDRD3 is larger than the most frequent copy number (for example, 2) of the same gene from the parent population, preferably 3.
A method for detecting effectiveness of allergen immunotherapy for an immediate allergy in a subject, the method comprising the steps of: detecting a copy number variation of at least one gene selected from the following gene group in a specimen to be examined collected from the subject; correlating the obtained result from detecting the copy number variation with effectiveness of the allergen immunotherapy based on any of the following judgment criteria (e) to (h) or a combination thereof:
(e) the allergen immunotherapy is effective when the copy number of at least one gene selected from the group consisting of NAV3, TYRP1, NCAM2, PCDH17, CHODL, BTG3, DIAPH3, GBA3, IFRD1, KCNT2, THOC7, TMEM168, BST1, C7orf53, CD38, DCUN1D1, FGFBP1, FOXP2, GLRB, GTF2B, HSP90AB2P, MCCC1, MDFIC, ODF2L, PDGFC, SEP15, SH3GLB1, TDRD3 and YEATS2 is 1;
(f) the allergen immunotherapy is effective when the copy number of at least one gene selected from the group consisting of AGXT2L2, AZGP1, CAV3, CEP72, CUL4A, CXCR7, C19orf34, DNAJB8, FTH1, GEMIN4, GIGYF2, GJC3, ITCH, HTT, MARK2, METT10D, PAFAH2, PLEKHG4B, PPFIA1, SCARNA11, SDF2L1, SNORA44, ST6GALNAC1, SULF2, WFDC13, ZNFX1, CCDC127, C14orf180, SIVA1, TNFRSF14, AHNAK2, C14orf79, LOC25845, PLD4, GPR132, LOC389257, BRF1 and ADSSL1 is 3;
(g) the allergen immunotherapy is ineffective when the copy number of at least one gene selected from the group consisting of AGXT2L2, CAV3, CEP72, CUL4A, CXCR7, C16orf48, C19orf34, DNAJB8, FTH1, GEMIN4, GIGYF2, GJC3, HTT, ITCH, MARK2, METT10D, NOB1, PAFAH2, PLEKHG4B, SCARNA11, SDF2L1, SMPD3, SNORA44, SRP14, ST6GALNAC1, SULF2, TMED6, WFDC13, ZNFX1, CCDC127, C14orf180, SIVA1, TNFRSF14, AHNAK2, C14orf79, LOC25845, PLD4, GPR132, LOC389257, BRF1 and ADSSL1 is 1; and
(h) the allergen immunotherapy is ineffective when the copy number of at least one gene selected from the group consisting of C12orf60, NAV3, TYRP1, NCAM2, PCDH17, CHODL, BTG3, DIAPH3, GBA3, IFRD1, KCNT2, THOC7, TMEM168, BST1, C7orf53, CD38, FGFBP1, FOXP2, GLRB, GTF2B, HSP90AB2P, MDFIC, ODF2L, PDGFC, SEP15, SH3GLB1 and TDRD3 is 3.
According to the present invention, an example of allergen immunotherapy includes a sublingual allergen immunotherapy. Examples of an immediate allergy include at least one selected from the group consisting of pollen allergy, urticaria, food allergy, mite allergy, allergic rhinitis, bronchial asthma and atopic dermatitis.
The present invention provides a biomarker that is capable of detecting the efficacy of allergen immunotherapy, when the therapy is performed on a patient currently suffering from an immediate allergy or a subject who may not be, currently suffering from an immediate allergy but may in the future. By measuring the copy number variation of the gene collected from the subject, whether or not the allergen immunotherapy is effective at present or in the future can be examined. As a result, a treatment method that is effective for an allergy patient can be provided.
Hereinafter, the present invention will be described in detail.
1. General Outline
The present invention relates to a method for detecting the effectiveness of allergen immunotherapy in an immediate allergy patient by using a copy number variation of a gene in a specimen to be examined from a subject (immediate allergy patient, healthy subject, etc.) as an index (correlating with effectiveness of the allergen immunotherapy).
The present inventors have examined the therapeutic efficacy of sublingual allergen immunotherapy in patients with a cedar pollen allergy, and grouped them into a patient group with significant therapeutic efficacy against allergic symptoms (therapeutically effective group) and a patient group without efficacy (therapeutically ineffective group).
For each patient group, genes that correlate with the therapeutic efficacy were detected in a parent population including the therapeutically effective group and the therapeutically ineffective group using a technique called integrated analysis. “Integrated analysis” is a method that collectively puts clinical condition data, results from serological test, mRNA analysis and copy number variation analysis together and comprehensively processes a combination of all or some of these terms by statistical analysis. By doing so, genes that can be employed for detecting effectiveness of allergen immunotherapy, that are, certain genes (listed below) having copy number variations that allow detection of whether the allergen immunotherapy is effective or ineffective.
A copy number variation (CNV) refers to a genomic region whose copy number per cell differs among individual people in a certain population. A human cell generally inherits two (2 copies of) genes from both paternal and maternal sides. However, the number of a certain gene per cell may vary from individual to individual, which may be only one (1 copy) or three (3 copies) or more. Such individual variation in a gene is referred to as CNV. For example, the most frequently occurring number of CNV in the above-described parent population is represented as “N (Normal)” (CNV=N), the number of CNV smaller than N is represented as “L (Loss)” (CNV=L), and the number of CNV larger than N is represented as “G (Gain)” (CNV=G).
CNV of a gene contained in a sample of a subject is detected so that whether allergen immunotherapy is effective or ineffective in the subject can be judged prior to the treatment depending on which of the above-described L, N and G the CNV falls into.
Hence, according to the present invention, the copy number variation of a predetermined gene is analyzed so as to correlate the analysis result with efficacy of the allergen immunotherapy to examine whether or not the allergen immunotherapy is effective in the immediate allergy patient.
2. Allergen Immunotherapy
Allergen immunotherapy is the only curative treatment among the currently taken allergy treatments, and refers to an immune therapy which disables antigen-specific immune response by gradual exposure to allergens inside the body. Currently, sublingual allergen immunotherapy, subcutaneous allergen immunotherapy and the like are known.
Sublingual allergen immunotherapy is characterized by allowing antigen sensitization to be carried out at home without visiting the specialist and being painless without the need of entering a needle into the body (burden of the patient is small). Sublingual allergen immunotherapy is carried out by putting drops of an antigen extract under the tongue for antigen sensitization, where the antigen sensitization is performed daily starting from lower to higher concentrations in a stepwise manner for the first month of about two years of treatment duration, which is then followed by antigen sensitization twice a week.
On the other hand, subcutaneous allergen immunotherapy is characterized by being painful to some extent for subcutaneously injecting an antigen with a needle, and requires to visit the specialist at least 50 times or more during two years of treatment duration. Similar to sublingual allergen immunotherapy, subcutaneous allergen immunotherapy is carried out by performing antigen sensitization for about two years of treatment duration, where the antigen sensitization is performed daily starting from lower to higher concentrations in a stepwise manner for the first month, which is then followed by antigen sensitization twice a week.
According to the present invention, whether or not the therapy is effective can be detected for either allergen immunotherapy, but it is preferably sublingual allergen immunotherapy which is a allergen immunotherapy recommended by WHO.
3. Subject
A subject as a target of a method of the present invention, namely, a patient suffering from an immediate allergy as a target of allergen immunotherapy for an immediate allergy is not particularly limited as long as the patient presents an immediate (Type I) allergy.
For example, Type I allergies can be classified as follows.
Classification based on antigens: pollen allergy, food allergy and mite allergy.
Classification based on pathological conditions: urticaria, allergic rhinitis, bronchial asthma and atopic dermatitis.
Among them, pollen allergies (cedar pollen allergy, rice pollen allergy, ragweed pollen allergy, Japanese cypress pollen allergy, etc.) are preferable.
Even a healthy subject may have the potential of suffering from an immediate allergy in the future. Therefore, according to the present invention, a healthy subject may also be the subject other than the immediate allergy patients.
4. Analysis of Copy Number Variation
(1) Specimen to be Examined
The specimen to be examined may be, for example, a biological specimen from a healthy subject or an immediate allergy patient.
The specimen to be examined used for an analysis of a copy number variation (CNV) may be, for example, blood, mucous membrane of the nose, nasal discharge, sputum or the like collected from the above-described subject, and preferably blood or a component thereof (serum, plasma, etc.). Methods for collecting these biological specimens or the like are known by those skilled in the art.
When the biological specimen to be used for CNV analysis is blood, for example, cells are preferably extracted from a blood specimen to prepare a lysate thereof or to extract RNA (e.g., mRNA) or DNA therefrom. Preparation of a lysate and extraction of DNA may be carried out by using a known method or a commercially available kit. Alternatively, cells extracted from a blood specimen may further be sorted with a Fluorescence Activated Cell Sorter (FACS), MACS (registered trademark) or the like. An example of the extracted cell includes an immune cell. Examples of the immune cell include lymphocytes, macrophages, neutrophils, eosinophils, basophils, monocytes, dendritic cells and plasmacytoid dendritic cells (PDC), where CD4T cells, dendritic cells and basophils are preferable which are suggested to have an association with allergy. Furthermore, the specimen may be messenger RNA (mRNA) extracted from the above-described cell.
When the biological specimen is in a liquid form, it is preferably diluted, for example, in a buffer or the like to be used for the measurement of DNA.
Measurement of CNV is preferably conducted by a microarray method in terms of simple operation.
(2) Genes
According to the present invention, genes used for detecting effectiveness of allergen immunotherapy are as follows.
The above-listed genes were extracted by performing an integrated analysig (see Examples).
According to the present invention, at least one gene among the above-mentioned genes can be used.
Moreover, nucleotide sequence information of the genes are easily accessible from Accession numbers indicated above.
(3) Technique of Analyzing Gene Copy Variation
Examples of a method for analyzing a CNV variation of a gene according to the present invention include known methods such as a microarray method and a sequencing method, and, for example, may also use a commercially available kit (for example, “Agilent SurePrint G3 Human CNV microarray kit 2×400K” (Agilent Technologies)).
A DNA microarray has one ends of nucleotide probes fixed on a support in an array, and includes a DNA chip, a Gene chip, a microchip, a bead array and the like. An example of a DNA microarray assay such as a DNA chip includes GeneChip assay (Affymetrix). GeneChip technique utilizes a small-scale high-density microarray of oligonucleotide probes attached to a chip.
A sequencing method is a method of analyzing the presence or the absence of a copy number variation by amplifying a region including the copy number variation by PCR and sequencing the DNA sequence with a Dye Terminator or the like (Sambrook, Fritsch and Maniatis, “Molecular Cloning: A Laboratory Manual” 2nd Edition (1989), Cold Spring Harbor Laboratory Press).
According to a detection method like the one exemplified above, an oligonucleotide prepared to include a gene of interest is used as a probe or a primer. Accordingly, the present invention also provides an oligonucleotide prepared to include each of the genes of interest.
The oligonucleotide primer or oligonucleotide probe designed as described above may be chemically synthesized by a known technique or method but generally they are synthesized using a commercially available chemical synthesizer.
Moreover, the operation can be automated by adding a fluorescent label (e.g., FITC, FAM, VIC, Redmond Dye or the like) and a quencher for the fluorescent label to the probe.
(4) Judgment Criteria
The detection results may be correlated with effectiveness of the allergen immunotherapy based on, any of the following judgment criteria (a) to (d) or a combination thereof:
(a) the allergen immunotherapy is effective when the copy number of at least one gene selected from the group consisting of NAV3, TYRP1, NCAM2, PCDH17, CHODL, BTG3, DIAPH3, GBA3, IFRD1, KCNT2, THOC7, TMEM168, BST1, C7orf53, CD38, DCUN1D1, FGFBP1, FOXP2, GLRB, GTF2B, HSP90AB2P, MCCC1, MDFIC, ODF2L, PDGFC, SEP15, SH3GLB1, TDRD3 and YEATS2 is smaller than the most frequent copy number (for example, 2) of the same gene from the parent population, preferably 1;
(b) the allergen immunotherapy is effective when the copy number of at least one gene selected from the group consisting of AGXT2L2, AZGP1, CAV3, CEP72, CUL4A, CXCR7, C19orf34, DNAJB8, FTH1, GEMIN4, GIGYF2, GJC3, ITCH, HTT, MARK2, METT10D, PAFAH2, PLEKHG4B, PPFIA1, SCARNA11, SDF2L1, SNORA44, ST6GALNAC1, SULF2, WFDC13, ZNFX1, CCDC127, C14orf180, SIVA1, TNFRSF14, AHNAK2, C14orf79, LOC25845, PLD4, GPR132, LOC389257, BRF1 and ADSSL1 is larger than the most frequent copy number (for example, 2) of the same gene from the parent population, preferably 3;
(c) the allergen immunotherapy is ineffective when the copy number of at least one gene selected from the group consisting of AGXT2L2, CAV3, CEP72, CUL4A, CXCR7, C16orf48, C19orf34, DNAJB8, FTH1, GEMIN4, GIGYF2, GJC3, HTT, ITCH, MARK2, METT10D, NOB1, PAFAH2, PLEKHG4B, SCARNA11, SDF2L1, SMPD3, SNORA44, SRP14, ST6GALNAC1, SULF2, TMED6, WFDC13, ZNFX1, CCDC127, C14orf180, SIVA1, TNFRSF14, AHNAK2, C14orf79, LOC25845, PLD4, GPR132, LOC389257, BRF1 and ADSSL1 is smaller than the most frequent copy number (for example, 2) of the same gene from the parent population, preferably 1; and
(d) the allergen immunotherapy is ineffective when the copy number of at least one gene selected from the group consisting of C12orf60, NAV3, TYRP1, NCAM2, PCDH17, CHODL, BTG3, DIAPH3, GBA3, IFRD1, KCNT2, THOC7, TMEM168, BST1, C7orf53, CD38, FGFBP1, FOXP2, GLRB, GTF2B, HSP90AB2P, MDFIC, ODF2L, PDGFC, SEP15, SH3GLB1 and TDRD3 is larger than the most frequent copy number (for example, 2) of the same gene from the parent population, preferably 3.
Herein, the terms “remarkably effective” and “effective” are synonymous.
Those skilled in the art can appropriately determine which of the judgment criteria (a) to (d) should a target gene used with the present invention be sorted into upon correlating the copy number of the gene with effectiveness of the allergen immunotherapy. For example, the target genes from the parent population of the patients undergoing hyposensitization treatment or a subpopulation extracted from the parent populatidn are classified into four groups based on their copy numbers (whether CNV is L or G) and the therapeutic efficacy (remarkably effective or ineffective). Then, for the groups with CNV of L or G, difference between the number of people belonging to the remarkably effective group and the number of people belonging to the ineffective group is calculated. A large difference means that judgment can be made as to which CNV gives remarkable effectiveness or ineffectiveness in the therapeutic efficacy.
For example, in Example 1, looking at CNV of GEMIN4 gene (Table 9) from 25 patients extracted from the parent population of the treated patients, 8 patients whose therapeutic efficacy was ineffective had CNV of 1 while the number of patients whose therapeutic efficacy was remarkably effective with CNV of 1 was zero (0). On the other hand, 12 patients whose therapeutic efficacy was remarkably effective had CNV of 3 while the number of patients whose therapeutic efficacy was ineffective with CNV of 3 was only one. The difference between the number of remarkably effective patients with CNV of 1 and the number of ineffective patients with CNV of 1 is 0−8=−8. In addition, the difference between the number of remarkably effective patient with CNV of 3 and the number of ineffective patient with CNV of 1 is 12−1=11. When the absolute value of this difference makes up at least 8% or more, preferably 12% or more and more preferably 25% or more of the patients subjected to the calculation, the gene can be included in the target genes that can be used for judging remarkable effectiveness or ineffectiveness. In the later-described example, since 24 people from the remarkably effective group and 25 people from the ineffective group were analyzed, a gene can be used as a target gene for judging remarkable effectiveness or ineffectiveness when the difference in the numbers of the patients is 8% or more of the total number of the patients, that is, two or more people (24×0.08≈2 or 25×0.08=2). The judgment can be “remarkably effective” when the result obtained by subtracting the number of ineffective patients from the number of remarkably effective patients is positive, and “ineffective” when negative. In the case where the number of the remarkably effective patients is subtracted from the number of the ineffective patients, the judgment results based on positive or negative value would be the other way around.
The present invention is also capable of calculating the difference between the number of the effective patients and the number of the ineffective patients in a more strict way. Although the calculation of the difference was conducted without considering the patients with CNV of N in the above-described example, the calculation can also take place in consideration of the patients with CNV of N. However, since the number of the patients with CNV of N does not serve as a criterion for sorting genes into the judgment criteria, it is considered as a background value and thus, preferably, a predetermined proportion, for example, 50% or more, 60% or more, 70% or more, 80% or more, 90% or more or whole (100%) number of patients in the parent population or the subpopulation is subtracted from the parent population. In this case, when the number of patients resulting from subtraction from the patients belonging to CNV=N is negative, the CNV=N patient is calculated as 0 (zero). This value is used as a correction value (“CNV=N correction value”), for example, in the following calculation.
The rates of remarkable effectiveness are calculated by the following calculation formula where CNV of L and G are represented as “CNV=L” and “CNV=G”, respectively.
(1) Rate of remarkable effectiveness with CNV=L
[{Number of remarkably effective cases with CNV=L/(Number of people with CNV=L+CNV=N Correction value+Number of people with CNV=G, in the remarkably effective groups)}−{Number of ineffective cases with CNV=L/(Number of people with CNV=L+CNV=N Correction value+Number of people with CNV=G, in the ineffective groups)}]×100
(2) Rate of remarkable efficacy with CNV=G
[{Number of remarkably effective cases with CNV=G/(Number of people with CNV=L+CNV=N Correction value+Number of people with CNV=G, in the remarkably effective groups)}−{Number of ineffective cases with CNV=G/(Number of people with CNV=L+CNV=N Correction value+Number of people with CNV=G, in the ineffective groups)}]×100
(3) When the value obtained in (1) or (2) above is equal to or higher than the criterion value (positive number), the gene targeted by the calculation is selected as a marker gene for remarkable effectiveness, whereas when the value is equal to or lower than the criterion value (negative number), the gene targeted by the calculation is selected as a marker gene for ineffectiveness. According to the above-described calculation formula, the target gene can be judged as a gene that indicates remarkable effectiveness when the resulting value is positive whereas the target gene can be judged as a gene that indicates ineffectiveness when the resulting value is negative.
For example, when the rates of remarkable effeCtiveness are calculated with respect to the above-mentioned GEMIN4 gene under the above-described conditions, the rates of remarkable effectiveness will be as follows:
(0/12−8/9)×100=−88.9 Formula (1):
(12/12−1/9)×100=88.9 Formula (2):
Here, CNV=2 (CNV=N) is considered as background, where a portion of the number of people corresponding to 80% of the parent population (for example, 24 (CNV=1)×0.8=19 people) was subtracted. Since subtraction of 19 people gave a negative number, it was considered as 0.
Then, if the absolute value of the criterion targeted for selection is set to 50, the gene is judged to indicate remarkable effectiveness when the rate is +50 or higher and the gene is judged to indicate ineffectiveness when the rate is −50. In the case of the above-described example, the selection criterion of the rate of remarkable effectiveness was set to 50 (absolute value) but it may be set to 10, 20, 30, 40, 50, 60, 70, 80 or 90 according to the present invention depending on the purpose of the analysis.
Accordingly, since the value obtained in Formula (1) is a negative value, GEMIN4 gene with CNV of 1 is selected as a gene that indicates that the allergen immunotherapy is ineffective, whereas GEMIN4 gene with CNV of 3 is selected as a gene that indicates that the allergen immunotherapy is remarkably effective.
Furthermore, according to the present invention, in order to select a gene for judging between remarkable effectiveness and effectiveness, various statistical analyzing techniques such as multivariate analysis may be used. Examples of a multivariate analysis method include a multiple regression analysis, a discriminant analysis, a principal component analysis, a factor analysis and the like. See Example 2 described below for details.
On the other hand, according to the method of the present invention, the results from the CNV measurement of the above-described gene obtained from a predetermined number of patients (primary parent population) can be used as reference data so that the effectiveness of allergen immunotherapy can be judged for a subject outside the primary parent population by comparing the CNV of each gene of the subject with the reference data.
The data of the subject outside the primary parent population may also be integrated into the values of the primary parent population, and the expression level thereof may again be subjected to data processing (averaging, etc.) so as to increase the number of the target subjects (the parent population). By increasing the number of cases, accuracy of the CNV of each gene and thus accuracy of detection or diagnosis can be improved.
In an actual clinical setting, the method of the present invention can be applied to samples individually collected from patients, subjects who had medical checkup or the like so as to judge whether the allergen immunotherapy is remarkably effective or ineffective to the patients or the subjects before starting the actual treatment. Specifically, genomic DNA is collected from the sample to detect the copy number of the gene associated with the remarkable effectiveness or ineffectiveness of the allergen immunotherapy. When the copy number is L or G, the patient or the subject can be judged to show remarkable effectiveness or ineffectiveness to allergen immunotherapy.
For example, in the case where CNV of NAV3 gene present in genomic DNA collected from a certain patient is L, this patient can be expected to show remarkable effectiveness to allergen immunotherapy.
Moreover, copy numbers were examined for M number of genes among NAV3, TYRP1, NCAM2, PCDH17, CHODL, BTG3, DIAPH3, GBA3, IFRD1, KCNT2, THOC7, TMEM168, BST1, C7orf53, CD38, DCUN1D1, FGFBP1, FOXP2, GLRB, GTF2B, HSP90AB2P, MCCC1, MDFIC, ODF2L, PDGFC, SEP15, SH3GLB1, TDRD3 and YEATS2 genes in a genome collected from a certain patient. If the copy number is L for some of the genes examined but N or G for the rest of the genes, the copy numbers of L, N and G are scored as +1, 0 and −1, respectively, and the scores are summed up for each gene. From the resulting sum, judgment of remarkable effectiveness or ineffectiveness can comprehensively be made. In this case, when the sum is 50% or more, preferably 60% or more, 70% or more, 80% or more, 90% or more or 100% with respect to M, the patient can be judged to show remarkable effectiveness to allergen immunotherapy.
Alternatively, the sum of the scores assigned to the copy numbers of a certain gene or a plurality of genes can be presented to the patient while explaining the correlation between the copy number or the sum with remarkable effectiveness or ineffectiveness of the allergen immunotherapy so that it can be used as a risk factor for the patient himself/herself to judge whether to actually perform or continue the allergen immunotherapy with the copy number or the sum in mind.
Furthermore, the detection result may be used, for example, as the main information or subsidiary information for making a confirmation diagnosis of the therapeutic efficacy against an immediate allergy. In order to make a diagnosis for confirming that the immediate allergy has been treated, judgment can be made comprehensively by combining the above-described detection results with at least one selected from the results of physical remarks, the results from a serological examination and the like after the allergen immunotherapy.
5. Microarray and Kit
Furthermore, one ends of the oligonucleotides of the present invention can be fixed to a support such as glass, silicon or gel to prepare a microarray. The oligonucleotide array can be produced, for example, by light irradiation chemical synthesis method (Affymetrix) in which a solid-phase chemical synthesis method is combined with a photolithography production technique employed in the semiconductor industry. In order to clearly define the border between the chemical reaction sites on the chip, a particular chemical synthesis process is carried out using a photolithography mask, thereby constructing a high-density array having oligonucleotide probes attached to predetermined positions of the array.
Moreover, according to another aspect of the present invention, a kit for detecting CNV variations of a gene, which includes oligonucleotides of the present invention and/or a microarray prepared with the oligonucleotides is provided. Such a kit may also include, other than the oligonucleotides of the present invention and/or the microarray prepared with the oligonucleotides, a solution for detection reaction, an oligonucleotide as a control, a vessel used for detection reaction, instructions and the like.
Hereinafter, the present invention will be described more specifically by means of examples, although these examples are provided for the purpose of illustrating the present invention and do not limit the present invention.
<Method>
1. Sorting Between Therapeutically Effective Group and Ineffective Group
First, therapeutic efficacy of sublingual allergen immunotherapy for cedar pollen allergy was examined during two years of administration of a pollen allergy therapeutic drug (a allergen immunotherapy drug). The allergen immunotherapy drug (standardized cedar pollen extract (Torii Pharmaceutical Co., Ltd.)) was given under the administration schedule shown in Table 2.
Patients were classified into a group with remarkable therapeutic efficacy (therapeutically effective group) and a group with no therapeutic efficacy (group in which conditions worsened or did not change with the treatment: therapeutically ineffective group) with respect to the allergic symptoms. In doing so, conditions prior to treatment in August, Heisei 18 (2006), conditions in the middle of the treatment in April, Heisei 19 (2007) and conditions in the later period of the treatment in April Heisei 20 (2008) were sorted into the effective group and the ineffective group of the pollen allergy treatment using, as indicators, clinical conditions obtained by comprehensively scoring the conditions of the mucous membrane of the nose in the allergy diary.
2. Collection of Serum
During the period of the sublingual allergen immunotherapy, bloods were drawn 8 times under the schedule shown in
3. Collection of DNA and RNA
RNA samples before and after the sublingual allergen immunotherapy were collected, separately from the above-described serum, upon blood drawing 1 (prior to treatment) and blood drawing 7 (following treatment) shown in
Sublingual allergen immunotherapy was carried out by administering actual drugs to 202 patients for two year from August, Heisei 18 (2006). Therapy was carried out in eight hospitals in Japan. Bloods were sampled under the schedule of blood drawings 1 to 8 shown in
The amount of blood drawn per single blood drawing was 8 ml per person, which was subjected to cell isolation operation immediately after the blood drawing with a flow cytometry (Becton, FACSaria) to isolate into CD4T cell, a dendritic cell and the like. The isolated cells were each suspended in a cell freeze preservation medium and then stored at −80° C. until use.
Along with the therapeutic efficacy judgment in June, Heisei 20 (2008), 25 patients were sorted out from each of the therapeutically effective and ineffective samples that had been classified, and their RNAs were purified according to an ordinary technique. The purified messenger RNA (mRNA) was immediately subjected to a gene analyzer (DNA chip from Affymetrix).
Meanwhile, DNA was collected once during the examination period as described above, for which the amount of blood drawn was 8 ml per person. Centrifugal operation was performed immediately after the blood drawing to isolate blood cells. DNA was collected from the resulting blood cells according to an ordinary technique. The purified and isolated DNA was stored at −80° C. until gene analysis.
4. Serological Test
For the sera obtained above, the cedar pollen-specific IgE level (RISA-cedar) and the total IgE level (RAST) were measured by ELISA according to an ordinary technique. Fifty types of immune-related humoral factors in the blood (Table 3) were measured according to a fluorescent bead measurement method (Bio-plex from Bio-Rad).
5. Integrated Analysis
Clinical study was conducted by actual drug administration in a scale of about 200 patients. The treatment duration was about two years, and data such as clinical conditions were obtained for about 150 patients at the end. The conditions (severity levels) were scored to classify the therapeutic efficacy in detail. As a result, classification gave'five groups, among which the group with the highest therapeutic efficacy (remarkably effective treatment group) and the worst group (therapeutically ineffective group) were extracted. Biomaterials of the top 24 patients and the bottom 25 patients from these groups were subjected to the following research analysis.
Hereinafter, the detail will be described.
Sublingual allergen immunotherapy was conducted from the beginning of the treatment for two years and 154 patients who were interviewed about their clinical conditions in June, 2008 (Heisei 20) were subjected to analysis. In addition to the previous assessment of the conditions, data of QOL and the like were added for further research and obtained the results shown in
In
The conditions (severe conditions) with the highest severity level of 5 are shown in red while the lowest conditions (mild conditions) of 1 are shown in green. Conditions that lie between the highest and the lowest severity levels are shown in a gradient shading from red to green. Patients extracted by therapeutic efficacy are vertically arranged in order starting from the highest efficacy for the remarkably effective group and from the lowest efficacy for the therapeutically ineffective group. The top 24 and 25 patients from the remarkably effective group and the therapeutically ineffective group, respectively, were subjected to the subsequent analysis.
<Analysis Terms and Integrated Analysis Method>
(1) Analysis of Clinical Conditions
The severity levels at the beginning of the allergen immunotherapy (as of 2006) were organized into 5 stages from 1 (mild) to 5 (severe) to be used as indicators of severity. As indicators of alleviation upon the efficacy judgment after two years of allergen immunotherapy (as of 2008), elimination of the symptom or improvement in the severity level by two stages was defined as grade 1 (remarkably effective treatment group), improvement in the severity level by one stage as grade 2, and no change or worsening as grade 3 (therapeutically ineffective group).
(2) Serological Test
In addition to the above-described 50 immunological humoral factors (Table 3), the antigen-specific IgE level and the total IgE level were added.
(3) mRNA Analysis
mRNA derived from CD4+T cells fractionated by FACS from the blood samples drawn at blood drawing points 1 and 7, i.e., upon assessment of severity or alleviation was used for detection with GeneChip (R) Human Gene1.9 ST array from Affymetrix. As a pre-process, normalization of mRNA expression level was carried out using RMA algorithm (1) with aroma.affymetrix package [R program] to obtain corrected gene mRNA value.
(4) CNV Analysis
Agilent Sure Print 3G CNV microarray kit was used to detect total of 49 patients from the patient groups (24 from the remarkably effective treatment group and 25 from the therapeutically ineffective group) to detect the CNV copy numbers according to the following method.
For genomic DNA extracted from peripheral blood-derived blood samples, Log2 value resulting from competitive hybridization with HapMap Japanese sample NA19000 reference DNA as a pre-process was normalized with Normtools (2) based on the population data. The corrected Log2 values were used as Copy Number Aberration (CNA) for segment judgment by CBS algorithm [R program], thereby obtaining CNV data. Based on this CNV data, CNV copy number was judged by gene unit using [median population +/−2SD population*median individual] of the CNV data value as a threshold (Copy number=2).
(5) Analysis Models and Integrated Analysis
1) Severity of pollen allergy in 2006, CNV: mRNA (blood drawing point 1 prior to treatment)
2) Efficacy assessment (alleviation) in 2008, CNV: mRNA (blood drawing point 7 after treatment)
Analysis model 1) was analyzed for relationship with severity reflecting the conditions of pollen allergy in an individual prior to the commencement of the clinical study in 2006, while analysis model 2) was analyzed for relationship with alleviation reflecting the judgment of efficacy of allergen immunotherapy by the clinical study in 2008.
For 18,945 genes intertwined with gene symbols based on mRNA data and CNV data, a linear model regression analysis was conducted using three parameters, i.e., serological test terms, mRNA values and CNV values of the above-mentioned analysis models 1) and 2) as explanatory variables and severity level of pollen allergy in the spring of 2006 and the value of efficacy judgment of the cedar pollen extract trial in the spring of 2008 as dependent variables.
P values and contribution levels (corrected R2 values) were calculated for about 3.63 million analyses as analysis models 1) and 2). Sorting, was conducted for analysis models 1) and 2) by filtering using contribution levels of the analysis models to the regression line as criteria. Thereafter, relationship with P value was assessed as Bonferroni corrected P value. The resulting analysis models and each explanatory variable were examined in detail in terms of the association between the clinical serum marker value, the mRNA value or the CNV value and the clinical phenotype, thereby obtaining final candidate models.
Specifically, with respect to each grade of analysis model 1) (stages 1 to 5) and analysis model 2) (stages 1 to 3), R-2.9 program was used to calculate R2 value as the contribution level and P value, for a combination in which any one term is modified in proportion to the measure of other term in a non-additive association, as an interaction term of three parameter terms, i.e., (a) mRNA gene Log2 strength, (b) relative Log2 ratio of the CNV segment on the gene and (c) serological test terms determined before and after the mRNA collection/blood drawing points. Here, as to the serological test term, values from blood drawings 1 and 2 and blood drawings 5 and 6 were made to correspond to analysis models 1) and 2), respectively. After the calculation of the statistic values, first-stage filtering was performed on each analysis model to extract combinations as candidate combinations of R2>0.2 and P value <0.05. As a result, 79 and 396 combinations were obtained for analysis models 1) and 2), respectively. Among these combinations, combinations were obtained by second-stage filtering in which the results of copy number judgment obtained in (4) CNV analysis above were once again assessed with respect to the CNV copy number of the gene.
As a result, 20 and 227 combinations were obtained in analysis models 1) and 2), respectively while 12 and 86 genes were obtained in analysis models 1) and 2), respectively. Among these genes, general linear regression model analysis was performed using analysis model 2) and serological test term as the third-stage filtering so as to extract significant serological test term (P value <0.05) from this model analysis. As a result, 22 genes with 37 combinations were obtained.
<CNV Analysis and Results from Integrated Analysis>
From the above-described results, 12 genes (Table 4) were obtained for analysis model 1) up to the second-stage filtering, whereas 22 genes were obtained for analysis model 2) up to the third-stage filtering (Table 5), resulting in total of 34 genes from analysis models 1) and 2) (Table 6).
Among analysis models 1), for example, CNV copy number of GJC3 gene was found to show strong correlation with the clinical condition. In particular, all of the cases with the copy number (CN) of 3 (“G” bar in
In addition, among analysis model 2), for example, CNV copy numbers of METT10D and CXCR7 genes were found to show strong correlation with the clinical conditions. In particular, all of the cases with the copy number of 3 (“G” bars in
Furthermore, for each gene, looking at the CNV copy numbers of patients whose treatments were remarkably effective (24 cases) or ineffective (25 cases), the number of patients with CNV copy number of 1 for the gene was compared with the number of patients with CNV copy number of 3. Then, the CNV copy number for the gene that appears more often in the remarkably effective group was scored +1 while the CNV copy number that appears more often in the ineffective group was scored −1. Furthermore, when the difference between the number of patients with CNV copy number of 1 and the number of patients with CNV copy number of 3 for the gene is equal to or less than 1, it was scored 0.
The results obtained by applying this scoring to each patient are shown in Table 7. The scores for each patient were summed, and distribution and the summed score for each patient are shown in Table 8.
Target genes to be selected were defined as genes having difference in the numbers of people between the remarkably effective group and the ineffective group of 2 or more for CNV of 1 or 3 according to the following criteria.
(a) Remarkably effective when CNV is 1
(b) Remarkably effective when CNV is 3
(c) Ineffective when CNV is 1.
(d) Ineffective when CNV is 3
Genes that fall in the above-described criteria are shown in Table 9. In Table 9, circles are marked on the right side for the genes that fall in the above-described criteria.
As a result, the allergen immunotherapy was found to have a tendency to be effective for NAV3 and TYRP1 when the copy number was 1 while the allergen immunotherapy was found to have a tendency to be effective for AGXT2L2, AZGP1, CAV3, CEP72, CUL4A, CXCR7, C19orf34, DNAJB8, FTH1, GEMIN4, GIGYF2, GJC3, ITCH, HTT, MARK2, METT10D, PAFAH2, PLEKHG4B, PPF1A1, SCARNA11, SDF2L1, SNORA44, ST6GALNAC1, SULF2, WFDC13 and ZNFX1 when the copy number was 3.
On the other hand, the allergen immunotherapy was found to have a tendency to be ineffective for AGXT2L2, CAV3, CEP72, CUL4A, CXCR7, C16orf48, C19orf34, DNAJB8, FTH1, GEMJN4, GIGYF2, GJC3, HTT, ITCH, MARK2, METT10D, NOB1, PAFAH2, PLEKHG4B, SCARNA11, SDF2L1, SMPD3, SNORA44, SRP14, ST6GALNAC1, SULF2, TMED6, WFDC13 and ZNFX1 when the copy number was 1 while the allergen immunotherapy was found to have a tendency to be ineffective for C12orf60, NAV3 and TYRP1 when the copy number was 3.
Search for Genes that Contribute to Assessment of Efficacy with Higher Probability
According to the present example, multiple regression analysis was performed for the purpose of searching for genes necessary for carrying out the judgment of the efficacy with higher probability.
A relational expression between a certain variable y (referred to as a criterion variable or a dependent variable) and variables x1, x2, . . . xp that are considered to influence variable y (referred to as explanatory variables or independent variables) was obtained, based on which y value can be predicted from values x1, x2, . . . xp or the contribution level of each x upon such prediction can be assessed. Such analysis is referred to as regression analysis. In particular, the analysis is called a multiple regression analysis when there are two or more explanatory variables.
According to the present example, for the above-described 34 genes, a multiple regression analysis was conducted based on a general linear model having binomial dependent variable y representing remarkable effectiveness and ineffectiveness and trinomial explanatory variable representing the copy number of each gene where 2 copies (CNV=2) being the baseline, increase therefrom (CNV=3) as 1 and decrease therefrom (CNV=1) as −1, thereby selecting an AIC model using a backward elimination algorithm (step set value: 10,000). Here, an AIC model (“Akaike's Information Criterion” model) refers to a statistical indicator that is frequently used worldwide and that was developed for the purpose of model selection, where a model with the minimum AIC is selected given a plurality of models for an identical data set.
The multiple regression analysis was conducted using R-2.9 program software (R Development Core Team).
The results are shown in Table 10.
As a result, after the process of 18-stage stepwise selection of AIC optimal model, 17 genes with AIC=97.94 were found. Hence, use of the above-mentioned 17 genes for the judgment of the remarkably effective/ineffective groups was found to be an optimal analysis model. The regression coefficient β shows the relationship between the copy number and efficacy. Positive (+)β value in the analysis model means that the increase in the copy number (CN=3) upon analysis of the remarkably effective and ineffective groups is likely to be related to the remarkably effective group whereas negative (−)β means that the decrease in the copy number is likely to be related to the ineffective group.
mRNA analysis was performed using mRNA derived from the basophil fractionated from the blood sample drawn at blood drawing point 7 by FACS. In addition, CNV analysis was carried out for each patient group (22 patients from the remarkably effective treatment group and 22 patients from the therapeutically ineffective group (total of 44 patients)). The analysis model was “efficacy assessment (alleviation) in 2008, CNV: mRNA (blood drawing point 7 after the treatment)”. Other than the above-described points, the analysis was carried out in the same manner as Example 1.
Specifically, for 13,792 genes intertwined with gene symbols based on mRNA data and CNV data, a linear model regression analysis was conducted using three parameters, i.e., serological test term, mRNA value and CNV value of the above-mentioned analysis model as explanatory variables and the value of efficacy judgment of the cedar pollen extract trial in the spring of 2008 as dependent variables.
P values and contribution levels (corrected R2 values) were calculated for about 1.49 million analyses as the analysis model. Sorting was conducted by filtering using contribution level of the analysis model to the regression line as criteria. Thereafter, relationship with P value was assessed as Bonferroni corrected P value.
The resulting analysis model and each explanatory variable were examined in detail in terms of the association between the clinical serum marker value, the mRNA value or the CNV value and the clinical phenotype, thereby obtaining a final candidate model.
Specifically, with respect to each grade of the analysis model (stages 1 to 3), R-2.9 program was used to calculate R2 value as the contribution level and P value, for a combination in which any one term is modified in proportion to the measure of other term in a non-additive association, as an interaction term of three parameter terms, i.e., (a) mRNA gene Log2 strength, (b) relative Log2 ratio of the CNV segment on the gene and (c) serological test terms determined before and after the mRNA collection/blood drawing point. Here, the value from blood drawing 6 was made to correspond to the analysis model.
After the calculation of the statistic values, first-stage filtering was performed to extract combinations as candidate combinations of R2>0.25 and P value <0.00000363. As a result, 254 combinations, were obtained. Among these combinations, combinations were obtained by second-stage filtering in which the results of copy number judgment obtained in the CNV analysis were once again assessed with respect to the CNV copy number of the gene.
Thus, the present analysis model and the serological test terms were used for a general linear regression model analysis to extract significant serological test term (P value <0.05) in this model analysis. As a result, 21 genes were obtained with 254 combinations (see Tables 11 and 12).
Correlations between the CNV copy number of each gene and the clinical condition are shown in Tables 6 to 8.
Targets of analysis
Remarkably effective: 22 patients
Ineffective: 22 patients
Other than the above-described genes, the following genes were also suggested of their significance. Therefore, in the present example, 39 genes consisting of the above-mentioned 21 genes found to have significance as well as the following 13 genes suggested to have significance (Table 13) were analyzed.
Similar to Example 1, focusing on the CNV copy numbers of the patients with remarkable treatment effectiveness (22 cases) and patients with ineffectiveness (22 cases) for each gene, the number of patients having CNV copy number of 1 was compared with the number of patients having CNV copy number of 3 for the gene so as to carry out scoring of the CNV copy numbers. The results obtained by applying the scores to the patients are shown in Table 14. In addition, the scores for each patient are summed and the results showing distribution of the total score for each patient are shown in Table 15.
Selection of target genes were carried out according to the following criteria for each of the variations, i.e., CNV of 1 or 3, where the difference in the numbers of people between the remarkably effective group and the ineffective group was equal to or more than 2.
(a) Remarkably effective when CNV is 1
(b) Remarkably effective when CNV is 3
(c) Ineffective when CNV is 1
(d) Ineffective when CNV is 3
Genes that fall in the above-described criteria are shown in Table 16. In Table 16, circles are marked on the right side for the genes that fall in the above-described criteria.
As a result, the allergen immunotherapy was found to have a tendency to be effective for NCAM2, PCDH17, CHODL, BTG3, DIAPH3, GBA3, IFRD1, KCNT2, THOC7, TMEM168, BST1, C7orf53, CD38, DCUN1D1, FGFBP1, FOXP2, GLRB, GTF2B, HSP90AB2P, MCCC1, MDFIC, ODF2L, PDGFC, SEP15, SH3GLB1, TDRD3 and YEATS2 when the copy number was 1, while the allergen immunotherapy was found to have a tendency to be effective for CCDC127, C14orf180, SIVA1, TNFRSF14, AHNAK2, C14orf79, LOC25845, PLD4, GPR132, LOC389257, BRF1 and ADSSL1 when the copy number was 3.
On the other hand, the allergen immunotherapy was found to have a tendency to be ineffective for CCDC127, C14orf180, SIVA1, TNFRSF14, AHNAK2, C14orf79, LOC25845, PLD4, GPRI32, LOC389257, BRF1 and ADSSL1 when the copy number was 1, while the allergen immunotherapy was found to have a tendency to be ineffective for NCAM2, PCDH17, CHODL, BTG3, DIAPH3, GBA3, IFRD1, KCNT2, THOC7, TMEM168, BST1, C7orf53, CD38, FGFBP1, FOXP2, GLRB, GTF2B, HSP90AB2P, MDFIC, ODF2L, PDGFC, SEP15, SH3GLB1 and TDRD3 when the copy number was 3.
The present invention can detect whether or not allergen immunotherapy is effective in an immediate allergy patient. A method of the present invention can be used for selecting a treatment or a therapeutic drug for an immediate allergy.
Number | Date | Country | Kind |
---|---|---|---|
2009-244965 | Oct 2009 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2010/068869 | 10/25/2010 | WO | 00 | 6/18/2012 |