The present invention relates to the field of molecular diagnostics. More specifically the present invention relates to means and methods for predicting a risk of a subject for Type 1 diabetes (T1D).
Type 1 diabetes (T1D) is a progressively developing multifactorial disease resulting from immune-mediated destruction of insulin-producing β cells in the pancreatic islets. Subsequently, T1D patients are dependent on exogenous insulin and blood glucose monitoring, and currently there is no prevention or cure for the disease. The worldwide T1D incidence is increasing at an alarming rate of 4% annually, especially in children under 5 years of age. Accordingly, T1D is one of the most common chronic childhood diseases, with estimated 86 000 children developing T1D each year.
Currently, the appearance of T1D-associated autoantibodies is the first, and only, measurable parameter used to predict progression toward T1D in genetically susceptible individuals. Although the disease progression rate varies greatly, the children with genetic HLA risk expressing at least two T1D autoantibodies will very likely progress to clinical T1D. On the other hand, autoantibodies are poor prognostic markers in predicting the timing of clinical onset of T1D, and cannot be used as endpoints in clinical intervention studies. In addition, appearance of autoantibodies is indicative of an active autoimmune reaction, where immune tolerance has already been broken. Thus, there is a clear need for new markers predicting the onset of autoimmune reaction preceding T1D, or reflecting the beta cell function, in order to allow a window for complete disease prevention.
WO 2008/112772 suggest interleukin-1β (IL1B), early growth response gene 3 (EGR3) and prostaglandin-endoperoxide synthase 2 (PTGS2) as diagnostic markers of T1D. The suggestion is based on studies, wherein patients with newly diagnosed T1D and healthy controls were employed as study subjects. No predictive markers of T1D are disclosed, and no conclusion can be drawn regarding T1D progressors, i.e. subjects who will eventually develop T1D.
Orban et al. (J. Autoimmun. 28 (2007) 177-187) discloses differences in gene expressions levels between patients with new onset of T1D, patients with long term Type 2 diabetes, and healthy controls. All patients employed in the study were adults. Interleukin 32 (IL-32) is disclosed as a gene whose expression is lower in CD4+ T-cells of patients with T1D than in those of the controls. No predictive markers are disclosed.
WO 2014/207312 discloses predictive markers of T1D identified on the basis of microarray measurements of whole blood RNA samples. Notably, IL32 is not among the predictive markers disclosed, and improved predictive markers are still needed.
An object of the present invention is to provide improved methods and means for determining T1D in an individual, particularly for determining a preclinical T1D status in an individual.
This object is achieved by a method and an arrangement, which are characterized by what is stated in the independent claims. Some specific embodiments of the invention are disclosed in the dependent claims.
The present invention is based, at least partly, on mRNA-sequencing based analysis of 306 cell samples longitudinally collected at 3, 6, 12, 18, 24 and 36 months of age from children developing T1D-associated autoantibodies and/or clinical T1D, paired with gender, age and HLA risk-matched children who did not show signs of T1D-releted autoimmune reaction during the course of the study, collected in the international DIABIMMUNE study following at-risk neonates. For analysis, PBMC samples were sub-fractionated into CD4+ T cells and CD8+ T cells, and also the negative (CD4−CD8−) fraction was analysed together with an aliquot of the original PBMC population as a control.
The results indicate that fractionation of the cells, and especially analysis of the enriched CD8+ population, allowed specific signature identification and revealed novel beta-cell autoimmune-related genes. Notably, interleukin 32 (IL-32) and co-regulated gene signature were identified to be upregulated when children were progressing towards T1D. These first longitudinal unbiased RNA sequencing data from high-risk children highlight the involvement of novel genes and pathways in T1D pathogenesis, and indicate that these genes can be utilized in early prediction of the disease activity.
The present invention thus provides a method of determining Type 1 Diabetes (T1D) in an individual, wherein the method comprises assessing the expression level of interleukin 32 (IL-32) in a sample obtained from said individual. Also provided is use of IL-32 for determining T1D in an individual.
In a further aspect, the invention provides a kit and use thereof in the present method, the kit comprising one or more testing agents capable of detecting the expression level of IL-32 in a biological sample obtained from an individual whose T1D is to be determined.
Further aspects, specific embodiments, objects, details, and advantages of the invention are set forth in the following drawings, detailed description, and examples.
In the following the invention will be described in greater detail by means of preferred embodiments with reference to the attached drawings, in which
Interleukin-32 (IL-32) is a pro-inflammatory cytokine that in humans is encoded by the IL32 gene on chromosome 16 p13.3. The gene has eight exons and at least nine splice variants (i.e. isoforms), namely, IL-32α, IL-32β, IL-32γ, IL32δ, IL-32ε, IL-32ζ, IL-32η, IL-32θ, and IL-32s are known in the art. As used herein, the term “IL-32” refers to any splice variant of IL-32 or a combination thereof, unless otherwise indicated. Some embodiments of the invention may relate to any particular splice variant of IL-32.
The present invention relates to different aspects of IL-32 for use as a marker of increased risk of or progression towards Type 1 diabetes (T1D). Thus, in some non-limiting implementations, IL-32 may be used for determining, predicting or monitoring an individual's risk of or progression towards T1D. Further implementation are disclosed below.
Accordingly, herein is provided an in vitro method of determining Type 1 Diabetes (T1D) status, especially preclinical T1D status, in an individual on the basis of the expression level of IL-32 in a sample obtained from said individual. Increased expression of IL-32 as compared with a relevant control is indicative of an increased risk of T1D or progression towards T1D. Accordingly, non-increased or normal expression level of IL-32 is indicative of non-increased risk of T1D or progression towards T1D.
As used herein, the term “T1D status” refers to any distinguishable manifestation of a disease, including non-disease. For example, the term includes, without limitation, information regarding the presence or absence of the disease, the presence or absence of a preclinical phase of the disease, the risk of the disease, the stage of the disease, and progression of the disease.
As used herein, the term “preclinical T1D” refers to impaired glucose tolerance prior to onset of clinical T1D. Subjects with preclinical T1D are autoantibody positive.
As used herein, the term “clinical T1D” refers to a situation, wherein the subject fulfills one of the diagnostic criteria for diabetes. In the presence of symptoms of diabetes (increased thirst, increased urination, and unexplained weight loss), the criterion is a single randomly measured plasma glucose level of ≥11.1 mmol/l (or with a single randomly measured venous blood glucose level of >10.0 mmol/l). In the absence of symptoms of diabetes, the criterion is either 1) a raised random plasma glucose reading ≥11.1 mmol/l (venous blood glucose ≥10.0 ml/l) on two occasions, 2) a raised fasting plasma glucose reading ≥7.0 mmol/l (venous blood glucose ≥6.1 ml/l) on two occasions, or 3) a diabetic oral glucose tolerance test (OGTT) by the WHO criteria, i.e. fasting venous plasma glucose ≥7.0 mmol/l (fasting venous blood glucose ≥6.1 mmol/l=110 mg/dl) on two occasions, or 2 hour venous plasma glucose ≥11.1 mmol/l (2 hour venous blood glucose ≥10.0 mmol/l] on two occasions. Accordingly a second OGTT should be performed, if the first one is diabetic. There should be an interval of at least one week between these two OGTTs.
In some embodiments, the present method may optionally comprise determining changes in the expression level of IL-32 in an individual at different time points in order to monitor, preferably prior to seroconversion, any changes in the development of the risk of or progression towards T1D. For monitoring purposes, said determination is repeated at least twice at different time points but it may be repeated as many times and as often as desired. In some embodiments, it is envisaged that the greater the increase in the IL-32 expression level, the higher the risk of or faster the progression towards T1D. Accordingly, low increase in the expression level of IL-32 may be indicative of a low risk of or slow progression towards T1D.
In some implementations, the present method of determining, predicting or monitoring an individual's risk for T1D may further include therapeutic intervention. Once an individual is identified to have an increased risk for T1D, he/she may be subjected to, for instance, dietary or other changes in the individual's lifestyle to prevent, inhibit or reduce the risk of or progression towards T1D.
The present method of determining T1D in an individual may be used not only for determining, predicting or monitoring an individual's risk of or progression towards T1D but also for screening new therapeutics or preventive drugs for T1D. In other words, the IL-32 may be used for assessing whether or not a candidate drug or intervention therapy is able to decrease the expression level of IL-32 of an at-risk individual towards that of a negative control or towards that of an individual who is not at risk of T1D. For example, individuals identified to have an increased risk for T1D on the basis of their IL-32 expression levels could be employed as targets in preventive vaccination trials or in other trials aimed for identifying preventive drugs or agents, such as probiotics, or other intervention therapies for T1D. Thus, the present method may also be used for stratifying individuals for clinical trials.
The present method of determining T1D in an individual may also be formulated as a method of identifying an individual at risk of T1D. Accordingly, anything disclosed herein with respect to the method of determining T1D in an individual, or e.g. details, embodiments or uses thereof, apply also to the method of identifying an individual at risk of T1D.
In some important embodiments, the present method of determining T1D in an individual is carried out prior any signs of seroconversion or prior to any clinical signs of T1D. As shown in the experimental part, increased expression of IL-32 may be detected, at least in some cases, at least as early as 12 month prior to seroconversion. Thus, IL-32 may be used for determining an individual's stage of progression towards T1D. In some embodiments, said stage may be denoted as a pre-seroconversion stage.
As used herein, the term “seroconversion” refers to the first detection of one or several T1D-associated autoantibodies against beta cell-specific antigens in serum. These include islet cell specific autoantibodies (ICA), insulin auto-antibodies (IAA), glutamic acid decarboxylase 65 autoantibodies (GADA), islet antigen-2 autoantibodies (IA-2A), and zinc transporter 8 autoantibodies (ZnT8A). In some embodiments, the following cut-off values may be used for determining the presence or absence of the autoantibodies: ICA≥4 JDFU (Juvenile Diabetes Foundation units), IAA≥3.48 RU (relative units), GADA≥5.36 RU, IA-2A≥0.43 RU, and ZnT8A≥0.61 RU. Seroconversion may occur years, e.g. 1 to 2 years, before clinical diagnosis.
Typically, the individual whose risk for T1D is to be determined is a human subject, preferably a child or an adolescent. In some more preferred embodiments, said subject does not show any signs of seroconversion. As used herein, the terms “subject” and “individual” are interchangeable.
More generally, the term “subject” as used herein includes, but is not limited to, mammals such as humans and domestic animals such as livestock, pets and sporting animals. Examples of such animals include without limitation carnivores such as cats and dogs and ungulates such as horses.
The present invention is particularly applicable to individuals having a Human Leukocyte Antigen (HLA)-conferred risk for T1D. As used herein, the term “HLA-conferred risk for T1D” refers to a predisposition to T1D as determined on the basis of the individual's HLA genotype. In some embodiments, HLA-conferred susceptibility is assigned if the individual carries HLA-DQB1 alleles *02/*0302 or *0302. In the experiments conducted, T1D diagnosed individuals whose risk was HLA-conferred were compared with control subjects with the same susceptibility. Accordingly, HLA-conferred susceptibility may be taken into account when choosing a relevant control to be used in the present method.
As used herein, the term “increased expression of IL-32” refers to an up-regulated expression of IL-32 in a sample obtained from an individual whose T1D risk is to be determined as compared to a relevant control. Said expression can be determined at any desired molecular level including, but not limited to protein level and polynucleotide level, including RNA level, such as mRNA level. Accordingly, in some embodiments, the term refers to increased transcription of IL-32 RNA; while in other embodiments, the term refers to increased amount of IL-32 protein, for example. The increase can be determined qualitatively and/or quantitatively according to standard methods known in the art. The expression is increased if the expression level of the gene in the sample is, for instance, at least about 1.5 times, 1.75 times, 2 times, 3 times, 4 times, 5 times, 6 times, 8 times, 9 times, time times, 10 times, 20 times or 30 times the expression level of the same gene in the control sample.
Suitable biological samples for use in accordance with the present invention include, but are not limited to, tissue samples (e.g. pancreatic samples and lymph node samples) and blood samples (e.g. whole blood, serum, plasma, fractionated or non-fractionated peripheral blood mononuclear cells (PBMCs) or any purified blood cell type). In essence, any biological sample which contains RNA, preferably mRNA or any other RNA species which represents IL-32 is a suitable sample for determining the expression of IL-32 at RNA level. In some embodiments, the sample to be analyzed is extracted total whole-blood RNA or, if desired, the sample may consist of isolated mRNA or any other RNA species representing IL-32. On the other hand, if the expression of IL-32 is to be determined at protein level, in essence any biological protein-containing sample is a suitable sample for the present purposes.
Accordingly, as used herein, the term “sample” also includes samples that have been manipulated or treated in any appropriate way after their procurement, including but not limited to centrifugation, filtration, precipitation, dialysis, chromatography, treatment with reagents, washing, or enriching for a certain component of the sample such as a cell population.
To determine whether the expression level of IL-32 differs from normal, the normal expression level of IL-32 present in a biological sample obtained from a relevant control has to be determined. Once the normal expression level is known, the determined IL-32 level can be compared therewith and the significance of the difference can be assessed using standard statistical methods. When there is a statistically significant increase in the determined IL-32 expression level as compared with the normal IL-32 expression level, there is an increased risk that the tested individual will develop T1D.
In some further embodiments, the expression level of IL-32 may be compared with one or more predetermined threshold values, including a positive control value indicative of the risk of developing T1D and/or a negative control value indicative of non-increased risk of developing T1D. Statistical methods for determining appropriate threshold or control values will be readily apparent to those of ordinary skill in the art. The negative threshold or control value may originate from a relevant control which may be a single individual not affected by T1D or be a value pooled from more than one such individual. Likewise, the positive threshold or control value may originate from a relevant control which may be a single individual affected by T1D or be a value pooled from more than one such individual. In some embodiments, age-dependent control values may be employed.
In some preferred embodiments, the control sample or the control value is case matched with the individual whose risk for T1D is to be predicted. Case-matching may be made, for instance, on the basis of one of more of the following criteria: age, date of birth, place of birth, gender, predisposition for T1D, HLA status and any relevant demographic parameter. In some embodiments, said control sample or value consists of a pool of, preferably case-matched, relevant control samples or values. In some embodiments, said control sample or control value has been predetermined prior to predicting a risk of T1D in an individual in accordance with the present disclosure. In some other embodiments, analyzing said control sample or determining said control value may be comprised as a method step in the present method.
Optionally, before to be compared with the control sample or the control value, the expression level of IL-32 is normalized using standard methods. For example, the expression level of an endogenous control gene having a stable expression in the sample type to be employed may be used for normalization. Those skilled in the art know which house-keeping genes to use for which sample types. In some embodiments, the house-keeping gene to be employed is GAPDH.
The expression level of IL-32 may be determined by a variety of techniques. In particular, the expression at nucleic acid level may be determined by measuring the quantity of RNA, preferably mRNA or any other RNA species representing IL-32, using methods well known in the art. Non-limiting examples of suitable methods include digital PCR and real time (RT) quantitative or semiquantitative PCR. Primers suitable for these methods may be easily designed by a skilled person.
Further suitable techniques for determining the expression level of IL32 at nucleic acid level include, but are not limited to, fluorescence-activated cell sorting (FACS) and in situ hybridization.
Other non-limiting ways of measuring the quantity of RNA, preferably mRNA or any other RNA species representing IL-32, include transcriptome approaches, in particular DNA microarrays. Generally, when it is the quantity of mRNA that is to be determined, test and control mRNA samples are reverse transcribed and labelled to generate cDNA probes. The probes are then hybridized to an array of complementary nucleic acids immobilized on a solid support. The array is configured such that the sequence and position of each member of the array is known. Hybridization of a labelled probe with a particular array member indicates that the sample from which the probe was derived expresses that gene. Non-limiting examples of commercially available microarray systems include Affymetrix GeneChip™ and Illumina BeadChip.
Furthermore, single cell RNA sequencing or cDNA sequencing, e.g. by Next Generation Sequencing (NGS) methods, may also be used for determining the expression level of IL-32.
If desired, the quantity of RNA, preferably mRNA any other RNA species representing IL-32, may also be determined or measured by conventional hybridization-based assays such as Northern blot analysis, as well as by mass cytometry.
Changes in the regulation of activity of the IL32 gene can be determined through epigenetic analysis, such as histone modification analysis, for example by chromatin immunoprecipitation followed by sequencing or quantitative PCR, or quantitation of DNA methylation levels, for example by bisulfite sequencing or capture based methods, at the intergenic regulatory sites or IL-32 gene region.
As is readily apparent to a skilled person, a variety of techniques may be employed for determining the expression level of IL-32 at protein level. Non-limiting examples of suitable methods include mass spectrometry-based quantitative proteomics techniques, such as isobaric Tags for Relative and Absolute Quantification reagents (iTRAQ) and label free analysis, as well as selected reaction monitoring (SRM) mass spectrometry and any other techniques of targeted proteomics. Also, the level or amount of a protein marker may be determined by e.g. an immunoassay (such as ELISA or LUMINEX®), Western blotting, spectrophotometry, an enzymatic assay, an ultraviolet assay, a kinetic assay, an electrochemical assay, a colorimetric assay, a turbidimetric assay, an atomic absorption assay, flow cytometry, mass cytometry, or any combination thereof. Further suitable analytical techniques include, but are not limited to, liquid chromatography such as high performance/pressure liquid chromatography (HPLC), gas chromatography, nuclear magnetic resonance spectrometry, related techniques and combinations and hybrids thereof, for example, a tandem liquid chromatography-mass spectrometry (LC-MS).
In contrast to earlier findings disclosed in WO 2008/112772, no differences in the expression levels of interleukin-1β (IL1B), early growth response gene 3 (EGR3) or prostaglandin-endoperoxide synthase 2 (PTGS2) between T1D progressors and non-progressors were detected, while MYC was clearly a weaker marker of T1D progression than the herein identified marker IL-32.
On the other hand, the present results showed that IL-32 is often coregulated with other genes. Accordingly, in some embodiments, the present method may further comprise determining expression levels of one or more genes co-regulated with IL-32, especially those disclosed in Table 2 below. An advantage associated with such embodiments is that combined analysis of IL-32 and one or more of its co-regulated genes increases the predictive power of the assay. Such combined analysis may also define a cell-subtype specific signature better than IL-32 alone. Moreover, some of the IL-32 co-regulated genes are cell surface receptors (e.g. CD52, TRBV4-1, BTN3A2, BTN3A1, AMICA1) which may facilitate easier identification of IL-32 expressing cells using methods such as FACS.
Non-limiting examples of combinations of IL-32 with its co-expressed genes for use in the present invention include the following:
Any of the embodiments or implementations described herein may involve concomitant, simultaneous or separate determination of the expression levels of said one or more co-regulated genes. Increased expression of said one or more co-regulated genes as compared with a relevant control are indicative of increased risk of or progression towards T1D. Said expression levels may be determined using any suitable technique available in the art, including those mentioned above for determining the expression level of IL-32. Furthermore, those skilled in the art know how to apply definitions such as “a relevant control” and “increased expression” disclosed in connection with IL-32 to said one or more co-regulated genes in an appropriate manner.
The present disclosure also relates to an in vitro kit for determining, predicting or monitoring an individual's risk of or progression towards T1D. The kit may be used in any implementation of the present method or its embodiments. At minimum, the kit comprises one or more testing agents or reagents which are capable of specifically detecting IL-32.
In some embodiments, the kit may comprise a pair of primers and/or a probe specific to IL-32. A skilled person can easily design suitable primers and/or probes taking into account specific requirements of a technique to be applied. The kit may further comprise means for detecting the hybridization of the probes with nucleotide molecules, such as mRNA or cDNA, representing IL-32 in a test sample and/or means for amplifying and/or detecting the nucleotide molecules representing IL-32 in the test sample by using the pairs of primers.
In some embodiments, the kit may also comprise one or more testing agents or reagents for specifically detecting one or more genes co-regulated with IL-32 in accordance with the disclosure above.
Other optional components in the kit include a compartmentalized carrier means, one or more buffers (e.g. block buffer, wash buffer, substrate buffer, etc.), other reagents, positive or negative control samples, etc.
The kit may also comprise a computer readable medium comprising computer-executable instructions for performing any method of the present disclosure.
It will be obvious to a person skilled in the art that, as technology advances, the inventive concept can be implemented in various ways. The invention and its embodiments are not limited to the examples described below but may vary within the scope of the claims.
The samples were collected as part of the DIABIMMUNE study from Finnish (n=10) and Estonian (n=4) participants (Table 1).
The HLA-DR-DQ genotypes related to type 1 diabetes risk were analyzed from a cord blood sample with a lanthanide-labeled oligonucleotide hybridization method, as previously described (Peet et al. 2014, Diabetes Res Rev. 28(5):455-461), and at-risk children were monitored and sampled at 3, 6, 12, 18, 24 and 36 months of age. The study protocols were approved by the ethical committees of the participating hospitals and the parents gave their written informed consent. Autoantibodies against insulin (IAA), glutamic acid decarboxylase (GADA), islet antigen-2 (IA-2A), and zinc transporter 8 (ZnT8A) were measured from serum with specific radiobinding assay (Knip et al. 2010 N Eng J Med. 363(20):1900-8). Islet cell antibodies (ICA) were analyzed with immunofluorescence in autoantibody-positive subjects. The cut-off values were based on the 99th percentile in non-diabetic children and were 2.80 relative units (RU) for IAA, 5.36 RU for GADA, 0.78 RU for IA-2A and 0.61 RU for ZnT8A. The detection limit in the ICA assay was 2.5 Juvenile Diabetes Foundation units (JDFU). The time when any of the above mentioned auto-antibodies was first determined positive (above cut-off, excluding cord-blood samples) was considered as the time of seroconversion.
8 ml of blood was drawn in sodium-heparine tubes (Vacutainer, 368480, BD) at each study visit. Plasma was first separated by centrifugation, and consecutively, PBMCs were isolated by Ficoll-Paque isogradient centrifugation (17-1440-03 GE Healthcare Life Sciences). After washes the PBMCs were suspended in RPMI 1640 medium (42401-018, Gibco, Life Technologies) supplemented with 10% DMSO (0231-500 ml, Thremo Scientific), 5% human AB serum (IPLA-SERAB-OTC, Innovative Research), 2 mM L-glutamine (G7513, Sigma-Aldrich), and 25 mM gentamicin (G-1397 Sigma-Aldrich). After overnight incubation in freezing container (BioCision) at −80° C., sample vials were stored in liquid nitrogen (−180° C.). For fractionation, PBMC samples were thawed, quantitated for number and viability, and magnetic antibody-coupled beads (11331D and 11333D Invitrogen) were used for sequential positive enrichment of CD4+ and CD8+ cells. A fraction of PBMCs and the CD4−CD8− flow through were also collected for downstream analysis. RNA was isolated from the samples with AllPrep kit (80224 Qiagen), and quality and quantity was determined using Qubit RNA assay (Q32852, Invitrogen) and Bioanalyzer 2100 (Agilent). At least 80 ng of total RNA was processed for transcriptome analysis with TruSeq Stranded mRNA Library Prep-kit (RS-122-2101, Illumina) according to manufacturer's instructions. The Next-Generation Sequencing was carried out with Illumina HiSeq2500 instrument using TruSeq v3 chemistry and paired-end 2×100 bp read length.
The raw RNA-Seq data were subjected to basic quality control checks using FastQC (version 0.10.0), after which they were aligned to the human reference transcriptome, Human GRCh37 assembly version 75, using Tophat (version 2.0.10). On an average, approximately 93% of the reads from each sample in each cell type were successfully mapped to the human transcriptome. Aligned reads with a mapping quality >10 were counted at a gene level with HTSeq package (htseq-count version 0.6.1), where each gene is considered as the union of all its exons and only those reads are retained that uniquely and completely aligns to a single gene. The read counts of genes were adjusted for the varying sequencing depths and were normalized using the trimmed means of M-values (TMM) method, implemented in the R software package edgeR. Subsequently, all the genes were divided into two categories: coding and non-coding genes. This was done using the biotype information for each gene retrieved from the Ensemble database and the description of biotypes was taken from Gencode [gencode—http://www.gencodegenes.org/gencode_biotypes.html, retrieved September 2015]. Each category of genes were filtered using different RPKM thresholds (RPKM>3 and RPKM>0.5 for coding and non-coding genes, respectively) to discard lowly expressed genes.
The differential expression analysis between Cases and Controls were conducted separately on coding and non-coding filtered genes, using edgeR. As post-filtering steps, only those genes were considered differentially expressed that had a median log FC>0.5, FDR<0.05, and had more than 65% samples across all individuals regulated in the same direction (i.e. up- or down-regulated). The above-mentioned stringent filtering steps were added to the pipeline of this study to ensure significant findings and discard false positives that may arise due to the heterogeneity of the samples (normal variation non-related to T1D). The differential analysis, along with the pre- and post-filtering steps, was performed by taking all samples over all above mentioned timepoints, and separately also using only those samples that were collected within 12 months before seroconversion.
In order to find the genes and autoantibodies (together referred to as ‘features’ in the remaining text) co-regulating/co-clustering with IL-32 in each cell-fraction, k-means clustering followed with Euclidean distance based co-clustering selection criteria, was performed on the expression levels of coding and non-coding differentially expressed genes (DEGs) as well as the autoantibodies. Due to the heterogeneity of the data and the disease, the clustering was done individually on each case and its matched control. Before clustering, the RPKM expression values of each gene and expression level of each autoantibody was log 2 transformed to ensure that values are approximatively normally distributed; and gene-wise standardized to make the features comparable. For each case-control pair, to find the optimum number of clusters, a silhouette score was calculated for each possible number of clusters from 2 to (total number of features−1). The silhouette score depicts how well each object lies within its cluster. For each possible number of clusters, the features were clustered using an unsupervised learning algorithm, called k-means clustering. Subsequently, using the resulting classification of features into clusters along with the Euclidean distance measures between the features, a silhouette score was calculated. Thus, the optimum number of clusters was chosen to be the one with the largest silhouette score. The features were then clustered into the “optimum number of clusters” using k-means clustering with 20 random sets of initialization values and sufficient iterations for convergence. Once clustered, the cluster containing the IL-32 was considered the IL-32-cluster with its co-regulated features. To summarize over the IL32-clusters from the 7 case-control pairs, a feature was considered to co-cluster with IL32 if its median Euclidean distance across all pairs was below 2.5. All features in this step clustered with IL-32 in at least one case-control pair.
50 ng of total RNA was treated with DNaseI Amplification Grade (Invitrogen) and subsequently cDNA was synthesized with Transcriptor First Strand cDNA Synthesis Kit (Roche). qPCR reactions were run using a custom TaqMan Gene Expression Assay reagent targeting IL-32 exon 6 (# AJ5IQA9, Thermo Scientific) in KAPA qPCR Master Mix with low ROX (Kapa biosystems) in duplicate and in two separate runs. The amplification was monitored with with QuantStudio 12K Flex Real-Time PCR System (95□ 10 minute enzyme activation, followed by 40 cycles of 95□ 0:15 minutes and 60□ 1 minute) and analyzed with QuantStudio Software on Thermo Cloud (Thermo Scientific). ΔCt values were calculated based on the expression of a housekeeping gene GAPDH in the sample, detected with GAPDH-specific probe dual-labelled with fluorophore 6-carboxyfluorescein (acronym FAM) and quencher tetramethylrhodamine (acronym TAMRA), as well as GAPDH-specific primers (5′-FAM-ACCAGGCGCCCAATACGACCAA-TAMRA-3′ (SEQ ID NO:1); primer1 3′-CCGGCTTTCTTCGCAGTAG-5′ (SEQ ID NO:2), primer2 5′-CACGGACGCCTGGAAGA-3′ (SEQ ID NO:3)).
When comparing Cases against their matched Controls in each cell fraction across the whole timeframe from 3 to 36 months, by using the FDR Z 0.05 and log FC>10.51 cut-off in at least 65% of the Case-Control comparisons, 51, 69, 143 and 85 genes were found to be differentially expressed in CD4+, CD8+, CD4− CD8− and PBMC fractions, respectively. Interestingly, upregulation of cytokine IL32 was observed throughout the analyzed cell fractions (
In the gene co-clustering analysis, Euclidian distance cutoff 2.5 was used to define the IL-32 co-regulated genes in the dataset. These genes are listed in Table 2.
TMEM14C
BTN3A2
TRBV4-1
LARS
UROS
AMICA1
WASH7P
BTN3A2
LARS
RSU1
BTN3A2
UROS
AMICA1
WASH7P
RSU1
BTN3A3
CARD8
CCDC167
LINC01184
TMEM14C
BTN3A2
TRBV4-1
LARS
AMICA1
RSU1
BTN3A3
CARD8
CCDC167
LINC01184
Number | Date | Country | Kind |
---|---|---|---|
20175864 | Sep 2017 | FI | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/FI2018/050696 | 9/27/2018 | WO | 00 |