Sudden Unexpected Death in Epilepsy (SUDEP) is said to occur when a person with epilepsy dies unexpectedly and was previously in a usual state of health. The death is not known to be related to an accident or seizure emergency such as status epilepticus. When an autopsy is done, no structural or toxicological cause of death can be found. Each year, more than 1 out of 1,000 people with epilepsy die from SUDEP. However, it occurs more frequently in people with epilepsy whose seizures are poorly controlled. One out of 150 people with poorly controlled epilepsy may die from SUDEP each year. SUDEP takes more lives annually in the United States than sudden infant death syndrome (SIDS). Most importantly, SUDEP is the leading cause of death in young people with certain types of uncontrolled epilepsy. The causes of SUDEP are not known. SUDEP occurs most often at night or during sleep and the death is not witnessed, leaving many questions unanswered. Currently no laboratory tests that could help identify patients at risk of SUDEP.
This disclosure is based on identification that patients who died of SUDEP had unique genetic signature compared to epilepsy patients who did not die of SUDEP. Specific mutations involved GABA/Glutamate receptor signaling pathway and cardiac arrhythmia genes were identified. Further, expression of several genes was found to be enhanced or reduced in the brains of patients who died of SUDEP as compared to epilepsy patients who did not die of SUDEP or normal individuals. Based on these observations, the present disclosure provides methods for predicting likelihood of epilepsy patients progressing to SUDEP. The method comprises identifying the presence of one or more specific mutations described herein, or determining if the expression of one or more genes disclosed herein is increased or decreased as compared to controls. An increase in the expression of certain genes, or the decrease in the expression of certain genes is predictive of a likelihood that the individual will progress to SUDEP. Based on such identification, the individual can be monitored and treated.
This disclosure provides identification of a unique genetic pattern in patients who died of SUDEP. Mutations involved GABA/Glutamate receptor signaling pathway or cardiac arrhythmia genes. These mutations were not present in age/sex matched controls of patients with epilepsy who are alive. Nor were they present in other public genomic databases such as 1) dbSNP, 2) 1000 genomes, 3) ESP6500 exome database or epilepsy SPECIFIC CarpeDB database. Based on the data provided herein, it is considered that these mutations are strongly associated with the SUDEP phenotype. Such mutations are termed herein as SUDEP specific mutations. The mutation spectrum provided in this disclosure is relatively specific for the SUDEP population and therefore, provides relevant biomarkers.
In one aspect, this disclosure provides an in vitro method for identifying, or aiding in identifying, predicting, or aiding in predicting, a human individual as being at risk of developing SUDEP comprising detecting in a test sample derived from the individual one or more SUDEP specific mutations in one or more marker genes. For example, the method comprises identifying, or aiding in identifying, predicting, or aiding in predicting an individual (such as an individual who is suffering from, or has been diagnosed with epilepsy) as being at risk of developing SUDEP comprising detecting in a test sample obtained from the individual one or more SUDEP specific mutations in one or more marker genes selected from the group consisting of: ITPR1, GABRR2, JUP, SSTR5, F2, KCNMB1, CNTNAP2, GRM8, GNAI2, TUBA3D, GRIK1, GRIK5 and DPP6. For example, mutations may be detected in the group of genes involved in GABA/Glutamate receptor signaling pathway and/or cardiac arrhythmia genes. For example, mutations may be present in one or more of ITPR1, GABRR2, SSTR5, CNTNAP2, GRM8, GNAI2, GRIK1, and GRIK5 (GABA/Glutamate receptor signaling pathway genes), and/or they may be present in JUP, F2, KCNMB1, TUBA3D, and DPP6 (cardiac arrhythmia genes). Mutations may be present in two or more of the GABA/Glutamate receptor signaling pathway genes, or two or more of the cardiac arrhythmia genes. In certain SUDEP patients, mutations were observed in SSTR5 and GRIK1, GRM8 and GNAI2, and TUBA3D, F2 and JUP. Some or all of the mutations shown in Tables 1 and 2 can be detected. For example, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or 13 gene mutations shown in Tables 1 and 2 can be detected. The GABA/Glutamate receptor signaling pathway gene mutations can be tested separately from the gene mutations in the cardiac arrhythmia genes or they can all be tested together.
The test sample for testing can be obtained from an individual. Typically, the individual will have been diagnosed with epilepsy. The test samples may include body tissues (e.g., biopsies or resections) and fluids, such as blood, sputum, cerebrospinal fluid, and urine. The test samples may contain a single cell, a cell population (i.e. two or more cells) or a cell extract derived from a body tissue. The test samples are generally collected in a clinically acceptable manner, in a way that nucleic acids and/or proteins are preserved so that they can be detected. The test samples may be used in unpurified form or subjected to enrichment or purification step(s) prior to use, for example in order to isolate the DNA, RNA or the protein fraction in a given sample. Such techniques are known to those skilled in the art. (See, e.g., Sambrook, J., and Russel, D. W. (2001), Molecular cloning: A laboratory manual (3rd Ed.) Cold Spring Harbor, N.Y., Cold Spring Harbor Laboratory Press; Ausubel, F. M. et al. (2001) Current Protocols in Molecular Biology, Wiley & Sons, Hoboken, N.J., USA).
Suitable techniques for determining the presence or absence of the mutations include but are not limited to sequencing methodologies, hybridization of probes or primers directed to genomic DNA or cDNA, and/or by using various chip technologies, polynucleotide or oligonucleotide arrays, and combinations thereof. Thus, probes directed to polynucleotides comprising the mutations can be arranged and/or fixed on a solid support. For amplification or sequencing reactions, primers can be designed which hybridize to a segment of a polynucleotide comprising or proximal to the mutations and used to obtain nucleic acid amplification products (i.e., amplicons). Those skilled in the art will recognize how to design suitable primers and perform amplification and/or hybridization and/or sequencing reactions in order to carry out various embodiments of the method. The primers/probes can comprise modifications, such as being conjugated to one or more detectable labels.
In one embodiment, the method comprises determining in a test sample obtained from an individual, such as an individual who has been diagnosed with epilepsy, one or more of the mutations in Table 1.
The different GenBank accession numbers for a gene refer to splice variants. The single nucleotide polymporphisms (SNP) for a gene has the same location on the chromosome, but may manifest itself as different location on the mRNA due to splice variants as indicated by reference to the cDNAs in Table 1.
After variants chromosome and coordinates are obtained, the annotation for the gene, cDNA and amino acid change were obtained by running the coordinate file through the program called ANNOVAR. ANNOVAR is an efficient software tool to utilize up-to-date information to functionally annotate genetic variants detected from diverse genomes. Given a list of variants with chromosome, start position, end position, reference nucleotide and observed nucleotide, ANNOVAR can perform gene based annotation that can identify whether a variant cause protein coding changes in the genome through the amino acids that are affected.
A 200 nucleotide sequence containing each of the mutations is provided in the following SEQ IDs shown in Table 2 for each SUDEP gene mutation.
The mutation in each sequence ID in the following cDNA sequences is shown as bolded and underlined.
The specific mutations can be: for ITPR1 gene, a change of G to A at position corresponding to position no. 100 of SEQ ID NO: 1; for GABRR2 gene, a change of G to T at position corresponding to position no. 100 of SEQ ID NO: 2; for JUP, a change of A to G at position corresponding to position no. 100 of SEQ ID NO: 3; for SSTR5, a change of G to T at position corresponding to position no. 100 of SEQ ID NO: 4; for F2, a change of C to T at position corresponding to position no. 100 of SEQ ID NO: 5; for KCNMB1, a change of T to C at position corresponding to position no. 100 of SEQ ID NO: 6; for CNTNAP2, a change of G to A at position corresponding to position no. 100 of SEQ ID NO: 7; for GRM8, a change of A to G at position corresponding to position no. 100 of SEQ ID NO: 8; for GNA12, a change of C to T at position corresponding to position no. 100 of SEQ ID NO: 9; for TUBA3D, a change of A to G at position no. 100 of SEQ ID NO: 10; for GRIK1, a change of A to G at position corresponding to position no. 100 of SEQ ID NO: 11; for GRIK5, a change of T to A at position corresponding to position no. 100 of SEQ ID NO: 12; and/or for DPP6, a change of C to G at position corresponding to position no. 100 of SEQ ID NO: 13.
The mutation also includes a mutation in the complementary nucleotide in the opposite strand. Based on the mutations and the chromosomal locations, one skilled in the art can design appropriate primers for identifying their presence. The sequences are provided here for convenience, however, sequence information can be obtained by one skilled in the art from the chromosomal locations and other information provided herein.
In one aspect, this disclosure provides a method for identifying SUDEP specific mutations. A SUDEP specific mutation is defined as a mutation (such as an SNP) which is identified as present in chromosomal DNA of individuals who died from SUDEP, but is absent in the chromosomal DNA of age/sex matched individuals who have epilepsy, but who, without intervention, did not die from SUDEP. To identify SUDEP specific mutations, test samples may be obtained from individuals who died from SUDEP and compared to their matched controls to identify SUDEP specific mutations as further described in the example below.
For example, while certain SUDEP specific mutations are described in Table 1 and in SEQ IDs 1-13 above, other SUDEP mutations in these genes or in other genes may be identified.
In one aspect, this disclosure provides a method for predicting an individual to be at risk of developing SUDEP comprising contacting a DNA or RNA from a test sample from the individual with a gene chip, wherein the gene chip comprises one or more probes that can detect one or more mutations in the genes specified in Table 1. For example, the one or more probes may detect one or more SNPs listed in Table 1, or as shown in SEQ IDs 1-13. The DNA may be cDNA or may be RNA, or amplified from chromosomal DNA, or whole genome sequencing, or transcriptome sequencing.
In one aspect, this disclosure provides a panel of probes, said panel comprising probes which can detect one or more mutations provided in Table 1. The panel may be in the form of a chip. Further, DNA microarrays can be used comprising polynucleotide probes, wherein the probes are designed to discriminate mutations, such as SNPs that are associated with SUDEP as described herein. For any single or any combination of the markers set forth in this disclosure, a DNA array or any chip or bead format for testing a plurality of polynucleotides can be provided. Various reagents, devices and procedures which comprise polynucleotide arrays and are used for analyzing nucleic acid samples are known in the art, are commercially available and can be adapted for use with the present disclosure. For instance, devices and services sold under the trade names ILLUMINA and AFFYMETRIX can be adapted to test biological samples obtained or derived from individuals for any one or any combination of the markers discussed herein, given the benefit of this disclosure. The disclosure includes determining heterozygous and homozygous mutations.
Any of the DNA sequences presented herein and any combination of them can be used in a DNA array on a chip. “DNA array” and “chip” are not intended to be limited to any particular configuration, and include all devices, articles of manufacture and processes that are used for concurrent testing of a plurality of distinct nucleic acids to determine multiple distinct SNPs present in the distinct polynucleotides.
Genomics analysis can be carried out such as chromosomal analysis whole genome sequencing, partial genome sequencing, transcriptome analysis, copy number variation analysis, and single nucleotide polymorphism (SNP) analyses. Genomics analysis may be carried out with an assay such as, for example, fluorescent in situ hybridization (FISH), comparative genome hybridization (CGH), polymerase chain reaction (PCR), semi-quantitative real-time PCR, multiplex PCR, oligonucleotide or nucleotide arrays, antibody arrays, and chromatin immunoprecipitation. Genomic analyses and assays may be applied to both genomic DNA and genomic RNA.
In one embodiment, instead of detecting mutated polynucleotides, the method comprises detecting mutated proteins encoded by any of the genes described in Tables 1 and 2. The detection can be carried out by using any suitable technique or reagent, and will generally entail separating the protein from a biological sample and reacting the separated protein with at least one specific binding partner. Such binding partners can include but are not necessarily limited to antibodies, whether polyclonal or monoclonal, and antibody fragments that can specifically bind to the protein, such as Fab fragments, Fab′ fragments, F(ab′)2 fragments, Fd fragments, Fv fragments, and scFv fragments. Other specific binding partners can include aptamers, diabodies, or any other reagent that can specifically recognize the mutant protein. Detecting a complex of a specific binding partner and mutant protein can be performed using any suitable technique, including Western blotting, and other immunodetection methods, such as enzyme linked immunosorbant assay (ELISA), a lateral flow test, etc.
In one embodiment, any mutations in DNA that result in the amino acid changes disclosed here may be identified and such mutations can be used as being predictive of the risk of developing SUDEP.
Once individuals are identified as being at risk of developing SUDEP, they can be provided a focused approach to prevent SUDEP. Since some of genes identified in the present disclosure are associated with cardiac arrhythmia, individuals that are identified as containing one or more mutations in genes associated with cardiac arrhythmia, these patients can be followed up with cardiac evaluation to rule out cardiac arrhythmia and/or treat it if present. Furthermore, these patients could take additional precautions such as making sure that their seizures are under control (uncontrolled seizures are one of the risk factors of SUDEP), take medications regularly, visit healthcare team regularly especially if seizures are not controlled well, strongly avoid potential seizure triggers such as alcohol, recreational drugs. Patients would take extra effort to make sure family and coworkers know what to do for seizure first-aid, take extra precautions around water, including swimming and bathing.
Once specific mutations are identified in individuals, therapeutic approach could be tailored to fit best the patient's mutation status to prevent death from SUDEP. For example, a patient with a mutation in the GABA/Glutamate pathway may benefit from a GABA targeting therapy including: 1) GABA Receptor Agonists such as benzodiazepines, barbiturates, and other substances such as picrotoxins, bicuculline, and neurosteroids; 2) GABA reuptake inhibitors such as Nipecotic acid and tiagabine; 3) GABA Transaminase inhibitiors such as Vigabatrin, or Glutamate targeting therapy such as Glutamate blockers including felbamate, Topiramate, Perampanel. In patients with mutation in the KCNMB1 gene, which is a potassium calcium-activated channel subfamily M regulatory beta subunit 1 gene, an anti-epilepsy medication targeting potassium channels such as Ezogabine (Potiga), known as retigabine, may be administered for seizure control.
The mutations we have identified as predictive of a risk of developing SUDEP are specific. This is supported by our findings that SUDEP and Control patients (i.e., epilepsy patients who did not die of SUDEP) shared a large number (37) of somatic mutations, which were not unique to SUDEP phenotype (
The present disclosure provides a method for predicting an individual to be at risk of developing sudden unexpected death in epilepsy (SUDEP) comprising: a) obtaining a sample from the individual, said sample comprising cells (such as a blood sample); and b) sequencing nucleic acids from the sample to detect the presence or absence of one or more SUDEP specific mutations in one or more marker genes selected from the group consisting of: ITPR1, GABRR2, JUP, SSTR5, F2, KCNMB1, CNTNAP2, GRM8, GNAI2, TUBA3D, GRIK1, GRIK5 and DPP6. The SUDEP specific mutations are identified by their presence in the DNA from a population of individuals who had SUDEP, but absent in the DNA of from matched controls. The method of claim 1, wherein the mutation is detected at the DNA level. The specific mutation in the genes can be: a) for ITPR1, corresponding to nucleotide G at position 100 in SEQ ID NO:1 (such as change of G to A); b) for GABRR2, corresponding to nucleotide G at position 100 in SEQ ID NO: 2 (such as change of G to T); c) for JUP, corresponding to nucleotide A at position 100 in SEQ ID NO: 3 (such as change of A to G); d) for SSTR5, corresponding to nucleotide G at position 100 in SEQ ID NO: 4 (such as change of G to T); e) for F2, corresponding to nucleotide C at position 100 in SEQ ID NO: 5 (such as change of C to T); f) for KCNMB1, corresponding to nucleotide T at position 100 in SEQ ID NO: 6 (such as change of T to C); g) for CNTNAP2, corresponding to nucleotide G at position 100 in SEQ ID NO: 7 (such as change of G to A); h) for GRM8, corresponding to nucleotide A at position 100 in SEQ ID NO: 8 (such as change of A to G; i) for GNAI2, corresponding to nucleotide C at position 100 in SEQ ID NO: 9 (such as change of C to T); j) for TUBA3D, corresponding to nucleotide A at position 100 in SEQ ID NO: 10 (such as change of A to G); k) for GRIK1, corresponding to nucleotide A at position 100 in SEQ ID NO: 11 (such as change of A to G); l) for GRIK5, corresponding to nucleotide T at position 100 in SEQ ID NO: 12 (such as change of T to A); and m) for DPP6, corresponding to nucleotide C at position 100 in SEQ ID NO: 13 (such as change of C to G). The method may not comprise detecting mutations in the genes that were found to not be predictive, such as for example, the genes shown in
Our findings indicate that SUDEP patients can be divided into two groups, patients with mutations in Cardiac pathway genes (SUDEP Cardio) and GABA/Glutamate signaling (SUDEP Gaba/Glu) (
To further support the notion that mutations identified herein (
Based on the findings disclosed herein, this disclosure provides a method to obtain a signature predictive of the likelihood of progression to SUDEP. The signature can comprise two or more markers that are disclosed herein to be associated with SUDEP. The two or more markers may be from Group 1, Group 2, Group 3 or Group 4 or may be a combination of genes from these groups. The expression of any number of genes from each group, separately or simultaneously, may be determined. For example, the expression of from 1 to 47 genes in Group 1, from 1 to 45 in Group 2, from 1 to 40 in Group 3, and/or 1 to 41 in Group 4 can be determined.
These markers can be used to assess risk of an individual to develop SUDEP. The risk assessment can be done on a continuum. For example, if all markers from Group 1 or Group 3 are found to be highly expressed (such as, for example, assigned a score of +2), the individual may be considered at higher risk than an individual who shows enhanced expression of fewer than all the markers, or in whom the expression is not so highly enhanced. Similarly, if the expression of all the markers from Group 2 or Group 4 is reduced (such as, for example, assigned a score of −2), then the individual is considered at higher risk than an individual who shows reduced expression of fewer than all the genes or shows less decrease in expression.
The sample in which the determination is carried out can be any tissue in which these genes are expressed or any fluid where brain cell RNA is excreted into. For example, a convenient tissue is brain tissue or brain cells obtained floating from the cerebrospinal fluid during a routine spinal tap procedure. A biopsy of the brain tissue can be obtained during any surgical procedure carried out or can be obtained during a procedure intended to collect a biopsy specimen. Furthermore, RNA from brain tissue can also be extracted from exosomes circulating in the blood or in urine using established protocols (See e.g., Li et al., Philos Trans R Soc Lond B Biol Sci. 2014 Sep. 26; 369(1652): 20130502. doi: 10.1098/rstb.2013.0502 PMCID: PMC4142023). The expression level of at least one marker is determined in the sample. For example, the expression level of at least one marker from Group 1, Group 2, Group 3, or Group 4 can be determined.
Based on the findings provided in Example 2, the expression levels of one or more genes set forth in Group 1, 2, 3 and/or 4 can be determined and compared to a reference (also referred to herein as control). The reference levels may be the levels from epilepsy patients who were not afflicted with SUDEP. The expression of the genes can be used to generate a reference pattern, which can then be used to estimate the likelihood of progression to SUDEP.
The expression of more than one marker from a Group or from each Group can be determined. For example, the expression of at least two markers from a group or at least two groups can be determined. For example, a finding of an enhanced expression of at least one gene from Group 1, reduced expression from at least one gene in Group 2, enhanced expression of at least one gene in Group 3, and/or reduced expression of at least one gene in Group 4 can be predictive of likelihood of progression to SUDEP. The expression of more than one gene up to all the genes from each or all groups can be determined.
The markers provided in this disclosure show a sufficient difference in expression from SUDEP groups to controls to use them as classifiers for the likelihood of progression. Thus, comparison of an expression pattern from a signature to another expression pattern from another signature may indicate and inform a change in the expression of genes in the brain, and likelihood of progression to SUDEP. Additionally, or alternatively, changes in intensity of expression may be scored, either as increases or decreases. Any significant change can be used. Typical changes which are more than 1-fold or 2-fold are suitable for use.
Some methods provided in this disclosure relate to diagnostic or prognostic uses of information about expression levels. For example, expression patterns from signatures can be obtained. For example, the disclosure provides a method of determining an expression pattern, comprising collecting a suitable biological sample comprising cells (such as brain cells), determining the expression level of more than one marker in the sample, said marker being selected from gene in Tables 3 to 6, and obtaining an expression pattern for the signature. The expression pattern, as a whole, or for individual genes or groups of genes, can be compared to similar expression patterns generated from controls.
The groups of genes that can be used in the present methods are those whose expression is specifically associated with SUDEP. For example, genes involved with Glutamate/GABA signaling, and or genes involved with regulation of blood pressure and heart development can be used. Based on the disclosure provided herein, other genes may be identified whose expression is predictive of progression to SUDEP.
Expression of genes can be detected by techniques well known in the art. For example, mRNA can be detected from the cells and/or expression products such as peptides and proteins can be detected, or whole transcriptome analysis (RNA sequencing) can be carried out. Detection of mRNA can involve sample extraction, PCR amplification, nucleic acid fragmentation and labeling, extension reactions, and transcription reactions. Methods of isolating total RNA are well known to those of skill in the art. For example, total nucleic acid is isolated from a given sample using, for example, an acid guanidinium-phenol-chloroform extraction method and polyA selection for mRNA using oligo dT column chromatography or by using beads or magnetic beads with (dT)n groups attached (see, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual (2nd ed.), Vols. 1-3, Cold Spring Harbor Laboratory, (1989), or Current Protocols in Molecular Biology, F. Ausubel et al., ed. Greene Publishing and Wiley-Interscience, New York (1987)).
Microarray technology can be used to evaluate expression status of a plurality of genes. Sequence based techniques, like serial analysis of gene expression (SAGE, SuperSAGE) are also used for gene expression profiling. In an mRNA or gene expression profiling microarray, the expression levels of multiple genes can be simultaneously evaluated. For example, microarray-based gene expression profiling can be used to obtain gene signatures of individuals suspected of being as risk of SUDEP.
This disclosure also provides a SUDEP tool or kit, which can be used for determining the likelihood of individuals to progress to SUDEP. The tool can comprise one or more of reagents for performance of transcriptome analysis, charts providing patterns of expression of markers as identified here and instructions and/or guidance for interpretation of results. For example, the charts may be similar to
The invention is further described in the examples provided below, which intended to illustrate the invention and not intended to be restrictive.
Methods
Whole Exome Sequencing
DNA was isolated from 8 SUDEP and non-SUDEP (i.e. Control) patients' formalin fixed paraffin embedded brain tissue which was previously resected during the brain surgery for epilepsy management. 250 ng of DNA from each sample was sheared to an average of 150 bp in a Covaris instrument for 360 seconds (Duty cycle—10%; intensity—5; cycles/Burst—200). Barcoded libraries were prepared using the Kapa Low-Throughput Library Preparation Kit (Kapa Biosystems), amplified using the KAPA HiFi Library Amplification kit (Kapa Biosystems) (8 cycles) and quantified using Qubit Fluorimetric Quantitation (Invitrogen) and Agilent Bioanalyzer. An equimolar pool of the 4-barcoded libraries (300 ng each) was used as an input to capture the exome using one reaction tube of the Nimblegen SeqCap EZ Human Exome Library v3.0 (Roche, cat #06465684001), according to the manufacturer's protocol. The pooled capture library was quantified by Qubit (Invitrogen) and Bioanalyzer (Agilent) and sequenced on an Illumina HiSeq 2500 using a paired end, 100 nucleotides in length run mode, to achieve an average of 100× coverage.
Exome Bioinformatics (Variant Analysis)
Demultiplexed fastq reads were aligned to the hg19 genome build (GRCh37) using the Burrows-Wheeler Aligner (BWA) (Li et al., Bioinformatics 25, 1754-1760 (2009). Further indel realignment, base-quality score recalibration and duplicate-read removal were performed using the Genome Analysis Toolkit (GATK) v2.4-92. GATK Haplotype Caller (McKenna, A. et al. Genome Res. 20, 1297-1303 (2010) was used to generate single-nucleotide variation (SNV) and indel calls using standard, default parameters. SNV's found in the living epilepsy controls as well as germline variants found in the 1000 Genomes Project (1000 Genomes Project Consortium. Nature 467, 1061-1073 (2010), ESP5400 (National Heart, Lung, and Blood Institute (NHLBI) GO Exome Sequencing Project) and dbSNP132 (Sherry et al., Nucleic Acids Res. 29, 308-311 (2001) were excluded. Resulting putative mutations were annotated based on RefSeq (Release 55) using Annovar (Wang et al., Nucleic Acids Res. 38, e164 (2010)) and only the missense and nonsense mutations were retained. The mutated genes were queried for pathways using Ingenuity Pathway Analysis (IPA) tool, which identified 5 genes that were associated with Cardiac Arrhythmia (see below). Further, the mutations were examined for functional consequence using Ingenuity Variant Analysis (IVA) software. This analysis revealed 8 genes in GABA/Glutamate receptor signaling pathways (details below). All 13-candidate mutations were manually inspected via Integrative Genomics Viewer (IGV) v2.1 (Robinson, J. T. et al. Integrative genomics viewer. Nat. Biotechnol. 29, 24-26 (2011)).
The analysis is shown in
Genes/Mutations Associated with Cardiac Arrhythmia:
Genes/Mutations Associated with GABA/Glutamate Pathway:
Since there are currently no models that would allow us to reliably test functional cumulative effect of mutations we identified in our cohorts, we set to perform a whole transcriptome analysis to obtain an independent confirmation that brains of SUDEP patients are distinctly different than non-SUDEP epilepsy patients (Controls). We performed whole transcriptome analysis (RNA sequencing) of the same brain tissue samples on which we performed whole exome DNA sequencing to identify mutations.
We carried out experiments to determine whether mutations in SUDEP specific genes are associated with distinct changes in expression of the mutant gene and/or signaling family (Cardiac vs GABA/Glutamate signaling), whether SUDEP patients have unique gene expression signature that distinguishes them from Control patients; and if specific enrichment of gene groups that would be associated with SUDEP phenotype by performing Gene Set Enrichment Analysis (GSEA).
As shown in
We first separated SUDEP GABA/Glutamate (S Gl/Ga,
To identify functional effect of the SUDEP genotype, we performed GSEA. In SUDEP GABA/Glut we identified enrichment of genes associated with sugar metabolism, sugar binding and oxygen binding. Sugar is a critical brain metabolite and abnormal sugar metabolism, inability to bind could be detrimental during seizures when the need of sugar increases in brain cells. Similarly, oxygen is critical for brain metabolism and abnormalities in oxygen metabolism can be fatal during the seizures. We further identified enrichment of genes associated with regulation of blood pressure and heart development further strengthening the association between the epilepsy and heart function for risk of SUDEP. Lastly, we identified enrichment of genes associated with drug metabolism. One of the well-known risks of SUDEP is inability to control seizures by medication. Patients with hypermetabolism of antiepileptic drugs would likely have shorter lifespan of medication in their system and therefore higher risk of developing sudden, potentially fatal seizure event. The SUDEP Cardiac cohort was smaller (2 patients). However even in this cohort, we were able to identify the enrichment for genes associated with higher risk of diabetes, particularly type 1.
Methods:
Nucleic Acids (DNA, RNA) Extraction
DNA and RNA were extracted from the formalin fixed paraffin embedded surgical pathology brain tissue using automated Maxwell Promega system per manufacturer's protocols.
DNA Sequencing
Whole exome DNA sequencing was performed using SeqCap capture (NimbleGen) and 50 base-pair paired-end sequencing. Exome sequencing. 250 ng of DNA from each sample were sheared on a Covaris instrument for 360 seconds (Duty cycle—10%; intensity—5; cycles/Burst—200). Barcoded libraries were prepared using the Kapa Low-Throughput Library Preparation Kit Standard (Kapa Biosystems), amplified using the KAPA HiFi Library Amplification kit (Kapa Biosystems) (8 cycles) and quantified using Qubit Fluorimetric Quantitation (Invitrogen) and Agilent Bioanalyzer. An equimolar pool of the 4 barcoded libraries (300 ng each) were used as input to capture the exome using one reaction tube of the Nimblegen SeqCap EZ Human Exome Library v3.0 (Roche, cat #06465684001), according to the manufacturer's protocol. The pooled capture library was quantified by Qubit (Invitrogen) and Bioanalyzer (Agilent) and sequenced on an Illumina Illumina HiSeq 2500 using a paired end, 100 nucleotides in length run mode, to achieve an average of 100× coverage.
DNA Sequencing Analysis
Realigned exomes were queried for SNPs using HaplotypeCaller (GATK). High frequency SNPs found in 1000 g, ESP6500 and dbSNP141 were filtered out. Resulting filtered putative mutations were annotated using ANNOVAR RefSeq hg19. Synonymous mutations were excluded. Mutations were grouped by genes and analyzed using MSigDB, IPA, Reactome and CarpeDB databases. Ingenuity™ pathway (IPA) and variant analysis (IVA; ingenuity.com) was performed to identify candidate mutations involved in cardiac and central nervous system function.
RNA Sequencing
Whole transcriptome analysis was performed. RNASeq libraries were prepared using the Clontech SMARTer Stranded Total RNA-Seq Kit library prep, with Ribozero Gold to remove rRNA, the recommended input ranging from 250 pg to 10 ng of total mammalian RNA, following the manufacturer's protocol. The libraries were pooled equimolarly, and loaded on high output Illumina HiSeq 2500 flow cells, using v4 reagents, as paired 50 nucleotide reads. Libraries were pooled and distributed uniformly across 3 lanes in order to generate 60-80 million reads per sample. Following this approach, we are able to prepare high quality libraries and perform sequencing. The alignment statistics were optimal with high concordant pair alignment rates and low multiple alignment rates.
RNA-Seq Data Analysis
Raw sequencing data were received in FASTQ format. Read mapping was performed using Tophat 2.0.9 against the hg19 human reference genome. The resulting BAM alignment files were processed using the HTSeq 0.6.1 python framework and respective hg19 GTF gene annotation, obtained from the UCSC database. Subsequently, the Bioconductor package DESeq2(3.2) was used to identify differentially expressed genes (DEG). This package provides statistics for determination of DEG using a model based on the negative binomial distribution. The resulting values were then adjusted using the Benjamini and Hochberg's method for controlling the false discovery rate (FDR). Genes with an adjusted p-value <0.05 were determined to be differentially expressed. Gene Set enrichment analysis was performed utilizing GSEA v.2.2.2.
This application claims priority to U.S. Provisional application No. 62/263,078, filed on Dec. 4, 2015, the disclosure of which is incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2016/064970 | 12/5/2016 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62263078 | Dec 2015 | US |