The invention relates to a probabilistic method for inferring epileptogenicity of a brain region that is not observed as recruited or not observed as not recruited in a seizure activity of an epileptic patient brain.
Model inversion i.e., finding a set of model parameters that yields the best possible fit to the observed data is a challenging task in statistical inference. Bayesian frameworks offer powerful and principled methods for parameter inference and model prediction from experimental data with a broad range of applications. Within neuroimaging context, the Bayesian approaches have been widely used for inference of neuronal population’s intrinsic parameters and/or interactions between neuronal populations in a pre-specified neuronal network from neurophysiological data.
It is well-known that gradient-free sampling algorithms such as Metropolis-Hastings, Gibbs sampling and slice-sampling generally fail to explore the parameter space efficiently when applied to large-scale inverse problems, as often encountered in the application of whole-brain imaging for clinical diagnoses. In particular, the traditional Markov Chain Monte Carlo (MCMC) mix poorly in high-dimensional parameter spaces involving correlated variables. In contrast, gradient-based algorithms such as
Hamiltonian Monte Carlo (HMC), although computationally expensive, are far superior to gradient-free sampling algorithms in terms of the number of independent samples produced per unit computational time. This class of sampling algorithms provides efficient convergence and exploration of parameter space even in very high-dimensional spaces that may exhibit strong correlations. Nevertheless, the efficiency of gradient-based sampling methods such as HMC is highly sensitive to the user-specified algorithm parameters. More advanced MCMC sampling algorithms such as No-U-Turn Sampler (NUTS) a self-tuning variant of HMC solve these issues by adaptively tuning the algorithm parameters. It has been shown that these algorithms efficiently sample from high-dimensional target distributions that allow us to solve complex inverse problems conditioned on massive data set as the observation.
MCMC has the advantage of being non-parametric and asymptotically exact in the limit of long/infinite runs. Of the other alternatives, Variational Inference (VI) turns the Bayesian inference into an optimization problem, which typically results in much faster computation than MCMC methods. However, the classical derivation of VI requires a major model-specific work on defining a variational family appropriate to the probabilistic model, computing the corresponding objective function, computing gradients, and running a gradient-based optimization algorithm. Automatic Differentiation Variational Inference (ADVI) solves these problems automatically.
Probabilistic programming languages (PPLs) provide efficient implementation for automatic Bayesian inference on user-defined probabilistic models by featuring the next generation of MCMC sampling and VI algorithms such as NUTS and ADVI, respectively. With the help of PPLs, these algorithms take the advantage of automatic differentiation methods for the computation of derivatives in computer programs to avoid the random walk behaviour and sensitivity to correlated parameters. In particular, Stan and PyMC3 are high-level statistical modeling tools for Bayesian inference and probabilistic machine learning, which provide the advanced inference algorithms such as NUTS and ADVI, enriched with extensive and reliable diagnostics. Although PPLs allow for automatic inference, the performance of these algorithms can be sensitive to the form of parameterization. An appropriate form of reparameterization in the probabilistic models to improve the inference efficiency of system dynamics (governed by a set of nonlinear stochastic differential equations) remains a challenging problem.
On the other hand, due to the potential to improve medical treatment strategies, the personalized large-scale brain network modeling has gained popularity over the recent years. In the individualized whole-brain modeling approach, the patient-specific information such as anatomical connectivity obtained from non-invasive imaging techniques is combined with the mean-field models of local neuronal activity to simulate the individual’s spatiotemporal brain activity at the macroscopic scale. The Virtual Brain (TVB) is an open-access computational framework written in Python to reproduce and evaluate the personalized configurations of the brain by using individual subject data. This neuroinformatics platform integrates brain computational modeling and multimodal neuroimaging data to systematically simulate the individual’s spatiotemporal brain activity. However, there is currently no specific workflow for automatic model inversion and data fitting validation in preparation for TVB.
More recently, it was proposed a novel approach namely Virtual Epileptic Patient (VEP) to brain interventions based on personalized brain network models derived from non-invasive structural data of individual patients. The VEP model is a large-scale computational model of an individual brain that incorporates personal data such as the locations of seizure initiation, subject-specific brain connectivity, and MRI lesions to inform patient-specific clinical monitoring and improve surgical outcomes. It has been previously shown that the VEP model is able to realistically mimic the evolution of epileptic seizures in a patient with bitemporal epilepsy. However, the inverse problem of such large-scale brain network models is a challenging task due to the intrinsic non-linear dynamics of each brain network node as well as the related large number of model parameters and the observation as commonly encountered in brain-imaging setting.
Accordingly, a need exists for establishing a useful link between the most popular probabilistic programming tools (e.g., Stan/PyMC3) and the personalized brain network modeling (e.g., the VEP model), in order to systematically predict the location of seizure initiation in a virtual epileptic patient. The present invention allows to build, in particular, a Bayesian Virtual Epileptic Patient (BVEP) as a probabilistic framework designed to infer the hidden/unobserved dynamics of personalized large-scale brain model of epilepsy spread generated by TVB.
In accordance with a first aspect, the invention concerns a method for inferring the epileptogenicity of a brain region that is not observed as recruited or is not observed as not recruited, in a seizure activity of an epileptic patient brain, comprising the steps of:
Preferentially, – the probabilistic programming language is a Bayesian programming language, the probabilistic virtual epileptic patient brain model is Bayesian virtual epileptic patient (BVEP) brain model, and the epileptogenicity of the brain region that is not observed as recruited or not observed as not recruited, is inferred using Bayesian inference; – the structural data of the epileptic patient brain comprise non-invasive T1-weighted imaging data and/or diffusion MRI images data; – the model able to reproduce the epileptic seizure dynamic in the primate brain is a model which reproduces the dynamics of an onset, a progression and an offset seizure events, that comprises state variables coupling two oscillatory dynamical systems on three different timescales, a fastest timescale wherein state variables account for fast discharges during an ictal seizure state, an intermediate timescale wherein state variables represent slow spike-and-wave oscillation, and a slowest timescale wherein a state variable is responsible for the transition between interictal and ictal states, and wherein a degree of epileptogenicity of a region of the brain is represented through a value of an excitability parameter; – for obtaining the probabilistic virtual epileptic patient brain model, a spatial map of epileptogenicity of the patient brain is provided, said spatial map of epileptogenicity classifying brain regions of the patient brain into epileptogenic zones (EZ) which can trigger epileptic seizures autonomously, propagation zones (PZ) which do not trigger seizures autonomously but can be recruited during a seizure evolution, and healthy zones (HZ) that do not trigger seizures autonomously; – the probabilistic virtual epileptic patient brain model is generated according to a generative model based on the state-space representation of the virtual epileptic patient; – the state-space representation of the virtual epileptic patient is of the form
where x(t) ∈ ℝn is a n-dimensional vector of system’s states evolving over time, xt0 is an initial state vector at time t = 0, θ ∈ ℝp contains all the unknown parameters of the virtual epileptic patient model, u(t) stands for the external input, y(t) ∈ ℝm denotes the measured data subject to the measurement error v(t), f is a vector function that describes dynamical properties of the system and h represents a measurement function; – for obtaining of the probabilistic virtual epileptic patient (BVEP) model, the state-space representation of the virtual epileptic patient (VEP) model is incorporated in the probabilistic virtual epileptic patient (BVEP) model as state transition probabilities; – the state transition probabilities are such as:
where T denotes the transition probability from a state x(t) to x(t + dt); – the generative model is defined in terms of likelihood and prior on model parameters, whose product yields a joint density:
where prior distribution P(ϑ) includes our prior beliefs about the hidden variables and potential parameter values, while the conditional likelihood term P(γ|ϑ) represents the probability of obtaining an observation, with a given set of parameter values; – in order to infer the epileptogenicity of the brain region that is not observed as recruited or not observed as not recruited, in the seizure activity of the patient brain, sampling algorithms are implemented; – the sampling algorithm is a Markov chain Monte Carlo or a variational inference algorithm; and – the method is computer implemented.
Other features and aspects of the present invention will be apparent from the following description and the accompanying drawings, in which:
The invention relates to a method for inferring an epileptogenicity of a brain region that is not observed as recruited or not observed as not recruited in a seizure activity of an epileptic patient brain. This is a computerized probabilistic method for inferring the spatial map of epileptogenicity across different brain regions in a personalized epileptic brain patient that the seizures initiate in hypothetical areas and may propagate to candidate brain regions.
The method according to the invention comprises various steps that are computer implemented. A computer readable medium is encoded with computer readable instructions for performing the steps of the method according to the invention.
It comprises a step of providing a computerized model modelling various regions of a primate brain and connectivity between said regions. This brain is a virtual brain. It is a neuro-informatics platform for full brain network simulations using biologically realistic connectivity. This simulation environment enables the model-based inference of neurophysiological mechanisms across different brain scales that underlie the generation of macroscopic neuroimaging signals including functional Magnetic Resonance Imaging (fMRI), EEG and Magnetoencephalography (MEG). It allows the reproduction and evaluation of personalized configurations of the brain by using individual subject data.
It further comprises a step of providing said computerized model with a model able to reproduce an epileptic seizure dynamic in the primate brain, said model being a function of a parameter that is the epileptogenicity of a region of the brain.
Preferentially, the model able to reproduce the epileptic seizure dynamic in the primate brain is a model which reproduces the dynamics of an onset, a progression and an offset seizure events, that comprises state variables coupling two oscillatory dynamical systems on three different timescales, a fastest timescale wherein state variables account for fast discharges during an ictal seizure state, an intermediate timescale wherein state variables represent slow spike-and-wave oscillation, and a slowest timescale wherein a state variable is responsible for the transition between interictal and ictal states, and wherein a degree of epileptogenicity of a region of the brain is represented through a value of an excitability parameter.
Also, according to the invention comprises a step of providing structural data of the epileptic patient brain and personalizing the computerized model using said structural data in order to obtain a virtual epileptic patient (VEP) brain model. The structural data are, for example, images data of the patient brain acquired using magnetic resonance imaging (MRI), diffusion-weighted magnetic resonance imaging (DW-MRI), nuclear magnetic resonance imaging (NMRI), or magnetic resonance tomography (MRT). Preferentially, the structural data of the epileptic patient brain comprise non-invasive T1-weighted imaging data and/or diffusion MRI images data.
The method according to the invention further comprises a step of translating a state-space representation of the virtual epileptic patient (VEP) brain model into a probabilistic programming language (PPL) using probabilistic state transitions in order to obtain a probabilistic virtual epileptic patient brain model (BVEP).
Preferentially, the probabilistic programming language is a Bayesian programming language, the probabilistic virtual epileptic patient brain model is Bayesian virtual epileptic patient (BVEP) brain model, and the epileptogenicity of the brain region that is not observed, neither as recruited nor as not recruited, is inferred using Bayesian inference.
Preferentially, for obtaining the probabilistic virtual epileptic patient brain model, a spatial map of epileptogenicity of the patient brain is provided, said spatial map of epileptogenicity classifying brain regions of the patient brain into epileptogenic zones (EZ) which can trigger epileptic seizures autonomously, propagation zones (PZ) which do not trigger seizures autonomously but can be recruited during a seizure evolution, and healthy zones (HZ) that do not trigger seizures autonomously.
Preferentially, the probabilistic virtual epileptic patient brain model is generated according to a generative model based on the state-space representation of the virtual epileptic patient.
More preferentially, the state-space representation of the virtual epileptic patient is of the form
where x(t) ∈ ℝn is a n-dimensional vector of system’s states evolving over time, xt0 is an initial state vector at time t = 0, θ ∈ ℝp contains all the unknown parameters of the virtual epileptic patient model, u(t) stands for the external input, y(t) ∈ ℝm denotes the measured data subject to the measurement error v(t), f is a vector function that describes dynamical properties of the system and h represents a measurement function.
Preferentially, for obtaining of the probabilistic virtual epileptic patient (BVEP) model the state-space representation of the virtual epileptic patient (VEP) model is incorporated in the probabilistic virtual epileptic patient (BVEP) model as state transition probabilities.
More preferentially, the state transition probabilities are such as:
where T denotes the transition probability from a state x(t) to x(t + dt) .
More preferentially, the generative model is defined in terms of likelihood and prior on model parameters, whose product yields a joint density:
where prior distribution P(ϑ) includes our prior beliefs about the hidden variables and potential parameter values, while the conditional likelihood term P(γ|ϑ) represents the probability of obtaining an observation, with a given set of parameter values.
The method according to the invention further comprises the step of acquiring electro- or magneto-encephalographic data of the patient brain and fitting the probabilistic virtual epileptic patient brain model against said data in order to infer the excitability of said brain region that is not observed, neither as recruited nor as not recruited, in the seizure activity of the patient brain.
Preferentially, in order to infer the excitability of the brain region that is not observed as recruited or not observed as not recruited, in the seizure activity of the patient brain, sampling algorithms are implemented.
More preferentially, the sampling algorithm is a Markov chain Monte Carlo or a variational inference algorithm.
In an example, the method according to the invention is based on the personalized brain network modeling and Bayesian inference as schematically illustrated in
In what follows, it is shown step by step how to build the BVEP model for a particular patient in order to fit the constructed brain model against in-silico data and validate our inference. The accuracy and the reliability of the estimations are validated by several convergence diagnostics and posterior behaviour analysis.
For this study, two patients were selected: a 23 year-old female with drug-resistant occipital lobe epilepsy (patient 1), and a 24 year-old female with drug-resistant temporo-frontal lobe epilepsy (patient 2). The patients underwent standard clinical evaluation, details of which were described in a previous study (Proix et al., Individual brain structure and modelling predict seizure propagation. Brain 140, 641-654, 2017). The evaluation included non-invasive T1-weighted imaging (MPRAGE sequence, repetition time = 1900 ms, echo time = 2.19 ms, 1.0 × 1.0 × 1.0 mm, 208 slices) and diffusion MRI images (DTI-MR sequence, angular gradient set of 64 directions, repetition time = 10.7 s, echo time = 95 ms, 2.0 × 2.0 × 2.0 mm, 70 slices, b-weighting of 1000 s mm-2). The images were acquired on a Siemens Magnetom Verio™ 3T MR-scanner.
The structural connectome was built with a reconstruction pipeline using generally available neuroimaging software. The current version of the pipeline evolved from a previously described version (Proix et al., 2017). First, the command recon-all from Freesurfer™ package in version v6.0.0 was used to reconstruct and parcellate the brain anatomy from T1-weighted images. Then, the T1-weighted images were coregistered with the diffusion weighted images by the linear registration tool flirt™ from FSL package in version 6.0 using the correlation ratio cost function with 12 degrees of freedom.
The MRtrix™ package in version 0.3.15 was then used for the tractography. The fibre orientation distributions were estimated from DWI using spherical deconvolution by the dwi2fod tool with the response function estimated by the dwi2response tool using the Tournier algorithm. Next, the tckgen tool was used, employing the probabilistic tractography algorithm iFOD2 to generate 15 millions fiber tracts. Finally, the connectome matrix was built by the tck2connectome tool using the Desikan-Killiany parcellation generated by FreeSurfer in the previous step. The connectome was normalized so that the maximum value is equal to one.
Typically, to build a personalized brain network model, the brain regions are defined using a parcellation scheme and a set of mathematical equations is used to model the regional brain activity. Taking such a data-driven approach to incorporate the subject-specific brain’s anatomical information, the network edges are then represented by structural connectivity of the brain, which are obtained from non-invasive imaging data of individual patients. In the VEP model, the dynamics of brain network nodes are governed by the Epileptor equations (Jirsa et al., On the nature of seizure dynamics. Brain 137, 2210-2230, 2014) that are coupled through the structural connectivity matrix derived from diffusion-weighted MRI (dMRI) techniques (Jirsa et al., The virtual epileptic patient: Individualized whole-brain models of epilepsy spread, 2017) .
The Epileptor is a dynamical model of seizure evolution and is able to realistically reproduce the dynamics of onset, progression and onset of seizure-like events. The Epileptor comprises five state variables coupling two oscillatory dynamical systems on three different timescales: on the fastest timescale, variables x1 and y1 account for fast discharges during the ictal seizure states. On the intermediate timescale, variables x2 and y2 represent the slow spike-and-wave oscillations. On the slowest timescale, the permittivity state variable z is responsible for the transition between interictal and ictal states. In addition, the interictal and preictal spikes are generated via the term g(x1).
Following Jirsa et al. (2014), the dynamics of full Epileptor model is described by:
where
with τ0 = 2857, τ1 = 1, τ2 = 10, I1 = 3.1, I2 = 0.45, and γ = 0:01. The degree of epileptogenicity is represented through the value of excitability parameter η. If η > ηc, where ηc is the critical value of epileptogenicity, Epileptor shows seizure activity autonomously and is referred to as epileptogenic; otherwise Epileptor is in its (healthy) equilibrium state and does not trigger seizures autonomously.
Following Jirsa et al. (2017), the full VEP brain model equations (N-coupled Epileptors) read as follows:
where the network nodes are coupled by a linear approximation of permittivity coupling through
which includes a global scaling factor K, and the patient’s connectome Cij.
Under the assumption of timescale separation, Proix et al. (2014) have shown that the effect of second neuronal ensemble of Epileptor (i.e., the variables x2 and y2) is negligible by averaging on the coupled Epileptor equations, which yields the 2D reduction of VEP model as follows:
Depending on the value of excitability parameter η, the 2D Epileptor exhibits different stability regimes. For η < ηc, a trajectory in the phase plane is attracted to the single stable fixed point of the system on the left branch of the cubic x-nullcline. In this regime, the Epileptor is said to be healthy, meaning not triggering epileptic seizure without external input. As the value of η increases, the z-nullcline moves down and a saddle-node bifurcation occurs at η = ηc corresponding to a seizure onset. For η > ηc, the system exhibits an unstable fixed point allowing a seizure to happen (the Epileptor is said to be epileptogenic). In this example, we use the 2D reduction of VEP model is used for Bayesian inference of spatial map of epileptogenicity to reduce the computational cost associated with the model parameter estimation. The 2D reduction of Epileptor allows for faster inversion while enabling us to predict the envelope of fast discharges during the ictal seizure states (i.e., onset, propagation and offset of seizure patterns) (Proix et al., 2014; Jirsa et al., 2017).
In addition to the patient’s connectome which structurally constraints the individual variability, the dynamics of brain network model can be further constrained by hypothesis formulation about functional brain network component to produce more specific patterns of brain activity across individuals. In the case of epilepsy, clinical hypothesis on the location of epileptogenic zone or lesion allows refining the network pathology to better predict seizure initialization and propagation in individual patients.
In the BVEP brain model, each network node can trigger seizures depending on its connectivity and the excitability value. The parameter η controls the tissue excitability, and its spatial distribution is thus the target of parameter fitting. In this study, depending on the excitability value, the different brain regions are classified into three main types:
Based on the above dynamical properties, the spatial map of epileptogenicity across different brain regions comprises the excitability values of EZ (high value of excitability), PZ (smaller excitability values) and all other regions categorized as HZ (not epileptogenic). Note however, that an intermediate excitability value does not guarantee that the seizure recruits this area as part of the propagation zone, because the propagation is also determined by various other factors including connectivity and brain state dependence. In the BVEP brain model, the clinical hypotheses can be formulated as the prior knowledge on the spatial distribution of excitability parameters. In this study, assuming no clinical hypothesis on a particular brain area, the same prior distribution were assigned on the excitability parameter across all brain regions included in the analysis.
The key component in constructing a probabilistic brain network model within a Bayesian framework is the generative model. Given a set of observations, the generative model is a probabilistic description of the mechanisms by which observed data are generated through some hidden states and unknown parameters. Here, the generative model will therefore have a mathematical formulation guided by the dynamical model that describes the evolution of model’s state variables, given parameters, over time. This specification is necessary to construct the likelihood function. The full generative model is then completed by specifying prior beliefs about the possible values of the unknown parameters.
The BVEP brain model presented in this study is built upon two main steps. First, the VEP model equation that provides the basic form of the data generative process describing how the epileptic seizures are generated. Second, the hypothesis formulation on the spatial map of epileptogenicity in the brain as our prior knowledge. The later component informs the model using hypotheses about the spatial distribution of excitability parameter across different brain regions.
The generative model in the BVEP is formulated based on a system of nonlinear stochastic differential equations of the form (so-called state-space representation):
where x(t) ∈ ℝn is a n-dimensional vector of system’s states evolving over time, xt0 is an initial state vector at time t = 0, θ ∈ ℝp contains all the unknown parameters of the virtual epileptic patient model, u(t) stands for the external input, y(t) ∈ ℝm denotes the measured data subject to the measurement error v(t). The process (dynamical) noise and the measurement noise denoted by w(t) ~ N(0; σ2) and v(t) ~ N(0; σ′2), respectively, are assumed to follow a Gaussian distribution with mean zero and variance σ2 and σ′2, respectively. The coloured and non-Gaussian dynamical noise can be captured in the term w(t), whereas in the presence of multiplicative noise (i.e., the noise whose intensity depends upon the system’s state) or multiplicative feedback (the system’s state further influences the driving noise intensity), an additional term appears which can lead to qualitatively different solutions. Moreover, f(.) is a vector function that describes the dynamical properties of the system and h(.) represents a measurement function. In source localization problem, h(.) is known as the lead-field matrix. It is noted that the current work focuses on the potential brain sources of observed activity to avoid the inevitable inconsistency associated with mapping from source dipoles to the measurements at electrode contacts (i.e., h(.) is a linear function here).
Considering the 2D reduction of VEP model (cf. Eq. (3)), then x(t) = (x1,1, z1, x1,2, z2, ..., x1,N, zN) ∈ ℝn, with n = 2N, where N is equal to the number of brain regions. Accordingly, θ = (xt0,1, xt0,2, ...., xt0,N, η1, η2, ..., ηN, K, σ, σ′) € ℝp, where p = 3N + 3. Using the reconstruction pipeline to virtualize a patient as described in section 2.2, here N = 84.
The state-space representation (cf. Eq. (4)) defining the dynamics of hidden states x(t) is incorporated in the BVEP model as state transition probabilities:
where T denotes the transition probability from a state x(t) to x(t + dt). However, the above parameterization referred to as centered parameterization may exhibit a pathological geometry yielding biased estimations.
It has been previously shown that a careful choice of reparameterization increases the effective sample size and decreases the divergences, in particular for the regions of extreme curvature. To avoid pathological samples, and therefore, the biased estimations due to strong correlation between parameters in the centered form of parameterization, the advantage of location-scale transformation is taken to invert the nonlinear state-space equations, which allows to decorrelate the parameters representing state variables at successive time steps.
A non-centered reparameterization of the above distribution reads as follows:
In Example 3, it is shown that using the non-centered form of parameterization to infer the system dynamics dramatically improves the performance of sampling by avoiding biased estimations due to the strong correlation between parameter.
A generative model is characterized by the joint probability distribution of the model parameters and the observation P(y|ϑ) where Y denotes the observed variables, and ϑ includes the system’s hidden variables and the model parameters. Bayesian techniques infer the distribution of unknown parameters of the underlying data generating process, given only observed responses and prior beliefs about the underlying generative process. By product rule, the generative model can be defined in terms of likelihood and prior on the model parameters, whose product yields the joint density:
where prior distribution P(ϑ) includes our prior beliefs about the hidden variables and potential parameter values, while the conditional likelihood term P(y|ϑ) represents the probability of obtaining the observation, with a given set of parameter values. In Bayesian inference, we seek the posterior density P(ϑ|y), which is the conditional distribution of model parameters given the observation. Bayes’s Theorem expresses this posterior density in terms of likelihood and prior as follows:
where the denominator P(y) represents the probability of the data and it is known as evidence or marginal likelihood (in practice amounts to simply a normalization term).
To sample from posterior density P(ϑ|y), the performance of HMC is highly sensitive to the step size and the number of steps in leapfrog integrator for updating the position and momentum variables in Hamiltonian dynamic simulation. If the number of steps in the leapfrog integrator is chosen too small, then HMC exhibits an undesirable random walk behaviour similar to Metropolis-Hastings algorithm, and thus algorithm poorly explores the parameter space. If the number of leapfrog steps is chosen too large, the associated Hamiltonian trajectories may loop back to a neighbourhood of the initial state, and the algorithm wastes computation efforts. NUTS extends HMC with adaptive tuning of both the step size and the number of steps in leapfrog integration to sample efficiently from posterior distributions. In an alternative approach, ADVI posits a family of densities, automatically computes the gradients, and then finds the closest member (measured by Kullback-Leibler divergence) to the target distribution. In this study, we use NUTS, a self-tuning variant of HMC, as well as ADVI to approximate the posterior distribution of the model parameters (cf. Eq. (3)).
The prior on excitability parameter for all brain regions included in the analysis was assumed as a normal distribution with a mean of –2.5 and a standard deviation of 1.0, i.e., N(–2.5, 1.0). Moreover, we placed a weakly informative prior on the system initial conditions and the global coupling parameter K, as a normal distribution centered at the ground-truth with standard deviation of 1.0. The prior on the hyperparameters was considered as a generic weakly informative prior N(0, 1.0).
After fitting a Bayesian model, it is often necessary to measure the predictive accuracy of the inferred model. The information criteria and leave-one-out cross-validation(LOO) are two rigorous approaches to assess the model’s ability in prediction of new data. Taking the existing simulation draws from log-likelihood evaluated at the posterior of the parameter values, widely applicable information criterion (WAIC) and Pareto-smoothed importance sampling (PSIS) LOO allow for efficiently estimating predictive accuracy of a fitted Bayesian model within a negligible computational time relative to the cost of model fitting.
After running a MCMC sampling algorithm, it is necessary to carry out some statistical analysis in order to evaluate the convergence of MCMC samples. One simple way to assess the performance of MCMC algorithms based on posterior samples is to visualize how well the chain is mixing (i.e., MCMC sampler explores all the modes in the parameter space efficiently). This can be monitored in different ways including traceplot (evolution of parameter estimates from MCMC draws over the iterations), pair plots (to identify collinearity between variables), and autocorrelation plot (to measure the degree of correlation between draws of MCMC samples). A more quantitative way to assess the MCMC convergence to the stationary distribution is to estimate the potential scale reduction factor R̂, and effective sample size Neff based on the samples of posterior model probabilities. The R̂ diagnostic provides estimate of how much variance could be reduced by running chains longer. Each MCMC estimation has R̂ statistic associated with it, which is essentially the ratio of between-chain variance to within-chain variance. If R̂ is approximately less than 1.1, the MCMC convergence has been achieved (approaching to 1.0 in the case of infinite samples); otherwise, the chains need to be run longer. Moreover, the Neff statistic gives the number of independent samples represented in the chain. The larger the effective sample size, higher the precision of MCMC estimates. Note that these are necessary but not sufficient conditions for convergence of MCMC samples.
In addition to the general MCMC diagnostics mentioned above, the NUTS-specific diagnostics can be used to monitor the convergence of samples; the number of divergent leapfrog transitions (due to highly varying posterior curvature), the step size used by NUTS in its Hamiltonian simulation (if the step size is too small, the sampler becomes inefficient, whereas if the step size is too large, the Hamiltonian simulation diverges), and the depth of tree used by NUTS, which is related to the number of leapfrog steps taken during the Hamiltonian simulation.
Using synthetic data for fitting allows us to validate the inference as the ground-truth of the parameters being inferred are known. Therefore, standard error metrics can be used to measure the similarity between the inferred parameters and those used for data generation. The metrics used to validate our inference are confusion matrix, posterior shrinkage, and posterior z-score.
Confusion matrix is a metric to evaluate the accuracy of a classification. The element qi,j is equal to the number of observations known to be in class i but predicted to be in class j, with i, j ∈ {1, 2, ...Q}, where Q is the total number of classes. In the BVEP model, we defined three groups namely HZ, PZ, and EZ to classify brain regions, thus Q = 3.
Moreover, in order to quantify the accuracy of the inference, the posterior z-scores (denoted by z) were plotted against the posterior shrinkage (denoted by s), which are defined as:
where
indicate the variance (uncertainty) of the prior and the posterior, respectively. The posterior z-score quantifies how much the posterior distribution encompasses the ground-truth, while the posterior shrinkage quantifies how much the posterior distribution contracts from the initial prior distribution.
In order to validate the inference using BVEP, advantage is taken of simulation capabilities of The Virtual Brain (TVB) for generating synthetic data sets. TVB is an open-source neuroinformatics tool written in Python to simulate large-scale brain network models based on individual subject data. This platform has been extensively used to simulate common neuroimaging signals including functional MRI (fMRI), EEG, SEEG and MEG with a wide range of clinical applications from Alzheimer disease, chronic stroke to human focal epilepsy (Jirsa et al., 2017).
In this study, TVB is used to reconstruct the personalized brain network model. In order to validate the inference on spatial epileptogenicity, epileptic seizures were simulated for two patients: one simulation with the seizure spread to all brain nodes specified as PZ (patient 1), and another with the seizure spread to some of the brain nodes specified as PZ (patient 2). These data sets were generated using two different structural connectivity matrices and distinct spatial map of epileptogenicity.
The seizure activity of patient 1 was simulated by setting two regions as EZ, and three regions as PZ, where EZidx ∈ {7,35}, and PZidx ∈ {6,12,28}, with the excitability values ηez = –1.6, and ηpz = –2.4, respectively. All the other brain nodes were fixed as not epileptogenic i.e., HZ with ηhz = –3.6.
To simulate the seizure activity of patient 2, two brain regions were selected as EZ, and five regions as PZ, at the nodes EZidx ∈ {7, 24}, and PZidx ∈ {10,23, 27,28,35}, respectively. For the regions selected as EZ, the excitability value was set to ηez = –1.5. The excitability of PZ was set as ηpz = –2,4, and all the other regions were defined as HZ with ηhz = –3.4.
In both synthetic data sets, to simulate the VEP model as a system of stochastic differential equations, an Euler-Maruyama integration scheme was used with an integration step of 0.04. The additive white Gaussian noise was introduced in the state variable x(t) = (x1,i(t), y1,i(t), zi(t), x2,i(t), y2,i(t) with zero mean and variance (0.01, 0.01, 0.0, 0.0015, 0.0015). The initial conditions were selected in the interval (-2.0, 5.0) for each state variable.
Finally, to invert the BVEP for the simulated data sets, two popular open-source PPL tools were used for flexible probabilistic inference: Stan, and PyMC3. Stan language can be run in different interfaces, whereas PyMC3 provides several MCMC algorithms for model specification directly in native Python code. By specifying the model density functions in these tools, the gradients of functions are computed through automatic differentiation, a powerful technique for algorithmic computation of derivatives, to efficiently approximate the log-posterior density by NUTS and ADVI. The computation of independent MCMC chains can also be performed in parallel on separate processors. In this example, the Stan command line interface was used, whereas all the codes for simulations and posterior-based analysis were implemented in Python. The model simulation and parameter estimation were performed on a Linux machine with 3.0 GHz Intel Xeon processor and 32 GB of memory.
The result of workflow in the BVEP model to estimate the spatial map of epileptogenicity across different brain regions for patient 1 is illustrated in
The accuracy of estimated spatial map of epileptogenicity across different brain regions for patient 1 by BVEP implementation in Stan is presented in
In order to investigate whether the BVEP is a platform-independent framework, we also used PyMC3 to estimate the spatial map of epileptogenicity across different brain regions. For both patients analyzed, the same accuracy was obtained by inversion of Eq. 4 in Stan and PyMC3. These results indicate that the BVEP inversion in Stan and PyMC3 leads to similar estimation of spatial map of epileptogenicity across brain regions in both analyzed patients.
Furthermore, the NUTS-specific diagnostics were monitored to check whether the Markov chain has converged. The diagnostics plot shows that there are no divergent transitions in HMC indicating that the posterior density was explored efficiently. Also, none of the NUTS iterations reached maximum tree-depth (its value to run NUTS was specified 10.0 here) indicating that the optimal number of leapfrog steps needed for the Hamiltonian simulation was sufficiently lower than the maximum. Together, these diagnostics validate that the samples by NUTS has converged to the target distribution.
To illustrate the mechanisms underlying seizure initiation and propagation within the BVEP model, the phase-plane topology of the simulation (top row) versus the prediction (bottom row) characterizing the dynamics of the different brain node types in the BVEP model is presented in
To compare the BVEP inversion by NUTS and ADVI schemes,
Once the model parameters have been estimated, it is necessary to assess the convergence of MCMC samples. To verify the reliability of the inferred estimates, we monitored the potential scale reduction factor R̂ as it is the most reliable quantitative metric for MCMC convergence. In addition, the posterior samples from the joint posterior probability distribution were plotted to show the efficiency of the transformed non-centered parameterization in comparison to the centered form of parameterization.
Moreover, scatterplot of samples drawn from joint posterior probability distribution between the hyper–parameters σ and σ′ estimated by the mean-field ADVI is illustrated in
Finally, this invention presents a probabilistic framework namely the Bayesian Virtual Epileptic Patient (BVEP) to infer the spatial map of epileptogenicity for developing a personalized large-scale brain model of epilepsy spread (cf.
To demonstrate the potential functionality of the BVEP in prediction of seizure initiation and propagation, simple and complex seizure spread were simulated using different spatial maps of epileptogenicity (cf.
Understanding brain dynamics in epilepsy is critical for developing therapeutic approaches towards brain interventions to improve the surgical outcome. Using theory of nonlinear dynamic systems, the complete taxonomy of epileptic seizures with a thorough description of bifurcations that give rise to onset, offset and seizure evolution characteristics has been extensively investigated elsewhere (Jirsa et al., 2014). In parameter space description of Epileptor, the seizure onset and offset are described by saddle-node and homoclinic bifurcations. The emergent dynamic effects in the BVEP model crucially depend on the interplay between network node model (Epileptor), patient specific structural connectivity (from dMRI), and spatial maps of epileptogenicity (EZ, PZ, HZ). According to the dynamical properties of Epileptor model, the brain regions were classified into three main types: EZ (exhibiting unstable fixed point corresponding to the brain area responsible for the seizure initiation), PZ (close to saddle-node bifurcation corresponding to the candidate brain area responsible for the seizure propagation), and HZ (exhibiting stable fixed point corresponding to healthy brain area). This approach allows us to define the spatial map of epileptogenicity based on the excitability parameter value, which is the target of fitting.
It is important to note that an excitability value close to the critical value of epileptogenicity does not guarantee that the seizure originating from pathological brain areas (i.e., responsible for the seizure onset specified as EZ) propagates to such brain regions defined as PZ. By a detailed patient evaluation, it has been reported that the individual structural connectivity is essential for predicting seizure spatial propagation. However, it has been recently shown that purely structural information is not sufficient to predict the propagation and eventual stopping of the seizures. Rather, the abnormal activity in the recruited regions is a complex network effect which depends on the interplay between multiple factors including the brain region’s epileptogenicity (node dynamics), the individual structural connectivity (network structure) (Jirsa et al., 2017), and brain state dependence (network dynamics). Furthermore, there are nonlinearities and multiple propagation patterns that can be observed for the same excitability parameter sets due to the coupled nonlinear system dynamics (cf.,
In this study, the analysis of phase-plane trajectories of the observed system versus the prediction was carried out across different brain regions in order to gain a better understanding of the mechanisms underlying seizure initiation and propagation within the proposed approach (cf.
Both NUTS and ADVI schemes were used to infer the spatial map of epileptogenicity in a personalized whole-brain model of epilepsy spread. The results from both inference schemes led to similar estimation of spatial map of epileptogenicity across brain regions, except that the ADVI slightly underestimate the variances compared to the estimations by NUTS algorithm (cf.
Lastly, the efficiency of transformed non-centered parameterization was investigated. In agreement with previous studies showing that NUTS is sensitive to the parameterization, our results indicated that the non-centered form of parameterization to invert the nonlinear state-space equations yields an efficient parameter-space exploration, whereas the centered form of sampling demonstrates an inefficient exploration due to the high collinearity between model parameters (see
A novel approach to build personalized in-silico brain network models based on Bayesian inference within PPL tools such as Stan and PyMC3. Although several PPL libraries have been developed for Bayesian inference, only a few of them are built around efficient sampling algorithms such as NUTS that avoids the random walk behavior and sensitivity to correlated parameters. Both Stan and PyMC3 provide NUTS and ADVI with automatic differentiation to efficiently compute gradients without requiring user intervention. Stan is a generic and flexible software package that has interfaces for common data science languages, also providing extensive diagnostics for MCMC convergence. PyMC3 provides several MCMC algorithms by model specification directly in native Python code. Our implementation in both Stan and PyMC3 result in similar estimation of spatial map of epileptogenicity across brain regions indicating that BVEP is a platform-independent approach. However, a larger number of warm-up iterations were required in PyMC3 to arrive at the same posterior convergence achieved by our implementation in Stan. This is due to the differences in implementation of NUTS in Stan and PyMC3. Comparison of the implementations in Stan, PyMC3 and other alternative PPL packages is beyond the scope of this note.
This invention is the first personalized large-scale brain network modeling approach for inferring the spatial map of epileptogenicity (properties of nodes) based on patient-specific whole-brain anatomical information (i.e., network structure derived from dMRI). Dynamic Causal Modelling (DCM) is a well-established framework for analyzing neuroimaging modalities (such as fMRI, MEG, and EEG) by neural mass models where inferences can be made about the coupling among brain regions (effective connectivity) to infer how the changes in neuronal activity of brain regions are caused by activity in the other regions through the modulation in the latent coupling. Using DCM, focal seizure activity in electrocorticography (ECoG) data was recently studied to estimate the key synaptic parameters or coupling connections using observed signals in a human subject. In another study, Bayesian belief updating scheme for DCM has been used to estimate the synaptic drivers of cortical dynamics during a seizure from EEG/ECoG recordings with a little computational expense. Although DCM can be used to model and track the changes in excitatory{inhibitory balance at seizure onset/offset, these studies are based on single neural mass model (i.e., small number of cortical sources are modelled), and the non-linear ordinary differential equation representing the neural mass model is approximated by its linearization, with which only the seizure onset or offset can be modelled but not both. In this note, the Bayesian Virtual Epileptic Patient (BVEP) model can characterize whole-brain spatiotemporal nonlinear dynamics of seizure propagation. This approach allows describing the onset and offset of ictal states as well as the alternation between normal and ictal periods. The BVEP approach relies on the patient-specific structural data rather formulating the inverse problem purely in terms of unknown model parameters used in DCM. It is also worth mentioning that the dynamics of system was inferred with coupled fast and slow time-scales (cf., Eq. (3)), therefore, the variations in slow variable depend on the hidden states of fast activity while it is assumed that only the activity of fast variable is observed. In this study, the time-scale separation in Epileptor model enabled to capture reliably full evolutions of complex dynamics, ranging from pre-ictal to onset, ictal evolution and offset rather using time-varying parameters. Future extensions to the current work could examine explicitly the non-stationary dynamics of networks in order to investigate conditions for mechanism of seizure initiation whether the seizure onset is more likely to occur through a deterministic parameter changes as in a bifurcation or it is a jump phenomenon due to the noise-driven transition between bistable attractors. The Bayesian inversion in the current work is based on the auto-tuning algorithms such as NUTS and ADVI accomplished with fast automatic differentiation for the calculation of gradients. This allows to efficiently sample from complex and high-dimensional posterior distributions with correlated parameters compared to the traditional sampling algorithms. The inferences in the presented framework is also enriched with several MCMC convergence diagnostics to assess the reliability of the estimations.
Various noninvasive and invasive methods have been used to improve pre-surgical evaluation in identification of the EZ, and consequently to increases surgery success rates. Employing the BVEP model in clinical therapies and brain interventions will require quantification of the model outcomes in fitting empirical secondary functional signals of patients such as EEG, MEG, SEEG, and fMRI signals. In this framework, it is straightforward to incorporate further knowledge such as MRI lesions and clinical hypothesis on EZ from pre-surgical evaluation. Since the BVEP model can be considered as a generic approach towards large-scale brain modeling, it offers promising avenue for inference from clinically used non-invasive imaging signals (EEG, MEG, fMRI), and invasive measurements such as SEEG signals. The results indicate that the proposed approach according to this invention is able to successfully fit against the patient’s empirical SEEG data (not shown). Note that in the case of empirical SEEG recording, the source localization is an ill-posed problem due to the sparsity of lead-field matrix, which can affect the accuracy of the estimates. In principle, it is possible that the surgical strategies can be systematically tested using the BVEP model, however, the real clinical application remains to be investigated and validated in future work.
In conclusion, the invention establishes a link between the probabilistic modeling and personalized brain network modeling in order to systematically predict the location of seizure initiation in a virtual epileptic patient. It is demonstrated step by step, how the proposed framework allows one to infer the spatial map of epileptogenicity based on large-scale brain network models that are derived from noninvasive structural data of individual patients. The invention rests on advanced efficient sampling algorithms that provide accurate and reliable estimates validated by the posterior behavior analysis and convergence diagnostics. In summary, with the help of PPLs, the use of personalized brain network models offer a proper guidance for development of comprehensive clinical hypothesis testing and novel surgical intervention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2020/059511 | 4/3/2020 | WO |