Estimation of effective connectivity via data-driven neural modeling

Freestone, Dean R.; Karoly, Philippa J.; Nešić, Dragan; Aram, Parham; Cook, Mark J.; Grayden, David B.

doi:10.3389/fnins.2014.00383

ORIGINAL RESEARCH article

Front. Neurosci., 28 November 2014

Sec. Brain Imaging Methods

Volume 8 - 2014 | https://doi.org/10.3389/fnins.2014.00383

This article is part of the Research TopicFunctional brain mapping of epilepsy networks: methods and applicationsView all 24 articles

Estimation of effective connectivity via data-driven neural modeling

Dean R. Freestone^1,2^*^†

Philippa J. Karoly^1,2^†

Dragan Nešić²

Parham Aram³

Mark J. Cook¹

David B. Grayden^2,4

¹Department of Medicine, St. Vincent's Hospital Melbourne, The University of Melbourne, Fitzroy, VIC, Australia
²NeuroEngineering Laboratory, Department of Electrical and Electronic Engineering, The University of Melbourne, Parkville, VIC, Australia
³Department of Automatic Control and Systems Engineering, University of Sheffield, Sheffield, UK
⁴Centre for Neural Engineering, The University of Melbourne, Parkville, VIC, Australia

This research introduces a new method for functional brain imaging via a process of model inversion. By estimating parameters of a computational model, we are able to track effective connectivity and mean membrane potential dynamics that cannot be directly measured using electrophysiological measurements alone. The ability to track the hidden aspects of neurophysiology will have a profound impact on the way we understand and treat epilepsy. For example, under the assumption the model captures the key features of the cortical circuits of interest, the framework will provide insights into seizure initiation and termination on a patient-specific basis. It will enable investigation into the effect a particular drug has on specific neural populations and connectivity structures using minimally invasive measurements. The method is based on approximating brain networks using an interconnected neural population model. The neural population model is based on a neural mass model that describes the functional activity of the brain, capturing the mesoscopic biophysics and anatomical structure. The model is made subject-specific by estimating the strength of intra-cortical connections within a region and inter-cortical connections between regions using a novel Kalman filtering method. We demonstrate through simulation how the framework can be used to track the mechanisms involved in seizure initiation and termination.

1. Introduction

This paper presents a model-based framework for imaging neural dynamics from electrophysiological data. This paper builds on a rich history of research in computational neuroscience that has been increasingly focused on the development of generative models to understand the link between neural activity and neuroimaging data (David et al., 2004; Coombes and Terry, 2012; Moran et al., 2013), with emphasis on two main areas. The first area of focus is forward modeling, or the mapping of relevant neuronal variables to recorded data that facilitates the development of theoretical predictions. The second area of focus is inverse modeling, which is the prediction of states, parameters and neuronal outputs given measured data (David, 2007). The new research presented in this manuscript provides a framework that contributes to solving the inversion problem. A key contribution of this paper is the development of an estimation scheme that is applicable to many alternate neural architectures that can be described by a core set of equations, which encapsulates our knowledge of the biophysics of large-scale neural systems.

Large-scale neural models can combine information from multiple neuroimaging modalities (fMRI, EEG, MEG, etc.), allowing a systems approach for data analysis. The behavior of such models is described by system states, whose dynamics are set by parameters, which are static variables. The systems approach of conducting analyses allows one to study all interactions as a whole. This has advantages over correlation-based science, where correlations do not necessarily reveal causation in large-scale systems. A systems approach provides a unified picture of both local properties and remote interactions, and is considered critical to form an understanding of many of the brain's activities (Freeman, 1975; Deco et al., 2008) including seizure generation (Wendling et al., 2000; Breakspear et al., 2006), which is the focus of this study. In the context of this study, the local properties are described by the connectivity strengths between neural subtypes within the circuitry of a functional processing unit (cortical area or cortical column) and the remote interactions are the functional changes that occur between cortical areas.

The definition of cortical connectivity is multi-faceted and is informed by structural, functional and, more recently, model-based experimentation and analysis (Friston, 1994; David et al., 2004). Despite being multi-faceted, it has been hypothesized that the key characteristics of connectivity within functional processing units in the neocortex can be represented at a high level by canonical neural circuits that are repeated throughout the neocortex (Douglas et al., 1989; Douglas and Martin, 2004; Haeusler et al., 2009). These canonical cortical circuits are able to adapt to the specific functional requirements of the brain through temporal and spatial fluctuations in their interrelationships (da Costa and Martin, 2010). The neural mass model (Jansen and Rit, 1995) that is used for inferring connectivity in this current study can be considered a simplified form of a canonical cortical circuit.

For biological systems, structure is usually a good starting point to study functional interactions (Crick and Koch, 2005). For the brain, this process usually starts with building a map of the anatomic pathways (Sporns, 2013; Van Essen et al., 2013). Often quite separately from the anatomical data, functional relationships are also analyzed through temporal correlations in neuroimaging data, which is recorded from spatially distinct regions of the brain. For example, PET, fMRI, and EEG data have all been used to infer connectivity within and between regions of cortex using a variety of quantitative measures (Biswal et al., 1995; Horwitz et al., 1995; Bokde et al., 2001; Horwitz, 2003). A major challenge lies in consolidating the anatomical data and the functional data to form a unified causative model. This challenge is addressed by the framework presented in this paper.

This paper is concerned with the investigation of effective connectivity through causal modeling. In the context of this paper, effective connectivity is defined as the influence one neural area has on another (Friston, 1994). It is anticipated that the use of causal models, which encapsulate our knowledge of the anatomical connectivity and biophysics of neural populations in conjunction with experimental measurements, will provide a more complete picture of how neural connectivity mediates function. The generation of patient-specific models will also be beneficial in a clinical context, providing greater insight into the cause and progression of neurological disorders, such as epilepsy, and enabling treatment regimes to be investigated through computer simulations.

Analysis of mesoscopic neural dynamics through the use of mean-field models has been validated through several alternative approaches. For example, the so-called neural mass model (Wilson and Cowan, 1972; Da Silva et al., 1974; Freeman, 1987) has been able to describe a large range of neural dynamics such as alpha rhythms (Jansen and Rit, 1995), MEG/EEG oscillations (David and Friston, 2003) and epileptic activity (Wendling et al., 2002). Neural mass models can also be easily extended to define additional population types and larger cortical regions (Babajani-Feremi and Soltanian-Zadeh, 2010; Cui et al., 2011; Goodfellow et al., 2011). The aforementioned results motivate the use of the neural mass model as the basis of a canonical cortical circuit. Furthermore, neural mass models offer a reasonable trade-off between biological realism and parsimony, allowing for practical implementation and subsequent inversion. Inversion is the key to using recorded data to estimate the neural states (membrane dynamics of various neural population subtypes) and parameters (defining connectivity strengths). Estimation of system variables provides new information about underlying population dynamics and physiological properties that cannot be directly measured using other neuroimaging methods (without destroying the tissue). For instance, the connectivity strength between neural population subtypes (i.e., pyramidal, spiny stellate and inhibitory interneurons) have been implicated in seizure generation and have also been found to be patient-specific (Wendling et al., 2000; Breakspear et al., 2006; Blenkinsop et al., 2012).

It has previously been demonstrated that a model-based neurophysiological framework can be used to image parameters associated with seizure onset, evolution and termination in an individual epilepsy patient using ECoG data (Freestone et al., 2013). The framework presented in this manuscript builds on this with improvements to the estimation algorithm and an expansion to include multiple brain regions. Numerous other formulations exist for fitting spatially extended mesoscopic neural models to data. For instance, dynamic causal modeling (DCM) is a technique that is often applied to investigate connectivity of neural areas using generative models (Friston et al., 2003; Kiebel et al., 2006). DCM applies Bayesian inference to determine the most probable configuration of model parameters (i.e., neural coupling coefficients) given a window of recorded data. Therefore, the resulting model is contextualized by the experimentally applied stimuli or conditions under which data was generated (Daunizeau et al., 2011). Another approach has been to apply genetic algorithms to search the parameter space of the model for a structure that is optimal for generating the observed data (Wendling et al., 2005; Nevado-Holgado et al., 2012). In relation to the current work, the aforementioned methods of model optimization can be used to initialize the inversion technique outlined in this paper.

The inversion method outlined in this paper is based on the Kalman filter (Kalman, 1960). The model dynamics are assumed to adhere to a Markov process and estimation quantities (states and parameters) are approximated as random variables with Gaussian distributions. For every electrocorticography (ECoG) measurement, the multivariate state and parameter distribution is propagated through the neural population model; then Bayes rule is used to determine the posterior probability distribution of parameters given measured data. In the case of a linear model, this method is known as the augmented Kalman filter, which provides the optimal (minimizing the variance of the estimation errors) unbiased estimate for states and parameters. Various versions of the Kalman filter equations for nonlinear models have been previously applied for model inversion (Voss et al., 2004; Schiff and Sauer, 2008; Deng et al., 2009; Freestone et al., 2011; Aram et al., 2013; Liu and Gao, 2013). However, these studies were based on either simplified field equations or a single region population model. A key advantage of the Kalman filter-based estimation algorithm outlined in this paper over other expectation maximization or genetic algorithm type schemes is the ability to track states and parameters in real time. Tracking in real time provides a greater level of temporal accuracy in the detection of transitions that underly specific neural activity (such as seizure generation). Furthermore, this paper demonstrates a flexible predictive framework that can be readily adapted to alternative forms of the neural population model (that are based on the same fundamental building blocks) in order to reflect our most current understanding of the architecture of the brain.

The organization of this paper is as follows. The first section outlines the formulation of the computational model of multiple cortical regions and the algorithm for tailoring the model to subject-specific data. Next, example simulations and results are provided that validate the framework for both single and multiple cortical areas. We then provide an example specific to studying epilepsy, where we show how the framework can be used to identify a seizure onset site and the mechanism for seizure initiation and termination. The final section discusses the benefits of this approach in a wider context of understanding seizures and developing much needed new therapies as well as the current limitations of the proposed framework and directions for further work.

2. Materials and Methods

This section discusses the core biophysics of the mass action of the cortical regions that are incorporated into our mathematical model along with the algorithm for tailoring the model to subject-specific data. Together, the mathematical model and the estimation algorithm form a lens that focuses on the parameters that govern connectivity and function of neural networks.

2.1. Neural Population Model

The neural population model that is used for the framework is based on the neural mass model. This type of neural model describes the dynamics of the mean membrane potential of a population of a specific neuron subtype given firing rate inputs. Populations of this type with varied parameters can be connected together to form local networks to describe the dynamics of specific cortical regions, such as a cortical column. Multiple cortical regions can then be interconnected to form a large-scale network model. Within this section, the building blocks of all neural populations of our large-scale network model are presented that describe the action of the synaptic connections (mean firing rate to mean membrane potential) and the action of the somas (mean membrane to firing rate). The notation used to derive the neural population model in the following section is summarized in Table 1.

TABLE 1

Table 1. Notation for the neural population model.

2.1.1. Single population model

To derive a population model, we begin by defining the mean membrane potential of a neural population, v_n, as the sum of contributing mean post-synaptic potentials, v_mn, where the post-synaptic and pre-synaptic neural populations are indexed by n and m, respectively,

\begin{matrix} v_{n} = \sum_{m = 1}^{M} v_{m n} . & (1) \end{matrix}

Each post-synaptic potential arises from the convolution of the input firing rate, ϕ_m(t), with the post-synaptic response kernel

\begin{matrix} v_{m n} (t) = α_{m n} \int_{- \infty}^{t} h_{m n} (t - t^{'}) ϕ_{m} (t^{'}) d t^{'}, & (2) \end{matrix}

where α_mn is a lumped connectivity parameter that incorporates the average synaptic gain, the number of connections and the average maximum firing rate of the presynaptic populations. All lumped connectivity parameters are assumed to be unknown, so must be inferred from data. The post-synaptic response kernels denoted by h_mn(t) describe the profile of the post-synaptic membrane potential of population n that is induced by an infinitesimally short pulse from the inputs (like an action potential). The post-synaptic response kernels are parameterized by the time constant τ_mn and are given by

\begin{matrix} h_{m n} (t) = η (t) \frac{t}{τ_{m n}} \exp (- \frac{t}{τ_{m n}}), & (3) \end{matrix}

where η(t) is the Heaviside step function. Typically, α_mn and τ_mn are assumed to be constants (particularly for current-based synapses) that define the presynaptic population type. For example, GABAergic inhibitory interneurons typically induce a higher amplitude post-synaptic potential with a longer time constant than glutamatergic excitatory cells. For the model that we are considering, the index n (post-synaptic) may represent either pyramidal (p), excitatory interneuron (spiny stellate) (e) or inhibitory interneuron (i) populations.

The inputs to the population, ϕ_mn, may come from external regions, u, or from other populations within the model, g_mn(v_m), where

\begin{matrix} ϕ_{m} = {\begin{array}{l} u_{m} & if m indexes external inputs \\ g (v_{m}) & if m indexes internal inputs \end{array} . & (4) \end{matrix}

The various populations within the model are linked via the activation function, g(·), that describes a mean firing rate as a function of the pre-synaptic population's mean membrane potential. The activation function exploits a sigmoidal relationship (limited firing rate due to refractory period of the neurons) between the mean membrane potential and firing rate of each of the populations. This sigmoidal nonlinearity may take different forms, but for this study the error function form is used where

\begin{matrix} g (v_{m}) = \frac{1}{\sqrt{2 π} ς} \int_{- \infty}^{v_{m}} \exp (- \frac{{(z - v_{0})}^{2}}{2 ς^{2}}) d z & (5) \end{matrix}

\begin{matrix} = \frac{1}{2} (erf (\frac{v_{m} - v_{0}}{\sqrt{2} ς}) + 1) . & (6) \end{matrix}

The quantity ς describes the slope of the sigmoid or, equivalently, the variance of firing thresholds of the presynaptic population (assuming a Gaussian distribution of firing thresholds). The mean firing threshold relative to the mean resting membrane potential is denoted by v₀(v₀ = v_thresh + v_rest). The resting membrane potential is not usually explicitly defined for forward models of this type. However, for inverse models, it is important to understand how the resting membrane potential is included in the equations. The parameters of the sigmoidal activation functions, ς and v₀, are usually assumed to be constants.

The convolution in Equation 2 can conveniently be written as two coupled, first-order ordinary differential equations, which is a second-order state-space model. This gives the system

\begin{matrix} \begin{array}{l} \frac{d v_{m n}}{d t} = z_{m n} \\ \frac{d z_{m n}}{d t} = \frac{α_{m n}}{τ_{m n}} ϕ_{m n} - \frac{2}{τ_{m n}} z_{m n} - \frac{1}{τ_{m n}^{2}} v_{m n} . \end{array} & (7) \end{matrix}

In summary, this single neural population model maps from a mean pre-synaptic firing rate to a post-synaptic potential. The terms that are usually considered parameters of the model are the synaptic time constants, τ, the connectivity constants, α, the mean firing thresholds, v₀, and firing threshold variances, ς. These parameters can be set to describe connections between specific neural populations, such as pyramidal neurons, spiny stellate cells and fast and slow inhibitory interneurons.

2.1.2. Multiple populations for a cortical region

Multiple populations in the form of Equation 7 can be configured and interconnected to represent the circuitry of a cortical region, such as a cortical column. Each synaptic connection in the model is described by the set of coupled first-order ODEs of Equation 7; however, the parameters are connection-specific. Models exist in the literature describing from two to five different neural types with two to thirteen synaptic connections (4th to 26th order) (Da Silva et al., 1974; Wang and Knösche, 2013). Contributions in this regard have been made by David and Friston (2003); Wendling et al. (2002); Jansen and Rit (1995) and others. An illustration of the model of a cortical region used in this study is shown in Figure 1.

FIGURE 1

Figure 1. Population model of a cortical region. The left hand side shows a cross section of the cortical laminar, highlighting the stratification and different population the various layers. A graphical representation of the population model is presented on the right hand side, showing three interconnected neural populations, which are inhibitory interneurons (supragranular layers), excitatory spiny stellate cells (granular layer), and pyramidal neurons (infraganualar layers). The specific subtype of neural population is defined by the parameters that describe the post-synaptic response kernels. The intra-region connectivity are denoted by α_mn, where the subscript denotes a connection from population m to n. An example of the post-synaptic potentials that are generated at each connection are also shown.

The parameters of the neural populations not only define the population type, but also the behavior the model of the cortical region exhibits. For example, for a certain parameter combination, we obtain a model of a cortical region that will generate alpha-wave type activity; for another set of parameters, we obtain a different model that will exhibit epileptic behavior. The parameters used in this study have been determined previously for similar models (Jansen and Rit, 1995) and are shown in Table 2. The parameters to be estimated are the synaptic gain terms, α_mn.

TABLE 2

Table 2. Fixed parameter values for the neural population model that are not estimated.

2.1.3. Multiple region model

Coupling of cortical region j to region k is achieved by connecting the output firing rate of the pyramidal population in region j to the input of the pyramidal population in region k via a delay kernel. The delay kernel is of the same form as the post-synaptic response kernel of Equation 3, but maps a firing rate to a delayed firing rate. The inputs from the delayed firing rates are modeled for every pyramidal population using the same form of second-order model defined in Equation 7. All interconnections between regions were assumed to have the same delay kernel, which was parameterized by a time constant, τ_d (Wendling et al., 2000) (see Table 2). The delayed firing rates form standard inputs to the pyramidal cells in the adjoining cortical region and induce post-synaptic potentials via a convolution kernel as described by Equation 2. However, the connectivity parameter α_jk describes the interconnection gain between regions rather than between populations. In this study, we consider four interconnected cortical regions as shown in Figure 2. The values of the interconnection gains for forward simulations were tuned to achieve the desired behavior in the ECoG outputs, while avoiding saturation of neural populations. Different interconnection gains were used to either simulate data consistent with alpha rhythms or to achieve transition to seizure. Further details about the simulations and parameters used are given in Section 2.3.

FIGURE 2

Figure 2. Graphical representation of the four region population model with differential ECoG measurements. Each region is interconnected to its immediate neighbor. The inter-region connectivity strength is governed by the parameter α_jk, where j and k ∈ {1, 2, 3, 4} and j ≠ k. The differential montage provides a more realistic measurement model then what is typically used for model inversion.

2.1.4. Augmented discrete time state-space model

For notational convenience, the subscripts for the synaptic gains, denoted α_mn and α_jk, and the post-synaptic potentials, denoted by v_mn in the previous section, will now be numbered sequentially from 1 to N + K. N is the number of intra-regional connections and K is the number of inter-regional connections in the multi-area model.

The state vector is a concatenation of discrete time values of the post-synaptic membrane potentials, the derivatives of the potentials, the delayed firing rates (inter-region) and their derivatives by

x ≜ {[\begin{matrix} v_{1} & z_{1} & \dots & v_{N} & z_{N} & v_{ϕ, 1} & z_{ϕ, 1} & \dots & v_{ϕ, K} & z_{ϕ, K} \end{matrix}]}^{⊤},

where the large-scale model has N intra-region connections and K inter-region connections. The subscript ϕ indicates that the post-synaptic potential/derivative is associated with the delayed firing rate from a pyramidal population of a neighboring region.

The parameters to be estimated can also be concatenated into a vector by

θ ≜ {[\begin{matrix} α_{l, 1} & \dots & α_{l, N} & α_{d, 1} & \dots & α_{d, K} \end{matrix}]}^{⊤},

where l denotes local connections within regions (including from inputs, u), d denotes distant connections between regions. For a four-region model, assuming the number of connections within each region is equal, then the number of connections within each region is equal to N ÷ 4. In this formulation of the model the parameter vector is written in differential form, with trivial dynamics as

\begin{matrix} \dot{θ} = 0 . & (8) \end{matrix}

The differential form of the parameter vector facilitates augmenting the parameters to the state vector for estimation purposes.

The augmented state space vector is created by

\begin{matrix} ξ ≜ {[\begin{matrix} x & θ \end{matrix}]}^{⊤}, & (9) \end{matrix}

which has dimensionality ξ ∈ ℝ^n_ξ where n_ξ = 3(N + K). The augmented large-scale state space model is given by

\begin{matrix} \dot{ξ} = A ξ + B ξ ◦ g (C ξ) + D (u) ξ, & (10) \end{matrix}

where ◦ denotes element-wise multiplication. The matrices A, B, C, and D(u) are defined in Appendix 5.2. The large-scale model can be written in a compact form that is useful for deriving the estimation algorithm by

\begin{matrix} \dot{ξ} = F (ξ, u) . & (11) \end{matrix}

It is necessary to discretize the model for estimation purposes. The Euler method was used for discretizing the model and is presented in Appendix 5.1. For the Bayesian inference scheme, it is also necessary to model uncertainty in our model by an additive noise term. With the inclusion of the additive noise term, w_t, the discrete time augmented state space model is denoted by

\begin{matrix} ξ_{t + 1} = A_{δ} ξ_{t} + B_{δ} ξ_{t} ◦ g (C ξ_{t}) + D_{δ} (u_{t}) ξ_{t} + w_{t} & (12) \end{matrix}

and can be written in compact form by

\begin{matrix} ξ_{t + 1} = F_{δ} (ξ_{t}, u_{t}) + w_{t} . & (13) \end{matrix}

The model uncertainty is defined by a zero mean, temporally white Gaussian with known covariance matrix Q. In forward models, w_t is used as a driving term to simulate unknown input to the system from afferent connections or from other cortical regions. However, for model inversion purposes, this additional term also facilitates estimation and tracking of parameters via Kalman filtering or other Bayesian inference schemes. For the Kalman filter, the covariance of w_t quantifies the error in the predictions through the model. If we believed our model is accurate, then we would set all of the elements of Q to a small value. On the other hand, a high degree of model-to-brain mismatch can be quantified by setting the elements of Q to larger values.

2.1.5. Model of ECoG measurements

It is well accepted that the field potentials that are measured with ECoG are predominately generated by synaptic currents arising from inputs to the pyramidal neurons (Nunez and Srinivasan, 2006). In our model, these currents are linearly proportional to the mean membrane potential of the pyramidal population. Therefore, the ECoG signal is modeled as the mean membrane potential of the pyramidal population, which is the sum of the incoming post-synaptic membrane potentials.

For the multi-region neural population the ECoG measurement is taken to be the difference between neighboring regions. This provides a differential montage that is compatible with experimental data. Typically, the generators of ECoG signals are modeled by the individual mean membrane potentials of the pyramidal populations, effectively ignoring the differential nature of actual ECoG recordings. In this paper, we demonstrate that parameters can be accurately estimating when using the more realistic measurement model.

The measurement model that relates the ECoG measurements to the augmented state vector, ξ_t, is given by

\begin{matrix} y_{t} = H ξ_{t} + v_{t}, & (14) \end{matrix}

where v_t ~ yes (0, R) is a zero mean, spatially and temporally white Gaussian noise process with a standard deviation of 1 mV, that simulates measurement errors. For model inversion purposes, the variance of v_t quantifies the confidence we have in the measurements. The matrix H defines a summation of the membrane potentials (corresponding to pyramidal populations) that contribute to each ECoG channel along with the differential montaging scheme. The number of channels used in this case was equal to the number of regions (four), as seen in Figure 2.

2.2. A Kalman Filter for the Population Model

The aim of the Kalman filter is to estimate the most likely sequences of states, $\hat{ξ}$ ⁺_t, and the associated error covariances, $\hat{P}$ ⁺_t, given (uncertain) knowledge of the biophysics and anatomy of the brain regions of interest combined with the noisy ECoG measurements, y_t. The optimal state estimates can be formally stated using the expectations

\begin{matrix} {\hat{ξ}}_{t}^{+} = 𝔼 [ξ_{t} | y_{1}, y_{2}, \dots, y_{t}] & (15) \end{matrix}

\begin{matrix} {\hat{P}}_{t}^{+} = 𝔼 [(ξ_{t} - {\hat{ξ}}_{t}^{+}) {(ξ_{t} - {\hat{ξ}}_{t}^{+})}^{⊤}], & (16) \end{matrix}

which are known as the a posteriori state estimate and state estimate covariance, respectively. The a posteriori state estimate is computed by correcting the a priori state estimate, which is a prediction though our model and defined as

\begin{matrix} {\hat{ξ}}_{t}^{-} = 𝔼 [ξ_{t} | y_{1}, y_{2}, \dots, y_{t - 1}], & (17) \end{matrix}

using a weighted difference between a prediction of the observations and the actual noisy measurements. The a posteriori state estimate is calculated by updating the prediction using measured data by

The weighting to correct the a priori augmented state estimate, yes _t, is known as the Kalman gain (Kalman, 1960). The Kalman gain is calculated using the available information regarding the confidence in a prediction of the augmented states through the model and the observation model that includes noise by

where

\begin{matrix} {\hat{P}}_{t}^{-} = 𝔼 [(ξ_{t} - {\hat{ξ}}_{t}^{-}) {(ξ_{t} - {\hat{ξ}}_{t}^{-})}^{⊤}] & (20) \end{matrix}

is the a priori state estimate error covariance, R is the observation noise covariance, and H is the observation matrix. For a linear observation function, the a posteriori covariance is then updated by using the Kalman gain to provide the correction

Practically, the actual state is not known so the Kalman filter must be initialized with the best guess for $\hat{ξ}$ ⁺₀ and $\hat{P}$ ⁺₀, which provides the a posteriori state estimate and state estimate covariance for time t = 0. The a priori state estimate for time t = 1 can then be computed by propagating the initial guess through the model and taking the expectation,

\begin{matrix} {\hat{ξ}}_{t}^{-} = 𝔼 [F_{δ} ({\hat{ξ}}_{t - 1}^{+}, u_{t - 1})] & (22) \end{matrix}

\begin{matrix} = 𝔼 [A_{δ} {\hat{ξ}}_{t - 1}^{+} + B_{δ} {\hat{ξ}}_{t - 1}^{+} ◦ g (C {\hat{ξ}}_{t - 1}^{+}) + D_{δ} (u_{t - 1}) {\hat{ξ}}_{t - 1}^{+}] & (23) \end{matrix}

\begin{matrix} = A_{δ} {\hat{ξ}}_{t - 1}^{+} + 𝔼 [B_{δ} {\hat{ξ}}_{t - 1}^{+} ◦ g (C {\hat{ξ}}_{t - 1}^{+})] + D_{δ} (u_{t - 1}) {\hat{ξ}}_{t - 1}^{+} & (24) \end{matrix}

Generally, for nonlinear systems, the solution to this expectation is not known. Therefore, approximations are often used, such as the extended and unscented Kalman filters, respectively.

We approximate the expectation by

\begin{matrix} 𝔼 [B_{δ} {\hat{ξ}}_{t - 1}^{+} ◦ g (C {\hat{ξ}}_{t - 1}^{+})] \approx B_{δ} {\hat{ξ}}_{t - 1}^{+} ◦ 𝔼 [g (C {\hat{ξ}}_{t - 1}^{+})], & (25) \end{matrix}

where the accuracy of the approximation depends on the width of the distributions for the parameters, Bξ⁺_{t − 1}. Since we are assuming the parameters are unknown with the possibility of slow changes, a small amount of uncertainty is added. For known parameters, Equation 25 would be exact. Therefore, the accuracy of the approximation improves as parameter estimates converge toward their actual values.

In an effort to improve state and parameter estimation accuracy, a new innovation in this study is an analytic solution to the expectation of the mean membrane potential, which is modeled as a Gaussian, transformed by the sigmoid. To show the solution, we first point out that

\begin{matrix} γ_{j} {\hat{ξ}}_{t - 1}^{+} = {\hat{v}}_{t, j} & (26) \end{matrix}

corresponds to the total pre-synaptic mean membrane potential of the jth neural population, where γ_j is a row vector from the adjacency matrix, C, which is described in detail in Appendix 5.2. Also, the variance of the pre-synaptic mean membrane potential is

\begin{matrix} γ_{j} {\hat{P}}_{t - 1}^{+} γ_{j}^{⊤} = {\hat{σ}}_{t, j}^{2} . & (27) \end{matrix}

The analytic solution for the expectation of a Gaussian distributed random variable (total membrane potential of the respective population) transformed by the sigmoid error function, g(·), is given by

\begin{matrix} 𝔼 [g (γ_{j} {\hat{ξ}}_{t - 1}^{+})] = \frac{1}{2} (erf (\frac{γ_{j} {\hat{ξ}}_{t - 1}^{+} - v_{0}}{\sqrt{2 (ς^{2} + γ_{j} {\hat{P}}_{t - 1}^{+} γ_{j}^{⊤})}}) + 1) ​ ​ . & (28) \end{matrix}

The derivation of this new result is shown in Appendix 5.3.

The a-priori covariance is approximated using the unscented transform, which approximates the statistics of a multivariate Gaussian that undergoes a nonlinear transformation (Julier and Uhlmann, 1997). The approximation is given by

where yes ⁱ_{t − 1} is a matrix of sigma vectors, which are carefully chosen samples from the distribution of $\hat{x}$ ⁺_{t − 1}, and W_i are vectors of weights associated with the transform. For completeness, the method of computing the sigma vectors and the weights is provided in Appendix 5.4.

It is likely that the parameters and states described by a cortical circuit will be subject to identifiable physiological constraints that should be included in an inversion problem in order to exploit all available information. There are various ways to constrain the parameter space by truncating the distribution of the prior (Simon, 2006). In this study, a computationally simple method known as “clipping” (Kandepu et al., 2008) was used to constrain the synaptic gains. Upper and lower bounds on synaptic gain estimates were enforced during the calculation of the posterior distribution by imposing limits on the analytic calculation of the mean and on the sample space of the unscented transform (used to approximate the covariance). The bounds were set larger than proposed ranges for the intra-regional parameters of a multi-area neural mass model, determined by Babajani-Feremi and Soltanian-Zadeh (2010). The bounds for the constraints are shown in Table 3.

TABLE 3

Table 3. Parameter constraints used in the clipping method of the estimation algorithm.

2.3. Simulations for Validation

In order to test the performance capabilities of the model-based framework, it is necessary to use data where the actual parameter values are known. While it is impossible to accurately measure parameter values in an experiment, it is possible to know the actual values when using data that is generated in a forward simulation. Therefore, artificial data was used to test the estimation performance. This type of test does not guarantee that the method will work with clinical recordings, but provides a proof of principal based on the assumption that our neural population model provides a reasonable representation of cortical dynamics. Considering the wide range of phenomena that the population model has been able to describe and the wide acceptance in the literature, this assumption is a reasonable starting point.

In order to test the robustness of the estimation algorithm, a Monte Carlo simulation was performed by testing the estimation algorithm with 50 realizations of synthetic data, each with a different unknown input. For each of the realizations, the parameters were set such that the model generated activity with a dominate spectral peak at around 10 Hz (alpha activity). The parameter values are shown in Table 4. The accuracy of parameters estimates (connectivity gains) are measured in terms of percentage bias and were taken as the absolute difference between the estimated and true values at the end of each simulation. Simulations were run for 60 s for the single-region model and 100 s for the four-region model, as the parameter estimates were observed to converge well within this time. For state tracking, only the results of the post-synaptic potentials are shown, although the derivatives of the post-synaptic potentials were also tracked. State accuracy was measured by the root mean squared (RMS) error over 1 s of data, since the states (and their estimates) are dynamic. The RMS error was measured from the final second of the simulation, when parameter estimates were assumed to be constant. Results are also presented for a single realization for both the single and four region models (normal and epileptiform) in order to illustrate the convergence properties over time of the parameter estimates. The parameters used to simulate the epileptic-type behavior seen in the simulated seizure transition are given in Table 5. The bounds that were used to constrain the parameter estimates are shown in Table 3.

TABLE 4

Table 4. Connectivity parameters to simulate an alpha rhythm in the multi-region population model.

TABLE 5

Table 5. Connectivity parameters used to simulate epileptic behavior in the multi-region population model.

3. Results

3.1. Comparison of Analytic Mean and Unscented Transform

The performance of the modified Kalman filter and the unscented Kalman filter were compared in order to quantify the increase in estimation performance from using the analytic mean. Both methods approximated the covariance of the joint distribution using the unscented transform. Since the mean and covariance cannot be considered separately when the distribution is propagated through the neural population model, the Kalman filter that uses the analytic mean is really an approximation of a Gaussian distribution. However, the difference between the standard UKF and this novel application of the Kalman filter, which is tailored to the neural population model, is that the new approach based on the analytic mean has the potential to improve state and parameter estimation for this particular application.

Tables 6, 7 show the mean estimation bias for intra-connectivity gains and post-synaptic potentials (PSPs) of a single cortical region. Table 6 demonstrates that the analytic mean approach is approximately twice as accurate as the UKF for state tracking of v_up, v_pi and v_ip and has equal accuracy with the UKF for v_ep and v_pe. This is consistent with the parameter estimates in Table 7, which shows that the analytic mean method gave two to three times improved accuracy over the UKF for α_up, α_pi and α_ip (and has the same accuracy for α_ep and α_pe). Figure 3 shows the results for the entire Monte Carlo simulation and again demonstrates that the Kalman filter using an analytic mean outperforms the UKF for the single region model. Figures 3A,B show that the intra-connectivity gain estimation is within 60% for all parameters for the UKF and less than 25% for the analytic mean method. Figures 3C,D show that the bias for tracking of PSPs is consistently less than 1.4 mV for the UKF and less than 0.7 mV for the analytic mean approach. On the whole, these results demonstrate the value of the novel application of the modified Kalman filter for the neural population model.

TABLE 6

Table 6. Mean bias (over 50 simulations) of the post-synaptic potential estimates for a single region model of alpha rhythms, with comparison between the UKF and the new modified Kalman filter.

TABLE 7

Table 7. Mean bias (over 50 simulations) of the connectivity gain estimates for a single region model of alpha-type rhythms, with comparison between the UKF and the new modified Kalman filter.

FIGURE 3

Figure 3. Comparison of the estimation results from the modified Kalman filter with the unscented Kalman filter from the Monte-Carlo simulation (50 realizations). (A) The bias for parameter estimation as a percentage of the true value for the connectivity gain using the UKF. (B) The bias for parameter estimation as a percentage of the true value for the connectivity gain using the analytic mean. (C) RMS error for state tracking of the post synaptic potentials using the UKF. (D) RMS error for state tracking of the post synaptic potentials using the analytic mean. The center line of the box plots shows the median error and the box covers are the 25th to 75th percentiles. The whiskers cover the entire range of errors that are not considered outliers, which are shown by the dots. The outliers are determined to be outside q₁ − 1.5(q₃ − q₁) to q₃ + 1.5(q₃ − q₁) where q₁ and q₃ denote the 25th and 75th percentiles.

3.2. Single Region Model

Figure 4 shows an example of state tracking and parameter estimation for a single cortical region. The plots show that the algorithm was able to reliably track all postsynaptic potentials and estimate all connectivity gains in the region. This remarkable result was achieved using only the noisy ECoG signal and knowledge of the structure of the cortical circuit. Figure 4 also shows that the standard deviation of the estimated parameters also converged, which demonstrates the filter was performing as expected. The standard deviation of the estimate for α_ip remained larger than the estimates for the other connectivity gains, as it had the largest bounds representing greater uncertainty.

FIGURE 4

Figure 4. Estimation results showing convergence of parameters in the single region model. 30 s of ECoG data simulating an alpha rhythm from a single region model was used. Each panel shows the PSP (upper) and connectivity gain (lower) estimates. The actual states are shown in red and the estimated values are shown in black. The gray shaded regions show the estimated standard deviation estimates of the connectivity gains. The scale in the lower left of each subpanel is distinct for the PSP (LHS) and connectivity gain (RHS) (A) PSP and connectivity gain for spiny stellate to pyramidal connection. (B) PSP and connectivity gain for pyramidal to inhibitory interneuron connection. (C) PSP and connectivity gain for pyramidal to spiny stellate connection. (D) PSP and connectivity gain for inhibitory interneuron to pyramidal connection. (E) PSP and connectivity gain for external input to pyramidal connection.

Figures 3B,D show the results for parameter estimation and state tracking using the Kalman filter with the analytic mean for a Monte Carlo simulation with 50 realizations. Both figures demonstrate good accordance for estimation results to the actual states and parameters, with the possible exception of the inhibitory-to-pyramidal connectivity gain estimate (α_ip) when using the standard unscented Kalman filter.

From Figure 3D and Table 6 it can be seen that the bias of the state (PSP) tracking was consistently less than 0.7 mV and the mean RMS bias was less than 0.4 mV for all the potentials when using the modified filter. The amplitude of the PSPs was on the order of 10–30 mV, thus an average bias of less than 0.4 mV represents satisfactory performance. The tracking of post-synaptic potential induced from the input, v_up, was the worst performer. This is to be expected since it is linked to the connection from the stochastic input, u(t), and the pyramidal population. Figure 3B and Table 7 show that the mean estimation bias for all of the connectivity coefficients (slow states) was less than 22% with a mean of less than than 8%. It is anticipated that this level of accuracy in state estimation will provide a strong basis for a classification algorithm that distinguishes between healthy and abnormal oscillations (such as observed during seizures).

3.3. Four Region Model

Figure 5 shows an example estimation result for the four region model. The four region model has four times as many measurements that are inputs to the filter, as there are additional ECoG voltage signals (one per region). However, the dimensionality of the system is more than four times larger than the single column, as each new column introduces an equal number of intra-regional connections as well as two inter-regional connections with its neighbors. In Figure 5, only the inter-regional connections are shown, although all of the PSPs and connectivity gains were estimated. The results that are presented in Figure 5 demonstrate that the estimation method was capable of scaling up from a single region model to a larger model of coupled regions, while maintaining the ability to simultaneously estimate all the connectivity gains and track the PSPs associated with every synapse. The ability to scale up to a larger area is crucial in order to apply estimation to patient-specific models of epilepsy.

FIGURE 5

Figure 5. Post-synaptic potential and connectivity gain estimation results for the four region model showing parameter convergence. ECoG data was obtained over a 50 s simulation using the four region model to generate alpha-type rhythms. The filter output for PSP tracking is over a short time segment and the connectivity gain estimation is for the entire simulation. The actual states are shown in red and the filter output is shown in black. The gray bar around the plot of the connectivity gain estimates shows the standard deviation of the estimate. (A) PSP and interconnectivity gains from region one to two (upper) and four (lower). (B) PSP and interconnectivity gains from region two to one (upper) and three (lower). (C) PSP and interconnectivity gains from region four to one (upper) and four to three (lower). (D) PSP and interconnectivity gains from region three to four (upper) and three to two (lower).

Figures 6, 7 show the estimation bias over 50 simulations for the connectivity gains and PSP tracking, respectively. Each simulation was run for 100 s (as in Figure 5) with a different randomly generated sequence for u(t) as external input. Tables 8, 9 summarize the mean (over the 50 simulations) values of the estimation biases for both fast and slow states. Figure 6 and Table 8 show that the RMS bias for PSP tracking was consistently less than 1.5 mV and the mean RMS bias was less than 1 mV for all connections. The amplitude of the PSP signals was on the order of 10–30 mV and the variance of noise added to the ECoG voltages was 1 mV. Therefore, the bias for PSP tracking represents a high level of accuracy. As was seen for the single region model, the tracking performance was less accurate for v_up due to the stochastic input that generates this PSP.

FIGURE 6

Figure 6. Post-synaptic potential estimation results in the four region model from a Monte-Carlo simulation. Each subplot shows the RMS bias for state tracking of a PSP associated with a specific synapse over 50 simulations. (A) RMS bias for v_up. (B) RMS bias for v_pi. (C) RMS bias for v_pe. (D) RMS bias for v_jk. (E) RMS bias for v_ep. (F) RMS bias for v_ip. (G) RMS bias for v_kj. ECoG data was obtained using the four-region model generating alpha-type rhythms, with different stochastic input for every simulation. For every subplot, the centerline of the boxplots are the median and the edges are the 25th and 75th percentiles. Outliers are determined to be outside q₁ − 1.5(q₃ − q₁) to q₃ + 1.5(q₃ − q₁) where q₁ and q₃ denote the 25th and 75th percentiles.

FIGURE 7

Figure 7. Connectivity estimation results in the four region model from a Monte-Carlo simulation. Each subplot shows the estimation bias as a percentage of the true value for the connectivity gain for every synapse over 50 simulations. (A) Bias for α_up. (B) Bias for α_pi. (C) Bias for α_pe. (D) Bias for α_jk. (E) Bias for α_ep. (F) Bias for α_ip. (G) Bias for α_kj. ECoG data was obtained using the four-region model generating alpha-type rhythms, with different stochastic input for every simulation. For every subplot, the centerline of the boxplots are the median and the edges are the 25th and 75th percentile. Outliers are determined to be outside q₁ − 1.5(q₃ − q₁) to q₃ + 1.5(q₃ − q₁) where q₁ and q₃ denote the 25th and 75th percentiles.

TABLE 8

Table 8. Mean RMS estimation bias (over 50 realizations in mV) for post-synaptic potential tracking in the multi-region model.

TABLE 9

Table 9. Mean bias (over 50 realizations in %) for connectivity parameter estimates in the multi-region model.

Figure 7 and Table 9 show that the estimation bias for the connectivity gains was less than 40% and the mean bias was less than 10%, except for α_ip and α_jk which were less than 15%. The parameter estimation accuracy for the coupled model compared with the single region model was comparable in terms of the mean value for all connectivity gains. Over the entire Monte Carlo simulation, the estimation performance for α_ep, α_pi and α_pe were similar to the single region model. The decrease in performance is most evident for α_ip (from within 20% to within 40%). This is consistent with the results from the single region model where α_ip was the least accurate of the estimated gains. The estimation performance for α_jk and α_kj cannot be compared to the single region model. However, the estimation accuracy of the interconnectivity gains was worse than the intra-region gains (apart from α_ip). It is difficult to pinpoint sources of error for this parameter, as all of the estimated states are highly interactive with each other. A potential source of the decreased accuracy for α_jk and α_kj (as well as α_up) is that their values are an order of magnitude smaller than the other estimated connectivity gains, which can lead to numerical problems for the Kalman filter equations. On the whole, the consequences of scaling up the model from a single region to four coupled regions has not resulted in major loss of estimation accuracy.

3.4. Simulation of an Epileptic Seizure

Figure 8 shows a simulated ECoG time series with transitions from a background rhythm to seizure-like oscillations and back. The transitions were achieved in the forward simulation by ramping the amplitude of the excitatory gains of one cortical region (region 1 in Figure 8) and then decreasing them back to their usual values. The values used to generate the seizure-type behavior are shown in Table 5. In order to ensure that the seizure-like oscillations would spread from one region to the neighboring regions, the interconnectivity between the first area (where the seizure was initiated) to its neighbors was increased from the previous example over the entire time course of the simulation, while the interconnectivity gains from all other regions back to the first region were decreased (as shown in Table 5).

FIGURE 8

Figure 8. Simulation of an epileptiform transition. ECoG signals were obtained using a 100 s forward simulation and adjusting the connectivity gains from alpha to seizure rhythms and vice versa (see Tables 4, 5). The simulation output shows the epileptiform activity rapidly spreading from Region 1 (where the pathology was simulated), to the rest of the network. The figure also shows a graphical representation of the model of the differential measurement function. The blue and red sub-panels show example alpha and seizure-type rhythms, respectively.

Figure 9 shows the estimation results of the connectivity gains for each cortical area during the simulated seizure. In order to track parameter changes (compared with the previous estimation when parameters were assumed to be static), additional uncertainty was added to the estimate error covariance in the Kalman filter (see Appendix 5.4.). The additional uncertainty was required to inflate the estimation error covariance to capture unmodeled transitions in parameter values. It is clear that the method has successfully identified the transitions in the cortical region that led to the seizure generation, as the filter tracked the increase in these gains for region 1, while accurately estimating the corresponding connectivity gains for the other cortical regions that remained constant.

FIGURE 9

Figure 9. Results from tracking pathological changes in the connectivity gains that lead to epileptiform activity. In each subplot, the red line shows the actual values. (A–G) Show the estimation results from Region 1, where the internal excitatory connectivity gains were transiently increased to induce the epileptiform discharge. The mean is shown by the black line and the gray shaded area shows the standard deviation of the estimate. (H–N) Show the estimates from the non-pathological regions (no change in parameters from baseline), where the solid lines show the mean and shaded regions show the standard deviation of the parameters.

It can be seen from Figure 9A that the estimation accuracy for α_up was lower than the other connectivity gains due to the stochastic input. The estimated interconnectivity gains that were associated with inputs to region 1 (the epileptic region), α₂₁ and α₄₁, also do not quite converge (Figures 9F,G) the actual values. This could be due to the much smaller magnitude of these gains compared with the corresponding interconnectivity gains in the other regions. From Figure 9D, it can also be noted that the estimation accuracy of inhibitory to pyramidal connectivity, α_ip, did not converge to the actual value in first part of the simulation (alpha rhythm), which was also consistent with previous results. However, the estimates of α_ip converged to actual values during the seizure and had a lower estimation standard deviation, which can be attributed to the higher signal-to-noise ratio during larger amplitude oscillations. If this method of estimation can be translated for use on real data, it has the potential to provide valuable insight into the cause and spread of seizures and enable more informed treatment measures for epilepsy patients.

4. Discussion

This paper presented a framework for model inversion that facilitates estimation and imaging of the physiological properties of the brain using electrocorticography (ECoG) data, under the assumption that the model captures the key features of the cortical circuits of interest. Tracking of the mean membrane potentials of the various neural populations and connectivity parameters (within and between cortical regions) may provide a clear picture of the causal relationships between cortical dynamics and seizures. The link between physiological parameters and data will undoubtedly improve detection and treatment outcomes across a range of pathologies.

We have demonstrated that is possible to reliably track the post-synaptic potentials and estimate the connectivity parameters of a large-scale neural population model. This demonstration highlights the power of combining the prior information we have about neural dynamics and cortical structure (that is encoded in the computational model) to estimate the parameters of interest. For the single region case, the average prediction bias for connectivity parameters is less than 8% and the average RMS error in the mean post-synaptic potential estimates within the local circuit was less than 0.4 mV (the peak to peak potential of a typical post-synaptic potential was approximately 20 mV). We demonstrated that the framework can be scaled up to a larger-scale model (of four cortical regions) with more realistic measurements without a major decrease in estimation accuracy. The average estimation error remained less than 10% except for three parameters (errors in α_ip, α_jk, and α_kj were less than 15%). The tracking of post-synaptic potentials in the four-region model had mean RMS error of less than 1 mV. Importantly, we demonstrated the ability to track slow changes in the connectivity parameters, that led to transitions to and from seizures. Traditionally, functional neuroimaging methods have been very successful, but limited to determining where and when seizures occur. This new method can be used with ECoG data to also determine the mechanisms. This knowledge will provide opportunities to develop new therapies.

Traditionally, amplitude, frequency and phase correlations in neuroimaging data have been used as features to study connectivity. While these techniques imply a causal relationship, they can be misleading. For instance, correlations that arise between multiple microelectrode neural recordings could be the result of neurons independently responding to a common stimulus or could be caused by synaptic coupling between neural populations (Friston, 1994). Other possibilities that need to be taken into account are neural populations receiving a common modulatory input from another unobserved region of the brain, or indirect coupling between neural populations where connectivity is affected via multiple regions (Friston, 1994). Questions about the sources of correlation in neural recordings are difficult to disambiguate without resorting to more invasive methods of measurement. On the other hand, computational models can directly infer cortical connectivity patterns and neural dynamics from data, providing the probable cause of empirical observations. The degree to which such causal relationships correspond to the true state of the cortex is limited by the model uncertainty, just as correlations identified using other types of neuroimaging are limited by spatial and/or temporal resolution constraints. However, model uncertainty can be quantified, which is a highly useful property for many classification applications.

Under a Gaussian assumption, the Kalman filter provides estimates of the probability distributions of the states and parameters of the population model, which is updated as new measurements become available. If the Gaussian assumption holds, the Kalman filter provides the minimum variance estimate of the states and parameters (Simon, 2006). However, the nonlinearities in the model lead to non-Gaussian states. Nevertheless, the Gaussian approximation leads to good estimation results, as demonstrated by the Monte Carlo simulations. However, these results do not guarantee that the state and parameter estimates will not eventually diverge from the actual values, given a measurement times series of a longer duration. This is due to the approximations of the unscented transform. Possible improvements in the estimation results could come from using sequential Monte Carlo (SMC) filtering methods, when the Gaussian assumption can be relaxed. However, SMC methods impose a much larger computation burden that may make them prohibitive for imaging large-scale neural systems.

The derivation of the analytic a-priori (prediction through the model) state and parameter estimates provided in this paper gives an exact solution for the expected value for a Gaussian transformed by a sigmoid, regardless of the shape of the resultant distribution. This improves on the the unscented or extended Kalman filters, which have previously been used in a similar context (Voss et al., 2004; Schiff and Sauer, 2008; Liu and Gao, 2013). The Gaussian approximation of the uncertainty in the state and parameter estimates that are predicted by the model is maintained in our framework using the unscented transform.

The implementation of the unscented transform with large covariance matrices is a well established limitation of the filter (Wan and Van Der Merwe, 2000; Simon, 2006; Särkkä, 2013). While scaling up the size of the model did not significantly increase the estimation bias in this case, it does exponentially increase the computation time to the point where it becomes impractical for real-time applications. For increasing numbers of variables to be estimated, the covariance matrix eventually becomes so large that the use of the unscented transform becomes computationally infeasible. The extended Kalman filter is one possible alternative for approximating the covariance, but estimation accuracy is compromised (for the sigmoid nonlinearity). A possible direction of future research is improved methods of covariance estimation.

A probabilistic (Bayesian) approach is also used in the dynamic causal modeling (DCM) framework, which utilizes an expectation maximization algorithm. However, in the DCM framework, individual distributions of states and parameters are not estimated, where uncertainty is placed over the full model including the measurement function. DCM fits a range of candidate models with various inter-region connectivity structures, and then selects the most appropriate candidate using an information theoretic criterion (Daunizeau et al., 2009). DCM has been applied across a range of data from fMRI (David et al., 2008), ECoG time series (David, 2007) and EEG spectral response (Moran et al., 2008), as well as different phenomena such as seizure prediction (Aarabi and He, 2013) and auditory habituation (Wang and Knösche, 2013). A possible advantage of the Kalman (and sequential) filtering approaches over the DCM framework and other similar methods (such as genetic algorithms) is the ability to track slowly changing parameters in real time, which is likely to be particularly important when investigating transitions observed in data, such as epileptic seizures.

The algorithm presented in this paper utilized known constraints of physiological variables. Enforcing constraints on states and parameters greatly improved the convergence properties of the filter. Without any bounds applied to the distributions of parameter estimates, the results typically did not converge to a steady value within the simulation time-frame. There are a number of alternative and more theoretically rigorous approaches for constraining the parameter estimates. However, most constraint methods add a significant computational burden to the filter (Simon, 2006; Kandepu et al., 2008), rendering them impractical for implementation in large-scale systems. The large number of states and parameters to be estimated restricted the constraint method to clipping, which is computationally efficient to implement. Future work in this area should be to investigate effect of constraints on the estimation performance (such as the estimate variance).

The initialization of the filter, in particular the covariance matrix, is a notoriously inexact science (Wan and Nelson, 1997; Wan and Van Der Merwe, 2000; Simon, 2006; Schiff, 2012). In practice, significant tuning is often required to achieve stable and accurate estimation results. For this study, the initial covariance was based on knowledge obtained from forward simulations. A larger initial covariance was used when the number of hidden variables was increased. The initial uncertainty for parameters was increased by broadening the range of the constraints. Furthermore, when parameters to be estimated are dynamic rather than static (as would be the case for most parameters of interest in neural models), an additional constant error term is added to the covariance matrix to prevent an overestimate of confidence in the model (Voss et al., 2004). In this case it was found that additional uncertainty should be very small relative to the magnitude of the parameter. The amplitude of the additive uncertainty is analogous to a learning rate parameter in other algorithms. It can be relatively easily tuned by examining the convergence rate the parameters (i.e., see Figure 9).

The estimation framework presented in this paper can be naturally integrated with other existing imaging technologies and computational methods in the field of neuroscience. All methods of neuroimaging are essentially inversion problems, that rely on a transformation from the measurement space to the source space. An example is the transformation of magnetic radiation to the haemodynamic response in fMRI. Typically, measurements are transformed using a specific inversion technique to determine the state of the neural tissue. The framework presented in this paper applies the same philosophy. However, the transformation from the measurement to the source space is via a generative model. The generative model reflects the current state-of-the-art of our knowledge of the mesoscopic biophysics and anatomy of cortical circuits. By the same token, limitations and uncertainties in our current knowledge can also be quantified and incorporated into the model, making all predictions reflect probability distributions rather than scalar values. The mapping from neural population models to measurements can be readily adapted to describe different modalities, via alternative observation equations, enabling multiple sources of data to be combined to form a unifying model. The difficulty of measuring brain activity in a minimally invasive manner makes it imperative to use as much information as possible to predict neural states and inter-connectivities. A framework that combines patient-specific measurements with well accepted principles of brain structure and function, and importantly, knowledge of uncertainty, is an important step toward the lofty goal of reverse engineering the brain.

The estimation framework presented in this model could be used as the first stage of a seizure prediction system, providing the necessary features that are used as inputs to a classifier. It is necessary to represent neural data using representative features in order to reduce the dimensionality of the problem prior to applying a classification algorithm. In the past, efforts have focused on defining features that are correlated with ictal and pre-ictal periods and, as such, can be used in a predictive capacity (Andrzejak et al., 2001; Lehnertz et al., 2003). Recently a patient-specific seizure classifier for ECoG was implemented using parameters identified from a neural mass model (Aarabi and He, 2013). The advantages of using neural states and parameters as features for seizure classification is that they are naturally patient-specific (since they are directly relatable to the neural activity) and may also provide clues as to the underlying cause of seizures, which could inform treatment strategies.

The capability of neural models to be tailored to an individual patient's data is particularly relevant to the investigation and treatment of epilepsy, since it is a highly patient-specific disorder. The mechanisms for seizure onset and propagation vary significantly between patients (Wendling et al., 2005; Mormann et al., 2007; Coombes and Terry, 2012). Ideally, information about neural interconnectivity should be obtained on a case-by-case basis using an individualized model (Blenkinsop et al., 2012; Nevado-Holgado et al., 2012). A reliable model inversion framework will enable more precise targeting of therapies. The information provided by a model-based framework could also predict the response to drug treatments or electrical stimulation in a simulated environment, sparing a patient the negative side effects that may arise from a trial-and-error approach. Models can also be used to provide feedback for deep brain stimulators for robust prevention of seizures (Mormann et al., 2007; Adhikari et al., 2009).

This paper presented a framework rather than a specific method. Within the framework, the level of realism of the model can be increased to include more neural population subtypes and the spatial extent can increased to model larger cortical networks. The end goal is to provide the tools to create patient-specific models that use all of the available patient-specific neuroimaging data. Existing studies have demonstrated that this framework is capable of being extended to describe more complex phenomena through the inclusion of, for example; more populations and regions (Babajani-Feremi and Soltanian-Zadeh, 2010; Wang and Knösche, 2013), self feedback connections (Ursino et al., 2010) and firing rate modulated plasticity/habituation of synapses (Deco et al., 2008; Moran et al., 2013) or spatially dependent dynamics (Freestone et al., 2011; Aram et al., 2013). As the model size and complexity increases, there will be new parameters that need to be estimated as they are not directly measurable by other means. There are a number of potential directions that should be investigated to address the problem of dimensionality, such as model reduction, improved methods of covariance approximation or linearization techniques. Finally, further validation of the proposed estimation framework on patient data is necessary to evaluate the true predictive capability of this method.

Author Contributions

Dean R. Freestone and Philippa J. Karoly contributed to all aspects of the paper, including conception of ideas, derivation of new analytic results, software development, and testing, interpretation of results, and writing and editing of the manuscript. Philippa J. Karoly led the software development and simulation experiments. Dean R. Freestone led the model and estimation derivations. David B. Grayden, Dragan Nešić, Parham Aram, and Mark J. Cook all contributed toward conceiving the ideas and drafting the manuscript. All authors have provided final approval and are accountable for all aspects of the research.

Funding

This work was funded by the Australian Research Council (Linkage Project LP100200571).

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

Thanks to Richard Balson, Amirhossein Jafarian, Saeed Ahmadizadeh, Omid Monfred, Elmira Karami, Andre Peterson, Alan Lai, Anthony Burkitt, Tianlin (Stella) Ying, Benjamin Guo, Tatiana Kameneva, Raymond Boston, and Tim Esler, who all contributed to this paper either by providing feedback, stimulating discussions, and/or provided support.

References

Aarabi, A., and He, B. (2013). Seizure prediction in hippocampal and neocortical epilepsy using a model-based approach. Clin. Neurophysiol. 125, 930–940. doi: 10.1016/j.clinph.2013.10.051

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Adhikari, M. H., Heeroma, J. H., di Bernardo, M., Krauskopf, B., Richardson, M. P., Walker, M. C., et al. (2009). Characterisation of cortical activity in response to deep brain stimulation of ventral–lateral nucleus: modelling and experiment. J. Neurosci. Methods 183, 77–85. doi: 10.1016/j.jneumeth.2009.06.044

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Andrzejak, R. G., Lehnertz, K., Mormann, F., Rieke, C., David, P., and Elger, C. E. (2001). Indications of nonlinear deterministic and finite-dimensional structures in time series of brain electrical activity: dependence on recording region and brain state. Phys. Rev. E 64, 1–8. doi: 10.1103/PhysRevE.64.061907

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Aram, P., Freestone, D., Dewar, M., Scerri, K., Jirsa, V., Grayden, D. B., et al. (2013). Spatiotemporal multi-resolution approximation of the amari type neural field model. Neuroimage 66, 88–102. doi: 10.1016/j.neuroimage.2012.10.039

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Arcak, M., and Nešić, D. (2004). A framework for nonlinear sampled-data observer design via approximate discrete-time models and emulation. Automatica 40, 1931–1938. doi: 10.1016/j.automatica.2004.06.004

CrossRef Full Text | Google Scholar

Babajani-Feremi, A., and Soltanian-Zadeh, H. (2010). Multi-area neural mass modeling of eeg and meg signals. Neuroimage 52, 793–811. doi: 10.1016/j.neuroimage.2010.01.034

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Biswal, B., Zerrin Yetkin, F., Haughton, V. M., and Hyde, J. S. (1995). Functional connectivity in the motor cortex of resting human brain using echo-planar mri. Magn. Reson. Med. 34, 537–541. doi: 10.1002/mrm.1910340409

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Blenkinsop, A., Valentin, A., Richardson, M. P., and Terry, J. R. (2012). The dynamic evolution of focal-onset epilepsies–combining theoretical and clinical observations. Eur. J. Neurosci. 36, 2188–2200. doi: 10.1111/j.1460-9568.2012.08082.x

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bokde, A. L., Tagamets, M.-A., Friedman, R. B., and Horwitz, B. (2001). Functional interactions of the inferior frontal cortex during the processing of words and word-like stimuli. Neuron 30, 609–617. doi: 10.1016/S0896-6273(01)00288-4

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Breakspear, M., Roberts, J., Terry, J. R., Rodrigues, S., Mahant, N., and Robinson, P. (2006). A unifying explanation of primary generalized seizures through nonlinear brain modeling and bifurcation analysis. Cereb. Cortex 16, 1296–1313. doi: 10.1093/cercor/bhj072

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Coombes, S., and Terry, J. R. (2012). The dynamics of neurological disease: integrating computational, experimental and clinical neuroscience. Eur. J. Neurosci. 36, 2118–2120. doi: 10.1111/j.1460-9568.2012.08185.x

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Crick, F. C., and Koch, C. (2005). What is the function of the claustrum? Philos. Trans. R. Soc. B Biol. Sci. 360, 1271–1279. doi: 10.1098/rstb.2005.1661

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Cui, D., Li, X., Ji, X., and Liu, L. (2011). Multi-channel neural mass modelling and analyzing. Sci. China Inform. Sci. 54, 1283–1292. doi: 10.1007/s11432-011-4216-9

CrossRef Full Text | Google Scholar

da Costa, N. M., and Martin, K. A. (2010). Whose cortical column would that be? Front. Neuroanat. 4:16. doi: 10.3389/fnana.2010.00016

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Da Silva, F. L., Hoeks, A., Smits, H., and Zetterberg, L. (1974). Model of brain rhythmic activity. Kybernetik 15, 27–37. doi: 10.1007/BF00270757

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Daunizeau, J., David, O., and Stephan, K. E. (2011). Dynamic causal modelling: a critical review of the biophysical and statistical foundations. Neuroimage 58, 312–322. doi: 10.1016/j.neuroimage.2009.11.062

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Daunizeau, J., Friston, K., and Kiebel, S. (2009). Variational bayesian identification and prediction of stochastic nonlinear dynamic causal models. Physica D 238, 2089–2118. doi: 10.1016/j.physd.2009.08.002

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

David, O. (2007). Dynamic causal models and autopoietic systems. Biol. Res. 40, 487–502. doi: 10.4067/S0716-97602007000500010

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

David, O., Cosmelli, D., and Friston, K. J. (2004). Evaluation of different measures of functional connectivity using a neural mass model. Neuroimage 21, 659–673. doi: 10.1016/j.neuroimage.2003.10.006

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

David, O., and Friston, K. J. (2003). A neural mass model for meg/eeg:: coupling and neuronal dynamics. Neuroimage 20, 1743–1755. doi: 10.1016/j.neuroimage.2003.07.015

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

David, O., Guillemain, I., Saillet, S., Reyt, S., Deransart, C., Segebarth, C., et al. (2008). Identifying neural drivers with functional mri: an electrophysiological validation. PLoS Biol. 6:e315. doi: 10.1371/journal.pbio.0060315

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Deco, G., Jirsa, V. K., Robinson, P. A., Breakspear, M., and Friston, K. (2008). The dynamic brain: from spiking neurons to neural masses and cortical fields. PLoS Comput. Biol. 4:e1000092. doi: 10.1371/journal.pcbi.1000092

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Deng, B., Wang, J., and Che, Y. (2009). A combined method to estimate parameters of neuron from a heavily noise-corrupted time series of active potential. Chaos 19:015105. doi: 10.1063/1.3092907

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Douglas, R. J., and Martin, K. A. (2004). Neuronal circuits of the neocortex. Ann. Rev. Neurosci. 27, 419–451. doi: 10.1146/annurev.neuro.27.070203.144152

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Douglas, R. J., Martin, K. A., and Whitteridge, D. (1989). A canonical microcircuit for neocortex. Neural Comput. 1, 480–488. doi: 10.1162/neco.1989.1.4.480

CrossRef Full Text | Google Scholar

Freeman, W. J. (1975). Mass Action in the Nervous System. New York, NY: Academic Press.

Google Scholar

Freeman, W. J. (1987). Simulation of chaotic eeg patterns with a dynamic model of the olfactory system. Biol. Cybern. 56, 139–150. doi: 10.1007/BF00317988

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Freestone, D., Aram, P., Dewar, M., Scerri, K., Grayden, D. B., and Kadirkamanathan, V. (2011). A data-driven framework for neural field modelling. Neuroimage 56, 1043–1058. doi: 10.1016/j.neuroimage.2011.02.027

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Freestone, D., Kuhlmann, L., Chong, M., Nesic, D., Grayden, D. B., Aram, P., et al. (2013). “Patient-specific neural mass modelling: stochastic and deterministic methods,” in Recent Advances in Predicting and Preventing Epileptic Seizures, eds R. Tetzlaff, C. E. Elger, and K. Lehnertz (Dresden: World Scientific Publishing Company), 63–82.

Google Scholar

Friston, K. J. (1994). Functional and effective connectivity in neuroimaging: a synthesis. Hum. Brain Mapp. 2, 56–78. doi: 10.1002/hbm.460020107

CrossRef Full Text | Google Scholar

Friston, K. J., Harrison, L., and Penny, W. (2003). Dynamic causal modelling. Neuroimage 19, 1273–1302. doi: 10.1016/S1053-8119(03)00202-7

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Goodfellow, M., Schindler, K., and Baier, G. (2011). Intermittent spike wave dynamics in a heterogeneous, spatially extended neural mass model. Neuroimage 55, 920–932. doi: 10.1016/j.neuroimage.2010.12.074

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Haeusler, S., Schuch, K., and Maass, W. (2009). Motif distribution, dynamical properties, and computational performance of two data-based cortical microcircuit templates. J. Physiol. Paris 103, 73–87. doi: 10.1016/j.jphysparis.2009.05.006

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Horwitz, B. (2003). The elusive concept of brain connectivity. Neuroimage 19, 466–470. doi: 10.1016/S1053-8119(03)00112-5

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Horwitz, B., McIntosh, A. R., Haxby, J. V., Furey, M., Salerno, J. A., Schapiro, M. B., et al. (1995). Network analysis of pet-mapped visual pathways in alzheimer type dementia. Neuroreport 6, 2287–2292. doi: 10.1097/00001756-199511270-00005

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Jansen, B. H., and Rit, V. G. (1995). Electroencephalogram and visual evoked potential generation in a mathematical model of coupled cortical columns. Biol. Cybern. 73, 357–366. doi: 10.1007/BF00199471

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Julier, S. J., and Uhlmann, J. K. (1997). “A new extension of the kalman filter to nonlinear systems,” in Proceedings of the SPIE: Signal Processing, Sensor Fusion, and Target Recognition VI, Vol. 3068, ed I. Kadar (Orlando, FL: SPIE). doi: 10.1117/12.280797

CrossRef Full Text

Kalman, R. E. (1960). A new approach to linear filtering and prediction problems. J. Basic Eng. 82, 35–45. doi: 10.1115/1.3662552

CrossRef Full Text | Google Scholar

Kandepu, R., Imsland, L., and Foss, B. A. (2008). “Constrained state estimation using the unscented kalman filter,” in Proceedings of the 16th Mediterranean Conference on Control and Automation (Ajaccio: Citeseer), 1453–1458.

Kiebel, S. J., David, O., and Friston, K. J. (2006). Dynamic causal modelling of evoked responses in eeg/meg with lead field parameterization. Neuroimage 30, 1273–1284. doi: 10.1016/j.neuroimage.2005.12.055

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Lehnertz, K., Mormann, F., Kreuz, T., Andrzejak, R., Rieke, C., David, P., et al. (2003). Seizure prediction by nonlinear eeg analysis. Eng. Med. Biol. Mag. 22, 57–63. doi: 10.1109/MEMB.2003.1191451

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Liu, X., and Gao, Q. (2013). Parameter estimation and control for a neural mass model based on the unscented kalman filter. Phys. Rev. E 88:042905. doi: 10.1103/PhysRevE.88.042905

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Moran, R., Pinotsis, D. A., and Friston, K. (2013). Neural masses and fields in dynamic causal modeling. Front. Comput. Neurosci. 7:57. doi: 10.3389/fncom.2013.00057

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Moran, R. J., Stephan, K. E., Kiebel, S. J., Rombach, N., O'Connor, W., Murphy, K., et al. (2008). Bayesian estimation of synaptic physiology from the spectral responses of neural masses. Neuroimage 42, 272–284. doi: 10.1016/j.neuroimage.2008.01.025

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Mormann, F., Andrzejak, R., Elger, C., and Lehnertz, K. (2007). Seizure prediction: the long and winding road. Brain 130, 314–333. doi: 10.1093/brain/awl241

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Nevado-Holgado, A. J., Marten, F., Richardson, M. P., and Terry, J. R. (2012). Characterising the dynamics of eeg waveforms as the path through parameter space of a neural mass model: application to epilepsy seizure evolution. Neuroimage 59, 2374–2392. doi: 10.1016/j.neuroimage.2011.08.111

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Nunez, P. L., and Srinivasan, R. (2006). Electric Fields of the Brain: The Neurophysics of EEG, 2nd Edn. New York, NY: Oxford University Press. doi: 10.1093/acprof:oso/9780195050387.001.0001

CrossRef Full Text | Google Scholar

Särkkä, S. (2013). Bayesian Filtering and Smoothing, 3rd Edn. Cambridge, MA: Cambridge University Press. doi: 10.1017/CBO9781139344203

CrossRef Full Text | Google Scholar

Schiff, S. J. (2012). Neural Control Engineering: The Emerging Intersection Between Control Theory and Neuroscience. Cambridge, MA: The MIT Press.

Google Scholar

Schiff, S. J., and Sauer, T. (2008). Kalman filter control of a model of spatiotemporal cortical dynamics. J. Neural Eng. 5, 1–8. doi: 10.1088/1741-2560/5/1/001

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Simon, D. (2006). Optimal State Estimation: Kalman, H Infinity, and Nonlinear Approaches, 1st Edn. Hoboken, NJ: John Wiley and Sons. doi: 10.1002/0470045345

CrossRef Full Text | Google Scholar

Sporns, O. (2013). The human connectome: origins and challenges. Neuroimage 80, 53–61. doi: 10.1016/j.neuroimage.2013.03.023

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Ursino, M., Cona, F., and Zavaglia, M. (2010). The generation of rhythms within a cortical region: analysis of a neural mass model. Neuroimage 52, 1080–1094. doi: 10.1016/j.neuroimage.2009.12.084

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Van Essen, D. C., Smith, S. M., Barch, D. M., Behrens, T. E., Yacoub, E., and Ugurbil, K. (2013). The wu-minn human connectome project: an overview. Neuroimage 80, 62–79. doi: 10.1016/j.neuroimage.2013.05.041

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Voss, H. U., Timmer, J., and Kurths, J. (2004). Nonlinear dynamical system identification from uncertain and indirect measurements. Int. J. Bifurcation Chaos 14, 1905–1933. doi: 10.1142/S0218127404010345

CrossRef Full Text | Google Scholar

Wan, E. A., and Nelson, A. T. (1997). Dual kalman filtering methods for nonlinear prediction, smoothing, and estimation. Adv. Neural Inform. Process. Syst. 9, 793–799.

Wan, E. A., and Van Der Merwe, R. (2000). “The unscented kalman filter for nonlinear estimation,” in Adaptive Systems for Signal Processing, Communications, and Control Symposium (Lake Louise, AB: IEEE), 153–158.

Google Scholar

Wan, E. A., and Van Der Merwe, R. (2001). “The unscented kalman filter,” in Kalman Filtering and Neural Networks, ed S. Haykin (New York, NY: John Wiley & Sons, Inc.), 221–280.

Wang, P., and Knösche, T. R. (2013). A realistic neural mass model of the cortex with laminar-specific connections and synaptic plasticity–evaluation with auditory habituation. PLoS ONE 8:e77876. doi: 10.1371/journal.pone.0077876

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Wendling, F., Bartolomei, F., Bellanger, J., and Chauvel, P. (2000). Relevance of nonlinear lumped-parameter models in the analysis of depth-eeg epileptic signals. Biol. Cybern. 83, 367–378. doi: 10.1007/s004220000160

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Wendling, F., Bartolomei, F., Bellanger, J., and Chauvel, P. (2002). Epileptic fast activity can be explained by a model of impaired gabaergic dendritic inhibition. Eur. J. Neurosci. 15, 1499–1508. doi: 10.1046/j.1460-9568.2002.01985.x

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Wendling, F., Hernandez, A., Bellanger, J.-J., Chauvel, P., and Bartolomei, F. (2005). Interictal to ictal transition in human temporal lobe epilepsy: insights from a computational model of intracerebral eeg. J. Clin. Neurophysiol. 22, 343–356.

Pubmed Abstract | Pubmed Full Text | Google Scholar

Wilson, H. R., and Cowan, J. D. (1972). Excitatory and inhibitory interactions in localized populations of model neurons. Biophys. J. 12, 1–24. doi: 10.1016/S0006-3495(72)86068-5

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

5. Appendix

5.1. Discretization

To begin, we start with the exact continuous time system

\begin{matrix} \dot{ξ} = {[\begin{matrix} \dot{x} & \dot{θ} \end{matrix}]}^{⊤} & (A 1) \end{matrix}

\begin{matrix} = {[\begin{matrix} f^{e} (x, θ, u) & 0 \end{matrix}]}^{⊤} & (A 2) \end{matrix}

\begin{matrix} = F (ξ, u) . & (A 3) \end{matrix}

Discretization is performed using the Euler method, where the integration time step is denoted by δ by

\begin{matrix} F_{δ}^{a} (ξ, u) ≜ ξ + δ F (ξ, u) . & (A 4) \end{matrix}

The approximate discrete time system can be written in the compact form

\begin{matrix} ξ_{t + 1}^{a} = F_{δ}^{a} (ξ_{t}, u_{t}), & (A 5) \end{matrix}

where a denotes approximate and the subscript δ indicates that the model is parametrized by integration step size. Now, if we let the discrete time system that corresponds to an exact solution to the continuous system at the integration steps be f^e_δ(x_t,u_t), then under reasonable conditions it can be proven that the solution to the approximate discrete time system is consistent, such that

\begin{matrix} | F_{δ}^{e} (ξ_{t}, u_{t}) - F_{δ}^{a} (ξ_{t}, u_{t}) | \leq δ ρ (δ), & (A 6) \end{matrix}

where ρ(·) is a class-K function that has a dependance on size of the set of ξ and u (see Arcak and Nešić, 2004 for details). In the body of this paper, we will drop the subscript δ for notational convenience. However, we stress that the discrete time model is an approximation of the continuous system and is parameterized by the integration time step.

5.2. Definition of Matrices A, B, C, and D

The continuous time system can be written as

\begin{matrix} \dot{ξ} = A ξ + B ξ ◦ g (C ξ) + D (u) ξ & (A 7) \end{matrix}

where the matrices A, B, C, and D(u) ∈ ℝ^{n_ξ × n_ξ} and n_ξ = 3(N + K). For a fixed integration time step, δ, the discrete time model can be written in the form

\begin{matrix} ξ_{t + 1} = A_{δ} ξ_{t} + B_{δ} ξ_{t} ◦ g (C ξ_{t}) + D_{δ} (u) ξ_{t} & (A 8) \end{matrix}

where A_δ, B_δ, and D_δ(u) have the same dimension as their continuous time counterparts. (Note ◦ is the element-wise vector product)

In this appendix, we define all the matrices in Equations A7 and A8 and show the relationship between the models. The model contains (N + K) synaptic connections (N local connections and K inter-regional connections). Therefore, the number of parameters (connectivity coefficients) is defined as n_θ = (N + K) and the number of states (PSPs and their derivatives) is defined as n_x = 2(N + K).

The matrix A has a block diagonal structure that is comprised of two sub-matrices,

\begin{matrix} A = [\begin{matrix} Ψ & 0 \\ 0 & I_{n_{θ}, n_{θ}} \end{matrix}], & (A 9) \end{matrix}

where I_{n_θ, n_θ} ∈ ℝ^{n_θ × n_θ} is the identity matrix and Ψ ∈ ℝ^{n_x × n_x} is also composed of the sub-matrices;

\begin{matrix} Ψ = diag (Ψ_{j}) & (A 10) \end{matrix}

\begin{matrix} Ψ_{j} = [\begin{matrix} 0 & 1 \\ - \frac{1}{τ_{j}^{2}} & - \frac{2}{τ_{j}} \end{matrix}], & (A 11) \end{matrix}

where j = 1, …, N + K indexes connections.

The discrete time version A_δ is related to A by

\begin{matrix} A_{δ} = [\begin{matrix} I + δ Ψ & 0 \\ 0 & I \end{matrix}] . & (A 12) \end{matrix}

The matrix B has the form

\begin{matrix} B = [\begin{matrix} 0_{n_{x}, n_{x}} & Θ \\ 0_{n_{θ}, n_{x}} & 0_{n_{θ}, n_{θ}} \end{matrix}], & (A 13) \end{matrix}

where 0_{n_θ, n} ∈ ℝ^{n_θ × n} are zero matrices (for n = n_x, n_θ). Θ ∈ ℝ^{n_x × n_θ} maps the connectivity gains to the relevant sigmoidal activation function and is of the form

\begin{matrix} Θ = [\begin{matrix} 0 & \dots & 0 \\ \frac{b_{1}}{τ_{1}} & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & 0 \\ 0 & \dots & \frac{b_{N + K}}{τ_{N + K}} \end{matrix}], & (A 14) \end{matrix}

where b_j = 1 if the relevant connectivity gain is associated with an internal connection, otherwise b_j = 0 (where u_j ≠ 0) and the input is from an external population and is captured in the matrix D_δ(u), which is described below. The discrete time version is simply

\begin{matrix} B_{δ} = δ B . & (A 15) \end{matrix}

The adjacency matrix C is the same for both the continuous and discrete version of the model. It has a block diagonal structure where

\begin{matrix} C = diag (Γ, 0_{n_{θ}, n_{θ}}) & (A 16) \end{matrix}

and Γ ∈ ℝ^{n_x × n_x} sums the relevant post-synaptic potentials to form the mean membrane potentials then maps them to the activation function and is of the form

\begin{matrix} Γ = [\begin{matrix} 0 & 0 & \dots & 0 & 0 \\ γ_{2, 1} & 0 & γ_{1, n_{x} - 1} & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & 0 \\ γ_{n_{x}, 1} & 0 & \dots & γ_{n_{x}, n_{x} - 1} & 0 \end{matrix}] . & (A 17) \end{matrix}

The rows of Γ, which we will denote by γ_j, index the PSPs that contribute to the mean membrane potential of the presynaptic populations.

The input matrix D(u) has the structure

\begin{matrix} D (u) = [\begin{matrix} 0_{n_{x}, n_{x}} & U \\ 0_{n_{θ}, n_{x}} & 0_{n_{θ}, n_{θ}} \end{matrix}], & (A 18) \end{matrix}

where the matrix U ∈ ℝ^{n_x, n_θ} is given by

\begin{matrix} U = [\begin{matrix} 0 & \dots & 0 \\ \frac{u_{1}}{τ_{1}} & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & 0 \\ 0 & \dots & \frac{u_{N + K}}{τ_{N + K}} \end{matrix}] . & (A 19) \end{matrix}

The inputs u_m are zero for the majority of the elements, where there is only one external input per region in the current formulation. Each active input is a constant value. The discrete time version is

\begin{matrix} D_{δ} (u) = δ D (u) . & (A 20) \end{matrix}

5.3. Expectation of a Gaussian Membrane Potential Transformed by a Sigmoid

The prediction step in Kalman filter for the neural population model can be solved analytically given the solution of the expected value of the Gaussian membrane potential that is transformed by the nonlinear sigmoidal activation function. The solution for this problem is provided in this appendix. In order to provide the most concise derivation as possible, we will let mean firing threshold parameter v₀ = 0 and firing threshold variance ς = 1. The solution is provided for an arbitrary v₀ and ς, which can be found via the same sequence of steps in the derivation.

Let our Gaussian random variable, v, be described by the probability density function

\begin{matrix} p (v) = \frac{1}{σ \sqrt{2 π}} \exp (- \frac{{(v - μ)}^{2}}{2 σ^{2}}) . & (A 21) \end{matrix}

The expected value of the Gaussian random variable transformed by the sigmoid is defined by

\begin{matrix} 𝔼 [g (v)] = \int_{- \infty}^{\infty} g (v) p (v) d v & (A 22) \end{matrix}

\begin{matrix} = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{\infty} \int_{- \infty}^{v} \exp (- \frac{z^{2}}{2}) p (v) d z d v . & (A 23) \end{matrix}

To proceed, we can make the substitution z = w − v to get v out of the integral terminal giving

\begin{matrix} 𝔼 [g (v)] = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{\infty} \int_{- \infty}^{0} \exp (- \frac{{(w - v)}^{2}}{2}) p (v) d w d v . & (A 24) \end{matrix}

Next we substitute in the equation for the probability density function of the membrane potential and switch the order of integration, which can be changed without altering the limits of integration giving

\begin{matrix} 𝔼 [g (v)] = \frac{1}{2 π σ} \int_{- \infty}^{0} \int_{- \infty}^{\infty} & (A 25) \end{matrix}

\begin{matrix} \exp (- \frac{{(w - v)}^{2}}{2} - \frac{{(v - μ)}^{2}}{2 σ^{2}}) d v d w & (A 26) \end{matrix}

Now we need to integrate out v, so we collect all the v-related terms

\begin{matrix} \begin{array}{l} 𝔼 [g (v)] = \frac{1}{2 π σ} \int_{- \infty}^{0} \exp (- \frac{1}{2 σ^{2}} (σ^{2} w^{2} + μ^{2})) \\ \times \int_{- \infty}^{\infty} \exp (- \frac{σ^{2} + 1}{2 σ^{2}} v^{2} + \frac{σ^{2} w + μ}{σ^{2}} v) d v d w . \end{array} & (A 27) \end{matrix}

Integrating out v in the second term we get

\begin{matrix} \begin{array}{l} \int_{- \infty}^{\infty} \exp (- \frac{σ^{2} + 1}{2 σ^{2}} v^{2} + \frac{σ^{2} w + μ}{σ^{2}} v) d v \\ = \frac{\sqrt{2 π} σ}{\sqrt{σ^{2} + 1}} \exp (\frac{{(σ^{2} w + μ)}^{2}}{2 σ^{2} (σ^{2} + 1)}) . \end{array} & (A 28) \end{matrix}

The solution in Equation A28 is then recombined with Equation A27. After rearranging and simplifying, the expected value becomes

\begin{matrix} 𝔼 [g (v)] = \frac{1}{2 π} \frac{\sqrt{2 π}}{\sqrt{σ^{2} + 1}} \int_{- \infty}^{0} \exp (- \frac{{(w - μ)}^{2}}{2 (σ^{2} + 1)}) d w . & (A 29) \end{matrix}

To solve this last integral, we perform a change of variables

\begin{matrix} z = \frac{w - μ}{\sqrt{σ^{2} + 1}}, \frac{d z}{d w} = \frac{1}{\sqrt{σ^{2} + 1}} & (A 30) \end{matrix}

\begin{matrix} d w = \sqrt{σ^{2} + 1} d z, & (A 31) \end{matrix}

giving the final result,

\begin{matrix} \begin{array}{l} 𝔼 [g (v)] = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{\frac{μ}{\sqrt{σ^{2} + 1}}} \exp (- \frac{z^{2}}{2}) d z \\ = \frac{1}{2} (erf (\frac{μ}{\sqrt{2 (σ^{2} + 1)}}) + 1) . \end{array} & (A 32) \end{matrix}

The more general solution for an arbitrary mean firing threshold, v₀, and firing threshold variance, ς, is

\begin{matrix} 𝔼 [g (v)] = \frac{1}{2} (erf (\frac{μ - v_{0}}{\sqrt{2 (ς^{2} + σ^{2})}}) + 1) . & (A 33) \end{matrix}

5.4. Unscented Transform

The sigma vectors are defined as

where κ is a constant that can be tuned which determines the spread of the sigma vectors around the mean and β is a parameter that can be used to incorporate information about the distribution of the states (2 is optimal for Gaussians) (Wan and Van Der Merwe, 2001). The vector ${(\sqrt{(n_{x} + κ) {\hat{P}}_{t - 1}^{+}})}_{i}$ is the i^th column of the matrix square root (e.g., the lower triangular matrix that can be computed using the Cholesky decomposition), where i = 1, …, n_x.

The weights, W_i, for the unscented transform are calculated as

\begin{matrix} W_{0} = \frac{κ}{n_{x} + κ} + β & (A 37) \end{matrix}

\begin{matrix} W_{i} = \frac{1}{2 (n_{x} + κ)} i = 1, \dots, 2 n_{x} . & (A 38) \end{matrix}

For the initialization of the Kalman filter in this paper, algorithm values were

\begin{matrix} β = 2 & (A 39) \end{matrix}

\begin{matrix} κ = 3 - 2 n_{x}, & (A 40) \end{matrix}

where N is the number of synapses.

5.5. Algorithm Initialization

To initialize the filter, $\hat{ξ}$ ⁺₀ and off-diagonal elements of $\hat{P}$ ⁺₀ were set to zero. The diagonal elements of $\hat{P}$ ⁺₀ corresponding to fast states (PSPs and their derivatives) were set to the variances of the states obtained from forward simulations. The initial variance estimate for the slow states (connectivity parameters) were set by recognizing that the variance of each PSP in the state vector is proportional to the amplitude of the connectivity parameter that is associated with that particular connection. Therefore, the initial estimation variance for each connectivity parameter was set to be proportional (by a scaling parameter) to the variance of the associated PSP obtained from forward simulation. Scaling parameters were chosen for each connection subtype to reflect the different orders of magnitude of the connectivity strengths (shown in Table A1). The weighting for the slow state $\hat{P}$ ⁺₀ values was determined by normalizing across all the regions for connection specific PSPs; i.e., let

\begin{matrix} \begin{array}{l} β ≜ [\begin{matrix} var (v_{u p}^{1}) & var (v_{e p}^{1}) & var (v_{p i}^{1}) & var (v_{i p}^{1}) & var (v_{p e}^{1}) & var (v_{j k}^{1}) & var (v_{k j}^{1}) \\ ⋮ & ⋮ \\ var (v_{u p}^{J}) & \dots & var (v_{k j}^{J}) \end{matrix}] \\ = [\begin{matrix} Σ_{v}^{1} \\ ⋮ \\ Σ_{v}^{J} \end{matrix}] \end{array} & (A 41) \end{matrix}

for J cortical regions. The normalized matrix is given by

\begin{matrix} M = diag (∥ Σ_{v}^{1} ∥_{\infty}^{- 1}, \dots, ∥ Σ_{v}^{J} ∥_{\infty}^{- 1}) β, & (A 42) \end{matrix}

where we are normalizing using the L_∞ norm of each of the rows of β, which are denoted by Σ^j_v. The resultant matrix M is scaled to form the initial values of the variances for the connectivity estimates. The scaling values to set the values of $\hat{P}$ ⁺₀ are shown in Table A1.

TABLE A1

Table A1. Initial values for the elements of $\hat{P}$ ⁺₀ that correspond to connectivity gain estimates.

To initialize the filter values for the model and measurement variance in the Kalman filter equations (denoted Σ and R, respectively) knowledge of the forward simulation was used. The measurement variance was set to

\begin{matrix} R = σ_{y}^{2} I_{n_{y}, n_{y}}, & (A 43) \end{matrix}

where σ_y is the standard deviation of the additive measurement noise used in the forward simulation for the ECoG signal, which was 1 mV. I_{n_y, n_y} is the identity matrix and n_y is the number of measurements (i.e., the number of regions in this case).

The model uncertainty was set to

\begin{matrix} Σ = {\begin{array}{l} 10^{- 16} I_{n_{ξ}, n_{ξ}} + Q & for static parameters \\ 10^{- 16} I_{n_{ξ}, n_{ξ}} + Q + Q^{θ} & for parameter tracking \end{array}, & (A 44) \end{matrix}

where the first term on the left hand side is for numerical stability, Q is the known covariance matrix of process noise, w_t, that was used in the forward simulations, and the Q^θ term represents a constant additive covariance for parameter tracking purposes,

\begin{matrix} Q^{θ} = diag (0_{n_{x}, n_{x}}, Σ^{θ}) . & (A 45) \end{matrix}

When the filter is used to track parameter dynamics, Σ^θ is used to capture the unexpected changes (this is not necessary for the state as their dynamics are modeled, whereas parameters are assumed to be static by the filter). Σ^θ was a diagonal matrix, where for j = 1 … n_θ,

The yes notation shows that the uncertainty is proportional to the order of the connectivity gain (α_j). The coefficients can be tuned to adjust the rate of estimation convergence. The smaller value for α_up was the result of tuning based on the estimation results.

Keywords: functional connectivity, neural mass model, model inversion, Kalman filter, epilepsy, seizures, parameter estimation, effective connectivity

Citation: Freestone DR, Karoly PJ, Nešić D, Aram P, Cook MJ and Grayden DB (2014) Estimation of effective connectivity via data-driven neural modeling. Front. Neurosci. 8:383. doi: 10.3389/fnins.2014.00383

Received: 14 July 2014; Accepted: 09 November 2014;
Published online: 28 November 2014.

Edited by:

Patrick William Carney, The Florey Institute of Neuroscience and Mental Health, Australia

Reviewed by:

Klaus Lehnertz, University of Bonn, Germany
Bruce Gluckman, Penn State University, USA

Copyright © 2014 Freestone, Karoly, Nešić Aram, Cook and Grayden. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dean R. Freestone, Department of Medicine, St. Vincent's Hospital Melbourne, The University of Melbourne, 19 Regent St., Fitzroy, VIC 3065, Australia e-mail:ZGVhbnJmQHVuaW1lbGIuZWR1LmF1

^†These authors have contributed equally to this work and share first authorship.

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.