A State Space Approach to Dynamic Modeling of Mouse-Tracking Data

Calcagnì, Antonio; Lombardi, Luigi; D'Alessandro, Marco; Freuli, Francesca

doi:10.3389/fpsyg.2019.02716

ORIGINAL RESEARCH article

Front. Psychol., 17 December 2019

Sec. Quantitative Psychology and Measurement

Volume 10 - 2019 | https://doi.org/10.3389/fpsyg.2019.02716

A State Space Approach to Dynamic Modeling of Mouse-Tracking Data

1. Department of Developmental and Social Psychology, University of Padova, Padova, Italy
2. Department of Psychology and Cognitive Science, University of Trento, Trento, Italy

Article metrics

View details

Citations

7,4k

Views

1,4k

Downloads

Abstract

Mouse-tracking recording techniques are becoming very attractive in experimental psychology. They provide an effective means of enhancing the measurement of some real-time cognitive processes involved in categorization, decision-making, and lexical decision tasks. Mouse-tracking data are commonly analyzed using a two-step procedure which first summarizes individuals' hand trajectories with independent measures, and then applies standard statistical models on them. However, this approach can be problematic in many cases. In particular, it does not provide a direct way to capitalize the richness of hand movement variability within a consistent and unified representation. In this article we present a novel, unified framework for mouse-tracking data. Unlike standard approaches to mouse-tracking, our proposal uses stochastic state-space modeling to represent the observed trajectories in terms of both individual movement dynamics and experimental variables. The model is estimated via a Metropolis-Hastings algorithm coupled with a non-linear recursive filter. The characteristics and potentials of the proposed approach are illustrated using a lexical decision case study. The results highlighted how dynamic modeling of mouse-tracking data can considerably improve the analysis of mouse-tracking tasks and the conclusions researchers can draw from them.

1. Introduction

Over the last decades, the study of computer-mouse trajectories has brought to light new perspectives into the investigation of a wide range of cognitive processes [e.g., for a recent review see Freeman (2017)]. Unlike traditional behavioral measures, such as reaction times and accuracies, mouse trajectories may offer a valid and cost-effective way to measure the real-time evolution of ongoing cognitive processes during experimental tasks (Friedman et al., 2013). This has also been supported by recent researches investigating mouse-tracking in association to more consolidated experimental devices, such as eye-tracking and fMRI (e.g., Quétard et al., 2016; Stolier and Freeman, 2017). In a typical mouse-tracking experiment, participants are presented with a computer-based interface showing the stimulus at the bottom of the screen and two competing categories on the left and right top corners. Participants are asked to select the most appropriate label given the task instruction and stimulus while the x-y trajectories are instantaneously recorded. The main idea is that trajectories of reaching movements can unfold the decision process underlying the hand movement behavior. For instance, the curvature of computer-mouse trajectories might reveal competing processes activated in discriminating the two categories. Mouse-tracking has been successfully applied in several cognitive research studies, including lexical decision (Incera et al., 2017; Ke et al., 2017), social categorization (Carraro et al., 2016; Freeman et al., 2016), numerical cognition (Faulkenberry, 2014, 2016), memory (Papesh and Goldinger, 2012), moral decision (Koop, 2013), and lie detection (Monaro et al., 2017). Moreover, the availability of specialized and freely-available software for mouse-tracking experiments have strongly contributed to the wide-spread application of such a methodology in the more general psychological domain (Freeman and Ambady, 2010; Kieslich and Henninger, 2017). Recently, the debate on the nature of cognitive processes tracked by this type of reaching trajectories have also received attention from the motor control literature (Van Der Wel et al., 2009; Friedman et al., 2013).

So far, mouse-tracking data have been analyzed using simple strategies based on the conversion of x-y trajectories into summary measures, such as maximum deviation, area under the curve, response time, initiation time (Hehman et al., 2015). Although these steps are still meaningful in case of simple and well-behaved x-y trajectories, they can also provide biased results if applied to more complex and possibly noisy data. To circumvent these problems, other approaches have been proposed more recently (Cox et al., 2012; Calcagǹı et al., 2017; Krpan, 2017; Zgonnikov et al., 2017). However, also the more recent proposals require modeling empirical trajectories before the data-analysis. Although these approaches potentially provide informative results in many empirical cases, they can also suffer from a number of issues, which revolve around the reduction of x-y data to simple scalar measures. For instance, problems may arise in the case of trajectories showing multiple phases, averaging with non-homogeneous curves, and signal-noise discrimination (Calcagǹı et al., 2017). As far as we know, a proper framework to simultaneously model and analyse mouse-tracking data in a unified way is still lacking.

In this paper we describe an alternative perspective based on a state-space approach with the aim to simultaneously model and analyse mouse-tracking data. State-space models are very general time-series methods that allow estimating unobserved dynamics which gradually evolve over discrete time. As for diffusion models, which are widely used in modeling the temporal evolution of cognitive decision processes (Smith and Ratcliff, 2004), they belong to the general family of stochastic processes and offer optimal discrete approximation to many continuous differential systems used to represent dynamics with autoregressive patterns (Cox, 2017). In particular, we used a non-linear and discrete-time model that represents mouse trajectories as a function of some typical experimental manipulations. The model is estimated under a Bayesian framework, using a conjunction of a non-linear recursive filter and a Metropolis-Hastings algorithm. Data analyses is then performed using posterior distributions of model parameters (Gelman et al., 2014).

The reminder of this article is organized as follows. In section 2 we motivate our proposal by reviewing the main issues of a typical mouse-tracking experiment. In section 3 we present our proposal and describe its main characteristics. In section 4 we describe the application of our method to a psycholinguistic case study. Section 5 provides a general discussion of the results, comments and suggestions for further investigations. Section 6 concludes the article by summarizing its main findings.

2. A Motivating Example

To begin with, consider a two-choice semantic categorization task (Dale et al., 2007), in which participants have to classify semantic stimuli (e.g., name of animals) into their corresponding categories (e.g., mammal, fish). In the most typical implementation of a mouse-tracking task, participants would sit in front of a computer screen showing a resting frame (see Figure 1A). They start a trial by clicking a starting button at the bottom-center of the screen, after which they are presented with a given stimulus (e.g., hen, see Figure 1B). To finalize the trial, participants move the cursor on the screen by means of a well-tuned computer-mouse in order to reach and select one of the two labels presented on the top-left and top-right corners of the screen (e.g., mammal vs. bird, see Figure 1C). In the meanwhile, x-y coordinates, initiation time, and final clicking time are registered for each participant and trial. The basic idea is that x-y trajectories reflect the extent to which the real-time categorization response is affected by the experimental manipulation. More precisely, as a result of the assumption that co-activation of competing processes continuously drive the explicit hand response (Spivey and Dale, 2006), one would suppose to see more curved—or generally irregular—trajectories in association with stimuli showing higher ambiguity. In our case, for instance, it would be expected that atypical exemplars, such as hen, dolphin, and penguin, globally produce more curved or irregular trajectories than typical exemplars like dog, rabbit, and lion (see Figures 1D,E).

Figure 1

In the mouse-tracking literature, data analysis commonly proceeds summarizing the recorded trajectories by means of few indices, which are then used as input to standard statistical techniques. In the current example, for instance, the typicality manipulation could be tested by assessing whether the distribution of maximum deviations (i.e., the maximum curvature showed by trajectories) over trials and participants is bimodal or not (Freeman and Dale, 2013). In a similar way, linear models could be employed to test whether the typicality effect varies as a function of external covariates, such as psycholinguistic variables.

However, the two-step approach does have some issues. For instance, it lacks a way to represent both the experimental variability—that is induced by task manipulations—and individual variability—that is instead produced by individual-specific motor programs. Likewise, in some cases, it might neglect relevant characteristics of x-y data, with the consequence that similar classes of trajectories are treated as if they were different. Still, a two-step approach does ignore the data generation process underlying observed trajectories. This does not allow, for example, making predictions or simulations on new data given the experimental settings.

In the next section, we will present a dynamic probabilistic model that handles mouse-tracking data in a unified way. Our proposal is based upon a Bayesian non-linear state space approach, which offers a good compromise between model flexibility and model simplicity while overcoming many drawbacks of the standard mouse-tracking analyses.

3. State-Space Modeling of Mouse-Tracking Data

A state-space model is a mathematical description used to represent linear or generally non-linear dynamic models. In their general form, state-space systems consist of (i) a measurement density f_y(y_n; z_n, θ_y) that describes how the observed vector of data y_n at time step n is linked to a possibly underlying process z_n and (ii) a state density f_z(z_n; θ_z) describing the transition dynamics that drive the vector of states z_n. Temporal dynamics can be discrete or continuous and, in the latter case, stochastic differential equations are used to model the transition dynamics. By and large, there are two aims of any analysis involving state-space models. The first is to infer the unobserved process given the data Y = (y₀, …, y_N). This task is usually accomplished by means of filtering and smoothing procedures (Jazwinski, 2007). The second aim regards estimating the parameters (θ_y, θ_z) given the complete set of data . This is commonly performed using gradient-based methods on the likelihood of the model (Shumway and Stoffer, 1982). Although state space models were originally used in the area of aerospace modeling (Kalman, 1960), they are now applied in a wide variety of domains, including control theory, remote sensing, economics, and statistics (Hamilton, 1994; Shumway and Stoffer, 2006). Recently, there has also been an increasing interest in psychology, where state-space models have been used to analyse, for example, dyadic interactions (Song and Ferrer, 2009), affective dynamics (Lodewyckx et al., 2011; Bringmann et al., 2017), facial electromyography data (Yang and Chow, 2010), individual differences (Hamaker and Grasman, 2012; Chow and Zhang, 2013), and path analysis (Gu et al., 2014).

In line with this, we developed a state-space representation to simultaneously model and analyse mouse-tracking data. In particular, our proposal is to represent the empirical collection of computer-mouse trajectories as a function of two independent sub-models, one representing the experimental manipulations (stimuli equation) and the other capturing the main features of the mouse movement process (states equation). Thus, the goal of our analysis is 2-fold: (i) to determine the states equation for each participant over a set of experimental trials, (ii) to estimate the parameters governing the stimuli equation. The first goal will provide information on how participants differ from each other in terms of movement dynamics. By contrast, the second goal will find out to what extent the experimental manipulations affect the individual variations in producing mouse-tracking responses.

3.1. Data

Let S be a I (individuals) × J (trials) array representing the observed data. The element s_ij of S defines the sub-array containing the streaming of Cartesian coordinates of the computer mouse movements:

with 0 and N_ij being the first and the last coordinates for the i-th participant in the j-th trial. The coordinates in s_ij are temporally ordered (0 < … < n < … < N_ij) because they are usually collected while the computer-mouse is moving along its surface with a constant sampling rate. Further, to make the observed data comparable, we rescale and normalize s_ij as a function of a common ordered scale, which indicates the cumulative amount of progressive time from 0% to N = 100% (e.g., Tanawongsuwan and Bobick, 2001; Ramsay and Silverman, 2007). Thus, the final trajectories s_ij lie on the real plane defined by the hyper-rectangles [−1, 1]^N × [0, 1]^N, with the first movement being equal to by convention. Since we are interested in studying the co-activation of competing processes as reflected in some spatial properties of the response—such as location, direction, and amplitude of the action dynamics (Spivey and Dale, 2006; Freeman, 2017)—we need to simplify the original data structure so that these properties can easily emerge. Inspired by earlier work on this problem (Gowayyed et al., 2013; Kapsouras and Nikolaidis, 2014; Calcagǹı et al., 2017), we reduce the dimensionality of the data by projecting s_ij in a proper lower-dimensional subspace of movement via the restricted four-quadrant inverse tangent mapping [atan2, see Burger and Burge (2010)] from the real coordinates to the interval [0, π]^N as follows:

where y₀ is the angle at the beginning of the process whereas y_N is the angle at the end of the process. Figure 1F shows a graphical example of the atan2 function for a hypothetical movement path. Finally, the array of angles y_ij is the input for our state-space model.

3.2. Model Representation

The unobserved states equation of the model is a AR(1) Gaussian model Z_{i, n}|Z_{i, n−1} with transition density equal to:

which models how the movement process of the i-th subject changes from the step n−1 to the next step n. The stochastic dynamics for the i-th subject is constrained by the variance parameter that represents the uncertainty about the future location z_{i, n + 1} given the current state z_{i, n}.

The measurement equation for the observations y_ij = (y₀, …, y_n, …, y_N) is modeled by means of a two-component von-Mises mixture distribution with density equal to:

where the generic density is the standard von-Mises law:

In the density formula, the term is the exponentially scaled Bessel function of order zero (Abramowitz and Stegun, 1972). The parameters of the mixture density are mapped to the experimental interface of the two-choice categorization task (see Figures 1D,E). In particular, the means are mapped to the label categories C1 and C2 whereas the concentrations indicate how the observations are spread around the means. Since {μ₁, μ₂} are determined by the fixed and known positions of the labels C1 and C2 on the screen, they are not treated as parameters to be estimated. Finally, the terms π_ijn and (1 − π_ijn) are the probabilities to activate the first and second density of the von-Mises components and are expressed as function of the latent states z_{i,0 :N} and some additional covariates. The model is Markovian, in the sense that the unobserved states {Z_n; n > 1} form a Markov sequence and the measurements {Y_n; n > 1} are conditionally independent given the unobserved states.

To further characterize our state-space representation, the probability π_ijn is defined according to a logistic function:

with β _j ∈ ℝ being the intercept of the model. Equation (3) can be interpreted as the probability for the i-th subject at step n to categorize the j-th stimuli as belonging to C1 (π_ijn tends to 1) or C2 (π_ijn tends to 0). In addition, when the categories C1 and C2 are expressed in terms of distractor and target (Freeman, 2017), the sequences π_{ij, 0:N} can be interpreted as the attraction probability that the distractor has exerted on the trajectory process z_{i,0 :N}.

The state-space representation is completed by linearly expanding the intercept term β_j as follows:

where , x_j is an element of the array x ∈ ℝ^J, whereas d_jk is an element of the (Boolean) partition matrix D_{J × K}, with d_jk = 1 indicating whether the j-th stimulus belongs to the k-th level of the variable D. Note that the matrix D satisfies the property , for all j = 1, …, J. In our model representation, Equation (4) is the stimuli equation and conveys information about the experiment. It consists of three main terms. (i) A categorical term describing how the stimuli = {1, …, j, …, J} have been arranged into K < J distinct levels of a categorical variable D. (ii) A continuous term x_jη that expresses the stimuli as a function of a continuous variable X weighted by the coefficient η. (iii) An interaction term between the levels of D and X, where δ_k ∈ ℝ and δ₁ = 0. This definition allows for modeling all the cases implied by an univariate experimental design with at most one covariate variable. Indeed, for η = 0 and δ_K = 0_K this formulation boils down to the simplest experimental case with a single categorical variable D. By contrast, for δ_K = 0_K and γ_K = 0_K it reduces to the case where stimuli are simply paired with a continuous predictor X. Finally, when D_{J × K} = I_{J × J}, the stimuli equation reduces to the most simple case where we have as many parameters as trials¹. Figure 2 shows a graphical representation of state-space model whereas Figure 3 illustrates the inner-working of the model for the simplest design with a two-level experimental factor.

Figure 2

Figure 3

In our model representation, the observed movement angles y_{ij, 0:N} are sampled from C1 (resp. C2) with probabilities equal to π_{ij, 0:N} (resp. ), which in turn are expressed as a function of the AR(1) latent trajectory z_{i,0 :N}. Hence, an increase in the latent process z_{i, n} > 0 will also increase the probability that y_ijn is sampled from the hemispace C1 (e.g., π_ijn > 0.5). By contrast, a decrease in the latent process z_{i, n} < 0 will increase the chance to sample y_ijn from C2 (e.g., π_ijn < 0.5). As a result of Equation (4) such an increasing (or decreasing) pattern can be modulated by the stimuli component β_J. Moreover, as the coefficients β_J are decomposed as a function of η, γ_K, and δ_K, we can also analyse the effect of β_J on π_{ij, 0:N} in terms of the experimental manipulation D, the covariate X, or the interaction term DX. Figure 3 shows a conceptual representation of the modeling steps involved by our approach. Panel (A) shows an example of the random-walk used to represent the movement process (Equation 3). Instead, panel (B) shows the logistic function used to form the stimuli equation (Equation 3) for two typical cases of β_J. Panel (C) represents the probability π_{ij, 0:N} to activate the distractor C1 (upper panel) and the probability to activate the target C2 (lower panel) as a function of z_{i,0 :N} and β_J. Finally, panel (D) depicts two cases of observed radians that are associated to π_{ij, 0:N} and . In particular, the upper panel shows an example of data with a pronounced attraction toward C1, which is in turn reflected by the blue probability curve of the panels (B,C). By contrast, the lower panel represents data with little attraction toward C1, as also reflected by the red probability curve of the panels (B,C). In this sense, as Equation (3) represents an intercept model, the parameter β_J does not affect the shape of the movement dynamics z_{i,0 :N}. On the contrary, it acts by shifting the movement dynamics upward (β < 0) or downward (β > 0) toward the C1 or C2 hemispaces, respectively.

3.3. Model Identification

State-space model identification consists of inferring the unobserved sequence of states by means of filtering and smoothing algorithms and estimating the model's parameters via Likelihood-based approximations (Shumway and Stoffer, 2006; Särkkä, 2013). For instance, in the simplest linear gaussian case, where both the states and measurement equations are linear with additive gaussian noise, inference of latent states is usually performed via Kalman filter whereas parameter's estimation is realized with the Expectation-Maximization algorithm. In our case, as Equations (3) and (4) describe a more complex non-linear model, we adopted a recursive Gaussian approximation filter for the inference problem (Smith and Brown, 2003), coupled with a marginal Metropolis-Hastings MCMC for the parameters estimation (Andrieu et al., 2010)².

To formulate the problem more precisely, let:

be the arrays representing all the J × 2 unknown parameters and I × N unobserved states that characterize the model's behavior. In this context, can be set to 1_I without loss of model adequacy³. The joint log-density of the complete-data given the array of parameters and the observed data is defined as follows:

and the state and measurement equations are given as in (1) and (2) whereas the term f(z_ij0|θ_Z₀) is the a-priori density function for the initial state of the process. Note that the factorization (9) is due to the Markovian properties of the model. By adopting the Bayesian perspective, we perform inference conditional on the observed sample of angles Y, with Θ being an unknown term. The posterior density f(Z, Θ|Y) of hidden states and parameters is as follows:

where f(Θ) is a prior density ascribed on the vector of model's parameters Θ. Note that Equation (10) comes from the standard conditional definition f(Z, Θ, Y) = f(Z, Θ, Y)/f(Y), where the joint density f(Z, Θ, Y) is re-arranged by factorization using the Markovian properties of the model (e.g., see Andrieu et al., 2010). Since our aim is to get samples from the posterior f(Z, Θ|Y), we proceed by jointly updating Θ and Z using a marginal Metropolis-Hastings. This alternates between proposing a candidate sample Θ^(t) given Θ^(t−1) and filtering the sequences Z^(t) conditioned on Θ^(t). Finally, the candidate couple (Θ^(t), Z^(t)) is jointly evaluated by the Metropolis-Hasting ratio.

The evaluation of both the densities f(Z|Y) and f(Θ|Y) involve computing the expression in Equation (11). To do so, we derived the first term by means of filtering and smoothing procedures (Jazwinski, 2007) whereas the second term was evaluated by implementing a Metropolis-Hasting algorithm. All the technical steps for the model identification are included in Appendices A–C whereas all the algorithms are freely available at https://github.com/antcalcagni/SSM_mousetracking.

3.4. Model Evaluation

The state-space model formulated can be evaluated in many ways under the Bayesian framework of analysis (Shiffrin et al., 2008; Gelman et al., 2014). For instance, adequacy of the algorithm can be assessed via standard diagnostic measures, such as traceplot of the chains, autocorrelation measures, and the Gelman-Rubin statistics whereas the recovery of the true model structure can be done by simulations from the priors ascribed to the model (Gelman et al., 2014). Similarly, the adequacy of the model to reproduce the observed data can be assessed by means of simulation-based procedures (e.g., posterior predictive checks) where the fitted model is used to generate new simulated datasets that are then compared to the observed data (e.g., see Gelman et al., 1996; Cook et al., 2006). In our context, the robustness of the model formulation in recovering the true model structure as well as the goodness of fit to the observed data are assessed by adopting a simulation-based approach. Technical details on this procedure are available in Supplementary Materials.

3.5. Model Summary

Table 1 shows a summary of the components of the complete state-space model used throughout the paper, including observed and latent variables, parameters and their support spaces.

Table 1

i ∈ {1, …, I}	Set index for individuals
j ∈ {1, …, J}	Set index for trials
n ∈ {0, …, N}	Set index for (discrete) time points
	Observed N × 1 array of mouse-tracking data
	N × 1 Array of latent states to be inferred
x ∈ ℝ^J	Observed covariate of the experiment
Y_ijn ~ mix-vonMises (μ₁, μ₂, κ₁, κ₂)	Random variable governing the realization of y_ijn
Z_in\|Z_{i, n−1} ~ N(Z_{i, n−1}, σ_i)	AR(1) random process governing the realization of z_in
	Fixed parameters of the mixture von-Mises law (true locations of the stimuli)
	Parameters of the von-Mises law (precision)
	Fixed parameter of the Gaussian law (standard deviation)
	N × 1 array of attraction probability (i.e., probability to activate distractor vs. target hemi-space)
β ∈ ℝ^J	J × 1 array of coefficients (intercepts) modeling the experimental design
β = Dγ + x(η + Dγ)	Linear equation governing the experimental design in terms of additive and interaction effects
D ∈ ℝ^{J × K}	J × K Boolean partition matrix for the experimental design
γ ∈ ℝ^K	K × 1 array of coefficients for the additive components of the experimental design
(η, δ) ∈ ℝ^{K + 1}	(K + 1) × 1 array of coefficients for the interaction components of the experimental design
	Complete array of parameters to be estimated (some of them can be set to zero, depending on the experimental design)

Model summary: observed and latent variables, parameters, and equations of the state-space model formulated for the analysis of mouse-tracking trajectory.

4. Application

In this section, we will present an application of the model to the analysis of an already published lexical decision dataset (Barca and Pezzulo, 2012). The state-space modeling framework will be evaluated via three different instances of model representation with an increasing level of complexity. Note that the application we report here has only an illustrative purpose with the main goal to introduce and highlight the interpretation of the model's parameters and the flexibility of its representation with dynamic data. All the models were estimated using 20 (chains) × 10,000 (iterations), with a burning-in period of 2500 iterations. Starting values θ₀ for the MH algorithm were determined by maximizing the observed likelihood of the model in Equation (2). Similarly, the starting covariance matrix Σ⁽⁰⁾ was computed by using the Hessian of the observed likelihood at θ₀. The adaptive phase of the MH algorithm was performed at fixed interval t + H (with H = 25) to prevent the degeneracy of the adaptation. For each model, the prior densities were defined as f(θ) = (μ = 0, 1σ² = 25), where the variance was sufficiently large to cover the natural range of the model parameters. The adequacy of the model to reproduce the data was evaluated with a simulation-based approach, where a series of M = 5, 000 new datasets were generated through the fitted model and compared with the observed data Y (e.g., see Cook et al., 2006). The goodness of fit was evaluated overall (i.e., the adequacy of the model to reproduce the complete observed matrix Y) and subject-based (i.e., the adequacy of the model to reproduce for each subject i = 1, …, I the observed matrix Y_i). Comparisons were computed by means of 0–100% normalized measures, with 0% indicating bad fit and 100% optimal fit. Technical details as well as extended graphical results are included in Supplementary Materials.

4.1. General Context and Motivation

Lexical decision is one of the most known and widely used task to study visual world recognition and reading in the cognitive psycholinguistic literature (Norris and Kinoshita, 2008; Yap et al., 2008; Hawkins et al., 2012). Generally, this task is very simple and versatile and provides an ideal context for applying the state-space modeling framework when lexical decision data are collected via the mouse tracking paradigm. In this application, we evaluated the extent to which the parameters of the state-space model reflect eventual differences associated with the manipulation of a stimulus type factor composed by words (with either high-frequency or low-frequency) and random strings (i.e., random sequence of letters that are phonotactically illegal in the language) in the lexical decision task. Moreover, we will take advantage of this psycholinguistic case study to show how our state-space model can deal with both categorical and (pseudo)quantitative predictive variables considered either individually or in interaction in the model. In particular, the first model instance will illustrate the application of our modeling framework when a simple categorical variable (stimulus type factor) is considered to affect the observed mouse-tracking trajectories collected using the lexical decision task. By contrast, the second model will be based on a simple regression-type model with a single quantitative independent variable (bigram frequency) used to predict the attraction toward the distractor category. Finally, the third model will integrate these two variables (stimulus type factor and bigram frequency) into a unified model including the main effects of the two variables as well as their interaction. In our context, the first two models will be considered as simple toy examples to illustrate the main features of the state-space model representation when applied to real data, whereas the third model will be discussed in more details according to a group analysis evaluation as well as an individual analysis representation.

4.2. Model 1

4.2.1. Data Structure and Variables

In the original work by Barca and Pezzulo (2012), the lexical decision experiment was run in Italian and based only on one stimulus type factor with four different levels: Words of high written frequency (HF, e.g., acqua “water”), words of low written frequency (LF, e.g., cervo “deer”), pseudowords (PW, e.g., “dorto”), and strings of letters that are orthographically illegal in Italian (NW, e.g., “btfpr”). In their study, participants saw a total of 96 stimuli, one at the time, and were required to categorize each stimulus as either a word or a non-word by using the mouse-tracking paradigm. Trajectories were recorded using the Mouse Tracker software (Freeman and Ambady, 2010) with sampling rate of ~70 Hz (Barca and Pezzulo, 2012). As usual, raw trajectories were normalized into N = 101 time steps using linear interpolation with equal spaces between coordinate samples. However, for our analysis we preferred to select only three of the four levels of the experimental factor (that is to say, HF,LF, and NW) for a total of 72 stimuli equally distributed within each level⁴. Finally, the dependent variable Y of Model 1 consisted of the movement angles array associated with the mouse-movement trajectory recorded for each distinct stimulus in the stimulus set.

4.2.2. Data Analysis and Results

In this first model the term β_j in the stimuli equation boils down to the simple expression:

where the indices k = 1, 2, 3 refer to HF, LF, and NW stimuli. The MCMC convergences of the algorithm are reported in Supplementary Materials. The model fitted the data very satisfactorily, with overall fit of 84% and subject-based fit of 74% (see Table 2). The posterior quantiles (5, 50, and 95%) are reported in Table 3 whereas Figure 4A shows the probability graph, that is to say, the probability to activate the distractor cue for each of the three levels HF, LF, NW as a function of the latent variable Z.

Table 2

	Overall (%)	By-subject (%)
Model 1	84	78
Model 2	73	70
Model 3	75	71

Application: adequacy of the model to reproduce the observed matrices Y (overall fit) and Y_i (by-subject fit).

All the measures are normalized in the range 0% (bad fit)–100% (optimal fit). See Supplementary Materials for technical details.

Table 3


Model 1	q_0.05	1.224	1.234	1.211
	μ	1.323	1.337	1.310
	q_0.975	1.443	1.457	1.432
		1.003	1.002	1.003
Model 2	q_0.05				0.063
	μ				0.078
	q_0.975				0.091
					1.001
Model 3	q_0.05	0.083	1.130	1.217	0.305	−0.505	−0.437
	μ	0.341	1.300	1.314	0.402	−0.385	−0.336
	q_0.975	0.605	1.468	1.411	0.500	−0.269	−0.235
		1.008	1.013	1.012	1.011	1.013	1.010
All the models

Application: posterior means (μ), 95% posterior intervals ([q_0.05, q_0.975]), and Gelman-Rubin index for the estimated parameters of Models 1–3.

Figure 4

The results of this first analysis clearly show that the dynamics of the state-space model were unaffected by the different categories represented in the recoded experimental factor. This pattern finds further support in the post-hoc comparisons between the three experimental conditions (Figure 4B). In sum, these findings indicate that for a dynamic model represented according to a state-space modeling framework, the three stimulus categories (HF, LF, and NW) were all processed in a very similar way, as the original trajectories were not sufficiently different among the three stimulus categories. In substantive terms, the results of the categorical model showed how the attraction probability toward the distractor was definitively modest in all the three experimental conditions. This is evident from a direct inspection of Figure 4B where the probability activation function (logistic function) is shifted toward right (Z > 0) which in turn means that the average activation of the distractor category was relatively poor in HF, LF, and NW items. In this respect, the results of our simple spatial model were partially at odds with the outcomes observed using temporal measures (response time variables) (Barca and Pezzulo, 2012).

4.3. Model 2

4.3.1. Data Structure and Variables

Also for the second model, the dependent variable was represented by the movement angles array Y. However, unlike model 1, in model 2 the original independent categorical variable (stymulus type factor) was replaced with a quantitative psycholinguistic variable called bigram frequency. Bigram frequency is defined as the frequency with which adjacent pairs of letters (bigrams) occur in printed texts; for its characteristics, it may be considered as a measure of orthographic typicality (e.g., see Hauk et al., 2006). In this second application, only bigram frequency was used as quantitative variable, since it was the only psycholinguistic variable that could be computed for all the 72 stimuli in the stimulus set. This second model instance nicely provides a simple but effective example of application of our state-space model when a continuous variable is considered to predict the attraction toward distractor.

4.3.2. Data Analysis and Results

In model 2 the term β_j simply reduces to:

as the first and third terms in formula (4) cancel out. In this case, the variable x_j denotes the value of the bigram frequency for stimulus j in the stimulus set. For the model results, the posterior quantiles are reported in Table 3 whereas MCMC convergences of the algorithm are reported in Supplementary Materials. Also in this case, the model fitted the data very well, with overall fit of 73% and subject-based fit of 70% (see Table 2). Figure 5 shows the probability graph for model 2. This graph represents the probability to activate the distractor hemispace at three representative levels of the variable, i.e., the lowest, the medium, and the highest values of bigram. As evident from the graph, bigram frequency affects the probability to activate the target, with a higher probability for stimuli with low bigram frequency.

Figure 5

In substantive terms, the results of the quantitative model supported the evidence that the attraction probability toward the distractor was slightly affected by the specific value of the quantitative predictor (bigram frequency). In particular, low-level bigram frequencies were characterized by an average larger activation probability (0.55) for the distractor, whereas medium or large frequencies were associated with a logistic function slightly shifted toward positive values of the latent space Z > 0, thus reflecting a lower chance for the distractor category (average activation probability of 0.45). Moreover, by an inspection of the contingency table for the joint representation of bigram frequency (as a transformed categorical variable) and stimulus type, we noted that low bigram frequency values were mainly characterized by string letters (NW: 94%) whereas high bigram frequency values were predominantly associated with high frequency words (HF: 55%) or low frequency words (LF: 44%).

4.4. Model 3

4.4.1. Data Structure and Variables

The final and more complex model included both the three-level categorical predictor (stimulus type factor: HF, LF, STR) and the continuous predictor (bigram frequency) as well as the interaction term between these two variables. The dependent variable was the movement angles array Y.

4.4.2. Data Analysis and Results

The stimuli equation which characterizes the third model is defined as follows:

The MCMC diagnostics together with the estimated marginal posterior densities for the model's parameters are reported in Supplementary Materials. The model fit was good, with an overall fit of 75% whereas the subject-based fit was equal to 71% (see Table 2). The posterior quantiles are reported in Table 3. Figure 6 shows the probability graph for model 3. This graph represents the probability to activate the distractor hemispace for each of the three levels HF, LF, NW of the categorical factor as a function of the latent variable Z and three distinct levels (high, medium, and low) for bigram frequency. The inspection of Figure 6 shows a clear interaction between stimulus type factor and bigram frequency indicating that the impact of stimulus category, in particular word frequency, increases with the decrease of stimulus bigram frequency. In other words, at high level of bigram frequency, the probability to activate the distractor is similarly low in all conditions (0.17 ≤ p-distractor ≤ 0.2). By contrast, when bigram frequency decreases—that is stimuli become orthographically atypical—the probability of distractor activation increases, but only for the more lexically-familiar stimuli, i.e., words of high frequency (e.g., p-distractor raises from 0.17 to 0.70, in low and high bigram frequency condition, respectively).

Figure 6

Finally, it is also worth mentioning the emergence of the main effect of stimulus category which was instead missing in model 1. By a quick inspection of Figure 7, one may clearly observe that HF words differ from both LF words and letter strings (NW), whereas LF words and letter strings do not differ with respect to the probability of activation of the distractor hemispace. Interestingly, the addition of the covariate bigram frequency in the model allowed the main effect of stimulus category to show up. Indeed, while at the medium and high levels of bigram frequency the results are in line with those observed at a sample level in the original study (see Figures 1, 2, 5 in the paper Barca and Pezzulo, 2012) and in a recent re-analysis (see Table 2 in the paper Calcagǹı et al., 2017), in the case of low bigram the probability to activate the distractor increases with respect to high frequency words (HF). This might be somewhat related to a moderate difficulty in the orthographic processing of low frequency bigram words (e.g., see Rastle and Davis, 2008) even in the case of stimuli with richer lexical representation.

Figure 7

4.5. Profiles Analysis

To further investigate the dynamic characteristics involved in the lexical decision task, we extend here the results of the third model to include also a profiles analysis. Figure 8 shows the estimated latent movement states Z_{I × N} for all the participants involved in the study. The profiles appear regular, as they evolve smoothly toward the target cue (T). We grouped the dynamics into four well-separated clusters (Figure 8, smallest panels on the right) according to their functional similarities (Ramsay and Silverman, 2007). Particularly, the first group is characterized by a higher exploration of the distractor's hemispace, especially in the first 30% of the process. The same applies to the third and fourth groups, although they show a gradual activation of the distractor. Finally, the second group clearly represents those profiles with no uncertainty in the categorization process, as they show no activation of the distractor's hemispace at all. Although well-separated among them, these clusters still show some level of inner heterogeneity (for example, see group 1 and 4). To study this latter issue in terms of experimental manipulations, we focused on group 1 and considered the low vs. high frequency conditions (HF vs. LF). We also selected the middle phase of the process (Δ = 30−50%), where it is expected to observe larger cognitive competitions in the categorization (Barca and Pezzulo, 2012). Figure 9 shows the participants' profiles in terms of attraction probability π_{4 × N} for the two lexical conditions. As expected, the profiles differ between these conditions, with LF eliciting higher attraction probability. This is in line with the fact that low frequency words have a weaker lexical representation than high frequency stimuli and consequently they are more difficult to process (Barca and Pezzulo, 2012). Interestingly, the individual profiles also differ in the way they activate the distractor. For instance, the participant 6 had higher probability in both LF (p_Δ(D) = 0.67) and HF (p_Δ(D) = 0.54) conditions whereas the participant 7 had a more pronounced activation just in the LF condition (p_Δ(D) = 0.57) than HF (p_Δ(D) = 0.43). Similarly, participants 6 and 7 seemed to prolong the competing dynamics up to the 50% of the process, by contrast participants 8 and 15 seemed to resolve the lexical competition earlier as showed by the abrupt decreasing of their curves. We complete our analysis by evaluating how individual profiles are linked to empirical measurements. Figure 10 represents this scenario for two stimuli belonging to HF and LF conditions. As we can notice, the curves present the same dynamics (due to the latent states z_{i,0 :N}) although they clearly differ in terms of attraction exerted by the stimulus (due to the β component of the model). In this case, the LF stimulus produced larger conflict than HF in the lexical categorization. This is evident when we turn back to the observed data: as expected, the rose diagrams of LF showed larger directions in the distractor's hemispace.

Figure 8

Figure 9

Figure 10

5. Discussion

We have described a new approach to model and analyse dynamic data coming from mouse-tracking experiments. Our proposal took the advantages of a state-space representation, in which the observed data Y were thought as being function of two independent sub-models, one representing the movement process and its properties (Z) and the second modeling the two-choice experimental task (β) according to which the data were collected. These sub-models were integrated by means of an inverse-logit function (π) that expressed how the experimental manipulations acted on the movement processes in selecting the final correct response against the competing one. This formulation was flexible enough to take into account the complexity of some dynamic behaviors showed by the reaching trajectories. Moreover, it allowed for separately accounting for the motor heterogeneity and experimental variability in Y. Indeed, when β = 1β₀ our state-space representation simply reduced to a model where the experimental manipulations had no relevant effect in reproducing the observed data. This instance has been illustrated in section 4.3 (Model 1). In this case, as Z = 0 was not allowed in our model formulation, all the variability of Y can be ascribed to Z. This is relevant in view of the fact that movement variability may reflect only individual motor executions in absence of any experimental manipulations (Yu et al., 2007). The movement component Z was modeled to be Markovian with gaussian transition density.

Although more complex models can be used to represent movement dynamics, simple random walks still allows a great deal of flexibility in modeling reaching trajectories under weak assumptions on the movement behavior (e.g., see Yu et al., 2007; Paninski et al., 2010). In particular, in the case of mouse-tracking tasks, they allow representations of the following three properties: (i) Each movement is goal-oriented as individuals have to finalize the action by clicking on one of the two categories shown on the screen. (ii) Mouse-tracking trajectories generally start at rest, proceed out in the movement space, and end at rest. (iii) Hand trajectories tend to be smooth during the reaching process, i.e., small changes in the interval [n, n + 1] are more likely than large and abrupt changes (Brockwell et al., 2004; Spivey et al., 2010). The stimuli component β was defined to be a linear combination of information typically involved in a univariate design, namely a categorical variable D containing the levels of the experimental factor and a continuous covariate X. This gave researchers the opportunity to additionally analyse which component of the experimental design is relevant in producing the effect of β on Y. The data-generation process was defined according to a mixture of two von-Mises distributions representing the categories of a two-choice categorization task. Among others, we chose this distribution because it provides a flexible representation for angular ordered data, especially because it simplifies mathematical computations involved in the model's derivation (e.g., see McClintock et al., 2012; Mulder and Klugkist, 2017).

There are other existing methods that offer alternative ways to model mouse-tracking data. For instance, Van Der Wel et al. (2009) proposed the use of the movement superposition model (Henis and Flash, 1995) to model and analyse the typical two-choice lexical decision task. In particular, they modeled mouse-tracking trajectories by representing the complete hand movement as a summation of sub-movements, which were obtained by the solution of the minimum-jerk equation for the standard reaching trajectory (i.e., a movement characterized by a bell-shaped speed profile that minimizes the sum of the squared rates of jerks over the movement duration). Similarly, Friedman et al. (2013) discussed how an intermittent model of arms movement can be used for reaching trajectories in random-dot experiments. They used both Wiener's diffusion process and Flash and Hogan's movement equation to predict reaction times (RTs) and movement data. Their goal was to assess the link between movement trajectories and underlying cognitive processing. Our model differs in some respects from these works. With regards to Van Der Wel et al. (2009), for instance, we used a stochastic state-space approach to model the movement trajectories instead of deterministic equations. Instead, with respect to Friedman et al. (2013), we tailor-made our model to a typical two-choice categorization task, making use of few assumptions on the nature of the movement process [as those implied by the Gaussian AR(1) process]. By and large, our goal was not to provide a mathematical representation of the cognitive components underpinning mouse-trajectories since the model does not describe the cognitive processing per se. By contrast, we simply provided a statistical model for the analysis of mouse-tracking data, which can offer a good compromise between data modeling and data analysis.

5.1. Model's Advantages and Limitations

Our non-linear state-space model has several advantages. For instance, when comparing with the standard approaches, our proposal provides a unified analytic framework to simultaneously model and analyse trajectories data. By modeling movement heterogeneity and task variability together, we can evaluate how experimental variables directly act on the observed series of trajectories, with no need to use any kind of summary measures. An additional advantage of our model concerns the study of individual differences in terms of latent dynamics. While this is impractical in standard two-step approaches, in our proposal researchers can assess individual variations by studying the movement profiles once they are estimated. For instance, they can be analyzed in terms of similarity/dissimilarity with regards to external individual covariates (e.g., vocabulary knowledge and bilingualism in psycholinguistic experiments; IQ, risk-taking propensity, or more generally clinical variables in decision-making tasks). Still, individual dynamics can be compared each other qualitatively in terms of chance to activate the distractor or target cues. As the dynamics are normalized on a common cumulative scale, researchers can assess whether the chance to activate the distractor cue at a certain percentage of the process and for an experimental manipulation, is particularly higher in a sub-group of participants (this case, for example, has been described in section 4.6).

As for any modeling approach, also the current proposal can potentially suffer from some limitations. A first limitation concerns the only-intercept model π(Z, β) used to integrate individual dynamics and experimental information. Although this was enough to represent whether or not certain stimuli can increase the probability to select the distractor cue, we may want to known whether some experimental manipulations can modify the individual dynamics as well. However, this would particularly pronounce the computational costs required for the model identification (especially with regards to filtering), as we need to appropriately generalize Equation (3) to include more parameters. Lastly, in the current study we used univariate non-linear state-space models to represent individual dynamics for the sake of parsimony. However, more complicated situations may require models including further movement characteristics like step-length, velocity, acceleration, and jerk (Kulkarni and Paninski, 2008), which may be modeled as statistical constraints of the model (Ciavolino and Calcagǹı, 2014; Calcagǹı et al., 2017).

5.2. Further Extensions

Our non-linear state-space model can be improved in many aspects. For instance, the stimuli equation (4) can be generalized to cope with more complex experimental designs, like those involving multiple factors and covariates together with high-order interaction terms. Likewise, the current model restrictions can be relaxed to allow changes in slopes of π(Z, β) as a function of the experimental stimuli. Further, the development of a hierarchical representation of the model, with a random-effect component in the state Equation (3), would offer a way to model the inter-individual variations as resulting from an underlying common population. Still, the development of a multivariate state-space model to include other movement components will surely be considered a future extension of the present work. Further studies may lead to generalize the AR(1) process used for the movement dynamics to include former knowledge on the deterministic constraints of the hand movement as those used, for instance, by Van Der Wel et al. (2009) and Friedman et al. (2013). Moreover, further studies may also lead to generalize the AR(1) process used for the movement dynamics to include former knowledge on the deterministic constraints of the hand movement as those used, for instance, by Van Der Wel et al. (2009) and Friedman et al. (2013). Finally, an open issue which deserves greater consideration in future investigations is the need for a formal comparative framework with which we may eventually contrast and compare spatial modeling perspectives (like the one presented in this contribution) and currently used methods for analyzing mouse tracking data based on descriptive statistics (e.g., see Freeman, 2017).

6. Conclusions

In this paper we presented a novel and comprehensive analytic framework for modeling and analyse mouse-tracking trajectories. In particular, a non-linear state-space approach was used to model the observed trajectories as a function of both individual movement dynamics and experimental variables. Model identification was performed under the umbrella of Bayesian methods, in which a Metropolis-Hastings algorithm was coupled with a recursive gaussian approximation filter to get posterior distributions of model parameters. For the sake of illustration, we applied our new approach to a real mouse-tracking dataset concerning a two-choice lexical categorization task. The results indicated how our proposal can provide valuable insights to assess the dynamics involved in the decision task and identify how the experimental variables significantly contributed to the observed movement heterogeneity. Moreover, the analysis of individual profiles allowed for comprehensive and reliable identification of individual and group-based differences in the dynamics of decision making.

In conclusion, we think that this work yielded interesting findings in the development of computational models able to capture the unfolding high-level cognitive processes as reflected by motor executions which are typically involved in mouse-tracking tasks. To our knowledge, this is the first time that mouse-tracking data are fully modeled and analyzed within a process-oriented approach. We believe our contribution will offer a novel strategy that may help cognitive researchers to understand the roles of cognition and action in mouse-tracking based experiments.

Statements

Data availability statement

The datasets analyzed in this manuscript are not publicly available. Requests to access the datasets should be directed to Laura Barca, laurabarcapst@gmail.com.

Author contributions

AC and LL contributed to the conceptualization and statistical modelization. AC, MD'A, and LL performed the main statistical analyses. AC and MD'A performed the supplementary statistical analyses. FF contributed to the statistical analyses and revised supplementary statistical analyses. MD'A contributed to the section Model Evaluation. FF contributed to the section Application. AC wrote the first draft of the study. AC, LL, MD'A, and FF revised the final manuscript.

Acknowledgments

The authors thank Dr. Simone Sulpizio for many helpful discussions on earlier versions of the manuscript and Dr. Laura Barca for providing the dataset used in the case study.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2019.02716/full#supplementary-material

Footnotes

1.^To understand the meaning of the stimuli equation, consider the case of an experiment with a two-level manipulated factor A and B, each with twenty stimuli. In this case, K = 2 and J = 20 whereas D_20×2 is the design matrix codifying which stimulus belongs to level A (d_j1 = 1, d_j2 = 0) or level B (d_j1 = 0, d_j2 = 1). In the simple additive case, the stimuli equation is β_20×1 = D_20×2γ_2×1 where γ contains the coefficients associated to the experimental levels A and B. If we also have an external covariate x on the stimuli, we can include this information in the stimuli equation in two ways: (i) as additive component β_20×1 = D_20×2γ_2×1 + ηx_20×1, (ii) by including an interaction term β_20×1 = D_20×2γ_2×1 + ηx_20×1 + (x_20×1 ⊙ D_20×2δ_2×1), where δ now codifies the interaction between the covariate and the levels A and B included in D (note that ⊙ is the element-wise product). For further information on how codify categorical and continuous variables, see Fox (1997).

2.^Interestingly, this version of the MCMC algorithm can be subsumed into the more general family of particle-Metropolis Hasting (PMH) which, in turns, is a special case of Multiple Try Metropolis (MTM) techniques. For a broader review of these connections, see Martino (2018).

3.^Indeed, the constraint still guarantees the mapping π_ijn:ℝ → [0, 1] to cover the needed time-to-time variability of the random walk, as Equation (3) acts as a shrinkage operator on the support of the r.vs {Z_{i, 0}, …, Z_{i, n}}. This has also been confirmed by several pilot simulations we ran on our model. Note that this assumption is not overly limiting, since our state-space representation is built under the smoothness assumption on the movement behavior of the hand, according to which large abrupt changes in the small interval [n, n + 1] are not allowed (Yu et al., 2007).

4.^The motivation for this selection was due to some technical reasons regarding the lack of design balance in the original dataset, as the PW level showed a large number of errors when compared with the other three categories. In addition, the three-level representation of the stimulus type factor simplifies the interpretation of the results when we consider the full model with interaction.

References

1
AbramowitzM.StegunI. A. (1972). Handbook of Mathematical Functions With Formulas, Graphs, and Mathematical Tables. New York, NY: Dover.
- Google Scholar
2
AndrieuC.DoucetA.HolensteinR. (2010). Particle markov chain monte carlo methods. J. R. Stat. Soc. B Stat. Methodol.72, 269–342. 10.1111/j.1467-9868.2009.00736.x
- CrossRef
- Google Scholar
3
AndrieuC.RobertsG. O. (2009). The pseudo-marginal approach for efficient monte carlo computations. Ann. Stat.37, 697–725. 10.1214/07-AOS574
- CrossRef
- Google Scholar
4
AnsleyC. F.KohnR. (1982). A geometrical derivation of the fixed interval smoothing algorithm. Biometrika69, 486–487.
- Google Scholar
5
BanerjeeA.DhillonI. S.GhoshJ.SraS. (2005). Clustering on the unit hypersphere using von mises-fisher distributions. J. Mach. Learn. Res.6, 1345–1382.
- Google Scholar
6
BarcaL.PezzuloG. (2012). Unfolding visual lexical decision in time. PLoS ONE7:e35932. 10.1371/journal.pone.0035932
7
BringmannL. F.HamakerE. L.VigoD. E.AubertA.BorsboomD.TuerlinckxF. (2017). Changing dynamics: time-varying autoregressive models using generalized additive modeling. Psychol. Methods22, 409–425. 10.1037/met0000085
8
BrockwellA. E.RojasA. L.KassR. (2004). Recursive bayesian decoding of motor cortical signals by particle filtering. J. Neurophysiol.91, 1899–1907. 10.1152/jn.00438.2003
9
BurgerW.BurgeM. J. (2010). Principles of Digital Image Processing: Fundamental Techniques. London: Springer Science & Business Media.
- Google Scholar
10
CabrasS.NuedaM. E. C.RuliE. (2015). Approximate bayesian computation by modelling summary statistics in a quasi-likelihood framework. Bayesian Anal.10, 411–439. 10.1214/14-BA921
- CrossRef
- Google Scholar
11
CalcagnìA.LombardiL.SulpizioS. (2017). Analyzing spatial data from mouse tracker methodology: an entropic approach. Behav. Res. Methods49, 2012–2030. 10.3758/s13428-016-0839-5
12
CarraroL.CastelliL.NegriP. (2016). The hand in motion of liberals and conservatives reveals the differential processing of positive and negative information. Acta Psychol.168, 78–84. 10.1016/j.actpsy.2016.04.006
13
ChowS.-M.ZhangG. (2013). Nonlinear regime-switching state-space (RSSS) models. Psychometrika78, 740–768. 10.1007/s11336-013-9330-8
14
CiavolinoE.CalcagnìA. (2014). A generalized maximum entropy (GME) approach for crisp-input/fuzzy-output regression model. Qual. Quant.48, 3401–3414. 10.1007/s11135-013-9963-9
- CrossRef
- Google Scholar
15
CookS. R.GelmanA.RubinD. B. (2006). Validation of software for Bayesian models using posterior quantiles. J. Comput. Graph. Stat.15, 675–692. 10.1198/106186006X136976
- CrossRef
- Google Scholar
16
CoxD. R. (2017). The Theory of Stochastic Processes. New York, NY: Routledge.
- Google Scholar
17
CoxG.KachergisG.ShiffrinR. (2012). Gaussian process regression for trajectory analysis, inProceedings of the Cognitive Science Society, Austin, TX: Cognitive Science Society, Vol. 34.
- Google Scholar
18
DaleR.KehoeC.SpiveyM. J. (2007). Graded motor responses in the time course of categorizing atypical exemplars. Mem. Cogn.35, 15–28. 10.3758/BF03195938
19
De JongP. (1988). The likelihood for a state space model. Biometrika75, 165–169.
- Google Scholar
20
FaulkenberryT. J. (2014). Hand movements reflect competitive processing in numerical cognition. Can. J. Exp. Psychol.68, 147–151. 10.1037/cep0000021
21
FaulkenberryT. J. (2016). Testing a direct mapping versus competition account of response dynamics in number comparison. J. Cogn. Psychol.28, 825–842. 10.1080/20445911.2016.1191504
- CrossRef
- Google Scholar
22
FoxJ. (1997). Applied Regression Analysis, Linear Models, and Related Methods.Thousand Oaks, CA: Sage Publications, Inc.
- Google Scholar
23
FreemanJ. B. (2017). Doing psychological science by hand. Curr. Dir. Psychol. Sci.27, 315–323. 10.1177/0963721417746793
24
FreemanJ. B.AmbadyN. (2010). Mousetracker: software for studying real-time mental processing using a computer mouse-tracking method. Behav. Res. Methods42, 226–241. 10.3758/BRM.42.1.226
25
FreemanJ. B.DaleR. (2013). Assessing bimodality to detect the presence of a dual cognitive process. Behav. Res. Methods45, 83–97. 10.3758/s13428-012-0225-x
26
FreemanJ. B.PaukerK.SanchezD. T. (2016). A perceptual pathway to bias: interracial exposure reduces abrupt shifts in real-time race perception that predict mixed-race bias. Psychol. Sci.27, 502–517. 10.1177/0956797615627418
27
FriedmanJ.BrownS.FinkbeinerM. (2013). Linking cognitive and reaching trajectories via intermittent movement control. J. Math. Psychol.57, 140–151. 10.1016/j.jmp.2013.06.005
- CrossRef
- Google Scholar
28
GelmanA.CarlinJ. B.SternH. S.DunsonD. B.VehtariA.RubinD. B. (2014). Bayesian Data Analysis, Vol. 2. Boca Raton, FL: CRC Press.
- Google Scholar
29
GelmanA.MengX.-L.SternH. (1996). Posterior predictive assessment of model fitness via realized discrepancies. Stat. Sinica6, 733–760.
- Google Scholar
30
GowayyedM. A.TorkiM.HusseinM. E.El-SabanM. (2013). Histogram of oriented displacements (HOD): describing trajectories of human joints for action recognition, in Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence (AAAI Press), 1351–1357.
- Google Scholar
31
GuF.PreacherK. J.FerrerE. (2014). A state space modeling approach to mediation analysis. J. Educ. Behav. Stat.39, 117–143. 10.3102/1076998614524823
- CrossRef
- Google Scholar
32
HaarioH.SaksmanE.TamminenJ. (2001). An adaptive metropolis algorithm. Bernoulli7, 223–242. 10.2307/3318737
- CrossRef
- Google Scholar
33
HamakerE.GrasmanR. (2012). Regime switching state-space models applied to psychological processes: handling missing data and making inferences. Psychometrika77, 400–422. 10.1007/s11336-012-9254-8
- CrossRef
- Google Scholar
34
HamiltonJ. D. (1994). State-space models. Handb. Econometr.4, 3039–3080.
- Google Scholar
35
HaukO.DavisM. H.FordM.PulvermüllerF.Marslen-WilsonW. D. (2006). The time course of visual word recognition as revealed by linear regression analysis of ERP data. Neuroimage30, 1383–1400. 10.1016/j.neuroimage.2005.11.048
36
HawkinsG.BrownS. D.SteyversM.WagenmakersE.-J. (2012). Decision speed induces context effects in choice. Exp. Psychol.59, 206–215. 10.1027/1618-3169/a000145
37
HehmanE.StolierR. M.FreemanJ. B. (2015). Advanced mouse-tracking analytic techniques for enhancing psychological science. Group Process. Intergr. Relat.18, 384–401. 10.1177/1368430214538325
- CrossRef
- Google Scholar
38
HenisE. A.FlashT. (1995). Mechanisms underlying the generation of averaged modified trajectories. Biol. Cybernet.72, 407–419.
- Google Scholar
39
InceraS.ShahA. P.McLennanC. T.WetzelM. T. (2017). Sentence context influences the subjective perception of foreign accents. Acta Psychol.172, 71–76. 10.1016/j.actpsy.2016.11.011
40
JazwinskiA. H. (2007). Stochastic Processes and Filtering Theory. New York, NY: Courier Corporation.
- Google Scholar
41
KalmanR. E. (1960). A new approach to linear filtering and prediction problems. J. Basic Eng.82, 35–45.
- Google Scholar
42
KapsourasI.NikolaidisN. (2014). Action recognition on motion capture data using a dynemes and forward differences representation. J. Visual Commun. Image Represent.25, 1432–1445. 10.1016/j.jvcir.2014.04.007
- CrossRef
- Google Scholar
43
KeA. H.EpsteinS. D.LewisR.PiresA. (2017). The quantificational domain of DOU: an experimental study. J. Psycholinguist. Res.47, 537–556. 10.1007/s10936-017-9532-9
44
KieslichP. J.HenningerF. (2017). Mousetrap: an integrated, open-source mouse-tracking package. Behav. Res. Methods49, 1652–1667. 10.3758/s13428-017-0900-z
45
KoopG. J. (2013). An assessment of the temporal dynamics of moral decisions. Judg. Decis. Making8, 527–539.
- Google Scholar
46
KrpanD. (2017). Behavioral priming 2.0: enter a dynamical systems perspective. Front. Psychol.8:1204. 10.3389/fpsyg.2017.01204
47
KulkarniJ. E.PaninskiL. (2008). State-space decoding of goal-directed movements. IEEE Signal Process. Mag.25, 78–86. 10.1109/MSP.2008.4408444
- CrossRef
- Google Scholar
48
LodewyckxT.TuerlinckxF.KuppensP.AllenN.SheeberL. (2011). A hierarchical state space approach to affective dynamics. J. Math. Psychol.55, 68–83. 10.1016/j.jmp.2010.08.004
49
LuengoD.MartinoL. (2013). Fully adaptive gaussian mixture metropolis-hastings algorithm, in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (IEEE), 6148–6152.
- Google Scholar
50
MarinJ.-M.MengersenK.RobertC. P. (2005). Bayesian modelling and inference on mixtures of distributions. Handb. Stat.25, 459–507. 10.1016/S0169-7161(05)25016-2
- CrossRef
- Google Scholar
51
MartinoL. (2018). A review of multiple try mcmc algorithms for signal processing. Digital Signal Process.75, 134–152. 10.1016/j.dsp.2018.01.004
- CrossRef
- Google Scholar
52
McClintockB. T.KingR.ThomasL.MatthiopoulosJ.McConnellB. J.MoralesJ. M. (2012). A general discrete-time modeling framework for animal movement using multistate random walks. Ecol. Monogr.82, 335–349. 10.1890/11-0326.1
- CrossRef
- Google Scholar
53
MendelJ. M. (1995). Lessons in Estimation Theory for Signal Processing, Communications, and Control. Upper Saddle River, NJ: Pearson Education.
- Google Scholar
54
MonaroM.GamberiniL.SartoriG. (2017). The detection of faked identity using unexpected questions and mouse dynamics. PLoS ONE12:e0177851. 10.1371/journal.pone.0177851
55
MulderK.KlugkistI. (2017). Bayesian estimation and hypothesis tests for a circular generalized linear model. J. Math. Psychol.80, 4–14. 10.1016/j.jmp.2017.07.001
- CrossRef
- Google Scholar
56
NorrisD.KinoshitaS. (2008). Perception as evidence accumulation and bayesian inference: insights from masked priming. J. Exp. Psychol. Gen.137, 434–455. 10.1037/a0012799
57
PaninskiL.AhmadianY.FerreiraD. G.KoyamaS.Rahnama RadK.VidneM.et al. (2010). A new look at state-space models for neural data. J. Comput. Neurosci.29, 107–126. 10.1007/s10827-009-0179-x
58
PapeshM. H.GoldingerS. D. (2012). Memory in motion: movement dynamics reveal memory strength. Psychonom. Bull. Rev.19, 906–913. 10.3758/s13423-012-0281-3
59
QuétardB.QuintonJ. C.MermillodM.BarcaL.PezzuloG.ColombM.et al. (2016). Differential effects of visual uncertainty and contextual guidance on perceptual decisions: evidence from eye and mouse tracking in visual search. J. Vis.16, 28–28. 10.1167/16.11.28
60
RamsayJ. O.SilvermanB. W. (2007). Applied Functional Data Analysis: Methods and Case Studies. Springer.
- Google Scholar
61
RastleK.DavisM. H. (2008). Morphological decomposition based on the analysis of orthography. Lang. Cogn. Process.23, 942–971. 10.1080/01690960802069730
- CrossRef
- Google Scholar
62
SärkkäS. (2013). Bayesian Filtering and Smoothing, Vol. 3. New York, NY: Cambridge University Press.
- Google Scholar
63
ShampineL. F. (2008). Vectorized adaptive quadrature in matlab. J. Comput. Appl. Math.211, 131–140. 10.1016/j.cam.2006.11.021
- CrossRef
- Google Scholar
64
ShiffrinR. M.LeeM. D.KimW.WagenmakersE.-J. (2008). A survey of model evaluation approaches with a tutorial on hierarchical bayesian methods. Cogn. Sci.32, 1248–1284. 10.1080/03640210802414826
65
ShumwayR. H.StofferD. S. (1982). An approach to time series smoothing and forecasting using the em algorithm. J. Time Series Anal.3, 253–264.
- Google Scholar
66
ShumwayR. H.StofferD. S. (2006). Time Series Analysis and Its Applications: With R Examples. Springer Science & Business Media.
- Google Scholar
67
SmithA. C.BrownE. N. (2003). Estimating a state-space model from point process observations. Neural Comput.15, 965–991. 10.1162/089976603765202622
68
SmithP. L.RatcliffR. (2004). Psychology and neurobiology of simple decisions. Trends Neurosci.27, 161–168. 10.1016/j.tins.2004.01.006
69
SongH.FerrerE. (2009). State-space modeling of dynamic psychological processes via the kalman smoother algorithm: rationale, finite sample properties, and applications. Struct. Equat. Model.16, 338–363. 10.1080/10705510902751432
- CrossRef
- Google Scholar
70
SpiveyM. J.DaleR. (2006). Continuous dynamics in real-time cognition. Curr. Dir. Psychol. Sci.15, 207–211. 10.1111/j.1467-8721.2006.00437.x
- CrossRef
- Google Scholar
71
SpiveyM. J.DaleR.KnoblichG.GrosjeanM. (2010). Do curved reaching movements emerge from competing perceptions? A reply to van der wel et al. (2009). J. Exp. Psychol.36, 251–254. 10.1037/a0017170
72
StolierR. M.FreemanJ. B. (2017). A neural mechanism of social categorization. J. Neurosci.37, 5711–5721. 10.1523/JNEUROSCI.3334-16.2017
73
TanawongsuwanR.BobickA. (2001). Gait recognition from time-normalized joint-angle trajectories in the walking plane, in Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2001. CVPR 2001, Vol. 2.
- Google Scholar
74
TannerM. A. (1991). Tools for Statistical Inference, Vol. 3. New York, NY: Springer.
- Google Scholar
75
Van Der WelR. P.EderJ. R.MitchelA. D.WalshM. M.RosenbaumD. A. (2009). Trajectories emerging from discrete versus continuous processing models in phonological competitor tasks: a commentary on Spivey, Grosjean, and Knoblich (2005). J. Exp. Psychol.35, 588–594. 10.1037/0096-1523.35.2.588
76
YangM.ChowS.-M. (2010). Using state-space model with regime switching to represent the dynamics of facial electromyography (EMG) data. Psychometrika75, 744–771. 10.1007/s11336-010-9176-2
- CrossRef
- Google Scholar
77
YapM. J.BalotaD. A.TseC.-S.BesnerD. (2008). On the additive effects of stimulus quality and word frequency in lexical decision: evidence for opposing interactive influences revealed by RT distributional analyses. J. Exp. Psychol. Learn. Mem. Cogn.34, 495–513. 10.1037/0278-7393.34.3.495
78
YuB. M.KemereC.SanthanamG.AfsharA.RyuS. I.MengT. H.et al. (2007). Mixture of trajectory models for neural decoding of goal-directed movements. J. Neurophysiol.97, 3763–3780. 10.1152/jn.00482.2006
79
ZgonnikovA.AleniA.PiiroinenP.O'HoraD.di BernardoM. (2017). Decision landscapes: visualizing mouse-tracking data. Open Sci.4:170482. 10.1098/rsos.170482

Summary

Keywords

mouse tracking, state space modeling, dynamic systems, categorization task, aimed movements, Bayesian filtering

Citation

Calcagnì A, Lombardi L, D'Alessandro M and Freuli F (2019) A State Space Approach to Dynamic Modeling of Mouse-Tracking Data. Front. Psychol. 10:2716. doi: 10.3389/fpsyg.2019.02716

Received

05 August 2019

Accepted

18 November 2019

Published

17 December 2019

Volume

10 - 2019

Edited by

Pietro Cipresso, Italian Auxological Institute (IRCCS), Italy

Reviewed by

Raydonal Ospina, Federal University of Pernambuco, Brazil; Petri Ala-Laurila, University of Helsinki, Finland; Luca Martino, Rey Juan Carlos University, Spain

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Antonio Calcagnì antonio.calcagni@unipd.it

This article was submitted to Quantitative Psychology and Measurement, a section of the journal Frontiers in Psychology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

ORIGINAL RESEARCH article

A State Space Approach to Dynamic Modeling of Mouse-Tracking Data

Abstract

1. Introduction

2. A Motivating Example

3. State-Space Modeling of Mouse-Tracking Data

3.1. Data

3.2. Model Representation

3.3. Model Identification

3.4. Model Evaluation

3.5. Model Summary

4. Application

4.1. General Context and Motivation

4.2. Model 1

4.2.1. Data Structure and Variables

4.2.2. Data Analysis and Results

4.3. Model 2

4.3.1. Data Structure and Variables

4.3.2. Data Analysis and Results

4.4. Model 3

4.4.1. Data Structure and Variables

4.4.2. Data Analysis and Results

4.5. Profiles Analysis

5. Discussion

5.1. Model's Advantages and Limitations

5.2. Further Extensions

6. Conclusions

Statements

Data availability statement

Author contributions

Acknowledgments

Conflict of interest

Supplementary material

Footnotes

References

Summary

Outline

Figures

Cite article

Share article

Article metrics