Fast mental states decoding in mixed reality

De Massari, Daniele; Pacheco, Daniel; Malekshahi, Rahim; Betella, Alberto; Verschure, Paul F. M. J.; Birbaumer, Niels; Caria, Andrea

doi:10.3389/fnbeh.2014.00415

ORIGINAL RESEARCH article

Front. Behav. Neurosci., 27 November 2014

Sec. Learning and Memory

Volume 8 - 2014 | https://doi.org/10.3389/fnbeh.2014.00415

This article is part of the Research TopicLearned brain self-regulation for emotional processing and attentional modulation: from theory to clinical applicationsView all 28 articles

Fast mental states decoding in mixed reality

Daniele De Massari^1,2^†

Daniel Pacheco³^†

Rahim Malekshahi^1,4

Alberto Betella³

Paul F. M. J. Verschure^3,5

Niels Birbaumer^1,2

Andrea Caria^1,2^*

¹Institut für Medizinische Psychologie und Verhaltensneurobiologie, Universität Tübingen, Tübingen, Germany
²Fondazione Ospedale San Camillo, Istituto di Ricovero e Cura a Carattere Scientifico, Venezia, Italy
³SPECS - Laboratory of Synthetic Perceptive, Emotive and Cognitive Systems, Department of Technology, Center of Autonomous Systems and Neurorobotics, Universitat Pompeu Fabra, Barcelona, Spain
⁴Graduate School of Neural & Behavioural Sciences, International Max Planck Research School, Tübingen, Germany
⁵Institució Catalana de Recerca i Estudis Avançats, Barcelona, Spain

The combination of Brain-Computer Interface (BCI) technology, allowing online monitoring and decoding of brain activity, with virtual and mixed reality (MR) systems may help to shape and guide implicit and explicit learning using ecological scenarios. Real-time information of ongoing brain states acquired through BCI might be exploited for controlling data presentation in virtual environments. Brain states discrimination during mixed reality experience is thus critical for adapting specific data features to contingent brain activity. In this study we recorded electroencephalographic (EEG) data while participants experienced MR scenarios implemented through the eXperience Induction Machine (XIM). The XIM is a novel framework modeling the integration of a sensing system that evaluates and measures physiological and psychological states with a number of actuators and effectors that coherently reacts to the user's actions. We then assessed continuous EEG-based discrimination of spatial navigation, reading and calculation performed in MR, using linear discriminant analysis (LDA) and support vector machine (SVM) classifiers. Dynamic single trial classification showed high accuracy of LDA and SVM classifiers in detecting multiple brain states as well as in differentiating between high and low mental workload, using a 5 s time-window shifting every 200 ms. Our results indicate overall better performance of LDA with respect to SVM and suggest applicability of our approach in a BCI-controlled MR scenario. Ultimately, successful prediction of brain states might be used to drive adaptation of data representation in order to boost information processing in MR.

Introduction

Mixed Reality (MR) is a type of virtual reality-related technology where real and virtual worlds are merged so that real-time interaction with both physical and digital objects (Milgram, 1994; Bohil et al., 2011) is achievable. A particularly promising MR system is the eXperience Induction Machine (XIM) (Bernardet et al., 2011; Omedas et al., 2014). This technology permits to model representational elements analog to real phenomena as well as highly abstract non-representation forms describing complex high-dimensional data in a controlled environment. The exploration of data in XIM is conceptualized as an integrative narrative of varying forms where implicit and explicit responses as well as neurophysiological signals from the user can be utilized to modulate data representation (Bernardet et al., 2011; Lessiter et al., 2011; Verschure, 2011; Omedas et al., 2014).

It has been proposed that the combination of Brain-Computer Interface (BCI) technology, allowing online monitoring and decoding of mental states (Muller et al., 2008; Blankertz et al., 2010), with virtual reality systems may help to shape and guide implicit and explicit learning using ecological scenarios (Lécuyer et al., 2008; Lotte et al., 2013). Online analysis of specific brain activity has mainly been used in BCI applications for communication and control of external devices (Birbaumer and Cohen, 2007; Daly and Wolpaw, 2008), as well as for shaping behavior through neurofeedback paradigms (Delorme and Makeig, 2003; Shibata et al., 2011; Yoo et al., 2011; Caria et al., 2012; Scharnowski et al., 2012). Alternatively, the information acquired with BCI can be used to support human-computer interaction, in applications where its content is adapted to user's implicit interest, as well as for adaptive automation, affective computing, or video games (George, 2010). BCI-based real-time analysis of brain signals, with no need of participants to learn their control (sometimes referred to as “passive” BCI), can additionally be utilized to manipulate behavioral response by delivering information according to specific mental states.

Using this approach, enhancing and depressing learning and memory formation was demonstrated by triggering stimuli presentation during brain states favoring or reducing learning, which were assessed through online detection of activity in the bilateral parahippocampal areas with real-time fMRI (Yoo et al., 2011). In a similar fashion, presentation of external inputs during specific phases of neuroelectric activity might enhance or reduce participant's response.

Successful electroencephalographic (EEG)-based detection of brain states predicting participants' errors during complex cognitive decision tasks has been shown (Eichele et al., 2010). Several studies also demonstrated that brain states preceding stimulus presentation significantly affect perception (Arieli et al., 1996; Boly et al., 2007; Fox and Raichle, 2007; Fox et al., 2007; Busch et al., 2009; Mathewson et al., 2009). Building on these results, real-time information of ongoing brain states might be exploited for controlling data presentation in MR in order to boost information processing. For instance, detection of specific brain states might be used to drive changes in the level of complexity of presented information to facilitate participants' perception.

Toward this aim, we assessed to what extent multiple brain states can be discriminated during MR experience. Previous studies in the visual domain showed that real-time analysis of visual evoked potentials can detect fluctuations of perceptual dominance of each eye during binocular rivalry (Brown and Norcia, 1997). In the field of motor imagery-based BCI Millán and colleagues proposed a simple local neural classifier for the recognition of multiple mental tasks from on-line spontaneous EEG signal that achieved a recognition rate of 70% in distinguishing between relaxation, left and right hand movement imagination (Millán et al., 2002).

However, to date, clear evidence of classification of brain states during different cognitive tasks for BCI control of virtual and MR environments is still lacking. Most of studies on the integration of BCI with virtual reality focused on motor imagery, P300 and steady-state visual evoked potentials (SSVEPs) (Lotte et al., 2013). In particular, SSVEPs, permitting high information transfer rates and minimal training, seem to be suitable for BCI in virtual and MR (Martinez et al., 2007; Faller et al., 2010). Though, BCI based on continuous EEG decoding might be more flexible for monitoring brain activity during natural behavior in virtual and MR applications.

In our study, we tested classification of continuous EEG signal during spatial navigation, calculation and reading toward the implementation of BCI-controlled XIM-based MR. Spatial navigation represents a typical category of actions in virtual reality (Lécuyer et al., 2008; Lotte et al., 2013), while calculation and reading are fundamental tasks performed during information processing and data mining, and are also common cognitive processes used for mental workload assessment (Kohlmorgen et al., 2007). In particular, we performed EEG data classification using supervised classifiers based on linear discriminant analysis (LDA) and support vector machine (SVM). Furthermore, we examined predictive accuracy of our classifiers of mental workload in XIM. Increased mental workload was expected during calculation and reading as compared to spatial navigation because of larger involvement of working memory (Mayes and Koonce, 2001; Destefano, 2004; Imbo et al., 2007).

Based on previous studies showing large inter-individual differences in single-trial EEG classification of mental states in real operational environments (Kohlmorgen et al., 2007), we have used a flexible approach and calibrated our classifiers to each participant. Dynamic single trial classification was conducted using a sliding time-window shifting every 200 ms to permit applicability in a BCI-controlled MR scenario.

Materials and Methods

Five participants (29.60 ± 6.73 mean age ± SD, 1 female) underwent two consecutive sessions in a MR environment during which EEG signal was acquired continuously. The experiment was performed in the XIM (Bernardet et al., 2011; Omedas et al., 2014). The XIM architecture is an integrated framework that combines a sensing system to evaluate and measure complex psychological states with a number of actuators and effectors to coherently react to the user's actions (Figure 1). The internal processing of XIM is based on three main components. First, adaptive data mining that defines what data is presented to the user. Second, spatio-temporal structuring of the presented content in the form of narratives generated by the composition engines, and third an intentional, sentient agent, who controls the XIM interface and guides data exploration. XIM covers a surface area of 5.5 × 5.5 m, with a height of 4 m. Eight video projectors display the scenarios into four projection screens (2.25 × 5 m) surrounding the MR room. During each session participants experienced three different conditions, all involving the visual system: spatial navigation (SPN), reading (MER), and calculation (MEC). The user, sitting on a chair positioned in the middle of the XIM room, could navigate the virtual space by pressing the arrow keys of a keyboard. Participants were required to navigate a squared spiral labyrinth until the central point (indicated by a yellow sphere) (Figure 2). Nine different targets represented by red spheres were placed in alternating corners of the path. Proximity to red spheres triggered the beginning of a different condition. In the first session the condition consisted of a 30 s calculation task. When the participant reached the red sphere, screen went black and a random 3-digit number was displayed in the graphical interface. The participant was asked to iteratively subtract 17 from a given number. After 30 s, the black screen faded out and the participant was asked to continue spatial navigation. In the second session, the condition consisted of a 30 reading. The introduction of a scientific article was displayed and the participant was required to read it and press the space keyboard command when finished. In each session 9 SPN conditions were alternated to 8 MER or MEC conditions. An immersive 180° virtual reality application was developed and projected into the back screens of the XIM room. The VR application was developed using the Unity3D Game Engine, and adapted to fit the displays of XIM (Bernardet et al., 2011; Omedas et al., 2014). A virtual maze was modeled using Autodesk Maya (Autodesk Inc., San Rafael, CA, USA). The labyrinth size was 10 × 10 VR units (VR units permit to assign any type of units to objects' properties, e.g., weight, distance, etc., in our case they are defined as meters). The environment was constructed as an extension of the real physical space of the XIM—wall, floor, and other virtual objects were modeled so that they were perceived of real size (i.e., the point of view of the participant experiencing MR was equivalent to that of a person with average height). All participants were appropriately instructed about the experimental procedure. This study was approved by the ethics committee of the University of Tübingen.

FIGURE 1

Figure 1. The eXperience Induction Machine (XIM) architecture is a mixed reality integrated framework that combines a sensing system to evaluate and measure complex physiological and psychological states with a number of actuators and effectors to coherently react to the user's actions. It is mainly constituted of an immersive room that covers a surface area of 5.5 × 5.5 m, with a height of 4 m. Eight video projectors display the scenarios into four projection screens (2.25 × 5 m) surrounding the MR room. Reprinted with permission from Betella et al. (2014).

FIGURE 2

Figure 2. Top: The immersive XIM modeling the virtual maze used in the experiment (left). Center: View from the top of the labyrinth and the nine different targets (red spheres) that were placed in alternating corners of the path. The labyrinth size was 10 × 10 VR units (meters). Participants were required to navigate the squared spiral labyrinth until the central point (yellow sphere). Proximity to red spheres triggered the beginning of a different condition. In the first session the condition consisted of a 30 s calculation task. When the participant reached the red sphere, screen went black and a random 3-digit number was displayed in the graphical interface. The participant was asked to iteratively subtract 17 from a given number. After 30 s, the black screen faded out and the participant was asked to continue spatial navigation. In the second session, the condition consisted of a 30 s reading. The introduction of a scientific article was displayed and the participant was required to read it and press the space keyboard command when finished. Bottom: First person perspective of the labyrinth and a red sphere.

EEG Data Acquisition and Preprocessing

EEG data were recorded using a 64-channels BrainAmp amplifier (Brain Products GmbH, Munich Germany). An actiCap 64-channels EEG cap (modified 10–20 system, Brain Products GmbH, Munich Germany) was used for data acquisition, referenced to the FCz, and grounded anteriorly to Fz. Only 28 surface active electrodes at the following locations were used: Fp1, Fp2, F7, F3, Fz, F4, F8, Fc5, Fc1, Fc2, Fc6, T7, C3, Cz, C4, T8, Cp5, Cp1, Cp2, Cp6, P7, P3, Pz, P4, P8, O1, Oz, O2. Electrodes impedance was reduced to 15 kΩ before data recording. EEG signals were sampled at 250 Hz.

EEG signal was first visually inspected to exclude channels affected by artifacts. Spectral analysis was then conducted on each channel to prevent our classifiers from being affected by large muscle and eye artifacts. To this aim we explored differences between SPN (low cognitive load) and MER + MEC (high cognitive load) conditions focusing on the frequencies above 20 Hz (typical of muscles artifacts) and below 6 Hz (typical of eye artifacts) (Kohlmorgen et al., 2007). The channels showing a significant difference between different workload conditions in the selected frequency bands were discarded (Kohlmorgen et al., 2007). As in Kohlmorgen et al. (2007), for each subject a customized feature selection—channel subsets, spatial filtering, frequency bands, and window lengths—was performed based on the following set of parameters: EEG channels subset {FC#, C#, P#, CP#}, {F#, FC#, C#, P#, CP#, O#}, {F#, FC#, C#, P#, CP#, O#, T7, T8}, {FC#, C#, P#, CP#, T7, T8}; spatial filter: common median reference or none; frequency band for spectral estimation: 3–15, 7–15, 10–15, 3–10 Hz; window length: 2 or 5 s. Feature extraction was performed by computing a spectral estimation within a dynamic sliding window approach shifting every 200 ms. EEG data analysis and classification were performed using MATLAB (The Mathworks, Natick, MA).

Classification and Physiological Analyses

A SVM (from LIBSVM library, http://www.csie.ntu.edu.tw/~cjlin/libsvm/faq.html#f203) with non-linear kernel (radial basis function, rbf) and a LDA classifier were tested on each subject. For SVM classification, the regularization parameter C was set to 0.6 in order to prevent over-fitting, (Cherkassky and Ma, 2004). Two different classification schemes were used. In the first scheme, the classifiers aimed to distinguish between three different classes: spatial navigation, reading and calculation. The most common decomposing strategies for multiclass SVM are the “one against one” and “one against all” binary classification approaches. The former implies the training of k*(k–1)/2 different binary classifiers (where k is the number of classes); each new test sample is then labeled according to the class selected by the majority of classifiers. The latter implies to build a classifier per each class to distinguish the samples of the selected class from the samples of all remaining classes; a new test sample is labeled according to the maximum outcome of all trained SVMs. A comparison of several multi-class SVM methods showed similar performance for the “one-against-all,” “one-against-one” and directed acyclic graph SVM (DAGSVM) (Hsu and Lin, 2002). However, the authors suggested that one-against-one and DAG approaches are more suitable for practical use due to reduced training time. Accordingly, we adopted the “one against one” strategy. For each subject 8 MEC, 8 MER, and 18 SPN blocks were recorded in the two sessions. A leave-one-out cross validation (CV) was used for performing feature selection and for measuring classifiers performance (see Figure 3). One MEC, one MER, and two SPN blocks were pseudo-randomly selected and retained for testing classifiers performance, while the remaining blocks were fed into a 7-fold cross-validation scheme for performing parameter selection. This procedure satisfies the requirement of independency between parameter selection samples and testing samples for classifier performance assessment. The 7-fold CV scheme was performed for each combination of parameters (i.e., channel subset, spatial filter, frequency band, and window length) to select the parameters' combination providing the best 7-fold CV accuracy (see Table 1). In each fold one MEC, one MER, and two SPN blocks were retained to compute the accuracy of the model trained on the other 6 MEC, 6 MER, and 14 SPN blocks. As duration of SPN blocks was approximately half the duration of MEC, a comparable number of samples was obtained by using for SPN double the number of MEC or MER blocks. For measuring classifiers performance the CV procedure was repeated 7 times with different training subset for each class. The 7 accuracy values were then averaged to provide the final CV accuracy.

FIGURE 3

Figure 3. Flow diagram depicting feature selection and estimation of classification performance for scheme 1. A similar flow diagram was used for scheme 2 but using a different numbers of blocks and classes (HW and LW conditions).

TABLE 1

Table 1. Selected parameters in scheme 1 (top) and scheme 2 for each subject (bottom).

A second classification scheme aimed to test generalization capability of the classifier in discriminating between high (MER and MEC) and low (SPN) workload (LW and HW) independently of the type of workload. This additional scheme consisted in the application of a 11-fold CV on the merged MEC and MER datasets. The merged dataset was divided into two parts: four blocks (1 LW and 1 HW block from the MEC dataset and 1 LW and 1 HW block from the MER dataset) were pseudo-randomly selected for estimating classifier performance; the remaining 16 LW and 14 HW were fed into a 11-fold CV scheme aiming to select the best parameter setting for each subject. During each fold 14 LW and 12 HW blocks were used for training and the remaining 2 LW and 2 HW blocks for testing. The 11-fold CV scheme was performed for each combination of parameters (i.e., channel subset, spatial filter, frequency band, and window length). The parameters' combination providing the best 11-fold CV accuracy was then employed to test classifier performance on the retained four pseudo-randomly blocks (see Table 1). Features selection for the first and second classification scheme included a 5 s window length and the common median reference (in 15 out of 20 cases).

Feature selection process allowed to determine two most discriminant channels' subsets out of the four considered: {F#, FC#, C#, P#, CP#, O#} and {F#, FC#, C#, P#, CP#, O#, T7, T8} (see Table 1).

The Matthews correlation coefficient (MCC) was additionally computed, as it guarantees more robustness to performance variability in binary classification accuracy by taking into account differences in data dimensionality (Baldi et al., 2000). MCC ranges between -1 and 1, from total disagreement to agreement between prediction and observation, respectively, and with 0 indicating completely random prediction.

EEG spectral differences among the three conditions were inspected considering 3–7 and 8–12 Hz bands. Spectral differences between HW and LW conditions were assessed using a non-parametric Wilcoxon signed-rank test considering each of the 26 channels of all participants for 3–7 and 8–12 Hz bands separately.

Results

Classification performance of continuous EEG data during the three mental states SPN, MEC and MER is reported in Table 2. The LDA based classifier generated on average the highest accuracy (83.30%, MCC = 0.72) across all subjects, with peaks of 89.72% for accuracy and 0.84 for MCC in subject 2. The SVM based classifier generated on average the lower accuracy (65.68%, MCC = 0.45). The results of the classification between HW and LW are reported in Table 3. As for the first scheme, LDA yielded on average the highest accuracy (88.56%, MCC = 0.74) across all subjects, with peaks of 96.92% for accuracy and 0.92 for MCC in subject 1. SVM yielded on average a lower accuracy (86.59%) and a lower MCC value (0.63). Visual inspection of EEG power spectrum at representative discriminative channels (Fz and Pz, as they are usually less affected by muscle artifacts) showed power changes at theta frequencies (3–7 Hz) and at alpha band (8–12 Hz) among all three conditions (see Figure 4A). Moreover, increased alpha band power in frontal regions and theta band power in the frontal and parietal regions was measured during reading and calculation with respect to spatial navigation (see Figure 4B). Topographic distribution of the 8–12 Hz band power difference comparing HW to LW conditions showed positive peaks of neuroelectric activity in the left frontal area, F7, and bilateral parietal regions, Cp1 and Cp2 (p < 0.001, see larger dots in Figure 5, right). Topographic distribution of the 3–7 Hz band power difference comparing HW to LW conditions shows a significant positive peak in the left frontal area (F7) and several negative peaks in central and parietal areas (p < 0.001, see larger dots in Figure 5, left), except channels F8, Fc6, T7, C3, C4, T8, Cp1, Pz.

TABLE 2

Table 2. Results of the classification of spatial navigation, reading, and calculation.

TABLE 3

Table 3. Results of the classification between LW and HW.

FIGURE 4

Figure 4. (A) Power spectra (grand average across all subjects) of two discriminative channels of all conditions in frontal and parietal areas (Fz and Pz). Solid, dashed, and dotted lines represent the grand average power spectrum for SPN, MER, and MEC tasks, respectively. Fz shows a clear power difference among all conditions in the 3–7 Hz band, whereas Pz shows a clear power difference in the 8–12 Hz band. (B) Power spectra (grand average across all subjects) of channels showing positive peaks during high (HW) as compared to low (LW) mental workload (left frontal area, F7, and central parietal region, Cp2).

FIGURE 5

Figure 5. Topographic distribution of the spectral difference in the 3–7 Hz band (left) and in the 8–12 Hz band (right) between HW and LW (grand average across all subjects). Larger dots indicate those channels where a significant difference was measured (p < 0.001). Distribution of the 8–12 Hz band power difference shows significant positive peaks in the left frontal area (F7) and bilateral parietal regions (Cp1 and Cp2). Distribution of the 3–7 Hz band power shows a significant positive peak in the frontal area (F7), and several negative peaks in central and parietal areas.

Discussion

Our study investigated brain states classification during the execution of multiple cognitive processes in MR. To this end we explored continuous EEG data decoding during performance of MR relevant tasks such as spatial navigation, calculation and reading in XIM, using LDA and SVM based classifiers. Results of our first classification scheme, aiming to discriminate spatial navigation, calculation and reading, showed high performance of both LDA and SVM classifiers, with an average accuracy of around 83% for LDA and of around 65% for SVM. Results of our second classification scheme, aiming to decode mental workload independently on the workload task, showed high classification performance of both SVM and LDA (on average both above 86%). Successful decoding of all mental states was achieved considering a 5 s time-window shifting every 200 ms, permitting online applications with a bit-rate of about 5 bits/s.

Previous EEG-BCI mainly investigated mental states decoding considering different states separately, for instance motor imagery, attention, performance capability, emotional arousal, or brain activity prefiguring behavioral errors (Millán et al., 2002; Muller et al., 2008; Schubert et al., 2009; Eichele et al., 2010). However, the capability of decoding multiple brain states is particularly important for BCI-controlled MR as it would allow the implementation of more flexible scenarios with less behavioral constraints. A previous study aiming to implement a BCI for the recognition of multiple mental tasks from on-line spontaneous EEG signal reported a recognition rate of 70% in distinguishing between relaxation, left and right hand movement imagination using a simple local neural classifier (Millán et al., 2002). In addition, the authors performed a preliminary analysis in one subject to test generalization of two local neural classifiers in discriminating between three tasks—relaxation, arithmetic subtraction, and left hand movement imagination, as well as relaxation, cube rotation and left hand movement imagination; performance accuracy reached over 90% of correct prediction on the combined task (Millán et al., 2002). A more recent EEG classification study reported successful offline multi-class discrimination of several conditions, such as resting, mental calculation, mental writing and rotation, by combining wavelet transform decomposition for feature selection and a feed-forward neural network with one-step secant algorithm (Upadhyay, 2013).

Here, we tested a simpler approach using LDA and SVM, also on the basis of the results of a comparative analysis of multi-class EEG classifiers for BCI, such as LDA, Nearest Neighbor Classifier (NNC) and linear SVM, indicating that LDA provides the highest classification accuracy with low dimensional feature space (Lee et al., 2005). In line with these results both our classification schemes showed better performance of LDA with respect to SVM.

Increased accuracy of SVM classifier would also be achievable through additional optimization procedure applied to its parameters (i.e., C and γ), typically via cross-validation, but this would result in a substantial increase of the computational time. On the other hand, SVM can achieve higher performance as compared to LDA with high dimensional feature vectors (Lee et al., 2005).

Because of large intersubject variability of EEG data, subject-dependent classifiers, as those used here, guarantee better performance than subject-independent classifiers (Lotte and Ang, 2009) but they require an initial offline calibration during which the participants need to evoke specific mental state by performing appropriate tasks (supervised classifier). However, a subject-independent BCI system with no need of training, implemented using a combination of large datasets of subject-dependent classifiers into a single subject-independent classifier, demonstrated performance similar to that of subject-dependent methods (Fazli et al., 2009). In addition, Vidaurre and colleagues investigating co-adaptive learning using machine learning techniques implemented a subject-independent supervised classifier with no need of offline calibration procedure that showed good performance even in participants that are not able to control conventional BCI (Vidaurre et al., 2010). These promising findings suggest the possibility to extend to use of subject-independent classifiers in BCI applications.

Our subject-dependent LDA-based classifier provided the highest accuracy mostly using 3–10 and 3–15 Hz frequency power bands. Spectral analysis indicated changes at theta band (3–6 Hz) as well as at alpha band (8–12 Hz) between different mental states in frontoparietal regions. Alpha band increase in the frontal area was observed for all conditions, whereas both theta and alpha bands increase in the parietal regions was larger for reading and calculation with respect to spatial navigation. Accordingly, high workload, that included both reading and calculation, compared to low workload—spatial navigation—showed significant power differences at 3–6 and 8–12 Hz bands. In particular theta band maximum was observed in the left frontal area, while alpha band peaked in the left frontal and bilateral parietal regions.

Despite intersubject variability and our small sample size, the observed alpha band increase in the bilateral parietal areas is in line with previous results reporting alpha changes in bilateral parietal and occipital brain regions associated with mental workload, task engagement or attention (Humphrey and Kramer, 1994; Pope et al., 1995; Kohlmorgen et al., 2007). The measured changes in the theta band power are also in line with previous studies indicating that theta oscillations are related to spatial navigation as well as encoding and retrieval of spatial information (Kahana et al., 1999; Bischof and Boulanger, 2003). In particular, high amplitude theta activity, mainly in the left frontal and right temporal cortices, has been measured during navigation in a virtual maze (Kahana et al., 1999). Other studies corroborated these results and showed that the frequency of theta episodes is directly associated with the difficulty of maze navigation (Bischof and Boulanger, 2003). In light of previous studies our results indicate that indeed the here adopted reading and calculation tasks required increased allocation of mental resources with respect to spatial navigation.

In addition, we observed delta frequencies (3 Hz) power changes during all conditions in frontal regions. These results are in line with previous studies indicating increased EEG oscillations in the range 1–3.5 Hz in frontal regions associated with different cognitive processes (Harmony, 2013), in particular during internal concentration and calculation (Fernandez et al., 1995).

Our mental states classifiers can equally be employed for real-time analysis of frequency bands. Online monitoring of alpha and theta bands power would be important for assessing participants' performance as these bands reflect cognitive and memory processing (Klimesch, 1999). In online MR-combined BCI applications the information stream provided to the user could be adapted to the current workload as indicated by alpha and theta oscillations. Kohlmorgen et al. (2007) proposed the use of ratios of activity in alpha (8–12 Hz) or theta (3–7 Hz) bands to compute an index of the user task engagement.

Alpha band is also critical for visual perception, in particular medium and lower amplitudes can reflect improved performance in somatosensory and visual discrimination tasks (Pfurtscheller and Lopes Da Silva, 1999; Hanslmayr et al., 2005; Palva and Palva, 2007; Van Dijk et al., 2008). Moreover, decrease in the alpha frequencies (8–12 Hz) before target onset was associated with augmented visual target detection (Ergenoglu et al., 2004). Further studies confirmed this observation by showing that the amplitude of prestimulus ~10 Hz oscillations correlated with the detection of the upcoming target: the smaller the amplitude, the more likely the target would be detected (Van Dijk et al., 2008; Busch et al., 2009; Mathewson et al., 2009). On the contrary, stronger pre-stimulus alpha frequency band amplitude has been linked to increased cognitive performance (Neubauer and Freudenthaler, 1995; Klimesch, 1999). Thus, online inspection of alpha wave oscillations might be used for triggering stimuli presentation so as to optimize stimulus detection, as well as for improving interpretation of novel information and data mining since alpha band activity has also been associated with learning (Klimesch, 1999).

In conclusion, our LDA classifier is sufficiently flexible and powerful for the implementation of a MR-combined BCI system. Successful classification of mental states based on subject-specific single trials EEG indicates the possibility to combine BCI technology with the XIM so that brain activity could drive the adaptation of data representation. A possible way to refine our BCI in MR is the use of an asynchronous modality where participants do not need to follow a fixed repetitive scheme to switch from one mental task to another one. Asynchronous BCI, allowing individuals to decide when to perform a mental task and when to stop it and switch to another one, are more flexible and adaptive to different scenarios (Millán et al., 2002; Millan Jdel and Mourino, 2003). Ultimately, this approach would permit to model the user experience in the XIM as common product between the initial data representations and the changes made interactively as consequences of users' neurophysiological signals associated with spontaneous behavior.

Conflict of Interest Statement

The Review Editor Dr. Emanuele Pasqualotto declares that, despite having collaborated with some of the authors, the review process was handled objectively. The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The present study was supported by EU grants: FP7-ICT-2009-258749 CEEDs: The Collective Experience of Empathic Data Systems; FP7-ICT-2013- 609593 BNCI Horizon 2020. The Future of Brain/Neural Computer Interaction: Horizon 2020; Italian Ministry of Health, GR-2009-1591908.

References

Arieli, A., Sterkin, A., Grinvald, A., and Aertsen, A. (1996). Dynamics of ongoing activity: explanation of the large variability in evoked cortical responses. Science 273, 1868–1871. doi: 10.1126/science.273.5283.1868

CrossRef Full Text | Google Scholar

Baldi, P., Brunak, S., Chauvin, Y., Andersen, C. A., and Nielsen, H. (2000). Assessing the accuracy of prediction algorithms for classification: an overview. Bioinformatics 16, 412–424. doi: 10.1093/bioinformatics/16.5.412

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bernardet, U., Valjamae, A., Inderbitzin, M., Wierenga, S., Mura, A., and Verschure, P. F. (2011). Quantifying human subjective experience and social interaction using the eXperience Induction Machine. Brain Res. Bull. 85, 305–312. doi: 10.1016/j.brainresbull.2010.11.009

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Betella, A., Cetnarski, R., Zucca, R., Arsiwalla, X. D., Martinez, E., Omedas, P., et al. (2014). “BrainX3: embodied exploration of neural data,” in Proceedings of the 2014 Virtual Reality International Conference Article No. 37 (New York, NY: ACM). doi: 10.1145/2617841.2620726

CrossRef Full Text | Google Scholar

Birbaumer, N., and Cohen, L. G. (2007). Brain-computer interfaces: communication and restoration of movement in paralysis. J. Physiol. 579, 621–636. doi: 10.1113/jphysiol.2006.125633

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bischof, W. F., and Boulanger, P. (2003). Spatial navigation in virtual reality environments: an EEG analysis. Cyberpsychol. Behav. 6, 487–495. doi: 10.1089/109493103769710514

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Blankertz, B., Tangermann, M., Vidaurre, C., Fazli, S., Sannelli, C., Haufe, S., et al. (2010). The Berlin brain-computer interface: non-medical uses of BCI technology. Front. Neurosci. 4:198. doi: 10.3389/fnins.2010.00198

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bohil, C. J., Alicea, B., and Biocca, F. A. (2011). Virtual reality in neuroscience research and therapy. Nat. Rev. Neurosci. 12, 752–762. doi: 10.1038/nrn3122

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Boly, M., Balteau, E., Schnakers, C., Degueldre, C., Moonen, G., Luxen, A., et al. (2007). Baseline brain activity fluctuations predict somatosensory perception in humans. Proc. Natl. Acad. Sci. U.S.A. 104, 12187–12192. doi: 10.1073/pnas.0611404104

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Brown, R. J., and Norcia, A. M. (1997). A method for investigating binocular rivalry in real-time with the steady-state VEP. Vision Res. 37, 2401–2408. doi: 10.1016/S0042-6989(97)00045-X

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Busch, N. A., Dubois, J., and Vanrullen, R. (2009). The phase of ongoing EEG oscillations predicts visual perception. J. Neurosci. 29, 7869–7876. doi: 10.1523/JNEUROSCI.0113-09.2009

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Caria, A., Sitaram, R., and Birbaumer, N. (2012). Real-time fMRI: a tool for local brain regulation. Neuroscientist 18, 487–501. doi: 10.1177/1073858411407205

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Cherkassky, V., and Ma, Y. (2004). Practical selection of SVM parameters and noise estimation for SVM regression. Neural Netw. 17, 113–126. doi: 10.1016/S0893-6080(03)00169-2

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Daly, J. J., and Wolpaw, J. R. (2008). Brain-computer interfaces in neurological rehabilitation. Lancet Neurol. 7, 1032–1043. doi: 10.1016/S1474-4422(08)70223-0

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Delorme, A., and Makeig, S. (2003). EEG changes accompanying learned regulation of 12-Hz EEG activity. IEEE Trans. Neural Syst. Rehabil. Eng. 11, 133–137. doi: 10.1109/TNSRE.2003.814428

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Destefano, D. L. J. (2004). The role of working memory in mental arithmetic. Eur. J. Cogn. Psychol. 16, 353–386. doi: 10.1080/09541440244000328

CrossRef Full Text | Google Scholar

Eichele, H., Juvodden, H. T., Ullsperger, M., and Eichele, T. (2010). Mal-adaptation of event-related EEG responses preceding performance errors. Front. Hum. Neurosci. 4:65. doi: 10.3389/fnhum.2010.00065

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Ergenoglu, T., Demiralp, T., Bayraktaroglu, Z., Ergen, M., Beydagi, H., and Uresin, Y. (2004). Alpha rhythm of the EEG modulates visual detection performance in humans. Brain Res. Cogn. Brain Res. 20, 376–383. doi: 10.1016/j.cogbrainres.2004.03.009

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Faller, J. M. L.-P. G., Schmalstieg, D., and Pfurtscheller, G. (2010). “An application framework for controlling an avatar in a desktop based virtual environment via a software SSVEP brain-computer interface,” in Presence: Teleoperators and Virtual Environments, Vol. 19 (Cambridge, MA: MIT Press), 25–34.

Google Scholar

Fazli, S., Popescu, F., Danoczy, M., Blankertz, B., Muller, K. R., and Grozea, C. (2009). Subject-independent mental state classification in single trials. Neural Netw. 22, 1305–1312. doi: 10.1016/j.neunet.2009.06.003

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Fernandez, T., Harmony, T., Rodriguez, M., Bernal, J., Silva, J., Reyes, A., et al. (1995). EEG activation patterns during the performance of tasks involving different components of mental calculation. Electroencephalogr. Clin. Neurophysiol. 94, 175–182. doi: 10.1016/0013-4694(94)00262-J

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Fox, M. D., and Raichle, M. E. (2007). Spontaneous fluctuations in brain activity observed with functional magnetic resonance imaging. Nat. Rev. Neurosci. 8, 700–711. doi: 10.1038/nrn2201

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Fox, M. D., Snyder, A. Z., Vincent, J. L., and Raichle, M. E. (2007). Intrinsic fluctuations within cortical systems account for intertrial variability in human behavior. Neuron 56, 171–184. doi: 10.1016/j.neuron.2007.08.023

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

George, L. L. A. (2010). “An overview of research on “passive” brain-computer interfaces for implicit human-computer interaction,” in International Conference on Applied Bionics and Biomechanics ICABB (Venice).

Google Scholar

Hanslmayr, S., Sauseng, P., Doppelmayr, M., Schabus, M., and Klimesch, W. (2005). Increasing individual upper alpha power by neurofeedback improves cognitive performance in human subjects. Appl. Psychophysiol. Biofeedback 30, 1–10. doi: 10.1007/s10484-005-2169-8

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Harmony, T. (2013). The functional significance of delta oscillations in cognitive processing. Front. Integr. Neurosci. 7:83. doi: 10.3389/fnint.2013.00083

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Hsu, C. W., and Lin, C. J. (2002). A comparison of methods for multiclass support vector machines. IEEE Trans. Neural Netw. 13, 415–425. doi: 10.1109/72.991427

CrossRef Full Text | Google Scholar

Humphrey, D. G., and Kramer, A. F. (1994). Toward a psychophysiological assessment of dynamic changes in mental workload. Hum. Factors 36, 3–26.

Pubmed Abstract | Pubmed Full Text | Google Scholar

Imbo, I., Vandierendonck, A., and De Rammelaere, S. (2007). The role of working memory in the carry operation of mental arithmetic: number and value of the carry. Q. J. Exp. Psychol. (Hove) 60, 708–731. doi: 10.1080/17470210600762447

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Kahana, M. J., Sekuler, R., Caplan, J. B., Kirschen, M., and Madsen, J. R. (1999). Human theta oscillations exhibit task dependence during virtual maze navigation. Nature 399, 781–784. doi: 10.1038/21645

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Klimesch, W. (1999). EEG alpha and theta oscillations reflect cognitive and memory performance: a review and analysis. Brain Res. Brain Res. Rev. 29, 169–195. doi: 10.1016/S0165-0173(98)00056-3

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Kohlmorgen, J. D. G., Braun, M., Blankertz, B., Müller, K. R., Curio, G., Hagemann, K., et al. (2007). “Improving human performance in a real operating environment through real-time mental workload detection,” in Toward Brain-Computer Interfacing, eds J. D. R. M. G. Dornhege, T. Hinterberger, D. McFarland, and K.-R. Müller (Cambridge: MIT press), 409–422.

Google Scholar

Lécuyer, A., Lotte, F., Reilly, R. B., Leeb, R., Hirose, M., and Slater, M. (2008). Brain–computer interfaces, virtual reality, and videogames. Computer 41, 66–72. doi: 10.1109/MC.2008.410

CrossRef Full Text | Google Scholar

Lee, F. S. R., Leeb, R., Neuper, C., Bischof, H., and Pfurtscheller, G. (2005). “A comparative analysis of multi-class EEG classification for brain computer interface,” in Computer Vision Winter Workshop CVWW (Graz: Austrian Computer Society).

Google Scholar

Lessiter, J., Miotto, A., Freeman, J., Verschure, P., and Bernardet, U. (2011). CEEDs: unleashing the power of the subconscious. Proc. Comput. Sci. 7, 214–215. doi: 10.1016/j.procs.2011.09.069

CrossRef Full Text | Google Scholar

Lotte, F. F. J., Guger, C., Renard, Y., Pfurtscheller, G., Lécuyer, A., and Leeb, R. (2013). “Combining BCI with virtual reality: towards new applications and improved BCI,” in Towards Practical Brain-Computer Interfaces, eds S. D. Brendan, Z. Allison, R. Leeb, J. Del R. Millán, and A. Nijholt (Berlin; Heidelberg: Springer), 197–220.

Google Scholar

Lotte, F. G. C., and Ang, K. K. (2009). Comparison of designs towards a subject-independent brain–computer interface based on motor imagery. Conf. Proc. IEEE Eng. Med. Biol. Soc. 2009, 4543–4546. doi: 10.1109/IEMBS.2009.5334126

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Martinez, P., Bakardjian, H., and Cichocki, A. (2007). Fully online multicommand brain-computer interface with visual neurofeedback using SSVEP paradigm. Comput. Intell. Neurosci. 2007:94561. doi: 10.1155/2007/94561

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Mathewson, K. E., Gratton, G., Fabiani, M., Beck, D. M., and Ro, T. (2009). To see or not to see: prestimulus alpha phase predicts visual awareness. J. Neurosci. 29, 2725–2732. doi: 10.1523/JNEUROSCI.3963-08.2009

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Mayes, D. K. S. V. K., and Koonce, J. M. (2001). Comprehension and workload differences for VDT and paper-based reading. Int. J. Ind. Ergon. 28, 367–378. doi: 10.1016/S0169-8141(01)00043-9

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Milgram, P. K. A. F. (1994). Taxonomy of mixed reality visual displays. IEICE Trans. Inf. Syst. E77-D, 1321–1329.

Google Scholar

Millán Jdel, R., and Mouriño, J. (2003). Asynchronous BCI and local neural classifiers: an overview of the Adaptive Brain Interface project. IEEE Trans. Neural Syst. Rehabil. Eng. 11, 159–161. doi: 10.1109/TNSRE.2003.814435

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Millán, J. D. R., Mouriño, J., Franzé, M., Cincotti, F., Varsta, M., Heikkonen, J., et al. (2002). A local neural classifier for the recognition of EEG patterns associated to mental tasks. IEEE Transa. Neural Netw. 13, 678–686. doi: 10.1109/TNN.2002.1000132

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Muller, K. R., Tangermann, M., Dornhege, G., Krauledat, M., Curio, G., and Blankertz, B. (2008). Machine learning for real-time single-trial EEG-analysis: from brain-computer interfacing to mental state monitoring. J. Neurosci. Methods 167, 82–90. doi: 10.1016/j.jneumeth.2007.09.022

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Neubauer, A. C., and Freudenthaler, H. H. (1995). Ultradian rhythms in cognitive performance: no evidence for a 1.5-h rhythm. Biol. Psychol. 40, 281–298. doi: 10.1016/0301-0511(95)05121-P

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Omedas, P. B. A., Zucca, R., Arsiwalla, X., Pacheco, D., Wagner, J., Lingenfelser, F., et al. (2014). “XIM-Engine: a software framework to support the development of interactive applications that uses conscious and unconscious reactions in immersive mixed reality,” in Virtual Reality International Conference (VRIC) (Laval).

Google Scholar

Palva, S., and Palva, J. M. (2007). New vistas for alpha-frequency band oscillations. Trends Neurosci. 30, 150–158. doi: 10.1016/j.tins.2007.02.001

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Pfurtscheller, G., and Lopes Da Silva, F. H. (1999). Event-related EEG/MEG synchronization and desynchronization: basic principles. Clin. Neurophysiol. 110, 1842–1857. doi: 10.1016/S1388-2457(99)00141-8

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Pope, A. T., Bogart, E. H., and Bartolome, D. S. (1995). Biocybernetic system evaluates indices of operator engagement in automated task. Biol. Psychol. 40, 187–195. doi: 10.1016/0301-0511(95)05116-3

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Scharnowski, F., Hutton, C., Josephs, O., Weiskopf, N., and Rees, G. (2012). Improving visual perception through neurofeedback. J. Neurosci. 32, 17830–17841. doi: 10.1523/JNEUROSCI.6334-11.2012

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Schubert, R., Haufe, S., Blankenburg, F., Villringer, A., and Curio, G. (2009). Now you'll feel it, now you won't: EEG rhythms predict the effectiveness of perceptual masking. J. Cogn. Neurosci. 21, 2407–2419. doi: 10.1162/jocn.2008.21174

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Shibata, K., Watanabe, T., Sasaki, Y., and Kawato, M. (2011). Perceptual learning incepted by decoded fMRI neurofeedback without stimulus presentation. Science 334, 1413–1415. doi: 10.1126/science.1212003

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Upadhyay, D. (2013). Classification of eeg signals under different mental tasks using wavelet transform and neural network with one step secant algorithm. Int. J. Sci. Eng. Technol. 2, 256–259.

Google Scholar

Van Dijk, H., Schoffelen, J. M., Oostenveld, R., and Jensen, O. (2008). Prestimulus oscillatory activity in the alpha band predicts visual discrimination ability. J. Neurosci. 28, 1816–1823. doi: 10.1523/JNEUROSCI.1853-07.2008

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Verschure, P. F. (2011). “The complexity of reality and human computer confluence: stemming the data deluge by empowering human creativity,” in Proceedings of the 9th ACM SIGCHI Italian Chapter International Conference on Computer-Human Interaction: Facing Complexity (New York, NY: ACM), 3–6.

Google Scholar

Vidaurre, C., Sannelli, C., Muller, K. R., and Blankertz, B. (2010). Machine-learning-based coadaptive calibration for brain-computer interfaces. Neural Comput. 23, 791–816. doi: 10.1162/NECO_a_00089

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Yoo, J. J., Hinds, O., Ofen, N., Thompson, T. W., Whitfield-Gabrieli, S., Triantafyllou, C., et al. (2011). When the brain is prepared to learn: enhancing human learning using real-time fMRI. Neuroimage 59, 846–852. doi: 10.1016/j.neuroimage.2011.07.063

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Keywords: mental states decoding, EEG, mixed reality, XIM

Citation: De Massari D, Pacheco D, Malekshahi R, Betella A, Verschure PFMJ, Birbaumer N and Caria A (2014) Fast mental states decoding in mixed reality. Front. Behav. Neurosci. 8:415. doi: 10.3389/fnbeh.2014.00415

Received: 07 June 2014; Accepted: 12 November 2014;
Published online: 27 November 2014.

Edited by:

Nuno Sousa, University of Minho, Portugal

Reviewed by:

Mamiko Koshiba, Saitama Medical University, Japan
Emanuele Pasqualotto, Université Catholique de Louvain, Belgium

Copyright © 2014 De Massari, Pacheco, Malekshahi, Betella, Verschure, Birbaumer and Caria. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Andrea Caria, Institute of Medical Psychology and Behavioural Neurobiology, Eberhard-Karls-University of Tübingen, Silcherstrasse 5, D-72076 Tübingen, Germany e-mail:YW5kcmVhLmNhcmlhQHVuaS10dWViaW5nZW4uZGU=

^†These authors have contributed equally to this work.

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.