Neural rhythmic symphony of human walking observation: Upside-down and Uncoordinated condition on cortical theta, alpha, beta and gamma oscillations

Biological motion observation has been recognized to produce dynamic change in sensorimotor activation according to the observed kinematics. Physical plausibility of the spatial-kinematic relationship of human movement may play a major role in the top-down processing of human motion recognition. Here, we investigated the time course of scalp activation during observation of human gait in order to extract and use it on future integrated brain-computer interface using virtual reality (VR). We analyzed event related potentials (ERP), the event related spectral perturbation (ERSP) and the inter-trial coherence (ITC) from high-density EEG recording during video display onset (−200–600 ms) and the steady state visual evoked potentials (SSVEP) inside the video of human walking 3D-animation in three conditions: Normal; Upside-down (inverted images); and Uncoordinated (pseudo-randomly mixed images). We found that early visual evoked response P120 was decreased in Upside-down condition. The N170 and P300b amplitudes were decreased in Uncoordinated condition. In Upside-down and Uncoordinated conditions, we found decreased alpha power and theta phase-locking. As regards gamma oscillation, power was increased during the Upside-down animation and decreased during the Uncoordinated animation. An SSVEP-like response oscillating at about 10 Hz was also described showing that the oscillating pattern is enhanced 300 ms after the heel strike event only in the Normal but not in the Upside-down condition. Our results are consistent with most of previous point-light display studies, further supporting possible use of virtual reality for neurofeedback applications.


INTRODUCTION
Neuronal processing of the visual system allows us to perceive objects, movements, colors, contrasts, and to represent the space around us with a very high resolution. In addition to the classical dichotomy between the ventral stream (the "What" pathway) supporting object vision and a dorsal stream (the "Where" pathway), a more recent conception based on clinical evidence (Kravitz et al., 2011) divides the dorsal stream into three sub-pathways projecting on to the premotor (supporting visually-guided actions), the prefrontal and the medial temporal lobes (supporting spatial working memory) both directly and through the posterior cingulate and retrosplenial areas (supporting navigation). This emphasizes the contribution of numerous functionally specialized, hierarchically organized visual areas giving rise to a conscious perception of the different attributes of the visual scene (Zeki et al., 1991;Singer, 1999). The discovery of the phase-locking mechanism at the level of the cortical neurons producing gamma oscillation (Gray et al., 1989) constitutes a strong scientific foundation for the binding by synchrony hypothesis (Singer, 1999) and has also paved the way for noninvasive investigation of the implicated mechanisms by electroencephalography (EEG) and event-related potentials (ERP) (Makeig et al., 2002;Cheron et al., 2007Cheron et al., , 2014Cebolla et al., 2009Cebolla et al., , 2014. New approaches of signal analysis (Delorme and Makeig, 2004) have permitted to better understand the genesis of the sensory evoked responses including visual motion in VR environment (Gramann et al., 2009;Cheron et al., 2014) and the origin of the movement gating of sensory evoked responses (Cebolla et al., 2009).
The discovery of mirror neurons responding similarly when the monkey performs an action and when it observes the experimenter performing the same action (Rizzolatti et al., 1996) has led to human studies of visual processes involved in recognition (Blake and Shiffrar, 2007;Avanzini et al., 2013;Di Dio et al., 2013), prediction of others' movements (Csibra, 2007;Kilner et al., 2007), and their implication in social cognition (Jacob and Jeannerod, 2005;Schütz-Bosbach and Prinz, 2007;Heyes, 2010;Press et al., 2011). Behavioral, neuroimaging and neurophysiological data have demonstrated a high sensitivity to reference frame (Pavlova and Sokolov, 2000;Pavlova et al., 2004;McGlothlin et al., 2012), human body form (Downing et al., 2001), kinematics of human movement (Avanzini et al., 2012;McAleer et al., 2014), gender and personal traits (Pollick et al., 2005;Troje et al., 2005;McGlothlin et al., 2012). Several studies demonstrated that shape and motion information are treated separately by ventral and dorsal visual streams, and converge to the posterior portion of superior temporal sulcus (Vaina et al., 2001;Giese and Poggio, 2003;Blake and Shiffrar, 2007). Moreover, the motor theory of perception, based on the fact that movement perception is influenced by the implicit knowledge about the working principles of the motor control system (Viviani and Stucchi, 1992;Rizzolatti and Craighero, 2004), give a critical place to ventral premotor cortex in biological motion perception processes (Saygin et al., 2004).
By extending Darwin's evolutionary perspective about face emotion (Darwin, 1872) to human locomotion, we may advance that the recognition of the human primate by its bipedal locomotion already present in early hominid before stone tools and large brains (Leakey and Walker, 1997) is probably one of the most vital activities of human in a selection retrospective view. Moreover, such human gestures may represent a constitutive element of the emotional body language (De Gelder, 2006, for a review). This whole body movement of homo sapiens is characterized by distinctive patterns of smooth, regular, alternated lower and upper limbs movements performed around a relatively fixed and erected posture of the head and trunk segment (Pozzo et al., 1990). Another remarkable element is the heelstrike considered as an acquired character of African and Asian apes linked closely to knuckle walking quadrupedalism (Thorpe et al., 2007;Crompton et al., 2010). These highly recognizable elements would implicate that human mirror neuron systems should be active when watching somebody else walk (Cheng et al., 2005).
Here, we studied the ERP and dynamics of theta, alpha, beta and gamma oscillations induced by the observation of an animated avatar in a virtual reality (VR). We hypothesized that physical plausibility of the spatial-kinematic of human locomotion plays a major role in different modes of neural processing (bottom-up and top-down) implicated in locomotion recognition: we expect that these processes will be reflected in different contributions of rhythmic power and phase-locked perturbation in different frequency bands and cortical areas. To address this question, we used an animation representing a human mannequin during walking action performed in normal, Uncoordinated kinematics and in Upside-down views. This will offer the possibility to extract the dynamic signature of human walking observation with respect to the neural activity evoked by the same image content but in an unusual frame of reference (Upside-down view) or without respect to normal kinematics (Uncoordinated walk) from the EEG signals.

ARTICIPANTS
Sixteen healthy volunteers took part in this study (ten males, mean age 25.9 years, range 18-35 years). All subjects were righthanded, had Normal or corrected-to-Normal vision, were naive with respect to the purpose of the experiment, and gave informed consent. The experiment was performed with the approval of the ethics committee of Université Libre de Bruxelles and realized in accordance with the ethical standards of the 1964 Declaration of Helsinki.

STIMULI
Visual stimuli consisted of an animation in the center of the screen representing human walking mannequin (from Cal3D Library) that was presented in three different ways: Normal walking (N), Upside-down (U), and Uncoordinated (J) (Figure 1). Each animation was organized in 22 blocks of 10 s duration interspersed by 7-12 s of random periods of gray screen. This allowed us to obtain a repetitive control of the baseline state throughout the whole recording session and facilitated the recording of the averaged responses (see below). For the Normal walking condition, each block was initiated by an image representing the heel strike event of the right leg and was ended by the last image preceding the next heels strike of the right leg picture ( Figure 1A). The animation had a 10 Hz frequency. In the Upside-down condition the same successive pictures were merely inverted, keeping the same kinematics sequences. In the Uncoordinated condition the first image remained the same as the Normal condition while all the others were randomly mixed giving rise to an incoherent Uncoordinated movement. The random sequence was conserved for each trial and each subject ( Figure 1B).
We have calculated the luminosity of each image in the three colors (RGB) by using "imread" MATLab function. This function stores in a matrix the composition in red, green and blue of each pixel of an image. For each frame of the animation, we subtracted this matrix to that of the previous frame. We have then computed the mean of the resulting matrix and called this composition "dynamic contrast." This represents overall change between two successive frames of the composition in luminosity of the same located pixels.

EXPERIMENTAL SETUP
The recording was realized in a single session. EEG was recorded in 128 channels (ANT system, The Netherlands) at a sampling frequency of 2048 Hz and with a resolution of 22 bits (71.5 nV per bit). An active-shield cap using 128 Ag/AgCl sintered ring electrodes and shielded co-axial cables (5-10 electrode system placements) was comfortably adjusted to subject head. All electrodes were referred to left earlobe. Impedance was lowered below 10 k for each electrode and checked before each recording. Displays were presented on a 17 Dell computer screen. Participants looked straight ahead at the computer screen through a form-fitting facemask and a circular barrel (cylinder). The screen was centered on the line of gaze at a distance of ∼30 cm from the eyes. Viewing through the barrel removed any external visual references. In addition, subjects had earplugs to isolate from external hearing disturbance. For eight subjects, we presented in three successive sessions the Normal walking condition, then Upside-down walking and then Uncoordinated one (No-Random group). For eight other subjects, conditions were presented randomly in three successive sessions (Random group). We made a pause between each session in order to limit the effect of fatigue. As the aim of this study was to evaluate the effect of purposeless perception on the brain rhythms, no particular attentional task was required. However, the state of awareness was continuously checked by online EEG (absence of slow rhythm linked to drowsiness) and EOG, for which we placed electrodes above, below, right and left of the right eye. In particular, we checked that the blink number and configuration remained unchanged throughout the experimental session. We used an in-house script that counts the number of blinks by incrementing an index for each potential higher than 250 μV, and calculates the interval between blinks to provide a view of their configuration over time.
We then calculated the number of saccades by EOG derived function.

DATA TREATMENT
Off-line data treatment and analysis was performed by means of EEGLAB software (Delorme and Makeig, 2004;Brunner et al., 2013) and in-house MATLAB-based tools . DC offset was removed, then band pass filter 0.1-80 Hz and notch filter around 50 Hz (47.5-52.5 Hz) were applied to attenuate electrical artifacts. Portions of data and defective electrodes (max. 6%) were removed by careful visual inspection. Ocular (blink and saccade) and any other remaining artifacts (muscular, cardiac) were isolated by ICA algorithm decomposition. We used the scalp topography, temporal activity localization and spectra magnitude criterion to identify ICA related to artifact. In case of doubt the rejection occurred only if all experimenters involved in data treatment reached agreement. After ICA rejection, defective electrodes were spherically interpolated. Two analyses were performed: animation onset analysis and SSVEP analysis. In the animation onset analysis, data were organized in epochs corresponding to intervals [−1000; 3000] ms, centered on animation onset. We rejected epochs according to ±100 μV threshold criterion, and we made a visual review to confirm epoch rejection. In total, we obtained 17 ± 5 epochs per subject (n = 16) and per condition (n = 3). A study design was used to average data from subjects for each condition. A time window of 1000 ms before stimulus onset was used as baseline.
In SSVEP analysis, data were organized in epochs corresponding to intervals [−200; 600] ms, centered on each heel strike except for the first and the last one. As in preceding analysis, we applied ±100 μV threshold criterion confirmed by a visual review. In total, we obtained 159 ± 16) epochs per subject (n = 16) and per condition (n = 3). We performed a grand average study including 2065 ± 26 trials for each condition. The interval [−200; 0] ms was used as baseline, and the first and last heel strikes were excluded from the analysis In this case, the SSVEP analysis was independent of the neutral black screen periods allowing to join mixed Random and Non-Random groups.

EVENT-RELATED POTENTIAL
The first noticeable ERP component referenced to the earlobe elicited after the onset of the VR-animation was the P120 component recorded in occipito-parietal electrodes. Analysis revealed a peak reduction in Upside-down condition with respect to Normal condition over occipito-parietal regions. The next component was the N170 mainly recorded over the occipito-parietal regions (Figure 2). We observed a significant decrease in Uncoordinated condition over occipital and parieto-lateral regions. This was followed by a large P300 component ending at about 500 ms in occipital regions and 400 ms in frontal regions. In order to simplify the description of condition effect, we subdivided P300 in the two classical P300a and P300b components (Figure 2). The P300a component was significantly smaller in the Upside-down condition over parietal regions with respect to the Normal situation, whereas the P300b component remained the same. On the contrary in the Uncoordinated condition, the P300a component was not modified whereas the P300b showed a significant decrease in occipital and parietal regions ( Table 1). Comparison between Random and No-Random group revealed no significant difference between their respective conditions except in the P300 amplitude in parietal region which was greater for the No-Random group for the 3 conditions.

EVENT-RELATED SPECTRAL PERTURBATION AND INTER-TRIAL COHERENCE
Whatever the observed condition (Figures 3-5), the video onset triggered throughout the scalp a theta ERS in the [0; 500] ms interval followed by an alpha ERD initiated at about 200 ms and maintained during all the duration of the video. This alpha ERD was accompanied by a beta ERD which was more pronounced in the Uncoordinated condition ( Figure 5). The Normal presentation induced a gamma ERD at about 700 ms and maintained for all the duration of the video in the left sensori-motor region (Figure 3, CP5 electrode). This gamma ERD was not present in the two others conditions (Figures 4, 5).
In the [−200;600] ms time-window, the Normal condition was characterized by the following observations (Figures 6-9): (1) at the latency of the P120, ERSP plots showed an earlier alpha ERS reaching maximal value in the parietal region and extending to the beta band in the parieto-central region. With respect to the Normal condition, in the Upside-down condition, three significant changes were noted (Figures 6-9): (1) a decrease in alpha-beta power between 100 and 500 ms over occipital to frontal regions, resulting in a lack of ERS in alpha bands at the latency of the P120, and an increased alpha-beta ERD at about 200-500 ms; (2) a decrease of the theta phase-locking in the parieto-central regions; (3) an increase in (gamma) 40 and 60 Hz ERS over occipito-central regions, respectively at about 150 and 350 ms latency ( Table 1).
With respect to the Normal condition, in the Uncoordinated condition the following was observed (Figures 6-9): (1) a decrease of alpha-beta band at the latency of the P120, followed by an earlier and greater alpha-beta ERD over the occipitoparietal region; (2) a reduction of theta phase-locking. However, in contrast to the Upside-down and Normal condition, (3) the Uncoordinated animation produced a gamma 40-60 Hz ERD at about 200-500 ms over occipito-central regions ( Table 1). The analysis of the dynamic contrast of the image showed that the Uncoordinated condition presented an increased contrast between the third and the fourth image (400 ms after the onset) with respect to the two other conditions.

STEADY STATE VISUAL EVOKED POTENTIALS
When the heel strike of the right leg was used as trigger, the average trace corresponded to an oscillatory pattern peaking at about 9 Hz. (8.86 Hz). This was observed for all subjects and conditions and may be considered as a SSVEP induced by the frequency of the video. Figure 10 illustrates the SSVEP traces resulting from a grand average of all the 16 subjects which conserve the 9 Hz oscillating pattern presented in each single subject. However, the amplitude of the grand average oscillation was not constant throughout the time period. The first negative peak occurred close to 100 ms after the heel strike in any of the 3 different conditions ( Figure 10A). For Normal and Upside-Down condition the SSVEP amplitude increased after the heel strike and culminated at a latency of 300 ms (negative peak) only in the Normal condition. Thereafter, the oscillating pattern decreased in Normal and Upside-down condition but was maintained in the Uncoordinated condition.
The Upside-down condition showed a significant decrease of the third negative SSVEP component at 300 ms with respect to the Normal condition over parieto-occipital regions ( Figure 10A left part; Table 1). The reduction of the ascending phase slope of this negativity in the Upside-Down condition was accompanied over parieto-occipital regions by a power increase in beta-gamma band at about 250 ms and later by a power increase in gamma band (60 Hz) at about 400 ms ( Figure 10B, left part). These effects were preceded by a decrease of theta-alpha bands ITC at about 120 ms with respect to the Normal condition ( Figure 10B, left part).
The Uncoordinated condition showed a significant increase of the SSVEP negativity at about 400 ms with respect to the Normal condition ( Figure 10A, right part; Table 1). This was preceded by an ERD in the theta-alpha band ( Figure 10B) over parieto-occipital regions. While the higher amplitude of the first negative peak recorded in this condition remained under the significance level ( Figure 10A, right part) it was accompanied by a significant increase of theta-alpha bands power at about 100 ms with respect to the Normal condition. These effects were followed by successive decrease and then increase of theta-alpha ITC respectively at about 350 and 450 ms (Figure 10B, right part).

DISCUSSION
To our knowledge, this work represents the first study on the dynamic neural response elicited by VR-animation of human walking. The presentation of the VR-animation elicits ERP components classically described in response to a visual stimulation (Jeffreys, 1996). We demonstrate significant changes in the amplitude of the P120 and P300a when the avatar was upside-down, and the N170 and P300b when the walking sequence was perturbed (Uncoordinated) with respect to the Normal condition. The presence of the early alpha ERS characterize the Normal condition. The alpha-beta ERD was reinforced and the theta phase-locking was disturbed in Upside-down and Uncoordinated condition. As regards gamma oscillation, a contrasting situation was seen as its power was increased in association with the Upside-down animation and decreased with the Uncoordinated animation.
An SSVEP-like response oscillating at about 9 Hz was also described when the heel strike event was used as trigger, showing that the oscillating pattern is enhanced 300 ms after the heel strike event only in the Normal but not in the Upside-Down condition.

ERP
The present ERP components evoked by avatar observation are in accordance with those recorded in previous studies using point-light paradigm (Johansson, 1973). The timing of P120, N170, and P300 are commonly regarded as corresponding to the main three components P1 around 130 ms, N1 at 200 ms and N2 (P3 depending of the type and the placement of reference electrode) at 300-400 ms described in point-light walking studies (Hirai et al., 2003(Hirai et al., , 2005(Hirai et al., , 2009(Hirai et al., , 2013Jokisch et al., 2005;Krakowski et al., 2011;Buzzell et al., 2013).
Although point-light and VR displays concern the same biological motion, VR display induced a stronger visual representation, including form and color than point-light walking. VR display conserved body structure in the Normal, Upside-down, and Uncoordinated conditions. This was not the case in point-light studies, where inverted condition associate form recognition to motion, and scramble condition deconstruct body form. In this context, the present VRparadigm offers the possibility to dissociate recognition of body form occurring at the first video frame, and motion coherence (smoothness, coordination, speed, etc.) produced by the successive frames. This allows to focus specifically on the effect induced by changing the frame of reference in Upsidedown and the global coherency of the walking motion in Uncoordinated. The similarity of P120 evoked by the Normal and the Uncoordinated conditions is consistent with the fact that the first image is the same in these conditions. In contrast, pointlight studies showed a delay and a decrease in P1 response elicited by scramble condition with respect to upright condition (Hirai et al., 2009(Hirai et al., , 2013Krakowski et al., 2011). These results support common interpretation that first component reflects a global representation coding of form (Baccus et al., 2009;Buzzell et al., 2013;White et al., 2014). However, the finding of decrease P120 in the Upside-down condition with respect to the Normal condition was not reported by previous point-light studies. The origin of this early alteration can be explained by both bottom-up and/or top-down process. Distinctions between these influences are not easy. The Normal presentation of the avatar may unconsciously induce an easy visual representation than the Upside-down mannequin which implies a mental transformation of the reference frame. In parallel, the repeated presentation of normal or inverted locomotion can predictively influence this early response by a top-down effect exerted by the frontal cortex to the primary visual cortex (Peyrin et al., 2010;Cardin et al., 2011;Zanto et al., 2011;Ramalingam et al., 2013). However, previous studies suggest that explicitly attended tasks process does not appear to influence the earlier activity at about 100 ms (Krakowski et al., 2011;Buzzell et al., 2013).
Concerning later activation, N1 and N2 were generally related to integration of form and motion (Baccus et al., 2009;Buzzell et al., 2013;White et al., 2014). It is interesting to note that the effects of Uncoordinated condition on N170 (analogous to N1 in point-light studies) and of point-light scramble on N1 are comparable, as N170 was reduced in Uncoordinated condition as N1 in scramble condition (Hirai et al., 2003(Hirai et al., , 2013Jokisch et al., 2005). This result suggests that N170 was mainly related to motion. In contrast, P300a (analogous to N2 in point-light studies) amplitude was the same in Normal and Uncoordinated conditions, whereas N2 was larger in upright point-light walker than point-light scramble (Jokisch et al., 2005;Hirai et al., 2013). Moreover, P300a amplitude was smaller in Upside-down than in Normal. This could also be related to the alteration of the SSVEP pattern occurring at this latency in the Upside-down vs. Normal condition. Taken together these results suggest that P300a was mainly related to the global form of walking.
Finally, the effect we recorded on P300b and in the SSVEP at about the same latency in the Uncoordinated condition is comparable to the late phase describe by Krakowski et al. (2011), which was characterized by a greater positivity in response to upright and inverted point-light walker than point-light scramble. This last phase after 400 ms is generally considered as indexing a highorder representation coding (Krakowski et al., 2011). This effect observed in Uncoordinated suggests that P300b component is sensitive to coherence of motion rather to mere recognition of walking.

Alpha ERS/ERD
The significant alpha ERS occurring at about 120 ms in the occipito-parietal regions characterized the Normal walking observation. It was followed by an alpha-beta ERD at 200 ms, which extends throughout the video. This was significantly more pronounced in Upside-down and Uncoordinated conditions than in Normal.
The first alpha ERS is in accordance with recent studies showing similar transient alpha increase in response to upright facial motion (Girges et al., 2014). The suppression of the early alpha ERS in Upside-down suggest that the inversion of the body presentation rapidly affect the early visual process. However, similar alteration found for the Uncoordinated condition while the first image was exactly the same suggests a top-down influence. Alpha oscillation has been interpreted as reflecting global inhibition of the cortex, improving behavioral performance by facilitation of the cognitive control (Klimesch et al., 1996(Klimesch et al., , 2003Klimesch, 1999Klimesch, , 2012Cheron et al., 2006;Haegens et al., 2010). Thus, increase in alpha power (ERS) may participate to a general clearance of noise or distracting event in order to selectively update relevant incoming information (Sadaghiani et al., 2012), and access to memory (Klimesch, 2012). In this context, suppression of alpha ERS in Upside-down and Uncoordinated conditions would be correlated with increase of attention to motion cue and involvement of cognitive resources. The next alpha ERD are in line with previous research showing decrease in alpha band power during perception of human motion (Cochin et al., 1998;Ulloa and Pineda, 2007). This was generally related to the desynchronization of mirror neurons activity as studied with EEG and fMRI combination (Arnstein et al., 2011), and would reflect a release from inhibition. However, it is interesting to note that our results are in contrast to studies of face perception (Girges et al., 2014), which report a greater alpha ERD in response to upright facial motion than in inverted condition. This difference between face and body motion recognition may be explain by high specialization of the brain to face recognition, and in particular to treatment of semantic content of facial gesture (Rojas et al., 2011). According to Klimesch et al. (1997Klimesch et al. ( , 1999Klimesch et al. ( , 2012, the alpha ERD increased as a function of the semantic content of retrieved information from the storage system. In our study, each stimulus has the same semantic content as a walking avatar. In this context, amplification of alpha ERD would indicate a recruitment of the mirror neurons system in order to recognize or predicted observed motion, by transformation of reference frame (in Upside-down) and reconstruction of motion (in Uncoordinated). The enhancement of the alpha ERD in Upside-down and Uncoordinated condition might then facilitate a dynamical process throughout the neural network involved in alpha rhythm generation evoked by the Normal walking avatar.

Theta ERS
The present ITC analysis shows that phase locking occur mainly in the theta range (peaked at ∼5 Hz). However, as it is classically the case, this is not a pure phase locking because it was accompanied by theta ERS throughout all electrodes. Indeed, the visual evoked potentials (P100-N200) elicited by the classical checkerboard pattern or by more complex visual stimuli were accompanied by a clear theta ERS and related ITC (Klimesch et al., 2004;Cheron et al., 2014). Although present in each of the three present conditions in the 100-400 ms time period, the theta ITC was significantly perturbed in both Upside-down and Uncoordinated condition, while theta ERS were not significantly different. This indicates that the recognition of Normal walking is accompanied by a stronger theta phase locking peaking between 200 and 300 ms. In humans the theta EEG rhythm (4-7 Hz) was initially defined as an intermediate rhythm between delta and alpha (Walter and Dovey, 1944;Mitchell et al., 2008). Later, the term FM-theta (FM for fronto-midline) was introduced by Ishihara and Yoshi (1972) when EEG was recorded during arithmetic task (Ishihara and Yoshi, 1972). Later, the presence of FM-theta during arithmetic and musical activities was demonstrated with MEG (Sasaki et al., 1996a,b,c). The midline frontal areas, such as the anterior cingulate cortex encompassing the lateral part of the prefrontal cortex are commonly cited as potential generators of the FM-theta (Gevins et al., 1997;Mizuhara et al., 2004;Sauseng et al., 2007).
In rat hippocampal regions, theta oscillation (3-9 Hz) is recognized to play an important role in the phase precession of the place cells firing assuming cued recall of the coming positions along the locomotion path of the rat (O' Keefe and Recce, 1993). The intrinsic theta generator of the hippocampal cortex is reinforced by the extrinsic theta pacemaker situated in the medial septal nucleus and allows a large-scale synchronization of theta oscillations in the hippocampus (Kocsis et al., 1999;Buzsáki, 2002). Theta oscillation is not restricted to the hippocampus but also emerges in different cortical areas in the rat (Leung and Borst, 1987;Silva et al., 1991). The ability of different cortical regions to produce theta is supported by slice recording demonstrating that theta oscillation may be produced by the activation of the NMDA receptors of the layer 5 (Silva et al., 1991;Flint and Connors, 1996) as well as by cholinergic activation of interneurons (Blatow et al., 2003).
Although human theta rhythm is not as robust as in the rat hippocampus, the ability of the human cortex to produce theta oscillation is now well recognized. It has been related to sensorimotor integration , navigation (Kahana et al., 1999), memory load  and working memory (Raghavachari et al., , 2006Liebe et al., 2012). Interestingly, all the different phases of virtual movement during a navigation game induced an increase of 4-8 Hz oscillation in both the hippocampus and neocortex in human (Ekstrom et al., 2005). Although the present experiment involves the observation of human locomotion it cannot be assumed that the recorded theta oscillations are specifically related to locomotion per se. Indeed, theta oscillations are now considered as a basic physiological element involved in global oscillatory synchronization processes linking together multiple brain regions (Buzsáki and Draguhn, 2004;Fries, 2005). For example, the multiplicity of functional roles for this oscillation was demonstrated by the fact that the amplitude of theta power recorded over the temporal and frontal cortex predicted the behavioral performance of the subject (Sederberg et al., 2003). A recent MEG study demonstrated that hippocampal-prefrontal theta synchronization plays a mnemonic guidance in human decision-making (Guitart-Masip et al., 2013). Single neurons and local field potential recordings in the human medial temporal lobe show that theta phase locking reflects a global activation providing a temporal window for the conscious recognition (Rey et al., 2014). At a lower hierarchical level closer to the present observational task, theta oscillation is related to the perception of color shape of object and visual attention (Fries et al., 2001b). It is also involved in different sensory modalities to provide meaningful chunks of neuronal signals allowing subsequent decoding for an enhanced perception. In our case, such theta oscillation may thus be viewed as taking part of time-division multiplexing mechanism representing sequential information upon which a neuronal code may emerge by cross-frequency interaction with faster (gamma) oscillation (Akam and Kullmann, 2014).

Gamma modulation
Gamma oscillation (30-100 Hz) occupies a privileged position in cognitive neuroscience. The current understanding of gamma oscillation points to its emergence from the synchronous activity of a large ensemble of firing neurons (Eckhorn et al., 1988;Gray et al., 1989;Jensen and Colgin, 2007). It is central to the binding theory, in which gamma oscillations combine different features in a visual scene to form a coherent percept (Singer, 1999). Unexpectedly, our results show that Upside-down condition elicited gamma power increase at about 150 and 400 ms and a gamma ERD at the same latency in the Uncoordinated condition. This contrasting behavior of gamma oscillation is interesting because these oscillations are considered to underlie perception of coherent stimuli. These data are in accordance to previous MEG study showing enhancements in gamma rhythm at 100 ms after display onset in upright and inverted pointlight walker (Pavlova et al., 2004). However, in the latter study additional gamma peak appeared only for upright point-light walker at 130 and 170 ms. Another study of the same group reported increased gamma activity in the left parieto-occipital region at 80 ms, with additional peaks in attended point-light walker on the right parietal and temporal cortex at 120 and 155 ms, respectively (Pavlova et al., 2006). The present results are in agreement with the data of Tallon-Baudry et al. (1996), where they demonstrated the presence of non-phase-locked gamma activation (60 Hz) at about 300-400 ms after the presentation of an illusory Kanizsa triangle figure (Tallon-Baudry et al., 1996). In this experiment, the gamma activity was stronger when the recognition task required additional mental reconstruction (stronger gamma oscillation for illusory triangle than normal triangle). In our present study, a non-phase-locked 60 Hz power increase occurred at the same latency only when the walking avatar was presented in Upside-down configuration. The subject was not instructed to perform any mental task but implicit recognition can recruit gamma activity for unconscious and conscious neuronal process (Aru et al., 2012;Vidal et al., 2014). The complex interplay between these neuronal qualia occupies a central position in cognitive neuroscience (Kandel, 2013). In the context of the Global Workspace Theory, serial and parallel processing take part from the widespread treatment of unconscious information to the emergence of consciousness (Baars, 1997;Dehaene and Naccache, 2001;Baars et al., 2013;Dehaene et al., 2014).
From a physiological perspective, experiments and modeling have demonstrated that gamma rhythms emerge from the interaction between local excitation and inhibition (Traub et al., 1997;Brunel and Wang, 2003;Kang et al., 2010), in which the gap junctions between interneurons play a pivotal role in ensuring gamma oscillation coherence Whittington and Traub, 2003). In macaque, high density electrocorticography recording  demonstrated that natural viewing induced a strong gamma oscillation (50-80 Hz) over most of the recorded visual cortex including V1 and V4 but not over most of the remaining cortex extending from superior temporal sulcus to the anterior part of the arcuate sulcus. The functional link between neuronal spikes and local field potential oscillation has been well documented in different preparations, demonstrating that spike-field coherence in the gamma-band frequency is accompanied by power enhancement of the gamma rhythm (Fries et al., 2001a(Fries et al., ,b, 2002(Fries et al., , 2008. It was also demonstrated that when visual stimuli are moving smoothly, the visual cortex produces neuronal synchronization in the gamma-frequency band (Friedman-Hill et al., 2000). This gamma synchronization is considered as a key element for signal transmission to postsynaptic targets and to assume the continuity of the visual message (Fries, 2009). In this context, the gamma ERD recorded during the Uncoordinated condition could be explained by a previous experience of Kruse and Eckhorn (1996) realized in the primary visual cortex of the cat. When a smooth movement of the visual field was presented it induced gamma oscillation, but when the smooth movement was intermingled with sudden random acceleration in and against the original direction of the smooth movement the gamma oscillations disappeared (Kruse and Eckhorn, 1996). This latter situation corresponds to the present Uncoordinated condition where gamma ERD replace gamma ERS present in Normal and Upside-down condition. The smoothness aspect of the walking video for both Normal and Upside-down presentations induces gamma oscillation while the sudden "Uncoordinated" image desynchronizes the neuronal population responsible for the gamma oscillation. In addition, Kruse and Eckhorn (1996) have demonstrated an inverse relationship between the decrease in gamma power and an increase in the stimulus-locked responses in lower frequency band (Kruse and Eckhorn, 1996).
Following the canonical microcircuit model (Bastos et al., 2012) based on intracellular recordings in cat visual cortex incorporating the neuronal sources of forward and backward connections in cortical hierarchies, it was proposed that the superficial pyramidal neurons generate gamma responses whereas deep pyramidal neurons generate alpha and beta dynamics. The visual cortex has been suggested to act as a dynamic filter of the visual input where stimulus properties like movement, contrast, localization and size of visual cues may modify the configuration of gamma oscillation (Gray et al., 1989;Ray and Maunsell, 2010;Brunet et al., 2013;Roberts et al., 2013). Among these stimulus properties, contrast is able to enhanced the signal-to-noise ratio of the sensory input inducing an increase in the postsynaptic gain of superficial pyramidal cells implicate in gamma oscillation (Feldman and Friston, 2010). Although, the same avatar was used here in the three different conditions the kinematic change of the Uncoordinated condition induced a significant increase in the dynamic contrast at the latency of 400 ms and may thus explain the late gamma ERD present in this condition. The spatial summation and the receptive field organization in V1 depend on contrast stimulus (Sceniak et al., 2002). The effects of contrast on the induced rhythms are complex and specifically influence the postsynaptic gain of neuronal populations, the strength of intrinsic and horizontal connectivity which can be differentially engaged depending on stimulus properties (Pinotsis et al., 2014). These authors have reported that the increase in visual contrast induces an increase of gamma peak frequency (from 46 to 58 Hz) accompanied by a decrease in gamma power. This contrast effect on the gamma power must be taken in account in the present gamma ERD and is complementary to the previous Kruse and Eckhorn's (1996) reported effect on the gamma power when the visual movement is Uncoordinated.

SSVEP
In order to strengthen the ERP, ERSP, and ITC studies of the transient presentation of the walking video, a SSVEP approach was made by using the heel strike events as the synchronized item of the video images occurring at every 100 ms. SSVEP offer many advantages in comparison to ERP, including better signal-to-noise ratio with a clear peak in the FFT occurring at the frequency of interest and some of its harmonics, and greater number of averaged items in a shorter period of time. SSVEP are classically obtained by using neutral LED or LCD image flashing between 1 and 100 Hz inducing resonance phenomena. In the present study, the SSVEP was not obtained by directly triggering all of the images occurring at 10 Hz but by using a specific event of the avatar locomotion corresponding to the initiation of the step cycle. SSVEP is classically considered as an oscillatory response of the visual cortex evoked by contrast or luminance-modulated stimuli that drive the neural response at the imposed frequency of the constant peripheral stimulation (Regan, 1966;Müller et al., 1998). SSVEP are not only imposed by the physical properties of the stimulus but also depends on the brain state, task related-process, bottom-up and top-down influences (Müller et al., 1998;Keil et al., 2003;Andersen and Müller, 2010). This oscillatory pattern is a strong steady state potential that mainly arises from the occipital area, with strong contribution from the early visual cortex but also from more extended parts of the visual system including higher visual areas (Müller et al., 1997;Di Russo et al., 2007). This partly explains why SSVEP approach is increasingly used in cognitive and affective neurosciences to study face processing including face identification and decoding of facial emotional expressions (McTeague et al., 2011;Ales et al., 2012;Gruss et al., 2012;Rossion et al., 2012). To our knowledge, the present study is the first to use a walking avatar video for inducing SSVEP-like response. It showed specific amplitude enhancement of the oscillatory pattern and the related spectral perturbation at a precise time in relation to a kinematic event. Although no direct comparison can be made between SSVEP and ERP results, it is interesting to highlight the convergence of both types of results with regard to significant changes in the EEG brain rhythms at about the latency of 300 ms when the same avatar video was presented in Upside-down vs. Normal condition. The reported differences in the SSVEP configuration and rhythmic alteration (early theta-alpha ERS and late gamma ERS) in the Uncoordinated condition can be due to the higher dynamic contrast of this condition with respect to the other two conditions.