Impact Factor 2.990 | CiteScore 3.5
More on impact ›


Front. Psychol., 23 March 2015 |

The self in conflict: actors and agency in the mediated sequential Simon task

  • 1Helsinki Institute for Information Technology HIIT, Aalto University, Espoo, Finland
  • 2Department of Computer Science, University of Helsinki, Helsinki, Finland
  • 3Department of Social Research, University of Helsinki, Helsinki, Finland
  • 4School of Business, Aalto University, Helsinki, Finland

Executive control refers to the ability to withstand interference in order to achieve task goals. The effect of conflict adaptation describes that after experiencing interference, subsequent conflict effects are weaker. However, changes in the source of conflict have been found to disrupt conflict adaptation. Previous studies indicated that this specificity is determined by the degree to which one source causes episodic retrieval of a previous source. A virtual reality version of the Simon task was employed to investigate whether changes in a visual representation of the self would similarly affect conflict adaptation. Participants engaged in a mediated Simon task via 3D “avatar” models that either mirrored the participants’ movements, or were presented statically. A retrieval cue was implemented as the identity of the avatar: switching it from a male to a female avatar was expected to disrupt the conflict adaptation effect (CAE). The results show that only in static conditions did the CAE depend on the avatar identity, while in dynamic conditions, changes did not cause disruption. We also explored the effect of conflict and adaptation on the degree of movement made with the task-irrelevant hand and replicated the reaction time pattern. The findings add to earlier studies of source-specific conflict adaptation by showing that a visual representation of the self in action can provide a cue that determines episodic retrieval. Furthermore, the novel paradigm is made openly available to the scientific community and is described in its significance for studies of social cognition, cognitive psychology, and human–computer interaction.


Cognitive control refers to the ability to withstand temptation and avoid distraction in order to reach certain goals. This is true for definitions from both social and clinical studies – in which such goals are generally longer term, abstract and self-referencing (Baumeister et al., 2000) – and cognitive science – in which they tend to be short term (“in the next block”), very specific (“press a button as cued by the center of the stimulus, not its flankers”) and referencing a specific task designed by the experimenter (here Eriksen and Eriksen, 1974). Despite these differences, cognitive control is commonly portrayed as a kind of limited resource that allows us to handle conflicts and interferences: should the resource run low, we may fail to act quickly or correctly.

This somewhat dualistic characterization of control is reflected in models formalizing conflict and control in terms of models featuring two routes. A stimulus can trigger, quickly or automatically, responses that are typical for our normal functioning: the urge is to deal with this token stimulus as with any other of its kind. A secondary type of processing works its slow, willful way top–down from a goal level toward the more complex processing of the stimulus. For example, in the popular Stroop task (Stroop, 1935), in which we are asked to respond “green” if the word “red” is written in green, we are almost overwhelmed by the automatic reaction to repeat our well-rehearsed training and “read out loud” the word, rather than mind the coloring. Thus, the conflict is between competing responses of the two routes, while the executive control is supposed to suppress the incorrect response.

Despite the apparent simplicity of dual-route models, they do elegantly account for a more recently found effect called conflict adaptation. The effect has also been referred to as Gratton effect, or sequential conflict modulation effect, and refers to the observation that after experiencing one instance of conflict, subsequent conflict becomes easier. The effect seems to extend across diverse conflict tasks, including the Stroop task (Egner and Hirsch, 2005b; Spapé and Hommel, 2008), the Simon task (Simon and Rudell, 1967; Hommel et al., 2004) and the Eriksen Flanker Task (Eriksen and Eriksen, 1974; Gratton et al., 1992). To improve clarity, we shall refer to the conflict adaptation effect (CAE) independently from the specific paradigm in which it is encountered, formulating it as:


In which capital C s and I s denote currently compatible (congruent, non-conflicting) and incompatible (incongruent, conflicting) trials, whereas lower case c s and i s refer to preceding (often termed N-1) compatibility and incompatibility. The formula thus quantifies the effect as the reduction of conflict-effects as a function of preceding trials.

Dual-route models of executive control account for the CAE by suggesting that a conflicting trial – the word “red” in green – triggers the recruitment of attentional resources to cope with the response uncertainty (Botvinick et al., 2001). Depending on the preferred model, this would mean for our example either that task-relevant route (the color-response association) is facilitated, or that part of the irrelevant stimulus processing route (the word-response association) is suppressed. The result is more or less the same: if, on a subsequent trial, the word “green” is presented in red, the system should be able to cope with ease: both our enhanced color-route, or our attenuated verbal route leaves us well-prepared for correct action.

However, recent observations suggest dual-route models may not adequately account for localized, or context dependent conflict adaptation. For example, if attentional resources are generically recruited after experiencing conflict, one should predict smaller subsequent conflict effects, independent of the task – which is not always the case (Notebaert and Verguts, 2008). Furthermore, even within a task, changing a task-irrelevant feature between two Stroop (Spapé and Hommel, 2008) or Simon (Spapé et al, 2011) displays, critically reduces the CAE. Finally, the outcome of conflict in terms of reward has also been shown to affect the CAE (Van Steenbergen et al., 2009). It seems, then, that a unitary, limited resource type of executive control would fail to account for these observations.

Sequences of conflict, however, involve many more cognitive functions than just executive control. To understand what happens in any kind of task repetitions, it is necessary to take a more detailed look at the specific features involved in sequences of conflict. For one, it has been argued that if conflict changes (i.e., cI and iC sequences), some part of the stimulus or response must be different as well, whereas if the conflict does not change (in cC and iI), there is usually a proportion of trials in which the whole stimulus-response scenario is repeated. In other words, priming – rather than cognitive control – was pointed out to be at least partly responsible for the CAE pattern (Mayr et al., 2003).

Further aggravating the situation was the observation by Hommel et al. (2004) who showed that increased errors and reaction latencies observed in cI and iC sequences could be traced back to their constituent features partly repeating. Following in the footsteps of Kahneman et al. (1992), they provided evidence that if one scenario (e.g., an arrow left pointing to the left) is similar to a previous representation in that features are repeated (an arrow left pointing to the right), an episodic retrieval effect ensues. This is problematic for two reasons: (1) the repeated feature (the location of the arrow) thus prompts a no longer relevant and indeed conflicting response; and (2) the partial overlap itself may be problematic for the cognitive system (Treisman, 1996; Hommel et al., 2001).

It is thus possible that the workings of episodic retrieval, memory and a type of pattern recognition may account for both the CAE and the context dependency of the CAE. This “stronger” account suggests that the data can fully be accounted for by referring to the “lower-level” functions involved in priming (Mayr et al., 2003), episodic retrieval (Hommel et al., 2004) and contingency learning (Schmidt and Besner, 2008). Thus, there would be very little theoretical need to postulate the extra limited resource to sometimes come to our aid and cognitive control is reduced to an illusory epiphenomenon of free will.

Alternatively, a mechanism featuring episodic retrieval causing conflict adaptation could reconcile “pure control” with context dependency effects. As we have argued before (Spapé and Hommel, 2008, 2014), it is possible that the similarity of situations between two trials may not only retrieve the previous episodes in terms of their constituent features, but also in terms of control parameters. Thus, tasks involving an amount of similarity, because, e.g., a Simon stimulus gradually rotated into its new position, causing updated episodic memory (Spapé and Hommel, 2010) or a voice presenting an auditory Stroop stimulus is repeated (Spapé and Hommel, 2008), may result in conflict adaptation. Conversely, gradually rotating the Simon display to the wrong position or presenting a stimulus in a different tone of voice may interfere with retrieval of executive control (for a similar proposal, see Egner, 2014).

Present Study

The mapping of contingencies of conflict adaptation thus remains important while the debate concerning the status of conflict adaptation continues. The present study was somewhat inspired by the earlier cited observation of the context dependency of the CAE (Spapé and Hommel, 2008). In that study, the words “high” and “low” were mixed with high and low tones, and participants were asked to judge the pitch of the tones and ignore the words. A type of Stroop effect was observed—participants found it difficult to not imitate the voice—as well as conflict adaptation—the Stroop effect was smaller after incompatible trials. The context dependency was in the voice: although it was entirely irrelevant to the task, changing the voice from one gender to the other caused interference with the CAE.

A visual version of this task was designed for the present study, with one critical change: the degree of ownership over the contextual change. Rather than changing something entirely irrelevant as in the original study, or changing the task itself (Notebaert and Verguts, 2008), we set out to change the degree to which the change was related to the person involved in the task. Participants were engaged in the task in two conditions: directly or mediated by a visual representation of themselves, which we will refer to as the “avatar.” Similar to the original study, this avatar served as a contextual cue, and could either alternate or repeat between two genders. Although entirely irrelevant to the task, changes in avatar identity should, according to the episodic retrieval account of the CAE, affect the conflict-control pattern. That is, repeating the avatar should act as a cue, prompting retrieval of the preceding trial and possibly its conflict-related aspects. Changes in the identity of the avatar should, conversely, interfere with retrieval and thereby reduce the CAE.

However, to go beyond previous studies related to the context-dependency of the CAE, we investigated whether the relationship between the participant and their virtual identity would have an effect on conflict and control. By using a motion tracking device, we established a sense of agency over the avatar, projecting it as standing in front of the participant and mimicking the participants’ gestures. Previous studies used similar techniques in order to manipulate the representation of the self toward the virtual identity (Lenggenhager et al., 2007). In the present experiment, we contrast this “dynamic” condition in which the avatar is displayed as co-acting the participant’s gestures, with a “static,” control condition in which the avatar did not move.

On the one hand, creating a sense of agency over the avatar by making it respond to the task necessarily increases the degree to which the avatar is task-relevant. Given that conflict-resolution has previously been found to work on task-relevant features (Egner and Hirsch, 2005a), a conflict-control point of view would predict changes in a task-related avatar’s identity to be of greater impact than changes in a static, and therefore neutral and irrelevant, picture. On the other hand, however, the degree of agency over the avatar could create the impression that the avatar is “part of” the participant. Thus, a superficial change in the visual appearance of the self-related object should be negated by the sense that it acts as a pointer toward the distal representation: the participant him or herself.

The motion tracking device furthermore enabled us to go a step beyond the traditional reaction times (RTs). Recent studies used single-handed pointing movements (Buetti and Kerzel, 2008) and mouse pointer trajectories (Scherbaum et al., 2010) and analyzed movement trajectories in order to dissociate conflict mechanisms underlying the Simon effect. In these studies, the spatial location of a stimulus was found to cause a shift in movement trajectory toward the stimulus (Buetti and Kerzel, 2009). Here, we explored whether this continuous, “visuomotor” Simon effect (Wiegand and Wascher, 2005) could similarly be observed in a gesture-based, two-handed paradigm. Similar to these studies, we expected the visual location of the stimulus to evoke unintentional movement toward that location. However, in this two-handed study, such movement should occur in the other hand, even though it is irrelevant for executing the desired gesture. To our knowledge, there are as yet no studies directly testing the conflict dependency of the CAE on this type of movement trajectory measure, but we expected the pattern of the irrelevant movement (IM) to largely follow that of traditional RT.

Materials and Methods


We partly based the number of participants on similar episodic studies, such as Spapé and Hommel (2008), who observed a sizable effect size of identity switches on conflict control of ηp2 = 0.56 with 14 subjects. However, given the unknown, additional factor of avatar animation, and the novel apparatus in use, we ultimately recruited 18 volunteers (seven female). They were 27.1 ± 3.2 years of age and took part in the study in exchange for cinema tickets. Before signing informed consent, they were informed of their rights in accordance with the Declaration of Helsinki. One (female) participant could not complete the study and was removed from further analysis.

Apparatus and Stimuli

The Xbox-360 Kinect (Microsoft, Redmond, WA, USA) is a motion sensing input device that uses a depth camera to track up to six persons and estimate full skeletal tracking information of two persons. Its sensor has a frame rate of 30 Hz, a field of view of 57° × 43°, and 27° of vertical tilt range, to obtain information for estimating the 3D spatial position of 20 joints for each body. In the study we used it for tracking the position of both hands relative to the torso. Furthermore, we calculated the participant’s joint orientation. In the dynamic condition of the present study, the detected joint orientation was projected onto the avatar, giving it participant-avatar congruence in bodily motion.

Figure 1 shows the basic characteristics of the Simon task, which was displayed on a 95.17 cm × 57.10 cm virtual screen which itself was projected on a 254 cm × 142.875 cm Screenline real screen. All task related stimuli – the circles, stars, and fixation crosshair – were 28.55 cm × 28.55 cm. Left and right locations were defined as occurring at, respectively, 28.58 cm left and right from the center of the screen. The 3D character, referred to as the “avatar,” was presented at a location below and slightly overlapping the central fixation, as to give the impression that it was standing in between the participant and the virtual screen. It was 25.32 cm × 105.51 cm in size (of which the lower ca. 30 cm not visible) and was of either male or female gender.


FIGURE 1. Schematic display of the trial procedure for compatible and incompatible conditions. Participants were instructed to keep their hands together, until the target stimulus (a circle or star) was displayed, prompting a left or right handed action, respectively. Both positive (pictured) and negative feedback was displayed in the first 16 trials, whereas during the rest of the experiment, either negative feedback – following incorrect reactions – or blank virtual screens – following correct actions – were used. The Simon effect refers to the effect of incompatibility between stimulus- and response-location, as is the case in the lower middle panel.


After reading written instructions, participants witnessed a demonstration of the experiment involving one of the authors undertaking 16 trials to show the task. Participants were then asked to stand at a distance between 2.5 and 3.5 m from the screen with the arms spread wide, while the instruments were calibrated. If participants had no further questions, they were asked to move their hands together to start the first trial of the experiment.

Every trial started with a fixation crosshair, displayed for ca. 1 s of stable identification of both participant’s hands remaining near the center of their body. Then, a star or circle was presented to the left or right of the virtual screen. Participants were instructed to move their left arm left if a circle was shown and their right arm right if a star was shown, irrespective of the location of the stimulus. Movements were detected if the participant moved either hand 20 cm lateral to their shoulders, at which point the star or circle was removed from the screen. Only once the participant moved both their hands back together would the next trial begin. Avatars were presented throughout the experiment as either “static” or “dynamic,” the latter case referring to the scenario that the movements of the participants were reflected in the movements of the avatar.

Design and Measurements

The general design of the experiment was based on 2 (locations, left vs. right) × 2 (shapes requiring left vs. right responses) × 2 (avatar identities) × 2 (animations) × 16 = 256 trials with one block of 128 trials for each type of animation, presented in counter-balanced order with equal numbers of compatible (location = response) and incompatible (location ≠ response) trials. The analysis was based on two four-way repeated measures ANOVAs with animation (static vs. dynamic), avatar repetition (vs. alternation), previous compatibility (vs. incompatibility), and current compatibility (vs. incompatibility) as factors. Within each block, a restricted random sampling procedure was used to generate at least 12 occurrences for each design cell.

Two measurements were tested independently: RT and incorrect movement (IM) velocity. The RT was measured as the difference between the onset of the target stimulus (i.e., the circle or star) and the time at which a displacement of either of the participant’s hand was detected at least 20 cm relative to the corresponding shoulder. The IM was measured as the peak velocity of the average movement trajectory of the inactive hand prior to the final movement (occurring on average at 601 ± 25 ms after target onset). The movement of the correct hand was also recorded, but not analyzed, as it is confounded with RT (see Figure 2).


FIGURE 2. Measurements and results. (A) Grand Averages of the compatible-after compatible (cC) and incompatible-after-compatible (cI) conditions for the correct and incorrect hand over time. The two arrows indicate the difference between the two dependent variables used in (B). (B) Conflict adaptation effect (CAE) for each dependent variable and each combination of avatar animation and repetition. Vertical bars indicate one standard error.


The first eight trials as well as the first trial in each block were considered still part of training and removed from analysis. All trials with slow (RT > 1000 ms) or incorrect reactions were also removed, as well as the first trial directly after such scenarios, constituting 9.1 ± 6.3% of trials.

In repeated measures ANOVAs with animation of the avatar (static vs. dynamic), the repetition of the avatar (repeated vs. alternated), the previous compatibility (vs. incompatibility), and current compatibility (vs. incompatibility) on RT and IM, current compatibility significantly affected both RT, F(1,15) = 194.64, MSE = 785.36, p < 0.001, ηp2 = 0.92, and IM, F(1,15) = 26.01, MSE = 26.52, p < 0.001, ηp2 = 0.62. This suggested a robust Simon effect, with incompatible conditions being associated with slower RTs (ca. 47 ms) and more IM than compatible ones. Previous compatibility also significantly affected RT, F(1,16) = 29.31, MSE = 158.26, p < 0.001, ηp2 = 0.65, and IM F(1,16) = 13.10, MSE = 14.98, p = 0.002, ηp2 = 0.45, with compatibility in the preceding trial resulting in faster RTs, but less IM.

Neither of the other main effects was significant for RT, p s ‖ 0.59, and IM, p s ‖ 0.20. In general, the IM measure showed a pattern similar to the RT, with interacting variables significantly affecting either both RT and IM, or neither. However, one effect was uniquely observed for one measure: compatibility significantly interacted with avatar identity, F(1,17) = 4.60, MSE = 10.70, p = 0.048, ηp2 = 0.22, for IM only. This indicated that the compatibility effect was larger (C-I = 40.4 pts) after repeated than after alternated (23.3 pts) avatar identities.

Critically, a significant interaction effect between previous and current compatibility was observed for both measures, RT F(1,15) = 80.31, MSE = 545.71, p < 0.001, ηp2 = 0.83; IM F(1,15) = 13.02, MSE = 16.88, p = 0.002, ηp2 = 0.45. This showed a clear replication of a CAE, with the effect of incompatibility being reduced following incompatibility, for both RT (cC – cI = 73 ms, iC – iI = 22 ms) and IM (cC – cI = 49.8 pts, iC – iI = 13.9 pts). Finally, a significant four-way interaction suggested conflict adaptation to be dependent on both the repetition of the avatar, and its animation, RT F(1,15) = 5.25, MSE = 84.36, p = 0.04, ηp2 = 0.25, and IM F(1,15) = 10.37, MSE = 8.60, p = 0.005, ηp2 = 0.39.

To better understand the significant four-way interaction, we calculated the interaction term for each individual combination of avatar animation and avatar repetition. These CAE scores represent the decrease in the conflict effect as a function of preceding trial and are summarized in Figure 2. As can be seen from the figure, a maximal CAE was observed in repeated, static conditions for both RT and IM, indicating a replication of a standard CAE or Gratton effect (Gratton et al., 1992; Botvinick et al., 2001). CAEs were lower during static, alternated trials, with the CAE in IM turning to insignificance (4.15 ± 16.58 pts), replicating previous observations of the context dependency of the CAE. However, this context dependency itself was modulated by the animation of the avatar as, with dynamic conditions, the alternated avatar identities no longer caused a disruption of the CAE.


The results show that both the identity of the avatar, and its relation with the participant, affect cognitive performance. In general, participants suffered from a smaller conflict effect after conflict was repeated. Replicating previous studies suggesting conflict adaptation acts locally, or depends critically on irrelevant cues, the CAE was found to be disrupted if the identity of the avatar was changed. In other words, despite the avatar itself being entirely irrelevant to the task, a subtle change in its appearance reduced the CAE. This could be due to the change in cue disrupting recall of the preceding episode, disrupting feature integration and perhaps recall of control-related parameters.

One might imagine, as we sketched in the introduction, that perceiving the avatar as actively mimicking the participant’s actions would make it necessarily related to the task, as opposed to, as in the static case, an accidental bystander. Consequently, a change in the mirror image could constitute a particularly disrupting, if not disturbing event: after all, such an imaginary change in self-perception is a classic motif in horror stories (Dietrich, 1992) and a symptom in psychiatry (Maack and Mullen, 1983). Whether frightful or merely task-relevant, the predicted effect of avatar changes should from this perspective be larger in animated than in static conditions.

However, this prediction clearly did not hold. Conditions in which the avatar was displayed dynamically, with its movements mimicking those of the participant, showed no longer the disruptive effect of identity changes on the CAE. Indeed, if anything, the effect sometimes even seemed to increase after a change.

One way to account for this could be in terms of an integration process that makes the avatar similar to “a tool” as held by the participant. In the rubber-hand illusion, seeing an object being stroked and feeling the sensation on the real hand brings about the perception that the virtual object is part of oneself (Botvinick and Cohen, 1998). Here, a virtual persona is likewise presented in synchrony with the participant’s actions. By acting consistently in concert with the subject, it is likely that a bi-directional association is formed (Hommel, 1996), between one’s own intentions and the behavior carried out by the avatar. Such bidirectional association has recently been shown to elicit a certain unity between model and imitator, as shown by facilitated action execution if a model anticipates imitation rather than counter-imitation (Pfister et al., 2013).

Thus, if perceiving the dynamic avatar results in similar co-representation, the result could be that in the dynamic condition, the avatar is not necessarily an aspect of the task anymore, but an aspect of the agent. This, in turn, should have a critical effect on control in the degree to which the new and the old trial relate: the superficial identity of the avatar may have changed, but it should still point toward the same distal (Hommel, 2009) property. The repetition would then act as an episodic recall cue for the preceding trial, in which the same agent (i.e., the participant him- or herself) was present. In other words, different task-related, whether relevant or irrelevant, features may retrieve preceding, potentially partially overlapping trials, but changes in the avatar still relate to the self-same agent, who was always present in the preceding trial as well.

A competing explanation for the findings could be that the dynamically portrayed avatar made it more difficult to see changes affecting the identity of the avatar. However, this seems to run counter previous studies showing effects on conflict adaptation to remain even with stimulus displays featuring dynamic contextual cues (Spapé and Hommel, 2014). Alternatively, the animation itself was not critical in disturbing the context dependency of the CAE, but the fact that the animation was congruent with the participant’s own movement. This form of agency could perhaps counteract the effect by inducing a type of “change-blindness” (Simons and Levin, 1997) to the changes in identity. In the end, however, this forward-interfering account seems presently difficult to distinguish from the earlier, retrieval-based one.

Finally, we would like to discuss some novel aspects of the platform and methodology used in the experiment, as with the publication of this article, we release it as open source, freely available (source1) to the academic community. The compressed archive contains source, binaries and a short documentation file (see README.txt inside archive). Notice that, apart from the dynamic and static conditions referred to in the present manuscript, the platform also allows pre-programmed avatar animations with an onset equal to the average RT of the participant. We decided not to use these animations for the present study, as we had no predictions for model-imitator incongruency at the time (but see Pfister et al., 2013), but we could well-imagine this option could be of potential interest to fellow researchers.

The first aspect to note, particularly of interest for studies of conflict control, could be in the use of motion tracking. Although the field remains dominated by simple RTs and 2–4 alternatives forced-choice paradigms, current theoretical models, neuroscience methods and motor control paradigms (Scherbaum et al., 2010; Spapé and Serrien, 2010; Serrien and Spapé, 2011) indicate that focusing on the far end-point of an action – the time at which a button is fully pressed – ignores valuable data. Although previous studies found compatibility affecting response force as well as RT (van der Lubbe et al., 2001), the present study goes further to show the time-course of response conflict in the irrelevant response modality. It is possible that the other hand provides a more optimal indicator of conflict than the correct hand, as it is presumably less affected by early control operations that may partially negate the final RT. Of course, previous studies have circumvented the issue by providing measures related to the activation of the irrelevant motor cortex (Valle-Inclán, 1996) and muscles (Hasbroucq et al., 1999). However, the presented IM measure has the advantage of being very directly related to irrelevant response tendency as well as being rather cost-effective in terms of expenses of consumer grade apparatus and the time involved for participants and researchers (no recording preparation or calibration requirements).

The second aspect of the study that merits further discussion is the virtualized design. The experiment in a wider setting may provide a relatively low-cost virtual reality platform for studies of cognition and social identity. Here, we showed effects of changing one’s identity, implying that the setup can be a useful tool for the study of social and virtual identity. Social psychological effects, such as social facilitation (Zajonc, 1965) and conformity (Asch, 1951) can be easily tested without relying on confederates by adding extra avatars and operating them remotely (see Blascovich et al., 2002 for an overview of the benefits of immersive virtual environments). Tests of implicit stereotyping and embodied cognition could involve the adjustment of the shape of the avatar to enable identification with various cultural stereotypes. In sum, the study demonstrates that the present design (open source code1) may provide an interesting, new way for a variety of researchers and fields of study.

Finally the study blends the fields of executive control and conflict with the study of human–computer interaction (HCI). Given the growing diversity of input techniques and the heterogeneity of user interfaces, basic psychological studies can inform design by taking into account how different interaction techniques inflict conflict or provide control. User interfaces, such as employed in the study are increasingly becoming part of everyday consumer products such as game consoles (Harper and Mentis, 2013) and public displays (Kuikkaniemi et al., 2011). This has prompted research in HCI to reconsider embodied interaction with virtual representations (Wilson et al., 2012). The study also demonstrates self-representing avatars may positively contribute to interfaces designed for scenarios with common distraction and a high demand for attentional control. This should motivate further investigation of effects of avatars on various persuasion phenomena on a wide range of different application contexts.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.


This work was supported by the Academy of Finland, project number 268999.


  1. ^


Asch, S. E. (1951). “Effects of group pressure upon the modification and distortion of judgments,” in Groups, Leadership, and Men, ed. H. Guetzkow (Pittsburgh: Carnegie Press), 177–190.

Google Scholar

Baumeister, R. F., Muraven, M., and Tice, D. M. (2000). Ego depletion: a resource model of volition, self-regulation, and controlled processing. Soc. Cogn. 18, 130–150. doi: 10.1521/soco.2000.18.2.130

CrossRef Full Text | Google Scholar

Blascovich, J., Loomis, J., Beall, A. C., Swinth, K. R., Hoyt, C. L., and Bailenson, J. N. (2002). Immersive virtual environment technology as a methodological tool for social psychology. Psychol. Inq. 13, 103–124. doi: 10.1207/S15327965PLI1302_01

CrossRef Full Text | Google Scholar

Botvinick, M. M., Braver, T. S., Barch, D. M., Carter, C. S., and Cohen, J. D. (2001). Conflict monitoring and cognitive control. Psychol. Rev. 108, 624–652. doi: 10.1037/0033-295X.108.3.624

CrossRef Full Text | Google Scholar

Botvinick, M. M., and Cohen, J. (1998). Rubber hands’ feel’touch that eyes see. Nature 391, 756–756. doi: 10.1038/35784

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Buetti, S., and Kerzel, D. (2008). Time course of the Simon effect in pointing movements for horizontal, vertical, and acoustic stimuli: evidence for a common mechanism. Acta Psychol. 129, 420–428. doi: 10.1016/j.actpsy.2008.09.007

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Buetti, S., and Kerzel, D. (2009). Conflicts during response selection affect response programming: reactions toward the source of stimulation. J. Exp. Psychol. Hum. Percept. Perform. 35, 816–834. doi: 10.1037/a0011092

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Dietrich, B. D. (1992). The ghost of the corpus callosum. Semiotics 279–287. doi: 10.5840/cpsem199211

CrossRef Full Text | Google Scholar

Egner, T. (2014). Creatures of habit (and control): a multi-level learning perspective on the modulation of congruency effects. Cognition 5, 1247. doi: 10.3389/fpsyg.2014.01247

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Egner, T., and Hirsch, J. (2005a). Cognitive control mechanisms resolve conflict through cortical amplification of task-relevant information. Nat. Neurosci. 8, 1784–1790. doi: 10.1038/nn1594

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Egner, T., and Hirsch, J. (2005b). The neural correlates and functional integration of cognitive control in a Stroop task. Neuroimage 24, 539–547. doi: 10.1016/j.neuroimage.2004.09.007

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Eriksen, B. A., and Eriksen, C. W. (1974). Effects of noise letters upon the identification of a target letter in a nonsearch task. Percept. Psychophys. 16, 143–149. doi: 10.3758/BF03203267

CrossRef Full Text | Google Scholar

Gratton, G., Coles, M. G., and Donchin, E. (1992). Optimizing the use of information: strategic control of activation of responses. J. Exp. Psychol. Gen. 121, 480–506. doi: 10.1037/0096-3445.121.4.480

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Harper, R., and Mentis, H. (2013). “The mocking gaze: the social organization of kinect use,” in Proceedings of the 2013 Conference on Computer Supported Cooperative Work, New York, NY, 167–180. Available at: = 2441797

Google Scholar

Hasbroucq, T., Possamaï, C.-A., Bonnet, M., and Vidal, F. (1999). Effect of the irrelevant location of the response signal on choice reaction time: an electromyographic study in humans. Psychophysiology 36, 522–526. doi: 10.1017/S0048577299001602

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Hommel, B. (1996). The cognitive representation of action: automatic integration of perceived action effects. Psychol. Res. 59, 176–186. doi: 10.1007/BF00425832

CrossRef Full Text | Google Scholar

Hommel, B. (2009). Action control according to TEC (theory of event coding). Psychol. Res. PRPF 73, 512–526. doi: 10.1007/s00426-009-0234-2

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Hommel, B., Müsseler, J., Aschersleben, G., and Prinz, W. (2001). The Theory of Event Coding (TEC): a framework for perception and action planning. Behav. Brain Sci. 24, 849–878. doi: 10.1017/S0140525X01000103

CrossRef Full Text

Hommel, B., Proctor, R. W., and Vu, K.-P. L. (2004). A feature-integration account of sequential effects in the Simon task. Psychol. Res. 68, 1–17. doi: 10.1007/s00426-003-0132-y

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Kahneman, D., Treisman, A., and Gibbs, B. J. (1992). The reviewing of object files: object-specific integration of information. Cogn. Psychol. 24, 175–219. doi: 10.1016/0010-0285(92)90007-O

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Kuikkaniemi, K., Jacucci, G., Turpeinen, M., Hoggan, E., and Muller, J. (2011). From space to stage: how interactive screens will change urban life. Computer 44, 40–47. doi: 10.1109/MC.2011.135

CrossRef Full Text | Google Scholar

Lenggenhager, B., Tadi, T., Metzinger, T., and Blanke, O. (2007). Video ergo sum: manipulating bodily self-consciousness. Science 317, 1096–1099. doi: 10.1126/science.1143439

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Maack, L. H., and Mullen, P. E. (1983). The doppelgänger, disintegration and death: a case report. Psychol. Med. 13, 651–654. doi: 10.1017/S0033291700048066

CrossRef Full Text | Google Scholar

Mayr, U., Awh, E., and Laurey, P. (2003). Conflict adaptation effects in the absence of executive control. Nat. Neurosci. 6, 450–452.

Google Scholar

Notebaert, W., and Verguts, T. (2008). Cognitive control acts locally. Cognition 106, 1071–1080. doi: 10.1016/j.cognition.2007.04.011

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Pfister, R., Dignath, D., Hommel, B., and Kunde, W. (2013). It takes two to imitate anticipation and imitation in social interaction. Psychol. Sci. 24, 2117–2121. doi: 10.1177/0956797613489139

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Scherbaum, S., Dshemuchadse, M., Fischer, R., and Goschke, T. (2010). How decisions evolve: the temporal dynamics of action selection. Cognition 115, 407–416. doi: 10.1016/j.cognition.2010.02.004

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Schmidt, J. R., and Besner, D. (2008). The Stroop effect: why proportion congruent has nothing to do with congruency and everything to do with contingency. J. Exp. Psychol. Learn. Mem. Cogn. 34, 514–523. doi: 10.1037/0278-7393.34.3.514

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Serrien, D. J., and Spapé, M. M. (2011). Motor awareness and dissociable levels of action representation. Neurosci. Lett. 494, 145–149. doi: 10.1016/j.neulet.2011.02.077

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Simon, J. R., and Rudell, A. P. (1967). Auditory SR compatibility: the effect of an irrelevant cue on information processing. J. Appl. Psychol. 51, 300–304. doi: 10.1037/h0020586

CrossRef Full Text | Google Scholar

Simons, D. J., and Levin, D. T. (1997). Change blindness. Trends Cogn. Sci. 1, 261–267. doi: 10.1016/S1364-6613(97)01080-2

CrossRef Full Text | Google Scholar

Spapé, M. M., Band, G. P. H., and Hommel, B. (2011). Compatibility-sequence effects in the Simon task reflect episodic retrieval but not conflict adaptation: evidence from LRP and N2. Biol. Psychol. 88, 116–123. doi: 10.1016/j.biopsycho.2011.07.001

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Spapé, M. M., and Hommel, B. (2008). He said, she said: episodic retrieval induces conflict adaptation in an auditory Stroop task. Psychon. Bull. Rev. 15, 1117–1121. doi: 10.3758/PBR.15.6.1117

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Spapé, M. M., and Hommel, B. (2010). Actions travel with their objects: evidence for dynamic event files. Psychol. Res. 74, 50–58. doi: 10.1007/s00426-008-0219-6

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Spapé, M. M., and Hommel, B. (2014). Sequential modulations of the Simon effect depend on episodic retrieval. Front. Psychol. 5:855. doi: 10.3389/fpsyg.2014.00855

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Spapé, M. M., and Serrien, D. J. (2010). Interregional synchrony of visuomotor tracking: perturbation effects and individual differences. Behav. Brain Res. 213, 313–318. doi: 10.1016/j.bbr.2010.05.029

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Stroop, J. R. (1935). Studies of interference in serial verbal reactions. J. Exp. Psychol. 18, 643–662. doi: 10.1037/h0054651

CrossRef Full Text | Google Scholar

Treisman, A. (1996). The binding problem. Curr. Opin. Neurobiol. 6, 171–178. doi: 10.1016/S0959-4388(96)80070-5

CrossRef Full Text | Google Scholar

Valle-Inclán, F. (1996). The locus of interference in the Simon effect: an ERP study. Biol. Psychol. 43, 147–162. doi: 10.1016/0301-0511(95)05181-3

CrossRef Full Text | Google Scholar

van der Lubbe, R. H., Jaśkowski, P., Wauschkuhn, B., and Verleger, R. (2001). Influence of time pressure in a simple response task, a choice-by-location task, and the Simon task. J. Psychophysiol. 15, 241–255. doi: 10.1027//0269-8803.15.4.241

CrossRef Full Text | Google Scholar

Van Steenbergen, H., Band, G. P., and Hommel, B. (2009). Reward counteracts conflict adaptation evidence for a role of affect in executive control. Psychol. Sci. 20, 1473–1477. doi: 10.1111/j.1467-9280.2009.02470.x

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Wiegand, K., and Wascher, E. (2005). Dynamic aspects of stimulus-response correspondence: evidence for two mechanisms involved in the Simon effect. J. Exp. Psychol. Hum. Percept. Perform. 31, 453–464. doi: 10.1037/0096-1523.31.3.453

PubMed Abstract | Full Text | CrossRef Full Text | Google Scholar

Wilson, A., Benko, H., Izadi, S., and Hilliges, O. (2012). “Steerable augmented reality with the beamatron,” in Proceedings of the 25th Annual ACM Symposium on User Interface Software and Technology, New York, NY, 413–422. doi: 10.1145/2380116.2380169

CrossRef Full Text | Google Scholar

Zajonc, R. B. (1965). Social facilitation. Science 149, 269–274. doi: 10.1126/science.149.3681.269

CrossRef Full Text | Google Scholar

Keywords: cognitive control, conflict adaptation, feature integration, mediated interaction, episodic retrieval

Citation: Spapé MM, Ahmed I, Jacucci G and Ravaja N (2015) The self in conflict: actors and agency in the mediated sequential Simon task. Front. Psychol. 6:304. doi: 10.3389/fpsyg.2015.00304

Received: 19 November 2014; Accepted: 03 March 2015;
Published online: 23 March 2015.

Edited by:

Snehlata Jaswal, Indian Institute of Technology Jodhpur, India

Reviewed by:

Peter König, University of Osnabrück, Germany
Roland Pfister,Julius Maximilians University of Würzburg, Germany

Copyright © 2015 Spapé, Ahmed, Jacucci and Ravaja. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Michiel M. Spapé, Helsinki Institute for Information Technology HIIT, Aalto University, Open Innovation House, Otaniementie 19-21, 002150 Espoo, Finland