How “Social” is the social Simon effect?

Dolk, Thomas; Hommel, Bernhard; Colzato, Lorenza  S; Schütz-Bosbach, Simone; Prinz, Wolfgang; Liepelt, Roman

doi:10.3389/fpsyg.2011.00084

ORIGINAL RESEARCH article

Front. Psychol., 06 May 2011

Sec. Cognition

volume 2 - 2011 | https://doi.org/10.3389/fpsyg.2011.00084

How “Social” is the social Simon effect?

TD
Thomas Dolk ¹^*
BH
Bernhard Hommel ²
LS
Lorenza S. Colzato ²
SS
Simone Schütz-Bosbach ³
WP
Wolfgang Prinz ¹
RL
Roman Liepelt ^1,4

1. Department of Psychology, Max Planck Institute for Human Cognitive and Brain Sciences Leipzig, Germany
2. Cognitive Psychology Unit, Leiden Institute for Brain and Cognition, Leiden University Leiden, Netherlands
3. Independent Research Group “Body and Self”, Max Planck Institute for Human Cognitive and Brain Sciences Leipzig, Germany
4. Department of Psychology, Junior Group “Neurocognition of Joint Action’’, Westfälische Wilhelms-University Münster, Germany

Part of this article's content has been mentioned in:

The joint Simon effect: a review and theoretical integration
1. Read focused review

Abstract

In the standard Simon task, participants carry out spatially defined responses to non-spatial stimulus attributes. Responses are typically faster when stimulus location and response location correspond. This effect disappears when a participant responds to only one of the two stimuli and reappears when another person carries out the other response. This social Simon effect (SSE) has been considered as providing an index for action co-representation. Here, we investigated whether joint-action effects in a social Simon task involve mechanisms of action co-representation, as measured by the amount of incorporation of another person's action. We combined an auditory social Simon task with a manipulation of the sense of ownership of another person's hand (rubber hand illusion). If the SSE is established by action co-representation, then the incorporation of the other person's hand into one's own body representation should increase the SSE (synchronous > asynchronous stroking). However, we found the SSE to be smaller in the synchronous as compared to the asynchronous stroking condition (Experiment 1), suggesting that the SSE reflects the separation of spatial action events rather than the integration of the other person's action. This effect is independent of the active involvement (Experiment 2) and the presence of another person (Experiment 3). These findings suggest that the “social” Simon effect is not really social in nature but is established when an interaction partner produces events that serve as a spatial reference for one's own actions.

Introduction

Many activities we perform in daily life are carried out together with other people. But how do we mentally represent other people's actions and how does this affect our own behavior?

Recent research suggests that joint action can lead to the representation of one's own and other's actions. This “action co-representation” is thought to facilitate action prediction and coordination of one's own actions with those of others (Sebanz et al., 2006). Evidence for this view stems from the “social Simon task” developed by Sebanz et al. (2003). In the standard Simon task (Simon and Rudell, 1967; Simon, 1990), participants typically carry out spatially defined responses (e.g., left and right key presses) to non-spatial stimulus attributes (e.g., auditory pitch or visual color) that randomly appear on the left or right. For example, participants are required to press a right key whenever they perceive a high-pitched tone and a left key in response to a low-pitched tone. Although stimulus location is completely irrelevant in this task, responses are typically faster when they spatially correspond to the stimulus signaling them. That is, spatial stimulus–response compatibility facilitates task performance, a phenomenon that has come to be known as the Simon effect. Commonly, this effect disappears when a participant responds to only one of the two stimuli, rendering the task a “go–nogo task” (Hommel, 1996). However, if the same go–nogo task is shared between two participants so that each of them operates one of the two responses, a Simon effect is observed (Sebanz et al., 2003) – the “social Simon effect” (SSE).

According to the dimensional overlap model (Kornblum et al., 1990) the standard Simon effect can be explained by a match between the spatially irrelevant dimension of the stimulus and the relevant response dimension (Hommel et al., 2001). Accordingly, responses are assumed to be automatically activated if the stimulus spatially corresponds to the correct response and thus facilitate task performance, whereas a lack of correspondence between stimulus–response pairs leads to response competition.

It is fair to say that the mechanisms underlying the SSE are poorly understood. Some authors have claimed that, due to the fundamentally social nature of perception and action, people automatically co-represent other people's actions (Knoblich and Sebanz, 2006). However, a finding that does not seem to be completely in line with the idea of action co-representation being social, automatic, and mandatory is that the SSE is fully present in autistic participants (Sebanz et al., 2005), who can be assumed to have difficulties processing social information. According to Guagnano et al. (2010), the major role of the co-actor in the social Simon task might be to provide a spatial reference frame that allows coding of one's own action as left or right relative to the other person – just as one's own action alternatives provide a reference frame for relative response coding (Hommel, 1996). Guagnano et al. (2010) further claimed that this reference frame can only be used if the other person is located within a participant's peripersonal space. In line with a spatial reference explanation for the SSE, the authors were able to show that the SSE breaks down if the two co-actors are seated outside of arm's reach. However, this approach does not easily explain why an individual's bad mood (Kuhbandner et al., 2010) or negative relationship with the co-actor (Hommel et al., 2009) eliminates the effect.

In the present study, we make a further attempt to clarify what the notion of action co-representation might mean, what it refers to, and in which sense it might account for the SSE. In essence, it may be possible to distinguish between three concepts of action co-representation, ranging from strong to weak. According to the first, strong concept, the SSE is assumed to be functionally similar to the effect obtained when one person is taking care of both responses (Sebanz et al., 2003). Following this line of reasoning, the SSE is due to the cognitive integration of the co-actor and his/her actions into the actor's body scheme. The second, intermediate concept, assumes that actors represent information about their co-actor and his/her actions without integrating it with representations of their own body and actions. This co-representation of the self and other provides a reference frame for the (e.g., spatial) coding of an individual's own actions relative to the other person and his/her actions (Guagnano et al., 2010). Thus, rather than incorporating the other person into the actor's body schema, the co-actor is represented as a social agent responsible for the alternative action separately from one's own body and action. According to the third, weak concept, the co-actor does not function as a social being but mainly by virtue of producing particular events (actions with perceivable effects), which serve as reference for coding one's own action.

Our experiments proceeded from testing the strongest to the weakest concept. In Experiment 1, we tested whether the SSE is affected by the perceived ownership of another person's hand as suggested by a strong conceptualization of action co-representation (Sebanz et al., 2003; Knoblich and Sebanz, 2006). A reliable paradigm to experimentally manipulate the sense of ownership of another person's hand is the rubber hand illusion (RHI; Botvinick and Cohen, 1998). Here, a rubber hand (or another person's hand) is stroked either synchronously or asynchronously. During synchronous stroking, the subject commonly feels the illusion that the seen rubber (or foreign) hand becomes a part of his/her own body.

We experimentally combined the RHI with an auditory social Simon task. In Experiments 2 and 3, we gradually de-socialized the task situation. In Experiment 2, we tested if we could find evidence of a SSE without the active involvement of the co-actor. In Experiment 3, we excluded the co-actor from the task setting altogether to test the weak concept of action co-representation.

Experiment 1

The aim of Experiment 1 was to investigate whether the SSE relies on or varies as a function of action co-representation induced by the RHI. Participants performed an auditory social Simon task while the perceived ownership of another person's hand (i.e., synchronous vs. asynchronous stroking) was manipulated.

The RHI is assumed to arise from a multimodal conflict between vision, touch, and proprioception (Ehrsson et al., 2004; Tsakiris and Haggard, 2005; Kammers et al., 2009). As vision usually dominates touch and proprioception (Constantini and Haggard, 2007), the RHI emerges as a consequence of synchronous but not asynchronous stroking. When stroking is synchronous, the sense of ownership is strong. As a result, the activity of the other hand should be more strongly attributed to one's own body and thus induce an integration of another person into one's own action representation. Conversely, in the asynchronous stroking condition, the other hand is more likely to be attributed to a different actor (Botvinick and Cohen, 1998) and thus, clearly separated from one's own action. This condition was hypothesized to work against the strong concept of action co-representation.

If the SSE relies on the cognitive integration of the co-actor and his/her actions into the actor's body schema (strong concept), synchronous stroking should create a more pronounced SSE compared to asynchronous stroking. However, if an actor tends to represent the co-actor as separate from him/herself and not integrate the other's actions into their own body schema (intermediate concept), synchronous stroking might actually lead to a smaller, rather than a larger SSE than asynchronous stroking does. This is because the asynchronous stroking might increase the saliency of the other person's hand and its actions, and thereby provide a stronger spatial reference for coding the actor's own action.

Methods

Participants

Forty healthy undergraduate students (20 female; 20–25 years of age, mean age = 23.8) with no history of neurological or hearing problems participated in Experiment 1. Twenty served as actual participants (henceforth called actors) and 20 as co-actors (see Figure 1). The participants were all right-handed as assessed by the Edinburgh Inventory (Olfield, 1971), had normal or corrected-to-normal vision, were naive with regard to the hypothesis of the experiment and were paid €14 for participating.

Figure 1

Apparatus and stimuli

An auditory Simon task (go–nogo task) was used. In each trial, one of two sounds designed by van Steenbergen (2007) and chosen as go (sound A) and nogo (sound B) was presented via two loudspeakers separated by a distance of 1 m at approximately 60 dB to either the left or right side of both participants.

To experimentally induce a sense of ownership of the other person's hand, we made use of the RHI. This involved stimulating the actor's and the co-actor's hand mechanically by means of two computer-controlled stepper motors, each with two identical paintbrushes attached, allowing the precise control of onset, direction, speed, and duration of both steppers independently. Following Lloyd (2007), the distance between both stroking devices was about 22.5 cm.

Subjective measures

Participants rated the perceived strength of the RHI by working through nine statements directly after each induction and experimental phase. The statements were translated from the original RHI Questionnaire (Botvinick and Cohen, 1998) and participants were to agree or disagree on a visual analog scale from left (0 = “completely disagree”) to right (10 = “completely agree”). The first three statements are suggested to capture the core of the illusion (Botvinick and Cohen, 1998; Schütz-Bosbach et al., 2008; Kammers et al., 2009): (1) “It seemed as if I were feeling the touch of the paintbrush in the location where I saw the rubber hand touched”; (2) “It seemed as though the touch I felt was caused by the paintbrush touching the rubber hand”; (3) “I felt as if the rubber hand were my hand.” A successful RHI induction would be indicated by higher ratings after synchronous than asynchronous visual–tactile stimulation.

Task and procedure

The experiment consisted of two consecutive sessions, each including an induction and an experimental phase. To avoid carryover effects, both sessions were separated by a 5-min mandatory break. Prior to the induction phase, participants were seated next to each other. The actual participant (see Figure 1) was always seated on the right and was asked to place his/her left index finger under the stroking device, so that the paintbrush could stimulate the occluded index finger from the knuckle to the fingertip or vice versa. His/her right index finger rested on the right response button. Randomly chosen co-actors, whose performance was not analyzed, were always seated on the left. They rested their left index finger on the left response button (80 cm between the two response buttons) directly under the left stroking device and their right hand on their lap under the table. After participant and co-actor were seated and had placed their hands in the correct positions, a white towel was placed over their shoulders and arms to obscure everything on the table except the co-actor's left and the participant's right hand (see Figure 1).

The experiment started with the induction phase. The stimulation was delivered mechanically by two stepper motors to which paintbrushes were attached. The amount of stimulation (onset, direction, speed, and duration) was precisely matched across conditions. To avoid habituation effects, the speed and direction of the paintbrushes were unpredictable and changed randomly every 5 s. In the synchronous condition, the participant's and the co-actor's left index fingers were stroked in synchrony, with identical location, timing, and trajectory parameters. In the asynchronous condition, the parameters differed between the two stroked fingers, while the total amount of stimulation for both index fingers was the same as in the synchronous condition. Thus, the synchronous and asynchronous stroking conditions differed only in the phase of the temporal structures of visual and tactile stimulation. The stroking procedure in each induction phase lasted for about 3-min. After the stimulation, both participant and co-actor were asked to fill out the RHI Questionnaire.

After completing the questionnaires, the experimental phase started. There were four blocks of 64 trials for each participant and co-actor (32 with spatially compatible stimulus–response relationships and 32 with spatially incompatible relationships). Each trial began with the presentation of the warning sound. After 1000 ms, the critical sound – either sound A or B – was presented to the right or the left side of both the participant and co-actor, requiring a response as quickly and as accurately as possible. Participant and co-actor were instructed to fixate on the other's hand and to respond exclusively to the sound assigned to them, irrespective of its location. Each response was followed by a 1000 ms inter-stimulus-interval (ISI) and 3000 ms stroking, which was always congruent to the stroking type of the corresponding induction period (either synchronous or asynchronous) to refresh the RHI (see Figure 2).

Figure 2

Feedback [mean reaction time (RT) and percentage correct] as well as a 2-min break were provided at the end of each block. After completing the first four blocks, participants were asked to fill in the RHI Questionnaire again, which was followed by a 5-min break to avoid carryover effects to the second session of the experiment. After the break, the second session started. The procedure was the same as in the first session except for the type of stimulation, which was always different from that in the first session. The order of stimulation type (synchronous followed by asynchronous stroking or vice versa) was counterbalanced across participants.

Results

In the following, only data from the actual participants (actors) were analyzed.

Rubber hand questionnaire

Participants experienced the co-actor's hand as their own hand as a consequence of synchronous but not asynchronous stroking during both the induction and experimental phase: The RHI was significantly stronger after synchronous than after asynchronous stroking (RHI-related questions 1–2 after the induction and 1–3 after the experimental phase; two-way paired-sample t-tests; all ps < 0.05).

Simon task

Reaction times

Responses were coded as compatible (stimulus ipsilateral to the correct response side) and incompatible (stimulus contralateral to the correct response side). Mean RTs on the auditory social go–nogo Simon task for the 20 actual participants were submitted to a 2 (Compatibility: compatible, incompatible) × 2 (Stroking: synchronous, asynchronous) within-subjects repeated measures analysis of variance (ANOVA). The analysis showed a significant main effect of Compatibility [F(1,19) = 25.46, p < 0.001, η² = 0.57] indicating that responses were faster with spatially compatible (mean RT = 291 ms) than with incompatible stimulus–response relationships (mean RT = 313 ms). More importantly, the compatibility effect varied with stroking, as indicated by a significant interaction of Compatibility × Stroking [F(1,19) = 5.88, p < 0.05, η² = 0.24; see Figure 3]. The 29 ms compatibility effect observed in the asynchronous stroking condition was significantly larger [F(1,19) = 25.17, p < 0.001, η² = 0.57] than the 15 ms compatibility effect in the synchronous stroking condition [F(1,19) = 10.82, p < 0.01, η² = 0.36; see Figure 3]. The main effect of Stroking was not significant [F(1,19) < 1, η² = 0.01]. To check for possible task order effects, we performed an additional ANOVA with Order as a between-subjects factor – but the three-way interaction was not significant [F(1,18) = 1.40, p > 0.05, η² = 0.07].

Figure 3

Error rates

We observed a significant main effect of Compatibility [F(1,19) = 12.67, p < 0.01, η² = 0.40], indicating higher error rates for incompatible (1.0%) than for compatible trials (0.3%). The interaction of Compatibility × Stroking was far from significance [F(1,19) < 1, η² = 0.01], which rules out a speed–accuracy trade-off.

Discussion

The aim of this experiment was to test predictions of a strong concept of action co-representation accounting for the SSE. In particular, we investigated whether the SSE is mediated by the degree to which the active hand of a co-actor is perceived to be a part of the actor's own body.

First of all, we were able to replicate the findings of Sebanz et al. (2003), confirming that our particular setup was sufficiently sensitive to elicit the SSE in both the synchronous and the asynchronous stroking conditions. Second, we found the effect to be smaller, rather than larger, with synchronous than with asynchronous stroking. Thus, the incorporation of another person's hand into one's own body schema through the RHI (induced by synchronous stroking) reduces the SSE as compared to a condition where the co-actor is represented as a separate actor (induced by asynchronous stroking). This interpretation is supported by the subjective rating of the sense of ownership of the co-actor's hand in the synchronous stroking condition, which indicates that the experimental RHI manipulation was successful across all phases of the experiment.

These results provide considerable evidence against a strong concept of action co-representation as a mechanism underlying the SSE. That is, the SSE seems to occur even though actors represent their own action and the action of their co-actor separately. Emphasizing the difference between the two actions – or the related effectors – leads to a more pronounced SSE. This increase of the SSE in the asynchronous stroking condition is in line with the assumption that the SSE is established by the coding of one's action in reference to other actions (intermediate concept) or salient events (weak concept). Referential coding is known to be a basic principle operating in the Simon task (Hommel, 1993). Stimuli have been shown to be spatially coded relative to other stimuli that are either voluntarily attended to (Nicoletti and Umiltà, 1989) or that are salient enough to attract attention involuntarily (Treccani et al., 2006). With respect to action, response location has been shown to be coded in reference to other possible or recent responses (Hommel, 1996), in particular on spatial dimensions that help to discriminate between response alternatives (Ansorge and Wühr, 2004).

Given that most authors agree that the Simon effect is due to some sort of match or mismatch between spatial stimulus and response codes (Kornblum et al., 1990; Prinz, 1990; Hommel et al., 2001), the effect can only occur if stimulus location and response location are coded on the same dimension – as left and right in our case. In a standard Simon task, where the same participant performs both responses, this is very likely to happen, as the left–right dimension is particularly salient and provides the best discriminability between the two responses. In the social Simon task, however, participants operate only one response, so there is no actual need for spatial coding. Yet, if a co-actor (or perhaps another event) is sufficiently salient, people may nevertheless tend to code their response in reference to the spatial location of the other person or event (cf. Guagnano et al., 2010).

According to this reasoning, the social aspect of the joint-action situation created by the social Simon task may be just one of perhaps many factors that attract attention to other events and thereby induce the referential coding of one's own action, thus creating or enhancing the SSE. One implication of this possibility is that the active involvement of the co-actor in the present task might not necessarily induce referential response coding and elicit the SSE. To test this possibility, we performed a second experiment that included a now inactive but still salient “co-actor.”

Experiment 2

The aim of Experiment 2 was to test whether the SSE can also be obtained with an inactive co-actor (to whom we will nevertheless keep referring to as “co-actor” for the sake of convenience). To do so, we replicated Experiment 1 but now de-socialized the task to some degree: The co-actor no longer responded but sat passively next to the actual participant. If the co-actor provides a spatial reference frame for the coding of one's own action as left or right relative to the other person, one should expect a SSE even with an inactive co-actor. By contrast, however, if the active participation of the co-actor as a responding agent is crucial for the SSE to emerge as the original approach of Guagnano et al. (2010) suggests, the Simon effect should disappear.