Action-Effect Bindings and Ideomotor Learning in Intention- and Stimulus-Based Actions

Herwig, Arvid; Waszak, Florian

doi:10.3389/fpsyg.2012.00444

ORIGINAL RESEARCH article

Front. Psychol., 25 October 2012

Sec. Cognition

Volume 3 - 2012 | https://doi.org/10.3389/fpsyg.2012.00444

Action-Effect Bindings and Ideomotor Learning in Intention- and Stimulus-Based Actions

AH
Arvid Herwig ^1,2^*
FW
Florian Waszak ^3,4

1. Department of Psychology, Bielefeld University Bielefeld, Germany
2. Department of Psychology, Max Planck Institute for Human Cognitive and Brain Sciences Leipzig, Germany
3. Université Paris Descartes, Sorbonne Paris Cité Paris, France
4. CNRS, Laboratoire Psychologie de la Perception, UMR 8158 Paris, France

Abstract

According to ideomotor theory, action-effect associations are crucial for voluntary action control. Recently, a number of studies started to investigate the conditions that mediate the acquisition and application of action-effect associations by comparing actions carried out in response to exogenous stimuli (stimulus-based) with actions selected endogenously (intention-based). There is evidence that the acquisition and/or application of action-effect associations is boosted when acting in an intention-based action mode. For instance, bidirectional action-effect associations were diagnosed in a forced choice test phase if participants previously experienced action-effect couplings in an intention-based but not in a stimulus-based action mode. The present study aims at investigating effects of the action mode on action-effect associations in more detail. In a series of experiments, we compared the strength and durability of short-term action-effect associations (binding) immediately following intention- as well as stimulus-based actions. Moreover, long-term action-effect associations (learning) were assessed in a subsequent test phase. Our results show short-term action-effect associations of equal strength and durability for both action modes. However, replicating previous results, long-term associations were observed only following intention-based actions. These findings indicate that the effect of the action mode on long-term associations cannot merely be a result of accumulated short-term action-effect bindings. Instead, only those episodic bindings are selectively perpetuated and retrieved that integrate action-relevant aspects of the processing event, i.e., in case of intention-based actions, the link between action and ensuing effect.

Introduction

Humans either carry out actions to achieve desired effects in the environment or to accommodate to environmental demands. For instance, pressing the cappuccino button on a coffee dispenser is primarily based on the agent’s intention to have a hot cup of coffee. In contrast, flooring the brake pedal at a red traffic light is chiefly performed in response to a prior stimulus event. These two types of action have been labeled voluntary, operant, or intention-based, on the one side, and reaction, response, or stimulus-based, on the other side.

Neuroscientific evidence suggests that intention- and stimulus-based actions have distinct neural bases (e.g., Goldberg, 1985; Passingham, 1993; Praamstra et al., 1995; Deiber et al., 1996; Waszak et al., 2005, 2012; Mueller et al., 2007; Haggard, 2008). This distinction is further supported by clinical observations showing a selective impairment of one type of action and thereby implying dissociation between intention- and stimulus-based action control (e.g., Lhermitte, 1983; Shallice et al., 1989).

However, the actual processes that guide these two types of actions are still not well understood. One obvious functional difference between intention- and stimulus-based actions is the role of external stimuli either preceding or following the action. According to ideomotor reasoning (e.g., Harleß, 1861; James, 1890; for recent reviews, see Nattkemper et al., 2010; Shin et al., 2010; Pfister and Janczyk, 2012) intention-based actions are primarily directed at and selected by the effects following the action whereas there is a less obvious connection to preceding stimuli. On the contrary, stimulus-based actions, by definition, crucially depend on preceding stimuli whereas stimuli following the action are often less important. Thus, it has been suggested that intention-based actions rely more strongly on action-effect associations specifying which action produces which effect, whereas stimulus-based actions rely more strongly on stimulus-response associations specifying which motor routines action-relevant stimuli habitually require (Waszak et al., 2005, 2012; Herwig et al., 2007, 2013; Pfister et al., 2010).

Ideomotor learning

The purported functional difference is supported by a number of recent studies directly comparing the long-term consequences of actions carried out in response to exogenous stimuli (stimulus-based) with actions selected endogenously (intention-based). For instance, Herwig et al. (2007) investigated ideomotor learning, that is, the spontaneous acquisition of action-effect associations, in intention- and stimulus-based actions. Ideomotor learning can be assessed in a paradigm conceived by Elsner and Hommel (2001). These authors made participants first undergo an acquisition phase, in which a self selected key press always produced a particular tone (e.g., left key press/high-pitch tone; right key press/low-pitch tone). After having performed about 200 key presses, the same tones were presented as imperative stimuli for a speeded choice response in a subsequent test phase. Elsner and Hommel observed that the speeded choice responses were faster in response to the tone that the action had previously produced (e.g., compatible group: low-pitch tone/right key press) than to a tone that had been produced by the alternative action (e.g., incompatible group: high-pitch tone/right key press). This result demonstrates that, during the acquisition phase, participants acquire long-lasting bidirectional associations between the motor code of the action and the perceptual code of the auditory effect (i.e., action-effect associations). Presenting the effects as imperative stimuli in a later test phase leads to the retrieval of the previously acquired action-effect association which either speeds up or slows down the speeded choice task depending on whether the retrieved actions are compatible or incompatible with the instructed response.

Importantly, the effect of action-effect associations on a subsequent speeded choice task depends on the action mode during acquisition (Herwig et al., 2007; Herwig and Waszak, 2009; Herwig and Horstmann, 2011; but, see Pfister et al., 2011, for different results with a free choice test). That is, in the studies of Herwig and colleagues, a compatibility effect only occurred if, in the acquisition phase, participants freely selected between left and right key presses (intention-based acquisition), whereas there was no compatibility effect if the actions were triggered by external stimulus events (stimulus-based acquisition). This dependency on the action mode holds true for such different effect- and action-modalities like auditory effects and manual actions (Herwig et al., 2007; Herwig and Waszak, 2009) as well as visual effects and oculomotor actions (Herwig and Horstmann, 2011). Moreover, guiding participants’ attention away or toward the effect did not influence the pattern of results (Herwig and Waszak, 2009). Thus, the observed differences between intention- and stimulus-based actions are not simply due to differences in allocation of attention to the action-effect event. Instead, the results suggest that one and the same action-effect event results in different long-term consequences depending on the action mode: if actions are performed in the intention-based mode, ideomotor learning occurs, that is new action-effect associations are acquired and later on retrieved upon effect presentation. In contrast, if actions are selected in the stimulus-based mode, sensorimotor learning occurs, that is stimulus-response associations are established while action-effect associations are much harder to detect subsequently.

It has to be noted that to date it is still under debate why action-effect associations are much harder to detect following stimulus-based actions and different hypotheses have been proposed. Herwig et al. (2007) suggested that the action mode affects the acquisition of action-effect associations. Accordingly, action-effect associations are weaker following a stimulus-based acquisition compared to an intention-based acquisition which in turn hampers their later detection. However, the different-acquisition hypothesis was recently put into question by two studies showing ideomotor learning also following stimulus-based actions (Pfister et al., 2011; Wolfensteller and Ruge, 2011). In the study of Pfister et al. (2011) ideomotor learning was assessed in a free choice test phase, in which participants were presented with randomly selected action-effects, which merely served as a trigger to carry out a self-chosen response. Under these test conditions participants preferred the selection of the action that was previously producing the effect regardless of the action mode during acquisition. To account for the differences between their own results and the results of Herwig et al. (2007), the authors proposed the different-application hypothesis (for converging evidence that the action mode can affect the application of action-effect associations, see Pfister et al., 2010; Herwig and Horstmann, 2011). According to this hypothesis, action-effect associations are acquired irrespective of the action mode, but are applied during the test phase only if an intention-based mode is adopted. Importantly, adopting an intention- or a stimulus-based mode depends not only on the current task in the test phase (free choice vs. forced choice) but also on the previous task in the acquisition phase (free choice vs. forced choice). However, the relationship between these two determining factors seem to be quite complex. According to Pfister et al. (2011) the intention-based mode is quickly adopted if participants carry out self-chosen responses (either during acquisition or test) and once adopted they will stick to this action mode even in a forced choice test phase. In contrast, participants slowly adopt a stimulus-based mode during a forced choice acquisition phase but remain in this mode only if they continue to perform forced choice actions in the test phase. Finally, Wolfensteller and Ruge (2011) suggested a third hypothesis to explain the observed effect of the action mode on ideomotor learning. In their study participants had to constantly switch between stimulus-based acquisition phases of varying lengths and forced choice test phases in which the effects were presented together with the imperative stimuli¹. The results showed a small but reliable compatibility effect after only 12 action-effect episodes which seems to depend on contextual stability (i.e., on a consistent stimulus-response mapping). Therefore Wolfensteller and Ruge proposed the different-context hypothesis which states that action-effect associations following a stimulus-based acquisition are contextualized by means of their imperative stimuli (i.e., stimulus-action-effect episode). Such a contextualization can in principle hamper the retrieval of action-effect associations if the context (i.e., the imperative stimuli) changes between acquisition and test (cf., Godden and Baddeley, 1975).

Unfortunately, to date none of the three hypotheses, that is the different-acquisition, the different-application, and the different-context hypothesis, can satisfactorily explain all of the divergent results concerning the effect of the action mode on ideomotor learning. Thus, one main aim of the present study was to take a closer look at the emergence of action-effect associations against the background of the different-acquisition hypothesis proposed by Herwig et al. (2007).

Action-effect binding

Up to now, we focused on the influence of the action mode on the compilation of action-effect associations that may be retrieved at least a couple of minutes after the acquisition (i.e., long-term associations or learning, hereafter). However, the build-up of long-term memory traces is not the only type of perceptuomotor integration that takes place when humans interact with the environment. The other type refers to a much shorter timescale and is related to one of the main characteristics of the primate brain: distributed coding (i.e., short-term integration or binding, hereafter)². Distributed coding refers not only to features in the visual domain (e.g., shape, color, and location, see Cowey, 1985; Felleman and van Essen, 1991) and in the auditory domain (e.g., periodicity, location, and spectral shape, Brown and Wang, 2006) but also as regards the features of to-be-performed actions (e.g., direction, amplitude, and duration, Wickens et al., 1994).

Importantly, distributed coding creates numerous binding problems (Treisman, 1996), which call for some kind of integration mechanism that binds together the distributed codes belonging to the same object (e.g., color, shape, and motion of an object). Hommel (1998) argued that the binding problem holds for perceptuomotor processing as well. That is, perceptual and motor codes belonging to the same event need to be integrated, too (Hommel, 2004). Following previous work addressing the creation of “object files” (Kahneman et al., 1992), the temporarily stored outcome of this integration process was termed “event file” (Hommel, 1998).

Bindings of stimulus and action features can be assessed in the prime-probe stimulus-response task of Hommel (1998). In this paradigm each trial comprises two subtasks. In the first subtask, participants perform simple, precued left- or right key presses (R1) to the mere presence of a “Go” signal (S1) that varies randomly in form, color, and location. The effects of bindings created between S1- and R1-features on later performance are assessed in a second subtask, which is a binary-choice reaction (R2) to a pre-instructed feature (e.g., color) of a second stimulus (S2). The typical result of this type of paradigm is that performance is impaired in partial repetition trials, that is, if only the stimulus (or only the response) is repeated, compared to when both stimulus and response are repeated or when both change. This pattern of results suggests that a temporary binding of the respective codes is compiled when stimuli and actions co-occur. Repeating one feature reactivates also the associated fellow code, which, in partial repetition trials, creates a mismatch and, therefore, induces a time-consuming re-binding process (for a review, see Hommel, 2004).

Transient perceptuomotor bindings have been shown to emerge quickly (after 300 ms or less) and to remain intact for at least 4 s (Hommel and Colzato, 2004). Moreover, the temporal order of S1 and R1 does not seem to be important for perceptuomotor binding. Hommel (2005, Experiment 2) showed that stimulus features were bound to response features even if S1 follows R1 which suggests that the temporal time window for feature integration might be rather broadly defined. Thus, temporary feature binding across perception and action may take place not only in events, where the perceptual stimulus triggers the action (stimulus-based actions), but also in events, where the action triggers the perceptual event (intention-based actions). This opens up the possibility to investigate the immediate binding between actions and their effects in stimulus- and intention-based actions.

The present study

As outlined above, Herwig and colleagues (Herwig et al., 2007; Herwig and Waszak, 2009; Herwig and Horstmann, 2011) proposed that the acquisition of action-effect associations (i.e., ideomotor learning) is affected by the action mode. The present study investigates whether the action mode also influences temporary feature bindings. Although there is already some evidence that action-effect bindings can be observed following intention-based actions (Dutzi and Hommel, 2009) and stimulus-based actions (see Hommel, 2005; Experiment 2), a direct comparison of the strength and durability of action-effect bindings following intention- and stimulus-based actions is lacking. As a consequence, it is utterly unknown whether temporary action-effect bindings, too, are affected by the action mode and one main aim of the present study was to address this gap in the literature.

We ran three experiments that compare strength as well as durability of action-effect bindings between the two action modes. Experiments 1 and 2 were designed to test how the two types of integration, that is, binding and learning, are related. Based on the different-acquisition hypothesis proposed by Herwig et al. (2007) we, see three possible relationships (see Colzato et al. (2006), for similar considerations). First, binding and learning are tightly linked (strong dependence hypothesis). Binding via synchronization may cause long-term modifications of synaptic efficacy as suggested by Fell et al. (2003). In this scenario, temporary bindings strengthen the association between two features mediated through Hebbian learning (i.e., neurons that fire together, wire together; Hebb, 1949), each time making the memory trace more durable. The strong dependence hypothesis assumes that the difference in ideomotor learning between intention- and stimulus-based actions, as shown by Herwig et al. (2007), is due to a difference in action-effect binding between the two modes of movement. That is, if action and effect do not wire together (ideomotor learning) in stimulus-based actions, then this might be due to the fact that action and effect features do not always fire together (temporary bindings) in the first place.

Second, ideomotor learning is completely independent of the formation of temporary action-effect bindings. Although such a non-dependence hypothesis is rather radical, it is not so unlikely, since binding and learning act on different time-scales and are thought to solve different problems, with bindings being involved in the problem of distributed coding and ideomotor learning being involved in the control of voluntary actions. Under this view temporary feature binding represents a representational level which is mainly used for the perception of the current event. Action-effect associations underlying ideomotor learning, however, represent a different representational level at which integrated feature assemblies are stored for the purpose of future guidance of behavior. Accordingly, there might be two crucial distinctions between both levels of representation. First, bindings as part of short-term memory depend on the actual presentation of an external effect, whereas action-effect associations as part of long-term memory depend on the internal generation of the effect. As a consequence, both representational levels might fundamentally differ in the level of detail and concreteness they are able to provide. Second, while action-effect associations underlying ideomotor learning presuppose contingent action-effect relationships (Elsner and Hommel, 2004), short-term bindings are also engaged in the perception of ever-changing action-effect relationships – just think of the different sounds one produces while talking with the mouth empty vs. full or the different ball trajectories one produces while playing pinball. Accordingly, both levels might fundamentally differ in the range of events they are able to incorporate. The non-dependence hypothesis thus assumes that the difference in ideomotor learning between intention- and stimulus-based actions (Herwig et al., 2007) do not have to be reflected in short-term bindings.

Third, binding and learning may not be as rigidly connected as assumed under the strong dependence hypothesis and not as independent as under the non-dependence hypothesis. In daily life, the particular effect that an action achieves depends tremendously on the current context. It would appear inefficient to perpetuate all episodes, that is, even those which are not needed anymore once the particular event is finished. This should especially hold true for non-contingent action-effects which cannot be reliable used for action planning. On the weak dependence view, binding and learning do not take place on fundamentally different levels. Instead, bindings are the building blocks for long-term associations, but only those bindings which are reliable and thus worthwhile to be preserved are further processed to form a more durable memory trace (see Colzato et al., 2006). The weak dependence hypothesis thus assumes that binding and learning are related only in case of contingency. Therefore, the difference in ideomotor learning between intention- and stimulus-based actions (Herwig et al., 2007), should only be reflected in short-term bindings of contingent action-effects, whereas it should not be reflected in short-term bindings of non-contingent action-effects.

Experiments 1 and 2 are designed to pit these three accounts against each other. The crucial difference between the experiments is that in Experiment 1 the features of the action-effect were not contingent on the action (as it is usually the case in this type of experiment), whereas in Experiments 2 they were contingent. The strong dependence hypothesis assumes that a difference in action-effect binding is the reason for the difference in ideomotor learning between intention- and stimulus-based actions. Consequently, this hypothesis predicts that intention-based actions result in both experiments in stronger binding effects than stimulus-based actions. The non-dependence hypothesis predicts that intention-based actions result neither in Experiment 1 nor in Experiment 2 in stronger binding than stimulus-based actions, since under this view learning and binding represent two different representational levels. Finally, the weak dependence hypothesis predicts that intention-based actions result only in Experiment 2 in stronger binding than stimulus-based actions, but not in Experiment 1. This is because contingent action-effects can be used only in Experiment 2 but not in Experiment 1 as building blocks for long-term associations. Experiment 3 complements Experiments 1 and 2 by directly comparing binding and ideomotor learning within one experiment.

To sum up, the present study addresses two research questions. First, are temporary bindings between action and effect features modulated by the action mode? Second, how are short-term bindings and long-term ideomotor learning related?

Experiment 1

To investigate the influence of the action mode on temporary action-effect bindings, we slightly modified the original prime-probe stimulus-response task comprising of two subtasks (see above; Hommel, 1998). In the first subtask, the first response (R1) to a neutral go signal was either freely selected (intention-based trials) or precued (stimulus-based trials). In both cases it triggered one out of four auditory effect stimuli (S1; see Figure 1). The second subtask was a speeded forced choice response (R2) to a second stimulus (S2). Moreover, we manipulated the stimulus-onset asynchrony (SOA) between S1 and S2 (1000 vs. 2000 vs. 6000 ms) to assess binding durability.

Figure 1

To assess the binding between features of R1 and S1, our focus was on interactions between stimulus and response repetition effects. On the basis of earlier findings regarding perceptuomotor binding (Hommel, 2005, Experiment 2), we expected that performance is impaired on partial repetition trials, in which either the response feature is repeated while the stimulus feature is alternated, or the stimulus feature is repeated while the response feature is alternated (partial repetition costs). By contrast, alternating both stimulus and response between the two subtasks of one trial should yield a performance level in the second subtask that is as good as when both are repeated. Such a pattern of results points to action-effect binding, since it implies that reactivating one feature tends to also activate the fellow feature. This, in turn, causes conflict in case of partial repetitions.

The crucial question was whether this interaction would be modulated by the action mode (intention- vs. stimulus-based). Under the strong dependence hypothesis of the relation between learning and binding, one would expect action-effect bindings to be weaker or less durable for stimulus-based than for intention-based actions. In this case the fragility of action-effect bindings in stimulus-based actions could be considered to be responsible for the effect of the action mode on ideomotor learning (see Herwig et al., 2007; Herwig and Waszak, 2009). Under the weak dependence hypothesis as well as the non-dependence hypothesis, binding should not be influenced by the action mode.

Materials and methods

Participants

Sixteen adults (mean age: 24.9 years) participated. They reported having normal or corrected-to-normal vision and audition and were not familiar with the purpose of the experiment. Informed consent was obtained from all subjects.

Apparatus and stimuli

The experiment was controlled by a standard PC, interfaced to a 17″ monitor. The viewing distance was about 70 cm. Visual stimuli were displayed on a black background. In stimulus-based trials, two white left- or right-pointing arrows (mean extension: 0.4° × 0.7°) served as response cues and were presented in the center of the screen. In intention-based trials, the response cue was replaced by the free choice cue, i.e., two arrows pointing in different directions (<>) requesting participants to prepare a left or right key press depending on their own choice. A white rectangle (mean extension: 0.7° × 1.0°) served as a go signal for the execution of the precued/prepared response. Auditory stimuli were the English numbers “2” and “10” vocalized by a male or female voice (duration 200–300 ms). The words were presented simultaneously through the left and right speaker of a headphone. Responses were made by pressing the left or right of two keys mounted in a horizontal distance of 13.5 cm on a board with the left or right index finger.

Procedure and design

Each trial comprised two speeded responses. The first response (R1) was always a simple reaction to the go signal. The type of response (i.e., left or right key press) was either indicated by the response cue (stimulus-based trials) or depended on participants’ own choice (intention-based trials). R1 triggered the presentation of the first auditory effect stimulus (S1). Whether the stimulus was the number 2 or 10 vocalized by a male or a female voice was determined randomly. The second response (R2) was always a binary-choice reaction to the number feature of the second stimulus (S2). S2 was again either the number 2 or 10 vocalized by either a male or a female voice, randomly determined. Half of the participants responded to the number 2 and 10 by pressing the left and right key, respectively, whereas the other half responded according to the opposite mapping.

The sequence of events in each trial is shown in Figure 1. Following an intertrial interval of 2000 ms, a response cue or a free choice cue was presented for 1500 ms, followed by the go signal that was presented until the first response was executed. R1 triggered the presentation of S1 (50-ms onset asynchrony between R1 and S1). If R1 was not executed within 1000 ms (counted as omission) a visual warning message (too slow) was presented for 800 ms and the trial started from the beginning. If R1 was incorrect (only possible in stimulus-based trials) or anticipatory (RT < 80 ms) a visual warning message (wrong key, too fast, respectively) was presented for 800 ms and the trial continued. S2 appeared 1000, 2000, or 6000 ms after the onset of S1. Responses to S2 that were incorrect, premature (RT < 80 ms) or omitted (RT > 2000 ms) triggered presentation of the corresponding visual warning message.

The experiment was divided into four parts which were done in 1 day. Two of the four parts consisted of 3 blocks of 96 randomly ordered intention-based trials each and the remaining two parts of 3 blocks of 96 randomly ordered stimulus-based trials each. The order of the four parts was counterbalanced across participants. Participants performed 24 randomly selected practice trials at the beginning of the experiment and prior to the first switch of the action mode. That is, all in all the experiment comprised 48 practice trials and 1152 experimental trials which took approximately 4 h. Each block was composed of a factorial combination of S2 number (2 vs. 10, corresponding to left vs. right R2) and S2 gender (male vs. female), the possible relationships between S1 and S2 (repetition vs. alternation) regarding number and gender, the SOA between S1 and S2 (1000 vs. 2000 vs. 6000 ms), and the two possible relationships between R1 and R2 (repetition vs. alternation). In intention-based blocks, in contrast, the relationship between R1 and R2 could not be determined a priori because R1 depends on participants’ free choice. In these blocks participants were instructed to use the left and right key for the first response about equally often and in a random order. Participants could take a break after each block.

Results

For the sake of clarity and according to our main question (i.e., action-effect bindings for intention- and stimulus-based actions), we present only the results of subtask 2 and, specifically only the reliable effects in the main text. The Appendix presents the results of subtask 1 as well as two tables which provide a detailed overview of the means (see Table A1 in Appendix) and ANOVA outcomes (see Table A2 in Appendix) for RTs and error rates obtained for subtask 2. After excluding trials in which R2 was anticipated or omitted (0.2%), R2 data were analyzed as a function of the action mode (intention- vs. stimulus-based), SOA (1000 vs. 2000 vs. 6000 ms), and repetition vs. alternation of stimulus number, gender, and response. Analyses of variance (ANOVA) with the factors Action mode (intention- vs. stimulus-based), Response (repetition vs. alternation), Number (repetition vs. alternation), Gender (repetition vs. alternation), and SOA (1000 vs. 2000 vs. 6000 ms) were performed on error rates and error-free RTs by using a five-way design for repeated measures. Violations of sphericity were corrected using the Huynh–Feldt ε. The significance criterion was set to p < 0.05 for all analyses.

Reaction times

The RT analysis yielded five reliable effects and importantly, none of these effects interacted with the action mode (ps > 0.24). There were main effects of SOA, response, and gender. These main effects indicated faster responses with increasing SOA (661, 637, and 603 ms for SOA of 1000, 2000, and 6000 ms, respectively), for response alternations (643 and 624 ms for response repetitions and alternations, respectively), and for gender repetitions (623 and 644 ms for gender repetitions and alternations, respectively). The main effect of gender was further modified by an interaction with number, indicating an integration of the auditory stimulus features number and gender.

More importantly, the main effect of response was modified by an interaction with number, indicating action-effect binding. Figure 2 shows the relative repetition benefit for each stimulus dimension (i.e., the mean RT difference between number/gender alternation and number/gender repetition; note that the values depicted in Figure 2 are differences of averaged values given in Table A1 in Appendix) as a function of the relationship between R1 and R2 separated for intention- and stimulus-based trials and the three SOAs. A positive difference indicates that participants responded faster for stimulus repetitions than alternations, whereas a negative difference indicates faster reactions for stimulus alternations than repetitions. As Figure 2 clearly shows, repeating stimulus number produces a benefit if, and only if, the response is also repeated. If the response is alternated, the repetition benefit turns into an alternation benefit. This was true for all three SOAs.

Figure 2

Error rates

The error rates overall mirrored the RTs but produced some additional effects. Importantly, once again none of the reported effects was modified by the action mode (ps > 0.27). As concerns the main effects, participants committed fewer errors with increasing SOA and response alternations. However, in contrast to the RT data, participants committed fewer errors with gender alternations (3.0 and 2.5% for gender repetitions and alternations, respectively). Thus a speed-accuracy trade-off can be excluded for the factors SOA and response, but not for gender. The main effect of response was modified by an interaction with SOA, indicating an increased alternation benefit with the medium SOA of 2000 ms.

Of importance, response interacted with number as well as with gender, indicating that each stimulus dimension was separately integrated with the response. Repeating both the number and the response or alternating both (1.8 and 0.9%, respectively) decreased the error rate, whereas the error rate increased if only one, but not the other, was repeated (number repeated: 3.5%; response repeated: 4.9%). Likewise, a response repetition was easier if gender was also repeated than alternated (3.2 and 3.4%, respectively), whereas a response alternation was easier if gender was also alternated than repeated (1.7 and 2.8%, respectively). Moreover, action-effect bindings for both effect features interacted with SOA. Separate ANOVAs for each SOA showed both interactions to be significant only for the SOAs of 1000 [response × number: F(1,15) = 36,80, p < 0.001, response × gender: F(1,15) = 12.06, p = 0.003, ] and 2000 ms [response × number: F(1,15) = 24.30, p < 0.001, response × gender: F(1,15) = 4.83, p = 0.044, ] but not for the SOA of 6000 ms (ps > 0.143).

Discussion

As shown in Figure 2, the effect of stimulus repetition was clearly dependent on whether or not the response was also repeated. Thus, Experiment 1 suggests that the co-occurrence of action and auditory codes triggered by the action results in the temporary binding between the involved perceptual and motor features. Comparable to studies investigating perceptuomotor binding (e.g., Hommel, 1998), action-effect bindings were pronounced for the task relevant stimulus feature (i.e., number). Moreover, the analysis of RTs of Experiment 1 showed action-effect bindings to remain intact for at least six seconds – a finding that extends the results regarding the durability of perceptuomotor bindings by 2 s (Hommel and Colzato, 2004).

More importantly, Experiment 1 did not show any influence of the action mode on the strengths or durability of the action-effect bindings. That is, short-term action-effect bindings were comparably strong and durable following intention- and stimulus-based actions. This observation is in contrast to the predictions derived from the strong dependence hypothesis of binding and learning. Thus, the finding of Herwig et al., 2007; see also Herwig and Waszak, 2009; Herwig and Horstmann, 2011) that ideomotor learning is affected by the action mode does not seem to be due to an elementary difference in action-effect binding.

However, the dissociation of the effect of the action mode on binding and learning is in accord with the non-dependence as well as the weak dependence hypothesis. If binding and learning actually represent two independent representational levels (as suggested by the non-dependence hypothesis), one would not expect binding and learning to be influenced by the same factors. According to the weak dependence hypothesis action-effect binding is a necessary, but not a sufficient precursor for long-term ideomotor learning. In this scenario, the action mode might determine whether or not the repeated formation of identical transient bindings forms a memory trace. Metaphorically speaking, bindings may be regarded as building blocks that are constructed whenever an effect is produced in close temporal contiguity by an action regardless of whether the action was externally or internally selected. However, only intention-based actions, but not stimulus-based actions, may provide the glue necessary to agglutinate these building blocks to form a durable memory trace.

This notion can only be tested if one effect feature is produced contingently by one and not the other action. In Experiments 1 each effect feature was produced by each action with the same probability. Consequently, distinct action-effect relations could not be established. Therefore, we implemented contingent action-effect mappings in Experiments 2 and 3.

Experiment 2

As pointed out above, one reason for the missing influence of the action mode on the formation and durability of action-effect bindings might be related to the fact that each effect feature was produced by each action with the same probability. It is possible that due to this missing contingency between action and effect features binding and learning are unrelated as suggested by the weak dependence hypothesis. To address this issue, Experiment 2 was conducted, in which each action (R1) contingently produced one specification of the irrelevant effect feature of S1 (i.e., gender). For example, pressing the left key led to the auditory presentation of the number “2” or “10” spoken by a female voice, whereas pressing the right key resulted in the presentation of the number “2” or “10” spoken by a male voice.

Such a contingency manipulation should in principle enhance ideomotor learning (Elsner and Hommel, 2004). Importantly, if the weak dependence hypothesis holds (i.e., if binding and learning are only related in case of a contingent action-effect relationships), this enhancement should be reflected in partial repetition costs as well. This is because in Experiment 2 R2 may be affected by two factors: the event file compiled during the fist subtask of each trial and the memory trace emerging through the repeated experience of the contingent action-effect mapping. Both factors should entail RT costs if only the gender or the response is repeated while the other feature is alternated (i.e., partial repetition costs). Consider, for instance, an action-effect mapping for R1–S1 that links a left key press with a female voice (F) and a right key press with a male voice (M). Moreover, the stimulus-response mapping rule for S2–R2 be to respond to the number two (2) and ten (10) by pressing the right and left key, respectively. If S2 is the number two spoken by a female voice (2F), this might lead to a conflict in initiating R2 because female may automatically activate the left response due to the compiled memory trace, whereas 2 calls for a right response due to the instructed mapping. Likewise, if S2 is 10M, 10 calls for a left whereas male calls for a right response. In contrast, no conflict arises if S2 is 2M or 10F, because the number as well as the gender feature call for the same response. Importantly, in the given example, 2F and 10M would also be the partial repetitions with respect to R1–S1, because a left R1 always triggers S1 spoken with a female voice and a right R1 always triggers S1 spoken with a male voice (leaving 2M and 10F as complete repetitions or complete alternations). Accordingly, if contingency determines whether binding and learning are related or not, one would expect R2 to be influenced by the previously compiled event file and the accumulating memory trace only for intention-based actions. In contrast, for stimulus-based actions R2 should be affected solely by the event file, resulting in a three-way interaction of response, gender, and action mode.