Gesturing Meaning: Non-action Words Activate the Motor System

Bach, Patric; Griffiths, Debbra; Weigelt, Matthias; Tipper, Steven

doi:10.3389/fnhum.2010.00214

ORIGINAL RESEARCH article

Front. Hum. Neurosci., 04 November 2010

Sec. Motor Neuroscience

Volume 4 - 2010 | https://doi.org/10.3389/fnhum.2010.00214

This article is part of the Research TopicUnderstanding human intentional communicationView all 9 articles

Gesturing meaning: non-action words activate the motor system

Patric Bach^1,2*

Debra Griffiths¹

Matthias Weigelt³

Steven P. Tipper¹

¹ School of Psychology, Bangor University, Bangor, UK
² School of Psychology, University of Plymouth, Plymouth, UK
³ Institute of Sport Science, Saarland University, Saarbrücken, Germany

Across cultures, speakers produce iconic gestures, which add – through the movement of the speakers’ hands – a pictorial dimension to the speakers’ message. These gestures capture not only the motor content but also the visuospatial content of the message. Here, we provide first evidence for a direct link between the representation of perceptual information and the motor system that can account for these observations. Across four experiments, participants’ hand movements captured both shapes that were directly perceived, and shapes that were only implicitly activated by unrelated semantic judgments of object words. These results were obtained even though the objects were not associated with any motor behaviors that would match the gestures the participants had to produce. Moreover, implied shape affected not only gesture selection processes but also their actual execution – as measured by the shape of hand motion through space – revealing intimate links between implied shape representation and motor output. The results are discussed in terms of ideomotor theories of action and perception, and provide one avenue for explaining the ubiquitous phenomenon of iconic gestures.

Introduction

When people talk, even on the phone, they often find themselves producing iconic gestures: without any conscious intention, their hand movements capture the actions and objects they speak about (e.g., Iverson and Goldin-Meadow, 1998). Despite its unconscious and non-strategic nature, the production of these gestures is not without its uses. Gesturing enhances verbal fluency (Krauss et al., 1996; Kita, 2000) and transmits information that complements the information transmitted by language (e.g., Melinger and Levelt, 2004), even when not consciously accessible by the speaker (Goldin-Meadow and Wagner, 2005). On the side of the receiver, observing gestures helps language comprehension, as the gestural content is automatically integrated with the accompanying sentence (e.g., Holle and Gunter, 2007; Wu and Coulson, 2007a,b) and engages the same semantic system as language itself (e.g., Gunter and Bach, 2004; Özyürek et al., 2007).

Despite these important functions for communication, the phenomenon of iconic gestures has not been convincingly explained. Why is gesturing such an effortless process, considering that it happens alongside the complex act of speaking? Why does gesturing even improve the verbal fluency of the speaker, and, conversely, why does preventing people from gesturing interfere with speaking (Rauscher et al., 1996)?

Two frameworks provide potential explanations for these phenomena. One avenue for explanation is provided by “simulation” or “motor resonance” accounts of language that presuppose that any mental representation of a bodily movement relies on the sensorimotor structures that are involved in its actual production (e.g., Buccino et al., 2005; for a review, see Fischer and Zwaan, 2008; for neuroimaging data, see Pulvermüller and Fadiga, 2010). Various studies have demonstrated that seeing actions primes similar actions in the observer and engages motor-related brain structures (di Pellegrino et al., 1992; Brass et al., 2000; Bach and Tipper, 2007; Griffiths and Tipper, 2009). Important for explaining gesturing in a language context, these effects are also observed for movements that are only available mentally, not visually. For instance, reading words denoting actions (e.g., “close the drawer”) primes the associated movements in the same way as seeing the actions themselves (e.g., pushing one’s hand forward, Glenberg and Kaschak, 2002; for similar results, see Buccino et al., 2005; Zwaan and Taylor, 2006), and object words evoke the actions that are typically performed with them, such as appropriate hand configurations for grasping or use (e.g., Glover et al., 2004; Tucker and Ellis, 2004; Bub et al., 2008). That implicit activation of action information suffices to induce motor activation has also been demonstrated with face stimuli. Identifying famous soccer and tennis players affects responses with the effectors involved in their sports, even when these athletes were identified from their faces and no direct visual cues to their sports were present (Bach and Tipper, 2006; Candidi et al., 2010; Tipper and Bach, 2010).

These findings indicate that the actions associated with words or objects are automatically retrieved and flow into the motor system, suggesting that gesturing may be a direct consequence of representing the action content of a message. Yet, there is reason to believe that such motor resonance accounts are too limited. Iconic gestures are not only driven by action content. Many iconic gestures refer to the spatial content of language and capture, for instance, spatial layouts, motion through space, or object shape (e.g., Graham and Argyle, 1975; Seyfeddinipur and Kita, 2001; Kita and Özyürek, 2003; for a review, see Alibali, 2005). Another class of theories – so-called ideomotor or common coding views of cognition (e.g., Greenwald, 1970; Prinz, 1990; Hommel et al., 2001; Knoblich and Prinz, 2005) – is not affected by these limitations and can account also for the spatial content transmitted by gestures. According to these theories, the motor priming effects discussed above are only one instantiation of a more general principle. Movements are planned on a perceptual level, in terms of their directly perceivable consequences (their effects). The same representations are therefore used to describe stimuli in the environment and to plan movements toward them. For example, the same code that represents the orientation of a bar in front of you would also be used as possible goal state for the orientation of your own hand, allowing visual stimuli to directly evoke appropriate movements toward them. There are numerous studies that demonstrate the automatic translation of simple spatial stimulus features into motor actions, such as laterality (Simon, 1969) or orientation (Craighero et al., 1999), but also complex features, such as hand postures (Stürmer et al., 2000), or body parts (Bach et al., 2007; Gillmeister et al., 2008; Tessari et al., 2010; for a recent review, see Knoblich and Prinz, 2005).

Ideomotor accounts therefore differ from motor simulation accounts in that they assume that motor output can also be driven by perceptual rather than motor content of a message. They predict that not only action, but any spatial content – even such complex cues as an object’s shape – should drive the motor system, irrespective of whether it directly relates to action. In fact, the motor system may be influenced by words denoting objects, which we have never interacted with before (e.g., “moon”), as long as their representation carries strong visual shape information. This study directly tests this hypothesis. It investigates whether mentally representing an object automatically activates iconic movements that capture the object’s shape. Importantly, and in contrast to previous research investigating motor effects in word processing (e.g., Glenberg and Kaschak, 2002; Tucker and Ellis, 2004; Zwaan and Taylor, 2006; Bub et al., 2008), it investigates gestures that are far removed from the actual way in which we typically interact with the objects, minimizing affordance-based contributions to gestural output.

We used the following strategy. The first experiment uses a stimulus–response compatibility (SRC) paradigm to establish a technique that can then be applied to study word processing. Participants saw abstract geometrical shapes – circles or squares – appearing on the screen, and were instructed to either produce a bimanual round or a square gesture in the vertical space in front of their body (Figure 1), depending on the color of the shapes. We found that the similarity of seen and to-be produced shapes affected both the time to initiate the gesture and the actual time required to perform the gestures. This study therefore demonstrates that even such complex cues as geometrical shape are mapped onto the motor system and facilitate the selection and performance of corresponding gestures. The second experiment takes this logic further, and investigates whether shape information also flows into the motor system when this shape information is not directly available from the stimulus, but is carried by words referring to either round (e.g., “carousel”) or square objects (e.g., “billboard”), which have no direct links to action. We tested whether such influences on gesturing requires explicit semantic processing of the words. Experiment 3 replicates these effects with non-words that were, in a previous learning task, arbitrarily associated with objects. Finally, Experiment 4 investigates the actual performance of the gestures with motion tracking techniques and directly demonstrates that processing words that represent objects of particular shapes influences the actual performance of gestures – the shape of hand motion through space – and not just gesture selection processes. The experiment therefore reveals intimate links between the representation of implied shape and actual motor control.

FIGURE 1

Figure 1. The two gestures that participants were instructed to perform in all experiments. Black arrows show the actual shapes, gray arrows show movements back to the rest keys.

Experiment 1: Visual Shapes

Experiment 1 investigates whether seeing abstract geometrical shapes affects ongoing behavior, even when irrelevant to current behavioral goals. Participants produced bimanual square or circle gestures in the vertical plane in front of their body (Figure 1) in response to the color of squares and circles. From a motor simulation perspective, there should be minimal grounds for predicting effects of the perceived shapes on motor behavior. First, in contrast to the 3D photographs of real objects with strong affordances used in prior research (Tucker and Ellis, 1998, 2001; Bub et al., 2008), we presented abstract geometrical shapes as two-dimensional abstract wire frames, which are unlikely be associated with specific motor responses (e.g., Symes et al., 2007). Second, the bimanual gestures required by the participants do not mirror any object-directed behavior and are not part of the typical set of affordances of commonplace objects. Yet, if the same spatial codes represent both shapes in the environment and actions one can performed, then these hand movements should nevertheless be influenced by the presented shapes. In particular, participants should be quicker to initiate and produce square gestures in response to seeing a square and slower when seeing a circle, and vice versa. Such findings would confirm that not only simple perceptual properties such as laterality (Simon, 1969) are represented in common perceptual-motor codes, but that the same is true for more complex geometrical shape information.

Materials and Methods

Participants

Ten participants (seven females), all students at Bangor University, Wales, took part in the experiment. They ranged in age from 18 to 22 years and had normal or corrected-to-normal vision. The participants received course credits for their participation. They satisfied all requirements in volunteer screening and gave informed consent approved by the School of Psychology at Bangor University, Wales, and the North-West Wales Health Trust, and in accordance with the Declaration of Helsinki.

Materials and apparatus

The experiment was controlled by the software Presentation run on a 3.2-GHz PC running Windows XP. The stimulus set consisted of seven pictures: a fixation cross (generated by the + sign of the Trebuchet MS font in white on a black background), a green circle, a green square, a red circle and a red square, plus a white square and a white circle. The circles and squares were line drawings (line strength 4.5) generated in PowerPoint and subsequently exported to bitmap (.bmp) format. The circles and squares were presented on a black background and were 12 cm tall and wide.

Procedure and design

The participants were seated in a dimly lit room facing a color monitor at an approximate distance of 60 cm. The participants received a computer driven instruction and their response assignment (i.e., whether the red color cued square gestures and green color cued round gestures, or vice versa). They then performed 12 training trials. When both experimenter and participant were satisfied that the task was understood, the main experiment started. It consisted of four blocks of 60 trials each, separated by a short pause that participants could terminate by pressing the space bar. Altogether, there were 12 different trial types, resulting from the factorial combinations of two movements (square, round), two shapes (square, round), and three SOAs (100, 200, 500 ms).

Participants initiated a trial by resting their index finger of the left and right hands on previously designated “rest” buttons (the V and N keys on the computer keyboard). After the presentation of a fixation cross (500 ms.), and a short blank (500 ms.), one of the two neutral stimuli was presented (white square or circle). After an SOA of 100, 200, or 500 ms, the circle turned either green or red. This served as the imperative cue for the participants to perform one of the previously designated gestures. Depending on the color of the stimulus, the participant would lift their fingers from the rest keys and either gesture a circle or a square with both hands (index fingers extended) in the vertical space in front of their body (black arrows in Figure 1). After finishing the gesture participants were instructed to bring the fingers back as quickly as possible, and in a straight line, to the rest keys (gray arrows in Figure 1). The assignment of gestures to colors was counterbalanced across participants. As soon as the fingers left the keys, the displayed shape was removed and replaced by a blank screen that remained on screen for the duration of the movement. As soon as the fingers returned to the rest keys the next trial started.

Both the times from stimulus onset to release of the rest keys (response times/RTs) and times from movement onset to return to the rest keys (movement times/MTs) were measured. RTs and MTs were calculated from the average RTs and MTs of both hands. Please note that the equipment used in Experiment 1 does not allow us to measure gesturing accuracy. The issue of gesturing accuracy is specifically addressed in Experiment 4, in which motion tracking was used to record the participants’ movements through 3D space.

Results

For the analysis of RTs (Figure 2A), only trials were considered in which the release of both rest keys followed by the return to both rest keys was detected. All trials that lay beyond three standard deviations of the condition means were excluded (2.9% of trials). The remaining RTs were analyzed with 2 × 2 repeated measurement ANOVA with the factors Shape (circle, square) and Gesture (circle, square). Entering SOA as a further variable in the ANOVA did not reveal any additional interactions (F < 1); data were therefore collapsed across the three SOAs. The analysis revealed neither a main effect of Gesture (F[1, 9] < 1), nor a main effect of Shape (F[1, 9] = 2.25; p = 0.17), but the predicted interaction of both factors was significant (F[1, 9] = 27.23; p < 0.001, η² = 0.752). Post hoc t-tests showed that participants initiated a circle more quickly when seeing a circle than a square (t[9] = 5.21; p < 0.001), but that they initiated a square more quickly when they saw a square than a circle (t[9] = 2.55; p = 0.031).

FIGURE 2

Figure 2. Response times (A) and movement times (B) in Experiment 1. Error bars show the standard error of the mean.

Movement times (Figure 2B) were analyzed in an analogous manner. As SOA did not interact with compatibility effects (F = 1.470, p = 0.259), we again collapsed the data across this factor. The analysis did not reveal a main effect of Shape (F[1, 9] < 1), but a main effect of Gesture (F[1, 9] = 32.44; p < 0.001), with participants gesturing circles more quickly than squares. The predicted interaction of Shape and Gesture just failed to reach full significance (F[1, 9] = 4.85; p = 0.055, η² = 0.350). Participants gestured circles more quickly when the stimulus was a circle that when it was a square (t[9] = 1.89; p = 0.091), but gestured a square more quickly when responding to a square than to a circle (t[9] = 1,79; p = 0.11), though the effect failed to reach significance.

Discussion

The experiment demonstrated that seeing geometrical shapes facilitates the production of similar gestures. Squares were gestured more efficiently when responding to a square, and circles were gestured more efficiently when responding to a circle. These effects were found even though the shapes did not belong to recognizable objects and even though they did not have to be encoded for the participant’s color decision task. Moreover, the required movements of the participants were not goal directed and did not match the usage patterns of commonplace objects. Finally, typical object-based affordance effects are usually not observed when responses are based on an action-irrelevant feature, such as color (e.g., Tipper et al., 2006). That robust effects on gesture output were nevertheless detected therefore indicates that the shape of viewed objects feeds directly into the motor system, affecting the production of ongoing movements, as predicted by ideomotor accounts of action control. Our data therefore put shape information on a similar level as perceptual-motor features, such as laterality (i.e., the Simon effect, Simon, 1969; for a review, see Hommel, 2010) or magnitude (i.e., the SNARC effect, Dehaene et al., 1993), for which similar perception-action links have been reported.

Although the effect was less robust, the compatibility of shape and movement appeared to not only affect gesture selection (as measured by RTs), but also their actual execution (MTs). As the geometrical shapes disappeared as soon as the participants released the rest key and started the movement, this MT effect cannot be attributed to direct perceptual interference with gesture production. Rather, it appears to have a more cognitive origin, suggesting that the shape codes generated during shape observation have a long lasting effect on both gesture selection and actual execution.

Experiment 2: Object Words

Experiment 1 demonstrated that simply seeing abstract shapes – circles or squares – may elicit gestures of the same shape. Although no language stimuli were utilized, the data are nevertheless relevant to understanding gesture. In many situations, people have to coordinate their actions and attention to share information or achieve a goal. Often, this coordination is achieved not only through language, but though iconic gestures that transmit the communicative intent by capturing the objects’ attributes (e.g., Morsella and Krauss, 2005). Experiment 1 provides evidence for a natural and highly automatic pathway that can explain gesturing in such circumstances.

To explain gesturing in a language context, it is important to demonstrate that similar processes also occur for content that is available only mentally, not visually. Experiment 2 addresses this question, by activating shape information through words denoting either round or square objects. Previous work has demonstrated motor system activation for words directly representing actions, such as “kick” or “push” (Hauk et al., 2004; Kemmerer et al., 2008), or objects that have become associated with specific motor responses (e.g., Tucker and Ellis, 2004; Bub et al., 2008). Here, we go beyond these findings to demonstrate that motor codes are activated even when the words do not describe actions or objects associated with specific behaviors. The word “billboard,” for example, possesses no directly available shape information (the word itself has no square shape), does not directly describe any specific behavior (unlike words such a “push”), and it is not linked to a prior history of associated motor behaviors. Moreover, as in Experiment 1, the bimanual gestures participants had to produce were not goal directed and did not correspond to the typical usage patterns of commonplace objects. Yet, if (a) reading object words activates a visual representation of the object’s shape, and (b) shape information is represented in terms of motor codes, then compatibility effects analogous to Experiment 1 should nevertheless be observed. Both MTs and RTs should again be slowed when participants have to gesture a circle in response to a word denoting a square object compared to a round object, and vice versa when participants have to gesture a square.

A second issue is how deeply the word has to be encoded for these effects to take place. If gestural effects are driven by visual shape information associated with the semantic knowledge of the word, then motor codes should only be activated when the word is processed semantically. We therefore compared two conditions, tested in separate experimental blocks. In one condition, participants had to select the gesture to perform – round or square – on the basis of the color of the presented word, minimizing the requirement of semantic or even lexical analysis of the word. In the other condition, participants selected the gesture based on semantic information carried by the word. If the referent object was typically found inside the house (e.g., fridge) participants would have to perform one gesture and the other gesture if the object was typically found outside the house (e.g., billboard).