Metaphors are Embodied, and so are Their Literal Counterparts

Santana, Eduardo; De Vega, Manuel

doi:10.3389/fpsyg.2011.00090

ORIGINAL RESEARCH article

Front. Psychol., 10 May 2011

Sec. Cognition

volume 2 - 2011 | https://doi.org/10.3389/fpsyg.2011.00090

Metaphors are embodied, and so are their literal counterparts

Eduardo Santana

Manuel de Vega*

Psicología Cognitiva, University of La Laguna, La Laguna, Spain

This study investigates whether understanding up/down metaphors as well as semantically homologous literal sentences activates embodied representations online. Participants read orientational literal sentences (e.g., she climbed up the hill), metaphors (e.g., she climbed up in the company), and abstract sentences with similar meaning to the metaphors (e.g., she succeeded in the company). In Experiments 1 and 2, participants were asked to perform a speeded upward or downward hand motion while they were reading the sentence verb. The hand motion either matched or mismatched the direction connoted by the sentence. The results showed a meaning-action effect for metaphors and literals, that is, faster hand motion responses in the matching conditions. Notably, the matching advantage was also found for homologous abstract sentences, indicating that some abstract ideas are conceptually organized in the vertical dimension, even when they are expressed by means of literal sentences. In Experiment 3, participants responded to an upward or downward visual motion associated with the sentence verb by pressing a single key. In this case, the facilitation effect for matching visual motion-sentence meaning faded, indicating that the visual motion component is less important than the action component in conceptual metaphors. Most up and down metaphors convey emotionally positive and negative information, respectively. We suggest that metaphorical meaning elicits upward/downward movements because they are grounded on the bodily expression of the corresponding emotions.

Introduction

People use language to refer literally to perceptual objects or events, in sentences such as “the balloon rose.” However, they can also refer to abstract events and entities using the indirect pathway of metaphors. For example, “his mood rose” expresses the abstract concept of “arriving at a good mood” in terms of the more concrete concept of “rising.” In other words, metaphors help us to understand abstract or relatively unclear concepts such as mental states in terms of concrete sensory-motor experiences. Two related features emerge from the current literature on metaphorical meaning: metaphors are conceptual, and metaphors are embodied. The conceptual nature of metaphors means that metaphorical expressions are tied to metaphorical concepts in a systematic way (Lakoff and Johnson, 1980; Johnson, 1987; Lakoff, 1987). In other words, concepts are primarily metaphorical (rather than symbolic or propositional) and linguistic metaphors are systematically derived from these concepts. On the other hand, conceptual metaphors are grounded in embodied representations; that is to say, we use sensory-motor experience to conceptualize abstract domains such as time, feelings, interpersonal relationships, etc. (Lakoff and Johnson, 1980; Gibbs, 1994, 2006; Johnson and Lakoff, 2002; Casasanto, 2009). Thus, given the prominence of space in our perceptual and motor experience, spatial dimensions are frequently used to support rich metaphorical conceptual systems (Lakoff and Johnson, 1980). For instance, speakers of English and other languages have been found to conceptualize time by mapping the future in front and the past behind themselves (e.g., Casasanto and Boroditsky, 2008; Sell and Kaschak, 2010) or, alternatively, the future to the right and the past to the left (Torralbo et al., 2006; Santiago et al., 2007). There is also a rich orientational metaphor system in the up/down spatial dimension, which is the main concern of this paper. According to Lakoff and Johnson (1980) some metaphorical notions (good, virtue, happiness, consciousness, health, wealth, high status, power, etc.) are mapped onto the “up” pole of the vertical dimension, whereas the opposite notions (evil, vice, sadness, unconsciousness, illness, poverty, low status, etc) are mapped onto the “down” pole of the vertical dimension.

Some recent studies have supported the embodied character of up/down conceptual metaphors. Thus, Schubert (2005) has found evidence that the concept of power is partially mapped onto the physical vertical dimension; in other words, the “control is up – lack of control is down” conceptual metaphor is embodied. Thus, when participants were asked to judge which one was the more powerful of two groups (e.g., master or servant), their response was faster if the word for the powerful group was in the upper part of the screen than if it was in the lower part. Furthermore, Moeller et al. (2008), using a spatial attention paradigm, found that individuals with dominant personalities favored the vertical dimension of space more than individuals with low dominance, being faster to respond to probes along a vertical axis, while both groups performed similarly over a horizontal axis.

In the same vein, the “more is up – less is down” orientational metaphor was investigated by Joseph et al. (1994), who found that participants’ judgments of their performance in a proofreading task were influenced by the size of the pile of documents they were required to read. Individuals who were required to read pages inside journals (high pile) judged that they had done more and a better job than those who were asked to read single pages (low pile), despite the fact that the two conditions demanded the same amount of work. Similarly, Langston (2002) found that texts that violate the “more is up-less is down” conceptual metaphor are more difficult to comprehend than texts that are consistent with such metaphor, yielding slower response times and higher error rates in a semantic task. Moreover, Ito and Hatta (2004) reported a SNARC-like effect (Spatial-Numerical Association of Response Codes) for the vertical dimension: they found that participants were faster to respond to large numbers with the top choice key than with the bottom choice key, whereas the reverse was true for small numbers.

Finally, concerning the “positive is up – negative is down” metaphor, Meier and Robinson (2004) have reported that evaluations for positive words were faster when words were in the upper rather than on the lower position in the screen, whereas for negative words they found the opposite pattern. They also found that participants with higher neuroticism or depressive symptoms responded faster to spatial attention targets placed in the bottom, which suggests that negative affect biases selective attention in a direction that favors lower regions of physical space (Meier and Robinson, 2006). Memory tasks are also sensitive to the “positive is up – negative is down” metaphor. Thus, Crawford et al. (2006) asked participants to remember images with an emotionally positive or negative valence, and found that positive items were remembered better when presented on the top of the screen, while negative images were biased downward. Also Casasanto and Dijkstra (2010) reported that participants retrieved more positive memories of their life when instructed to move marbles up, and more negative memories when instructed to move them down, demonstrating that positive and negative life experiences are implicitly associated with schematic representations of upward and downward motion.

The above studies investigated how the up/down metaphorical conceptualization modulates different semantic tasks, such as semantic classifications in bipolar categories, but they did not directly investigate whether the motions in the vertical dimension are activated online during ordinary comprehension of metaphors. To test the embodied meaning approach to language comprehension, some researchers have used an action-sentence compatibility effect (ACE) paradigm. The basic ACE procedure consists of asking participants to listen to or read sentences describing motor events while they perform a motor task designed to match or mismatch the meaning of the sentences (Glenberg and Kaschak, 2002; Buccino et al., 2005; Borreggine and Kaschak, 2006; Zwaan and Taylor, 2006; Glenberg et al., 2008; Kaschak and Borregine, 2008; de Vega et al., in revision). In most cases a facilitatory ACE has been reported in the literature; that is, the meaning-action matching condition produces faster motor responses than the mismatching condition. Thus, Glenberg and Kaschak (2002) asked people to judge the sensibility of sentences describing a transfer motion toward or away from them (e.g., “Andy delivered the pizza to you” or “You delivered the pizza to Andy”). The judgment time for sensible sentences was faster for the matching conditions (e.g., sentences describing a transfer toward oneself with the “yes” response being a hand motion toward oneself) than for the mismatching conditions.

Surprisingly, in spite of the abundant literature on metaphors, no study has yet been performed to address whether the comprehension of up/down orientational metaphors activates embodied representations online. To our knowledge, the only ACE study on metaphorical comprehension to date has been run recently to demonstrate that the comprehension of “future is in front – past is behind” spatial metaphors activates body action simulations in the front/back spatial dimension (Sell and Kaschak, 2010). The present research aims to fill the gap, testing whether the comprehension of up/down metaphors activates vertical body actions or visual motions online. Given that orientational metaphors are ubiquitous in everyday language, it is possible that they might have become “frozen” or “dead;” if such were the case, they would not activate embodied representations online. Conversely, if embodied representations were part of metaphorical meaning, then they would be activated in online comprehension. Moreover, not only would explicit orientational metaphors activate embodied representations but, as we will argue, abstract literal sentences that describe the same conceptual domains as orientational metaphors would also activate embodied representations in the same way. Assuming that metaphors do activate embodied representations online, another goal of this study was to know whether these representations have a motor component, a visuo-spatial component, or both. Notice that most up/down orientational metaphors employ verbs of motion (e.g., rising, going down, falling, jumping, etc), and these motions could potentially be understood as visual or motor events. For instance, the metaphor “made him rise with victory” could be represented either as an upward visual motion (e.g., as watching a movie), as an upward body action, or as a combination of the two.

In this research, the ACE procedure was modified to test how orientational metaphors modulate vertical body actions (Experiments 1 and 2). Participants performed either upward or downward hand motions while they read orientational metaphors and other types of sentences. If orientational metaphors involved a simulation of a vertical action, then they would interact with the enactment of a matching body vertical motion. Thus, one might expect faster responses in matching as compared to mismatching conditions: up metaphors (“climbing up in the company”) would facilitate upward hand motions, and down metaphors (“falling into depression”) would facilitate downward hand motions. Moreover, given that the conceptual system supposedly is itself metaphoric (Lakoff and Johnson, 1980), the same meaning-action facilitation should be observed with literal sentences referring to the same concepts as orientational metaphors. For instance, “succeeding in business” would facilitate upward hand motions and “becoming sick” would facilitate downward hand motions. By contrast, if the ACE were restricted to orientational metaphors, this would mean that it might be a lexical phenomenon triggered by the motion verb, rather than a conceptual phenomenon.

The purpose of Experiment 3 was to test whether metaphorical meaning involves a purely visual motion component. Rather than using the ACE paradigm of the previous experiments, participants were asked to press a single key in response to either an upward or a downward visual animation. In this way, the visual motion matched or mismatched the orientational meaning of the sentences, but participants did not perform any upward or downward motion. In spite of this, it is possible that a visual motion-sentence compatibility effect would emerge; namely, we might expect faster responses for matching conditions (e.g., up metaphors and upward visual animation) than for mismatching conditions. In fact, some studies have used a dual-task paradigm to demonstrate a visual motion-sentence compatibility effect. For instance, Kaschak et al. (2005) asked participants to make semantic judgments on auditory sentences that described motions toward them (e.g., “the car approached you”) or away from them (e.g., “the horse ran away from you”), while simultaneously viewing dynamic stimuli that produced an illusory motion toward or away from them. Semantic judgments were faster in the mismatching condition than in the matching conditions, suggesting a competition for the same neuronal resources responsible for processing a given visual motion (e.g., away from you) and processing of the meaning sentence. Furthermore, a neuroimaging study observed that during the comprehension of visual motion-related sentences, there was activation in a brain region responsible for processing dynamic visual stimuli (Rueschemeyer et al., 2010).

To test these ideas, three kinds of sentences were created (see examples in Table 1). First, literals describing upward or downward physical movements were included as a baseline condition, in which ordinary ACE should be expected. The second group comprised metaphors, including upward motion verbs (e.g., “climbing,” “flying,” “jumping”), and downward motion verbs (e.g., “falling,” “sinking,” “burying”). Typically, upward metaphors referred to abstract positive events like “success,” “gain,” “improvement,” or “happiness,” while downward metaphors referred to negative events such as “defeat,” “loss,” “worsening,” or “sadness.” The third group, abstract sentences referred to similar concepts as the orientational metaphors, although in this case employing literal verbs such as “succeeding,” “improving,” “failing,” or “winning.” While reading the critical verb (Table 1, in bold), participants were asked to perform a hand motion that either matched or mismatched the direction of the motion described by the sentence. The reaction times in the matching and mismatching conditions provided the main ACE measure. In Experiment 1, the cue prompting the motor response was a visual upward or downward animation of the target word. In Experiment 2, the cue prompting the motor response was a color switch (from black to red or blue) of the target word, with the purpose being to observe meaning-action interaction in absence of visual motion. In Experiment 3 there was again an upward/downward animation of the target word, but in this case participants were not asked to move their hand in these directions. They simply performed a go/nogo task: pressing a single key when the target word moved and not pressing any key otherwise.

TABLE 1

Table 1. Examples of the original Spanish materials, and their English translation.

Experiment 1

In this experiment, the motor responses were cued by an upward/downward visual animation of the sentence verb, which was easily interpreted by participants as a prompt to immediately move their finger in the same direction. The visual animation was set 200 ms after the verb onset, because action-verbs more strongly activate the motor cortex within this temporal range, according to magnetoencephalography studies (e.g., Pulvermüller et al., 2005). Furthermore, in a previous study with literal transfer sentences de Vega et al. (in revision) found that this interval was optimal to obtain ACE. In addition to the motor responses, participants were asked to respond to a simple yes/no memory question after reading each sentence, providing later measures of meaning-action effects.

The procedure used here was the same as the one employed by de Vega et al. (in revision), and differs from the typical ACE studies (e.g., Glenberg and Kaschak, 2002; Glenberg et al., 2008) in which the directional motor responses were associated with sensibility judgments and were collected after understanding the whole sentence. By contrast, here the motor action was a simple psychophysical response to a visual cue, and did not require the burden of a semantic judgment. The procedure allows collecting meaning-action effects at a relatively early stage of sentence processing (in the verb), and also dissociating the motor effects (measured on the motor response) from the semantic effects (measured on the memory task).

Method

Participants

Sixty students of Psychology of the University of La Laguna, all native speakers of Spanish, took part in the experiment in exchange for academic credits.

Materials

Ninety-six experimental sentences (32 literals, 32 metaphors, and 32 abstract sentences) were constructed, as well as 50 filler sentences, describing static scenarios (e.g.: The delivery van was near the door), and eight practice sentences. The experimental and the sample sentences, illustrated in Table 1, shared the following structure: Subject/noun complement/verbal periphrasis/main verb/verb complement. The main verb, which described an upward or downward motion or an abstract event, was always the fourth segment. All five segments had approximately the same number of words in each sentence: 2 in the first one, 2 or 3 in the second and fifth and 1 or 2 in the third and fourth, depending on the particular periphrasis. This periphrastic structure was not suitable for the filler sentences, which described static situations. As a result, the filler sentences had four segments and the critical verb was placed in the third position. Finally, each sentence had a corresponding yes/no question that could refer to any segment.

Design and Procedure

The experiment manipulated 3 sentence types (literals, metaphors, and abstract sentences) × 2 Motor direction (upward, downward) × 2 Semantic directions (upward, downward) in a factorial within-participant design. The experimental session started with instructions to perform the task, followed by eight practice trials similar to the experimental ones, and finally the 96 experimental trials mixed with the 50 fillers were presented randomly in two blocks. The experimental trials included eight sentences for each of the 12 experimental conditions resulting from the combination of the three factors: upward literal-upward motor, upward metaphor-upward motor, upward abstract-upward motor, downward literal-upward motor, downward metaphor-upward motor, downward abstract-upward motor, upward literal-downward motor, upward metaphor-downward motor, upward abstract-downward motor, downward literal-downward motor, downward metaphor-downward motor, downward abstract-downward motor. There were two counter-balanced conditions, in which the assignment of motor direction to trials was reversed.

An ordinary computer keyboard was used for the recording of motor responses. The keyboard was fixed in an upright position, remaining thus during the whole experimental session, with all keys removed except the letters A, G, and L, which were placed in a downward, central, and upward position, respectively. Their distances to the table surface were 10, 18, and 26 cm, respectively. Upward and downward keys were covered by a 5-mm red square with icon arrows of the corresponding directions. A set of concentric circles, like a small target, covered the central (resting) key. The rest of the keyboard was covered with white cardboard. Participants were seated in front of the computer screen, with their elbows resting on the table, and were instructed to use the response keyboard with their dominant hand, and the mouse with the other.

The experimental trials consisted of the following sequence: a fixation point in the middle of the screen prompted participants to press the resting key and keep it pressed while the first three segments appeared automatically in the screen, remaining 800 ms each. Then, the fourth segment with the critical verb appeared and remained 200 ms, before “jumping” upward or downward. This apparent motion was a cue for participants to release the resting key and move their dominant hand’s index finger to press the key either above or below it. After the motor response, the final segment was presented, and remained 800 ms in the screen. Finally, a memory question referring to the sentence was given, and participants responded yes or no by using one of the two keys of the mouse with their non-dominant hand. The practice trials were similar to the experimental ones, except that the former were followed by motor response feedback.

Results

Four participants were discarded from the analyses because they gave more than 10% wrong responses in the memory task. The average error rate for the motor response was very small (less than 1%). Outliers exceeding the mean reaction times by two SDs were also excluded from the analyses (4.2% of data). Several measures were collected for analysis. The releasing time (from the animation onset to the release of the resting key) and the key-pressing time (from the release to the pressing of one directional key) are components of the same response, and so we decided to use the sum of the two as a single dependent measure, which we called motor response time. We also analyzed the time and accuracy of the responses to the memory questions. Repeated measures analyses of variance (ANOVAs) including Sentence type, Semantic direction, and Motor direction were performed for each of the above dependent measures, both by item (F₁) and by participant (F₂). Additional Semantic direction × Motor direction ANOVAs were also run for each Sentence type separately, as well as t-tests between pairs of conditions sharing the same directional motor response. We will only report significant effects.

Motor Response Times

There was a main effect of Motor direction: [F₁(1, 55) = 40.188, MSe = 86877.74; F₂(1, 90) = 9.15, MSe = 19773.01, p ≤ 0.0001]; specifically, downward responses were faster (M = 749 ms) than upward responses (M = 785 ms). However, this effect was modulated by the important Motor direction × Semantic direction interaction: F₁(1, 55) = 21.79, MSe = 87347.48, p ≤ 0.000; F₂(1, 90) = 9.15, MSe = 19773.01, p ≤ 0.003. This interaction consists of a matching < mismatching pattern, as shown in Figure 1. Moreover, the Motor direction × Semantic direction interaction was significant for each type of sentence analyzed separately: literals [F₁(1, 55) = 4.58; MSe = 4798.17, p ≤ 0.037; F₂(1, 30) = 2.13, n.s.], metaphors [F₁(1, 55) = 10.63; MSe = 4090.46, p ≤ 0.002; F₂(1, 30) = 6.62, MSe = 2444.53, p ≤ 0.015], and abstract sentences [F₁(1, 55) = 6.99; MSe = 3438.67, p ≤ 0.01; F₂(1, 30) = 1.70, n.s.], indicating that the matching < mismatching pattern was shared by the three types of sentences. Further t-tests performed for each pair of matching-mismatching conditions revealed significant effects both for upward [t(55) = 2.38, p ≤ 0.021] and downward [t(55) = 2.60, p ≤ 0.012] motor response in metaphors; for upward motor responses [t(55) = 2.181, p ≤ 0.033] in abstract sentences, and for upward motor responses [t(55) = 2.332, p ≤ 0.023] in literals.

FIGURE 1

Figure 1. Experiment 1: Mean of motor responses times, as a function of Sentence type, Sentence direction, and Motor direction. The vertical lines indicate the SDs, and the stars (*) correspond to significant matching-mismatching pairwise comparisons (p < 0.05).

Response Times in the Memory Task

The most important result was the significant Semantic direction × Motor direction interaction [F₁(1, 55) = 3.83, MSe = 284478.48, p ≤ 0.05; F₂(1, 90) = 19.19, MSe = 14988.06, p ≤ 0.0001]. This interaction was significant for literals [F₁(1, 55) = 6.07, MSe = 17154.01, p < 0.02; F₂(1, 30) = 11.22, MSe = 14821.93, p ≤ 0.002], and for abstract sentences [F₂(1, 30) = 6.63, MSe = 8353.65, p ≤ 0.016] showing the matching < mismatching pattern. The t-tests confirmed this pattern for upward motor direction [t(55) = 3.48, p ≤ 0.0001] in metaphors, for downward motor direction [t(55) = 2.08, p ≤ 0.042] in abstract sentences, and for both upward [t(55) = 3.58, p ≤ 0.001] and downward [t(55) = 2.08, p ≤ 0.042] motor direction in literals. These results are shown in Table 2.

TABLE 2

Table 2. Experiment 1. Mean response times (in ms) in the memory task, and mean percent errors.

Other results with less theoretical interest for this paper were the main effect of Sentence type [F₁(2, 110) = 17.77, MSe = 20289.54, p ≤ 0.0001], as abstract sentences produced faster responses (M = 1342) than metaphors (M = 1404), and literals (M = 1404). This effect, however, was modulated by the Sentence type × Semantic direction [F₁(2, 110) = 9.122, MSe = 195866.86, p ≤ 0.0001] and by the Sentence type × Motor direction [F₁(2, 110) = 6.15, MSe = 582186.81, p ≤ 0.01].

Errors in the Memory Task

The mean percentages of errors are shown in Table 2. The ANOVA revealed a Semantic direction × Motor direction interaction [F₁(1, 55) = 5.64, MSe = 58.53, p ≤ 0.021], consisting of smaller number of errors for matching than for mismatching conditions. Separate analyses performed for each sentence type only confirmed a significant Semantic direction × Motor direction interaction for metaphors [F₁(1, 55) = 9.78, MSe = 59.44, p ≤ 0.003]. The t-tests confirmed statistically significant differences for upward [t(55) = 2.63, p ≤ 0.01] and downward motor responses [t(55) = 2.12, p ≤ 0.04] in metaphors.

Discussion

Using a double task paradigm, Experiment 1 obtained robust ACE. Several facts are remarkable in the results. First, the ACE consists of the standard pattern observed in other studies: the matching conditions resulted in a better performance than the mismatching conditions, confirming that the reading of the experimental sentences activates embodied representations of vertical motions online. Second, the meaning-action effects were observed in two different moments, modulating the speed of the finger motion task that immediately followed the sentence verb, as well as the speed and accuracy of the memory task recorded at the end of sentence. This fact suggests a symmetric meaning-action modulation, as will be argued in the general discussion. In other words, not only does the sentence meaning modulate the performance of a motor action, but the motor action also modulates the comprehension of the sentence. Third, the ACE was obtained in all three types of sentences: in literal sentences describing upward/downward motions, in metaphors and even in abstract sentences that shared meaning with metaphors, indicating that the effects were not specifically associated with the action-verbs, but with the metaphorical domain underlying these sentences. Interestingly, the ACE was apparently more conspicuous in metaphors (significant both for upward and downward movements) than in the other types of sentences (only significant for upward movements), although the sentence type × motor direction × semantic direction interaction was not significant [F(1, 55) < 1].

Experiment 2

Experiment 1 confirmed our predictions for meaning-action effects, supporting the embodied cognition approach to metaphorical meaning. Moreover, the activation of embodied representations was found to occur online while participants were reading the sentence verb and extended to the end of sentence. Notice, however, that an apparent motion of the sentence verb cued the upward/downward finger motion in the same direction. Consequently, there was a possible confusion between the visual motion effects and the motor response effects. The apparent motion might produce a compatibility effect itself, indistinguishable from the meaning-action interaction. In this respect, some papers have reported that the apparent motion of visual stimuli could affect the comprehension of a simultaneous sentence with a meaning matching or mismatching the direction of the visual motion (e.g., Kaschak et al., 2005). To avoid this confusion, in Experiment 2 the upward/downward finger motion was prompted by a color change of the critical verb, rather than its apparent motion. This ensured that the static visual cue could not produce any compatibility effect itself, and the obtained effects, if any, could only be attributed to the meaning-action compatibility.