Motivation Modulates Visual Attention: Evidence from Pupillometry

Wykowska, Agnieszka; Anderl, Christine; Schubö, Anna; Hommel, Bernhard

doi:10.3389/fpsyg.2013.00059

ORIGINAL RESEARCH article

Front. Psychol., 12 February 2013

Sec. Cognition

Volume 4 - 2013 | https://doi.org/10.3389/fpsyg.2013.00059

Motivation modulates visual attention: evidence from pupillometry

Agnieszka Wykowska¹

Christine Anderl^1,2,3

Anna Schubö⁴

Bernhard Hommel²*

¹Ludwig Maximilian University, Munich, Germany
²Leiden University, Leiden, Netherlands
³Goethe University, Frankfurt, Germany
⁴Philipps University, Marburg, Germany

Increasing evidence suggests that action planning does not only affect the preparation and execution of overt actions but also “works back” to tune the perceptual system toward action-relevant information. We investigated whether the amount of this impact of action planning on perceptual selection varies as a function of motivation for action, which was assessed online by means of pupillometry (Experiment 1) and visual analog scales (VAS, Experiment 2). Findings replicate the earlier observation that searching for size-defined targets is more efficient in the context of grasping than in the context of pointing movements (Wykowska et al., 2009). As expected, changes in tonic pupil size (reflecting changes in effort and motivation) across the sessions, as well as changes in motivation-related scores on the VAS were found to correlate with changes in the size of the action-perception congruency effect. We conclude that motivation and effort might play a crucial role in how much participants prepare for an action and activate action codes. The degree of activation of action codes in turn influences the observed action-related biases on perception.

Introduction

Human attention is traditionally considered a mechanism that allows prioritizing the processing of information that is behaviorally or emotionally relevant (e.g., Hansen and Hansen, 1988; Öhman et al., 2001; Feldmann-Wüstefeld et al., 2011), task-relevant (Folk et al., 1992; Wolfe, 1994; Müller et al., 2009; Wykowska and Schubö, 2010, 2011), signaled to be potentially relevant (Posner, 1980; Müller and Rabbitt, 1989; Friesen and Kingstone, 1998), or simply salient (e.g., Itti and Koch, 2000; Theeuwes, 2010). However, almost none of the available attentional theories consider the further use of the attentional mechanisms beyond perceptual judgment and decision-making. And yet, recent evidence suggests that attentional processes play a major role in action control, that is, in the processes that are following perception and action selection (e.g., Bekkering and Neggers, 2002; Fagioli et al., 2007; Wykowska et al., 2009). For instance, Fagioli et al. (2007) demonstrated that preparing for a manual reaching movement facilitates the detection of location-defined visual oddball stimuli while preparing for a manual grasp facilitates detection of size-defined oddball stimuli. Along similar lines, Wykowska et al. (2009) showed that preparing for a particular action might sensitize the perceptual system to information suited to guide that action. Biasing perception toward action-relevant dimensions would make it easier for motor-control operations to identify the perceptual parameters suited to specify the open parameters of online control, such as hand aperture (Hommel, 2010).

In the paradigm used by Wykowska et al. (2009, see also Wykowska et al., 2011, 2012) participants had to first prepare for a grasping or a pointing movement (as indicated by a cue picture representing a grasping/pointing hand), then detect and report a target in a visual search display (size or luminance pop-out item), and only then carry out the prepared movement on an indicated object (see Figure 1, which depicts an adapted version of the task used in Wykowska et al., 2009). Importantly, the movement task and the visual search task were perceptually and motorically unrelated: the visual search display was presented on a computer screen and the response was to be made on a mouse key with the dominant hand while the movement was to be executed with the other hand on one of the items of a movement execution device (Wykowska et al., 2009, 2011) or on one of three cups positioned below the computer screen (Wykowska et al., 2011, 2012). The design consisted of two action-perception congruent pairs: grasping and size (visual search target defined by size) and pointing and luminance (visual search target defined by luminance), as it was assumed that size is a potentially relevant dimension for a grasping movement while luminance is related to localizing – which is inherently linked to pointing. Results showed action-perception congruency effects: detection of a given dimension was facilitated when a congruent movement was being prepared, relative to the incongruent movement. In more detail, detection of size targets was faster when grasping movement was prepared, as compared to the pointing movement; and the reverse pattern was observed for detection of luminance targets. The authors concluded that visual selection is biased by a so-called intentional weighting mechanism (Wykowska et al., 2009, 2012; Hommel, 2010; Memelink and Hommel, in press), which prioritizes perceptual processing in order to deliver potentially action-relevant perceptual dimensions for open parameters of online action control, such as hand aperture (Hommel, 2010). Given that in the paradigm of Wykowska and colleagues the movement object was indicated only after the search task, all parameters of the prepared action could not be fully specified before the search task. Therefore, the intentional weighting mechanism prioritized processing of those perceptual dimensions that might have been necessary for efficient online action control.

FIGURE 1

Figure 1. Trial sequence of Experiment 1 and 2. Trials started with a fixation mark (in Experiment 1 it was the continuous valid pupil signal of 300+ 300 ms), followed by one of the cues (pointing/grasping; 800 ms), which informed participants which movement they should prepare. After another fixation mark (600 ms), the search display (target/no target) appeared on the screen (100 ms), and was followed by another fixation mark. Four hundred milliseconds after response to the search task, the movement position cue (400 ms) appeared and participants performed the prepared movement on the respective paper cup.

At the same time, however, in the paradigm of Wykowska and colleagues, preparing the whole action plan was not strictly necessary immediately after the movement cue. Therefore, the less motivated participants might have “kept in mind” what action to do and might have engaged in (more complete) preparation only after completing the search task. This strategy would be expected to reduce or prevent action control processes from taking place before the onset of the visual search display, which should reduce or eliminate congruency effects. The aim of the present study was to characterize the role of action control in visual attention by exploiting individual differences in motivation and effort invested in the task. Re-analyses of data from pilot studies (Anderl, 2009), together with informal observations, have suggested considerable individual differences not so much with respect to the initial motivation for the task but in the maintenance of motivation during the experimental session. A loss of motivation, we reasoned, would be likely to affect the (effortful) preparation of the movement, which in turn could affect action-perception congruency effects. Therefore, in the present study, we predicted that individuals with a greater loss of motivation/effort should show a (more) reduced effect of congruency between to-be-prepared action and target dimension.

To provide a reliable but unobtrusive measure of the individual motivational level (so to avoid any impact of the act of measurement on the participants’ motivational state) and its possible change over time, we recorded tonic pupil size. The sympathetic nervous system is known to both modulate pupil size (Loewenfeld, 1993) and regulate arousal, so that pupil diameter has often been taken to reflect motivation for, or effort spent on, a task (Ahern and Beatty, 1979; see also Steinhauer and Hakerem, 1992). Indeed, pupil size has been shown to be highly correlated with the level of cognitive effort, with more effort (due to task demands) being reflected in a larger pupil diameter (Hess and Polt, 1964; Beatty and Kahneman, 1966; Loewenfeld, 1993; see Beatty, 1982 for review on pupillometry as a measure of task-related mental effort; Kahneman, 1973 on the idea of effort theory of attention relating task demands to pupil dilation; and Granholm and Steinhauer, 2004 on pupillometry as measure of normal and abnormal cognitive processes). Moreover, changes in pupil diameter have been associated also with shorter-term changes in motivation, as induced by performance-based reward (e.g., Heitz et al., 2008).

Even though most studies have concentrated on phasic changes in pupil diameter related to a given task/stimulus, and have indicated that pupil dilation is related to cognitive effort in many domains such as lexical decision (Kuchinke et al., 2007), attention allocation (Karatekin et al., 2004), or load on attentional capacity (Kahneman, 1973), working memory load (Granholm et al., 1996; Van Gerven et al., 2004), or face perception (Goldinger et al., 2009), tonic pupil size has also been found to be an indicator of mental effort and arousal (Kahneman, 1973; Gilzenrat et al., 2010; see Laeng et al., 2012 for review), alertness and fatigue (Lowenstein and Loewenfeld, 1962; Merritt et al., 2004), or control state (Gilzenrat et al., 2010).

Experiment 1

Materials and Methods

Participants

Fifteen university students aged from 18 to 32 years participated in this study (age: M = 23.3, six males, one left-handed) for partial fulfillment of course credit or a financial reward. All of them reported normal or corrected-to-normal vision. Two participants were excluded from the analysis of pupil data (due to technical problems during recording) but remained in all behavioral analyses. APA ethical standards were followed throughout the study. The experiment was undertaken with the understanding and consent of each participant.

Apparatus and stimuli

Stimuli were presented on a standard 17′′ TFT monitor of a remote eye tracker system (Tobii T120, Tobii Technology, Stockholm, Sweden) with a refresh rate of 75 Hz. Stimulus presentation was controlled by E-Prime presentation software (Psychology Software Tools, Pittsburgh, PA, USA).

Participants were seated in central position relative to the midpoint of the screen. Head positions were stabilized with a chin rest at a viewing distance of approximately 50 cm. Room illumination was kept at the level of 100 lux. An asterisk (0.7° of visual angle, presented at the central position of the screen) served as fixation mark. The type of movement required in a trial was indicated by a cue (see Figure 1), which was a black and white photograph (18.4° × 23.7° of visual angle), showing a left hand performing a pointing or a grasping movement of a white paper cup. These cues were also presented centrally on the screen.

The search display (see Figure 1) consisted of 28 gray circles (2.4° of visual angle), which were presented on a white background. They were positioned on three imaginary circular arrays with diameters of 10.4°, 14.1°, and 17.7° of visual angle. Targets were defined as larger circles (3.3° of visual angle) and could appear at the lateralized positions (three left, three right) of the middle circle. Target present trials and target absent trials were randomly intermixed but were presented with equal probability (50%) each. Note that we used only one target dimension (size) to simplify the design for this experiment (similarly to Wykowska et al., 2011; Wykowska et al., 2012, Experiment 1), although the first studies of Wykowska et al. (2009) showed congruency effects for both size and luminance targets when the dimensions were blocked. As size is a relevant perceptual dimension for grasping but not for pointing movements, trials with grasping cues will be referred to as congruent whereas trials with pointing cues will be referred to as incongruent.

For each trial, the movement-relevant cup was indicated by a yellow asterisk (1.4° of visual angle; CIE L*a*b color coordinates: 87/5/82), which could appear at one of three different positions on the screen (10.0° of visual angle below the imaginary midline of the screen in vertical direction and −11.6°, 0°, and 11.6° of visual angle measured from the midline of the screen in horizontal direction). The positions were randomly intermixed and equally likely (33.3% each).

Directly below the three possible positions of the yellow asterisk, three white paper cups were positioned on a board that was installed at 20 cm below the computer screen. Participants were instructed to perform the prepared movement (pointing or grasping) on the indicated paper cup. The cups were identical in height (6.2 cm) but differed in diameter measured at 3.1 cm height (small: 5.3 cm, medium: 6.6 cm, large: 7.6 cm). Positions of the cups (left, middle, right position) were randomized between participants.

Participants were to indicate whether or not they had detected a target by pressing a mouse key with the index and middle finger of their dominant hand. The assignment between mouse keys and target present/absent trials was balanced between participants. The movement task (pointing vs. grasping of a cup) was carried out with the non-dominant hand to allow for simultaneous movement preparation (non-dominant hand) and response to the search task (dominant hand). The experimenter monitored the performed movements with a camera and coded their correctness online with a mouse key.

Procedure

Participants attended a 30-min practice session to become familiar with the movement task (180 trials). The experiment proper was conducted no earlier than 2 h and no later than 2 days after the practice session. It started with a five-point calibration and validation procedure of the eye signal. Subsequently, two practice (80 trials each) and two experimental blocks (240 trials each) were performed.

Trial sequence and timing are depicted in Figure 1. Trials started with a fixation asterisk (continuous valid pupil signal of 300 + 300 ms), followed by one of the cues (pointing/grasping; 800 ms), which informed participants which movement they should prepare. After another fixation mark (600 ms), the search display (target/no target) appeared on the screen (100 ms), and participants were supposed to respond to the visual search display as fast as possible. Those speeded responses to the search display were given by pressing the left or right mouse key for target present or absent trials respectively (or vice versa). Reaction times were measured as the time between the onset of the visual search display and key press. Upon the visual search response, another fixation mark was presented for 400 ms. Subsequently, an asterisk signaling which object to grasp/point to (400 ms) appeared, and participants performed the prepared movement on the respective paper cup. The movement task was not speeded but accuracy was stressed. Each trial ended with the registration of the movement type by the experimenter, followed by a 100-ms intertrial interval. Importantly, participants were instructed to prepare for the movement indicated by the cue but not to perform it until one of the yellow asterisks appeared on the screen to indicate the movement’s object. This was done in order to make sure that the movement representation would be active while participants were performing the visual search task.

Data Analysis¹

Behavioral analysis

For RT analyses, correct movement and correct search trials were taken into account. For the analysis of error rates in the search task incorrect movement trials were excluded, and for the analyses of error rates in the movement task, incorrect trials in the search task were excluded. Moreover, RT outliers (±3 SD from the overall mean RT of correct trials for each participant and each experimental block separately) were excluded from the RT analysis. Two separate ANOVAs with the factors congruency of movement (congruent vs. incongruent), display type (target present vs. absent), and block (1 vs. 2) were conducted for both mean RTs and error rates.

Analysis of pupil data

Pupil data were preprocessed to exclude blinks and other noise by means of a program developed by Henk van Steenbergen (Leiden University). It replaced missing values by the value measured for the other eye or, when data points were missing for both eyes, an interpolation between the pupil size of the last valid value before and the first valid value after the blink or otherwise missing data point.

For calculating the mean tonic pupil sizes for both experimental blocks, the mean values of data points recorded during a 600-ms interval directly preceding the movement cue onset of each trial (for similar procedure, see, e.g., Heitz et al., 2008) were calculated across all trials of block 1 and 2 separately. We chose an interval of 600 ms since it was identical to the minimum time the fixation mark stayed on the screen before each trial and because the baseline interval is commonly chosen in the range between 100 ms (e.g., Verney et al., 2004) and 1000 ms (e.g., Porter et al., 2007).

To assess the individual changes in motivation during the experimental session, we calculated the change in tonic pupil size (Δ_psize) from Block 1 to Block 2 by subtracting, for each participant, the average trial-baseline pupil size in the latter from the average trial-baseline pupil size in the former. Positive values therefore, denote decrease in pupil size. Pearson correlations between Δ_psize and four other difference measures were analyzed: the change in overall performance across blocks [Δ_performance = overall mean RT (or error rate respectively) in block 2 minus overall mean RT (or error rate respectively) in block 1] and the change in congruency effect [Δ_congruency = congruency effect in Block 1 minus congruency effect in Block 2]. Congruency effects were calculated as follows: Mean RT (or error rate respectively) in congruent trials were subtracted from Mean RT (or error rate respectively) in incongruent trials. Also in these subtracted scores, positive values denote decrease in congruency effect. As it is a common pattern in visual search literature to find different effects for target present and target absent trials (see Chun and Wolfe, 1996 as well as Schubö et al., 2004, 2007), and as our previous results showed differential congruency effects for target present and target absent trials (Wykowska and Schubö, 2012; Wykowska et al., 2012, Experiment 2; Wykowska et al., 2009, Experiment 3), we considered only target present trials for correlational analyses.

Results

Behavioral results

RTs from the search task were analyzed as a function of congruency, display type (target present or absent), and block (see Figure 2). A repeated measures ANOVA revealed significant main effects of congruency, F(1, 14) = 4.81, p < 0.05, $η_{P}^{2} = 0.26$ with responses in the search task being faster for the congruent (M = 535 ms, SEM = 29 ms) than the incongruent condition (M = 549 ms, SEM = 34 ms); display type, F(1, 14) = 6.36, p < 0.05, $η_{P}^{2} = 0.31$ with faster responses to target present displays (M = 526 ms, SEM = 34 ms) compared to target absent displays (M = 559 ms, SEM = 29 ms); and Block, F(1, 14) = 6.29, p < 0.05, $η_{P}^{2} = 0.31$ with slower responses in the first (M = 563 ms, SEM = 29 ms) than the second block (M = 522 ms, SEM = 35 ms). None of the interactions reached the level of significance, all Fs < 1, ps > 0.4.

FIGURE 2

Figure 2. Mean RTs as a function of congruency and block in Experiment 1. Congruent (white bars) and incongruent condition (gray bars) for target present displays in the first block (left) and second block (right). Error bars indicate the standard errors of the mean, adapted to within-participants designs, according to procedure described in Cousineau (2005).

The repeated measures ANOVA on error rates revealed only a significant main effect of display type, F(1, 14) = 11.72, p < 0.005, $η_{P}^{2} = 0.46$ with a higher error rate in target present displays (Misses; M = 8.9%, SEM = 1.9%) than in target absent displays (False alarms; M = 2.8%, SEM = 0.8%). No other effects or interactions reached the level of significance, all Fs < 2, ps > 1. The pattern of error rates however, was in line with the results in RT data: congruent trials yielded smaller error rates (M = 5.7%, SEM = 1.3) than incongruent trials (M = 5.9%, SEM = 1.1.); and therefore, there was no speed-accuracy trade-off observed.

Error rates in the movement task

Comparison of accuracy across the two types of movements revealed no difference in performance for pointing and grasping movements, t(14) < 1, p > 0.36 with pointing movements yielding 2.3% of errors on average and grasping movements 2.8% of errors on average.

Pupil data

Individual congruency effects and pupil sizes for each block separately are presented in Table 1.

TABLE 1

Table 1. Individual average pupil sizes and congruency effects in Block 1 and Block 2 of Experiment 1, sorted according to increasing pupil size.

Overall performance did not correlate with overall pupil size, r(13) = 0.001; p > 0.9. Similarly, changes in general level of performance (Δ_performance) did not correlate with changes in tonic pupil size (Δ_psize), neither for RT data, r(13) = −0.062; p > 0.8 nor for error rates, r(13) = 0.306; p > 0.3. Most importantly, however, Δ_psize was strongly correlated with changes in the congruency effect (Δ_congruency) in both RTs, r(13) = 0.77, p < 0.005, and error rates, r(13) = 0.59, p < 0.05 (see Figure 3).

FIGURE 3

Figure 3. Scatter plots and linear regression curves indicating the correlation between changes in pupil size and congruency effects (in RTs left, and in error rates right) across the two experimental blocks in Experiment 1. The changes in pupil size were calculated as a Mean Pupil Size_Block1 − Mean Pupil Size_Block2. Therefore, positive values indicate decrease in pupil sizes. Changes in congruency effects were calculated as Congruency effect in RT/Error rate_Block1 − Congruency Effect in RT/Error rate_Block2. Positive values denote decrease in congruency effects across blocks.

Median split analysis

In order to examine whether the effects of interest depended on individual differences in overall pupil size, we split the sample into two groups based on median pupil size (averaged across both experimental blocks). Pupil size (large vs. small) was then entered into a 2 × 2 mixed ANOVA as a between participants factor, with the within-participants factor of congruency. The analysis was conducted for target present trials only. Main effect of congruency was observed, F(1, 11) = 5.19, p < 0.05, but no interaction with pupil size, F < 0.2, p > 0.7. Subsequently, we tested whether the change in the congruency effect across blocks (Congruency_Block1 − Congruency_Block2) correlated with the change in pupil size (Pupil size_Block1 − Pupil size_Block2) for one of the groups more than for the other. Indeed, the correlation was significant only for the group of participants that had an overall larger pupil size, r(7) = 0.93, p < 0.01. The correlation was not observed for the group of participants with smaller pupil size r(6) < 0.55, p > 0.23. The change in overall performance (Mean RT_Block1 − Mean RT_Block2) did not correlate with the change in pupil size in either of the two groups, both rs < 0.45, ps > 0.4.

Discussion

The aim of Experiment 1 was to examine the influence of individual differences in capacity for maintenance of motivation and effort throughout the experimental session on the size of action-perception congruency effects. We reasoned that losses of motivation/effort might affect the extent to which the required action plan is activated and this in turn might affect the intentional weighting mechanism, i.e., the action-related biases on perceptual selection of action-relevant characteristics. Results indeed showed that changes in pupil size – a marker of individual motivation/effort – correlated with changes in congruency effects. In particular, the more the pupil size decreased over time (reflecting a decrease in motivation/effort), the more the size of congruency effects decreased as well. This suggests that maintenance of motivation/effort throughout an experimental session has a specific impact on the degree to which action plans are activated and the mechanism of intentional weighting is employed.

However, pupil size is only an indirect measure of motivation/effort. Therefore, Experiment 2 was conducted with the aim of examining the relationship between individual capacity for motivation maintenance and congruency effects with a more direct measure of motivation.

Experiment 2

Experiment 2 was conducted in order to test whether the correlation between fluctuations in levels of motivation/engagement throughout the experiment and the changes in size of the congruency effects would also be observed when a more direct measure of motivation is applied: the level of motivation as measured with a visual analog scale (VAS, see Bond and Lader, 1974; see also Kleih et al., 2010, for a similar methodology to assess individual levels of motivation).

Method

The paradigm of Experiment 2 remained similar to Experiment 1 except that instead of measuring pupil size during the experiments, the VAS was administered before the Experiment, between Block 1 and Block 2 and at the end of Experiment.