Tracking the allocation of attention using human pupillary oscillations

Naber, Marnix; Alvarez, George  A; Nakayama, Ken

doi:10.3389/fpsyg.2013.00919

ORIGINAL RESEARCH article

Front. Psychol., 10 December 2013

Sec. Cognition

Volume 4 - 2013 | https://doi.org/10.3389/fpsyg.2013.00919

Tracking the allocation of attention using human pupillary oscillations

Marnix Naber^1,2^*

George A. Alvarez¹

Ken Nakayama¹

¹Vision Sciences Laboratory, Department of Psychology, Harvard University, Cambridge, MA, USA
²Social and Behavioural Sciences, Cognitive Psychology Unit, Leiden University, Leiden, Netherlands

The muscles that control the pupil are richly innervated by the autonomic nervous system. While there are central pathways that drive pupil dilations in relation to arousal, there is no anatomical evidence that cortical centers involved with visual selective attention innervate the pupil. In this study, we show that such connections must exist. Specifically, we demonstrate a novel Pupil Frequency Tagging (PFT) method, where oscillatory changes in stimulus brightness over time are mirrored by pupil constrictions and dilations. We find that the luminance–induced pupil oscillations are enhanced when covert attention is directed to the flicker stimulus and when targets are correctly detected in an attentional tracking task. These results suggest that the amplitudes of pupil responses closely follow the allocation of focal visual attention and the encoding of stimuli. PFT provides a new opportunity to study top–down visual attention itself as well as identifying the pathways and mechanisms that support this unexpected phenomenon.

Introduction

Paying attention to items and events outside ones central gaze is a key cognitive skill (James, 1890; Posner, 1980). For instance, a driver's main focus is the road, but attention may need to be diverted to the pedestrians on the sidewalk as well. Visual attention is the cognitive process of (pre)allocating mental resources to particular locations, features, or objects in a visual scene (e.g., Scholl, 2001; Naber et al., 2011) to improve sensory processing of the selected information (Corbetta et al., 1990; Motter, 1993; Desimone and Duncan, 1995; Hillyard et al., 1998; Roelfsema et al., 1998; Somers et al., 1999; Kastner and Ungerleider, 2000; Treue, 2001; Silver et al., 2007). However, observers cannot attend to everything in their surroundings at the same time because the visual system has serious limitations in processing capacity (Broadbent, 1958; Neisser, 1967; Schneider and Shiffrin, 1977; Tsotsos, 1990; Verghese and Pelli, 1992). Therefore, attention needs to be divided between many competing features, some of which automatically attract more resources than others (e.g., Treisman, 1969; Eriksen and Eriksen, 1974; Duncan, 1984). Hence, there can be parts of the visual scene that receive focused attention and parts that receive none or fewer attentional resources. The perception of the latter is extremely limited (Rensink et al., 1997; Mack and Rock, 1998; Most et al., 2005; Cohen et al., 2011) and consequently attentional slips sometimes lead to undesirable events such as accidents (Reason, 1990). As attentional competition and capacity limitations can have serious repercussions for everyday life, it is important to investigate their underlying mechanisms.

Visual attention is usually measured by assessing performance outcomes on a task. In a typical experiment, observers are cued to attend a particular object, which leads to faster and more accurate report of its properties as compared to unattended objects (Averbach and Coriell, 1961; Eriksen and Hoffman, 1972; Posner, 1980; Nakayama and Mackeben, 1989). The deployment of attention is, however, considerably variable over time (e.g., Martínez et al., 2001) and it has been a challenge for researchers to measure its deployment throughout a single experimental trial (Bennett and Pratt, 2001; Tse et al., 2003). To successfully relate small and short-term changes in attention to behavior, we need to be able to measure its dynamics on-line. Here we present a novel pupillometric method that serves as a tool to measure attention over time and to predict behavioral performance on a trial-by-trial basis.

We demonstrate that attention enhances not only performance on a task, but also pupil responses. We employ a method similar to steady-state visual evoked potentials (SSVEP) used in MEG/EEG studies (Regan, 1989; Morgan et al., 1996; Müller et al., 2003; Störmer et al., 2013). However, rather than using electrophysiological signals to track the dynamics of attention, we use frequency tagged pupillary responses. Specifically, we induce pupil oscillations by modulating luminance levels of target objects and distractor objects at different frequencies, and show that the amplitude of these pupil oscillations track focal attention allocated to a specific flickering object. We have termed this novel attentional tracking method Pupil Frequency Tagging (PFT) and demonstrate its application and potential in three experiments.

Experiment 1

The PFT method requires repetitive oscillations in the brightness of stimuli (dark-light-dark-light…), in combination with continuous measures of pupil diameter using an eye-tracker. If a stimulus is relatively brighter than its background, then its appearance will trigger pupil constriction and its disappearance will trigger pupil dilation. Our question was simple: Would the amplitudes of these pupil responses be modulated by attention? Before measuring possible effects of attention, we first determined the highest frequencies where satisfactory pupil responses could be obtained by presenting a full-screen flickering stimulus.

Materials and Methods

Observers

Thirteen students participated in Experiment 1. All participants had normal or corrected-to-normal vision, were naïve to the purpose of the experiment, and gave informed written consent before the experiment. The experiments conformed to the ethical principles of the Declaration of Helsinki and were approved by the local ethics commission of Harvard.

Stimuli and apparatus

To measure the effects of changes in perceived brightness on pupil size, observers viewed a blank screen that flickered at either 0.3, 0.7, 1.0, 1.7, 2.3, or 3.4 Hz (Figure 1A). The monitor screen was 30 by 24 in visual degrees and the fixation point was 0.25° in diameter. The screen, fixation, and backgrounds were either black (1.65 cd/m²), gray (16.46 cd/m²), or white (61.10 cd/m²).

FIGURE 1

Figure 1. Pupillary responses to a range of screen flicker rates. (A) Observers viewed full monitor screens that flickered at a particular frequency rate (0.3, 0.7, 1.0, 1.7, 2.3, or 3.4 Hz) while their pupil size was recorded with a camera. (B) Examples of pupil size of a selected observer as a function of time in six separate trials with distinct flicker frequencies. The solid and dashed vertical lines indicate the onsets of white and black screens, respectively. (C) Average spectrum of FFT power per flicker frequency across all observers.

Stimuli were presented on a 21″ CRT screen at a fixed viewing distance of 70cm. Observers' heads were supported by a chin- and forehead-rest. The resolution and refresh rate of the screen was 1600 × 1200 pixels and 85 Hz. Observer's pupil size of one eye was tracked with an infrared sensitive camera at a rate of 1000 Hz.

Procedure

Observers viewed a full screen that alternated between black and white at a specific frequency while their pupil size was recorded. Observers were instructed to fixate at the center dot but pay close attention to the flicker rates. A different screen alternation frequency was randomly selected per trial (2 trials per frequency). Observers could take breaks between trials and start each trial by pressing a button. The experiment consisted of 12 trials of 10 s each.

Analysis

The strength of pupil oscillations was analyzed by conducting a Fast Fourier Transform (FFT) that produces a power spectrum across frequencies. The EyeLink pupil tracking system outputs pupil size in arbitrary units that depend on variable factors such as the camera's pupil detection parameters and the observer's viewing distance to the screen. Nonetheless, we could roughly estimate that a pupil size unit of 100 corresponded to a pupil diameter of approximately 6 mm and a unit of 40 to 3 mm (see Figure 1B). Pupil size and gaze location was interpolated with a cubic spline fit during blinks. Pupil size recorded in the first second of each trial was removed from analysis to control for confounding effects on pupil size due to transient onset responses and because observers needed some time to become oriented after trial onset.

Results and Discussion

In Experiment 1, observers viewed a full-screen flickering stimulus, where the flicker frequency varied across trials (0.3, 0.7, 1.0, 1.7, 2.3, and 3.4 Hz; Figure 1A). As shown by the continuous changes in pupil size synchronous to the flicker rate of the stimulus in Figure 1B, most flicker frequencies induced consistent pupillary oscillations. Next, we determined whether a FFT frequency spectrum analysis on the pupil oscillations accurately which frequency was presented on each trial. As shown in Figure 1C, the power magnitudes in the FFT frequency spectrum were selectively enhanced for the presented flicker frequencies. The power of each present frequency was significantly larger than the power of absent frequencies across observers [t₍₁₂₎ >= 2.73, p <= 0.018; for all statistical comparisons, see Table A1]. The peak in power of the highest flicker frequency (3.4 Hz) was also discernibly higher than other frequencies on most trials, except for 3 out of 13 observers whose pupillary responses were too noisy to get reliable magnitudes at that frequency. Hence, we conclude that flicker frequencies up to 2.3 Hz induce consistent, measurable pupillary oscillations in all observers. In the following experiment, we use this frequency to investigate whether we can measure attentional effects on pupil responses at a relatively high temporal resolution.

Experiment 2

Having established that an FFT spectrum analysis of pupil oscillations accurately indicates visual flicker frequencies up to ~2.5 Hz, we investigated whether attention modulates oscillations amplitudes. To do so, we presented four separate stimuli with distinct locations and flicker frequencies to observers while recording pupil responses as a function of attended location. The idea was that each flicker frequency left its own oscillatory trace in the pupil and that the strength of this oscillation can be measured by determining the peak power in the FFT spectrum analysis.