Attention and Visuospatial Working Memory Share the Same Processing Resources

Feng, Jing; Pratt, Jay; Spence, Ian

doi:10.3389/fpsyg.2012.00103

ORIGINAL RESEARCH article

Front. Psychol., 18 April 2012

Sec. Cognition

Volume 3 - 2012 | https://doi.org/10.3389/fpsyg.2012.00103

Attention and visuospatial working memory share the same processing resources

Jing Feng¹*

Jay Pratt²

Ian Spence²

¹ Rotman Research Institute, Baycrest Centre for Geriatric Care, Toronto, ON, Canada
² Department of Psychology, University of Toronto, Toronto, ON, Canada

Attention and visuospatial working memory (VWM) share very similar characteristics; both have the same upper bound of about four items in capacity and they recruit overlapping brain regions. We examined whether both attention and VWM share the same processing resources using a novel dual-task costs approach based on a load-varying dual-task technique. With sufficiently large loads on attention and VWM, considerable interference between the two processes was observed. A further load increase on either process produced reciprocal increases in interference on both processes, indicating that attention and VWM share common resources. More critically, comparison among four experiments on the reciprocal interference effects, as measured by the dual-task costs, demonstrates no significant contribution from additional processing other than the shared processes. These results support the notion that attention and VWM share the same processing resources.

Introduction

According to Baddeley’s model (Baddeley and Hitch, 1974; Baddeley, 1986, 1992, 2000, 2003; Baddeley and Della Sala, 1996), working memory consists of a central executive and several distinct slave storage systems, including the phonological loop, the visuospatial sketchpad, and the episodic buffer. The central executive oversees working memory; it manages many critical processes, including the direction of attention to relevant information and the suppression of irrelevant information, the supervision of information integration, and the coordination of the slave storage systems. This central executive is thought to depend heavily on the function of selective attention (Engle, 2002) and there is considerable evidence to support this view. For example, attention is important for the process of binding features into a perceptual object representation (Treisman and Gelade, 1980), for maintenance of feature binding and feature values (Wheeler and Treisman, 2002; Fougnie and Marois, 2009; Brown and Brockmole, 2010) in visuospatial working memory (VWM), and for successful change detection (Makovski et al., 2006). Attention also supports the maintenance of these bound representations (Wheeler and Treisman, 2002), and it assists the transfer of perceptual information into working memory (Averbach and Coriell, 1961; Hollingworth and Henderson, 2002; Schmidt et al., 2002).

Attention also plays an important role in the visuospatial sketchpad. Information processing in this system is thought to involve three major processes: encoding, maintenance, and retrieval (e.g., Jonides et al., 2008), with the encoding process modulated by attention which performs the selection of pertinent perceptual items. In fact, attention may have direct and immediate access to the process of encoding (Trick and Pylyshyn, 1994; McElree, 1998; Cowan, 2000), and when new items enter the focus of attention, they displace other items (McElree, 1998; Nairne, 2002). The processing capacities in attention and working memory encoding are managed by common underlying neural mechanisms (Fusser et al., 2011). The maintenance process keeps item representations alive and provides protection against interfering irrelevant stimuli or intruding thoughts (e.g., Funahashi et al., 1989; Pasternak and Greenlee, 2005; Postle, 2006; Ranganath, 2006). Finally, the retrieval process returns items to the focus by switching attention to them (McElree, 2006; Jonides et al., 2008). Thus, attention plays a significant role in the storage of visuospatial information and not just in the various processes of the central executive.

While several stages of processing in working memory where attention plays an important role have been identified, it is still not clear what role – if any – attention plays in determining VWM capacity. Information processing bottlenecks have been observed in both VWM and attentional tasks, and the processing capacities are remarkably similar. In VWM, up to about four items can be stored and manipulated at one time (Luck and Vogel, 1997; Cowan, 2000; Rouder et al., 2008). This “magical number four” has also been cited in attentional tasks. For instance, participants can enumerate up to about four targets in a parallel fashion (Trick and Pylyshyn, 1993, 1994), and they can track about four targets simultaneously (e.g., Pylyshyn and Storm, 1988; Sears and Pylyshyn, 2000; Cavanagh and Alvarez, 2005). However, counting items does not capture all of the various aspects of capacity, and processing capacities are probably more appropriately characterized by considering the information load in addition to the number of items. Indeed, this more nuanced conception of capacity has been proposed for both VWM (Alvarez and Cavanagh, 2004) and attention (Davis et al., 2001).

Since the processing of visuospatial information in working memory seems to depend heavily on attention-based processes, the similarity in the capacities of spatial attention and VWM may be a direct consequence of a limited attentional resource which, in turn, constrains the capacity of VWM (Cowan, 1995, 2000; Awh and Jonides, 2001; Tuholski et al., 2001; Engle, 2002). In Cowan’s (1988, 1995, 2000, 2005a,b) model of VWM capacity, processing of information from long term memory is subject to a limited attentional resource. The constraint on processing ability is observed in working memory tasks and defined as VWM capacity. This attention-based model is supported by evidence from a variety of sources. For example, attention and VWM capacities are highly correlated over individuals (Tuholski et al., 2001; Bleckley et al., 2003). In addition, at the neural level, overlapping brain regions are recruited during attentional and VWM processes (Awh and Jonides, 1998; Mayer et al., 2007), including these regions thought to mediate processing capacities (Xu and Chun, 2006; Fusser et al., 2011). Furthermore, working memory capacity and the ability to control attention have been linked to the same gene (Feng et al., 2005; Söderqvist et al., 2010). These findings, however, along with most other supporting evidence that is correlational, do not provide direct support. While these data indicate that attention and VWM processes involve overlapping underlying mechanisms, they do not demonstrate that the two processes are constrained by access to the same processing resources.

The lack of a definitive demonstration has led various researchers to propose multiple stage interaction models (e.g., Wheeler and Treisman, 2002; Fougnie and Marois, 2006). These models suggest that although the capacity in attention and VWM are largely constrained by shared mechanisms, there is significant contribution from components other than the shared ones (Wheeler and Treisman, 2002; Delvenne and Bruyer, 2004; Fougnie and Marois, 2006). A good example was provided by Fougnie and Marois (2006), who combined a primary VWM task with either an attentional task or a secondary VWM task to form a dual-task. They reasoned that if a limited attentional resource is the only constraint on the capacity of VWM, then both the attentional task and the secondary VWM task, when equally loaded, should produce the same amount of interference on performance in the primary VWM task. However, increasing the load on the attentional task did not produce as much interference as did the secondary VWM task. Fougnie and Marois interpreted this finding as evidence against the shared resource hypothesis.

The dual-task paradigm, such as the one used by Fougnie and Marois (2006), is well-suited for examining the nature of VWM capacity. It is known that attention and VWM interfere with each other, as indicated by slowed responses and increased errors (e.g., Woodman and Luck, 2004). If the loads on the attentional process and the secondary VWM process were equalized, and if each produced comparable interference effects on the primary VWM task, this would suggest that attention and VWM were both constrained by the same processing resources. If the interference effects were not comparable, this would suggest the significant involvement of other processes.

There is, however, a major difficulty with a dual-task approach. Previous studies (e.g., Awh et al., 1998; Oh and Kim, 2004; Woodman and Luck, 2004; Fougnie and Marois, 2006) have focused on comparing task performances which depend critically on the specific tasks and stimuli used. Appropriate load-matching is essential to achieve a fair comparison. Since capacities in both attention and VWM are a function of both the information load and the number of items (Davis et al., 2001; Alvarez and Cavanagh, 2004), non-equivalent interference can occur if the secondary tasks are not equally matched, and simple counting of items is not sufficient to establish comparability. Given the importance of being able to make fair comparisons in the dual-task paradigm (Cowan and Morey, 2007), an accurate match between the secondary attentional and VWM tasks, in terms of the information load, is essential. Achieving such a match is neither simple nor straightforward. Fougnie and Marois (2006) have tackled the problem by equating the number of items and task accuracies on both the attention and working memory tasks. Our approach also uses a dual-task paradigm, but does not rely on load-matching and consequently avoids the potential pitfalls of equating loads.

In our experiments, instead of comparing performance over combinations of tasks and stimuli, we concentrated on the dual-task cost. The dual-task cost is the performance difference between the single-task condition and the dual-task condition. We focused on the pattern of change in dual-task costs, avoiding the tricky problem of matching loads across the primary and secondary tasks. We examined how this cost changes when an attentional task changes from easy (low load) to difficult (high load) and when a working memory task changes from easy (low load) to difficult (high load). The logic of this approach is described below.

A Shared Processing Model

We view capacity as the brain’s ability to manage information. The probability of error when managing increasing amounts of information in either working memory or attention does not increase suddenly at some fixed limit. After a slow initial increase, the error function rises quickly, but not abruptly. Similarly, at very high levels the function decelerates as it approaches asymptote (Bachelder, 2000, p. 116, Figure 1; Vetter et al., 2008, p. 4, Figure 3). When more than one working memory or attentional load is processed at the same time, the notion of shared capacity means that processing or information management of the two loads depends on shared resources in the brain.

FIGURE 1

Figure 1. Illustration of the model: (A,B) show the effects of the logit transformation; the transformed differences in probabilities are now proportional to equal differences in loads, (C,D) show the various quantities that are used in the model and how they relate to each other.

Consistent with our belief that the same processing resource supports both working memory and attention, we assume that there is no essential difference between processing a working memory load (m_i) and an attentional load (a_j). Let the probability of error on a visual working memory task with load m_i be p_i = P(m_i). The analogous probability for an attentional task with load a_j is p_j = P(a_j). The functions for working memory and attentional loads are both approximately ogival and may be linearized (Figures 1A,B) by a logit transformation (Berkson, 1944; Finney, 1947):

y_{i} = logit (p_{i}) = log (\frac{p_{i}}{1 - p_{i}}) = log (p_{i}) - log (1 - p_{i}) .

In practice, the linearization may not be strictly necessary, since low working memory or attentional loads yield a point on the error function close to the lower bend. Similarly, combinations of high loads are not likely to be far from the upper bend, unless the loads are so high as to make the task impossible for the participant. Between the bends, the function is close to linear. Nonetheless, for completeness, and to make differences in error rates comparable, we assume a logit-linearized error function. After the linearizing transformation, the error probability associated with a load may be considered to be proportional to the magnitude of the load and is thus an indirect measure of the magnitude of the load. This reasoning is similar to the argument used by Fougnie and Marois(2006, p. 529) when they used error rates to establish the equivalence of loads. Since we assume that working memory and attentional loads are processed by the same resource, the term “load” can refer to either function; this, however, does not assume that loads for the two different functions are necessarily equivalent. To avoid repetition in the development below, we omit the qualification “transformed” when we refer to the various probabilities of error.

The dual-task cost, z_i|j, of adding a load, i, while a participant is simultaneously processing another load, j, is:

z_{i | j} = y_{i | j} - y_{i},

where y_i|j is the probability of error for load i on the dual-task while load j is processed simultaneously and y_i is the probability of error on the single-task with load i (Figures 1C,D). In the dual-task, load j consumes some of the shared processing resource and, consequently, load i has access to a smaller resource than was available for the single-task. In addition, some overhead is likely charged to the processing resource for managing two loads simultaneously, thus increasing the probability of error for load i in the dual-task.

Since we have assumed that error probabilities are proportional to the magnitudes of loads, we make the further assumption that the overall dual-task probability of error, y_ij, for loads i and j, is the sum of the single-task probabilities, y_i and y_j, plus a possible processing overhead, w_ij, that is also charged to the shared resource; thus y_ij = y_i + y_j + w_ij (Figures 1C,D). The overall error probability, y_ij, is not directly measurable, but y_i|j, the probability of error for load i on the dual-task is observable. Since we have assumed that probabilities are proportional to loads, and that – like the loads – they may be added and subtracted, y_i|j = y_ij − y_i. Hence y_i|j = y_i + w_ij and thus z_i|j = w_ij. Hence the dual-task costs are identical to the processing overheads, w_ij. If z_i|j − z_{i|j ′} − z_i′|j + z_{i′|j ′} = 0, for all i, j, i′, and j ′, the dual-task costs will be additive.

Although an infinite number of configurations of the w_ij (and hence the z_i|j) may produce additivity, parsimony suggests that the processing overheads are either constant or related to the magnitudes of the loads. Other than in the trivial case where all w_ij = 0, the w_ij will be additive if (a) w_ij = c, or (b) w_ij = c_i + c_j, for all i, j pairs. This implies additivity if (a) the processing overheads, w_ij, are zero or constant, or (b) the processing overheads are constants proportional to the sizes of the individual loads. While there are other ways of defining the magnitudes of the overheads that could also produce additivity of the w_ij, such definitions would be increasingly complicated and hence increasingly unlikely.

If z_i|j − z_{i|j ′} − z_i′|j + z_{i′|j ′} = w_ij − w_ij′ − w_i′j + w_{i′j ′} ≠ 0, both the dual-task costs and the w_ij would be non-additive. Non-additivity is inconsistent with a model that assumes a shared processing resource since it would imply that processing of the loads would differ depending on the particular combinations, i and j, of the two loads. In our view, this is only likely to be true if other brain resources, separate and unique to working memory and attention, were involved. If, however, the dual-task costs are additive, this is consistent with a model which assumes that working memory and attentional loads are managed by the same shared processing resource and, furthermore, that some of the same shared resource is allocated to a processing overhead associated with the two loads.

Testing the Model

We further illustrate the shared processing model in Figure 2. In our attention–VWM dual-task paradigm, the response variable of interest is the dual-task cost in either attentional performance (Figure 2A) at low and high attentional loads with low or high VWM loads, or in VWM performance (Figure 2B) at low and high VWM loads with low or high attentional loads. In each panel of Figure 2, two lines connect the dual-task costs in each pair of conditions. If the lines are parallel for the attentional performance cost, increasing the VWM load produces the same effects on the attentional process regardless of the existing attentional load level. If the same pattern is observed for both the attentional and VWM costs (i.e., parallel slopes as in both panels of Figure 2). This suggests that the reciprocal interference effects on the two processes are additive, and thus supports the hypothesis that attention and working memory share the same processing resources. However, if the slopes of the lines in each panel of Figure 2 were to differ, this would imply the involvement of other processes other than shared processes that constrain attentional and VWM capacities.

FIGURE 2

Figure 2. (A) Hypothetical dual-task costs (percentage error) on the attentional task under the 2 × 2 (high/low attentional load × high/low VWM load) conditions if attention is the sole significant limit on VWM capacity. (B) Dual-task costs (percentage error) on the VWM task under the 2 × 2 (high/low attentional load × high/low VWM load) conditions if attention is the sole significant limit on VWM capacity.

Four experiments are required to implement our dual-task costs approach to examining the question of whether attention and VWM share the same processing resources. Each experiment corresponds to one of the four conditions created by combining attentional loads at both low and high levels with VWM loads at low and high levels. First, the baseline dual-task costs incurred are established when both attention and VWM are subject to low loads (Experiment 1). Then the load is increased on either VWM (Experiment 2) or attention (Experiment 3) or both (Experiment 4). In each of the four experiments, we expect some level of interference. If attention and VWM involve common processes – no matter whether partially or completely – we should observe interference between the attentional and VWM processes when both are executed at the same time. Furthermore, the interference should intensify when the load on either process increases. But the critical comparison is the pattern of dual-task costs across the four experiments.

To establish whether attention and VWM completely share the same processing resources, it is necessary to examine how increasing the load on either process contributes to the change in the dual-task cost. If attention and VWM share the same resources, there should be no interaction between the attentional load change and the VWM load change, as illustrated in Figure 2. If, however, the capacities in attention and VWM only partially share common mechanisms, increasing the load on one process should produce differential effects on the other process depending on the existing load on the other process. In statistical terminology, there will be an interaction between attentional load change and VWM load change. In graphs of the changes in the dual-task costs, the cost lines will not be parallel, indicating non-additivity. This may be summarized in the following sub-hypotheses:

1. There will be interference between the attentional and VWM processes;

2. The interference will be reciprocal;

2.1 When the load on either process increases, the reciprocal interference will increase;

2.2 The greatest interference will occur when both processes are subjected to high loads;

2.3 If attention and VWM are supported by completely overlapping mechanisms, the dual-task costs for attention, and VWM will be additive.

If all hypotheses are supported, an additive model would be appropriate and would suggest that attention and VWM share the same processing resources. If, however, 2.1 and 2.2 are supported, but not 2.3, this would suggest that the processing resources are only partially overlapping.

Experiment 1

We examined the interference effects produced when a low-load VWM task (change detection) and a low-load attentional task (enumeration) are performed concurrently (low attentional load + low VWM load). We chose enumeration as the attentional task since: (1) it demonstrates a clear attentional capacity (Trick and Pylyshyn, 1993, 1994); and (2) both the subitizing (within capacity) and counting (beyond capacity) processes rely on attention (Vetter et al., 2008). We chose a change detection task for the VWM task because it is a well established paradigm for investigating working memory capacity (e.g., Luck and Vogel, 1997; Johnson et al., 2008).

Materials and Methods

Participants

Four males and 12 females (aged from 18 to 35) at University of Toronto participated for course credit.

Stimuli

In the attentional task, each trial began with a fixation cross (0.24° × 0.24°) in the center of the screen for 200 ms. A number (one to six) of black squares (each subtending 0.84° × 0.84° with a minimum distance of 0.36° between adjacent pairs) appeared within an invisible square area (light gray, 5.96° × 5.96°) centered on the screen for 50 ms. Participants enumerated the squares and respond by pressing the appropriate number key on the 3 × 3 number keypad. Speed and accuracy were emphasized equally.

A change detection paradigm was used for the VWM task. On each trial, a fixation cross (0.24° × 0.24°) appeared in the center of the screen for 200 ms. A number (one to six) of colored squares (each subtending 0.84° × 0.84° with a minimum distance of 0.36° between adjacent pairs) then appeared at random locations within an invisible square area (light gray, 7.16° × 7.16°) centered on the screen for 50 ms. The colors of the squares were randomly selected from a pool (blue, green, red, violet, white, yellow) with no more than two squares sharing the same color. Participants had to remember the color of each square in the memory array. After a retention interval of 2500 ms, the test array appeared. The two arrays were identical except that on half the trials, one randomly chosen square changed its color subject to the restriction that no more than two squares in the test array could share the same color. Participants reported a color change by pressing the “1” key (change) or the “2” key (no change). Participants pressed the appropriate key on the 3 × 3 number keypad. Speed and accuracy were emphasized equally.

In the dual-task, enumeration (the attentional task) was performed during the retention interval of the change detection task (Figure 3A). Participants enumerated the number of black squares while remembering the memory array of the VWM task. After responding in the attentional task, participants viewed the test array in the VWM task and indicated whether a color change had taken place.

FIGURE 3

Figure 3. The display sequence for a dual-task trial combines two tasks: (A) in Experiment 1, a change detection task and an enumeration task with no distractors; (B) in Experiment 2, a location recall task and an enumeration task with no distractors; (C) in Experiment 3 a change detection task and an enumeration task with distractors; (D) in Experiment 4, a location recall task and an enumeration task with distractors.

Design and procedure

Both the attentional and VWM single tasks used 96 trials evenly distributed across set sizes varying from one to six, with 16 trials for each set size condition in a single-task. In half the VWM trials, one square changed its color; otherwise the memory and test arrays were identical. There were 216 trials in the dual-task with three trials for each treatment combination (enumeration set size × VWM set size × change/non-change in VWM). Participants maintained fixation on the center of the screen during each trial and they were instructed to devote equal effort to both tasks.

Results

The percentage of errors and log transformed RT in both the attentional and VWM tasks were analyzed in a 6 × 2 (set size × condition) ANOVA. Untransformed percentage of errors were used given the following analyses do not depend on the assumption that requires logit transformation. Performance on enumeration in both the single-task and dual-task conditions clearly revealed (Figure 3A) a subitizing processing (four items or fewer) and a counting process (more than four items; e.g., Trick and Pylyshyn, 1993, 1994): set size effect, F(5, 75) = 16.01, p < 0.01. Participants made more errors when VWM was loaded (upper left panels in Figure 5A): condition effect, F(1, 15) = 6.23, p < 0.05. In the VWM task, performance also depended on the set size in both the single-task and dual-task conditions, revealing a capacity limit pattern, such as that noted by Cowan (2000): set size effect, F(5, 75) = 43.21, p < 0.01. Participants made more errors in the dual-task condition (upper left panel in Figure 5C): condition effect, F(1, 15) = 59.61, p < 0.01. In addition, the capacity breakpoint shifted toward smaller set sizes in the dual-task condition (Figure 4A): set size × condition effect, F(5, 75) = 5.87, p < 0.01. There was no difference in RT in the VWM task (upper left panel in Figure 5D).

FIGURE 4

Figure 4. Error rates (upper) and reaction times (RT; lower) for attentional tasks in: (A) experiment 1 (low attentional load); (B) experiment 2 (low attentional load); (C) experiment 3 (high attentional load); (D) Experiment 4 (high attentional load). “S” indicates the enumeration data for single-task trials and “D” indicates the enumeration data for dual-task trials. The error bars represent ± 1 SE computed using Loftus and Masson’s (1994) method for within-subject designs.

FIGURE 5

Figure 5. Error rates (upper) and reaction times (RT; lower) for: (A) the VWM task (change detection) in Experiment 1; (B) the VWM task (location recall) in Experiment 2; (C) the VWM task (change detection) in Experiment 3; (D) the VWM task (location recall) in Experiment 4. “S” indicates the VWM single-task trials and “D” indicates the VWM performance in the dual-task trials. The error bars represent ± 1 SE computed using Loftus and Masson’s (1994) method for within-subject designs.

Discussion

The results replicated typical performance patterns in both enumeration (Trick and Pylyshyn, 1993, 1994) and change detection (Luck and Vogel, 1997). Even though the loads for both attention and VWM were low, they were sufficient to produce reciprocal interference in the dual-task condition (Brown, 1997). Participants made more errors in the enumeration task (upper panel in Figure 4B and upper left panel in Figure 6A) and their VWM accuracies were reduced (upper panel in Figure 5A and upper left panel in Figure 6C). These results demonstrate interference between the attentional and VWM processes and are similar to those reported in previous studies (Oh and Kim, 2004; Woodman and Luck, 2004; Fougnie and Marois, 2006).

FIGURE 6

Figure 6. (A) Error rates for the attentional tasks in four experiments (Experiment 1-upper left, Experiment 2-upper right, Experiment 3-lower left, Experiment 4-lower right). “Single” indicates the enumeration data for single-task trials and “Dual” indicates the enumeration data for dual-task trials. (B) Reaction Time (RT) for the attentional tasks in four experiments. (C) Percentage Error for the VWM task in four experiments. (D) Reaction Time (RT) for the VWM task in four experiments. The error bars represent ± 1 SE computed using Loftus and Masson’s (1994) method for within-subject designs.

Experiment 2

The load on attention remained the same as in Experiment 1, while the load on VWM was increased by using a location recall task (low attentional load + high VWM load). The location recall task differs from the change detection task by requiring the participant to recall and report the location of one of the memorized items.