Shape and spatial working memory capacities are mostly independent

Sanada, Motoyuki; Ikeda, Koki; Hasegawa, Toshikazu

doi:10.3389/fpsyg.2015.00581

ORIGINAL RESEARCH article

Front. Psychol., 20 May 2015

Sec. Cognitive Science

Volume 6 - 2015 | https://doi.org/10.3389/fpsyg.2015.00581

This article is part of the Research TopicDynamic control of representations in visual working memoryView all 10 articles

Shape and spatial working memory capacities are mostly independent

Motoyuki Sanada^1,2*†

Koki Ikeda^2,3†

Toshikazu Hasegawa¹

¹Department of Cognitive and Behavioral Sciences, Graduate School of Arts and Science, The University of Tokyo, Tokyo, Japan
²Japan Society for the Promotion of Science, Tokyo, Japan
³Department of Psychology, Chukyo University, Nagoya, Japan

Whether visual working memory (WM) consists of a common storage resource or of multiple subsystems has been a controversial issue. Logie (1995) suggested that it can be divided into visual (for color, shape, objects, etc.) and spatial WM (for location). However, a recent study reported evidence against this hypothesis. Using a dual task paradigm, Wood (2011) showed interference between shape and spatial WM capacities, suggesting that they share a common resource limitation. We re-examined this finding controlling possible confounding factors, including the way to present spatial location cues, task order, and type of WM load to be manipulated. The same pattern of results was successfully reproduced, but only in a highly powered experiment (N = 90), and therefore the size of interference was estimated to be quite small (d = 0.24). Thus, these data offer a way to reconcile seemingly contradicting previous findings. On the one hand, some part of the storage system is genuinely shared by shape and spatial WM systems, confirming the report of Wood (2011). On the other hand, the amount of the overlap is only minimal, and therefore the two systems should be regarded as mostly independent from each other, supporting the classical visuo-spatial separation hypothesis.

Introduction

It is well accepted that working memory (WM) is separated into two systems, namely, phonological (the phonological loop) and visual information storages (the visuo-spatial sketchpad; Baddeley and Hitch, 1974). More controversial is the next assumption that the latter can be further divided into two substructures for visual (colors, shapes, object, etc.) and spatial (location) information processing (Logie, 1995), which are usually referred as visual and spatial WM, respectively.

Evidence for this visuo-spatial separation hypothesis has been mixed so far (Luck, 2008). On the one hand, dual task experiments have provided some supporting data. In a typical dual task paradigm, a cognitive task is inserted during the retention interval of a WM task, thus participants have to perform the task and WM maintenance simultaneously. Studies found that the interference between the two tasks became significantly larger when they were related to the same domain (e.g., visual task and visual WM) than when they were related to different domains (Logie and Marchetti, 1991; Tresch et al., 1993; Woodman et al., 2001; Woodman and Luck, 2004). For example, Tresch et al. (1993) found that spatial WM performance was selectively disrupted by a movement discrimination task (i.e., a spatial cognitive task) but not by a color discrimination task (i.e., a feature-based visual cognitive task), whereas the opposite pattern of results was obtained for a shape WM task.

Other studies have suggested that a more nuanced argument might be required for this issue. For example, Wheeler and Treisman (2002) hypothesized that keeping spatial information may not be necessary for maintaining simple features (e.g., “blue” or “triangle”) but critical for conjunctive objects (e.g., “blue triangle”; see, however, Zhang et al., 2012). Furthermore, using a change detection task in which the test display was either exactly the same as the memorized one or differed from it in one item, Jiang et al. (2000) reported that performances of simple feature WM tasks (i.e., color or shape) were impaired when item locations were changed between to-be-remembered and to-be-matched stimuli, even though the spatial information was totally task-irrelevant. Although these studies did not directly test the visuo-spatial WM separation hypothesis, they suggested that the proposed dichotomy might be rather simplistic and further investigations were required.

Wood (2011) tackled this problem by investigating the dual task paradigm again, but now in a thoroughly systematic way. In addition, he investigated the consequences of directly combining two WM tasks. This was a notable attempt, since the majority of the previous studies that had utilized the dual task paradigm had only focused on the interference between a WM and a cognitive task (e.g., a movement discrimination task; Tresch et al., 1993), which does not examine the interferences between two WM tasks and therefore might not be a direct test of the visuo-spatial separation hypothesis. In contrast, Wood (2011) combined spatial WM tasks with various types of visual WM tasks with the following basic design. White dots appeared on a computer screen at the beginning of each trial, and participants were requested to remember their locations. Next, items with simple color, shape, or conjunctive features were presented, and participants had to remember their identities, too. After a brief blank, a test array was presented, which could be matched with either the spatial WM locations or the visual WM items that were remembered previously. In most of the experiments, the so-called “single probe” task was adopted to test visual WM, where only one to-be-matched item is presented on the test display. The number of to-be-remembered locations and items were manipulated to examine whether and when interference occurred between the two domains.

Despite his effort for an inclusive examination, the results of Wood (2011) only added further complications to the issue. In Experiment 2, he found that increasing the number of spatial cues did not disrupt the performance of the color, but did interfere with the shape and object (color–shape conjunction) tasks. No previous theories and studies are fully consistent with these new data. Firstly, these data clearly contradict the traditional hypothesis of visuo-spatial WM separation, which would have predicted no interference between visual (including shape) and spatial WM (Tresch et al., 1993; cf. Woodman et al., 2001). Secondly, based on the theory of Wheeler and Treisman (2002), overloading spatial WM would have been predicted to deplete the capacity for spatial information maintenance, and therefore interfere with object but not simple feature (e.g., shape) WM. On the other hand, the data of Jiang et al. (2000) would have suggested that spatial WM load would generally affect the spatial information maintenance and therefore interact with both shape and color WM.

What are the causes of these discrepancies? The first possibility is the differences in task designs, especially between the change detection and single probe tasks. Wood (2011) reported that the interference between color (i.e., a feature) and spatial WM was found only in the change detection, but not in the single probe task. He therefore speculated that in the change detection task, but not in the single probe task, spatial configural information is employed to retain not only spatial, but also visual WM including simple features. In accordance with this hypothesis, Jiang et al. (2000) utilized the change detection paradigm and observed impairments even in feature WM performance (i.e., color and shape) when the locations of items were changed during a trial, lending some credibility to the argument. This is, however, not sufficient to account for the interference between shape and spatial WM found in Wood (2011), because the interference was detected not only in the change detection, but also in the single probe task.

In order to reconcile the shape-spatial WM interference reported by Wood (2011) with the previous literature, the current study tried to replicate this finding while controlling some possibly confounding factors observed in the original study. Our hypothesis was that these factors might have caused the discrepancy, and therefore a clear conclusion could be obtained if they were fully controlled. The first factor was related to a specific methodological detail Wood adopted, which has already been discussed in some previous studies (Woodman and Luck, 2004; Lecerf and De Ribaupierre, 2005). That is, since multiple white dots appeared simultaneously on the computer screen in Wood (2011), the participants might have encoded them as a shape formed by these white dots rather than separate spatial locations. If this was the case, the observed shape-spatial interference could be interpreted as having occurred between two shape WM tasks. We examined this possibility in Experiments 1a,b,c. Next, we also examined the effect of task order (Experiment 2) and types of WM load to be manipulated (Experiment 3). These factors were not, or only minimally manipulated in the original study. To foreshadow the results, none of the controls altered the results. Moreover, no statistically significant evidence of between-domain interference was found in any of these experiments, seemingly disconfirming the observations of Wood (2011). Importantly, however, we found a very small, but consistent trend of interference in all experiments regardless of the different settings, suggesting the obtained null results were simply due to under-powered designs. Therefore, we conducted an omnibus test including four of these experiments and found a small, but statistically significant effect. We also conducted another replication following the design of Wood (2011) more precisely (Experiment 4), in which we collected data from 90 participants to sufficiently increase statistical power. A significant effect of interference was observed again, but its effect size remained to be quite small. We concluded that, although there was an overlap between spatial and shape WM processing, the size of this effect was small, and therefore the two systems should be regarded as mostly independent from each other.

Experiment 1

The purpose of Experiments 1a,b,c was to test if the results of Experiment 2 in Wood (2011) were due to the specific methods that the study adopted for the spatial WM cue presentation or data analysis. Sequential cue presentation was used in Experiments 1a,b, and simultaneous cue presentation in Experiment 1c. We examined the interaction between task type and load manipulation as a measure of interference in all three experiments.

Experiment 1a

Methods

Participants

Thirty volunteers (male: 15; female: 15; mean age: 19.93 years, SD: 2.00 years) participated in the experiment. They provided informed consent before commencing the experiment and were compensated monetarily.

Stimuli and procedure

All stimuli were presented on a black screen of a 17 inch CRT monitor, and E-prime 2.0 (Psychology Software Tools, Inc., Sharpsburg, PA, USA) was used to program the experiment. The viewing distance was about 60 cm.

At the beginning of a trial, two alphabet letters (white, bold, and 45 point Courier New font) were randomly selected and presented for 1,000 ms at the center of the screen. Participants had to pronounce these letters repeatedly for articulately suppression until they responded to the test array (see below), in order to prevent the spatial and shape stimuli from being verbalized. To confirm if participants correctly followed this instruction, we recorded their voices all through the experiment by a voice recorder. Participants were informed about this recording procedure beforehand. After the letter presentation, the word “Ready” (white, bold, and 45 point Courier New font) appeared for 500 ms at the center of the screen, followed by a 500 ms blank and then a spatial memory array.

Unlike Wood (2011), the spatial memory array was presented in a sequence. A 5 × 5 grid (width 17.6° × length 14.7°) with white borders appeared for 400 ms at the center of the screen, and consecutively white dots (2.2° × 2.2°) were randomly presented one by one, each for 300 ms, in one of the cells in the grid. The white dots in one trial never appeared in the same cell. There was no interval between dot presentations, thus the entire presentation time changed according to the set size of the memory array; they were 300, 900, and 1,500 ms for the set size 1, 3, and 5, respectively. Participants had to remember all locations, but not the order of presentation. This spatial memory task was followed by a 800 ms blank and the shape memory array (Figure 1A).

FIGURE 1

FIGURE 1. Schematic diagrams of the experimental procedures used in Experiments 1a (A), 1b (B), and 1c (C). In all experiments, a trial consisted of the instruction of articulatory suppression, spatial working memory (WM) cues, first blank, shape WM array, second blank, and the test probe. Spatial WM was tested in the half of trials, and shape WM in the rest of trials. The load size of spatial WM randomly changed from 1, 3, to 5, but that of the shape WM was fixed to 4. The only difference between Experiments 1a,b was the way to present spatial WM cues. The total time to present spatial WM cues changed in accordance with the number of the load in Experiment 1a, whereas fixed to 1,500 ms in Experiment 1b. (D) Shows the shape stimuli which were used for the shape WM task in Experiments 1a – c, 2, and 4.

The shape memory array consisted of four white shapes randomly selected from seven distinguishable items (star, square, pentagon, triangle, diamond, spiral, and cross, see Figure 1D), all of which had a size of 3.2° × 3.2°. They remained on the screen for 500 ms. The locations were fixed on the corners of a width 10.1° × length 6.1° rectangle appearing on the center of the screen. The participants had to remember the shapes but not their locations. After the shape memory array, the word “Test” (white color, bold, 45 point Courier New font) appeared for 1,000 ms, and was followed by the memory test.

Two different versions of WM test were used; that is, one for the spatial and another for shape memory, each occurring with a probability of 50%. Note that participants had to retain both spatial and shape information in all trials, since the selection of test type was totally random. For testing spatial WM, a white dot appeared in one of the 5 × 5 grid cells. In half of trials (i.e., 25% of all trials), the dot appeared at one of the locations where the to-be-memorized items had been presented previously (the same condition), and in one of the remaining locations in the rest of trials (the different condition). For assessing shape WM, a shape was selected from the aforementioned seven shapes (see Figure 1C) and presented at the center of the screen. It matched with one of the to-be-memorized shape items in half of the trials (the same condition) but not in the rest (the different condition). In both versions, participants had to answer whether the test item matched with one of the items retained in memory, by pressing the “f” or “j” key on the keyboard (the key-response correspondence was counterbalanced across participants). The test item remained on the screen until the response. A 300 ms blank was inserted as inter-trial interval before the next trial started.

The experiment comprised 30 blocks, each of which contained 12 trials, thus the total number of trials was 360. At the end of each block, accuracy rates of spatial and shape memory test were presented on the monitor. Participants conducted two practice blocks with trial-by-trial accuracy feedback before starting the experiment.

Results and Discussion

The results of Experiment 1a showed no evidence of interference between shape and spatial WM. Whereas higher spatial WM load significantly impaired the spatial WM score, it did not affect the shape WM performance (Figure 2A). We conducted a within-subject ANOVA on the accuracy data with the two factors, spatial WM load size (1, 3, or 5) and test type (spatial or shape). In addition to a significant main effects of spatial WM load size and test type [F(2,58) = 25.63, p < 0.001, $η_{p}^{2}$ = 0.47, F(1,29) = 24.03, 24.03, p < 0.001, $η_{p}^{2}$ = 0.45, respectively), the interaction between the two factors was also significant [F(2,58) = 16.79 p < 0.001, $η_{p}^{2}$ = 0.37]. Post hoc analyses showed that, as spatial WM load increased, accuracy in the spatial WM task decreased significantly (92.8, 85.7, and 79.3% for the load 1, 3, and 5, respectively; for each difference, ps < 0.001), but shape WM performance did not (80.1, 79.0, and 78.8% for the load 1, 3, and 5, respectively. ps > 0.995 for the difference between each load size). Finally, note that the insensitivity of shape WM score to the change of spatial WM load was apparently not due to a floor effect, because the mean performance in all conditions (around 80%) was far better than what would have been expected based on random guesses (50%).

FIGURE 2

FIGURE 2. The results for Experiments 1a (A), 1b (B), and 1c (C). The broken lines with empty squares and the solid lines with filled circles indicate the accuracy (% correct) of the spatial and the shape WM test trials, respectively. In all experiments, as the spatial WM load size increased, the performance of the spatial WM clearly decreased; the shape WM performance were, however, almost constant. The error bars indicate the SEM.

Experiment 1b

The procedure of Experiment 1b was almost the same as Experiment 1a, except that it adopted an alternative way to present the spatial cue sequence. In this experiment, the presentation interval was fixed to 1,500 ms (Figure 1B). This procedure eliminated the influence of the difference of interval length between conditions, and made it possible to test more precisely whether the consumption of spatial WM capacity disrupted the shape WM processing.

Method