Does Attentional Selectivity in the Flanker Task Improve Discretely or Gradually?

Hübner, Ronald; Töbel, Lisa

doi:10.3389/fpsyg.2012.00434

ORIGINAL RESEARCH article

Front. Psychol., 26 October 2012

Sec. Cognitive Science

Volume 3 - 2012 | https://doi.org/10.3389/fpsyg.2012.00434

Does attentional selectivity in the flanker task improve discretely or gradually?

Ronald Hübner*

Lisa Töbel

Fachbereich Psychologie, Universität Konstanz, Konstanz, Germany

An important question is whether attentional selectivity improves discretely or continuously during stimulus processing. In a recent study, Hübner et al. (2010) found that the discrete Dual-Stage Two-Phase (DSTP) model accounted better for flanker-task data than various continuous-improvement models. However, in a subsequent study, White et al. (2011) introduced the continuous shrinking-spotlight (SSP) model and showed that it was superior to the DSTP model. From this result they concluded that attentional selectivity improves continuously rather than discretely. Because different stimuli and procedures were used in these two studies, though, we questioned that the superiority of the SSP model holds generally. Therefore, we fit the SSP model to Hübner et al.’s data and found that the DSTP model was again superior. A series of four experiments revealed that model superiority depends on the response-stimulus interval. Together, our results demonstrate that methodological details can be crucial for model selection, and that further comparisons between the models are needed before it can be decided whether attentional selectivity improves continuously or discretely.

Introduction

Selective spatial attention is an important control mechanism for goal-directed behavior. Accordingly, it has intensively been investigated during the last decades. One idea of how specific information is selected from the visual field is to assume some kind of spatial attentional filtering. For instance, based on results obtained with the spatial-cueing paradigm, some researchers proposed that such filtering proceeds like an attentional spotlight, i.e., that visual attention can be allocated to a certain location and that items at that location are processed more intensively than items at other locations (Posner, 1980; Posner et al., 1980). Further important properties of spatial attention have also been revealed by the flanker task (Eriksen and Eriksen, 1974), in which participants have to categorize a target stimulus as fast and as accurately as possible, while ignoring irrelevant flanker stimuli. The flankers are usually congruent, i.e., associated with the same response as the target, or incongruent, i.e., associated with the opposite response. The degree to which the flankers can be ignored or filtered out is assessed by the difference between the performance for congruent and incongruent stimuli, which is called the flanker congruency effect. Usually, responses to congruent stimuli are faster and more reliable than responses to incongruent flankers and the size of differences in RT and error rate (ER) are considered as measures of the efficiency of selective attention. Results obtained with the flanker task have led to the attentional zoom-lens metaphor, which generalizes the spotlight idea by not only assuming a variable position of the attentional filter, but also a variable size and form (Eriksen and Schultz, 1979; Eriksen and St James, 1986).

The regularly observed flanker congruency effect clearly indicates that selectivity is limited. Moreover, Gratton et al. (1988) analyzed distributional data and found that this limit changes in time. Usually, accuracy on incongruent trials is much higher for slow than for fast responses, indicating that attentional selectivity improves during the course of processing. In view of such results it has been hypothesized that stimulus processing is unselective in a first phase of processing, but then, after some time, enters a second phase with relatively high selectivity (e.g., Gratton et al., 1992). In more recent models, it has also been assumed that the increase in selectivity is controlled by some conflict monitoring mechanism. Accordingly, attentional selectivity is increased only after a response conflict is detected, which also leads to an unselective and a selective phase, at least for incongruent stimuli (e.g., Davelaar, 2008; Yu et al., 2009). Yet, as the zoom-lens metaphor already suggests, a discrete and stage-like improvement of selectivity is not the only way to account for the dynamics of selective attention. It is also possible that selectivity increases continuously with processing time by a gradually narrowing attentional focus on the target item (e.g., Heitz and Engle, 2007). Because the continuous account seems plausible and is relatively easy to formalize, it has been implemented in the frameworks of neural-networks (e.g., Cohen et al., 1992; Liu et al., 2008), of Bayesian observers (e.g., Yu et al., 2009), and of diffusion processes (e.g., Liu et al., 2009).

Recently, however, the idea of a discrete and stage-like improvement of selectivity has also been formalized by Hübner et al. (2010). Their Dual-Stage Two-Phase (DSTP) model relies on the assumption of two discrete stages of stimulus selection, an early stage of low selectivity and a late stage of high selectivity. The information provided by these two stages drives response selection in a first and second phase, respectively. Both phases are modeled by a diffusion process (cf. Ratcliff, 1978). Such processes are basically characterized by a drift rate reflecting the evidence available for responses A and B, and two corresponding thresholds A and −B. Hübner et al. (2010) assumed that in the first phase of response selection the rate is simply the sum of two component rates μ_ta and μ_fl for target and flankers, respectively. If the flankers are incompatible, then μ_fl is negative, which reduces the overall rate. Because the magnitudes of these component rates are modulated by attentional weights (attentional filtering), this part of the model represents early selection. Additionally, though, a late stimulus-selection process runs in parallel with response selection. It is also implemented as a diffusion process with drift rate μ_SS, and selects the target or flanker depending on whether the accumulated evidence first reaches threshold C or −D, respectively. If the target is selected before a response, then the rate of response selection increases to a value μ_RS2, which accounts for the improved accuracy of slower responses. It should be noted that the DSTP model accounts for the dynamics of selectivity within a trial without the assumption of conflict monitoring. An outline of the model with an example process is shown in Figure 1.

FIGURE 1

Figure 1. Outline of the two phases of response selection in the DSTP model. The upper graph represents the response-selection process, whereas the lower graph depicts the stimulus-selection process. In this example stimulus selection (late selection) is successful and selects the correct stimulus. Because response selection has not finished yet at that time, the stimulus selection has the effect that the rate of evidence accumulation for response selection increases, which defines the beginning of Phase 2 of response selection. The slope of the arrows represents the respective rate. The trajectories represent examples of single sample paths.

Hübner et al. (2010) compared the DSTP model to several continuous-improvement models, including the neural-network model of Cohen et al. (1992), which was also implemented as a diffusion-process models (Liu et al., 2008). Accordingly, the improvement of attentional selectivity in the alternative models was generally realized by a continuously increasing drift rate for response selection. However, the function of how the rate increased with time differed between the models. Fitting the different models to various distributional flanker-task data revealed that the DSTP model was superior, suggesting that attentional selectivity improves discretely rather than continuously. However, a general problem is that there are an infinite number of ways of how selectivity can increase continuously in time, so that Hübner et al. (2010) may simply not have found an optimal member of this model class. Indeed, White et al. (2011) questioned that the assumption of discrete selectivity generally explains data better than continuous selectivity, and proposed a specific shrinking-spotlight (SSP) model, also implemented as a diffusion process.

In the SSP model the overall rate for a given stimulus is also computed from the weighted evidence provided by each sub-component or item. It is assumed that all items provide the same amount of perceptual evidence p. However, the attentional weight for each item is determined by the proportion of the “spotlight” that falls on the item’s location in the display. Selectivity, and consequently the drift rate for incongruent stimuli, increases gradually as the width of the target-centered spotlight shrinks over time at a linear rate, r_d, from sd₀ to a minimum.

White et al. (2011) applied the SSP model together with discrete selection models, including a simplified version of the DSTP model, to flanker-task data and found that their model was superior. Based on this result, they concluded that processing in the flanker task is better described by gradual than by discrete attentional narrowing, which is contrary to the conclusion of Hübner et al. (2010). Therefore, from our perspective, the crucial question was whether the superiority of the SSP model to the DSTP model holds generally. Because White et al. used a different experimental method than Hübner et al. (2010), it was possible that the SSP model is superior to the DSTP model only under specific conditions.

It is certainly impossible to compare the two models under all possible methodological conditions. However, if selectivity improves continuously, as proposed by White et al. (2011) then one would expect that the SSP is at least also superior in accounting for Hübner et al.’s (2010) flanker-task data. Therefore, in a first step we implemented the SSP model and fit it to the distributional data from the three experiments (eight conditions) of Hübner et al.’s (2010) study using the same fitting procedure as in that study (for details see also below). The obtained goodness-of-fit measures are shown in Table 1. For comparison, not only the values for the SSP model are listed, but also those for the DSTP model and the best fitting continuous model from Hübner et al. (2010). The latter model has a non-linearly increasing rate and can be considered as equivalent to the neural-network model of Cohen et al. (1992).

TABLE 1

Table 1. Fit statistics of different models for the three experiments and corresponding conditions in Hübner et al. (2010).

As can be seen in Table 1, with respect to the G² (Wilks likelihood ratio chi-square) values, which represent a basic measure of fit (cf. Ratcliff and Smith, 2004), the DSTP model is superior for the data in all experiments and conditions. The table also shows the Bayesian information criterion (BIC) model-selection statistics (Schwarz, 1978), which takes the number of model parameters into account. According to this statistic, the model with the smaller BIC should be preferred. If we consider these values in Table 1, then we see that the BIC for the SSP model is slightly smaller (61.0 versus 61.6) than that for the DSTP model only in the 20%-congruent condition. In all other conditions, though, the DSTP model still yields better results. Compared to the non-linear increase model, the BIC for the SSP model was always superior except for the wide condition. Thus, the SSP model is a parsimonious model that, in terms of BIC, is more successful than the best continuous-improvement model considered in Hübner et al. (2010). However, it is still less successful compared to the DSTP model.

These comparisons show that the SSP model is not generally superior to the DSTP model. Accordingly, the conclusion that attentional selectivity improves continuously is no longer justified. Rather, with respect to the different models, it is obvious that their superiority depends on methodological details. The method applied in Hübner et al. (2010) seems to be favorable for the DSTP model, whereas that in White et al. (2011) is advantageous for the SSP model.

Thus, a further aim of the present study was to investigate which details of the applied methods are crucial for model superiority. Of the differences between the studies those with respect to stimuli and tasks were most striking. Although both studies used a flanker task, Hübner et al. required parity judgments on numerals, whereas White et al.’s (2011) participants had to indicate the pointing direction of arrows. Therefore, after verifying in our first experiment that White et al.’s (2011) result was replicable in our lab, we combined in Experiment 2 the arrow stimuli and the corresponding task with the procedure in Hübner et al. (2010). As a result, the DSTP model was now better than the SSP model, which indicated that some other variable must be crucial for model superiority. The experiments still differed in stimulus duration, response-stimulus interval (RSI), error feedback, and responding. Because there were striking differences in the variance of the latencies between the experiments, we speculated that stimulus duration might be an important factor. However, its variation in Experiment 3 had no effect on model superiority. Therefore, we next examined the effect of the RSI, because this factor is known to affect automatic as well as controlled process (e.g., Soetens et al., 1985). Indeed, the result of Experiment 4, combined with that of Experiment 3, shows that a relatively long RSI leads to data that are fit better by the DSTP than by the SSP model, whereas the opposite holds for a relatively short RSI.

Experiment 1

In our first experiment we tried to replicate White et al.’s (2011) results. To this end, we collected data by applying the same stimuli and procedure as in that study. Specifically, we used vertically arranged arrows as stimuli and a “left” or “right” decision as task, and also adopted the other procedural details from White et al. (2011). If the task and procedure matter for model superiority, then the SSP model should again fit the data better than the DSTP model.

Method

Participants

Eighteen participants (mean age 24.4 years, five male) with normal or corrected-to-normal vision, participated in the study. They were recruited at the Universität Konstanz and were paid 8 €/h.

Apparatus and stimuli

Stimuli were presented on a 19″-monitor with a resolution of 1280 × 1024 pixels, and a personal computer (PC) served for controlling stimulus presentation and response registration. The item set was the same as in White et al. (2011) and consisted of left or right pointing arrows (<, >). Participants were seated at a distance of about 45 cm from the screen, so that the width and height of the arrows subtended a visual angle of approximately 0.7°. Stimuli were presented in white on a black background. The target arrow always appeared at the center of the screen. Flanker arrows (two above, and two below the target) were arranged vertically as in White et al. (2011). The separation between the items was always 0.4°. For congruent stimuli, the flanker arrows pointed in the same direction as the target arrow, whereas for incongruent stimuli the flankers pointed in the opposite direction.

Procedure

Stimuli were presented at the center of the screen and remained on the display until response. The task was to decide whether the target arrow pointed to the left or to the right, and to indicate the decision by pressing corresponding keys “y” and “–” on the keyboard (German layout) with their index finger of their left and right hand, respectively. Stimuli were congruent on half of the trials and incongruent on the other half. One second after the response, the next trial began. No error feedback was given. After an RSI of 350 ms the next stimulus appeared.

Participants first performed a 48-trials practice block, and then worked through 16 test blocks of 64 trials each in a 45 min session. Outliers were controlled by eliminating the fastest and slowest responses. Cut-offs were chosen in such a way that less than 1% of the data were excluded (cf. Ulrich and Miller, 1994). For the present experiment this means that responses faster than 250 ms or slower than 1500 ms were excluded from analysis (<0.9% of the data).

Model fitting

To examine model performance, responses-time distributions for correct and incorrect responses in each condition (congruent, incongruent) were constructed by quantile-averaging (0.1, 0.3, 0.5, 0.7, and 9) the data. By this procedure, the data of each condition were sorted into six bins comprising 10, 20, 20, 20, 20, and 10% of the data, respectively. One exception were the error responses for congruent stimuli. Because they occurred rarely, only the 0.5 quantile was used for representing the corresponding RTs, as in White et al. (2011), which produced only two bins (50, 50%). Computer-simulation versions of the DSTP and the SSP model were then fit to these distributions with the same fit procedure as in Hübner et al. (2010). Specifically, the PRAXIS algorithm (Brent, 1973; Gegenfurtner, 1992) was applied to find parameter values for a given model that minimized the G² statistics (cf. Ratcliff and Smith, 2004):

G^{2} = 2 \sum_{i = 1}^{J} N p_{i} 1 n (\frac{p_{i}}{π_{i}}),

In this equation J is the number of bins, p_i is the proportion of observations in the i^th bin, and π_i is the proportion in this bin predicted by the considered model. N is the number of all observations¹. Because the congruent and incongruent conditions were fit together, we had J = 20 bins (six for correct responses in the congruent condition, two for errors in the congruent condition, six for correct responses in the incongruent condition, and six for errors in the incongruent conditions).

Assuming symmetric thresholds (A = B, C = D), there were seven parameters for the DSTP model, including one parameter (t_er) for representing the non-decisional time. The SSP model had five parameters. Let J_c and J_i be the number of bins for the congruent and incongruent condition, respectively, and M the number of model parameters, then the degrees of freedom (df) are calculated by df = (J_c − 1) + (J_i − 1) − M.

We simulated 8 × 10⁵ trials for each condition and fit cycle. To prevent that the obtained parameter estimates represent a local minimum, the fit procedure was repeated several times with different sets of initial parameter values.

Results and Discussion

Mean performance

The latencies of correct responses were analyzed by a one-factor ANOVA for repeated measures on the factor congruency (congruent, or incongruent). The analysis revealed a significant congruency effect, F(1, 18) = 126, p < 0.001. Responses were faster for congruent than for incongruent stimuli (Table 2). The mean ER was 7.28%. The ERs were subjected to an ANOVA of the same type as for the RTs. It revealed a significant effect of congruency, F(1, 18) = 42.2, p < 0.001, indicating that congruent stimuli produced a smaller ER than incongruent ones (Table 2).

TABLE 2

Table 2. Mean response times and their SD for correct responses, mean response times for error responses, and mean error rates for the different conditions in the four experiments.

These results show the same pattern as those in White et al.’s (2011) first experiment. However, the responses in the present experiment were numerically faster (474 versus 505 ms), and the congruency effect was smaller in RT (Δ38 versus Δ78 ms) as well as in ER (Δ5.81 versus Δ7.6%).

Model fits

The parameters and goodness-of-fit values obtained from fitting the DSTP and the SSP model to the distributional data are also shown in Table 3. The table also shows BIC model-selection values (Schwarz, 1978), which also represent goodness-of-fit but additionally take the number of model parameters into account. Accordingly, the model with the smaller BIC should be preferred. As can be seen, although the pure goodness-of-fit (G²) was slightly better for the DSTP model, the BIC value is in favor (i.e., smaller) of the SSP model due to the fewer parameters of that model.

TABLE 3

Table 3. Parameter estimates and goodness-of-fit measures obtained by fitting the DSTP model and the SSP model to quantile-averaged response-time distributions for the different congruent and incongruent conditions.

Thus, by applying the task and procedure of White et al. (2011), and by fitting the models to the data we have to conclude that the SSP model is indeed superior to the DSTP model, at least under these specific conditions. The fact that the DSTP model is superior under other experimental conditions suggests that procedural differences produced the inconclusive results with respect to model superiority. The question now was which methodological details were responsible for the advantage of the SSP model in the present experiment. To answer this question, we conducted further experiments.

Experiment 2

In this experiment we examined the role of stimulus type and task for model superiority. The hypothesis was that data obtained with arrow stimuli and the corresponding task might generally be better accounted for by the SSP model. If this is the case, then this model should also be superior to the DSTP model when arrow stimuli are combined with the procedure of Hübner et al. (2010). To test this hypothesis, we used the same stimuli and task as in Experiment 1, but applied the procedure as in Hübner et al. Specifically, stimuli were presented only for 165 ms, participants had to indicate their decision by pressing a corresponding key with their index or middle finger of their right hand, respectively, errors were signaled by a tone, and the RSI was 2000 ms. Moreover, whereas the flanking arrows had always been arranged vertically in White et al. (2011), we also included a condition with horizontally arranged items.

If the observed difference in fit performance between the DSTP and the SSP model was due to the applied stimulus type and task, then the SSP model should again be superior, at least for the condition with vertically arranged items. However, if the difference depended on other procedural differences between White et al.’s (2011) and Hübner et al.’s (2010) studies, then the DSTP model should now be better.