Ego Depletion in Real-Time: An Examination of the Sequential-Task Paradigm

Arber, Madeleine M.; Ireland, Michael J.; Feger, Roy; Marrington, Jessica; Tehan, Joshua; Tehan, Gerald

doi:10.3389/fpsyg.2017.01672

ORIGINAL RESEARCH article

Front. Psychol., 26 September 2017

Sec. Personality and Social Psychology

Volume 8 - 2017 | https://doi.org/10.3389/fpsyg.2017.01672

Ego Depletion in Real-Time: An Examination of the Sequential-Task Paradigm

Joshua Tehan

School of Psychology and Counselling and Institute for Resilient Regions, University of Southern Queensland, Springfield, QLD, Australia

Current research into self-control that is based on the sequential task methodology is currently at an impasse. The sequential task methodology involves completing a task that is designed to tax self-control resources which in turn has carry-over effects on a second, unrelated task. The current impasse is in large part due to the lack of empirical research that tests explicit assumptions regarding the initial task. Five studies test one key, untested assumption underpinning strength (finite resource) models of self-regulation: Performance will decline over time on a task that depletes self-regulatory resources. In the aftermath of high profile replication failures using a popular letter-crossing task and subsequent criticisms of that task, the current studies examined whether depletion effects would occur in real time using letter-crossing tasks that did not invoke habit-forming and breaking, and whether these effects were moderated by administration type (paper and pencil vs. computer administration). Sample makeup and sizes as well as response formats were also varied across the studies. The five studies yielded a clear and consistent pattern of increasing performance deficits (errors) as a function of time spent on task with generally large effects and in the fifth study the strength of negative transfer effects to a working memory task were related to individual differences in depletion. These results demonstrate that some form of depletion is occurring on letter-crossing tasks though whether an internal regulatory resource reservoir or some other factor is changing across time remains an important question for future research.

Introduction

Self-regulation refers to dynamic efforts to monitor and adapt behavior, attention, emotions, and cognitive strategies in a goal-directed way (Carver and Scheier, 2000). The focus of this paper is concerned with one theory of self-regulation, but is aimed at addressing some of the current “crises” besetting self-regulation research. We will first outline the theory and methodology employed to test the theory and then examine what have been labeled the “replication crisis” in psychology (Pashler and Harris, 2012) and the “conceptual crisis” associated with self-regulation research in particular (Lurquin and Miyake, 2017).

While there are a number of different theories that address self-regulation, some based upon shifts in motivation (Inzlicht and Schmeichel, 2012), and some based on notions of cognitive control (Dang et al., 2017), the current experiments deal specifically with the strength model of self-regulation which still exerts influence on current research in spite of doubts concerning the veracity or utility of the model (e.g., Inzlicht and Berkman, 2015). While we focus on this model, we would argue that the issues raised have implications for other models as well.

The strength theory of “ego depletion” is an account of processes believed to underlie self-regulation, and importantly, explain regulatory failures (Baumeister et al., 1998, 2000). The ego depletion framework posits a strength-model or resource-model of self-regulation, whereby the ability to execute regulatory functions mirrors the familiar processes of a muscle temporarily fatiguing with use. A finite supply of internal psychological resources is hypothesized to be available to support regulatory actions and these resources are “spent” in the act of performing them. More precisely, the capacity to carry out the higher-order executive functions that underpin self-regulation and self-control (e.g., concentration and attention regulation, impulse control, emotion regulation, and behavioral inhibition) is governed by the availability of a finite internal psychological resource. Ego depletion refers to a state in which internal resources have become diminished, executive function capacity is reduced, and the likelihood of self-regulatory failure is enhanced (Baumeister and Alquist, 2009).

In the laboratory, ego depletion effects are typically investigated using the “sequential-task paradigm” (or “dual-task” paradigm). As the name suggests, this experimental paradigm involves testing for performance deficits on the second of two tasks (the outcome task) that result from completing an initial task designed to tax self-regulation resources (the depletion task). Participants' performance following a depletion task is then compared to control subjects that have not spent resources on the initial depleting task, with the expectation that those in the experimental group will show poorer performance on the outcome task than the control group. From the perspective of the strength model, two key assumptions underpin the use and interpretations of the dual-task paradigm: firstly, that engagement in the depleting task consumes self-regulatory resources; and secondly, that the decline of self-regulation resources causes the observable deficits on a subsequent self-regulatory task. Many studies have tested and provided supporting evidence for the predictions of the dual-task paradigm (see Hagger et al., 2010 for a review); however this line of evidence has exclusively focused on measuring group differences in carry-over effects on the second task without scrutinizing the actual changes in performance occurring within the depleting task itself.

“Replication Crisis”

In spite of a coherent body of confirmatory evidence, including meta-analyses, suggesting that depletion effects are reliable and moderately sized [d = 0.62, (95% CI: 0.57, 0.67)], more recent meta-analyses have cast doubt on the true magnitude of these effects. Carter and McCullough (2014) suggested that the effect size might be an over-estimate of the true size given publication biases to positive results and the increased likelihood of obtaining such positive effects in experiments that have small numbers of participants. In a subsequent meta-analysis Carter et al. (2015) examined effect size as a function of the outcome task used, showing that carry-over depletion effects differed across tasks, but again, when bias-correction techniques were adopted, effect sizes were not distinguishable from zero. Moreover, the most recent meta-analysis focused solely on the Stroop Task and found little evidence to support the strength model, and what evidence there was, was contaminated by publication bias (Dang et al., 2017).

Just as the meta-analyses cast doubt on the veracity of ego-depletion, there are now a number of highly publicized failures to replicate the phenomenon (Xu et al., 2014) or found it to be substantially smaller in size than reported in meta-analytic syntheses (Tuk et al., 2015). In one large large-N, multi-site study involving 23 different laboratories across English speaking and non-English speaking countries (Hagger and Chatzisarantis, 2016) the same protocol was administered, consisting of a letter crossing manipulation task and the Multi-Source Interference Task as the outcome task. Of the 23 replications approximately half produced positive outcomes and half produced negative outcomes of differing strengths. Overall the small positive effect could not be distinguished from zero. Thus, corrections for small study and publication biases plus failures to replicate question whether ego-depletion is a real phenomenon (Hagger and Chatzisarantis, 2016; Lurquin et al., 2016).

“Conceptual Crisis”

While the meta-analysis and failures to replicate suggest that the strength model has been largely discredited, Lurquin and Miyake (2017) have argued that the ego depletion literature as a whole suffers from a conceptual crisis as well as a replication crisis. They argue that there is a lack of clear operational definitions of self-control; a lack of independent empirical validation for self-control tasks; and a lack of well-specified models that make unambiguous, falsifiable predictions. The lack of independent validation of self-control tasks is readily seen in Baumeister and Vohs (2016) response to the failed multi-site replication experiment. They argued that depletion only occurs under a limited set of task parameters. The depletion task must first set up a habitual response, and then change the task requirements such that this habitual response must be resisted. They argued that failures in replication resulted from an absence of first creating a habitual response. The depletion task could not (rather than failed to) induce ego depletion. A second reason provided for the failed multi-site replication was the assertion that the use of computer-administered tasks was sub-optimal for inducing depletion and that pencil and paper administrations of the depletion task were more potent manipulations. These are not the only parameters that have been proposed to explain different outcomes. The time on task and the level of difficulty have often been used as post-hoc explanations for observed patterns of performance. Moreover, Dang et al. (2013), in one of the few studies that has explored performance on the depletion task across time, raise the interesting possibility that with extended time on the depletion task participants can adapt to the task (in this instance a Stroop task) and “replenish” resources. This later study questions one other fundamental assumption of the strength model that resource depletion occurs over time.

While debate continues about what tasks are truly depleting and why, these events highlight some fundamental considerations about the nature and effect of the depleting task itself. What happens in the depletion task is crucial, as changes in performance, or lack of them, can falsify theories, constrain them or provide confirming evidence for one and disconfirming evidence for another. For example, a demonstration that no change in behavior occurred in the depletion task over time but carry-over effects did emerge, would present strong disconfirming evidence for the strength model (Baumeister et al., 1998, 2007). On the other hand such an outcome would not be problematic for models that attribute depletion effects to task switching aspects of cognitive control, where it is the nature of the two tasks that is important, not what happens within each task (Dang et al., 2013, 2017). Moreover, examining what happens in the depletion task is important from a methodological perspective. Most experiments employ a “hard” version of the depletion task in the experimental group (crossing out letters according to a complex set of rules) and an “easy” version of the task in the control group (crossing out every letter) without ever determining (a) that the hard version produces decrements in performance, (b) that the simple version does not produce decrements in performance, and, most importantly, (c) that decrements in performance on the hard task are more substantial than in the easy version of the task. However, if performance deteriorated in similar ways in both hard and easy versions of the depletion task, then much of the literature that has been cited to invalidate the strength model would itself be called into question if there were no differences between experimental and control groups on the depletion task. Lastly, the Dang et al. (2013) demonstration that people can adapt to the depletion tasks suggests the possibility of individual differences in depletion. If this were the case, it might be possible to identify a group of participants for whom depletion effects are minimal and a group who show severe depletion. From a strength model, carry-over effects would expected to be more pronounced for the second group than the first. We assert that much of the ambiguity of regarding self-regulation research could be eliminated if performance on the depletion task was monitored over time.

In the absence of a consensus about what tasks are actually depleting, we propose an empirical approach to assessing the importance of the above factors in inducing depletion both within the depletion task itself, and carryover effects on an outcome task. One approach to examining depletion is to track performance on the depleting task as it is being completed. If the task exhausts self-control resources then a testable consequence will be an observable decline in performance. The absence of any decline would be a clear indicator that the task does not induce depletion or participants can adapt to the task and replenish the resources (Dang et al., 2013). The observation of declining performance would provide prima facie evidence that depletion could be occurring. The first aim of the current experiments was to test this fundamental assumption of the strength model of ego depletion by measuring performance over time on a commonly-used depletion task. The second aim involved evaluating the claims that Baumeister and Vohs (2016) made regarding the conditions for depletion to be induced, specifically the need for a habit-forming stage and the effects of presentation modality, response modality time on task, and degree of task exposure. Thirdly, we test the utility of individual differences in depletion effects as a further means of testing the assumptions of the strength model.

One of the most commonly-used depletion induction activities is a letter crossing task (Hagger et al., 2010). This task requires participants to scan and identify words within text containing a target letter (commonly the letter “e”) and then identify words where the presence of the letter satisfies a set of conditions. We chose one of the simplest versions of the letter-crossing task similar to the one used by Hagger and Chatzisarantis (2016) that did not involve any pre habit-formation. By not including habit formation, we test a central claim of Baumeister and Vohs (2016) that depletion effects are not induced unless there is a habit-forming stage. One unlikely but possible interpretation of this claim, is that performance on the depletion task might not deteriorate across time, since self-regulation is not required. Alternatively, there might be deterioration across the task of some cognitive resources, but since self-control resources are not depleted, carry-over effects on an outcome task would not be expected. Finding such carry-over effects when there was no habit formation, when the stimuli were presented on a computer, and no motor response was required, would invalidate most of Baumeister and Vohs' claims.

The basic stimuli were held constant across the five studies and involved five short passages of text that varied in length between 150 and 400 words. The first study adopted a paper-pencil letter-crossing procedure using hand-written passages that the participant completed for 10 min (the most commonly used length of time for the letter-crossing task). In study two, the same stimuli and procedures were used but participants were not timed, rather they completed the full task regardless of how long it took. To examine whether computer administration would produce performance changes, in study three participants were once again not timed but were presented with the stories on a computer screen, and participants verbally identified target items. In study four participants were presented with a fully online version of the task where the stories were again presented on a single screen and participants were required to click on the actual letter “e” in the target word. In this version, the depletion task was restricted to 10 min. Study five was conducted to again document depletion over time, but to also confirm that depletion effects on the letter-e task had carryover effects on a working memory task. It also introduced an individual differences approach to understanding depletion effects where we compare performance of people who do not show changes in performance across the letter-e task to those who do show deteriorating performance across time.

Given Lurquin and Miyake (2017) comments regarding operational definitions of self-control, proponents of a strength model could argue that doing the letter-e task involves a series of steps, even without a habit forming stage. Detecting an e in a word is the first step in response chain. Having detected an e, participants need to compare surrounding letters and either inhibit a circling response if the letter is not surrounded by a vowel, or proceed to the next decision point if there is a neighboring vowel. Having determined that one of the surrounding letters is a vowel, the third step involves either proceeding with a response or inhibiting that response. The decision to not respond at two points in the process involves self-control. Each decision to not respond should deplete self-control resources such that participant scores on identifying target letters should decrease with increasing exposure to the task. Thus, from the perspective of the strength model, it is hypothesized that target detection should deteriorate across time on the letter-e task, and to the extent that this happens, this should produce carry-over effects on a subsequent outcome task.

Study 1

This study commenced the study of depletion effects in time by presenting the Letter-e task in a pencil and paper format. The five stories were presented across 13 pages of handwritten text, with each story being written by a different person.

Methods

Ethics Statement

The following studies were approved by the Human Research Ethics Committee at the host institution (H15REA055). Written informed consent was obtained from all participants in Studies 1 to 3, and Study 5. In Study 4, participants consented via entering a unique code into a consent field on the computer page.

Participants

Study one included a mixed sample of 111 students and community residents (40.7% women). The average age of the sample was 29.04 years (range = 15–64, SD = 11.15; one participant did not correctly enter their age).

Tasks and Scoring

The first version of the letter “e” task used five short stories sourced from the internet. These were first transcribed by hand onto 13 pages of lined paper and then photocopied for each participant. While there were differences in hand writing across stories, all were legible enough that participants could identify target items.

The same set of instructions was administered across all five studies, with modifications on how to produce a response (circle a word, say the word, click on the e). The initial instructions indicated that the task was challenging and required attention to detail. It gave a general overview of the task that indicated the requirement to find a specific target letter, “e,” among text words. The instructions then introduced the two rules that participants had to adhere to. Rule one required the “e” to be followed or preceded by another vowel in which case the word was circled. Rule two required that if the accompanying vowel was an “i,” the participant did not circle the word. The participants were then given some practice applying the rules to ensure that they understood the task requirements. The final instruction was to work as quickly as possible without making errors.

In this study the experimenter timed the participants and the task was restricted to 10 min. For each participant, the proportion of target words correctly detected on each page was measured.

Results and Discussion

Data for all five studies are available via the Open Science Framework (https://osf.io/5fhvm/). The data report accuracy across time as a function of the stories (1 through 5) and pages (1 through 13). The mean proportion of words correctly identified per page are summarized in Figure 1, and at the story level in Figure 2. It is clear that there is a deterioration in performance with time until the last page where there is an improvement on the task, consistent with the proposal that participants can conserve resources for a final push when the end point is known (Baumeister, 2001). Excluding the final page, 75% of the variance in target identification across pages is accounted for by a linear function. Because the time period for doing the depletion task was fixed, the number of data points contributing to the average varies by page. For this reason, accuracy from the first two pages, the last two pages completed (which varied depending on how many pages each participant finished), and an average of the pages in between, were compared for the 102 participants who met the threshold of completing at least five pages. For those that completed all 13 pages, the scores on page 12 were used as their final page. The mean proportion of targets correctly detected on each of these five “pages” in order were 0.89 (SD = 0.13), 0.90 (SD = 0.13), 0.82 (SD = 0.14), 0.74 (SD = 0.24), and 0.59 (SD = 0.30). A repeated measures ANOVA indicated there was a large and significant difference between the five page-groupings, F_{(4, 404)} = 57.89, p < 0.001, η_p² = 0.36. Post-hoc tests indicated that there was no significant difference between pages 1 and 2, but these were significantly different from the remaining pages, which all differed from each other (p < 0.05). The difference between the first and last pages produced a large standardized effect size of Cohen's d = 1.03.

FIGURE 1

Figure 1. Proportion of targets detected as a function of page number in Studies 1 and 2.

FIGURE 2

Figure 2. Proportion of targets detected as a function of story number in Studies 1–5.

Study 2

Study two was conducted to examine the extent to which results from Study 1 were merely an artifact of the artificial time constraints placed on the letter-crossing task. While the time constraint was imposed to ensure the tasks mirrored the method of implementation in prior experiments, we sought to determine whether the pattern of performance deterioration would be replicated after removing this constraint.