Knowledge of Previous Tasks: Task Similarity Influences Bias in Task Duration Predictions

Thomas, Kevin E.; König, Cornelius J.

doi:10.3389/fpsyg.2018.00760

ORIGINAL RESEARCH article

Front. Psychol., 24 May 2018

Sec. Cognition

Volume 9 - 2018 | https://doi.org/10.3389/fpsyg.2018.00760

Knowledge of Previous Tasks: Task Similarity Influences Bias in Task Duration Predictions

$\r\nKevin E. Thomas*$ Kevin E. Thomas^1*

Cornelius J. König²

¹Department of Psychology, Faculty of Science and Technology, Bournemouth University, Bournemouth, United Kingdom
²Department of Psychology, Faculty of Human and Business Sciences, Saarland University, Saarbrücken, Germany

Bias in predictions of task duration has been attributed to misremembering previous task duration and using previous task duration as a basis for predictions. This research sought to further examine how previous task information affects prediction bias by manipulating task similarity and assessing the role of previous task duration feedback. Task similarity was examined through participants performing two tasks 1 week apart that were the same or different. Duration feedback was provided to all participants (Experiment 1), its recall was manipulated (Experiment 2), and its provision was manipulated (Experiment 3). In all experiments, task similarity influenced bias on the second task, with predictions being less biased when the first task was the same task. However, duration feedback did not influence bias. The findings highlight the pivotal role of knowledge about previous tasks in task duration prediction and are discussed in relation to the theoretical accounts of task duration prediction bias.

Introduction

Predicting task duration has been the focus of much research, which has almost universally found that predictions are biased (e.g., Buehler et al., 1997; König et al., 2015). Prediction bias is evident on tasks as diverse as writing a college thesis (Buehler et al., 1994), shopping for gifts (Kruger and Evans, 2004), and solving abstract problems (Thomas et al., 2003). There is considerable support for the planning fallacy, which is the tendency to underestimate future task duration despite knowing that previous similar tasks did not finish on time (Buehler et al., 2010b).

The planning fallacy was identified by Kahneman and Tversky (1979) who suggest that two kinds of data are available when predicting task duration: singular and distributional information. Distributional information is essentially base-rate data (e.g., previous task performance), whereas singular information concerns aspects of a focal task (e.g., number of parts). Kahneman and Tversky (1979) claim that the planning fallacy occurs because singular information becomes the focus of attention whilst distributional information is ignored. This explanation has been termed the inside–outside account (Kahneman and Lovallo, 1993), as using singular or distributional information is like taking an ‘inside’ or ‘outside’ view of a focal task, respectively.

Inherent in the inside–outside account is the notion that people possess distributional information (e.g., they remember their performance on previous similar tasks) but typically ignore it when predicting task duration. Kahneman and Tversky (1979) suggest that when making a prediction on a task that has been encountered before people place less emphasis on what task information they remember and more emphasis on what they perceive differentiates the current task from previous similar tasks. Accordingly, through the use of verbal protocols (Ericsson and Simon, 1980), Buehler et al. (1994) found that predictions are typically based on aspects of focal tasks, but when people are prompted to consider their previous task performance just before making a time prediction, the temporal underestimation indicative of the planning fallacy is reduced. Buehler et al.’s (1994) research suggests that people tend to take an ‘inside’ perspective when making task duration predictions, but when they incorporate what information they remember about previous tasks into their predictions, they do a better job of judging when tasks will finish. Although the inside–outside account makes no prediction about the role memory for previous task performance in the task duration prediction process, the work of Buehler et al. (1994) highlights the benefit of taking such an outside perspective when making time predictions. The role of memory for previous task performance is central to another account of the planning fallacy which claims that people use distributional information but inaccurately recall it when making time predictions (Roy et al., 2005). This memory-bias account states that people consider previous task information but their memory for such information is biased, which leads to biased predictions. Roy and Christenfeld (2007) suggest that because people rarely closely monitor the durations of the tasks they perform in daily life, the planning fallacy (and the overestimation of future task duration) is a result of memory being for estimated or perceived duration rather than actual duration. Support for this claim comes from the retrospective time estimation literature, where autobiographical events (Burt, 1992) and public events (Burt and Kemp, 1991) are remembered as being shorter than was actually the case. There is growing support for the memory-bias account because the amount of prior experience of a focal task seems to matter when it comes to underestimating task duration (Roy and Christenfeld, 2007). Moreover, prediction bias is reduced when feedback about previous task duration is provided, thus correcting memory (Roy et al., 2008). Similar to research supporting the inside–outside account (Buehler et al., 1994), Roy et al.’s (2008) findings suggest that using previous task information can reduce prediction bias, suggesting some complementarity between the memory-bias and inside–outside accounts. However, Roy et al. (2008) found that such information had to be accurate to be beneficial whereas this was not examined by Buehler et al. (1994), suggesting an arguably subtle difference between the two accounts. A more obvious way in which the accounts differ though is in the link between the use of previous task information and prediction bias, with the inside–outside account predicting that bias is due to not using information and the memory-bias account predicting that bias occurs because such information is incorrectly used.

Similar to the memory-bias account, the anchoring account (Thomas et al., 2007), emphasizes the use of information about previous tasks. The account derives from the anchoring and adjustment heuristic (Tversky and Kahneman, 1974) and posits that the actual or perceived duration of previous tasks serves as a basis (anchor) for predictions which are typically insufficiently adjusted according to the demands of focal tasks. This anchoring and insufficient adjustment process results in underestimation when previous tasks are shorter than focal tasks and overestimation when previous tasks are longer than focal tasks. This difference in the direction of bias was found by Thomas et al. (2007) who varied the relative durations of previous and focal tasks. Further support for the anchoring account comes from studies where participants are shown numbers (anchors) representing the durations of previous tasks before predicting the durations of focal tasks. For example, König (2005) found that presenting an anchor concerning a shorter or longer duration on the same task as a focal task resulted in underestimation or overestimation, respectively. Similarly, Thomas and Handley (2008) found that the direction of bias differed in the same way when anchor values concerned a task that was the same as or different to a focal task. Moreover, bias was reduced among participants that reported taking account of their performance on previous similar tasks when making a prediction on the focal task, suggesting that prior task experience might reduce the impact of task duration anchors on prediction bias. However, prior task experience was not manipulated by Thomas and Handley (2008), meaning that the effect of the similarity between anchor value (previous) tasks and focal tasks is unclear.

Anchoring is also posited as a factor in time prediction bias when information about previous task duration is not explicitly presented (e.g., in the form of anchor values). König et al. (2015) suggest that implicitly knowing that one misestimated the duration of a previous task is sufficient to reduce prediction bias on the next task through self-generated feedback on task duration. König et al. (2015) identified this self-learning effect by using two unrelated tasks of similar duration (coloring-in a drawing and building a toy bird) and having some participants estimate the duration of the first task retrospectively. They found that both tasks were underestimated but that the retrospective estimate reduced prediction bias on the second task relative to when no retrospective estimate was made and relative to the first task. König et al. (2015) suggest that making the retrospective estimate led participants to focus attention on how long they thought they took on the first task and they used this information as a self-generated anchor value that was adjusted according to the perceived demands of the second task. Consistent with this claim, self-generated anchor values have been shown to influence the extent of adjustment in judgments on tasks without a temporal element (e.g., general knowledge tests; Epley and Gilovich, 2005), suggesting that an explicit anchor value is not necessary to induce an anchoring and adjustment judgment strategy (Jacowitz and Kahneman, 1995). From their findings, König et al. (2015) proposed the self-learning account of task duration prediction bias.

The self-learning (König et al., 2015), anchoring (e.g., Thomas et al., 2007), memory-bias accounts (e.g., Roy and Christenfeld, 2007), and inside–outside accounts (Kahneman and Lovallo, 1993) all imply that information about previous tasks can influence bias in predictions of task duration. However, unlike the inside–outside account (see e.g., Buehler et al., 1994), the other three accounts suggest that using such information does not necessarily reduce bias. The shared focus on the use of information about previous tasks not necessarily reducing bias implies that the memory-bias, self-learning, and anchoring accounts are complementary.

Although some factors concerning information about previous tasks have been found to influence prediction bias (e.g., temporal distance between actual and predicted duration; Roy and Christenfeld, 2007), a potentially important factor, task similarity, has yet to be studied. The similarity of previous and focal tasks is important and germane to the self-learning, anchoring, and memory-bias accounts because it concerns the relevance of previous task information. Moreover, manipulating task similarity allows the testing of predictions derived from these three accounts plus the inside–outside account. Thus, examining task similarity enhances our understanding of the mechanisms underlying task duration prediction bias and provides an important contribution to the extant literature. Although task similarity should not influence bias according to the inside–outside account (Kahneman and Lovallo, 1993) because information about previous tasks is typically ignored, incorporating remembered information about previous similar tasks into predictions would presumably reduce bias because of the relevance of that information. Thus, a prediction based on the inside–outside account would be that bias is less when tasks are similar, provided that people consider their previous task performance when making predictions (Buehler et al., 1994).

Predicting the effect of task similarity is clearer for the memory-bias account (Roy et al., 2005) because previous task performance is remembered, albeit incorrectly, when making predictions. This process implies that previous similar tasks are relevant to focal tasks and thus informative for predicting task duration, whereas memory for previous different tasks is irrelevant and so of little or no benefit. Furthermore, if people know how long they took on a previous similar task (through receiving feedback), bias should be reduced due to such information correcting memory (Roy et al., 2008). Thus, the memory-bias account would predict that bias is less when tasks are similar but only when the exact task duration of previous tasks is known.

The effect of task similarity can be predicted from the anchoring account because of centrality of the use of previous task duration in the time prediction process to the account. Using previous task duration as an anchor for predictions can occur when previous and focal tasks are different and might or might not reduce bias depending on the relative durations of the tasks (e.g., Thomas et al., 2007). However, such anchoring should only reduce bias when previous and focal tasks are similar because of the relevance of the anchor information (Thomas and Handley, 2008). Thus, the anchoring account would predict that bias is less when previous and focal tasks are similar.

Similarly, because of the centrality of the use of previous task information in the self-learning account, the effect of task similarity can be predicted clearly. Although previous different tasks can influence predictions through learning that such tasks were misestimated and using this knowledge to adapt predictions on a focal task (König et al., 2015), when tasks are similar, this knowledge will be pertinent to the focal task and so reduce bias. Thus, the self-learning account would predict that task similarity reduces bias.

In conjunction with task similarity, examining the role of knowing the exact duration of a previous task through receiving feedback permits further evaluation of the memory-bias, anchoring, and self-learning accounts, all of which emphasize the role of previous tasks in influencing prediction bias. For the memory-bias account, being told how long a previous task took should correct memory, resulting in less prediction bias previous and focal tasks are similar (Roy et al., 2008). For the anchoring account, basing predictions on the feedback exact duration of a previous similar task should reduce bias because of the relevance of that task information to the focal task (Thomas and Handley, 2008). For the self-learning account, thinking about one’s performance on previous task where the exact duration is known should be useful only when the tasks are similar, thus reducing bias due to transferring the insight gained from the previous task to the focal task (König et al., 2015).

The present research comprised three experiments in which student participants performed two tasks in two sessions separated by a temporal interval (1 week) that is reflective of what happens in applied settings (e.g., workplaces). In such settings, there are rarely occasions when tasks are totally novel (i.e., when nothing remotely similar has been done before), with people typically undertaking fairly familiar tasks (Boltz et al., 1998; Hinds, 1999). To try to mimic this state of affairs, the focal task chosen here was one that involved the kind of skills that students would often utilize in the course of their studies (e.g., proofreading and editing assignments). In all experiments, the first and second tasks were the same (formatting an essay twice) or different (building a miniature castle first then formatting an essay second). Prediction bias on the second more familiar task was the measure of the effect of task similarity and previous task duration feedback. Feedback on the exact duration of the first task was provided at the end of the first session in Experiments 1 and 2. In Experiment 2, prompting the recall of that feedback at the start of the second session was manipulated. In Experiment 3, providing feedback on the duration of the first task was manipulated.

All experiments tested the hypothesis that bias on the more familiar second task would be less when the two tasks were identical. Experiment 2 also tested the hypothesis that prompting the recall of the feedback duration of the first task would correct memory, resulting in less biased predictions on the second task when the two tasks identical. Similarly, Experiment 3 tested the hypothesis that having feedback on the exact duration of the first task would correct memory, thereby reducing bias on the second task when the two tasks were identical. Given the shared focus on the use of previous task information not necessarily reducing prediction bias among the memory-bias, self-learning, and anchoring accounts, additional analyses sought to test predictions from these accounts in all experiments.

Experiment 1

Participants performed either the same essay-formatting task at the first and second sessions or built a miniature castle at the first session and formatted the essay at the second session. Participants were told the exact duration of the task they performed at the first session just after they had finished that task.

Method

Participants

Forty psychology students (28 female and 12 male) at a large university in Southern England participated voluntarily in return for course credit. Participants were aged 18 to 33 (M = 20.90, SD = 3.46) years.

Design

A one-factor [Task Similarity (same vs. different)] between-groups design was used.

Materials

A computer-based essay formatting task was chosen as the focal task and the identical previous task because it is typical of the kind of academic task routinely performed by university students and has been used in similar research (Francis-Smythe and Robertson, 1999). The task was a 7-page, 2200-word essay, which was a template from a module on a History degree course. Its text was black, Times New Roman, 12-point font, and had 1.5 line-spacing. Each page was A4-sized with 1-inch margins all around. Formatting the unformatted essay involved making 200 changes ranging from correcting spelling and grammatical errors to changing the typeface (e.g., italicizing text).

A Playmobil^® plastic toy was chosen as the different previous task because it has been used in task duration prediction research (Thomas et al., 2007). The task involved constructing a model multi-turreted castle comprising 68 components by following a set of pictorial sequential instructions presented in a booklet.

Procedure

Participants were recruited to an experiment on task performance and were not informed of the time estimation element of it until the end of the second session. At the first session, participants were randomly assigned to one of the two equal-sized first task conditions: essay and castle. At each session, participants were tested individually in a laboratory with no clock, and were instructed to remove their watches and place them out of sight so that they would not be a distraction during task performance.

At the first session, before predicting task duration, participants in the castle condition were informed that they could refer to the instruction booklet whilst building the castle. They were then given 2 min to view the instruction booklet and task components that were arranged on a table in front of them. At the first session, the essay condition was presented with a paper copy of the formatted and unformatted versions of the essay task and given 2 min to inspect them before predicting task duration. Afterwards, the paper copy of the unformatted task was removed from sight and a laptop computer with the unformatted task displayed on its screen (as a Microsoft^® Word Version 2000 document) was placed on a table in front of these participants. Performing the task involved making the necessary changes to the on-screen unformatted task so that it looked exactly like the paper version of the formatted task that was located beside the computer and had to be followed.

Before performing each task, participants predicted task duration (in whole or part minutes) in writing. Participants were asked to make as accurate and realistic a prediction as possible and were allowed as much time as they needed to do this. The sheets on which predictions were made were removed from sight whilst the task was performed so that predictions could not be amended. To facilitate thorough task performance, after predicting duration, participants were informed that the task would be inspected for accuracy after completion. Task duration was recorded using a digital stopwatch and participants were informed of their completion times just after the task had finished.

This procedure was repeated for the second session, except that all participants performed the essay task from the first session and no feedback was given about its duration. At the second session, participants that performed the castle task at the first session were in the castle-first condition and participants that performed the essay task at the first session were in the essay-first condition. At the end of the second session, participants were fully debriefed about the study. Each session lasted between 40 and 50 min.

Results and Discussion

The data from Experiment 1 is available in Supplementary Data Sheet S1. At the first session, all the castle condition built the castle correctly and the essay condition made an average of 181.75 (SD = 8.87) changes to the essay. At the second session, the number of changes to the essay did not differ between the essay-first (M = 189.00, SD = 4.63) and castle-first conditions (M = 186.80, SD = 4.95), t(38) = 1.45, p = 0.155, d = 0.46, suggesting that any difference in prediction bias found at the second session is not due to the number of changes made to the essay.

Basic descriptive statistics are presented in Table 1, which shows that there was underestimation on the essay and overestimation on the castle at the first session. The tasks differed in duration, with the castle taking just over half as long as the essay, t(38) = 5.38, p < 0.001, d = 1.70. At the second session, Table 1 shows that there was overestimation in the essay-first condition and underestimation in the castle-first condition. Table 1 shows that the duration of the second essay task differed between the conditions, with the castle-first condition taking 20 to 25% longer than the essay-first condition, t(38) = -2.99, p = 0.005, d = 0.94.

TABLE 1

TABLE 1. Mean (standard deviation) predicted and actual duration and prediction bias (minutes), and proportional error scores per task and task similarity condition in Experiment 1.

As actual task durations were not similar between the conditions in both sessions, proportional error scores were computed to assess prediction bias. These scores measure prediction bias as a function of task duration and are calculated by subtracting predicted from actual task duration and dividing the difference by actual task duration. Proportional error scores are well-established as an appropriate dependent measure in the time estimation literature (Brown, 1997).

On the first task, prediction bias was less on the castle, t(38) = -5.96, p < 0.001, d = 1.87. One-sample t-tests with values of zero (no bias) revealed significant underestimation on the essay, t(19) = -7.04, p < 0.001, and non-significant overestimation on the castle, t(19) = 1.89, p = 0.075. As the essay task involves the kind of skills that students are likely to use when preparing college assignments (e.g., proofreading), significant underestimation on the task could be due to participants recalling how they performed on previous similar tasks and using this knowledge as a basis for their prediction. If such knowledge was inaccurately recalled, then significant underestimation would be expected according to the memory-bias account (see Roy and Christenfeld, 2007). Similarly, a lack of knowledge of the seemingly less familiar castle task should result in less biased predictions because of the lack of task-specific memories to inaccurately recall. On tasks where prior task experience is low or absent, predictions would presumably be based on information concerning the task at hand such as the number of distinct, discrete elements the task entails (Thomas et al., 2003). On the second task, prediction bias was less in the essay-first condition, t(38) = 4.97, p < 0.001, d = 1.51, with significant underestimation in the castle-first condition, t(19) = -6.50, p < 0.001, and non-significant overestimation in the essay-first condition, t(19) = 0.88, p = 0.389. The effect of task similarity was further examined by comparing predictions on the second task. Predictions were longer in essay-first condition than the castle-first condition, t(38) = 2.69, p = 0.011, d = 0.85, suggesting that the effect was driven by the difference in predictions and actual durations (see above).

To assess whether predictions were based on previous task duration (Roy et al., 2005), the relationship between predictions on the second task and the actual, feedback duration of the first task was analyzed. Predictions were more highly correlated with actual durations in the essay-first condition, r(20) = 0.71, p < 0.001, than the castle-first condition, r(20) = 0.20, p = 0.405, and this difference was significant, Fisher z = 1.99, p = 0.046. These findings suggest that knowing the exact duration of a previous task is informative for predictions only when this knowledge is relevant to the focal task. In conjunction with the effect of task similarity, these findings indicate that feedback on previous task duration does not debias predictions when it pertains to a different task (Roy et al., 2008).

To assess whether predictions were anchored on previous task duration, the difference between predictions on the second task and actual durations on the first task was analyzed per condition. The reason for this analysis is that anchoring involves using the feedback duration of the first task as a basis for predicting the duration of the second task. Thus, predictions should be close to actual durations if an anchoring strategy is used (Thomas et al., 2007). Predictions were found to be similar to actual durations in the essay-first condition, t(38) = 0.11, p = 0.915, d = 0.03, and the castle-first condition, t(38) = -0.32, p = 0.754, d = 0.07, suggesting that anchoring occurred. Using an anchoring strategy should result in slight overestimation in the essay-first condition because those participants tended to complete the second task faster than the first task. Conversely, in the castle-first condition, such anchoring should result in underestimation on the essay as the essay was longer than the castle. In conjunction with the task similarity effect, these findings provide support for the anchoring account but indicate that anchoring only reduces misestimation when a previous task is the same as a focal task (Thomas and Handley, 2008).

To assess whether learning from prediction bias on the first task and using this self-generated feedback as a basis for predictions on the second task (König et al., 2015) was influenced by task similarity, error scores on the first and second tasks were compared between the task similarity conditions. A 2 (task) × 2 (task similarity) mixed ANOVA produced an interaction, F(1,38) = 7.24, MSE = 0.12, p < 0.001, $η_{p}^{2}$ = 0.62, with error scores being higher on the first task than the second task in the essay-first condition (p < 0.001) and lower on the first task than the second task in the castle-first condition (p < 0.001). In conjunction with the task similarity effect, this finding indicates that participants learnt from their mistakes on the first task when it was the essay and used this self-generated informative feedback to make a less biased prediction when faced with the same task again. Thus, there is support for the self-learning account (König et al., 2015) and the idea that the transfer of insight gained on a previous task is only useful when that task is the same as a focal task.

This experiment highlights the effect of information about previous tasks on task duration prediction bias and provides support for the memory-bias, anchoring, and self-learning accounts. However, the effect of previous task information was examined indirectly through telling all participants how long they took on the first task. Thus, it was not possible to directly assess whether such feedback was used when predicting task duration. To provide a stronger test of the effect of previous task information, the explicit recall of the feedback duration of the first task was manipulated alongside task similarity in Experiment 2.

Experiment 2

Participants performed two tasks (formatting an essay twice or building a miniature castle then formatting the same essay) 1 week apart. At the end of the first session, the exact duration of the first task was feedback to all participants. At the second session, half of those participants that built the castle and half of those that formatted the essay at the first session were prompted to recall the feedback duration of the first task.

Following Experiment 1, it was hypothesized that: (1) prompting the recall of the duration of the first task would reduce prediction bias on the second task when the two tasks were identical; (2) predictions on the second task would be less biased when the two tasks were identical.