- School of Psychology, University of New York in Prague, Prague, Czechia
Introduction: Delayed rewards discounting describes the tendency to choose a smaller immediate rewards instead of a larger delayed rewards. Considering the central role of impulsivity in models accounting for criminal conduct in general and violent behavior, the relationship between delayed rewards discounting and crime is likely to be present. Thereby extending the reported association with the addictive behavior. However, it is unclear whether it should be treated as a risk or an etiological factor. Consequently, the current literature review aims to summarize the existing empirical research focused on this aspect of impulsive decision-making among those who have offended.
Methods: The review was performed in accordance with the 2021 Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. The literature search of the Web of Science, PubMed, and PsycINFO databases was conducted in February 2025.
Results: The initial search yielded 1,251 articles. After exclusion of 250 duplicates, 1,001 titles were screened for relevance, leading to 556 abstracts. After reading them, 162 full-text articles were inspected, leaving 25 articles included in the review.
Conclusion: This review demonstrates that although delayed rewards discounting is associated with general criminal conduct, the association with violence specifically is tenuous. Furthermore, several studies point out that influencing serotonergic functioning, behavioral modeling, or future representations have the potential to influence it. However, further detailed research is needed.
Introduction
Delay discounting refers to the decline in the perceived value of a rewards as the time required to obtain it increases. The longer the waiting time for the rewards the higher is the likelihood for abandoning it in favor of a smaller but immediately available one (Odum, 2011). Delay discounting has been consistently linked with a range of maladaptive behaviors, such as addictions (Weinsztok et al., 2021), dysregulated eating (Stojek and MacKillop, 2017), and attention deficit hyperactivity disorder (Jackson and MacKillop, 2016). Following an impulsive choice that favors a smaller, immediate rewards over a larger, delayed rewards is associated with conduct that can result in potential harm to the individual who makes such a decision.
Furthermore, delay discounting, assessed using computerized tasks where participants expected to receive selected amounts or expected an opponent to receive a negative stimulus, is associated with self-reported sexual coercion, antisocial behavior, and aggression (Bobova et al., 2009; Carrier Emond et al., 2018; West et al., 2022). Similarly, longitudinal research has shown that self-reported impulsivity is related to both life-course and adolescent-limited offending, especially violent behavior (Jolliffe et al., 2017; Farrington and Aguilar-Carceles, 2023). Furthermore, self-reported delayed rewards discounting has been consistently linked with substance use, particularly alcohol (Amlung et al., 2017), which in turn has a robust association with violence (Duke et al., 2018). This places opting for smaller, immediate rewards at the expense of larger, but delayed ones, as a potential risk factor, albeit an indirect one, for criminal conduct.
This aligns with the proposition from the General Aggression Model (Allen et al., 2018). According to it, through evaluative decision-making, aggression is enacted when it is judged to lead to desired values. Likewise, the I3 meta-theory of aggression (Finkel and Hall, 2018) places low self-control or a lack of inhibition among the three core factors, which manifest aggressive impulses into conduct. This meta-theory posits that aggression results from evoking proclivity to be aggressive by internal or external stimuli (instigators), which is then boosted or diminished by personal factors (impellance) and is not inhibited by cognitive processes (inhibitors). Indeed, a meta-analysis confirmed that such facets of self-reported impulsivity as negative urgency, sensation seeking, and lack of premeditation were consistently associated with general and physical aggression (Bresin, 2019). Nevertheless, response inhibition, assessed using go/no-go task, exerts different effects on short-lived and continuous aggression among students from the United States (Sebalo et al., 2024). Furthermore, as shown by Madole et al. (2019) behavioral response inhibition and emotional impulsivity exert two separate effects on aggressive behavior.
A potential reason for this is that violent offenders rely on cognitive structures that are different from those of the general population, of the United Kingdom and New Zealand (Polaschek et al., 2009; Sebalo et al., 2023). Thus, the estimated value of various actions for them is likely to be distinct. This would mean that for them, the immediate option of aggressive conduct would not be “the smaller immediate rewards” and being crime-free does not constitute a “larger delayed rewards.” If this is the case, then discounting delayed financial rewards would not translate into a high frequency of violent conduct. Then, delayed rewards discounting stops being an index of impulsivity. Indeed, West et al. (2022) have shown that in the United States, non-offenders high in aggressive or sadistic traits were less likely to discount the infliction of minor harm now in order to inflict more harm in the future.
Similarly, Åkerlund et al. (2016) reported that males who, at the age of 13, discounted larger but delayed hypothetical rewards in a task in favor of a smaller but immediate one, had a higher likelihood of obtaining a criminal conviction by the age of 31 for any crime, including property and violent ones, as compared to those who did not. However, the association weakened when the total number of crimes was ken into account. Likewise, Lee et al. (2017) have shown that although there is a correlation between delay discounting in a hypothetical task and self-reported engagement in any criminal activity, the interaction between these two variables is not straightforward. While the rate of discounting larger delayed rewards was associated with property crime a year later, it was not associated with property crime 2 years later. At the same time, for violent crime, the relationship was reversed. Self-reported violent behavior was associated with higher rates of choosing smaller and immediate rewards a year later, but not the other way around. These results highlight two aspects. First, delayed rewards discounting appears to be a risk factor for engaging in criminal conduct. Second, rather than being a trait characteristic, it is a preference, the rate of exercising, which can be affected by other behaviors.
Considering the high prevalence of impulsivity in forensic populations and its association with aggression (Tonnaer et al., 2016), steep delay discounting can be expected to be prevalent in this population. After all, despite certain reductionism, Gottfredson and Hirschi’s (1990) the general theory of crime posits a deficiency in self-control as the core reason for crime (Burt, 2020). However, a potential complication of the relationship between delayed reward discounting and criminal conduct arises from the realization that such conduct can be perceived as rewarding for those who engage in it. Moreover, given the well-founded argument for reconceptualizing impulsivity as a construct, a systematic investigation focused on this particular aspect of impulsiveness among those who have committed a crime is warranted.
Consequently, the current study aims to summarize the existing research studying delayed rewards discounting among adults who have committed any criminal offense. This allowed to verify whether choosing smaller immediate rewards over larger but delayed ones can be considered a precursor for criminal conduct of any magnitude or nature. When the included studies specified the violent or non-violent nature of the offense, the difference in patterns was captured. Furthermore, it provides a description for choice preferences and associated constructs specifically among those who were found legally guilty of an offense. The choice to focus on forensic1 rather than on the general population was based on the uniqueness of the former. Understanding the etiology and risk factors of criminal conduct necessitates studies involving those whose engagement in it was proven beyond a reasonable doubt.
Materials and methods
The literature review was conducted in accordance with PRISMA guidelines (Page et al., 2021). It was conducted the February 2025, when Wed of Science, PsychINFO and PubMed were searched using the following terms: Psychopathy OR Offend* OR Convict OR Criminal OR prisoners OR felons OR convicts OR inmates OR Antisocial” AND intertemporal choice OR delay discounting OR delayed rewards discounting OR temporal discounting OR delayed gratification OR delayed gratification OR impulsive choice OR impulsive decision making OR intertemporal decision making OR intertemporal choice OR discounting.
Studies were eligible for inclusion if they met the following criteria: were published in English; had a quantitative design; were published in a peer-reviewed journal; assessed delayed rewards discounting and criminal conduct after the age of 18.
Given the study’s aim, the review included only studies that had samples drawn from the forensic population, which is distinct from the general population. The primary criterion was the presence of a criminal conviction. Studies that relied on self-reports of behaviors considered to be criminal were not included, because self-reports do not equate with a legally determined sentence. For similar reasons, although the diagnosis of Antisocial Personality Disorder (ASPD) and psychopathy contains criminal versatility as one of the symptoms, unless the presence of criminal conviction among participants was explicitly stated, the study was excluded.
The initial search yielded 1,251 articles, with 250 duplicates. The remaining 1,001 titles were inspected for relevance. After removing 445 non-relevant titles, 556 abstracts were inspected. Of these, 162 were found to be relevant. Following a full-text inspection of the articles, with particular attention to the assessment of criminal conduct and delayed rewards discounting, 26 articles were found suitable for inclusion in the review. Four researchers reviewed the titles, abstracts and full texts, and performed a quality check. One study was identified as having poor quality; however, it was kept in the review to keep transparency and maintain completeness of the results. The details of the excluded studies are presented in Figure 1.
Study characteristics
The majority of studies were conducted in the United States of America (n = 13), followed by the United Kingdom (n = 5), and Australia (n = 2). Countries with only one study included China, the Netherlands, and New Zealand. There was also one cross-cultural study that was conducted in Italy and Poland. The majority of studies were cross-sectional quasi-experimental (n = 8), as they were conducted at one point in time but compared different groups. There was almost equal amount of purely cross-sectional (n = 7) and experimental (n = 6) studies, and two longitudinal ones. More detailed information on the included studies is provided in Table 1.
The quality of the included studies was assessed using Checklists for Analytical Cross-Sectional Studies, Cohort Studies, Quasi-Experimental Designs, and Randomized Control Studies from the Joanna Briggs Institute (JBI) at the University of Adelaide (Moola et al., 2020). For each item of the checklist, the rating was rated as either present, absent or not applicable, and then a sum score was calculated. One author (IS) rated the included studies separately from the three authors. Two disagreements were resolved in a discussion. There was almost an equal number of studies with excellent quality (n = 13), defined as having a maximum score or of 1 below, and good quality (n = 12), defined as having a score above the median plus 1 and below excellent. One study was of poor quality, but it was retained. As noted in the previous section, studies were only included if they captured criminal conviction, which in most cases was captured through examination of participants’ criminal records. There were only two studies where participants self-reported their incarceration or arrest.
Delayed rewards discounting measurements
All of the included studies assessed delayed rewards discounting by presenting participants with two options: one associated with a smaller but immediate gain and another with a larger rewards, but after a delay. However, the specific paradigms and instruments were different.
An amended version of the self-control task from Mazur (1987), titled the impulsivity paradigm (IMP) was the most widely used (n = 8). During it, two letters appeared on the screen. As soon as one is selected, it remains on the screen and, after a delay, starts to flash. Pressing on it again would yield a set rewards. One of the options started to flash after 5 seconds and yielded 5 cents, while the other one began to flash after 15 s and yielded 15 cents. Participants were paid based on the amount they had earned during the sessions.
A particular variation of the IMP was used in one study (Newman et al., 1992). Participants were presented with two buttons on the screen, where participants could choose either one. The rewards were always 5 cents (represented by a poker chip); however, the consistency and delay differed depending on the condition. In rewards, only the first group button was rewarded 40% of the time and could be pressed immediately, while the other button was rewarded 80% of the presses and could be pressed 10 s after the start of the trial. In the equal condition, the frequency of rewards was the same, but both options could be chosen 10 s after the start of the trial. Lastly, in the rewards and punishment group, one option was rewarded 90% of the presses and penalized (a subtraction of 5 cents) 10% of the trials, and could be pressed 10 s after the start. Meanwhile, the other option could be chosen 5 s after the start, but was rewarded in 70% of trials and punished in 30%.
The full version of the Monetary choice questionnaire (MCQ) from Kirby and Maraković (1996) was the second most consistently used (n = 4) measure. MCQ is a 27-item questionnaire where participants must choose one of two monetary options. Original MCQ includes three groups of rewards values: small ($30–$35), medium ($55–$65), and large ($70–$85). However, the currency and amount were adjusted based on the country where the study took place and the time it was conducted. For instance, “Would you prefer £27 today or £28 in 117 days?” (Hanoch et al., 2013). One choice is smaller but immediate, and another choice is larger but delayed. The magnitude of the rewards and the delay, ranging from 7 to 189 days, differ between trials. Using Mazur’s (1987) discounting equation,2 the rate of discounting can be calculated.
Jones et al. (2013) used only nine, larger rewards value questions from MCQ. Meanwhile, Jones et al. (2015) used a computerized version inspired by it. In this task, the delayed gain or loss was always 10$ or 100$. There were also studies (n = 5) that used similarly phrased questions, but the number of questions and the magnitudes of the rewards were different.
Additionally, a 16-item questionnaire was used by Arantes et al. (2013). In it, participants were asked to imagine they had won a lottery and could choose either the money now or a larger sum after a certain delay. The immediate rewards and delays also varied, with immediate rewards being $500, $1,000, $2,000, and $4,000 and delay options being 1, 2, 4, and 8 years. The main difference, though, was imagining a lottery win and asking to write the sum that would make both options equally attractive.
Rieser et al. (2019) used a computerized version of the delay discounting task (Wittmann et al., 2007), which operates on the same principles of varying magnitude of awards, with immediate rewards ranging from 0 to 476 euros and delayed rewards ranging from 476 to 524 euros and delays being either 5, 30, 180, 365, 1,095, or 3,650 days. It consists of 48 trials, and the outcome measure is the area under the discounting curve.
A slightly different version of the computerized delay discounting task was used by Hosking et al. (2017). It also included 48 trials, but both short- and long-term rewards varied in time and magnitude. Short-term rewards ranged between 5$ and 40$ and could be immediate, within 2 or within 4 weeks. Long-term rewards were dependent on the short-term ones and constituted 1%, 3%, 5%, 10%, 15%, 25%, 35%, or 50% increase. Their delay ranged from 2 to 4 weeks from the short-term one.
The Cambridge Gambling Task was used in two studies. During it, participants saw 10 boxes on a screen and needed to guess behind which one of them a token was hidden and place a bet, in the form of a percentage of the 100 points they had for each trial. The magnitude of the bet was presented either in descending or ascending order. Matching them allowed to determine those who will wait for a high bet in either circumstance and those who opt for the immediate option at the cost of a delayed one but promising higher benefit.
Lastly, one study employed a unique measure. Brown and Gutsch (1985) presented participants with choices concerning objects available in prison, specifically cigarettes, Coca-Cola, pens, and chocolate bars. Participants could choose either the smallest amount immediately or an increasing amount after 6 h, 2 days or 1 week. Participants were also informed that they would be given one of their chosen options selected at random.
Results
Group comparison
A third of the studies (n = 9) compared the patterns of delayed rewards discounting in offenders and those who had not committed any crime. The majority of them (n = 7) showed a clear and significant difference between these two populations. Those who have committed a crime showed higher discounting rates as they consistently preferred smaller short-term rewards to larger but delayed ones (Arantes et al., 2013; Hanoch et al., 2013; Jones et al., 2015; Peng et al., 2022; Torres et al., 2023). Neither of these studies has differentiated between specific types of offense, suggesting that committing one has a relationship with choosing immediate rewards, even if it is smaller than a delayed one. Although Sloan et al. (2014) focused specifically on comparing drunk drivers with other drinkers, the same patterns emerged.
Arantes et al. (2013) specify this preference as “on average, non-offenders reported that $2,520 delayed 1 year was equal in subjective value to $1,000 now, whereas the corresponding indifference point for offenders was $8,183” (p.247). Their study included both males and females from medium secure prisons and community participants and reported a medium to large effect size of the difference. Meanwhile, Peng et al. (2022) demonstrate that even when covariates, such as relative deprivation, sensitivity to rewards and punishment, and decision-making styles are present, delayed rewards discounting still has a significant relationship with unspecified offending. Hanoch et al. (2013) clarified this relationship further. Both those who are or were imprisoned, primarily for drug-related and violent offenses, showed steeper delayed rewards discounting than community participants. However, despite the significant differences, the effect size was small. Likewise, Jones et al. (2015) showed that those with a criminal history, the majority with convictions for theft and burglary, discount delayed rewards more steeply than those without it. Although the study focused on those from the drug court, discounting delayed rewards was not associated with illicit drug use (i.e., failure to comply with programme requirements). However, ex-prisoners discounted the delayed rewards with higher frequency than those currently in prison or community participants. This lends to a supposition that delayed rewards discounting is a habit that develops with experience rather than a stable trait. Indeed, Torres et al. (2023) show that imagining oneself in the future can reduce discounting among veterans with a history of incarceration, for any crime, as well as without. The authors did not report effect sizes, though.
However, there were also studies reporting non-significant differences between those with a criminal conviction, primarily for violent offenses, and those without (De Brito et al., 2013; Lijffijt et al., 2017). One potential reason for these deviations is the task chosen to assess delayed rewards discounting. Rather than asking participants to choose between a certain amount of money now vs. a differing amount later, Lijffijt et al. (2017) the amended version of the self-control task, Mazur (1987) titled the Two Choice Impulsivity Paradigm (TCIP), requires inhibiting an immediate response rather than expressing one’s preference, as was done by aforementioned studies. Meanwhile, De Brito et al. (2013) used gambling task where participants needed to wait before the bet increased in value, which again requires response inhibition rather than just preference expression. Given that these types of tasks assess the behavioral expression of delayed rewards discounting, the aforementioned difference may apply to the cognitions rather than conduct.
Offenders only
Slightly more than half of the studies included in this review (n = 16) focused exclusively on offenders when investigating the constructs associated with delayed rewards discounting. Their results showed potential associations with psychopathy (Brown and Gutsch, 1985; Newman et al., 1992; Hosking et al., 2017), stimulants (Pietras et al., 2003), incarcerations (Varghese et al., 2014), serotonin (Cherek and Lane, 1999a,2000, 2001; Cherek et al., 2002), and tenuous associations with violence (Cherek et al., 1997; Cherek and Lane, 1999b; Brennan et al., 2010), modeling (Stumphauzer, 1972) and substance use (Jones et al., 2013; Huddy et al., 2017). Although most of these relationships are based on a small number of studies, in some cases, even just one, they can serve as a potential guideline.
For example, Brown and Gutsch (1985) reported that male offenders, in a medium security prison, with primary psychopathy chose large rewards with lower delays when compared to those with secondary psychopathy or no such diagnosis. However, Newman et al. (1992) showed that male offenders, in a minimum security prison, diagnosed with psychopathy and low levels of anxiety discounted delayed rewards less frequently when compared to those without psychopathy but with similarly low anxiety. However, this difference was evident only when expressed preferences led to actual rewards. These two studies suggest that although psychopathy can be relevant to the pattern of delayed rewards discounting, it is unlikely to be the sole determinant. However, it is important to highlight that Newman et al.’s (1992) study was the only study of poor quality and that both studies did not report effect sizes. A more recent study has also reported no direct association between delay discounting rate and psychopathy symptoms or criminal convictions, among those currently in medium secure prison (Hosking et al., 2017). However, they did demonstrate the presence of weaker medial cortico-striatal functional connectivity among those with high psychopathy traits during delay discounting tasks, which was associated with criminal convictions.
Likewise, when it comes to ivolence, the results are not consistent. Brennan et al. (2010) reported that despite lower delay discounting rates among offenders with a history of weapon-involved violent crime as compared to those with a history of violent crime without a weapon or non-violent crime at all, these differences were not significant. Although the authors suggest that small sample size prevented them from detecting effects, prior studies with even smaller samples found the relationship to be significant for both male and female offenders on parole, but did not specify the effect size (Cherek et al., 1997; Cherek and Lane, 1999b). Specifically, violent parolees had higher delay discounting rates than non-violent ones. However, a significant moderate negative correlation was reported between delay discounting and aggression for non-violent females, but not for violent ones.
Meanwhile, for males, it was reversed. There was a significant strong positive correlation between delay discounting and aggression among violent males, but not among non-violent males. Taken together, these findings again point to the possible relevance of preferring short-term rewards over delayed ones in understanding violent behavior, but undermine its position as a core determinant. Nevertheless, the relationship with criminal offenses in general, described in the previous section, appears to extend to the number of criminal convictions, of varying severity, and reactive criminal thinking, assessed in a sample of prisoners released primarily from minimum-security prisons (Varghese et al., 2014).
The relationship with serotonin, however, was slightly more straightforward. Cherek and Lane, 1999a,2000) reported an apparent decrease in delay discounting following d,l-fenfluramine (a 5-HT releasing agent) administration among participants with a history of conduct disorder who were on parole, after having an unspecified conviction. However, the lack of sizes precludes the specification of the magnitude. Furthermore, in a subsequent study, despite the presence of a trend, this association was not significant (Cherek and Lane, 2001). Attributing the lack of significance to a smaller sample size, Cherek et al. (2002) showed that administration of paroxetine (a 5-HT reuptake inhibitor) also decreases, albeit to an unspecified magnitude, delay discounting, particularly at the last day of treatment, which took place 2 weeks after the baseline. Consequently, the positioning of serotonin as an active agent in determining the preference for short-term or delayed rewards appears justified. The results showing that serotonin plays an important role in facilitating the ability to prefer larger rewards later at the expense of a smaller rewards immediately align with Miyazaki et al.’s (2012) argument, based primarily on animal studies. Furthermore, they match the suggestion that serotonin affects impulsive aggression through its impact on threat reactivity and harm aversion (da Cunha-Bang and Knudsen, 2021). Administration of methylphenidate also resulted in unspecified decreased delay discounting among males who committed any crime and without a history of ADHD diagnosis or treatment (Pietras et al., 2003). This would also implicate the functioning of dopamine and norepinephrine.
Another common correlate of rewards discounting, namely drug use, was investigated only in two studies that included only participants with a criminal conviction. Huddy et al. (2017) showed that among those in high secure prisons, high delay discounting had a small bivariate association with opiate use, but in a regression model, it did not predict crack or cocaine use or personality problems. Meanwhile, Jones et al. (2013) have shown that low discounting decreases the chances of drug use more efficiently when there is intense supervision of those who have committed any offense than when there is supervision as usual. However, high discounters did not differ from low discounters when the amount of supervision was normal. Furthermore, the effect of intense supervision was varied as the odds ratio ranged from 0.36 to 0.99. Taken together, these results, again, do not show a uniform and robust relationship between substance use and delayed rewards discounting among prisoners.
Lastly, there was one dated study, which had acceptable quality and was among the two assessing a non-pharmacological attempt to influence delayed rewards discounting. Stumphauzer (1972) have shown that exposure to someone else choosing a delayed rewards leads to a significant increase of unspecified magnitude in delay with effect lasting for up to a month, among those currently in a medium secure prison. This highlights a promising technique for counteracting delayed rewards discounting and highlights the need to investigate the difference between capacity and motivation. The inability to control the preference for immediate rewards over larger ones later is not the same as a disinterest in evaluating both options and making a reasoned choice. More replications and further studies are needed, though.
Longitudinal
There were also two longitudinal studies examining delayed rewards discounting and offending behavior. However, both have opposing results. Rieser et al. (2019) followed 213 male offenders on parole with SUD diagnoses for 14.4 months on average. Their results showed no association between delay discounting rates at the baseline and substance use at the follow-up. Likewise, they have reported that delayed rewards discounting is a non-significant predictor of property crimes, violent crimes, or any crime. However, Piquero et al. (2018) followed the convictions of 411 males from the Cambridge Study in Delinquent Development (CSDD) from the age of 18 until 56, and delayed rewards discounting was assessed at the ages of 18, 32, and 48. They have found delayed rewards discounting rate to be a positive predictor of small magnitude for the number of convictions throughout adulthood, even when accounting for other individual and environmental variables (e.g., IQ, extraversion, low family income). Importantly, once other factors were controlled for the effect of delay discounting became more consistent as the low boundary of incidence rate ratio changed from 1.001 to 1.018. Considering the length and scope of the latter study, which included a wide range of offenders, it appears that a preference for immediate but smaller rewards over delayed but larger ones is associated with criminal conduct. However, while the former study assessed delayed rewards discounting using an extensive task, the latter only used three questions.
Discussion
This literature review aimed to summarize the empirical evidence about the delayed rewards discounting among adults with criminal convictions. All but one of the included studies were of good or excellent quality based on the evaluation forms from the Joanna Briggs Institute (JBI) at the University of Adelaide (Moola et al., 2020). They also included at least partially participants with a recorded criminal offense and assessed delayed rewards discounting by presenting two mutually exclusive options. Taken together, the results highlight three key aspects. First, adults who have committed at least one crime tend to opt for smaller, immediate rewards at the expense of larger, delayed ones. However, steeper delayed rewards discounting appears to be a predisposing rather than a sufficient factor of small magnitude in males for future offending, particularly over long periods. Second, although preference for smaller gains right now rather than larger ones after some time is associated with serotonergic functioning, the association with psychopathy, substance use, and violence or criminal thinking needs to be tested further, as the current results are conflicting and come from relatively small samples. Lastly, two studies (Stumphauzer, 1972; Torres et al., 2023) have shown that delayed rewards discounting among offenders can be reduced, suggesting the malleability of this aspect of impulsivity.
The association between delay rewards discounting and registered criminal offenses among adults (Arantes et al., 2013; Hanoch et al., 2013; Varghese et al., 2014; Jones et al., 2015; Peng et al., 2022; Torres et al., 2023) agrees with findings based on self-reported antisocial or offending behavior (Bobova et al., 2009; Carrier Emond et al., 2018; West et al., 2022). It also appears to lend support to the central positioning of cognitive structures and processing in the General Aggression Model (Allen et al., 2018) and I3 meta-theory (Finkel and Hall, 2018). However, considering weak evidence for the link between delayed rewards discounting and violent behavior (Cherek et al., 1997; Cherek and Lane, 1999b; Brennan et al., 2010), this support should be considered partial. A potential explanation for this inconsistency may lie in understanding the rewards magnitude and its nature. The hidden premise behind the suggested relationship of delayed rewards discounting and aggression is that perpetrating aggression yields a smaller but immediate rewards (e.g., maintaining status) that is chosen instead of a larger rewards of a crime-free life and maintenance of the social contract. However, these magnitudes are not necessarily shared by an offender. Indeed, Varghese et al. (2014) has shown that discounting of delayed monetary rewards was associated with reactive but not proactive criminal thinking. Furthermore, West et al. (2022) have shown that non-offenders with antagonistic traits are more likely to choose the delayed option if it promises larger harm. In other words, it is possible that for offenders, the tendency to discount larger but delayed financial rewards does not necessarily transfer to the tendency to discount larger but delayed intrinsic rewards that they can achieve through aggressive conduct.
Nevertheless, these findings clearly show that delayed financial rewards discounting is associated with criminal conduct, albeit with a small to moderate strength. The results from one of the longitudinal studies strengthen this relationship (Piquero et al., 2018). A possible inference from this is that for offenders, crime in general is more related to financial rewards than violence. This would support the proposition of Gottfredson and Hirschi’s (1990) general theory of crime. Furthermore, this aligns with Lee et al.’s (2017) findings demonstrating a relationship between delayed rewards discounting and self-reported property crimes rather than violent crimes. Nevertheless, the proposed separation of crime in general and violence does not fully agree with the findings of Åkerlund et al. (2016). They have reported that delayed rewards discounting during adolescence was associated with at least one violent crime within the next 28 years. Concerning Varghese et al. (2014) findings, it is possible that the violent crime was predominantly reactive rather than proactive. Another possibility for the differences is that Åkerlund et al. (2016) assessed delayed rewards discounting at the age of 13, possibly assessing general impulsivity rather than choice preference.
Nevertheless, the other longitudinal study included in the review did not find any association between delayed rewards discounting and either general or violent or property crime within 12 months (Rieser et al., 2019). The immediate reasons for this discrepancy appear to be the focus only on offenders with SUD diagnoses and a relatively short duration. Still, the lack of association places delayed rewards discounting into the risk factors category rather than the causes.
However, even as a risk factor, delayed rewards discounting is not the optimal strategy for social or financial situations, which raises the question of whether it can be influenced. A set of studies demonstrated that an increase in serotonin, potentially through an increase in inhibitions, can increase the preference for larger delayed rewards (Cherek and Lane, 1999a, 2000). Moreover, there was promising, albeit preliminary, evidence for the effectiveness of non-pharmacological interventions. Stumphauzer (1972) demonstrated that simply witnessing a person opting for the delayed option can increase similar choices among adult prisoners. Meanwhile, Torres et al. (2023) demonstrated that personalizing the date of the delayed rewards can facilitate the refusal of the immediate but smaller gain. Although replications and further investigation are necessary to verify the utility of such interventions, the changes itself is noteworthy. It suggests that delayed rewards discounting can change depending on the environment and the individual’s cognitive structures. This, in turn, lends support to the aforementioned proposition that the same person can express different discounting patterns depending on the nature of the rewards.
Recommendation for future research and practice
The adjustment of the rewards to the behavior postulated to be predicted by delayed rewards discounting should be implemented in further studies. When investigating precursors of financial behavior, the use of monetary rewards is likely to yield more accurate results. However, once the focus shifts to behavior with a potential for intrinsic rewards, then the type of rewards should be changed. Consequently, to address the inconsistencies related to the violent behavior, socially or intrinsically valued rewards should be given. Furthermore, such investigations into aggressive behavior should include covariates in the form of aggressiveness and hostility traits, as well as the potential to differentiate between reactive and proactive aggression.
Similarly, additional research is needed to verify the association between delayed rewards discounting and substance use, which could consider replacing monetary rewards. However, even with financial options, additional studies comparing forensic and general populations are required. Likewise, further investigation into psychopathy, specifically among adults who have offended, is necessary. Psychopathic traits potentially present in non-forensic populations are unlikely to exert the same effect on rewards choice as a cluster of entrenched symptoms found among forensic populations.
Nevertheless, from the perspective of maximizing utility, the primary aim of further research should be to identify interventions that can change the pattern of delayed rewards discounting among both forensic and the general population. Identifying them not only provides concrete tools that a person can use to alter their tendencies but also highlights the key mechanisms that affect the tendency to focus on short-term gains even when long-term ones outweigh them.
Although the current review could not find consistent evidence for delayed rewards discounting being a necessary or sufficient cause, it found support for treating such choice preference as a potential risk factor. Consequently, there are grounds to suggest the implementation of specific modules addressing preference for the small immediate rewards at the cost of larger but delayed ones. If such modules are implemented within existing treatments or interventions addressing cognitive structures supporting criminal conduct, then they can also address the potential problem of the forensic population not perceiving non-criminal life as a delayed but large rewards.
Limitations
The current review is not without its limitations. Although all the included articles assessed delayed rewards discounting by presenting participants with two choices —one that grants a smaller gain in the short term and another that grants a larger gain in the long term —the particulars differed. Studies differed in the number of choices presented to the participants and the differences observed. This undermined the generalization of the results to the forensic population and could have been the source of the discrepancies in the results. Furthermore, despite the inclusion of female adults with criminal convictions in some of the studies, the majority focused exclusively on males, restricting the inference even further.
Another explicit limitation is the clustering of studies in countries with Western values, which prevents the investigation of the relationships between delayed rewards discounting and different cultures. However, the only study from a non-Western culture (Peng et al., 2022) still showed an increased tendency to discount large rewards if they are delayed, suggesting that this specific link might be cross-cultural. Lastly, as this study was a systematic literature review rather than a meta-analysis, the relationships highlighted in it are based on the qualitative analysis of the sources rather than quantitative indices.
Conclusion
This review demonstrates that although delayed discounting of monetary rewards is associated with criminal conduct, the former can only be considered a risk factor, not a causal one. Not only does this relationship appear to be bidirectional, but the current evidence strongly supports it for crimes in general, but not for specific behaviors, such as violence. Nevertheless, the tendency to choose smaller immediate gains at the cost of larger ones in the future is not an entrenched algorithm applied to any two options. Instead, it is affected by the specific circumstances surrounding the choice as well as by the nature of the rewards and the individual’s cognitive structures. Thus, to accurately investigate its role in determining the behavior of adult offenders, individualized and conduct-relevant paradigms should be employed.
Author contributions
IS: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. DD: Data curation, Formal analysis, Investigation, Writing – original draft. SK: Data curation, Formal analysis, Investigation, Writing – original draft. JV: Data curation, Formal analysis, Investigation, Writing – original draft.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Footnotes
1. ^In this article forensic population refers to those who have committed a criminal offence regardless of the presence of absence of a psychiatric diagnoses and the type of crime.
2. ^V = A / (1 – kD), where V is the perceived values of an outcome; A is an absolute value; D is delay; and k is the discount rate that determines the slope of the discounting function.
References
Åkerlund, D., Golsteyn, B. H., Grönqvist, H., and Lindahl, L. (2016). Time discounting and criminal behavior. Proc. Natl. Acad. Sci. U. S. A. 113, 6160–6165. doi: 10.1073/pnas.1522445113
Allen, J. J., Anderson, C. A., and Bushman, B. J. (2018). The general aggression model. Curr. Opin. Psychol. 19, 75–80. doi: 10.1016/j.copsyc.2017.03.034
Amlung, M., Vedelago, L., Acker, J., Balodis, I., and MacKillop, J. (2017). Steep delay discounting and addictive behavior: A meta-analysis of continuous associations. Addiction 112, 51–62. doi: 10.1111/add.13535
Arantes, J., Berg, M. E., Lawlor, D., and Grace, R. C. (2013). Offenders have higher delay-discounting rates than non-offenders after controlling for differences in drug and alcohol abuse. Legal Crim. Psychol. 18, 240–253. doi: 10.1111/j.2044-8333.2012.02052.x
Bobova, L., Finn, P. R., Rickert, M. E., and Lucas, J. (2009). Disinhibitory psychopathology and delay discounting in alcohol dependence: Personality and cognitive correlates. Exp. Clin. Psychopharmacol. 17, 51–61. doi: 10.1037/a0014503
Brennan, I. R., Moore, S. C., and Shepherd, J. P. (2010). Aggression and attitudes to time and risk in weapon-using violent offenders. Psychiatry Res. 178, 536–539. doi: 10.1016/j.psychres.2009.05.006
Bresin, K. (2019). Impulsivity and aggression: A meta-analysis using the UPPS model of impulsivity. Aggress. Violent Behav. 48, 124–140. doi: 10.1016/j.avb.2019.08.003
Brown, H. J. D., and Gutsch, K. U. (1985). ‘Cognitions associated with a delay of gratification task’. Crim. Justice Behav. 12, 453–462. doi: 10.1177/0093854885012004004
Burt, C. H. (2020). Self-Control and crime: Beyond gottfredson and Hirschi’s theory. Annu. Rev. Crim. 3, 43–73. doi: 10.1146/annurev-criminol-011419-041344
Carrier Emond, F., Gagnon, J., Nolet, K., Cyr, G., and Rouleau, J. L. (2018). What money can’t buy: Different patterns in decision making about sex and money predict past sexual coercion perpetration. Arch. Sex Behav. 47, 429–441. doi: 10.1007/s10508-017-1116-0
Cherek, D. R., and Lane, S. D. (1999a). Effects of d,l-fenfluramine on aggressive and impulsive responding in adult males with a history of conduct disorder. Psychopharmacology 146, 473–481. doi: 10.1007/pl00005493
Cherek, D. R., and Lane, S. D. (1999b). Laboratory and psychometric measurements of impulsivity among violent and nonviolent female parolees. Biol. Psychiatry 46, 273–280. doi: 10.1016/s0006-3223(98)00309-6
Cherek, D. R., and Lane, S. D. (2000). Fenfluramine effects on impulsivity in a sample of adults with and without history of conduct disorder. Psychopharmacology 152, 149–156. doi: 10.1007/s002130000530
Cherek, D. R., and Lane, S. D. (2001). Acute effects of D-fenfluramine on simultaneous measures of aggressive escape and impulsive responses of adult males with and without a history of conduct disorder. Psychopharmacology 157, 221–227. doi: 10.1007/s002130100812
Cherek, D. R., Lane, S. D., Pietras, C. J., and Steinberg, J. L. (2002). Effects of chronic paroxetine administration on measures of aggressive and impulsive responses of adult males with a history of conduct disorder. Psychopharmacology 159, 266–274. doi: 10.1007/s002130100915
Cherek, D. R., Moeller, F. G., Dougherty, D. M., and Rhoades, H. (1997). Studies of violent and nonviolent male parolees: Ii. Laboratory and psychometric measurements of impulsivity. Biol. Psychiatry 41, 523–529. doi: 10.1016/s0006-3223(96)00426-x
da Cunha-Bang, S., and Knudsen, G. M. (2021). The modulatory role of serotonin on human impulsive aggression. Biol. Psychiatry 90, 447–457. doi: 10.1016/j.biopsych.2021.05.016
De Brito, S. A., Viding, E., Kumari, V., Blackwood, N., and Hodgins, S. (2013). Cool and hot executive function impairments in violent offenders with antisocial personality disorder with and without psychopathy. PLoS One 8:e65566. doi: 10.1371/journal.pone.0065566
Dougherty, D. M., Bjork, J. M., Harper, R. A., Marsh, D. M., Moeller, F. G., Mathias, C. W., et al. (2003). Behavioral impulsivity paradigms: a comparison in hospitalized adolescents with disruptive behavior disorders. J. Child Psychol. Psychiatry. Allied Discip 44, 1145–1157. doi: 10.1111/1469-7610.00197
Duke, A. A., Smith, K. M., Oberleitner, L., Westphal, A., and McKee, S. A. (2018). Alcohol, drugs, and violence: A meta-meta-analysis. Psychol. Violence 8, 238–249. doi: 10.1037/vio0000106
Farrington, D. P., and Aguilar-Carceles, M. (2023). ‘The life course of impulsive males from childhood to adulthood’. J. Crim Psychol. 13, 224–238. doi: 10.1108/JCP-12-2022-0036
Finkel, E. J., and Hall, A. N. (2018). The I3 model: A metatheoretical framework for understanding aggression. Curr. Opin. Psychol. 19, 125–130. doi: 10.1016/j.copsyc.2017.03.013
Gottfredson, M., and Hirschi, T. (1990). A general theory of crime. Stanford, CA: Stanford University Press.
Hanoch, Y., Rolison, J., and Gummerum, M. (2013). ‘Good things come to those who wait: Time discounting differences between adult offenders and nonoffenders’. Personal. Individ. Differ. 54, 128–132. doi: 10.1016/j.paid.2012.08.025
Hosking, J. G., Kastman, E. K., Dorfman, H. M., Samanez-Larkin, G. R., Baskin-Sommers, A., Kiehl, K. A., et al. (2017). Disrupted prefrontal regulation of striatal subjective value signals in psychopathy. Neuron 95, 221–231.e4. doi: 10.1016/j.neuron.2017.06.030
Huddy, V., Kitchenham, N., Roberts, A., Jarrett, M., Phillip, P., Forrester, A., et al. (2017). Self-report and behavioural measures of impulsivity as predictors of impulsive behaviour and psychopathology in male prisoners. Personal. Individ. Differ. 113, 173–177. doi: 10.1016/j.paid.2017.03.010
Iozzino, L., Canessa, N., Rucci, P., Iommi, M., Dvorak, A., Heitzman, J., et al. (2022). Decision-making and risk-taking in forensic and non-forensic patients with schizophrenia spectrum disorders: A multicenter European study. Schizophr Res Cogn. 29:100257. doi: 10.1016/j.scog.2022.100257
Jackson, J. N., and MacKillop, J. (2016). Attention-Deficit/Hyperactivity disorder and monetary delay discounting: A meta-analysis of case-control studies. Biol. Psychiatry Cogn. Neurosci. Neuroimag. 1, 316–325. doi: 10.1016/j.bpsc.2016.01.007
Jolliffe, D., Farrington, D. P., Piquero, A. R., Loeber, R., and Hill, K. G. (2017). Systematic review of early risk factors for life-course-persistent, adolescence-limited, and late-onset offenders in prospective longitudinal studies. Aggress. Violent Behav. 33, 15–23. doi: 10.1016/j.avb.2017.01.009
Jones, C. G., Fearnley, H., Panagiotopoulos, B., and Kemp, R. I. (2015). Delay discounting, self-control, and substance use among adult drug court participants. Behav. Pharmacol. 26, 447–459. doi: 10.1097/FBP.0000000000000149
Jones, C. G. A., Kemp, R. I., and Chan, J. S. K. (2013). ‘The relationship between delay discounting, judicial supervision, and substance use among adult drug court clients’. Psychol. Public Pol. Law 19, 454–465. doi: 10.1037/a0034107
Kirby, K. N., and Maraković, N. N. (1996). Delay-discounting probabilistic rewards: Rates decrease as amounts increase. Psychon. Bull. Rev. 3, 100–104. doi: 10.3758/BF03210748
Lee, C. A., Derefinko, K. J., Milich, R., Lynam, D. R., and DeWall, C. N. (2017). Longitudinal and reciprocal relations between delay discounting and crime. Pers. Individ. Dif. 111, 193–198. doi: 10.1016/j.paid.2017.02.023
Lijffijt, M., Lane, S. D., Mathew, S. J., Stanford, M. S., and Swann, A. C. (2017). Heightened early-attentional stimulus orienting and impulsive action in men with antisocial personality disorder. Eur. Arch. Psychiatry Clin. Neurosci. 267, 697–707. doi: 10.1007/s00406-016-0734-1
Madole, J. W., Johnson, S. L., and Carver, C. S. (2019). A model of aggressive behavior: Early adversity, impulsivity, and response inhibition. J. Aggress. Maltreat. Trauma 29, 594–610. doi: 10.1080/10926771.2019.1591561
Mazur, J. E. (1987). “An adjusting procedure for studying delayed reinforcement,” in The effect of delay and of intervening events on reinforcement value, eds M. L. Commons et al. (Mahwah, NJ: Lawrence Erlbaum Associates, Inc).
Mischel, W., and Grusec, J. (1967). Waiting for rewards and punishments: Effects of time and probability on choice. J. Pers. Soc. Psychol. 5, 24–31. doi: 10.1037/h0024180
Miyazaki, K., Miyazaki, K. W., and Doya, K. (2012). The role of serotonin in the regulation of patience and impulsivity. Mol. Neurobiol. 45, 213–224. doi: 10.1007/s12035-012-8232-6
Moola, S. Z., Munn, Z., Tufanaru, C., Aromataris, E., Sears, K., Sfetcu, R., et al. (2020). Systematic reviews of etiology and risk. JBI Manual Evid. Synthesis 1, 217–269. doi: 10.46658/JBIMES-24-01
Newman, J. P., Kosson, D. S., and Patterson, C. M. (1992). Delay of gratification in psychopathic and nonpsychopathic offenders. J. Abnorm. Psychol. 101, 630–636. doi: 10.1037//0021-843x.101.4.630
Odum, A. L. (2011). Delay discounting: I’m a k, you’re a k. J. Exp. Anal. Behav. 96, 427–439. doi: 10.1901/jeab.2011.96-423
Page, M. J., McKenzie, J. E., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., et al. (2021). The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 372:n71. doi: 10.1136/bmj.n71
Peng, J., Zhang, J., Yuan, W., Zhou, X., Tian, J., and Fang, P. (2022). The commission of crime from the perspective of decision-making differences. Front. Psychol. 13:937876. doi: 10.3389/fpsyg.2022.937876
Pietras, C. J., Cherek, D. R., Lane, S. D., Tcheremissine, O. V., and Steinberg, J. L. (2003). Effects of methylphenidate on impulsive choice in adult humans. Psychopharmacology 170, 390–398. doi: 10.1007/s00213-003-1547-2
Piquero, A. R., Farrington, D. P., and Jennings, W. G. (2018). Money now, money later: Linking time discounting and criminal convictions in the cambridge study in delinquent development. Int. J. Offender Ther. Comp. Crim. 62, 1131–1142. doi: 10.1177/0306624X16678938
Polaschek, D. L., Calvert, S. W., and Gannon, T. A. (2009). Linking violent thinking: Implicit theory-based research with violent offenders. J. Interpers. Violence 24, 75–96. doi: 10.1177/0886260508315781
Rieser, N. M., Shaul, L., Blankers, M., Koeter, M. W. J., Schippers, G. M., and Goudriaan, A. E. (2019). The predictive value of impulsivity and risk-taking measures for substance use in substance dependent offenders. Front. Behav. Neurosci. 13:192. doi: 10.3389/fnbeh.2019.00192
Sebalo, I., Bozzay, M., Verona, E., Chu, S., and Ireland, J. L. (2024). Aggressiveness, inhibitory control, and emotional states: A provocation paradigm. Aggress. Behav. 50:e22165. doi: 10.1002/ab.22165
Sebalo, I., Steene, L. M. B., Gaylor, L. L. E., and Ireland, J. L. (2023). A preliminary study exploring the nature of aggression supportive beliefs in a forensic psychiatric sample. J. Crim. Psychol. 13, 334–350. doi: 10.1108/JCP-11-2022-0031
Sloan, F. A., Eldred, L. M., and Xu, Y. (2014). The behavioral economics of drunk driving. J. Health Econ. 35, 64–81. doi: 10.1016/j.jhealeco.2014.01.005
Stewart, N., Chater, N., Stott, H. P., and Reimers, S. (2003). Prospect relativity: How choice options influence decision under risk. J. Exp. Psychol. Gen. 132, 23–46. doi: 10.1037/0096-3445.132.1.23
Stojek, M. M. K., and MacKillop, J. (2017). Relative reinforcing value of food and delayed reward discounting in obesity and disordered eating: A systematic review. Clin. Psychol. Rev. 55, 1–11. doi: 10.1016/j.cpr.2017.04.007
Stumphauzer, J. S. (1972). Increased delay in gratification in young prison inmates through imitation of high-delay peer models. J. Pers. Soc. Psychol. 21, 10–17. doi: 10.1037/h0032064
Tonnaer, F., Cima, M., and Arntz, A. (2016). Executive (Dys)Functioning and impulsivity as possible vulnerability factors for aggression in forensic patients. J. Nerv. Ment. Dis. 204, 280–286. doi: 10.1097/NMD.0000000000000485
Torres, T. M., Steinhauer, S. R., Forman, S. D., and Forster, S. E. (2023). Patients with cocaine use disorder exhibit reductions in delay discounting with episodic future thinking cues regardless of incarceration history. Addict. Behav. Rep. 18:100518. doi: 10.1016/j.abrep.2023.100518
Varghese, F. P., Charlton, S. R., Wood, M., and Trower, E. (2014). Temporal discounting and criminal thinking: Understanding cognitive processes to align services. Psychol. Serv. 11, 171–178. doi: 10.1037/a0035004
Weinsztok, S., Brassard, S., Balodis, I., Martin, L. E., and Amlung, M. (2021). Delay discounting in established and proposed behavioral addictions: A systematic review and meta-analysis. Front. Behav. Neurosci. 15:786358. doi: 10.3389/fnbeh.2021.786358
West, S. J., Lasko, E. N., Hall, C. J., Khan, N. G., and Chester, D. S. (2022). Some revenge now or more revenge later? Applying an intertemporal framework to retaliatory aggression. Motivat. Sci. 8, 33–55. doi: 10.1037/mot0000248
Keywords: delayed rewards discounting, prisoners, offenders, forensic, decision-making, impulsivity, value preference
Citation: Sebalo I, Darashkevich D, Kostelníková S and Voldřichová J (2025) Value preference in forensic population: a systematic literature review of delay discounting among those who have committed an offence. Front. Behav. Neurosci. 19:1666649. doi: 10.3389/fnbeh.2025.1666649
Received: 15 July 2025; Accepted: 22 October 2025;
Published: 19 November 2025.
Edited by:
Jean-Christophe Cassel, Université de Strasbourg, FranceReviewed by:
Erica Holliday, Kennesaw State University, United StatesLaurent Avila Chauvet, Instituto Tecnológico de Sonora (ITSON), Mexico
Copyright © 2025 Sebalo, Darashkevich, Kostelníková and Voldřichová. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Ivan Sebalo, aXNlYmFsb0B1bnlwLmN6
Stela Kostelníková