A Decision-Theoretic Model of Behavior Change

Matsumori, Kaosu; Iijima, Kazuki; Koike, Yasuharu; Matsumoto, Kenji

doi:10.3389/fpsyg.2019.01042

HYPOTHESIS AND THEORY article

Front. Psychol., 21 May 2019

Sec. Health Psychology

Volume 10 - 2019 | https://doi.org/10.3389/fpsyg.2019.01042

A Decision-Theoretic Model of Behavior Change

Kaosu Matsumori^1,2^*

Kazuki Iijima¹

Yasuharu Koike³

Kenji Matsumoto¹^*

¹Brain Science Institute, Tamagawa University Brain Science Institute, Machida, Japan
²Department of Information Processing, Tokyo Institute of Technology, Yokohama, Japan
³Institute of Innovative Research, Tokyo Institute of Technology, Yokohama, Japan

Undesirable habitual or addictive behaviors are often difficult to change. The issue of “behavior change” has long been studied in various research fields. Several models for behavior change have converged to the hypothesis that attitudes, norms, and self-efficacy are important determinants of intentions and behavior. To improve the accuracy of behavior-change models, some researchers have tried to combine behavioral economics models with existing models for behavior change. However, these attempts have failed because the existing models [e.g., Theory of Planned Behavior (TPB)] are not consistent with Expected Utility Theory (EUT), which underlies various behavioral economics models. In the present paper, we clarify the corresponding components between existing models for behavior change and EUT, and propose a new model, the Decision-Theoretic Model of behavior change (DTM), which is a natural extension of ordinary EUT.

Introduction

It is often difficult for clinicians, trainers, or teachers to change people's undesirable habitual or addictive behaviors, such as overeating, excessive drinking, lack of exercise, and smoking. How can we help them change people's behavior for the better? The problem of “behavior change” has long been studied in various research fields such as psychology, pedagogy, nursing, public health, medicine, and health promotion (Fishbein and Ajzen, 2010). Several models for behavior change have converged to the hypothesis that attitudes, norms, and self-efficacy are important determinants of intentions and behavior (Sheeran et al., 2016). However, existing models for behavior change, such as “Social Cognitive Theory” and “Theory of Planned Behavior (TPB)” cannot sufficiently predict the occurrence probabilities of a considered behavior or its change through interventions (Sniehotta et al., 2014).

To improve the accuracy of predictive models for behavior change, some researchers have started to try to combine behavioral economics models with existing models for behavior change (Roberto and Kawachi, 2015). Because behavioral economics models consider various behavioral biases that affect the occurrence of a target behavior and/or its change through interventions, this combination was expected to be useful. However, existing models of behavior change are not consistent with Expected Utility Theory (EUT), which underlies a variety of behavioral economics models (Kahneman and Tversky, 1979; Schoemaker, 1982), and, therefore, this combination of models has been challenging.

In the present paper, by clarifying the corresponding components between TPB and EUT, we propose a new model, Decision-Theoretic Model of behavior change (DTM), which is consistent with EUT (Figure 1). Specifically, in DTM, we add the components of subjective norm and self-efficacy to the ordinary EUT.

FIGURE 1

Figure 1. (A) EUT. EUT is one of the most popular approaches for rational decision-making in a stochastic environment. An action set (A = {a₁: performing the target behavior, a₂: not performing the target behavior}) and a state set (S = {s₁, s₂, …}) are assumed. The agent holds the belief that each action causes any state with a certain probability in the corresponding action-state link (P(s_n|a_j)). When an action a_j is given, the expected value of subjective utility (E[U_self|a_j]) is calculated. EUT states that the agent chooses a_j, so as to maximize E[U_self|a_j]. (B) EUT-like schema of TPB. Intention to perform the target behavior (i₁) is additionally assumed. In TPB, the three determinants of the behavioral intention are attitude toward the behavior, subjective norm, and perceived self-efficacy. The attitude toward the behavior depends on P(s_n|a₁) and U_self(s_n), subjective norms appear as U_others(a₁), and perceived self-efficacy appears as P(a₁|i₁). (C) DTM. The intention set I = {i₁: intention to perform the target behavior, i₂: intention not to perform the target behavior} as well as the action set, and the state set are assumed. The agent holds the belief that each intention causes both actions with certain probabilities of the corresponding intention-action links (P(a_j|i_h)) in the same way as each action causes the states with certain probabilities of the corresponding action-state links (P(s_n|a_j)). When i_h is given, the expected value of subjective utility (E[(U_self + wU_others)|i_h]) is calculated, where w denotes the weight of U_others relative to U_self in calculating subjective utility. DTM states that the agent chooses intention i_h so as to maximize E[(U_self + wU_others)|i_h].

In the following sections, we first explain the details of EUT; second, we explain the details of TPB and reinterpret TPB in a decision-theoretic way; third, we describe our new model as a natural extension of EUT; fourth, we discuss the superiority of DTM; and finally, we summarize our arguments and discuss future research directions.

Expected Utility Theory (EUT)

EUT is one of the most popular approaches for rational decision-making in a stochastic environment (von Neumann and Morgenstern, 1947). When the state set (S = {s₁, s₂, …, s_n, …, s_N}), the action set (A = {a₁, a₂, …, a_j, …, a_J}), the subjective probability of a state s_n given an action a_j (P(s_n|a_j)), and the subjective utility of a state s_n (U_self(s_n)) are given, EUT states that the agent chooses an action a_j so as to maximize the expected value of subjective utility.

\begin{array}{l} E [U_{s e l f} | a_{j}] = \sum_{n = 1}^{N} P (s_{n} | a_{j}) U_{s e l f} (s_{n}) & (1) \end{array}

In the present paper, we consider a case wherein the action set has two complementary elements (A = {a₁: performing the target behavior, a₂: not performing the target behavior}) (Figure 1A). In many empirical studies, it is assumed that the agent's action-selection rule is based on a sigmoidal function, e.g., the logistic function (Luce, 1959; Sutton and Barto, 1998).

\begin{array}{l} P (a_{1}) = s i g m o i d (β_{1} \cdot {E [U_{s e l f} | a_{1}] - E [U_{s e l f} | a_{2}]} + β_{0}) & (2) \end{array}

where the inverse temperature β₁ denotes randomness of action selection, and the constant term β₀ denotes decision bias.

For example, consider the case with S = {s₁: health, s₂: disease}, A = {a₁: exercising, a₂: not exercising}, and that the agent has the beliefs of P(s₁| a₁) = 0.8, P(s₁| a₂) = 0.2, U_self(s₁) = 1, and U_self(s₂) = 0. Then, the expected utilities of each action are:

\begin{array}{l} E [U_{s e l f} | a_{1}] = \sum_{n = 1}^{2} P (s_{n} | a_{1}) U_{s e l f} (s_{n}) = 0.8 \cdot 1 + 0.2 \cdot 0 = 0.8 \\ E [U_{s e l f} | a_{2}] = 0.2 \cdot 1 + 0.8 \cdot 0 = 0.2 \end{array}

When the agent's internal decision parameter β₁ = 1, and constant term β₀ = 0, EUT predicts that P(a₁) ≒ 0.65 in this simple situation.

Theory of Planned Behavior

TPB is a typical model for behavior change, in which the behavioral intention (BI) for the target behavior (a₁) is determined by three factors: attitude toward the behavior, subjective norm, and perceived self-efficacy (Figure 2). At first glance, perceived self-efficacy is different from “perceived behavioral control,” which is the third factor of the original version of TPB, but these two concepts are treated as being the same in a newer version (Fishbein and Cappella, 2006). All behavior determinants are measured by questionnaire ratings for the target behavior. Table 1 shows the typical TPB questionnaire in the case that the target behavior is “Exercising for at least 20 min, three times per week for the next 3 months” (Fishbein and Ajzen, 2010).

FIGURE 2

Figure 2. TPB. TPB is a typical model for behavior change, in which the BI for the target behavior is determined by three factors: attitude toward the behavior, subjective norm, and perceived self-efficacy. Attitude toward the behavior (E[U_self|a₁]) is determined by aggregating the products of each behavioral belief strength (P(s_n|a₁)), and evaluation of each outcome (U_self(s_n)) (violet). Subjective norm (U_others) is determined by aggregating the products of each normative belief (U_k(a₁)), and motivation to comply (m_k)) (green). Perceived self-efficacy is the belief about the probability of performing the target behavior successfully when the agent intends to perform it (P(a₁|i₁)) (orange). BI (blue) is determined by the weighted sum of attitude toward the behavior, subjective norm, and perceived self-efficacy. Occurrence of the behavior (red) is a function of BI and actual self-efficacy (P_actual(a₁|i₁)) (gray).

TABLE 1

Table 1. A typical questionnaire for TPB.

Attitude toward the behavior is the agent's positive or negative evaluation of performing the target behavior a₁ (Ajzen, 1991; Fishbein and Ajzen, 2010), which is based on EUT in economics, or expectancy-value theory in psychology (Edwards, 1954; Ajzen, 1985). Attitude toward the behavior is determined by aggregating the products of behavioral beliefs and the evaluation of outcomes. As a behavioral belief it is the belief (subjective probability) that performing the target behavior (a₁) will lead to a particular outcome state (s_n) among the state set, we consider and denote the behavioral belief as P(s_n|a₁) (Ajzen, 1985). As the evaluation of an outcome is the expectation of an agent's utility when the outcome is obtained, we denote it as U_self(s_n) (Ajzen, 1985). Then, importantly, we can consider the attitude toward the behavior as the expected utility when a₁ is given (E[U_self|a₁] = Σ_{n = 1}^NP(s_n|a₁)*U_self(s_n)) (Edwards, 1954; Ajzen, 1985; Fishbein and Ajzen, 2010). It is worth noting that both E[U_self|a₁] and E[U_self|a₂] are considered in EUT, but only E[U_self|a₁] is considered in TPB.

Because the agent's behavior could not be explained well merely by attitude toward the behavior, TPB has added two other factors, subjective norm and perceived self-efficacy.

Subjective norm is the perceived social pressure to engage or not engage in a behavior (Fishbein and Ajzen, 2010). Subjective norm is determined by aggregating the products of normative beliefs and the motivation to comply with other individuals (m_k; k = 1, 2, …, K). As normative beliefs refer to the agent's belief about the degree to which a particular individual, K, thinks the agent should perform the target behavior a₁, we consider it as the agent's expectation of the individual's utility when the target behavior is performed, and denote it as U_k(a₁). Then, we can consider the subjective norm as the weighted sum of other individuals' utilities (U_others(a₁) = Σ_{k = 1}^K $m_{k}^{*}$ U_k(a₁)) (Fishbein and Ajzen, 2010). It is worth noting that other individuals' utilities are a function of action, whereas the agent's utility in attitude toward the behavior is a function of state, in TPB.

(Perceived) self-efficacy, originally proposed by Bandura (Bandura, 1977), is a personal judgement of “how well one can execute courses of action required to deal with prospective situations” (Bandura, 1982). Bandura emphasized it as a determinant of human behavior in addition to outcome expectations (Figure 3). As perceived self-efficacy for the target behavior a₁ is the belief about the probability of performing the behavior successfully when the agent intends to perform the target behavior (i₁), we denote it as P(a₁|i₁). It is worth noting here that the outcome expectation corresponds to the behavioral beliefs mentioned above, because it is defined as an agent's estimate that a given behavior will lead to certain outcomes.

FIGURE 3

Figure 3. Bandura's schema. Perceived self-efficacy as well as outcome expectation are considered as determinants of human behavior. Perceived self-efficacy (P(a|i)) is the belief about the probability of performing the behavior successfully when the agent intends to perform it. Outcome expectation (P(s|a)) is the belief about the probability of a particular outcome, given the agent's target behavior.

The weighted sum of these three determinants—attitude toward the behavior, subjective norm, and perceived self-efficacy—determines BI (Figure 2).

\begin{array}{l} B I (i_{1}) = w_{1} E [U_{s e l f} | a_{1}] + w_{2} U_{o t h e r s} (a_{1}) + w_{3} P (a_{1} | i_{1}) & (3) \end{array}

where, w₁, w₂, and w₃ denote the weight of attitudes toward the behavior, subjective norm, and perceived self-efficacy, respectively. This equation can be simplified to:

\begin{array}{l} B I (i_{1}) = w_{3} P (a_{1} | i_{1}) \\ + \sum_{n = 1}^{N} P (s_{n} | a_{1}) {w_{1} U_{s e l f} (s_{n}) + w_{2} U_{o t h e r s} (a_{1})} & (3′) \end{array}

which allows us to compare it with DTM later [section Decision-Theoretic Model of Behavior Change (DTM)]. The second term of Equation 3 and the corresponding part of Equation 3′ about U_others are equivalent, because $\sum_{n = 1}^{N} P (s_{n} | a_{1}) = 1 .$

Here, we note that BI is not consistent with EUT, because subjective norm and perceived self-efficacy are simply added to E[U_self|a₁]. In other words, attempts to improve the model's accuracy by incorporating subjective norm and perceived self-efficacy in TPB are inconsistent with EUT, which underlies a variety of behavioral economics models (Kahneman and Tversky, 1979; Schoemaker, 1982). We tried to draw a schematic view of TPB while maintaining consistency with EUT, as much as possible (Figure 1B). In the EUT-like schema of TPB, the three determinants of behavioral intention can be identified. However, their summation does not mathematically provide the occurrence probability of the target behavior in the EUT-like schema.

When the target behavior is considered as a dichotomous variable ({a₁: performing the target behavior, a₂: not performing the target behavior}), logistic regression is commonly used to predict the agent's intention. This corresponds to the assumption that the agent's intention-selection rule is based on a sigmoidal function, e.g., the logistic function Luce, 1959; Sutton and Barto, 1998.

\begin{array}{l} P (i_{1}) = s i g m o i d (β_{1} \cdot B I (i_{1}) + β_{0}) & (4) \end{array}

The occurrence probability of the target behavior (P(a₁)) is a function of P(i₁) and actual (not perceived) self-efficacy. As actual self-efficacy for the target behavior a₁ should be the objective probability of performing the behavior successfully when the agent intends to perform a₁, we denote it as P_actual(a₁|i₁). However, in many cases, actual self-efficacy is difficult to measure through questionnaires. In such cases, perceived self-efficacy is used as a proxy for actual self-efficacy. Then, the estimated occurrence probability of the target behavior is:

\begin{array}{l} P (a_{1}) = P_{a c t u a l} (a_{1} | i_{1}) \cdot P (i_{1}) ≒ P (a_{1} | i_{1}) \cdot P (i_{1}) & (5) \end{array}

Here, note that the TPB questionnaire (Table 1) does not include any questions regarding the belief about the probability of achieving the target behavior (a₁) when the agent intends not to perform the behavior (i₂). Calculating P(a₁) without considering P(a₁|i₂) (≒P_actual(a₁|i₂)) is allowed when P(a₁|i₂) is assumed to be zero, which enables us to calculate P(a₁) just with P_actual(a₁|i₁) and P(i₁) (cf. Equation 8).

Thus, P(a₁), which requires the value of P(i₁) based on BI(i₁) to be calculated, is what researchers would like to predict in behavior change studies. Therefore, typical TPB questionnaires contain questions about P(s_n|a₁), U_self(s_n), U_k(a₁), m_k, and P(a₁|i₁), to predict P(a₁) (Table 1).

Decision-Theoretic Model of Behavior Change (DTM)

As we mentioned in the Introduction, some researchers recently tried to combine behavioral economics models with existing models for behavior change (Roberto and Kawachi, 2015) to improve the accuracy of the prediction of behavior. However, the existing models of behavior change challenge this combination, because they are not consistent with EUT.

Here, we propose a new model, DTM, which is consistent with EUT. In DTM, we add the components of subjective norm and self-efficacy to the ordinary EUT. To do so, we introduce an intention set (I = {i₁: intention to perform the target behavior, i₂: intention not to perform the target behavior}), in addition to the state set (S = {s₁, s₂, …, s_n, …,s_N}) and the action set (A = {a₁: performing the target behavior, a₂: not performing the target behavior}), which were already included in EUT (Figure 1C).

The occurrence of i_h (h = 1, 2) is determined by expected utility (E[U_total|i_h]) in DTM. E[U_total|i_h] is an aggregation of the products of the subjective probability of a state s_n given an intention i_h (P(s_n|i_h) = Σ_{j = 1}²P(s_n|a_j)*P(a_j|i_h)), and the total utility of a state (U_total(s_n)). We assume that total utility is a summation of the agent's utility and others' utility (U_total = U_self + wU_others), both of which are functions of state and behavior, where w denotes the weight of U_others relative to U_self in calculating subjective utility. Thus, expected utility E[U_total|i_h] is:

\begin{array}{l} E [U_{t o t a l} | i_{h}] \\ = \sum_{j = 1}^{2} P (a_{j} | i_{h}) \sum_{n = 1}^{N} P (s_{n} | a_{j}) {U_{s e l f} (a_{j}, s_{n}) + w U_{o t h e r s} (a_{j}, s_{n})} & (6) \end{array}

Note that other individuals' utilities in subjective norm are functions of action, whereas the agent's utility in attitude toward the behavior is a function of state in TPB. Here, in DTM, we defined both the agent and other individuals' utilities as functions of action and state.

To compare with TPB, we denote equation 6 as follows:

\begin{array}{l} E [U_{t o t a l} | i_{h}] \\ = P (a_{1} | i_{h}) \sum_{n = 1}^{N} P (s_{n} | a_{1}) {U_{s e l f} (a_{1}, s_{n}) + w U_{o t h e r s} (a_{1}, s_{n})} \\ + P (a_{2} | i_{h}) \sum_{n = 1}^{N} P (s_{n} | a_{2}) {U_{s e l f} (a_{2}, s_{n}) + w U_{o t h e r s} (a_{2}, s_{n})} & (6′) \end{array}

Equation 3′ of TPB and Equation 6′ of DTM are different in the following five ways (Figures 1B,C):

(1) E[U_total|i₁] is a kind of expected utility; E[U_total|i₁] in DTM is naturally extended from E[U_self|a₁] in EUT by adding the components of subjective norm and perceived self-efficacy. In contrast, BI(i₁) in TPB cannot be considered as expected utility.

(2) DTM considers not only the expected utility given i₁ (E[U_total|i₁]), but also the expected utility given i₂ (E[U_total|i₂]), whereas TPB considers behavioral intention only for i₁ (BI(i₁)). This difference is important when we consider P(i₁) and P(a₁) later in this section.

(3) U_self(a_j, s_n) and U_others(a_j, s_n) in DTM are more flexible functions than U_self(s_n) and U_others(a₁) in TPB. TPB cannot consider cases in which the agent's utility depends on his/her action cost, or other individuals' utilities depend on the consequences of their actions.

(4) E[U_total|i₁] in DTM considers the utility of the case in which the agent intends to perform the target behavior (i₁), but fails to perform it and instead, performs an alternative action (a₂). However, BI(i₁) in TPB cannot take this into account.

(5) Perceived self-efficacy (P(a_j|i_h)) is multiplied by expected utility given an action in DTM but is added to expected utility given a₁ in TPB.

We assume that the intention-selection rule is based on the sigmoidal function, as with EUT Luce, 1959; Sutton and Barto, 1998.

\begin{array}{l} P (i_{1}) = s i g m o i d (β_{1} \cdot {E [U_{t o t a l} | i_{1}] - E [U_{t o t a l} | i_{2}]} + β_{0}) & (7) \end{array}

The difference between Equation 4 (TPB) and 7 (DTM) is that E[U_total|i₂] is explicitly considered in Equation 7, but not in Equation 4. This difference is not important when E[U_total|i₂] is stable across subjects or contexts, because it is adsorbed into a constant term. If E[U_total|i₂] varies across subjects or contexts, which should be a plausible assumption, it significantly affects P(i₁).

The estimated occurrence probability of the target behavior is:

\begin{array}{l} P (a_{1}) = P_{a c t u a l} (a_{1} | i_{1}) \cdot P (i_{1}) + P_{a c t u a l} (a_{1} | i_{2}) \cdot P (i_{2}) \\ ≒ P (a_{1} | i_{1}) \cdot P (i_{1}) + P (a_{1} | i_{2}) \cdot P (i_{2}) & (8) \end{array}

The difference between Equation 5 (TPB) and Equation 8 (DTM) is that Equation 8 explicitly considers the case in which the agent performs the target behavior despite the absence of an intention to do so. This difference is not important only if P_actual(a₁|i₂) and/or P(i₂) are zero, because Equation 5 (TPB) and Equation 8 (DTM) are the same in this case.

Thus, the occurrence probability of the target behavior is predicted by using these equations (Equations 6–8) in DTM. Therefore, DTM needs some additional questions in its questionnaires (Table 2).

TABLE 2

Table 2. A proposed questionnaire for DTM.

To summarize, DTM is a natural extension of EUT, which accounts for behavior change.

An Example Showing the Superiority of DTM

Here, we focus on the fifth difference between Equations 3′ and 6′ in section Decision-Theoretic Model of Behavior Change (DTM), to assert the superiority of DTM over TPB. Whereas, perceived self-efficacy is multiplied by the weighted sum of attitude toward the behavior and subjective norm in DTM (Equation 6′), it is added to these factors in TPB (Equation 3′), as we noted above.

Let us think about the case of opening a tight jar lid. For the sake of simplicity, let us assume that there is no other individual present. The target behavior (a₁) is “straining the wrist enough to open the jar lid.” Here, i₁ is “intention to strain the wrist enough to open the jar lid,” s₁ is “the lid was opened,” and s₂ is “the lid was not opened.”

In TPB, BI is determined by the following factors: (1) Attitude toward the behavior, which is governed by the value of the contents of the jar to oneself, (2) Subjective norm, which can be ignored in this case, because the absence of any other individual is assumed, (3) Perceived self-efficacy, which is the belief about the probability of straining the wrist enough to open the jar lid when one intends to do it. The estimated weight for attitude toward the behavior, and that for perceived self-efficacy are assumed to be positive in this case. Now, let us assume that this person injured his/her spinal cord and became totally paralyzed. Then, perceived self-efficacy would change to 0, but the attitude toward the behavior (or the subjective norm) would not change. Because BI of TPB is determined by the weighted sum of the attitude toward the behavior, the perceived self-efficacy, and the subjective norm (ignored here), TPB would predict that one will have the intention to strain the wrist enough to open the jar lid, regardless of her/his inability to move, in proportion to the value of the contents of the jar. This prediction is unrealistic, thus presenting a counterexample for TPB.

In contrast, DTM can properly predict that BI is consistently zero regardless of the value of the contents, because the weighted sum of attitude toward the behavior (and the subjective norm) is multiplied by perceived self-efficacy (= 0), showing the superiority of DTM.

Discussion

In the present paper, we show that TPB could be considered as an attempt to improve the EUT's accuracy of predicting behavior change, by incorporating subjective norm and self-efficacy. Indeed, TPB has achieved great success, because it is a relatively simple model, and its three factors are actually effective in promoting behavior change (Sheeran et al., 2016). Applying TPB has allowed investigators to identify important psychological factors to understand, predict, and change human social behavior (Van Lange et al., 2011). Moreover, behavior change interventions applying TPB were actually effective in two-thirds of studies (Hardeman et al., 2002), indicating that TPB is appropriate for clinical application.

However, TPB has a serious problem. Because subjective norm and perceived self-efficacy are simply added to the standard expected utility in TPB, it is not consistent with EUT, and thus, cannot be connected with behavioral economics models. To overcome this problem, we propose a new behavior change model, DTM, which includes the components of subjective norm and self-efficacy as a natural extension of EUT.

As DTM is consistent with EUT, it can be easily extended in several ways. First, DTM can handle intertemporal choices by using temporal discounted utility. In particular, hyperbolic discounting, which is well-studied in behavioral economics, is important for behavior change because it can express procrastination (Story et al., 2014). Second, DTM can be easily extended to a Markov model by introducing a Markov decision process (MDP) framework. Markov models are useful when the situation is continuous over time, and important events may happen more than once (Sonnenberg and Beck, 1993; Sutton and Barto, 1998). Because most current neural models of the reward system are based on MDP, this extension enables us to combine behavior change models with pharmacological models of aberrant behavior such as addiction (Redish, 2004; Rangel et al., 2008). Third, we simply defined U_total by the weighted sum of U_self and U_others in the present paper, but other ways of formulating U_total are possible when considering various types of social preferences, such as inequality aversion, guilt aversion, and Rawlsian preferences (Fehr and Krajbich, 2014). Fourth, DTM could be applicable to studies about morality (Crockett, 2013). In DTM, we introduced a distinction between action and intention into the EUT, and this is an important character of moral judgement (Cushman, 2008). Utility in DTM is suitable to represent moral values, because it could be a function of not only action and outcome, but also intention [i.e., U_self(i_h, a_j, s_n), U_others(i_h, a_j, s_n)].

We hope that DTM leads to a better combination of existing models of behavior change and behavioral economics models.

Author Contributions

KaM conceptualized the data. KaM and KeM wrote the paper. KaM, KeM, KI, and YK revised the paper.

Funding

This research was partially supported by JSPS KAKENHI Grant No. JP15H03124, JP17H05929, and 18H05085, and by the program for Brain Mapping by Integrated Neurotechnologies for Disease Studies (Brain/MINDS) from the Japan Agency for Medical Research and development (AMED).

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We thank Yukihito Yomogida, Ayaka Sugiura, Ryuta Aoki, and the reviewers for helpful discussions and comments on previous versions of this manuscript.

References

Ajzen, I. (1985). “From intentions to actions: a theory of planned behavior,” in Action Control: From Cognition to Behavior, eds J. Kuhl and J. Beckmann (Berlin, Heidelberg: Springer Berlin Heidelberg), 11–39.

Google Scholar

Ajzen, I. (1991). The theory of planned behavior. Organ. Behav. Hum. Decis. Process. 50, 179–211. doi: 10.1016/0749-5978(91)90020-t

CrossRef Full Text | Google Scholar

Bandura, A. (1977). Self-efficacy - toward a unifying theory of behavioral change. Psychol. Rev. 84, 191–215. doi: 10.1037/0033-295x.84.2.191

PubMed Abstract | CrossRef Full Text | Google Scholar

Bandura, A. (1982). Self-efficacy mechanism in human agency. Am. Psychol. 37, 122–147. doi: 10.1037/0003-066x.37.2.122

CrossRef Full Text | Google Scholar

Crockett, M. J. (2013). Models of morality. Trends Cogn. Sci. 17, 363–366. doi: 10.1016/j.tics.2013.06.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Cushman, F. (2008). Crime and punishment: distinguishing the roles of causal and intentional analyses in moral judgment. Cognition 108, 353–380. doi: 10.1016/j.cognition.2008.03.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Edwards, W. (1954). The theory of decision making. Psychol. Bull. 51, 380–417.

PubMed Abstract | Google Scholar

Fehr, E., and Krajbich, I. (2014). “Social preferences and the brain,” in Neuroeconomics, 2nd Edition. eds P. W. Glimcher and E. Fehr (San Diego, CA: Academic Press), 193–218.

Google Scholar

Fishbein, M., and Ajzen, I. (2010). Predicting and Changing Behavior: The Reasoned Action Approach. New York, NY: Taylor and Francis.

Google Scholar

Fishbein, M., and Cappella, J. N. (2006). The role of theory in developing effective health communications. J. Commun. 56, S1–S17. doi: 10.1111/j.1460-2466.2006.00280.x

CrossRef Full Text | Google Scholar

Hardeman, W., Johnston, M., Johnston, D., Bonetti, D., Wareham, N., and Kinmonth, A. L. (2002). Application of the theory of planned behaviour in behaviour change interventions: a systematic review. Psychol. Health 17, 123–158. doi: 10.1080/08870440290013644a

CrossRef Full Text | Google Scholar

Kahneman, D., and Tversky, A. (1979). Prospect theory: an analysis of decision under risk. Econometrica 47, 263–291.

Google Scholar

Luce, R. D. (1959). Individual Choice Behavior. Oxford: John Wiley.

Google Scholar

Rangel, A., Camerer, C., and Montague, P. R. (2008). A framework for studying the neurobiology of value-based decision making. Nat. Rev. Neurosci. 9, 545–556. doi: 10.1038/nrn2357

PubMed Abstract | CrossRef Full Text | Google Scholar

Redish, A. D. (2004). Addiction as a computational process gone awry. Science 306, 1944–1947. doi: 10.1126/science.1102384

PubMed Abstract | CrossRef Full Text | Google Scholar

Roberto, C. A., and Kawachi, I. (2015). Behavioral Economics and Public Health. Oxford: Oxford University Press.

Google Scholar

Schoemaker, P. (1982). The expected utility model: its variants, purposes, evidence and limitations. J. Econ. Lit. 20, 529–563.

Google Scholar

Sheeran, P., Maki, A., Montanaro, E., Avishai-Yitshak, A., Bryan, A., and Klein, W. M., et al. (2016). The impact of changing attitudes, norms, and self-efficacy on health-related intentions and behavior: a meta-analysis. Health Psychol. 35, 1178–1188. doi: 10.1037/hea0000387

PubMed Abstract | CrossRef Full Text | Google Scholar

Sniehotta, F. F., Presseau, J., and Araújo-Soares, V. (2014). Time to retire the theory of planned behaviour. Health Psychol. Rev. 8, 1–7. doi: 10.1080/17437199.2013.869710

PubMed Abstract | CrossRef Full Text | Google Scholar

Sonnenberg, F. A., and Beck, J. R. (1993). Markov models in medical decision making: a practical guide. Med. Decision Making 13, 322–338. doi: 10.1177/0272989x9301300409

PubMed Abstract | CrossRef Full Text | Google Scholar

Story, G. W., Vlaev, I., Seymour, B., Darzi, A., and Dolan, R. J. (2014). Does temporal discounting explain unhealthy behavior? A systematic review and reinforcement learning perspective. Front. Behav. Neurosci. 8:76. doi: 10.3389/fnbeh.2014.00076

PubMed Abstract | CrossRef Full Text | Google Scholar

Sutton, R. S., and Barto, A. G. (1998). Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press.

Google Scholar

Van Lange, P., Kruglanski, A., and Higgins, T. (2011). Handbook of Theories of Social Psychology. Thousand Oaks, CA: SAGE Publications Ltd.

Google Scholar

von Neumann, J., and Morgenstern, O. (1947). Theory of Games and Economic Behavior. Princeton, NJ: Princeton University Press.

Google Scholar

Keywords: Theory of Planned Behavior, self-efficacy, Social Cognitive Theory, expected utility theory, Markov decision process

Citation: Matsumori K, Iijima K, Koike Y and Matsumoto K (2019) A Decision-Theoretic Model of Behavior Change. Front. Psychol. 10:1042. doi: 10.3389/fpsyg.2019.01042

Received: 01 December 2018; Accepted: 23 April 2019;
Published: 21 May 2019.

Edited by:

Giada Pietrabissa, Catholic University of the Sacred Heart, Italy

Reviewed by:

Marcelo De Souza Lauretto, University of São Paulo, Brazil
Ken-Ichiro Tsutsui, Tohoku University, Japan

Copyright © 2019 Matsumori, Iijima, Koike and Matsumoto. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Kaosu Matsumori, a2Fvc3UubWF0c3Vtb3JpQGdtYWlsLmNvbQ==
Kenji Matsumoto, bWF0c3Vtb3RAbGFiLnRhbWFnYXdhLmFjLmpw

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.