Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Psychol., 25 September 2025

Sec. Sport Psychology

Volume 16 - 2025 | https://doi.org/10.3389/fpsyg.2025.1657506

This article is part of the Research TopicTowards a Psychophysiological Approach in Physical Activity, Exercise, and Sports-Volume VView all 20 articles

Psychological and technological predictors of the physical activity intention-behavior gap: an explainable machine learning analysis

Yirong LiYirong LiJianguang Cai
Jianguang Cai*
  • School of Physical Education, Hunan University of Science and Technology, Xiangtan, China

Objective: Intention is widely regarded as the most proximal predictor of behavior. But, physical activity (PA) intentions do not invariably translate into actual exercise behavior, leaving a intention-behavior (I-B) gap. The study integrates psychological and technological frameworks to examine the mechanisms that moderate the PA I-B gap.

Methods: Unlike traditional dichotomous measures of the PA I-B gap, this study employs baseline correction to derive a standardized continuous measure that quantifies the magnitude of the gap. Using survey data from 1,334 Chinese adults, we combined the Health Belief Model and the Unified Theory of Acceptance and Use of Technology within an explainable machine-learning framework to identify important predictors and their non-linear interactions.

Results: The machine learning based optimal XGBoost model (R2 = 0.647) significantly outperforms traditional regression approaches. Perceived barriers, self efficacy, intention to use smart tools and social support emerge as the four core predictors of the PA I-B gap. Higher levels of perceived barriers and late night frequency enlarge the gap whereas greater self efficacy, perceived exercise benefits, intention to use smart tools, social support, social influence and personal innovation narrow it. The psychological cognition dimension exhibits significantly stronger predictive power than smart sports tools. These tools function primarily as auxiliary resources, and their facilitative effects differ across distinct psychological cognition levels.

Conclusion: Psychological cognition and smart sports tools jointly predict the PA I-B gap. The study’s conclusions are constrained by its reliance on self-reported measures and its cross-sectional design. Future research should adopt longitudinal or experimental protocols, supplemented by objective data from wearable devices, to delineate causal pathways and illuminate the finer mechanisms underlying the gap.

1 Introduction

Regular physical activity (PA) is universally acknowledged as essential for enhancing health and preventing diseases. It plays a crucial role in both mitigating noncommunicable diseases such as cardiovascular conditions, cancers, and diabetes, and alleviating psychological issues like depression and anxiety. These benefits extend across various life stages, profoundly influencing overall well-being (World Health Organization, 2024). Despite a widespread intention to PA, translating these intentions into consistent PA poses a significant challenge. According to latest WHO data, 31% of adults worldwide do not achieve the recommended minimum of 150 min of moderate-intensity PA weekly (Strain et al., 2024). Despite strong PA intentions, individuals often refrain from acting on intended behaviors due to various barriers (e.g., high stress, other priorities, and waning motivation), which is a phenomenon widely recognized as the PA I-B gap (Rhodes and de Bruijn, 2013; Englert et al., 2023).

Scholars have been investigating the critical factors that influence the transition from PA intentions to actual behavior, recognizing the importance of bridging the I-B gap for enhancing PA levels and public health outcomes. Huang and Zhang (2024) empirical research on the I-B gap identified perception variables within the Health Belief Model and Theory of Planned Behavior as pivotal in converting PA intentions into tangible actions. Furthermore, the use of smart sports tools, including fitness applications (Anderberg et al., 2025) and wearable technology (Song et al., 2018), has been shown to bolster individual motivation and enhance athletic performance through monitoring and support. Healthy psychological cognition serves as an internal drive for behavior, while intelligent sports offer external support and behavioral interventions, collectively facilitating the successful translation of PA intentions into practice.

Current research predominantly employs traditional econometric methods to examine the relationships between influencing factors and the PA I-B gap. However, behavioral mechanisms frequently exhibit non-linear and high-dimensional interactive characteristics that simple parametric specifications cannot fully capture, potentially leading to biased estimates (Zhuo and Yunsong, 2025). In recent years, machine learning algorithms have become essential tools for prediction tasks in complex settings because of their capacity to handle high-dimensional covariates, capture non-linear relationships and reduce model bias. Within behavioral research, these algorithms have already been applied to learning (Su et al., 2022), internet addiction (Gan et al., 2025) and sports (Liu et al., 2023), establishing a new research paradigm. Nevertheless, machine learning prioritizes predictive accuracy, which is fundamentally different from the objective of causal inference—namely, the unbiased estimation of parameters or effects (Mullainathan and Spiess, 2017). Consequently, relying solely on predictive algorithms is insufficient for obtaining unbiased causal estimates of the relationship between independent and dependent variables.

To address this limitation, Double Machine Learning (DML) integrates high-dimensional machine learning techniques with traditional causal identification strategies by incorporating residual orthogonalization and cross-validation into a conventional causal framework. When numerous observable confounders are present, DML delivers causal effect estimates accompanied by confidence intervals. This approach demonstrates clear advantages in handling high-dimensional data, relaxing functional form assumptions, estimating conditional treatment effects, and enhancing the accuracy and reliability of causal inference (Chernozhukov et al., 2018).

Despite these strengths, complex algorithms are often perceived as black boxes, restricting the interpretability of their decision-making processes. The SHAP (SHapley Additive exPlanations) algorithm, grounded in Shapley value theory, decomposes model predictions into the marginal contributions of individual features, quantifying the influence of each variable on the outcome and presenting key pathways and threshold effects through intuitive visualizations (Lundberg and Lee, 2017).

Building on these considerations, our study first employs machine-learning algorithms to construct predictive models of the PA I-B gap and subsequently applies Double Machine Learning to conduct causal inference. SHAP is then introduced to provide transparent visual explanations. By systematically evaluating the predictive and causal impacts of psychological cognition and smart sports, this study aims to furnish rigorous evidence for the design of precise and actionable intervention strategies.

1.1 Physical activity intention-behavior gap

Within the domain of PA research, intention has consistently emerged as a paramount and dependable predictor, serving as a critical proximal determinant of behavioral enactment (Hagger and Chatzisarantis, 2009; Roberts et al., 2010). Nonetheless, while intentions hold significant predictive power, empirical assessments of their predictive efficacy have demonstrated that they seldom account for all variances in behavior. As highlighted in a meta-analysis by Rhodes and Dickau (2012), modest shifts in intent correspond to minimal behavioral alterations. This divergence between intended and actual behavior is termed the PA I-B gap, which constitutes the central focus of the present study.

In recent years, scholars have examined the mechanisms underlying the translation of PA intention into behavior from two complementary perspectives: the individual and the environment. At the individual level, research has primarily focused on the intrinsic properties of intention itself, including psychological attributes such as self-efficacy (Hou et al., 2022), habit strength (Di Maio et al., 2021) and personality traits (MacCann et al., 2015). Simultaneously, demographic variables including gender, age, educational attainment and health status have been demonstrated to exert differential influences on this relationship (Yang et al., 2025; Knapova et al., 2024). At the environmental level, earlier studies have shown that objective factors such as the availability and accessibility of sports facilities (Santinha et al., 2022), together with subjective factors such as social support (Schumacher et al., 2021) and cross-cultural variations between individualistic and collectivistic exercise cultures (Gurleyik et al., 2022), have moderated the I-B pathway. Collectively, these elements have formed a complex, intertwined network of potential confounders.

Despite this progress, extant research has predominantly relied on traditional statistical techniques, namely linear regression (Schumacher et al., 2021) and mediation or moderation models (Hou et al., 2022; Yang et al., 2025; Knapova et al., 2024). Although these approaches have established several significant associations, they have remained limited in their capacity to identify critical variables precisely within high-dimensional feature spaces and to characterize nonlinear relationships adequately. Moreover, minor alterations in model specifications have yielded divergent interpretations (Rebar et al., 2019), thereby undermining the robustness and generalisability of the findings.

Much of the existing research employs a categorical framework to measure the I-B gap, categorizing individuals into quadrants based on their intentions (intentioned/non-intentioned) and outcomes (successful/unsuccessful)(Sheeran, 2002). Additionally, some studies have evaluated the correlation between intention strength and behavior through fixed statements, such as I plan to engage in PA at least N times per week, coupled with response scales that have gauged the degree of agreement (Maltagliati et al., 2025). Although these methods have been prevalent in PA research, they have exhibited inherent limitations: they have failed to provide quantitative analyses of the gap’s magnitude and have struggled to capture individual nuances. For instance, existing measurement techniques have been unable to differentiate accurately and quantify the gap between individuals who have aimed for seven weekly PA sessions but have achieved only five and those who have intended to exercise once but have failed to do so (Burnett et al., 2018). Consequently, comparing I–B gaps across individuals with varying baseline levels has remained challenging, thereby constraining the precise assessment of the gap’s extent.

1.2 HBM-UTAUT theoretical model

This study centrally aims to investigate the discrepancy between individuals’ PA intentions and their actual behaviors. The majority of prior research has concentrated on intra-individual factors, including personality differences (Maltagliati et al., 2025), PA procrastination (Miao et al., 2024) and PA preferences (Rebar et al., 2016), often neglecting the impact of external environmental factors on the I-B relationship. Amidst the rising prevalence of digital lifestyles, the significance of smart sports tools as convenient and efficient instruments for enhancing PA intentions and PA has garnered increasing attention (Yang and Koenigstorfer, 2021). By integrating the Health Belief Model with the Unified Theory of Acceptance and Use of Technology (UTAUT), this study establishes a comprehensive theoretical framework that encapsulates both psychological cognition and technological support. This psycho-technology dual-path model offers a holistic view to elucidate the disparities between PA intentions and behaviors, thereby furnishing a theoretical foundation for devising effective intervention strategies.

The Health Belief Model (HBM) encompasses components such as perceived benefits, perceived barriers, perceived severity and self-efficacy, which are pivotal in assessing attitudes toward health conditions (Rahmati-Najarkolaei et al., 2015). It stands as one of the seminal theories in behavioral health. This model, renowned for its ability to identify, elucidate, and forecast health-related behaviors, as well as to guide preventive measures, has been extensively applied to anticipate and interpret the motivational factors driving PA across diverse demographics, including the elderly (Qiao et al., 2021), pregnant women (Shafieian and Kazemi, 2017) and college students (Sheng et al., 2023). Given its relevance to the study of individual PA, and considering its frequent application in examining the PA I-B gap, the HBM is integrated into the theoretical framework of the present research.

The Unified Theory of Acceptance and Use of Technology (UTAUT), introduced by Venkatesh et al. (2003), offers a comprehensive framework that synthesizes insights from eight pre-existing models. This theory primarily utilizes four core constructs—performance expectations, effort expectancy, social influence and facilitating conditions—to elucidate users’ intentions and behaviors regarding the adoption of specific technologies, alongside other significant factors. Smart sports refers to the use of smart devices, apps, virtual reality and other technical means to provide individuals with PA guidance, feedback and motivation. Studies have shown that UTAUT is a robust predictive model of technology acceptance, and it is widely used to study smart sports initiatives, including live sports video platforms (Xiang et al., 2024), fitness applications (Liu et al., 2019), and smart wearables (Seol et al., 2017). The UTAUT model goes beyond the binary analysis of technology use to delve into how various attributes of intelligent motion affect the motor I-B gap (Wei et al., 2021) and proves particularly good at assessing users’ propensity to accept new technologies. Therefore, this study integrates UTAUT into its theoretical framework to enhance the understanding of the adoption of intelligent motor technology and its impact on motor behavior.

1.3 Double machine learning

Over the past decade, machine-learning algorithms have been widely adopted by social scientists for data generation and prediction tasks. Across economics, sociology and psychology, causal identification has become the central concern of empirical inquiry (Abadie and Cattaneo, 2018). For machine-learning algorithms engaged in data prediction, it is sufficient to establish correlations among variables; causal relationships are not required. Yet Kleinberg points out that the principal goal of most machine-learning models is prediction: observable correlations between features and outcomes suffice, and causal structure is dispensable (Kleinberg et al., 2015). This orientation leads many algorithms to neglect the underlying causal chain, focusing solely on predictive performance and thereby creating a methodological gap with the mainstream causal-identification literature in the social sciences. Nevertheless, machine learning and causal inference are not inherently incompatible. Chernozhukov et al. (2018) integrate the high-dimensional flexibility of machine learning with the identification strategies of classical causal inference to propose Double Machine Learning (DML). By retaining the adaptability of machine-learning algorithms while leveraging residual orthogonalization and cross-validation, DML delivers unbiased estimates of causal effects and offers a novel and important technical pathway for causal identification in the social sciences.

DML is implemented in two stages. The first stage is a pure prediction task aimed at obtaining highly accurate forecasts of the outcome variable. Guided by variable types and data structure, researchers use cross-validation to select the best performing model from among SVM, Random Forest and XGBoost, and extract residuals for subsequent orthogonalization. The second stage estimates the causal effect of the treatment variable on the outcome. Given the potential non-linear relationships among variables, polynomial regression or non-parametric strategies such as causal random forests are typically employed. Non-parametric approaches avoid prespecified functional forms and allow confidence intervals to be constructed through bootstrap sampling, with traditional hypothesis tests or interval estimates providing the statistical basis for inference (Chernozhukov et al., 2018). It should be noted, however, that DML is not without limitations; in particular, algorithmic opacity and insufficiently revealed parameter heterogeneity remain concerns. Consequently, after obtaining causal-effect estimates, one can incorporate the SHAP explainable machine-learning framework to decompose model predictions into the marginal contributions of individual features and to conduct case-level heterogeneity analyses.

2 Methods

2.1 Data source

To ensure the representativeness and validity of the smart-sport dimension data, the sampling framework was aligned with the gender and age distribution of habitual smart-sport users. Under the constraint of an approximately balanced gender ratio, the target population was restricted to adults aged 18–45 years. Accordingly, questionnaires were distributed via Wenjuanxing to residents in Changsha, Zhuzhou and Xiangtan, Hunan Province, between 1 and 20 April 2025. The study protocol and online informed consent form were approved by the Ethics Committee of Hunan University of Science and Technology, and all participants gave written informed consent before they began the questionnaire. A total of 1,428 questionnaires were received; after excluding questionnaires completed in <5 min or >10 min and those with clearly consistent answers, 1,334 valid questionnaires were retained. The final sample comprised 49.1% males and 50.9% females; 86.9% were aged 18–45. Educational attainment ranged from below junior high to master’s degree, with 87.9% having completed senior high school or higher. The two most common occupational categories were private-sector employees (27.8%) and students (27.3%). The sample’s demographic composition closely mirrors the actual user profile of smart-sports technologies, indicating strong representativeness for the population of interest.

2.2 Variable description

2.2.1 Physical activity intention-behavior gap (PA I-B gap)

Based on the Physical Activity Rating Scale-3 (PARS-3) (Liang, 1994), operationalisation proceeded in two steps. First, each participant’s actual PA score was calculated as intensity × duration × frequency (maximum = 100). Second, under unconstrained conditions, each participant’s ideal PA score was computed as ideal intensity × ideal duration × ideal frequency (maximum = 100). Because baseline PA levels differ markedly between individuals, a simple raw difference can misrepresent the true gap: for example, an absolute discrepancy of 20 points is less consequential for a high-achiever (actual = 60, ideal = 80) than for a low-achiever (actual = 20, ideal = 40). Following standard practice in medical research, we applied baseline correction to convert the absolute difference into a relative index, ΔZ = (Ideal − Actual)/σ_actual, where σ_actual denotes the standard deviation of the actual scores across the entire sample. This index quantifies how many standard deviations an individual’s I-B gap deviates from the population mean gap, while preserving the validity of PARS-3 and eliminating baseline heterogeneity. The formula as follows:

ΔZ = Ideal Actual σ _ actual .

To evaluate the utility of ΔZ, a machine-learning model with ten-fold cross-validation was used to compare its predictive performance with that of the traditional absolute gap (Z). ΔZ achieved R2 = 0.647, significantly outperforming Z (R2 = 0.47), thereby demonstrating superior statistical performance and interpretability and establishing ΔZ as a robust, comparable core variable for subsequent analyses.

2.2.2 Health Belief Model (HBM) and the Unified Theory of Acceptance and Use of Technology (UTAUT)

Drawing on the PA health belief scale designed by Huang and Zhang (2024) and Dai et al. (2011), this study assessed individual HBM across five dimensions: perceived benefits (BEN), affective attitude (AT), perceived barriers (BAR), perceived severity (SEV), and self-efficacy (SE). Smart sports tools refer to intelligent products that enhance the athletic experience through the application of smart technology, which primarily include fitness apps, smart wearable devices and exercise-focused short videos. Building on the UTAUT (Unified Theory of Acceptance and Use of Technology) scale designs by Venkatesh et al. (2003) and Featherman and Pavlou (2003), this study measures respondents’ perceptions of smart sports tools across seven dimensions: performance expectancy (PE), effort expectancy (EE), social influence (SI), facilitating conditions (FC), personal innovation (PN), perceived risk (PR), social support(SS) and usage intention (UI). All items were evaluated using a 7-point Likert scale.

Reliability analysis revealed that the Cronbach’s alpha coefficients for all latent variables ranged from 0.72 to 0.89, exceeding the threshold of 0.70, indicating strong internal consistency of the scales. Exploratory factor analysis demonstrated that the KMO values for the HBM and UTAUT scales were 0.848 and 0.786, respectively (both above 0.60), and Bartlett’s test of sphericity yielded a significance level of p < 0.001, confirming the structural validity of the questionnaire for factor analysis. Confirmatory factor analysis indicated that the composite reliability (CR) for each factor was greater than 0.7, and the average variance extracted (AVE) was greater than 0.5, suggesting that all dimensions in this study have good construct reliability and convergent validity, aligning with theoretical expectations. The operationalization of the items for each variable and their validity assessment results are presented in Table 1. It should be noted that the standardized factor loading for BAR1 (The area where I reside lacks suitable sports facilities) is at the threshold level (loading = 0.504), yet it still exceeds the customary lower bound of 0.50. As a core measure of objective environmental accessibility, this item has been empirically demonstrated in the extant literature to significantly facilitate PA behavior by modulating psychological states (Xue and Li, 2023). Balancing empirical evidence with theoretical relevance, we retained BAR1 to enhance the model’s coverage of contextual factors.

Table 1
www.frontiersin.org

Table 1. Operationalized items and validity test results.

2.2.3 Characteristics of PA

Research indicates that PA I-B gap is shaped by a multifaceted profile of PA engagement, including: (1) the proficiency level in PA, which indicates an individual’s experience and skill in PA (Gao et al., 2021) and is a pivotal predictor of PA behavioral intentions; (2) modes of exercise engagement, which cover various participation formats such as individual workouts, partnered activities, and club-based sessions (Gut et al., 2020), influencing the frequency and consistency of PA; (3) exercise motivations, reflecting the range of reasons individuals have for engaging in sports, including health, leisure, socializing, weight management, and personal interests (Liu et al., 2023), with different motivations correlating with the vigor and persistence of PA; and (4) the economic investment in sports, which pertains to the financial outlay on sports gear and gym subscriptions (Chen et al., 2024). These dimensions collectively contribute to understanding and potentially narrowing the I-B gap in PA.

2.2.4 Basic demographic information

This study selects demographic and health-related behavioral variables to control for potential confounding factors. Demographic variables include gender, age, education level, current occupation and monthly income level; health-related characteristics primarily consist of perceived health status, frequency of staying up late, and Body Mass Index (BMI), all of which are incorporated into the statistical analysis. The assignment of values to each variable and descriptive statistical analysis are presented in Table 2.

Table 2
www.frontiersin.org

Table 2. Variables description and descriptive statistical analysis.

Based on the questionnaire design, 68 sub-variables were systematically encoded and integrated into 30 composite variables, consisting of one outcome variable and 29 predictor variables, with no missing observations. Among the predictors, continuous variables were standardized to a mean of zero and a standard deviation of one via the z-score method, whereas categorical variables were recoded using one-hot encoding. Subsequent Pearson correlation analyses revealed that all predictors were<0.60, confirming independence and absence of multicollinearity, which allowed the variables to be entered into the subsequent modelling procedures.

2.3 Double machine learning construction

Compared with traditional regression, Double Machine Learning (DML) offers clear advantages in handling high-dimensional covariates, multicollinearity and nonlinear relationships. Its core logic is a two-step procedure: first, a high-precision predictive model is constructed; second, causal effects are estimated using parametric or non-parametric strategies. Guided by this paradigm, the present study established the following analytical pipeline.

2.3.1 Construction of a high-precision predictive model

To quantify the PA I-B gap, we systematically evaluated the explanatory power of five algorithms—ordinary least squares (OLS), decision tree (D-Tree), support vector machine (SVM), random forest (R-Forest) and extreme gradient boosting (XGBoost)—against a high-dimensional, nonlinear data structure. The workflow comprised three sequential stages: training, validation, and hyper-parameter optimisation. After an 80:20 random split of the data into training and testing sets, a five-fold cross-validated grid search was performed on the training data. Model performance was adjudicated using mean squared error (MSE), root-mean-square error (RMSE), mean absolute error (MAE), and the coefficient of determination (R2).

Table 3 summarized the comparative results. XGBoost achieved the lowest MSE, RMSE, and MAE, while simultaneously registering the highest R2(0.613), thereby outperforming the alternative algorithms by a clear margin. Accordingly, the grid-optimized XGBoost model was selected as the base learner for the subsequent causal-inference pipeline.

Table 3
www.frontiersin.org

Table 3. Comparison of the performance of machine learning algorithms.

2.3.2 Causal effect estimation

Upon validating the superior predictive performance of the XGBoost model, we incorporated it into the ‌Causal Forest DML‌ framework to estimate the ‌Average Treatment Effect (ATE)‌ of key features on the ‌PA I-B gap‌. In this framework, the continuous outcome variable Y represents the PA I-B gap, the treatment variable T consists of the core features selected via XGBoost-SHAP importance analysis, and the confounding variables W include all observed covariates except the specific treatment variable under investigation. To eliminate the influence of confounding variables and ensure unbiased estimation of the causal effect θ of T on Y, the estimation process strictly follows the canonical two-stage procedure of DML, incorporating residual orthogonalization and cross-fitting techniques.

1. Cross-Fitting and Residual Estimation. By leveraging the XGBoost model to assess the impact of the confounding variable W on the dependent variable Y and the treatment variable T, the estimated values ĝ(W) and ĥ(W) are obtained, respectively. Subsequently, the residuals Ey and Et of the two predictive models are computed, with the specific formulas as follows:

E y = Y g ̂ ( W ) , g ̂ ( W ) = E ( Y W )
E t = Y h ̂ ( W ) , h ̂ ( W ) = E ( Y W )

2. ATE Estimation. The residual Et is employed to fit Ey, aiming to estimate the parameter θ, which serves as an unbiased estimator of the ATE between T and Y:‌

E y = θ 0 E t +

3. Mitigation of Overfitting Bias through Cross-Fitting. To ensure the robustness of the estimation, the K-Fold cross-validation method is adopted, where the data are partitioned into K folds, and the estimation process is repeated K times. This separation of model training and residual calculation effectively controls the systematic errors in the residuals. The final estimate of the causal effect is given by:

θ = 1 K 1 k θ k

In brief, the core procedure of DML consists of two stages: first, XGBoost is employed to model both the outcome variable Y and the treatment variable T, and the residuals of Y and T are obtained by subtracting the predicted values from the observed values; second, a non-parametric model is used to fit the relationship between these residuals, thereby estimating the causal effect of the key feature on Y. Repeating the above steps via K-fold cross-fitting effectively controls overfitting bias and yields a robust and interpretable estimate of the ATE along with its confidence interval.

3 Results

3.1 Construction of the optimal model

To ensure the robustness of the model, this study employed a method of multiple random splits to mitigate the stochastic effects of data partitioning. Specifically, the sample was divided using 20 distinct random seeds for proportional stratification, followed by a grid search with 10-fold cross-validation via GridSearchCV to ascertain the optimal hyperparameters. Through this process, the XGBoost model achieved a mean coefficient of determination (R2) of 0.613, which was significantly higher than that of other machine learning algorithms, reaffirming its status as the superior predictive model. With the optimal hyperparameter configuration, the R2 of the XGBoost model further increased to 0.647, indicating its high accuracy and generalizability in forecasting the PA I-B gap among college students. The optimal model hyperparameters are detailed in Table 4.

Table 4
www.frontiersin.org

Table 4. Parameter settings of the XGBoost algorithm.

3.2 Analysis of important features

Accurately identifying and ranking key predictors is a central task in machine learning forecasting. Using the optimal XGBoost algorithm and the SHAP framework, this study independently assessed the relative importance of features contributing to the PA I-B gap. Figure 1 presents the ten most influential variables obtained from each method; eight of the ten highest-ranked features overlap between algorithms, attesting to model stability and the persistent salience of these predictors. Although their exact ranks differ slightly, perceived barriers (BAR), self-efficacy (SE), intention to use smart tools (UI) and social support (SS) consistently occupy the top four positions and jointly account for more than 50% of the total SHAP value. Additionally, aggregating the SHAP contributions of these four variables reveals that the psychological-cognition dimension (BAR+SE) accumulates to 0.233, approximately 1.7 times the contribution of the smart-sport dimension (UI + SS = 0.140). Thus, compared with technological factors, psychological cognition exerts a substantially stronger influence on attenuating the gap, whereas smart-sport tools play a secondary, supportive role. Moreover, the frequency of staying up late, as an indicator of an unhealthy lifestyle, maintains a prominent rank with a stable SHAP value of approximately 0.05, further corroborating the significant predictive value of sleep behavior for the gap.

Figure 1
Side-by-side bar charts show the top ten feature importances. The left chart depicts XGBoost importances with

Figure 1. Ranking of feature importance.

3.3 Explanatory analysis of important features

Drawing on eight overlapping key features, this study employed the SHAP framework to decompose the average marginal contribution of each predictor to the XGBoost model’s output and to clarify its directional effect. Figure 2 presents a SHAP summary plot in which features are arranged in descending order of importance along the vertical axis; each dot represents an individual sample, its horizontal position indicating the marginal effect on the predicted outcome, while the blue-to-red color gradient denotes the corresponding feature value from low to high.

Figure 2
Dot plot showing SHAP values (impact on model output) for different features: BAR, SE, UI, SS, Midnight, BEN, SI, and PN. Colors range from blue (low value) to red (high value).

Figure 2. SHAP beeswarm plot of important features.

The results indicate that BAR constitute the foremost predictor, exerting a positive effect whereby higher values significantly widen the PA I-B gap. SE ranks second, with elevated values markedly narrowing the gap. Subsequently, UI and SS, key dimensions of smart sports technology, demonstrate negative effects, such that higher values are associated with a smaller gap. Late-night behavior exhibits a upward association, with increasing frequency corresponding to an enlarged gap. In contrast, BEN, SI and PN, although negatively related to the gap, display SHAP absolute values significantly lower than those of the top four predictors, indicating a relatively limited influence. Overall, psychological-cognitive variables exert a greater contribution to the I-B gap than smart-sports variables, and the directional effects are fully consistent with theoretical expectations.

3.4 Causal analysis of important features

After partialling out high-dimensional confounders, we estimated the Average Treatment Effects (ATEs) of eight focal variables on the PA I-B gap using Causal Forests. All estimates were obtained within a Double Machine Learning framework that combined residual orthogonalization with cross-fitting; standard errors and 95% confidence intervals (CI) were constructed via 5,000 bootstrap replications using normal approximation.

Table 5 shows that the ATEs for BAR and Midnight are significantly upward [BAR: 0.186, 95% CI = (0.160, 0.211), p < 0.001; Midnight: 0.108, 95% CI = (0.088, 0.129), p < 0.001]. A one-standard-deviation increase in either variable widens the PA I-B gap by 0.186 and 0.108 units, respectively. SE, UI, BEN, SS, SI and PN yield significantly downward ATEs. Among them, SE exhibits the largest effect size [−0.157, 95% CI = (−0.192, −0.122), p < 0.001], indicating the strongest causal impact on narrowing the gap. None of the 95% CIs include zero, reinforcing the robustness of these causal estimates.

Table 5
www.frontiersin.org

Table 5. ATE and 95% CI of important variables.

3.5 Dependency explanation of single-variable

Under the established causal significance, we generated SHAP dependence plots with the feature value on the horizontal axis and the SHAP value on the vertical axis; values above zero indicate an amplification of the PA I-B gap, whereas values below zero indicate a reduction. A LOWESS curve was superimposed to delineate the linear and non-linear associations with the gap.

Figure 3 shows that BAR, SE, BEN, Midnight, and PN follow smooth monotonic trajectories without inflection points, consistent with linear relationships. As BAR and Midnight increase, their positive contributions to SHAP values rise linearly, thereby widening the I-B gap; conversely, higher levels of SE, BEN, and PN linearly attenuate the gap. UI, SS, and SI exhibit non-linear patterns. UI displays an inverted U-shaped relationship: when the UI index falls below −2.5 or exceeds 0, SHAP values drop to low levels, suggesting that both minimal and intensive use of smart-sport tools can markedly reduce the gap. SS exhibits a clear threshold effect: SHAP values decline sharply once the SS index surpasses 0.5, indicating that high social support significantly shortens the gap. SI presents a mild U-shaped curve: SHAP values are lowest when the SI index lies between 0.4 and 1.2, implying that either insufficient or excessive social influence may enlarge the gap. Collectively, the coexistence of linear and non-linear patterns reveals a multidimensional and complex mechanism underlying the formation of the PA I-B gap.

Figure 3
Scatter plots labeled A to H show SHAP values against various features. Each plot includes blue scatter points and a red LOWESS curve. Key trends: A shows a positive correlation, B shows a negative correlation, C has a peak, D declines, E is negatively sloped, F is positively sloped, G shows a decline, and H has a U-shape pattern.

Figure 3. SHAP dependence plot of important features.

3.6 Heterogeneity analysis

Basing on the combined SHAP evidence, the psychological-cognition dimension exerts a significantly larger aggregate effect than the smart-sports-tools dimension, confirming its dominant role in narrowing the PA I-B gap, whereas the latter serves primarily an auxiliary function. To examine whether this dominan-auxiliary relationship varies across psychological-cognition levels, we dichotomised the sample at the median values of BAR and SE and estimated heterogeneous treatment effects using DML framework. The smallest subgroup (n = 611) exceeds the 250–500 cases threshold recommended for Causal-Forest subgroup analyses (Bonander and Svensson, 2021), ensuring adequate statistical power.

Table 6 reveals a consistent pattern of high-cognition activation yet low-cognition inhibition among the smart-sports-tool dimensions. Specifically, three findings emerge: (1) UI, SS, and PN constitute the core tool effects. All three variables significantly promote the translation of intention to behavior (p < 0.001; 95% CIs exclude zero). The effects are strongest when self-efficacy is low: PN = –0.100, SS = –0.100 and UI = –0.094. (2)Multi-dimensional activation among cognitively advantaged groups. In the low-BAR and high-SE subgroups, PE and EE exhibit significant positive effects (p < 0.05; 95%CIs exclude zero). Additionally, PR is significant only in the low-BAR group (ATE = 0.054, p < 0.05) and SI is significant only in the high-SE group (ATE = -0.039, p < 0.001). Overall, individuals with higher psychological-cognition levels are more able to leverage the diverse facilitative attributes of smart-sports tools. (3)Restricted tool functionality among cognitively disadvantaged groups. Beyond UI, SS, and PN, only FC is significant in the low-SE subgroup (ATE = −0.027, p < 0.05); all other dimensions carry confidence intervals that span zero. Thus, when psychological-cognition levels are low, the facilitative potential of smart-sports tools is markedly constrained.

Table 6
www.frontiersin.org

Table 6. Heterogeneity analysis by subgroups.

4 Discussion

The study analyzed 1,344 valid Chinese questionnaires, employing five machine learning algorithms (‌OLS, SVM, DT, RF and XGBoost‌) with hyperparameter optimization to construct the best-fitting prediction model for the ‌PA I-B gap‌. Building upon the optimal model, we estimated the ‌average treatment effect‌ of important predictors on the PA I-B gap by employing ‌residual orthogonalization and cross-validation‌ to account for high-dimensional observable confounders. ‌SHAP-based visualization‌ further elucidated the mechanisms underlying these effects. Compared to conventional regression methods, the ‌interpretable DML framework‌ significantly enhanced both predictive accuracy and causal interpretability, thereby establishing a novel methodological paradigm for PA behavior research.

Our findings indicate that perceived barriers, self-efficacy, intention to use smart tools and social support are the primary determinants of the PA I-B gap. These results provide empirical evidence for the synergistic role of psychological health cognition and smart sports support in moderating the gap. Unlike previous studies that have focused on either psychological cognition (Burnett et al., 2018; Miao et al., 2024) or smart sports (Wang and Dai, 2022) in isolation, our study emphasizes the combined effect of both factors. Among the top four predictors, the aggregated SHAP value of the psychological-cognition dimension (0.233) was approximately 1.7 times that of the smart-sports-tools dimension (0.140), indicating that the former exerts dominant predictive power over the PA I-B gap.

The Health Belief Model proposes that perceived barriers such as lack of energy or interest reduce exercise motivation (Wu et al., 2020). Smart sports tools can mitigate these barriers and reinforce exercise adherence through gamification and social interaction (Laranjo et al., 2021), yet their effectiveness hinges on users’ ability to overcome intrinsic laziness and self-imposed limitations (Gil-Píriz et al., 2021). Social cognitive theory further asserts that self-efficacy is the pivotal driver of behavioral change (Bandura, 2004). Although smart devices like activity trackers and fitness apps can sustain engagement over short periods (eg, up to 3 months) (Romeo et al., 2019), they cannot substitute for individuals’ inherent exercise volition and intrinsic motivation. Consequently, while smart-sports technologies may modestly reduce the gap, their influence is confined to reinforcing maintenance among those who already exercise, rather than initiating PA behavior (Gabarron et al., 2024). In contrast, psychological cognition constitute the decisive determinants that translate intention into behavior.

Drawing on the important variables identified through causal forest estimation, we find that all important predictors exhibit statistically significant causal associations with the PA I-B gap and that their directional effects align with theoretical expectations (Table 5). Perceived barriers, which yielded the largest average treatment effect (ATE = 0.186), indicate that constraints such as inadequate facility accessibility, lack of exercise methods, weak interpersonal support, diminished interest, and limited time and energy management considerably reduce individuals’ willingness to engage in PA (Lin et al., 2022), thereby widening the gap. The absolute ATEs for self-efficacy and perceived benefits both exceeded 0.10 with negative signs, implying that higher psychological cognition increases the likelihood of translating intention into action. This finding aligns with previous evidence that individuals who recognize the health value of PA and possess high self-efficacy are more likely to adopt effective strategies, invest greater effort, and maintain regular physical activity (Di Maio et al., 2021; Landais et al., 2023). Although the effect size of usage intention toward smart sports tools (ATE = −0.118) was smaller than that of the psychological variables, it was still significantly larger than the ancillary dimensions of social support, personal innovativeness and social influence (all |ATEs| < 0.10). This suggests that, in a mature digital environment, enhancing overall usage intention—by strengthening users’ sense of involvement and motivation—may be a more critical route to narrowing the I-B gap than simply optimizing functional modules. In addition, each one-standard-deviation increase in late-night frequency significantly enlarged the PA I-B gap (ATE = 0.108). Research has proposed that sleep deprivation can lead to depressed mood, fatigue, and impaired decision-making, thereby diminishing individuals’ willingness and capability to participate in exercise (Williams et al., 2024).

Additionally, we generated SHAP dependence plots to examine whether the effects of key features on the PA I-B gap are linear, threshold-based, or non-linear. The results show that perceived barriers, self-efficacy, perceived benefits, stay up late and smart sports tools of personal innovation display monotonic linear relationships with the gap, whereas smart sports tools of usage intention, social support and social influence present more complex,non-linear trajectories. The inverted-U link between usage intention and the PA I-B gap is best interpreted through self determination theory and habit strength. On the left branch, a subset of non-users of smart sports tools is characterized by high autonomy and strong habitual strength (Herrmann and Blackstone, 2020), so technological aids are unnecessary, thus the gap remains small. At the apex of the curve, moderate-intention users often rely on the competence and relatedness supports afforded by the technology for external regulation; when feedback or social incentives are inadequate, intrinsic motivation rapidly erodes (Ryan and Deci, 2000), so the gap widens. On the right branch, most habitual users are exercise enthusiasts with established routines who integrate frequent smart-tool cues into automated behavioral scripts; this transforms external prompts into internalized habits and converges intention with behavior, thereby narrowing the gap. Social support exhibits a clear threshold effect, when emotional support and social interaction from family and friends reach a sufficiently high level, the resulting emotional comfort and peer encouragement markedly reduce psychological barriers and promote sports (Zhao et al., 2024). The gentle U-shaped curve for social influence shows that sedentary adults with a large the gap are more likely to be encouraged by their social circles to adopt smart sports tools for supervised exercise (Sullivan and Lachman, 2016), demonstrating that social influence can provide supplementary facilitation even when cognitive resources are limited. Together, these patterns portray a multidimensional and complex regulatory mechanism in which psychological cognition dominates, while smart tools and social support act in synergy.

To delineate the boundary conditions of the technology acceptance-behavior transformation pathway, we dichotomised perceived barriers and self-efficacy at their respective medians and examined the multidimensional facilitative properties of smart-sports tools across distinct psychological-cognition strata. Across all subgroups, personalized innovation, social support and intention to use exerted significant positive effects. These findings corroborate previous research indicating that tailoring exercise prescriptions to users’ physical condition, training history, goals and preferences (Wu et al., 2020), and reinforcing enjoyment and a sense of belonging through peer interaction and support (Li et al., 2024), is a critical pathway for enhancing exercise maintenance. Ultimately, optimizing these attributes aims to consolidate continued usage intention, thereby enabling smart-sports tools to deliver larger-scale and longer-lasting effects in health management and PA promotion (Migliaccio et al., 2024). Among individuals with low perceived barriers and high self-efficacy, performance expectancy and effort expectancy also exert significant effects, indicating that those with favorable psychological-cognitive profiles are able to fully leverage the diverse functions of smart sports tools. Conversely, only personal innovation, social support and intention to use remained significant among participants with high BAR and low SE, and these effects were amplified in the low-SE group. This suggests that, under conditions of limited psychological-cognition resources, personal innovation, social support and use intention constitute the pivotal dimension for translating intention into behavior, whereas the overall facilitative potential of smart-sports tools is markedly constrained.

Based on the findings of this study, the following recommendations are proposed: Firstly, psychological interventions should be strengthened to reduce perceived barriers and enhance self-efficacy. By improving the allocation of sports resources, promoting scientific exercise methods and fostering a positive sports culture, individuals can be supported in integrating PA into daily life, thereby lowering the barriers to participation. In addition, guiding individuals to set attainable goals and to record their exercise achievements can further elevate self-efficacy and sustain motivation. Secondly, continuously monitor users’ psychological-cognitive states to enable precise alignment of smart-sports tools multifunction. For individuals with elevated perceived barriers or low self-efficacy, priority should be given to activating personal innovation and social-support modules that deliver adaptive exercise prescriptions and create supportive online and offline environments. Such targeted deployment can reduce participation thresholds, boost exercise-specific self-efficacy and facilitate the durable translation of intention into behavior. Third, technology-enablement strategies should be refined to expand the reach of smart-sports tools among psychologically advantaged users. Streamlined interfaces and reduced operational complexity lower technological barriers, while optimized hardware compatibility and environmental adaptation, such as integrated venue booking and weather alerts, enhance the overall exercise experience. Collectively, these measures will encourage sustained engagement, advance population-wide physical activity and contribute to comprehensive health promotion.

5 Conclusion

This study innovatively integrates an explainable double-machine-learning framework to examine important predictors of the physical activity intention-behavior gap, grounding the analysis in the Health Belief Model and the Unified Theory of Acceptance and Use of Technology. It introduces a novel relative I-B gap metric to enable cross-population comparisons. Results reveal that psychological cognition, including perceived barriers and self-efficacy, exerts a stronger predictive influence on the gap than smart sports tools, represented by usage intention and social support. Furthermore, the multidimensional attributes of smart sports tools exhibit heterogeneous effects across distinct psychological-cognition subgroups. These findings enrich the theoretical understanding of the I-B gap and provide empirical guidance for designing targeted interventions to bridge the gap and promote physical activity.

6 Limitations

While this study provides valuable insights into the PA I-B gap, several limitations should be acknowledged. First, although informed consent was obtained and questionnaires were collected anonymously, and measures such as reverse-worded items and attention checks were employed to partially mitigate response fatigue and consistency motivation, self-reported data remain susceptible to recall and social-desirability biases. Future studies should adopt a multi-method assessment strategy that integrates subjective reports with objective measures (e.g., wearable devices and app log files) to enhance data validity and reliability. Second, the relative I-B gap metric introduced herein is novel and may be subject to unexamined measurement error or limitations; subsequent investigations should employ alternative measurement strategies to corroborate its validity. Third, the cross-sectional design constrains causal inference regarding temporal ordering; longitudinal follow-ups are needed to capture the dynamic evolution of the PA I-B relationship. Moreover, the theoretical framework omits psychological traits beyond self-efficacy and objective environmental factors (e.g., walkability, socio-cultural context). While XGBoost outperformed alternative models, the R2 of 0.647 indicates that a substantial proportion of variance in the I-B gap remains unexplained; future studies should incorporate additional variables to improve predictive power. Finally, the sample is exclusively Chinese, which may limit external validity; cross-group validation in culturally diverse and international samples is warranted to enhance generalisability.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Author contributions

YL: Conceptualization, Investigation, Methodology, Formal analysis, Writing – original draft, Writing – review & editing. JC: Funding acquisition, Project administration, Methodology, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This research was funded by the National Social Science Foundation of China (grant number 18BTY110) and the Hunan Provincial Natural Science Foundation (grant number 2024JJ7181).

Acknowledgments

We sincerely appreciate the reviewers’ insightful suggestions, which substantially improved the quality of this manuscript. We also thank all participants for their time and willingness to contribute to this research, as well as our research team for their perseverance throughout the study.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Abadie, A., and Cattaneo, M. D. (2018). Econometric methods for program evaluation. Ann. Rev. Econom. 10, 465–503. doi: 10.1146/annurev-economics-080217-053402

Crossref Full Text | Google Scholar

Anderberg, I., Kemps, E., and Prichard, I. (2025). Tracking every bite and step: associations between diet and fitness app use, disordered eating, body image concern and compulsive exercise. Psychol. Sport Exerc. 80:102890. doi: 10.1016/j.psychsport.2025.102890

Crossref Full Text | Google Scholar

Bandura, A. (2004). Health promotion by social cognitive means. Health Educ. Behav. 31:143. doi: 10.1177/1090198104263660

Crossref Full Text | Google Scholar

Bonander, C., and Svensson, M. (2021). Using causal forests to assess heterogeneity in cost-effectiveness analysis. Health Econ. 30, 1818–1832. doi: 10.1002/hec.4263

Crossref Full Text | Google Scholar

Burnett, R., Chen, H., Szyszkowicz, M., Fann, N., Hubbell, B., Pope, C. A., et al. (2018). Global estimates of mortality associated with long-term exposure to outdoor fine particulate matter. Proc. Natl. Acad. Sci. USA 115, 9592–9597. doi: 10.1073/pnas.1803222115

Crossref Full Text | Google Scholar

Chen, W., Zhou, B., Peng, B., and Li, L. (2024). The relationship between urban residents' physical exercise attitudes and sports consumption demands: the mediating role of physical activity level. Front Psychol. 15:1500150. doi: 10.3389/fpsyg.2024.1500150

Crossref Full Text | Google Scholar

Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W., et al. (2018). Double/debiased machine learning for treatment and structural parameters. Econ. J. 21, C1–C68. doi: 10.1111/ectj.12097

Crossref Full Text | Google Scholar

Dai, X., Yin, M. H., and Zhu, L. (2011). Compilation and preliminary application of the scale of physical fitness health belief of college students. J. Beijing Sport University. 34, 72–74. doi: 10.19582/j.cnki.11-3785/g8.2011.12.020

Crossref Full Text | Google Scholar

Di Maio, S., Keller, J., Hohl, D. H., Schwarzer, R., and Knoll, N. (2021). Habits and self-efficacy moderate the effects of intentions and planning on physical activity. Br. J. Health Psychol. 26, 50–66. doi: 10.1111/bjhp.12452

Crossref Full Text | Google Scholar

Englert, C., Rebar, A., Rhodes, R. E., and Pfeffer, I. (2023). Editorial: new developments in the intention-behavior gap for physical activity - recent trends, controversies, and a critical outlook. Front. Psychol. 14:1119973. doi: 10.3389/fpsyg.2023.1119973

Crossref Full Text | Google Scholar

Featherman, M. S., and Pavlou, P. A. (2003). Predicting e-services adoption: a perceived risk facets perspective. International J. Human-Computer Studies 59, 451–474. doi: 10.1016/S1071-5819(03)00111-3

Crossref Full Text | Google Scholar

Gabarron, E., Larbi, D., Rivera-Romero, O., and Denecke, K. (2024). Human factors in AI-driven digital solutions for increasing physical activity: scoping review. JMIR Hum. Factors 11:e55964. doi: 10.2196/55964

Crossref Full Text | Google Scholar

Gan, Y., Kuang, L., Xu, X. M., Ai, M., He, J. L., Wang, W., et al. (2025). Application of machine learning in predicting adolescent internet behavioral addiction. Front. Psych. 15:1521051. doi: 10.3389/fpsyt.2024.1521051

Crossref Full Text | Google Scholar

Gao, Z., Wen, X., Fu, Y., Lee, J. E., and Zeng, N. (2021). Motor skill competence matters in promoting physical activity and health. Biomed. Res. Int. 2021:9786368. doi: 10.1155/2021/9786368

Crossref Full Text | Google Scholar

Gil-Píriz, D., Leyton-Román, M., Mesquita, S., and Jiménez-Castuera, R. (2021). Barriers to the practice of sport and physical activity from the perspective of self-determination theory. Sustainability. 13:7665. doi: 10.3390/su13147665

Crossref Full Text | Google Scholar

Gurleyik, D., Sen, C. K. N., Etnier, J. L., and Acar, I. H. (2022). Culture in physical activity: the contribution of basic psychological needs and goal orientation. Int. J. Environ. Res. Public Health 19:16691. doi: 10.3390/ijerph192416691

Crossref Full Text | Google Scholar

Gut, V., Schmid, J., and Conzelmann, A. (2020). The interaction of behavioral context and motivational-volitional factors for exercise and sport in adolescence: patterns matter. BMC Public Health 20:570. doi: 10.1186/s12889-020-08617-5

Crossref Full Text | Google Scholar

Hagger, M. S., and Chatzisarantis, N. L. (2009). Integrating the theory of planned behaviour and self-determination theory in health behaviour: a meta-analysis. Br. J. Health Psychol. 14, 275–302. doi: 10.1348/135910708X373959

Crossref Full Text | Google Scholar

Herrmann, L. K., and Blackstone, S. R. (2020). Feasibility and outcomes from development and testing of a health behavior theory-based fitness app. Health Technol. 4:4. doi: 10.21037/HT-20-17

Crossref Full Text | Google Scholar

Hou, B., Li, L., Zheng, L., Qi, Y., and Zhou, S. (2022). Linking exercise intention to exercise action: the moderating role of self-efficacy. Front. Psychol. 13:921285. doi: 10.3389/fpsyg.2022.921285

Crossref Full Text | Google Scholar

Huang, J., and Zhang, S. (2024). Predictive variables and combined paths of physical activity intention-behavior gap based on random forest approach. J. Shanghai Sport University 48, 19–28. doi: 10.16099/j.sus.2024.04.27.0002

Crossref Full Text | Google Scholar

Kleinberg, J., Ludwig, J., Mullainathan, S., and Obermeyer, Z. (2015). Prediction policy problems. Am. Econ. Rev. 105, 491–495. doi: 10.1257/aer.p20151023

Crossref Full Text | Google Scholar

Knapova, L., Cho, Y. W., Chow, S. M., Kuhnova, J., and Elavsky, S. (2024). From intention to behavior: within- and between-person moderators of the relationship between intention and physical activity. Psychol. Sport Exerc. 71:102566. doi: 10.1016/j.psychsport.2023.102566

Crossref Full Text | Google Scholar

Landais, L. L., Jelsma, J. G. M., Verhagen, E. A. L. M., Timmermans, D. R. M., and Damman, O. C. (2023). Awareness of a disconnect between the value assigned to health and the effort devoted to health increases the intention to become more physically active. Health Psychol. Behav. Med. 11:2242484. doi: 10.1080/21642850.2023.2242484

Crossref Full Text | Google Scholar

Laranjo, L., Ding, D., Heleno, B., Kocaballi, B., Quiroz, J. C., Tong, H. L., et al. (2021). Do smartphone applications and activity trackers increase physical activity in adults? Systematic review, meta-analysis and metaregression. Br. J. Sports Med. 55, 422–432. doi: 10.1136/bjsports-2020-102892

Crossref Full Text | Google Scholar

Li, H., Lee, J., and Cho, H. (2024). The effect of online social support from sport-related short video platforms on leisure-time physical activity intention. Int. J. Sport Psychol. 55, 523–543. doi: 10.7352/IJSP.2024.55.523

Crossref Full Text | Google Scholar

Liang, D. Q. (1994). The relationship between college students' stress levels and physical exercise. Chinese J. Mental Health. 8, 5–6.

Google Scholar

Lin, L., Liu, Q., Xiao, X., and Luo, Q. (2022). Perceived constraints on active recreational sport participation among residents in urban China. Int. J. Environ. Res. Public Health 19:14884. doi: 10.3390/ijerph192214884

Crossref Full Text | Google Scholar

Liu, H., Hou, W., Emolyn, I., and Liu, Y. (2023). Building a prediction model of college students' sports behavior based on machine learning method: combining the characteristics of sports learning interest and sports autonomy. Sci. Rep. 13:15628. doi: 10.1038/s41598-023-41496-5

Crossref Full Text | Google Scholar

Liu, D., Maimaitijiang, R., Gu, J., Zhong, S., Zhou, M., Wu, S., et al. (2019). Using the unified theory of acceptance and use of technology (UTAUT) to investigate the intention to use physical activity apps: cross-sectional survey. JMIR Mhealth Uhealth 7:e13127. doi: 10.2196/13127

Crossref Full Text | Google Scholar

Liu, Y., Zhai, X., Zhang, Y., Jiang, C., Zeng, J., Yang, M., et al. (2023). The promoting effect of exercise motivation on physical fitness in college students: a mediation effect model. BMC Public Health 23:2244. doi: 10.1186/s12889-023-17154-w

Crossref Full Text | Google Scholar

Lundberg, S. M., and Lee, S.-I. (2017). A unified approach to interpreting model predictions. Neural Information Processing Systems. 4, 4768–4777. doi: 10.5555/3295222.3295230

Crossref Full Text | Google Scholar

MacCann, C., Todd, J., Mullan, B. A., and Roberts, R. D. (2015). Can personality bridge the intention-behavior gap to predict who will exercise? Am. J. Health Behav. 39, 140–147. doi: 10.5993/AJHB.39.1.15

Crossref Full Text | Google Scholar

Maltagliati, S., Raichlen, D. A., Rhodes, R. E., and Cheval, B. (2025). Closing the intention-behaviour gap in physical activity: the moderating effect of individual differences in the valuation of physical effort. Br. J. Health Psychol. 30:e12790. doi: 10.1111/bjhp.12790

Crossref Full Text | Google Scholar

Miao, M., Chen, Y., Zhou, Z., Wen, J., and Zheng, L. (2024). Procrastination in the intention-behaviour gap: exercise procrastination and the moderating role of emotion. Psychol. Sport Exerc. 74:102672. doi: 10.1016/j.psychsport.2024.102672

Crossref Full Text | Google Scholar

Migliaccio, G. M., Padulo, J., and Russo, L. (2024). The impact of wearable technologies on marginal gains in sports performance: an integrative overview on advances in sports, exercise, and health. Appl. Sci. 14:6649. doi: 10.3390/app14156649

Crossref Full Text | Google Scholar

Mullainathan, S., and Spiess, J. (2017). Machine learning: an applied econometric approach. J. Econ. Perspect. 31, 87–106. doi: 10.1257/jep.31.2.87

Crossref Full Text | Google Scholar

Qiao, X., Ji, L., Jin, Y., Si, H., Bian, Y., Wang, W., et al. (2021). Development and validation of an instrument to measure beliefs in physical activity among frail older adults: an integration of the Health Belief Model and the theory of planned behavior. Patient Educ. Couns. 10, 2544–2551. doi: 10.1016/j.pec.2021.03.009

Crossref Full Text | Google Scholar

Rahmati-Najarkolaei, F., Tavafian, S. S., Gholami Fesharaki, M., and Jafari, M. R. (2015). Factors predicting nutrition and physical activity behaviors due to cardiovascular disease in Tehran university students: application of Health Belief Model. Iran Red Crescent Med J 17:e18879. doi: 10.5812/ircmj.18879

Crossref Full Text | Google Scholar

Rebar, A. L., Maher, J. P., Doerksen, S. E., Elavsky, S., and Conroy, D. E. (2016). Intention-behavior gap is wider for walking and moderate physical activity than for vigorous physical activity in university students. J. Sci. Med. Sport 19, 130–134. doi: 10.1016/j.jsams.2014.11.392

Crossref Full Text | Google Scholar

Rebar, A. L., Rhodes, R. E., and Gardner, B. (2019). How we are misinterpreting physical activity intention - behavior relations and what to do about it. Int. J. Behav. Nutr. Phys. Act. 16:71. doi: 10.1186/s12966-019-0829-y

Crossref Full Text | Google Scholar

Rhodes, R. E., and de Bruijn, G. J. (2013). How big is the physical activity intention-behaviour gap? A meta-analysis using the action control framework. Br. J. Health Psychol. 18, 296–309. doi: 10.1111/bjhp.12032

Crossref Full Text | Google Scholar

Rhodes, R. E., and Dickau, L. (2012). Experimental evidence for the intention-behavior relationship in the physical activity domain: a meta-analysis. Health Psychol. 31, 724–727. doi: 10.1037/a0027290

Crossref Full Text | Google Scholar

Roberts, V., Maddison, R., Magnusson, J., and Prapavessis, H. (2010). Adolescent physical activity: does implementation intention have a role? J. Phys. Act. Health 7, 497–507. doi: 10.1123/jpah.7.4.497

Crossref Full Text | Google Scholar

Romeo, A., Edney, S., Plotnikoff, R., Curtis, R., Ryan, J., Sanders, I., et al. (2019). Can smartphone apps increase physical activity? Systematic review and meta-analysis. J. Med. Internet Res. 21:e12053. doi: 10.2196/12053

Crossref Full Text | Google Scholar

Ryan, R. M., and Deci, E. L. (2000). Self-determination theory and the facilitation of intrinsic motivation, social development, and well-being. Am. Psychol. 55, 68–78. doi: 10.1037//0003-066x.55.1.68

Crossref Full Text | Google Scholar

Santinha, G., Oliveira, R., and Gonçalves, L. J. (2022). Contextual factors influencing young athletes' decision to do physical activity and choose a Sports' Club: the case of Portugal. Healthcare 10:347. doi: 10.3390/healthcare10020347

Crossref Full Text | Google Scholar

Schumacher, L. M., Thomas, C., Ainsworth, M. C., and Arigo, D. (2021). Social predictors of daily relations between college women's physical activity intentions and behavior. J. Behav. Med. 44, 270–276. doi: 10.1007/s10865-020-00166-x

Crossref Full Text | Google Scholar

Seol, S., Ko, D., and Yeo, I. (2017). Ux analysis based on TR and UTAUT of sports smart wearable devices. KSII Transactions Internet Information Systems. 11, 4162–4179. doi: 10.3837/tiis.2017.08.024

Crossref Full Text | Google Scholar

Shafieian, M., and Kazemi, A. (2017). A randomized trial to promote physical activity during pregnancy based on Health Belief Model. J. Educ. Health Promot. 6:40. doi: 10.4103/jehp.jehp_19_15

Crossref Full Text | Google Scholar

Sheeran, P. (2002). Intention-behavior relations: a conceptual and empirical review. Eur. Rev. Soc. Psychol. 12, 1–36. doi: 10.1080/14792772143000003

Crossref Full Text | Google Scholar

Sheng, J., Gong, L., and Zhou, J. (2023). Exercise Health Belief Model mediates the relationship between physical activity and peer support among Chinese college students: a cross-sectional survey. Front Psychol. 14:1103109. doi: 10.3389/fpsyg.2023.1103109

Crossref Full Text | Google Scholar

Song, J., Kim, J., and Cho, K. (2018). Understanding users’ continuance intentions to use smart-connected sports products. Sport Management Review. 21, 477–490. doi: 10.1016/j.smr.2017.10.004

Crossref Full Text | Google Scholar

Strain, T., Flaxman, S., Guthold, R., Semenova, E., Cowan, M., Riley, L. M., et al. (2024). National, regional, and global trends in insufficient physical activity among adults from 2000 to 2022: a pooled analysis of 507 population-based surveys with 5·7 million participants. Lancet Glob Health. 12, e1232–e1243. doi: 10.1016/S2214-109X(24)00150-5

PubMed Abstract | Crossref Full Text | Google Scholar

Su, Y. S., Lin, Y. D., and Liu, T. Q. (2022). Applying machine learning technologies to explore students' learning features and performance prediction. Front. Neurosci. 16:1018005. doi: 10.3389/fnins.2022.1018005

Crossref Full Text | Google Scholar

Sullivan, A. N., and Lachman, M. E. (2016). Behavior change with fitness Technology in Sedentary Adults: a review of the evidence for increasing physical activity. Front. Public Health 4:289. doi: 10.3389/fpubh.2016.00289

Crossref Full Text | Google Scholar

Venkatesh, V., Morris, M., Davis, G., and Davis, G. (2003). User acceptance of information technology: toward a unified view. MIS Q. 27, 425–478. doi: 10.2307/30036540

Crossref Full Text | Google Scholar

Wang, F., and Dai, J. (2022). The impact mechanism and measurement of mobile sports app features on college students' exercise intention in the digital age. Journal of Shenyang Sport University. 41, 71–77. doi: 10.12163/j.ssu.20220139

Crossref Full Text | Google Scholar

Wei, J., Vinnikova, A., Lu, L., and Xu, J. (2021). Understanding and predicting the adoption of fitness Mobile apps: evidence from China. Health Commun. 36, 950–961. doi: 10.1080/10410236.2020.1724637

Crossref Full Text | Google Scholar

Williams, T. B., Badariotti, J. I., Corbett, J., Miller-Dicks, M., Neupert, E., McMorris, T., et al. (2024). The effects of sleep deprivation, acute hypoxia, and exercise on cognitive performance: a multi-experiment combined stressors study. Physiol. Behav. 274:114409. doi: 10.1016/j.physbeh.2023.114409

Crossref Full Text | Google Scholar

World Health Organization. (2024). Global levels of physical inactivity in adults: off track for 2030 (Technical report). Geneva, Switzerland: World Health Organization. Available at: https://iris.who.int/handle/10665/378026 (Accessed 8 September 8, 2025).

Google Scholar

Wu, S., Feng, X., and Sun, X. (2020). Development and evaluation of the Health Belief Model scale for exercise. Int. J. Nursing Sci. 7, S23–S30. doi: 10.1016/j.ijnss.2020.07.006

Crossref Full Text | Google Scholar

Wu, Y., Ma, Z., Zhao, H., Li, Y., and Sun, Y. (2020). Achieve personalized exercise intensity through an intelligent system and cycling equipment: a machine learning approach. Appl. Sci. 10:7688. doi: 10.3390/app10217688

Crossref Full Text | Google Scholar

Xiang, T., Huang, Y. F., and Wu, H. (2024). Investigation Study on Older People’s Willingness to Use Webcast Exercise Guidance: Based on the UTAUT Model and the Perceived Risk Theory. Gen. Pract. 11, 1–7. Available at: https://link.cnki.net/urlid/13.1222.R.20240823.1330.010

Google Scholar

Xue, X., and Li, Y. (2023). Will the construction of sports facilities nudge people to participate in physical exercises in China? The moderating role of mental health. Healthcare. 11:219. doi: 10.3390/healthcare11020219

Crossref Full Text | Google Scholar

Yang, Y., and Koenigstorfer, J. (2021). Determinants of fitness app usage and moderating impacts of education-, motivation-, and gamification-related app features on physical activity intentions: cross-sectional survey study. J. Med. Internet Res. 23:e26063. doi: 10.2196/26063

Crossref Full Text | Google Scholar

Yang, Z., Li, J., and Liu, X. (2025). "inconsistency between words and deeds": a meta-analysis of the moderating and mediating mechanisms of bridging the exercise-intentional-behavior gap. Front. Psychol. 16:1586176. doi: 10.3389/fpsyg.2025.1586176

Crossref Full Text | Google Scholar

Zhao, G., Sun, K., Xue, Y., and Dong, D. (2024). A chain-mediated model of the effect of physical exercise on loneliness. Sci. Rep. 14:30798. doi: 10.1038/s41598-024-81059-w

Crossref Full Text | Google Scholar

Zhuo, C., and Yunsong, C. (2025). The application of double machine learning in causal inference for social Sciences. Zhejiang. Soc. Sci. 6, 72–158. doi: 10.14167/j.zjss.2025.06.016

Crossref Full Text | Google Scholar

Keywords: physical activity, intention-behavior gap, double machine learning, SHAP explainable method, Health Belief Model (HBM), unified theory of acceptance and use of technology (UTAUT)

Citation: Li Y and Cai J (2025) Psychological and technological predictors of the physical activity intention-behavior gap: an explainable machine learning analysis. Front. Psychol. 16:1657506. doi: 10.3389/fpsyg.2025.1657506

Received: 01 July 2025; Accepted: 15 September 2025;
Published: 25 September 2025.

Edited by:

Pedro Forte, Higher Institute of Educational Sciences of the Douro, Portugal

Reviewed by:

Rafael Peixoto, Instituto Superior de Ciências Educativas, Portugal
Soukaina Hattabi, University of Jendouba, Tunisia

Copyright © 2025 Li and Cai. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jianguang Cai, MTI1MjMzOTI2M0BxcS5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.