ORIGINAL RESEARCH article
Front. Physiol.
Sec. Computational Physiology and Medicine
Volume 16 - 2025 | doi: 10.3389/fphys.2025.1549306
Prediction of Obesity Levels Based on Physical Activity and Eating Habits with a Machine Learning Model Integrated with Explainable Artificial Intelligence
Provisionally accepted- 1Department of Management Information Systems, Faculty of Economics and Administrative Sciences, Sivas Cumhuriyet University, Sivas, Sivas, Türkiye
- 2Department of Biostatistics and Medical Informatics, Faculty of Medicine, Inonu University, Malatya, Türkiye
- 3Department of Sport Management, Faculty of Sport Sciences, Inonu University,, Inohu, Türkiye
- 4Yasar Oncan Secondary School, Ministry of National Education, 44900 Malatya, Malatya, Türkiye
- 5Department of Physical Education and Special Motricity, Faculty of Physical Education and Mountain Sports, Transilvania University of Brașov, Brasov, Brasov, Romania
- 6Federal University of Pernambuco, Recife, Brazil
- 7Keizo Asami Institute, Federal University of Pernambuco, Pernambuco, Brazil
- 8Department of Computer Science, Lakehead University, Thunder Bay, ON P7B 5E1, Canada, Ontario, Canada
- 9Department of Optometry, College of Applied Medical Sciences, King Saud University, Riyadh, Riyadh, Saudi Arabia
- 10Department of Ocean Operations and Civil Engineering, Norwegian University of Science and Technology (NTNU), Alesund, Alesund, Norway
- 11Department of Sustainable Systems Engineering (INATECH), Albert Ludwigs University of Freiburg, Freiburg, Freiburg, Germany
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Objectives: This study aims to build a machine learning (ML) prediction model integrated with explainable artificial intelligence (XAI) to categorize obesity levels from physical activity and dietary patterns. The inclusion of XAI methodologies facilitates a comprehensive understanding of the risk factors influencing the model predictions and thus increases transparency in the identification of obesity risk factors. Methods: Six ML models were used: Bernoulli Naive Bayes, CatBoost, Decision Tree, Extra Trees Classifier, Histogram-based Gradient Boosting and Support Vector Machine. For each model, hyperparameters were tuned by random search methodology and model effectiveness was evaluated by repeated holdout testing. SHAP (SHapley Additive Annotations) and LIME (Local Interpretable Model Independent Annotations) interpretability methods were used to generate local and global feature importance measures. Results: The CatBoost model exhibited the highest overall performance and achieved superior results in accuracy, precision, F1 score and AUC metrics. Nonetheless, other models such as Decision Tree and Histogram-based Gradient Boosting also yielded strong and competitive results. The results also highlighted age, weight, height and specific food patterns as key predictors of obesity. In terms of interpretability, LIME showed superior in fidelity,, whereas SHAP showed improved sparsity and consistency across models, facilitating a comprehensive understanding of trait importance. Conclusion: This research demonstrates that ML algorithms, when integrated with XAI technologies, can accurately predict obesity levels and explain important contributing risk factors. The use of SHAP and LIME increases model transparency, facilitating the identification of specific lifestyle patterns linked to obesity risk. These findings help to formulate more precise intervention techniques guided by a reliable and understandable predictive framework.
Keywords: Obesity prediction, machine learning, Explainable artificial intelligence, Physical Activity and Diet, Feature importance
Received: 30 Dec 2024; Accepted: 13 May 2025.
Copyright: © 2025 Görmez, Yagin, Yagin, Aygun, Boke, Badicu, De Sousa Fernandes, Alkhateeb, Basil A. Al-Rawi and Aghaei. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Mohammadreza Aghaei, Department of Ocean Operations and Civil Engineering, Norwegian University of Science and Technology (NTNU), Alesund, Alesund, Norway
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.