Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Public Health

Sec. Children and Health

Volume 13 - 2025 | doi: 10.3389/fpubh.2025.1590689

This article is part of the Research TopicAI-Enabled Healthcare for Adolescents with Cyber DisordersView all articles

Explainable Machine Learning Prediction of Internet Addiction Among Chinese Primary and Middle School Children and Adolescents: A Longitudinal Study Based on Positive Youth Development Data (2019-2022)

Provisionally accepted
Jiahe  LiuJiahe Liu1,2Lang  ChenLang Chen1Yuxin  ChenYuxin Chen3Jingsong  LuoJingsong Luo3Kexin  YuKexin Yu4Linlin  FanLinlin Fan4Chan  YongChan Yong4Huiyu  HeHuiyu He4Simei  LiaoSimei Liao5Lihua  JiangLihua Jiang4,6*
  • 1School of Mathematics and Statistics, Faculty of Science, University of Melbourne, Parkville, Victoria, Australia
  • 2AIM for Health Lab, Faculty of IT, Monash University, Clayton, Australia
  • 3The Jockey Club School of Public Health and Primary Care, Faculty of Medicine, The Chinese University of Hong Kong, Shatin, Hong Kong Region, China
  • 4West China School of Public Health, Sichuan University, Chengdu, Sichuan Province, China
  • 5School of Nursing, Capital Medical University, Beijing, Beijing Municipality, China
  • 6Teaching & Research Section of General Practice Medical Center, West China Hospital of Sichuan University, Chengdu, China

The final, formatted version of the article will be published soon.

Background: Internet Addiction (IA) has emerged as a critical concern, especially among school age children and adolescents, potentially stalling their physical and mental development. Our study aimed to examine the risk factors associated with IA among Chinese children and adolescents and leverage explainable machine learning (ML) algorithms to predict IA status at the time of assessment, based on Young’s Internet Addiction Test.Methods: The longitudinal data consisting of 8824 schoolchildren from the Chengdu Positive Child Development (CPCD) survey were analysed, where 33.3% of participants were identified with IA (Age: 10.97 ± 2.31, Male: 51.73%). IA was defined using Young’s Internet Addiction Test (IAT ≥ 40). Demographic variables such as age, gender, and grade level, along with key variables including scores of Cognitive Behavioural Competencies (CBC), Prosocial Attributes (PA), Positive Identity (PI), General Positive Youth Development Qualities (GPYDQ), Life Satisfaction (LS), Delinquent Behaviour (DB), Non-Suicidal Self-Injury (NSSI), Depression (DP), Anxiety (AX), Family Function Disorders (FF), Egocentrism (EG), Empathy (EP), Academic Intrinsic Value (IV), and Academic Utility Value (UV) were examined. Chi-square and Mann-Whitney U tests were employed to validate the significance of the mentioned predictors of IA. We applied six ML models: Extra Random Forest, XGBoost, Logistic Regression, Bernoulli Naïve Bayes, Multi-Layer Perceptron (MLP), and Transformer Encoder. Performance was evaluated via 10-fold cross-validation and held-out test sets across survey waves. Feature selection and SHapley Additive exPlanations (SHAP) analysis were utilised for model improvement and interpretability, respectively.Results: ExtraRFC achieved the best performance (Test AUC = 0.854, Accuracy = 0.798, F1 = 0.659), outperforming all other models across most metrics and external validations. Key predictors included grade level, delinquent behaviour, anxiety, family function, and depression scores. SHAP analysis revealed consistent and interpretable feature contributions across individuals.Conclusions: Depression, anxiety, and family dynamics as significant factors influencing IA in children. The Extra Random Forest model proves most effective in predicting IA, emphasising the importance of addressing these factors to promote healthy digital habits in children. This study presents an effective SHAP-based explainable ML framework for IA prediction in children and adolescents.

Keywords: Conceptualisation, Data curation, Formal analysis, methodology, visualisation, Writing -original draft, Writing -Review & Editing. Lang Chen: Conceptualisation, Writing -Review & Editing. Yuxin Chen: Conceptualisation

Received: 12 Mar 2025; Accepted: 25 Jun 2025.

Copyright: © 2025 Liu, Chen, Chen, Luo, Yu, Fan, Yong, He, Liao and Jiang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Lihua Jiang, Teaching & Research Section of General Practice Medical Center, West China Hospital of Sichuan University, Chengdu, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.