AUTHOR=Gan Yao , Kuang Li , Xu Xiao-Ming , Ai Ming , He Jing-Lan , Wang Wo , Hong Su , Chen Jian mei , Cao Jun , Zhang Qi TITLE=Application of machine learning in predicting adolescent Internet behavioral addiction JOURNAL=Frontiers in Psychiatry VOLUME=Volume 15 - 2024 YEAR=2025 URL=https://www.frontiersin.org/journals/psychiatry/articles/10.3389/fpsyt.2024.1521051 DOI=10.3389/fpsyt.2024.1521051 ISSN=1664-0640 ABSTRACT=ObjectiveTo explore the risk factors affecting adolescents’ Internet addiction behavior and build a prediction model for adolescents’ Internet addiction behavior based on machine learning algorithms.MethodsA total of 4461 high school students in Chongqing were selected using stratified cluster sampling, and questionnaires were administered. Based on the presence of Internet addiction behavior, students were categorized into an Internet addiction group (n=1210) and a non-Internet addiction group (n=3115). Gender, age, residence type, and other data were compared between the groups, and independent risk factors for adolescent Internet addiction were analyzed using a logistic regression model. Six methods—multi-level perceptron, random forest, K-nearest neighbor, support vector machine, logistic regression, and extreme gradient boosting—were used to construct the model. The model’s indicators under each algorithm were compared, evaluated with a confusion matrix, and the optimal model was selected.ResultThe proportion of male adolescents, urban household registration, and scores on the family function, planning, action, and cognitive subscales, along with psychoticism, introversion-extroversion, neuroticism, somatization, obsessive-compulsiveness, interpersonal sensitivity, depression, anxiety, hostility, paranoia, and psychosis, were significantly higher in the Internet addiction group than in the non-Internet addiction group (P < 0.05). No significant differences were found in age or only-child status (P > 0.05). Statistically significant variables were analyzed using a logistic regression model, revealing that gender, household registration type, and scores on planning, action, introversion-extroversion, psychoticism, neuroticism, cognitive, obsessive-compulsive, depression, and hostility scales are independent risk factors for adolescent Internet addiction. The area under the curve (AUC) for multi-level perceptron, random forest, K-nearest neighbor, support vector machine, logistic regression, and extreme gradient boosting models were 0.843, 0.817, 0.778, 0.846, 0.847, and 0.836, respectively, with extreme gradient boosting showing the best predictive performance among these models.ConclusionThe detection rate of Internet addiction is higher in males than in females, and adolescents with impulsive, extroverted, psychotic, neurotic, obsessive, depressive, and hostile traits are more prone to developing Internet addiction. While the overall performance of the machine learning models for predicting adolescent Internet addiction is moderate, the extreme gradient boosting method outperforms others, effectively identifying risk factors and enabling targeted interventions.