AUTHOR=Gan Yao , Kuang Li , Xu Xiao-Ming , Ai Ming , He Jing-Lan , Wang Wo , Hong Su , Chen Jian mei , Cao Jun , Zhang Qi TITLE=Research on prediction model of adolescent suicide and self-injury behavior based on machine learning algorithm JOURNAL=Frontiers in Psychiatry VOLUME=Volume 15 - 2024 YEAR=2025 URL=https://www.frontiersin.org/journals/psychiatry/articles/10.3389/fpsyt.2024.1521025 DOI=10.3389/fpsyt.2024.1521025 ISSN=1664-0640 ABSTRACT=ObjectiveTo explore the risk factors that affect adolescents’ suicidal and self-injurious behaviors and to construct a prediction model for adolescents’ suicidal and self-injurious behaviors based on machine learning algorithms.MethodsStratified cluster sampling was used to select high school students in Chongqing, yielding 3,000 valid questionnaires. Based on whether students had engaged in suicide or self-injury, they were categorized into a suicide/self-injury group (n=78) and a non-suicide/self-injury group (n=2,922). Gender, age, insomnia, and mental illness data were compared between the two groups, and a logistic regression model was used to analyze independent risk factors for adolescent suicidal and self-injurious behavior. Six methods—multi-level perceptron, random forest, K-nearest neighbor, support vector machine, logistic regression, and extreme gradient boosting—were used to build predictive models. Various model indicators for suicidal and self-injurious behavior were compared across the six algorithms using a confusion matrix to identify the optimal model.ResultIn the self-injury and suicide groups, the proportions of male adolescents, late adolescence, insomnia, and mental illness were significantly higher than in the non-suicide and self-injury groups (p <0.05). Compared with the non-suicidal self-injury group, this group also showed significantly increased scores in cognitive subscales, impulsivity, psychoticism, introversion–extroversion, neuroticism, interpersonal sensitivity, depression, anxiety, hostility, terror, and paranoia (p <0.05). These statistically significant variables were analyzed in a logistic regression model, revealing that gender, impulsivity, psychoticism, neuroticism, interpersonal sensitivity, depression, and paranoia are independent risk factors for adolescent suicide and self-injury. The logistic regression model achieved the highest sensitivity and specificity in predicting adolescent suicide and self-injury behavior (0.9948 and 0.9981, respectively). Performance of the random forest, multi-level perceptron, and extreme gradient models was acceptable, while the K-nearest neighbor algorithm and support vector machine performed poorly.ConclusionThe detection rate of suicidal and self-injurious behaviors is higher in women than in men. Adolescents displaying impulsiveness, psychoticism, neuroticism, interpersonal sensitivity, depression, and paranoia have a greater likelihood of engaging in such behaviors. The machine learning model for classifying and predicting adolescent suicide and self-injury risk effectively identifies these behaviors, enabling targeted interventions.