Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Psychiatry

Sec. Digital Mental Health

Volume 16 - 2025 | doi: 10.3389/fpsyt.2025.1584335

This article is part of the Research TopicAI Approach to the Psychiatric Diagnosis and Prediction Volume IIView all 3 articles

Predicting Suicidality in People Living with HIV in Uganda: A Machine Learning Approach

Provisionally accepted
  • 1African Center of Excellence in Bioinformatics and Data Intensive Science, Makerere University., Kampala, Uganda
  • 2AIDS Healthcare Foundation (Uganda), Kampala, Uganda
  • 3London School of Hygiene and Tropical Medicine Uganda Research Unit, Medical Research Council (Uganda), Entebbe, Uganda
  • 4Precision Healthcare University Research Institute, Queen Mary University of London, London, United Kingdom
  • 5Department of Non-Communicable Disease Epidemiology, Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, United Kingdom

The final, formatted version of the article will be published soon.

BackgroundPeople living with HIV (PLWH) are more likely to experience suicidal thoughts and exhibit suicidal behavior than the general population. However, there are currently no effective methods of predicting who is likely to experience suicidal thoughts and behavior. Machine learning (ML) approaches can be leveraged to develop models that evaluate the complex etiology of suicidal behavior, facilitating the timely identification of at-risk individuals and promoting individualized treatment allocation. Materials and methodsThis retrospective case-control study used longitudinal sociodemographic, psychosocial, and clinical data of 1,126 PLWH from Uganda to evaluate the potential of ML in predicting suicidality. In addition, suicidality polygenic risk scores (PRS) were calculated for a subset of 282 study participants and incorporated as an additional feature in the model to determine if including genomic information improves overall model performance. The model’s performance was evaluated using the area under the receiver operating characteristic curve (AUC), positive predictive value (PPV), sensitivity, specificity, and Mathew’s correlation coefficient (MCC).ResultsWe trained and evaluated eight different ML algorithms, including logistic regression, support vector machines, Naïve Bayes, k-nearest neighbors, decision trees, random forests, AdaBoost, and gradient-boosting classifiers. Cost-sensitive AdaBoost emerged as the best model, achieving an AUC of 0.79 (95% CI: 0.72–0.87), a sensitivity of 0.63, a specificity of 0.74, a PPV of 0.36, and an NPV of 0.89 on unseen baseline data. The model demonstrated good generalizability, predicting prevalent and incident suicidality at 12-month follow-up with an AUC of 0.75 (95% CI: 0.69–0.81 ) and 0.69 (95% CI: 0.62–0.76), respectively. Incorporating PRS as an additional feature in the model resulted in a 6% improvement in model sensitivity and a 9% reduction in specificity. A positive MDD diagnosis and high stress contributed the most to predicting suicidality risk.ConclusionA cost-sensitive AdaBoost model developed using the sociodemographic, psychosocial, and clinical data of PLWH in Uganda can predict suicidality risk, albeit with modest PPV. Incorporating suicidality PRS improved the overall predictive performance of the model. However, larger studies involving more diverse participants are needed to evaluate the potential of PRS in enhancing risk stratification and the clinical utility of the prediction model.

Keywords: Suicidality, prediction, machine learning, polygenic risk scores, HIV

Received: 27 Feb 2025; Accepted: 29 Jul 2025.

Copyright: © 2025 Mutema, Lillian, Jjingo, Fatumo, Kinyanda and Kalungi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Anthony Batamye Mutema, African Center of Excellence in Bioinformatics and Data Intensive Science, Makerere University., Kampala, Uganda
Allan Kalungi, London School of Hygiene and Tropical Medicine Uganda Research Unit, Medical Research Council (Uganda), Entebbe, Uganda

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.