Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Artif. Intell.

Sec. Machine Learning and Artificial Intelligence

Volume 8 - 2025 | doi: 10.3389/frai.2025.1606245

Predictors of Mortality among Neonates in Lusaka, Zambia: A Comparative Analysis of Machine Learning and Traditional Survival Analysis Techniques

Provisionally accepted
  • 1Private, Pretoria, South Africa
  • 2Stellenbosch University, Stellenbosch, Western Cape, South Africa
  • 3Department of Pharmacy, School of Health Sciences, University of Zambia, Lusaka, Zambia

The final, formatted version of the article will be published soon.

Neonatal mortality remains a critical global health issue, with 2.3 million deaths in 2022. Sub-Saharan Africa bears 57% of under 5 deaths despite only 30% of global births, with Zambia ranking fourth highest in terms of neonatal mortality among neighbouring countries despite progress. While traditional survival analysis has identified neonatal mortality risk factors, the use of machine learning for prediction remains underexplored. This study aimed to determine factors associated with neonatal mortality as well as compare the predictive performance of traditional survival analysis and machine learning models in predicting mortality among neonates in Lusaka, Zambia (January 2018–Septem-ber 2019). Demographic and clinical data from 1018 neonates were analysed using seven models: Weibull, Lasso, Ridge, Elastic Net (regularized cox), Random Survival Forest, DeepSurv Neural Networks and Gradient Boosting Machine. Model performance was assessed using nested cross validation with hyperparameter tuning in the inner folds and unbiased performance estimation in the outer folds. Evaluation metrics included the concordance index , time dependent area under the curve at 7, 14, and 28 days, brier scores, and calibration plots. Kaplan-Meier plots illustrated survival probabilities over time. Of 1,018 neonates, 757 (74.3%) died. Hypoxic ischemic encephalopathy (TR = 0.71, 95% CI: 0.63–0.81) was strongly associated with reduced survival, while higher birth weight was protective (TR = 1.88, 95% CI: 1.60–2.20). Sepsis demonstrated a paradoxical association with longer survival (TR = 1.16, 95% CI: 1.04–1.30), which persisted after sensitivity analyses. Among predictive models, the random survival forest achieved the highest discrimination (C index = 0.731; AUC = 0.759, 0.665, and 0.728 at 7, 14, and 28 days, respectively; brier = 0.164–0.254), outperforming Weibull (C-index = 0.622) and penalized Cox mod-els (0.620). Elastic Net demonstrated the most stable calibration across time points, while DeepSurv underperformed (C-index = 0.553). Feature importance analysis from random survival forest identified birth weight as the dominant predictor, followed by sex, sepsis, and necrotizing enterocolitis. While the traditional Weibull model remains valuable for interpretability, the integration of machine learning approaches offers enhanced predictive accuracy. Hybrid modelling strategies may improve early risk identification and inform neonatal care in resource-limited settings.

Keywords: survival analysis, Weibull, machine learning, Elastic net regression, neonatal mortality, predictive modelling

Received: 04 Apr 2025; Accepted: 21 Oct 2025.

Copyright: © 2025 Mokoena, Mukosha, Zunza and Maposa. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Tshepiso Mokoena, glandzm@gmail.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.