ORIGINAL RESEARCH article
Front. Public Health
Sec. Health Economics
Volume 13 - 2025 | doi: 10.3389/fpubh.2025.1549210
Exploring Machine Learning Classification for Community Based Health Insurance Enrollment in Ethiopia
Provisionally accepted- 1Department of Statistics, College of Natural and Computational Sciences, Debre Tabor, Ethiopia
- 2Department of Statistics, Faculty of Science, University of Johannesburg, Johannesburg, South Africa
- 3Department of Statistics, College of Science, Bahir Dar University, Bahirdar, Amhara Region, Ethiopia
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Background: Community-based health insurance (CBHI) is a vital tool for achieving universal health coverage, a key global health priority outlined in the Sustainable Development Goals (SDGs). Sub-Saharan Africa continues to face challenges in achieving universal health coverage (UHC) and protecting individuals from the financial burden of disease. As a result, CBHI has become popular in low-and middle-income countries, including Ethiopia. Therefore, this study aimed to identify the ML algorithm with the best predictive accuracy for CBHI enrollment and to determine the most influential predictors among the dataset.The 2019 Ethiopian Mini Demographic and Health Survey (EMDHS) data were used.The CBHI were predicted using seven machine learning models: linear discriminant analysis (LDA), support vector machine with radial basis function (SVM), k-nearest neighbors (KNN), classification and regression tree (CART), and random forest (RF). Receiver operating characteristic curves and other metrics were used to evaluate each model's accuracy.The RF algorithm was determined to be the best machine learning model based on different performance assessments. The result indicates that age, wealth index, household members, and land usage all significantly affect CBHI in Ethiopia.Conclusions: This study found that RF machine learning models could improve the ability to classify CBHI in Ethiopia with high accuracy. Age, wealth index, household members, and land utilisation are some of the most significant variables associated with CBHI that were determined by feature importance. The results of the study can help health professionals and policymakers create focused strategies to improve CBHI enrollment in Ethiopia.
Keywords: machine learning, health insurance, random forest, accuracy, Ethiopia
Received: 20 Dec 2024; Accepted: 24 Jun 2025.
Copyright: © 2025 Yilema, Shiferaw, Moyehodie, Fenta, Belay, Fenta and Nigussie. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Seyifemickael Amare Yilema, Department of Statistics, College of Natural and Computational Sciences, Debre Tabor, Ethiopia
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.