Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Public Health

Sec. Health Economics

This article is part of the Research TopicHealth Services and Economic Inequalities through the Lens of Sustainable DevelopmentView all articles

Determinants of Life Expectancy in High Longevity Countries: Evidence from Machine learning

Provisionally accepted
Hong  ZhangHong Zhang1Haixiang  QiaoHaixiang Qiao2Xiuping  LiXiuping Li2Ijaz  UddinIjaz Uddin3Xiaolan  ZhangXiaolan Zhang4*Ruitao  LiRuitao Li5*
  • 1Jinggangshan University, Ji'an, China
  • 2Qingdao Binhai University, Qingdao, China
  • 3Abdul Wali Khan University Mardan, Mardan, Pakistan
  • 4First Affiliated Hospital, Zhejiang Chinese Medical University, Hangzhou, China
  • 5Assumption University, Bangkok, Thailand

The final, formatted version of the article will be published soon.

Health is directly aligned with Sustainable Development Goal (SDG) 3: Good Health and Well-Being, which emphasizes ensuring healthy lives and promoting well-being for all at all ages. The present study investigates the determinants of life expectancy (LEX) by incorporating a comprehensive set of factors: CO₂ emissions as an environmental factor; GDP, health expenditure, and research and development (R&D) as economic factors; education and individual internet use as social factors; and rule of law and government effectiveness as institutional factors. Using panel data for the top 20 high-life-expectancy countries covering the period 2001–2023, this study applies both traditional econometric techniques namely, PMG, fixed effects, and FMOLS estimators and advanced machine learning approaches, specifically Gradient Boosting and Random Forest. The regression results reveal that CO₂ emissions negatively affect LEX, whereas GDP, health expenditure, education, internet use, rule of law, government effectiveness, and R&D exert positive influences. The machine learning results further indicate that GDP, health expenditure, and education are the three most critical predictors of LEX in both Gradient Boosting and Random Forest models, with GDP emerging as the most dominant factor. Institutional variables such as rule of law, government effectiveness, and R&D display moderate importance, while CO₂ emissions and individual internet use consistently rank as the least influential. In terms of predictive performance, Gradient Boosting outperforms Random Forest across evaluation metrics, demonstrating lower errors and higher explanatory power. In light of these findings, this study also provides important policy implications to enhance LEX.

Keywords: economic, social, Institutional, Life Expectancy, Machine leaning

Received: 18 Aug 2025; Accepted: 31 Oct 2025.

Copyright: © 2025 Zhang, Qiao, Li, Uddin, Zhang and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Xiaolan Zhang, zhangxiaolan2025@126.com
Ruitao Li, ruitaohoho@outlook.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.