Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Public Health

Sec. Occupational Health and Safety

This article is part of the Research TopicBioconvergence: A New Frontier for Understanding and Enhancing Human Adaptations to Extreme EnvironmentsView all 5 articles

Data-Driven Identification of Metabolic and Cardiovascular Biomarkers in High-Altitude Workers: A Machine Learning Approach

Provisionally accepted
Ricardo  JorqueraRicardo Jorquera1Guillermo  DroppelmannGuillermo Droppelmann2*Gonzalo  BlancoGonzalo Blanco1Max  DollmannMax Dollmann1Ignacio  AhumadaIgnacio Ahumada1Felipe  FeijooFelipe Feijoo3*
  • 1Workmed, Santiago, Chile
  • 2Clínica MEDS, Santiago, Chile
  • 3Pontificia Universidad Catolica de Valparaiso, Valparaíso, Chile

The final, formatted version of the article will be published soon.

Background: Workers in high-altitude mining settings face increased cardiometabolic risk due to chronic exposure to low oxygen levels. Traditional fitness-for-work (FFW) assessments often evaluate biomarkers in isolation, missing relevant health patterns. Aim: To improve the risk stratification of the FFW status in high-altitude workers by identifying relevant biomarkers through ML models. Methods: A retrospective cohort of 420,966 preemployment examination records, corresponding to 89,149 workers between 2021 and 2024 was analyzed. Workers were classified as fit or unfit for work, in each of their medical examinations, according to national guidelines. Several supervised ML models were applied, including random forests (RF), support vector machines, k-nearest neighbors, and decision trees, to identify relevant predictors of FFW. Logistic regression was performed to assess statistical associations between biomarkers and fitness outcomes. Results: Among the 420,966 preemployment examination records, 48,783 were particularly assessed for fitness for high-altitude work. Among these, 8% were classified as unfit for high-altitude work. Significant predictors included body mass index (BMI), blood glucose, triglycerides, and systolic blood pressure. The Random Forest (RF) model outperformed SVM and KNN, achieving the highest predictive performance with an accuracy of 0.89, sensitivity of 0.92, and specificity of 0.83. Multivariate logistic regression confirmed BMI as the strongest predictor (OR 2.640, p < 0.001), followed by glucose (OR 2.000, p < 0.001), triglycerides (OR 1.461, p < 0.001), systolic blood pressure (OR 1.380, p < 0.001), smoker (OR 1.125, p < 0.002). Conclusion: ML models can effectively identify critical health indicators related to FFW in high-altitude environments. These tools offer the potential to improve occupational health assessments and support preventive decision making in vulnerable worker populations.

Keywords: Artificial Intelligence1, cardiovascular biomarkers2, fitness-for-work3, high-altitude4, machine learning

Received: 23 Jun 2025; Accepted: 18 Nov 2025.

Copyright: © 2025 Jorquera, Droppelmann, Blanco, Dollmann, Ahumada and Feijoo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Guillermo Droppelmann, guillermo.droppelmann@meds.cl
Felipe Feijoo, felipe.feijoo@pucv.cl

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.