Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med.

Sec. Nephrology

Volume 12 - 2025 | doi: 10.3389/fmed.2025.1618222

This article is part of the Research TopicDisparities in Nephrotic Syndrome OutcomesView all 6 articles

Integration of Machine Learning and Large Language Models for Screening and Identifying Key Risk Factors of Acute Kidney Injury after Cardiac Surgery

Provisionally accepted
Zishan  LiZishan Li1Aiping  WuAiping Wu2Xunying  ZhangXunying Zhang1Tao  LiuTao Liu1*Lei  WangLei Wang3*
  • 1North China University of Science and Technology, Tangshan, China
  • 2School of Basic Medical Sciences, North China University of Science and Technology, Tangshan, Hebei Province, China
  • 3Affiliated Hospital of North China University of Science and Technology, Tangshan, Hebei Province, China

The final, formatted version of the article will be published soon.

Objective: This study aimed to identify critical risk factors for acute kidney injury (AKI) following cardiac surgery. By integrating patient data from the MIMIC-IV database with large language models (LLMs) and machine learning algorithms, we ensured the clinical relevance of the selected risk factors, providing robust insights for the early identification and intervention of postoperative AKI. Methods: ICU data of patients from the MIMIC-IV database undergoing cardiac surgery were analyzed. Lasso regression and random forest algorithms were used to select significant predictive features from high-dimensional data. Model evaluation involved 10-fold cross-validation and metrics including accuracy, sensitivity, specificity, and the area under the curve. To enhance clinical relevance, LLMs-simulated expert judgment in cardiology and nephrology, which was further validated through discussions with clinical experts. Results: In the cohort consisting of 4,565 patients, a total of 113 important and shared risk factors for AKI were identified, including variables such as anion gap, arterial partial pressure of oxygen (PaO₂), and fraction of inspired oxygen (FiO₂). Among these, 18 key variables were identified as postoperative AKI predictors via machine learning and LLMs-simulated expert validation. These included anchor age, Creatinine (serum), BUN (Blood Urea Nitrogen), Potassium (serum), Sodium (serum), Lactic Acid, Troponin-T, Furosemide (Lasix), Vancomycin (Random), Gentamicin (Trough), Albumin 5%, ART BP Mean, Cardiac Output (thermodilution), Brain Natriuretic Peptide (BNP), Absolute Count - Lymphs, Absolute Count - Monos, and Absolute Count - Neuts. The integration of LLMs with machine learning algorithms proved effective in accurately identifying clinically relevant risk factors. Conclusion: The proposed risk prediction approach for postoperative AKI following cardiac surgery, based on the collaborative analysis of machine learning and large language models (LLMs), effectively identified and validated key clinical risk factors. By simulating expert clinical reasoning, the LLMs significantly enhanced the medical relevance of feature selection and improved the clinical interpretability of the model. This approach provides a solid theoretical and practical foundation for the precise early identification and clinical intervention of postoperative AKI in cardiac surgery patients.

Keywords: acute kidney injury (AKI), Large Language Models (LLMs), LASSO regression, random forest, MIMIC- IV database

Received: 25 Apr 2025; Accepted: 23 Sep 2025.

Copyright: © 2025 Li, Wu, Zhang, Liu and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Tao Liu, liutaocreate@gmail.com
Lei Wang, leilei5533@sina.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.