Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Endocrinol.

Sec. Systems Endocrinology

Volume 16 - 2025 | doi: 10.3389/fendo.2025.1599028

Establishment and Evaluation of a Model for Clinical Feature Selection and Prediction in Gout Patients with Cardiovascular Diseases: A Retrospective Cohort Study

Provisionally accepted
Bingbing  FanBingbing Fan1Yuqing  YeYuqing Ye1Zihan  WangZihan Wang2Yuanyuan  XuYuanyuan Xu3Meishan  LuMeishan Lu2Fang  MaFang Ma1*
  • 1Xiyuan Hospital, China Academy of Chinese Medical Sciences, Beijing, China
  • 2Beijing University of Chinese Medicine, Beijing, Beijing Municipality, China
  • 3Heilongjiang University of Chinese Medicine, Harbin, Heilongjiang Province, China

The final, formatted version of the article will be published soon.

Background: Gout is a chronic inflammatory condition increasingly recognized as a risk factor for cardiovascular events (CVE). Early identification of high-risk individuals is crucial for targeted prevention and management. However, conventional risk stratification approaches often fall short in accuracy and clinical utility. This study aimed to develop and validate a robust, interpretable machine learning (ML)-based model for predicting CVE in patients with gout. Results: Of the 686 patients, 263 experienced cardiovascular events during follow-up (incidence rate: 38.3%). A logistic regression model was constructed based on eight variables selected using the Boruta feature selection algorithm: sex, age, PLT, EOS, LYM, CO2, GLU and APO-B. Among the five models evaluated, the CatBoost classifier achieved the best performance, with the highest area under the ROC curve (AUC) of 0.976 and the recall of 0.971. Furthermore, SHAP (SHapley Additive exPlanations) values were employed to provide both global and individual-level interpretability of the CatBoost model. To assess the model's generalization performance, bootstrap resampling was performed 10 times. Based on these results, the standard error was improved using machine learning-based enhancement methods, thereby optimizing the model’s robustness and predictive stability.

Keywords: Gout, cardiovascular events, Prediction nomogram, machine learning (ML), Nomo gram

Received: 24 Mar 2025; Accepted: 15 Sep 2025.

Copyright: © 2025 Fan, Ye, Wang, Xu, Lu and Ma. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Fang Ma, anjanette49@126.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.