ORIGINAL RESEARCH article

Front. Cardiovasc. Med., 05 January 2026

Sec. Precision Cardiology

Volume 12 - 2025 | https://doi.org/10.3389/fcvm.2025.1674287

Explainable machine learning model of coronary artery disease combined with diabetes: development and validation study

  • 1. Department of Emergency ICU, The Affiliated Hospital of Qinghai University, Xining, Qinghai, China

  • 2. Department of Cardiovascular Medicine, The First Hospital of Jilin University, Changchun, China

  • 3. Department of Neurology, The Second Hospital of Lanzhou University, Lanzhou, Gansu, China

  • 4. Department of Biobank, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, China

Article metrics

View details

986

Views

68

Downloads

Abstract

Background:

Coronary artery disease (CAD) demonstrates a strong bidirectional association with diabetes mellitus, which not only elevates cardiovascular disease risk but also correlates with poorer clinical prognosis. Prognostication in patients with comorbid CAD and diabetes remains a critical clinical challenge, significantly influencing therapeutic decision-making. Leveraging readily available clinical parameters for predicting adverse outcomes in this population offers substantial clinical value. This investigation employs machine learning algorithms to develop predictive models for prognostic assessment in CAD patients with diabetes comorbidity.

Method:

We conducted a retrospective cohort study of 389 patients with comorbid coronary artery disease (CAD) and diabetes mellitus. The cohort was randomly allocated into a training set (n = 273) and an internal validation set (n = 116). Feature selection utilized LASSO regression followed by backward stepwise Cox regression analysis. A nomogram incorporating independent predictors was developed for clinical application. Model performance was assessed through discrimination metrics, calibration plots, and decision curve analysis (DCA). Random survival forest analysis validated the clinical significance of selected variables.

Result:

Our modeling approach employed a systematic methodology: LASSO regression for initial feature selection followed by backward stepwise Cox regression analysis, yielding eight independent predictors.The final model incorporated hemoglobin, INR, albumin, NT-proBNP, age, fibrinogen, diuretic use, and digitalis therapy. The integrated model demonstrated strong discriminative performance for mortality prediction across both training (AUC = 0.846, 0.838, 0.82) and validation cohorts (AUC = 0.824, 0.813, 0.798) at 3-, 5-, and 8-year intervals. Calibration plots and decision curve analysis confirmed model reliability and clinical utility over time. A nomogram was developed to facilitate individualized risk stratification. Kaplan–Meier analysis showed significant survival stratification by individual predictors, and restricted cubic spline analysis identified non-linear associations between continuous variables and mortality. Random survival forest analysis prioritized five key predictors (hemoglobin, INR, albumin, NT-proBNP, age). Comparative evaluation against the 9-variable model confirmed superior performance of the comprehensive model across all timepoints.

Conclusion:

Our multimodal prognostic model demonstrated robust performance in predicting all-cause mortality among patients with coronary artery disease and diabetes comorbidity. The nomogram's capacity for personalized risk estimation offers potential utility in clinical decision-making and patient stratification.

Introduction

The primary cause of death worldwide and a significant contributor to disability is coronary artery disease (CAD) (1). Diabetic patients are more likely to experience severe acute coronary syndromes and have worse outcomes than non-diabetic patients (2, 3). Although microvascular complications substantially impact diabetes-related morbidity, mortality primarily stems from macrovascular manifestations including CAD and cerebrovascular events (4). This clinical landscape underscores the critical need for accurate risk stratification tools to inform therapeutic strategies and healthcare resource utilization.

Conventional prognostic models have predominantly utilized isolated parameters including natriuretic peptide levels, glycated hemoglobin A1c (HbA1c), and New York Heart Association (NYHA) functional classification (5, 6). Established risk assessment tools include the Framingham Risk Score (FRS) in North America and the Systematic Coronary Risk Evaluation (SCORE) model in Europe (7, 8). However, these models demonstrate limited discriminative capacity for CAD mortality prediction and insufficient integration of modern prognostic factors for acute coronary syndrome (9, 10). Furthermore, FRS applicability shows significant variability across ethnic and geographic populations. Most current prognostic models focus on identifying high-risk subgroups rather than providing comprehensive risk stratification. Emerging biomarkers like the triglyceride-glucose (TyG) index show promise for predicting major adverse cardiovascular events (MACE) in post-CABG patients with CAD and diabetes, potentially enabling targeted monitoring and preventive interventions (11). Notably, validated clinical scoring systems for long-term prognosis assessment in comorbid CAD and diabetes populations remain largely unavailable.

The rapid advancement of artificial intelligence has enabled increasing machine learning applications in clinical prediction. Supervised learning algorithms enhance prognostic accuracy by iteratively optimizing model parameters against clinical outcome data (12). This technological evolution creates new opportunities for developing precision prognostic algorithms in diabetes-associated CAD management.

Our multimodal approach comprehensively integrates demographic profiles, clinical histories, pharmacological data, and echocardiographic parameters to enhance risk assessment in comorbid CAD-diabetes patients. We hypothesize this multidimensional strategy will surpass conventional single-parameter models in prognostic precision, ultimately improving clinical risk categorization and therapeutic decision-making.

Methods

Study design and participants

Out of 834 patients with CAD and diabetes, 389 met the inclusion and exclusion criteria and were included in the final analysis. This study analyzed clinical data from 389 CAD and diabetes patients treated between December 2014 and December 2024. The primary outcome was cardiovascular death. The study included 389 participants who satisfied the following inclusion criteria and had both diabetes and CAD: (1) aged over 18 years old; (2) diagnosed with CAD according to the European Society of Cardiology (ESC) diagnosis and treatment guidelines (13): a pathological process characterized by atherosclerotic plaque accumulation in the epicardial arteries, whether obstructive or non-obstructive; (3) fasting plasma glucose (FPG) ≥ 7 mmol/L (126 mg/dL) is introduced as the criterion for diagnosis of diabetes (14). Exclusion criteria consisted of (1) Acute or chronic infections are present concurrently; (2) the patient has a tumor, autoimmune illness, or hematological disease; (3) and the patient's information is lacking. Survival outcomes were verified through structured telephone interviews or clinical follow-up documentation.This study adhered to the Declaration of Helsinki and was approved by the Ethics Committee of the Second Hospital of Lanzhou University (2024A-1270), with a waiver of informed consent.

Data acquisition

At baseline, all participants underwent comprehensive demographic evaluation including sex, age, and clinical histories (hypertension, diabetes mellitus, atrial fibrillation, coronary heart disease, anemia, smoking status, and alcohol use). Medication profiles were documented, encompassing antiplatelet agents, anticoagulants, statins, nitrates, diuretics, and cardiac remodeling therapies. Standardized laboratory assessments included fasting glucose, hemoglobin, lipid profile (total cholesterol, triglycerides, HDL, LDL), renal function (serum creatinine), hepatic enzymes (AST, ALT), albumin, INR (International Normalized Ratio), CRP, and NT-proBNP (N-terminal pro B-type Natriuretic Peptide). Supplementary cardiovascular evaluations comprised 12-lead ECG and echocardiographic parameters.

In the feature selection process, we used LASSO regression to identify significant predictors of patient outcomes. We included time-dependent covariates and interaction terms in the LASSO regression. Conditions like diabetes and hypertension develop over time due to long-term factors. To account for this, we incorporated time-dependent variables to reduce confounding from the chronic nature of these diseases. Additionally, for medication use, we classified patients as “users” if they had been regularly prescribed medications for coronary artery disease and diabetes for six months or more, defining “non-users” as those who did not meet this threshold. This approach allowed us to effectively capture the temporal relationships and reduce potential biases in our analysis. Although time-dependent covariates were considered during variable selection, the final prognostic model was a baseline Cox proportional hazards model. Medication exposure variables (e.g., diuretics and digoxin) were constructed using prescription records from the first six months after the index date; patients who continuously received these medications for at least six months were classified as “users”, whereas those who did not meet this threshold were classified as “non-users”. Consequently, a formal time-dependent Cox model was not fitted and only baseline covariates were used in the final analysis.

The final cohort comprised 389 patients with confirmed CAD and diabetes. The training set was defined as 116 patients, and their early morning fasting peripheral blood samples were retained for the test of laboratory examination results. The training set consisted of 273 patients, as shown in Figure 1.

Figure 1

Flowchart detailing the study design for a coronary artery disease and diabetes mellitus retrospective study. It starts with 834 individuals with CAD. Exclusions are due to age, diagnosis, fasting plasma glucose, existing infections, tumors, autoimmune or hematologic diseases, and missing information, narrowing to a final cohort of 389. The cohort is divided into a training set of 273 and a validation set of 116. LASSO regression, Cox feature screening, and random survival forest analysis are used. Model validation assesses discrimination, calibration, and clinical usefulness for three, five, and eight-year survival.

Flow chart.

DStatistical analysis and model development

All statistical analyses and model construction were done using R software (version 4.3.1). The survival and rms packages were used for Cox proportional hazards modelling, random ForestSRC was used to construct the random survival forest, time-ROC and risk Regression were used to estimate time-dependent ROC curves and AUCs, rms and pec were used to generate calibration curves, and dcurves and rmda were used for decision-curve analysis. A summary of functions and packages is provided in Supplementary Table S7. Group comparisons employed χ2 tests for categorical variables and Mann–Whitney U-tests for nonparametric continuous data. Statistical significance was determined using two-tailed α = 0.05. for the handling of missing data and continuous predictors, continuous variables with a missing rate of less than 5% were imputed using the random forest method. Feature selection followed a two-step process: LASSO regression with 10-fold cross-validation was first applied to the training cohort to identify potential predictors, followed by backward stepwise Cox regression. The variance inflation factor (VIF) was used to assess multicollinearity among selected variable. Several methods were used to assess the model's performance. R software (version 4.3.1; R Foundation for Statistical Computing, Vienna, Austria) was used for all statistical analyses. Categorical variables were compared using χ2 tests, and continuous variables with non-normal distribution were compared using the Mann–Whitney U-test. A two-tailed alpha of 0.05 was considered statistically significant. Feature selection followed a two-step process: first, LASSO regression with 10-fold cross-validation was applied to identify predictors. These variables chosen by LASSO were then subjected to backward stepwise Cox regression analysis. The variance inflation factor (VIF) was used to evaluate multicollinearity among the final variables that were chosen. Several methods were used to assess the model's performance. Time-dependent receiver operating characteristic (ROC) curves for 3-, 5-, and 8-year mortality forecasts in both training and validation sets were used to evaluate discrimination. These time horizons were selected to reflect short−, intermediate− and long−term follow−up. The median follow−up (survival) time in our cohort was 4.68 years (interquartile range 0.55–8.71 years), so 5 years approximated the mid-point of follow−up and 8 years corresponded to the upper quartile. As shown in Supplementary Figure S6, follow−up durations ranged from less than 1 year to more than 10 years, with a small peak of patients completing long−term follow−up around 8 years. At 5 years, 184 participants were still under observation, and 111 participants remained at risk at 8 years (Supplementary Figure S6). Evaluating the model at 3, 5 and 8 years therefore captures short−, medium− and longer−term risk while maintaining adequate numbers at risk for reliable estimates. To evaluate the degree of agreement between expected and observed results, calibration curves were plotted. Decision curve analysis (DCA) was used to assess the model's clinical utility. Using Kaplan–Meier survival analyses, the prognostic value of categorical factors was evaluated. The “survminer” package was used to find the best cutoff values for continuous variables. Using (restricted cubic splines) RCS with three knots positioned at the 10th, 50th, and 90th percentiles to evaluate possible non-linear connections, the relationships between continuous variables and mortality were investigated. To offer a useful tool for predicting personal risk, a nomogram was created. The top 8 most significant factors from our 9 predictors were determined by additional analysis using a random survival forest with 1,000 trees. Iterative testing from 1 to 75 was used to establish the ideal node size. We next performed comparative studies between models built utilizing these top 8 variables and our 9-variable model. Time-dependent ROC curves, calibration plots, and decision curve analyses at 3, 5, and 8-year time points in both training and validation cohorts were used to compare the models. Internal validation was performed via bootstrap resampling: we estimated optimism−corrected concordance index (C−index) values and computed 95% bias−corrected and accelerated (BCa) confidence intervals from the bootstrap resamples.

Results

Patient characteristics and outcomes

The baseline clinical characteristics, treatment of the total cohort (N = 389) are detailed in Supplementary Table S1. Of the 389 patients with CAD combined with diabetes, the median age was 86.00 [82.00;90.00] years, including 299 (76.86%) males and 90 (23.14%) females. The median body mass index (BMI) was 24.46 [22.32;26.75] kg/m2. The most common comorbidities were hypertension (341, 87.66%), anemia (172, 44.22%), chronic heart failure (127, 32.65%), and chronic kidney disease (121, 31.11%). In terms of laboratory tests, the median NT-proBNP level was 0.5 × 103 pg/mL (IQR: 0.18–1.91), and the median HDL level was 1.01 mmol/L (0.84; 1.19). Baseline characteristics are shown in Table 1. The cohort was randomly allocated into a training set (n = 273) and an internal validation set (n = 116). No substantial differences were observed between the training group and the validation group across the majority of features, it indicates that the baseline characteristics of the training cohort and the validation cohort are generally comparable. Supplementary Figure S6 illustrates the distribution of follow-up durations and the number of participants at risk: follow-up durations ranged from less than 1 year to more than 10 years, with a median follow-up time of 4.68 years; 184 participants remained under follow-up at 5 years, and 111 participants remained at risk at 8 years.

Table 1

Characteristics ALL N = 389 Validation group N = 116 Training group N = 273 P value
Death 0.336
 No 90 (23.14%) 31 (26.72%) 59 (21.61%)
 Yes 299 (76.86%) 85 (73.28%) 214 (78.39%)
Survival time (year) 4.68 [0.55;8.71] 5.51 [0.81;8.77] 3.77 [0.50;8.23] 0.233
Gender 1.000
 Female 45 (11.57%) 13 (11.21%) 32 (11.72%)
 Male 344 (88.43%) 103 (88.79%) 241 (88.28%)
Age 86.00 [82.00;90.00] 86.00 [84.00;89.00] 87.00 [82.00;90.00] 0.941
BMI (kg/m2) 24.46 [22.32;26.75] 23.88 [21.73;26.33] 24.62 [22.53;27.05] 0.077
HR (bpm) 72.00 [65.00;80.00] 72.00 [65.00;80.00] 72.00 [65.00;80.00] 0.996
mSBP (mmHg) 135.40 (13.41) 132.98 (12.51) 136.43 (13.66) 0.016
mDBP (mmHg) 68.60 [62.80;73.60] 68.75 [61.60;73.23] 68.60 [63.33;74.00] 0.335
Smoke 0.364
 Never/ Former smoker 243 (62.47%) 68 (58.62%) 175 (64.10%)
 Current smoker 146 (37.53%) 48 (41.38%) 98 (35.90%)
Hypertension 0.950
 No 48 (12.34%) 15 (12.93%) 33 (12.09%)
 Yes 341 (87.66%) 101 (87.07%) 240 (87.91%)
CHD 0.889
 No 268 (68.89%) 81 (69.83%) 187 (68.50%)
 Yes 121 (31.11%) 35 (30.17%) 86 (31.50%)
CHF 0.426
 No 262 (67.35%) 82 (70.69%) 180 (65.93%)
 Yes 127 (32.65%) 34 (29.31%) 93 (34.07%)
CKD 1.000
 No 240 (61.70%) 72 (62.07%) 168 (61.54%)
 Yes 149 (38.30%) 44 (37.93%) 105 (38.46%)
AF 0.993
 No 317 (81.49%) 94 (81.03%) 223 (81.68%)
 Yes 72 (18.51%) 22 (18.97%) 50 (18.32%)
Anemia 0.533
 No 217 (55.78%) 68 (58.62%) 149 (54.58%)
 Yes 172 (44.22%) 48 (41.38%) 124 (45.42%)
Glucose 5.80 [4.98;7.18] 5.53 [4.90;6.69] 5.84 [5.12;7.40] 0.053
NT_proBNP_1,000 (pg/mL/1,000) 0.50 [0.18;1.91] 0.32 [0.14;1.77] 0.56 [0.20;2.00] 0.062
Hb (g/L) 122.00 [107.00;134.00] 122.00 [106.75;134.00] 121.00 [107.00;134.00] 0.690
TC (mmol/L) 3.65 [3.16;4.28] 3.59 [3.06;4.34] 3.69 [3.20;4.28] 0.496
TG (mmol/L) 1.32 [0.99;1.92] 1.32 [0.92;1.97] 1.32 [1.01;1.82] 0.898
HDL (mmol/L) 1.01 [0.84;1.19] 1.02 [0.82;1.23] 1.01 [0.85;1.18] 0.914
LDL (mmol/L) 2.03 [1.54;2.52] 2.01 [1.58;2.52] 2.03 [1.51;2.51] 0.994
Albumin (g/L) 38.50 [35.00;40.60] 38.50 [35.88;41.32] 38.40 [35.00;40.50] 0.220
Scr (μmol/L) 90.80 [72.00;123.00] 89.00 [71.25;126.88] 91.10 [72.10;122.70] 0.960
UA (μmol/L) 330.50 [237.60;420.00] 336.55 [235.92;418.52] 328.60 [241.20;420.00] 0.846
Na (mmol/L) 139.00 [136.00;141.00] 139.50 [136.00;142.00] 139.00 [136.00;141.00] 0.191
K (mmol/L) 4.00 [3.78;4.31] 4.00 [3.75;4.38] 4.00 [3.79;4.31] 0.898
Ca (mmol/L) 2.23 [2.13;2.32] 2.24 [2.14;2.34] 2.22 [2.12;2.31] 0.211
P (mmol/L) 1.10 [0.96;1.25] 1.10 [0.94;1.21] 1.10 [0.97;1.27] 0.402
NLratio 2.48 [1.80;4.09] 2.40 [1.76;4.09] 2.62 [1.80;4.04] 0.457
Fibrinogen (g/L) 3.49 [2.93;4.22] 3.37 [2.87;4.18] 3.57 [2.97;4.23] 0.218
INR 1.08 [1.01;1.15] 1.08 [1.02;1.13] 1.08 [1.01;1.16] 0.904
CRP (mg/L) 0.68 [0.18;2.73] 0.46 [0.12;3.11] 0.74 [0.22;2.70] 0.212
eGFR (mL/min/1.73 m2) 67.35 (24.40) 66.67 (23.81) 67.64 (24.68) 0.716
IVS (mm) 11.00 [10.00;12.00] 11.00 [10.00;12.00] 11.00 [10.00;11.00] 0.352
LVPW (mm) 10.00 [10.00;11.00] 10.00 [10.00;11.00] 10.00 [10.00;11.00] 0.948
LVESD (mm) 33.00 [31.00;35.00] 33.00 [31.00;35.00] 33.00 [31.00;35.00] 0.975
LVEDD (mm) 49.00 [47.00;51.00] 49.00 [46.00;51.00] 49.00 [47.00;51.00] 0.425
LVEF (%) 59.00 [55.00;62.00] 59.00 [55.75;62.00] 59.00 [54.00;62.00] 0.595
LVMI (g/m2) 32.00 [29.00;34.00] 32.00 [29.75;34.00] 32.00 [29.00;33.00] 0.544
Aspirin 0.916
 No 208 (53.47%) 63 (54.31%) 145 (53.11%)
 Yes 181 (46.53%) 53 (45.69%) 128 (46.89%)
Clopidogrel 0.430
 No 144 (37.02%) 39 (33.62%) 105 (38.46%)
 Yes 245 (62.98%) 77 (66.38%) 168 (61.54%)
β-blocker 0.825
 No 106 (27.25%) 33 (28.45%) 73 (26.74%)
 Yes 283 (72.75%) 83 (71.55%) 200 (73.26%)
CCB 0.695
 No 111 (28.53%) 31 (26.72%) 80 (29.30%)
 Yes 278 (71.47%) 85 (73.28%) 193 (70.70%)
Nitrate 0.391
 No 50 (12.85%) 18 (15.52%) 32 (11.72%)
 Yes 339 (87.15%) 98 (84.48%) 241 (88.28%)
ACEI/ARB 0.931
 No 174 (44.73%) 51 (43.97%) 123 (45.05%)
 Yes 215 (55.27%) 65 (56.03%) 150 (54.95%)
Statins 0.808
 No 129 (33.16%) 40 (34.48%) 89 (32.60%)
 Yes 260 (66.84%) 76 (65.52%) 184 (67.40%)
Diuretics 0.738
 No 151 (38.82%) 47 (40.52%) 104 (38.10%)
 Yes 238 (61.18%) 69 (59.48%) 169 (61.90%)
Digitalis 0.180
 No 318 (81.75%) 100 (86.21%) 218 (79.85%)
 Yes 71 (18.25%) 16 (13.79%) 55 (20.15%)

Baseline characteristics of the training and validation groups.

Prediction model built based on lasso-cox regression

Parameters were screened using Lasso regression, and Figure 2A displayed the variation features of the variables' coefficients. After applying the 10-fold cross-validation approach to the iterative analysis, a model with a low number of variables and outstanding performance was achieved when λ was 0.063 (Log λ = −1.20) (Figure 2B). Age, CRP, Hb, ALB, INR, NT-proBNP, fibrinogen, digitalis, and diuretics were among the factors that were screened. Lasso regression-screened parameters were used to further develop the Cox regression model (Table 2). Detailed coefficients of all selected variables are presented in Supplementary Table S2. Variance inflation factor analysis confirmed the absence of significant multicollinearity among these predictors (Supplementary Table S3).

Figure 2

Graph A shows a coefficient path diagram with various colored lines representing coefficients against log Lambda. Graph B displays a plot of partial likelihood deviance versus log Lambda, with red dots indicating changes and vertical dotted lines marking specific log Lambda values.

Feature selection using LASSO regression. (A) The variation characteristics of the coefficient of variables; (B) the selection process of the optimum value of the parameter λ in the Lasso regression model by cross-validation method.

Table 2

Characteristics B SE HR CI Z P
Age 0.054 0.012 1.056 1.031–1.081 4.483 0
Hb −0.015 0.004 0.985 0.977–0.993 −3.525 0
Albumin −0.044 0.021 0.957 0.918–0.998 −2.053 0.04
Fibrinogen 0.201 0.064 1.222 1.079–1.385 3.146 0.002
INR 0.574 0.183 1.775 1.241–2.539 3.14 0.002
Diuretics 0.559 0.163 1.749 1.27–2.408 3.423 0.001
Digitalis 0.406 0.195 1.501 1.024–2.201 2.082 0.037
NT_proBNP_1,000 0.027 0.015 1.027 0.997–1.058 1.762 0.078

Cox proportional hazards regression to predict prognosis based on Lasso regression.

The predictive accuracy of the new prognostic model

Time-dependent receiver operating characteristic (ROC) curves for 3-, 5-, and 8-year death forecasts were used to assess the model's discriminative ability (Figure 3). These results were confirmed in the validation cohort, with AUCs of 0.824 (95% CI: 0.734–0.915), 0.813 (95% CI: 0.73–0.896), and 0.798 (95% CI: 0.72–0.877) for 3-, 5-, and 8-year mortality (Figure 3B). The training set's C-index was 0.753 (95% CI: 0.722–0.784), and the validation set's was 0.74 (95% CI: 0.685–0.795). These C-index estimates and 95% confidence intervals are optimism-corrected values derived using BCa bootstrap resampling; the validation cohort therefore represents an internal validation set.

Figure 3

ROC curves comparing model performance at three, five, and eight-year intervals. Chart A shows AUC values of 0.846, 0.838, and 0.82, while chart B shows AUC values of 0.824, 0.813, and 0.798. The x-axis represents 1-Specificity, and the y-axis represents Sensitivity. A dashed diagonal line indicates random chance.

The predictive accuracy and discrimination of the novel prognostic model. Time-dependent ROC in the training cohort (A) and validation cohort (B).

Model calibration assessment

Calibration plots comparing expected and observed survival probabilities at 3-, 5-, and 8-year time intervals were used to assess the model's calibration (Figure 4). All three time points’ calibration curves nearly matched the 45-degree diagonal line, showing that the anticipated probability and the actual observations agreed well.

Figure 4

Two scatter plots labeled A and B compare observed versus predicted overall survival (OS) at 3, 5, and 8 years. Plot A shows lines in red, green, and blue with error bars for each period. Plot B displays similar data but with a more linear trend. Both plots include a dashed diagonal reference line.

Calibration plots of the nomogram for 3-, 5-, and 8-year survival prediction. (A) Training cohort. (B) Validation cohort.

Clinical decision curve analysis

Within a reasonable range of threshold probabilities (roughly 0.1 to 0.6), the model outperformed both “treat all” and “treat none” strategies in the training cohort, demonstrating consistent net benefit across a range of threshold probabilities at 3-year (Figure 5A), 5-year (Figure 5C), and 8-year (Figure 5E) follow-up. At the 3-year (Figure 5B), 5-year (Figure 5D), and 8-year (Figure 5F) time points, the validation cohort showed comparable patterns of net benefit, indicating its possible therapeutic utility in directing treatment choices.

Figure 5

Graphs labeled A to F depict decision curve analyses. Each graph shows three lines representing \"Nomogram,\" \"Treat All,\" and \"Treat None\" against net benefit and threshold probability. Generally, \"Nomogram\" displays a curved path, while \"Treat All\" slopes down, and \"Treat None\" remains flat along the x-axis. Differences across graphs are subtle variations in slopes and curve shapes.

Decision curve analysis for the prediction model. Decision curves for 3-year (A,B), 5-year (C,D), and 8-year (E,F).

Construction of a prediction nomogram in training cohort

A nomogram for predicting 3-, 5-year and 8-years overall survival of coronary artery disease combined with diabetes patients that integrated the above 8 independent prognostic factors is shown in Figure 6. The sum of the points allotted to each predictor—which ranged from 0 to 400 points—was used to determine the total points. The nomogram score of age ranged from 60 to 100, making the largest contribution to the prognosis of coronary artery disease combined with diabetes.

Figure 6

A nomogram depicting various clinical factors including age, hemoglobin (Hb), albumin, fibrinogen, INR, diuretics, digitalis, and NT-proBNP levels. It calculates total points leading to a linear predictor score, followed by three-, five-, and eight-year survival probabilities.

The nomogram for predicting overall survival of coronary artery disease combined with diabetes patients.

Kaplan–Meier analysis

In the Kaplan–Meier analysis, the model displayed good capability in coronary artery disease combined with diabetes patients into high-risk and low-risk groups (Log-rank P < 0.0001) in the training set (Figure 7).

Figure 7

Kaplan-Meier survival curve comparing two groups: Group 2 High (red) and Group 2 Low (blue). The x-axis represents time, and the y-axis shows survival probability. Group 2 Low demonstrates higher survival probability over time. A significant difference between the groups is indicated with a p-value less than 0.0001. Below the graph, a table displays the number at risk for each group at various time points.

Kaplan–meier survival curves for continuous variables in the final model.

Random survival forest analysis

We use a training cohort to construct a random survival forest model in order to evaluate the significance of the variables and compare them with the Cox model. Hemoglobin and INR were the best predictors, according to variable importance analysis, followed by albumin, NT-proBNP, patient age, fibrinogen, diuretics and digitalis (Figure 8). After roughly 400 trees, the error rate tends to settle, suggesting that the model has sufficiently converged. In our analysis, we employed Random Survival Forest (RSF) to assess the contribution of each feature to predicting patient outcomes. Additionally, C-index values were compared for both the training and test datasets for both models (Supplementary Figure S3A). The results revealed comparable discriminatory performance, with RSF showing similar or slightly improved performance in both training and test sets. To further evaluate model performance, we compared the Brier scores of Random Survival Forest (RSF) and Cox proportional hazards (CoxPH) models, which are shown in Supplementary Figure S3B. The Brier score comparison demonstrated that RSF had slightly better calibration than CoxPH. To ensure the transparency and interpretability of the model, we utilized SHAP (Shapley Additive Explanations) values, which allow us to explain how each feature influences the model's output. Specifically, we used SHAP summary bar plots (Supplementary Figure S4A) and beeswarm plots (Supplementary Figure S4B) to visualize feature importance. Additionally, we generated SHAP partial dependence plots (Supplementary Figure S5) to explore the non-linear relationships between continuous features and the model's predictions. These plots provide insight into how variations in feature values affect model output, capturing the complex interactions that are often overlooked in traditional linear models. Hyperparameter settings of the random forest are provided in Supplementary Table S5.

Figure 8

Two graphs are displayed. The left graph is a line chart showing error rate decreasing with the increase in number of trees, indicating improved accuracy. The right chart is a bar graph displaying variable importance, with Hb, INR, and Albumin having the highest importance, followed by NT-proBNP_1000, Age, Fibrinogen, Diuretics, and Digitalis, each with decreasing importance.

Variable importance analysis from random survival forest model.

Model comparison across different time horizons

We conducted a comprehensive comparison of mortality prediction models across 3-, 5-, and 8-year intervals (Figure 9). In the training cohort, Model A (incorporating all 8 variables) demonstrated superior discriminative performance for 3-year mortality prediction compared to single-variable models (AUC = 0.846, 95% CI: 0.798–0.894). This advantage persisted for 5-year (AUC = 0.838, 95% CI: 0.792–0.884) and 8-year predictions (AUC = 0.82, 95% CI: 0.768–0.872). Model A maintained consistent performance in the validation cohort across all time points: 3-year AUC = 0.824 (95% CI: 0.734–0.915), 5-year AUC = 0.813 (95% CI: 0.73–0.896), and 8-year AUC = 0.798 (95% CI: 0.72–0.877). (Supplementary Figures S1, S2) Calibration plots demonstrated strong agreement between predicted and observed probabilities at all time intervals. Decision curve analysis consistently showed that Model A provided superior net benefit across a wider range of threshold probabilities compared to both simplified models and default strategies.

Figure 9

Graphs A and B show ROC curves for different models, illustrating varied AUC values. Graphs C and D depict concordance index trends over time. Graphs E and F present calibration plots comparing predicted to observed outcomes. Graphs G and H are decision curves showing net benefit across threshold probabilities for each model.

Discriminative performance of different models for 3-year mortality prediction. Model A: age + hemoglobin + albumin + fibrinogen + INR + diuretics + digoxin + NT-ProBNP/1,000, Model B: hemoglobin, Model C: INR, Model D: albumin, Model E: NT-ProBNP/1,000, and Model F: age. (A,B) time-dependent ROC curves; (C,D) C-index over time; (E,F) calibration plots; (G,H) decision curve analysis; (A,C,E,G) training cohort; (B,D,F,H) validation cohort.

Survival analysis based on key predictors

Kaplan–Meier analysis was used to evaluate survival differences, with stratification according to key predictors identified in the nomogram analysis (Figure 10). For continuous variables, optimal cutoff values were determined using statistical methods to maximize separation between risk groups (Supplementary Table S4). NT-proBNP (cutoff: 487.2 pg/mL), INR (cutoff: 1.12), age (cutoff: 85 years), fibrinogen (cutoff: 3.72 g/L), and hemoglobin (cutoff: 116 g/L) all demonstrated significant survival outcome stratification (P < 0.0001 for all). For categorical variables, the use of diuretics and digoxin also showed strong prognostic stratification (P < 0.0001).

Figure 10

Seven Kaplan-Meier survival plots labeled A to G compare survival probabilities over time for different groups, indicated in red and blue. Each plot includes shaded confidence intervals and p-values. A table below each plot displays the number at risk over time.

Kaplan–meier survival curves for continuous variables in the final model. Kaplan-Meier survival curves stratified by (A) NT-proBNP, (B) INR, (C) age, (D) fibrinogen, (E) hemoglobin, (F) diuretic use, and (G) digoxin use.

Nonlinear association analysis

To investigate the nonlinear relationships between continuous predictors and mortality risk, we conducted adjusted analyses of key predictors identified in multivariable Cox regression models, sequentially excluding each variable under investigation (Figure 11). In adjusted analyses, NT-proBNP, albumin, and INR demonstrated significant nonlinear associations with mortality (global P < 0.001, nonlinearity P < 0.001). Age and hemoglobin exhibited strong linear associations with mortality (global P < 0.001) with marginally significant nonlinear components (nonlinearity P = 0.541 and 0.182, respectively). These patterns remained consistent across both unadjusted and adjusted analyses, confirming the robustness of these relationships independent of other clinically significant predictors.

Figure 11

Five graphs (A to E) show hazard ratios with 95% confidence intervals for different variables: NT proBNP, INR, Albumin, age, and hemoglobin (Hb). Each graph has a red line indicating the hazard ratio and a pink shaded area representing the confidence interval. The y-axis label is \"Hazard ratio (95% CI)\", and the x-axis labels vary according to the variable. P-values for overall and nonlinear trends are noted in each graph.

Unadjusted restricted cubic spline analysis of continuous variables and mortality risk. Restricted cubic spline plots for (A) NT-proBNP, (B) INR, (C) albumin, (D) age, and (E) hemoglobin.

Discussion

In this comprehensive investigation of outcomes in patients with coronary artery disease and diabetes comorbidity, we developed and validated a multimodal prediction model. The model demonstrated robust predictive performance, with notable mortality discrimination accuracy and consistent calibration across various time intervals. The complementary nature of these parameters suggests that multimodal approaches may more effectively capture the complex pathophysiology underlying disease progression in this comorbid population. To facilitate clinical decision-making, we developed a nomogram based on our model, which allows clinicians to estimate a patient's survival probabilities at 3, 5, and 8 years. The nomogram incorporates key predictors such as age, hemoglobin levels, albumin, INR, and NT-proBNP, which are factors that clinicians commonly encounter in daily practice. By assigning a score to each of these variables, the nomogram calculates a total points score that can be translated into predicted survival probabilities at different time intervals. This tool can be integrated into clinical workflows as a decision-support system, assisting clinicians in risk-adjusted treatment planning tailored to individual patients. For instance, when faced with a patient exhibiting specific risk factors (e.g., advanced age or low albumin levels), a clinician can use the nomogram to estimate the patient's long-term survival prognosis, which may influence decisions regarding aggressive interventions, frequency of follow-up visits, or the need for specialized care.

Our study identified anemia as a significant prognostic predictor in patients with comorbid coronary artery disease and diabetes. Anemia, a known contributor to ischemic conditions, reduces blood oxygen capacity, causing myocardial hypoxia. Compensatory mechanisms such as increased heart rate and cardiac output exacerbate pre-existing myocardial ischemia (15). Furthermore, anemia frequently coexists with chronic kidney disease (CKD). Impaired renal erythropoietin (EPO) production in CKD patients significantly elevates anemia risk (16). This creates a pathophysiological cycle where anemia worsens renal hypoxia, accelerating kidney function deterioration. Importantly, renal insufficiency promotes toxin accumulation, intensifying systemic inflammatory responses and oxidative stress that compound cardiovascular damage (17). Current evidence demonstrates no conclusive benefit of anemia correction for improving clinical outcomes in cardiac patients (18). Robust clinical data indicate that erythropoiesis-stimulating agents (ESAs) provide neither symptomatic relief nor outcome improvement in those with mild-to-moderate anemia, while potentially increasing serious adverse risks (19).

Inflammation serves as a key driver in the initiation and progression of atherosclerosis (20). In diabetic patients, inflammatory responses exacerbate vascular endothelial damage, accelerating lipid deposition and plaque formation. Furthermore, these responses activate the coagulation system, elevating clotting factor levels and platelet activity, thereby increasing thrombotic risk. This reciprocal interaction between coagulation and inflammation creates a self-reinforcing cycle that accelerates atherosclerotic progression (21, 22). INR levels reflect underlying coagulation dysfunction and correlate with clinical outcomes. Elevated INR levels demonstrate strong prognostic value, showing consistent associations with adverse outcomes in cardiovascular disease patients (2325). The conversion of fibrinogen to insoluble fibrin and subsequent formation of stable clots represents the terminal phase of the coagulation cascade. Impaired fibrin clot architecture is widely recognized as both a prognostic marker in CAD patients and a potential indicator of thromboembolic risk (26, 27). Furthermore, hyperfibrinogenemia may constitute a key mechanism underlying elevated cardiovascular risk in T2DM populations (28, 29).

Albumin, the most abundant circulating protein in humans, exhibits critical physiological functions including anti-inflammatory effects, antioxidant activity, and inhibition of platelet aggregation (30). As the predominant antioxidant in whole blood, serum albumin derives its antioxidant capacity primarily from free cysteine residues. The prognostic significance of serum albumin in cardiovascular diseases stems principally from its associations with nutritional status and systemic inflammation (31). Albumin additionally modulates inflammatory and immune responses while maintaining vascular integrity (32). Novel albumin-based inflammatory indices such as the C-reactive protein-albumin-lymphocyte (CALLY) index (33) and C-reactive protein/albumin ratio (CAR) (34) enhance prognostic assessment by quantifying systemic inflammatory burden. Reduced serum albumin levels demonstrate associations with ischemic heart disease, heart failure, atrial fibrillation, stroke, and venous thromboembolism (35). In diabetic patients, albumin's antioxidant properties become compromised, acquiring pro-oxidant characteristics (36). Elena The work of Succurro et al. (37) reveals that reduced albumin in diabetes may upregulate Nox2 activity—a mediator of pro-inflammatory responses and vasoconstriction—potentially contributing to cardiovascular events.

Cardiac contraction primarily relies on energy derived from mitochondrial fatty acid oxidation, a process supported by the abundance of mitochondria in myocardial tissue. NT-proBNP serves as a key prognostic biomarker for risk stratification in heart failure patients (38). Pathological cardiac hypertrophy is characterized by reduced fatty acid oxidation alongside enhanced glucose uptake and glycolytic activity. In T2DM, insulin resistance restricts myocardial glucose utilization for energy production, while increased fatty acid oxidation, lipid accumulation, and oxygen consumption collectively impair cardiac function (39). The Malachias et al. study identified NT-proBNP as a significant predictor of mortality and cardiovascular events in diabetic populations (40). Incorporating both BNP and NT-proBNP into optimized risk prediction models enhances discriminative capacity beyond established cardiovascular outcome predictors (41). Serial NT-proBNP measurements demonstrate superior predictive value over single assessments for forecasting cardiovascular mortality or heart failure hospitalization in patients with coronary artery disease and T2DM (42).

Our study revealed that diuretic and digoxin use in patients with comorbid coronary artery disease and diabetes was associated with poorer clinical outcomes, likely attributable to their more complex comorbidities and disease progression toward heart failure. However, prior research has demonstrated that loop diuretic use in coronary artery disease patients without systolic heart failure or renal impairment correlates with increased cardiovascular and non-cardiovascular mortality (43). These findings emphasize the necessity for judicious prescription of these medications in patients lacking clear clinical indications.

Given the multifactorial, multisystem complexity of coronary artery disease (CAD) with comorbid diabetes, there is an urgent clinical need for precise prognostic models to guide therapeutic decision-making. Machine learning demonstrates significant potential in risk prediction, early diagnosis, and personalized management of CAD with diabetes. By analyzing extensive clinical datasets, these algorithms can identify complex patterns potentially overlooked by conventional statistical methods, offering novel insights for early intervention (4446). Our investigation employed a comprehensive multimodal approach with rigorous validation protocols and clinically relevant time frames. The model demonstrated robust discriminative performance and calibration accuracy across temporal intervals, suggesting practical utility for both short- and long-term risk stratification. However, limitations include the single-center study design with a restricted sample size (n = 389), which may limit generalizability, as well as the lack of external validation using multi-ethnic data. Future validation in larger, multicenter cohorts is required to confirm clinical applicability. Given the relatively small sample size of 389 patients and the inclusion of multiple predictors, the risk of overfitting is a concern, especially in a longitudinal study with potential patient loss to follow-up. To mitigate this, we employed Random Survival Forest (RSF), which reduces overfitting through ensemble learning, enhances model generalizability, and allows for the interpretability of predictor contributions. Another limitation is that we did not implement a formal time-dependent Cox model; medication exposures were aggregated over the first six months and treated as baseline covariates, so residual immortal time bias may remain. In addition to the methods used in this study, we acknowledge the potential benefits of exploring auxiliary model-based optimization methods and hybrid nonlinear identification algorithms, such as those discussed by Ali et al. (47, 48), which could offer further improvements in predictive accuracy. These advanced techniques are particularly beneficial for handling complex, non-linear relationships in clinical data, and could enhance the model's robustness and adaptability to diverse patient populations (49, 50). Specifically, the Runge-Kutta optimization algorithm for fractional input nonlinear output error systems, as well as Chameleon Swarm Optimization and Mountain Gazelle optimization algorithms, offer biologically inspired, auxiliary model-driven frameworks that can improve both interpretability and dynamic response estimation of medical survival models, enabling more precise and reliable predictions in clinical settings. We are actively considering these approaches for future iterations of our model to improve its performance in more dynamic clinical settings.

Conclusion

The advancement of machine learning has established random forest models as a robust methodology for developing medical prediction tools. Integration of refined biomarkers with artificial intelligence-driven models shows significant potential to enhance prognostic assessment in patients with coronary artery disease and comorbid diabetes. This study conducted a comprehensive analysis of clinical parameters associated with mortality risk in coronary artery disease patients with diabetes comorbidity. Our findings highlight the critical predictive value of hemoglobin, INR, albumin, NT-proBNP, age, fibrinogen, diuretic use, and digitalis therapy in risk stratification. The developed model demonstrates reliable short-term prognostic accuracy using a parsimonious set of clinical variables. This advancement holds promise for enabling personalized therapeutic strategies to optimize clinical outcomes in this high-risk population.

Statements

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by The Ethics Committee of the Second Hospital of Lanzhou University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

XW: Formal analysis, Writing – review & editing, Writing – original draft, Funding acquisition, Conceptualization. SW: Writing – review & editing, Investigation, Writing – original draft. XL: Data curation, Writing – original draft. RX: Resources, Writing – original draft. SL: Writing – original draft, Software. BW: Validation, Writing – review & editing. YZ: Supervision, Writing – review & editing, Visualization.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This research was supported by the 2025 Graduate Research and Practice Innovation Project of Qinghai University. Item number was 2025-GMKY-14.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcvm.2025.1674287/full#supplementary-material

References

  • 1.

    Roth GA Mensah GA Johnson CO Addolorato G Ammirati E Baddour LM et al Global burden of cardiovascular diseases and risk factors, 1990–2019: update from the GBD 2019 study. J Am Coll Cardiol. (2020) 76(25):29823021. 10.1016/j.jacc.2020.11.010

  • 2.

    Li Y Liu Y Cui J Zhu M Wang W Chen K et al Oral-gut microbial transmission promotes diabetic coronary heart disease. Cardiovasc Diabetol. (2024) 23(1):123. 10.1186/s12933-024-02217-y

  • 3.

    Marx N Federici M Schütt K Müller-Wieland D Ajjan RA Antunes MJ et al 2023 ESC guidelines for the management of cardiovascular disease in patients with diabetes. Eur Heart J. (2023) 44(39):4043140. 10.1093/eurheartj/ehad192

  • 4.

    Goodarzi MO Rotter JI . Genetics insights in the relationship between type 2 diabetes and coronary heart disease. Circ Res. (2020) 126(11):152648. 10.1161/circresaha.119.316065

  • 5.

    Kragelund C Grønning B Køber L Hildebrandt P Steffensen R . N-terminal pro-B-type natriuretic peptide and long-term mortality in stable coronary heart disease. N Engl J Med. (2005) 352(7):66675. 10.1056/NEJMoa042330

  • 6.

    American Diabetes Association Professional Practice Committee. 2. Diagnosis and classification of diabetes: standards of care in diabetes-2024. Diabetes Care. (2024) 47(1):S2042. 10.2337/dc24-S002

  • 7.

    D'Agostino RB Sr. Vasan RS Pencina MJ Wolf PA Cobain M Massaro JM et al General cardiovascular risk profile for use in primary care: the framingham heart study. Circulation. (2008) 117(6):74353. 10.1161/circulationaha.107.699579

  • 8.

    Pavlović J Bos D Ikram MK Ikram MA Kavousi M Leening MJG . Guideline-directed application of coronary artery calcium scores for primary prevention of atherosclerotic cardiovascular disease. JACC Cardiovasc Imaging. (2025) 18(4):46575. 10.1016/j.jcmg.2024.12.008

  • 9.

    Cook NR Paynter NP Eaton CB Manson JE Martin LW Robinson JG et al Comparison of the framingham and reynolds risk scores for global cardiovascular risk prediction in the multiethnic Women’s health initiative. Circulation. (2012) 125(14):174856, s1–11. 10.1161/circulationaha.111.075929

  • 10.

    Selvarajah S Kaur G Haniff J Cheong KC Hiong TG van der Graaf Y et al Comparison of the framingham risk score, SCORE and WHO/ISH cardiovascular risk prediction models in an Asian population. Int J Cardiol. (2014) 176(1):2118. 10.1016/j.ijcard.2014.07.066

  • 11.

    Zhang H Chong H Li Z Li K Zhang B Xue Y et al Triglyceride-glucose index in the prediction of major adverse cardiovascular events in patients with type 2 diabetes mellitus after coronary artery bypass surgery: a retrospective cohort study. Front Endocrinol. (2022) 13:1015747. 10.3389/fendo.2022.1015747

  • 12.

    Wei S Guo X He S Zhang C Chen Z Chen J et al Application of machine learning for patients with cardiac arrest: systematic review and meta-analysis. J Med Internet Res. (2025) 27:e67871. 10.2196/67871

  • 13.

    Knuuti J Wijns W Saraste A Capodanno D Barbato E Funck-Brentano C et al 2019 ESC guidelines for the diagnosis and management of chronic coronary syndromes. Eur Heart J. (2020) 41(3):40777. 10.1093/eurheartj/ehz425

  • 14.

    Alberti KG Zimmet PZ . Definition, diagnosis and classification of diabetes mellitus and its complications. Part 1: diagnosis and classification of diabetes mellitus provisional report of a WHO consultation. Diabet Med. (1998) 15(7):53953. 10.1002/(sici)1096-9136(199807)15:7%3C539::Aid-dia668%3E3.0.Co;2-s

  • 15.

    da Silveira AD Ribeiro RA Rossini AP Stella SF Ritta HA Stein R et al Association of anemia with clinical outcomes in stable coronary artery disease. Coron Artery Dis. (2008) 19(1):216. 10.1097/MCA.0b013e3282f27c0a

  • 16.

    Zafrir B Leviner DB Saliba W Sharoni E . Prognostic interplay of chronic kidney disease, Anemia, and diabetes in coronary bypass surgery. Ann Thorac Surg. (2021) 111(1):94101. 10.1016/j.athoracsur.2020.04.124

  • 17.

    Carracedo J Alique M Vida C Bodega G Ceprián N Morales E et al Mechanisms of cardiovascular disorders in patients with chronic kidney disease: a process related to accelerated senescence. Front Cell Dev Biol. (2020) 8:185. 10.3389/fcell.2020.00185

  • 18.

    Maraey A Elzanaty A Elsharnoby H Younes M Salem M Khalil M . Restrictive vs liberal blood transfusion strategy in patients with symptomatic coronary artery disease: meta-analysis from randomized controlled trials. Eur Heart J. (2021) 42(1):ehab724.1257. 10.1093/eurheartj/ehab724.1257

  • 19.

    Kansagara D Dyer E Englander H Fu R Freeman M Kagen D . Treatment of anemia in patients with heart disease: a systematic review. Ann Intern Med. (2013) 159(11):74657. 10.7326/0003-4819-159-11-201312030-00007

  • 20.

    Soehnlein O Libby P . Targeting inflammation in atherosclerosis—from experimental insights to the clinic. Nat Rev Drug Discovery. (2021) 20(8):589610. 10.1038/s41573-021-00198-1

  • 21.

    Bryk AH Prior SM Plens K Konieczynska M Hohendorff J Malecki MT et al Predictors of neutrophil extracellular traps markers in type 2 diabetes mellitus: associations with a prothrombotic state and hypofibrinolysis. Cardiovasc Diabetol. (2019) 18(1):49. 10.1186/s12933-019-0850-0

  • 22.

    Babes EE Bustea C Behl T Abdel-Daim MM Nechifor AC Stoicescu M et al Acute coronary syndromes in diabetic patients, outcome, revascularization, and antithrombotic therapy. Biomed Pharmacother. (2022) 148:112772. 10.1016/j.biopha.2022.112772

  • 23.

    Tang Y Sun J Yu Z Liang B Peng B Ma J et al Association between prothrombin time-international normalized ratio and prognosis of post-cardiac arrest patients: a retrospective cohort study. Front Public Health. (2023) 11:1112623. 10.3389/fpubh.2023.1112623

  • 24.

    Delgado GE Zirlik A Gruber R Scheffold T Krämer BK März W et al The association of high-normal international-normalized-ratio (INR) with mortality in patients referred for coronary angiography. PLoS One. (2019) 14(8):e0221112. 10.1371/journal.pone.0221112

  • 25.

    Liu L Ying M Chen S Li Q Chen G Li H et al The association between prothrombin time-international normalized ratio and long-term mortality in patients with coronary artery disease: a large cohort retrospective study with 44,662 patients. BMC Cardiovasc Disord. (2022) 22(1):297. 10.1186/s12872-022-02619-4

  • 26.

    Ząbczyk M Ariëns RAS Undas A . Fibrin clot properties in cardiovascular disease: from basic mechanisms to clinical practice. Cardiovasc Res. (2023) 119(1):94111. 10.1093/cvr/cvad017

  • 27.

    Larsen JB Hvas AM . Fibrin clot properties in coronary artery disease: new determinants and prognostic markers. Pol Arch Intern Med. (2021) 131(11):16113. 10.20452/pamw.16113

  • 28.

    Ravi Kiran M Harshavardhan L . Fibrinogen levels in type 2 diabetes Mellitus and it’s correlation with microvascular microvascular complications. J Assoc Physicians India. (2022) 70(4):112. eng. Epub 2022/04/22. Cited in: Pubmed; PMID 35443328.

  • 29.

    Bruno G Cavallo-Perin P Bargero G Borra M D'Errico N Pagano G . Association of fibrinogen with glycemic control and albumin excretion rate in patients with non-insulin-dependent diabetes mellitus. Ann Intern Med. (1996) 125(8):6537. 10.7326/0003-4819-125-8-199610150-00005

  • 30.

    Manolis AA Manolis TA Melita H Mikhailidis DP Manolis AS . Low serum albumin: a neglected predictor in patients with cardiovascular disease. Eur J Intern Med. (2022) 102:2439. 10.1016/j.ejim.2022.05.004

  • 31.

    Don BR Kaysen G . Serum albumin: relationship to inflammation and nutrition. Semin Dial. (2004) 17(6):4327. 10.1111/j.0894-0959.2004.17603.x

  • 32.

    Tufoni M Baldassarre M Zaccherini G Antognoli A Caraceni P . Hemodynamic and systemic effects of albumin in patients with advanced liver disease. Curr Hepatol Rep. (2020) 19(3):14758. 10.1007/s11901-020-00521-1

  • 33.

    Ji H Luo Z Ye L He Y Hao M Yang Y et al Prognostic significance of C-reactive protein-albumin-lymphocyte (CALLY) index after primary percutaneous coronary intervention in patients with ST-segment elevation myocardial infarction. Int Immunopharmacol. (2024) 141:112860. 10.1016/j.intimp.2024.112860

  • 34.

    Di Rosa M Sabbatinelli J Giuliani A Carella M Magro D Biscetti L et al Inflammation scores based on C-reactive protein and albumin predict mortality in hospitalized older patients independent of the admission diagnosis. Immun Ageing. (2024) 21(1):67. 10.1186/s12979-024-00471-y

  • 35.

    Arques S . Human serum albumin in cardiovascular diseases. Eur J Intern Med. (2018) 52:812. 10.1016/j.ejim.2018.04.014

  • 36.

    Roche M Rondeau P Singh NR Tarnus E Bourdon E . The antioxidant properties of serum albumin. FEBS Lett. (2008) 582(13):17837. 10.1016/j.febslet.2008.04.057

  • 37.

    Succurro E Andreozzi F Carnevale R Sciacqua A Cammisotto V Cassano V et al Nox2 up-regulation and hypoalbuminemia in patients with type 2 diabetes mellitus. Free Radical Biol Med. (2021) 168:15. 10.1016/j.freeradbiomed.2021.03.026

  • 38.

    Jhund PS Anand IS Komajda M Claggett BL McKelvie RS Zile MR et al Changes in N-terminal pro-B-type natriuretic peptide levels and outcomes in heart failure with preserved ejection fraction: an analysis of the I-preserve study. Eur J Heart Fail. (2015) 17(8):80917. 10.1002/ejhf.274

  • 39.

    Malachias MVB Wijkman MO Bertoluci MC . NT-proBNP as a predictor of death and cardiovascular events in patients with type 2 diabetes. Diabetol Metab Syndr. (2022) 14(1):64. 10.1186/s13098-022-00837-6

  • 40.

    Malachias MVB Jhund PS Claggett BL Wijkman MO Bentley-Lewis R Chaturvedi N et al NT-proBNP by itself predicts death and cardiovascular events in high-risk patients with type 2 diabetes Mellitus. J Am Heart Assoc. (2020) 9(19):e017462. 10.1161/jaha.120.017462

  • 41.

    Wolsk E Claggett B Pfeffer MA Diaz R Dickstein K Gerstein HC et al Role of B-type natriuretic peptide and N-terminal prohormone BNP as predictors of cardiovascular morbidity and mortality in patients with a recent coronary event and type 2 diabetes Mellitus. J Am Heart Assoc. (2017) 6(6):e004743. 10.1161/jaha.116.004743

  • 42.

    Wolsk E Claggett B Diaz R Dickstein K Gerstein HC Køber L et al Risk estimates of imminent cardiovascular death and heart failure hospitalization are improved using serial natriuretic peptide measurements in patients with coronary artery disease and type 2 diabetes. J Am Heart Assoc. (2022) 11(8):e021327. 10.1161/jaha.121.021327

  • 43.

    Schartum-Hansen H Loeland KH Svingen GFT Nordrehaug JE Nygaard O . Use of loop diuretics is strongly associated with increased risk of death in coronary patients without systolic heart failure or renal impairment. Eur Heart J. (2013) 34(1):878. 10.1093/eurheartj/eht308.878

  • 44.

    Gautam N Mueller J Alqaisi O Gandhi T Malkawi A Tarun T et al Machine learning in cardiovascular risk prediction and precision preventive approaches. Curr Atheroscler Rep. (2023) 25(12):106981. 10.1007/s11883-023-01174-3

  • 45.

    Oikonomou EK Khera R . Machine learning in precision diabetes care and cardiovascular risk prediction. Cardiovasc Diabetol. (2023) 22(1):259. 10.1186/s12933-023-01985-3

  • 46.

    Ji Y Shang H Yi J Zang W Cao W . Machine learning-based models to predict type 2 diabetes combined with coronary heart disease and feature analysis-based on interpretable SHAP. Acta Diabetol. (2025) 62(10):163146. 10.1007/s00592-025-02496-1

  • 47.

    Samadi ME Guzman-Maldonado J Nikulina K Mirzaieazar H Sharafutdinov K Fritsch SJ et al A hybrid modeling framework for generalizable and interpretable predictions of ICU mortality across multiple hospitals. Sci Rep. (2024) 14(1):5725. 10.1038/s41598-024-55577-6

  • 48.

    Ali MA Chaudhary NI Khan TA Mao W-L Lin C-C Raja MAZ . Design of key term separated identification model for fractional input nonlinear output error systems: auxiliary model based Runge Kutta optimization algorithm. Chaos, Solitons Fractals. (2024) 189:115696. 10.1016/j.chaos.2024.115696

  • 49.

    Ali MA Chaudhary NI Lin C-C Mao W-L . Mountain gazelle optimization algorithm for identification of nonlinear hammer-stein output error systems. J Innov Technol. (2025) 7(1):11724. 10.29424/JIT.202503_7(1).0010d

  • 50.

    Ali MA Chaudhary NI Khan TA Mao W-L Lin C-C Khan ZA et al Auxiliary model-based chameleon swarm optimization for robust parameter estimation of fractional order nonlinear hammerstein systems. J Comput Nonlinear Dyn. (2025) 20(9):091005. 10.1115/1.4068718

Summary

Keywords

coronary artery disease, diabetes mellitus, machine learning, mortality, prognostic model

Citation

Wang X, Wang S, Liu X, Xie R, Liang S, Wang B and Zhang Y (2026) Explainable machine learning model of coronary artery disease combined with diabetes: development and validation study. Front. Cardiovasc. Med. 12:1674287. doi: 10.3389/fcvm.2025.1674287

Received

27 July 2025

Revised

01 December 2025

Accepted

04 December 2025

Published

05 January 2026

Volume

12 - 2025

Edited by

Arvind Mukundan, National Chung Cheng University, Taiwan

Reviewed by

Chien-Wei Chuang, Fu Jen Catholic University, Taiwan

Tariq Mahmood, International Islamic University, Islamabad, Pakistan

Updates

Copyright

* Correspondence: Biyun Wang Yuke Zhang

†These authors have contributed equally to this work

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics