Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Cardiovasc. Med., 17 November 2025

Sec. Coronary Artery Disease

Volume 12 - 2025 | https://doi.org/10.3389/fcvm.2025.1611709

This article is part of the Research TopicCoronary Artery Disease in Young Adults: Epidemiology, Clinical Insights and ManagementView all articles

Construction of diagnostic and prognostic models for premature coronary artery disease based on multiple machine algorithms


Yu-Chan He,,&#x;Yu-Chan He1,2,†Ye Li,,&#x;Ye Li1,2,†Xiu-Jin Qin,Xiu-Jin Qin1,2Zhong-Hai Bi,Zhong-Hai Bi1,2Si-Yao Li,Si-Yao Li1,2Yan-Li Liu,
Yan-Li Liu1,2*Liu Miao,

Liu Miao1,2*
  • 1Department of Cardiology, Liuzhou People’s Hospital, Affiliated of Guangxi Medical University, Liuzhou, Guangxi, China
  • 2The Key Laboratory of Coronary Atherosclerotic Disease Prevention and Treatment of Liuzhou, Liuzhou People's Hospital, Liuzhou, Guangxi, China

Objective: To evaluate the diagnostic and prognostic predictive value of the pan-immune-inflammation value (PIV) and triglyceride-glucose (TyG) index in premature coronary artery disease (PCAD).

Methods: This study analyzed data from 26,883 patients admitted with chest pain at Liuzhou People's Hospital (January 2014 to December 2020), with 5,653 patients included after screening. Multiple machine learning algorithms, including Gradient Boosting Machine (GBM), Extreme Gradient Boosting (XGBoost), Support Vector Machine (SVM), Lasso regression, Random Forest (RF), and logistic regression, were applied to identify PCAD-related variables, which were integrated into a decision tree model. Propensity score matching (PSM) ensured cohort comparability. The Mime1 package facilitated ensemble feature selection and visualization, while optimal PIV and TyG cutoff values were determined via Receiver Operating Characteristic (ROC) analysis for 36-month survival subgroup analysis.

Results: Logistic regression identified PIV [odds ratio [OR] 2.651, 95% CI [to be specified], P < 0.001] and TyG [OR 1.003, 95% CI (to be specified), P < 0.001] as PCAD risk factors. The decision tree model, incorporating PIV, TyG, and white blood cell count (WBC), achieved an accuracy of 0.88 and an area under the ROC curve (AUC) of 0.86 for PCAD diagnosis. Survival analysis over 36 months revealed that low PIV and TyG levels were associated with reduced all-cause mortality, whereas elevated levels correlated with poorer prognosis (P < 0.001), with TyG showing a pronounced effect.

Conclusion: The combined evaluation of PIV, TyG, and WBC offers robust diagnostic and prognostic value for PCAD, with elevated PIV and TyG levels indicating a poor prognosis, underscoring their potential as clinical biomarkers.

1 Introduction

Coronary artery disease (CAD) remains the leading cause of morbidity and mortality worldwide, despite advances in prevention and treatment strategie (1, 2). Premature coronary artery disease (PCAD), defined as CAD occurring in men younger than 55 years and women younger than 65 years, represents a distinct clinical entity. Compared with CAD in older individuals, PCAD often presents more acutely, carries a higher lifetime risk of recurrent events, and imposes a substantial socioeconomic burden (3, 4).

Traditional risk factors such as smoking, hypertension, dyslipidemia, and diabetes contribute to PCAD development, but emerging evidence highlights the role of inflammation and metabolic dysregulation (57). Atherosclerosis is increasingly recognized as a chronic inflammatory condition driven by endothelial injury, lipid accumulation, and immune responses (8, 9). Consequently, inflammatory and metabolic biomarkers have attracted interest for risk stratification in younger CAD populations.

The pan-immune-inflammation value (PIV), derived from neutrophil, platelet, monocyte, and lymphocyte counts, has been validated as a prognostic marker in cancer and cardiovascular disease (1012). Similarly, the triglyceride-glucose (TyG) index, an established surrogate of insulin resistance, has been shown to predict cardiovascular events across diverse populations (13, 14). Both markers are readily obtainable from routine laboratory tests, making them practical for clinical application.

However, limited studies have examined the combined prognostic value of PIV and TyG in patients with PCAD. Given the unique metabolic and inflammatory profile of younger patients, evaluating these indices may provide insights into risk stratification and prognosis.

2 Method

2.1 Research design

This study was designed to investigate risk factors linked to premature coronary artery disease (PCAD) and their prognostic importance through the application of multiple machine learning approaches. The analysis utilized a comprehensive dataset of clinical records to provide a foundation for early screening and primary prevention strategies. A schematic overview of the study workflow is shown in Figure 1, depicting the sequential process of data acquisition, variable selection, model development, and validation.

Figure 1
Workflow diagram illustrating data collection and processing steps for chest pain patients from 2014 to 2020. It includes exclusion criteria, enrollment of 4,983 PCAD and 670 non-PCAD patients, feature selection using machine learning methods, and validation with a decision tree showing an AUC of 0.86. Subgroup survival analysis uses K-M curves. The process ends with visualization and interpretation of results.

Figure 1. Model development and validation flowchart. PCAD, premature coronary artery disease; PIV, pan-immune inflammatory value; TyG, triglyceride-glucose index; WBC, white blood cell count; PSM, propensity score matching; GBM, gradient boosting machine; XGboost, extreme gradient boosting; SVM, support vector machine; RF, random forest; RSF, random survival forest; ROC, receiver operating characteristic.

2.2 Study population

This study evaluated clinical data from 26,883 patients consecutively admitted to Liuzhou People's Hospital from January 2014 to December 2020. Patients were eligible for inclusion if they met the following criteria: (1) presented with chest pain and underwent coronary angiography; (2) were males younger than 55 years or females younger than 65 years; and (3) were diagnosed with coronary artery disease (CAD) based on ≥50% stenosis in at least one major coronary artery (left main, left anterior descending, circumflex, or right coronary artery). Patients were classified into premature CAD (PCAD) or non-PCAD groups according to coronary angiographic results. Patients were excluded if they met any of the following criteria: (1) had conditions anticipated to limit life expectancy to less than one year (e.g., severe hepatic or renal insufficiency, malignant neoplasms); (2) lacked essential clinical data; (3) exhibited abnormal bone marrow hematopoietic function or severe infections (e.g., sepsis, septic shock, rheumatoid arthritis); or (4) were pregnant or lactating. Ultimately, 4,983 PCAD patients and 670 non-PCAD patients were enrolled. Patients were followed up regularly at 3, 6, 9, and 12 months after enrollment through telephone follow-up, readmission follow-up, and outpatient follow-up. The total follow-up period was 36 months. The observational endpoint was defined as all-cause mortality, including cardiovascular death (death due to myocardial infarction, heart failure, arrhythmias, or other cardiovascular causes), with follow-up concluding at the time of death. All-cause mortality served as the measure of prognosis.

2.3 Data collection

A comprehensive dataset was constructed from patient medical records, encompassing baseline characteristics including sex, age, body mass index (BMI), blood pressure, history of smoking and alcohol consumption, and fasting blood glucose concentrations. Furthermore, the initial blood sample collected upon admission was assessed for a range of biomarkers and indices, including white blood cell count (WBC), red blood cell count (RBC), hemoglobin (Hb), platelet (PLT), hematocrit (Hct), eosinophil count, lymphocyte, monocyte, basophil, neutrophil, total cholesterol (TC), triglycerides (TG), low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol (HDL-C), creatinine (Scr), urea nitrogen (BUN), homocysteine (Hcy), aspartate aminotransferase (AST), alanine aminotransferase (ALT), waistline(WC), glycated hemoglobin (HbA1), triglyceride-to-HDL ratio (THR),triglyceride-glucose index (TyG), PIV, Charlson Comorbidity Index (CCI), Body Roundness Index (BRI), Visceral Adiposity Index (VAI), and Castelli Risk Indices 1 and 2 (CRI-1, CRI-2). The TyG index was calculated as ln[fasting triglycerides (mg/dl) × fasting glucose (mg/dl)/2], and the PIV as (neutrophil count × platelet count × monocyte count)/lymphocyte count.

2.4 Statistical analysis

All statistical analyses were performed using R software. Continuous variables are presented as means ± standard deviations, with between-group comparisons conducted using analysis of variance (ANOVA). Categorical variables are reported as counts and percentages, with group differences evaluated using the chi-square test or Fisher's exact test, depending on applicability. A P-value < 0.05 was deemed statistically significant. Propensity score matching (PSM) was employed to adjust for potential confounders and ensure comparability between the premature coronary artery disease (PCAD) and non-PCAD cohorts. Variables predictive of PCAD diagnosis were determined using a range of machine learning methods, including gradient boosting machine (GBM), extreme gradient boosting (XGBoost), support vector machine (SVM), Lasso regression, random forest, and logistic regression. A decision tree model was subsequently developed to formulate a diagnostic algorithm for PCAD. For patients fulfilling the PCAD diagnostic criteria, receiver operating characteristic (ROC) curves were used to establish optimal cutoff values for the PIV and the TyG index. These cutoff values were used to stratify the population into four subgroups: low PIV, high PIV, low TyG, and high TyG. Kaplan–Meier survival curves were constructed to assess prognostic differences among these subgroups.

3 Result

3.1 Patient characteristics

A total of 26,683 patients admitted with chest pain were enrolled. Of these, 21,030 patients were excluded for not meeting the inclusion criteria. Consequently, 5,653 patients were included in the final analysis.

3.2 Initial search for characteristic variables

The study comprised 5,653 participants, including 4,983 patients with PCAD and 670 with non-PCAD. Following PSM, the cohorts were balanced, yielding 654 participants per group. Before PSM, significant differences emerged between the PCAD and non-PCAD groups for numerous variables (all P < 0.001), including sex, white WBC, RBC, Hb, Hct, eosinophil count, lymphocyte count, PLT, CRP, HDL-C, CRI-1, CRI-2, PIV, CCI, TG, LDL-C, BRI and THR. After PSM, differences remained significant for a subset of variables (all P < 0.001), WBC, RBC, Hb, neutrophil count, PLT, CRP, TyG, PIV, and CCI. Notably, the TyG index, PIV, and WBC consistently differed between the PCAD and non-PCAD groups both before and after PSM. These results indicate that these variables may contribute to PCAD (Table 1).

Table 1
www.frontiersin.org

Table 1. General characteristics of patients before and after PSM.

3.3 Filtering feature variables through machine learning

Machine learning analyses were performed on the 37 previously described feature variables using R.

Gradient Boosting Machine (GBM): GBM, an ensemble learning algorithm, sequentially integrates multiple weak learners to reduce prediction error. Using GBM, variable importance was assessed, identifying the top five variables as the CCI, WBC, sex, PIV, and TyG (Figure 2A).

Figure 2
(A) Bar chart showing variable ranks by relative importance, with various markers. (B) LASSO regression plot indicating coefficients against lambda values, showing where Log(lambda.min) is optimal. (C) Two plots with points denoting mean decrease in accuracy and Gini for different variables. (D) Line graph illustrating RMSE in cross-validation decreasing as the number of variables increases. (E) Bar chart reflecting feature importance with PIV, WBC, and TyG ranked in descending importance.

Figure 2. Multiple machine learning results. (A) Gradient Boosting Machine (GBM): The top five feature variables based on relative importance ranking are CCI, WBC, Gender, PIV, and TyG. (B) Lasso regression: at the optimal lambda value (lambda = −7.48), 32 variables were retained, including TyG, PIV, and WBC. (C) Random forest: based on Gini importance evaluation, the five most important features are CCI, WBC, PIV, TyG, and PLT. (D) Support vector machines (SVM): The model achieved the lowest Root Mean Square Error (RMSE) of 0.3027, indicating optimal performance, with the top five contributing variables being CCI, WBC, RBC, PIV, and TyG. (E) Extreme Gradient boosting (XGBoost): Based on the feature importance scores, the variables are ranked as follows: PIV (0.80), WBC (0.18), and TyG (0.02).

Lasso Regression: Lasso regression was applied for feature selection and model regularization. With an optimal lambda value of −7.48, key variables were identified and ranked in descending order of importance: age, alanine aminotransferase (ALT), aspartate aminotransferase (AST), BMI, BRI, blood urea nitrogen (BUN), CCI, CRI-1, CRI-2, CRP, diastolic blood pressure (DBP), alcohol consumption, sex, glucose (Glu), Hb, glycated hemoglobin (HbA1c), Hct, homocysteine (Hcy), HDL-C, LDL-C, PIV, PLT, RBC, systolic blood pressure (SBP), serum creatinine (Scr), smoking status, TC, TG, THR, TyG, WBC, and waist circumference (WC) (Figure 2B).

Random Forest (RF): The RF algorithm improves prediction accuracy, generalization, and resistance to overfitting through the aggregation of multiple decision trees (via averaging for regression or majority voting for classification). Risk factors for PCAD were evaluated for importance, with higher values denoting greater significance. The top five variables identified were the CCI, WBC, PIV, TyG, and PLT (Figure 2C).

Support Vector Machine (SVM): The SVM classification algorithm was applied to classify the data. Iterative model training and sequential removal of less significant variables enabled the identification of the most predictive features. The model was optimized by minimizing root mean squared error (RMSE), resulting in the selection of 21 variables, with the top five being the CCI, WBC, RBC, PIV, and TyG (Figure 2D).

Logistic Regression: Logistic regression was employed to model the binary classification of PCAD diagnosis (PCAD = 1, non-PCAD = 0). A stepwise method was used to select variables, yielding a final model with nine key predictors: RBC, WBC, PLT, PIV, TyG, sex, Hb, Hct and CRP (Table 2).

Table 2
www.frontiersin.org

Table 2. Logistic regression.

Extreme Gradient Boosting (XGBoost): The XGBoost algorithm was applied to evaluate the relative importance of feature variables, identifying the top three as the PIV, TyG, and WBC (Figure 2E).

These machine learning results are presented in Figure 2. Notably, the consistent prominence of the feature variables—specifically the TyG, PIV and WBC—corresponds with their observed differences between the PCAD and non-PCAD groups both before and after PSM.

3.4 Diagnostic prediction model

A decision tree model was constructed to predict PCAD based on key feature variables namely, the PIV, WBC and TyG identified through multiple machine learning algorithms. The decision tree applies conditional rules to classify PCAD, outlined as follows: (1) If TyG ≥ 4.85 and WBC > 4.165 × 10⁹ /L: (a) TyG > 5.13 indicates PCAD; (b) 4.85 ≤ TyG ≤ 5.03 with WBC > 4.165 × 10⁹/L indicates PCAD. (2) If 3.56 < TyG < 4.85: (a) PIV > 368.56 suggests PCAD. (3) If TyG < 4.85 and PIV < 368.56: (a) WBC > 5.62 × 10⁹/L suggests PCAD. The model's diagnostic performance was evaluated, demonstrating an accuracy of 0.88 and an area under the receiver operating characteristic curve (AUC) of 0.86. A schematic of the decision tree is presented in Figure 3.

Figure 3
Decision tree diagram with nodes labeled WBC, PIV, TyG, leading to outcomes labeled Control and PCAD. The tree includes numeric thresholds. Below the tree, a heatmap with rows for PCAD, PIV, WBC, and TyG displays varying colors. Performance metrics include accuracy 0.887, balanced accuracy 0.665, Kappa 0.379, ROC AUC 0.862, and PR AUC 0.770.

Figure 3. Decision tree. The accuracy of the model is 0.88 and the confidence level is 0.86. WBC, white blood cells; TyG, triglyceride-glucose index; PIV, pan-immunoinflammatory value.

3.5 Feature variable selection via machine learning

Machine learning, bolstered by computational advancements, is widely utilized in medical research for analyzing complex datasets (1517). Ensemble machine learning models constitute a notable advance, offering robust tools for predicting patient prognosis based on input variables and cohort data. These models substantially alleviate the burden on researchers by automating feature variable selection for prognostic modeling.

In this study, the Mime1 package in R was used to construct an ensemble model for predicting the prognosis of PCAD. More than 100 predictive models were developed within a k-fold cross-validation framework, with the concordance index (C-index) computed for each model across validation datasets (Figure 4). Among these models, the StepCox [forward]-RSF ensemble model exhibited the highest mean C-index (0.99) across both training and validation cohorts, establishing it as the optimal model.

Figure 4
Heatmap comparing various statistical models based on their C-index values across two datasets. Models are listed on the left, with C-index values represented by color gradients. Legends indicate cohort datasets and mean C-index values in different cohorts, with blue and orange colors distinguishing the datasets.

Figure 4. An ensemble model for the prognosis of PCAD. The StepCox[forward]-RSF ensemble model exhibited the highest mean C-index (0.99) across both training and validation cohorts. RSF, random survival forest.

The 37 feature variables were subsequently evaluated using machine learning methods implemented in R. The Random Survival Forest (RSF) ranked the top five variables as LDL-C, TyG, PIV, CRI-1 and CRI-2, in descending order of importance (Figure 5). Furthermore, the StepCox [forward] method identified 10 key feature variables: PLT, Scr, Hcy, Glu, Smoke, TyG, PIV, CCI, WBC and CRP (Table 3). The consistent identification of TyG and PIV across both methods indicates their strong association with the prognosis of PCAD.

Figure 5
Graph A shows error rates over 500 trees, with error decreasing rapidly initially, then stabilizing. Graph B features two bar charts displaying variable importance. The left chart shows the mean decrease in accuracy, and the right chart shows the mean decrease in the Gini index. Both highlight different variables and their importance in the model.

Figure 5. Random survival forest. (A) Show that the error rate of the model stabilizes when the number of classification trees exceeds 200. (B) RSF: Based on Gini importance evaluation, the five most important features are LDL, TyG, PIV, CRI1, and CRI2. RSF, random survival forest; LDL, low-density lipoprotein cholesterol; TyG, triglyceride-glucose index; PIV, pan-immunoinflammatory value; CRI1and CRI2, castelli risk index 1 and 2.

Table 3
www.frontiersin.org

Table 3. Univariate regression analysis of PCAD.

3.6 ROC analysis to determine optimal cutoff value

ROC curve analysis was conducted to determine optimal cutoff values for the PIV and TyG index, key predictors of prognosis in patients with PCAD. These cut-off values were established to optimize the prognostic model's sensitivity and specificity, delineating thresholds beyond which mortality risk in PCAD patients markedly rises. For PIV, the optimal cutoff value was determined as 229.5. At this threshold, the prognostic model achieved a specificity of 65.9% and a sensitivity of 68.9% (Figure 6A). Likewise, for the TyG index, the optimal cutoff value was established at 4.9, with a specificity of 92.1% and a sensitivity of 52.2% (Figure 6B). These results underscore that PIV and TyG serve as effective biomarkers for stratifying mortality risk in PCAD patients, with the established thresholds balancing diagnostic precision and clinical applicability.

Figure 6
Two graphs compare the PIV and TyGthresholds' confidence intervals. Both graphs plot sensitivityversus specificity, with diagonal reference lines. Graph A shows a threshold at 229.5 with sensitivity 68.9% and specificity 65.9%. 9%. Graph B shows a threshold at 4.9 with sensitivity 52.2% and specificity 92.1%.

Figure 6. ROC curve. (A) The optimal threshold for the PIV index predicted by the PCAD prognostic model was determined to be 229.5, at which the specificity and sensitivity of the PCAD prognostic model were 65.9% and 68.9%. (B) The optimal critical value of the TyG index predicts that the PCAD prognostic model is 4.9, at which the specificity and sensitivity of the PCAD prognostic model are 92.1% and 52.2%. TyG, triglyceride-glucose index; PIV, pan-immunoinflammatory value.

3.7 Kaplan–Meier survival analysis

This analysis assessed the impact of varying TyG and PIV levels on the survival outcomes of patients with PCAD over three years. The study population was divided into groups based on cutoff values of 229.5 for PIV and 4.9 for TyG.

Analysis by PIV Levels: The population was categorized into two groups according to the PIV threshold of 229.5: (1) Low PIV group (PIV < 229.5); (2) High PIV group (PIV ≥ 229.5). Kaplan–Meier survival curves (Figure 7A) demonstrated a markedly lower survival rate in the high PIV group compared to the low PIV group over three years. This indicates that elevated PIV levels correlate with a poorer prognosis in PCAD patients.

Figure 7
Three Kaplan-Meier survival curves are depicted, illustrating survival probability over time based on different variables. Chart A compares high and low PIV groups, Chart B compares high and low TyG groups, with both showing significant survival differences (log-rank p<0.0001). Chart C presents data stratified by gender for four strata: TyG high/PIV high, TyG high/PIV low, TyG low/PIV high, and TyG low/PIV low, also showing significant survival differences. The number of subjects at risk is indicated below each chart.

Figure 7. Kaplan–Meier survival analysis. (A) The follow-up of patients with different PIV index groups for 36 months. (B) The follow-up of patients with different TyG index group for 36 months. (C) Men and women were followed up for 36 months in different PIV + TyG groups. TyG, triglyceride-glucose index; PIV, pan-immunoinflammatory value.

Analysis by TyG Levels: Likewise, the population was divided into two groups based on the TyG threshold of 4.9: (1) Low TyG group (TyG < 4.9); (2) High TyG group (TyG ≥ 4.9). Figure 7B shows that the high TyG group exhibited a lower survival rate than the low TyG group, suggesting that elevated TyG levels are linked to heightened mortality risk in PCAD.

Combined effects of PIV and TyG: The combined influence of PIV and TyG levels on survival rates was examined, with outcomes stratified by gender (Figure 7C). Four groups were evaluated: (1) High PIV + High TyG: This group displayed the lowest survival rates across genders, highlighting the heightened risk associated with elevated levels of both indices. (2) Low PIV + Low TyG: Conversely, this group exhibited the highest survival rates, indicating a protective effect when both indices fall below their thresholds. (3) High PIV + Low TyG and (4) Low PIV + High TyG: The high PIV + low TyG group demonstrated a higher survival rate than the low PIV + high TyG group. This suggests that the TyG index is a stronger prognostic indicator in PCAD, with lower TyG levels mitigating the adverse effects of elevated PIV.

Gender-Specific Survival Trends: In the low PIV + low TyG group, modest gender differences emerged: (1) Females exhibited higher survival rates than males in the early three-year period; (2) Subsequently, female survival rates decreased, falling below male rates in later stages. The reasons for this gender-specific trend are addressed in the discussion section.

Kaplan–Meier survival analysis revealed that elevated PIV and TyG levels correlate with reduced survival in PCAD patients. The high PIV + high TyG combination yielded the poorest outcomes, whereas the low PIV + low TyG combination was associated with the most favorable prognosis. Notably, the TyG index exerts a greater influence on survival, as demonstrated by the superior outcomes in the high PIV + low TyG group compared to the low PIV + high TyG group. These results, alongside gender-specific patterns, provide critical insights for risk stratification and prognostic evaluation in PCAD.

4 Discussion

The prevalence of PCAD has risen in recent years, with a worse prognosis than CAD in older adults. In contrast to CAD in older populations, PCAD is characterized by fewer conventional risk factors yet manifests more acutely, often as an initial acute coronary syndrome in most patients. This acute onset elevates mortality rates among affected individuals. Moreover, the prolonged life expectancy of younger patients increases the likelihood of long-term complications, including recurrent cardiovascular events or heart failure. Thus, identifying reliable prognostic indicators, implementing effective prevention strategies, and developing targeted early interventions are critical to improving survival rates and quality of life for PCAD patients.

This study employs diverse machine learning algorithms to examine risk factors linked to PCAD and their prognostic impact, providing a foundation for early detection and primary prevention. A predictive model was constructed using various machine learning algorithms to forecast both the diagnosis and prognosis of PCAD. The diagnostic model attained an accuracy of 0.88, indicating robust performance in identifying PCAD. Furthermore, the PIV and TyG index emerged as key prognostic indicators for PCAD. Receiver operating characteristic (ROC) curve analysis established optimal cutoff values for these indices (PIV > 229.5, TyG > 4.9) and evaluated their sensitivity and specificity for prognostic stratification in PCAD. Survival analysis confirmed that elevated PIV and TyG levels correlate with a poorer prognosis, with the TyG index exhibiting a notably strong association.

Extensive research has established atherosclerosis as a low-grade, non-infectious inflammatory condition (1821). This condition is characterized by inflammation-induced endothelial dysfunction, initiating key processes: (1) Lipoprotein accumulation beneath the impaired endothelium; (2) Leukocyte recruitment to the inflammatory site; (3) Increased vascular permeability, worsening the condition. Recruited monocytes differentiate into macrophages, adopting pro-inflammatory or anti-inflammatory phenotypes based on the local microenvironment. The balance between these phenotypes critically governs atherosclerosis progression or resolution (22).

Systemic and localized inflammation, affecting the entire body and specific vessels respectively, are pivotal in the development and progression of cardiovascular disease. Thus, early and precise detection of inflammation, combined with timely intervention, is crucial for enhancing outcomes in patients with PCAD.

Although single biomarkers provide limited predictive value for cardiovascular risk assessment, recent focus has shifted to multi-biomarker panels. These biomarker combinations demonstrate substantial potential for improving the precision of cardiovascular disease outcome predictions.

The PIV is a systemic inflammation biomarker derived from specific blood cell subgroup counts. It combines neutrophil, monocyte, lymphocyte, and platelet counts—cell types integral to inflammation—to yield a composite measure of immune-inflammatory status (2325). PIV is increasingly valued for its prognostic utility across conditions such as cancers, cardiovascular diseases, and metabolic syndrome (15, 16, 26). Given the established inflammation-CAD link, PIV's ability to predict CAD severity and complications has attracted considerable interest. Ayşe Irem Demirtola et al. (17) reported that elevated PIV levels correlate with increased atherosclerotic lesion severity in CAD patients. Likewise, Bektas Murat et al. (20) noted that higher PIV levels correlate with elevated long-term mortality in ST-elevation myocardial infarction (STEMI) patients, although no significant association emerged with in-hospital mortality. These results support the notion that systemic inflammation drives CAD progression. Moreover, PIV outperforms other inflammatory indices—such as the systemic immune-inflammation index (SII), platelet-to-lymphocyte ratio (PLR), and neutrophil-to-lymphocyte ratio (NLR)—in predicting post-percutaneous coronary intervention (PCI) prognosis and coronary artery stenosis extent in STEMI patients (21).

Despite these insights, PIV's role in PCAD, a CAD subset in younger individuals, remains largely unexamined. Current studies have not fully explored the PIV-PCAD relationship, underscoring a significant literature gap. Given PCAD's distinct profile—fewer conventional risk factors yet a more acute course—examining PIV's predictive utility in this population could provide critical prognostic and therapeutic insights.

The TyG index, derived from plasma triglycerides and fasting glucose levels, offers a simple, non-invasive method to assess insulin resistance (IR). It is widely applied to evaluate IR, predict metabolic dysregulation, and assess cardiovascular disease (CVD) risk. Hyperglycemia and hypertriglyceridemia are established drivers of CVD, contributing to endothelial dysfunction, inflammation, and atherosclerosis—central mechanisms in cardiovascular complications. In healthy individuals, insulin regulates glucose and lipid metabolism by facilitating glucose uptake, promoting glycolysis, and maintaining lipid homeostasis. However, in IR, these processes falter, leading to impaired glucose uptake, reduced glycolysis, and dyslipidemia, which are closely tied to adverse cardiovascular outcomes (27). The TyG index provides a practical alternative to traditional IR measures like the hyper insulinemic-euglycemic clamp or HOMA-IR (28). Studies consistently link a higher TyG index to increased risks of myocardial infarction, stroke, and heart failure, even after adjusting for conventional risk factors such as age, smoking, and hypertension. Its integration of triglycerides and glucose enhances its robustness in predicting CVD risk, particularly in metabolic syndrome, where it often outperforms other IR markers.

Despite its established role in CVD (2932), the TyG index's association with PCAD remains underexplored. Given PCAD's unique metabolic and inflammatory profile, investigating the TyG index in this context could refine risk stratification and guide early interventions.

The advent of machine learning (ML) has transformed vast datasets into actionable models, significantly enhancing diagnostic precision. While ML models for cardiovascular disease likelihood and prognosis are emerging (33, 34), many are limited by small sample sizes or a focus on older populations, with few large-scale studies addressing PCAD. Our study bridges this gap by developing a diagnosis and prognosis model tailored to PCAD patients. We employed multiple ML methods with cross-validation to select key variables, diverging from traditional training-validation splits. Our findings confirm a significant association between PIV, the TyG index, and PCAD occurrence, with elevated levels linked to poorer prognosis, reinforcing inflammation's foundational role in atherosclerosis. Notably, in the low PIV + low TyG cohort, women exhibited lower survival rates than men over time, possibly due to post-menopausal estrogen decline reducing atherosclerosis resistance.

5 Conclusions

The combined evaluation of PIV, TyG, and WBC offers robust diagnostic and prognostic value for PCAD, with elevated PIV and TyG levels indicating a poor prognosis, underscoring their potential as clinical biomarkers.

6 Limitations

Despite these advances, limitations persist. A limitation of our study is that our definition of PCAD was based on angiographic stenosis ≥50%, which may not capture patients with early subclinical atherosclerosis. Future studies should aim to establish a consensus definition to improve comparability. This study was conducted based on data from a single center, which may limit the generalizability of our findings and raises the possibility of overfitting in the predictive model. Although the internal performance was robust, external validation using multicenter or publicly available datasets is warranted to confirm the reliability and broader applicability of the model in different populations and clinical settings. To enhance clinical applicability, it is worth noting that the components of PIV and TyG are typically part of routine laboratory tests, available within a short time frame in most emergency or outpatient settings. Therefore, their integration into clinical decision-making pathways is feasible in many healthcare systems. Nonetheless, in resource-limited environments or when immediate lab access is not possible, simplified models based on clinical features or point-of-care tests may be explored, though potentially at the expense of diagnostic accuracy. Further studies are needed to validate such surrogate models. Minor CommentsSingle-time-point measurements of glucose and lipid levels overlook temporal variations, and the retrospective design may introduce confounding and bias. Futhermore, the follow-up information in this retrospective study was restricted to all-cause mortality, and detailed coronary events such as PCI, re-PCI, or myocardial infarction were not uniformly available. In addition, data on peripheral atherosclerosis and related vascular events were lacking. These factors may have limited our ability to comprehensively evaluate cardiovascular outcomes. Future prospective multicenter studies with standardized collection of interventional and peripheral vascular events are needed to validate and extend our findings. Future research should leverage larger, multicenter cohorts, extended follow-ups, and randomized controlled trials to enhance predictive accuracy and validate these findings.

Data availability statement

The data supporting the findings of this study are available from the corresponding author upon reasonable request.

Ethics statement

The studies involving humans were approved by Ethics Review Committee of Liuzhou People's Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

Y-CH: Writing – original draft, Methodology. YL: Data curation, Writing – original draft. X-JQ: Data curation, Writing – original draft. Z-HB: Investigation, Writing – original draft. S-YL: Investigation, Writing – original draft. Y-LL: Supervision, Project administration, Writing – review & editing. LM: Writing – review & editing, Conceptualization.

Funding

The author(s) declare financial support was received for the research and/or publication of this article. The authors acknowledge the essential role of the project of Liuzhou Science and Technology (2024RA0102A002 and 2022CAC0202), the funding of National Natural Science Foundation of China (NSFC: 82060072), the project of Liuzhou people’s Hospital (LYRGCC202203), Guangxi self-financing research projects (Z20210459), Guangxi Medical and health key discipline construction project and The Key Laboratory of Coronary Atherosclerotic Disease Prevention and Treatment of Liuzhou.

Acknowledgments

The authors acknowledge the essential role of the project of Liuzhou Science and Technology (2024RA0102A002 and 2022CAC0202), the funding of National Natural Science Foundation of China (NSFC: 82060072), the project of Liuzhou people's Hospital (LYRGCC202203), Guangxi self-financing research projects (Z20210459), Guangxi Medical and health key discipline construction project and The Key Laboratory of Coronary Atherosclerotic Disease Prevention and Treatment of Liuzhou.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Bauersachs R, Zeymer U, Brière JB, Marre C, Bowrin K, Huelsebeck M. Burden of coronary artery disease and peripheral artery disease: a literature review. Cardiovasc Ther. (2019) 2019:8295054. doi: 10.1155/2019/8295054

PubMed Abstract | Crossref Full Text | Google Scholar

2. Timmis A, Vardas P, Townsend N, Torbica A, Katus H, De Smedt D, et al. European Society of cardiology: cardiovascular disease statistics 2021. Eur Heart J. (2022) 43(8):716–99. doi: 10.1093/eurheartj/ehab892

PubMed Abstract | Crossref Full Text | Google Scholar

3. Kuruvila M, Maddineni K, Koppula SR, Patel B, Ratnani T, Reddy AS, et al. Exploring the factors influencing coronary heart disease prevalence in the US population: a retrospective observational study. Cureus. (2024) 16(6):e62741. doi: 10.7759/cureus.62741

PubMed Abstract | Crossref Full Text | Google Scholar

4. Le A, Peng H, Golinsky D, Di Scipio M, Lali R, Paré G. What causes premature coronary artery disease? Curr Atheroscler Rep. (2024) 26(6):189–203. doi: 10.1007/s11883-024-01200-y

PubMed Abstract | Crossref Full Text | Google Scholar

5. Gupta R, Wood DA. Primary prevention of ischaemic heart disease: populations, individuals, and health professionals. Lancet. (2019) 394(10199):685–96. doi: 10.1016/S0140-6736(19)31893-8

PubMed Abstract | Crossref Full Text | Google Scholar

6. Stone NJ, Smith SC Jr, Orringer CE, Rigotti NA, Navar AM, Khan SS, et al. Managing atherosclerotic cardiovascular risk in young adults: JACC state-of-the-art review. J Am Coll Cardiol. (2022) 79(8):819–36. doi: 10.1016/j.jacc.2021.12.016

PubMed Abstract | Crossref Full Text | Google Scholar

7. Dong W, Gong Y, Zhao J, Wang Y, Li B, Yang Y. A combined analysis of TyG index, SII index, and SIRI index: positive association with CHD risk and coronary atherosclerosis severity in patients with NAFLD. Front Endocrinol (Lausanne). (2024) 14:1281839. doi: 10.3389/fendo.2023.1281839

PubMed Abstract | Crossref Full Text | Google Scholar

8. Tian X, Zhang N, Tse G, Li G, Sun Y, Liu T. Association between lipoprotein(a) and premature atherosclerotic cardiovascular disease: a systematic review and meta-analysis. Eur Heart J Open. (2024) 4(3):oeae031. doi: 10.1093/ehjopen/oeae031

PubMed Abstract | Crossref Full Text | Google Scholar

9. Fioranelli M, Bottaccioli AG, Bottaccioli F, Bianchi M, Rovesti M, Roccia MG. Stress and inflammation in coronary artery disease: a review psychoneuroendocrineimmunology-based. Front Immunol. (2018) 9:2031. doi: 10.3389/fimmu.2018.02031

PubMed Abstract | Crossref Full Text | Google Scholar

10. Geovanini GR, Libby P. Atherosclerosis and inflammation: overview and updates. Clin Sci (2018) 132:1243–52. doi: 10.1042/CS20180306

PubMed Abstract | Crossref Full Text | Google Scholar

11. Zhang TY, Chen HL, Shi Y, Jin Y, Zhang Y, Chen Y. The relationship between system inflammation response index and coronary heart disease: a cross-sectional study (NHANES 2007–2016). Front Cardiovasc Med. (2024) 11:1439913. doi: 10.3389/fcvm.2024.1439913

PubMed Abstract | Crossref Full Text | Google Scholar

12. Golia E, Limongelli G, Natale F, Fimiani F, Maddaloni V, Pariggiano I, et al. Inflammation and cardiovascular disease: from pathogenesis to therapeutic target. Curr Atheroscler Rep. (2014) 16(9):435. doi: 10.1007/s11883-014-0435-z

PubMed Abstract | Crossref Full Text | Google Scholar

13. Wang C, Yan W, Ren M, Zhong L. Screening significance of systemic immune-inflammation index (SII) and systemic inflammation response index (SIRI) in coronary heart disease of symptomatic youth. Immun Inflammation Dis. (2024) 12:e1369. doi: 10.1002/iid3.1369

PubMed Abstract | Crossref Full Text | Google Scholar

14. Dziedzic EA, Gąsior JS, Tuzimek A, Paleczny J, Junka A, Dąbrowski M, et al. Investigation of the associations of novel inflammatory biomarkers-systemic inflammatory Index (SII) and systemic inflammatory response Index (SIRI)-with the severity of coronary artery disease and acute coronary syndrome occurrence. Int J Mol Sci. (2022) 23(17):9553. doi: 10.3390/ijms23179553

PubMed Abstract | Crossref Full Text | Google Scholar

15. Ramasamy J, Murugiah V, Dhanapalan A, Balasubramaniam G. Diagnostic utility of pan-immune-inflammation value (PIV) in predicting insulin resistance: results from the national health and nutrition examination survey (NHANES) 2017–2020. EJIFCC. (2024) 35(2):100–10.39247661

PubMed Abstract | Google Scholar

16. Hai-Jing Y, Shan R, Jie-Qiong X. Prognostic significance of the pretreatment pan-immune-inflammation value in cancer patients: an updated meta-analysis of 30 studies. Front Nutr. (2023) 10:1259929. doi: 10.3389/fnut.2023.1259929

PubMed Abstract | Crossref Full Text | Google Scholar

17. Demirtola Aİ, Erdöl MA, Mammadli A, Göktuğ Ertem A, Yayla Ç, Akçay AB. Predicting coronary artery severity in patients undergoing coronary computed tomographic angiography: insights from pan-immune inflammation value and atherogenic index of plasma. Nutrition, Metabolism, and Cardiovascular Diseases: NMCD. (2024) 34(10):2289–97. doi: 10.1016/j.numecd.2024.05.015

PubMed Abstract | Crossref Full Text | Google Scholar

18. Libby P. The changing landscape of atherosclerosis. Nature. (2021) 592:524–33. doi: 10.1038/s41586-021-03392-8

PubMed Abstract | Crossref Full Text | Google Scholar

19. Libby P, Hansson GK. From focal lipid storage to systemic inflammation. J Am Coll Cardiol. (2019) 74:1594–607. doi: 10.1016/j.jacc.2019.07.061

PubMed Abstract | Crossref Full Text | Google Scholar

20. Murat B, Murat S, Ozgeyik M, Bilgin M. Comparison of pan-immune-inflammation value with other inflammation markers of long-term survival after ST -segment elevation myocardial infarction. Eur J Clin Invest. (2023) 53:e13872. doi: 10.1111/eci.13872

PubMed Abstract | Crossref Full Text | Google Scholar

21. Yang L, Guo J, Chen M, Wang Y, Li J, Zhang J. Pan-Immune-Inflammatory value is superior to other inflammatory indicators in predicting inpatient Major adverse cardiovascular events and severe coronary artery stenosis after percutaneous coronary intervention in STEMI patients. Rev Cardiovasc Med. (2024) 25(8):294. doi: 10.31083/j.rcm2508294

PubMed Abstract | Crossref Full Text | Google Scholar

22. Henein MY, Vancheri S, Longo G, Vancheri F. The role of inflammation in cardiovascular disease. Int J Mol Sci. (2022) 23:12906. doi: 10.3390/ijms232112906

PubMed Abstract | Crossref Full Text | Google Scholar

23. Yang XC, Liu H, Liu DC, Tong C, Liang XW, Chen RH. Prognostic value of pan-immune-inflammation value in colorectal cancer patients: a systematic review and meta-analysis. Front Oncol. (2022) 12:1036890. doi: 10.3389/fonc.2022.1036890

PubMed Abstract | Crossref Full Text | Google Scholar

24. Lin F, Zhang LP, Xie SY, Huang HY, Chen XY, Jiang TC, et al. Pan-Immune-Inflammation value: a new prognostic Index in operative breast cancer. Front Oncol. (2022) 12:830138. doi: 10.3389/fonc.2022.830138

PubMed Abstract | Crossref Full Text | Google Scholar

25. Wu B, Zhang C, Lin S, Zhang Y, Ding S, Song W. The relationship between the pan-immune-inflammation value and long-term prognoses in patients with hypertension: national health and nutrition examination study, 1999–2018. Front Cardiovasc Med. (2023) 10:1099427. doi: 10.3389/fcvm.2023.1099427

PubMed Abstract | Crossref Full Text | Google Scholar

26. Su Z, Tang J, He Y, Zeng WH, Yu Q, Cao XL, et al. Pan-immune-inflammation value as a novel prognostic biomarker in nasopharyngeal carcinoma. Oncol Lett. (2024) 27(6):252. doi: 10.3892/ol.2024.14385

PubMed Abstract | Crossref Full Text | Google Scholar

27. Ormazabal V, Nair S, Elfeky O, Aguayo C, Salomon C, Zuñiga FA. Association between insulin resistance and the development of cardiovascular disease. Cardiovasc Diabetol. (2018) 17(1):122. doi: 10.1186/s12933-018-0762-4

PubMed Abstract | Crossref Full Text | Google Scholar

28. Vasques AC, Novaes FS, de Oliveira Md, Souza JR, Yamanaka A, Pareja JC, et al. Tyg index performs better than HOMA in a Brazilian population: a hyperglycemic clamp validated study. Diabetes Res Clin Pract. (2011) 93(3):e98–e100. doi: 10.1016/j.diabres.2011.05.030

PubMed Abstract | Crossref Full Text | Google Scholar

29. Li S, Guo B, Chen H, Shi Z, Li Y, Tian Q, et al. The role of the triglyceride (triacylglycerol) glucose index in the development of cardiovascular events: a retrospective cohort analysis. Sci Rep. (2019) 9(1):7320. doi: 10.1038/s41598-019-43776-5

PubMed Abstract | Crossref Full Text | Google Scholar

30. Barzegar N, Tohidi M, Hasheminia M, Azizi F, Hadaegh F. The impact of triglyceride-glucose index on incident cardiovascular events during 16 years of follow-up: Tehran lipid and glucose study. Cardiovasc Diabetol. (2020) 19:155. doi: 10.1186/s12933-020-01121-5

PubMed Abstract | Crossref Full Text | Google Scholar

31. Wang A, Tian X, Zuo Y, Chen S, Meng X, Wu S, et al. Change in triglyceride-glucose index predicts the risk of cardiovascular disease in the general population: a prospective cohort study. Cardiovasc Diabetol. (2021) 20(1):113. doi: 10.1186/s12933-021-01305-7

PubMed Abstract | Crossref Full Text | Google Scholar

32. Sánchez-Íñigo L, Navarro-González D, Fernández-Montero A, Pastrana-Delgado J, Martínez JA. The TyG index may predict the development of cardiovascular events. Eur J Clin Investig. (2016) 46(2):189–97. doi: 10.1111/eci.12583

PubMed Abstract | Crossref Full Text | Google Scholar

33. Jiang L, Chen S, Wu Y, Zhou D, Duan L. Prediction of coronary heart disease in gout patients using machine learning models. Math Biosci Eng. (2023) 20(3):4574–91. doi: 10.3934/mbe.2023212

PubMed Abstract | Crossref Full Text | Google Scholar

34. van Dalen JA, Koenders SS, Metselaar RJ, Vendel BN, Slotman DJ, Mouden M, et al. Machine learning based model to diagnose obstructive coronary artery disease using calcium scoring, PET imaging, and clinical data. J Nucl Cardiol. (2023) 30(4):1504–13. doi: 10.1007/s12350-022-03166-3

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: premature coronary artery disease, pan-Immune-Inflammation value, triglyceride-glucose index, machine learning, diagnostic model, prognostic model, survival analysis

Citation: He Y-C, Li Y, Qin X-J, Bi Z-H, Li S-Y, Liu Y-L and Miao L (2025) Construction of diagnostic and prognostic models for premature coronary artery disease based on multiple machine algorithms. Front. Cardiovasc. Med. 12:1611709. doi: 10.3389/fcvm.2025.1611709

Received: 14 April 2025; Accepted: 20 October 2025;
Published: 17 November 2025.

Edited by:

Tommaso Gori, Johannes Gutenberg University Mainz, Germany

Reviewed by:

Rafal Adam Januszek, Andrzej Frycz Modrzewski Krakow University, Poland
Panagiotis Siogkas, University of Ioannina, Greece
Kaisaierjiang Kadier, First Affiliated Hospital of Xinjiang Medical University, China

Copyright: © 2025 He, Li, Qin, Bi, Li, Liu and Miao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yan-Li Liu, Z3hseWxAMTI2LmNvbQ==; Liu Miao, ZHIubWlhb2xpdUBxcS5jb20=

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.