Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Nutr., 11 December 2025

Sec. Clinical Nutrition

Volume 12 - 2025 | https://doi.org/10.3389/fnut.2025.1681169

Machine learning-based prediction of early-onset peritoneal dialysis-associated peritonitis: the role of the CONUT score

Hua Zhou&#x;Hua Zhou1Chunlei Yao&#x;Chunlei Yao2Kai Song&#x;Kai Song3Shuya Zhao&#x;Shuya Zhao1Ye YuanYe Yuan4Xiangyin ChenXiangyin Chen1Youqi MaYouqi Ma1Huiyue HuHuiyue Hu1Min Yang
Min Yang1*
  • 1Department of Nephrology, The Third Affiliated Hospital of Soochow University, Changzhou, Jiangsu, China
  • 2Department of Nephrology, The Affiliated Taizhou Second People’s Hospital of Yangzhou University, Taizhou, Jiangsu, China
  • 3Department of Nephrology, The Second Affiliated Hospital of Soochow University, Suzhou, Jiangsu, China
  • 4Department of Oncology, The Third Affiliated Hospital of Soochow University, Changzhou, Jiangsu, China

Background: Peritoneal dialysis-associated peritonitis (PDAP) remains a major complication of peritoneal dialysis (PD). The controlling nutritional status (CONUT) score, which reflects the immune-nutritional state, may offer predictive value in identifying patients at risk. This study aimed to evaluate the utility of machine learning models in predicting early-onset PDAP and to assess the prognostic importance of baseline CONUT score, 6-month CONUT score, and their dynamic changes.

Methods: In this multicenter prospective cohort study, 675 patients initiating PD were enrolled. Multivariable logistic regression was performed to identify clinical predictors of early-onset peritonitis, while Kaplan–Meier survival analysis was used to compare peritonitis-free survival among patients with no peritonitis, early-onset peritonitis, and late-onset peritonitis. To enhance predictive performance, machine learning models including XGBoost, LightGBM, and their ensemble were constructed. Feature selection was based on SHapley Additive exPlanations (SHAP) values derived from an initial XGBoost model. The top 10 SHAP-ranked features were used to train all models. Model performance was assessed using area under the receiver operating characteristic curve (AUC), and SHAP summary plots were generated to interpret feature contributions.

Results: Over a median follow-up period of 41.8 months, 82 patients developed early-onset PDAP. Multivariable logistic regression identified baseline total cholesterol, neutrophil-to-lymphocyte ratio, and 6-month CONUT score as independent predictors of early-onset PDAP (vs. no PDAP; p < 0.05). In comparisons between early- and late-onset PDAP, older age, longer PD duration, and lower 6-month CONUT score were independently associated with a decreased likelihood of early-onset PDAP (p < 0.05). Using the top 10 SHAP-ranked features, three models (XGBoost, LightGBM, and an ensemble) were trained. For distinguishing early-onset PDAP from no PDAP, LightGBM performed best (AUC = 0.717), followed by the ensemble (0.698) and XGBoost (0.670). In differentiating early- from late-onset PDAP, LightGBM showed the highest AUC (0.781), outperforming the ensemble (0.744) and XGBoost (0.691). SHAP summary plots consistently identified the 6-month CONUT score as the important feature across both classification tasks.

Conclusion: The 6-month CONUT score is an independent predictor of early-onset PDAP and was among the top contributing features in multiple machine learning models. Integrating SHAP-based feature selection with gradient boosting improved model accuracy and interpretability. Dynamic monitoring of nutritional-immune status may aid in early risk stratification and guide personalized prevention strategies in patients undergoing PD.

Introduction

Peritoneal dialysis (PD) is a common modality of renal replacement therapy, offering patients with end-stage renal disease (ESRD) a home-based, flexible, and cost-effective alternative to hemodialysis (HD). Over the past decades, PD has seen substantial growth globally, particularly in low- and lower-middle income countries, due to its relative affordability and independence from centralized infrastructure (1). Advances in catheter design, dialysate composition, and automated PD systems have further improved its accessibility and clinical outcomes (2). Despite these developments, PD remains constrained by several clinical limitations, the most critical of which is the risk of peritoneal dialysis-associated peritonitis (PDAP). PDAP is one of the most frequent and serious complications of PD, leading to technique failure, hospitalization, catheter loss, and increased morbidity and mortality (3). Early or recurrent episodes can result in peritoneal membrane dysfunction, rendering patients unsuitable for continued PD and necessitating a switch to HD (4).

Traditional risk factors associated with PDAP include poor catheter exit-site care, nasal carriage of Staphylococcus aureus, and improper exchange techniques (5). Although these factors guide preventive strategies, they are insufficient for reliably identifying patients at highest risk. Therefore, there is a pressing need for accurate and clinically applicable methods to predict PDAP. Accurate early prediction of PDAP could enable timely interventions, leading to improved clinical outcomes and preservation of peritoneal function. To achieve this, novel biomarkers or risk models capable of stratifying patients based on their susceptibility are required. Compared to traditional statistical methods, machine learning models offer enhanced predictive performance by integrating high-dimensional clinical data, making them valuable tools for individualized PDAP risk stratification. Patients’ nutritional and immune status—and dynamic changes therein—have emerged as important determinants of PDAP risk (68). The controlling nutritional status (CONUT) score, a composite index based on serum albumin, total cholesterol (TC), and lymphocyte count, has been validated as a prognostic marker in various clinical settings (911). Our previous studies have demonstrated the utility of baseline CONUT score in predicting outcomes in patients undergoing PD (12). However, it remains unclear whether CONUT score measured during follow-up, or temporal changes in CONUT score, offers superior predictive value for PDAP compared to baseline assessments alone.

The present study aimed to compare the predictive value of the baseline CONUT score, the 6-month CONUT score, and changes in CONUT score during the first 6 months of PD in forecasting the development of PDAP. In addition, we applied machine learning models integrated with SHapley Additive exPlanations (SHAP) to enhance risk stratification by identifying the most informative nutritional predictors and interpreting their contributions to early PDAP risk.

Materials and methods

Study design and patients

This prospective cohort study included 1,241 patients who initiated PD at the Third and Second Affiliated Hospitals of Soochow University and the Affiliated Taizhou Second People’s Hospital of Yangzhou University between December 1, 2012, and June 30, 2022. Patients were followed until June 30, 2023. Exclusion criteria were as follows: (1) age < 18 years; (2) PD duration < 6 months; (3) presence of active infection; (4) malignant tumors; (5) estimated glomerular filtration rate (eGFR) (CKD-EPI) (13) ≥ 15 mL/min/1.73 m2; and (6) missing data on baseline CONUT score, or 6-month CONUT score. A total of 675 patients were ultimately enrolled. Following catheter insertion, all patients or their caregivers underwent standardized PD training and successfully passed competency assessments. Glucose-based dialysate was used in all cases, and none of the patients used automated PD. The study was conducted in compliance with the principles of the Declaration of Helsinki and approved by the Ethics Committee of the Third Affiliated Hospital of Soochow University (approval number: #52/2016). Written informed consent was obtained from all participants in the study.

Data collection

Baseline demographic and clinical characteristics were collected at the initiation of PD from the electronic medical records of the three participating centers. Collected demographic data included age, gender, duration of PD, body mass index (BMI), and primary kidney disease. Clinical variables included comorbidities such as hypertension, diabetes, cardiovascular disease, smoking status, autoimmunity disease, and glucocorticoids use. Systolic blood pressure (SBP) and diastolic blood pressure (DBP) were also recorded. Laboratory parameters included hemoglobin, platelet count, neutrophil count, lymphocyte count, potassium, sodium, fasting blood glucose (FBG), blood urea nitrogen (BUN), serum creatinine (SCr), uric acid (UA), eGFR, corrected calcium, magnesium, phosphorus, alkaline phosphatase (ALP), bile acid, serum albumin, prealbumin, C-reactive protein (CRP), TC, triglycerides (TG), intact parathyroid hormone (iPTH), and ferritin. Follow-up laboratory indicators included lymphocyte count, serum albumin, and TC. The CONUT score, based on these parameters, was calculated as previously described (11). Baseline CONUT score referred to the value obtained before PD catheter insertion, and the 6-month CONUT score referred to the value assessed within 1 month before or after 6 months of PD initiation. CONUT score change was defined as the 6-month score minus the baseline score. Patients were stratified into two groups based on the median CONUT score: < 4 and ≥ 4. Patients were classified based on the median CONUT score change: those with a change < 0 and those with a change ≥ 0. The systemic immune-inflammation index (SII) was calculated as (platelet count × neutrophil count) / lymphocyte count. The neutrophil-to-lymphocyte ratio (NLR) was calculated as the neutrophil count divided by the lymphocyte count. The causative organism was identified and recorded at the time of PDAP diagnosis. Causative organism categories include Gram-positive, Gram-negative, polymicrobial, fungus, and culture-negative.

Study outcome

The primary outcome was the first episode of PDAP, regardless of the causative organism. Competing events included transfer to HD, combined PD and HD, kidney transplantation, death, or loss to follow-up. The diagnosis of PDAP was established when at least two of the following three criteria were met: (1) clinical symptoms such as fever, abdominal pain, cloudy dialysate effluent, nausea, or vomiting; (2) peritoneal effluent white blood cell count exceeding 100/mm3 with more than 50% neutrophil count; and (3) positive bacterial identification in the peritoneal effluent by culture or Gram staining (14). Early-onset PDAP was defined as the occurrence of PDAP within 12 months of initiating PD, whereas late-onset PDAP was defined as an episode occurring after 12 months.

Statistical analyses

Statistical analyses were performed using SPSS software (version 26.0, IBM Corp.) and R software. The normality of continuous variables was evaluated using the Kolmogorov–Smirnov test. Only serum albumin was normally distributed and is presented as mean ± standard deviation; all other continuous variables are presented as medians with interquartile ranges, while categorical variables are summarized as frequencies and proportions.

Patients were classified into three groups based on the occurrence and timing of peritonitis: no PDAP, early-onset PDAP, and late-onset PDAP. For group comparisons, one-way analysis of variance (ANOVA) was employed for the comparison of normally distributed variables, whereas the Kruskal–Wallis test was applied to variables that did not follow a normal distribution. The chi-square test was used for categorical variables.

To develop and evaluate predictive models for peritonitis, the dataset was randomly partitioned into training (80%) and validation (20%) sets using stratified sampling to preserve the proportion of outcome classes. Categorical variables were encoded using dummy variables to ensure compatibility with machine learning algorithms, and feature alignment was performed to maintain consistency between the training and validation sets. Scale_pos_weight parameter was applied during model training for class imbalance. Feature selection was performed by identifying the top 10 most predictive features based on SHAP values derived from an initial XGBoost model. These selected features were subsequently used to train XGBoost, LightGBM, and their ensemble model.

Receiver operating characteristic (ROC) curves were generated for all models, with corresponding AUC values displayed on the plots. Model performance was further evaluated using a comprehensive suite of metrics including accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), F1-score, and Youden index.

Missing data were addressed using variable-type–specific imputation strategies. For continuous variables, missing values were imputed using the predictive mean matching (PMM) method via the multiple imputation by chained equations (MICE) algorithm implemented in the mice package in R. For categorical variables, missing values were imputed using the mode, defined as the most frequently occurring value within each variable.

Kaplan–Meier (K-M) analysis was used to compare survival between subgroups stratified by the 6-month CONUT score (< 4 and ≥ 4) and CONUT score change (< 0 and ≥ 0). Survival curves were generated for comparisons between: (1) no PDAP vs. early-onset PDAP, stratified by 6-month CONUT score; (2) no PDAP vs. early-onset PDAP, stratified by CONUT score change; (3) early- vs. late-onset PDAP, stratified by 6-month CONUT score; and (4) early- vs. late-onset PDAP, stratified by CONUT score change. Group differences were assessed using the log-rank test.

Cumulative incidence function (CIF) analysis was used to account for competing events, and Gray’s test was applied for between-group comparisons. All statistical tests were two-sided, and a p value < 0.05 was considered statistically significant.

Results

Characteristics of the participants

A total of 675 patients undergoing PD were included in the analysis. Over a median follow-up period of 41.8 months, 253 patients (37.5%) developed PDAP. Patients were categorized into three groups: no PDAP (n = 422), early-onset PDAP (n = 82), and late-onset PDAP (n = 171). Comparisons of baseline demographic and clinical characteristics across groups are summarized in Table 1. Significant differences among the three groups were observed in age, duration of PD, smoking status, baseline SBP, DBP, sodium, CRP, TC, NLR, 6-month albumin, 6-month TC, 6-month CONUT score, and CONUT score change (p < 0.05).

Table 1
www.frontiersin.org

Table 1. Baseline characteristics of three groups based on the occurrence and timing of PDAP.

Among patients who developed PDAP, Gram-positive bacteria were the most common causative organisms in the first episode (n = 103, 40.7%), followed by Gram-negative bacteria (n = 56, 22.1%). By the end of follow-up, 138 patients had transferred to HD, 28 had been treated with combined PD and HD, 54 had undergone kidney transplantation, 117 had died, and 7 were lost to follow-up.

Univariable and multivariable logistic regression analyses: no PDAP vs. early-onset PDAP

Univariable logistic regression identified multiple factors associated with early-onset PDAP compared to no PDAP, including smoking status, autoimmune disease, baseline SBP, serum albumin, TC, SII, NLR, 6-month lymphocyte, 6-month albumin, 6-month CONUT score, and CONUT score change (p < 0.1, Table 2). Smoking was excluded from the final multivariable model because none of the female patients in the cohort reported smoking status, indicating a sex-specific association.

Table 2
www.frontiersin.org

Table 2. Univariable and multivariable logistic regression of prognostic factors for early-onset PDAP in patients with no PDAP and early-onset PDAP.

In the final multivariable model, baseline TC [odds ratio (OR) 1.354, 95% confidence interval (CI) 1.098–1.670; p = 0.005], NLR (OR 1.073, 95% CI 1.005–1.146; p = 0.035), and the 6-month CONUT score (OR 1.239, 95% CI 1.126–1.364; p < 0.001) remained independent predictors of early-onset PDAP (Table 2). Sensitivity analyses using the imputed dataset yielded consistent findings, confirming the robustness of the final model.

Univariable and multivariable logistic regression analyses: early-onset vs. late-onset PDAP

Univariable logistic regression analysis comparing early- and late-onset PDAP identified age, duration of PD, smoking status, glucocorticoids use, baseline SBP, DBP, sodium, albumin, 6-month lymphocyte, 6-month albumin, 6-month CONUT score, and CONUT score change as factors associated with late-onset PDAP (p < 0.1, Table 3). These variables except smoking status were included in the multivariable model.

Table 3
www.frontiersin.org

Table 3. Univariable and multivariable logistic regression of prognostic factors for late-onset PDAP in patients with early-onset PDAP and late-onset PDAP.

In the multivariable analysis, older age (OR 1.027, 95% CI 1.006–1.048; p = 0.011), longer PD duration (OR 1.027, 95% CI 1.015–1.040; p < 0.001), and lower 6-month CONUT score (OR 0.806, 95% CI 0.721–0.902; p < 0.001) were independently associated with an increased likelihood of late-onset PDAP (Table 3). Sensitivity analyses using imputed data confirmed these associations.

Machine learning-based classification of early-onset vs. no peritonitis

To distinguish between patients who did not develop peritonitis and those who experienced early-onset peritonitis, machine learning models were constructed using the top 10 most predictive features identified via SHAP values. As shown in the SHAP summary plot (Figure 1A), the most influential features contributing to model predictions included 6-month CONUT score (SHAP value = 1.041), CRP (0.698), NLR (0.564), SII (0.559), BUN (0.454), platelet count (0.454), SBP (0.450), magnesium (0.348), ferritin (0.337), and calcium (0.291).

Figure 1
Chart A displays SHAP value impacts on model output for various features, with `6_month_CONUT_score` having the highest impact. Chart B presents ROC curves for the top 10 SHAP features, with AUCs for XGBoost, LightGBM, and their combination being 0.670, 0.717, and 0.698, respectively.

Figure 1. (A) Summary plot of SHAP values highlighting the top 10 most influential features for differentiating early-onset from no PDAP; (B) ROC curves of three models trained using the top SHAP-ranked features to predict early-onset PDAP. SHAP, SHapley Additive exPlanations; PDAP, peritoneal dialysis-associated peritonitis; ROC, receiver operating characteristic.

Model performance was assessed on the validation set using ROC analysis. Among the three models, LightGBM achieved the highest area under the curve (AUC = 0.717, 95% CI 0.581–0.853), followed by the ensemble model combining XGBoost and LightGBM (AUC = 0.698, 95% CI 0.556–0.841), and the XGBoost model (AUC = 0.670, 95% CI 0.526–0.814) (Figure 1B). Table 4 summarizes the performance metrics of the machine learning models used to differentiate early-onset PDAP from patients without PDAP. These results indicate that LightGBM performing slightly better than the others in distinguishing early-onset peritonitis. However, all three models demonstrated relatively low sensitivity (0.200–0.250), which may limit their ability to identify patients at risk of early-onset PDAP. Conversely, specificity was consistently high across models (0.933–0.950), suggesting good accuracy in excluding non-PDAP cases. Among these models, XGBoost yielded the highest F1-score (0.345) and Youden index (0.183), indicating a more balanced trade-off between sensitivity and specificity.

Table 4
www.frontiersin.org

Table 4. Machine learning model metrics for predicting early-onset PDAP vs. no PDAP.

Machine learning-based classification of early-onset vs. late-onset peritonitis

To explore the discriminatory features between early- and late-onset peritonitis, SHAP values were used to rank feature importance. As shown in the SHAP summary plot (Figure 2A), the most influential variables in the predictive model included duration of peritoneal dialysis (SHAP value = 0.968), age (0.731), potassium (0.528), 6-month CONUT score (0.523), FBG (0.475), bile acid (0.437), SBP (0.416), BMI (0.385), ALP (0.350), and CRP (0.346).

Figure 2
Panel A shows a SHAP plot with features like Duration of PD and Age having the highest impact on model output, colored by feature value from low to high. Panel B displays ROC curves for the top ten SHAP features, comparing XGBoost, LightGBM, and a combined model. XGBoost shows an AUC of 0.691, LightGBM 0.781, and the combined model 0.744.

Figure 2. (A) Summary plot of SHAP values highlighting the top 10 most influential features for differentiating early-onset from late-onset PDAP; (B) ROC curves of three models trained using the top SHAP-ranked features to predict early-onset PDAP. SHAP, SHapley Additive exPlanations; PDAP, peritoneal dialysis-associated peritonitis; ROC, receiver operating characteristic.

Three machine learning models were trained using these top 10 features. ROC analysis on the validation set demonstrated that LightGBM achieved the best performance with an AUC of 0.781 (95% CI 0.637–0.925), followed by the ensemble model of XGBoost and LightGBM (AUC = 0.744, 95% CI 0.586–0.902), and XGBoost alone (AUC = 0.691, 95% CI 0.510–0.872) (Figure 2B). The performance metrics of machine learning models for predicting early-onset PDAP vs. late-onset PDAP are presented in Table 5. XGBoost demonstrated higher sensitivity (0.533) and specificity (0.840), together with the best F1-score (0.593) and Youden index (0.373), suggesting a superior balance between sensitivity and specificity.

Table 5
www.frontiersin.org

Table 5. Machine learning model metrics for predicting early-onset PDAP vs. late-onset PDAP.

Impact of CONUT score on early-onset PDAP

Among patients with no PDAP and early-onset PDAP, K-M survival analyses were performed to evaluate the association of the 6-month CONUT score and CONUT score change with the risk of early-onset PDAP. Patients with a 6-month CONUT score < 4 had a significantly lower risk of early-onset PDAP compared to those with a score ≥ 4 (p < 0.001, Figure 3A). Likewise, patients with a CONUT score change < 0 exhibited a lower risk of early-onset PDAP than those with a change ≥ 0 (p = 0.001, Figure 3B).

Figure 3
Four graphs comparing survival probability over time in months for different PDAP groups. Graphs A and B show “No PDAP & Early-onset PDAP”; C and D represent “Early-onset PDAP & Late-onset PDAP”. Graphs A and C compare 6-month CONUT score groups <4 and ≥4; graphs B and D show score change <0 and ≥0. P-values are noted in each graph: A (P<0.001), B (P=0.001), C (P<0.001), D (P=0.002). Dotted and solid lines indicate different score groups.

Figure 3. Kaplan–Meier curves for overall survival of PDAP (A) No PDAP & early-onset PDAP according to 6-month CONUT score groups (B) No PDAP & early-onset PDAP according to CONUT score change groups (C) Early-onset & late-onset PDAP according to 6-month CONUT score groups (D) Early-onset & late-onset PDAP according to CONUT score change groups. PDAP, peritoneal dialysis-associated peritonitis; CONUT, controlling nutritional status.

In a separate analysis restricted to patients with early- or late-onset PDAP, K-M curves revealed significant differences in the timing of PDAP onset according to both 6-month CONUT score (p < 0.001, Figure 3C) and CONUT score change (p = 0.002, Figure 3D).

However, in the competing risk analysis using CIF curves among patients without PDAP and those with early-onset PDAP, a significant difference in the cumulative incidence of early-onset PDAP was observed between patients with a 6-month CONUT score < 4 and those with a score ≥ 4 (p = 0.028, Figure 4A; p = 0.138, Figure 4B). In contrast, no significant differences in cumulative incidence were found when comparing early- and late-onset PDAP according to either the 6-month CONUT score or CONUT score change (p > 0.05, Figures 4C,D).

Figure 4
Four cumulative incidence graphs labeled A, B, C, and D compare groups based on CONUT scores. A and C show increasing trends with higher scores for “Early-onset PDAP” (A: P=0.028, C: P=0.633). B and D show similar patterns for “No PDAP” and “Early-onset PDAP” with CONUT score changes (B: P=0.138, D: P=0.841). Solid and dotted lines represent groups by score threshold.

Figure 4. Cumulative incidence function curves of PDAP considering competing risks (A) No PDAP & early-onset PDAP according to 6-month CONUT score groups (B) No PDAP & early-onset PDAP according to CONUT score change groups (C) Early-onset & late-onset PDAP according to 6-month CONUT score groups (D) Early-onset & late-onset PDAP according to CONUT score change groups. PDAP, peritoneal dialysis-associated peritonitis; CONUT, controlling nutritional status.

Discussion

This multicenter study evaluated the prognostic value of the dynamic CONUT score in predicting early-onset PDAP. Multivariable logistic regression identified the 6-month CONUT score as an independent risk factor for early-onset PDAP. K-M analyses revealed significant differences in the incidence of early-onset PDAP among groups stratified by both the 6-month CONUT score and its temporal changes. In addition, machine learning models incorporating the top 10 SHAP-ranked features demonstrated that the 6-month CONUT score was an important predictor for early-onset PDAP, further highlighting its clinical relevance. Among the models, LightGBM achieved the highest predictive performance, supporting the utility of machine learning-based approaches for individualized risk stratification. Taken together, these results underscore the importance of longitudinal nutritional and immunologic assessment, as well as advanced predictive modeling, in identifying patients at elevated risk of PDAP.

In our study, the 6-month CONUT score emerged as an independent predictor of early-onset PDAP. Although each component has previously been associated with outcomes in patients undergoing PD, the composite score offers a synergistic assessment. Serum albumin, a well-established negative acute-phase reactant, serves as a marker for both inflammation and nutritional reserves. Prior studies have demonstrated that low serum albumin correlates with adverse outcomes in PDAP (8). Lymphocyte count reflects immune competence and has been linked to both susceptibility to PDAP and recovery (7). Hypocholesterolemia is frequently observed during systemic inflammatory responses and may reflect the severity of acute illness (15). Previous studies have similarly demonstrated that objective nutritional indexes are associated with technique failure in patients undergoing PD (16). This reinforces the notion that multidimensional scoring systems provide a more comprehensive assessment of patient vulnerability. Our findings support the routine use of the CONUT score as part of risk stratification tools in PD populations and highlight the value of integrated nutritional-immunologic surveillance in clinical practice.

Interestingly, our analysis revealed that the baseline CONUT score—measured prior to the initiation of PD—was not significantly associated with the risk of early-onset PDAP. This finding suggests that pre-dialysis nutritional and immune status may inadequately reflect the dynamic physiological stress and environmental exposures encountered after the initiation of PD. Dialysis initiation marks a profound metabolic and immunologic transition. During the early months of PD, nutritional status and systemic inflammation are highly dynamic and are influenced by factors such as dialysate glucose absorption, peritoneal protein loss, fluid imbalance, and early inflammatory responses (1720). As such, a static, pre-dialysis assessment may fail to capture the evolving risk landscape following the start of therapy. These findings support the incorporation of serial CONUT assessments—particularly within the first 6 months of PD—as a more meaningful approach to predicting infectious complications.

K-M survival analyses revealed significant differences in early-onset PDAP incidence based on both the 6-month CONUT score and its temporal changes. The 6-month CONUT score—rather than the baseline CONUT score—was recognized as an independent risk factor for early-onset PDAP. Patients with persistently elevated CONUT scores at 6 months were at substantially higher risk, underscoring the clinical relevance of dynamic nutritional assessment over static, baseline evaluation. This finding is consistent with previous studies demonstrating that the increment and trajectory of serum albumin levels, particularly during the early phase of PD, may serve as a more critical determinant of patient outcomes (21). While K-M analyses revealed significant associations between both the 6-month CONUT score and its temporal change with peritonitis risk, only the 6-month CONUT score remained independently predictive in the multivariable logistic regression model. This suggests that changes in the CONUT score over time may reflect overlapping risk information already captured by the absolute score at 6 months. In addition, CIF curves found a significant difference only between patients with 6-month CONUT scores <4 vs. ≥ 4 among those with no PDAP and early-onset PDAP. No significant difference was seen between early- and late-onset PDAP. This discrepancy may result from competing events, which CIF accounts for but K-M treats as censored. These results suggest that the 6-month CONUT score better predicts early-onset PDAP risk, but its ability to distinguish early vs. late-onset cases is limited when considering competing risks.

Furthermore, we applied interpretable machine learning to distinguish peritonitis onset patterns in PD patients, with SHAP analysis identifying key predictors and improving clinical interpretability. SHAP analysis identified the 6-month CONUT score as the most influential predictor distinguishing patients without peritonitis from those with early-onset PDAP. Other top-ranking features—CRP, NLR, and SII—highlighted the role of inflammatory and nutritional disturbances, underscoring the importance of immune-nutritional status in infection risk (22). In distinguishing early- from late-onset peritonitis, dialysis duration and age were the most impactful variables, suggesting that longer exposure and older age are key determinants of infection timing, alongside biochemical markers such as potassium, bile acid, and FBG. SHAP plots enhanced model transparency by quantifying the contribution of individual variables to each prediction. Compared to traditional statistical methods, the machine learning approach offers improved accuracy, automatic feature selection, and better handling of multicollinearity—particularly when combined with SHAP. These advantages highlight its potential for developing clinically applicable tools for early risk stratification and personalized care in PD populations. Moreover, LightGBM demonstrated superior discriminative ability, while XGBoost provided a more balanced performance across sensitivity, specificity, and composite metrics, making it potentially more applicable in clinical practice.

This study has several strengths. Unlike prior investigations that assessed nutritional status at a single time point, our longitudinal analysis revealed that the trajectory of the CONUT score serves as a more robust predictor of early-onset PDAP, underscoring the importance of dynamic nutritional and immunologic monitoring. These findings suggest that patients exhibiting a worsening CONUT trajectory may benefit from early, targeted interventions, such as intensified nutritional support, immunomodulatory therapy, and closer clinical surveillance. The multicenter design enhances the external validity of our findings and supports their applicability across diverse patient populations. In addition, the use of interpretable machine learning models, particularly with SHAP-based explanation, enabled the identification of key predictors and provided insights into variable contributions, enhancing both predictive accuracy and clinical relevance. Moreover, the application of competing risk analysis and sensitivity analyses provide a methodologically rigorous framework for modeling infection risk.

However, this study has several limitations. First, partial missing data for the 6-month CONUT score may have introduced bias in evaluating longitudinal nutritional trends. Second, as an observational study, causal relationships cannot be established, and no interventional strategies were employed to modify nutritional or immune status; therefore, we were unable to assess whether such interventions could reduce the risk of PDAP. Third, important clinical parameters, including peritoneal membrane function and dialysis adequacy, were not included in the analysis and may represent potential confounders. Future prospective studies incorporating these parameters and evaluating the impact of targeted nutritional and immunologic interventions are warranted to confirm and expand upon these findings.

Conclusion

To our knowledge, this is the first study to demonstrate the prognostic value of both the 6-month CONUT score and its trajectory in predicting early-onset PDAP. Dynamic monitoring of nutritional and immune status may support risk-based prevention strategies to improve outcomes and extend technique survival. Furthermore, machine learning models confirmed the 6-month CONUT score as the strong predictor, with SHAP analysis highlighting the added value of inflammatory and biochemical markers. These findings support the development of individualized prediction tools, and future prospective studies are needed to validate their clinical utility.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Ethics Committee of Third Affiliated Hospital of Soochow University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

HZ: Conceptualization, Formal analysis, Funding acquisition, Visualization, Writing – original draft. CY: Data curation, Investigation, Writing – review & editing. KS: Data curation, Investigation, Writing – review & editing. SZ: Data curation, Formal analysis, Investigation, Methodology, Writing – original draft. YY: Formal analysis, Writing – original draft. XC: Investigation, Writing – original draft. YM: Investigation, Writing – original draft. HH: Investigation, Methodology, Writing – original draft. MY: Conceptualization, Investigation, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This research was funded by grants from the Jiangsu Provincial Health Commission (M2024070), the Top Talent of Changzhou “The 14th Five-Year Plan” High-Level Health Talents Training Project (2022CZBJ003), Changzhou Sci & Tech Program (CJ20241111), and Changzhou Key Medical Discipline (2022).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Nkunu, V, Tungsanga, S, Diongole, HM, Sarki, A, Arruebo, S, Caskey, FJ, et al. Landscape of kidney replacement therapy provision in low- and lower-middle income countries: a multinational study from the ISN-GKHA. PLOS Glob Public Health. (2024) 4:e0003979. doi: 10.1371/journal.pgph.0003979,

PubMed Abstract | Crossref Full Text | Google Scholar

2. Sanchez, JE, Ulloa, C, Bueno, CM, Astudillo, E, and Rodriguez-Suarez, C. Impact of peritoneal dialysis strategy on technique and patient survival. Clin Kidney J. (2023) 16:2523–9. doi: 10.1093/ckj/sfad155,

PubMed Abstract | Crossref Full Text | Google Scholar

3. Wang, AH, Sawyer, K, and Shah, AD. Persistent peritonitis in peritoneal dialysis: a comphrenesive review of recurrent, relapsing, refractory, and repeat peritonitis. Int Urol Nephrol. (2024) 56:583–95. doi: 10.1007/s11255-023-03731-w,

PubMed Abstract | Crossref Full Text | Google Scholar

4. Ito, Y, Sun, T, Tawada, M, Kinashi, H, Yamaguchi, M, Katsuno, T, et al. Pathophysiological mechanisms of peritoneal fibrosis and peritoneal membrane dysfunction in peritoneal Dialysis. Int J Mol Sci. (2024) 25:8607. doi: 10.3390/ijms25168607,

PubMed Abstract | Crossref Full Text | Google Scholar

5. Segal, JH, and Messana, JM. Prevention of peritonitis in peritoneal dialysis. Semin Dial. (2013) 26:494–502. doi: 10.1111/sdi.12114,

PubMed Abstract | Crossref Full Text | Google Scholar

6. Tang, J, Wang, D, Chen, Y, and Feng, J. The association between new inflammation markers and frequent peritoneal dialysis-associated peritonitis. BMC Nephrol. (2024) 25:81. doi: 10.1186/s12882-024-03496-z,

PubMed Abstract | Crossref Full Text | Google Scholar

7. He, Y, Huang, X, Zhang, J, Liao, J, Huang, H, He, Y, et al. Decreased peripheral blood lymphocyte count predicts poor treatment response in peritoneal Dialysis-associated peritonitis. J Inflamm Res. (2023) 16:5327–38. doi: 10.2147/JIR.S438674,

PubMed Abstract | Crossref Full Text | Google Scholar

8. Zha, D, Yang, X, and Xi, H. Association of hypoalbuminemia with the risk of peritoneal dialysis-associated peritonitis in peritoneal dialysis patients: a meta-analysis. Blood Purif. (2025) [Preprint] 1–23. doi: 10.1159/000543693,

PubMed Abstract | Crossref Full Text | Google Scholar

9. Tsai, MH, Yang, CH, Tsai, YT, Lin, YT, Chao, CY, Chuang, HC, et al. Controlling nutritional status score as a survival prognosticator in patients with head and neck cancer. J Formos Med Assoc. (2025). doi: 10.1016/j.jfma.2025.04.017,

PubMed Abstract | Crossref Full Text | Google Scholar

10. Park, S, Choi, GW, Lee, I, Seo, Y, Chae, YS, Yun, WG, et al. Impact of nutritional changes on the prognosis in pancreatic Cancer patients underwent curative surgery after neoadjuvant chemotherapy. Nutrients. (2025) 17:647. doi: 10.3390/nu17040647,

PubMed Abstract | Crossref Full Text | Google Scholar

11. Ignacio de Ulibarri, J, Gonzalez-Madrono, A, de Villar, NG, Gonzalez, P, Gonzalez, B, Mancha, A, et al. CONUT: a tool for controlling nutritional status. First validation in a hospital population. Nutr Hosp. (2005) 20:38–45.

Google Scholar

12. Zhou, H, Chao, W, Cui, L, Li, M, Zou, Y, and Yang, M. Controlling nutritional status (CONUT) score as immune-nutritional predictor of outcomes in patients undergoing peritoneal dialysis. Clin Nutr. (2020) 39:2564–70. doi: 10.1016/j.clnu.2019.11.018,

PubMed Abstract | Crossref Full Text | Google Scholar

13. Levey, AS, Stevens, LA, Schmid, CH, Zhang, YL, Castro, AF 3rd, Feldman, HI, et al. A new equation to estimate glomerular filtration rate. Ann Intern Med. (2009) 150:604–12. doi: 10.7326/0003-4819-150-9-200905050-00006,

PubMed Abstract | Crossref Full Text | Google Scholar

14. Li, PK, Chow, KM, Cho, Y, Fan, S, Figueiredo, AE, Harris, T, et al. ISPD peritonitis guideline recommendations: 2022 update on prevention and treatment. Perit Dial Int. (2022) 42:110–53. doi: 10.1177/08968608221080586,

PubMed Abstract | Crossref Full Text | Google Scholar

15. Golucci, A, Marson, FAL, Ribeiro, AF, and Nogueira, RJN. Lipid profile associated with the systemic inflammatory response syndrome and sepsis in critically ill patients. Nutrition. (2018) 55-56:7–14. doi: 10.1016/j.nut.2018.04.007,

PubMed Abstract | Crossref Full Text | Google Scholar

16. Yang, Y, Xu, Y, Zhang, P, Zhou, H, Yang, M, and Xiang, L. Predictive value of objective nutritional indexes in technique failure in peritoneal Dialysis patients. J Ren Nutr. (2022) 32:605–12. doi: 10.1053/j.jrn.2021.09.005,

PubMed Abstract | Crossref Full Text | Google Scholar

17. Malho Guedes, A, Calcas Marques, R, Ribeiro, B, Fernandes, MT, Faisca, M, Silva, AP, et al. Peritoneal protein loss, inflammation, and nutrition: refuting myths. Front Med. (2022) 9:884061. doi: 10.3389/fmed.2022.884061,

PubMed Abstract | Crossref Full Text | Google Scholar

18. Dong, J, Chen, Y, Luo, S, Xu, R, and Xu, Y. Peritoneal protein leakage, systemic inflammation, and peritonitis risk in patients on peritoneal dialysis. Perit Dial Int. (2013) 33:273–9. doi: 10.3747/pdi.2011.00326,

PubMed Abstract | Crossref Full Text | Google Scholar

19. Trionfetti, F, Marchant, V, Gonzalez-Mateo, GT, Kawka, E, Marquez-Exposito, L, Ortiz, A, et al. Novel aspects of the immune response involved in the peritoneal damage in chronic kidney disease patients under dialysis. Int J Mol Sci. (2023) 24:5763. doi: 10.3390/ijms24065763,

PubMed Abstract | Crossref Full Text | Google Scholar

20. Yung, S, and Chan, TM. Pathophysiological changes to the peritoneal membrane during PD-related peritonitis: the role of mesothelial cells. Mediat Inflamm. (2012) 2012:484167. doi: 10.1155/2012/484167,

PubMed Abstract | Crossref Full Text | Google Scholar

21. Chiu, PF, Tsai, CC, Wu, CL, Yang, TY, Liou, HH, Chen, HL, et al. Trajectories of serum albumin predict survival of peritoneal Dialysis patients: a 15-year follow-up study. Medicine (Baltimore). (2016) 95:e3202. doi: 10.1097/MD.0000000000003202,

PubMed Abstract | Crossref Full Text | Google Scholar

22. Liu, M, Lu, Q, Sun, J, Dai, S, Yan, F, Yang, C, et al. Establishment of a prediction model for long-term infections in patients undergoing peritoneal dialysis. Front Med. (2025) 12:1596403. doi: 10.3389/fmed.2025.1596403,

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: controlling nutritional status, machine learning, peritonitis, peritoneal dialysis, SHapley Additive exPlanations

Citation: Zhou H, Yao C, Song K, Zhao S, Yuan Y, Chen X, Ma Y, Hu H and Yang M (2025) Machine learning-based prediction of early-onset peritoneal dialysis-associated peritonitis: the role of the CONUT score. Front. Nutr. 12:1681169. doi: 10.3389/fnut.2025.1681169

Received: 07 August 2025; Revised: 01 October 2025; Accepted: 18 November 2025;
Published: 11 December 2025.

Edited by:

Zhen Li, Qilu Hospital of Shandong University, China

Reviewed by:

Shi Xue Dai, Guangdong Provincial People's Hospital, China
Priyanka Gavade, KLE Technological University Dr. M. S. Sheshgiri Campus, India

Copyright © 2025 Zhou, Yao, Song, Zhao, Yuan, Chen, Ma, Hu and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Min Yang, eWFuZ21pbjE1MTZAc3VkYS5lZHUuY24=

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.