Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med., 27 January 2026

Sec. Gastroenterology

Volume 13 - 2026 | https://doi.org/10.3389/fmed.2026.1712846

This article is part of the Research TopicAdvancing Gastrointestinal Disease Diagnosis with Interpretable AI and Edge Computing for Enhanced Patient CareView all 14 articles

Development and evaluation of a machine learning model to predict unplanned readmission risk in patients with ulcerative colitis

  • 1First Clinical Medical College, Shandong University of Traditional Chinese Medicine, Jinan, Shandong, China
  • 2Department of Gastroenterology, Affiliated Hospital of Shandong University of Traditional Chinese Medicine, Jinan, Shandong, China
  • 3Department of Gastroenterology, Hangzhou Hospital of Traditional Chinese Medicine, Hangzhou, Zhejiang, China

Objective: Ulcerative colitis (UC), a chronic inflammatory bowel disease marked by recurrent flares and remissions, often necessitates repeated hospitalization owing to disease variability. However, commonly used risk-scoring systems have limited predictive accuracy for hospital readmission. This study aimed to develop and validate a machine learning (ML)-based model to predict the risk of unplanned readmission within 1 year in patients with UC.

Methods: Unplanned readmission within 1 year was defined as an endpoint event, and a predictive model was developed using a retrospective cohort (n = 324) and externally validated using an independent prospective cohort (n = 137). Demographic characteristics, medical history, medication use, clinical symptoms, laboratory findings, and endoscopic data were integrated as input variables. The optimal feature subset was selected using Recursive Feature Elimination (RFE), and eight ML models were constructed. All models were optimized via five-fold cross-validation, and the best-performing model was selected as the final predictive tool and was subjected to external validation. Shapley additive explanation plots were used to interpret the predictive model.

Results: The RFE algorithm identified five critical predictors: C-reactive protein, erythrocyte sedimentation rate, red blood cell count, increased frequency of bowel movements, and platelet count. All ML models achieved an AUC above 0.75 in the training cohort, demonstrating their robust predictive capability. The random forest (RF) model consistently outperformed the others across the training, internal validation, and external validation cohorts, with AUCs of 0.936, 0.815, and 0.813, respectively, reflecting excellent stability and generalization. Building upon the RF model, an online risk prediction platform was developed to estimate the probability of unplanned readmission in patients with UC.

Conclusion: The RF-based model showed strong predictive accuracy for assessing the 1-year risk of unplanned readmission in UC patients. The corresponding web-based risk calculator offers clinicians a valuable tool for personalized risk evaluation and enhanced patient management.

1 Introduction

Ulcerative colitis (UC) is a chronic and relapsing inflammatory bowel disease that predominantly involves the colonic mucosa, with its etiology strongly linked to immune dysregulation, environmental exposure, and genetic predisposition (1, 2). As of 2023, UC affects approximately 5 million people globally, and its incidence continues to rise, with Europe showing the highest rates, Norway reporting 505 cases per 100,000 population (1, 3). The advent of biologics and immunomodulators in recent years has led to substantial improvements in disease control for certain patients with UC, especially those with moderate to severe activity or poor response to conventional treatments (4, 5). Nonetheless, UC remains a disease characterized by a protracted course and high relapse rate, posing a considerable burden on patient’s quality of life and healthcare systems (6).

The readmission rate (7) serves as a key metric for assessing disease control and management quality in UC. Evidence indicates that patients with UC face a heightened risk of readmission during disease flares and treatment transitions, highlighting the critical value of timely intervention within 1 year (8, 9). Consequently, accurate prediction of readmission risk enables clinicians to identify high-risk individuals and implement tailored follow-up and intervention plans, ultimately enhancing disease control and reducing the burden of rehospitalization.

While current prognostic models for UC offer preliminary insight into unplanned readmission risk, they are limited by incomplete incorporation of clinical variables, such as laboratory results, medication profiles, and comorbidities, as well as by the lack of external validation and cross-model comparative analysis (10, 11). Thus, a practical and widely applicable predictive tool is urgently needed to assist in managing the risk of readmission among patients with UC.

In summary, this study sought to leverage multiple machine learning (ML) algorithms and comprehensive clinical data to build and externally validate predictive models, ultimately developing an online tool for assessing 1-year unplanned readmission risk in patients with UC to enhance their clinical management.

2 Methods

2.1 Study design and subjects

The training dataset comprised a retrospective cohort of 324 patients with UC who were hospitalized at the Affiliated Hospital of Shandong University of Traditional Chinese Medicine between September 2015 and September 2020. The external validation cohort included 137 patients with UC with prospectively collected data who were hospitalized at the same center between October 2020 and October 2023. All patients underwent 1-year follow-up using telephone interviews, outpatient evaluations, and inpatient medical records.

The inclusion criteria were as follows: (1) diagnosis in accordance with the American College of Gastroenterology (ACG) guidelines (12); (2) age ≥ 18 years and ≤ 80 years, regardless of sex.

The exclusion criteria were as follows: (1) presence of severe dysfunction of the heart, liver, or kidneys; serious infections; hematologic diseases; or malignancies; (2) concurrent autoimmune diseases such as rheumatoid arthritis or systemic lupus erythematosus; (3) a history of colectomy in patients with UC; (4) substantial missing data in medical records (> 15%); and (5) poor treatment compliance or loss to follow-up.

This study complied with the Declaration of Helsinki and was approved by the Ethics Committee of the Affiliated Hospital of Shandong University of Traditional Chinese Medicine (Approval No. 2024-152-ky). All participants or their legal guardians provided informed consent by signing a consent form.

2.2 Outcomes

Endpoint events were defined as unplanned readmissions within 1 year. Unplanned readmission is defined as an unexpected hospitalization resulting from sudden changes in a patient’s condition or inadequate management, occurring outside of scheduled routine follow-up monitoring, enteral nutrition therapy, medication infusion, or other planned interventions (13). Each case of suspected unplanned readmission was independently assessed by two senior physicians, each with over 10 years of clinical experience. Any discrepancies in the evaluation were resolved through discussion until a consensus was reached.

2.3 Candidate predictors and data collection

Candidate predictors included only variables collected at the time of initial hospital admission. The following data were collected: (1) demographic factors, including age, sex, smoking history, and alcohol use; (2) past medical history, such as hypertension, diabetes, cardiovascular disease, and anemia; (3) medication regimen, including 5-aminosalicylic acid drugs, corticosteroids, immunosuppressants, and probiotics; (4) clinical manifestations, such as increased bowel frequency; (5) laboratory tests, including blood counts, liver and kidney function, electrolytes, and coagulation; (6) endoscopic data, including colonoscopy results and mucosal histology; and (7) clinical assessment tools, primarily the Mayo score (12), the Mayo Endoscopic Subscore (MES) (14), the Degree of Ulcerative Colitis Burden of Luminal Inflammation (DUBLIN) score (15), and the Ulcerative Colitis Endoscopic Index of Severity (UCEIS) (16). Two data managers independently collected and cross-checked all clinical data to ensure accuracy and consistency.

2.4 Statistical analysis and model construction

All statistical analyses were performed using R software (version 4.3.2, https://www.R-project.org). Multiple imputation was used to address missing data, and variables with over 15% missingness were excluded from the analysis. Descriptive statistics are presented as n (%), mean ± SD, or median (Q1, Q3), as appropriate. Group comparisons were conducted using Student’s t-test for normally distributed continuous variables and the Mann–Whitney U test for those not normally distributed. Categorical data were analyzed using Pearson’s χ2 test or Fisher’s exact test, depending on the expected frequencies. Statistical significance was set at p < 0.05 (two-tailed).

The optimal feature subset was selected using Recursive Feature Elimination (RFE) and applied to the training and validation of each ML model. Eight different ML models were trained and tested, including Random Forest (RF), Decision Tree (DT), K-Nearest Neighbors (KNN), Light Gradient Boosting Machine (Light GBM), Logistic Regression (LR), Multilayer Perceptron (MLP), Support Vector Machine (SVM), and Extreme Gradient Boosting (XG Boost). The training dataset was used for model development, and five-fold cross-validation was performed for hyperparameter tuning and internal validation. An independent prospective cohort was used for external validation, and the best-performing model was selected as the final model. The model performance was evaluated using metrics including the area under the receiver operating characteristic curve (AUC), accuracy, F1 score, recall, precision, sensitivity, and specificity. In addition, the optimal model was compared with the Mayo score, the MES, the DUBLIN score, and the UCEIS in the training and external validation cohorts. The classification performance was assessed using the precision-recall (PR) curve, and the clinical net benefit was evaluated using Decision Curve Analysis (DCA).

Shapley additive explanation (SHAP) was used to analyze the contribution and direction of each predictor. A web-based risk prediction platform was developed based on the optimal model and feature importance of the predictive variables.

3 Results

3.1 Comparison of clinical characteristics

This study included 461 eligible patients, of whom 324 were assigned to the training cohort and 137 to the validation cohort. Table 1 summarizes the baseline characteristics and missing data patterns of the included patients. After removing variables with missing rates over 15%, 45 variables were included in the statistical analysis (p > 0.05). Details of the excluded fecal calprotectin data are presented in Supplementary Table 1. The training and validation cohorts were comparable in terms of major clinical features, with no statistically significant differences. Compared with the training cohort, the external validation cohort showed significantly lower mean corpuscular hemoglobin concentration, mean corpuscular volume, serum potassium, and apolipoprotein A1 levels, but higher mean platelet volume and serum chloride levels (p < 0.05).

Table 1
www.frontiersin.org

Table 1. Baseline characteristics and outcomes of the training and validation cohort.

3.2 Selection of key predictors

RFE was used to identify key features from 45 candidate variables in the training cohort. The highest accuracy (80.61%) was achieved when the model selected an optimal subset of five clinical predictors (Supplementary Figure 1). The five selected features were C-reactive protein (CRP) level, erythrocyte sedimentation rate (ESR), red blood cell count (RBC), increased bowel movement frequency (IFBM, based on the Mayo score), and platelet count (PLT).

3.3 Multiple ML model performance

In the training dataset, all models, except the DT model (AUC = 0.779), achieved an AUC exceeding 0.80. The RF model exhibited the highest performance, with an AUC of 0.936 (Figure 1A). In internal validation, the RF model continued to demonstrate superior predictive power (AUC = 0.815) via five-fold cross-validation, whereas the remaining models also showed good performance, with AUCs above 0.70. In the external validation cohort, all models yielded AUCs above 0.75, reflecting their strong robustness and generalization capability. Notably, the RF model continued to outperform the others, achieving an AUC of 0.813 (Figure 1B). The RF model consistently outperformed the others across all datasets (training, internal, and external), demonstrating excellent predictive accuracy and clinical utility (Table 2). The calibration plot indicates that the RF model shows reasonably good calibration (Supplementary Figure 2). Accordingly, the RF model was chosen as the final predictive tool for assessing the risk of unplanned readmission in patients with UC.

Figure 1
ROC curves for two cohorts. Panel A shows ROC curves for the training cohort with various models: RF (AUC = 0.936), DT (AUC = 0.779), KNN (AUC = 0.908), Light GBM (AUC = 0.839), Logistic (AUC = 0.811), MLP (AUC = 0.865), SVM (AUC = 0.832), XGBoost (AUC = 0.822). Panel B displays ROC curves for the validation cohort with models: RF (AUC = 0.813), DT (AUC = 0.752), KNN (AUC = 0.719), Light GBM (AUC = 0.802), Logistic (AUC = 0.791), MLP (AUC = 0.81), SVM (AUC = 0.775), XGBoost (AUC = 0.804).

Figure 1. AUC comparison of eight machine learning models in predicting unplanned readmission of UC in (A) training cohort and (B) validation cohort. DT, decision tree; KNN, k-nearest neighbors; Light GBM, light gradient boosting machine; MLP, multilayer perceptron; RF, random forest; SVM, support vector machine, XG Boost, extreme gradient boosting.

Table 2
www.frontiersin.org

Table 2. Predictive performance of eight machine learning models.

DCA revealed that the model provided positive clinical benefits across a wide range of threshold probabilities in both the training and external validation cohorts (Figure 2). Notably, the RF model showed persistently higher net benefits in both cohorts, especially at lower to intermediate probability thresholds, suggesting greater clinical utility in guiding decision-making.

Figure 2
Two Decision Curve Analysis (DCA) graphs comparing models for training and validation cohorts. Both graphs display net benefit against threshold probability for models such as Random Forest (RF), Decision Tree (DT), K-Nearest Neighbors (KNN), LightGBM, Logistic, Multi-Layer Perceptron (MLP), Support Vector Machine (SVM), and XGBoost. The legend includes

Figure 2. Decision curve analysis (DCA) of different machine learning models in the training cohort (A) and external validation cohort (B).

PR curve analysis revealed that the RF model performed well in both the training and external validation cohorts (Figure 3). In the training cohort, the PR curve showed a stable trend, suggesting that the RF model consistently achieved high precision and recall across various thresholds, indicating its strong robustness. Although slight fluctuations were observed in the PR curve at very low recall levels (0.00–0.10) in the external validation cohort, the model maintained a strong classification performance within clinically relevant decision thresholds, underscoring its practical value in clinical settings.

Figure 3
Two precision-recall curves labeled A and B. A represents the training cohort, and B represents the validation cohort, both for the RF model. Precision is plotted against recall, showing a generally increasing trend with varying fluctuations.

Figure 3. The precision recall (PR) curves of the random forest (RF) model in the training cohort (A) and validation cohort (B).

3.4 Comparison of the performance of the optimal model and the Mayo score, MES, DUBLIN score, and UCEIS

Using ROC curve analysis, we compared the ability of the optimal RF model, the Mayo score, the MES, the DUBLIN score, and the UCEIS to predict unplanned readmission within 1 year (Figure 4). The results showed that the RF model achieved the best predictive performance, with AUCs of 0.936 and 0.813 in the training and validation cohorts, respectively. The UCEIS had the second-best performance, with AUCs of 0.734 and 0.779 in the training and validation cohorts, respectively. The Mayo score showed modest predictive ability in the validation cohort (AUC = 0.720), but its performance in the training cohort was relatively lower (AUC = 0.698). The predictive performance of the MES and DUBLIN scores was suboptimal, with AUCs of 0.637 and 0.625 in the training cohort and 0.647 and 0.676 in the validation cohort, respectively.

Figure 4
Two ROC curve graphs comparing model performances for training and validation cohorts. Graph A shows RF with AUC 0.936, Mayo 0.698, MES 0.637, DUBLIN 0.625, UCEIS 0.734. Graph B shows RF with AUC 0.813, Mayo 0.72, MES 0.647, DUBLIN 0.676, UCEIS 0.779. Sensitivity vs. 1-Specificity is plotted.

Figure 4. AUC comparison of the RF model and clinical scores (Mayo, MES, DUBLIN, UCEIS) for predicting unplanned readmission of UC in the training cohort (A) and validation cohort (B). MES, Mayo Endoscopic Subscore; DUBLIN, Degree of Ulcerative Colitis Burden of Luminal Inflammation; UCEIS, Ulcerative Colitis Endoscopic Index of Severity.

3.5 Feature importance and model explainability

To gain deeper insights into the decision logic of the model, SHAP plots were employed to visualize the RF model outputs (Figure 5). The top contributing features, in descending order of importance, were CRP, ESR, RBC, IFBM, and PLT. SHAP analysis revealed that CRP, ESR, IFBM Severe, PLT, and IFBM Moderate had predominantly positive SHAP values-indicating that increases in these features (as shown in red) contributed positively to predicted readmission risk and were thus identified as risk factors. In contrast, RBC and IFBM Mild had predominantly negative SHAP values, indicating that higher levels of these features were associated with a lower readmission risk and could be considered protective factors.

Figure 5
SHAP summary plot showing the impact of features on model output. Features include CRP, ESR, RBC, IFBM Severe, PLT, IFBM Moderate, and IFBM Mild. The color gradient indicates feature value, with red representing high values and blue representing low values. SHAP values range from -0.2 to 0.4, indicating the contribution of each feature to prediction.

Figure 5. SHAP analysis of RF model used for predicting UC unplanned readmission. CRP, C-reactive protein; ESR, erythrocyte sedimentation rate; RBC, red blood cell; IFBM, increased frequency of bowel movements; PLT, platelet count.

3.6 Implementation of web calculator

Based on five key indicators—CRP, ESR, RBC, IFBM, and PLT—a web-based calculator was developed to provide an individualized prediction of unplanned readmission risk in patients with UC, with the goal of supporting precision management and clinical intervention1 (Figure 6).

Figure 6
Readmission Predictor interface showing input parameters and a bar chart of feature importance measured by SHAP values. Inputs include increased defecation and CRP, ESR, RBC, and PLT levels. The bar chart highlights CRP as the most important feature, followed by ESR, RBC, and others. The server status indicates readiness for calculation.

Figure 6. Web-based calculator for predicting unplanned readmission of UC patients.

4 Discussion

Patients with ulcerative colitis face a high risk of unplanned readmission during disease flare-ups, which presents an urgent public health concern. Unplanned readmission typically indicates worsening health status post-discharge, exacerbated clinical symptoms, increased difficulty in medication management, and heightened risks, including opportunistic infections (such as Clostridioides difficile and cytomegalovirus infections), rapid disease progression, and severe complications such as toxic megacolon, massive hemorrhage, or perforation (17). Inadequate early detection and delayed intervention may cause sudden clinical decline, elevating both mortality and long-term disability risks in some patients (18, 19). Thus, the timely identification of high-risk patients for unplanned readmissions is critical for improving clinical outcomes. However, the complex and individualized nature of UC, driven by factors such as inflammatory burden, treatment response, comorbid conditions, and psychological state, makes it difficult for even seasoned clinicians to reliably predict disease progression over a 1-year horizon. Hence, constructing a predictive model based on multidimensional clinical variables and implementing an accessible online tool is vital for reducing the risk of unplanned readmissions and optimizing the long-term prognosis of patients with UC.

This study established an ML-driven predictive platform that incorporates a wide range of clinical variables—demographics, medical history, medication profiles, symptoms, lab tests, and endoscopic findings—to estimate the 1-year unplanned readmission risk in patients with UC, enabling a holistic assessment of their clinical condition. The RF model demonstrated superior predictive performance compared with the other seven algorithms, achieving AUCs of 0.936, 0.815, and 0.813 in the training, internal validation, and 0.813 in external validation cohort, respectively. DCA results confirmed that the RF model offered a higher net clinical benefit within most clinically relevant threshold ranges, underscoring its practical utility and scalability in real-world settings. Some fluctuations were observed in the PR curve of the RF model at very low recall levels (recall <0.10) in the external validation cohort, likely due to the limited number of positive cases and uneven class distribution, which may have impacted the stability in identifying rare high-risk patients. Overall, the RF model consistently exhibited a strong classification performance across thresholds, reflecting its robustness and ability to generalize to unseen data. In summary, the online prediction platform built using the RF model demonstrated strong feasibility. Its risk prediction capabilities help patients gain clearer insights into their health status and enhance their self-management awareness through risk alerts. For high-risk patients, clinicians can intensify follow-up, promptly adjust treatment plans, and strengthen nursing monitoring. This personalized intervention not only effectively reduces unplanned readmission rates but also improves treatment outcomes and minimizes the waste of healthcare resources.

This study identified five critical predictors incorporated into the RF model: CRP, ESR, RBC, IFBM, and PLT, which represent the core clinical domains of UC, such as systemic inflammation, symptomatic severity, bleeding tendency, and nutritional condition. CRP and ESR are widely recognized markers of inflammation; their elevation reflects enhanced systemic inflammatory responses and correlates strongly with disease activity and mucosal injury (20, 21). When the intestinal mucosa is damaged, the immune system is activated, particularly local intestinal T cells and macrophages, which release inflammatory mediators (such as TNF-α and IL-6). This exacerbates the local inflammatory response and induces systemic acute-phase inflammatory reactions via the bloodstream, leading to elevated CRP and ESR levels (22, 23). RBCs serve as indirect indicators of persistent disease activity. In patients with UC, intestinal inflammation, malabsorption, and the release of inflammatory cytokines may lead to RBC destruction, resulting in malnutrition and anemia (2427). As the most clinically intuitive symptom variable, IFBM reflects intestinal barrier impairment and inflammation-induced motility abnormality. An increased stool frequency is significantly associated with the risk of readmission (28, 29). PLTs are among the first cells recruited to the vascular endothelium at the sites of inflammation and infection (30). PLTs and platelet activation can trigger a cascade of inflammatory responses by increasing vascular permeability and promoting leukocyte migration, thereby exacerbating intestinal mucosal injury (31). PLTs not only serve as markers of inflammation but may also indicate the activation of the coagulation cascade, a known contributor to higher complication rates in patients with IBD (32). In summary, these five variables characterize the clinical status of patients with UC from different dimensions, providing highly representative information inputs for the model. This ensures its discriminative power and clinical interpretability for risk prediction.

In recent years, multiple studies have attempted to develop clinical indicator-based models to predict the risk of hospital readmission in patients with UC. Xiang et al. (33) developed a nomogram incorporating the AHRQ Elixhauser index, regular follow-up status, corticosteroid history, CRP level, and the UC endoscopic index of severity (UCEIS) to estimate readmission risk in UC patients. The model demonstrated an AUC of 0.764 at the 52-week follow-up in the external validation. Sobotka et al. (34) developed a logistic regression model incorporating surgical factors—including race, medication history, preoperative severity, surgery type, and postoperative complications—to predict 30-day readmission risk. In the validation cohort, the model achieved an AUC of 0.71 (95% CI: 0.66–0.75). However, these models were constrained by small sample sizes, suboptimal accuracy, and limited feature diversity, thus falling short of the requirements for personalized risk assessment. Moreover, the LR model by Sobotka et al. was not externally validated, which restricts its applicability in broader clinical settings. The RF model outperformed the existing clinical scores in both the training and validation cohorts. Mayo mainly focuses on disease activity (35). MES is the endoscopic component of the Mayo score and is simpler (36). UCEIS assesses endoscopic severity in three domains: vascular pattern, bleeding, and erosions/ulcers (37). DUBLIN scores is mainly used to quantify the inflammatory burden of ulcerative colitis (38). However, the lack of integration of comprehensive clinical data in these scores limits their ability to predict multifactorial outcomes, such as readmission. In contrast, the RF model integrates diverse clinical features to provide a comprehensive view of patient status, thereby improving its capacity to accurately discriminate high-risk readmission.

Our study has several strengths. First, we conducted a comprehensive screening of multidimensional clinical variables to identify key predictors, minimizing the potential omission of relevant features and enhancing the model’s predictive accuracy. Second, the model was externally validated using a prospectively collected independent cohort, confirming its satisfactory applicability in clinical practice. Third, the optimal model was selected by comparing the performance of multiple ML algorithms, and an online risk prediction platform was developed based on the top-performing RF model. This web-based risk calculator is freely available to the public, providing clinical decision support for physicians in assessing readmission risk, while also aiding patients in scheduling follow-ups and improving self-management. Looking forward, we will explore integration of this tool into electronic health record systems to enable automated data transfer and real-time risk updates, thereby enhancing its clinical utility in real-world settings.

4.1 Limitations and prospects

However, this study had certain limitations. First, this was a single-center study. Although the applicability of the model was preliminarily validated in a prospective cohort, multicenter prospective studies are still needed to further assess its external generalizability. Second, the model was developed using retrospective data. Although multiple clinical dimensions were considered, the early initiation of the study led to high missing rates in fecal calprotectin, other histologic indices, and psychosocial variables, which prevented some potentially informative predictors from being included in the model. To address this limitation, we have designed a new prospective cohort study using predesigned case report forms and standardized data collection procedures, in which the above variables will be systematically incorporated into predictor selection and model development, with the aim of further improving model performance. Third, the study population consisted only of hospitalized patients with UC, and it remains unclear whether the model is applicable to UC management in the outpatient setting. In the future, we plan to integrate the RF model more broadly into the clinical decision-making process for patients with UC and to continuously accumulate case data across different clinical contexts in order to further evaluate and optimize its predictive performance.

5 Conclusion

This study established a RF model based on five clinically accessible predictors—CRP, ESR, RBC, IFBM, and PLT—with reliable predictive performance for 1-year outcomes in patients with UC. An online platform was developed based on the RF model to assist clinicians in performing individualized risk assessments, thereby optimizing treatment strategies and reducing the risk of unplanned readmission.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation. Requests to access these datasets should be directed to the first author, Tianqi Wang, wwwsunlight77@163.com.

Ethics statement

The studies involving humans were approved by the Research Ethics Committee of Shandong University of Traditional Chinese Medicine Affiliated Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

TW: Conceptualization, Data curation, Methodology, Writing – original draft. YZ: Methodology, Writing – original draft. XZ: Data curation, Writing – original draft. JiZ: Data curation, Writing – original draft. JuZ: Conceptualization, Formal analysis, Methodology, Writing – original draft. DW: Conceptualization, Funding acquisition, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This research was funded by the National Natural Science Foundation of China (Grant No. 82174177), Shandong Province Natural Science Foundation Joint Fund Project (Grant No. ZR2022LZY012), Qilu Health and Health Outstanding Youth Talent Program, and Open Project of the Key Laboratory of Traditional Chinese Medicine Classic Theory of the Ministry of Education (Grant No. School Experiment Letter (2023) No. 15).

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that Generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2026.1712846/full#supplementary-material

Footnotes

References

1. Le Berre, C, Honap, S, and Peyrin-Biroulet, L. Ulcerative colitis. Lancet. (2023) 402:571–84. doi: 10.1016/s0140-6736(23)00966-2,

PubMed Abstract | Crossref Full Text | Google Scholar

2. Voelker, R. What is ulcerative colitis? JAMA. (2024) 331:716. doi: 10.1001/jama.2023.23814

Crossref Full Text | Google Scholar

3. Ng, SC, Shi, HY, Hamidi, N, Underwood, FE, Tang, W, Benchimol, EI, et al. Worldwide incidence and prevalence of inflammatory bowel disease in the 21st century: a systematic review of population-based studies. Lancet. (2017) 390:2769–78. doi: 10.1016/s0140-6736(17)32448-0,

PubMed Abstract | Crossref Full Text | Google Scholar

4. Khan, AU, Ali, M, and Wahab, MA. Comparative efficacy of pharmacologic interventions in ulcerative colitis: a network meta-analysis. Inflammopharmacology. (2025) 33:2679–87. doi: 10.1007/s10787-025-01723-z,

PubMed Abstract | Crossref Full Text | Google Scholar

5. Singh, S, Loftus, EV Jr, Limketkai, BN, Haydek, JP, Agrawal, M, Scott, FI, et al. AGA living clinical practice guideline on pharmacological management of moderate-to-severe ulcerative colitis. Gastroenterology. (2024) 167:1307–43. doi: 10.1053/j.gastro.2024.10.001,

PubMed Abstract | Crossref Full Text | Google Scholar

6. Wangchuk, P, Yeshi, K, and Loukas, A. Ulcerative colitis: clinical biomarkers, therapeutic targets, and emerging treatments. Trends Pharmacol Sci. (2024) 45:892–903. doi: 10.1016/j.tips.2024.08.003,

PubMed Abstract | Crossref Full Text | Google Scholar

7. Kunkle, B, Singh, H, Abraham, D, Asamoah, N, Barrow, J, and Mattar, M. Independent predictors of 90-day readmission in patients with inflammatory bowel disease: a nationwide retrospective study. J Crohns Colitis. (2025) 19:jaf034. doi: 10.1093/ecco-jcc/jjaf034.,

PubMed Abstract | Crossref Full Text | Google Scholar

8. Colombel, JF, D'Haens, G, Lee, WJ, Petersson, J, and Panaccione, R. Outcomes and Strategies to Support a Treat-to-target Approach in Inflammatory Bowel Disease: A Systematic Review. J Crohns Colitis. (2020) 14:254–66. doi: 10.1093/ecco-jcc/jjz131,

PubMed Abstract | Crossref Full Text | Google Scholar

9. Yoon, JY, Cha, JM, Lee, CK, Park, YS, Huh, KC, Shin, JE, et al. Early course of newly diagnosed moderate-to-severe ulcerative colitis in Korea: Results from a hospital-based inception cohort study (MOSAIK). J Gastroenterol Hepatol. (2021) 36:2149–56. doi: 10.1111/jgh.15435,

PubMed Abstract | Crossref Full Text | Google Scholar

10. Takenaka, K, Ohtsuka, K, Fujii, T, Oshima, S, Okamoto, R, and Watanabe, M. Deep Neural Network Accurately Predicts Prognosis of Ulcerative Colitis Using Endoscopic Images. Gastroenterology. (2021) 160:2175–2177.e3. doi: 10.1053/j.gastro.2021.01.210,

PubMed Abstract | Crossref Full Text | Google Scholar

11. Noguchi, T, Ando, T, Emoto, S, Nozawa, H, Kawai, K, Sasaki, K, et al. Artificial intelligence program to predict p53 mutations in ulcerative colitis-associated cancer or dysplasia. Inflamm Bowel Dis. (2022) 28:1072–80. doi: 10.1093/ibd/izab350,

PubMed Abstract | Crossref Full Text | Google Scholar

12. Rubin, DT, Ananthakrishnan, AN, Siegel, CA, Barnes, EL, and Long, MD. ACG Clinical Guideline Update: Ulcerative Colitis in Adults. Am J Gastroenterol. (2025) 120:1187–224. doi: 10.14309/ajg.0000000000003463,

PubMed Abstract | Crossref Full Text | Google Scholar

13. Conilione, P, Jessup, R, and Gust, A. Novel machine learning model for predicting multiple unplanned hospitalisations. BMJ Health Care Inform. (2023) 30:e100682. doi: 10.1136/bmjhci-2022-100682,

PubMed Abstract | Crossref Full Text | Google Scholar

14. Pagnini, C, Menasci, F, Desideri, F, Corleto, VD, Delle Fave, G, and Di Giulio, E. Endoscopic scores for inflammatory bowel disease in the era of 'mucosal healing': old problem, new perspectives. Dig Liver Dis. (2016) 48:703–8. doi: 10.1016/j.dld.2016.03.006,

PubMed Abstract | Crossref Full Text | Google Scholar

15. Rowan, CR, Cullen, G, Mulcahy, HE, Sheridan, J, Moss, AC, Ryan, EJ, et al. DUBLIN [Degree of Ulcerative colitis Burden of Luminal Inflammation] Score, a Simple Method to Quantify Inflammatory Burden in Ulcerative Colitis. J Crohns Colitis. (2019) 13:1365–71. doi: 10.1093/ecco-jcc/jjz067,

PubMed Abstract | Crossref Full Text | Google Scholar

16. Travis, SP, Schnell, D, Krzeski, P, Abreu, MT, Altman, DG, Colombel, JF, et al. Developing an instrument to assess the endoscopic severity of ulcerative colitis: the Ulcerative Colitis Endoscopic Index of Severity (UCEIS). Gut. (2012) 61:535–42. doi: 10.1136/gutjnl-2011-300486,

PubMed Abstract | Crossref Full Text | Google Scholar

17. Rivière, P, Li Wai Suen, C, Chaparro, M, De Cruz, P, Spinelli, A, and Laharie, D. Acute severe ulcerative colitis management: unanswered questions and latest insights. Lancet Gastroenterol Hepatol. (2024) 9:251–62. doi: 10.1016/s2468-1253(23)00313-8,

PubMed Abstract | Crossref Full Text | Google Scholar

18. Honap, S, Jairath, V, Sands, BE, Dulai, PS, Danese, S, and Peyrin-Biroulet, L. Acute severe ulcerative colitis trials: the past, the present and the future. Gut. (2024) 73:1763–73. doi: 10.1136/gutjnl-2024-332489,

PubMed Abstract | Crossref Full Text | Google Scholar

19. Attauabi, M, Madsen, GR, Bendtsen, F, Seidelin, JB, and Burisch, J. Multidimensional Patient-Reported Outcomes and Quality of Life at Diagnosis of IBD: A Population-Based Inception Cohort Study. Clin Gastroenterol Hepatol. (2025) 23:1418–27. doi: 10.1016/j.cgh.2024.08.047,

PubMed Abstract | Crossref Full Text | Google Scholar

20. Langhorst, J, Boone, J, Lauche, R, Rueffer, A, and Dobos, G. Faecal Lactoferrin, Calprotectin, PMN-elastase, CRP, and White Blood Cell Count as Indicators for Mucosal Healing and Clinical Course of Disease in Patients with Mild to Moderate Ulcerative Colitis: Post Hoc Analysis of a Prospective Clinical Trial. J Crohns Colitis. (2016) 10:786–94. doi: 10.1093/ecco-jcc/jjw044,

PubMed Abstract | Crossref Full Text | Google Scholar

21. Croft, A, Lord, A, and Radford-Smith, G. Markers of Systemic Inflammation in Acute Attacks of Ulcerative Colitis: What Level of C-reactive Protein Constitutes Severe Colitis? J Crohns Colitis. (2022) 16:1089–96. doi: 10.1093/ecco-jcc/jjac014,

PubMed Abstract | Crossref Full Text | Google Scholar

22. Tatiya-Aphiradee, N, Chatuphonprasert, W, and Jarukamjorn, K. Immune response and inflammatory pathway of ulcerative colitis. J Basic Clin Physiol Pharmacol. (2018) 30:1–10. doi: 10.1515/jbcpp-2018-0036,

PubMed Abstract | Crossref Full Text | Google Scholar

23. Katsandegwaza, B, Horsnell, W, and Smith, K. Inflammatory Bowel Disease: A Review of Pre-Clinical Murine Models of Human Disease. Int J Mol Sci. (2022) 23:9344. doi: 10.3390/ijms23169344,

PubMed Abstract | Crossref Full Text | Google Scholar

24. Bai, X, Bai, G, Tang, L, Liu, L, Li, Y, and Jiang, W. Changes in MMP-2, MMP-9, inflammation, blood coagulation and intestinal mucosal permeability in patients with active ulcerative colitis. Exp Ther Med. (2020) 20:269–74. doi: 10.3892/etm.2020.8710,

PubMed Abstract | Crossref Full Text | Google Scholar

25. Głąbska, D, Guzek, D, Kanarek, B, and Lech, G. Analysis of association between dietary intake and red blood cell count results in remission ulcerative colitis individuals. Medicina (Kaunas). (2019) 55:96. doi: 10.3390/medicina55040096,

PubMed Abstract | Crossref Full Text | Google Scholar

26. Iolascon, A, Andolfo, I, Russo, R, Sanchez, M, Busti, F, Swinkels, D, et al. Recommendations for diagnosis, treatment, and prevention of iron deficiency and iron deficiency anemia. Hema. (2024) 8:e108. doi: 10.1002/hem3.108,

PubMed Abstract | Crossref Full Text | Google Scholar

27. Fan, Z, Zhou, H, Zhang, J, Liu, X, Wu, T, Shi, Y, et al. Opportunistic infections changed before and after SARS-CoV-2 infection in inflammatory bowel disease patients: a retrospective single-center study in China. Front Med (Lausanne). (2024) 11:1461801. doi: 10.3389/fmed.2024.1461801,

PubMed Abstract | Crossref Full Text | Google Scholar

28. Pan, SM, Wang, CL, Hu, ZF, Zhang, ML, Pan, ZF, Zhou, RY, et al. Baitouweng decoction repairs the intestinal barrier in DSS-induced colitis mice via regulation of AMPK/mTOR-mediated autophagy. J Ethnopharmacol. (2024) 318:116888. doi: 10.1016/j.jep.2023.116888,

PubMed Abstract | Crossref Full Text | Google Scholar

29. Restellini, S, Chao, CY, Martel, M, Barkun, A, Kherad, O, Seidman, E, et al. Clinical Parameters Correlate With Endoscopic Activity of Ulcerative Colitis: A Systematic Review. Clin Gastroenterol Hepatol. (2019) 17:1265–1275.e8. doi: 10.1016/j.cgh.2018.12.021,

PubMed Abstract | Crossref Full Text | Google Scholar

30. Tan, S, Yang, X, Mu, X, Liu, S, Wang, Y, Li, Y, et al. The predictive role of peripheral serum inflammatory markers NLR, PLR, and LMR in ulcerative colitis and Crohn's disease: a systematic review and meta-analysis. Front Immunol. (2025) 16:1623899. doi: 10.3389/fimmu.2025.1623899,

PubMed Abstract | Crossref Full Text | Google Scholar

31. Endo, K, Satoh, T, Yoshino, Y, Kondo, S, Kawakami, Y, Katayama, T, et al. Neutrophil-to-Lymphocyte and Platelet-to-Lymphocyte Ratios as Noninvasive Predictors of the Therapeutic Outcomes of Systemic Corticosteroid Therapy in Ulcerative Colitis. Inflamm Intest Dis. (2021) 6:218–24. doi: 10.1159/000520523,

PubMed Abstract | Crossref Full Text | Google Scholar

32. Li, HY, and Liu, ™. Platelet indices and inflammatory bowel disease: a Mendelian randomization study. Front Immunol (2024) 15:1377915. doi:doi: 10.3389/fimmu.2024.1377915,

PubMed Abstract | Crossref Full Text | Google Scholar

33. Xiang, Y, Yuan, Y, Liu, J, Xu, X, Wang, Z, Hassan, S, et al. A nomogram based on clinical factors to predict calendar year readmission in patients with ulcerative colitis. Ther Adv Gastroenterol. (2023) 16:17562848231189124. doi: 10.1177/17562848231189124,

PubMed Abstract | Crossref Full Text | Google Scholar

34. Sobotka, LA, Husain, SG, Krishna, SG, Hinton, A, Pavurula, R, Conwell, DL, et al. A risk score model of 30-day readmission in ulcerative colitis after colectomy or proctectomy. Clin Transl Gastroenterol. (2018) 9:175. doi: 10.1038/s41424-018-0039-y,

PubMed Abstract | Crossref Full Text | Google Scholar

35. Hashash, JG, Yu Ci Ng, F, Farraye, FA, Wang, Y, Colucci, DR, Baxi, S, et al. Inter- and Intraobserver Variability on Endoscopic Scoring Systems in Crohn's Disease and Ulcerative Colitis: A Systematic Review and Meta-Analysis. Inflamm Bowel Dis. (2024) 30:2217–26. doi: 10.1093/ibd/izae051,

PubMed Abstract | Crossref Full Text | Google Scholar

36. Xu, W, Liu, F, Hua, Z, Gu, Y, Lian, L, Cui, L, et al. Comparison of The Toronto IBD Global Endoscopic Reporting (TIGER) score, Mayo endoscopic score (MES), and ulcerative colitis endoscopic index of severity (UCEIS) in predicting the need for ileal pouch-anal anastomosis in patients with ulcerative colitis. Int J Color Dis. (2023) 38:53. doi: 10.1007/s00384-023-04347-3,

PubMed Abstract | Crossref Full Text | Google Scholar

37. Meringer, H, Kayal, M, Jairath, V, Qasim, A, Macdonald, JK, Yuan, Y, et al. Scoring indices for assessing endoscopic disease activity in acute severe ulcerative colitis: a systematic review. J Crohns Colitis. (2025) 19:jjaf126. doi: 10.1093/ecco-jcc/jjaf126,

PubMed Abstract | Crossref Full Text | Google Scholar

38. Zhang, XF, Li, P, Ding, XL, Chen, H, Wang, SJ, Jin, SB, et al. Comparing the clinical application values of the Degree of Ulcerative Colitis Burden of Luminal Inflammation (DUBLIN) score and Ulcerative Colitis Endoscopic Index of Severity (UCEIS) in patients with ulcerative colitis. Gastroenterol Rep (Oxf). (2021) 9:533–42. doi: 10.1093/gastro/goab026,

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: machine learning, online platform, predictive model, ulcerative colitis, unplanned readmission

Citation: Wang T, Zhao Y, Zhao X, Zhu J, Zhan J and Wang D (2026) Development and evaluation of a machine learning model to predict unplanned readmission risk in patients with ulcerative colitis. Front. Med. 13:1712846. doi: 10.3389/fmed.2026.1712846

Received: 25 September 2025; Revised: 22 December 2025; Accepted: 06 January 2026;
Published: 27 January 2026.

Edited by:

Sri Krishnan, Toronto Metropolitan University, Canada

Reviewed by:

Duolong Zhu, Baylor College of Medicine, United States
Apurva Patel, Gujarat Cancer & Research Institute, India

Copyright © 2026 Wang, Zhao, Zhao, Zhu, Zhan and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dongli Wang, d2FuZ2RsMjAyNTA1QDE2My5jb20=; Junyi Zhan, emhhbmp1bnlpNTY2QGdtYWlsLmNvbQ==

These authors share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.