- 1Department of Infectious Disease, The Second Hospital of Dalian Medical University, Dalian, China
- 2Department of Gastroenterology, Shanxi Bethune Hospital, Taiyuan, China
- 3Department of Nephrology, The Second Hospital of Dalian Medical University, Dalian, China
Background and aims: Acute-on-chronic liver failure (ACLF) is a severe clinical syndrome marked by rapid progression and high short-term mortality. In the Asia-Pacific region, where the hepatitis B virus (HBV) is endemic, HBV-related ACLF (HBV-ACLF) represents the most common subtype. The aim of this study was to develop and validate a concise and accurate nomogram for predicting 28-day mortality in patients with HBV-related ACLF.
Methods: A total of 159 patients with HBV-ACLF were enrolled, including 113 in the training cohort and 46 in the validation cohort. Clinical characteristics, routine laboratory parameters, and inflammatory cytokine levels were collected. In the training cohort, least absolute shrinkage and selection operator (LASSO) regression was applied to select variables, followed by multivariate logistic regression to construct a nomogram model (ICI). Model performance was evaluated in both cohorts using receiver operating characteristic (ROC) curves, calibration plots, and decision curve analysis (DCA), and compared with COSSH-ACLF II, CLIF-C ACLF, AARC, CLIF-OFs, MELD, and CTP scores.
Results: LASSO regression identified ln(INR), ln(Cr), ln(WBC), and ln(IL-6) as candidate predictors. Multivariate logistic regression further confirmed that ln(INR), ln(Cr), and ln(IL-6) were independent risk factors. The new model (ICI) demonstrated good discriminative ability with an area under the receiver operating characteristic curve (AUC) of 0.826 in the training cohort and 0.814 in the validation cohort. Calibration analysis showed excellent consistency, and decision curve analysis (DCA) indicated that the ICI model provided higher net clinical benefit across different threshold probabilities. Furthermore, compared with traditional models, the ICI model exhibited significant advantages in terms of discriminative ability and clinical benefit.
Conclusion: The newly developed model (ICI) showed superior predictive performance for short-term prognosis in patients with HBV-ACLF, outperforming conventional scoring systems. It is anticipated that the ICI model will serve as a valuable instrument in complementing the conventional scoring system for the HBV-ACLF population in the future.
Introduction
Acute-on-chronic liver failure (ACLF) is a clinical syndrome characterized by rapid progression to hepatic decompensation occurring in patients with underlying chronic liver disease or cirrhosis, following acute precipitating events such as infections, variceal bleeding, or drug-induced liver injury. ACLF is marked by rapid deterioration of liver function, often accompanied by multiple organ dysfunction syndrome (MODS), manifesting significant clinical heterogeneity and extremely high short-term mortality rates (1–3). Due to its rapid progression, unpredictable organ failure trajectory, and narrow therapeutic window, ACLF is widely recognized as one of the most challenging critical conditions in hepatology (4). The rising prevalence of chronic liver diseases has correspondingly increased the incidence of ACLF annually (5, 6). Globally, ACLF occurs in approximately 35% of patients with decompensated cirrhosis (7), and a retrospective cohort study reported a 28-day mortality rate of about 40% in ACLF patients (8). Its dual characteristics of high incidence and mortality have progressively made ACLF a major public health challenge worldwide. In Asian regions, particularly China, hepatitis B virus (HBV) infection remains the primary etiology, accounting for the majority of ACLF cases (9, 10).
In terms of pathogenesis, current research indicates systemic inflammation and immune dysregulation as key drivers in ACLF onset and progression. Acute insults rapidly activate the hepatic innate immune response, leading to elevated proinflammatory cytokines, such as interleukin-6 (IL-6) and tumor necrosis factor-alpha (TNF-α), forming a sustained and intense cytokine release state (“cytokine storm”). This process not only exacerbates hepatocyte necrosis but also significantly contributes to MODS through mechanisms including increased vascular permeability and disrupted microcirculatory perfusion (11–13).
Currently, several prognostic scoring systems for ACLF have been widely utilized in clinical practice, including MELD (14), Child-Turcotte-Pugh (CTP) (15), CLIF-Organ Failure score (CLIF-OFs) (16), CLIF Consortium ACLF (CLIF-C ACLF) (17), Asian Pacific Association for the Study of the Liver ACLF Research Consortium (AARC) (2), and the Chinese Group on the Study of Severe Hepatitis B COSSH-ACLF II (18). MELD and CLIF-C scoring systems were developed primarily from European and American populations, predominantly comprising white patients with alcohol-related liver disease and hepatitis C virus infection. Significant differences exist in etiology and population characteristics compared with HBV-dominated ACLF populations in Asia, thus limiting their generalizability and predictive performance in HBV-ACLF patients. Although the COSSH scoring system was developed specifically for Chinese HBV-ACLF patients and is widely used, its predictive model relies heavily on COSSH-defined ACLF diagnostic criteria, which emphasize existing organ failure and a higher total bilirubin threshold (TBil ≥12 mg/dL). Compared to the broader APASL criteria (TBil ≥5 mg/dL), the COSSH system tends to identify patients in mid-to-late disease stages (2, 18, 19). Consequently, it has limitations in identifying and stratifying early-stage HBV-ACLF patients, thus restricting its applicability for early therapeutic interventions.
This study aims to construct a concise and practical 28-day mortality risk prediction model integrating routine laboratory indicators and inflammatory cytokines in HBV-ACLF patients diagnosed under the APASL criteria. By comparing its performance with conventional scoring systems, this model aims to enhance early risk stratification and support clinical decision-making.
Methods
Study population
This study was a dual-center retrospective cohort study. Patients with HBV-ACLF who met the inclusion criteria from January 2020 to January 2025 at the Second Affiliated Hospital of Dalian Medical University were enrolled as the training cohort (n = 113). Additionally, eligible patients from Shanxi Bethune Hospital recruited between March 2022 and September 2024 were designated as the validation cohort (n = 46), a total of 159 patients were included in the final analysis (Figure 1).
The study was conducted in accordance with the ethical principles outlined in the Declaration of Helsinki and received approval from the Ethics Committee of the Second Affiliated Hospital of Dalian Medical University (approval number: KY2025-485-01). In light of the retrospective nature of the study, the requirement for written informed consent was waived by the Ethics Committee.
Inclusion and exclusion criteria
Inclusion criteria were as follows:
1. Age ≥18 years;
2. Confirmed chronic hepatitis B virus (HBV) infection (20);
3. Diagnosis of ACLF based on the criteria of the Asian Pacific Association for the Study of the Liver (APASL) (2);
4. Completion of key laboratory tests within 48 h of admission.
Exclusion criteria included:
1. Coexisting liver diseases of other etiologies (e.g., alcoholic, autoimmune, drug-induced, etc.);
2. Presence of malignant tumors or immunodeficiency disorders;
3. History of liver transplantation;
4. Death within 48 h of admission;
5. Incomplete clinical or laboratory data.
Data collection
Clinical data were extracted from the hospital’s electronic medical records system within 48 h of patient admission, including age, sex, body mass index (BMI), and blood pressure. Laboratory parameters assessed included white blood cell count, hemoglobin, platelet count, serum albumin, alanine aminotransferase (ALT), aspartate aminotransferase (AST), total bilirubin, serum creatinine, serum sodium, international normalized ratio (INR), and inflammatory cytokines, among others. Additionally, complications such as hepatic encephalopathy, infection, ascites, electrolyte disturbances, and whether the patient received artificial liver support therapy were documented.
All enrolled patients were followed from the day of admission until death or the end of the 28-day follow-up period to assess short-term outcomes. Survival status was confirmed through both electronic medical records and telephone follow-up.
Calculation of conventional ACLF scores
The following established prognostic scores were calculated for all enrolled patients: MELD = 3.78 × ln [TBil (mg/dL)] + 11.2 × ln [INR] + 9.57 × ln [Creatinine (mg/dL)] + 6.43 (14); CLIF-C ACLF = 10 × [0.33 × CLIF-OFs + 0.04 × age + 0.63 × ln (WBC count) – 2] (17); COSSH-ACLF II = 1.649 × ln (INR) + 0.457 × grade of hepatic encephalopathy (1: grade 0; 2: grade I–II; 3: grade III–IV) + 0.425 × ln [neutrophil count (×109/L)] + 0.396 × ln [TBil (μmol/L)] + 0.576 × ln [blood urea (mmol/L)] + 0.033 × age (18); AARC (2); CTP (15); CLIF-OFs (16).
Data preprocessing
Before constructing the predictive model, normality tests were performed on all continuous variables using the Shapiro–Wilk test to assess the distribution of the data. For variables that significantly deviated from normality, a natural logarithm transformation was applied to make the data more normal-looking, thereby better satisfying the assumptions of statistical models such as linear regression. Based on the transformed data, Pearson correlation coefficients were calculated to evaluate the correlations between pairs of variables, and a correlation heatmap was generated. For pairs of variables with absolute correlation coefficients greater than 0.7, one variable from each pair was retained for subsequent analysis. Factors such as clinical relevance and generalisability were considered in order to reduce redundancy and minimize the risk of multicollinearity, thereby enhancing the stability of the model.
Model development
In the training cohort, candidate variables were selected using the least absolute shrinkage and selection operator (LASSO) regression, with the optimal penalty parameter (λ) determined through 10-fold cross-validation. These variables were then incorporated into a multivariate logistic regression model and a nomogram was constructed to predict 28-day mortality based on the regression results for variables with a p-value less than 0.05.
Model performance evaluation and comparison
The performance of the model was comprehensively assessed in both the training and validation cohorts. Discriminatory ability was evaluated by calculating the area under the receiver operating characteristic (ROC) curve (AUC). The model’s calibration was assessed using several metrics, including the Brier score, calibration curve, calibration slope, and calibration intercept. Additionally, the clinical utility of the model was evaluated through decision curve analysis (DCA) to estimate the net benefit across various threshold probabilities. To further validate the model’s effectiveness, this study systematically compared the predictive performance of the ICI model with six conventional scoring systems in both cohorts. The comparison was conducted using ROC curves, net reclassification improvement (NRI), integrated discrimination improvement (IDI), probability density function (PDF) analysis, and clinical decision curve analysis.
Statistical analysis
All statistical analyses were performed using R software (version 4.3.3). The primary R packages used were glmnet, rms, pROC, ggplot2 and rmda. The normality of continuous variables was assessed using the Shapiro–Wilk test. For normally distributed data, the mean ± standard deviation was reported and compared using a t-test. For non-normally distributed data, the median (interquartile range) was reported and compared using the Wilcoxon rank-sum test. Categorical variables were presented as frequencies (percentages) and compared using the chi-squared test (when the expected frequency was greater than or equal to 5) or Fisher’s exact test (when the expected frequency was less than 5). LASSO regression with 10-fold cross-validation was used for variable selection. The selected variables were then incorporated into a multivariate logistic regression model to construct a nomogram. The model’s performance was evaluated in both the training and validation cohorts using methods including receiver operating characteristic (ROC) curves, calibration plots and decision curve analysis (DCA). The model’s performance was then compared with that of six conventional scoring systems: COSSH-ACLF II, CLIF-C ACLF, MELD, AARC, CLIF-OFs and CTP. All hypothesis tests were two-sided and a p-value of less than 0.05 was considered statistically significant.
Results
Baseline characteristics
The mean age of the training cohort was 54.92 years (SD 11.35) and 32.7% of patients were female. The incidence rates for hepatic encephalopathy (HE), ascites, electrolyte imbalance (EI), infection, hepatorenal syndrome (HRS) and hepatopulmonary syndrome (HPS) were 60.2, 75.2, 60.2, 59.3, 23.9 and 15.0%, respectively. The baseline characteristics of patients in the validation cohort were largely similar to those in the training cohort, with no significant differences observed for most variables (p > 0.05). Detailed information is provided in Table 1.
Data preprocessing
In the training cohort, Pearson correlation coefficients were calculated for the candidate variables and a correlation heatmap was generated (Figure 2). This analysis revealed significant linear correlations between several variables, including ln(ALT) and ln(AST) (r = 0.911, p < 0.01), ln (PT) and PTA (r = −0.938, p < 0.01), DBP and MAP (r = 0.930, p < 0.01) and ln(NLR) and ln(NEUT) (r = 0.756, p < 0.01). These results suggest the presence of multicollinearity. Based on clinical relevance and generalisability, one variable from each highly correlated pair was retained and the redundant variables were excluded. For instance, both ALT and AST are biochemical markers of hepatocellular injury; however, since ALT is more specific to the liver and reflects the degree of liver damage in hepatitis B-related liver failure more accurately, it was selected as the model variable while AST was excluded. Following this selection process, seven redundant variables were removed, leaving 34 variables for subsequent LASSO regression analysis.
Figure 2. Correlation heatmap of clinical variables. The heatmap shows the correlation between variables, where red indicates strong positive correlation, blue indicates strong negative correlation, and white indicates no correlation. Asterisks (*) represent statistically significant correlations (p < 0.05).
Variable selection and model construction
In the training cohort, 10-fold cross-validation combined with LASSO regression analysis was applied to select from 34 candidate variables. Using the optimal penalty parameter (λ = 0.08958842), four variables with non-zero coefficients were identified: ln(INR), ln(CR), ln(WBC), and ln(IL-6) (Figures 3A,B). These variables were subsequently included in a multivariate logistic regression analysis. The results indicated that ln(INR) [OR = 1.7259, 95% CI (1.0241, 2.9087), p = 0.0404], ln(CR) [OR = 1.9799, 95% CI (1.1735, 3.3406), p = 0.0105], and ln(IL-6) [OR = 2.3684, 95% CI (1.3262, 4.2298), p = 0.0036] were independent predictors of 28-day mortality, whereas ln(WBC) [OR = 1.2998, 95% CI (0.7572, 2.2312), p = 0.3416] did not show statistical significance (Table 2). Based on these findings, a predictive model was developed: ICI = 1.73 × ln[INR] + 1.98 × ln[Cr (μmol/L)] + 2.37 × ln[IL-6 (pg/mL)]. A nomogram (Figure 3C) was subsequently constructed to provide a visual risk assessment tool for clinical decision-making.
Figure 3. Lasso regression and nomogram for predicting risk. (A) Coefficients of predictors across different values of log(λ). (B) Binomial deviance plot for selecting optimal λ. (C) Nomogram for predicting risk, showing points for each predictor (ln_INR, ln_CR, ln_IL6), total points, linear predictor, and predicted risk. “ln_” indicates the natural logarithm transformation of the variable.
Discrimination of the model
To evaluate the ICI model’s discriminative ability, receiver operating characteristic (ROC) curves were plotted for both the training and validation cohorts, and the area under the curve (AUC) was calculated. In the training cohort (Figure 4A), the model demonstrated good discriminatory ability, achieving an AUC of 0.826 and a 95% confidence interval (CI) of [0.751, 0.901]. At the optimal cut-off value of 0.314, the sensitivity and specificity were 0.821 and 0.716, respectively. In the independent validation cohort (Figure 4B), the AUC was 0.814, with a 95% CI of [0.666, 0.962], indicating consistent and robust predictive performance across different datasets. At the optimal threshold of 0.293, the sensitivity and specificity were 0.917 and 0.647, respectively. Taken together, these results suggest that the model demonstrates strong generalisability and robustness.
Figure 4. Model performance evaluation. (A) ROC curve for the training set; (B) ROC curve for the validation set; (C) calibration curve for the training set; (D) calibration curve for the test set; (E) decision curve analysis (DCA) for the training set; (F) decision curve analysis (DCA) for the test set.
Calibration of the ICI model
To evaluate the calibration performance of the ICI model, calibration curves were plotted for both the training and validation cohorts. In the training cohort, the calibration slope was 1.000, the intercept was 0.000, the Brier score was 0.162, and the average and maximum absolute errors (Eavg and Emax) were 0.038 and 0.128, respectively (Figure 4C), indicating a strong agreement between the predicted probabilities and the actual outcomes. In the independent validation cohort (Figure 4D), the calibration slope and intercept remained 1.000 and 0.000, respectively, while the Brier score slightly increased to 0.143. The Eavg and Emax values also improved to 0.030 and 0.113, respectively, reflecting the model’s stability across different datasets. The nonparametric calibration curve closely overlapped with the ideal 45-degree reference line, further confirming the ICI model’s excellent external calibration performance.
The Hosmer-Lemeshow test also supported the model’s good calibration in both datasets. For the training cohort, the χ2 statistic was 5.7411 with a p-value of 0.6762, and for the validation cohort, the χ2 statistic was 7.1307 with a p-value of 0.5226. Both p-values exceeded 0.05, indicating no significant difference between the predicted probabilities and actual outcomes, which corroborates the model’s calibration accuracy across both cohorts.
Decision curve analysis of the ICI model
To further evaluate the clinical utility of the ICI model across a range of threshold probabilities, decision curve analysis (DCA) was performed to plot net benefit curves for both the training and validation cohorts (Figures 4E,F). The results demonstrated that in both cohorts, the ICI model provided favorable net clinical benefit across most threshold ranges, indicating a stable decision-making advantage under various clinical risk thresholds.
Comparison with traditional scoring systems
By comparing the ICI model with six commonly used ACLF scoring systems, we further validated the superiority of the new model. ROC curve analysis (Figures 5A,B) showed that the ICI model demonstrated significantly higher AUCs than traditional models such as COSSH-ACLF II (0.744/0.712), CLIF-C (0.734/0.707), and MELD (0.706/0.682) in both the training and validation cohorts. The DeLong test results (Table 3) indicated that the differences in AUC between the ICI model and all traditional models were statistically significant (p < 0.05) in both cohorts, suggesting that the ICI model has strong discriminatory ability and stable predictive performance. Moreover, Net Reclassification Improvement (NRI) and Integrated Discrimination Improvement (IDI) (Figure 6) showed that the ICI model outperformed traditional models in terms of reclassification ability and predictive accuracy. According to PDF analysis (Figure 7), the overlap coefficient of the ICI model in the training and validation cohorts was 58.5 and 60.1%, respectively, significantly higher than that of other traditional models, such as AARC (68.6%/68.5%), CLIF-C (64.0%/67.4%), COSSH-ACLF II (70.4%/66.8%), CTP (81.6%/71.0%), and MELD (70.2%/67.3%). This indicates that the ICI model has superior discriminatory power in predicting 28-day survival. Finally, the clinical decision curve results demonstrated that the ICI model provides better clinical benefit compared to traditional models (Figures 5C,D).
Figure 5. Performance evaluation of risk prediction models. Panels (A,B) display the ROC curves for the training and validation sets, respectively, comparing multiple models with their corresponding AUC values. Panels (C,D) present the decision curve analysis (DCA) for the training and test sets, respectively, illustrating the net benefit of each model at various high-risk thresholds.
Figure 6. NRI and IDI for model comparison. Panels (A,B) show the net reclassification improvement (NRI) of the ICI and traditional models in the training and validation sets, respectively. Panels (C,D) display the integrated discrimination improvement (IDI) for the same models in the training and validation sets.
Figure 7. Probability density and overlap of prognostic scores. Panels show the probability density of raw prognostic scores for surviving and non-surviving patients in the derivation and validation cohorts. The grey ribbon indicates the overlap between the two groups, with the overlap percentage (OVL%) displayed at the centroid.
Discussion
Early identification of high-risk patients with HBV-ACLF is essential for improving clinical outcomes, as it not only supports timely treatment decisions but may also reduce the substantial short-term mortality associated with the disease (19, 21). In this study, we developed and validated a concise and practical nomogram—the ICI model—based on three routinely available indicators: INR, Cr, and IL-6. This model was designed to predict 28-day mortality in patients with HBV-related ACLF. The ICI model demonstrated good discriminative ability and calibration consistency in both the training and external validation cohorts. Decision curve analysis further confirmed its superior net clinical benefit across a wide range of clinically relevant threshold probabilities. Compared with conventional scoring systems, the ICI model exhibited advantages in both predictive performance and accessibility of variables.
Currently, prognostic scoring systems such as MELD, CLIF-C, and COSSH-ACLF II are widely used for risk stratification in patients with ACLF. However, due to the lack of a unified definition of ACLF, these models differ significantly in terms of development background, variable composition, and target populations, and a universally applicable standardized model remains unavailable (22, 23). For example, the MELD score is structurally simple and based on total bilirubin, INR, and creatinine (14), and was originally developed for organ allocation in end-stage liver disease (24). However, studies have indicated that MELD has limited accuracy in predicting short-term mortality in ACLF patients (17, 25). The CLIF-C series, derived from the CANONIC study, focuses on multi-organ failure scoring and is primarily suited for alcohol-related ACLF in European populations (16). Nonetheless, its complexity and reduced performance in HBV-ACLF limit its applicability in HBV-endemic regions (16, 17, 26). The COSSH-ACLF scoring systems were developed from a large Chinese HBV cohort and demonstrate good predictive performance within this population. However, because their diagnostic criteria emphasize severe hepatic injury and organ failure, their ability to detect early, subclinical deterioration is limited, which affects generalizability (16, 18). The AARC score, derived from an etiologically mixed cohort, may not adequately capture the distinct progression patterns of HBV-ACLF, thereby restricting its utility in this specific subgroup (2).
An increasing body of evidence recognizes ACLF as an inflammation-driven syndrome, characterized not only by hepatic parenchymal injury but more importantly by systemic immune activation that leads to a cascade of multi-organ failure (27–31). Acute triggers such as infections or drug-induced liver injury can provoke an overwhelming immune response, marked by a rapid surge in pro-inflammatory cytokines like IL-6 and TNF-α. This “cytokine storm” contributes to microcirculatory dysfunction and extrahepatic organ impairment (12, 32). Among these cytokines, IL-6 has shown strong correlations with short-term mortality in ACLF patients and is emerging as a valuable prognostic biomarker (33–35). Beyond its role as a classical inflammatory mediator, IL-6 exerts pathogenic effects through the JAK/STAT3 signaling pathway, promoting monocyte–macrophage chemotaxis, upregulation of adhesion molecules, and secondary cytokine cascades that amplify tissue injury (36, 37). Moreover, IL-6 has been shown to impair mitochondrial function and disrupt cellular bioenergetics, contributing to immune cell exhaustion during ACLF progression (38–40). Therapeutic antibodies targeting the IL-6/IL-6R, such as tocilizumab, have demonstrated long-term efficacy and safety in diseases like rheumatoid arthritis and COVID-19 (41–43). These findings suggest that targeting IL-6-mediated pathways could offer a promising strategy for precision treatment in ACLF. In our study, IL-6 was incorporated into the prognostic model not only to reflect its central role in ACLF pathogenesis but also to improve the identification of “subclinical inflammatory risk phenotypes” that are often overlooked by traditional scoring systems.
The ICI model integrates a mechanism-oriented and data-driven approach by incorporating three key variables — INR, Cr, and IL-6 — to construct a predictive framework that reflects hepatic synthetic capacity, renal compensatory function, and systemic inflammatory status. This design improves the identification of patients at risk for early progression. INR and Cr are already core components of several established prognostic scores, highlighting their clinical significance in ACLF. Specifically, INR quantifies coagulation function and serves as a surrogate marker for hepatic synthetic function in patients with liver disease. Cr is a key indicator for assessing renal function. Its elevation is often associated with decreased renal perfusion caused by systemic inflammation and haemodynamic disturbances, acute kidney injury (AKI) or even haemorrhagic renal syndrome (HRS), even mild elevations are significantly associated with increased mortality (44, 45). IL-6, as a representative marker of systemic inflammation, further enhances the mechanistic interpretability of the model. Its pivotal role in the pathogenesis of ACLF has been discussed previously. In comparison with complex models that rely on multi-organ failure scoring systems, the ICI model features a simplified structure and strong practicality, demonstrating promising potential for clinical translation.
Because inflammatory biomarker selection directly affects a model’s interpretability and reproducibility, we performed a systematic comparison of candidate inflammatory biomarkers and conducted sensitivity analyses to evaluate how key variable choices influenced overall model performance, using data from the training cohort (Supplementary Tables S1,S2). In univariate analyses, ln(IL-6) showed a stable positive association with 28-day mortality (OR = 2.113, 95% CI 1.398–3.194; p < 0.001) and demonstrated the strongest overall discrimination among candidate biomarkers (AUC = 0.716, 95% CI 0.615–0.817; DeLong comparisons in Supplementary Table S1). Notably, ln(WBC) was also associated with mortality (OR = 3.637, 95% CI 1.674–7.903; p = 0.001), with an AUC of 0.696 (95% CI 0.589–0.803), which did not differ significantly from that of ln(IL-6) (DeLong p = 0.771). ln(WBC) was also retained as a candidate predictor during initial LASSO screening. To quantify the incremental predictive value of ln(WBC), we further developed and compared candidate models with and without ln(WBC) (Supplementary Table S2). The model including ln(WBC) showed discrimination comparable to the final ICI model (AUC: 0.833 vs. 0.826; DeLong p = 0.421) with nearly identical prediction error (Brier score: 0.160 vs. 0.162). Adding ln(WBC) to the ICI model did not significantly improve model fit (LRT p = 0.335), and information criteria (AIC/BIC) favored the more parsimonious model. Moreover, the model substituting ln(WBC) for ln(IL-6) showed overall inferior performance (AUC: 0.787; Brier score: 0.174; higher AIC/BIC than the ICI model), although the AUC difference versus the ICI model was not statistically significant (DeLong p = 0.237). Importantly, a non-significant DeLong test (p > 0.05) does not imply equivalence or interchangeability. Considering discrimination, prediction error, and model parsimony together, ln(IL-6) was prioritized as the inflammation-related predictor in the final model.
While these analyses serve to strengthen the evidence base for variable selection and model simplification, it is important to note that several limitations of the study remain. Firstly, the sample size and the source of the validation cohort are relatively limited, and further validation of the model’s stability is required using multicentre data from heterogeneous populations. Secondly, the model was constructed based on a single time point at admission and does not account for the trend of changes in the indicators. It is recommended that subsequent studies investigate the potential for the model framework to be updated in a dynamic manner. Thirdly, the impact of the model on actual clinical decision-making (e.g., ICU admission, artificial liver support, liver transplantation prioritization) has not yet been evaluated, and prospective studies are needed to validate its clinical utility.
Conclusion
In summary, the ICI model demonstrates excellent performance in terms of mechanistic rationality, predictive performance and clinical feasibility. It is anticipated that the ICI model will serve as a valuable instrument in complementing the conventional scoring system for the HBV-ACLF population in the future.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by the UDS Institutional Review Board. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.
Author contributions
HD: Conceptualization, Data curation, Software, Visualization, Writing – original draft. RB: Conceptualization, Data curation, Validation, Writing – original draft. YL: Conceptualization, Data curation, Visualization, Writing – original draft. HX: Writing – original draft, Data curation. RL: Conceptualization, Methodology, Writing – review & editing. JY: Writing – review & editing, Methodology, Data curation, Conceptualization.
Funding
The author(s) declared that financial support was not received for this work and/or its publication.
Acknowledgments
We thank the authors of the primary studies for their timely and helpful responses to our information requests.
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that Generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2025.1688479/full#supplementary-material
References
1. Choudhury, A, Kulkarni, AV, Arora, V, Soin, AS, Dokmeci, AK, Chowdhury, A, et al. Acute-on-chronic liver failure (ACLF): the ‘Kyoto consensus’—steps from Asia. Hepatol Int. (2025) 19:1–69. doi: 10.1007/s12072-024-10773-4,
2. Sarin, SK, Choudhury, A, Sharma, MK, Maiwall, R, Al Mahtab, M, Rahman, S, et al. Acute-on-chronic liver failure: consensus recommendations of the Asian Pacific association for the study of the liver (APASL): an update. Hepatol Int. (2019) 13:353–90. doi: 10.1007/s12072-019-09946-3
3. Karvellas, CJ, Bajaj, JS, Kamath, PS, Napolitano, L, O’Leary, JG, Solà, E, et al. AASLD practice guidance on acute-on-chronic liver failure and the management of critically ill patients with cirrhosis. Hepatology. (2024) 79:1463–502. doi: 10.1097/HEP.0000000000000671,
4. Moreau, R, Tonon, M, Krag, A, Angeli, P, Berenguer, M, Berzigotti, A, et al. EASL clinical practice guidelines on acute-on-chronic liver failure. J Hepatol. (2023) 79:461–91. doi: 10.1016/j.jhep.2023.04.021,
5. Devarbhavi, H, Asrani, SK, Arab, JP, Nartey, YA, Pose, E, and Kamath, PS. Global burden of liver disease: 2023 update. J Hepatol. (2023) 79:36990226:516–37. doi: 10.1016/j.jhep.2023.03.017
6. Gan, C, Yuan, Y, Shen, H, Gao, J, Kong, X, Che, Z, et al. Liver diseases: epidemiology, causes, trends and predictions. Sig Transduct Target Ther. (2025) 10:33. doi: 10.1038/s41392-024-02072-z,
7. Mezzano, G, Juanola, A, Cardenas, A, Mezey, E, Hamilton, JP, Pose, E, et al. Global burden of disease: acute-on-chronic liver failure, a systematic review and meta-analysis. Gut. (2022) 71:148–55. doi: 10.1136/gutjnl-2020-322161,
8. Mahmud, N, Kaplan, DE, Taddei, TH, and Goldberg, DS. Incidence and mortality of acute-on-chronic liver failure using two definitions in patients with compensated cirrhosis. Hepatology. (2019) 69:2150–63. doi: 10.1002/hep.30494,
9. Gu, WY, Xu, BY, Zheng, X, Chen, J, Wang, XB, Huang, Y, et al. Acute-on-chronic liver failure in China: rationale for developing a patient registry and baseline characteristics. Am J Epidemiol. (2018) 187:1829–39. doi: 10.1093/aje/kwy083,
10. Ganem, D, and Prince, AM. Hepatitis B virus infection--natural history and clinical consequences. N Engl J Med. (2004) 350:15014185:1118–29. doi: 10.1056/NEJMra031087
11. Hou, W, Wei, X, Liang, J, Fang, P, Ma, C, Zhang, Q, et al. HMGB1-induced hepatocyte Pyroptosis expanding inflammatory responses contributes to the pathogenesis of acute-on-chronic liver failure (ACLF). JIR. (2021) 14:7295–313. doi: 10.2147/JIR.S336626,
12. Khanam, A, and Kottilil, S. Abnormal innate immunity in acute-on-chronic liver failure: immunotargets for therapeutics. Front Immunol. (2020) 11:11. doi: 10.3389/fimmu.2020.02013,
13. Taru, V, Szabo, G, Mehal, W, and Reiberger, T. Inflammasomes in chronic liver disease: hepatic injury, fibrosis progression and systemic inflammation. J Hepatol. (2024) 81:895–910. doi: 10.1016/j.jhep.2024.06.016,
14. Kamath, PS, Wiesner, RH, Malinchoc, M, Kremers, W, Therneau, TM, Kosberg, CL, et al. A model to predict survival in patients with end–stage liver disease. Hepatology. (2001) 33:464. doi: 10.1053/jhep.2001.22172,
15. Pugh, RNH, Murray-Lyon, IM, Dawson, JL, Pietroni, MC, and Williams, R. Transection of the oesophagus for bleeding oesophageal varices. Br J Surg. (1973) 60:646–9. doi: 10.1002/bjs.1800600817,
16. Jalan, R, Saliba, F, Pavesi, M, Amoros, A, Moreau, R, Ginès, P, et al. Development and validation of a prognostic score to predict mortality in patients with acute-on-chronic liver failure. J Hepatol. (2014) 61:24950482:1038–47. doi: 10.1016/j.jhep.2014.06.012
17. Hernaez, R, Solà, E, Moreau, R, and Ginès, P. Acute-on-chronic liver failure: an update. Gut. (2017) 66:541–53. doi: 10.1136/gutjnl-2016-312670,
18. Li, J, Liang, X, You, S, Feng, T, Zhou, X, Zhu, B, et al. Development and validation of a new prognostic score for hepatitis B virus-related acute-on-chronic liver failure. J Hepatol. (2021) 75:1104–15. doi: 10.1016/j.jhep.2021.05.026,
19. Wu, T, Li, J, Shao, L, Xin, J, Jiang, L, Zhou, Q, et al. Development of diagnostic criteria and a prognostic score for hepatitis B virus-related acute-on-chronic liver failure. Gut. (2018) 67:28928275:2181–91. doi: 10.1136/gutjnl-2017-314641
20. Terrault, N, Lok, A, McMahon, B, Chang, K, Hwang, J, Jonas, M, et al. Update on prevention, diagnosis, and treatment and of chronic hepatitis B: AASLD 2018 hepatitis B guidance. Hepatology. (2018) 67:29405329:1560–99. doi: 10.1002/hep.29800
21. Liang, X, Li, P, Jiang, J, Xin, J, Luo, J, Li, J, et al. Transcriptomics unveils immune metabolic disruption and a novel biomarker of mortality in patients with HBV-related acute-on-chronic liver failure. JHEP Rep. (2023) 5:37425213. doi: 10.1016/j.jhepr.2023.100848,
22. Ge, J, Kim, WR, and Kwong, AJ. Common definitions and variables are needed for the United States to join the conversation on acute-on-chronic liver failure. Am J Transplant. (2024) 24:1755–60. doi: 10.1016/j.ajt.2024.06.021,
23. Kulkarni, AV, and Sarin, SK. Acute-on-chronic liver failure – steps towards harmonization of the definition! J Hepatol. (2024) 81:360–6. doi: 10.1016/j.jhep.2024.03.036,
24. Brown, RS Jr, Kumar, KS, Russo, MW, Kinkhabwala, M, Rudow, DL, Harren, P, et al. Model for end-stage liver disease and child-Turcotte-Pugh score as predictors of pretransplantation disease severity, posttransplantation outcome, and resource utilization in united network for organ sharing status 2A patients. Liver Transpl. (2002) 8:278–84. doi: 10.1053/jlts.2002.31340,
25. Zhang, Y, Nie, Y, Liu, L, and Zhu, X. Assessing the prognostic scores for the prediction of the mortality of patients with acute-on-chronic liver failure: a retrospective study. PeerJ. (2020) 8:e9857. doi: 10.7717/peerj.9857,
26. Li, P, Liang, X, Luo, J, Li, J, Xin, J, Jiang, J, et al. Predicting the survival benefit of liver transplantation in HBV-related acute-on-chronic liver failure: an observational cohort study. Lancet Reg Health West Pac 2022;32:100638. doi:doi: 10.1016/j.lanwpc.2022.10063836793753.
27. Kou, K, Sun, X, Tian, G, Zhi, Y, Fan, Z, and Lv, G. The mechanisms of systemic inflammatory and immunosuppressive acute-on-chronic liver failure and application Prospect of single-cell sequencing. J Immunol Res. (2022) 2022:1–18. doi: 10.1155/2022/509127536387424,
28. Choudhury, A, Kumar, M, Sharma, BC, Maiwall, R, Pamecha, V, Moreau, R, et al. Systemic inflammatory response syndrome in acute-on-chronic liver failure: relevance of ‘golden window’: a prospective study. J Gastroenterol Hepatol. 32:1989–1997. doi: 10.1111/jgh.13799,
29. Laleman, W, Claria, J, Van der Merwe, S, Moreau, R, and Trebicka, J. Systemic inflammation and acute-on-chronic liver failure: too much, not enough. Can J Gastroenterol Hepatol. (2018) 2018:1027152. doi: 10.1155/2018/1027152,
30. Bernardi, M, Moreau, R, Angeli, P, Schnabl, B, and Arroyo, V. Mechanisms of decompensation and organ failure in cirrhosis: from peripheral arterial vasodilation to systemic inflammation hypothesis. J Hepatol. (2015) 63:1272–84. doi: 10.1016/j.jhep.2015.07.004,
31. Solé, C, Solà, E, Morales-Ruiz, M, Fernàndez, G, Huelin, P, Graupera, I, et al. Characterization of inflammatory response in acute-on-chronic liver failure and relationship with prognosis. Sci Rep. (2016) 6:32341. doi: 10.1038/srep32341,
32. Huang, C, Li, Q, Huang, Y, and Chen, L. Role of immune dysfunction in acute-on-chronic liver failure: from pathogenesis to clinical prognosis. Discovery Med. (2021) 31:21–9.
33. Zhou, C, Zhang, N, He, TT, Wang, Y, Wang, LF, Sun, YQ, et al. High levels of serum interleukin-6 increase mortality of hepatitis B virus-associated acute-on-chronic liver failure. World J Gastroenterol. (2020) 26:4479–88. doi: 10.3748/wjg.v26.i30.4479,
34. Zhang, X, He, J, He, X, Lun, Y, He, C, and Xu, Z. Golgi protein 73 and IL-6 as strong predictors of short-term prognosis in patients with HBV-ACLF. Sci Rep. (2025) 15:22191. doi: 10.1038/s41598-025-06472-1,
35. Zhu, B, Gao, F, Li, Y, Shi, K, Hou, Y, Chen, J, et al. Serum cytokine and chemokine profiles and disease prognosis in hepatitis B virus-related acute-on-chronic liver failure. Front Immunol. (2023) 14:14. doi: 10.3389/fimmu.2023.1133656,
36. Hirano, T, and Kishimoto, T. Interleukin-6. Peptide Growth Factors Their Receptors I. (1991):633–65. doi: 10.1007/978-1-4612-3210-0_14
37. Aliyu, M, Zohora, FT, Anka, AU, Ali, K, Maleknia, S, Saffarioun, M, et al. Interleukin-6 cytokine: An overview of the immune regulation, immune dysregulation, and therapeutic approach. Int Immunopharmacol. (2022) 111:109130. doi: 10.1016/j.intimp.2022.109130,
38. Maheshwari, D, Kumar, D, Jagdish, RK, Nautiyal, N, Hidam, A, Kumari, R, et al. Bioenergetic failure drives functional exhaustion of monocytes in acute-on-chronic liver failure. Front Immunol. (2022) 13:856587. doi: 10.3389/fimmu.2022.856587,
39. Romano, M, Sironi, M, Toniatti, C, Polentarutti, N, Fruscella, P, Ghezzi, P, et al. Role of IL-6 and its soluble receptor in induction of chemokines and leukocyte recruitment. Immunity. (1997) 6:315–25. doi: 10.1016/s1074-7613(00)80334-9,
40. Xu, J, Wakai, M, Xiong, K, Yang, Y, Prabakaran, A, Wu, S, et al. The pro-inflammatory cytokine IL6 suppresses mitochondrial function via the gp130-JAK1/STAT1/3-HIF1α/ERRα axis. Cell Rep. (2025) 44:115403. doi: 10.1016/j.celrep.2025.115403,
41. Parisi, S, Ditto, MC, Ghellere, F, Panaro, S, Piccione, F, Borrelli, R, et al. Update on tocilizumab in rheumatoid arthritis: a narrative review. Front Immunol. (2025) 16:1470488. doi: 10.3389/fimmu.2025.1470488,
42. Abidi, E, El Nekidy, WS, Alefishat, E, Rahman, N, Petroianu, GA, El-Lababidi, R, et al. Tocilizumab and COVID-19: timing of administration and efficacy. Front Pharmacol. (2022) 13:825749. doi: 10.3389/fphar.2022.825749,
43. Choy, EH, De Benedetti, F, Takeuchi, T, Hashizume, M, John, MR, and Kishimoto, T. Translating IL-6 biology into effective treatments. Nat Rev Rheumatol. (2020) 16:335–45. doi: 10.1038/s41584-020-0419-z,
44. Nadim, MK, Kellum, JA, Forni, L, Francoz, C, Asrani, SK, Ostermann, M, et al. Acute kidney injury in patients with cirrhosis: acute disease quality initiative (ADQI) and International Club of Ascites (ICA) joint multidisciplinary consensus meeting. J Hepatol. (2024) 81:163–83. doi: 10.1016/j.jhep.2024.03.031,
Keywords: acute-on-chronic liver failure, hepatitis B virus, interleukin-6, prognosis, SIRS
Citation: Deng H, Bao R, Lu Y, Xu H, Li R and Yao J (2026) Development of a short-term prognostic model for hepatitis B-related acute-on-chronic liver failure in a dual-center cohort. Front. Med. 12:1688479. doi: 10.3389/fmed.2025.1688479
Edited by:
Muhammad Sohaib Asghar, AdventHealth, United StatesReviewed by:
Shreya Banerjee, Boston University, United StatesNianhua Tan, The First Hospital of Hunan University of Chinese Medicine, China
Copyright © 2026 Deng, Bao, Lu, Xu, Li and Yao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Rongkuan Li, ZGFsaWFubHJrQDEyNi5jb20=; Jia Yao, eWFvamlhQHN4YnFlaC5jb20uY24=
‡ORCID: Yanxue Lu, https://orcid.org/0009-0001-0574-6261
†These authors have contributed equally to this work
Ruili Bao2†