Machine learning-based algorithms for the prediction of 90-day survival in patients with liver failure receiving artificial liver therapy

Deng, Bo; Bai, Chengzhi; Xu, Huaqian; Zhang, Xue; Deng, Ying

doi:10.3389/fphys.2025.1687860

ORIGINAL RESEARCH article

Front. Physiol., 27 October 2025

Sec. Gastrointestinal Sciences

Volume 16 - 2025 | https://doi.org/10.3389/fphys.2025.1687860

Machine learning-based algorithms for the prediction of 90-day survival in patients with liver failure receiving artificial liver therapy

Bo Deng^1,2*

Chengzhi Bai¹

Huaqian Xu¹

Xue Zhang¹

Ying Deng³*

¹Department of Gastroenterology, The General Hospital of Western Theater Command, Chengdu, Sichuan, China
²Graduate School of Chengdu Medical University, Chengdu, Sichuan, China
³Integrated Care Management Center, Institute of Respiratory Health, West China Hospital, Sichuan University, Chengdu, China

Background: Liver failure is associated with high short-term mortality, and the predictive value of clinical factors for patients undergoing artificial liver therapy is uncertain. We aim to develop prognostic models using several machine learning algorithms to predict 90-day survival in patients with liver failure undergoing artificial liver therapy.

Methods: We retrospectively enrolled hospitalized patients with liver failure who received artificial liver therapy in our center between December 2017 and December 2021. Prognostic characteristics were chosen by the least absolute shrinkage and selection operator (LASSO) regression and independent predictors by stepwise logistic regression analysis. Five machine learning algorithms—logistic regression (LR), random forest (RF), support vector machine (SVM), eXtreme Gradient Boosting (XGBoost), and k-nearest neighbor (KNN)—were used to build and validate models to predict 90-day survival following Artificial liver support systems. The model performance was assessed by the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, positive predictive value, and negative predictive value.

Results: A total of 197 patients were included in this study. LASSO regression, based on patient admission data, identified the top 15 prognostic features, and stepwise LR analysis determined that the age, direct bilirubin, retinol, alpha-fetoprotein, and thrombin time were independent predictors. Among the five machine learning models, LR achieved the highest predictive performance with an AUC of 0.884 and accuracy of 75.0%, followed by RF (AUC = 0.797), KNN (AUC = 0.788), XGBoost (AUC = 0.769), and SVM (AUC = 0.732). The predictive performance of LR models based on longitudinal data using patient characteristics from the day before treatment had an AUC of 0.869, and from the day after treatment, it had an AUC of 0.859.

Conclusion: Machine learning models showed promising performance in predicting 90-day survival in liver failure patients receiving artificial liver support therapy, potentially supporting individualized prognostic assessment.

Introduction

Liver failure is a life-threatening syndrome characterized by rapid liver dysfunction, severe coagulopathy, and multi-organ failure, presenting as acute liver failure (ALF), acute-on-chronic liver failure (ACLF), or chronic liver failure (CLF) depending on the etiology and disease course (Perez Ruiz de Garibay et al., 2022; Wang X et al., 2024). Liver failure significantly contributes to augmented morbidity and mortality worldwide, accounting for more than 2 million deaths yearly, with deaths due to liver disease increasing by 50% over the last 3 decades and predicted to double over the next 20 years (GBD, 2017 Cirrhosis Collaborators, 2020; Zwirner et al., 2024). Specific physiological defects contribute to disease progression in the varying forms of liver failure. In ALF, failure of the liver to eliminate toxins from the body causes systemic inflammation, coagulation abnormalities, and kidney damage, with the mortality greater than 50% (Tujios et al., 2022). CLF, following cirrhosis of the liver, causes slow deterioration of the liver’s capabilities, and it is accountable for more than 1.5 million deaths each year, with a 1-year mortality of 40%–60% at the late stages of the disease (Wu et al., 2024). In ACLF, failure of the liver to maintain physiological control causes multiple organ failure, with greater than 90-day mortality of 50% or greater (European Association for the Study of the Liver, 2023). Thus, it is important to develop predictive models for the prognosis of liver failure to prevent delayed progression, limit complications, and enhance the overall management.

Artificial liver support systems (ALSSs) have been utilized to manage liver failure. They are extracorporeal therapies utilized to eliminate toxins, restore plasma components, and temporarily support the liver, facilitating liver regeneration or allowing time to wait for liver transplantation (Lan et al., 2025). Among these, plasma exchange (PE) and its combination with double plasma molecular adsorption system (DPMAS) are commonly used. PE removes bilirubin, ammonia, and inflammatory mediators, and DPMAS specifically eliminates protein-bound toxins and cytokines, minimizing donor plasma requirements (Tan et al., 2020). Clinical trials have attested that PE markedly attenuates the systemic inflammatory response syndrome (SIRS) in ALF patients and enhances their survival rates (Maiwall et al., 2020). Furthermore, PE in combination with DPMAS improves the coagulation function, decreases organ failure scores, and enhances short-term survival, especially in hepatitis B virus (HBV)-related ACLF patients (Guo et al., 2020).

Previous studies have shown that the development and progression of hepatic failure are regulated by a multitude of physiological mechanisms and clinical variables. Hepatic encephalopathy (HE) results from the accumulation of the body’s metabolic byproducts, leading to neurological malfunction and causing systemic inflammation that contributes to worsening hepatic damage. Low pre-albumin levels are indicative of impaired hepatic synthetic capabilities and malnutrition, signifying a decline in liver physiological capabilities, while higher levels of serum creatinine coincide with acute kidney injury (AKI), additionally worsening liver failure (Figueira et al., 2021; Zhang et al., 2023). In addition, studies have shown that systemic inflammation is responsible for organ functional failure, while renal functional failure also impairs the processes of detoxification. These factors are responsible for multiorgan failure and accelerate liver damage, which are independent predictive factors of in-hospital mortality among patients with ALF (Thuluvath et al., 2021; Tong et al., 2019). Although the usual predictive models such as MELD and the Child–Pugh scoring system are widely practiced in clinical scenarios, they are unable to reflect the complexities of such physiological processes effectively and, hence, have limited predictive capabilities, specifically within situations of complicated liver failure. Recently, machine learning models, such as decision trees, random forest (RF), support vector machine (SVM), and deep learning, have been demonstrated to have superior predictive power in short- and long-term outcome prediction (Panackel et al., 2024; Qiu et al., 2024). Research on prognostic prediction after ALSS is sparse, especially so with machine learning algorithms.

Because of the high mortality of liver failure, the restrictedness of conventional prognostic scores, and the promise of machine learning in precision medicine, we sought to establish and validate prognostic models to predict 90-day survival for liver failure patients undergoing ALSS. With LASSO-selected variables and a variety of machine learning algorithms, we sought to determine the crucial prognostic factors and compare their predictive value for facilitating personalized treatment planning.

Methods

Study design and patients

This was a retrospective cohort study, and it was carried out after being approved by the Ethics Review Committee of the General Hospital of Western Theater Command (No. 2020ky005). The study was in accordance with the principles of the Helsinki Declaration, conformed to the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines, and was in compliance with all relevant national laws. Due to the retrospective, anonymous nature of the study and its low risk, requirement of informed consent was waived.

We chose patients with liver failure who underwent artificial liver therapy in our hospital between December 2017 and December 2021. The inclusion criteria were as follows: 1) age >18 years old; 2) clinically diagnosed with liver failure; 3) undergoing artificial liver treatment (PE or PE plus DPMAS); 4) integrity of clinical data. The exclusion criteria were as follows: 1) autoimmune liver disease, drug-induced hepatitis, alcoholic liver injury, hepatocellular carcinoma, or other liver malignancies; 2) combined with severe heart, lung, or kidney disease, other cancers, or other serious diseases influencing the prognosis; 3) severe mental or cognitive illness or pregnant women; 4) lack of follow-up data.

Definition of liver failure

Liver failure is defined as a syndrome resulting from severe liver damage caused by numerous factors that cause prominent decompensation or abundant dysfunction of its physiological functions such as detoxification, biotransformation, metabolism, and synthesis. Clinically, it manifests primarily as a syndrome of hepatorenal syndrome, jaundice, coagulation disorders, hepatic encephalopathy, and ascites (Liver Failure and Artificial Liver Group, 2019). ALF is a clinical syndrome that results in the sudden onset of liver failure in 2 weeks in the form of grade II or higher HE (under grade IV classification); fast progressive jaundice with serum TBIL of at least 10 times the upper limit of normal (ULN) or a daily rise of at least 17.1 μmol/L; bleeding tendencies with PTA of 40% or less (or INR of 1.5 or more); extreme exhaustion; marked anorexia; abdominal distension, nausea, vomiting, and other extreme gastrointestinal symptoms; and progressive shrinking of the liver. CLF is characterized as slow liver failure and decompensation based on cirrhosis, occurring in the form of raised TBIL (usually <10 times ULN), low albumin, low platelet count, and PTA of 40% or less (or INR of 1.5 or more), along with refractory ascites/portal hypertension and HE. ACLF is a syndrome of acute/subacute liver decompensation based on pre-existing chronic liver disease, occurring in the form of rapidly progressive jaundice, serum TBIL of at least 10 times ULN (or a daily rise of at least 17.1 μmol/L), and bleeding with PTA of 40% or less (or INR of 1.5 or more).

Artificial liver therapy

For PE therapy, the femoral vein is catheterized with a double-lumen catheter to establish extracorporeal circulation, connecting to the artificial liver support system. The plasma separator removes 2,500 mL–3,000 mL of fresh plasma, and an equivalent volume of fresh frozen plasma is substituted. The blood flow rate is 100 mL/min, with a plasma exchange flow rate of 20 mL/min. Each session lasts 2 h–3 h and is repeated every 3 to 5 days. PE is also used in patients with severe HE, refractory hyperbilirubinemia, hepatorenal syndrome, and toxin-induced liver damage, particularly in the presence of coagulopathy, severe jaundice, or multi-organ failure (Agrawal et al., 2025; Yuan et al., 2018). PE in combination with DPMAS therapy is particularly used in patients with toxin-induced liver damage, refractory hyperbilirubinemia, or in patients needing to remove protein-bound toxins and cytokines, offering premium liver support along with effective detoxification (Yao et al., 2019). In PE + DPMAS, separated plasma is subjected to a bilirubin adsorber and blood perfusion unit under dual adsorption, which eliminates bulk molecule toxins, cytokines, bilirubin, and numerous detrimental compounds, further enhancing liver performance. The duration of each session is 3 h–4 h, repeated every 3 to 5 days.

Data collection

We retrieved two main categories of information from the de-identified patient records, including data from three key time points: at the time of admission, the day before ALSS treatment, and the day after ALSS treatment. Clinical data included patient demographics such as age, sex, type of liver failure, number of complications, hepatitis B virus (HBV) positivity, and presence of cirrhosis. The complications include HE ascites, AKI, variceal bleeding, electrolyte imbalance, and spontaneous bacterial peritonitis (SBP). HE is confirmed based on the West Haven criteria. The presence of ascites is confirmed by ultrasound assessment. AKI is identified based on the serum creatinine levels, with a rise of ≥0.3 mg/dL being qualified as AKI. Variceal bleeding is confirmed by upper gastrointestinal endoscopy. Electrolyte imbalance is confirmed by hematological examination. Spontaneous bacterial peritonitis is confirmed by culturing the ascitic fluid. Meanwhile, laboratory test results include alpha-fetoprotein (AFP), albumin, pre-albumin, total bilirubin (TBIL), direct bilirubin (DBIL), indirect bilirubin (IBIL), alanine aminotransferase (ALT), aspartate aminotransferase (AST), γ-glutamyl transferase (γGGT), alkaline phosphatase (ALP), total bile acid (TBA), cholinesterase, retinol, urea, creatinine, cystatin C, endogenous creatinine clearance, uric acid (UA), prothrombin time (PT), INR, fibrinogen, activated partial thromboplastin time (APTT), thrombin time (TT), white blood cell (WBC) count, red blood cell (RBC) count, hemoglobin, platelet count, neutrophil count, lymphocyte count, monocyte count, potassium, sodium, chloride, and C-reactive protein (CRP). The outcome measure is the 90-day survival rate, and the patient survival status is confirmed by an interview over the phone and home visit. Follow-up time points are at 90 days post-discharge.

Statistical analysis

Statistical analysis was conducted with Python 3.13. The variables were all classified based on their type. Continuous variables that were normally distributed were presented as mean ± standard deviation (SD), and non-normally distributed continuous variables were presented as the median (interquartile range, Q1–Q3). These were compared across groups using Welch’s t-test or the Mann–Whitney U test, accordingly. Categorical variables were presented as counts and percentages and compared using the Chi-square test or Fisher’s exact test, accordingly. A two-tailed p-value <0.05 was regarded as statistically significant. Power analysis was conducted to assess the statistical power for detecting the survival differences among different liver failure types.

Feature selection was carried out using the least absolute shrinkage and selection operator (LASSO) regression approach by applying L1 regularization along with 5-fold cross-validation in order to select the best regularization parameter (λ), thus minimizing the possibility of overfitting. Stepwise LR was performed, where statistically insignificant variables were progressively selected or removed to identify the optimal combination of the variables. The coefficients (Coef), standard errors, z-values, p-values, and 95% confidence intervals (CI) of each variable were estimated to identify the independent predictors of survival. Machine learning models such as logistic regression (LR), RF, SVM, eXtreme Gradient Boosting (XGBoost), and K-nearest neighbor (KNN) were then utilized for developing risk prediction models based on these independent risk factors. The dataset was then randomly divided into a training set (70%) and a validation set (30%), and hyperparameter tuning was carried out in the training set using grid search with cross-validation. Regularization and early stopping techniques were utilized to avoid overfitting. The performance of the models was assessed using the single validation set, with evaluation metrics including the area under the ROC curve (AUC) and its 95% CI, accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and F1 score. Shapley additive explanation (SHAP) was utilized in the interpretation of model predictions and in the identification of the most influential contributors toward survival. Based on the MELD and Child–Pugh scores, we constructed prediction models to compare the performance of machine learning models with that of traditional models. We also conducted a longitudinal data analysis based on patient characteristics measured on the day before treatment and the day after treatment.

Results

Characteristics of the participants at the time of admission

Table 1 shows the summary of baseline characteristics of the patients at the time of admission involved in this study. This study included 197 subjects (Figure 1). The survival group cohort included 154 patients, while the non-survival group included 43 patients. The male gender was predominant in both cohorts (83.12% vs. 79.07%), and no statistically significant difference was found between the survival and non-survival groups (p = 0.70). The survival group members were significantly younger (46.88 ± 11.79 years) than those in the non-survival group (55.02 ± 10.73 years), with a statistically significant difference (p < 0.05). The largest percentage of patients in both groups were diagnosed with ACLF (72.73% vs. 62.79%), and most individuals in both groups received PE treatment (55.19% vs. 58.14%). Additionally, the non-survival group had higher levels of TBIL, DBIL, and IBIL, while the survival group had significantly higher levels of albumin and pre-albumin (p < 0.05). In terms of coagulation function, the non-survival group had significantly higher PT, INR, and APTT, and the fibrinogen level was significantly lower. In addition, the non-survival group had lower platelet and RBC counts (p < 0.05).

Table 1

Table 1. Characteristics of the included liver failure patients.

Figure 1

Flowchart showing patient selection for a study. Initially, 619 patients with liver failure who received artificial liver treatment from 2017 to 2021. Excluded are 422 patients: 251 non-liver failure, 109 not receiving treatment, 18 with incomplete clinical data, 13 missing 90-day follow-up data, and 31 other reasons. The final cohort has 197 enrolled patients.

Figure 1. Flow diagram depicting the participant selection process.

Survival rate

At the 90-day follow-up, 85 of 112 patients with ACLF survived (75.89%), 30 of 33 patients with ALF survived (90.91%), and 12 of 25 patients with CLF survived (48.00%). The survival curve reveals that patients with ACLF have the best survival outcomes, with most patients surviving the 90-day duration. Patients with ALF also have relatively favorable survival, but a proportion of patients still die. Patients with CLF have the worst survival, with the curve declining sharply, suggesting that the majority of the patients die within 90 days. The log-rank test p-value of 0.00 suggests that the differences in survival among the types of liver failure are statistically significant (Figure 2). The power analysis results showed that the effect size between ALF and CLF was 0.429 with a power of 0.52, that between ACLF and ALF was 0.103 with a power of 0.08, and that between ACLF and CLF was 0.326 with a power of 0.45.

Figure 2

Kaplan-Meier survival curve illustrating survival probabilities over 90 days for different liver failure types. The orange line represents acute liver failure, maintaining higher survival rates. The green line shows acute-on-chronic liver failure with moderate decline, and the blue line indicates chronic failure with significant decline. The log-rank test shows a significant difference among groups with a p-value of 0.000.

Figure 2. Comparison of 90-day survival curves for patients with different types of liver failure.

Feature screening results based on patients’ admission data

Within the context of LASSO regression, the optimal λ identified using cross-validation was log(λ) = 0.73, which retained the top 15 features with coefficients (Supplementary Figure S1). The features that were retained included age, AFP, HVB, liver failure type, cirrhosis, DBIL, TBIL, CRP, retinol, pre-albumin, platelets, PT, cholinesterase, TT, and monocytes (Figure 3). Stepwise LR using the features selected by LASSO indicated that the age (Coef = −0.064, 95% CI: −0.103 to 0.81, and p = 0.001), DBIL (Coef = −0.007, 95% CI: −0.011 to −0.013, and p = 0.000), retinol (Coef = 0.091, 95% CI: 0.012 to 0.171, and p = 0.024), AFP (Coef = 0.008, 95% CI: 0.002 to 0.003, and p = 0.007), and TT (Coef = −0.185, 95% CI: −0.318 to −0.051, and p = 0.007) were independent predictors (Table 2).

Figure 3

Bar chart showing features with their coefficient values. Positive coefficients in red include AFP, Platelets, and Prealbumin. Negative coefficients in blue include Age, TBIL, and liver failure type. The values range from -0.5 to 0.8.

Figure 3. LASSO regression coefficients of the selected features for predicting 90-day survival in patients with liver failure based on patient admission data.

Table 2

Table 2. Result of the stepwise logistic regression analysis.

Comparison of different prediction models

On the basis of these independent risk factors, we constructed predictive models with various machine learning algorithms to predict the 90-day survival rate of liver failure patients following artificial liver treatment. The LR model showed optimal predictive power; the model had an AUC of 0.884 (0.786–0.960) and accuracy of 75.0%. The other models also showed good predictive power, with the RF model having an AUC of 0.797 (0.663–0.914), the KNN model having an AUC of 0.788 (0.642–0.907), the XGBoost model having an AUC of 0.769 (0.585–0.918), and the SVM model having an AUC of 732 (0.527–899) (Figure 4A; Supplementary Table S1). Across the five models, LR showed the most balanced performance with better discrimination, net benefit, and calibration. RF and SVM achieved moderate benefits in the medium probability range but showed decreased performance at higher thresholds. XGBoost exhibited a similar trend to RF and SVM, with moderate benefits but a decline at higher thresholds. KNN was less stable overall, with greater fluctuations and deviations from the ideal calibration line (Figures 4B, C; Supplementary Figure S2). We also established traditional machine learning models by using the MELD score and the Child–Pugh score, and their AUCs were 0.574 and 0.673, respectively; both of them are less than that of the machine learning model (Supplementary Figure S3).

Figure 4

Panel (A) shows a ROC curve comparing logistic, RF, KNN, SVM, and XGB models with respective AUCs. Panel (B) displays a calibration plot of predicted vs. observed probability for the same models. Panel (C) features a decision curve analysis, illustrating net benefits across threshold probabilities for each model and baseline strategies.

Figure 4. Comparison of 90-day survival prediction performance in liver failure patients across different machine learning models. (A) Comparison of ROC curves for different machine learning models illustrating the performance in predicting 90-day survival. (B) Decision curve analysis for different machine learning models evaluating the net clinical benefit of each model at varying threshold probabilities. (C) Calibration curve for different machine learning models showing the agreement between the predicted survival probabilities and the observed outcomes.

SHAP-based model interpretability analysis

SHAP analysis findings indicated that AFP was the highest predictor in ascertaining the 90-day survival in patients undergoing artificial liver support therapy. Other significant predictors, such as age, DBIL, TT, and retinol, also had significant contributions in terms of model output (Supplementary Figure S4).

Assessment of predictive ability using longitudinal data sets

Patient characteristic analysis was also performed on a day before and after treatment. The LASSO and stepwise LR identified significant variables of age (Coef = −0.504), albumin (Coef = 1.17), IBIL (Coef = −0.874), CRP (Coef = −0.673), and PT (Coef = −0.920) before undergoing ALSS; meanwhile, after a day, the significant variables were age (Coef = −0.754), TBIL (Coef = −0.797), PT (Coef = −0.755), and RBC (Coef = 0.738) (Supplementary Table S2). The predictive power of the models based on the data of patients from a day before treatment produced an AUC of 0.869, while data from after a day produced an AUC of 0.859 (Supplementary Table S3; Supplementary Figure S5).

Discussion

This study examined the 90-day survival rates of patients with liver failure treated by ALSS, demonstrating that survival outcomes varied considerably across different liver failure categories. Patients with ACLF and ALF had relatively good survival rates, while those with CLF had much poorer outcomes. LASSO regression, based on patient admission data, identified the top 15 prognostic features, and stepwise LR analysis revealed that AFP, age, DBIL, retinol, and TT were the independent predictors of 90-day survival. Based on these essential variables, predictive models were established using several machine learning methods, and their performance was compared systematically. Among the five models considered, LR showed the best overall predictive power, outperforming RF, SVM, XGBoost, and KNN in terms of discrimination accuracy and calibration reliability. The SHAP analysis identified AFP as the most influential factor in predicting 90-day survival in patients undergoing artificial liver support therapy. The confusion matrix, decision curve, and calibration plot analyses showed that the LR model accurately classified survivors and non-survivors, offering the greatest clinical benefit and closely aligning predictions with observed outcomes, particularly at high probabilities. The longitudinal analysis based on data from the day before and the day after treatment also demonstrated good predictive performance for the LR model.

The prognostic research of liver failure has gained extensive attention, and earlier studies have primarily targeted general patient populations or etiologies (Li W et al., 2024; Zhu et al., 2024). In recent years, increasing interest has been shown in the prognostic and survival risk factors of patients treated with artificial liver support therapy. A study on HBV–ACLF patients showed that a nomogram based on independent prognostic factors such as age, mid-to-late stage liver failure, HE, upper gastrointestinal bleeding, and the mode of artificial liver therapy (PE + DPMAS) demonstrated good AUC (Wang F et al., 2024). Another study demonstrated that total bilirubin, international normalized ratio (INR), serum creatinine, and age were prognostic factors for the 28-day survival of ACLF patients treated with PE therapy and also obtained good predictive performance (Huang et al., 2019). Du et al. developed the PALS prognostic model based on cirrhosis, TBIL, INR, infection, and hepatic encephalopathy, which accurately predicts the 90-day mortality risk in liver failure patients undergoing PE (Du et al., 2021). Compared to previous studies that used traditional models, adopting a machine learning approach resulted in a moderate increase in the predictive ability (Shi et al., 2024). In terms of applying machine learning models, researchers established an artificial neural network model based on clinical information of HBV–ACLF patients that could predict the mortality risk at 90 days and significantly outperform traditional models (Hou et al., 2020). XGB-CV and decision tree models also markedly outperformed traditional standard models in predicting short-term outcomes among patients with ACLF (Verma et al., 2023). These findings are consistent with our results.

While ALSS can enhance the prognosis of liver failure patients, significant survival disparities still exist among patients, and the determination of independent risk factors is essential (Zhang et al., 2024). Age, DBIL, and TT are crucial physiological parameters that demonstrate liver function and physiological status. With increasing age, the capacity of the liver to regenerate diminishes, and physiological buffers are depleted. Comorbidities and immune dysfunction continue to worsen, lowering the body’s tolerance against circulatory demands and anticoagulation hazards of ALSS and other treatments that could elevate the threat of mortality (Ma et al., 2024; Wu et al., 2018). In case of liver failure, compromised hepatocellular functions impede the clearance of normally secreted bile, leading to an acute elevation of DBIL. Increased DBIL indicates the liver’s compromised detoxifying capacity, aggravated toxin accumulation, systemic inflammation, and hepatocellular injury that results in multi-organ failure (Liang et al., 2022; Xiang et al., 2024). Moreover, an elevated TT signifies an extended coagulation process, which may indicate a disrupted coagulation cascade resulting from the impaired synthesis of coagulation factors in cases of liver failure. This impairment can elevate the risk of hemorrhage, thereby exacerbating the clinical prognosis for individuals with liver failure (Roy et al., 2024).

In the present work, LR also indicated greater values of AUC than other machine learning algorithms. This finding could be explained by the consideration that selected predictors possess approximate linear relationships with survival end-point results consistent with the underlying assumptions of LR. Moreover, on relatively limited clinical data with few predictors, LR will tend to yield more consistent robust performance, such that tree-based and kernel methods are more prone to issues of underfitting or overfitting unless the hyperparameters are optimally fine-tuned. Smooth monotonic probability estimation by LR also permits more stable risk ranking at varying thresholds than stepwise or ill-calibrated probability outputs of ensemble or nonparametric methods (Lynam et al., 2020; Nusinovici et al., 2020). Based on the analysis of the model of LR by SHAP, AFP is the most contributory predictor. Increased AFP levels indicate the damaged regeneration capacity of hepatocytes and the clearance of harmful agents, such that they are indicative of severe liver damage with poor repair capacity of the liver, both of which are consistent with poor prognosis with more severe liver disease pathophysiology (Li C et al., 2024). Retinol exhibits properties that are anti-inflammatory, antioxidant, and immune-regulatory in nature. It has the capacity to mitigate liver damage and enhance hepatic metabolism and regeneration by curbing the excessive activation of hepatic stellate cells and modulating bilirubin metabolism. Specifically, in patients suffering from ACLF and ALF, increased concentrations of retinol may decelerate the progression of liver fibrosis, alleviate hepatic burden, and possibly diminish the mortality risk (Chen et al., 2025; Romeo and Valenti, 2016).

This study’s comparison of 90-day survival across various types of liver failure revealed that ALF and ACLF patients had significantly better survival compared to CLF patients. In patients with ACLF, while there is an underlying chronic liver disease, hepatic function may still recover partially if the acute precipitating factors are effectively controlled early (Li et al., 2023). ALF typically occurs in the setting of previously normal liver function, and their hepatocyte regenerative capability is relatively intact, allowing for better response to active supportive care despite the risk of early rapid deterioration (Ocak, 2023). CLF patients, however, are frequently in a phase of irreversible hepatic parenchymal injury, along with portal hypertension, malnutrition, and various comorbidities. Their hepatocyte regenerative capability is poor, and the benefit of ALSS is minimal, which together may account for the steep decline seen in this group’s survival curve (Saliba et al., 2022). Treatment strategies and monitoring schedules, thus, need to be individualized based on the type of liver failure to ensure the optimization of intervention benefits.

Limitations

Our study has several limitations. (1) The single-center, retrospective design may introduce selection bias and limit the generalizability of findings. The absence of an external validation cohort also calls for prospective multicenter validation to assess model robustness and clinical applicability. (2) The sample size was inadequate to reveal distinctions among different types of liver failure, particularly within the ALF and ACLF cohorts, and power analysis verified that the study lacked sufficient power, thus constraining its generalizability. (3) The dataset imbalance (154 survivors versus 43 non-survivors) might skew the model toward the predominant class. Although several machine learning algorithms were utilized, methods such as SMOTE or class weighting were not implemented, and the performance was predominantly evaluated using AUC-ROC. (4) A solitary validation set for model assessment, which, despite being computationally efficient and appropriate for extensive datasets, may fail to encompass the full range of variability in model performance. (5) Only baseline data were used for prediction, without considering dynamic changes during treatment. Future research could incorporate longitudinal data to enhance the predictive performance.

Conclusion

Machine learning algorithms that incorporate significant clinical and laboratory parameters enhanced the precision of 90-day survival prediction in patients with liver failure undergoing artificial liver support therapy, among which LR showed the best performance. Such models can promote personalized treatment strategy and offer more robust evidence for clinical decisions.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving humans were approved by The Ethics Review Committee of the General Hospital of Western Theater Command. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because of the retrospective, anonymous nature of the study and its low risk.

Author contributions

BD: Conceptualization, Writing – original draft, Formal Analysis, Validation, Data curation. CB: Data curation, Writing – review and editing, Investigation. HX: Software, Writing – review and editing, Data curation, Visualization. XZ: Writing – review and editing, Data curation. YD: Conceptualization, Writing – review and editing, Supervision, Methodology, Project administration.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphys.2025.1687860/full#supplementary-material

References

Agrawal D., Ariga K. K., Gupta S., Saigal S. (2025). Therapeutic plasma exchange in hepatology: indications, techniques, and practical application. J. Clin. Exp. hepatology 15 (1), 102410. doi:10.1016/j.jceh.2024.102410

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen J., Zhang Y., Deng Z., Zhu Y., Xu C., Gao B., et al. (2025). Integrated Cascade antioxidant nanozymes-Cu5.4O@CNDs combat acute liver injury by regulating retinol metabolism. Theranostics 15 (12), 5592–5615. doi:10.7150/thno.106811

PubMed Abstract | CrossRef Full Text | Google Scholar

Du L., Ma Y., Zhou S., Chen F., Xu Y., Wang M., et al. (2021). A prognostic score for patients with acute-on-chronic liver failure treated with plasma exchange-centered artificial liver support system. Sci. Rep. 11 (1), 1469. doi:10.1038/s41598-021-81019-8

PubMed Abstract | CrossRef Full Text | Google Scholar

European Association for the Study of the Liver (2023). EASL clinical Practice Guidelines on acute-on-chronic liver failure. J. hepatology 79 (2), 461–491. doi:10.1016/j.jhep.2023.04.021

PubMed Abstract | CrossRef Full Text | Google Scholar

Figueira E. R. R., Rocha-Filho J. A., Lanchotte C., Nacif L. S., de Paiva Haddad L. B., Assalin A. R., et al. (2021). Creatinine-lactate score predicts mortality in non-acetaminophen-induced acute liver failure in patients listed for liver transplantation. BMC Gastroenterol. 21 (1), 252. doi:10.1186/s12876-021-01830-5

PubMed Abstract | CrossRef Full Text | Google Scholar

GBD 2017 Cirrhosis Collaborators (2020). The global, regional, and national burden of cirrhosis by cause in 195 countries and territories, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. Gastroenterology & Hepatology 5 (3), 245–266. doi:10.1016/S2468-1253(19)30349-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo X., Wu F., Guo W., Zhang J., Yang Y., Lu Y., et al. (2020). Comparison of plasma exchange, double plasma molecular adsorption system, and their combination in treating acute-on-chronic liver failure. J. Int. Med. Res. 48 (6), 300060520932053. doi:10.1177/0300060520932053

PubMed Abstract | CrossRef Full Text | Google Scholar

Hou Y., Zhang Q., Gao F., Mao D., Li J., Gong Z., et al. (2020). Artificial neural network-based models used for predicting 28- and 90-day mortality of patients with hepatitis B-associated acute-on-chronic liver failure. BMC Gastroenterol. 20 (1), 75. doi:10.1186/s12876-020-01191-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang K., Ji F., Xie Z., Wu D., Xu X., Gao H., et al. (2019). Artificial liver support system therapy in acute-on-chronic hepatitis B liver failure: classification and regression tree analysis. Sci. Rep. 9 (1), 16462. doi:10.1038/s41598-019-53029-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Lan X., Hong C., Zhang X., Zhou L., Li Y., Zhang C., et al. (2025). Artificial liver support System improves one-year prognosis of patients with hepatitis B virus-associated acute-on-chronic liver failure. J. Gastroenterology Hepatology 40 (4), 940–948. doi:10.1111/jgh.16883

PubMed Abstract | CrossRef Full Text | Google Scholar

Li C., Hu H., Bai C., Xu H., Liu L., Tang S. (2024). Alpha-fetoprotein and APRI as predictive markers for patients with Type C hepatitis B-related acute-on-chronic liver failure: a retrospective study. BMC Gastroenterol. 24 (1), 191. doi:10.1186/s12876-024-03276-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Li G., Zhang P., Zhu Y. (2023). Artificial liver support systems for hepatitis B virus-associated acute-on-chronic liver failure: a meta-analysis of the clinical literature. J. Viral Hepat. 30 (2), 90–100. doi:10.1111/jvh.13767

PubMed Abstract | CrossRef Full Text | Google Scholar

Li W., Liu W., Rong Y., Li D., Zhu B., Yang S., et al. (2024). Development and validation of a new prognostic model for predicting survival outcomes in patients with acute-on-chronic liver failure. J. Clin. Transl. hepatology 12 (10), 834–844. doi:10.14218/JCTH.2024.00316

PubMed Abstract | CrossRef Full Text | Google Scholar

Liang C., Yu Z., Bai L., Hou W., Tang S., Zhang W., et al. (2022). Association of Serum Bilirubin with Metabolic Syndrome and non-alcoholic fatty liver disease: a systematic review and meta-analysis. Front. Endocrinol. 13, 869579. doi:10.3389/fendo.2022.869579

PubMed Abstract | CrossRef Full Text | Google Scholar

Liver Failure and Artificial Liver Group (2019). Guideline for diagnosis and treatment of liver failure (2018). Chin. J. Clin. Infect. Dis. 35 (1), 38–44. doi:10.3969/j.issn.1001-5256.2019.01.007

CrossRef Full Text | Google Scholar

Lynam A. L., Dennis J. M., Owen K. R., Oram R. A., Jones A. G., Shields B. M., et al. (2020). Logistic regression has similar performance to optimised machine learning algorithms in a clinical setting: application to the discrimination between type 1 and type 2 diabetes in young adults. Diagnostic prognostic Res. 4, 6. doi:10.1186/s41512-020-00075-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Ma Y., Xu Y., Du L., Bai L., Tang H. (2024). Association between systemic immune inflammation index and short term prognosis of acute on chronic liver failure. Sci. Rep. 14 (1), 21535. doi:10.1038/s41598-024-72447-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Maiwall R., Bajpai M., Singh A., Agarwal T., Kumar G., Bharadwaj A., et al. (2020). Standard-Volume plasma exchange improves outcomes in patients with acute liver failure: a randomized controlled trial. Clin. Gastroenterol. Hepatol. 20 (4), e831–e854. doi:10.1016/j.cgh.2021.01.036

PubMed Abstract | CrossRef Full Text | Google Scholar

Nusinovici S., Tham Y. C., Chak Yan M. Y., Wei Ting D. S., Li J., Sabanayagam C., et al. (2020). Logistic regression was as good as machine learning for predicting major chronic diseases. J. Clin. Epidemiol. 122, 56–69. doi:10.1016/j.jclinepi.2020.03.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Ocak I. (2023). Single-center experience in 127 adult patients, mono or dual artificial liver support therapy, in patients with acute liver failure. Front. Med. 10, 1190067. doi:10.3389/fmed.2023.1190067

PubMed Abstract | CrossRef Full Text | Google Scholar

Panackel C., Raja K., Fawas M., Jacob M. (2024). Prognostic models in acute liver failure-historic evolution and newer updates “prognostic models in acute liver failure”. Clin. Gastroenterol. 73, 101957. doi:10.1016/j.bpg.2024.101957

PubMed Abstract | CrossRef Full Text | Google Scholar

Perez Ruiz de Garibay A., Kortgen A., Leonhardt J., Zipprich A., Bauer M. (2022). Critical care hepatology: definitions, incidence, prognosis and role of liver failure in critically ill patients. Crit. care London, Engl. 26 (1), 289. doi:10.1186/s13054-022-04163-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Qiu S., Zhao Y., Hu J., Zhang Q., Wang L., Chen R., et al. (2024). Predicting the 28-day prognosis of acute-on-chronic liver failure patients based on machine learning. Dig. liver Dis. official J. Italian Soc. Gastroenterology Italian Assoc. Study Liver 56 (12), 2095–2102. doi:10.1016/j.dld.2024.06.029

PubMed Abstract | CrossRef Full Text | Google Scholar

Romeo S., Valenti L. (2016). Regulation of retinol-binding protein 4 and retinol metabolism in fatty liver disease. Hepatol. Baltim. Md 64 (5), 1414–1416. doi:10.1002/hep.28722

PubMed Abstract | CrossRef Full Text | Google Scholar

Roy A., Kumar Y., Verma N. (2024). Coagulopathy in acute liver failure. Clin. Gastroenterol. 73, 101956. doi:10.1016/j.bpg.2024.101956

PubMed Abstract | CrossRef Full Text | Google Scholar

Saliba F., Bañares R., Larsen F. S., Wilmer A., Parés A., Mitzner S., et al. (2022). Artificial liver support in patients with liver failure: a modified DELPHI consensus of international experts. Intensive Care Med. 48 (10), 1352–1367. doi:10.1007/s00134-022-06802-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Shi S., Yang Y., Liu Y., Chen R., Jia X., Wang Y., et al. (2024). Development and validation of a machine learning model to predict prognosis in liver failure patients treated with non-bioartificial liver support system. Front. Med. 11, 1368899. doi:10.3389/fmed.2024.1368899

PubMed Abstract | CrossRef Full Text | Google Scholar

Tan E. X., Wang M. X., Pang J., Lee G. H. (2020). Plasma exchange in patients with acute and acute-on-chronic liver failure: a systematic review. World J. Gastroenterology 26 (2), 219–245. doi:10.3748/wjg.v26.i2.219

PubMed Abstract | CrossRef Full Text | Google Scholar

Thuluvath P. J., Alukal J. J., Zhang T. (2021). Acute liver failure in Budd-Chiari syndrome and a model to predict mortality. Hepatol. Int. 15 (1), 146–154. doi:10.1007/s12072-020-10115-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Tong J. J., Zhao W., Mu X. Y., Xu X., Su H. B., Liu X. Y., et al. (2019). Predictive value of the Chinese group on the study of severe hepatitis B-acute-on-chronic liver failure score in the short-term prognosis of patients with hepatitis B virus-related acute-on-chronic liver failure. Chin. Med. J. 132 (13), 1541–1549. doi:10.1097/CM9.0000000000000298

PubMed Abstract | CrossRef Full Text | Google Scholar

Tujios S., Stravitz R. T., Lee W. M. (2022). Management of Acute Liver failure: update 2022. Seminars Liver Dis. 42 (3), 362–378. doi:10.1055/s-0042-1755274

PubMed Abstract | CrossRef Full Text | Google Scholar

Verma N., Choudhury A., Singh V., Duseja A., Al-Mahtab M., Devarbhavi H., et al. (2023). APASL-ACLF research consortium-artificial intelligence (AARC-AI) model precisely predicts outcomes in acute-on-chronic liver failure patients. Liver Int. official J. Int. Assoc. Study Liver 43 (2), 442–451. doi:10.1111/liv.15361

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang X,, Yang Z., Pu Z., Zheng Y., Chen H., Huang Y., et al. (2024). Development and validation of a novel prognostic nomogram for hepatitis B virus-related acute-on-chronic liver failure patients receiving artificial liver therapy. Eur. J. Med. Res. 29 (1), 556. doi:10.1186/s40001-024-02141-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang X., Zheng M. Y., He H. Y., Zhu H. L., Zhao Y. F., Chen Y. H., et al. (2024). Quality evaluation of guidelines for the diagnosis and treatment of liver failure. Crit. Care Med. 52 (10), 1624–1632. doi:10.1097/CCM.0000000000006346

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu T., Li J., Shao L., Xin J., Jiang L., Zhou Q., et al. (2018). Development of diagnostic criteria and a prognostic score for hepatitis B virus-related acute-on-chronic liver failure. Gut 67 (12), 2181–2191. doi:10.1136/gutjnl-2017-314641

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu X. N., Xue F., Zhang N., Zhang W., Hou J. J., Lv Y., et al. (2024). Global burden of liver cirrhosis and other chronic liver diseases caused by specific etiologies from 1990 to 2019. BMC Public Health 24 (1), 363. doi:10.1186/s12889-024-17948-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Xiang Y., Li R., Cai J., Jiang Q. (2024). Three artificial liver models of treatment of acute-on-chronic liver failure. Ther. Clin. Risk Manag. 20, 731–740. doi:10.2147/TCRM.S485620

PubMed Abstract | CrossRef Full Text | Google Scholar

Yao J., Li S., Zhou L., Luo L., Yuan L., Duan Z., et al. (2019). Therapeutic effect of double plasma molecular adsorption system and sequential half-dose plasma exchange in patients with HBV-Related acute-on-chronic liver failure. J. Clin. Apher. 34 (4), 392–398. doi:10.1002/jca.21690

PubMed Abstract | CrossRef Full Text | Google Scholar

Yuan S., Qian Y., Tan D., Mo D., Li X. (2018). Therapeutic plasma exchange: a prospective randomized trial to evaluate 2 strategies in patients with liver failure. Transfus. Apher. Sci. official J. World Apher. Assoc. official J. Eur. Soc. Haemapheresis 57 (2), 253–258. doi:10.1016/j.transci.2018.02.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang H., Yang K., Wang Q., Jin L., Wang L. M., Fan X. Y., et al. (2023). Prealbumin as a predictor of short-term prognosis in patients with HBV-related acute-on-chronic liver failure. Infect. Drug Resist. 16, 2611–2623. doi:10.2147/IDR.S402585

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang L., Ma Y., Wang X., Ma L. N., Ma W., Ding X. C. (2024). Comparative efficacy of double plasma molecular adsorption system combined with plasma exchange versus plasma exchange in treating acute-on-chronic liver failure due to hepatitis B: a meta-analysis. J. Clin. Apher. 39 (4), e22140. doi:10.1002/jca.22140

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhu Z. Y., Huang X. H., Jiang H. Q., Liu L. (2024). Development and validation of a new prognostic model for patients with acute-on-chronic liver failure in intensive care unit. World J. Gastroenterology 30 (20), 2657–2676. doi:10.3748/wjg.v30.i20.2657

PubMed Abstract | CrossRef Full Text | Google Scholar

Zwirner S., Abu Rmilah A. A., Klotz S., Pfaffenroth B., Kloevekorn P., Moschopoulou A. A., et al. (2024). First-in-class MKK4 inhibitors enhance liver regeneration and prevent liver failure. Cell. 187 (7), 1666–1684.e26. doi:10.1016/j.cell.2024.02.023

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: liver failure, artificial liver therapy, survival, machine learning, predictive value

Citation: Deng B, Bai C, Xu H, Zhang X and Deng Y (2025) Machine learning-based algorithms for the prediction of 90-day survival in patients with liver failure receiving artificial liver therapy. Front. Physiol. 16:1687860. doi: 10.3389/fphys.2025.1687860

Received: 18 August 2025; Accepted: 30 September 2025;
Published: 27 October 2025.

Edited by:

Hongxiang Hui, Monterrey Park, United States

Reviewed by:

Suyavaran Arumugam, Yale University, United States
Hirotaka Tashiro, National Hospital Organization Kure Medical Center, Japan

Copyright © 2025 Deng, Bai, Xu, Zhang and Deng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Bo Deng, Ym9kMjk0OTNAZ21haWwuY29t; Ying Deng, NjI2NDkxMzQwQHFxLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.