Your new experience awaits. Try the new design now and help us make it even better

SYSTEMATIC REVIEW article

Front. Endocrinol., 09 January 2026

Sec. Thyroid Endocrinology

Volume 16 - 2025 | https://doi.org/10.3389/fendo.2025.1748907

Association between thyroid dysfunction and prognosis in patients with liver failure: a systematic review and meta-analysis

Shaoyin Bao&#x;Shaoyin Bao1†Yin Pan&#x;Yin Pan2†Yanyun RuanYanyun Ruan1Hozeifa Mohamed HassanHozeifa Mohamed Hassan1Hongsheng LuHongsheng Lu3Qi Chen*Qi Chen1*
  • 1Precision Medicine Center, Taizhou Central Hospital (Taizhou University Hospital), Taizhou, China
  • 2Department of Surgical Oncology, Taizhou Central Hospital (Taizhou University Hospital), Taizhou, China
  • 3Department of Pathology, Taizhou Central Hospital (Taizhou University Hospital), Taizhou, China

Objective: To systematically evaluate the association between thyroid dysfunction and prognosis in patients with liver failure.

Methods: We systematically searched PubMed, Embase, Web of Science (WOS), and China National Knowledge Infrastructure (CNKI) till August 2025 to identify prospective or retrospective studies assessing the relationship between thyroid function—specifically, thyroid-stimulating hormone (TSH), free triiodothyronine (FT3), and free thyroxine (FT4)—and outcomes in adult patients with liver failure, including mortality, intensive care unit (ICU) admission, and organ failure. Study quality was evaluated using the Newcastle–Ottawa Scale (NOS). Pooled relative risks (RRs) with 95% confidence intervals (CIs) were calculated, and heterogeneity, subgroup, and sensitivity analyses were conducted.

Results: Eleven studies, including 3,595 patients from four countries, were included. Thyroid dysfunction was significantly associated with increased mortality in patients with liver failure (RR = 3.56, 95% CI: 2.77–4.57; = 36%). Subgroup analyses by sex, FT3 cut-off, and etiology (hepatitis B virus [HBV] vs. non-hepatitis B virus acute-on-chronic liver failure [non-HBV ACLF]) showed consistent associations. Additionally, a subgroup analysis comparing FT3 concentrations between survival and death groups in liver failure patients revealed a significant difference in FT3 levels between the two groups (mean difference [MD] = 0.89 [0.41, 1.37] for short-term mortality and MD = 0.42 [0.23, 0.61] for long-term mortality). Sensitivity analyses confirmed the robustness of the results. No substantial publication bias was observed.

Conclusions: Thyroid dysfunction, particularly low FT3 levels, is associated with higher mortality rates in patients with liver failure. These findings suggest that routine thyroid function assessment may help identify high-risk patients. Further well-designed prospective studies are needed to clarify the underlying mechanisms and evaluate the potential therapeutic implications.

1 Introduction

Liver failure is a severe clinical condition characterized by sudden deterioration of hepatic function, systemic inflammatory activation, and progression to multiple organ dysfunction (1, 2). Despite substantial advances in intensive care and liver transplantation, the mortality rate remains extremely high (3). Accurate prognostic assessment is essential for clinical decision-making and resource allocation. However, the currently available scoring systems, such as the Model for End-Stage Liver Disease (MELD) and the Chronic Liver Failure–Sequential Organ Failure Assessment (CLIF-SOFA), exhibit limited predictive accuracy (4, 5). The identification of novel prognostic biomarkers is therefore of considerable importance.

The thyroid-liver axis is pivotal for systemic metabolic regulation: the liver mediates thyroid hormone deiodination, conjugation, and clearance, while thyroid hormones regulate hepatic lipid, glucose, and protein metabolism (68). In liver failure, this axis is disrupted, most notably resulting in non-thyroidal illness syndrome (NTIS), a condition characterized by abnormal thyroid function tests in the absence of primary thyroid disease, including low free triiodothyronine (FT3), normal or low free thyroxine (FT4), and normal thyroid-stimulating hormone (TSH) (9, 10). This syndrome differs from primary hypothyroidism (where TSH is elevated) and is recognized as a marker of systemic illness severity.

The distinct trajectories of thyroid hormones in liver failure are mechanistically driven: FT3 declines early and consistently from​ reduced hepatic type 1 deiodinase (D1) activity and impaired peripheral conversion of thyroxine (T4) to FT3, a process exacerbated by systemic inflammation (11, 12); FT4 typically remains normal or mildly decreased, as hepatic clearance of FT4 is partially preserved and binding to thyroid-binding globulin is altered but not abolished (9, 12); TSH is usually within the normal range, reflecting intact hypothalamic-pituitary feedback, which is a critical distinction between NTIS and central hypothyroidism (13). These alterations have been linked to metabolic disturbances, immune dysfunction, and multi-organ failure progression.

A growing body of research has explored thyroid dysfunction as a prognostic factor in liver failure, but critical gaps persist: several studies report low FT3 as a prognostic marker for mortality (14), while others show varying or inconclusive findings (15, 16). These discrepancies may stem from small sample sizes, heterogeneous definitions of thyroid dysfunction (FT3-based vs. TSH-based), variable etiological spectra (HBV-related vs. non-HBV liver failure), and differing follow-up durations.

Given these uncertainties, a more robust synthesis of available data is necessary. To ensure methodological consistency and comparability of prognostic outcomes across studies, we focused specifically on cohort designs. Cohort studies provide a clear temporal sequence between thyroid dysfunction and subsequent clinical outcomes and are generally considered more reliable for estimating prognostic associations. To address the limitations of existing evidence, we performed a systematic review and meta-analysis of cohort studies to investigate the relationship between thyroid dysfunction and prognosis in patients with liver failure. Subgroup analyses were also carried out to examine the influence of sex, FT3 cut-off values, underlying etiologies, and the association between FT3 concentrations and survival outcomes in patients with liver failure, specifically comparing the FT3 concentrations between survival and death groups. Clinically, this work may complement existing prognostic tools by helping to identify high-risk patients, guide closer monitoring, and support the design of future interventional studies, with the ultimate goal of improving outcomes in this high-mortality population.

2 Methods

2.1 Literature search

A systematic literature search was conducted in the PubMed, Embase, Web of Science (WOS), and China National Knowledge Infrastructure (CNKI) databases til August 2025. The objective of this search was to identify studies that assess the association between thyroid function and prognosis in liver failure patients. Thyroid function was defined by levels of TSH, FT3, FT4, and related hormones, with a focus on prognostic outcomes including mortality, intensive care unit (ICU) admission, and organ failure.

The keywords covered terms related to liver failure, thyroid function and hormones, and prognosis or risk factors. Terms from different categories were combined using the Boolean operator AND, whereas terms within the same category were combined with OR. Searches were limited to English-language studies (Chinese allowed for CNKI) and adult populations only. The reference lists of relevant reviews were manually screened to identify additional eligible studies. Potential bias from non-English literature exclusion: Non-English studies outside CNKI were excluded due to limited access to language-specific databases.

Titles and abstracts were independently screened by two reviewers, and full texts were assessed for eligibility. Disagreements regarding inclusion were resolved through discussion. The detailed electronic search strategy is presented in the Supplementary File.

2.2 Inclusion and exclusion criteria

This study followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines for systematic reviews and meta-analyses (17).

Studies were included if they met all of the following conditions (1): prospective or retrospective cohort design (2); adult patients diagnosed with liver failure according to accepted clinical criteria (3); thyroid function assessed at baseline or during hospitalization using FT3, FT4, TSH, or their predefined categories; and (4) reported at least one prognostic outcome (mortality, ICU admission, organ failure) together with effect estimates or extractable data. The exclusion criteria were as follows (1): non-original studies, such as reviews, meta-analyses, conference abstracts, editorials, letters, or commentaries (2); studies involving participants under 18 years of age, pregnant women, or pediatric/adolescent populations (3); eligible studies that included prospective and retrospective cohort studies; case-control and cross-sectional studies were excluded (4); studies including patients with malignancy or severe infections, or those that did not report risk estimates, such as odds ratios (OR) or relative risks (RR), for thyroid function or hormone levels in relation to liver failure outcomes; and (5) studies that reported thyroid function in formats that could not be harmonized and for which effect estimates were not obtainable.

Duplicate records were removed, and titles and abstracts were screened to exclude irrelevant studies. The remaining articles were assessed through full-text review based on the inclusion and exclusion criteria. Two independent investigators (Shaoyin Bao and Hozeifa M. Hassan) evaluated all eligible studies, and any disagreements were resolved by discussion with a third investigator (Qi Chen).

2.3 Data extraction and quality assessment

Two investigators (Shaoyin Bao and Hozeifa M. Hassan) independently extracted data from the included studies. The following information was collected: first author, year of publication, country or region, study design, sample size, baseline characteristics of participants (age and sex distribution), type of liver failure, type and definition of thyroid dysfunction, follow-up duration, primary outcomes (mortality, ICU admission, and incidence of organ failure), and effect estimates (OR and RR with 95% CI). When effect sizes were not directly reported, they were calculated from available data. Additionally, a standardization of serum FT3 and FT4 concentrations to pmol/L was performed to ensure comparability across studies.

The quality of included cohort studies was assessed using the Newcastle–Ottawa Scale (NOS), which evaluates participant selection, comparability of groups, and outcome assessment. Full details of the scoring process are provided in the Supplementary File. The scale ranges from 0 to 9 points, with a score of >7 considered high quality. Data extraction and quality assessment were conducted independently by two reviewers, and any discrepancies were resolved through discussion with a third investigator (Qi Chen).

2.4 Statistical analysis

All retrieved records were imported into EndNote (version 21, Clarivate Analytics, Philadelphia, PA, USA https://endnote.com) for reference management. Duplicate publications were initially removed using EndNote’s automated deduplication function and subsequently checked manually to ensure accuracy. All statistical analyses were performed using RevMan (Review Manager version 5.4, Cochrane Collaboration, Copenhagen, Denmark https://training.cochrane.org/online-learning/core-software/revman) and R software (version 4.4.1, R Foundation for Statistical Computing, Vienna, Austria https://www.r-project.org). The relative risk (RR) was used to assess the association between thyroid dysfunction and liver failure outcomes. The mean difference (MD) was applied to compare the average differences in continuous outcomes between patient groups.​​ The 95% confidence interval (CI) indicated the precision of estimates. The P-value determined statistical significance, while the Chi-squared (Chi²) test assessed heterogeneity, and the degrees of freedom (df) were calculated based on the analysis categories. The Z-score was reported to evaluate the strength of the effect. RRs with corresponding 95% CIs were calculated as the pooled effect size. Heterogeneity across studies was assessed using the Cochran’s Q test and quantified by the statistic. An > 50% or P < 0.10 for the Q test was considered indicative of substantial heterogeneity, in which case a random-effects model was applied; otherwise, a fixed-effects model was used.

Analyses were conducted at both the overall population level and across several predefined subgroups (1): sex-based subgroup analysis (male vs. female), which was performed using sex-stratified data extracted from the included studies. To reduce ecological bias, analyses were based on within-study sex-specific comparisons, and studies without sex-stratified data were excluded from subgroup pooling (2); cut-off subgroup analysis, categorized as Low cut-off subgroup (FT3 ≤ 2.8 pmol/L, the established​ cut-off value derived from a previously published study (18)) and High cut-off subgroup (FT3 > 2.8 pmol/L), restricted to acute-on-chronic liver failure (ACLF) patients; and (3) liver failure etiology subgroup analysis, limited to hepatitis B virus (HBV)-related ACLF and ACLF of other causes due to sample size constraints; and (4) subgroup analysis comparing FT3 concentrations between survival and death groups in liver failure patients.

Sensitivity analyses were performed in two ways (1): by sequentially omitting one study at a time (leave-one-out method) to assess the stability of pooled results and (2) by excluding studies rated as low quality (NOS score ≤ 7); and (3) a pre-planned sensitivity analysis was conducted to address heterogeneity in the definition of thyroid dysfunction. As reduced TSH reflects hypothalamic–pituitary axis dysfunction rather than the peripheral low FT3 pattern typical of NTIS in liver failure, studies defining thyroid dysfunction solely by TSH were excluded from the primary meta-analysis to maintain clinical and pathophysiological consistency.

Publication bias was evaluated when ≥ 10 studies were included. Funnel plots were visually inspected, and both Egger’s regression test and Begg’s rank correlation test were conducted using R software. A two-tailed P < 0.05 was considered statistically significant.

3 Results

3.1 Study selection

Using both electronic database searches and manual screening, a total of 235 records were initially identified. After removing 36 duplicates, 199 unique records remained. Based on the titles and abstracts, 177 articles were excluded owing to their content not matching the review. The full texts of the remaining 22 articles were reviewed in detail. Of these, 11 were excluded for the following reasons: conference abstracts without sufficient data (n = 3), outcome not relevant (n = 5), duplicate data from the same cohort (n = 2), and early study with insufficient methodological quality (published in 1987, n = 1). Finally, 11 studies (14, 15, 1927) met the inclusion criteria and were included in the meta-analysis (Figure 1).

Figure 1
Flowchart depicting the identification and screening process of studies. Initially, 235 records were identified from various databases. Thirty-six duplicate records were removed, leaving 199 screened records. Of these, 177 were excluded. Twenty-two full-text articles were assessed for eligibility, with eleven studies included in the review. Exclusions were due to insufficient data, irrelevance, duplicate data, or methodological quality issues.

Figure 1. Flow chart of article selection.

3.2 Study characteristics and quality

We identified a total of 11 cohort studies from 4 countries (China, Austria, Germany, and Brazil) in the final analysis of the association between thyroid dysfunction and prognosis of liver failure, comprising 3595 participants. The sample sizes of the included studies ranged from 60 to 1862, and the methodological quality was moderate to high, with NOS scores ranging from 6 to 9. Most studies (n = 10) assessed mortality as the primary outcome, with follow-up periods ranging from 28 days to 2 years. One study additionally reported 28-day liver transplantation requirement. The baseline characteristics and quality assessment of the included studies are summarized in Table 1.

Table 1
www.frontiersin.org

Table 1. Study characteristics and quality assessment of 11 cohort studies included in the meta-analysis.

3.3 Overall meta-analysis

As illustrated in Figure 2, the initial analysis showed substantial heterogeneity ( = 82%). After excluding the study that defined thyroid dysfunction as low TSH, the overall meta-analysis of the remaining studies demonstrated that thyroid dysfunction was significantly associated with an increased risk of mortality in patients with liver failure (RR = 3.56, 95% CI: 2.77–4.57; = 36%, P < 0.00001). This exclusion was justified by both statistical and pathophysiological considerations: the excluded study’s TSH-based definition reflected central thyroid axis dysfunction, whereas the remaining studies used FT3-based definitions consistent with liver failure-induced NTIS, eliminating cross-phenotype heterogeneity.

Figure 2
Forest plot showing risk ratio comparisons between thyroid abnormalities and normal conditions across multiple studies. Each study's risk ratio, confidence interval, and weight are displayed. The overall effect size is indicated at the bottom with a diamond, representing a risk ratio of 3.56 with a confidence interval from 2.77 to 4.57, suggesting a higher risk for thyroid abnormalities. Risk ratios are presented on a logarithmic scale, with the plot favoring thyroid abnormality on the left and normal on the right. Heterogeneity analysis details are provided below the plot.

Figure 2. Pooled analysis of the association between thyroid dysfunction and mortality risk in liver failure patients, after excluding the study with low TSH definition.

The funnel plot (Figure 3) showed a symmetrical distribution of effect sizes, suggesting no substantial publication bias. This was further supported by Begg’s test (P = 0.788) and Egger’s test (P = 0.188), indicating the robustness of the pooled results.

Figure 3
Funnel plot displaying studies on a log risk ratio (RR) scale. The plot includes points representing individual studies, with dashed lines forming a symmetrical triangle, indicating a typical distribution for the standard error of the log RR.

Figure 3. Funnel plot evaluating potential publication bias in the included studies. The plot demonstrates a visually symmetrical distribution of effect sizes, indicating a low likelihood of publication bias.

3.4 Subgroup analyses

Subgroup analysis stratified by sex showed that thyroid dysfunction was significantly associated with higher mortality in both male and female individuals with liver failure (Figure 4). Among male patients, the combined RR reached 4.95 (95% CI: 2.56-9.58; = 0%, P < 0.00001), whereas in females it was 3.02 (95% CI: 1.28-7.10; = 0%, P = 0.01). The test for subgroup differences did not reveal significant heterogeneity between sexes (P for interaction = 0.37), suggesting a consistent association across genders.

Figure 4
Forest plot showing risk ratios of thyroid abnormalities versus normal conditions, divided by gender and studies. For males, the risk ratio is 4.95, and for females, it is 3.02, with a total risk ratio of 4.25. Weight percentages and confidence intervals are provided for each study. Overall effect and heterogeneity statistics are included for both genders and combined data, with a significant overall effect.

Figure 4. Subgroup analysis stratified by sex evaluating the association between thyroid dysfunction and mortality in patients with liver failure.

Subgroup analysis based on FT3 cut-off values among patients with ACLF (Figure 5) revealed that low FT3 levels (≤ 2.8 pmol/L) were significantly associated with increased mortality (RR = 3.44, 95% CI: 1.38-8.58; = 0%; P = 0.008). Similarly, in patients with higher FT3 levels (> 2.8 pmol/L), thyroid dysfunction remained significantly correlated with mortality (RR = 2.17, 95% CI: 1.15-4.13; = 0%; P = 0.02). Between-subgroup comparison indicated no significant heterogeneity (P for interaction = 0.42).

Figure 5
Forest plot showing the risk ratio of events comparing thyroid abnormalities to normal, divided into low and high cut-off subgroups. The low cut-off subgroup has a risk ratio of 3.44, while the high cut-off subgroup has a risk ratio of 2.17. Overall risk ratio for all studies is 2.60, with no significant heterogeneity. The plot includes blue squares representing individual study estimates and black diamonds indicating pooled estimates, with confidence intervals extending from 0.01 to 100.

Figure 5. Subgroup analysis stratified by FT3 cut-off values evaluating the association between thyroid dysfunction and mortality in patients with liver failure.

Subgroup analysis by etiology (Figure 6) indicated that thyroid dysfunction was significantly linked to elevated mortality in HBV-ACLF patients (RR = 1.55, 95% CI: 1.41–1.71; = 40%; P < 0.00001). In contrast, among non-HBV-ACLF patients, this association did not reach statistical significance (RR = 1.88, 95% CI: 0.90–3.95; = 0%; P = 0.09). When data from all etiologies were combined, thyroid dysfunction continued to show a significant correlation with mortality (RR = 1.56, 95% CI: 1.42–1.72; = 22%; P < 0.00001). Although the point estimate suggested a potentially stronger association in the HBV-ACLF subgroup (RR = 1.55) compared to the non-HBV-ACLF subgroup (RR = 1.88), the test for subgroup differences was not statistically significant (P for interaction = 0.61), indicating that the effect of thyroid dysfunction may not differ substantially between these etiologies.

Figure 6
Forest plot comparing risk ratios for thyroid abnormalities in non-HBV ACLF and HBV-ACLF groups across various studies. Diamonds represent summary estimates with confidence intervals. Subtotals show increased risk ratios for both groups, favoring thyroid abnormality presence. Heterogeneity and effect tests are included.

Figure 6. Subgroup analysis stratified by etiology evaluating the association between thyroid dysfunction and mortality in patients with liver failure.

Subgroup analysis by survival status (Figure 7) showed a significant association between FT3 concentrations and mortality in liver failure patients. The overall analysis revealed that FT3 levels were significantly lower in the death group compared to the survival group (Total MD = 0.70 [0.36, 1.04]; = 78%; P < 0.0001). Further subgroup analysis (Figure 8) indicated that in the short-term mortality group, FT3 concentrations were significantly lower, with a mean difference of 0.89 [0.41, 1.37] ( = 76%; P = 0.0003). In contrast, the long-term mortality group (Figure 9) showed a smaller but still significant difference in FT3 levels (MD = 0.42 [0.23, 0.61]; = 34%; P < 0.0001). These findings suggest that low FT3 levels are associated with both short-term and long-term mortality, with a stronger correlation in the short-term mortality group. Considerable heterogeneity was observed in both the overall analysis ( = 78%) and the short-term mortality subgroup ( = 76%). To explore potential sources of heterogeneity, we reviewed study characteristics including liver failure etiology, timing of FT3 measurement, measurement platforms, and patient severity at baseline. Several factors appear to contribute. For instance, Hong-Ling Feng_2020 included a mixed population consisting of acute liver failure, subacute liver failure, and acute-on-chronic liver failure, whereas Nardin_2024 focused exclusively on patients with acute decompensation. Variability in laboratory measurement platforms may also contribute to heterogeneity. Hong-Ling Feng_2020 used the Roche Cobas e411 electrochemiluminescence immunoassay, whereas Nardin_2024 measured thyroid hormones using the Siemens ADVIA Centaur XP chemiluminescent microparticle immunoassay. Taken together, the diversity in liver failure etiologies and the inconsistency in assay methods across studies are likely to account for part of the substantial heterogeneity observed in the survival vs. death comparisons.

Figure 7
Forest plot displaying survival versus death mean differences across four studies. Each study is represented by a green square with a horizontal line indicating the confidence interval. Combined total mean difference is 0.70 with a 95% confidence interval of 0.36 to 1.04. Heterogeneity statistics include Tau² = 0.09, Chi² = 13.53 (P = 0.004), I² = 78%. Overall effect Z value is 4.04 (P < 0.0001).

Figure 7. Subgroup analysis based on survival status comparing FT3 concentrations between survival and death groups in liver failure patients.​​ The overall pooled analysis demonstrates significantly lower FT3 levels in the death group compared to the survival group.

Figure 8
Forest plot displaying mean differences in survival and death between two studies, Lin Lin_2024 and Liuxing_2023. Lin Lin_2024 shows a mean difference of 0.70 with a 14.9 percent weight, while Liuxing_2023 shows a mean difference of 0.37 with an 85.1 percent weight. The overall effect size is 0.42, confidence interval 0.23 to 0.61. Heterogeneity chi-squared is 1.52, and P-value is 0.22, indicating 34 percent heterogeneity. Overall effect significance Z equals 4.40 with a P-value less than 0.0001.

Figure 8. Subgroup analysis comparing FT3 levels between survival and short-term mortality groups in liver failure patients.​​ The pooled analysis demonstrates significantly lower FT3 levels in the short-term mortality group.

Figure 9
Forest plot comparing survival and death rates from two studies, Hong-Ling Feng 2020 and Nardin 2024. The plot displays mean differences with a ninety-five percent confidence interval. The overall mean difference is 0.89, favoring survival. Heterogeneity statistics indicate variability among studies.

Figure 9. Subgroup analysis comparing FT3 concentrations between survival and long-term mortality groups in liver failure patients.​​ The pooled analysis shows a significant but smaller reduction in FT3 levels in the long-term mortality group compared to the survival group.

3.5 Sensitivity analysis

To evaluate the robustness of the results, sensitivity analyses were conducted using both the leave-one-out approach and restriction to studies with higher methodological quality (NOS >7). As summarized in Table 2, omitting individual studies in turn yielded pooled RRs ranging from 3.01 to 3.58, indicating that no single study drove the overall effect. Notably, the excluding of Chenjunfeng_2022 resulted in a substantial decline in heterogeneity, with decreasing from 82% to 36%, while the association with mortality remained statistically significant. When only high-quality studies were included, the summary estimate was comparable (RR = 3.30, 95% CI: 2.03–5.35), reinforcing the stability of the findings.

Table 2
www.frontiersin.org

Table 2. Sensitivity analysis results.

4 Discussion

The aim of our meta-analysis was to evaluate the relationship between thyroid dysfunction and prognosis in patients with liver failure. We found that low FT3 was consistently associated with increased mortality, and the risk remained stable across subgroup analyses by sex, FT3 cut-off, and etiology. Sensitivity analyses further confirmed the robustness of this association. Notably, the exclusion of Chenjunfeng_2022 substantially reduced heterogeneity, likely due to the distinct definition of thyroid dysfunction in this study (low TSH rather than low FT3), reflecting central rather than peripheral dysregulation. This distinction is not merely semantic: low TSH reflects potential central hypothalamic-pituitary dysfunction, which is different in pathophysiology from the NTIS caused by liver failure.

From a mechanistic perspective, reduced FT3 is a core feature of NTIS, may reflect the impaired hepatic clearance capacity and altered deiodinase activity under liver injury conditions (14, 28). These hormonal abnormalities may be linked to, rather than definitively cause, metabolic disorders, systemic inflammation, and immune dysfunction, thereby potentially contributing to multi-organ failure and increasing mortality death (29). Given the observational nature of the included studies, the biological explanations should be viewed as hypotheses rather than established mechanisms. Our subgroup analysis further revealed that the association between FT3 concentrations and mortality differed between short-term and long-term outcomes. FT3 levels were significantly lower in the death group compared to the survival group, particularly in the short-term mortality group. This finding suggests that FT3 might serve as a potential prognostic marker for liver failure short-term outcomes, potentially due to its role in rapidly modulating metabolic processes and inflammation during acute liver failure (14). It is critical to acknowledge that the observed association does not establish causality, and FT3 levels should not be interpreted as a direct causal determinant of mortality.

In the etiology subgroup analysis, a significant association was observed in HBV-ACLF patients, while the result in non-HBV-ACLF patients did not reach statistical significance. However, the test for subgroup differences was not significant (P for interaction = 0.61), indicating a lack of statistical evidence for a true etiological difference. Therefore, the observed numerical difference should be interpreted with caution. The non-significance in the non-HBV-ACLF subgroup is likely attributable to its smaller sample size and greater etiological heterogeneity. Although it is physiologically plausible that immune-mediated injury in HBV-ACLF could amplify the consequences of thyroid dysfunction (30, 31), our findings do not confirm this hypothesis. The results underscore the need for confirmation in larger, prospective cohorts with adequate power to detect true etiological interactions.

From a clinical perspective, the study demonstrates the feasibility of thyroid function evaluation in risk stratification for liver failure. Monitoring FT3 and other thyroid parameters may help identify patients at high risk of liver failure who could benefit from intensified surveillance or tailored interventions (20, 22). Clinically, measuring thyroid hormones is inexpensive, widely available, and can be incorporated into routine laboratory panels, which supports its potential use in early risk flagging. However, its application has important limitations: thyroid abnormalities in liver failure are highly dynamic, influenced by non-thyroidal illness, and may not reflect intrinsic thyroid gland dysfunction. The observation that FT3 concentrations were more strongly associated with short-term mortality highlights the potential for using FT3 as an early biomarker for predicting acute liver failure outcomes. Nonetheless, the absence of interventional evidence means that correcting low FT3 cannot currently be recommended as a therapeutic strategy. Whether thyroid hormone supplementation improves prognosis remains unknown, and any clinical applications should rely on future randomized trials (15).

Future research should focus on large-scale prospective cohort studies to validate the prognostic value of FT3 and other thyroid parameters in diverse liver failure populations. Mechanistic studies are also needed to clarify how thyroid hormone alterations contribute to systemic inflammation, metabolic dysfunction, and multiorgan failure. In addition, whether therapeutic modulation of thyroid hormones can improve clinical outcomes warrants rigorous evaluation through randomized controlled trials. The development of etiology-specific prognostic models incorporating thyroid function markers may further enhance individualized risk stratification in patients with liver failure.

Some limitations of this analysis need to be addressed. Primary, differences in study design, patient populations, and definitions of thyroid dysfunction may have influenced the pooled estimates. Additionally, the predominance of retrospective studies introduces potential selection and reporting biases. Our literature search focused on PubMed, Embase, Web of Science, and CNKI, excluding the Cochrane Library, Google Scholar, and other study registries. This may have resulted in the omission of eligible studies, particularly gray literature or unpublished data, thereby contributing to potential selection bias. Notably, we didn’t register a protocol with PROSPERO (International Prospective Register of Systematic Reviews) or INPLASY (International Platform of Registered Systematic Review and Meta-analysis Protocols) beforehand, which may lead to publication bias or duplication of efforts. Furthermore, subgroup analyses were unable to thoroughly investigate potential effect modifiers such as age, liver failure severity, or regional differences. Lastly, the observational nature of the included studies precludes causal conclusions, and residual confounding cannot be excluded.

5 Conclusion

In conclusion, our findings confirm that thyroid dysfunction, particularly low FT3 levels, is a significant prognostic factor linked to increased mortality in patients with liver failure. Subgroup analysis further revealed that FT3 concentrations were more strongly associated with short-term mortality compared to long-term mortality, suggesting its potential as an early biomarker for predicting acute liver failure outcomes. These results highlight the potential benefit of incorporating thyroid function assessment into routine clinical practice for improved risk stratification. However, further large-scale prospective studies are needed to establish causality and explore the therapeutic potential of addressing thyroid dysfunction.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Author contributions

SB: Methodology, Software, Writing – original draft, Writing – review & editing, Validation, Investigation, Data curation, Formal analysis, Visualization. YP: Data curation, Validation, Writing – original draft, Writing – review & editing, Investigation, Formal analysis, Visualization. YR: Data curation, Writing – original draft, Writing – review & editing, Investigation, Visualization, Validation. HH: Data curation, Writing – original draft, Writing – review & editing, Validation, Visualization, Formal analysis. HL: Data curation, Writing – original draft, Writing – review & editing, Validation, Visualization, Project administration. QC: Funding acquisition, Project administration, Writing – original draft, Writing – review & editing, Supervision, Conceptualization, Resources, Visualization, Validation.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This study was supported by the Natural Science Foundation of Zhejiang Province (TGY24H200013) and the Zhejiang Provincial Medical and Health Science and Technology Plan (2023KY1330, 2024KY537).

Acknowledgments

The authors gratefully acknowledge the Precision Medicine Center, Taizhou Central Hospital (Taizhou University Hospital) for their generous support of this study, and especially thank the reviewers for allowing the authors to improve the manuscript.

Conflict of interest

The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2025.1748907/full#supplementary-material

Abbreviations

ACLD, advanced chronic liver disease; ACLF, acute-on-chronic liver failure; AD, acute decompensation; ALF, acute liver failure; CC, compensated cirrhosis; Chi², chi-squared test; CI, confidence interval; CLIF-SOFA, chronic liver failure–sequential organ failure assessment; CNKI, China national knowledge infrastructure; DC, decompensated cirrhosis; df, degrees of freedom; FT3, free triiodothyronine; FT4, free thyroxine; HBV, hepatitis B virus; HE, hepatic encephalopathy; ICU, intensive care unit; INPLASY, International Platform of Registered Systematic Review and Meta-analysis Protocols; MD, mean difference; MELD, model for end-stage liver disease; NA, not available; NOS, newcastle–ottawa scale; NTIS, non-thyroidal illness syndrome; OR, odds ratio; PRISMA, preferred reporting items for systematic reviews and meta-analyses; PROSPERO, international prospective register of systematic reviews; RR, relative risk; rT3, reverse triiodothyronine; SALF, subacute liver failure; TSH, thyroid-stimulating hormone; TT3, total triiodothyronine; WOS, web of science.

References

1. Maiwall R, Kulkarni AV, Arab JP, and Piano S. Acute liver failure. Lancet. (2024) 404:789–802.

Google Scholar

2. Shingina A, Mukhtar N, Wakim-Fleming J, Alqahtani S, Wong RJ, Limketkai BN, et al. Acute liver failure guidelines. Am J Gastroenterol. (2023) 118:1128–53. doi: 10.14309/ajg.0000000000002340

PubMed Abstract | Crossref Full Text | Google Scholar

3. Zhou X, Chen X, Du H, Ye Y, Miu Y, Su T, et al. Antithrombin III activity is associated with prognosis, infection, and inflammation in patients with hepatitis B virus-related acute-on-chronic liver failure. Eur J Gastroenterol Hepatol. (2023) 35:914–20. doi: 10.1097/MEG.0000000000002571

PubMed Abstract | Crossref Full Text | Google Scholar

4. Cheng GW, Fang Y, Xue LY, Zhang Y, Xie XY, Qiao XH, et al. Nomogram based on liver stiffness and spleen area with ultrasound for posthepatectomy liver failure: A multicenter study. World J Gastroenterol. (2024) 30:3314–25. doi: 10.3748/wjg.v30.i27.3314

PubMed Abstract | Crossref Full Text | Google Scholar

5. Lee M, Lee JH, Oh S, Jang Y, Lee W, Lee HJ, et al. CLIF-SOFA scoring system accurately predicts short-term mortality in acutely decompensated patients with alcoholic cirrhosis: a retrospective analysis. Liver Int. (2015) 35:46–57. doi: 10.1111/liv.12683

PubMed Abstract | Crossref Full Text | Google Scholar

6. Wang C, Niu Q, Lv H, Li Q, Ma Y, Tan J, et al. Elevated TPOAb is a strong predictor of autoimmune development in patients of type 2 diabetes mellitus and non-alcoholic fatty liver disease: A case-control study. Diabetes Metab Syndr Obes. (2020) 13:4369–78. doi: 10.2147/DMSO.S280231

PubMed Abstract | Crossref Full Text | Google Scholar

7. Sinha RA, Bruinstroop E, and Yen PM. Actions of thyroid hormones and thyromimetics on the liver. Nat Rev Gastroenterol Hepatol. (2025) 22:9–22. doi: 10.1038/s41575-024-00991-4

PubMed Abstract | Crossref Full Text | Google Scholar

8. Soares De Oliveira L and Ritter MJ. Thyroid hormone and the liver. Hepatol Commun. (2025) 9. doi: 10.1097/HC9.0000000000000596

PubMed Abstract | Crossref Full Text | Google Scholar

9. Gao S, Ma W, Huang S, Lin X, and Yu M. Impact of low triiodothyronine syndrome on long-term outcomes in patients with myocardial infarction with nonobstructive coronary arteries. Ann Med. (2021) 53:741–9. doi: 10.1080/07853890.2021.1931428

PubMed Abstract | Crossref Full Text | Google Scholar

10. Warner MH and Beckett GJ. Mechanisms behind the non-thyroidal illness syndrome: an update. J Endocrinol. (2010) 205:1–13. doi: 10.1677/JOE-09-0412

PubMed Abstract | Crossref Full Text | Google Scholar

11. Ön Ş, Acar S, Demir K, Abacı A, Öztürk Y, Kahveci Çelik S, et al. Evaluation of thyroid function tests in children with chronic liver diseases. J Clin Res Pediatr Endocrinol. (2020) 12:143–9. doi: 10.4274/jcrpe.galenos.2019.2019.0029

PubMed Abstract | Crossref Full Text | Google Scholar

12. Bohinc BN, Michelotti G, Xie G, Pang H, Suzuki A, Guy CD, et al. Repair-related activation of hedgehog signaling in stromal cells promotes intrahepatic hypothyroidism. Endocrinology. (2014) 155:4591–601.

PubMed Abstract | Google Scholar

13. Fliers E, Bianco AC, Langouche L, and Boelen A. Thyroid function in critically ill patients. Lancet Diabetes Endocrinol. (2015) 3:816–25. doi: 10.1016/S2213-8587(15)00225-9

PubMed Abstract | Crossref Full Text | Google Scholar

14. Feng HL, Li Q, Cao WK, and Yang JM. Changes in thyroid function in patients with liver failure and their clinical significance: A clinical study of non-thyroidal illness syndrome in patients with liver failure. Hepatobiliary Pancreat Dis Int. (2020) 19:561–6. doi: 10.1016/j.hbpd.2020.05.001

PubMed Abstract | Crossref Full Text | Google Scholar

15. Chen JF, Weng WZ, Huang M, Peng XH, Zhang J, Xiong J, et al. The impact of serum thyroid-stimulation hormone levels on the outcome of hepatitis B virus related acute-on-chronic liver failure: an observational study. BMC Gastroenterol. (2022) 22:330. doi: 10.1186/s12876-022-02406-7

PubMed Abstract | Crossref Full Text | Google Scholar

16. Wu Y, You S, Zang H, Liu H, Mao Y, Mao P, et al. Usefulness of serum thyroid-stimulation hormone (TSH) as a prognostic indicator for acute-on-chronic liver failure. Ann Hepatol. (2015) 14:218–24. doi: 10.1016/S1665-2681(19)30784-7

PubMed Abstract | Crossref Full Text | Google Scholar

17. Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. Bmj. (2021) 372:n71.

Google Scholar

18. Rong F, Dai H, Wu Y, Li J, Liu G, Chen H, et al. Association between thyroid dysfunction and type 2 diabetes: a meta-analysis of prospective observational studies. BMC Med. (2021) 19:257. doi: 10.1186/s12916-021-02121-2

PubMed Abstract | Crossref Full Text | Google Scholar

19. Huang X, Jiang S, Fan X, Jiang Y, Wu L, Li F, et al. Low-free triiodothyronine is associated with poor prognosis of portal hypertension in cirrhosis. Eur J Gastroenterol Hepatol. (2020) 32:1358–63. doi: 10.1097/MEG.0000000000001658

PubMed Abstract | Crossref Full Text | Google Scholar

20. Lin L, Huang ZY, Liu K, Tong XC, Zhang ZX, and Xue Y. The free triiodothyronine, gamma-glutamyl transpeptidase and spontaneous bacterial peritonitis index: A novel model for predicting 1-year mortality in patients with HBV-related hepatic encephalopathy. Hepat Med. (2024) 16:1–9. doi: 10.2147/HMER.S450638

PubMed Abstract | Crossref Full Text | Google Scholar

21. Liu X, Kong M, Hua X, Yang Y, Xu M, Bi Y, et al. Association of energy metabolism with serum thyroid hormone levels in patients with liver failure and their impact on prognosis. J Clin Hepatology. (2023) 39:137–41.

Google Scholar

22. Hartl L, Simbrunner B, Jachs M, Wolf P, Bauer DJM, Scheiner B, et al. Lower free triiodothyronine (fT3) levels in cirrhosis are linked to systemic inflammation, higher risk of acute-on-chronic liver failure, and mortality. JHEP Rep. (2024) 6:100954. doi: 10.1016/j.jhepr.2023.100954

PubMed Abstract | Crossref Full Text | Google Scholar

23. Mona-May L, Alina B, Sabrina G, Sebastian H, Denise Z, Lars M, et al. Thyroid hormone alterations are associated with decompensated liver cirrhosis and acute-onchronic liver failure. Z fur Gastroenterologie. (2022) 60:e17.

Google Scholar

24. Nardin G, Colombo BDS, Ronsoni MF, Silva P, Fayad L, Wildner LM, et al. Thyroid hormone profile is related to prognosis in acute decompensation of cirrhosis. Arch Endocrinol Metab. (2024) 68:e230249. doi: 10.20945/2359-4292-2023-0249

PubMed Abstract | Crossref Full Text | Google Scholar

25. Wu Y, Wan Z, You S, and Xin S. Usefulness of serum T3, and TSH as prognostic indicators for patients of acute-on-chronic liver failure with hepatitis B virus infection. Hepatol Int. (2014) 8:S3.

Google Scholar

26. Zhang J, Zhang L, Fu X, Chen Y, Duan Z, and Tian G. The value of dynamic changes in FT3 level for predicting 90-day prognosis of HBV-ACLF patients. Eur J Med Res. (2024) 29:288. doi: 10.1186/s40001-024-01770-2

PubMed Abstract | Crossref Full Text | Google Scholar

27. Anastasiou O, Sydor S, Sowa JP, Manka P, Katsounas A, Syn WK, et al. Higher thyroid-stimulating hormone, triiodothyronine and thyroxine values are associated with better outcome in acute liver failure. PloS One. (2015) 10:e0132189. doi: 10.1371/journal.pone.0132189

PubMed Abstract | Crossref Full Text | Google Scholar

28. Plikat K, Langgartner J, Buettner R, Bollheimer LC, Woenckhaus U, Schölmerich J, et al. Frequency and outcome of patients with nonthyroidal illness syndrome in a medical intensive care unit. Metabolism. (2007) 56:239–44.

PubMed Abstract | Google Scholar

29. Mosca A, Manco M, Braghini MR, Cianfarani S, Maggiore G, Alisi A, et al. Environment, endocrine disruptors, and fatty liver disease associated with metabolic dysfunction (MASLD). Metabolites. (2024) 14. doi: 10.3390/metabo14010071

PubMed Abstract | Crossref Full Text | Google Scholar

30. Zhou C, Zhang N, He TT, Wang Y, Wang LF, Sun YQ, et al. High levels of serum interleukin-6 increase mortality of hepatitis B virus-associated acute-on-chronic liver failure. World J Gastroenterol. (2020) 26:4479–88. doi: 10.3748/wjg.v26.i30.4479

PubMed Abstract | Crossref Full Text | Google Scholar

31. Wang X and Ning Q. Immune mediated liver failure. Excli J. (2014) 13:1131–44.

Google Scholar

Keywords: free triiodothyronine, liver failure, meta-analysis, mortality, thyroid dysfunction

Citation: Bao S, Pan Y, Ruan Y, Hassan HM, Lu H and Chen Q (2026) Association between thyroid dysfunction and prognosis in patients with liver failure: a systematic review and meta-analysis. Front. Endocrinol. 16:1748907. doi: 10.3389/fendo.2025.1748907

Received: 18 November 2025; Accepted: 15 December 2025; Revised: 15 December 2025;
Published: 09 January 2026.

Edited by:

Chukwuka Elendu, Federal University Teaching Hospital, Nigeria

Reviewed by:

Emmanuel Kokori, University of Ilorin, Nigeria
Adetola Babalola, Temple University, United States

Copyright © 2026 Bao, Pan, Ruan, Hassan, Lu and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Qi Chen, Y2hlbnFAdHpjLmVkdS5jbg==

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.