Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol., 12 January 2026

Sec. Gastrointestinal Cancers: Hepato Pancreatic Biliary Cancers

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1735042

Construction of a Bayesian network-based risk prediction model for hepatocellular carcinoma in cirrhotic patients

Ni MaNi Ma1Jingwei SongJingwei Song1Yuqing Yang*Yuqing Yang2*
  • 1School of Public Health, Xinjiang Medical University, Urumqi, China
  • 2The Second Affiliated Hospital of Xi’an Jiaotong University Xinjiang Hospital, People’s Hospital of Xinjiang Uygur Autonomous Region Bainiaohu Hospital, Urumqi, China

Objective: To investigate clinical data of hospitalised cirrhosis patients, identify risk factors for cirrhosis progression to hepatocellular carcinoma, establish a risk prediction model, and provide scientific basis for early identification of high-risk patients.

Methods: Hospitalised cirrhosis patients treated at Xinjiang Uygur Autonomous Region People’s Hospital between January 2019 and December 2023 were selected. Their inpatient examination records were retrieved, including medical record summaries alongside results for coagulation function, complete blood count, liver function tests, urinalysis, renal function tests, tumour markers, comprehensive hepatitis panel, comprehensive thyroid function panel, comprehensive biochemical panel, glucose series, and lipid series. Patients diagnosed with hepatic malignancy during subsequent hospitalisations (excluding the initial admission) formed the cancer progression group, while those without hepatic malignancy constituted the control group. Univariate and multivariate analyses identified risk factors for hepatocellular carcinoma (HCC) progression in cirrhosis patients. A predictive model for HCC development in cirrhosis patients was constructed using a combined Lasso regression model and Bayesian network model.

Results: This study enrolled 1,204 individuals, including 1,128 cirrhosis patients, of whom 76 progressed to liver malignancy. Multivariate logistic regression analysis indicated that female gender was a protective factor against cirrhosis progression to liver malignancy(OR = 0.532, OR95% CI = 0.297-0.952); while hepatitis B, elevated total cholesterol, and reduced antithrombin III activity were risk factors for progression to hepatocellular carcinoma (OR = 4.080, OR95%CI = 2.443-6.814;OR = 2.308, OR95%CI = 1.132-4.707, OR = 2.982, OR 95% CI = 1.389-6.402) (P < 0.05); The LASSO regression model ultimately identified 14 variables most significantly associated with the transformation of cirrhosis into malignant liver tumours: PT, TT, MAO, ALP, PDW, CRP, CK-MB, Ca, P, TC, AFP, FT4, SCC, AT3. The results of the Bayesian network model indicate that DOI, AT3, SCC, CRP, and TC have a direct connection with the occurrence of hepatic malignancy; Gender, FT4, hepatitis type, and Ca indirectly influence the development of hepatic malignancy. Using sensitivity analysis, we examined the probability of cirrhosis patients progressing to hepatocellular carcinoma under specific conditions. The results indicated that the probability of liver cirrhosis patients progressing to hepatocellular carcinoma was 0.0749 (7.5%) when AT3 levels were normal, 0.0709 (7.1%) when AT3 was low, and 0.0850 (8.5%) when AT3 was high. When TC levels were normal, the probability was 0.0763 (7.6%), and it increased to 0.1309 (13.1%) when TC was elevated. The performance of the Bayesian network model (AUC = 0.857, Brier Score = 0.052, Accuracy = 0.940, C-index = 0.833 (95% CI: 0.784–0.882), Sensitivity = 0.836, Specificity=0.744) was superior to that of the logistic regression model (AUC = 0.780, Brier Score = 0.056, Accuracy = 0.934, C-index = 0.785 (95% CI: 0.726–0.843), Sensitivity = 0.727, Specificity = 0.766). Ten-fold cross-validation showed an average accuracy of 0.93. After balancing sensitivity and specificity, the optimal threshold was determined by maximizing the Youden index (0.052), with a predicted probability >0.052 indicating progression from liver cirrhosis to hepatic malignancy. Based on the ROC curve, threshold 1 was set at 0.2 and threshold 2 at 0.8, establishing risk stratification as follows: low risk (predicted probability <0.2), intermediate risk (0.2 ≤ predicted probability ≤ 0.8), and high risk (predicted probability >0.8). This resulted in 318 patients classified as low risk, 42 as intermediate risk, and 1 as high risk.

Conclusions: Gender, hepatitis B, TC, and AT3 constitute risk factors for hepatocellular carcinoma in cirrhotic patients; Gender, hepatitis type, DOI, FT4, AT3, SCC, CRP, MAO, and Ca are associated with the progression of liver cirrhosis to malignant liver tumours either directly or indirectly. The risk prediction model constructed by combining LASSO regression with Bayesian networks demonstrates good predictive value.

1 Introduction

Liver cancer, specifically hepatocellular carcinoma (HCC), is a malignant tumour originating from liver cells, primarily comprising two types: hepatocellular carcinoma and cholangiocarcinoma. It ranks among the most prevalent malignant tumours globally, exhibiting high incidence and mortality rates. According to 2020 estimates, hepatocellular carcinoma ranks sixth among the most common malignant tumours worldwide, third among causes of malignant tumour mortality (1), and second among causes of premature cancer death (2). Hepatocellular carcinoma presents insidiously, with a poor prognosis in advanced stages, a median survival of less than one year, and a five-year survival rate of merely 7% (3, 4). Substantial evidence indicates that cirrhosis constitutes the primary pathological basis for the development of hepatocellular carcinoma (HCC), with over 80% of HCC cases originating from this population (5). Consequently, it has become an international consensus to identify cirrhosis patients as the core target group for primary prevention and early monitoring of HCC. Current clinical practice guidelines recommend regular ultrasound screening for all patients with liver cirrhosis. However, significant heterogeneity exists within this cohort, and the risk of progression to hepatocellular carcinoma (HCC) is not uniform. Therefore, precise risk stratification of cirrhosis patients to identify a truly “extremely high-risk” subgroup and develop individualised enhanced monitoring protocols for them is crucial for optimising HCC prevention and control systems and improving patient outcomes. This study aims to identify risk factors for the progression of liver cirrhosis to hepatocellular carcinoma by analysing diagnostic data from hospitalised cirrhosis patients, and to establish a predictive risk model. This endeavour seeks to provide scientific evidence for the early identification of high-risk individuals for hepatocellular carcinoma among cirrhosis patients.

2 Research content and methods

2.1 Research subjects

Patients hospitalised for liver cirrhosis at the Xinjiang Uygur Autonomous Region People’s Hospital between January 2019 and December 2023 were selected. Exclusion criteria were: ① Patients hospitalised for less than 24 hours; ② Patients with newly diagnosed malignant liver tumours during their first admission; ③ Patients with secondary liver cancer; ④ Patients with concomitant other tumours.

2.2 Research content

An analysis of the general characteristics and examination results of patients with cirrhosis and those who progressed from cirrhosis to hepatocellular carcinoma was conducted. This identified the factors influencing the development of hepatocellular carcinoma in cirrhosis patients and enabled the construction of a risk prediction model for the progression from cirrhosis to hepatocellular carcinoma.

2.3 Research methodology

The hospital information department retrieved medical records for cirrhosis patients using their hospital admission numbers as identifiers. This included the front pages of medical records and test results for coagulation function, complete blood count, liver function tests, urinalysis, renal function tests, tumour markers, comprehensive hepatitis panel, comprehensive thyroid function panel, comprehensive biochemical panel, glucose series, and lipid series. A total of 97 variables were selected, with incomplete or missing data removed from the records.

Patients with liver cirrhosis who were diagnosed with hepatic malignancy during subsequent hospitalisations (excluding the initial admission) constituted the malignancy group, while those without hepatic malignancy served as the control group. The progression of liver cirrhosis to hepatic malignancy was confirmed through clinical presentation, imaging studies, and pathological diagnosis. This study was approved by the hospital’s ethics committee (KY2024052420).

Diagnostic criteria for malignant liver tumours: Guidelines for the Diagnosis and Treatment of Hepatocellular Carcinoma (2019 Edition) (6).

Diagnostic criteria for cirrhosis: Evidence-based clinical practice guidelines for Liver Cirrhosis (7).

2.4 Statistical methods

1. Use Excel software to perform preliminary data processing.

2. Univariate analysis and factor analysis were conducted using SPSS 26.0 software. Numerical variables were expressed as mean ± standard deviation, with intergroup comparisons performed using the independent samples t-test. Categorical variables were represented by frequency and proportion, with intergroup comparisons conducted using the chi-square test. Factor analysis employed a binary logistic regression model. The significance level was set at P < 0.05.

3. Handling Missing Values and Outliers: Missing values were detected using the summary function in R. If any missing values were found, the Multiple Imputation method was employed for their imputation. Outliers were detected using boxplots, where values below Q1-1.5 * IQR or above Q3 + 1.5 * IQR were considered outliers. If an outlier resulted from an operational error during data recording, the data were re-verified and corrected. If the outlier was a natural component of the population, the winsorization method was applied to adjust the outlier to the boundary value of the corresponding percentile.

4. Employing the glmnet package within R software for LASSO regression variable selection, cross-validation was utilised to identify the optimal λ. Model λ screening primarily encompasses lambda.min and lambda.1se; this study employed lambda.min to select variables most strongly associated with the outcome.

5. Bayesian network modelling was conducted using the bnleam package within R software. Divide the data into training and test sets in a 7:3 ratio. First, use the training set data with the hc function to train the network, obtaining a preliminary network architecture. The bn.fit_MLE function was then used to learn the network parameters. The cpquery function facilitated Bayesian network inference and the calculation of conditional probability distribution tables. Subsequently, nodes and directed links may be added to the Bayesian network model based on existing research findings and decision tree models, employing the set.arc function for this purpose. Finally, Netica software was used to construct the Bayesian network model. The built-in node inference function of Netica was employed to calculate the probability of outcome changes caused by the cumulative effects of various factors. Additionally, sensitivity analysis was conducted using the “grain”package in R to analyse the probability of the outcome under specific single-factor conditions.

6. Model Evaluation: The “ggplot2” package in R was used to plot calibration curves. By comparing the Bayesian network model with the logistic regression model, the predictive performance of the Bayesian network model was assessed through a comprehensive comparison of several evaluation metrics, including Brier Score, AUC, Accuracy, C-index (95% CI), Sensitivity, and Specificity. k−fold cross-validation was employed to evaluate the accuracy rate of the model.

7. Model Prediction Threshold and Risk Stratification Strategy: The trained Bayesian network model was applied to the test set to compute risk probabilities. After evaluating the model’s performance, the ROC curve and AUC were calculated. The optimal threshold was determined from the ROC curve by maximizing the Youden index. Based on the predicted probabilities and actual classifications, two thresholds were defined: Threshold 1 (the upper limit for low risk—the threshold at which specificity reaches 90%) and Threshold 2 (the lower limit for high risk—the threshold at which sensitivity reaches 90%). The curve was then segmented into three parts corresponding to low, intermediate, and high risk. Finally, the relationships between risk strata and predicted probabilities, risk strata and actual cancer status, as well as a histogram of predicted probability distribution, were plotted.

3 Result

3.1 General overview

A total of 1,204 individuals were included in this study, comprising 1,128 patients with liver cirrhosis, with no missing values identified. Among them, 76 patients with liver cirrhosis developed hepatic malignancy. There were 724 males and 480 females. The proportion of males progressing to liver malignancy (8.3%) was higher than that of females (3.3%). Patients with cirrhosis and hepatitis B showed the highest incidence of liver malignancy progression (13.3%). Patients with a disease duration of ≥50 years had a higher incidence of liver malignancy progression than those with other durations. Patients with normal BMI exhibited the highest incidence of liver malignancy (8.3%); cirrhosis patients with elevated total cholesterol levels demonstrated a higher incidence of liver malignancy (11.8%); cirrhosis patients with reduced antithrombin III activity showed a higher incidence of liver malignancy (8.6%) (P < 0.05), as shown in Table 1.

Table 1
www.frontiersin.org

Table 1. Univariate analysis of hepatocellular carcinoma development in patients with liver cirrhosis.

3.2 Risk factor analysis

The results of the multivariable logistic regression model indicate that being female is a protective factor against the progression of liver cirrhosis to hepatocellular carcinoma (OR = 0.532, OR 95% CI = 0.297-0.952); Hepatitis B, elevated total cholesterol, and reduced antithrombin III activity were identified as risk factors for the progression of cirrhosis to malignant liver tumours (OR = 4.080, OR 95% CI = 2.443–6.814, OR = 2.308, OR 95% CI = 1.132–4.707, OR = 2.982, OR 95% CI = 1.389–6.402) (P < 0.05), as shown in Table 2.

Table 2
www.frontiersin.org

Table 2. Analysis of risk factors for hepatocellular carcinoma in patients with liver cirrhosis.

3.3 LASSO regression model

The 97 collected laboratory test results were incorporated into the LASSO model. After 100 rounds of cross-validation, the minimum average error lambda value was determined as lambda.min=0.008638309. This lambda value was then applied to the LASSO model, ultimately identifying 14 variables most significantly associated with the progression from liver cirrhosis to malignant liver tumours. These variables are: monoamine oxidase, alkaline phosphatase, platelet distribution width, C-reactive protein, creatine kinase isoenzymes, total calcium, phosphorus, total cholesterol, alpha-fetoprotein, free thyroxine, squamous cell carcinoma-associated antigen, and antithrombin III activity. See Figures 1, 2, and Table 3.

Figure 1
Plot showing the coefficients of multiple linear regression models against the logarithm of the regularization parameter Lambda. As Lambda increases from left to right, many coefficients converge towards zero, indicating more regularization.

Figure 1. Variable selection in the LASSO-based predictive model 1.

Figure 2
A line graph illustrating the relationship between Log(lambda) and AUC, with red dots indicating data points. The AUC values range from 0.55 to 0.75, showing a fluctuating pattern across different lambda values, particularly around Log(lambda) values of negative ten to negative four, with variability depicted by vertical error bars.

Figure 2. Variable selection in the LASSO-based predictive model 2.

Table 3
www.frontiersin.org

Table 3. Coefficients and odds ratios for variables selected in the LASSO model prediction.

3.4 Construction of Bayesian network models

Based on variables selected via LASSO regression and combined with risk factor analysis results, an initial Bayesian network disease prediction model comprising 15 nodes and 12 directed edges was constructed. The conditional probabilities for each node are shown in Figure 3. By constructing a decision tree model, relationships among some variables were identified (see Figure 4). Integrating this with the Bayesian network model from Figure 3, the set.arc function was employed to add edges, yielding a Bayesian network model comprising 15 nodes and 17 directed edges (see Figure 5). Figure 5 demonstrates that variables including gender, hepatitis type, disease duration, free thyroxine, antithrombin III activity, squamous cell carcinoma-associated antigen, C-reactive protein, monoamine oxidase, and total calcium establish connections with the progression of liver cirrhosis to malignant liver tumours through complex network relationships. Disease duration, antithrombin III activity, squamous cell carcinoma-associated antigen, C-reactive protein, and total cholesterol are directly associated with the occurrence of hepatic malignancy; gender, free thyroxine, hepatitis type, and total calcium exert an indirect influence on the development of hepatic malignancy.

Figure 3
Flowchart depicting relationships between medical variables. Starting with “Ca,” arrows lead to “CRP” and “TC,” then branching to “BMI,” “P,” “PT,” and others, ending with “Group.” “Gender,” “Types of hepatitis,” and “Course of the disease” are shown separately leading into “Group.” “TT” and “FT4” are isolated.

Figure 3. Bayesian network model.

Figure 4
Flowchart showing decision-making for liver cirrhosis and hepatocellular carcinoma (HCC) based on hepatitis types, free thyroxine levels, and antithrombin III levels. It displays percentages for cirrhosis and HCC in various scenarios.

Figure 4. Decision tree model.

Figure 5
Flowchart visualizing data distributions across various medical parameters with labeled nodes. Categories include gender, type, FT4, DOI, AT3, SCC, Ca, CRP, TC, PT, P, BMI, MAO, TT, and group. Each node lists percentage values with corresponding bar graphs, indicating distributions such as male-female ratio, low/normal/high levels, and specific medical categories. Arrows connect relevant nodes, illustrating relationships and transitions between categories.

Figure 5. Bayesian network risk prediction model for hepatocellular carcinoma in patients with liver cirrhosis.

3.5 Model inference

Sensitivity analysis revealed the probability of progression from liver cirrhosis to hepatocellular carcinoma under specific conditions: when AT3 levels were normal, the probability was 0.0749 (7.5%); when AT3 was low, it was 0.0709 (7.1%); and when AT3 was high, the probability increased to 0.0850 (8.5%). Regarding TC levels, the probability was 0.0763 (7.6%) when TC was normal, but rose significantly to 0.1309 (13.1%) when TC was elevated. The Netica node cumulative inference results showed that for a cirrhosis patient with hepatitis B, whose antithrombin III activity was at a low level and squamous cell carcinoma antigen, total cholesterol, and C-reactive protein were at high levels, the probability of progressing to hepatic malignancy was 52.1%. However, if antithrombin III activity, squamous cell carcinoma antigen, total cholesterol, and C-reactive protein were maintained at normal levels, the probability of developing hepatic malignancy would decrease to 7.5%, as shown in Figure 6.

Figure 6
Flowchart displaying two datasets comparing health parameters. Top section shows data like gender distribution, various health markers, and types like AH, HAV, HBV, and outcome groups HCC and Cirrhosis with percentages. Bottom section mirrors this with different percentage values. Arrows indicate relationships between parameters.

Figure 6. Bayesian network prediction model inference for hepatocellular carcinoma development in patients with liver cirrhosis.

3.6 Model evaluation

The results showed that the performance of the Bayesian network model (AUC = 0.857, Brier Score = 0.052, Accuracy = 0.940, C-index (95% CI)=0.833(0.784–0.882), Sensitivity = 0.836, Specificity = 0.744) was superior to that of the logistic regression model (AUC = 0.780, Brier Score = 0.056, Accuracy = 0.934, C-index (95% CI) = 0.785 (0.726–0.843), Sensitivity = 0.727, Specificity = 0.766), as detailed in Table 4 and Figure 7. The calibration curve results indicated that the predicted points of the Bayesian network model were evenly distributed around the diagonal line, suggesting good predictive performance (see Figure 8). Ten-fold cross-validation was performed, and the results showed that in nine out of the ten validation runs, the accuracy exceeded 0.9, with an average accuracy of 0.93 (see Table 5).

Table 4
www.frontiersin.org

Table 4. Comparison of diagnostic performance between Bayesian network model and logistic regression model.

Figure 7
ROC curve comparison graph displaying sensitivity versus specificity. Two models are compared: Bayesian (red line) and Logistic (blue line). The graph includes a diagonal reference line indicating random performance.

Figure 7. Comparison of the area under the ROC curve between the two models.

Figure 8
Calibration curves compare logistic regression (blue) and Bayesian Network (red) models by plotting predicted probability against observed proportion. The dashed line represents perfect calibration. Circle sizes indicate sample sizes of 100, 200, 300, and 400.

Figure 8. Comparison of the calibration curves between the two models.

Table 5
www.frontiersin.org

Table 5. Results of k-fold (10-fold) cross-validation.

3.7 Model prediction threshold and risk stratification strategy for the Bayesian network model

Based on the predicted values, the ROC curve of the Bayesian network model was established (see Figure 9). The area under the curve (AUC) was 0.857. After balancing sensitivity and specificity, the optimal threshold was determined by maximizing the Youden index (0.052). A predicted probability > 0.052 indicates progression from liver cirrhosis to hepatic malignancy. Based on the ROC curve, threshold 1 was set at 0.2 and threshold 2 at 0.8 to establish a risk stratification system: low risk (predicted probability < 0.2), intermediate risk (0.2 ≤ predicted probability ≤ 0.8), and high risk (predicted probability > 0.8). Accordingly, 318 patients were classified as low risk, 42 as intermediate risk, and 1 as high risk (see Table 6). Monitoring frequency recommendations are as follows: low-risk patients: annual routine check-ups; intermediate-risk patients: regular monitoring every 6 months; high-risk patients: close monitoring every 3 months, with consideration for further diagnostic procedures. The relationships between risk strata and predicted probabilities, risk strata and actual cancer status, and the predicted probability distribution are shown in Figure 10.

Figure 9
ROC curve chart showing Sensitivity versus Specificity. A blue line represents the model performance with an AUC (Area Under Curve) of 0.857. The curve slopes steeply upwards and then levels off, indicating good model discrimination ability.

Figure 9. ROC curve of the Bayesian network model.

Table 6
www.frontiersin.org

Table 6. Risk stratification of the Bayesian network model.

Figure 10
Three charts illustrate the prediction and proportion of cancer cases by risk level. The top left box plot shows predicted probability distribution for low, medium, and high risk levels. The top right bar chart depicts the proportion of cancer and non-cancer cases at each risk level, with increased cancer cases at higher risk levels. The bottom left histogram represents the distribution of predicted probabilities with a threshold of 0.05, showing most low probabilities relate to non-cancer cases.

Figure 10. The relationship between risk strata and predicted probabilities, the relationship between risk strata and actual cancer status, and the predicted probability distribution.

4 Discussion

Hepatocellular carcinoma exhibits high mortality rates and poor prognosis, constituting a major disease affecting human health and quality of life. In China, over 80% of hepatocellular carcinoma cases develop from cirrhosis. Consequently, close monitoring of cirrhosis patients to prevent progression to hepatocellular carcinoma is paramount. This study enrolled 1,204 cirrhosis patients, among whom 76 progressed to hepatocellular carcinoma. Male cirrhosis patients exhibited a higher probability of developing hepatocellular carcinoma than female patients, a finding consistent with the gender incidence ratio observed in China, South Korea, and Japan (8). This may be related to the fact that men engage in unhealthy lifestyle habits such as smoking and excessive alcohol consumption more frequently than women (9, 10). A single-centre cohort study in the United States indicates that men are diagnosed with liver cancer at a later stage than women, presenting with larger tumours and a greater number of lesions. This may be attributed to men undergoing routine screening for liver cancer less frequently. Consequently, it is recommended that male patients with hepatitis, cirrhosis, and even those in good health undergo regular health check-ups to facilitate early detection and diagnosis of liver cancer. Prompt treatment is essential to enhance therapeutic outcomes. At the same time, men should place greater emphasis on lifestyle and dietary habits, refraining from smoking and excessive drinking, adopting a healthy diet, engaging in regular exercise, and enhancing their physical fitness.

This study found that patients with cirrhosis complicated by hepatitis B virus infection exhibited a higher risk of developing hepatocellular carcinoma compared to other patients. Furthermore, multivariate analysis revealed that the presence of hepatitis B virus infection constitutes a risk factor for the progression of cirrhosis to hepatocellular carcinoma. In China, the incidence of hepatitis B-related hepatocellular carcinoma remains persistently high, with studies predicting an increasing trend in this incidence over the forthcoming period (11, 12). Since the formulation of the Guidelines for the Prevention and Treatment of Chronic Hepatitis B in 2005, the prevalence of hepatitis B in China has shown a marked downward trend (13). Nevertheless, China remains one of the countries bearing the heaviest global burden of hepatitis B virus infection (14), posing an extremely serious threat to the lives and property of its people. Consequently, the prevention and control of hepatitis B cannot be delayed. Regular testing for hepatitis B antibodies and timely vaccination with the hepatitis B vaccine are effective methods for preventing infection with the hepatitis B virus. Rejecting drugs, avoiding the sharing of syringes, and maintaining healthy lifestyle and dietary habits to boost immunity can all effectively prevent hepatitis B infection.

Among hepatitis patients, the longer the duration of illness, the greater the likelihood of developing hepatocellular carcinoma. This finding is consistent with the results of a Chinese age-time cohort study on hepatocellular carcinoma (10). Rocio I.R. Macias et al. (15) propose that with increasing age and disease duration, factors such as genomic instability, telomere attrition, epigenetic alterations, impaired protein homeostasis, and mitochondrial dysfunction collectively promote the development of hepatocellular carcinoma. The liver possesses remarkable regenerative potential; however, over time, the accumulation of risk factors can lead to irreversible damage to liver function. Consequently, early detection, diagnosis, and treatment are essential measures for reducing the incidence of liver cancer. National and societal bodies should establish comprehensive liver cancer surveillance systems to promptly identify high-risk populations. Communities and hospitals must undertake public education responsibilities, providing health awareness campaigns to inform the public about the dangers of liver cancer and the necessity of timely diagnosis and treatment.

The relationship between serum cholesterol and the risk of hepatocellular carcinoma remains controversial. Findings from studies by Cai X et al. indicate a negative correlation between serum cholesterol levels and the incidence of hepatocellular carcinoma (16, 17), whereas other research suggests that elevated cholesterol levels may contribute to the development of this malignancy (1820). Takuma Tsuchida et al. (21) observed in mouse studies that a high-cholesterol diet readily induces non-alcoholic fatty liver disease, subsequently leading to hepatocellular carcinoma. Further research indicates that dietary cholesterol drives hepatocellular carcinoma development by inducing alterations in the gut microbiota and metabolites of mice (19). This study found that patients with elevated cholesterol levels also exhibited a higher incidence of liver cancer. Analysis of influencing factors similarly indicated that elevated cholesterol constitutes a risk factor for the progression of cirrhosis to liver cancer, potentially linked to these patients’ high-cholesterol diets. Patients with cirrhosis require adequate nutrition to control disease progression, yet excessive cholesterol intake proves counterproductive. Given that high-fat, high-cholesterol foods constitute a significant proportion of the Xinjiang diet, promoting balanced nutrition and increasing physical activity are essential strategies for preventing the occurrence of liver cancer.

Antithrombin III, synthesised by the liver, vascular endothelial cells and megakaryocytes, is a protease inhibitor in plasma with anticoagulant properties. It can inhibit hepatitis viruses and reduce the likelihood of developing cirrhosis and hepatocellular carcinoma by modulating multiple host cell signalling factors associated with these conditions (22). It may also regulate the development of hepatocellular carcinoma by inhibiting thrombin (22), and can prevent the metastasis of cancer cells (23). This study has identified reduced antithrombin III activity as a risk factor for the progression of liver cirrhosis to hepatocellular carcinoma. This may be attributed to the more severe hepatic damage in cirrhotic patients, which diminishes their capacity to produce antithrombin III. Consequently, plasma antithrombin III levels decline, impairing the body’s ability to inhibit hepatocellular carcinoma development. Consequently, close monitoring of coagulation function is essential in daily life for cirrhosis patients. Regular medical examinations and adherence to prescribed medication are crucial for restoring antithrombin III activity. Concurrently, appropriate supplementation with iron- and folate-rich foods—such as animal blood, liver, and egg yolks—is advised to correct anaemia.

Bayesian networks can illustrate the relationship between various influencing factors and the progression of cirrhosis to hepatocellular carcinoma. The model indicates that, apart from phosphorus levels, BMI, thrombin time, and prothrombin time, other factors are associated with the development of hepatocellular carcinoma through direct or indirect pathways. Gender influences the type and duration of hepatitis, thereby contributing to HCC development. A 2019 study on the demographic characteristics of hepatitis B virus infection in China indicated that males are more susceptible to viral hepatitis than females (24). Patients with hepatitis B virus infection typically experience more severe disease and greater treatment challenges; the likelihood of developing HCC increases with disease progression.

This study further revealed that hepatitis type can influence free thyroxine levels, thereby affecting antithrombin III activity and ultimately contributing to hepatocellular carcinoma. Research indicates that nearly half of patients with viral hepatitis undergoing interferon therapy develop thyroid disorders, presenting as hypothyroidism (25). Prolonged hypothyroidism increases the risk of hepatocellular carcinoma (26), potentially through hypothyroidism-induced coagulation disorders leading to haemorrhage (27).

Squamous cell carcinoma antigen is a serine protease inhibitor present in normal squamous epithelium. It correlates with the invasion, metastasis, recurrence, and prognosis of squamous cell malignancies, serving as a crucial tumour marker reflecting the biological characteristics of squamous cell carcinoma. Research indicates elevated SCC-related antigen levels in the serum of hepatocellular carcinoma patients, whereas non-hepatocellular carcinoma patients exhibit lower serum SCC-related antigen concentrations (28). The Bayesian network model indicates a pathway from antithrombin III to SCC-A. However, no direct evidence currently exists demonstrating that antithrombin III activity directly influences SCC-A. This discrepancy may explain the elevated SCC-A levels observed in patients with abnormal antithrombin III activity in the present study.

Research indicates (29) that alterations in calcium ion concentration may influence the production and secretion of C-reactive protein. Under normal circumstances, hypocalcaemia (low blood calcium levels) may be associated with elevated C-reactive protein levels. This occurs because hypocalcaemia can stimulate the release of certain cytokines, thereby promoting C-reactive protein production. C-reactive protein is a non-specific inflammatory marker serving as an indicator of sensitivity and specificity in predicting tumour development (30). Existing research has identified an association between C-reactive protein and the occurrence of hepatocellular carcinoma, with higher C-reactive protein levels correlating with increased risk of liver cancer (31, 32). This aligns with the pathway demonstrated in this study’s model, wherein serum total calcium influences C-reactive protein, which in turn affects hepatocellular carcinoma development.

This study examined 1,204 patients with liver cirrhosis admitted to our hospital between 2019 and 2023, identifying 76 cases where cirrhosis progressed to hepatocellular carcinoma. Multivariate analysis revealed that hepatitis B infection, elevated total cholesterol levels, and reduced antithrombin III activity constitute risk factors for the development of malignant liver tumours in cirrhotic patients. Using the LASSO regression model for variable selection, fourteen variables most significantly associated with the progression from cirrhosis to malignant liver tumours were identified. These selected variables were incorporated into a Bayesian network. Combining literature evidence and decision tree model results, a risk prediction model for the progression from cirrhosis to hepatocellular carcinoma was constructed. In summary, gender, hepatitis type, disease duration, total calcium, C-reactive protein, total cholesterol, antithrombin III activity, free thyroxine, and carcinoembryonic antigen are closely associated with the progression of liver cirrhosis to hepatocellular carcinoma. This study has certain limitations. Firstly, the data utilised in this investigation was sourced from a single centre, namely the Xinjiang Uygur Autonomous Region People’s Hospital. Consequently, the distribution of study subjects may lack representativeness. Future research should involve collaboration with multiple hospitals across the region to enhance the representativeness of the study population. Secondly, all variables included in this study were extracted from patients’ hospital admission records and inpatient examination results, without considering variables such as lifestyle habits or dietary patterns. This may introduce bias into the model results. Questionnaire surveys or similar methods should be incorporated to broaden the scope of variables covered. Finally, the progression from cirrhosis to hepatocellular carcinoma is a protracted process. This study’s five-year observation period yielded a relatively small number of patients who developed hepatocellular carcinoma, potentially reducing the model’s accuracy. Consequently, subsequent research should combine retrospective and prospective studies to extend the observation period and minimise research error.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by People’s Hospital of Xinjiang Uygur Autonomous Region. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

NM: Validation, Data curation, Methodology, Writing – original draft. JS: Writing – review & editing. YY: Supervision, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. Natural Science Foundation of Xinjiang Uygur Autonomous Region (2024D01C84) and the “Tianshan Talent” Medical and Health High-level Talent Training Program (TSYC202301A063).

Conflict of interest

The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2021) 71:209–49. doi: 10.3322/caac.21660

PubMed Abstract | Crossref Full Text | Google Scholar

2. Rumgay H, Arnold M, Ferlay J, Lesi O, Cabasag CJ, Vignat J, et al. Global burden of primary liver cancer in 2020 and predictions to 2040. J Hepatol. (2022) 77:1598–606. doi: 10.1016/j.jhep.2022.08.021

PubMed Abstract | Crossref Full Text | Google Scholar

3. Zhang X, Guan L, Tian H, Zeng Z, Chen J, Huang D, et al. Risk factors and prevention of viral hepatitis-related hepatocellular carcinoma. Front Oncol. (2021) :686962. doi: 10.3389/fonc.2021.686962

PubMed Abstract | Crossref Full Text | Google Scholar

4. Sayiner M, Golabi P, and Younossi ZM. Disease burden of hepatocellular carcinoma: A global perspective. Dig Dis Sci. (2019) 64:910–7. doi: 10.1007/s10620-019-05537-2

PubMed Abstract | Crossref Full Text | Google Scholar

5. Perisetti A, Goyal H, Yendala R, Thandassery RB, and Giorgakis E. Non-cirrhotic hepatocellular carcinoma in chronic viral hepatitis: Current insights and advancements. World J Gastroenterol. (2021) 27:3466–82. doi: 10.3748/wjg.v27.i24.3466

PubMed Abstract | Crossref Full Text | Google Scholar

6. Zhou J, Sun H, Wang Z, Cong W, Wang J, Zeng M, et al. Guidelines for the diagnosis and treatment of hepatocellular carcinoma (2019 edition). Liver Cancer. (2020) 9:682–720. doi: 10.1159/000509424

PubMed Abstract | Crossref Full Text | Google Scholar

7. Yoshiji H, Nagoshi S, Akahane T, Asaoka Y, Ueno Y, Ogawa K, et al. Evidence-based clinical practice guidelines for Liver Cirrhosis 2020. J Gastroenterol. (2021) 56:593–619. doi: 10.1007/s00535-021-01788-x

PubMed Abstract | Crossref Full Text | Google Scholar

8. Dee EC, Laversanne M, Bhoo-Pathy N, Ho FDV, Feliciano EJG, Eala MAB, et al. Cancer incidence and mortality estimates in 2022 in southeast Asia: a comparative analysis. Lancet Oncol. 26:516–28. doi: 10.1016/S1470-2045(25)00017-8

PubMed Abstract | Crossref Full Text | Google Scholar

9. El-Serag HB and Kanwal F. Epidemiology of hepatocellular carcinoma in the United States: where are we? Where do we go? Hepatology. (2014) 60:1767–75. doi: 10.1002/hep.27222

PubMed Abstract | Crossref Full Text | Google Scholar

10. Richmond R, Kehoe L, Heather N, Wodak A, and Webster I. General practitioners’ promotion of healthy life styles: what patients think. Aust N Z J Public Health. (1996) 20:195–200. doi: 10.1111/j.1753-6405.1996.tb01818.x

PubMed Abstract | Crossref Full Text | Google Scholar

11. Zheng Y, Wu J, Ding C, Xu K, Yang S, and Li L. Disease burden of chronic hepatitis B and complications in China from 2006 to 2050: an individual-based modeling study. Virol J. (2020) 17:132. doi: 10.1186/s12985-020-01393-z

PubMed Abstract | Crossref Full Text | Google Scholar

12. Meng L, Yang R, Su N, Shen T, Qu X, and Sun H. Spatial epidemiological analysis of the burden of liver cancer in China. Transl Cancer Res. (2024) 13:363–70. doi: 10.21037/tcr-23-1240

PubMed Abstract | Crossref Full Text | Google Scholar

13. Yan R, Sun M, Yang H, Du S, Sun L, and Mao Y. 2024 latest report on hepatitis B virus epidemiology in China: current status, changing trajectory, and challenges. Hepatobil Surg Nutr. (2025) 14:66–77. doi: 10.21037/hbsn-2024-754

PubMed Abstract | Crossref Full Text | Google Scholar

14. Polaris Observatory Collaborators. Global prevalence, treatment, and prevention of hepatitis B virus infection in 2016: a modelling study. Lancet Gastroenterol Hepatol. (2018) 3:383–403. doi: 10.1016/S2468-1253(18)30056-6

PubMed Abstract | Crossref Full Text | Google Scholar

15. Macias RIR, Monte MJ, Serrano MA, González-Santiago JM, Martín-Arribas I, Simão AL, et al. Impact of aging on primary liver cancer: epidemiology, pathogenesis and therapeutics. Aging (Albany NY). (2021) 13:23416–34. doi: 10.18632/aging.203620

PubMed Abstract | Crossref Full Text | Google Scholar

16. Cai X, Peng S, Xiao X, Huang Z, and Zhang P. Serum ApoB/ApoA1 ratio in patients with CHB and the occurrence of HBV related cirrhosis and HBV related hepatocellular carcinoma. Sci Rep. (2024) 14:10996. doi: 10.1038/s41598-024-61820-x

PubMed Abstract | Crossref Full Text | Google Scholar

17. Kasmari AJ, Welch A, Liu G, Leslie D, McGarrity T, and Riley T. Independent of cirrhosis, hepatocellular carcinoma risk is increased with diabetes and metabolic syndrome. Am J Med. (2017) 130:746.e1–7. doi: 10.1016/j.amjmed.2016.12.029

PubMed Abstract | Crossref Full Text | Google Scholar

18. Wang Y, Wang J, Li X, Xiong X, Wang J, Zhou Z, et al. N1-methyladenosine methylation in tRNA drives liver tumourigenesis by regulating cholesterol metabolism. Nat Commun. (2021) 12:6314. doi: 10.1038/s41467-021-26718-6

PubMed Abstract | Crossref Full Text | Google Scholar

19. Zhang X, Coker OO, Chu ES, Fu K, Lau HCH, Wang YX, et al. Dietary cholesterol drives fatty liver-associated liver cancer by modulating gut microbiota and metabolites. Gut. (2021) 70:761–74. doi: 10.1136/gutjnl-2019-319664

PubMed Abstract | Crossref Full Text | Google Scholar

20. Liang JQ, Teoh N, Xu L, Pok S, Li X, Chu ESH, et al. Dietary cholesterol promotes steatohepatitis related hepatocellular carcinoma through dysregulated metabolism and calcium signaling. Nat Commun. (2018) 9:4490. doi: 10.1038/s41467-018-06931-6

PubMed Abstract | Crossref Full Text | Google Scholar

21. Tsuchida T, Lee YA, Fujiwara N, Ybanez M, Allen B, Martins S, et al. A simple diet- and chemical-induced murine NASH model with rapid progression of steatohepatitis, fibrosis and liver cancer. J Hepatol. (2018) 69:385–95. doi: 10.1016/j.jhep.2018.03.011

PubMed Abstract | Crossref Full Text | Google Scholar

22. Asmal M, Seaman M, Lin W, Chung RT, Letvin NL, and Geiben-Lynn R. Inhibition of HCV by the serpin antithrombin III. Virol J. (2012) 9:226. doi: 10.1186/1743-422X-9-226

PubMed Abstract | Crossref Full Text | Google Scholar

23. Meyer-Siegler KL, Cox J, Leng L, Bucala R, and Vera PL. Macrophage migration inhibitory factor anti-thrombin III complexes are decreased in bladder cancer patient serum: Complex formation as a mechanism of inactivation. Cancer Lett. (2010) 290:49–57. doi: 10.1016/j.canlet.2009.08.025

PubMed Abstract | Crossref Full Text | Google Scholar

24. Meng TT, Miao N, Wang FZ, Zheng H, Yin ZD, Liang XF, et al. Analysis on hepatitis B cases reported from surveillance points in China, 2019. Zhonghua Liu Xing Bing Xue Za Zhi. (2021) 42:1532–6. doi: 10.3760/cma.j.cn112338-20210319-00233

PubMed Abstract | Crossref Full Text | Google Scholar

25. Menconi F, Hasham A, and Tomer Y. Environmental triggers of thyroiditis: hepatitis C and interferon-α. J Endocrinol Invest. (2011) 34:78–84. doi: 10.1007/BF03346699

PubMed Abstract | Crossref Full Text | Google Scholar

26. Lin YH, Lin KH, and Yeh CT. Thyroid hormone in hepatocellular carcinoma: cancer risk, growth regulation, and anticancer drug resistance. Front Med (Lausanne). (2020) 7:174. doi: 10.3389/fmed.2020.00174

PubMed Abstract | Crossref Full Text | Google Scholar

27. Anagnostis P, Efstathiadou ZA, Slavakis A, Selalmatzidou D, Poulasouchidou M, Katergari S, et al. The effect of L-thyroxine substitution on lipid profile, glucose homeostasis, inflammation and coagulation in patients with subclinical hypothyroidism. Int J Clin Pract. (2014) 68:857–63. doi: 10.1111/ijcp.12394

PubMed Abstract | Crossref Full Text | Google Scholar

28. Yu J, Wang ZJ, Chen LH, and Dong WZ. Diagnostic value of serum squamous cell carcinoma antigen for hepatocellular carcinoma: a systematic review and meta-analysis. Scand J Clin Lab Invest. (2017) 77:8–14. doi: 10.1080/00365513.2016.1238504

PubMed Abstract | Crossref Full Text | Google Scholar

29. Kong F, You H, Zheng K, Tang R, and Zheng C. The crosstalk between pattern-recognition receptor signaling and calcium signaling. Int J Biol Macromol. (2021) 192:745–56. doi: 10.1016/j.ijbiomac.2021.10.014

PubMed Abstract | Crossref Full Text | Google Scholar

30. She S, Xiang Y, Yang M, Ding X, Liu X, Ma L, et al. C-reactive protein is a biomarker of AFP-negative HBV-related hepatocellular carcinoma. Int J Oncol. (2015) 47:543–54. doi: 10.3892/ijo.2015.3042

PubMed Abstract | Crossref Full Text | Google Scholar

31. Song M, Liu T, Liu H, Zhang Q, Zhang Q, Wang Y, et al. Association between metabolic syndrome, C-reactive protein, and the risk of primary liver cancer: a large prospective study. BMC Cancer. (2022) 22:853. doi: 10.1186/s12885-022-09939-w

PubMed Abstract | Crossref Full Text | Google Scholar

32. Siyin ST, Liu T, Li W, Yao N, Xu G, Qu J, et al. A prospective follow-up study of the relationship between high-sensitivity C-reactive protein and primary liver cancer. BMC Cancer. (2020) 20:1168. doi: 10.1186/s12885-020-07665-9

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: Bayesian network mode, cirrhosis, hepatocellular carcinoma, influencing factors, risk prediction

Citation: Ma N, Song J and Yang Y (2026) Construction of a Bayesian network-based risk prediction model for hepatocellular carcinoma in cirrhotic patients. Front. Oncol. 15:1735042. doi: 10.3389/fonc.2025.1735042

Received: 30 October 2025; Accepted: 19 December 2025; Revised: 14 December 2025;
Published: 12 January 2026.

Edited by:

Gang Ye, Sichuan Agricultural University, China

Reviewed by:

Yuanyin Teng, Zhejiang University, China
Siqi Zhao, National Institute of Dental and Craniofacial Research (NIH), United States

Copyright © 2026 Ma, Song and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yuqing Yang, NDU1NTkyMjY5QHFxLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.