- 1Department of Pancreatic Center, Guangdong Provincial Hospital of Chinese Medicine, Guangzhou, Guangdong, China
- 2Department of Hepatobiliary and Pancreatic Surgery, The Second Clinical Medical College of Jinan University, Shenzhen People’s Hospital, Shenzhen, Guangdong, China
Objective: This study aims to develop models for predicting microvascular invasion (MVI) in hepatocellular carcinoma (HCC) patients prior to surgery using two-dimensional (2D) and three-dimensional (3D) radiomics features from contrast-enhanced computed tomography (CT). The study compares the predictive performance of various models and explores the potential of radiomics to capture tumor spatial heterogeneity.
Materials and methods: A total of 150 hepatocellular carcinoma (HCC) patients who underwent contrast-enhanced CT examination and curative resection were included in this study. 2D features from the largest cross-sectional slice, as well as 3D radiomic features, were extracted from the non-contrast (NC), arterial phase (AP), portal venous phase (PVP), and balanced phase (BP) images. Feature selection was performed using the least absolute shrinkage and selection operator (LASSO) algorithm, and predictive models were constructed using logistic regression and XGBoost machine learning algorithms. The predictive performance of the models was evaluated using the area under the receiver operating characteristic curve (AUC).
Results: The 2D BP model (AUC = 0.801) and 3D PVP model (AUC = 0.876) showed superior performance among single-sequence models. The 2D multi-sequence model (AUC = 0.851) outperformed the 3D combined model (AUC = 0.811). Radiomics-based models outperformed clinical feature-based models, and combining radiomics scores with clinical features improved prediction accuracy. However, 3D models did not significantly outperform 2D models.
Conclusion: Both 2D and 3D radiomics models are effective for predicting MVI in HCC patients preoperatively. While the 3D model captures spatial heterogeneity, the 2D model excels at capturing local texture features. This study provides new insights into radiomics in HCC, contributing to its clinical application and standardization.
Introduction
Hepatocellular carcinoma (HCC) ranks as the sixth most common malignancy and the third leading cause of cancer-related mortality worldwide (1). The World Health Organization predicts that there will be over 1 million new liver cancer cases by 2025 (2). Surgical resection remains the most effective treatment for early-stage HCC; however, postoperative recurrence occurs in up to 70% of patients, significantly impacting long-term survival (3). Predicting HCC recurrence preoperatively and implementing targeted interventions are pressing and clinically significant.
Plenty of studies have demonstrated that microvascular invasion (MVI) is an independent risk factor for postoperative recurrence of HCC (4). As a highly vascularized tumor characterized by dual blood supply, HCC is prone to MVI, which represents the most critical pathological mechanism underlying postoperative recurrence and metastasis. MVI is also one of the key factors in predicting HCC recurrence (5). Furthermore, MVI is associated with patient prognosis and serves as an independent risk factor for both postoperative survival and extrahepatic metastasis in HCC patients (6).
In recent years, with the deepening of research on MVI and advances in radiomics, the precise preoperative prediction of MVI has become feasible. Radiomics, an emerging field, was first introduced in 2012 by Dutch scholar Lambin et al. (7). Over the past decade, radiomics has rapidly developed to be a bridge between medical imaging and precision medicine.
Enhanced computed tomography (CT) is one of the most commonly employed diagnostic tools for preoperative evaluation in HCC patients. Prediction models based on radiomic features derived from enhanced CT images have been widely recognized for their efficiency in preoperatively predicting MVI in HCC (8, 9). However, there are discrepancies in specific methodologies and detailed approaches in existing studies.
In radiomics-based region of interest (ROI) delineation, two main methods are used: contouring all tumor-involved slices [three-dimensional(3D)] and contouring only the single axial slice with the largest tumor area [two-dimensional(2D)]. The advantage of 3D ROI is evident, as it enables comprehensive analysis of all tumor cell populations, including spatial distribution within the tumor, which has been the predominant method in most previous studies. In contrast, 2D ROI provides clearer tumor boundaries and greater reproducibility (10). Moreover, 2D ROI is easier to obtain and requires significantly less workload and computational resources compared to 3D ROI. Given the extensive spatial heterogeneity observed within HCC tumors (11), it remains unclear whether single-layer images representing the largest tumor cross-sectional area can adequately reflect the overall tumor characteristics in radiomics studies. To date, no studies in the field of HCC have reported performance differences between 2D and 3D feature-based models, and further investigations are warranted.
The aim of this study was to develop radiomics-based predictive models using 2D and 3D features derived from different phases of enhanced CT imaging to preoperatively predict MVI in HCC patients. Additionally, the study sought to compare the predictive performance of these models and explore whether radiomics can capture tumor spatial heterogeneity. This work aimed to standardize the workflow of radiomics research in HCC and address the methodological inconsistencies observed in prior radiomics studies.
Materials and methods
Patient selection
This retrospective study utilized clinical information and CT images obtained from the electronic medical record system and imaging data system of Shenzhen People’s Hospital. The study was approved by the Institutional Review Board of Shenzhen People’s Hospital, with a waiver of informed consent. Consecutive patients who underwent contrast-enhanced CT scans at our hospital between July 2015 and June 2022 were included. The sample size was determined based on feasibility. All patients who met the eligibility criteria were retrospectively collected. The inclusion criteria required that all imaging data be uniformly reformatted using a B-spline interpolation resampling algorithm (12)with a voxel size of 1.0 mm × 1.0 mm × 1.0 mm. The exclusion criteria were as follows: (1) patients receiving preoperative antitumor treatments; (2) patients with a history of recurrent HCC or concurrent primary tumors; (3) patients with distant metastases associated with HCC; (4) poor image quality; and (5) incomplete clinical or pathological information.
“Poor image quality” was defined as the presence of significant motion artifacts, low signal-to-noise ratio, incomplete coverage of the region of interest (ROI), or other technical issues that hindered accurate segmentation and feature extraction. Image quality was independently assessed by two experienced radiologists, and any discrepancies were resolved via consensus.
After screening, a total of 150 HCC patients were included in the study (120 males and 30 females, mean age: 57 years, range: 29–84 years), comprising 46 MVI-positive cases and 104 MVI-negative cases. Clinical characteristics, including gender, age, carcinoembryonic antigen (CEA), alpha fetoprotein (AFP), CA125, and 44 other variables, were extracted from the electronic medical record system. Patients were randomized to the training and validation cohorts in a 70:30 ratio. Demographic and clinical characteristics of all patients are summarized in Table 1.
CT imaging and ROI segmentation
All CT examinations were performed using spiral CT scanners from the Radiology Department of Shenzhen People’s Hospital, with identical scanning parameters. The imaging protocol adhered to the standards recommended by the LI-RADS guidelines (13). 3D manual segmentation was conducted using 3D Slicer software by a hepatobiliary surgeon with nine years of clinical experience. ROIs were delineated along the visible boundaries of the lesions in all non-contrast (NC), arterial phase (AP), portal venous phase (PVP), and balanced phase (BP) images, encompassing the entire lesion volume. Incorporating peritumoral regions into the ROIs has been demonstrated in multiple studies to enhance the predictive performance of radiomics models (8, 14). Accordingly, a 3-mm margin was automatically expanded using 3D Slicer, with manual removal of any regions extending beyond the liver volume to ensure the final ROI encompassed the peritumoral area. The final segmentation results were validated by a senior hepatobiliary surgeon with 14 years of clinical experience. Subsequently, the software automatically generated 2D images of the tumor’s largest cross-sectional area. To assess intra-observer reproducibility, a subset of ROIs was randomly re-annotated by the same hepatobiliary surgeon on the imaging dataset. Intraclass correlation coefficient (ICC) analysis demonstrated good consistency between the two sets of annotations.
Radiomics feature extraction
Radiomics features were extracted using the Radiomics plugin package in 3D Slicer 5.4.0 for each phase: NC, AP, PVP, and BP. The B-spline interpolation resampling algorithm (12) was applied to standardize the image format, with a voxel spacing of 1.0 mm × 1.0 mm × 1.0 mm. The same extraction procedure was applied to wavelet-transformed derivative images (15) to enhance the dimensionality of the image data.
Each dataset comprised 1,130 features, with a one-to-one correspondence established to ensure comparability. These features included first-order shape features, first-order statistical features, second-order texture features, higher-order features, and wavelet-transformed features, quantitatively representing the imaging information of the corresponding tumor regions. Prior to further processing, the extracted features were dimensionless. Before further processing, all extracted features were standardized by non-dimensionalization. The Min-Max scaler function in Python was applied to normalize the training dataset, and the same scaling parameters were subsequently applied to the validation set. This procedure was employed to accelerate model training and to mitigate the potential impact of disproportionately scaled features on model performance.
Radiomics feature selection: embedded least absolute shrinkage and selection operator
Most of the extracted radiomic features were not associated with the outcome. LASSO was employed to select a small subset of features most relevant to the outcome from the vast number of extracted features. Additionally, among the outcome-associated features, multiple highly correlated features might exist. By introducing L1 regularization, LASSO reduces the regression coefficients of some features to zero, thereby automatically selecting key features and effectively preventing overfitting, addressing the issues arising from multicollinearity (16).
2D and 3D features from NC, AP, PVP, and BP images in the eight radiomic groups were selected via the LASSO algorithm. The most predictive features were selected for model construction. In addition, we combined all four imaging phases and separately identified the optimal features from the hybrid ROIs across 2D and 3D datasets. These selected features were then applied to the validation cohort to confirm model generalizability.
Model construction and evaluation
Eight predictive models were initially constructed using the 2D and 3D features selected from NC, AP, PVP, and BP. Both traditional logistic regression and XGBoost machine learning algorithms were used. To optimize the hyperparameters of the XGBoost model, a grid search strategy was employed. The parameter set yielding the highest mean AUC on the validation cohort was selected for the final model. The model’s predictive performance for MVI was evaluated using the receiver operating characteristic (ROC) curve and the area under the curve (AUC). Subsequently, a combined radiomics model was constructed, which incorporated the 2D and 3D features from all four phases.
Afterward, the optimal 2D and 3D radiomics-based models were selected from all the models. Radiomics scores were constructed using LASSO regression coefficients, and a combined model based on radiomics clinical features was further developed by combining the patient’s demographic and laboratory data, aiming to explore the best preoperative prediction approach for MVI. After the training phase, the model’s performance was evaluated in the test cohort. Shapley values (17) were then utilized to rank features and determine their contribution and importance. These results provided evidence for evaluating the intratumoral heterogeneity of HCC (Figure 1).

Figure 1. This flowchart illustrates the research process of predicting hepatocellular carcinoma (HCC) using CT radiomics. The study first collected preoperative contrast-enhanced CT data from 150 patients and performed tumor segmentation using 3D Slicer to label 2D and 3D regions. Then, shape, histogram, and texture features were extracted from both original and wavelet-filtered images. The most predictive features were selected using the LASSO method, and models were constructed using XGBoost and logistic regression with both single-modality (radiomics and clinical features) and multimodal models (combining 3D and 2D data). Finally, the model’s predictive performance was evaluated using ROC curves, heatmaps, and feature importance rankings.
In this study, all statistical analyses were performed using the R language (version 4.4.1). Data preprocessing and descriptive statistics were conducted using the tidyverse and CBCgrps packages. Feature selection was performed through LASSO regression via the glmnet package, while cross-validation was performed using the caret package. The prediction models were constructed using xgboost, while hyperparameter tuning and model evaluation were carried out through caret. Model performance was assessed via AUC, and internal validation and calibration analysis were performed using the rms package. Model interpretation was facilitated through SHAP values, while the models were visualized using pheatmap and ggplot2. The 10-fold cross-validation was used for all analyses to ensure the robustness and generalizability of the models.
Results
Patient characteristics
A total of 150 eligible patients were included in this study. Table 1 presents the statistical data and clinical characteristics of the training cohort (n = 106) and the testing cohort (n = 44). Histopathological examination revealed that the MVI status was balanced across the training and testing cohorts (p = 1).
Extracted radiomic features
A total of 1130 features were extracted from eight sets of ROIs across four sequences in both 2D and 3D configurations. An unsupervised clustering algorithm was employed to explore the potential correlations among these features. Heatmaps were subsequently generated to visualize the correlations between 2D and 3D regions, as shown in Figure 2.

Figure 2. Illustrates the correlation of features in NC, AP, PVP, and BP in both 2D and 3D imaging modalities, with red indicating strong correlations and blue representing the inverse relationship. The features analyzed include Shape, First Order, GLCM (Gray Level Co-occurrence Matrix), GLDM (Gray Level Dependence Matrix), GLRLM (Gray Level Run Length Matrix), GLSZM (Gray Level Size Zone Matrix), and NGTDM (Neighboring Gray Tone Difference Matrix).
Our results indicated that the overall correlation between 2D and 3D features was weak; however, a small number of features from all four sequences demonstrated strong correlations. The correlation was strongest during the balance phase of 2D and 3D features, whereas the correlation during the PVP was weakest. These findings provided further insight into the interpretation of subsequent prediction model outcomes.
Feature dimensionality reduction and selection
In the 2D ROI, seven, nine, six, and twelve MVI-related features were identified for the NC, AP, PVP, and BP, respectively. For the 3D ROI, the MVI-related features identified across these four phases were ten, six, six, and seven. Among the 63 features selected from the eight sets of imaging data, 41 were wavelet features.
Performance and evaluation of predictive models
Both logistic regression and machine learning methods were applied to all models. The performance of all 24 predictive models is summarized in Table 2, and the ROC curves of these models are depicted in Figure 3. In the models utilizing features extracted from single-sequence images in 16 cases, the 2D single-sequence model achieved the best AUC in the test set at the BP (logistic regression, AUC = 0.801, 95% CI: 0.663−0.94). The 3D single-sequence model exhibited the best AUC for the PVP (logistic regression, AUC = 0.876, 95% CI: 0.753−0.999).

Figure 3. In all 12 groups of 24 logistic regression and XGBoost model test sets, the optimal modeling method was selected. Figures NC/AP/PVP/BP demonstrate the discriminative ability of different single-sequence 2D and 3D models, while the 2D-3D figure presents the ROC curves of the 2D and 3D combined radiomics models. The final figure shows the ROC curves for the C: Clinical Indicator Model and C-R: Clinical Indicator and Radiomics Combined Model.
Logistic regression and XGBoost models were constructed using features extracted from all four sequences in both 2D and 3D imaging. The 2D combined model yielded an AUC of 0.851 (logistic regression, 95% CI: 0.726−0.976) in the test set, while the 3D combined model achieved an AUC of 0.811 (XGBoost, 95% CI: 0.648−0.975). These results indicated that both the 2D and 3D models demonstrated strong predictive performance for MVI. However, 3D radiomics did not provide a significant advantage and, in some cases, performed worse than the 2D model, particularly in the combined models.
Among the 44 different clinical features, LASSO regression identified four features (fibrinogen, CEA, AFP, and tumor diameter) for constructing a predictive model. The model based only on clinical features achieved an AUC of 0.71 in the test set (XGBoost, 95% CI: 0.535−0.886).
Finally, the 3D PVP and 2D BP models exhibiting the best performance were selected. These models were compressed into an imaging score, R-score, based on the λ coefficient. A predictive model was constructed that incorporated the R-score along with four clinical features identified. The clinical-radiomics combined model achieved an AUC of 0.818 in the test set (logistic regression, 95% CI: 0.654−0.981). Our findings suggested that while clinical features were able to predict MVI preoperatively, their predictive performance was inferior to that of radiomics. Furthermore, incorporating radiomics score and clinical features in models improved the predictive power. However, the combined model was not the most efficient.
Among the twelve groups of twenty-four models, eight logistic regression models outperformed others, while four XGBoost models demonstrated superior results. Overall, although XGBoost showed better performance in the training set, it did not exhibit an advantage over the logistic regression models in the validation.
Subsequently, Shapley analysis was performed on both the 2D and 3D combined models. Figure 4 presents the key radiomic features selected for constructing the models, along with their classification and contribution rankings. The results showed that the 2D combined model predominantly selected features from BP, whereas the 3D model favored features from the PVP. In contrast, among the single-sequence models, the performance for both 2D and 3D models was the best. Furthermore, there was significant feature overlap across different sequences, suggesting substantial collinearity among the selected radiomic features from various sequences. This indicated that the optimal features identified by LASSO largely originated from a single sequence. This finding contrasted with the results illustrated in the heatmap for the overall feature correlations.

Figure 4. Illustrates the effects of each feature value on the model output and the importance of the features that contribute most to predicting MVI, with the total contribution scores of all features summing to 1. The combined 2D and 3D models are used to interpret the selected features for model construction. The upper part of the figure shows the 2D combined model and the lower part shows the 3D combined model. The SHAP values along the x-axis are color-coded, with high positive SHAP values (red) indicating a positive correlation with the prediction of MVI, and low negative SHAP values (blue) indicating a negative correlation. The point cloud plot displays the SHAP values of each feature for every individual sample. The horizontal axis represents the SHAP values, where positive values contribute to an increased prediction and negative values contribute to a decreased prediction, and the vertical axis lists the features ranked by importance. The color of each point reflects the magnitude of the feature value for the corresponding sample, with red indicating high feature values and blue suggesting low feature values. The numbers following the features represent the sequence from which they are derived: none indicates BP, 1 indicates PVP, 2 indicates AP, and 3 indicates unenhanced phase.
In the 2D combined model, the contributions of various features were relatively balanced. However, in the 3D combined model, a single feature, wavelet-LLH-glszm-Small-Area-Emphasis, contributed over 60%. One of the dominant features in the model was derived from wavelet decomposition and GLSZM, which typically reflects the presence of numerous small and homogeneous grayscale zones at specific scales and orientations. This radiomic feature represents subtle intratumoral texture heterogeneity, which is closely associated with pathological characteristics such as necrosis, fibrosis, and MVI. These factors are generally indicative of tumor aggressiveness and poor prognosis. Therefore, the predominance of this texture feature in the radiomics model may be attributed to its capacity to capture the intricate spatial heterogeneity within the tumor, thereby enhancing the model’s ability to predict MVI accurately. However, over-reliance on a single feature can undermine the model’s robustness. If this feature is influenced by external conditions (e.g., image quality, data acquisition errors, or patient condition variations), the model’s performance may drastically decline. This could be one reason why the performance of the combined model incorporating 3D features did not meet expectations.
Discussion
In the single-sequence radiomics models, the overall performance between 2D and 3D models was similar. A possible reason for this is the overlapping data characteristics between 2D and 3D ROIs. In liver cancer radiomics, some features across different slices of certain sequences may be quite similar. Although the 3D model processes all slices, it does not capture significantly different information. The imaging features of MVI may already be adequately reflected in the local regions of the tumor. Additionally, a large number of texture features and wavelet transform features were extracted, and SHAP analysis also indicated (Figure 3) that for some sequences, MVI prediction mainly relied on texture features rather than spatial features, thereby reducing the need for spatial information. As a result, the spatial information contribution of the 3D model may be weakened.
In the field of liver cancer, no related research has discussed the advantages and disadvantages of 2D and 3D ROIs. However, similar conclusions have been reached in comparable studies in other cancers. In lung cancer, Zhang et al. (18) reported that the predictive performance of 2D and 3D models using the CT plain scan phase did not differ across various tasks (such as lymphovascular invasion (LVI), pleural invasion (VI), and pT staging prediction) (19). Another study compared the performance of 2D and 3D MRI features in meningioma grading and found similar predictive performance between the two (20). A multi-center study compared the performance of 2D and 3D CT radiomics in gastric cancer characterization and concluded that 2D features were slightly superior in most tasks. These studies noted common disadvantages of 3D models. 3D image annotation is performed layer-by-layer, which is time-consuming, labor-intensive, and noisy. Manual delineation of 3D tumors often results in unclear boundaries, which can affect predictive accuracy. Additionally, the Z-axis thickness in 3D ROI delineation is typically coarser than the planar resolution along the X and Y axes, leading to loss of detailed information due to inconsistent spatial resolution, while 2D imaging can capture the clearest information in a single maximal cross-section.
However, the aforementioned reasons cannot explain why the 2D ROI model demonstrates better predictive performance than the 3D model during BP, while the 3D ROI model outperforms the 2D model during the PVP. A correlation analysis of both 2D and 3D features was conducted, as shown in Figure 3. The correlation between 2D and 3D features was strongest during the BP and weakest during the PVP, which aligned with the results of the predictive models. Based on this correlation analysis, it can be speculated that the observed findings may be attributed to the intrinsic characteristics of HCC. Compared to other tumors, HCC may be more heterogeneous. Its growth pattern is typically irregular and infiltrating, often extends beyond a single plane and tends to invade surrounding tissues (1). Additionally, HCC is often highly vascularized (21), especially during the arterial and PVPs, where the tumor shows significant enhancement in blood flow. These vascular characteristics are expressed in a 3D distribution across different layers. The PVP images better reflect the spatial structure, density variations, microvascular distribution, and perfusion differences inherent to HCC heterogeneity. The 3D ROI can provide more comprehensive information than a single 2D plane.
During the BP, the 2D model outperformed the 3D model. This may be due to the unique features of the BP in predicting MVI: the BP better reflects the tissue permeability characteristics associated with MVI (22), where the contrast agent achieves relative balance between the tumor and surrounding tissues. At this point, the tissue permeability of the tumor and microvascular characteristics become more prominent. In 2D images, these local texture and density variations can be more clearly presented, effectively offering features required for MVI prediction. Additionally, the potential “information dilution” issue of 3D integration may be another explanation. The 3D model combines features from multiple slices, which may dilute the predictive power of local features during the BP.
Regardless of the 2D or 3D model, wavelet transform features played a crucial role. This suggests that both models rely on local texture details from certain frequency sub-bands. The stronger results from the 2D model imply that these texture details may not exclusively come from the local regions where MVI occurs; texture details from other areas of the tumor can also reflect the presence of MVI. From a radiomic perspective, this provides further evidence for the heterogeneity of liver cancer.
In comparison, a model based on clinical features was developed using the same method. The model also demonstrated predictive ability for MVI preoperatively. Among 44 clinical indicators, tumor size, and AFP had the most significant predictive power, which is consistent with most prior studies (23, 24). However, the clinical feature-based model did not outperform the radiomics-based models from any sequence.
Next, the optimal radiomic features from the 2D BP and the 3D PVP in the single-sequence models were combined with clinical features to construct a multimodal radiomics-clinical combined model. Our results indicate that incorporating radiomic scores and clinical features in a model enhances the predictive ability. This conclusion aligns with findings from several previous studies. Xia et al. (25) constructed a clinical-radiomics semantic feature-radiomic score prediction model, which predicted MVI status with an AUC of 0.86 (95% CI: 0.79, 0.92), and the external test set yielded an AUC of 0.84 (95% CI: 0.78, 0.91). Compared to the pure radiomic model, the combined model provided a greater net benefit within a reasonable threshold probability range. A multicenter study by Xu et al. Xu (9) has suggested that the accuracy of the radiomic model in predicting MVI outperforms the clinical logistic regression model based on AST, tumor size, and AFP (AUCs in the validation cohort were 0.750 and 0.648, respectively). However, in this study, it was not the optimal predictive approach, which may be attributed to various factors, including sample size and the weaker noise resistance of the 3D model.
In this study, LASSO was employed to select 63 features from a total of eight sets of imaging data, with a significant number of wavelet transform features identified. Notably, some of these wavelet features appeared repeatedly across different sequences of 2D and 3D images. This suggests that the prediction of radiomics-based model for MVI is highly dependent on local texture features and that there exists a strong collinearity between certain features across different sequences of contrast-enhanced imaging. A correlation analysis of the features was conducted, and a heatmap was plotted to further substantiate this observation.
The findings by Ni et al. (26) indicate that the dimensionality reduction and model construction methods employed for radiomic features can impact the predictive performance of the resulting radiomics-based models. LASSO dimensionality reduction combined with gradient boosting decision trees (GBDT) yields the highest prediction accuracy, and the LASSO + GBDT method outperforms other approaches when the threshold probability exceeds 0.22. Both GBDT and XGBoost are ensemble methods that use multiple weak learners (typically decision trees) to enhance model performance. XGBoost is an extension of GBDT that focuses on improving performance and efficiency. It incorporates several enhancements over the GBDT algorithm, including regularization terms, more efficient node splitting, and learning rate decay, which result in superior speed and performance compared to traditional GBDT (27).
Based on Ni et al.’s conclusion, a LASSO + XGBoost predictive model was constructed, with a logistic regression model used for comparison. However, in this study, although XGBoost demonstrated better performance on the training set, it did not show an advantage over the logistic regression model in the validation set. This result may be attributed to the complexity of XGBoost’s tree model structure, which showed slight overfitting in the validation set. If the relationship between CT imaging features and liver cancer MVI is not highly nonlinear, the advantages of XGBoost may not be fully realized, potentially even leading to reduced predictive performance. Logistic regression might be more suitable for this relatively simple feature representation. Notably, in comparison with the logistic regression model, the XGBoost model often performed better with the 3D models, likely due to the higher complexity of the 3D datasets. On the other hand, XGBoost models involve numerous parameters (such as learning rate, tree depth, and regularization parameters), and achieving optimal performance requires careful parameter tuning. Since the investigators in this study were clinicians with limited experience in tuning complex machine learning models, this may have contributed to the suboptimal performance of the machine learning models.
Overall, the radiomics models based on various sequences of contrast-enhanced CT images demonstrated a certain degree of predictive ability. The radiomics combined model outperformed some single-sequence models, though it did not achieve the best results. Among single-sequence models, the 3D PVP model exhibited the highest accuracy, while the 2D BP model showed the highest AUC. Ma et al. (28) conducted a study involving 157 patients with histologically confirmed HCC, with or without MVI, and found that radiomics features from AP, PVP, and BP of enhanced CT imaging could all be used to construct predictive models for liver cancer MVI They found that the PVP offered superior predictive performance compared to AP and BP, as well as multi-sequence models. Our findings are partially consistent with the conclusions by Ma et al.; however, in their study, the combined model directly combined multi-sequence radiomic features, achieving high AUC in the training set, but with suboptimal accuracy, specificity, sensitivity, and AUC in the validation set. A preoperative radiomics model for MVI prediction (29) was similarly constructed, but the training and testing datasets showed considerable differences, which may have been caused by a mismatch in the number of features and the sample size of the training set, leading to model overfitting. Their conclusions are not sufficiently rigorous.
This study has several limitations. The single-center retrospective design inevitably leads to sample selection bias. Additionally, with a relatively small sample size, the advantages of more complex combined models are restricted due to the limited training data. Furthermore, our predictive model did not incorporate semantic features based on imaging interpretations by physicians. Moreover, variations in clinical issues, as well as image acquisition and reconstruction protocols, may influence the results. Therefore, further validation is required to improve the generalizability of the conclusions to other diseases and clinical contexts.
Conclusion
This study successfully developed preoperative predictive models for MVI in HCC patients based on 2D and 3D radiomic features from contrast-enhanced CT imaging. The performance of different ROI delineation methods and imaging sequences was systematically compared. The results indicate that the 2D PVP model excels in capturing local texture features, while the 3D AP model demonstrates greater advantages in reflecting tumor spatial heterogeneity. Overall, the radiomic model outperforms traditional clinical-based predictive models, although the 3D model did not show a significantly better overall predictive performance than the 2D model.
The study further highlights the sensitivity of radiomic features to tumor heterogeneity, particularly in the expression of texture and spatial characteristics. Wavelet transform features are extensively selected and significantly influence prediction results. This underscores the pivotal role of radiomics in capturing the complexity and heterogeneity within tumors. This research provides robust evidence for radiomic studies in HCC, advancing preoperative MVI prediction techniques and offering critical insights for personalized and precise diagnosis and treatment of liver cancer.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.
Ethics statement
The studies involving humans were approved by The Ethics Review Committee of the Second Clinical Medical College of Jinan University. The studies were conducted in accordance with the local legislation and institutional requirements. The need for informed consent from individual patients was waived because of the retrospective and observational nature of the study.
Author contributions
ZY: Data curation, Formal Analysis, Methodology, Software, Validation, Visualization, Writing – original draft. MW: Conceptualization, Writing – review & editing. YL: Data curation, Investigation, Resources, Writing – review & editing. ZL: Data curation, Investigation, Resources, Writing – review & editing. SB: Project administration, Supervision, Writing – review & editing. LL: Project administration, Supervision, Writing – review & editing.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Llovet JM, Kelley RK, Villanueva A, Singal AG, Pikarsky E, Roayaie S, et al. Hepatocellular carcinoma. Nat Rev Dis Primers. (2021) 7:6. doi: 10.1038/s41572-020-00240-3
2. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: Cancer J Clin. (2021) 71:209–49. doi: 10.3322/caac.21660
3. EASL. EASL Clinical Practice Guidelines: Management of hepatocellular carcinoma. J Hepatol. (2018) 69:182–236. doi: 10.1016/j.jhep.2018.03.019
4. Zhang ZH, Jiang C, Qiang ZY, Zhou YF, Ji J, Zeng Y, et al. Role of microvascular invasion in early recurrence of hepatocellular carcinoma after liver resection: A literature review. Asian J Surg. (2024) 47:2138–43. doi: 10.1016/j.asjsur.2024.02.115
5. Sun B, Ji WD, Wang WC, Chen L, Ma JY, Tang EJ, et al. Circulating tumor cells participate in the formation of microvascular invasion and impact on clinical outcomes in hepatocellular carcinoma. Front Genet. (2023) 14:1265866. doi: 10.3389/fgene.2023.1265866
6. Jun L, Zhenlin Y, Renyan G, Yizhou W, Xuying W, Feng X, et al. Independent factors and predictive score for extrahepatic metastasis of hepatocellular carcinoma following curative hepatectomy. Oncol. (2012) 17:963–9. doi: 10.1634/theoncologist.2011-0447
7. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RG, Granton P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer (Oxford England: 1990). (2012) 48:441–6. doi: 10.1016/j.ejca.2011.11.036
8. Zhong X, Long H, Su L, Zheng R, Wang W, Duan Y, et al. Radiomics models for preoperative prediction of microvascular invasion in hepatocellular carcinoma: a systematic review and meta-analysis. Abdominal Radiol (New York). (2022) 47:2071–88. doi: 10.1007/s00261-022-03496-3
9. Xu X, Zhang HL, Liu QP, Sun SW, Zhang J, Zhu FP, et al. Radiomic analysis of contrast-enhanced CT predicts microvascular invasion and outcome in hepatocellular carcinoma. J Hepatol. (2019) 70:1133–44. doi: 10.1016/j.jhep.2019.02.023
10. Liu R, Elhalawani H, Radwan Mohamed AS, Elgohari B, Court L, Zhu H, et al. Stability analysis of CT radiomic features with respect to segmentation variation in oropharyngeal cancer. Clin Trans Radiat Oncol. (2020) 21:11–8. doi: 10.1016/j.ctro.2019.11.005
11. Ho DW, Tsui YM, Sze KM, Chan LK, Cheung TT, Lee E, et al. Single-cell transcriptomics reveals the landscape of intra-tumoral heterogeneity and stemness-related subpopulations in liver cancer. Cancer Lett. (2019) :459:176–85. doi: 10.1016/j.canlet.2019.06.002
12. van Griethuysen JJM, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. (2017) 77:e104–e7. doi: 10.1158/0008-5472.Can-17-0339
13. Chernyak V, Fowler KJ, Do RKG, Kamaya A, Kono Y, Tang A, et al. LI-RADS: looking back, looking forward. Radiology. (2023) 307:e222801. doi: 10.1148/radiol.222801
14. Hu F, Zhang Y, Li M, Liu C, Zhang H, Li X, et al. Preoperative prediction of microvascular invasion risk grades in hepatocellular carcinoma based on tumor and peritumor dual-region radiomics signatures. Front Oncol. (2022) 12:853336. doi: 10.3389/fonc.2022.853336
15. Fornacon-Wood I, Mistry H, Ackermann CJ, Blackhall F, McPartlin A, Faivre-Finn C, et al. Reliability and prognostic value of radiomic features are highly dependent on choice of feature extraction platform. Eur Radiol. (2020) 30:6241–50. doi: 10.1007/s00330-020-06957-9
17. Ali S, Akhlaq F, Imran AS, Kastrati Z, Daudpota SM, and Moosa M. The enlightening role of explainable artificial intelligence in medical & healthcare domains: A systematic literature review. Comput Biol Med. (2023) 166:107555. doi: 10.1016/j.compbiomed.2023.107555
18. Zhang X, Zhang G, Qiu X, Yin J, Tan W, Yin X, et al. Radiomics under 2D regions, 3D regions, and peritumoral regions reveal tumor heterogeneity in non-small cell lung cancer: a multicenter study. La Radiol Med. (2023) 128:1079–92. doi: 10.1007/s11547-023-01676-9
19. Duan C, Li N, Liu X, Cui J, Wang G, and Xu W. Performance comparison of 2D and 3D MRI radiomics features in meningioma grade prediction: A preliminary study. Front Oncol. (2023) 13:1157379. doi: 10.3389/fonc.2023.1157379
20. Meng L, Dong D, Chen X, Fang M, Wang R, Li J, et al. 2D and 3D CT radiomic features performance comparison in characterization of gastric cancer: A multi-center study. IEEE J Biomed Health Inform. (2021) 25:755–63. doi: 10.1109/jbhi.2020.3002805
21. Zheng N, Zhang S, Wu W, Zhang N, and Wang J. Regulatory mechanisms and therapeutic targeting of vasculogenic mimicry in hepatocellular carcinoma. Pharmacol Res. (2021) 166:105507. doi: 10.1016/j.phrs.2021.105507
22. Taskaeva I and Bgatova N. Microvasculature in hepatocellular carcinoma: An ultrastructural study. Microvascular Res. (2021) 133:104094. doi: 10.1016/j.mvr.2020.104094
23. Cucchetti A, Piscaglia F, Grigioni A, Ravaioli M, Cescon M, Zanello M, et al. Preoperative prediction of hepatocellular carcinoma tumour grade and micro-vascular invasion by means of artificial neural network: a pilot study. J Hepatol. (2010) 52:880–8. doi: 10.1016/j.jhep.2009.12.037
24. Zhang C, Zhao R, Chen F, Zhu Y, and Chen L. Preoperative prediction of microvascular invasion in non-metastatic hepatocellular carcinoma based on nomogram analysis. Transl Oncol. (2021) 14:100875. doi: 10.1016/j.tranon.2020.100875
25. Xia TY, Zhou ZH, Meng XP, Zha JH, Yu Q, Wang WL, et al. Predicting microvascular invasion in hepatocellular carcinoma using CT-based radiomics model. Radiology. (2023) 307:e222729. doi: 10.1148/radiol.222729
26. Ni M, Zhou X, Lv Q, Li Z, Gao Y, Tan Y, et al. Radiomics models for diagnosing microvascular invasion in hepatocellular carcinoma: which model is the best model? Cancer Imaging: Off Publ Int Cancer Imaging Soc. (2019) 19:60. doi: 10.1186/s40644-019-0249-x
27. Chen M, Wang Z, Zhao Z, Zhang W, Guo X, Shen J, et al. Task-wise split gradient boosting trees for multi-center diabetes prediction. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining; Virtual Event. Association for Computing Machinery, Singapore (2021). p. 2663–73.
28. Ma X, Wei J, Gu D, Zhu Y, Feng B, Liang M, et al. Preoperative radiomics nomogram for microvascular invasion prediction in hepatocellular carcinoma using contrast-enhanced CT. Eur Radiol. (2019) 29:3595–605. doi: 10.1007/s00330-018-5985-y
29. Famularo S, Penzo C, Maino C, Milana F, Oliva R, Marescaux J, et al. Preoperative detection of hepatocellular carcinoma’s microvascular invasion on CT-scan by machine learning and radiomics: A preliminary analysis. Eur J Surg Oncol: J Eur Soc Surg Oncol Br Assoc Surg Oncol. (2024) 24:108274. doi: 10.1016/j.ejso.2024.108274
Keywords: microvascular invasion, intratumoral heterogeneity, radiomics, two-dimensional and three-dimensional models, machine learning
Citation: Yin Z, Wu M, Li Y, Li Z, Bao S and Liu L (2025) Exploring 2D and 3D radiomic models for predicting microvascular invasion in hepatocellular carcinoma: a novel perspective on tumor heterogeneity. Front. Oncol. 15:1590655. doi: 10.3389/fonc.2025.1590655
Received: 10 March 2025; Accepted: 21 July 2025;
Published: 17 September 2025.
Edited by:
Francisco Tustumi, University of São Paulo, BrazilReviewed by:
Yu Gu, Jiangsu Province Hospital of Chinese Medicine, ChinaRenguo Guan, Guangdong Provincial People’s Hospital, China
Copyright © 2025 Yin, Wu, Li, Li, Bao and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Shiyun Bao, YmFvLnNoaXl1bkBzemhvc3BpdGFsLmNvbQ==; Liping Liu, bGl1bGlwaW5nQG1haWwuc3VzdGVjaC5lZHUuY24=