Predicting the recurrence risk of liver metastasis from colorectal cancer: a study based on preoperative CT intratumoral and peritumoral radiomics features

Zhang, Dongying; Li, Peiheng; Wei, Yong; Xue, Mingmei; Guo, Fangfang; Li, Chenguang

doi:10.3389/fonc.2025.1662354

ORIGINAL RESEARCH article

Front. Oncol., 23 September 2025

Sec. Cancer Imaging and Image-directed Interventions

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1662354

This article is part of the Research TopicPrecision Medical Imaging for Cancer Diagnosis and Treatment Volume IIIView all 9 articles

Predicting the recurrence risk of liver metastasis from colorectal cancer: a study based on preoperative CT intratumoral and peritumoral radiomics features

Dongying Zhang^*

Peiheng Li

Yong Wei

Mingmei Xue

Fangfang Guo

Chenguang Li

Department of Radiology, The First Affiliated Hospital of Xinxiang Medical University, Weihui, China

Objective: This study aims to explore the value of predicting the recurrence risk of colorectal cancer liver metastasis (CRLM) based on preoperative CT intratumoral and peritumoral radiomics features.

Methods: This study utilized retrospectively collected preoperative CT data of 201 CRLM patients, comprising 145 cases from the hospital one and 56 cases from an external hospital two. Liver metastases were precisely segmented via manual annotation. Subsequently, the intratumoral region of interest (ROI_Intra) was isotropically dilated to radii of 2 mm, 4 mm, and 6 mm, resulting in peri-tumoral ROIs (ROI_Peri2mm, ROI_Peri4mm and ROI_Peri6mm). We established the prediction models based on support vector machine (SVM), random forest (RF), and multilayer perceptron (MLP) algorithms. The area under the subject operating characteristic curve (AUC) was used to evaluate the predictive performance.

Results: Compared with SVM and RF, MLP demonstrated superior predictive performance for estimating the recurrence risk of CRLM patients. The best radiomics signatures for predicting the recurrence risk of CRLM were ROI_{Intra+Peri4mm} model, and the AUCs of the ROI_Intra model, ROI_{Intra+Peri2mm} model, ROI_{Intra+Peri4mm} model, and ROI_{Intra+Peri6mm} model constructed by MLP are 0.758 (95% confidence interval (CI), 0.621 - 0.865), 0.815 (95% CI, 0.684 - 0.908), 0.855 (95% CI, 0.731 - 0.936), and 0.825 (95% CI, 0.696 - 0.915), respectively, in the external test set.

Conclusion: Preoperative CT-based radiomics features extracted from intra-tumoral (ROI_Intra) and peritumoral (ROI_{Intra+Peri2mm}, ROI_{Intra+Peri4mm}, and ROI_{Intra+Peri6mm}) regions can effectively predict recurrence risk in CRLM patients.

Introduction

Colorectal cancer ranks as the third most prevalent cancer worldwide, accounting for approximately 10% of all cancer cases, and represents the second leading cause of cancer-related mortality globally (1, 2). Approximately 25% of colorectal cancer patients develop liver metastases, for which hepatic resection remains the primary curative treatment (3, 4). Despite advances in surgical techniques and medical oncology that have increased resectability rates, multiple studies have demonstrated that the pooled recurrence rate after liver resection remains alarmingly high at 60-80%, constituting the leading cause of mortality in these patients (5, 6). Precise prediction of recurrence risk is therefore critically important for developing personalized therapeutic strategies and accurate prognostic assessments in patients with colorectal liver metastases (CRLM) (7–9). Clinically, the diagnostic utility of liver biopsy is often constrained by its invasive nature, the potential for procedural complications, and the elevated risk of false-negative results attributable to inadequate tissue sampling or sampling errors. As a routine imaging modality for patients with CRLM, preoperative computed tomography (CT) not only provides detailed anatomical information, including the size, location, and morphology of hepatic lesions, but also captures subtle, indirect indicators of tumor biology (10, 11). These microscale features, imperceptible to the human eye, can be systematically extracted by radiomics through the identification of intratumoral patterns, textural heterogeneities, and spatial relationships within CT images, ultimately transforming them into quantifiable and mineable data for enhanced diagnostic and prognostic insights (12–14). Support vector machine (SVM), random forest (RF), and multilayer perceptron (MLP), as established machine learning algorithms with robust feature extraction capabilities, enable the development of predictive models for CRLM recurrence risk (15–18). By integrating radiomics - based CT feature extraction with SVM, RF, and MLP modeling frameworks, this approach offers a novel, non-invasive solution for personalized risk stratification in CRLM management.

Current radiomics - based studies on preoperative CT imaging for CRLM have predominantly focused on risk prediction (19, 20), chemotherapy response (12, 21), and prognosis (22, 23). However, there remains a critical gap in the development of predictive models for postoperative recurrence risk, which constitutes a major determinant of long-term survival in this patient population. Additionally, the evolution and progression of tumors are affected by the interactions between cells within the tumor and constituents in the peritumoral region (24). Previous research indicates that tumors are composed not only of malignant cells but also stromal components, immune elements and inflammatory elements. These factors induce the stromal remodeling, creating a microenvironment conducive to tumor progression (25). The tumor necrosis factor signaling pathway, which is connected to abnormal blood - vessel formation driven by cancer cells, as well as invasion and metastasis, is related to the features of the peritumoral area (26, 27). In light of this, both the tumor’s intratumoral and peritumoral microenvironments are likely reservoirs of pivotal biological signals and CRLM recurrence indicators.

In this study, we develop and validate a radiomics - based recurrence risk prediction model for CRLM using advanced machine learning algorithms, including SVM, RF, and MLP, applied to preoperative CT imaging. By integrating quantitative imaging features extracted from both intratumoral and peritumoral regions, with the optimal peritumoral area associated with recurrence risk identified, our approach bridges the current gap in postoperative recurrence prediction. Furthermore, by incorporating clinical parameters, we establish a novel, non-invasive prognostic tool to guide postoperative surveillance and personalized treatment strategies, ultimately improving clinical outcomes for CRLM patients.

Materials and methods

Patient collective

For this retrospective, multiple-centers, IRB-approved study, 201 patients with proven colorectal cancer and CRLM were identified, including 148 CRLM patients from hospital one and 53 patients from hospital two (Table 1). The ethics committee of this institution approved the study and waived informed consent. Inclusion criteria: 1). Pathologically confirmed CRLM; 2). Availability of histopathological reports for both liver tumor and non-tumorous hepatic parenchyma; 3). Preoperative portal venous phase contrast-enhanced CT imaging acquired within 6 weeks prior to hepatic resection; 4). The follow-up duration is at least 24 months. The follow-up process involved regular clinical evaluations, including serum tumor marker testing and imaging assessments every year. Exclusion criteria: 1). Preoperative hepatic arterial infusion chemotherapy; 2). Prior local tumor ablation therapy or more than three wedge resections of the liver; 3). No visible tumor on preoperative imaging. The flowchart of the patient selection process is presented in Figure 1. The clinical information available in the datasets included: gender, age, body mass index at operation, carcinoembryonic antigen test, lymph node status, colon primary status, presence of multiple lobes, presence of major comorbidity, maximum tumor size and the liver recurrence status. Recurrence was defined based on a combination of imaging and pathological evidence. Detection of new lesions on cross-sectional imaging with typical radiological features of metastatic disease, e.g., enhancing hepatic nodules, extrahepatic masses, that were not present at baseline and persisted or enlarged on subsequent scans. Histopathological verification of malignant cells from biopsy or surgical resection of suspected recurrent lesions, which served as the gold standard when available.

Table 1

Table 1. Clinical characteristics of CRLM patients in the training and test sets.

Figure 1

Flowchart showing the selection process for a study on CRLM patients. Two hospitals’ patients are screened. Inclusion criteria: confirmed CRLM, histopathological reports, preoperative CT imaging, 24-month follow-up. Exclusion criteria: prior treatments or no visible tumors. Total included: 201 patients. Training set: 148 patients from hospital one; 87 with recurrence, 61 without. Test set: 53 patients from hospital two; 27 with recurrence, 26 without.

Figure 1. Subject selection flowchart for this experiments.

Imaging protocols

All study participants underwent a standardized contrast-enhanced CT scan according to predefined imaging protocols. Abdominal imaging was performed using a multidetector CT scanner (Lightspeed 16 and VCT; GE Healthcare, Madison, WI, USA) with the following key settings: autoMA ranged from 220 to 380, a noise index of 12 to 14, a rotation time of 0.7 to 0.8 milliseconds, and a scan delay of 80 seconds (28).

Intratumoral segmentation and peritumoral dilation

Intratumoral segmentation was performed on CT images using the ITK-SNAP software (Version 3.60, http://www.itk-snap.org). Two radiologists with 8 and 10 years of experience in abdominal imaging diagnosis, respectively, independently carried out the segmentation process. If the region of interest (ROI) defined by two radiologists showed a discrepancy ≥5%, a senior radiologist with two decades of expertise conducted re-segmentation to finalize the ROIs. The segmented ROI served as the ROI_Intra, which was expanded by 2 mm, 4 mm, and 6 mm into the peritumoral region using standard morphological operations, yielding ROI_Peri2mm, ROI_Peri4mm and ROI_Peri6mm, respectively. The ROI_{Peri k mm} excluded skin, air, and muscles.

Data preprocessing

Prior to the extraction of radiomics features, all CT images underwent resampling to achieve a uniform voxel resolution of 1×1×1 mm³. Subsequently, the intensity histograms of the images were discretized using a bin width of 25, ensuring consistent and standardized feature extraction across the dataset. This preprocessing step is crucial for maintaining the comparability and reliability of the extracted radiomics features.

Radiomics feature extraction

Radiomics features were extracted from the ROI using the open-source software PyRadiomics (Version 2.20, https://github.com/Radiomics/pyradiomics). A comprehensive set of 12 filters, including Original, AdditiveGaussianNoise, Binomial, Normalize, LaplacianSharpening, CurvatureFlow, wavelet, ShotNoise, BoxMean, LoG, DiscreteGaussian, and BoxSigmaImage, were applied to enhance the extraction of radiomics features. A total of 201 patients in this study yielded 1197 high-dimensional radiomics features. To reduce the complexity or bias inherent in the radiomics feature set, dimensionality reduction techniques were employed. The primary objective of feature dimensionality reduction is to simplify the feature space while retaining the most informative features. In the training dataset, we first performed feature dimensionality reduction using the least absolute shrinkage and selection operator (LASSO) regression analysis, with the regularization parameter α set to 0.001 (29). Subsequently, the top 15 features with the highest correlation were selected based on the maximum relevance and minimum redundancy (mRMR) method (30). These methods help in selecting the most discriminative features, thereby improving the performance of subsequent predictive models.

Model construction

Using radiomics features extracted from ROI_Intra and its peritumoral extensions (ROI_{Intra+Peri2mm}, ROI_{Intra+Peri4mm}, and ROI_{Intra+Peri6mm}), predictive models were developed using machine learning algorithms, including SVM, RF, and MLP, to predict recurrence risk in CRLM. The radiomics workflow is shown in Figure 2. The details of these classifiers were shown in Table 2.

Figure 2

Flowchart illustrating the process from CT acquisition to prediction model. CT images and clinical data are used to segment intratumoral and peritumoral areas. Features are extracted, including first order, shape, texture, and wavelet. Radiomics and clinical features are compiled. Feature selection employs LASSO and mRMR methods, leading to model construction for predictions.

Figure 2. Radiomics analysis and machine learning workflow for predicting the recurrence risk in CRLM patients.

Table 2

Table 2. The details of different classifiers.

Statistical analysis

In the analysis of count data, comparisons between groups were performed using the Chi-square test, which is appropriate for categorical variables. For continuous data, comparisons between groups were conducted using either the Mann-Whitney U test or the independent samples t-test, depending on the normality assumptions of the data. The performance of the predictive models was evaluated using several metrics, including the area under the subject operating characteristic curve (AUC), sensitivity, and specificity. Decision curve analysis (DCA) and calibration curve were employed to independently evaluate the stability and clinical net benefit of the predictive model, respectively. These metrics provide a comprehensive assessment of the model’s ability to distinguish between different outcomes. All statistical analyses were conducted using the R software (Version 4.3.3). A significance level of P < 0.05 was adopted to indicate statistically significant differences.

Results

Statistical analysis of clinical features

A total of 114 patients with recurrent CRLM and 87 patients without recurrence were included. Among the clinical characteristics, significant differences were observed between the two groups in terms of presence of multiple lobes and the maximum tumor diameter. Consequently, presence of multiple lobes and the maximum tumor diameter were incorporated as clinical indicators into the predictive model (Table 1). The two variables were selected due to their statistical significance, suggesting their potential importance in influencing the recurrence risk of CRLM. Including these clinical indicators enhances the model’s ability to accurately predict recurrence by accounting for relevant patient-specific factors.

Radiomics feature selection and predictive performance

From the CT-based ROIs (ROI_Intra, ROI_{Intra+Peri2mm}, ROI_{Intra+Peri4mm}, and ROI_{Intra+Peri6mm}), 15 radiomics features were ultimately selected based on their significant association with recurrence risk. The extracted radiomics features by ROI_Intra, ROI_{Intra+Peri2mm}, ROI_{Intra+Peri4mm}, and ROI_{Intra+Peri6mm}, which are presented in Tables 3, 4, 5, and 6. The results of recurrence risk prediction for CRLM patients, achieved by integrating radiomics features with clinical characteristics and constructing predictive models using SVM, RF, and MLP, are presented in Table 3, 4. The predictive performance of the three machine learning models (SVM, RF, and MLP) improved consistently when incorporating peritumoral regions into the radiomics signature. The optimal performance was observed with the MLP model using ROI_{Intra+Peri4mm}, achieving the highest AUC of 0.905 (95% CI: 0.846 - 0.947) in the training set and 0.855 (95% CI: 0.731 - 0.936) in the test set. The significance level P for all nine models was less than 0.0001 (Table 7).

Table 3

Table 3. The key radiomics features selected out through ROI_Intra.

Table 4

Table 4. The key radiomics features selected out through ROI_{Intra+Peri2mm}.

Table 5

Table 5. The key radiomics features selected out through ROI_{Intra+Peri4mm}.

Table 6

Table 6. The key radiomics features selected out through ROI_{Intra+Peri6mm}.

Table 7

Table 7. Performance metrics SVM, RF, and MLP models across different ROI radiomics signature in the training set.

The ROC curves (Figure 3) illustrate that models constructed using CT intratumoral and peritumoral radiomics features are effective in predicting the recurrence risk of CRLM, with all models achieving AUC values exceeding 0.735 in the test set. Based on the H-L test, the P-values for the ROI_Intra model, ROI_{Intra+Peri2mm} model, ROI_{Intra+Peri4mm} model, and ROI_{Intra+Peri6mm} model by SVM compared to actual observations were 0.2262, 0.2298, 0.3631, and 0.0110, respectively. The P-values for the ROI_Intra model, ROI_{Intra+Peri2mm} model, ROI_{Intra+Peri4mm} model, and ROI_{Intra+Peri6mm} model by RF compared to actual observations were 0.4532, 0.3038, 0.3190, and 0.1586, respectively. The P-values for the ROI_Intra model, ROI_{Intra+Peri2mm} model, ROI_{Intra+Peri4mm} model, and ROI_{Intra+Peri6mm} model by MLP compared to actual observations were 0.3068, 0.3845, 0.8556, and 0.4218, respectively. The decision curve analysis evaluated the clinical utility of all the predictive models (ROI_Intra, ROI_{Intra+Peri2mm}, ROI_{Intra+Peri4mm}, and ROI_{Intra+Peri6mm}) across a range of threshold probabilities (Figure 4). The calibration curves assessed the agreement between predicted probabilities and observed outcomes (Figure 5). The results demonstrated that all models provided net benefit, indicating their potential clinical applicability for predicting recurrence risk in CRLM patients (Table 8).

Figure 3

Four ROC curve plots labeled A, B, C, and D. Each graph compares the performance of SVM (dashed line), RF (dotted line), and MLP (solid line) classifiers. The x-axis represents 100-Specificity and the y-axis shows Sensitivity, ranging from 0 to 100. All curves indicate classifier performance with varying levels of sensitivity and specificity. A diagonal line represents random chance in each plot.

Figure 3. ROC of the predictive models constructed by different radiomics signatures in the test set. (A) ROI_Intra, (B) ROI_{Intra+Peri2mm}, (C) ROI_{Intra+Peri4mm}, (D) ROI_{Intra+Peri6mm}.

Figure 4

Four graphs labeled A, B, C, and D display net benefit against risk threshold for SVM, RF, and MLP models, compared to no model. Each graph features three colored lines for the models and a black dashed line for the “None” category. A green shaded area indicates positive net benefit.

Figure 4. DCA of the predictive models constructed by different radiomics signatures in the test set. (A) ROI_Intra, (B) ROI_{Intra+Peri2mm}, (C) ROI_{Intra+Peri4mm}, (D) ROI_{Intra+Peri6mm}.

Figure 5

Four line graphs labeled A, B, C, and D compare actual versus predicted probabilities for SVM, RF, and MLP models, with a reference line for perfect calibration. Each graph displays varying levels of calibration for the models, indicated by the proximity of colored lines to the dotted reference line across different predicted probabilities.

Figure 5. Calibration curve of the predictive models constructed by different radiomics signatures in the test set. (A) ROI_Intra, (B) ROI_{Intra+Peri2mm}, (C) ROI_{Intra+Peri4mm}, (D) ROI_{Intra+Peri6mm}.

Table 8

Table 8. Performance metrics SVM, RF, and MLP models across different ROI radiomics signature in the test set.

Discussion

Radiomics - based analyses of preoperative CT imaging have been extensively validated for predicting metastatic risk, chemotherapy response, and prognosis in CRLM patients. For example, a study by Jing et al. showed that a radiomics model based on CT for preoperative prediction of liver metastases after surgery for colorectal cancer, with AUC of 0.761 in the test set (19). Karagkounis et al. constructed the radiomics prediction model using CT features of 85 Patients who underwent resection for CRLM, and the results showed that this model can provide a valuable reference for pathological response assessment (13). However, the potential of radiomics for recurrence risk stratification remains relatively underexplored in this patient population. This knowledge gap is particularly concerning given that recurrence patterns in CRLM are highly heterogeneous, with liver metastases and local hepatic recurrences exhibiting distinct clinical behaviors. Meanwhile, existing studies predominantly focus on intralesional features, overlooking the recurrence value of peritumoral texture patterns that may reflect microenvironmental changes associated with tumor dissemination. To address these gaps, our study systematically evaluated the predictive performance of radiomics models for recurrence risk by constructing separate SVM, RF, and MLP algorithms using four distinct radiomics feature sets derived from the intra-tumoral ROI (ROI_Intra) and its peritumoral extensions (ROI_{Intra+Peri2mm}, ROI_{Intra+Peri4mm}, and ROI_{Intra+Peri6mm}).

Biologically, the peritumoral region plays a critical role in tumor progression, as it is where cancer cells interact with stromal tissue, immune cells, and blood vessels, processes that drive local invasion and recurrence (13). Studies on hepatic metastases have shown that pathological changes extend beyond the visible tumor boundary, with distinct biological signatures observed at varying distances from the tumor edge (31). Specifically, the 2mm zone primarily reflects immediate tumor-stroma interactions, including early invasive activity and extracellular matrix remodeling. The 4mm zone captures broader paracrine effects and immune responses that mediate tumor survival and spread. The 6mm zone encompasses more distant microenvironmental changes, such as hepatic sinusoidal remodeling, which can facilitate micrometastasis formation. For instance, Shang et al. predicted the invasiveness of lung adenocarcinoma by analyzing radiomic features extracted from both tumor cores and 4mm peritumoral regions on CT imaging (32). Qin et al. established through MRI analysis that systematic evaluation of peritumoral ROIs expanded by 2mm, 4mm, and 6mm beyond tumor margins provides clinically significant predictive value for assessing pathological treatment response in locally advanced rectal cancer patients following neoadjuvant chemoradiotherapy (33). Clinically, these distances align with previous radiomic studies on malignancies, where 2 - 6mm peritumoral regions have been associated with recurrence risk and treatment response. We selected 2mm, 4mm, and 6mm to span this clinically relevant range, allowing us to capture both proximal and distal microenvironmental influences on recurrence.

The extraction of radiomics features is based on the principles of quantitative imaging analysis, which systematically quantifies the spatial and intensity distributions of voxel patterns within medical images. These features capture tumor heterogeneity and microenvironmental characteristics, thereby possessing significant biological relevance in medical research. As shown in Table 3, the first column displays the categories corresponding to the radiomics features, the second column lists the names of the selected features, and the third column presents the mRMR correlation coefficients between the radiomics features and the recurrence risk. We have analyzed the biological significance of the 3 radiomics features that are most strongly associated with the risk of recurrence. Original_glrlm_LongRunHighGrayLevelEmphasis measures the distribution of long contiguous pixel runs with high gray-level intensity values. The high value may correspond to highly ordered proliferative regions within the tumor or high cell density zones around necrotic areas, and is associated with the degree of tumor differentiation. Original_firstorder_Mean represents arithmetic mean of all pixel intensities within the tumor ROI. Low mean values may indicate necrotic areas, while high values suggest vascularized tumor regions. Original_shape_Sphericity measures how closely the tumor shape approximates a perfect sphere. Lower sphericity correlates with infiltrative growth patterns and desmoplastic reaction.

The radiomics features extracted by ROI_{Intra+Peri2mm} are shown in Table 4. The top 3 radiomic features most associated with the risk of recurrence are wavelet-LLH_glcm_Correlation, wavelet-LLH_glcm_DifferenceEntropy, wavelet-LLH_gldm_DependenceNonUniformity, respectively. Wavelet-LLH_glcm_Correlation quantifies how correlated a pixel is to its neighbor in the LLH wavelet space. High values indicate organized tissue structures, e.g., regular tumor stroma, and low values suggest chaotic tissue patterns. Wavelet-LLH_glcm_DifferenceEntropy calculates the entropy of gray-level difference distribution. Higher values indicate more heterogeneous tissue patterns. Wavelet-LLH_gldm_DependenceNonUniformity measures the variability of gray-level dependencies in local neighborhoods. It reflects complex microenvironment interactions in the CRLM tumor.

The radiomics features extracted by ROI_{Intra+Peri4mm} are shown in Table 5. The top 3 radiomic features most associated with the risk of recurrence are wavelet-HHL_glszm_HighGrayLevelZoneEmphasis,wavelet-HHL_glszm_LargeAreaEmphasis, wavelet-HHL_glszm_ZoneEntropy, respectively. Wavelet-HHL_glszm_HighGrayLevel ZoneEmphasis measures the relative distribution of high gray-level zones and emphasizes zones with higher gray-level values within the HHL wavelet-transformed image space. It may correlate with regions of active tumor metabolism or hypervascularized subregions. Wavelet-HHL_glszm_LargeAreaEmphasis quantifies the distribution of large homogeneous zones and emphasizes larger zone sizes within the HHL wavelet space. It may relate to areas of tumor stability or defined growth patterns. Wavelet-HHL_glszm_ZoneEntropy assesses the randomness or disorder in the distribution of zone sizes throughout the image, specifically within the HHL wavelet-transformed space. High values suggest a more irregular and heterogeneous distribution of tumor zones, indicating complex microenvironmental interactions or varied cellular compositions. Low values imply a more uniform and organized arrangement of tumor zones.

The radiomics features extracted by ROI_{Intra+Peri6mm} are shown in Table 6. The top 3 radiomic features most associated with the risk of recurrence are wavelet-LLL_glrlm_GrayLevelNonUniformityNormalized, wavelet-LLL_glrlm_LongRunEmphasis, log-sigma-5-mm-3D_glcm_JointEntropy, respectively. Wavelet-LLL_glrlm_GrayLevelNonUniformityNormalized quantifies gray-level distribution uniformity within runs. It may be related to infiltrative growth patterns with irregular cellular density. Wavelet-LLL_glrlm_LongRunEmphasis detects large-scale and spatially coherent tissue regions in LLL-filtered images. Higher values may correlate with well-organized tumor architectures. Log-sigma-5-mm-3D_glcm_JointEntropy enhances edges at 5 mm resolution and captures mid-range heterogeneity.

The inclusion of peritumoral regions (Peri_2mm, Peri_4mm, Peri_6mm) consistently improved model performance compared to intralesional features alone (ROI_Intra). For example, in the training set, MLP’s AUC increased from 0.872 (ROI_Intra) to 0.929 (ROI_{Intra+Peri4mm}), while in the test set, the best-performing model (MLP with ROI_{Intra+Peri4mm}) achieved an AUC of 0.855, compared to 0.735 for SVM with ROI_Intra alone. The performance of recurrence risk prediction models improved significantly with the inclusion of peritumoral radiomics features, confirming that tumor-stromal interactions in the peritumoral microenvironment contribute to metastatic progression. Among the evaluated radiomic regions - ROI_Intra, ROI_{Intra+Peri2mm}, ROI_{Intra+Peri4mm}, and ROI_{Intra+Peri6mm} demonstrated the best balance between sensitivity and specificity across all machine learning models. Although the SVM model demonstrated a marginal improvement in AUC (+0.002) when expanding the peritumoral region from ROI_{Intra+Peri4mm} to ROI_{Intra+Peri6mm} in the test set, both the RF and MLP models exhibited superior performance with the smaller ROI_{Intra+Peri4mm} signature. This suggests that, for these algorithms, the inclusion of excessive peritumoral information may introduce noise or redundancy, thereby diminishing predictive accuracy. The optimal peritumoral expansion size appears to be context-dependent, with smaller regions (4 mm) potentially striking a better balance between feature informativeness and model generalizability. In contrast, the minimal peritumoral inclusion (ROI_{Intra+Peri2mm}) failed to capture sufficient prognostic information, underscoring the importance of an optimized radiomic capture radius. ,

Machine learning model selection significantly influenced predictive accuracy. The selection of SVM, RF, and MLP was based on their distinct strengths and wide applicability in radiomic research. SVM was used for its robustness in handling high-dimensional data and its ability to find optimal hyperplanes for classification, which is valuable given the high dimensionality of our radiomic feature set. RF was selected due to its superiority in capturing non-linear relationships and interactions between features, as well as its built-in feature importance evaluation. MLP was included to account for potential complex, non-linear patterns in the data that traditional statistical models might miss, leveraging its capability to model hierarchical feature representations. The performance of SVM, RF, and MLP models varied significantly across different radiomics signatures, both in the training and test sets. MLP demonstrated superior performance in both training and testing, particularly when incorporating peritumoral features, likely due to its ability to model hierarchical feature interactions. RF showed consistent performance across signatures but generally lagged behind MLP, suggesting that ensemble methods may not fully exploit the radiomics feature space in this context. SVM, while computationally efficient, exhibited the greatest performance decline in testing (e.g., AUC dropped from 0.893 to 0.806 for ROI_{Intra+Peri4mm}), highlighting its vulnerability to overfitting in high-dimensional radiomics data. The superior performance of the MLP can be attributed to its unique ability to capture the complex, non-linear relationships and hierarchical feature interactions inherent in our radiomic dataset, particularly the combined intratumoral and peritumoral features. Unlike SVM, which focuses on finding optimal hyperplanes for binary classification, or RF, which relies on ensemble decision trees, MLP’s multi-layered neural network structure allows it to model intricate patterns across high-dimensional radiomic features. In our dataset, recurrence risk is influenced by a confluence of factors, including tumor size, margin status, and peritumoral inflammatory changes, features that manifest as non-linear associations in imaging data. Additionally, the MLP’s capacity for incremental learning allowed it to adapt to the subtle variations in imaging protocols between our training and external validation cohorts, contributing to its stable performance across both datasets.

In real-world clinical settings, for patients newly diagnosed with CRLM, the model can predict the risk of postoperative recurrence before surgery. For instance, patients at high recurrence risk may require more aggressive treatment strategies, such as expanding the scope of liver resection, combining with radiofrequency ablation, or administering preoperative neoadjuvant chemotherapy to reduce the activity of micrometastases. In contrast, patients at low risk can adopt more conservative surgical approaches, such as local hepatic segment resection, to avoid the risk of complications caused by overtreatment. Postoperative follow-up is crucial for reducing the mortality rate of recurrent CRLM. This study can assist physicians in formulating differentiated follow-up plans for patients with different risk stratifications. For example, patients at high risk need shorter follow-up intervals and should be prioritized for more sensitive monitoring methods, patients at low risk can have extended follow-up cycles, which reduces unnecessary medical interventions and the economic burden on patients while optimizing the allocation of medical resources.

To promote the transformation of the model from the research stage to a routine clinical tool, the following steps need to be completed in phases: 1) The current model is built based on data from two centers. The next step will involve conducting external validation in centers across different regions and with varying levels of medical resources to verify the stability of the model under different CT equipment parameters. For any biases identified during validation, the model’s adaptability can be optimized through standardized image preprocessing or transfer learning algorithms. 2) The model needs to be embedded into the hospital’s existing information systems to realize an automated process of image upload, feature extraction, and risk score generation. 3) Multidisciplinary training is required to help medical staff understand the model’s principles, scope of application, and limitations, thereby avoiding over-reliance on the model or misjudgment of risks. 4) To promote the model as a clinical decision-making tool, it is necessary to complete the application process in accordance with medical device regulatory requirements and provide data on its efficacy and safety. The goal is to gradually transition the model from a research tool to a routine clinical auxiliary method, ultimately supporting the individualized management of CRLM patients and improving their prognosis.

Our study has several limitations that warrant consideration. First, the retrospective design inherently introduces potential biases, such as selection bias in patient enrollment and variability in clinical data collection across different time periods. These factors may constrain the generalizability of the model’s predictive performance, as the retrospective setting cannot fully simulate real-world clinical scenarios where patient management is dynamic and influenced by evolving clinical practices. Second, while we performed external validation to test the model’s robustness, the sample size of the external cohort was relatively small, which limits the statistical power to detect subtle differences in predictive accuracy. A larger external validation cohort, ideally from multiple centers with diverse patient populations and clinical practices, would be necessary to confirm the model’s scalability and adaptability across different healthcare settings. Additionally, the absence of molecular genetic data such as KRAS or BRAF mutation in the dataset precluded deeper exploration of the biological underpinnings of recurrence risk, thereby restricting the study’s translational impact. To address these limitations, we plan to conduct prospective, multi-center studies with larger sample sizes in the future. These studies will standardize data collection protocols to minimize bias, and incorporate comprehensive molecular profiling to bridge radiomic features with their biological bases. Expanding the external validation cohort to include more diverse patient populations will further enhance the model’s clinical applicability. Through these efforts, we aim to refine the model and strengthen its potential for translation into clinical practice.

Conclusion

In conclusion, MLP - based models using ROI_{Intra+Peri4mm} radiomics signatures may offer the best trade-off between predictive accuracy and generalization for recurrence risk stratification in CRLM. This study is expected to have a positive impact on the level of personalized diagnosis and treatment for CRLM patients, as well as on the accuracy of predicting recurrence risk, ultimately enhancing patients’ survival benefits.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the First Affiliated Hospital of Xinxiang Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

DZ: Writing – original draft, Writing – review & editing. PL: Formal analysis, Data curation, Writing – review & editing. YW: Writing – review & editing. CL: Writing – review & editing. MX: Writing – review & editing. FG: Software, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research and/or publication of this article. The Key Project of Henan Province Medical Science and Technology Project (Nos. LHGJ20210512).

Acknowledgments

We acknowledge the support received from Henan Province Medical Science and Technology Project.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Dekker E, Tanis PJ, Vleugels JLA, Kasi PM, and Wallace MB. Colorectal cancer. Lancet. (2019) 394:1467–80. doi: 10.1016/S0140-6736(19)32319-0

Google Scholar

2. Abedizadeh R, Majidi F, Khorasani HR, Abedi H, and Sabour D. Colorectal cancer: a comprehensive review of carcinogenesis, diagnosis, and novel strategies for classified treatments. Cancer Metastasis Rev. (2024) 43:729–53. doi: 10.1007/s10555-023-10158-3

PubMed Abstract | Crossref Full Text | Google Scholar

3. Shin AE, Giancotti FG, and Rustgi AK. Metastatic colorectal cancer: mechanisms and emerging therapeutics. Trends Pharmacol Sci. (2023) 44:222–36. doi: 10.1016/j.tips.2023.01.003

PubMed Abstract | Crossref Full Text | Google Scholar

4. Cañellas-Socias A, Sancho E, and Batlle E. Mechanisms of metastatic colorectal cancer. Nat Rev Gastroenterol Hepatol. (2024) 21:609–25. doi: 10.1038/s41575-024-00934-z

PubMed Abstract | Crossref Full Text | Google Scholar

5. Patel RK, Rahman S, Schwantes IR, Bartlett A, Eil R, Farsad K, et al. Updated management of colorectal cancer liver metastases: scientific advances driving modern therapeutic innovations. Cell Mol Gastroenterol Hepatol. (2023) 16:881–94. doi: 10.1016/j.jcmgh.2023.08.012

PubMed Abstract | Crossref Full Text | Google Scholar

6. Griffiths CD, Karanicolas P, Gallinger S, Wei AD, Francescutti V, and Serrano PE. Health-related quality of life following simultaneous resection for synchronous colorectal cancer liver metastases. Ann Surg Oncol. (2023) 30:1331–8. doi: 10.1245/s10434-022-12696-6

PubMed Abstract | Crossref Full Text | Google Scholar

7. Cañellas-Socias A, Cortina C, Hernando-Momblona X, Palomo-Ponce S, Mulholland EJ, Turon G, et al. Metastatic recurrence in colorectal cancer arises from residual EMP1+ cells. Nature. (2022) 611:603–13. doi: 10.1038/s41586-022-05402-9

PubMed Abstract | Crossref Full Text | Google Scholar

8. Dueland S, Smedman TM, Syversveen T, Grut H, Hagness M, and Line PD. Long-term survival, prognostic factors, and selection of patients with colorectal cancer for liver transplant: A nonrandomized controlled trial. JAMA Surg. (2023) 158:e232932. doi: 10.1001/jamasurg.2023.2932

PubMed Abstract | Crossref Full Text | Google Scholar

9. Wong GYM, Mol B, Bhimani N, de Reuver P, Diakos C, Molloy MP, et al. Recurrence patterns predict survival after resection of colorectal liver metastases. ANZ J Surg. (2022) 92:2149–56. doi: 10.1111/ans.17835

PubMed Abstract | Crossref Full Text | Google Scholar

10. Wesdorp NJ, Kemna R, Bolhuis K, van Waesberghe JHTM, Nota IMGC, Struik F, et al. Dutch colorectal liver expert panel. Interobserver variability in CT-based morphologic tumor response assessment of colorectal liver metastases. Radiol Imaging Cancer. (2022) 4:e210105. doi: 10.1148/rycan.210105

PubMed Abstract | Crossref Full Text | Google Scholar

11. Baghdadi A, Mirpour S, Ghadimi M, Motaghi M, Hazhirkarzar B, Pawlik TM, et al. Imaging of colorectal liver metastasis. J Gastrointest Surg. (2022) 26:245–57. doi: 10.1007/s11605-021-05164-1

PubMed Abstract | Crossref Full Text | Google Scholar

12. Wang Y, Ma L, Guo H, Wang X, Ye Z, Fan S, et al. Efficiency of CT radiomics model in assessing the microsatellite instability of colorectal cancer liver metastasis. Curr Med Imaging. (2023) 20(000). doi: 10.2174/1573405620666230825113524

PubMed Abstract | Crossref Full Text | Google Scholar

13. Karagkounis G, Horvat N, Danilova S, Chhabra S, Narayan RR, Barekzai AB, et al. Computed tomography-based radiomics with machine learning outperforms radiologist assessment in estimating colorectal liver metastases pathologic response after chemotherapy. Ann Surg Oncol. (2024) 31:9196–204. doi: 10.1245/s10434-024-15373-y

PubMed Abstract | Crossref Full Text | Google Scholar

14. Wang Q, Nilsson H, Xu K, Wei X, Chen D, Zhao D, et al. Exploring tumor heterogeneity in colorectal liver metastases by imaging: Unsupervised machine learning of preoperative CT radiomics features for prognostic stratification. Eur J Radiol. (2024) 175:111459. doi: 10.1016/j.ejrad.2024.111459

PubMed Abstract | Crossref Full Text | Google Scholar

15. Chen L, Zhang P, Shen L, Zhu H, Wang Y, Xu K, et al. Adoption value of support vector machine algorithm-based computed tomography imaging in the diagnosis of secondary pulmonary fungal infections in patients with Malignant hematological disorders. Open Life Sci. (2023) 18:20220765. doi: 10.1515/biol-2022-0765

PubMed Abstract | Crossref Full Text | Google Scholar

16. Ye JY, Fang P, Peng ZP, Huang XT, Xie JZ, and Yin XY. A radiomics-based interpretable model to predict the pathological grade of pancreatic neuroendocrine tumors. Eur Radiol. (2024) 34:1994–2005. doi: 10.1007/s00330-023-10186-1

PubMed Abstract | Crossref Full Text | Google Scholar

17. Zhao J, Sun Z, Yu Y, Yuan Z, Lin Y, Tan Y, et al. Radiomic and clinical data integration using machine learning predict the efficacy of anti-PD-1 antibodies-based combinational treatment in advanced breast cancer: a multicentered study. J Immunother Cancer. (2023) 11:e006514. doi: 10.1136/jitc-2022-006514

PubMed Abstract | Crossref Full Text | Google Scholar

18. Liu J, Sun L, Zhao X, and Lu X. Development and validation of a combined nomogram for predicting perineural invasion status in rectal cancer via computed tomography-based radiomics. J Cancer Res Ther. (2023) 19:1552–9. doi: 10.4103/jcrt.jcrt_2633_22

PubMed Abstract | Crossref Full Text | Google Scholar

19. Jing HH, Hao D, Liu XJ, Cui MJ, Xue KJ, Wang DS, et al. Development and validation of a radiopathomics model for predicting liver metastases of colorectal cancer. Eur Radiol. (2025) 35:3409–17. doi: 10.1007/s00330-024-11198-1

PubMed Abstract | Crossref Full Text | Google Scholar

20. Rocca A, Brunese MC, Santone A, Avella P, Bianco P, Scacchi A, et al. Early diagnosis of liver metastases from colorectal cancer through CT radiomics and formal methods: A pilot study. J Clin Med. (2021) 11:31. doi: 10.3390/jcm11010031

PubMed Abstract | Crossref Full Text | Google Scholar

21. Miyamoto Y, Nakaura T, Ohuchi M, Ogawa K, Kato R, Maeda Y, et al. Radiomics-based machine learning approach to predict chemotherapy responses in colorectal liver metastases. J Anus Rectum Colon. (2025) 9:117–26. doi: 10.23922/jarc.2024-077

PubMed Abstract | Crossref Full Text | Google Scholar

22. Saber R, Henault D, Messaoudi N, Rebolledo R, Montagnon E, Soucy G, et al. Radiomics using computed tomography to predict CD73 expression and prognosis of colorectal cancer liver metastases. J Transl Med. (2023) 21:507. doi: 10.1186/s12967-023-04175-7

PubMed Abstract | Crossref Full Text | Google Scholar

23. Xing Q, Cui Y, Liu M, Gu XL, Li XT, Xing BC, et al. Preoperative CT-based morphological heterogeneity for predicting survival in patients with colorectal cancer liver metastases after surgical resection: a retrospective study. BMC Med Imaging. (2024) 24:343. doi: 10.1186/s12880-024-01524-w

PubMed Abstract | Crossref Full Text | Google Scholar

24. Tsilimigras DI, Ntanasis-Stathopoulos I, and Pawlik TM. Molecular mechanisms of colorectal liver metastases. Cells. (2023) 12:1657. doi: 10.3390/cells12121657

PubMed Abstract | Crossref Full Text | Google Scholar

25. Wang Y, Zhong X, He X, Hu Z, Huang H, Chen J, et al. Liver metastasis from colorectal cancer: pathogenetic development, immune landscape of the tumour microenvironment and therapeutic approaches. J Exp Clin Cancer Res. (2023) 42:177. doi: 10.1186/s13046-023-02729-7

PubMed Abstract | Crossref Full Text | Google Scholar

26. He Y, Han Y, Fan AH, Li D, Wang B, Ji K, et al. Multi-perspective comparison of the immune microenvironment of primary colorectal cancer and liver metastases. J Transl Med. (2022) 20:454. doi: 10.1186/s12967-022-03667-2

PubMed Abstract | Crossref Full Text | Google Scholar

27. Niu Y, Yang W, Qian H, and Sun Y. Intracellular and extracellular factors of colorectal cancer liver metastasis: a pivotal perplex to be fully elucidated. Cancer Cell Int. (2022) 22:341. doi: 10.1186/s12935-022-02766-w

PubMed Abstract | Crossref Full Text | Google Scholar

28. Simpson AL, Peoples J, Creasy JM, Fichtinger G, Gangai N, Keshavamurthy KN, et al. Preoperative CT and survival data for patients undergoing resection of colorectal liver metastases. Sci Data. (2024) 11:172. doi: 10.1038/s41597-024-02981-2

PubMed Abstract | Crossref Full Text | Google Scholar

29. Zhou TH, Zhou XX, Ni J, Ma YQ, Xu FY, Fan B, et al. CT whole lung radiomic nomogram: a potential biomarker for lung function evaluation and identification of COPD. Mil Med Res. (2024) 11:14. doi: 10.1186/s40779-024-00516-9

PubMed Abstract | Crossref Full Text | Google Scholar

30. Zhou T, Guan Y, Lin X, Zhou X, Mao L, Ma Y, et al. CT-based whole lung radiomics nomogram for identification of PRISm from non-COPD subjects. Respir Res. (2024) 25:329. doi: 10.1186/s12931-024-02964-2

PubMed Abstract | Crossref Full Text | Google Scholar

31. An SX, Yu ZJ, Fu C, Wei MJ, and Shen LH. Biological factors driving colorectal cancer metastasis. World J Gastrointest Oncol. (2024) 16:259–72. doi: 10.4251/wjgo.v16.i2.259

PubMed Abstract | Crossref Full Text | Google Scholar

32. Shang Y, Zeng Y, Luo S, Wang Y, Yao J, Li M, et al. Habitat imaging with tumoral and peritumoral radiomics for prediction of lung adenocarcinoma invasiveness on preoperative chest CT: a multicenter study. AJR Am J Roentgenol. (2024) 223:e2431675. doi: 10.2214/AJR.24.31675

PubMed Abstract | Crossref Full Text | Google Scholar

33. Qin S, Lu S, Liu K, Zhou Y, Wang Q, Chen Y, et al. Radiomics from mesorectal blood vessels and lymph nodes: a novel prognostic predictor for rectal cancer with neoadjuvant therapy. Diagnostics (Basel). (2023) 13:1987. doi: 10.3390/diagnostics13121987

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: computed tomography, radiomics, colorectal neoplasms, liver, neoplasm metastasis

Citation: Zhang D, Li P, Wei Y, Xue M, Guo F and Li C (2025) Predicting the recurrence risk of liver metastasis from colorectal cancer: a study based on preoperative CT intratumoral and peritumoral radiomics features. Front. Oncol. 15:1662354. doi: 10.3389/fonc.2025.1662354

Received: 09 July 2025; Accepted: 08 September 2025;
Published: 23 September 2025.

Edited by:

Laura Curiel, University of Calgary, Canada

Reviewed by:

Özgür Akgül, Ankara City Hospital, Türkiye
Fatma Alshohoumi, Sultan Qaboos University, Oman

Copyright © 2025 Zhang, Li, Wei, Xue, Guo and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dongying Zhang, emR5MTEyMnlpbmdAMTYzLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.