- Department of Radiology, The First Affiliated Hospital of Xinxiang Medical University, Weihui, China
Objective: This study aims to explore the value of predicting the recurrence risk of colorectal cancer liver metastasis (CRLM) based on preoperative CT intratumoral and peritumoral radiomics features.
Methods: This study utilized retrospectively collected preoperative CT data of 201 CRLM patients, comprising 145 cases from the hospital one and 56 cases from an external hospital two. Liver metastases were precisely segmented via manual annotation. Subsequently, the intratumoral region of interest (ROIIntra) was isotropically dilated to radii of 2 mm, 4 mm, and 6 mm, resulting in peri-tumoral ROIs (ROIPeri2mm, ROIPeri4mm and ROIPeri6mm). We established the prediction models based on support vector machine (SVM), random forest (RF), and multilayer perceptron (MLP) algorithms. The area under the subject operating characteristic curve (AUC) was used to evaluate the predictive performance.
Results: Compared with SVM and RF, MLP demonstrated superior predictive performance for estimating the recurrence risk of CRLM patients. The best radiomics signatures for predicting the recurrence risk of CRLM were ROIIntra+Peri4mm model, and the AUCs of the ROIIntra model, ROIIntra+Peri2mm model, ROIIntra+Peri4mm model, and ROIIntra+Peri6mm model constructed by MLP are 0.758 (95% confidence interval (CI), 0.621 - 0.865), 0.815 (95% CI, 0.684 - 0.908), 0.855 (95% CI, 0.731 - 0.936), and 0.825 (95% CI, 0.696 - 0.915), respectively, in the external test set.
Conclusion: Preoperative CT-based radiomics features extracted from intra-tumoral (ROIIntra) and peritumoral (ROIIntra+Peri2mm, ROIIntra+Peri4mm, and ROIIntra+Peri6mm) regions can effectively predict recurrence risk in CRLM patients.
Introduction
Colorectal cancer ranks as the third most prevalent cancer worldwide, accounting for approximately 10% of all cancer cases, and represents the second leading cause of cancer-related mortality globally (1, 2). Approximately 25% of colorectal cancer patients develop liver metastases, for which hepatic resection remains the primary curative treatment (3, 4). Despite advances in surgical techniques and medical oncology that have increased resectability rates, multiple studies have demonstrated that the pooled recurrence rate after liver resection remains alarmingly high at 60-80%, constituting the leading cause of mortality in these patients (5, 6). Precise prediction of recurrence risk is therefore critically important for developing personalized therapeutic strategies and accurate prognostic assessments in patients with colorectal liver metastases (CRLM) (7–9). Clinically, the diagnostic utility of liver biopsy is often constrained by its invasive nature, the potential for procedural complications, and the elevated risk of false-negative results attributable to inadequate tissue sampling or sampling errors. As a routine imaging modality for patients with CRLM, preoperative computed tomography (CT) not only provides detailed anatomical information, including the size, location, and morphology of hepatic lesions, but also captures subtle, indirect indicators of tumor biology (10, 11). These microscale features, imperceptible to the human eye, can be systematically extracted by radiomics through the identification of intratumoral patterns, textural heterogeneities, and spatial relationships within CT images, ultimately transforming them into quantifiable and mineable data for enhanced diagnostic and prognostic insights (12–14). Support vector machine (SVM), random forest (RF), and multilayer perceptron (MLP), as established machine learning algorithms with robust feature extraction capabilities, enable the development of predictive models for CRLM recurrence risk (15–18). By integrating radiomics - based CT feature extraction with SVM, RF, and MLP modeling frameworks, this approach offers a novel, non-invasive solution for personalized risk stratification in CRLM management.
Current radiomics - based studies on preoperative CT imaging for CRLM have predominantly focused on risk prediction (19, 20), chemotherapy response (12, 21), and prognosis (22, 23). However, there remains a critical gap in the development of predictive models for postoperative recurrence risk, which constitutes a major determinant of long-term survival in this patient population. Additionally, the evolution and progression of tumors are affected by the interactions between cells within the tumor and constituents in the peritumoral region (24). Previous research indicates that tumors are composed not only of malignant cells but also stromal components, immune elements and inflammatory elements. These factors induce the stromal remodeling, creating a microenvironment conducive to tumor progression (25). The tumor necrosis factor signaling pathway, which is connected to abnormal blood - vessel formation driven by cancer cells, as well as invasion and metastasis, is related to the features of the peritumoral area (26, 27). In light of this, both the tumor’s intratumoral and peritumoral microenvironments are likely reservoirs of pivotal biological signals and CRLM recurrence indicators.
In this study, we develop and validate a radiomics - based recurrence risk prediction model for CRLM using advanced machine learning algorithms, including SVM, RF, and MLP, applied to preoperative CT imaging. By integrating quantitative imaging features extracted from both intratumoral and peritumoral regions, with the optimal peritumoral area associated with recurrence risk identified, our approach bridges the current gap in postoperative recurrence prediction. Furthermore, by incorporating clinical parameters, we establish a novel, non-invasive prognostic tool to guide postoperative surveillance and personalized treatment strategies, ultimately improving clinical outcomes for CRLM patients.
Materials and methods
Patient collective
For this retrospective, multiple-centers, IRB-approved study, 201 patients with proven colorectal cancer and CRLM were identified, including 148 CRLM patients from hospital one and 53 patients from hospital two (Table 1). The ethics committee of this institution approved the study and waived informed consent. Inclusion criteria: 1). Pathologically confirmed CRLM; 2). Availability of histopathological reports for both liver tumor and non-tumorous hepatic parenchyma; 3). Preoperative portal venous phase contrast-enhanced CT imaging acquired within 6 weeks prior to hepatic resection; 4). The follow-up duration is at least 24 months. The follow-up process involved regular clinical evaluations, including serum tumor marker testing and imaging assessments every year. Exclusion criteria: 1). Preoperative hepatic arterial infusion chemotherapy; 2). Prior local tumor ablation therapy or more than three wedge resections of the liver; 3). No visible tumor on preoperative imaging. The flowchart of the patient selection process is presented in Figure 1. The clinical information available in the datasets included: gender, age, body mass index at operation, carcinoembryonic antigen test, lymph node status, colon primary status, presence of multiple lobes, presence of major comorbidity, maximum tumor size and the liver recurrence status. Recurrence was defined based on a combination of imaging and pathological evidence. Detection of new lesions on cross-sectional imaging with typical radiological features of metastatic disease, e.g., enhancing hepatic nodules, extrahepatic masses, that were not present at baseline and persisted or enlarged on subsequent scans. Histopathological verification of malignant cells from biopsy or surgical resection of suspected recurrent lesions, which served as the gold standard when available.
Imaging protocols
All study participants underwent a standardized contrast-enhanced CT scan according to predefined imaging protocols. Abdominal imaging was performed using a multidetector CT scanner (Lightspeed 16 and VCT; GE Healthcare, Madison, WI, USA) with the following key settings: autoMA ranged from 220 to 380, a noise index of 12 to 14, a rotation time of 0.7 to 0.8 milliseconds, and a scan delay of 80 seconds (28).
Intratumoral segmentation and peritumoral dilation
Intratumoral segmentation was performed on CT images using the ITK-SNAP software (Version 3.60, http://www.itk-snap.org). Two radiologists with 8 and 10 years of experience in abdominal imaging diagnosis, respectively, independently carried out the segmentation process. If the region of interest (ROI) defined by two radiologists showed a discrepancy ≥5%, a senior radiologist with two decades of expertise conducted re-segmentation to finalize the ROIs. The segmented ROI served as the ROIIntra, which was expanded by 2 mm, 4 mm, and 6 mm into the peritumoral region using standard morphological operations, yielding ROIPeri2mm, ROIPeri4mm and ROIPeri6mm, respectively. The ROIPeri k mm excluded skin, air, and muscles.
Data preprocessing
Prior to the extraction of radiomics features, all CT images underwent resampling to achieve a uniform voxel resolution of 1×1×1 mm³. Subsequently, the intensity histograms of the images were discretized using a bin width of 25, ensuring consistent and standardized feature extraction across the dataset. This preprocessing step is crucial for maintaining the comparability and reliability of the extracted radiomics features.
Radiomics feature extraction
Radiomics features were extracted from the ROI using the open-source software PyRadiomics (Version 2.20, https://github.com/Radiomics/pyradiomics). A comprehensive set of 12 filters, including Original, AdditiveGaussianNoise, Binomial, Normalize, LaplacianSharpening, CurvatureFlow, wavelet, ShotNoise, BoxMean, LoG, DiscreteGaussian, and BoxSigmaImage, were applied to enhance the extraction of radiomics features. A total of 201 patients in this study yielded 1197 high-dimensional radiomics features. To reduce the complexity or bias inherent in the radiomics feature set, dimensionality reduction techniques were employed. The primary objective of feature dimensionality reduction is to simplify the feature space while retaining the most informative features. In the training dataset, we first performed feature dimensionality reduction using the least absolute shrinkage and selection operator (LASSO) regression analysis, with the regularization parameter α set to 0.001 (29). Subsequently, the top 15 features with the highest correlation were selected based on the maximum relevance and minimum redundancy (mRMR) method (30). These methods help in selecting the most discriminative features, thereby improving the performance of subsequent predictive models.
Model construction
Using radiomics features extracted from ROIIntra and its peritumoral extensions (ROIIntra+Peri2mm, ROIIntra+Peri4mm, and ROIIntra+Peri6mm), predictive models were developed using machine learning algorithms, including SVM, RF, and MLP, to predict recurrence risk in CRLM. The radiomics workflow is shown in Figure 2. The details of these classifiers were shown in Table 2.

Figure 2. Radiomics analysis and machine learning workflow for predicting the recurrence risk in CRLM patients.
Statistical analysis
In the analysis of count data, comparisons between groups were performed using the Chi-square test, which is appropriate for categorical variables. For continuous data, comparisons between groups were conducted using either the Mann-Whitney U test or the independent samples t-test, depending on the normality assumptions of the data. The performance of the predictive models was evaluated using several metrics, including the area under the subject operating characteristic curve (AUC), sensitivity, and specificity. Decision curve analysis (DCA) and calibration curve were employed to independently evaluate the stability and clinical net benefit of the predictive model, respectively. These metrics provide a comprehensive assessment of the model’s ability to distinguish between different outcomes. All statistical analyses were conducted using the R software (Version 4.3.3). A significance level of P < 0.05 was adopted to indicate statistically significant differences.
Results
Statistical analysis of clinical features
A total of 114 patients with recurrent CRLM and 87 patients without recurrence were included. Among the clinical characteristics, significant differences were observed between the two groups in terms of presence of multiple lobes and the maximum tumor diameter. Consequently, presence of multiple lobes and the maximum tumor diameter were incorporated as clinical indicators into the predictive model (Table 1). The two variables were selected due to their statistical significance, suggesting their potential importance in influencing the recurrence risk of CRLM. Including these clinical indicators enhances the model’s ability to accurately predict recurrence by accounting for relevant patient-specific factors.
Radiomics feature selection and predictive performance
From the CT-based ROIs (ROIIntra, ROIIntra+Peri2mm, ROIIntra+Peri4mm, and ROIIntra+Peri6mm), 15 radiomics features were ultimately selected based on their significant association with recurrence risk. The extracted radiomics features by ROIIntra, ROIIntra+Peri2mm, ROIIntra+Peri4mm, and ROIIntra+Peri6mm, which are presented in Tables 3, 4, 5, and 6. The results of recurrence risk prediction for CRLM patients, achieved by integrating radiomics features with clinical characteristics and constructing predictive models using SVM, RF, and MLP, are presented in Table 3, 4. The predictive performance of the three machine learning models (SVM, RF, and MLP) improved consistently when incorporating peritumoral regions into the radiomics signature. The optimal performance was observed with the MLP model using ROIIntra+Peri4mm, achieving the highest AUC of 0.905 (95% CI: 0.846 - 0.947) in the training set and 0.855 (95% CI: 0.731 - 0.936) in the test set. The significance level P for all nine models was less than 0.0001 (Table 7).

Table 7. Performance metrics SVM, RF, and MLP models across different ROI radiomics signature in the training set.
The ROC curves (Figure 3) illustrate that models constructed using CT intratumoral and peritumoral radiomics features are effective in predicting the recurrence risk of CRLM, with all models achieving AUC values exceeding 0.735 in the test set. Based on the H-L test, the P-values for the ROIIntra model, ROIIntra+Peri2mm model, ROIIntra+Peri4mm model, and ROIIntra+Peri6mm model by SVM compared to actual observations were 0.2262, 0.2298, 0.3631, and 0.0110, respectively. The P-values for the ROIIntra model, ROIIntra+Peri2mm model, ROIIntra+Peri4mm model, and ROIIntra+Peri6mm model by RF compared to actual observations were 0.4532, 0.3038, 0.3190, and 0.1586, respectively. The P-values for the ROIIntra model, ROIIntra+Peri2mm model, ROIIntra+Peri4mm model, and ROIIntra+Peri6mm model by MLP compared to actual observations were 0.3068, 0.3845, 0.8556, and 0.4218, respectively. The decision curve analysis evaluated the clinical utility of all the predictive models (ROIIntra, ROIIntra+Peri2mm, ROIIntra+Peri4mm, and ROIIntra+Peri6mm) across a range of threshold probabilities (Figure 4). The calibration curves assessed the agreement between predicted probabilities and observed outcomes (Figure 5). The results demonstrated that all models provided net benefit, indicating their potential clinical applicability for predicting recurrence risk in CRLM patients (Table 8).

Figure 3. ROC of the predictive models constructed by different radiomics signatures in the test set. (A) ROIIntra, (B) ROIIntra+Peri2mm, (C) ROIIntra+Peri4mm, (D) ROIIntra+Peri6mm.

Figure 4. DCA of the predictive models constructed by different radiomics signatures in the test set. (A) ROIIntra, (B) ROIIntra+Peri2mm, (C) ROIIntra+Peri4mm, (D) ROIIntra+Peri6mm.

Figure 5. Calibration curve of the predictive models constructed by different radiomics signatures in the test set. (A) ROIIntra, (B) ROIIntra+Peri2mm, (C) ROIIntra+Peri4mm, (D) ROIIntra+Peri6mm.

Table 8. Performance metrics SVM, RF, and MLP models across different ROI radiomics signature in the test set.
Discussion
Radiomics - based analyses of preoperative CT imaging have been extensively validated for predicting metastatic risk, chemotherapy response, and prognosis in CRLM patients. For example, a study by Jing et al. showed that a radiomics model based on CT for preoperative prediction of liver metastases after surgery for colorectal cancer, with AUC of 0.761 in the test set (19). Karagkounis et al. constructed the radiomics prediction model using CT features of 85 Patients who underwent resection for CRLM, and the results showed that this model can provide a valuable reference for pathological response assessment (13). However, the potential of radiomics for recurrence risk stratification remains relatively underexplored in this patient population. This knowledge gap is particularly concerning given that recurrence patterns in CRLM are highly heterogeneous, with liver metastases and local hepatic recurrences exhibiting distinct clinical behaviors. Meanwhile, existing studies predominantly focus on intralesional features, overlooking the recurrence value of peritumoral texture patterns that may reflect microenvironmental changes associated with tumor dissemination. To address these gaps, our study systematically evaluated the predictive performance of radiomics models for recurrence risk by constructing separate SVM, RF, and MLP algorithms using four distinct radiomics feature sets derived from the intra-tumoral ROI (ROIIntra) and its peritumoral extensions (ROIIntra+Peri2mm, ROIIntra+Peri4mm, and ROIIntra+Peri6mm).
Biologically, the peritumoral region plays a critical role in tumor progression, as it is where cancer cells interact with stromal tissue, immune cells, and blood vessels, processes that drive local invasion and recurrence (13). Studies on hepatic metastases have shown that pathological changes extend beyond the visible tumor boundary, with distinct biological signatures observed at varying distances from the tumor edge (31). Specifically, the 2mm zone primarily reflects immediate tumor-stroma interactions, including early invasive activity and extracellular matrix remodeling. The 4mm zone captures broader paracrine effects and immune responses that mediate tumor survival and spread. The 6mm zone encompasses more distant microenvironmental changes, such as hepatic sinusoidal remodeling, which can facilitate micrometastasis formation. For instance, Shang et al. predicted the invasiveness of lung adenocarcinoma by analyzing radiomic features extracted from both tumor cores and 4mm peritumoral regions on CT imaging (32). Qin et al. established through MRI analysis that systematic evaluation of peritumoral ROIs expanded by 2mm, 4mm, and 6mm beyond tumor margins provides clinically significant predictive value for assessing pathological treatment response in locally advanced rectal cancer patients following neoadjuvant chemoradiotherapy (33). Clinically, these distances align with previous radiomic studies on malignancies, where 2 - 6mm peritumoral regions have been associated with recurrence risk and treatment response. We selected 2mm, 4mm, and 6mm to span this clinically relevant range, allowing us to capture both proximal and distal microenvironmental influences on recurrence.
The extraction of radiomics features is based on the principles of quantitative imaging analysis, which systematically quantifies the spatial and intensity distributions of voxel patterns within medical images. These features capture tumor heterogeneity and microenvironmental characteristics, thereby possessing significant biological relevance in medical research. As shown in Table 3, the first column displays the categories corresponding to the radiomics features, the second column lists the names of the selected features, and the third column presents the mRMR correlation coefficients between the radiomics features and the recurrence risk. We have analyzed the biological significance of the 3 radiomics features that are most strongly associated with the risk of recurrence. Original_glrlm_LongRunHighGrayLevelEmphasis measures the distribution of long contiguous pixel runs with high gray-level intensity values. The high value may correspond to highly ordered proliferative regions within the tumor or high cell density zones around necrotic areas, and is associated with the degree of tumor differentiation. Original_firstorder_Mean represents arithmetic mean of all pixel intensities within the tumor ROI. Low mean values may indicate necrotic areas, while high values suggest vascularized tumor regions. Original_shape_Sphericity measures how closely the tumor shape approximates a perfect sphere. Lower sphericity correlates with infiltrative growth patterns and desmoplastic reaction.
The radiomics features extracted by ROIIntra+Peri2mm are shown in Table 4. The top 3 radiomic features most associated with the risk of recurrence are wavelet-LLH_glcm_Correlation, wavelet-LLH_glcm_DifferenceEntropy, wavelet-LLH_gldm_DependenceNonUniformity, respectively. Wavelet-LLH_glcm_Correlation quantifies how correlated a pixel is to its neighbor in the LLH wavelet space. High values indicate organized tissue structures, e.g., regular tumor stroma, and low values suggest chaotic tissue patterns. Wavelet-LLH_glcm_DifferenceEntropy calculates the entropy of gray-level difference distribution. Higher values indicate more heterogeneous tissue patterns. Wavelet-LLH_gldm_DependenceNonUniformity measures the variability of gray-level dependencies in local neighborhoods. It reflects complex microenvironment interactions in the CRLM tumor.
The radiomics features extracted by ROIIntra+Peri4mm are shown in Table 5. The top 3 radiomic features most associated with the risk of recurrence are wavelet-HHL_glszm_HighGrayLevelZoneEmphasis,wavelet-HHL_glszm_LargeAreaEmphasis, wavelet-HHL_glszm_ZoneEntropy, respectively. Wavelet-HHL_glszm_HighGrayLevel ZoneEmphasis measures the relative distribution of high gray-level zones and emphasizes zones with higher gray-level values within the HHL wavelet-transformed image space. It may correlate with regions of active tumor metabolism or hypervascularized subregions. Wavelet-HHL_glszm_LargeAreaEmphasis quantifies the distribution of large homogeneous zones and emphasizes larger zone sizes within the HHL wavelet space. It may relate to areas of tumor stability or defined growth patterns. Wavelet-HHL_glszm_ZoneEntropy assesses the randomness or disorder in the distribution of zone sizes throughout the image, specifically within the HHL wavelet-transformed space. High values suggest a more irregular and heterogeneous distribution of tumor zones, indicating complex microenvironmental interactions or varied cellular compositions. Low values imply a more uniform and organized arrangement of tumor zones.
The radiomics features extracted by ROIIntra+Peri6mm are shown in Table 6. The top 3 radiomic features most associated with the risk of recurrence are wavelet-LLL_glrlm_GrayLevelNonUniformityNormalized, wavelet-LLL_glrlm_LongRunEmphasis, log-sigma-5-mm-3D_glcm_JointEntropy, respectively. Wavelet-LLL_glrlm_GrayLevelNonUniformityNormalized quantifies gray-level distribution uniformity within runs. It may be related to infiltrative growth patterns with irregular cellular density. Wavelet-LLL_glrlm_LongRunEmphasis detects large-scale and spatially coherent tissue regions in LLL-filtered images. Higher values may correlate with well-organized tumor architectures. Log-sigma-5-mm-3D_glcm_JointEntropy enhances edges at 5 mm resolution and captures mid-range heterogeneity.
The inclusion of peritumoral regions (Peri2mm, Peri4mm, Peri6mm) consistently improved model performance compared to intralesional features alone (ROIIntra). For example, in the training set, MLP’s AUC increased from 0.872 (ROIIntra) to 0.929 (ROIIntra+Peri4mm), while in the test set, the best-performing model (MLP with ROIIntra+Peri4mm) achieved an AUC of 0.855, compared to 0.735 for SVM with ROIIntra alone. The performance of recurrence risk prediction models improved significantly with the inclusion of peritumoral radiomics features, confirming that tumor-stromal interactions in the peritumoral microenvironment contribute to metastatic progression. Among the evaluated radiomic regions - ROIIntra, ROIIntra+Peri2mm, ROIIntra+Peri4mm, and ROIIntra+Peri6mm demonstrated the best balance between sensitivity and specificity across all machine learning models. Although the SVM model demonstrated a marginal improvement in AUC (+0.002) when expanding the peritumoral region from ROIIntra+Peri4mm to ROIIntra+Peri6mm in the test set, both the RF and MLP models exhibited superior performance with the smaller ROIIntra+Peri4mm signature. This suggests that, for these algorithms, the inclusion of excessive peritumoral information may introduce noise or redundancy, thereby diminishing predictive accuracy. The optimal peritumoral expansion size appears to be context-dependent, with smaller regions (4 mm) potentially striking a better balance between feature informativeness and model generalizability. In contrast, the minimal peritumoral inclusion (ROIIntra+Peri2mm) failed to capture sufficient prognostic information, underscoring the importance of an optimized radiomic capture radius. ,
Machine learning model selection significantly influenced predictive accuracy. The selection of SVM, RF, and MLP was based on their distinct strengths and wide applicability in radiomic research. SVM was used for its robustness in handling high-dimensional data and its ability to find optimal hyperplanes for classification, which is valuable given the high dimensionality of our radiomic feature set. RF was selected due to its superiority in capturing non-linear relationships and interactions between features, as well as its built-in feature importance evaluation. MLP was included to account for potential complex, non-linear patterns in the data that traditional statistical models might miss, leveraging its capability to model hierarchical feature representations. The performance of SVM, RF, and MLP models varied significantly across different radiomics signatures, both in the training and test sets. MLP demonstrated superior performance in both training and testing, particularly when incorporating peritumoral features, likely due to its ability to model hierarchical feature interactions. RF showed consistent performance across signatures but generally lagged behind MLP, suggesting that ensemble methods may not fully exploit the radiomics feature space in this context. SVM, while computationally efficient, exhibited the greatest performance decline in testing (e.g., AUC dropped from 0.893 to 0.806 for ROIIntra+Peri4mm), highlighting its vulnerability to overfitting in high-dimensional radiomics data. The superior performance of the MLP can be attributed to its unique ability to capture the complex, non-linear relationships and hierarchical feature interactions inherent in our radiomic dataset, particularly the combined intratumoral and peritumoral features. Unlike SVM, which focuses on finding optimal hyperplanes for binary classification, or RF, which relies on ensemble decision trees, MLP’s multi-layered neural network structure allows it to model intricate patterns across high-dimensional radiomic features. In our dataset, recurrence risk is influenced by a confluence of factors, including tumor size, margin status, and peritumoral inflammatory changes, features that manifest as non-linear associations in imaging data. Additionally, the MLP’s capacity for incremental learning allowed it to adapt to the subtle variations in imaging protocols between our training and external validation cohorts, contributing to its stable performance across both datasets.
In real-world clinical settings, for patients newly diagnosed with CRLM, the model can predict the risk of postoperative recurrence before surgery. For instance, patients at high recurrence risk may require more aggressive treatment strategies, such as expanding the scope of liver resection, combining with radiofrequency ablation, or administering preoperative neoadjuvant chemotherapy to reduce the activity of micrometastases. In contrast, patients at low risk can adopt more conservative surgical approaches, such as local hepatic segment resection, to avoid the risk of complications caused by overtreatment. Postoperative follow-up is crucial for reducing the mortality rate of recurrent CRLM. This study can assist physicians in formulating differentiated follow-up plans for patients with different risk stratifications. For example, patients at high risk need shorter follow-up intervals and should be prioritized for more sensitive monitoring methods, patients at low risk can have extended follow-up cycles, which reduces unnecessary medical interventions and the economic burden on patients while optimizing the allocation of medical resources.
To promote the transformation of the model from the research stage to a routine clinical tool, the following steps need to be completed in phases: 1) The current model is built based on data from two centers. The next step will involve conducting external validation in centers across different regions and with varying levels of medical resources to verify the stability of the model under different CT equipment parameters. For any biases identified during validation, the model’s adaptability can be optimized through standardized image preprocessing or transfer learning algorithms. 2) The model needs to be embedded into the hospital’s existing information systems to realize an automated process of image upload, feature extraction, and risk score generation. 3) Multidisciplinary training is required to help medical staff understand the model’s principles, scope of application, and limitations, thereby avoiding over-reliance on the model or misjudgment of risks. 4) To promote the model as a clinical decision-making tool, it is necessary to complete the application process in accordance with medical device regulatory requirements and provide data on its efficacy and safety. The goal is to gradually transition the model from a research tool to a routine clinical auxiliary method, ultimately supporting the individualized management of CRLM patients and improving their prognosis.
Our study has several limitations that warrant consideration. First, the retrospective design inherently introduces potential biases, such as selection bias in patient enrollment and variability in clinical data collection across different time periods. These factors may constrain the generalizability of the model’s predictive performance, as the retrospective setting cannot fully simulate real-world clinical scenarios where patient management is dynamic and influenced by evolving clinical practices. Second, while we performed external validation to test the model’s robustness, the sample size of the external cohort was relatively small, which limits the statistical power to detect subtle differences in predictive accuracy. A larger external validation cohort, ideally from multiple centers with diverse patient populations and clinical practices, would be necessary to confirm the model’s scalability and adaptability across different healthcare settings. Additionally, the absence of molecular genetic data such as KRAS or BRAF mutation in the dataset precluded deeper exploration of the biological underpinnings of recurrence risk, thereby restricting the study’s translational impact. To address these limitations, we plan to conduct prospective, multi-center studies with larger sample sizes in the future. These studies will standardize data collection protocols to minimize bias, and incorporate comprehensive molecular profiling to bridge radiomic features with their biological bases. Expanding the external validation cohort to include more diverse patient populations will further enhance the model’s clinical applicability. Through these efforts, we aim to refine the model and strengthen its potential for translation into clinical practice.
Conclusion
In conclusion, MLP - based models using ROIIntra+Peri4mm radiomics signatures may offer the best trade-off between predictive accuracy and generalization for recurrence risk stratification in CRLM. This study is expected to have a positive impact on the level of personalized diagnosis and treatment for CRLM patients, as well as on the accuracy of predicting recurrence risk, ultimately enhancing patients’ survival benefits.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by the First Affiliated Hospital of Xinxiang Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.
Author contributions
DZ: Writing – original draft, Writing – review & editing. PL: Formal analysis, Data curation, Writing – review & editing. YW: Writing – review & editing. CL: Writing – review & editing. MX: Writing – review & editing. FG: Software, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research and/or publication of this article. The Key Project of Henan Province Medical Science and Technology Project (Nos. LHGJ20210512).
Acknowledgments
We acknowledge the support received from Henan Province Medical Science and Technology Project.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Dekker E, Tanis PJ, Vleugels JLA, Kasi PM, and Wallace MB. Colorectal cancer. Lancet. (2019) 394:1467–80. doi: 10.1016/S0140-6736(19)32319-0
2. Abedizadeh R, Majidi F, Khorasani HR, Abedi H, and Sabour D. Colorectal cancer: a comprehensive review of carcinogenesis, diagnosis, and novel strategies for classified treatments. Cancer Metastasis Rev. (2024) 43:729–53. doi: 10.1007/s10555-023-10158-3
3. Shin AE, Giancotti FG, and Rustgi AK. Metastatic colorectal cancer: mechanisms and emerging therapeutics. Trends Pharmacol Sci. (2023) 44:222–36. doi: 10.1016/j.tips.2023.01.003
4. Cañellas-Socias A, Sancho E, and Batlle E. Mechanisms of metastatic colorectal cancer. Nat Rev Gastroenterol Hepatol. (2024) 21:609–25. doi: 10.1038/s41575-024-00934-z
5. Patel RK, Rahman S, Schwantes IR, Bartlett A, Eil R, Farsad K, et al. Updated management of colorectal cancer liver metastases: scientific advances driving modern therapeutic innovations. Cell Mol Gastroenterol Hepatol. (2023) 16:881–94. doi: 10.1016/j.jcmgh.2023.08.012
6. Griffiths CD, Karanicolas P, Gallinger S, Wei AD, Francescutti V, and Serrano PE. Health-related quality of life following simultaneous resection for synchronous colorectal cancer liver metastases. Ann Surg Oncol. (2023) 30:1331–8. doi: 10.1245/s10434-022-12696-6
7. Cañellas-Socias A, Cortina C, Hernando-Momblona X, Palomo-Ponce S, Mulholland EJ, Turon G, et al. Metastatic recurrence in colorectal cancer arises from residual EMP1+ cells. Nature. (2022) 611:603–13. doi: 10.1038/s41586-022-05402-9
8. Dueland S, Smedman TM, Syversveen T, Grut H, Hagness M, and Line PD. Long-term survival, prognostic factors, and selection of patients with colorectal cancer for liver transplant: A nonrandomized controlled trial. JAMA Surg. (2023) 158:e232932. doi: 10.1001/jamasurg.2023.2932
9. Wong GYM, Mol B, Bhimani N, de Reuver P, Diakos C, Molloy MP, et al. Recurrence patterns predict survival after resection of colorectal liver metastases. ANZ J Surg. (2022) 92:2149–56. doi: 10.1111/ans.17835
10. Wesdorp NJ, Kemna R, Bolhuis K, van Waesberghe JHTM, Nota IMGC, Struik F, et al. Dutch colorectal liver expert panel. Interobserver variability in CT-based morphologic tumor response assessment of colorectal liver metastases. Radiol Imaging Cancer. (2022) 4:e210105. doi: 10.1148/rycan.210105
11. Baghdadi A, Mirpour S, Ghadimi M, Motaghi M, Hazhirkarzar B, Pawlik TM, et al. Imaging of colorectal liver metastasis. J Gastrointest Surg. (2022) 26:245–57. doi: 10.1007/s11605-021-05164-1
12. Wang Y, Ma L, Guo H, Wang X, Ye Z, Fan S, et al. Efficiency of CT radiomics model in assessing the microsatellite instability of colorectal cancer liver metastasis. Curr Med Imaging. (2023) 20(000). doi: 10.2174/1573405620666230825113524
13. Karagkounis G, Horvat N, Danilova S, Chhabra S, Narayan RR, Barekzai AB, et al. Computed tomography-based radiomics with machine learning outperforms radiologist assessment in estimating colorectal liver metastases pathologic response after chemotherapy. Ann Surg Oncol. (2024) 31:9196–204. doi: 10.1245/s10434-024-15373-y
14. Wang Q, Nilsson H, Xu K, Wei X, Chen D, Zhao D, et al. Exploring tumor heterogeneity in colorectal liver metastases by imaging: Unsupervised machine learning of preoperative CT radiomics features for prognostic stratification. Eur J Radiol. (2024) 175:111459. doi: 10.1016/j.ejrad.2024.111459
15. Chen L, Zhang P, Shen L, Zhu H, Wang Y, Xu K, et al. Adoption value of support vector machine algorithm-based computed tomography imaging in the diagnosis of secondary pulmonary fungal infections in patients with Malignant hematological disorders. Open Life Sci. (2023) 18:20220765. doi: 10.1515/biol-2022-0765
16. Ye JY, Fang P, Peng ZP, Huang XT, Xie JZ, and Yin XY. A radiomics-based interpretable model to predict the pathological grade of pancreatic neuroendocrine tumors. Eur Radiol. (2024) 34:1994–2005. doi: 10.1007/s00330-023-10186-1
17. Zhao J, Sun Z, Yu Y, Yuan Z, Lin Y, Tan Y, et al. Radiomic and clinical data integration using machine learning predict the efficacy of anti-PD-1 antibodies-based combinational treatment in advanced breast cancer: a multicentered study. J Immunother Cancer. (2023) 11:e006514. doi: 10.1136/jitc-2022-006514
18. Liu J, Sun L, Zhao X, and Lu X. Development and validation of a combined nomogram for predicting perineural invasion status in rectal cancer via computed tomography-based radiomics. J Cancer Res Ther. (2023) 19:1552–9. doi: 10.4103/jcrt.jcrt_2633_22
19. Jing HH, Hao D, Liu XJ, Cui MJ, Xue KJ, Wang DS, et al. Development and validation of a radiopathomics model for predicting liver metastases of colorectal cancer. Eur Radiol. (2025) 35:3409–17. doi: 10.1007/s00330-024-11198-1
20. Rocca A, Brunese MC, Santone A, Avella P, Bianco P, Scacchi A, et al. Early diagnosis of liver metastases from colorectal cancer through CT radiomics and formal methods: A pilot study. J Clin Med. (2021) 11:31. doi: 10.3390/jcm11010031
21. Miyamoto Y, Nakaura T, Ohuchi M, Ogawa K, Kato R, Maeda Y, et al. Radiomics-based machine learning approach to predict chemotherapy responses in colorectal liver metastases. J Anus Rectum Colon. (2025) 9:117–26. doi: 10.23922/jarc.2024-077
22. Saber R, Henault D, Messaoudi N, Rebolledo R, Montagnon E, Soucy G, et al. Radiomics using computed tomography to predict CD73 expression and prognosis of colorectal cancer liver metastases. J Transl Med. (2023) 21:507. doi: 10.1186/s12967-023-04175-7
23. Xing Q, Cui Y, Liu M, Gu XL, Li XT, Xing BC, et al. Preoperative CT-based morphological heterogeneity for predicting survival in patients with colorectal cancer liver metastases after surgical resection: a retrospective study. BMC Med Imaging. (2024) 24:343. doi: 10.1186/s12880-024-01524-w
24. Tsilimigras DI, Ntanasis-Stathopoulos I, and Pawlik TM. Molecular mechanisms of colorectal liver metastases. Cells. (2023) 12:1657. doi: 10.3390/cells12121657
25. Wang Y, Zhong X, He X, Hu Z, Huang H, Chen J, et al. Liver metastasis from colorectal cancer: pathogenetic development, immune landscape of the tumour microenvironment and therapeutic approaches. J Exp Clin Cancer Res. (2023) 42:177. doi: 10.1186/s13046-023-02729-7
26. He Y, Han Y, Fan AH, Li D, Wang B, Ji K, et al. Multi-perspective comparison of the immune microenvironment of primary colorectal cancer and liver metastases. J Transl Med. (2022) 20:454. doi: 10.1186/s12967-022-03667-2
27. Niu Y, Yang W, Qian H, and Sun Y. Intracellular and extracellular factors of colorectal cancer liver metastasis: a pivotal perplex to be fully elucidated. Cancer Cell Int. (2022) 22:341. doi: 10.1186/s12935-022-02766-w
28. Simpson AL, Peoples J, Creasy JM, Fichtinger G, Gangai N, Keshavamurthy KN, et al. Preoperative CT and survival data for patients undergoing resection of colorectal liver metastases. Sci Data. (2024) 11:172. doi: 10.1038/s41597-024-02981-2
29. Zhou TH, Zhou XX, Ni J, Ma YQ, Xu FY, Fan B, et al. CT whole lung radiomic nomogram: a potential biomarker for lung function evaluation and identification of COPD. Mil Med Res. (2024) 11:14. doi: 10.1186/s40779-024-00516-9
30. Zhou T, Guan Y, Lin X, Zhou X, Mao L, Ma Y, et al. CT-based whole lung radiomics nomogram for identification of PRISm from non-COPD subjects. Respir Res. (2024) 25:329. doi: 10.1186/s12931-024-02964-2
31. An SX, Yu ZJ, Fu C, Wei MJ, and Shen LH. Biological factors driving colorectal cancer metastasis. World J Gastrointest Oncol. (2024) 16:259–72. doi: 10.4251/wjgo.v16.i2.259
32. Shang Y, Zeng Y, Luo S, Wang Y, Yao J, Li M, et al. Habitat imaging with tumoral and peritumoral radiomics for prediction of lung adenocarcinoma invasiveness on preoperative chest CT: a multicenter study. AJR Am J Roentgenol. (2024) 223:e2431675. doi: 10.2214/AJR.24.31675
Keywords: computed tomography, radiomics, colorectal neoplasms, liver, neoplasm metastasis
Citation: Zhang D, Li P, Wei Y, Xue M, Guo F and Li C (2025) Predicting the recurrence risk of liver metastasis from colorectal cancer: a study based on preoperative CT intratumoral and peritumoral radiomics features. Front. Oncol. 15:1662354. doi: 10.3389/fonc.2025.1662354
Received: 09 July 2025; Accepted: 08 September 2025;
Published: 23 September 2025.
Edited by:
Laura Curiel, University of Calgary, CanadaReviewed by:
Özgür Akgül, Ankara City Hospital, TürkiyeFatma Alshohoumi, Sultan Qaboos University, Oman
Copyright © 2025 Zhang, Li, Wei, Xue, Guo and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Dongying Zhang, emR5MTEyMnlpbmdAMTYzLmNvbQ==