Predicting early recurrence of colorectal cancer liver metastases: an integrative approach using radiomics and machine learning

Lin, Yanzong; Huang, Yunxia; Liu, Zhaohui; Feng, Xiaobin; Yang, Chunkang

doi:10.3389/fonc.2025.1613093

ORIGINAL RESEARCH article

Front. Oncol., 14 November 2025

Sec. Gastrointestinal Cancers: Colorectal Cancer

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1613093

This article is part of the Research TopicRadiomics and AI-Driven Deep Learning for Cancer Diagnosis and TreatmentView all 18 articles

Predicting early recurrence of colorectal cancer liver metastases: an integrative approach using radiomics and machine learning

Yanzong Lin^1,2

Yunxia Huang³

Zhaohui Liu²

Xiaobin Feng^4*

Chunkang Yang^1*

¹Department of Colorectal Surgery, Clinical Oncology School of Fujian Medical University, Fujian Cancer Hospital, Fuzhou, China
²Department of General Surgery, First Affiliated Hospital of Xiamen University, Xiamen, Fujian, China
³Department of Radiation Oncology, First Affiliated Hospital of Xiamen University, Xiamen, Fujian, China
⁴Hepatopancreatobiliary Center, Beijing Tsinghua Changgung Hospital, Institute for Precision Medicine, Key Laboratory of Digital Intelligence Hepatology (Ministry of Education), Tsinghua University, Beijing, China

Background: The overall incidence of liver metastasis in colorectal cancer is as high as 50%, and surgery remains the only potentially curative approach for the metastatic disease. The recurrence rate of liver metastases within one year after surgery is still 60%-70% in clinical practice. Whether we can accurately predict the early recurrence of patients after surgery is one of the most important considerations in formulating the overall treatment strategy.

Methods: In this study, we combined radiomics feature extraction with machine learning classification methods to develop a novel strategy for predicting intrahepatic metastases based on imaging radiomics and machine learning. We constructed and systematically evaluated multiple machine learning models to assess their performance. By validating these models on a test set, we determined the effectiveness of each predictive model and selected the one with the highest predictive accuracy.

Results: The integration of radiomics and machine learning methods demonstrated significant potential in predicting intrahepatic recurrence within one year after surgery in patients with colorectal cancer liver metastases. The Gradient Boosting, LightGBM, and Random Forest models all achieved classification accuracies (ACC) exceeding 65% across all classification tasks. Notably, the Random Forest model exhibited the best performance; while its classification accuracy was 65.52% in the imaging-only group, it increased to 75.86% when both imaging and clinical information were combined, with an area under the receiver operating characteristic curve (AUC) of 70.83%, indicating strong predictive capability. These findings suggest that these models have potential application value in supporting the diagnostic work of clinical radiologists, potentially helping to reduce workload and decrease the risk of misdiagnosis.

Conclusions: The imaging omics model and the combined model have good predictive efficacy for the recurrence of colorectal cancer liver metastases within one year, and can be used to assist in the clinical stratification of postoperative patients and identify high-risk factors for poor prognosis.

1 Introduction

Colorectal cancer (CRC) ranks as the third most frequently diagnosed cancer, with more than 1.92 million newly diagnosed cases and 903,800 deaths worldwide in 2022 (1), and its high mortality rate is mainly attributed to metastasis (2, 3). The liver is the most common site of metastasis, accounting for approximately 50% of colorectal cancer metastases (4). Liver metastasis is one of the major causes of death for CRC with 6.9 months of median survival receiving only palliative care (5, 6). Despite decades of advances in systemic and local therapies, hepatectomy remains the only curative treatment for patients with resectable colorectal liver metastases (7, 8). However, around 30% patients experience intrahepatic recurrence within a year of post-hepatectomy (9), which have worse outcomes than that late recurrence (10, 11). Therefore, accurate predictive models for early recurrence risk in patients with colorectal liver metastases are of significant clinical value.

With the advance rise of artificial intelligence technology, there are more and more researches on the application of imaging omics and machine learning to predict the prognosis of colorectal cancer liver metastasis, especially recurrence (12–14). Specifically, Lam CSN et al. employed a machine learning model for colorectal liver metastasis post-hepatectomy prognostications, presenting a better prediction ability compared to Fong Clinical Risk Score (12). Mühlberg A et al. established a imaging-based prediction model for 1-year survival of colorectal liver metastasis patients, which showed a better discriminative performance than clinical models (15).

This study aimed to develop and validate a predictive model that integrates radiomics features with machine learning algorithms to forecast intrahepatic recurrence within one year after surgery in patients with colorectal cancer liver metastases. We utilized CT radiomics data combined with an expert-annotated patient dataset, extracting radiomics features from tumor regions delineated by specialists on CT images. Two models were constructed: one using only radiomics features, and another that combines radiomics features with clinical information. Fifteen machine learning algorithms were employed for feature recognition and classification, and the most effective model was identified through performance comparison. This approach aims to predict the likelihood of recurrence in patients using radiomics technology. The workflow of this study is illustrated in Figure 1.

Figure 1

Flowchart illustrating a process in medical imaging and decision-making. CT images undergo ROI delineation, followed by feature extraction of shape and intensity. Feature selection is performed before machine learning analyzes the data. Clinical information is integrated, leading to clinical decisions indicating if a condition is normal with a check mark or recurrent with an exclamation mark.

Figure 1. The workflow of this study.

2 Method

2.1 Data acquisition

2.1.1 Ethical approval

All research processes were conducted in accordance with Helsinki Declaration (revised in 2013). This study was approved by the Institutional Review Committee of the Clinical Oncology School of Fujian Medical University.

2.1.2 Data source and variables

The data utilized in the experiment is sourced from the Cancer Imaging Archive dataset(TCIA). TCIA is a public database dedicated to cancer research, encompassing medical imaging and clinical data of various cases. Patients were excluded for the following:(I) liver metastasis assessment unresectable,(II) patients who cannot tolerate surgery or died perioperatively, and (III) extrahepatic recurrence. Collect the basic information of the patient, including age, gender, major comorbidity, body mass index, regional lymph node, multiple metastase, carcinoembryonic antigen(CEA) levels, max tumor size, lobar involvement and preoperative portal vein embolization. This study included a total of 197 patients with colorectal cancer liver metastasis after surgery, including 117 males (59.4%) and 80 females (40.6%); 110 patients (55.8%) were aged ≥60 years; 41 patients (20.8%) had liver metastatic tumors with a diameter of ≥5 cm; 122 patients (61.9%) received systemic chemotherapy before surgery; and 23 patients (11.7%) received portal vein embolization therapy before surgery. During the follow-up period, 122 patients experienced intrahepatic recurrence, and 37 patients experienced intrahepatic recurrence within one year after surgery (Table 1).

Table 1

Table 1. Demographic and tumor characteristics of patients with colorectal liver metastasis.

2.1.3 Follow-up criteria

The day of liver metastasis surgery is used as the starting point for follow-up, and follow-up is conducted by means of follow-up visits and telephone follow-up. Early intrahepatic recurrence is defined as intrahepatic recurrence within one year after resection of liver metastases, and no extra-hepatic recurrence. Patients with intrahepatic recurrence within one year after surgery are included in the recurrence group. Patients without intrahepatic recurrence within one year after surgery are included in the non-recurrence group. Follow-up is conducted every 3 months after surgery, including serum markers, abdominal B-ultrasound, abdominal CT, magnetic resonance imaging of the liver, and colonoscopy.

2.2 Data preprocessing

Prior to feature extraction, this study systematically preprocessed the acquired CT images using the SimpleITK package in Python. Histogram equalization was applied as a technique aimed at enhancing the contrast of images, making the details more pronounced and clear. By adjusting the grayscale distribution, histogram equalization enhances details in low contrast areas, thereby improving the visibility and analytical quality of the images. Subsequently, the images were processed using a Gaussian high-pass filter. This filter eliminates low-frequency noise, enhancing the visibility of edges and texture features within the image, which are crucial for extracting boundary information. “low-frequency noise” refers to areas in the image where the background grayscale changes slowly, such as large areas of uniform blur, brightness drift, or artifacts, which often obscure high-frequency details such as edges and textures. These boundary details were reintegrated into the original image to further enhance the representation of texture features. These preprocessing steps significantly enhanced the expression of detail information in CT images, thereby effectively improving the model’s performance.

Furthermore, to address the issue of class imbalance in the dataset, this study implemented data augmentation. Synthetic Minority Over-sampling Technique (SMOTE) was used to enhance the training data by generating new samples to balance the number of samples across different classes. This process effectively mitigated the challenges posed by class imbalance, enhancing the model’s generalization ability and predictive performance. SMOTE in this study is limited to the radiomics feature space. Specifically, after delineating and extracting features from CT image ROIs, the extracted structural features are oversampled, rather than synthesizing new images at the original image or texture level.

2.3 Manual region of interest annotation

Images of colorectal cancer liver metastases were exported from the TCIA database and imported into ITK⁃SNAP software in DICOM format. Regions of interest for lesions on the images were manually delineated layer by layer by two radiologists with 5–10 years’ experience in abdominal imaging diagnosis. In case of differing opinions, the decision was made by a radiology department deputy chief physician with over 15 years’ experience. All physicians involved in the delineation were unaware of the patient’s prognosis (Figure 2).

Figure 2

Two CT scan images of the liver. The left image is labeled “Normal,” showing a typical liver appearance. The right image is labeled “Metastases,” displaying liver with abnormal growths indicating metastases. Both images have marked crosshairs in red, aligned with blue dashed lines, and anatomical orientation labels (A, P, R, L).

Figure 2. The schematic of colorectal-liver-metastases database.

2.4 Feature extraction and selection

In this study, the feature extraction process utilized the pyradiomics library in Python, an advanced open-source tool designed to extract comprehensive radiomic features from two-dimensional (2D) and three-dimensional (3D) medical imaging data, encompassing aspects such as shape, intensity, and texture (16). Given the large number of radiomic features directly extracted by the pyradiomics library, a series of feature selection steps were implemented to optimize the feature set. Initially, features outside the 95% confidence interval were excluded via T-tests to reduce dimensionality and noise, thereby decreasing the computational burden in subsequent feature selection stages. This approach not only enabled the feature selection model to more effectively identify features that significantly contribute to the predictive target but also enhanced the model’s performance and predictive accuracy. Subsequent to this initial reduction, lasso regression was employed for further feature selection. This autoregressive technique filtered out significantly contributive features, effectively preventing overfitting and enhancing the interpretability and performance of subsequent machine learning models. Additionally, a combined clinical+radiomic group was established by integrating postoperative one-year clinical information of patients with colorectal cancer liver metastases with radiomic features. This approach was designed to utilize a more extensive set of data for model building, combining a wide range of features to facilitate personalized and precision medicine, ultimately aiming to improve model performance.

2.5 Construction of machine learning models

This research employed fifteen machine learning models—Random Forest (RF), Light GBM, Gradient Boosting, K-Nearest Neighbors (KNN), Decision Tree, AdaBoost, Extra Trees, Latent Dirichlet Allocation (LDA), Bootstrap Aggregating (Bagging), Bernoulli Naive Bayes (Bernoulli NB), Calibrated Classifier, Gaussian Naive Bayes (GNB), Logistic Regression, Multilayer Perceptron (MLP), and Quadratic Discriminant Analysis (QDA)—to process and classify selected radiomics features for predicting intrahepatic recurrence within one year post-surgery in patients with colorectal cancer liver metastases.

The Random Forest model leverages ensemble learning techniques, incorporating randomness during the training phase by training each tree on slightly different subsets of data and selecting the best features from randomly chosen subsets at split nodes. This significantly enhances model diversity, reduces overfitting risks, and improves generalization capabilities (17). Light GBM, an efficient gradient boosting framework, optimizes the speed and memory efficiency of traditional GBDT using a histogram-based decision tree algorithm, while maintaining high accuracy. Gradient Boosting models iteratively train decision trees to minimize loss functions, with each tree learning from the prediction residuals of the previous tree, thereby reducing model bias and enhancing accuracy on training data (18).

The KNN algorithm, a basic yet widely used method for classification and regression, predicts the value of unknown data points by referencing information from the nearest samples. In classification tasks, the category of a sample is determined by one or more of its nearest neighbors (19). Decision Tree, a non-parametric supervised learning method applicable for classification and regression tasks, predicts target values by learning decision rules from features, offering excellent interpretability due to its ability to visualize and simulate human decision-making processes (20).

AdaBoost is an ensemble learning algorithm that aims to build a strong learner by combining multiple weak learners. It reweights training samples in each round to increase the weights of previously misclassified samples, thus focusing subsequent learners on more challenging samples (21). Extra Trees, a variant of Random Forest, introduces additional randomness by selecting thresholds randomly for each feature rather than calculating the optimal thresholds, thereby enhancing tree diversity and reducing both model variance and, typically, computation time (22).

Although LDA is inherently an unsupervised learning model primarily used for identifying topic distributions in document collections, it can also be indirectly applied to classification tasks by transforming documents into topic-based representations and using these topic distributions as features for classifier training (23). Bootstrap Aggregating is a classic ensemble learning method that combines multiple models to enhance the accuracy of predictions and reduce model variance, using multiple training subsets to stabilize and improve the accuracy of final predictions (24).

BernoulliNB is a specific type of Naive Bayes classifier, designed for binary feature classification tasks. It performs well in text data handling, such as document classification or other binary classification problems, by assuming conditional independence of features within each class, simplifying computation. The Calibrated Classifier is an adjusted classifier that refines prediction probabilities to more accurately reflect the likelihood of actual events, providing more reliable probability estimates that enhance decision-making and risk assessment (25).

GNB is a classification algorithm based on Naive Bayes theory, particularly suitable for scenarios where features adhere to a Gaussian (normal) distribution. It excels in medical predictions and other areas, effectively handling continuous data and providing classification results based on normal distributions. Logistic Regression is a widely used linear classification algorithm that predicts the probability of sample membership in a category. As a simple yet effective model, it performs well in many classification tasks, especially when the relationship between features and the target variable is approximately linear (26).

MLP is a feed-forward neural network used for both classification and regression problems. By learning complex patterns and features through multiple hidden layers (“multilayers”), MLP handles highly nonlinear prediction tasks and finds broad applications in image recognition, natural language processing, and more (27). QDA extends LDA by allowing different covariance matrices for each class, effectively managing cases with complex nonlinear boundaries between categories, particularly suitable for predictions where classes exhibit varying covariance structures (28).

These models have been extensively studied and applied in the field of radiomics, demonstrating their excellent performance and high generalization capabilities (29–31). By utilizing these models for the prediction and classification of radiomics features, we achieved prediction of intrahepatic recurrence in colorectal cancer liver metastasis patients within one year post-surgery, explored the application of integrated models in clinical decision support systems, and aimed to reduce misdiagnosis rates and patient loss through high-precision radiomics grading methods.

2.6 Evaluation metrics

This research employs a comprehensive suite of evaluation metrics to ensure objective and thorough assessment of machine learning models, utilizing Accuracy (ACC), Precision (PRE), Recall (REC), and the Receiver Operating Characteristic (ROC) curve with the Area Under the Curve (AUC). These metrics, based on True Positives (TP), True Negatives (TN), False Positives (FP), and False Negatives (FN), offer a multifaceted evaluation of model performance. The formula of ACC (Equation 1), PRE (Equation 2) and REC (Equation 3):

\begin{array}{l} A C C = \frac{T P + T N}{T P + F P + T N + F N} & (1) \end{array}

\begin{array}{l} P R E = \frac{T P}{T P + F P} & (2) \end{array}

\begin{array}{l} R E C = \frac{T P}{T P + F N} & (3) \end{array}

3 Results

3.1 Experimental set-up

To rigorously evaluate the models developed in this study, the dataset was divided into an 80:20 training-to-test ratio for independent testing of model efficacy. During the feature selection phase, to ensure the statistical significance of the features and mitigate computational demands associated with high-dimensionality, a T-test was applied to retain data within a 95% confidence interval. Feature selection was further refined using the Lasso method with an alpha setting of 0.65, aimed at balancing model complexity against performance for optimal feature dimensionality. Additionally, careful adjustments were made to the iterations of various machine learning models to train them to convergence without overfitting, setting each model to run for 50 iterations with a batch size of 512 to ensure comprehensive learning and efficient allocation of computational resources.

All experiments were conducted on a Windows 11 Professional Edition, using Python 3.6.0. The architecture design and validation of models were supported by packages such as Pyradiomics 3.0.1, Scikit-learn 0.24.2, scipy 1.5.4, and matplotlib. The hardware setup included an Intel Core i7 10750H CPU (base frequency 2.6GHz, turbo up to 5GHz, six cores/twelve threads) and an NVIDIA GeForce GTX 4060Ti GPU (16GB memory, 128-bit memory bus).

3.2 Results of feature extraction and selection

In this study, we leveraged expert-annotated three-dimensional CT images of patients with colorectal cancer liver metastases to extract radiomic features. Utilizing the Pyradiomics package in Python, we initially extracted 995 radiomic features from the raw CT images. However, due to the large volume and potential irrelevance of many features to the model’s decision-making process, feature selection was imperative to reduce noise and improve model performance.

Initially, features with a significance level below 0.05 were excluded via T-tests, filtering out features significantly related to the target variable. This T-test yielded 72 features. However, some remaining features irrelevant to the machine learning models could still impact performance. Further feature refinement was conducted using LASSO regression, identifying key predictive features through 10-fold cross-validation. The cross-validation curve and regression coefficient path for LASSO are illustrated in Figure 3.

Figure 3

Graph A shows the relationship between binomial deviance and log lambda, with deviance decreasing as log lambda increases, stabilizing around -4. Graph B illustrates coefficient paths against log lambda, with multiple lines diverging as log lambda increases, indicating variable coefficients' impact on the model.

Figure 3. The results of lasso feature selection. (A) The cross-validation curve of LASSO. (B) The regression coefficient path of LASSO.

Ultimately, LASSO regression selected 8 essential features, as depicted in the feature contribution curve in Figure 4. These features were adequate for training the machine learning model, effectively eliminating the interference from noise and irrelevant features, thus significantly enhancing the model’s performance.

Figure 4

Bar graph titled “Feature Contributions” shows various feature contributions on a scale from negative one to one. Features include “wavelet.HHL_firstorder_InterquartileRange” with the highest positive contribution in yellow, and “wavelet.LHL_firstorder_10Percentile” with the largest negative contribution in dark purple. A vertical color scale on the right represents contribution levels from dark purple to yellow.

Figure 4. The feature contribution of selected features.

3.3 Results of intrahepatic recurrence models

To build more comprehensive models for predicting intrahepatic recurrence within one year post-surgery in patients with colorectal cancer liver metastases, we developed and compared two methodologies: a radiomics-only model and a combined radiomics-clinical model. The radiomics-only model employs radiomic features extracted from CT images, which are input into a machine learning model after feature selection. Conversely, the combined radiomics-clinical model incorporates selected radiomic features with clinical attributes as model inputs. These clinical attributes include patient age, BMI, tumor size, and preoperative chemotherapy status. This integrative approach enables the model to simultaneously consider patient information and clinical indicators of tumor condition, thus enhancing the model’s interpretability and improving its predictive performance.

3.3.1 Results of radiomics models

In this research, features extracted via radiomics were used to train and validate machine learning models for predicting intrahepatic recurrence within one year after surgery in patients with colorectal cancer liver metastases. The performance of these radiomics models is detailed in Table 2.

Table 2

Table 2. The result of radiomics models.

The Gradient Boosting classifier showed the highest accuracy among the evaluated models, achieving an average ACC of 72.41% and an AUC of 66.15%. In contrast, the Calibrated Classifier model recorded the lowest misdiagnosis rate, with ACC, PRE, and AUC values of 65.52%, 93.75%, and 71.46%, respectively. The ROC curves and Calibration curves of the radiomics models, depicted in Figure 5, reveal that the ROC curve of the Calibrated Classifier model is closest to the upper left corner, while its Calibration curve nears the ideal 45-degree line, demonstrating its outstanding clinical utility.

Figure 5

Panel A shows a Receiver Operating Characteristic (ROC) curve with various models, each with a different area under the curve (AUC) value, indicating performance. Panel B displays calibration curves for the same models, comparing fraction of positives against mean predicted probability. Both graphs include multiple color-coded lines representing different models like MLP, AdaBoost, and DecisionTree.

Figure 5. The visualization of model result of radiomics models. (A) The ROC curve of radiomics models. (B) The Calibration curves of the radiomics models.

3.3.2 Results of radiomics-clinical models

To utilize more comprehensive information for training models, we established a radiomics-clinical dataset for training machine learning models. This dataset significantly enhanced model performance compared to using solely radiomic features, as shown in Table 3.

Table 3

Table 3. The result of radiomics-clinical models.

In the radiomics-clinical model, the Random Forest classifier exhibited the best performance, achieving an ACC of 75.86% and AUC of 70.83%. It was closely followed by the Gradient Boosting, LightGBM, and Extra Trees models. The ROC curves and Calibration curves of the radiomics-clinical models, displayed in Figure 6, show that the ROC curve of the Random Forest model is nearer to the upper left corner and its Calibration curve is closer to the ideal 45-degree line, demonstrating higher reliability and clinical suitability. The use of radiomics-clinical features in the Random Forest model has promising prospects for clinical application, potentially assisting doctors in diagnosis, reducing the workload of radiologists, and minimizing the potential harm caused by diagnostic errors and omissions.

Figure 6

Two graphs compare multiple models. Graph A shows ROC curves with true positive rate against false positive rate. Each model is represented by a colored line, with AUC values noted. Graph B displays calibration curves with fraction of positives against mean predicted probability, also represented by colored lines. A key lists models such as MLP, AdaBoost, and DecisionTree.

Figure 6. The visualization of model result of radiomics-clinical models. (A) The ROC curve of radiomics-clinical models. (B) The Calibration curves of the radiomics-clinical models.

4 Discussion

Radiomics facilitates the automatic analysis of numerous image features in a short duration, many of which are difficult to assess visually (32). Integrating radiomics with machine learning models allows for objective and rapid disease classification. This combination provides clinicians with a valuable tool, aiding in the selection of appropriate treatment plans and avoiding unnecessary interventions (33). In this study, the Pyradiomics library was used to extract radiomic data from one-year postoperative CT images of colorectal cancer liver metastasis patients, combined with clinical data, to predict intrahepatic recurrence within one year post-surgery. Fifteen machine learning models were developed and evaluated. The radiomics-clinical model, particularly the Random Forest classifier, demonstrated the highest predictive performance, with an accuracy (ACC) of 75.86% and an area under the curve (AUC) of 70.83%. These promising results have potential clinical applications, helping physicians in diagnosis, reducing the workload of radiologists, identifying features not easily visible to the eye, lowering the rate of misdiagnoses, and ultimately reducing harm to patients.

Colorectal cancer has become the third most common malignant tumor worldwide (1).Liver metastasis is the most common distant metastasis pathway in patients with advanced colorectal cancer. 20% to 25% of patients with colorectal cancer have liver metastases at the time of diagnosis, and up to 50% of patients with colorectal cancer will develop synchronous liver metastases after resection of the primary tumor (34). Surgical radical resection of liver metastases is currently the best way to cure colorectal cancer liver metastases (35).

In clinical practice, some patients with colorectal cancer liver metastases still have a high recurrence rate after surgery, and the benefits of surgery are not obvious. The clinical and pathological characteristics of patients with colorectal cancer liver metastases, including patient characteristics, preoperative treatment, primary tumor and liver metastasis characteristics, and surgical factors, are all related to postoperative recurrence. Currently, the CRS score is the most commonly used evaluation system in clinical practice to predict recurrence and survival in patients with colorectal cancer liver metastases. It is of great value in guiding the timing of surgery and the choice of perioperative treatment, but it is not sufficient to predict the risk of recurrence after resection of liver metastases (36).

Imaging informatics is widely used in the diagnosis, grading and staging, efficacy evaluation and prognosis prediction of tumors by extracting a large number of imaging features from imaging images and analyzing image information in detail (37). This study combined imaging and clinical characteristics to initially establish a visual machine learning prediction model that can predict early intrahepatic recurrence after resection of liver metastases in patients with colorectal cancer liver metastases, thereby providing an effective basis for developing more accurate individualized treatment plans for patients with colorectal cancer liver metastases. For patients in the high-risk group for early recurrence identified by the model, we should take more active measures to check and treat the disease to increase disease control.

This study has several limitations. First, recurrence was evaluated only within a fixed 1-year timeframe; extending follow-up duration and incorporating diverse evaluation criteria would strengthen future analyses. Second, the absence of key biomarkers (RAS/BRAF mutations, MSI status, HER-2 expression) precluded assessment of their prognostic impact. Third, the single-center retrospective design with limited sample size constrains external validity. While genomic integration remains clinically imperative, its implementation requires resource-adaptive methodologies. We propose targeted genetic profiling (e.g., RAS/BRAF PCR) coupled with imaging biomarkers as a pragmatic solution. Our planned pilot study (N = 50-80) will validate this approach using propensity-weighted methods to establish scalable multimodal frameworks.

5 Conclusion

CT image omics combined with clinical parameters can predict the risk of early intrahepatic recurrence after surgery in patients with colorectal cancer liver metastases, showing high sensitivity and specificity. It can be used to stratify the risk of recurrence in this group of patients, and more active examination measures and adjuvant treatment can be considered for patients in the high-risk group. In addition, a prospective prediction model combining multiple omics may have higher accuracy, which is also the direction of future research on prediction models.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

Ethical approval was not required for the study involving humans in accordance with the local legislation and institutional requirements. Written informed consent to participate in this study was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and the institutional requirements.

Author contributions

YL: Software, Data curation, Writing – original draft, Writing – review & editing. YH: Data curation, Formal Analysis, Writing – original draft. ZL: Writing – review & editing. XF: Project administration, Writing – review & editing. CY: Writing – review & editing, Supervision.

Funding

The author(s) declare financial support was received for the research and/or publication of this article. This work was supported by National Natural Science Foundation of China (12326618),Xiamen Medical and Health Guidance Project(3502Z20244ZD1040 and 3502Z20244ZD1050).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Bray F LM, Sung H, Ferlay J, Siegel RL, Soerjomataram I, and Jemal A. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2024) 74:229–63. doi: 10.3322/caac.21834

PubMed Abstract | Crossref Full Text | Google Scholar

2. Biller LH and Schrag D. Diagnosis and treatment of metastatic colorectal cancer: A review. Jama. (2021) 325:669–85. doi: 10.1001/jama.2021.0106

PubMed Abstract | Crossref Full Text | Google Scholar

3. Ciardiello F, Ciardiello D, Martini G, Napolitano S, Tabernero J, and Cervantes A. Clinical management of metastatic colorectal cancer in the era of precision medicine. CA Cancer J Clin. (2022) 72:372–401. doi: 10.3322/caac.21728

PubMed Abstract | Crossref Full Text | Google Scholar

4. Giannis D, Sideris G, Kakos CD, Katsaros I, and Ziogas IA. The role of liver transplantation for colorectal liver metastases: A systematic review and pooled analysis. Transplant Rev (Orlando). (2020) 34:100570. doi: 10.1016/j.trre.2020.100570

PubMed Abstract | Crossref Full Text | Google Scholar

5. Stewart CL, Warner S, Ito K, Raoof M, Wu GX, Kessler J, et al. Cytoreduction for colorectal metastases: liver, lung, peritoneum, lymph nodes, bone, brain. When does it palliate, prolong survival, and potentially cure? Curr Probl Surg. (2018) 55:330–79. doi: 10.1067/j.cpsurg.2018.08.004

PubMed Abstract | Crossref Full Text | Google Scholar

6. Siegel RL, Miller KD, and Jemal A. Cancer statistics, 2018. CA Cancer J Clin. (2018) 68:7–30. doi: 10.3322/caac.21442

PubMed Abstract | Crossref Full Text | Google Scholar

7. Xu F TB, Jin TQ, and Dai CL. Current status of surgical treatment of colorectal liver metastases. World J Clin cases. (2018) 6:716–34. doi: 10.12998/wjcc.v6.i14.716

PubMed Abstract | Crossref Full Text | Google Scholar

8. Boudjema K LC, Sabbagh C, Ortega-Deballon P, Heyd B, Bachellier P, Métairie S, et al. Simultaneous versus delayed resection for initially resectable synchronous colorectal cancer liver metastases: A prospective, open-label, randomized, controlled trial. Ann Surg. (2021) 273:49–56. doi: 10.1097/SLA.0000000000003848

PubMed Abstract | Crossref Full Text | Google Scholar

9. Jones RP, Jackson R, Dunne DFJ, Malik HZ, Fenwick SW, Poston GJ, et al. Systematic review and meta-analysis of follow-up after hepatectomy for colorectal liver metastases2. Br J Surg. (2012) 99:477–86. doi: 10.1002/bjs.8667

PubMed Abstract | Crossref Full Text | Google Scholar

10. Concors S HA and Vauthey JN. Modeling the prediction of early treatment failure after hepatectomy for colorectal liver metastases. Ann Surg Oncol. (2023) 30:3182–3. doi: 10.1245/s10434-023-13343-4

PubMed Abstract | Crossref Full Text | Google Scholar

11. Imai K, Allard MA, Benitez CC, Vibert E, Sa Cunha A, Cherqui D, et al. Early recurrence after hepatectomy for colorectal liver metastases: what optimal definition and what predictive factors? Oncologist. (2016) 21:887–94. doi: 10.1634/theoncologist.2015-0468

PubMed Abstract | Crossref Full Text | Google Scholar

12. Lam CSN, Bharwani AA, Chan EHY, Chan VHY, Au HLH, Ho MK, et al. A machine learning model for colorectal liver metastasis post-hepatectomy prognostications. Hepatobiliary Surg Nutr. (2023) 12:495–506. doi: 10.21037/hbsn-21-453

PubMed Abstract | Crossref Full Text | Google Scholar

13. Amygdalos I, Müller-Franzes G, Bednarsch J, Czigany Z, Ulmer TF, Bruners P, et al. Novel machine learning algorithm can identify patients at risk of poor overall survival following curative resection for colorectal liver metastases. J Hepatobiliary Pancreat Sci. (2023) 30:602–14. doi: 10.1002/jhbp.v30.5

PubMed Abstract | Crossref Full Text | Google Scholar

14. Zhang Y, Zhang Z, Wei L, and Wei S. Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer. Front Public Health. (2022) 10. doi: 10.3389/fpubh.2022.1008137

PubMed Abstract | Crossref Full Text | Google Scholar

15. Mühlberg A, Holch JW, Heinemann V, Huber T, Moltz J, Maurus S, et al. The relevance of CT-based geometric and radiomics analysis of whole liver tumor burden to predict survival of patients with metastatic colorectal cancer. Eur Radiol. (2021) 31:834–46. doi: 10.1007/s00330-020-07192-y

PubMed Abstract | Crossref Full Text | Google Scholar

16. Van Griethuysen JJ, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. (2017) 77:e104–7. doi: 10.1158/0008-5472.CAN-17-0339

PubMed Abstract | Crossref Full Text | Google Scholar

17. Breiman L. Random forests. Mach Learn. (2001) 45:5–32. doi: 10.1023/A:1010933404324

Crossref Full Text | Google Scholar

18. Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, et al. Lightgbm: A highly efficient gradient boosting decision tree. In Advances in Neural Information Processing Systems (2017) 30:3146–54.

Google Scholar

19. Cover T and Hart P. Nearest neighbor pattern classification. IEEE Transact Inform Theor. (1967) 13:21–7. doi: 10.1109/TIT.1967.1053964

Crossref Full Text | Google Scholar

20. Song Y-Y and Lu Y. Decision tree methods: applications for classification and prediction. Shanghai Arch Psych. (2015) 27:130. doi: 10.11919/j.issn.1002-0829.215044

PubMed Abstract | Crossref Full Text | Google Scholar

21. Freund Y and Schapire RE. A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci. (1997) 55:119–39. doi: 10.1006/jcss.1997.1504

Crossref Full Text | Google Scholar

22. Geurts P, Ernst D, and Wehenkel L. Extremely randomized trees. Mach Learn. (2006) 63:3–42. doi: 10.1007/s10994-006-6226-1

Crossref Full Text | Google Scholar

23. Blei DM, Ng AY, and Jordan MI. Latent dirichlet allocation. J Mach Learn Res. (2003) 3:993–1022.

Google Scholar

24. Lin J, Chen H, Li S, Liu Y, Li X, and Yu B. Accurate prediction of potential druggable proteins based on genetic algorithm and Bagging-SVM ensemble classifier. Artif Intell Med. (2019) 98:35–47. doi: 10.1016/j.artmed.2019.07.005

PubMed Abstract | Crossref Full Text | Google Scholar

25. Silva Filho T, Song H, Perello-Nieto M, Santos-Rodriguez R, Kull M, and Flach P. Classifier calibration: a survey on how to assess and improve predicted class probabilities. Mach Learn. (2023) 112:3211–60. doi: 10.1007/s10994-023-06336-7

Crossref Full Text | Google Scholar

26. Chati PM, Raghunathan K, and Thiagarajah JR. Mo1799 identification of transcriptional signatures to classify disease in ulcerative colitis via an iterative gaussian naive bayes model. Gastroenterology. (2024) 166:S–1116. doi: 10.1016/S0016-5085(24)03038-5

Crossref Full Text | Google Scholar

27. Popescu M-C, Balas VE, Perescu-Popescu L, and Mastorakis N. Multilayer perceptron and neural networks. WSEAS Trans Circuits Syst. (2009) 8:579–88.

Google Scholar

28. Sidel JL, Bleibaum RN, and Tao KC. Quantitative descriptive analysis. Descriptive Anal sensory Eval. (2018) 287–318. doi: 10.1002/9781118991657.ch8

Crossref Full Text | Google Scholar

29. Zhou M, Scott J, Chaudhury B, Hall L, Goldgof D, Yeom KW, et al. Radiomics in brain tumor: image assessment, quantitative feature descriptors, and machine-learning approaches. Am J Neuroradiology. (2018) 39:208–16. doi: 10.3174/ajnr.A5391

PubMed Abstract | Crossref Full Text | Google Scholar

30. Farook TH and Dudley J. Automation and deep (machine) learning in temporomandibular joint disorder radiomics: A systematic review. J Oral Rehabil. (2023) 50:501–21. doi: 10.1111/joor.13440

PubMed Abstract | Crossref Full Text | Google Scholar

31. Karabacak M, Ozkara BB, Ozturk A, Kaya B, Cirak Z, Orak E, et al. Radiomics-based machine learning models for prediction of medulloblastoma subgroups: a systematic review and meta-analysis of the diagnostic test performance. Acta Radiologica. (2023) 64:1994–2003. doi: 10.1177/02841851221143496

PubMed Abstract | Crossref Full Text | Google Scholar

32. Algohary A, Viswanath S, Shiradkar R, Ghose S, Pahwa S, Moses D, et al. Radiomic features on MRI enable risk categorization of prostate cancer patients on active surveillance: Preliminary findings. J Magnetic Resonance Imaging. (2018) 48:818–28. doi: 10.1002/jmri.25983

PubMed Abstract | Crossref Full Text | Google Scholar

33. Fütterer JJ, Briganti A, De Visschere P, Emberton M, Giannarini G, Kirkham A, et al. Can clinically significant prostate cancer be detected with multiparametric magnetic resonance imaging? A systematic Rev literature Eur Urol. (2015) 68:1045–53. doi: 10.1016/j.eururo.2015.01.013

PubMed Abstract | Crossref Full Text | Google Scholar

34. Ma ZH, Wang YP, Zheng WH, Ma J, Bai X, Zhang Y, et al. Prognostic factors and therapeutic effects of different treatment modalities for colorectal cancer liver metastases. World J Gastrointest Oncol. (2020) 12:1177–94. doi: 10.4251/wjgo.v12.i10.1177

PubMed Abstract | Crossref Full Text | Google Scholar

35. Avella P, Vaschetti R, Cappuccio M, Gambale F, Meis LDE, Rafanelli F, et al. The role of liver surgery in simultaneous synchronous colorectal liver metastases and colorectal cancer resections: a literature review of 1730 patients underwent open and minimally invasive surgery. Minerva Surg. (2022) 77:582–90. doi: 10.23736/S2724-5691.22.09716-7

PubMed Abstract | Crossref Full Text | Google Scholar

36. Tan MC, Butte JM, Gonen M, Kemeny N, Fong Y, Allen PJ, et al. Prognostic significance of early recurrence: a conditional survival analysis in patients with resected colorectal liver metastasis. HPB (Oxford). (2013) 15:803–13. doi: 10.1111/hpb.12136

PubMed Abstract | Crossref Full Text | Google Scholar

37. Lambin P, Leijenaar RTH, Deist TM, Peerlings J, de Jong EEC, van Timmeren J, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol. (2017) 14:749–62. doi: 10.1038/nrclinonc.2017.141

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: colorectal liver metastasis, CT radiomics, machine learning, intrahepatic recurrence, prediction

Citation: Lin Y, Huang Y, Liu Z, Feng X and Yang C (2025) Predicting early recurrence of colorectal cancer liver metastases: an integrative approach using radiomics and machine learning. Front. Oncol. 15:1613093. doi: 10.3389/fonc.2025.1613093

Received: 16 April 2025; Accepted: 29 August 2025;
Published: 14 November 2025.

Edited by:

Arka Bhowmik, Memorial Sloan Kettering Cancer Center, United States

Reviewed by:

Lisheng Wang, Shanghai Jiao Tong University, China
Long Wu, Affiliated Hospital of Guizhou Medical University, China
Yen Cho Huang, Chang Gung Memorial Hospital, Taiwan

Copyright © 2025 Lin, Huang, Liu, Feng and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xiaobin Feng, ZmVuZ3hpYW9iaW4yMDA3MDhAYWxpeXVuLmNvbQ==; Chunkang Yang, Y2h1bmthbmcxMjlAZmptdS5lZHUuY24=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.