- 1Department of Colorectal Surgery, Clinical Oncology School of Fujian Medical University, Fujian Cancer Hospital, Fuzhou, China
- 2Department of General Surgery, First Affiliated Hospital of Xiamen University, Xiamen, Fujian, China
- 3Department of Radiation Oncology, First Affiliated Hospital of Xiamen University, Xiamen, Fujian, China
- 4Hepatopancreatobiliary Center, Beijing Tsinghua Changgung Hospital, Institute for Precision Medicine, Key Laboratory of Digital Intelligence Hepatology (Ministry of Education), Tsinghua University, Beijing, China
Background: The overall incidence of liver metastasis in colorectal cancer is as high as 50%, and surgery remains the only potentially curative approach for the metastatic disease. The recurrence rate of liver metastases within one year after surgery is still 60%-70% in clinical practice. Whether we can accurately predict the early recurrence of patients after surgery is one of the most important considerations in formulating the overall treatment strategy.
Methods: In this study, we combined radiomics feature extraction with machine learning classification methods to develop a novel strategy for predicting intrahepatic metastases based on imaging radiomics and machine learning. We constructed and systematically evaluated multiple machine learning models to assess their performance. By validating these models on a test set, we determined the effectiveness of each predictive model and selected the one with the highest predictive accuracy.
Results: The integration of radiomics and machine learning methods demonstrated significant potential in predicting intrahepatic recurrence within one year after surgery in patients with colorectal cancer liver metastases. The Gradient Boosting, LightGBM, and Random Forest models all achieved classification accuracies (ACC) exceeding 65% across all classification tasks. Notably, the Random Forest model exhibited the best performance; while its classification accuracy was 65.52% in the imaging-only group, it increased to 75.86% when both imaging and clinical information were combined, with an area under the receiver operating characteristic curve (AUC) of 70.83%, indicating strong predictive capability. These findings suggest that these models have potential application value in supporting the diagnostic work of clinical radiologists, potentially helping to reduce workload and decrease the risk of misdiagnosis.
Conclusions: The imaging omics model and the combined model have good predictive efficacy for the recurrence of colorectal cancer liver metastases within one year, and can be used to assist in the clinical stratification of postoperative patients and identify high-risk factors for poor prognosis.
1 Introduction
Colorectal cancer (CRC) ranks as the third most frequently diagnosed cancer, with more than 1.92 million newly diagnosed cases and 903,800 deaths worldwide in 2022 (1), and its high mortality rate is mainly attributed to metastasis (2, 3). The liver is the most common site of metastasis, accounting for approximately 50% of colorectal cancer metastases (4). Liver metastasis is one of the major causes of death for CRC with 6.9 months of median survival receiving only palliative care (5, 6). Despite decades of advances in systemic and local therapies, hepatectomy remains the only curative treatment for patients with resectable colorectal liver metastases (7, 8). However, around 30% patients experience intrahepatic recurrence within a year of post-hepatectomy (9), which have worse outcomes than that late recurrence (10, 11). Therefore, accurate predictive models for early recurrence risk in patients with colorectal liver metastases are of significant clinical value.
With the advance rise of artificial intelligence technology, there are more and more researches on the application of imaging omics and machine learning to predict the prognosis of colorectal cancer liver metastasis, especially recurrence (12–14). Specifically, Lam CSN et al. employed a machine learning model for colorectal liver metastasis post-hepatectomy prognostications, presenting a better prediction ability compared to Fong Clinical Risk Score (12). Mühlberg A et al. established a imaging-based prediction model for 1-year survival of colorectal liver metastasis patients, which showed a better discriminative performance than clinical models (15).
This study aimed to develop and validate a predictive model that integrates radiomics features with machine learning algorithms to forecast intrahepatic recurrence within one year after surgery in patients with colorectal cancer liver metastases. We utilized CT radiomics data combined with an expert-annotated patient dataset, extracting radiomics features from tumor regions delineated by specialists on CT images. Two models were constructed: one using only radiomics features, and another that combines radiomics features with clinical information. Fifteen machine learning algorithms were employed for feature recognition and classification, and the most effective model was identified through performance comparison. This approach aims to predict the likelihood of recurrence in patients using radiomics technology. The workflow of this study is illustrated in Figure 1.
2 Method
2.1 Data acquisition
2.1.1 Ethical approval
All research processes were conducted in accordance with Helsinki Declaration (revised in 2013). This study was approved by the Institutional Review Committee of the Clinical Oncology School of Fujian Medical University.
2.1.2 Data source and variables
The data utilized in the experiment is sourced from the Cancer Imaging Archive dataset(TCIA). TCIA is a public database dedicated to cancer research, encompassing medical imaging and clinical data of various cases. Patients were excluded for the following:(I) liver metastasis assessment unresectable,(II) patients who cannot tolerate surgery or died perioperatively, and (III) extrahepatic recurrence. Collect the basic information of the patient, including age, gender, major comorbidity, body mass index, regional lymph node, multiple metastase, carcinoembryonic antigen(CEA) levels, max tumor size, lobar involvement and preoperative portal vein embolization. This study included a total of 197 patients with colorectal cancer liver metastasis after surgery, including 117 males (59.4%) and 80 females (40.6%); 110 patients (55.8%) were aged ≥60 years; 41 patients (20.8%) had liver metastatic tumors with a diameter of ≥5 cm; 122 patients (61.9%) received systemic chemotherapy before surgery; and 23 patients (11.7%) received portal vein embolization therapy before surgery. During the follow-up period, 122 patients experienced intrahepatic recurrence, and 37 patients experienced intrahepatic recurrence within one year after surgery (Table 1).
2.1.3 Follow-up criteria
The day of liver metastasis surgery is used as the starting point for follow-up, and follow-up is conducted by means of follow-up visits and telephone follow-up. Early intrahepatic recurrence is defined as intrahepatic recurrence within one year after resection of liver metastases, and no extra-hepatic recurrence. Patients with intrahepatic recurrence within one year after surgery are included in the recurrence group. Patients without intrahepatic recurrence within one year after surgery are included in the non-recurrence group. Follow-up is conducted every 3 months after surgery, including serum markers, abdominal B-ultrasound, abdominal CT, magnetic resonance imaging of the liver, and colonoscopy.
2.2 Data preprocessing
Prior to feature extraction, this study systematically preprocessed the acquired CT images using the SimpleITK package in Python. Histogram equalization was applied as a technique aimed at enhancing the contrast of images, making the details more pronounced and clear. By adjusting the grayscale distribution, histogram equalization enhances details in low contrast areas, thereby improving the visibility and analytical quality of the images. Subsequently, the images were processed using a Gaussian high-pass filter. This filter eliminates low-frequency noise, enhancing the visibility of edges and texture features within the image, which are crucial for extracting boundary information. “low-frequency noise” refers to areas in the image where the background grayscale changes slowly, such as large areas of uniform blur, brightness drift, or artifacts, which often obscure high-frequency details such as edges and textures. These boundary details were reintegrated into the original image to further enhance the representation of texture features. These preprocessing steps significantly enhanced the expression of detail information in CT images, thereby effectively improving the model’s performance.
Furthermore, to address the issue of class imbalance in the dataset, this study implemented data augmentation. Synthetic Minority Over-sampling Technique (SMOTE) was used to enhance the training data by generating new samples to balance the number of samples across different classes. This process effectively mitigated the challenges posed by class imbalance, enhancing the model’s generalization ability and predictive performance. SMOTE in this study is limited to the radiomics feature space. Specifically, after delineating and extracting features from CT image ROIs, the extracted structural features are oversampled, rather than synthesizing new images at the original image or texture level.
2.3 Manual region of interest annotation
Images of colorectal cancer liver metastases were exported from the TCIA database and imported into ITK⁃SNAP software in DICOM format. Regions of interest for lesions on the images were manually delineated layer by layer by two radiologists with 5–10 years’ experience in abdominal imaging diagnosis. In case of differing opinions, the decision was made by a radiology department deputy chief physician with over 15 years’ experience. All physicians involved in the delineation were unaware of the patient’s prognosis (Figure 2).
2.4 Feature extraction and selection
In this study, the feature extraction process utilized the pyradiomics library in Python, an advanced open-source tool designed to extract comprehensive radiomic features from two-dimensional (2D) and three-dimensional (3D) medical imaging data, encompassing aspects such as shape, intensity, and texture (16). Given the large number of radiomic features directly extracted by the pyradiomics library, a series of feature selection steps were implemented to optimize the feature set. Initially, features outside the 95% confidence interval were excluded via T-tests to reduce dimensionality and noise, thereby decreasing the computational burden in subsequent feature selection stages. This approach not only enabled the feature selection model to more effectively identify features that significantly contribute to the predictive target but also enhanced the model’s performance and predictive accuracy. Subsequent to this initial reduction, lasso regression was employed for further feature selection. This autoregressive technique filtered out significantly contributive features, effectively preventing overfitting and enhancing the interpretability and performance of subsequent machine learning models. Additionally, a combined clinical+radiomic group was established by integrating postoperative one-year clinical information of patients with colorectal cancer liver metastases with radiomic features. This approach was designed to utilize a more extensive set of data for model building, combining a wide range of features to facilitate personalized and precision medicine, ultimately aiming to improve model performance.
2.5 Construction of machine learning models
This research employed fifteen machine learning models—Random Forest (RF), Light GBM, Gradient Boosting, K-Nearest Neighbors (KNN), Decision Tree, AdaBoost, Extra Trees, Latent Dirichlet Allocation (LDA), Bootstrap Aggregating (Bagging), Bernoulli Naive Bayes (Bernoulli NB), Calibrated Classifier, Gaussian Naive Bayes (GNB), Logistic Regression, Multilayer Perceptron (MLP), and Quadratic Discriminant Analysis (QDA)—to process and classify selected radiomics features for predicting intrahepatic recurrence within one year post-surgery in patients with colorectal cancer liver metastases.
The Random Forest model leverages ensemble learning techniques, incorporating randomness during the training phase by training each tree on slightly different subsets of data and selecting the best features from randomly chosen subsets at split nodes. This significantly enhances model diversity, reduces overfitting risks, and improves generalization capabilities (17). Light GBM, an efficient gradient boosting framework, optimizes the speed and memory efficiency of traditional GBDT using a histogram-based decision tree algorithm, while maintaining high accuracy. Gradient Boosting models iteratively train decision trees to minimize loss functions, with each tree learning from the prediction residuals of the previous tree, thereby reducing model bias and enhancing accuracy on training data (18).
The KNN algorithm, a basic yet widely used method for classification and regression, predicts the value of unknown data points by referencing information from the nearest samples. In classification tasks, the category of a sample is determined by one or more of its nearest neighbors (19). Decision Tree, a non-parametric supervised learning method applicable for classification and regression tasks, predicts target values by learning decision rules from features, offering excellent interpretability due to its ability to visualize and simulate human decision-making processes (20).
AdaBoost is an ensemble learning algorithm that aims to build a strong learner by combining multiple weak learners. It reweights training samples in each round to increase the weights of previously misclassified samples, thus focusing subsequent learners on more challenging samples (21). Extra Trees, a variant of Random Forest, introduces additional randomness by selecting thresholds randomly for each feature rather than calculating the optimal thresholds, thereby enhancing tree diversity and reducing both model variance and, typically, computation time (22).
Although LDA is inherently an unsupervised learning model primarily used for identifying topic distributions in document collections, it can also be indirectly applied to classification tasks by transforming documents into topic-based representations and using these topic distributions as features for classifier training (23). Bootstrap Aggregating is a classic ensemble learning method that combines multiple models to enhance the accuracy of predictions and reduce model variance, using multiple training subsets to stabilize and improve the accuracy of final predictions (24).
BernoulliNB is a specific type of Naive Bayes classifier, designed for binary feature classification tasks. It performs well in text data handling, such as document classification or other binary classification problems, by assuming conditional independence of features within each class, simplifying computation. The Calibrated Classifier is an adjusted classifier that refines prediction probabilities to more accurately reflect the likelihood of actual events, providing more reliable probability estimates that enhance decision-making and risk assessment (25).
GNB is a classification algorithm based on Naive Bayes theory, particularly suitable for scenarios where features adhere to a Gaussian (normal) distribution. It excels in medical predictions and other areas, effectively handling continuous data and providing classification results based on normal distributions. Logistic Regression is a widely used linear classification algorithm that predicts the probability of sample membership in a category. As a simple yet effective model, it performs well in many classification tasks, especially when the relationship between features and the target variable is approximately linear (26).
MLP is a feed-forward neural network used for both classification and regression problems. By learning complex patterns and features through multiple hidden layers (“multilayers”), MLP handles highly nonlinear prediction tasks and finds broad applications in image recognition, natural language processing, and more (27). QDA extends LDA by allowing different covariance matrices for each class, effectively managing cases with complex nonlinear boundaries between categories, particularly suitable for predictions where classes exhibit varying covariance structures (28).
These models have been extensively studied and applied in the field of radiomics, demonstrating their excellent performance and high generalization capabilities (29–31). By utilizing these models for the prediction and classification of radiomics features, we achieved prediction of intrahepatic recurrence in colorectal cancer liver metastasis patients within one year post-surgery, explored the application of integrated models in clinical decision support systems, and aimed to reduce misdiagnosis rates and patient loss through high-precision radiomics grading methods.
2.6 Evaluation metrics
This research employs a comprehensive suite of evaluation metrics to ensure objective and thorough assessment of machine learning models, utilizing Accuracy (ACC), Precision (PRE), Recall (REC), and the Receiver Operating Characteristic (ROC) curve with the Area Under the Curve (AUC). These metrics, based on True Positives (TP), True Negatives (TN), False Positives (FP), and False Negatives (FN), offer a multifaceted evaluation of model performance. The formula of ACC (Equation 1), PRE (Equation 2) and REC (Equation 3):
3 Results
3.1 Experimental set-up
To rigorously evaluate the models developed in this study, the dataset was divided into an 80:20 training-to-test ratio for independent testing of model efficacy. During the feature selection phase, to ensure the statistical significance of the features and mitigate computational demands associated with high-dimensionality, a T-test was applied to retain data within a 95% confidence interval. Feature selection was further refined using the Lasso method with an alpha setting of 0.65, aimed at balancing model complexity against performance for optimal feature dimensionality. Additionally, careful adjustments were made to the iterations of various machine learning models to train them to convergence without overfitting, setting each model to run for 50 iterations with a batch size of 512 to ensure comprehensive learning and efficient allocation of computational resources.
All experiments were conducted on a Windows 11 Professional Edition, using Python 3.6.0. The architecture design and validation of models were supported by packages such as Pyradiomics 3.0.1, Scikit-learn 0.24.2, scipy 1.5.4, and matplotlib. The hardware setup included an Intel Core i7 10750H CPU (base frequency 2.6GHz, turbo up to 5GHz, six cores/twelve threads) and an NVIDIA GeForce GTX 4060Ti GPU (16GB memory, 128-bit memory bus).
3.2 Results of feature extraction and selection
In this study, we leveraged expert-annotated three-dimensional CT images of patients with colorectal cancer liver metastases to extract radiomic features. Utilizing the Pyradiomics package in Python, we initially extracted 995 radiomic features from the raw CT images. However, due to the large volume and potential irrelevance of many features to the model’s decision-making process, feature selection was imperative to reduce noise and improve model performance.
Initially, features with a significance level below 0.05 were excluded via T-tests, filtering out features significantly related to the target variable. This T-test yielded 72 features. However, some remaining features irrelevant to the machine learning models could still impact performance. Further feature refinement was conducted using LASSO regression, identifying key predictive features through 10-fold cross-validation. The cross-validation curve and regression coefficient path for LASSO are illustrated in Figure 3.
Figure 3. The results of lasso feature selection. (A) The cross-validation curve of LASSO. (B) The regression coefficient path of LASSO.
Ultimately, LASSO regression selected 8 essential features, as depicted in the feature contribution curve in Figure 4. These features were adequate for training the machine learning model, effectively eliminating the interference from noise and irrelevant features, thus significantly enhancing the model’s performance.
3.3 Results of intrahepatic recurrence models
To build more comprehensive models for predicting intrahepatic recurrence within one year post-surgery in patients with colorectal cancer liver metastases, we developed and compared two methodologies: a radiomics-only model and a combined radiomics-clinical model. The radiomics-only model employs radiomic features extracted from CT images, which are input into a machine learning model after feature selection. Conversely, the combined radiomics-clinical model incorporates selected radiomic features with clinical attributes as model inputs. These clinical attributes include patient age, BMI, tumor size, and preoperative chemotherapy status. This integrative approach enables the model to simultaneously consider patient information and clinical indicators of tumor condition, thus enhancing the model’s interpretability and improving its predictive performance.
3.3.1 Results of radiomics models
In this research, features extracted via radiomics were used to train and validate machine learning models for predicting intrahepatic recurrence within one year after surgery in patients with colorectal cancer liver metastases. The performance of these radiomics models is detailed in Table 2.
The Gradient Boosting classifier showed the highest accuracy among the evaluated models, achieving an average ACC of 72.41% and an AUC of 66.15%. In contrast, the Calibrated Classifier model recorded the lowest misdiagnosis rate, with ACC, PRE, and AUC values of 65.52%, 93.75%, and 71.46%, respectively. The ROC curves and Calibration curves of the radiomics models, depicted in Figure 5, reveal that the ROC curve of the Calibrated Classifier model is closest to the upper left corner, while its Calibration curve nears the ideal 45-degree line, demonstrating its outstanding clinical utility.
Figure 5. The visualization of model result of radiomics models. (A) The ROC curve of radiomics models. (B) The Calibration curves of the radiomics models.
3.3.2 Results of radiomics-clinical models
To utilize more comprehensive information for training models, we established a radiomics-clinical dataset for training machine learning models. This dataset significantly enhanced model performance compared to using solely radiomic features, as shown in Table 3.
In the radiomics-clinical model, the Random Forest classifier exhibited the best performance, achieving an ACC of 75.86% and AUC of 70.83%. It was closely followed by the Gradient Boosting, LightGBM, and Extra Trees models. The ROC curves and Calibration curves of the radiomics-clinical models, displayed in Figure 6, show that the ROC curve of the Random Forest model is nearer to the upper left corner and its Calibration curve is closer to the ideal 45-degree line, demonstrating higher reliability and clinical suitability. The use of radiomics-clinical features in the Random Forest model has promising prospects for clinical application, potentially assisting doctors in diagnosis, reducing the workload of radiologists, and minimizing the potential harm caused by diagnostic errors and omissions.
Figure 6. The visualization of model result of radiomics-clinical models. (A) The ROC curve of radiomics-clinical models. (B) The Calibration curves of the radiomics-clinical models.
4 Discussion
Radiomics facilitates the automatic analysis of numerous image features in a short duration, many of which are difficult to assess visually (32). Integrating radiomics with machine learning models allows for objective and rapid disease classification. This combination provides clinicians with a valuable tool, aiding in the selection of appropriate treatment plans and avoiding unnecessary interventions (33). In this study, the Pyradiomics library was used to extract radiomic data from one-year postoperative CT images of colorectal cancer liver metastasis patients, combined with clinical data, to predict intrahepatic recurrence within one year post-surgery. Fifteen machine learning models were developed and evaluated. The radiomics-clinical model, particularly the Random Forest classifier, demonstrated the highest predictive performance, with an accuracy (ACC) of 75.86% and an area under the curve (AUC) of 70.83%. These promising results have potential clinical applications, helping physicians in diagnosis, reducing the workload of radiologists, identifying features not easily visible to the eye, lowering the rate of misdiagnoses, and ultimately reducing harm to patients.
Colorectal cancer has become the third most common malignant tumor worldwide (1).Liver metastasis is the most common distant metastasis pathway in patients with advanced colorectal cancer. 20% to 25% of patients with colorectal cancer have liver metastases at the time of diagnosis, and up to 50% of patients with colorectal cancer will develop synchronous liver metastases after resection of the primary tumor (34). Surgical radical resection of liver metastases is currently the best way to cure colorectal cancer liver metastases (35).
In clinical practice, some patients with colorectal cancer liver metastases still have a high recurrence rate after surgery, and the benefits of surgery are not obvious. The clinical and pathological characteristics of patients with colorectal cancer liver metastases, including patient characteristics, preoperative treatment, primary tumor and liver metastasis characteristics, and surgical factors, are all related to postoperative recurrence. Currently, the CRS score is the most commonly used evaluation system in clinical practice to predict recurrence and survival in patients with colorectal cancer liver metastases. It is of great value in guiding the timing of surgery and the choice of perioperative treatment, but it is not sufficient to predict the risk of recurrence after resection of liver metastases (36).
Imaging informatics is widely used in the diagnosis, grading and staging, efficacy evaluation and prognosis prediction of tumors by extracting a large number of imaging features from imaging images and analyzing image information in detail (37). This study combined imaging and clinical characteristics to initially establish a visual machine learning prediction model that can predict early intrahepatic recurrence after resection of liver metastases in patients with colorectal cancer liver metastases, thereby providing an effective basis for developing more accurate individualized treatment plans for patients with colorectal cancer liver metastases. For patients in the high-risk group for early recurrence identified by the model, we should take more active measures to check and treat the disease to increase disease control.
This study has several limitations. First, recurrence was evaluated only within a fixed 1-year timeframe; extending follow-up duration and incorporating diverse evaluation criteria would strengthen future analyses. Second, the absence of key biomarkers (RAS/BRAF mutations, MSI status, HER-2 expression) precluded assessment of their prognostic impact. Third, the single-center retrospective design with limited sample size constrains external validity. While genomic integration remains clinically imperative, its implementation requires resource-adaptive methodologies. We propose targeted genetic profiling (e.g., RAS/BRAF PCR) coupled with imaging biomarkers as a pragmatic solution. Our planned pilot study (N = 50-80) will validate this approach using propensity-weighted methods to establish scalable multimodal frameworks.
5 Conclusion
CT image omics combined with clinical parameters can predict the risk of early intrahepatic recurrence after surgery in patients with colorectal cancer liver metastases, showing high sensitivity and specificity. It can be used to stratify the risk of recurrence in this group of patients, and more active examination measures and adjuvant treatment can be considered for patients in the high-risk group. In addition, a prospective prediction model combining multiple omics may have higher accuracy, which is also the direction of future research on prediction models.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
Ethical approval was not required for the study involving humans in accordance with the local legislation and institutional requirements. Written informed consent to participate in this study was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and the institutional requirements.
Author contributions
YL: Software, Data curation, Writing – original draft, Writing – review & editing. YH: Data curation, Formal Analysis, Writing – original draft. ZL: Writing – review & editing. XF: Project administration, Writing – review & editing. CY: Writing – review & editing, Supervision.
Funding
The author(s) declare financial support was received for the research and/or publication of this article. This work was supported by National Natural Science Foundation of China (12326618),Xiamen Medical and Health Guidance Project(3502Z20244ZD1040 and 3502Z20244ZD1050).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Bray F LM, Sung H, Ferlay J, Siegel RL, Soerjomataram I, and Jemal A. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2024) 74:229–63. doi: 10.3322/caac.21834
2. Biller LH and Schrag D. Diagnosis and treatment of metastatic colorectal cancer: A review. Jama. (2021) 325:669–85. doi: 10.1001/jama.2021.0106
3. Ciardiello F, Ciardiello D, Martini G, Napolitano S, Tabernero J, and Cervantes A. Clinical management of metastatic colorectal cancer in the era of precision medicine. CA Cancer J Clin. (2022) 72:372–401. doi: 10.3322/caac.21728
4. Giannis D, Sideris G, Kakos CD, Katsaros I, and Ziogas IA. The role of liver transplantation for colorectal liver metastases: A systematic review and pooled analysis. Transplant Rev (Orlando). (2020) 34:100570. doi: 10.1016/j.trre.2020.100570
5. Stewart CL, Warner S, Ito K, Raoof M, Wu GX, Kessler J, et al. Cytoreduction for colorectal metastases: liver, lung, peritoneum, lymph nodes, bone, brain. When does it palliate, prolong survival, and potentially cure? Curr Probl Surg. (2018) 55:330–79. doi: 10.1067/j.cpsurg.2018.08.004
6. Siegel RL, Miller KD, and Jemal A. Cancer statistics, 2018. CA Cancer J Clin. (2018) 68:7–30. doi: 10.3322/caac.21442
7. Xu F TB, Jin TQ, and Dai CL. Current status of surgical treatment of colorectal liver metastases. World J Clin cases. (2018) 6:716–34. doi: 10.12998/wjcc.v6.i14.716
8. Boudjema K LC, Sabbagh C, Ortega-Deballon P, Heyd B, Bachellier P, Métairie S, et al. Simultaneous versus delayed resection for initially resectable synchronous colorectal cancer liver metastases: A prospective, open-label, randomized, controlled trial. Ann Surg. (2021) 273:49–56. doi: 10.1097/SLA.0000000000003848
9. Jones RP, Jackson R, Dunne DFJ, Malik HZ, Fenwick SW, Poston GJ, et al. Systematic review and meta-analysis of follow-up after hepatectomy for colorectal liver metastases2. Br J Surg. (2012) 99:477–86. doi: 10.1002/bjs.8667
10. Concors S HA and Vauthey JN. Modeling the prediction of early treatment failure after hepatectomy for colorectal liver metastases. Ann Surg Oncol. (2023) 30:3182–3. doi: 10.1245/s10434-023-13343-4
11. Imai K, Allard MA, Benitez CC, Vibert E, Sa Cunha A, Cherqui D, et al. Early recurrence after hepatectomy for colorectal liver metastases: what optimal definition and what predictive factors? Oncologist. (2016) 21:887–94. doi: 10.1634/theoncologist.2015-0468
12. Lam CSN, Bharwani AA, Chan EHY, Chan VHY, Au HLH, Ho MK, et al. A machine learning model for colorectal liver metastasis post-hepatectomy prognostications. Hepatobiliary Surg Nutr. (2023) 12:495–506. doi: 10.21037/hbsn-21-453
13. Amygdalos I, Müller-Franzes G, Bednarsch J, Czigany Z, Ulmer TF, Bruners P, et al. Novel machine learning algorithm can identify patients at risk of poor overall survival following curative resection for colorectal liver metastases. J Hepatobiliary Pancreat Sci. (2023) 30:602–14. doi: 10.1002/jhbp.v30.5
14. Zhang Y, Zhang Z, Wei L, and Wei S. Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer. Front Public Health. (2022) 10. doi: 10.3389/fpubh.2022.1008137
15. Mühlberg A, Holch JW, Heinemann V, Huber T, Moltz J, Maurus S, et al. The relevance of CT-based geometric and radiomics analysis of whole liver tumor burden to predict survival of patients with metastatic colorectal cancer. Eur Radiol. (2021) 31:834–46. doi: 10.1007/s00330-020-07192-y
16. Van Griethuysen JJ, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. (2017) 77:e104–7. doi: 10.1158/0008-5472.CAN-17-0339
18. Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, et al. Lightgbm: A highly efficient gradient boosting decision tree. In Advances in Neural Information Processing Systems (2017) 30:3146–54.
19. Cover T and Hart P. Nearest neighbor pattern classification. IEEE Transact Inform Theor. (1967) 13:21–7. doi: 10.1109/TIT.1967.1053964
20. Song Y-Y and Lu Y. Decision tree methods: applications for classification and prediction. Shanghai Arch Psych. (2015) 27:130. doi: 10.11919/j.issn.1002-0829.215044
21. Freund Y and Schapire RE. A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci. (1997) 55:119–39. doi: 10.1006/jcss.1997.1504
22. Geurts P, Ernst D, and Wehenkel L. Extremely randomized trees. Mach Learn. (2006) 63:3–42. doi: 10.1007/s10994-006-6226-1
23. Blei DM, Ng AY, and Jordan MI. Latent dirichlet allocation. J Mach Learn Res. (2003) 3:993–1022.
24. Lin J, Chen H, Li S, Liu Y, Li X, and Yu B. Accurate prediction of potential druggable proteins based on genetic algorithm and Bagging-SVM ensemble classifier. Artif Intell Med. (2019) 98:35–47. doi: 10.1016/j.artmed.2019.07.005
25. Silva Filho T, Song H, Perello-Nieto M, Santos-Rodriguez R, Kull M, and Flach P. Classifier calibration: a survey on how to assess and improve predicted class probabilities. Mach Learn. (2023) 112:3211–60. doi: 10.1007/s10994-023-06336-7
26. Chati PM, Raghunathan K, and Thiagarajah JR. Mo1799 identification of transcriptional signatures to classify disease in ulcerative colitis via an iterative gaussian naive bayes model. Gastroenterology. (2024) 166:S–1116. doi: 10.1016/S0016-5085(24)03038-5
27. Popescu M-C, Balas VE, Perescu-Popescu L, and Mastorakis N. Multilayer perceptron and neural networks. WSEAS Trans Circuits Syst. (2009) 8:579–88.
28. Sidel JL, Bleibaum RN, and Tao KC. Quantitative descriptive analysis. Descriptive Anal sensory Eval. (2018) 287–318. doi: 10.1002/9781118991657.ch8
29. Zhou M, Scott J, Chaudhury B, Hall L, Goldgof D, Yeom KW, et al. Radiomics in brain tumor: image assessment, quantitative feature descriptors, and machine-learning approaches. Am J Neuroradiology. (2018) 39:208–16. doi: 10.3174/ajnr.A5391
30. Farook TH and Dudley J. Automation and deep (machine) learning in temporomandibular joint disorder radiomics: A systematic review. J Oral Rehabil. (2023) 50:501–21. doi: 10.1111/joor.13440
31. Karabacak M, Ozkara BB, Ozturk A, Kaya B, Cirak Z, Orak E, et al. Radiomics-based machine learning models for prediction of medulloblastoma subgroups: a systematic review and meta-analysis of the diagnostic test performance. Acta Radiologica. (2023) 64:1994–2003. doi: 10.1177/02841851221143496
32. Algohary A, Viswanath S, Shiradkar R, Ghose S, Pahwa S, Moses D, et al. Radiomic features on MRI enable risk categorization of prostate cancer patients on active surveillance: Preliminary findings. J Magnetic Resonance Imaging. (2018) 48:818–28. doi: 10.1002/jmri.25983
33. Fütterer JJ, Briganti A, De Visschere P, Emberton M, Giannarini G, Kirkham A, et al. Can clinically significant prostate cancer be detected with multiparametric magnetic resonance imaging? A systematic Rev literature Eur Urol. (2015) 68:1045–53. doi: 10.1016/j.eururo.2015.01.013
34. Ma ZH, Wang YP, Zheng WH, Ma J, Bai X, Zhang Y, et al. Prognostic factors and therapeutic effects of different treatment modalities for colorectal cancer liver metastases. World J Gastrointest Oncol. (2020) 12:1177–94. doi: 10.4251/wjgo.v12.i10.1177
35. Avella P, Vaschetti R, Cappuccio M, Gambale F, Meis LDE, Rafanelli F, et al. The role of liver surgery in simultaneous synchronous colorectal liver metastases and colorectal cancer resections: a literature review of 1730 patients underwent open and minimally invasive surgery. Minerva Surg. (2022) 77:582–90. doi: 10.23736/S2724-5691.22.09716-7
36. Tan MC, Butte JM, Gonen M, Kemeny N, Fong Y, Allen PJ, et al. Prognostic significance of early recurrence: a conditional survival analysis in patients with resected colorectal liver metastasis. HPB (Oxford). (2013) 15:803–13. doi: 10.1111/hpb.12136
Keywords: colorectal liver metastasis, CT radiomics, machine learning, intrahepatic recurrence, prediction
Citation: Lin Y, Huang Y, Liu Z, Feng X and Yang C (2025) Predicting early recurrence of colorectal cancer liver metastases: an integrative approach using radiomics and machine learning. Front. Oncol. 15:1613093. doi: 10.3389/fonc.2025.1613093
Received: 16 April 2025; Accepted: 29 August 2025;
Published: 14 November 2025.
Edited by:
Arka Bhowmik, Memorial Sloan Kettering Cancer Center, United StatesReviewed by:
Lisheng Wang, Shanghai Jiao Tong University, ChinaLong Wu, Affiliated Hospital of Guizhou Medical University, China
Yen Cho Huang, Chang Gung Memorial Hospital, Taiwan
Copyright © 2025 Lin, Huang, Liu, Feng and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xiaobin Feng, ZmVuZ3hpYW9iaW4yMDA3MDhAYWxpeXVuLmNvbQ==; Chunkang Yang, Y2h1bmthbmcxMjlAZmptdS5lZHUuY24=
Yunxia Huang3