Machine learning-based radiomics approach assessing preoperative non-contrast CT for microsatellite instability prediction in colon cancer

Ren, Dongming; Wang, Yingjuan; Chen, Luda; He, Jianfeng; Shen, Tao

doi:10.3389/fphys.2025.1672636

ORIGINAL RESEARCH article

Front. Physiol., 29 September 2025

Sec. Computational Physiology and Medicine

Volume 16 - 2025 | https://doi.org/10.3389/fphys.2025.1672636

This article is part of the Research TopicMedical Knowledge-Assisted Machine Learning Technologies in Individualized Medicine Volume IIView all 29 articles

Machine learning-based radiomics approach assessing preoperative non-contrast CT for microsatellite instability prediction in colon cancer

Dongming Ren¹^†

Yingjuan Wang²^†

Luda Chen¹

Jianfeng He¹*

Tao Shen³*

¹Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China
²Department of Radiology, The third Affiliated Hospital of Kunming Medical University, Kunming, China
³Department of Colorectal Surgery, The third Affiliated Hospital of Kunming Medical University, Kunming, China

Objectives: To assess the feasibility of non-contrast CT-based radiomics model for predicting microsatellite instability (MSI) status in colon cancer.

Methods: Leveraging non-contrast abdominal CT imaging data from 57 retrospectively enrolled patients with balanced class distribution (training cohort: n = 38, 19 non-MSI-H and 19 MSI-H; test cohort: n = 19, 9 non-MSI-H and 10 MSI-H), we implemented a voxel volume-based tumor feature selection method. Feature selection integrated four feature selection filters—correlation analysis, univariate logistic regression, least absolute shrinkage and selection operator (LASSO), and recursive feature elimination (RFE). We comparatively evaluated multiple classifiers using cross-validation combined with accuracy for choosing the best classifier.

Results: A multilayer perceptron-based classification model was developed, achieving average multifold accuracy of 0.871 in cross-validation on the training cohort. In the test cohort, the model achieved an AUC of 0.944 (95% CI 0.841–1.000) with accuracy of 0.842, while maintaining sensitivity of 0.889 and specificity of 0.800, demonstrating excellent and comparable performance to previous contrast-enhanced CT-based radiomics models.

Conclusion: We validated the feasibility of non-contrast CT for MSI prediction in colon cancer with radiomics analysis, highlighting its potential as a flexible and cost-effective preliminary screening tool. This approach, which does not require supplementary medical examination, may enhance clinical decision-making by providing a valuable tool for identifying MSI-H molecular subtypes in colon cancer patients.

1 Introduction

Colon cancer ranks among the most prevalent malignancies globally, with epidemiological data documenting 1.14 million new cases and 530,000 associated deaths in 2022, reflecting a concerning upward trajectory in both incidence and mortality rates (Bray et al., 2024; Smith et al., 2002). Microsatellite instability has emerged as a pivotal molecular biomarker in colon oncology, with substantial evidence demonstrating differential therapeutic responses to immunotherapy and chemotherapy across the three MSI subcategories—Microsatellite Stable (MSS), Microsatellite Instability-Low (MSI-L), and Microsatellite Instability-High (MSI-H) (Zhao et al., 2019; Le et al., 2015). Notably, patients with MSI-H exhibit enhanced responsiveness to immune checkpoint inhibitors (ICIs) and more favorable prognostic (André et al., 2020), thereby demonstrating the value of MSI detection in guiding therapeutic strategies and prognostic evaluation for colon cancer management (Sinicrope and Sargent, 2012). Clinical practice guidelines recommend testing for MSI in all colon cancer patients (Benson et al., 2021).

Clinical methods predominantly used for MSI detection include polymerase chain reaction (PCR) and immunohistochemistry (IHC) (Söreide et al., 2006; Lindor et al., 2002). IHC evaluates mismatch repair (MMR) protein expression loss in tissue specimens and carries risks of false-negative interpretations, requiring 2–7 days for completion. While PCR remains the gold standard through direct microsatellite region length analysis (Umar et al., 2004), its implementation necessitates over a week of processing time and a higher cost. Both methods are expensive, depending on invasive biopsy sampling coupled with complex molecular techniques (Sepulveda et al., 2017; Harada and Morlote, 2020). The contradiction between the clinical demand for MSI testing and the high cost of conventional detection methods has created an urgent need for a non-invasive, low-cost approach that is desired by both patients and clinicians.

Radiomics-based feature analysis can capture the heterogeneous manifestations of tumors in medical imaging, providing a non-invasive and objective auxiliary tool for clinical diagnosis (Lambin et al., 2017). Existing studies have shown the feasibility of modeling using contrast-enhanced CT (CECT) imaging combined with radiomics to predict MSI status in colon cancer (Cao et al., 2021; Golia Pernicka et al., 2019; Bian et al., 2024; Li et al., 2021). Cao et al. integrated carcinoembryonic antigen (CEA), carbohydrate antigen 199 (CA199), and carbohydrate antigen 125 (CA125) with triphasic CECT for nomogram development (Cao et al., 2021). Golia et al. incorporated clinical information (e.g., lymph node positivity, mucinous adenocarcinoma status, KRAS mutation profiles) with venous-phase CECT in random forest modeling (Golia Pernicka et al., 2019). Bian et al. aggregated monophasic or multiphasic CECT with laboratory indices including CEA, CA199, white blood cell count, etc (Bian et al., 2024). Similarly, Li et al. incorporated venous-phase CT features with CEA and CA199 for machine learning model development (Li et al., 2021). These studies rely on multiphasic or venous-phase CECT and invasive auxiliary examinations to incorporate additional clinical features for enhancing the performance of the combination model. However, these supplementary tests not only increase prediction costs and procedural complexity but also diminish the non-invasive advantage of the method while amplifying data collection and integration challenges. These combined factors restrict the clinical implementation and widespread adoption of such models.

Notably, previous studies on the MSI status of colon cancer have paid insufficient attention to radiomic analysis of non-contrast CT (Wang et al., 2023). The use of non-contrast CT in tumor-related radiomics studies is common across various areas, including rectal cancer (Yuan et al., 2020), Hodgkin’s lymphoma (Jensen et al., 2022), adrenal tumors (Yuan et al., 2023), and gastrointestinal stromal tumors (Palatresi et al., 2022; Zhang et al., 2020; Xie et al., 2023), covering a wide range of tissue types. Yuan et al. conducted radiomics analysis on non-contrast CT scans of rectal cancer cases undergoing treatment to assess tumor regression grading (Yuan et al., 2020). Laura J. et al. also explored the feasibility of using non-contrast CT to evaluate the metabolic activity of Hodgkin’s lymphoma (Jensen et al., 2022). Additionally, researchers have attempted to use radiomic features extracted from non-contrast CT to differentiate between solitary micronodular adrenal hyperplasia and lipid-poor adenomas (Yuan et al., 2023). Non-contrast CT has also been used in multiple studies on gastrointestinal stromal tumors (Palatresi et al., 2022; Zhang et al., 2020; Xie et al., 2023). However, it remains unclear whether radiomics features extracted from non-contrast CT can be used to assess the MSI status of colon cancer currently.

Therefore, the aim of this study was to assess the feasibility of non-contrast CT-based radiomics model for predicting MSI status in colon cancer. We further validated this non-invasive approach, which integrates non-contrast CT imaging with routine clinical features to provide clinicians with a highly accessible and low-cost screening tool for detecting MSI status in colon cancer.

2 Materials and methods

2.1 Patient cohort

This study involving human participants was approved by the Medical Ethics Committee of the Third Affiliated Hospital of Kunming Medical University (Approval No. KYLX2023-198). All procedures were conducted in accordance with relevant guidelines and regulations, including the Declaration of Helsinki. As a retrospective study, informed consent was waived by the ethics committee.

The study cohort included patients with PCR-confirmed diagnoses (October 2020–January 2022). The collected data comprised preoperative CT scans, MSI status (non-MSI-H or MSI-H), and four clinical variables: age, sex, colon laterality (left/right), and subsite (ascending/transverse/descending/sigmoid). Feature selection prioritized readily accessible parameters, deliberately excluding histopathological, genomic, and even basic medical history data to minimize diagnostic complexity, clinician-patient communication, and resource utilization.

Exclusion criteria comprised: 1) Rectal carcinomas based on distinct imaging and physiological characteristics compared to colonic malignancies (n = 4); 2) Suboptimal CT imaging quality per predefined criteria due to metal artifacts or motion (n = 1). The final cohort included 57 patients with colon cancer, stratified into training and test cohorts through randomized allocation with balanced classes, as detailed in Figure 1.

Figure 1

Flowchart of cohort selection for a study with 62 colorectal cancer patients. One excluded due to poor image, four due to rectal cases, leaving 57. These are divided into training cohort (38 patients: 19 non-MSI-H, 19 MSI-H) and test cohort (19 patients: 9 non-MSI-H, 10 MSI-H).

Figure 1. The inclusion and exclusion criteria for patients and the sample cohort were partitioned. Both the training and test cohorts exhibited a non-MSI-H to MSI-H class ratio of approximately 1:1, ensuring a balanced distribution of positive and negative classes.

2.2 Assessment of MSI status

The PCR-based MSI detection protocol, considered the gold standard for MSI assessment in colon cancer, demonstrates high diagnostic accuracy with sensitivity and specificity ranges of 67%–100% and 61%–100%, respectively (Vilar and Gruber, 2010; Zhang and Li, 2013). Tumor and matched normal DNA samples were analyzed using fluorescence-labeled primers targeting five consensus microsatellite loci (BAT-25, BAT-26, D2S123, D5S346, D17S250) in our study, followed by capillary electrophoresis (Umar et al., 2004). MSI status was classified as MSI-H (≥ two unstable loci), MSI-L (single unstable locus), or MSS (no instability). All MSI assessments in this study were definitively determined by PCR, ensuring biologically validated labels for radiomics modeling.

2.3 CT imaging acquisition and segmentation

All imaging data were acquired using a standardized protocol on the same Siemens SOMATOM Definition AS+ 64-detector row CT scanner (128-slice configuration). Scanning parameters included: 120 kV tube voltage, automated tube current modulation via CareDose 4D technology, helical pitch 0.6, isotropic reconstruction at 2 mm slice thickness, and voxel spacing 0.8 × 0.8 × 2 mm (x, y, z). Anatomical coverage extended from 2 cm superior to the diaphragmatic dome to the inferior pubic symphysis, encompassing the entire abdominal cavity. All non-contrast imaging data were archived in DICOM format and retrieved through the institutional Picture Archiving and Communication System (PACS).

Region of interest (ROI) segmentation was conducted by one radiologist (15+ years’ experience) employing 3D Slicer (v5.6.2, https://www.slicer.org/) under blinded conditions to the pathology information of each patient. Following imaging quality verification, manual segmentation encompassed: primary tumor mass, adjacent bowel segments (5 mm margin), mesenteric fat infiltration zones, iliac vascular territories, and retroperitoneal nodal stations. The DICOM files with ROI were saved for subsequent radiomics analysis.

2.4 Feature extraction, selection, and model building

Preprocessing was implemented for non-contrast CT imaging comprising: 1) optimal windowing (width/level) configuration for abdominal soft tissue visualization; 2) intensity normalization (0–255 grayscale range). Preprocessed data were converted into integer discrete gray-level values with a bin width of 25 and then analyzed using PyRadiomics (version 3.1.0; http://www.radiomics.io/) (Van Griethuysen et al., 2017), an extensively used tool for quantitative imaging biomarker extraction. These biomarkers characterize tumor phenotypes, including intralesional heterogeneity patterns and spatial infiltration characteristics. For instance, the original_shape_VoxelVolume calculates three-dimensional tumor burden using the equation:

o r i g i n a l_s h a p e_V o x e l V o l u m e = \sum_{K = 0}^{N} V_{k}

This biomarker is calculated by multiplying segmented voxel count by voxel-wise volume unit $V_{k}$ , equivalent to summing all tumor voxel volumes. Therefore, original_shape_VoxelVolume was used as the primary measure of tumor voxel in our study. Non-numeric radiomics features were removed after extraction. Each retained feature was Z-score standardized to reduce the scale differences between features. Since the MSI-H and non-MSI-H groups had similar sample sizes in the training cohort, no setting for class imbalance was applied in subsequent model development.

Previous studies have shown that the MSI status in colon cancer exhibits strong correlation with texture-related features (Cao et al., 2021; Wang et al., 2023). This study employed a systematic approach to identify texture-related and key features from non-contrast CT, with the selection process summarized in Figure 2. The workflow began by selecting tumors based on volumetric criteria (10,000 - 110,000 voxels). Features extracted from these selected tumors were then processed through three parallelized filtering methods with continuous clinical parameters. The filtering strategies comprised: 1) correlation coefficient analysis (absolute value of correlation coefficient is greater than 0.3), 2) univariate logistic regression (p < 0.05), and 3) LASSO (non-zero regression coefficient). Each filtering method independently screened the extracted features; in other words, a feature only needed to meet the criteria of one filter to pass the first-level screening. Then, the union of the results from the three methods was taken and subjected to a second-level screening using RFE to determine the optimal features. This second-level screening process also reduced the dimensionality and mitigated the risk of overfitting.

Figure 2

Flowchart depicting a machine learning workflow. Clinical and radiomics features are processed through correlation, logistic regression, and LASSO, followed by dimensionality reduction. The train cohort undergoes five-fold cross-validation using multiple classifiers: multilayer perceptron, support vector machine, random forest, gradient boosting decision trees, and k-nearest neighbors. The best classifier is selected. The process involves 7 radiomics features and 1 clinical feature, including age.

Figure 2. The workflow of feature selection and model building.

Five classical machine learning classifiers (support vector machines (SVM), gradient boosting decision trees (GBDT), k-nearest neighbors (KNN), random forest (RF), and multilayer perceptron (MLP)) were incorporated in this study. Each algorithm has strengths and limitations in handling data distributions, feature complexity, and noise (Caruana and Niculescu-Mizil, 2006; Fernández-Delgado et al., 2014), as shown in Table 1. Comparative evaluation facilitates the identification of classifiers that achieve an optimal balance between accuracy and computational efficiency, supporting model building that aligns with research requirements.

Table 1

Table 1. The advantages and disadvantages of the five classifiers.

Consequently, all classifiers were evaluated using the Stratified K-Fold algorithm with 5-fold cross-validation on the training cohort. During cross-validation, we used accuracy as a metric, combined with grid search, to determine the optimal hyperparameters for each classifier. The average accuracy across the five folds was used as the initial performance metric for the models. Subsequently, a more comprehensive comparison of the five classifiers was conducted on the test cohort using additional metrics to determine the final classifier to be used.

2.5 Statistical analysis

This study implemented a complete analytical workflow using Python (v3.8.19) with third-party libraries (scikit-learn v1.3.2, pandas v2.0.3) for data processing and model development. Clinical features were summarized as mean or interquartile range. Radiomics features were extracted via PyRadiomics. Feature selection combined correlation analysis (correlation value >0.30), LASSO, and univariate logistic regression (statistically significant differences P < 0.05). The dimensionality reduction was performed using RFE. The performance of each classifier was evaluated across the fivefold cross-validation with multifold mean accuracy. The final model performance was evaluated using accuracy, sensitivity, specificity, F1-score, precision, recall, and ROC analysis (AUC with 95% CI via DeLong test). This systematic approach ensured methodological rigor and reliable predictive performance.

3 Results

3.1 Patient profiles

This study enrolled 57 colon cancer patients (28 males, 29 females), divided into non-MSI-H and MSI-H groups. The MSI-H group was younger (50.7 vs. 61.8 years) and had a higher proportion of males. Most tumors were right-sided (71.9%), with the ascending colon as the primary subsite (57.9% overall; 58.6% in non-MSI-H). No significant differences were observed in subsite distribution (ascending/transverse/descending/sigmoid) or colon laterality (right/left) between groups, indicating similar spatial patterns. See Table 2 for complete clinical comparison.

Table 2

Table 2. Patient profiles.

3.2 Feature selection

These eight features exhibited variable prioritization across the three linear selection methodologies (Figure 3). Different screening methods have their own criteria. In LASSO, four features with non-zero regression coefficients are more important, while the remaining features are indicated by short vertical lines (Figure 3A). From the perspective of correlation analysis, five features are highly correlated with MSI-H (absolute value of the correlation coefficient exceeds 0.3), making them more important than the other three features (Figure 3B). Univariate logistic regression (Figure 3C) further emphasizes five statistically significant features (p < 0.05), indicating that these features are more predictive of MSI status. The correlations between the features are shown in Figure 3D.

Figure 3

Panels display various statistical analyses. Panel A shows LASSO regression coefficients for features F1 to F8, with non-zero coefficients marked. Panel B displays feature correlation coefficients, indicating strong and weak correlations. Panel C presents univariate logistic regression results, highlighting significant and non-significant p-values. Panel D illustrates a feature correlation matrix, where circle size represents correlation strength, with a gradient color scale indicating coefficient values.

Figure 3. The performance of the 8 final modeling features in selection and correlation matrix (F1: wavelet-LLH_glcm_InverseVariance, F2: squareroot_glszm_SmallAreaLowGrayLevelEmphasis, F3: wavelet-HHL_glszm_GrayLevelVariance, F4: exponential_ngtdm_Contrast, F5: square_ngtdm_Strength, F6: wavelet-LLL_ngtdm_Strength, F7: squareroot_ngtdm_Complexity, F8: age). (A) In LASSO, four features showed non-zero coefficients, with the remaining features indicated by short vertical lines. (B) Correlation analysis revealed five features with strong correlations exceeding 0.3 and three with weaker correlations below 0.3. (C) Univariate logistic regression identified five statistically significant features, while three features remained non-significant. (D) The correlation between features.

The three linear methods—correlation analysis, univariate logistic regression, and LASSO—were applied to analyze 1,416 extracted features. The detailed results of these three linear filters are presented in the Supplementary Material. Each method identified distinct but interpretable associations with clinical outcomes. Univariate logistic regression selected 30 features, including 10 unique to this approach, by evaluating the individual contribution of each feature. LASSO chose 28 features with 19 method-exclusive identifiers, effectively controlling model complexity to prevent overfitting. Correlation analysis produced the most conservative results, retaining only 20 features. Implementing these methods in parallel leveraged their complementary strengths, enabling a comprehensive exploration of the feature space while reducing redundancy for improved model generalizability. The outputs from three linear filters were integrated with categorical clinical features, resulting in a pool of 54 features. Guided by established engineering standards to prevent overfitting—specifically applying the 20% dimensionality rule relative to the training cohort size (Guyon and Elisseeff, 2003) —and based on our training cohort size of 39, we performed RFE to reduce the feature pool to 8 relevant predictors: wavelet-LLH_glcm_InverseVariance, squareroot_glszm_SmallAreaLowGrayLevelEmphasis, wavelet-HHL_glszm_GrayLevelVariance, exponential_ngtdm_Contrast, square_ngtdm_Strength, wavelet-LLL_ngtdm_Strength, squareroot_ngtdm_Complexity, and age.

Traditional sequential screening may risk information loss through progressive elimination. The parallel integration of three selection techniques enabled synergistic feature evaluation while circumventing information attrition typical of serialized or single-method implementations. The resultant 8-feature set thus embodies complementary strengths from multiple selection paradigms. Notably, age emerged as the most discriminative parameter, demonstrating the highest performance in both correlation analysis (absolute coefficient: 0.516) and univariate logistic regression (P < 0.001), surpassing all radiomics features. This finding aligned with its significance in subsequent ablation studies, confirming its biological and clinical relevance.

3.3 Cross-validation

Figure 4 displays the five-fold cross-validation accuracy metrics for five classifiers trained on the 8-feature subset. The white line inside each box indicates the multifold mean accuracy (numerical values are displayed in brackets above the x-axis), and the black line represents the median. Three models—KNN, SVM, and MLP—demonstrated superior predictive capabilities with mean accuracy thresholds exceeding 0.85, indicating robust consistency in their high performance. Random Forest and Gradient Boosting underperformed slightly in contrast, with mean accuracies of 0.821 and 0.846, respectively. The fold-specific prediction accuracy of each classifier during 5-fold cross-validation is presented in Supplementary Material S2, revealing inter-fold variability that reflects the differential adaptability of models.

Figure 4

Box plot comparing classifier accuracy, displaying KNN, SVM, Random Forest, Gradient Boosting, and MLP. KNN, SVM, and MLP show similar accuracy around 0.871. Random Forest has the lowest median, while Gradient Boosting is slightly higher at 0.846.

Figure 4. Comparison of multiple classifiers on the training cohort.

After determining the optimal hyperparameters for each classifier, we conducted a more comprehensive evaluation of the five classifiers and presented the results in Table 3. In terms of the AUC metric on the training set, SVM performed approximately 5% lower than MLP and KNN, leading to its exclusion. Although KNN had a 0.002 higher AUC than MLP, considering the size of the training set, we believe this slight difference is not significant enough to indicate that KNN has a decisive advantage over MLP, and further comparison is needed.

Table 3

Table 3. Metrics of five classifiers on the training and test cohorts.

Furthermore, according to the metrics on the test set, MLP demonstrated stronger generalization ability, showing more than a 5% advantage in accuracy, AUC, F1-score, and sensitivity. Considering multiple factors such as performance metrics, generalization ability, and runtime efficiency, MLP can meet the needs of real-world medical applications. Therefore, we chose MLP as the best classifier for the final test.

3.4 Performance of model

Two predictive models were developed using MLP classifier: the combination model (including age) with all eight features, and the radiomics model using seven radiomics features (excluding age). Both models were tested on the internal test cohort, with results presented in ROC curves (Figure 5A) and confusion matrices (Figure 5B). Two models were based on the same classifier and differed only in the inclusion of a single clinical feature (patient age); thus, these results also serve as an ablation experiment to target the impact of patient age.

Figure 5

Panel A shows a Receiver Operating Characteristic (ROC) curve comparing a combination model with an AUC of 0.94 and a radiomics model with an AUC of 0.82. Panel B presents two confusion matrices. The left matrix for the combination model shows true labels versus predicted labels: non-MSI-H with eight true positives and two false negatives, MSI-H with one false positive and eight true positives. The right matrix for the radiomics model displays non-MSI-H with seven true positives and three false negatives, MSI-H with one false positive and eight true positives.

Figure 5. Test results for the combination model (including age) and radiomics model. (A) and (B) respectively display the AUC curves and confusion matrices of two models on the internal test cohort.

The combination model performed better, with AUC 0.944 (95% CI 0.841–1.000) and accuracy 0.842, compared to AUC 0.822 and accuracy 0.789 for the radiomics model on the test cohort. Confusion matrix analysis showed the combined model made three errors with the higher specificity, while the radiomics model made four errors (misclassifying 3 Non-MSI-H cases). A more detailed performance comparison between the two prediction models is illustrated in Table 4: Apart from the sensitivity metric on the test cohort, the combination model outperforms the radiomics model in all other metrics: the combination model exhibits stronger discriminative power (an increase of 7.9% in AUC on the training cohort and 12.2% on the test cohort) and higher classification accuracy (an improvement of 5.0% in accuracy on the training cohort and 5.3% on the test cohort). These comparisons confirm that age significantly improves predictive reliability, aligning with the results of feature selection.

Table 4

Table 4. Performance comparison between the combination model (including age) and the radiomics model.

4 Discussion

We developed a non-invasive method to predict MSI status (MSI-H vs. Non-MSI-H) using preoperative non-contrast CT scan and patient age in colon cancer. By analyzing tumor imaging features alongside routinely available clinical variables, this approach eliminates biopsy-related risks (e.g., infection, bleeding) and reduces human error in traditional tissue analysis. Age became the key clinical factor in the final model. The final model combining radiomics features and patient age achieved 84.2% accuracy and matching F1-score, proving our method worked reliably.

Our predictive method exhibits competitive potential compared with similar non-invasive studies utilizing CECT and eliminates the need for tissue biopsy, demonstrating its non-invasive nature. Cao et al. developed a radiomics model based on delayed-phase CECT, which achieved AUC of 0.953 and accuracy of 0.852 in the validation cohort (Cao et al., 2021). Ma et al. also developed a non-invasive predictive model using venous phase CECT, achieving AUC values of 0.903 and 0.852 in the training and test cohorts, respectively (Ma et al., 2024). The proposed non-contrast CT-based model achieved a multifold mean accuracy of 0.871 in the training cohort, with test cohort metrics including AUC 0.944, accuracy 0.842, sensitivity 0.889, and specificity 0.800. These results indicate comparable performance with CECT-based approaches while confirming the technical feasibility of non-contrast CT in predictive applications.

In the field of radiomics for oncology, the use of non-contrast CT has produced promising results. Yuan et al. developed and validated a radiomics nomogram to distinguish between solitary micronodular adrenal hyperplasia and lipid-poor adenomas, creating both a non-contrast radiomics model and a triphasic contrast-enhanced radiomics model. In an external cohort, there was no significant difference in AUC between the two models (0.838 vs. 0.843, p = 0.949) (Yuan et al., 2023). Although the clinical gold standard for diagnosing gastrointestinal stromal tumors is CECT (Palatresi et al., 2022), Zhang et al. demonstrated that radiomics models based on both non-contrast CT and CECT yielded similar AUC values for diagnosing high-malignant-potential gastrointestinal stromal tumors (Zhang et al., 2020). Similar findings were also reported by Xie et al. (2023) . These studies suggest that combining non-contrast CT with radiomics is a broadly feasible approach and, in some cases, non-contrast CT does not significantly underperform compared to CECT. For colon cancer, clinical practice tends to rely on CECT as the standard for diagnosis. We fully respect the views and standards of clinicians and do not think that non-contrast CT can replace CECT in clinical settings. However, for radiomics analysis alone, non-contrast CT may serve as an alternative option for feature extraction.

Previous radiomics research about MSI status in colon cancer thus lacks comparative validation between CECT and non-contrast CT, failing to disprove the feasibility of non-contrast CT for MSI prediction in colon cancer. Golia et al. first explored CT-based radiomics for predicting MSI status of colon cancer in 2019, choosing venous-phase CT for analysis based on standard care for TNM staging (Golia Pernicka et al., 2019). In 2021, Cao et al. speculated that the use of contrast agent could improve tissue differentiation through increased iodine concentration and consequently enhance prediction performance, though non-contrast CT was not discussed (Cao et al., 2021). A recent study by Ma et al. indicated their choice of venous-phase imaging for feature extraction was influenced by prior literature and their preliminary trial (Ma et al., 2023). Most similar studies either omit explanations for choosing CECT in radiomics feature analysis or briefly mention it as a limitation. CECT exposes patients to higher radiation doses with increased carcinogenic risks (Linet et al., 2012), while contrast agents carry nephrotoxicity and allergic reaction risks (Lameire et al., 2013), making it unsuitable for multiple reexaminations. From perspectives of cost, time consumption, and safety, non-contrast CT proves more advantageous for rapid screening and holds greater potential for implementation in resource-limited regions. Although non-contrast CT did not provide incremental diagnostic value in clinical practice, its widespread availability facilitates the radiomics quantitative analysis of colon cancer.

On the other hand, previous studies utilizing CECT have provided us with significant insights. Multiple studies have consistently identified texture-related features as predominant predictors (Cao et al., 2021; Golia Pernicka et al., 2019; Bian et al., 2024; Li et al., 2021; Wang et al., 2023), which play a crucial role in colon cancer MSI prediction by effectively reflecting tumor heterogeneity (Cao et al., 2021; Meng et al., 2019). Therefore, this study selected medium-to small-volume tumors (voxel counts ranging from 10,000 to 110,000) from our samples for feature selection. Within this voxel count range, the mean voxel count of MSI-H tumors was 47,857.570, slightly higher than the 43,680.635 observed in non-MSI-H tumors, thereby reducing volumetric discrepancies. Our method intentionally biases feature selection toward texture- or structure-related characteristics. Univariate logistic regression identified 10 first-order features (10/30) and 10 wavelet-related features (10/30). Correlation analysis revealed 6 first-order features (6/20) and 5 wavelet-related features (5/20). LASSO selected 5 first-order features (5/28) and 15 wavelet-related features (15/28). Our predictive model incorporated 8 features, with wavelet filters contributing 3 (3/8) and the remaining 5 presenting strong texture associations. The prominence of first-order category and wavelet-derived features in non-contrast CT feature analysis aligns with findings from multiple CECT studies (Cao et al., 2021; Golia Pernicka et al., 2019; Bian et al., 2024; Li et al., 2021), reinforcing the critical importance of texture features in MSI classification and showing the efficacy of our feature selection method for non-contrast CT imaging in highlighting the texture features of tumors.

Leveraging the practicality of non-contrast CT, the informatics system incorporating this model could serve as a rapid screening tool to facilitate clinical workflows. The workflow consists of three key steps: 1) standard non-contrast CT acquisition, 2) manual tumor segmentation requiring approximately 20 min, and 3) feature extraction by PyRadiomics and model prediction completed within 30 s. The system will deliver MSI predictions in 30 min—far faster and more cost-effective than traditional testing. The automated segmentation module, currently under development, is engineered to achieve comprehensive automation within 5 min after CT scan completion, integrating both image acquisition and computational analysis phases to support repeat screenings. Given the widespread use of non-contrast CT in routine health examination, this predictive method could enable large-scale screening of MSI status in colon cancer.

4.1 Limitations

This study has limitations that warrant attention. First, the rarity of MSI-H status in colon cancer constrained the positive sample size, resulting in a small-sample, single-center, retrospective design—a common challenge in similar research (Wang et al., 2023) — but internal consistency was maintained. Furthermore, the current lack of large-scale external datasets precludes comprehensive external validation. We acknowledge this work represents a preliminary exploration; future studies will expand the dataset through multicenter collaborations or ongoing data collection to enhance the robustness of conclusions and model generalization. Second, the cohort primarily comprised long-term residents from a single geographic region. Although current evidence on geographic influences on colon cancer MSI status remains limited and its clinical significance unclear, this homogeneity constrains generalizability to more diverse populations. Third, similar to other studies in this field, the features we have extracted and analyzed are influenced by the subjectivity of radiologists during manual segmentation. In the future, we plan to explore deep learning-based automatic segmentation methods to minimize bias in the segmentation process.

5 Conclusion

This study validated the feasibility of non-contrast CT in MSI assessment by radiomics machine learning modeling for colon cancer. The prediction model requires only patient age and non-contrast CT imaging as inputs, offering a non-invasive and simplified workflow with low implementation costs. This demonstrates its potential clinical utility as an efficient adjunct screening tool.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Medical Ethics Committee of the Third Affiliated Hospital of Kunming Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participant’s; legal guardians/next of kin because This study is a retrospective analysis using anonymized data. The use of such data does not expose any identifiable information about the participants. According to relevant laws and ethical committee guidelines, the requirement for written informed consent can be waived in this context.

Author contributions

DR: Conceptualization, Formal Analysis, Investigation, Writing – original draft. YW: Data curation, Software, Writing – review and editing. LC: Conceptualization, Data curation, Investigation, Writing – review and editing. JH: Project administration, Validation, Writing – review and editing. TS: Methodology, Supervision, Writing – review and editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This study has received funding from the National Natural Science Foundation of China (No.82160347), the Yunnan Fundamental Research Projects (grant No.202301AY070001-251), and the Yunnan Province Young and Middle-Aged Academic and Technical Leaders Project (202305AC350007).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphys.2025.1672636/full#supplementary-material

References

André T., Shiu K. K., Kim T. W., Jensen B. V., Jensen L. H., Punt C., et al. (2020). Pembrolizumab in microsatellite-instability–high advanced colorectal cancer. N. Engl. J. Med. 383 (23), 2207–2218. doi:10.1056/nejmoa2017699

PubMed Abstract | CrossRef Full Text | Google Scholar

Benson A. B., Venook A. P., Al-Hawary M. M., Arain M. A., Chen Y. J., Ciombor K. K., et al. (2021). Colon cancer, version 2.2021, NCCN clinical practice guidelines in oncology. J. Natl. Compr. Cancer Netw. 19 (3), 329–359. doi:10.6004/jnccn.2021.0012

PubMed Abstract | CrossRef Full Text | Google Scholar

Bian X., Sun Q., Wang M., Dong H., Dai X., Zhang L., et al. (2024). Preoperative prediction of microsatellite instability status in colorectal cancer based on a multiphasic enhanced CT radiomics nomogram model. BMC Med. Imaging 24 (1), 77. doi:10.1186/s12880-024-01252-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Bray F., Laversanne M., Sung H., Ferlay J., Siegel R. L., Soerjomataram I., et al. (2024). Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA a cancer J. Clin. 74 (3), 229–263. doi:10.3322/caac.21834

PubMed Abstract | CrossRef Full Text | Google Scholar

Cao Y., Zhang G., Zhang J., Yang Y., Ren J., Yan X., et al. (2021). Predicting microsatellite instability status in colorectal cancer based on triphasic enhanced computed tomography radiomics signatures: a multicenter study. Front. Oncol. 11, 687771. doi:10.3389/fonc.2021.687771

PubMed Abstract | CrossRef Full Text | Google Scholar

Caruana R., Niculescu-Mizil A. (2006). An empirical comparison of supervised learning algorithms. InProceedings 23rd Int. Conf. Mach. Learn. 25, 161–168. doi:10.1145/1143844.1143865

CrossRef Full Text | Google Scholar

Fernández-Delgado M., Cernadas E., Barro S., Amorim D., Fernández-Delgado A. (2014). Do we need hundreds of classifiers to solve real world classification problems? J. Mach. Learn. Res. 15 (1), 3133–3181. Available online at: https://www.jmlr.org/papers/volume15/delgado14a/delgado14a.pdf?source=post_page.

Google Scholar

Golia Pernicka J. S., Gagniere J., Chakraborty J., Yamashita R., Nardo L., Creasy J. M., et al. (2019). Radiomics-based prediction of microsatellite instability in colorectal cancer at initial computed tomography evaluation. Abdom. Radiol. 44, 3755–3763. doi:10.1007/s00261-019-02117-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Guyon I., Elisseeff A. (2003). An introduction to variable and feature selection. J. Mach. Learn. Res. 3 (Mar), 1157–1182. Available online at: https://www.jmlr.org/papers/v3/guyon03a.html.

Google Scholar

Harada S., Morlote D. (2020). Molecular pathology of colorectal cancer. Adv. anatomic pathology 27 (1), 20–26. doi:10.1097/PAP.0000000000000247

PubMed Abstract | CrossRef Full Text | Google Scholar

Jensen L. J., Rogasch J. M. M., Kim D., Rießelmann J., Furth C., Amthauer H., et al. (2022). CT radiomics to predict Deauville score 4 positive and negative Hodgkin lymphoma manifestations. Sci. Rep. 12 (1), 20008. doi:10.1038/s41598-022-24227-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Lambin P., Leijenaar R. T., Deist T. M., Peerlings J., De Jong E. E., Van Timmeren J., et al. (2017). Radiomics: the bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 14 (12), 749–762. doi:10.1038/nrclinonc.2017.141

PubMed Abstract | CrossRef Full Text | Google Scholar

Lameire N., Kellum J. A.KDIGO AKI Guideline Work Group (2013). Contrast-induced acute kidney injury and renal support for acute kidney injury: a KDIGO summary (Part 2). Crit. Care 17, 205–3. doi:10.1186/cc11455

PubMed Abstract | CrossRef Full Text | Google Scholar

Le D. T., Uram J. N., Wang H., Bartlett B. R., Kemberling H., Eyring A. D., et al. (2015). PD-1 blockade in tumors with mismatch-repair deficiency. N. Engl. J. Med. 372 (26), 2509–2520. doi:10.1056/NEJMoa1500596

PubMed Abstract | CrossRef Full Text | Google Scholar

Li Z., Zhong Q., Zhang L., Wang M., Xiao W., Cui F., et al. (2021). Computed tomography-based radiomics model to preoperatively predict microsatellite instability status in colorectal cancer: a multicenter study. Front. Oncol. 11, 666786. doi:10.3389/fonc.2021.666786

PubMed Abstract | CrossRef Full Text | Google Scholar

Lindor N. M., Burgart L. J., Leontovich O., Goldberg R. M., Cunningham J. M., Sargent D. J., et al. (2002). Immunohistochemistry versus microsatellite instability testing in phenotyping colorectal tumors. J. Clin. Oncol. 20 (4), 1043–1048. doi:10.1200/JCO.2002.20.4.1043

PubMed Abstract | CrossRef Full Text | Google Scholar

Linet M. S., Slovis T. L., Miller D. L., Kleinerman R., Lee C., Rajaraman P., et al. (2012). Cancer risks associated with external radiation from diagnostic imaging procedures. CA A Cancer J. Clin. 62 (2), 75–100. doi:10.3322/caac.21132

PubMed Abstract | CrossRef Full Text | Google Scholar

Ma Y., Xu X., Lin Y., Li J., Yuan H. (2023). An integrative clinical and CT-based tumoral/peritumoral radiomics nomogram to predict the microsatellite instability in rectal carcinoma. Abdom. Radiol. 49 (3), 783–790. doi:10.1007/s00261-023-04099-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Ma Y., Shi Z., Wei Y., Shi F., Qin G., Zhou Z. (2024). Exploring the value of multiple preprocessors and classifiers in constructing models for predicting microsatellite instability status in colorectal cancer. Sci. Rep. 14 (1), 20305. doi:10.1038/s41598-024-71420-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Meng X., Xia W., Xie P., Zhang R., Li W., Wang M., et al. (2019). Preoperative radiomic signature based on multiparametric magnetic resonance imaging for noninvasive evaluation of biological characteristics in rectal cancer. Eur. Radiol. 29, 3200–3209. doi:10.1007/s00330-018-5763-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Palatresi D., Fedeli F., Danti G., Pasqualini E., Castiglione F., Messerini L., et al. (2022). Correlation of CT radiomic features for GISTs with pathological classification and molecular subtypes: preliminary and monocentric experience. La Radiol. Medica. 127 (2), 117–128. doi:10.1007/s11547-021-01446-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Sepulveda A. R., Hamilton S. R., Allegra C. J., Grody W., Cushman-Vokoun A. M., Funkhouser W. K., et al. (2017). Molecular biomarkers for the evaluation of colorectal cancer: guideline from the American society for clinical pathology, college of American pathologists, association for molecular pathology, and American society of clinical oncology. Am. J. Clin. pathology 147 (3), 221–260. doi:10.1093/ajcp/aqw209

PubMed Abstract | CrossRef Full Text | Google Scholar

Sinicrope F. A., Sargent D. J. (2012). Molecular pathways: microsatellite instability in colorectal cancer: prognostic, predictive, and therapeutic implications. Clin. cancer Res. 18 (6), 1506–1512. doi:10.1158/1078-0432.CCR-11-1469

PubMed Abstract | CrossRef Full Text | Google Scholar

Smith G., Carey F. A., Beattie J., Wilkie M. J., Lightfoot T. J., Coxhead J., et al. (2002). Mutations in APC, Kirsten-ras, and p53—alternative genetic pathways to colorectal cancer. Proc. Natl. Acad. Sci. 99 (14), 9433–9438. doi:10.1073/pnas.122612899

PubMed Abstract | CrossRef Full Text | Google Scholar

Söreide K., Janssen E. A., Söiland H., Körner H., Baak J. P. (2006). Microsatellite instability in colorectal cancer. J. Br. Surg. 93 (4), 395–406. doi:10.1002/bjs.5328

PubMed Abstract | CrossRef Full Text | Google Scholar

Umar A., Boland C. R., Terdiman J. P., Syngal S., Chapelle A. D., Rüschoff J., et al. (2004). Revised Bethesda Guidelines for hereditary nonpolyposis colorectal cancer (Lynch syndrome) and microsatellite instability. J. Natl. Cancer Inst. 96 (4), 261–268. doi:10.1093/jnci/djh034

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Griethuysen J. J., Fedorov A., Parmar C., Hosny A., Aucoin N., Narayan V., et al. (2017). Computational radiomics system to decode the radiographic phenotype. Cancer Res. 77 (21), e104–e107. doi:10.1158/0008-5472.CAN-17-0339

PubMed Abstract | CrossRef Full Text | Google Scholar

Vilar E., Gruber S. B. (2010). Microsatellite instability in colorectal cancer—the stable evidence. Nat. Rev. Clin. Oncol. 7 (3), 153–162. doi:10.1038/nrclinonc.2009.237

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang Q., Xu J., Wang A., Chen Y., Wang T., Chen D., et al. (2023). Systematic review of machine learning-based radiomics approach for predicting microsatellite instability status in colorectal cancer. La Radiol. medica 128 (2), 136–148. doi:10.1007/s11547-023-01593-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Xie Z., Suo S., Zhang W., Zhang Q., Dai Y., Song Y., et al. (2023). Prediction of high Ki-67 proliferation index of gastrointestinal stromal tumors based on CT at non-contrast-enhanced and different contrast-enhanced phases. Eur. Radiol. 34 (4), 2223–2232. doi:10.1007/s00330-023-10249-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Yuan Z., Frazer M., Zhang G. G., Latifi K., Moros E. G., Feygelman V., et al. (2020). CT-based radiomic features to predict pathological response in rectal cancer: a retrospective cohort study. J. Med. Imaging Radiat. Oncol. 64 (3), 444–449. doi:10.1111/1754-9485.13044

PubMed Abstract | CrossRef Full Text | Google Scholar

Yuan H., Kang B., Sun K., Qin S., Ji C., Wang X. (2023). CT-based radiomics nomogram for differentiation of adrenal hyperplasia from lipid-poor adenoma: an exploratory study. BMC Med. Imaging 23 (1), 4. doi:10.1186/s12880-022-00951-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang X., Li J. (2013). Era of universal testing of microsatellite instability in colorectal cancer. World J. Gastrointest. Oncol. 5 (2), 12–19. doi:10.4251/wjgo.v5.i2.12

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang Q. W., Zhou X. X., Zhang R. Y., Chen S. L., Liu Q., Wang J., et al. (2020). Comparison of malignancy-prediction efficiency between contrast and non-contract CT-based radiomics features in gastrointestinal stromal tumors: a multicenter study. Clin. Transl. Med. 10 (3), e291. doi:10.1002/ctm2.91

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhao P., Li L., Jiang X., Li Q. (2019). Mismatch repair deficiency/microsatellite instability-high as a predictor for anti-PD-1/PD-L1 immunotherapy efficacy. J. Hematol. & Oncol. 12, 54–4. doi:10.1186/s13045-019-0738-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: colon cancer, microsatellite instability, non-contrast CT, radiomics, machine learning

Citation: Ren D, Wang Y, Chen L, He J and Shen T (2025) Machine learning-based radiomics approach assessing preoperative non-contrast CT for microsatellite instability prediction in colon cancer. Front. Physiol. 16:1672636. doi: 10.3389/fphys.2025.1672636

Received: 24 July 2025; Accepted: 15 September 2025;
Published: 29 September 2025.

Edited by:

Feng Gao, The Sixth Affiliated Hospital of Sun Yat-sen University, China

Reviewed by:

Yang Yang, Yunnan Normal University, China
Du Cai, The Sixth Affiliated Hospital of Sun Yat-sen University, China

Copyright © 2025 Ren, Wang, Chen, He and Shen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jianfeng He, amZlbmdoZUBrdXN0LmVkdS5jbg==; Tao Shen, c2hlbnRhb0BrbW11LmVkdS5jbg==

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.