Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol.

Sec. Thoracic Oncology

This article is part of the Research TopicRadiomics and Artificial Intelligence in Oncology ImagingView all 23 articles

Predicting Ki-67 Expression Levels in Non-Small Cell Lung Cancer Using an Explainable CT-Based Deep Learning Radiomics Model

Provisionally accepted
Shize  QinShize Qin1Qing  JiaQing Jia2Chunmei  ZhangChunmei Zhang1Man  LiMan Li3Xiufu  ZhangXiufu Zhang1Xue  ZhouXue Zhou1Dan  SuDan Su1Yongying  LiuYongying Liu1Jun  ZhouJun Zhou1*
  • 1The Jiang Jin Central Hospital of Chongqing, Chongqing, China
  • 2Chongqing General Hospital, Chongqing, China
  • 3Shanghai United Imaging Intelligence Co Ltd, Shanghai, China

The final, formatted version of the article will be published soon.

Objective To predict Ki-67 expression levels in non-small cell lung cancer (NSCLC) using an interpretable model combining clinical-radiological, radiomic, and deep learning features. Methods This retrospective study included 259 NSCLC patients from Center 1 (training/validation sets) and 112 from Center 2 (independent test set). Patients were grouped by a 40% Ki-67 cutoff. Radiomic features and deep learning features were extracted from CT images, where the deep learning features were obtained via a deep residual network (ResNet18). The least absolute shrinkage and selection operator (LASSO) was used to select optimal features and compute radiomics (rad-score) and deep learning (deep-score) scores. Univariate and multivariate logistic regression were used to identify independent clinical-radiological predictors of Ki-67. Four support vector machine models were developed: a clinical-radiological model (based on independent clinical-radiological features), a radiomic model (using the rad-score), a deep learning model (using the deep-score), and a combined model (integrating all the above features). SHapley Additive exPlanations (SHAP) analysis was used to visualize feature contributions. Models' performance was assessed using receiver operating characteristic (ROC) curves and the integrated discrimination improvement (IDI) index. Results High Ki-67 expression occurred in 76 (42.0%), 38 (48.7%), and 33 (29.5%) patients in the training, validation, and independent test sets, respectively. In the independent test set, the combined model achieved the highest predictive performance, with an AUC of 0.892 (95% CI: 0.828–0.956). This improvement over the clinical-radiological (0.820, 95% CI: 0.721–0.918), radiomics (0.750, 95% CI: 0.655–0.844), and deep learning (0.817, 95% CI: 0.732–0.902) models was statistically significant (all p<0.05), as supported by IDI values of 0.115, 0.288, and 0.095, respectively. SHAP analysis identified the deep-score, histological type, and rad-score as key predictors. Conclusion The interpretable combined model can predict Ki-67 expression in NSCLC patients. This approach may provide imaging evidence to assist clinicians in optimizing personalized therapeutic strategies.

Keywords: Radiomics, deep learning, Interpretability, Non-small cell lung cancer, Ki-67 expression levels

Received: 28 Jun 2025; Accepted: 26 Nov 2025.

Copyright: © 2025 Qin, Jia, Zhang, Li, Zhang, Zhou, Su, Liu and Zhou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Jun Zhou

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.