Predicting Ki-67 Expression Levels in Non-Small Cell Lung Cancer Using an Explainable CT-Based Deep Learning Radiomics Model

Qin, Shize; Jia, Qing; Zhang, Chunmei; Li, Man; Zhang, Xiufu; Zhou, Xue; Su, Dan; Liu, Yongying; Zhou, Jun

doi:10.3389/fonc.2025.1655714

ORIGINAL RESEARCH article

Front. Oncol.

Sec. Thoracic Oncology

This article is part of the Research TopicRadiomics and Artificial Intelligence in Oncology ImagingView all 23 articles

Predicting Ki-67 Expression Levels in Non-Small Cell Lung Cancer Using an Explainable CT-Based Deep Learning Radiomics Model

Provisionally accepted

Shize Qin¹

Qing Jia²

Chunmei Zhang¹ Man Li

Man Li³

Xiufu Zhang¹

Xue Zhou¹ Dan Su

Dan Su¹

Yongying Liu¹

Jun Zhou^1*

¹The Jiang Jin Central Hospital of Chongqing, Chongqing, China
²Chongqing General Hospital, Chongqing, China
³Shanghai United Imaging Intelligence Co Ltd, Shanghai, China

The final, formatted version of the article will be published soon.

Objective To predict Ki-67 expression levels in non-small cell lung cancer (NSCLC) using an interpretable model combining clinical-radiological, radiomic, and deep learning features. Methods This retrospective study included 259 NSCLC patients from Center 1 (training/validation sets) and 112 from Center 2 (independent test set). Patients were grouped by a 40% Ki-67 cutoff. Radiomic features and deep learning features were extracted from CT images, where the deep learning features were obtained via a deep residual network (ResNet18). The least absolute shrinkage and selection operator (LASSO) was used to select optimal features and compute radiomics (rad-score) and deep learning (deep-score) scores. Univariate and multivariate logistic regression were used to identify independent clinical-radiological predictors of Ki-67. Four support vector machine models were developed: a clinical-radiological model (based on independent clinical-radiological features), a radiomic model (using the rad-score), a deep learning model (using the deep-score), and a combined model (integrating all the above features). SHapley Additive exPlanations (SHAP) analysis was used to visualize feature contributions. Models' performance was assessed using receiver operating characteristic (ROC) curves and the integrated discrimination improvement (IDI) index. Results High Ki-67 expression occurred in 76 (42.0%), 38 (48.7%), and 33 (29.5%) patients in the training, validation, and independent test sets, respectively. In the independent test set, the combined model achieved the highest predictive performance, with an AUC of 0.892 (95% CI: 0.828–0.956). This improvement over the clinical-radiological (0.820, 95% CI: 0.721–0.918), radiomics (0.750, 95% CI: 0.655–0.844), and deep learning (0.817, 95% CI: 0.732–0.902) models was statistically significant (all p<0.05), as supported by IDI values of 0.115, 0.288, and 0.095, respectively. SHAP analysis identified the deep-score, histological type, and rad-score as key predictors. Conclusion The interpretable combined model can predict Ki-67 expression in NSCLC patients. This approach may provide imaging evidence to assist clinicians in optimizing personalized therapeutic strategies.

Keywords: Radiomics, deep learning, Interpretability, Non-small cell lung cancer, Ki-67 expression levels

Received: 28 Jun 2025; Accepted: 26 Nov 2025.

Copyright: © 2025 Qin, Jia, Zhang, Li, Zhang, Zhou, Su, Liu and Zhou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Jun Zhou

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.