Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol.

Sec. Cancer Imaging and Image-directed Interventions

CT-Based Explainable Machine Learning for Predicting Benign and Malignant Thyroid Nodules: A Multi-center Study

Provisionally accepted
海军  何海军 何1,2Mingquan  LuoMingquan Luo1,2Kai  HuKai Hu1,2Tengfei  KeTengfei Ke3Juntao  YangJuntao Yang4Xinyue  ZiXinyue Zi5Tingting  JiangTingting Jiang6Liangping  YongLiangping Yong2Tong  ChenTong Chen2Jun  ChenJun Chen2Zhengliang  HeZhengliang He2Qiangrong  GaoQiangrong Gao2Zhoubin  LiangZhoubin Liang2Yang  JingYang Jing7*Bin  YangBin Yang8
  • 1Nanbu County People's Hospital, 南充市, China
  • 2Radiology Department, People's Hospital of Nanbu County, Nanchong City, Sichuan Province, China
  • 3Yunnan Cancer Hospital,The Third Affiliated Hospital of Kunming Medical University,Peking University Cancer Hospital Yunnan Campus,China, Kunming, China
  • 4Radiology Department, Dali Bai Autonomous Prefecture People's Hospital, Dali City, Yunnan Province,China, Dali, China
  • 5The First Affiliated Hospital of Dali University, Dali, China
  • 6lood Transfusion Department, People's Hospital of Nanbu County, Nanchong City, Sichuan Province, Nanchong, China
  • 7Huiying Medical Technology Co., Ltd, Beijing, China
  • 8Medical Imaging Centre,Kunming First People's Hospital Medical Imaging Centre, Kunming, China

The final, formatted version of the article will be published soon.

Objective: To construct a CT-based explainable machine learning model for pre operative prediction of thyroid nodule benignity or malignancy, aiming to provi de a more accurate tool for clinical decision-making and management. Materials and Methods: A retrospective study included 370 patients with thyroi d nodules confirmed by pathology from three centres, divided into a training s et (n=229) and an internal validation set (n=100) in a 7:3 ratio, with patients f rom the third centre serving as an external validation set (n=41). Radiomics fe atures were extracted from preoperative CT images, and the optimal features w ere selected to construct a radiomics score (Rad_Score). Clinical risk factors w ere identified using univariate and multivariate logistic regression. LR and SV M algorithms were used to establish three models: a clinical model, an imagin g model, and a combined model (integrating clinical factors and Rad_Score). T he combined model was visualized using SHAP(SHapley Additive exPlanations) analysis. Model performance was evaluated using receiver operating characteristi c (ROC) curves, calibration curves, and decision curve analysis (DCA). Results: Seventeen features were ultimately selected for Rad_Score calculation. The combined model demonstrated the best performance, with the LR combine d model achieving AUC values of 0.962, 0.913, and 0.914 in the training set, internal validation set, and external validation set, respectively, all higher than t he LR clinical model and LR radiomics model; and the LR combined model o utperforms the SVM combined model (0.953, 0.885, 0.842). SHAP analysis rev ealed the relative importance of the key feature (Rad_score) in model predictio n, enhancing model transparency. Conclusion: The combined model performs better under the LR algorithm. Com bined with SHAP explainabel analysis, it provides a non-invasive, efficient, and transparent tool for preoperative differentiation of benign and malignant thyroi d nodules, potentially optimising individualized clinical management.

Keywords: thyroid nodules, Benignity or malignancy, CT, machine learning, Shapley additive explanations

Received: 28 Aug 2025; Accepted: 17 Nov 2025.

Copyright: © 2025 何, Luo, Hu, Ke, Yang, Zi, Jiang, Yong, Chen, Chen, He, Gao, Liang, Jing and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Yang Jing, 605413559@qq.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.