ORIGINAL RESEARCH article
Front. Oncol.
Sec. Cancer Imaging and Image-directed Interventions
CT-Based Explainable Machine Learning for Predicting Benign and Malignant Thyroid Nodules: A Multi-center Study
Provisionally accepted- 1Nanbu County People's Hospital, 南充市, China
- 2Radiology Department, People's Hospital of Nanbu County, Nanchong City, Sichuan Province, China
- 3Yunnan Cancer Hospital,The Third Affiliated Hospital of Kunming Medical University,Peking University Cancer Hospital Yunnan Campus,China, Kunming, China
- 4Radiology Department, Dali Bai Autonomous Prefecture People's Hospital, Dali City, Yunnan Province,China, Dali, China
- 5The First Affiliated Hospital of Dali University, Dali, China
- 6lood Transfusion Department, People's Hospital of Nanbu County, Nanchong City, Sichuan Province, Nanchong, China
- 7Huiying Medical Technology Co., Ltd, Beijing, China
- 8Medical Imaging Centre,Kunming First People's Hospital Medical Imaging Centre, Kunming, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Objective: To construct a CT-based explainable machine learning model for pre operative prediction of thyroid nodule benignity or malignancy, aiming to provi de a more accurate tool for clinical decision-making and management. Materials and Methods: A retrospective study included 370 patients with thyroi d nodules confirmed by pathology from three centres, divided into a training s et (n=229) and an internal validation set (n=100) in a 7:3 ratio, with patients f rom the third centre serving as an external validation set (n=41). Radiomics fe atures were extracted from preoperative CT images, and the optimal features w ere selected to construct a radiomics score (Rad_Score). Clinical risk factors w ere identified using univariate and multivariate logistic regression. LR and SV M algorithms were used to establish three models: a clinical model, an imagin g model, and a combined model (integrating clinical factors and Rad_Score). T he combined model was visualized using SHAP(SHapley Additive exPlanations) analysis. Model performance was evaluated using receiver operating characteristi c (ROC) curves, calibration curves, and decision curve analysis (DCA). Results: Seventeen features were ultimately selected for Rad_Score calculation. The combined model demonstrated the best performance, with the LR combine d model achieving AUC values of 0.962, 0.913, and 0.914 in the training set, internal validation set, and external validation set, respectively, all higher than t he LR clinical model and LR radiomics model; and the LR combined model o utperforms the SVM combined model (0.953, 0.885, 0.842). SHAP analysis rev ealed the relative importance of the key feature (Rad_score) in model predictio n, enhancing model transparency. Conclusion: The combined model performs better under the LR algorithm. Com bined with SHAP explainabel analysis, it provides a non-invasive, efficient, and transparent tool for preoperative differentiation of benign and malignant thyroi d nodules, potentially optimising individualized clinical management.
Keywords: thyroid nodules, Benignity or malignancy, CT, machine learning, Shapley additive explanations
Received: 28 Aug 2025; Accepted: 17 Nov 2025.
Copyright: © 2025 何, Luo, Hu, Ke, Yang, Zi, Jiang, Yong, Chen, Chen, He, Gao, Liang, Jing and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Yang Jing, 605413559@qq.com
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
