A predictive nomogram of thyroid nodules based on deep learning ultrasound image analysis

Li, Yuan; Li, Ting; He, Kai; Cui, Xiao-xiao; Zhang, Lu-lu; Wei, Xiu-liang; Liu, Zhi; Wu, Mei

doi:10.3389/fendo.2025.1504412

ORIGINAL RESEARCH article

Front. Endocrinol., 29 April 2025

Sec. Thyroid Endocrinology

Volume 16 - 2025 | https://doi.org/10.3389/fendo.2025.1504412

A predictive nomogram of thyroid nodules based on deep learning ultrasound image analysis

Yuan Li¹

Ting Li¹

Kai He²

Xiao-xiao Cui²

Lu-lu Zhang³

Xiu-liang Wei¹

Zhi Liu^2*

Mei Wu^1*

¹Department of Ultrasound, the Second Hospital, Cheeloo College of Medicine, Shandong University, Jinan, Shandong, China
²School of Information Science and Engineering, Shandong University, Qingdao, China
³Department of Pathology, the Second Hospital, Cheeloo College of Medicine, Shandong University, Jinan, Shandong, China

Objectives: The ultrasound characteristics of benign and malignant thyroid nodules were compared to develop a deep learning model, aiming to establish a nomogram model based on deep learning ultrasound image analysis to improve the predictive performance of thyroid nodules.

Materials and methods: This retrospective study analyzed the clinical and ultrasound characteristics of 2247 thyroid nodules from March 2016 to October 2023. Among them, 1573 nodules were used for training and testing the deep learning models, and 674 nodules were used for validation, and the deep learning predicted values were obtained. These 674 nodules were randomly divided into a training set and a validation set in a 7:3 ratio to construct a nomogram model.

Results: The accuracy of the deep learning model in 674 thyroid nodules was 0.886, with a precision of 0.900, a recall rate of 0.889, and an F1-score of 0.895. The binary logistic analysis of the training set revealed that age, echogenic foci, and deep learning predicted values were statistically significant (P<0.05). These three indicators were used to construct the nomogram model, showing higher accuracy compared to the China thyroid imaging reports and data systems (C-TIRADS) classification and deep learning models. Moreover, the nomogram model exhibited high calibration and clinical benefits.

Conclusion: Age, deep learning predicted values, and echogenic foci can be used as independent predictive factors to distinguish between benign and malignant thyroid nodules. The nomogram integrates deep learning and patient clinical ultrasound characteristics, yielding higher accuracy than the application of C-TIRADS or deep learning models alone.

1 Introduction

Thyroid nodules are common clinical findings and have a prevalence rate ranging from 34-66%, showing regional differences (1). In China, the prevalence of thyroid nodules is nearly 40%, with women exhibiting significantly higher rates than men (2). The incidence rate of thyroid cancer is about 7-15%. The most common pathological type is papillary thyroid cancer, accounting for 80-90% of all thyroid malignant tumors (3, 4). Since the 1980s, the incidence rate of thyroid cancer has gradually increased, but the mortality rate has remained relatively stable and the prognosis is usually good. Therefore, excessive diagnosis and treatment should be avoided (4, 5). At present, ultrasound remains the best imaging method for the thyroid gland and plays an essential role in the diagnosis of thyroid nodules (3). To standardize thyroid ultrasound results, various thyroid imaging reports and data systems (TI-RADS) have been proposed, such as the American Society of Radiology TI-RADS (ACR TI-RADS) (6), the European Thyroid Association TI-RADS (EU-TIRADS) (7), and the Korean Society of Thyroid Radiology TI-RADS (K-TIRADS) (8). In 2020, the Chinese Medical Association proposed the Chinese guidelines, also known as C-TIRADS, for ultrasound malignancy risk stratification of thyroid nodules, which were developed based on China’s national conditions and medical status (9). Traditionally, ultrasound examination has been shown to be highly subjective, depending on the operator’s ability. Different physicians may report different descriptions of the same ultrasound features (10, 11). However, fine needle aspiration for pathological examination of thyroid nodules is the gold standard for the diagnosis of thyroid nodules (12).

In recent years, due to the improvement in computing power and the availability of large-scale data, artificial intelligence, represented by deep learning has emerged as an essential tool in the field of medical imaging. Deep learning algorithms have promoted the development of precision medicine (13, 14). At present, deep learning has been widely applied in the ultrasound diagnosis of thyroid diseases, improving the classification, segmentation, and detection of thyroid images. These advances have facilitated the differential diagnosis of thyroid nodules (15, 16), the prediction of cervical lymph node metastasis (17, 18) or distant metastasis of thyroid cancer (19), and the analysis of the prognosis of thyroid cancer (20). Traditional convolutional neural networks (CNNs) represent an integral component of medical image analysis (21–25). However, due to the internal limitations of the algorithm, CNNs cannot model long-range dependencies. CNNs only focus on local pixels in the entire image, analyzing local features rather than learning global patterns (26). The Vision Transformer (ViT) model is a deep neural network based on an attention model proposed by Alexey Dosovitskiy. Its main feature is its ability to effectively store global structural information of images, which has been proven to be superior to the CNN models in the field of medical imaging (27, 28). This study uses ViT algorithm and aims to establish a nomogram based on deep learning ultrasound image analysis, integrating clinical data, ultrasound features, and deep learning results of thyroid nodules to assist in predicting the diagnosis of thyroid nodules.

2 Materials and methods

2.1 General information and grouping of patients

This retrospective study included 2247 cases of thyroid nodules (including 927 benign nodules and 1320 malignant nodules) treated at the Second Hospital of Shandong University from March 2016 to October 2023. All patients underwent thyroid ultrasound examinations before surgery or puncture. The digital ultrasound images were gathered from the ultrasound workstation.

Inclusion criteria (1): Preoperative or pre-puncture thyroid ultrasound examination; (2) Clear ultrasound image, with complete transverse and longitudinal images of the same nodule; (3) Complete clinical data; (4) Clear pathological diagnosis after surgery or puncture. Exclusion criteria: (1) Multiple (more than one) nodules on the same ultrasound section; (2) The required image section overlaps with measurement scales, Color Doppler Flow Imaging (CDFI) information, or elastography information, etc.; (3) Nodule puncture was performed before the ultrasound examination in our hospital; (4) Incomplete clinical data; (5) The pathological diagnosis is unclear.

All nodules were randomly divided in a 7:3 ratio, with 1573 nodules used for training and testing the deep learning model and 674 nodules used for validation. A external public database TN3K (29) was also used for testing; then the deep learning prediction results were obtained for the benign and malignant nodules. Thereafter, 674 nodules were divided into the benign group (308 cases) and the malignant group (366 cases) based on pathological results. About 70% of benign and malignant nodules were randomly selected as the training set to construct a nomogram chart, and 30% of benign and malignant nodules were assigned to the validation set to evaluate the nomogram chart. This study was approved by the Ethics Review Committee of the Second Hospital of Shandong University (KYLL2024752), and all patients provided signed informed consent. All procedures were conducted in compliance with the Helsinki Declaration. The flowchart of this study is shown in Figure 1A.

Figure 1

Figure 1. Flowchart. (A) Flowchart of the study. (B) Flowchart of ViT model.

2.2 Analysis of ultrasound images

The ultrasound diagnostic instruments included GE Logic E9 (linear probe, frequency 9-15MHz, Wauwatosa, America) and Mindray Resona 7S (linear probe, frequency 9-14MHz, Shenzhen, China). The patient was placed in the supine position with excessive neck extension, fully exposing the anterior cervical area, and a comprehensive scan of the thyroid gland was performed. Two physicians with over five years of experience in ultrasound diagnosis conducted a retrospective analysis of ultrasound images of thyroid nodules based on the C-TIRADS criteria (9), recording the size (maximum diameter of the nodule), location (upper lobe, middle lobe, lower lobe, and isthmus), orientation (vertical, horizontal), margin (clear, unclear), shape (regular, irregular), internal composition (solid, solid-cystic, cystic, spongiform), echogenicity (anechoic, hyperechoic, isoechoic, hypoechoic, markedly hypoechoic), echotexture (homogeneous, heterogeneous), echogenic foci (no echogenic foci or comet-tail artifacts, macrocalcifications and peripheral calcifications, microcalcifications and punctate echogenic foci), halo (absent halo, even thickness halo, uneven thickness halo), posterior feature (no posterior feature, enhancement, shadowing), and relationship with the capsule (distant, adjacent, or breakthrough). Moreover, the C-TIRADS classification of thyroid nodules was determined. Among them, nodules classified as class 3 or below by C-TIRADS were defined as benign; in contrast, nodules with a C-TIRADS classification of class 4a or above indicate ultrasound malignancy diagnosis. During this process, both physicians were blinded to the pathological results. Discrepancies in the ultrasound classification between the two physicians were settled by discussion until a consensus was reached to determine the final category of the nodule. While reviewing the images, the two doctors selected transverse and longitudinal images of each nodule to prepare for the training of the deep learning model.

2.3 Deep learning algorithms

In this study, we employed a transformer-based approach to classify thyroid nodules into two categories. Thyroid ultrasound images were first cropped to a uniform 224×224 pixel size and then processed using the ViT model (27), which divided each image into 16×16 pixel patches. These patches were flattened from the original (H×W×C) format into a sequence with shape N×(P²×C) (where N = HW/P²) and projected into D dimensions via a trainable linear layer, with a learnable embedding prepended to preserve spatial information. Thyroid ultrasound images were subsequently processed through Transformer blocks that leverage multi-head self-attention, layer normalization, and residual connections for deep feature extraction and contextual modeling. Finally, the combined outputs were passed through a two-layer MLP with GELU activation to generate global representations for effective binary classification.

To train the model, we used the Adam optimizer. This optimizer was configured with a momentum of 0.9 to ensure stable learning and a weight decay of 0.05 to help prevent overfitting. Additionally, we applied a dynamic learning rate strategy—starting at 0.0004 and gradually decreasing it following a cosine schedule—to facilitate a smoother training process. The training was conducted over 150 epochs with a batch size of 24 images per iteration.

All experiments were implemented using the PyTorch 2.1.1 and Timm 0.9.1 frameworks on an Ubuntu 18.04 system. The computational setup included an Intel Xeon Gold 6230 CPU running at 2.10 GHz and an NVIDIA GeForce RTX 3090 GPU with 24GB of memory, ensuring robust performance for our deep learning tasks. The flowchart of deep learning model is shown in Figure 1B.

2.4 Statistical methods

SPSS 21.0 statistical software was used for analysis. Categorical variables were presented as numbers and percentages and analyzed by the χ² test and Fisher’s exact test. Continuous variables were analyzed using a single sample K-S test to determine if the variables followed a normal distribution. Variables conforming to a normal distribution were analyzed using a t-test. In contrast, variables not conforming to a normal distribution were analyzed using the U-test. Multivariate analysis was conducted using logistic regression analysis. R (4.2.3) was used to construct and evaluate nomogram model. The receiver operating characteristic (ROC) curves of the model were plotted, and the area under the curve (AUC) and 95% confidence interval (CI) were calculated to evaluate the predictive ability of the models. Furthermore, the DeLong test was used to determine the statistical significance of differences in AUC between different models. A calibration curve was constructed to evaluate the calibration degree of the model and decision curve analysis (DCA) was performed to evaluate clinical benefits. The online interactive nomogram was constructed by Shinny. Violin plots were used to illustrate the age distribution differences between patients with benign and malignant nodules. P<0.05 indicated statistical significance.

3 Results

3.1 General features of deep learning model training and validation sets

Statistically significant differences (P<0.05) in age, size, and C-TIRADS classification were observed between benign and malignant nodules in both the deep learning model training set and the validation set. The general features of the deep learning model training and validation sets are shown in Table 1. The exact pathological types of all thyroid nodules are shown in Supplementary Table 1.

Table 1

Table 1. General features of deep learning model training and validation sets.

3.2 Prediction performance of the deep learning model

The accuracy of this model in the validation set of 674 nodules is 0.886, while the accuracy of TN3K is 0.825. This model showed high precision, recall rate, and F1-score in all nodules, benign nodules, and malignant nodules in the validation set respectively (Table 2). The confusion matrix of 674 nodules in the model is shown in Figure 2.

Table 2

Table 2. Prediction performance of deep learning models in validation sets.

Figure 2

Figure 2. The confusion matrix of deep learning models in the validation set. The horizontal axis represents the prediction results of the deep learning model, and the vertical axis represents the pathological results.

3.3 Clinical-ultrasound features of thyroid nodules identified from the nomogram training set and validation set

In the nomogram training set, 14 indicators, including age, size, orientation, location, internal composition, echogenicity, shape, margin, echotexture, posterior features, echogenic foci, halo, relationship with the capsule, and deep learning predicted values, showed statistically significant differences (P<0.05) between the benign and malignant groups, while gender showed no statistically significant difference. In the nomogram validation set, 13 indicators, including age, orientation, location, internal composition, echogenicity, shape, margin, echotexture, posterior features, echogenic foci, halo, relationship with the capsule, and deep learning predicted values showed statistically significant differences (P<0.05) between the benign and malignant groups. However, no statistically significant difference was observed in patient gender and size. The clinical-ultrasound characteristics analysis of thyroid nodules in the nomogram training set and validation set are shown in Table 3.

Table 3

Table 3. The clinical-ultrasound characteristics analysis of thyroid nodules in the nomogram training set and validation set.

3.4 Binary logistics regression analysis and establishment of the nomogram model

The significant indicators in the single factor analysis of the training set were included in the binary logistics regression analysis model, showing statistical significance (χ² = 499.321, P<0.001). The independent variables included in the model, age, deep learning predicted values, and echogenic foci (P<0.05). Specifically, the deep learning malignant predicted value, microcalcifications or punctate echogenic foci within nodules were indicative of malignant nodules. Moreover, every 1-year-old increase in age resulted in a 0.04 times reduction in the risk of malignant nodules. The age distribution of benign and malignant nodules, as depicted in the violin plots, followed a normal distribution with similar trends. Patients with malignant nodules were younger than those with benign nodules in both the training sets (Figure 3A) and validation sets (Figure 3B). The results of binary logistics regression analysis on the training set are shown in Table 4.

Figure 3

Figure 3. Violin plot of the age distribution of benign and malignant nodules. (A) in the training set; (B) in the validation set. The white solid line and box in the figure represent the quartiles of age.

Table 4

Table 4. Binary logistics regression analysis results of training set.

These indicators were incorporated into the nomogram model, and the malignant probability of nodules was predicted. The nomogram model was shown in Figure 4. Furthermore, an online interactive nomogram was developed(https://saprediction.shinyapps.io/DynNomapp/). The score of individual thyroid nodules can be obtained through each indicator, with the total score of each indicator corresponding to the probability of the nodule being diagnosed as malignant (Figure 5).

Figure 4

Figure 4. Nomogram model.

Figure 5

Figure 5. Example of application of the nomogram model. A 56-year-old female with a solid nodule in the middle of the right lobe of the thyroid gland, measuring 1.7x1.5cm. The nodule was vertically oriented, with unclear margins and microcalcifications. The C-TIRADS classification was 4c, and the deep learning model predicted a benign nodule. The total score of the nomogram model was about 70 points, corresponding to a malignant prediction probability of 0.17, and the prediction result was benign. Pathological result: nodular goiter. C-TIRADS China thyroid imaging reports and data systems.

3.5 Evaluation of the nomogram model

The ROC curves of the training and validation sets revealed that the model has good accuracy. The AUC of the training set (Figure 6A): C-TIRADS 0.715 (95% CI: 0.680-0.749), deep learning 0.898 (95% CI: 0.871-0.925), nomogram model 0.951 (95% CI: 0.932-0.969). Validation set AUC (Figure 6B): C-TIRADS 0.667 (95% CI: 0.612-0.723), deep learning 0.869 (95% CI: 0.818-0.919), nomogram model 0.898 (95% CI: 0.850-0.945). The Delong test showed statistical differences (P<0.05) between the ROC curves. The calibration curves of the training set (Figure 6C) and validation set (Figure 6D) showed that the model has good calibration accuracy, and DCA indicated that the model has clinical benefits in both the training set (Figure 6E, threshold 0-0.98) and validation set (Figure 6F, threshold 0-0.93).

Figure 6

Figure 6. Evaluation of the nomogram model. (A) ROC curve of training set; (B) ROC curve of validation set; (C) Nomogram model calibration curve of training set; (D) Nomogram model calibration curve of validation set; (E) Nomogram model DCA of training set; (F) Nomogram model DCA of validation set. ROC receiver operating characteristic, DCA decision curve analysis.

4 Discussion

In this study, a deep learning model was trained to comprehensively analyze the clinical and ultrasound characteristics of thyroid nodules. Based on the prediction results of the deep learning model for thyroid nodules, a nomogram model was developed and validated, which includes an online interactive nomogram, to predict the risk of malignancy of thyroid nodules. The nomogram showed good accuracy, calibration, and clinical value.

In this study, age was identified as a predictive factor for determining the benign or malignant nature of thyroid nodules. The violin plot of age distribution revealed that patients with malignant nodules were younger compared to those with benign nodules. However, the role of age in the differentiation of benign and malignant thyroid nodules remains controversial. In some previous studies, age was identified as an independent predictor of malignancy (30, 31), whereas other studies reported that age does not have statistical significance in the predictive models. This discrepancy may be attributed to the different attitudes of young and elderly patients toward the treatment of C-TIRADS 4a-5 nodules. Young patients may prefer surgical treatment, while elderly patients may prefer conservative treatment, resulting in missing pathological results (32). Therefore, the differential value of age in distinguishing benign and malignant thyroid nodules varies in different studies.

Ultrasound examination, as a non-invasive imaging modality, remains the most widely used initial examination method for thyroid assessment (3, 33). In our study, malignant and benign thyroid nodules showed statistical differences in nodule size, location, orientation, internal composition, echogenicity, shape, margin, echotexture, posterior feature, echogenic foci, halo, and capsule relationship, which is consistent with C-TIRADS (9). Among them, microcalcifications or punctate echogenic foci were found to be independent risk factors for malignant nodules. However, some benign nodules in this study also exhibited vertical growth, markedly hypoechoic features, irregular shape, unclear margin, microcalcifications, and posterior shadow, which may lead to higher C-TIRADS classification in ultrasound diagnosis, resulting in higher sensitivity and lower specificity of the C-TIRADS classification. Moreover, ultrasound examination involves a certain degree of subjectivity and relies heavily on the diagnostic experience of the physician (33). Therefore, more objective tools are required to eliminate potential observer bias, and assist in the ultrasound diagnosis of thyroid nodules. Deep learning constructs models by automatically learning features from data layers, which can automatically extract deep features from images without the need for manual intervention, reducing the subjective influence of doctors (34).

Nomogram is a commonly used visualization tool in medical research, which integrates different variables to generate the probability of clinical events, with accuracy and intuitiveness (35, 36). Previous studies have shown that a nomogram model that integrates clinical, ultrasound, and deep learning models is superior to ultrasound features or deep learning alone in identifying the nature of thyroid nodules (31, 34, 37). Du et al. (34) analyzed ultrasound images of 1076 cases of thyroid nodules and constructed a nomogram model based on deep learning, which showed high diagnostic performance (AUC>0.9). Zhang et al. (31) collected ultrasound images of 500 thyroid nodules in a similar retrospective study and performed deep learning of thyroid ultrasound images with the YOLOv3 model, and constructed a nomogram model to improve prediction ability. The model achieved 84% accuracy in identifying TI-RADS category 4 thyroid nodules. Zhong et al. (37) constructed a clinical-ultrasound-radiomics nomogram to differentiate between benign and malignant indeterminate cytology thyroid nodules, with higher accuracy than a single clinical or radiomics model. Our study incorporated clinical information of thyroid nodule patients, ultrasound features of nodules, and deep learning prediction results into a nomogram model. The ROC curve of the model reached an AUC of 0.898 in the validation set. The predictive ability of the model was improved compared to C-TIRADS and the application of deep learning models alone, which was consistent with previous research results. But compared with these, our study included a larger total number of ultrasound images and a larger sample size of ultrasound images was used for deep model training. Moreover, the present study developed an online interactive nomogram, directly displaying the malignancy probability of nodules, eliminating the step of adding scores in traditional nomograms. It can be used as a more convenient tool in clinical practice.

The diagnosis of thyroid diseases is facilitated by the comprehensive analysis of clinical, morphological, molecular, and epigenetic features using artificial intelligence algorithms (12). Therefore, the combination of deep learning models and clinical ultrasound features in this study is of great significance. Future studies can improve the predictive model of this study by incorporating pathological indicators and optimizing the model. Nevertheless, the limitations of the present study should be acknowledged. Firstly, as a single-center retrospective study, this study has certain biases and lacks validation with large sample data from multiple centers. This requires further improvement of multi-center data in our future research work. Secondly, this study only included two-dimensional grayscale ultrasound information and lacked multimodal ultrasound images and dynamic images, which will be further optimized in our future research.

5 Conclusion

Age, deep learning predicted values, and echogenic foci can be used as independent predictive factors for the benign or malignant judgment of thyroid nodules. The deep learning models showed superior diagnostic accuracy compared to the C-TIRADS classification. The nomogram integrates deep learning and clinical-ultrasound characteristics, yielding a higher accuracy than C-TIRADS or deep learning models alone. The online interactive nomogram provides a more convenient tool for clinical practice.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Ethics Review Committee of the Second Hospital of Shandong University (KYLL2024752). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

YL: Data curation, Formal Analysis, Writing – original draft. TL: Data curation, Formal Analysis, Writing – original draft. KH: Software, Validation, Writing – original draft. XC: Software, Validation, Writing – original draft. LZ: Data curation, Writing – original draft. XW: Data curation, Writing – original draft. ZL: Supervision, Writing – review & editing. MW: Supervision, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Acknowledgments

We thank Home for Researchers editorial team (www.home-for-researchers.com) for language editing service.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2025.1504412/full#supplementary-material

References

1. Uppal N, Collins R, James B. Thyroid nodules: global, economic, and personal burdens. Front Endocrinol. (2023) 14:1113977. doi: 10.3389/fendo.2023.1113977

PubMed Abstract | Crossref Full Text | Google Scholar

2. Li Y, Jin C, Li J, Tong M, Wang M, Huang J, et al. Prevalence of thyroid nodules in China: A health examination cohort-based study. Front Endocrinol. (2021) 12:676144. doi: 10.3389/fendo.2021.676144

PubMed Abstract | Crossref Full Text | Google Scholar

3. Haugen BR, Alexander EK, Bible KC, Doherty GM, Mandel SJ, Nikiforov YE, et al. 2015 American thyroid association management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: the American thyroid association guidelines task force on thyroid nodules and differentiated thyroid cancer. Thyroid. (2016) 26:1–133. doi: 10.1089/thy.2015.0020

PubMed Abstract | Crossref Full Text | Google Scholar

4. Lam AK. Papillary thyroid carcinoma: current position in epidemiology, genomics, and classification. Methods Mol Biol (Clifton NJ). (2022) 2534:1–15. doi: 10.1007/978-1-0716-2505-7_1

PubMed Abstract | Crossref Full Text | Google Scholar

5. Arias-Ortiz N, Rodríguez-Betancourt JD. Trends in cancer incidence and mortality in Manizales, Colombia, 2008-2017. Colombia Med (Cali Colombia). (2022) 53:e2044920. doi: 10.25100/cm.v53i1.4920

PubMed Abstract | Crossref Full Text | Google Scholar

6. Tessler FN, Middleton WD, Grant EG, Hoang JK, Berland LL, Teefey SA, et al. Acr thyroid imaging, reporting and data system (Ti-Rads): white paper of the Acr Ti-Rads committee. J Am Coll Radiology: JACR. (2017) 14:587–95. doi: 10.1016/j.jacr.2017.01.046

PubMed Abstract | Crossref Full Text | Google Scholar

7. Russ G, Bonnema SJ, Erdogan MF, Durante C, Ngu R, Leenhardt L. European thyroid association guidelines for ultrasound Malignancy risk stratification of thyroid nodules in adults: the Eu-Tirads. Eur Thyroid J. (2017) 6:225–37. doi: 10.1159/000478927

PubMed Abstract | Crossref Full Text | Google Scholar

8. Shin JH, Baek JH, Chung J, Ha EJ, Kim JH, Lee YH, et al. Ultrasonography diagnosis and imaging-based management of thyroid nodules: revised Korean society of thyroid radiology consensus statement and recommendations. Korean J Radiol. (2016) 17:370–95. doi: 10.3348/kjr.2016.17.3.370

PubMed Abstract | Crossref Full Text | Google Scholar

9. Zhou J, Yin L, Wei X, Zhang S, Song Y, Luo B, et al. 2020 Chinese guidelines for ultrasound Malignancy risk stratification of thyroid nodules: the C-Tirads. Endocrine. (2020) 70:256–79. doi: 10.1007/s12020-020-02441-y

PubMed Abstract | Crossref Full Text | Google Scholar

10. Grani G, Lamartina L, Cantisani V, Maranghi M, Lucia P, Durante C. Interobserver agreement of various thyroid imaging reporting and data systems. Endocrine connections. (2018) 7:1–7. doi: 10.1530/ec-17-0336

PubMed Abstract | Crossref Full Text | Google Scholar

11. Sych YP, Fadeev VV, Fisenko EP, Kalashnikova M. Reproducibility and interobserver agreement of different thyroid imaging and reporting data systems (Tirads). Eur Thyroid J. (2021) 10:161–7. doi: 10.1159/000508959

PubMed Abstract | Crossref Full Text | Google Scholar

12. Lebrun L, Salmon I. Pathology and new insights in thyroid neoplasms in the 2022 who classification. Curr Opin Oncol. (2024) 36:13–21. doi: 10.1097/cco.0000000000001012

PubMed Abstract | Crossref Full Text | Google Scholar

13. Egger J, Gsaxner C, Pepe A, Pomykala KL, Jonske F, Kurz M, et al. Medical deep learning-a systematic meta-review. Comput Methods programs biomedicine. (2022) 221:106874. doi: 10.1016/j.cmpb.2022.106874

PubMed Abstract | Crossref Full Text | Google Scholar

14. Yang WT, Ma BY, Chen Y. A narrative review of deep learning in thyroid imaging: current progress and future prospects. Quantitative Imaging Med Surg. (2024) 14:2069–88. doi: 10.21037/qims-23-908

PubMed Abstract | Crossref Full Text | Google Scholar

15. Wu GG, Lv WZ, Yin R, Xu JW, Yan YJ, Chen RX, et al. Deep learning based on Acr Ti-Rads can improve the differential diagnosis of thyroid nodules. Front Oncol. (2021) 11:575166. doi: 10.3389/fonc.2021.575166

PubMed Abstract | Crossref Full Text | Google Scholar

16. Luo P, Fang Z, Zhang P, Yang Y, Zhang H, Su L, et al. Radiomics score combined with Acr Ti-Rads in discriminating benign and Malignant thyroid nodules based on ultrasound images: A retrospective study. Diagnostics (Basel Switzerland). (2021) 11(6):1011. doi: 10.3390/diagnostics11061011

PubMed Abstract | Crossref Full Text | Google Scholar

17. Guang Y, Wan F, He W, Zhang W, Gan C, Dong P, et al. A model for predicting lymph node metastasis of thyroid carcinoma: A multimodality convolutional neural network study. Quantitative Imaging Med Surg. (2023) 13:8370–82. doi: 10.21037/qims-23-318

PubMed Abstract | Crossref Full Text | Google Scholar

18. Wang Z, Qu L, Chen Q, Zhou Y, Duan H, Li B, et al. Deep learning-based multifeature integration robustly predicts central lymph node metastasis in papillary thyroid cancer. BMC Cancer. (2023) 23:128. doi: 10.1186/s12885-023-10598-8

PubMed Abstract | Crossref Full Text | Google Scholar

19. Liu WC, Li ZQ, Luo ZW, Liao WJ, Liu ZL, Liu JM. Machine learning for the prediction of bone metastasis in patients with newly diagnosed thyroid cancer. Cancer Med. (2021) 10:2802–11. doi: 10.1002/cam4.3776

PubMed Abstract | Crossref Full Text | Google Scholar

20. An Y, Lu J, Hu M, Cao Q. A prediction model for the 5-year, 10-year and 20-year mortality of medullary thyroid carcinoma patients based on lymph node ratio and other predictors. Front Surg. (2022) 9:1044971. doi: 10.3389/fsurg.2022.1044971

PubMed Abstract | Crossref Full Text | Google Scholar

21. Li X, Zhang S, Zhang Q, Wei X, Pan Y, Zhao J, et al. Diagnosis of thyroid cancer using deep convolutional neural network models applied to sonographic images: A retrospective, multicohort, diagnostic study. Lancet Oncol. (2019) 20:193–201. doi: 10.1016/s1470-2045(18)30762-9

PubMed Abstract | Crossref Full Text | Google Scholar

22. Nguyen DT, Kang JK, Pham TD, Batchuluun G, Park KR. Ultrasound image-based diagnosis of Malignant thyroid nodule using artificial intelligence. Sensors (Basel Switzerland). (2020) 20(7):1822. doi: 10.3390/s20071822

PubMed Abstract | Crossref Full Text | Google Scholar

23. Zhu PS, Zhang YR, Ren JY, Li QL, Chen M, Sang T, et al. Ultrasound-based deep learning using the Vggnet model for the differentiation of benign and Malignant thyroid nodules: A meta-analysis. Front Oncol. (2022) 12:944859. doi: 10.3389/fonc.2022.944859

PubMed Abstract | Crossref Full Text | Google Scholar

24. Ajilisa OA, Jagathy Raj VP, Sabu MK. A deep learning framework for the characterization of thyroid nodules from ultrasound images using improved inception network and multi-level transfer learning. Diagnostics (Basel Switzerland). (2023) 13(14):2463. doi: 10.3390/diagnostics13142463

PubMed Abstract | Crossref Full Text | Google Scholar

25. Qi Q, Huang X, Zhang Y, Cai S, Liu Z, Qiu T, et al. Ultrasound image-based deep learning to assist in diagnosing gross extrathyroidal extension thyroid cancer: A retrospective multicenter study. EClinicalMedicine. (2023) 58:101905. doi: 10.1016/j.eclinm.2023.101905

PubMed Abstract | Crossref Full Text | Google Scholar

26. Li F, Zhou L, Wang Y, Chen C, Yang S, Shan F, et al. Modeling long-range dependencies for weakly supervised disease classification and localization on chest X-ray. Quantitative Imaging Med Surg. (2022) 12:3364–78. doi: 10.21037/qims-21-1117

PubMed Abstract | Crossref Full Text | Google Scholar

27. Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Houlsby N. An image is worth 16x16 words: transformers for image recognition at scale. arXiv (2020). doi: 10.48550/arXiv.2010.11929

Crossref Full Text | Google Scholar

28. Chen W, Ayoub M, Liao M, Shi R, Zhang M, Su F, et al. A fusion of vgg-16 and vit models for improving bone tumor classification in computed tomography. J Bone Oncol. (2023) 43:100508. doi: 10.1016/j.jbo.2023.100508

PubMed Abstract | Crossref Full Text | Google Scholar

29. Gong H, Chen J, Chen G, Li H, Li G, Chen F. Thyroid region prior guided attention for ultrasound segmentation of thyroid nodules. Comput Biol Med. (2023) 155:106389. doi: 10.1016/j.compbiomed.2022.106389

PubMed Abstract | Crossref Full Text | Google Scholar

30. Wu X, Li J, Mou Y, Yao Y, Cui J, Mao N, et al. Radiomics nomogram for identifying sub-1 cm benign and Malignant thyroid lesions. Front Oncol. (2021) 11:580886. doi: 10.3389/fonc.2021.580886

PubMed Abstract | Crossref Full Text | Google Scholar

31. Zhang X, Jia C, Sun M, Ma Z. The application value of deep learning-based nomograms in benign-malignant discrimination of Ti-Rads category 4 thyroid nodules. Sci Rep. (2024) 14:7878. doi: 10.1038/s41598-024-58668-6

PubMed Abstract | Crossref Full Text | Google Scholar

32. Li M, Wei L, Li F, Kan Y, Liang X, Zhang H, et al. High risk thyroid nodule discrimination and management by modified Ti-Rads. Cancer Manage Res. (2021) 13:225–34. doi: 10.2147/cmar.S284370

PubMed Abstract | Crossref Full Text | Google Scholar

33. Chang L, Zhang Y, Zhu J, Hu L, Wang X, Zhang H, et al. An integrated nomogram combining deep learning, clinical characteristics and ultrasound features for predicting central lymph node metastasis in papillary thyroid cancer: A multicenter study. Front Endocrinol. (2023) 14:964074. doi: 10.3389/fendo.2023.964074

PubMed Abstract | Crossref Full Text | Google Scholar

34. Du H, Chen F, Li H, Wang K, Zhang J, Meng J, et al. Deep-learning radiomics based on ultrasound can objectively evaluate thyroid nodules and assist in improving the diagnostic level of ultrasound physicians. Quantitative Imaging Med Surg. (2024) 14:5932–45. doi: 10.21037/qims-23-1597

PubMed Abstract | Crossref Full Text | Google Scholar

35. Balachandran VP, Gonen M, Smith JJ, DeMatteo RP. Nomograms in oncology: more than meets the eye. Lancet Oncol. (2015) 16:e173–80. doi: 10.1016/s1470-2045(14)71116-7

PubMed Abstract | Crossref Full Text | Google Scholar

36. Liang J, Huang X, Hu H, Liu Y, Zhou Q, Cao Q, et al. Predicting Malignancy in thyroid nodules: radiomics score versus 2017 American college of radiology thyroid imaging, reporting and data system. Thyroid. (2018) 28:1024–33. doi: 10.1089/thy.2017.0525

PubMed Abstract | Crossref Full Text | Google Scholar

37. Zhong L, Shi L, Lai J, Hu Y, Gu L. Combined model integrating clinical, radiomics, braf(V600e) and ultrasound for differentiating between benign and Malignant indeterminate cytology (Bethesda iii) thyroid nodules: A bi-center retrospective study. Gland Surg. (2024) 13:1954–64. doi: 10.21037/gs-24-310

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: thyroid nodules, ultrasound, C-TIRADS, deep learning, nomogram

Citation: Li Y, Li T, He K, Cui X-x, Zhang L-l, Wei X-l, Liu Z and Wu M (2025) A predictive nomogram of thyroid nodules based on deep learning ultrasound image analysis. Front. Endocrinol. 16:1504412. doi: 10.3389/fendo.2025.1504412

Received: 30 September 2024; Accepted: 28 March 2025;
Published: 29 April 2025.

Edited by:

Serena Monti, National Research Council (CNR), Italy

Reviewed by:

Ricardo V. Garcia-Mayor, Instituto de Investigación Sanitaria Galicia Sur (IISGS), Spain
Cihan Atar, Osmaniye State Hospital, Türkiye

Copyright © 2025 Li, Li, He, Cui, Zhang, Wei, Liu and Wu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Mei Wu, YV9tYXkwMjEyQDE2My5jb20=; Zhi Liu, bGl1emhpQHNkdS5lZHUuY24=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.