- 1Department of Ultrasound, Tangshan People’s Hospital, Tangshan, China
- 2Department of Pathology, Tangshan People’s Hospital, Tangshan, China
Objective: To compare the diagnostic performance of shear wave elastography (SWE), Chinese Thyroid Imaging Reporting and Data System (C-TIRADS), and an artificial intelligence (AI)-assisted diagnostic system in differentiating thyroid nodules of different sizes.
Methods: A total of 103 thyroid nodules in 90 patients were prospectively analyzed and divided into two groups based on the maximum diameter: <10 mm and ≥10 mm. Each thyroid nodule was evaluated using three methods: conventional ultrasound for C-TIRADS scoring, shear wave elastography (SWE), and AI-assisted diagnosis. The diagnostic performance of individual methods and their combinations was assessed within each nodule size group using sensitivity, specificity, and area under the receiver operating characteristic (ROC) curve. For combined assessments, a nodule was considered positive if any constituent method indicated malignancy. Intergroup comparisons of AUC values were performed using DeLong’s test to evaluate the effect of nodule size on diagnostic performance.
Results: In nodules ≥1 cm, AI demonstrated excellent performance (AUC = 0.875, sensitivity = 96.43%, specificity = 77.78%), and C-TIRADS also performed well (AUC = 0.834, sensitivity = 96.55%, specificity = 70.37%). Among SWE parameters, Emax achieved the highest AUC (0.895). The diagnostic efficacy of AI combined with C-TIRADS (AUC = 0.852) was comparable to that of AI + C-TIRADS + Emax. In subcentimeter nodules, diagnostic performance decreased, with AI achieving an AUC of 0.654 and C-TIRADS an AUC of 0.524, whereas Emean retained moderate discriminative ability (AUC = 0.821).
Conclusion: AI combined with C-TIRADS provides an efficient and practical strategy for diagnosing thyroid nodules ≥1 cm. For subcentimeter nodules, Emean retains discriminative ability, indicating potential clinical value in assessment of small lesions.
Introduction
Thyroid nodules are a common clinical finding, with their global prevalence steadily increasing over the past decades (1–3). According to the Global Cancer Statistics (GLOBOCAN) 2022, thyroid cancer(TC) ascended to the third most prevalent cancer type, with an increase of 245,025 new cases compared to 2020 (1). China accounted for the largest proportion of TC cases worldwide (56.77%) and also reported the highest number of TC-related deaths, representing 24.35% of the global total (4). In the context of this growing epidemiologic burden, accurate and standardized evaluation of thyroid nodules has become a crucial component of clinical management.
Ultrasound is the first-line imaging modality for evaluating thyroid nodules due to its accessibility, non-invasiveness, and diagnostic utility (5–7). In clinical practice, several ultrasound-based risk stratification systems, such as the Chinese Thyroid Imaging Reporting and Data System (C-TIRADS), have been developed to predict malignancy risk and guide the decision for fine-needle aspiration cytology (FNAC) (8–11). These systems rely on a combination of sonographic features, including echogenicity, margin characteristics, shape, and presence of calcifications. However, ultrasound interpretation is inherently operator-dependent, and the diagnostic performance may vary significantly with the physician’s level of experience (12).
FNAC remains the gold standard for thyroid nodule diagnosis, but its accuracy is affected by lesion size and sampling technique (7, 13). Notably, a considerable number of nodules yield indeterminate results (e.g., Bethesda III or TIR3), posing diagnostic challenges and potentially resulting in unnecessary surgeries (14). These limitations have highlighted the need for adjunct diagnostic tools to improve risk stratification and reduce unnecessary interventions.
In recent years, advanced technologies such as shear wave elastography (SWE) and artificial intelligence (AI)-assisted diagnostic systems have emerged as promising tools in thyroid nodule evaluation (15–17). SWE provides a quantitative assessment of tissue stiffness, aiding in the identification of malignancies that typically exhibit increased elasticity (15). AI-assisted systems, based on deep learning algorithms, offer standardized image analysis and have demonstrated high diagnostic consistency, especially among junior physicians (18). Notably, the 2023 edition of the Chinese Guidelines for the Diagnosis and Treatment of Thyroid Nodules and Differentiated Thyroid Cancer (Second Edition) formally incorporated SWE and AI-assisted diagnostic tools into the diagnostic workflow, acknowledging their growing clinical relevance (19).
However, current evidence regarding the impact of nodule size on the diagnostic efficacy of these emerging technologies is heterogeneous and lacks consensus (20, 21). For example, the guidelines of the European Federation of Societies for Ultrasound in Medicine and Biology (EFSUMB) state that SWE is applicable to nodules of all sizes, provided appropriate adjustment of the ROI (22). In contrast, the 2021 guidelines of the World Federation for Ultrasound in Medicine and Biology (WFUMB) suggest that SWE performs more reliably in nodules larger than 10 mm, while its diagnostic value may be compromised in subcentimeter nodules (23). In addition, evidence remains insufficient regarding how SWE, TI-RADS, and AI-assisted systems, particularly across nodules of different sizes. Most previous studies evaluated these modalities independently, and size-specific diagnostic differences have not been systematically characterized (20, 21). These gaps highlight the need to clarify the comparative performance of these tools in subcentimeter and ≥10-mm nodules.
Therefore, the aim of this study was to systematically compare the diagnostic performance of SWE, C-TIRADS, and AI-assisted diagnostic system in assessing thyroid nodules across different size categories. The findings of this study may provide evidence to support the optimal application of these tools in clinical practice and contribute to more personalized and precise management strategies for thyroid nodules.
Materials and methods
Study sample
From July 2024 to March 2025,90 patients from the ultrasound department of a tertiary oncology hospital were prospectively collected and analyzed. This study was approved by the hospital’s ethics committee (approval No. RMYY-LLKS-2025309), and informed consent was obtained from all patients. The inclusion criteria were as follows: (1) Age 16–73 years, with no sex preference, (2) All cases presented with thyroid nodules, (3) Definite pathological diagnosis obtained from surgery or fine-needle aspiration (FNA), (4) Underwent conventional ultrasound, SWE, and AI-assisted examinations with clear and complete imaging data, (5) No history of thyroid surgery, biopsy, or thermal ablation therapy. The exclusion criteria were as follows: (1) Lack of pathological confirmation (no surgery or FNA performed, or indeterminate cytology without surgical follow-up), (2) History of other malignant tumors, (3) Poor general condition or inability to cooperate with breath-holding during examination, (4) Incomplete imaging records or clinical data.
Equipment, instruments, and AI-assisted diagnostic system
A color Doppler ultrasound system (EPIQ 7, Philips, Netherlands) equipped with a linear-array probe (eL18-4) and shear-wave elastography (SWE) software was used for image acquisition. The AI-assisted diagnostic system evaluated in this study, the ITS100 Ultrasound Imaging Intelligent System (Version 1.2; Sichuan Maide Intelligent Technology Co., Ltd., China), consists of a main processing unit and an AI-assisted display module. The system, which was developed using a multicenter dataset of more than 100,000 thyroid ultrasound images from Chinese patients with pathologically confirmed benign and malignant nodules.
Conventional ultrasound examination
The patient was positioned in the supine position. After identifying the lesion location using two-dimensional grayscale ultrasound, the tumor’s orientation, margin, composition, echogenicity, presence of microcalcifications, anatomical location, position, lymph node metastasis, and maximum diameter were recorded. Additionally, color doppler flow imaging (CDFI) was employed to assess the blood flow grading of the tumor. CDFI utilized the Adler semi-quantitative method, with the following classification (24):Level 0: No blood flow signals detected in the lesion; Level I: One or two “star-like” blood flow signals with a diameter less than 1 mm detected within the tumor; Level II: Three or four spot-like blood flow signals, or a major vessel traversing the tumor; Level III: Four or more blood vessels detected within the lesion.
Image assessments were independently performed by two ultrasound physicians, each with over five years of clinical experience. The interpretation of results was based on the C-TIRADS criteria (10). Each lesion was assigned a score to aid in differentiating benign from malignant lesions (Table 1). In cases of discordant opinions, a senior physician with 20 years of experience was consulted, and the final diagnosis was established through consensus, based on the majority opinion. All physicians involved were blinded to the final diagnoses of the thyroid nodules.
Real-time SWE technology acquisition and analysis
Real-time SWE was conducted on the EPIQ7 ultrasound system via its built-in elastography module. After obtaining the optimal imaging plane of the lesion using gray-scale ultrasound, the mode was switched to ElastQ, and SWE was selected for assessment. The probe was positioned perpendicularly and gently placed on the skin surface, ensuring adequate coupling gel. The sampling box was adjusted to encompass the entire lesion as much as possible, including the target region and surrounding tissues while avoiding the skin and carotid artery. To minimize the impact of respiratory motion on the image, patients were instructed to hold their breath. The best image with optimal color filling was selected for analysis.
The ROI Q-box was placed over the hardest area of the lesion (for heterogeneous lesions, the Q-box was placed within the lesion, avoiding cystic or calcified regions). Another ROI Q-box was placed on the normal thyroid tissue at a depth similar to that of the lesion. The maximum (Emax), mean (Emean), and median (Emed) values of Young’s modulus (in kPa) were recorded, where E represents the elasticity of the tissue (25).Each lesion was measured at least three times by a physician trained in SWE to ensure measurement consistency, and the average value was used.
The optimal cutoff values for SWE parameters (Emax, Emean, and Emed) were determined based on ROC curve analysis of the study dataset. Thresholds were selected using the Youden index and then applied to calculate diagnostic performance metrics, including sensitivity, specificity, PPV, NPV, and accuracy.
Dynamic AI-assisted diagnostic system
In the event of a detected nodule, the AI leverages deep learning technology to automatically identify key image features. Using the distinctive features of ultrasound images, it crafts a CNN architecture for diagnostic purposes. With the help of CNN’s convolution kernel, the system samples pixels from the input image, extracting global features specific to the thyroid nodule area. This process builds a high-throughput, multi-level characteristic space. Once the diagnostic pattern analyzes and calculates the features of the input nodule’s image, it provides two probability values. These values represent the probability that a thyroid nodule is benign or malignant, as assessed by the diagnostic algorithm. The system outputs a categorical prediction-”malignant” (indicated by a red “M”) or “benign” (indicated by a green “B”)-based on the identified imaging pattern. Nodules classified as “uncertain” (C-TIRADS 4A) by AI are represented with an alternating red and green color pattern. In this study, the single “uncertain” nodule was excluded from the calculation of AI diagnostic performance to avoid verification bias and overestimation of specificity (Figure 1).
Figure 1. Workflow of the AI-assisted diagnostic system for thyroid nodules. Stage 1: detection of thyroid nodules from ultrasound images using a CNN-based model to automatically localize the nodule area. Stage 2: classification of nodule characteristics through a neural network and classifier to differentiate malignant (M) from benign (B) lesions.
Statistical analysis
All statistical analyses were conducted using SPSS software, version 26.0 (IBM Corp., Armonk, NY, USA), with α = 0.05 as the significance level for inter-group comparisons. Continuous data were expressed as mean ± standard deviation ( ̄± s). Categorical and ordinal data were presented as frequencies and percentages (%). For normally distributed continuous data, comparisons between two groups were conducted using independent sample t-tests, whereas the Mann–Whitney U test was used for non-normally distributed data. For categorical data, inter-group comparisons were performed using the chi-square (χ2) test, while the rank-sum test was applied for ordinal variables. Receiver operating characteristic (ROC) curve analysis was performed to assess the diagnostic value of each indicator, determine optimal cut-off points, and evaluate overall diagnostic performance. For ROC analyses involving C-TIRADS, categories were dichotomized as follows: 1–3 = benign, 4A–4C = malignant. This dichotomization was consistently applied for the calculation of sensitivity, specificity, and AUC.
Results
Comparison of pathological and clinical data between different nodule size groups
A total of 103 thyroid nodules from 90 patients aged between 16 and 73 years were included in the analysis. All nodules were confirmed by postoperative pathological examination. Based on nodule size, patients were divided into two groups: <10 mm (n = 47) and ≥10 mm (n = 56). The diameters of nodules ranged from 2.9–9.8 mm in the <10 mm group (median = 6.7 mm) and 10.2–46.9 mm in the ≥10 mm group (median = 20.8 mm), with a significant difference between the two groups (Z = –8.714, P < 0.001, Mann–Whitney U test) (Table 2). There were no statistically significant differences between the two groups in terms of age (45.87 ± 10.24 vs. 47.86 ± 12.49 years, P = 0.386) or sex distribution (male: 23.41% vs. 21.43%, female: 76.59% vs. 78.57%, P = 0.810) (Table 2).
Among the 47 nodules with a maximum diameter of <10 mm, 41 (87.23%) were diagnosed as papillary thyroid carcinoma, 3 (6.38%) as nodular goiters, 2 (4.25%) as thyroid follicular nodular lesions with focal fibrosis, and 1 (2.12%) as subacute thyroiditis. In contrast, among the 56 nodules with a maximum diameter of ≥10 mm, 28 (50.00%) were papillary thyroid carcinoma, 25 (44.64%) were nodular goiters, 2 (3.57%) were thyroid follicular nodular lesions with focal fibrosis, and 1 (1.78%) was medullary thyroid carcinoma (Table 3). In total, 70 nodules were malignant, including 69 papillary thyroid carcinomas and 1 medullary thyroid carcinoma, while the remaining 33 nodules were benign (Table 3).
Comparison of US parameters between different nodule size groups
The conventional ultrasonographic features of thyroid nodules were compared between the <10 mm (n = 47) and ≥10 mm (n = 56) groups (Table 4). Significant differences were observed in several sonographic characteristics between the two groups.
Nodules <10 mm were more frequently characterized by a solid composition (97.87% vs. 62.50%, P < 0.001), ill-defined margins (87.23% vs. 50.00%, P < 0.001), a vertical orientation (89.36% vs. 46.42%, P<0.001), and hypoechoic echogenicity (97.87% vs. 62.50%, P < 0.001). Additionally, microcalcifications were more commonly observed in the <10 mm group (68.08% vs. 46.42%, P = 0.021). CDFI patterns also differed significantly between the groups (P < 0.001). Grade 3 vascularity, indicating rich internal blood flow, was significantly more prevalent in nodules ≥10 mm compared to those <10 mm (46.42% vs. 10.63%), whereas Grade 0 (no detectable blood flow) was more frequent in nodules <10 mm (34.04% vs.12.50%).
In terms of nodule location within the thyroid, a significantly higher proportion of nodules in the ≥10 mm group were located in the middle layer (50.00% vs. 25.53%, P < 0.001), whereas nodules <10 mm were more often located in the upper layer (48.93% vs. 21.42%) (Table 4). No significant differences were found between the two groups regarding lymph node metastasis (P = 0.394), lobe position (P = 0.491).
Comparison of SWE parameters between different nodule size groups
As shown in Table 5, all SWE parameters, including Emax, Emean, and Emed, were significantly higher in the ≥10 mm group than in the <10 mm group. Specifically, the Emax of nodules ≥10 mm was 74.50 ± 44.02 kPa, significantly higher than that of nodules <10 mm (56.44 ± 35.70 kPa, P = 0.026). Similarly, Emean and Emed values in the ≥10 mm group were 53.80 ± 29.79 kPa and 53.62 ± 28.94 kPa, respectively, both significantly greater than the corresponding values in the <10 mm group (41.92 ± 23.15 kPa and 41.53 ± 22.75 kPa; P = 0.028 and P = 0.022, respectively) (Figures 2, 3).
Figure 2. A 56-year-old patient with a thyroid nodule who underwent thyroidectomy. (A) Gray-scale ultrasound image shows a purely solid nodule measuring less than 10 mm in maximum diameter, with an irregular shape, non-parallel orientation (aspect ratio > 1), and an unclear margin, without microcalcifications. (B) Shear-wave elastography (SWE) image of the same nodule. A 2-mm circular region of interest (ROI 1) was positioned within the stiffest solid portion of the lesion, avoiding calcified or cystic components. A second ROI (ROI 2) was placed in the adjacent normal thyroid parenchyma as a reference. The measured stiffness values for ROI 1 were Emax = 29.4 kPa, Emean = 28.0 kPa, and Emed = 28.2kPa. (C) AI-assisted diagnostic system output, indicating the nodule as malignant.
Figure 3. A 45-year-old patient with a thyroid nodule who underwent a thyroidectomy. (A) Gray-scale ultrasound image shows a purely solid nodule measuring more than 10 mm in maximum diameter, with an irregular shape, non-parallel orientation (aspect ratio > 1), and an unclear margin, without microcalcifications. (B) Shear-wave elastography (SWE) image of the same nodule. A 2-mm circular region of interest (ROI 1) was positioned within the stiffest solid portion of the lesion, avoiding calcified or cystic components. A second ROI (ROI 2) was placed in the adjacent normal thyroid parenchyma as a reference. The measured stiffness values for ROI 1 were Emax = 125 kPa, Emean = 91.1kPa, and Emed = 89.4kPa. (C) AI-assisted diagnostic system output, indicating the nodule as malignant.
Comparison of diagnosis between AI assisted diagnostic system and C-TIRADS between different nodule size groups
Table 6 shows the diagnostic distribution of thyroid nodules using the AI system and C-TIRADS classification in the <10 mm and ≥10 mm groups. The AI-assisted system classified 93.61% (44/47) of <10 mm nodules as malignant, significantly higher than the 57.14% (32/56) in the ≥10 mm group (P < 0.001). In contrast, the proportion of nodules classified as benign by AI was notably higher in the ≥10 mm group (41.07%) compared to the <10 mm group (6.38%). Only one nodule in the ≥10 mm group was labeled as “uncertain”.
For C-TIRADS, no nodules were assigned to category 1 or 6 in either group. Most <10 mm nodules were rated as category 4A (36.17%),4B (38.29%) and 5(14.89%), with few in category 3 (4.25%) or 4C (6.38%). In the ≥10 mm group, however, nodules were more evenly distributed across categories 3 (33.92%), 4A (16.07%), 4B (23.21%) and 5(21.42%), with few in category 4C (3.57%). The difference in the C-TIRADS distribution between the two groups was statistically significant (P < 0.001), with a tendency toward lower C-TIRADS categories observed in larger nodules.
ROC analysis
Subgroup analyses based on maximum nodule diameter (<10 mm vs ≥10 mm) demonstrated notable differences in diagnostic performance across methods. In the overall cohort, AI showed the highest discriminatory ability (AUC = 0.834), followed by C-TIRADS (AUC = 0.766) and Emax (AUC = 0.747) (Table 7A). In nodules <10 mm, most methods exhibited reduced performance, with AI (AUC = 0.654) and C-TIRADS (AUC = 0.524) showing limited diagnostic value. Notably, SWE, particularly Emean, maintained relatively strong discrimination (AUC = 0.821), outperforming Emax (0.659) and Emed (0.652) (Table 7B).In nodules ≥10 mm, all methods demonstrated substantial improvement, with AI (AUC = 0.875), C-TIRADS (AUC = 0.834), and Emax (AUC = 0.895) achieving high diagnostic accuracy, the latter representing the best-performing SWE parameter (Table 7C).
To further evaluate diagnostic optimization, parallel testing was conducted to compare the two combination models (AI plus C-TIRADS and AI plus C-TIRADS plus Emax). Across the overall cohort and both size-based subgroups, the model integrating only AI and C-TIRADS achieved the same AUC as the triple combination (AI plus C-TIRADS plus Emax). DeLong’s test confirmed that no significant difference existed between the two approaches (P = 1.000) (Tables 7A–C).
Discussion
The clinical management of thyroid nodules, particularly in the context of varying lesion sizes, remains challenging due to the heterogeneity in diagnostic performance across different imaging modalities (19–22). In this study, we systematically compared the diagnostic efficacy of SWE, C-TIRADS, and AI-assisted diagnostic systems in distinguishing benign from malignant thyroid nodules, stratified by lesion size. Our results demonstrate that the sensitivity, specificity, and the AUC differed significantly both across diagnostic methods and between nodules <10 mm and ≥10 mm. highlighting the importance of adopting size-specific diagnostic strategies in clinical practice.
It should be noted, however, that the high proportion of malignant nodules—particularly among subcentimeter lesions—reflects the tertiary oncology setting of our cohort and may have inflated certain performance metrics such as the positive predictive value. Therefore, our findings should be interpreted as indicative of the relative discriminatory capacity of the evaluated methods, rather than their absolute predictive performance in a general or screening population.
In our study, subcentimeter thyroid nodules had a significantly higher rate of classic malignant ultrasound features, including irregular shape, ill-defined margins, vertical orientation, hypoechogenicity, and microcalcifications, compared to nodules ≥10 mm. These findings are highly consistent with the malignant features emphasized in the latest C-TIRADS guidelines (10) and may partially explain the higher malignancy rate observed in the <10 mm group (87.23% vs. 50.00%). Moreover, previous studies have reported an inverse relationship between thyroid nodule size and malignancy risk, with smaller nodules more frequently exhibiting malignant behavior—consistent with our findings (26). This epidemiological trend underscores the importance of carefully evaluating subcentimeter nodules, particularly those with suspicious imaging characteristics. These results further support the need for size-stratified diagnostic strategies in clinical settings.
Although conventional ultrasound provides essential morphological features of thyroid nodules, such as borders, echogenicity, calcification, and aspect ratio, the C-TIRADS system further classifies these features to assess malignancy risk, they are inherently limited in their ability to reflect the biomechanical properties of thyroid tissue (27). In this context, SWE, an advanced technique capable of quantifying tissue stiffness, offers complementary diagnostic information in the differentiation of benign and malignant thyroid nodules (28). By measuring the propagation speed of shear waves within the tissue to calculate the Young’s modulus, SWE indirectly reflects tissue elasticity, and has demonstrated promising diagnostic performance in the evaluation of various solid tumors (29).
After stratifying nodules by size, our study revealed that the diagnostic performance of SWE varied significantly across different size groups. Previous studies have reported that SWE demonstrates higher diagnostic efficiency in larger nodules and generally recommend Emax as the preferred parameter due to its greater sensitivity to focal stiffness changes (20). Consistent with these findings, our study showed that Emax achieved the highest diagnostic performance in nodules ≥10 mm (AUC = 0.895). Emean also exhibited comparable diagnostic efficacy in this subgroup (AUC = 0.885). In nodules <10 mm, SWE parameters declined overall (Emax 0.659, Emed 0.652), whereas Emean retained relatively strong discriminative performance (AUC = 0.821), highlighting its potential to reflect the overall stiffness pattern of nodules. However, because we did not evaluate measurement repeatability or variability, such as coefficients of variation, we could not further compare the stability or reproducibility of Emean and Emax. Therefore, we report only their diagnostic performance without inferring superiority. Future studies incorporating repeated measurements and larger samples are needed to clarify the reproducibility and clinical applicability of different SWE parameters.
The SWE parameters of the Young’s modulus also differed across nodules of different sizes, which is consistent with previous studies (20). These observed elastic changes may be related to tumor volume–associated pathological alterations such as tissue remodeling, fibrosis, increased cellular density, collagen deposition, local stromal reactions, and reduced vascular compliance (30, 31). These biological changes are particularly common in papillary thyroid carcinoma, which constituted the majority of malignant nodules in our cohort. In contrast, smaller nodules (<10 mm) tended to show lower stiffness parameters. This size-dependent difference may be attributed to a combination of biological and technical factors. Biologically, smaller malignant nodules, especially early-stage papillary carcinomas, may not yet exhibit significant fibrosis or stromal remodeling, resulting in lower tissue stiffness and thus reduced sensitivity of SWE (32). Technically, shear wave propagation may be less stable in small or superficial nodules, reducing the accuracy of stiffness measurements (20).Moreover, subcentimeter nodules are more susceptible to artifacts caused by adjacent anatomical structures or patient motion, which may further compromise image quality (33, 34).
SWE is a non-invasive technique with high operability, providing biomechanical insights into tissue stiffness. However, its diagnostic performance remains inconsistent across studies (35). The 2023 Chinese Guidelines for the Diagnosis and Treatment of Thyroid Nodules and Differentiated Thyroid Cancer (Second Edition) have incorporated elastography, but only weakly recommend it as an adjunct to conventional ultrasound, based on moderate-quality evidence (19). Given these limitations, the application of AI-assisted diagnostic systems in thyroid nodule assessment has been gaining increasing attention.
AI-assisted diagnostic systems have emerged as a promising tool in the evaluation of thyroid nodules, offering high diagnostic accuracy and consistency by leveraging deep learning algorithms trained on large ultrasound datasets (21, 36). In our study, AI demonstrated superior diagnostic performance in nodules ≥10 mm compared to those <10 mm, with AUC values of 0.875 and 0.654, respectively. This suggests that AI performs with greater accuracy in recognizing larger nodules. However, in the ≥10 mm subgroup, Emax achieved a slightly higher AUC than the AI-assisted system. This observation may be explained by the biomechanical properties of larger lesions and the sensitivity of SWE to localized stiffness variations. Moreover, the AI model was applied without additional local retraining, which may have influenced its relative performance. These real-world findings highlight the value of presenting objective results and indicate that further optimization and population-specific adaptation of AI algorithms may still be needed.
To further assess the clinical value of multimodal combinations, this study conducted a parallel analysis of the three parameters with the highest individual AUC performance: AI, C-TIRADS, and Emax. The results indicated that the diagnostic performance of AI + C-TIRADS + Emax was identical to that of AI + C-TIRADS in terms of diagnostic efficacy. This finding suggests a potential diagnostic saturation, where adding additional parameters (such as Emax) does not further enhance diagnostic efficacy. Accordingly, in clinical practice, combining AI with C-TIRADS alone may achieve comparable diagnostic results while avoiding additional resource consumption and equipment dependence. This approach aligns more closely with current healthcare resource allocation and cost-effectiveness considerations, facilitating broader application, especially in resource-limited settings. Notably, the combination of AI and C-TIRADS significantly improved sensitivity both overall and in nodules ≥10 mm, with a slight decrease in specificity; nevertheless, their complementary diagnostic dimensions still suggest a potential synergistic value.
Although the diagnostic performance of AI + C-TIRADS and AI + C-TIRADS + Emax was comparable in the parallel analysis, we further explored why Emax is the preferred parameter in clinical practice. Among the three SWE parameters assessed in this study, Emax demonstrated the best overall AUC performance. Compared to Emean and Emed, Emax more accurately reflects the maximal stiffness of the lesion, and its “peak” feature allows for the establishment of clear diagnostic thresholds, facilitating model integration and standardization. From both a biomechanical and clinical operational standpoint, Emax shows stronger potential for integration. Therefore, even though it did not further improve the performance of the combined model in this study, Emax remains one of the most valuable SWE parameters for future multimodal diagnostic strategies.
Limitation
This study has several limitations. First, it was conducted at a single center using one ultrasound system, and lacks multicenter or multi-device validation. Second, all participants were recruited from a tertiary oncology hospital, where the malignancy rate was substantially higher than in general populations. This case-mix may introduce selection bias and raise the pretest probability of malignancy, potentially leading to overestimation of prevalence-dependent metrics such as PPV and AUC. Third, the AI-assisted diagnostic system assessed in this study is specific to a single platform, and its performance may vary across different devices or software. Further multicenter studies involving diverse patient populations and equipment are needed to strengthen the generalizability of our findings.
Conclusion
In conclusion, nodule size significantly affects the diagnostic performance of AI, C-TIRADS, and SWE. In nodules ≥10 mm, both AI and C-TIRADS showed strong performance, with Emax being the most effective SWE parameter. The combination of AI and C-TIRADS yielded comparable efficacy to AI + C-TIRADS + Emax, suggesting limited added value of SWE in larger nodules. For subcentimeter nodules, diagnostic performance declined across all methods, including combined approaches; however, Emean retained some discriminative ability, indicating its potential to reflect overall stiffness patterns in small nodules. These results highlight the importance of selecting appropriate imaging parameters and implementing size-stratified diagnostic strategies.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by the Institutional Review Board of Tangshan People’s Hospital (Protocol No. RMYY-LLKS-2025309). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
YZ: Supervision, Formal Analysis, Conceptualization, Methodology, Software, Data curation, Writing – original draft, Resources, Visualization, Writing – review & editing, Project administration, Validation, Funding acquisition, Investigation. RL: Investigation, Resources, Writing – review & editing, Supervision, Data curation. WZ: Writing – review & editing, Investigation, Software, Methodology, Resources, Data curation. ZL: Writing – review & editing, Investigation, Data curation, Software, Resources. JY: Investigation, Data curation, Supervision, Writing – review & editing, Resources. LW: Supervision, Investigation, Data curation, Writing – review & editing, Resources.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This work was supported by the Medical Science Research Project of Hebei under Grant No. 20251223.
Conflict of interest
The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Cao W, Qin K, Li F, and Chen W. Comparative study of cancer profiles between 2020 and 2022 using global cancer statistics (GLOBOCAN). J Natl Cancer Cent. (2024) 4:128–34. doi: 10.1016/j.jncc.2024.05.001
2. Vaccarella S and Dal Maso L. Challenges in investigating risk factors for thyroid cancer. Lancet Diabetes Endocrinol. (2021) 9:57–9. doi: 10.1016/S2213-8587(20)30426-5
3. Pizzato M, Li M, Vignat J, Laversanne M, Singh D, La Vecchia C, et al. The epidemiological landscape of thyroid cancer worldwide: GLOBOCAN estimates for incidence and mortality rates in 2020. Lancet Diabetes Endocrinol. (2022) 10:264–72. doi: 10.1016/S2213-8587(22)00035-3
4. Lyu Z, Zhang Y, Sheng C, Huang Y, Zhang Q, and Chen K. Global burden of thyroid cancer in 2022: Incidence and mortality estimates from GLOBOCAN. Chin Med J (Engl). (2024) 137:2567–76. doi: 10.1097/CM9.0000000000003284
5. Durante C, Grani G, Lamartina L, Filetti S, Mandel SJ, and Cooper DS. The diagnosis and management of thyroid nodules: A review. Jama. (2018) 319:914–24. doi: 10.1001/jama.2018.0898
6. Sipos JA. The history of thyroid ultrasound: past, present, and future directions. Endocr Pract. (2024) 30:1220–6. doi: 10.1016/j.eprac.2024.08.010
7. Haugen BR, Alexander EK, Bible KC, Doherty GM, Mandel SJ, Nikiforov YE, et al. 2015 American thyroid association management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: the american thyroid association guidelines task force on thyroid nodules and differentiated thyroid cancer. Thyroid. (2016) 26:1–133. doi: 10.1089/thy.2015.0020
8. Russ G, Bonnema SJ, Erdogan MF, Durante C, Ngu R, and Leenhardt L. European thyroid association guidelines for ultrasound Malignancy risk stratification of thyroid nodules in adults: the EU-TIRADS. Eur Thyroid J. (2017) 6:225–37. doi: 10.1159/000478927
9. Tessler FN, Middleton WD, Grant EG, Hoang JK, Berland LL, Teefey SA, et al. ACR thyroid imaging, reporting and data system (TI-RADS): white paper of the ACR TI-RADS committee. J Am Coll Radiol. (2017) 14:587–95. doi: 10.1016/j.jacr.2017.01.046
10. Zhou J, Yin L, Wei X, Zhang S, Song Y, Luo B, et al. 2020 Chinese guidelines for ultrasound Malignancy risk stratification of thyroid nodules: the C-TIRADS. Endocrine. (2020) 70:256–79. doi: 10.1007/s12020-020-02441-y
11. Ha EJ, Chung SR, Na DG, Ahn HS, Chung J, Lee JY, et al. 2021 Korean thyroid imaging reporting and data system and imaging-Based management of thyroid nodules: korean society of thyroid radiology consensus statement and recommendations. Korean J Radiol. (2021) 22:2094–123. doi: 10.3348/kjr.2021.0713
12. Jeong EY, Kim HL, Ha EJ, Park SY, Cho YJ, and Han M. Computer-aided diagnosis system for thyroid nodules on ultrasonography: diagnostic performance and reproducibility based on the experience level of operators. Eur Radiol. (2019) 29:1978–85. doi: 10.1007/s00330-018-5772-9
13. Koo DH, Song K, Kwon H, Bae DS, Kim JH, Min HS, et al. Does tumor size influence the diagnostic accuracy of ultrasound-guided fine-needle aspiration cytology for thyroid nodules? Int J Endocrinol. (2016) 2016:3803647. doi: 10.1155/2016/3803647
14. Cantara S, Marzocchi C, Pilli T, Cardinale S, Forleo R, Castagna MG, et al. Molecular signature of indeterminate thyroid lesions: current methods to improve fine needle aspiration cytology (FNAC) diagnosis. Int J Mol Sci. (2017) 18:775. doi: 10.3390/ijms18040775
15. Bora Makal G and Aslan A. The diagnostic value of the american college of radiology thyroid imaging reporting and data system classification and shear-wave elastography for the differentiation of thyroid nodules. Ultrasound Med Biol. (2021) 47:1227–34. doi: 10.1016/j.ultrasmedbio.2021.01.023
16. Filho RHC, Pereira FL, and Iared W. Diagnostic accuracy evaluation of two-dimensional shear wave elastography in the differentiation between benign and Malignant thyroid nodules: systematic review and meta-analysis. J Ultrasound Med. (2020) 39:1729–41. doi: 10.1002/jum.15271
17. Tuo J, Si X, and Song H. Artificial intelligence technology enhances the performance of shear wave elastography in thyroid nodule diagnosis. Am J Transl Res. (2023) 15:6226–33.
18. Chung SR, Baek JH, Lee MK, Ahn Y, Choi YJ, Sung TY, et al. Computer-aided diagnosis system for the evaluation of thyroid nodules on ultrasonography: prospective non-inferiority study according to the experience level of radiologists. Korean J Radiol. (2020) 21:369–76. doi: 10.3348/kjr.2019.0581
19. Chinese Society of Endocrinology, Thyroid and Metabolic Surgery Group of the Chinese Society of Surgery, Head and Neck Tumor Committee of the Chinese Anti-Cancer Association, Chinese Society of Nuclear Medicine, Thyroid Cancer Committee of the Chinese Anti-Cancer Association, Thyroid Surgery Committee of the Chinese Medical Doctor Association, et al. Guidelines for the diagnosis and treatment of thyroid nodules and differentiated thyroid carcinoma (2nd edition). Int J Endocrinol Metab. (2023) 43:149–94.
20. Li H, Kang C, Xue J, Jing L, and Miao J. Influence of lesion size on shear wave elastography in the diagnosis of benign and Malignant thyroid nodules. Sci Rep. (2021) 11:21616. doi: 10.1038/s41598-021-01114-8
21. Wang B, Wan Z, Zhang M, Gong F, Zhang L, Luo Y, et al. Diagnostic value of a dynamic artificial intelligence ultrasonic intelligent auxiliary diagnosis system for benign and Malignant thyroid nodules in patients with Hashimoto thyroiditis. Quant Imaging Med Surg. (2023) 13:3618–29. doi: 10.21037/qims-22-889
22. Dietrich CF, Bamber J, Berzigotti A, Bota S, Cantisani V, Castera L, et al. EFSUMB guidelines and recommendations on the clinical use of liver ultrasound elastography, update 2017 (Long version). Ultraschall Med. (2017) 38:e16–47. doi: 10.1055/s-0043-103952
23. Cosgrove D, Barr R, Bojunga J, Cantisani V, Chammas MC, Dighe M, et al. WFUMB guidelines and recommendations on the clinical use of ultrasound elastography: part 4. Thyroid. Ultrasound Med Biol. (2017) 43:4–26. doi: 10.1016/j.ultrasmedbio.2016.06.022
24. Adler DD, Carson PL, Rubin JM, and Quinn-Reid D. Doppler ultrasound color flow imaging in the study of breast cancer: preliminary findings. Ultrasound Med Biol. (1990) 16:553–9. doi: 10.1016/0301-5629(90)90020-D
25. Tan R, Liu J, Tang Q, Chen H, and Zhang Z. Clinical diagnostic value of shear-wave elastography in detecting Malignant nipple retraction. J Comput Assist Tomogr. (2025) 49:73–9. doi: 10.1097/RCT.0000000000001653
26. Cavallo A, Johnson DN, White MG, Siddiqui S, Antic T, Mathew M, et al. Thyroid nodule size at ultrasound as a predictor of Malignancy and final pathologic size. Thyroid. (2017) 27:641–50. doi: 10.1089/thy.2016.0336
27. Gao XQ, Ma Y, Peng XS, Wang LL, Li HX, Zheng XL, et al. Diagnostic performance of C-TIRADS combined with SWE for the diagnosis of thyroid nodules. Front Endocrinol (Lausanne). (2022) 13:939303. doi: 10.3389/fendo.2022.939303
28. Chambara N, Lo X, Chow TCM, Lai CMS, Liu SYW, and Ying M. Combined shear wave elastography and EU TIRADS in differentiating Malignant and benign thyroid nodules. Cancers (Basel). (2022) 14:5521. doi: 10.3390/cancers14225521
29. Cheng KL, Lai PH, Su CL, Baek JH, and Lee HL. Impact of region-of-interest size on the diagnostic performance of shear wave elastography in differentiating thyroid nodules. Cancers (Basel). (2023) 15:5214. doi: 10.3390/cancers15215214
30. Xue JP, Kang XY, Miao JW, Zhang YX, Li HZ, Yao FC, et al. Analysis of the influence of thyroid nodule characteristics on the results of shear wave elastography. Front Endocrinol (Lausanne). (2022) 13:858565. doi: 10.3389/fendo.2022.858565
31. Fukuhara T, Matsuda E, Endo Y, Takenobu M, Izawa S, Fujiwara K, et al. Correlation between quantitative shear wave elastography and pathologic structures of thyroid lesions. Ultrasound Med Biol. (2015) 41:2326–32. doi: 10.1016/j.ultrasmedbio.2015.05.001
32. Ha EJ, Baek JH, Lee JH, Song DE, Kim JK, Shong YK, et al. Sonographically suspicious thyroid nodules with initially benign cytologic results: the role of a core needle biopsy. Thyroid. (2013) 23:703–8. doi: 10.1089/thy.2012.0426
33. Jasim S, Baranski TJ, Teefey SA, and Middleton WD. Investigating the effect of thyroid nodule location on the risk of thyroid cancer. Thyroid. (2020) 30:401–7. doi: 10.1089/thy.2019.0478
34. Yang X, Geng H, Wang X, Li L, An X, and Cong Z. Identification of lesion location and discrimination between benign and Malignant findings in thyroid ultrasound imaging. Sci Rep. (2024) 14:32118. doi: 10.1038/s41598-024-83888-1
35. Nattabi HA, Sharif NM, Yahya N, Ahmad R, Mohamad M, Zaki FM, et al. Is diagnostic performance of quantitative 2D-shear wave elastography optimal for clinical classification of benign and Malignant thyroid nodules?: A systematic review and meta-analysis. Acad Radiol. (2022) 29 Suppl 3:S114–s21. doi: 10.1016/j.acra.2017.09.002
Keywords: artificial intelligence-assisted diagnostic system, Chinese thyroid imaging reporting and data system, shear wave elastography, thyroid nodule, ultrasound
Citation: Zhou Y, Li R, Zheng W, Liu Z, Yang J and Wang L (2026) SWE, C-TIRADS and AI-assisted diagnostic systems in distinguishing thyroid nodules of different lesion size: Which has better diagnostic performance? Front. Endocrinol. 16:1691092. doi: 10.3389/fendo.2025.1691092
Received: 22 August 2025; Accepted: 09 December 2025; Revised: 05 December 2025;
Published: 07 January 2026.
Edited by:
Sandeep Kumar Mishra, Yale University, United StatesReviewed by:
Shahram Taeb, Gilan University of Medical Sciences, IranXiyue Yang, The Eighth Affiliated Hospital of Guangxi Medical University, China
Copyright © 2026 Zhou, Li, Zheng, Liu, Yang and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yuwei Zhou, eXV3ZWl6aG91MTk4N0BvdXRsb29rLmNvbQ==
Ruifen Li1