- 1Department of Ultrasound, The First Affiliated Hospital of Dalian Medical University, Dalian, China
- 2College of Humanities and Social Sciences, Dalian Medical University, Dalian, China
- 3Department of Pathology, The First Affiliated Hospital of Dalian Medical University, Dalian, China
Objectives: Determining the nature of thyroid nodules through a single fine-needle aspiration (FNA) biopsy is not feasible for approximately one-third of patients. We developed a predictive model to assist FNA decision-making and reduce unnecessary FNAs.
Methods: This retrospective study consecutively included patients who underwent ultrasound-guided FNA between March 2018 and March 2023. Patients were divided into a training dataset (70%) and a validation dataset (30%). Univariate analysis was performed within the training dataset using Kruskal–Wallis test for continuous variables and chi-square test or Fisher’s exact test for categorical variables. Variables with significance were entered into multivariate logistic regression. The prediction model (B-Model) was constructed using a cascaded three-stage logistic regression framework: Stage I distinguished benign from non-benign nodules, Stage II differentiated malignant from non-malignant nodules, Stage III separated follicular neoplasm from indeterminate/atypia nodules. Model performance was assessed in the validation dataset using sensitivity (SEN), specificity (SPE), and accuracy (ACC). The reduction in repeat FNA facilitated by the B-Model was calculated.
Results: Training and validation datasets included 1,573 and 672 cases, respectively. The overall SEN, SPE and ACC of the B-Model were 84.7%, 76.7% and 60.1% in the validation dataset. The application of the B-Model reduced the number of patients requiring repeat FNA from 255 to 153, resulting in a 40.0% reduction.
Conclusion: The B-Model demonstrated robust predictive performance, facilitating the optimization of pre-FNA diagnostic workflows, significantly reducing unnecessary repeat FNAs, and advancing precision in thyroid nodule management.
1 Introduction
Thyroid nodules (TNs) are common in the general population, with a global incidence ranging from 19 to 68%. Most nodules are benign, with 7–15% being malignant (1–3). Given the differences in pathogenesis, biologic behavior, and clinical manifestations, there are significant variations in treatment and prognosis among different pathologic types and subtypes of TNs (4). In recent years, the advent and dissemination of treatment technologies, such as ablation, targeted therapy, immunotherapy, and traditional Chinese medicine, have revolutionized the management of TNs (5). To provide patients with more precise and personalized treatment strategies, accurate pathologic diagnosis of TNs is crucial.
Ultrasound (US)-guided fine-needle aspiration biopsy (FNA) is a safe and effective method for obtaining thyroid cells and is currently the preferred approach for diagnosing TNs (1, 6–8). The Bethesda System for Reporting Thyroid Cytopathology (BSRTC), which is widely adopted globally, aims to unify the terminology used in pathology reports and achieve standardized reporting (9–11). BSRTC II, V, and VI are distinctly labeled as benign, suspicious for malignancy, and malignant. Conversely, BSRTC I, III, and IV encompass nondiagnostic, atypia of undetermined significance, and follicular neoplasm, respectively, which lack definitive diagnoses and exhibit a potential occurrence range of 20–34% (10–13). Multiple guidelines suggest that comprehensive management should be performed based on clinical risk factors in accordance with the patient’s wishes. Repeat FNA (rFNA) is highly recommended for BSRTC I nodules. For BSRTC III, a range of options are advised, including rFNA, rFNA with molecular testing, diagnostic lobectomy, and surveillance. Concerning BSRTC IV, the recommended approach encompasses rRNA coupled with molecular testing or diagnostic lobectomy (1, 6, 14). Therefore, approximately one-third of patients may require two FNA procedures to achieve a more precise diagnosis. Even after undergoing two FNAs, some patients still confront diagnostic ambiguity, which ultimately requires thyroidectomy. This undoubtedly increases patient exposure to invasive procedures, prolongs waiting time, and imposes a significant financial burden.
This study aimed to devise a predictive model (B-Model) for BSRTC categorization of FNA that identifies nodules that cannot be determined solely through FNA so that we can minimize ineffective punctures, maximize the diagnostic efficiency of FNA, and ultimately promote precision medicine.
2 Materials and methods
2.1 Patients
This single-center retrospective study consecutively included patients who underwent US-FNA of TNs between March 2018 and March 2023 (n = 4,210). To evaluate temporal generalizability, the dataset was divided chronologically into two cohorts: March 2018 to February 2022 (training dataset) and March 2022 to March 2023 (validation dataset). Exclusions criteria included: absence of ultrasound images, pathology-confirmed non-thyroid lesions, operator experience <3 years, multiple punctures (only the last result retained), and missing biochemical data. After exclusions, the final study population consisted of 1,573 patients in the training dataset and 672 patients in the validation dataset, with an approximate ratio of 7:3 between the two cohorts. The overall study design and patient selection flow are illustrated in Figure 1.
 
  Figure 1. Study flow diagram of patient enrollment, dataset allocation, and B-Model development. Study flow diagram showing inclusion and exclusion criteria, patient enrollment, and dataset allocation into training and validation cohorts, with datasets divided chronologically (March 2018–February 2022 for training, March 2022–March 2023 for validation). Architecture of the cascaded logistic regression model (B-Model), in which three logistic regression equations were sequentially linked: Equation P1 distinguished benign from non-benign nodules (Group 1 vs. non-Group 1); Equation P2 differentiated malignant from non-malignant nodules (Group 4 vs. non-Group 4); and Equation P3 further separated follicular neoplasm from indeterminate/atypia nodules (Group 3 vs. Group 2). BSRTC, Bethesda System for Reporting Thyroid Cytopathology [Flowchart design: Boardmix Online Platform (https://boardmix.cn)].
2.2 Acquisition of clinical information and biochemical results
Clinical information and biochemical results for all patients were obtained from an electronic medical data management system. The following clinical features were recorded: patient’s age and sex. Biochemical results included free triiodothyronine (FT3), free thyroxine (FT4), thyroid-stimulating hormone (TSH), antithyroid peroxidase autoantibody (A-TPO), thyroglobulin antibody (A-TG), thyroglobulin (TG), and thyrotropin receptor antibody (TRAb). All biochemical tests were conducted within 1 month of the FNA.
2.3 Cytopathology acquisition and grouping
All cytopathologic examinations were performed by two pathologists with >8 years of thyroid cytopathology experience and subsequently reviewed by a senior pathologist with >15 years of experience. Findings were classified according to the 2023 revision of BSRTC into four groups: Group 1 (BSRTC II), Group 2 (BSRTC I/III), Group 3 (BSRTC IV), and Group 4 (BSRTC V/VI).
2.4 Ultrasound image acquisition and interpretation
Ultrasound data were retrieved from the institutional imaging system. Two US radiologists (>7 years of thyroid imaging experience) independently assessed thyroid echotexture, nodule position, capsule distance, size, volume, composition, echogenicity, echotexture, margin, shape, orientation, calcifications, posterior features, halo and Adler’s semiquantitative grading for nodule blood flow (Grades 0–3). Discrepancies were resolved by consensus with a senior radiologist (>20 years of experience).
2.5 Statistical analysis
SPSS statistical software (version 20.0; IBM Corporation, Armonk, NY, USA) was used for the statistical analysis. Baseline characteristics between the training and validation datasets were compared using the Mann–Whitney U test for continuous variables and the chi-square or Fisher’s exact test for categorical variables. Univariate analyses were further performed within the training dataset to identify factors associated with pathological classification, applying the Kruskal–Wallis test for continuous variables and the chi-square or Fisher’s exact test for categorical variables across the four groups. A p-value of <0.05 was considered statistically significant.
The prediction model (B-Model) was developed using multivariable logistic regression in SPSS based on training dataset, and it adopted a three-stage architecture as illustrated in Figure 1: (1) distinguished benign from non-benign nodules (Group 1 vs. non-Group 1) by Equation P1; (2) differentiated malignant from non-malignant nodules (Group 4 vs. non-Group 4) by Equation P2; (3) separated follicular neoplasm from indeterminate/atypia nodules (Group 3 vs. Group 2) by Equation P3. Each equation had two versions: one that included biochemical indicators as independent variables P(w), and another that did not include biochemical indicators as independent variables P(w/o). For other special circumstances, a supplementary version was designed P(c). Multivariable logistic regression analyses with backward stepwise selection were applied to identify independent variables x1-i. Based on clinical significance or published reports, we graded each risk factor, selected an appropriate grade as the baseline risk reference value, and recorded the score as 0 (1, 6, 13). β0-i is the regression coefficient of each independent variable. Using these parameters, we calculated P as the dependent variable corresponding to each risk factor classification using the following formula, where exp denotes the natural exponential function:
The dependent variable P in the equation above uses 0.5 as a threshold value. Similar cascaded/sequential logistic regression approaches have been applied in recent medical prediction studies to improve classification performance and manage class imbalance (15–17).
The data in the validation dataset were used to select the equations and validate the performance of the prediction models. By substituting the data into previously established equations and considering the actual pathologic results as the gold standard, the sensitivity (SEN), specificity (SPE), accuracy (ACC), positive predictive rate (PPV), negative predictive rate (NPV) and area under the receiver operating characteristic curve (AUC-ROC) of each equation were evaluated. Finally, the rate of reduction in rFNAs after the B-Model implementation was calculated using the following equation:
(FN: True Group 2/3 cases incorrectly classified as Group 1/4 by B-Model).
3 Results
3.1 Patient characteristics
In the training dataset, the final cohort included 1,573 patients [median age: 48 years (IQR: 38–57)] of the initial 2,818 patients, after the exclusion of 1,245 patients. In the validation dataset, the final cohort included 672 patients [median age: 50 years (IQR: 40–58)] of the initial 1,392 patients, after excluding 720 patients. The patient characteristics, US features, and biochemical results are shown in Table 1. Overall, no significant statistical differences were observed between two cohorts for most baseline characteristics except three laboratory indicators (FT4, A-TG, and A-TPO; p = 0.047, <0.001, and 0.002, respectively). These differences likely reflect case-mix shifts from time-based cohort division and variability in laboratory assays.
 
  Table 1. Comparison of baseline clinical characteristics and ultrasound features of thyroid nodules between the training and validation datasets a,b.
3.2 Factors influencing pathology
In the training dataset, univariate analysis identified significant differences (p < 0.05) in 2 patient characteristics, 15 US features, and 4 biochemical markers across the groups (Table 2). Specifically, thyroid echogenicity and A-TG levels were significantly different between Groups 1 and 3 (p = 0.047 and p = 0.046, respectively) whereas FT4 levels were significantly different between Groups 2 and 4 (p = 0.032). All significant variables were included as independent covariates in the subsequent multivariate analysis.
 
  Table 2. Patient clinical characteristics and ultrasound findings of the nodules associated with grouping in the training dataset a-c.
3.3 Construction of equations P1, P2, and P3
There versions of Equation P1 were derived: P1(w/o) (χ2 = 457.323, p < 0.001), P1(w) (χ2 = 300.627, p < 0.001), and P1(c) (χ2 = 300.627, p < 0.001). P1(c) was generated by cross-validation to address the absence of biochemical indicators in P1(w). Two versions of Equation P2 were developed: P2 (w/o) (χ2 = 324.479, p < 0.001) and P2 (w) (χ2 = 198.300, p < 0.001). Two versions of Equation P3 were established: P3 (w/o) (χ2 = 148.499, p < 0.001) and P3 (w) (χ2 = 98.663, p < 0.001).
3.4 Verification of equations P1, P2, and P3
The validation results showed that among the three Equation P1 variants, P1(c) demonstrated the highest SEN (88.3%), SPE (68.0%), ACC (83.1%), PPV (89.2%), and NPV (66.1%), while maintaining comparable ROC-AUC (0.830 vs. 0.842/0.842 in P1(w/o)/P1(w), all p < 0.001). The reduced variable count (from 10 to 6) enhanced clinical utility. In the final selected Equation P1, significant predictors included markedly hypoechoic feature (OR: 10.286, 95% CI: 6.118–17.296), hypoechoic feature (OR: 4.703, 95% CI: 3.190–6.932), irregular/extra-thyroidal extension (OR: 1.705, 95% CI: 1.180–2.463), enhanced posterior features (OR: 1.853, 95% CI: 1.265–2.715), and shadowing (OR: 2.809, 95% CI: 1.220–5.031), whereas lobulated shape showed nonsignificant association (OR: 1.122, 95% CI: 0.636–1.980). Isoechoic/hyperechoic pattern, oval-to-round shape, and absent posterior features were identified as independent protective factors for benign nodules.
Among the 498 non-Group 1 cases predicted by Equation P1, Equation P2 (w) demonstrated higher SEN (80.6% vs. 74.4%) and NPV (74.0% vs. 73.3%) compared to P2 (w/o), with comparable ROC-AUC (0.735 vs. 0.759, both p = 0.000). Thus, P2 (w) was selected to reduce missed diagnoses of malignancy. Key risk factors in Equation P2 included isthmus location (OR: 4.000, 95% CI: 1.475–10.843), size > 5 mm (highest risk at 5–10 mm; OR: 3.058, 95% CI: 1.671–5.596), markedly hypoechoic/hypoechoic features (OR: 20.203, 95% CI: 5.203–81.179), taller-than-wide shape (OR: 5.165, 95% CI: 2.889–9.235), microcalcifications/complex calcifications (OR: 1.199, 95% CI: 0.626–2.296), and elevated TRAb (OR: 1.628, 95% CI: 1.119–2.368). These were independent predictors of malignant nodules.
Among the 181 cases predicted as neither Group 1 nor 4 by Equation P1 and P2, Equation P3(w) showed higher SPE (96.0% vs. 95.4%) than P3(w/o) with similar SEN (both 37.5%), ACC (93.4% vs. 92.7%), PPV (both 2.9%), NPV (70.0% vs. 72.7%), and ROC-AUC (0.814 vs. 0.837, both p = 0.000). The predictive performance of Equations P1, P2 and P3 in the validation dataset are presented in Table 3 and Figure 2.
 
  Table 3. Predictive efficacy of equations P1 (w/o), P1 (w), P1 (c), P2 (w/o), P2 (w), P3 (w/o), and P3 (w) in the validation dataset.
 
  Figure 2. Receiver operating characteristic (ROC) curve analysis for three regression equations. (A) ROC curves comparing three designs (Equation P1) predicting Group 1 (BSRTC II). AUC values: P1(w/o): 0.842 (95% confidence interval [CI] 0.807–0.876), P1(W): 0.842 (95% CI 0.808–0.876), P1(C): 0.830 (95% CI 0.792–0.868). (B) ROC curves comparing two designs (Equation P2) predicting Group 4 (BSRTC V/VI). AUC values: P2(w/o): 0.759 (95% CI 0.717–0.801), P2(w): 0.735 (95% CI 0.691–0.779). (C) ROC curves comparing two designs (Equation P3) distinguishing Groups 2 (BSRTC I/III) and 3 (BSRTC IV). AUC values: P3(w/o): 0.837 (95% CI 0.650–1.000); P3(w): 0.814 (95% CI 0.599–1.000). BSRTC, Bethesda System for Reporting Thyroid Cytopathology (ROC curve plotting: SPSS 20.0, IBM; image editing: Adobe Photoshop CS5).
3.5 Overall efficacy of the B-Model
For the validation dataset, the number of cases correctly predicted by the B-Model were 115, 91, 3, and 195 in Groups 1, 2, 3, and 4, respectively. The prediction results of B-Model in the validation dataset are presented in Table 4. True Group 2/3 cases were 255, and true Group 2/3 cases incorrectly classified as Group 1/4 by B-Model was 153. The rFNA reduction rate was 40%.
4 Discussion
US remains the primary imaging tool for TN risk stratification. While certain US features are associated with malignancy, most nodules still require FNA for definitive diagnosis. This study bridges this gap by integrating clinical, biochemical, and US features into a cascaded multivariable logistic regression model (B-Model) for pre-FNA prediction of BSRTC categories.
Operationally, the B-Model links three logistic regression equations in sequence. At the point of use, clinicians input the available clinical, ultrasound, and biochemical variables; the model sequentially evaluates benign vs. non-benign (Equation P1), malignant vs. non-malignant (Equation P2), and follicular neoplasm vs. indeterminate/atypia (Equation P3). A fixed threshold of 0.5 is applied at each step, ensuring that every nodule is ultimately assigned to one, and only one, predicted BSRTC group.
As illustrated in Figure 3 this structured, pre-FNA assignment provides direct guidance for patient management. In contrasts to the conventional workflow, where indeterminate cytology (BSRTC I, III, IV) often necessitate rFNA and may ultimately proceed to diagnostic lobectomy, the B-Model enables early identification of nodules likely to yield indeterminate results. Such cases can be directly triaged to FNA plus molecular testing or diagnostic lobectomy, thereby avoiding redundant punctures. In the validation dataset, this approach reduced the rFNA by 40.0%, minimizing patient trauma and conserving healthcare resources. Importantly, the B-Model theoretically requires only a single FNA per nodule, representing a significant advancement in clinical efficiency.
 
  Figure 3. Diagnostic workflows for thyroid nodular diseases. (A) Conventional workflow based on fine-needle aspiration (FNA). Indeterminate results (BSRTC I, III, IV) require repeat FNA/and molecular testing, with unresolved nodules often proceeding to diagnostic lobectomy. (B) Proposed workflow using B-Model. Nodules are stratified into predicted BSRTC II/V/VI (direct FNA), BSRTC I/III (FNA + molecular testing), and BSRTC IV (molecular testing or direct diagnostic lobectomy), providing a more streamlined and individualized management strategy. Notably, in the B-Model, each nodule theoretically requires only a single FNA, avoiding repeated punctures. FNA, fine-needle aspiration; BSRTC, Bethesda System for Reporting Thyroid Cytopathology [Flowchart design: Boardmix Online Platform (https://boardmix.cn)].
A key methodological consideration was the reduction of cumulative errors inherent to cascaded regression. To mitigate this risk, BSRTC categories with similar clinical management strategies were merged (BSRTC I with III, and BSRTC V with VI), reducing six categories to four groups (1, 8, 11, 18). This consolidation balanced statistical robustness clinical practicality and minimized propagation error. Similar sequential or multi-step logistic regression strategies have been applied successfully in other medical domains, supporting both interpretability and transparency of the modeling process (19–21).
Although machine learning and deep learning methods such as convolutional neural networks (CNNs) have been increasingly applied in radiomics, they remain limited by several drawbacks (22–26). First, the ‘black-box’ nature of CNNs prevents transparent identification of the imaging features driving classification, thereby reducing interpretability. Second, overfitting may arise when models are over-parameterized, which undermines generalizability (26–29). In contrast, we selected a cascaded logistic regression model because it provides transparent and interpretable results that facilitate the training of junior clinicians; its sequential structure mimics a decision tree, which helps handle data imbalance while preserving a linear framework; and it also offers a necessary foundation for subsequent AI research, enabling insight into the underlying decision logic before moving toward more advanced algorithms (17, 20, 30).
Beyond diagnostic utility, the B-Model highlighted certain features that deserve further clinical attention. Equation P2 identified younger age, isthmus location, and small nodule size (particularly 5–10 mm) as predictors of malignancy. While some study have reported similar findings, one possible explanation for this observation in our cohort is the relatively high proportion of sub-centimeter and isthmus-located nodules (31–34). This indicated that conventional size–risk associations, which are largely derived from nodules ≥1 cm, may not fully capture the risk pattern of microcarcinomas. As a result, the diagnosis of microcarcinomas remains challenging, particularly for junior clinicians (35). By incorporating these features, our model provides intuitive “rules of thumb” that support structured image interpretation and enhance diagnostic confidence, especially for nodules ≤1 cm. Thus, the B-Model serves not only as a decision-support system but also as a valuable teaching aid.
This study has limitations. First, although the training and validation cohorts were largely comparable, differences were observed in FT4, A-TG, and A-TPO levels. These variations likely reflect case-mix shifts from time-based cohort division and assay-related variability in laboratory testing, but they were confined to biochemical indicators and did not affect model performance. Second, as a single-center study, variability in ultrasonography and pathologic interpretation may limit generalizability. Third, collinearity and potential confounding were not explicitly tested, though variables were selected based on clinical relevance and univariable screening, and regression coefficients remained stable. Finally, while the B-Model reduced rFNA by 40% under retrospective conditions, its real-world effectiveness and operational feasibility requires validation through prospective multicenter studies.
In conclusion, we developed a cascaded logistic regression model and demonstrated its effectiveness. By integrating clinical, ultrasound, and biochemical indicators, the B-Model enabled pre-FNA prediction of BSRTC categories, thereby optimizing the diagnostic workflow for TNs, reducing unnecessary FNAs, and advancing precision medicine in TN management.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The study protocol was approved by the Ethics Committee of the First Affiliated Hospital of Dalian Medical University (Approval No. PJ-KS-KY-2023-213) and registered with the Chinese Clinical Trial Registry (Registration ID: ChiCTR2400082395). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
SG: Methodology, Investigation, Writing – review & editing, Formal analysis, Software, Visualization, Data curation, Writing – original draft. BL: Formal analysis, Data curation, Software, Writing – original draft, Writing – review & editing, Investigation. MT: Formal analysis, Validation, Writing – original draft, Writing – review & editing. YZ: Investigation, Writing – review & editing, Visualization, Writing – original draft. LW: Writing – original draft, Writing – review & editing, Investigation, Visualization. LD: Writing – original draft, Writing – review & editing, Visualization, Investigation. CS: Writing – review & editing, Writing – original draft. MH: Writing – review & editing, Writing – original draft. YC: Conceptualization, Writing – original draft, Writing – review & editing, Methodology, Project administration, Supervision.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Acknowledgments
We would like to sincerely thank Dr. Qigui Liu, formerly of the School of Public Health, Dalian Medical University, for his invaluable guidance.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2025.1641266/full#supplementary-material
References
1. Haugen, BR, Alexander, EK, Bible, KC, Doherty, GM, Mandel, SJ, Nikiforov, YE, et al. 2015 American Thyroid Association management guidelines for adult patients with thyroid nodules and differentiated thyroid Cancer: the American Thyroid Association guidelines task force on thyroid nodules and differentiated thyroid Cancer. Thyroid. (2016) 26:1–133. doi: 10.1089/thy.2015.0020
2. Mu, C, Ming, X, Tian, Y, Liu, Y, Yao, M, Ni, Y, et al. Mapping global epidemiology of thyroid nodules among general population: a systematic review and meta-analysis. Front Oncol. (2022) 12:1029926. doi: 10.3389/fonc.2022.1029926
3. Vaccarella, S, Lortet-Tieulent, J, Colombet, M, Davies, L, Stiller, CA, Schüz, J, et al. Global patterns and trends in incidence and mortality of thyroid cancer in children and adolescents: a population-based study. Lancet Diabetes Endocrinol. (2021) 9:144–52. doi: 10.1016/S2213-8587(20)30401-0
4. Administration NHCotPsRoCMAaH. Guidelines for the diagnosis and treatment of thyroid carcinoma. Chin J Pract Surg. (2022) 42:1343–57.
5. Shonka, DC Jr, Ho, A, Chintakuntlawar, AV, Geiger, JL, Park, JC, Seetharamu, N, et al. American head and neck society endocrine surgery section and international thyroid oncology group consensus statement on mutational testing in thyroid cancer: defining advanced thyroid cancer and its targeted treatment. Head Neck. (2022) 44:1277–300. doi: 10.1002/hed.27025
6. Russ, G, Bonnema, SJ, Erdogan, MF, Durante, C, Ngu, R, and Leenhardt, LJETJ. European thyroid association guidelines for ultrasound malignancy risk stratification of thyroid nodules in adults: the EU-TIRADS. Eur Thyroid J. (2017) 6:225–37. doi: 10.1159/000478927
7. Tessler, FN, Middleton, WD, Grant, EG, Hoang, JK, Berland, LL, Teefey, SA, et al. ACR thyroid imaging, reporting and data system (TI-RADS): White paper of the ACR TI-RADS Committee. J Am Coll Radiol. (2017) 14:587–95. doi: 10.1016/j.jacr.2017.01.046
8. Lee, YH, Baek, JH, Jung, SL, Kwak, JY, Kim, JH, Shin, JH, et al. Ultrasound-guided fine needle aspiration of thyroid nodules: a consensus statement by the korean society of thyroid radiology. Korean J Radiol. (2015) 16:391–401. doi: 10.3348/kjr.2015.16.2.391
9. Cibas, ES, and Ali, SZ. The Bethesda system for reporting thyroid cytopathology. Am J Clin Pathol. (2009) 132:658–65. doi: 10.1309/AJCPPHLWMI3JV4LA
10. Cibas, ES, and Ali, SZ. The 2017 Bethesda system for reporting thyroid cytopathology. J Am Soc Cytopathol. (2017) 6:217–22. doi: 10.1016/j.jasc.2017.09.002
11. Ali, SZ, Baloch, ZW, Cochand-Priollet, B, Schmitt, FC, Vielh, P, and VanderLaan, PA. The 2023 Bethesda system for reporting thyroid cytopathology. J Am Soc Cytopathol. (2023) 12:319–25. doi: 10.1016/j.jasc.2023.05.005
12. Todorovic, E, Sheffield, BS, Kalloger, S, Walker, B, and Wiseman, SMJC. Increased cancer risk in younger patients with thyroid nodules diagnosed as atypia of undetermined significance. Cureus. (2018) 10:e2348. doi: 10.7759/cureus.2348
13. Huang, J, Shi, H, Song, M, Liang, J, Zhang, Z, Chen, X, et al. Surgical outcome and malignant risk factors in patients with thyroid nodule classified as Bethesda category III. Front Endocrinol. (2021) 12:686849. doi: 10.3389/fendo.2021.686849
14. Zhou, J, Yin, L, Wei, X, Zhang, S, Song, Y, Luo, B, et al. 2020 Chinese guidelines for ultrasound malignancy risk stratification of thyroid nodules: the C-TIRADS. Endocrine. (2020) 70:256–79. doi: 10.1007/s12020-020-02441-y
15. Dal Negro, RW, Micheletto, C, Tognella, S, Visconti, M, Guerriero, M, and Sandri, MF. A two-stage logistic model based on the measurement of pro-inflammatory cytokines in bronchial secretions for assessing bacterial, viral, and non-infectious origin of COPD exacerbations. COPD. (2005) 2:7–16. doi: 10.1081/COPD-200050680
16. Zhu, Y, and Fang, J. Logistic regression-based trichotomous classification tree and its application in medical diagnosis. Med Decis Making. (2016) 36:973–89. doi: 10.1177/0272989X15618658
17. Van Holsbeke, C, Ameye, L, Testa, AC, Mascilini, F, Lindqvist, P, Fischerova, D, et al. Development and external validation of new ultrasound-based mathematical models for preoperative prediction of high-risk endometrial cancer. Ultrasound Obstet Gynecol. (2014) 43:586–95. doi: 10.1002/uog.13216
18. Chen, DW, Lang, BHH, McLeod, DSA, Newbold, K, and Haymart, MR. Thyroid cancer. Lancet. (2023) 401:1531–44. doi: 10.1016/S0140-6736(23)00020-X
19. Grover, SB, Patra, S, Grover, H, Mittal, P, and Khanna, G. Prospective revalidation of IOTA "two-step", "alternative two-step" and "three-step" strategies for characterization of adnexal masses - an Indian study focussing the radiology context. Indian J Radiol Imaging. (2020) 30:304–18. doi: 10.4103/ijri.IJRI_279_20
20. Landolfo, C, Bourne, T, Froyman, W, Van Calster, B, Ceusters, J, Testa, AC, et al. Benign descriptors and ADNEX in two-step strategy to estimate risk of malignancy in ovarian tumors: retrospective validation in IOTA5 multicenter cohort. Ultrasound Obstet Gynecol. (2023) 61:231–42. doi: 10.1002/uog.26080
21. Zhang, M, Li, S, Xue, M, and Zhu, Q. Two-stage classification strategy for breast cancer diagnosis using ultrasound-guided diffuse optical tomography and deep learning. J Biomed Opt. (2023) 28:086002. doi: 10.1117/1.JBO.28.8.086002
22. Yao, J, Wang, Y, Lei, Z, Wang, K, Feng, N, Dong, F, et al. Multimodal GPT model for assisting thyroid nodule diagnosis and management. NPJ Digit Med. (2025) 8:245. doi: 10.1038/s41746-025-01652-9
23. Wang, J, Dong, C, Zhang, YZ, Wang, L, Yuan, X, He, M, et al. A novel approach to quantify calcifications of thyroid nodules in US images based on deep learning: predicting the risk of cervical lymph node metastasis in papillary thyroid cancer patients. Eur Radiol. (2023) 33:9347–56. doi: 10.1007/s00330-023-09909-1
24. Chang, L, Zhang, Y, Zhu, J, Hu, L, Wang, X, Zhang, H, et al. An integrated nomogram combining deep learning, clinical characteristics and ultrasound features for predicting central lymph node metastasis in papillary thyroid cancer: a multicenter study. Front Endocrinol. (2023) 14:964074. doi: 10.3389/fendo.2023.964074
25. Yao, J, Zhang, Y, Shen, J, Lei, Z, Xiong, J, Feng, B, et al. AI diagnosis of Bethesda category IV thyroid nodules. iScience. (2023) 26:108114. doi: 10.1016/j.isci.2023.108114
26. Peng, S, Liu, Y, Lv, W, Liu, L, Zhou, Q, Yang, H, et al. Deep learning-based artificial intelligence model to assist thyroid nodule diagnosis and management: a multicentre diagnostic study. Lancet Digit Health. (2021) 3:e250–9. doi: 10.1016/S2589-7500(21)00041-8
27. Buda, M, Wildman-Tobriner, B, Hoang, JK, Thayer, D, Tessler, FN, Middleton, WD, et al. Management of Thyroid Nodules Seen on US images: deep learning may match performance of radiologists. Radiology. (2019) 292:695–701. doi: 10.1148/radiol.2019181343
28. Wu, X, Li, M, Cui, XW, and Xu, G. Deep multimodal learning for lymph node metastasis prediction of primary thyroid cancer. Phys Med Biol. (2022) 67:035008. doi: 10.1088/1361-6560/ac4c47
29. Zhou, L, Zheng, Y, Yao, J, Chen, L, and Xu, D. Association between papillary thyroid carcinoma and cervical lymph node metastasis based on ultrasonic radio frequency signals. Cancer Med. (2023) 12:14305–16. doi: 10.1002/cam4.6107
30. Puggioni, G, Gelfand, AE, and Elmore, JG. Joint modeling of sensitivity and specificity. Stat Med. (2008) 27:1745–61. doi: 10.1002/sim.3186
31. Cavallo, A, Johnson, DN, White, MG, Siddiqui, S, Antic, T, Mathew, M, et al. Thyroid nodule size at ultrasound as a predictor of malignancy and final pathologic size. Thyroid. (2017) 27:641–50. doi: 10.1089/thy.2016.0336
32. Al-Hakami, HA, Alqahtani, R, Alahmadi, A, Almutairi, D, Algarni, M, and Alandejani, T. Thyroid nodule size and prediction of Cancer: a study at tertiary Care Hospital in Saudi Arabia. Cureus. (2020) 12:e7478. doi: 10.7759/cureus.7478
33. Dong, Y, Mao, M, Zhan, W, Zhou, J, Zhou, W, Yao, J, et al. Size and ultrasound features affecting results of ultrasound-guided fine-needle aspiration of thyroid nodules. J Ultrasound Med. (2018) 37:1367–77. doi: 10.1002/jum.14472
34. Lyu, YS, Pyo, JS, Cho, WJ, Kim, SY, and Kim, JH. Clinicopathological significance of papillary thyroid carcinoma located in the isthmus: a Meta-analysis. World J Surg. (2021) 45:2759–68. doi: 10.1007/s00268-021-06178-1
Keywords: fine-needle aspiration, logistic regression, ultrasound imaging, thyroid nodules, precision medicine
Citation: Gao S, Liu B, Tong M, Zhu Y, Wang L, Du L, Shi C, Han M and Che Y (2025) A cascaded clinical-ultrasound-biochemical model for precise prediction before thyroid nodule fine-needle aspiration biopsy. Front. Med. 12:1641266. doi: 10.3389/fmed.2025.1641266
Edited by:
Angelika Buczyńska, Medical University of Bialystok, PolandReviewed by:
Jincao Yao, University of Chinese Academy of Sciences, ChinaTianhan Zhou, Zhejiang Chinese Medical University, China
Copyright © 2025 Gao, Liu, Tong, Zhu, Wang, Du, Shi, Han and Che. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Ying Che, Y2hleWluZ0BkbXUuZWR1LmNu
 Shuhang Gao1
Shuhang Gao1