Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med., 18 September 2025

Sec. Precision Medicine

Volume 12 - 2025 | https://doi.org/10.3389/fmed.2025.1641266

A cascaded clinical-ultrasound-biochemical model for precise prediction before thyroid nodule fine-needle aspiration biopsy

Shuhang GaoShuhang Gao1Bojia LiuBojia Liu2Mengying TongMengying Tong1Yalin ZhuYalin Zhu1Lina WangLina Wang1Linyao DuLinyao Du1Chang ShiChang Shi3Mei HanMei Han3Ying Che
Ying Che1*
  • 1Department of Ultrasound, The First Affiliated Hospital of Dalian Medical University, Dalian, China
  • 2College of Humanities and Social Sciences, Dalian Medical University, Dalian, China
  • 3Department of Pathology, The First Affiliated Hospital of Dalian Medical University, Dalian, China

Objectives: Determining the nature of thyroid nodules through a single fine-needle aspiration (FNA) biopsy is not feasible for approximately one-third of patients. We developed a predictive model to assist FNA decision-making and reduce unnecessary FNAs.

Methods: This retrospective study consecutively included patients who underwent ultrasound-guided FNA between March 2018 and March 2023. Patients were divided into a training dataset (70%) and a validation dataset (30%). Univariate analysis was performed within the training dataset using Kruskal–Wallis test for continuous variables and chi-square test or Fisher’s exact test for categorical variables. Variables with significance were entered into multivariate logistic regression. The prediction model (B-Model) was constructed using a cascaded three-stage logistic regression framework: Stage I distinguished benign from non-benign nodules, Stage II differentiated malignant from non-malignant nodules, Stage III separated follicular neoplasm from indeterminate/atypia nodules. Model performance was assessed in the validation dataset using sensitivity (SEN), specificity (SPE), and accuracy (ACC). The reduction in repeat FNA facilitated by the B-Model was calculated.

Results: Training and validation datasets included 1,573 and 672 cases, respectively. The overall SEN, SPE and ACC of the B-Model were 84.7%, 76.7% and 60.1% in the validation dataset. The application of the B-Model reduced the number of patients requiring repeat FNA from 255 to 153, resulting in a 40.0% reduction.

Conclusion: The B-Model demonstrated robust predictive performance, facilitating the optimization of pre-FNA diagnostic workflows, significantly reducing unnecessary repeat FNAs, and advancing precision in thyroid nodule management.

1 Introduction

Thyroid nodules (TNs) are common in the general population, with a global incidence ranging from 19 to 68%. Most nodules are benign, with 7–15% being malignant (13). Given the differences in pathogenesis, biologic behavior, and clinical manifestations, there are significant variations in treatment and prognosis among different pathologic types and subtypes of TNs (4). In recent years, the advent and dissemination of treatment technologies, such as ablation, targeted therapy, immunotherapy, and traditional Chinese medicine, have revolutionized the management of TNs (5). To provide patients with more precise and personalized treatment strategies, accurate pathologic diagnosis of TNs is crucial.

Ultrasound (US)-guided fine-needle aspiration biopsy (FNA) is a safe and effective method for obtaining thyroid cells and is currently the preferred approach for diagnosing TNs (1, 68). The Bethesda System for Reporting Thyroid Cytopathology (BSRTC), which is widely adopted globally, aims to unify the terminology used in pathology reports and achieve standardized reporting (911). BSRTC II, V, and VI are distinctly labeled as benign, suspicious for malignancy, and malignant. Conversely, BSRTC I, III, and IV encompass nondiagnostic, atypia of undetermined significance, and follicular neoplasm, respectively, which lack definitive diagnoses and exhibit a potential occurrence range of 20–34% (1013). Multiple guidelines suggest that comprehensive management should be performed based on clinical risk factors in accordance with the patient’s wishes. Repeat FNA (rFNA) is highly recommended for BSRTC I nodules. For BSRTC III, a range of options are advised, including rFNA, rFNA with molecular testing, diagnostic lobectomy, and surveillance. Concerning BSRTC IV, the recommended approach encompasses rRNA coupled with molecular testing or diagnostic lobectomy (1, 6, 14). Therefore, approximately one-third of patients may require two FNA procedures to achieve a more precise diagnosis. Even after undergoing two FNAs, some patients still confront diagnostic ambiguity, which ultimately requires thyroidectomy. This undoubtedly increases patient exposure to invasive procedures, prolongs waiting time, and imposes a significant financial burden.

This study aimed to devise a predictive model (B-Model) for BSRTC categorization of FNA that identifies nodules that cannot be determined solely through FNA so that we can minimize ineffective punctures, maximize the diagnostic efficiency of FNA, and ultimately promote precision medicine.

2 Materials and methods

2.1 Patients

This single-center retrospective study consecutively included patients who underwent US-FNA of TNs between March 2018 and March 2023 (n = 4,210). To evaluate temporal generalizability, the dataset was divided chronologically into two cohorts: March 2018 to February 2022 (training dataset) and March 2022 to March 2023 (validation dataset). Exclusions criteria included: absence of ultrasound images, pathology-confirmed non-thyroid lesions, operator experience <3 years, multiple punctures (only the last result retained), and missing biochemical data. After exclusions, the final study population consisted of 1,573 patients in the training dataset and 672 patients in the validation dataset, with an approximate ratio of 7:3 between the two cohorts. The overall study design and patient selection flow are illustrated in Figure 1.

Figure 1
Flowchart showing patient selection and dataset development for a study on ultrasound-guided fine needle aspiration of thyroid nodules from March 2018 to March 2023. Patients were split into two time frames: March 2018-February 2022 (2,818 patients) and March 2022-March 2023 (1,392 patients). Exclusion criteria include lack of ultrasound images, non-thyroid lesions, operator inexperience, multiple punctures, and missing biochemical results. After applying these criteria, datasets were divided into a training dataset (1,573 patients) and a validation dataset (672 patients). Model development and evaluation processes are detailed, including group categorization and equations (P1, P2, P3) for analysis.

Figure 1. Study flow diagram of patient enrollment, dataset allocation, and B-Model development. Study flow diagram showing inclusion and exclusion criteria, patient enrollment, and dataset allocation into training and validation cohorts, with datasets divided chronologically (March 2018–February 2022 for training, March 2022–March 2023 for validation). Architecture of the cascaded logistic regression model (B-Model), in which three logistic regression equations were sequentially linked: Equation P1 distinguished benign from non-benign nodules (Group 1 vs. non-Group 1); Equation P2 differentiated malignant from non-malignant nodules (Group 4 vs. non-Group 4); and Equation P3 further separated follicular neoplasm from indeterminate/atypia nodules (Group 3 vs. Group 2). BSRTC, Bethesda System for Reporting Thyroid Cytopathology [Flowchart design: Boardmix Online Platform (https://boardmix.cn)].

2.2 Acquisition of clinical information and biochemical results

Clinical information and biochemical results for all patients were obtained from an electronic medical data management system. The following clinical features were recorded: patient’s age and sex. Biochemical results included free triiodothyronine (FT3), free thyroxine (FT4), thyroid-stimulating hormone (TSH), antithyroid peroxidase autoantibody (A-TPO), thyroglobulin antibody (A-TG), thyroglobulin (TG), and thyrotropin receptor antibody (TRAb). All biochemical tests were conducted within 1 month of the FNA.

2.3 Cytopathology acquisition and grouping

All cytopathologic examinations were performed by two pathologists with >8 years of thyroid cytopathology experience and subsequently reviewed by a senior pathologist with >15 years of experience. Findings were classified according to the 2023 revision of BSRTC into four groups: Group 1 (BSRTC II), Group 2 (BSRTC I/III), Group 3 (BSRTC IV), and Group 4 (BSRTC V/VI).

2.4 Ultrasound image acquisition and interpretation

Ultrasound data were retrieved from the institutional imaging system. Two US radiologists (>7 years of thyroid imaging experience) independently assessed thyroid echotexture, nodule position, capsule distance, size, volume, composition, echogenicity, echotexture, margin, shape, orientation, calcifications, posterior features, halo and Adler’s semiquantitative grading for nodule blood flow (Grades 0–3). Discrepancies were resolved by consensus with a senior radiologist (>20 years of experience).

2.5 Statistical analysis

SPSS statistical software (version 20.0; IBM Corporation, Armonk, NY, USA) was used for the statistical analysis. Baseline characteristics between the training and validation datasets were compared using the Mann–Whitney U test for continuous variables and the chi-square or Fisher’s exact test for categorical variables. Univariate analyses were further performed within the training dataset to identify factors associated with pathological classification, applying the Kruskal–Wallis test for continuous variables and the chi-square or Fisher’s exact test for categorical variables across the four groups. A p-value of <0.05 was considered statistically significant.

The prediction model (B-Model) was developed using multivariable logistic regression in SPSS based on training dataset, and it adopted a three-stage architecture as illustrated in Figure 1: (1) distinguished benign from non-benign nodules (Group 1 vs. non-Group 1) by Equation P1; (2) differentiated malignant from non-malignant nodules (Group 4 vs. non-Group 4) by Equation P2; (3) separated follicular neoplasm from indeterminate/atypia nodules (Group 3 vs. Group 2) by Equation P3. Each equation had two versions: one that included biochemical indicators as independent variables P(w), and another that did not include biochemical indicators as independent variables P(w/o). For other special circumstances, a supplementary version was designed P(c). Multivariable logistic regression analyses with backward stepwise selection were applied to identify independent variables x1-i. Based on clinical significance or published reports, we graded each risk factor, selected an appropriate grade as the baseline risk reference value, and recorded the score as 0 (1, 6, 13). β0-i is the regression coefficient of each independent variable. Using these parameters, we calculated P as the dependent variable corresponding to each risk factor classification using the following formula, where exp denotes the natural exponential function:

y = β 0 + β 1 x 1 + β 2 x 2 + + βixi
P = exp y 1 + exp y = exp β 0 + β 1 x 1 + β 2 x 2 + + βixi 1 + exp β 0 + β 1 x 1 + β 2 x 2 + + βixi

The dependent variable P in the equation above uses 0.5 as a threshold value. Similar cascaded/sequential logistic regression approaches have been applied in recent medical prediction studies to improve classification performance and manage class imbalance (1517).

The data in the validation dataset were used to select the equations and validate the performance of the prediction models. By substituting the data into previously established equations and considering the actual pathologic results as the gold standard, the sensitivity (SEN), specificity (SPE), accuracy (ACC), positive predictive rate (PPV), negative predictive rate (NPV) and area under the receiver operating characteristic curve (AUC-ROC) of each equation were evaluated. Finally, the rate of reduction in rFNAs after the B-Model implementation was calculated using the following equation:

rFNA reduction rate ( % ) = True Group 2 / 3 B Model FN True Group 2 / 3 × 100 %

(FN: True Group 2/3 cases incorrectly classified as Group 1/4 by B-Model).

3 Results

3.1 Patient characteristics

In the training dataset, the final cohort included 1,573 patients [median age: 48 years (IQR: 38–57)] of the initial 2,818 patients, after the exclusion of 1,245 patients. In the validation dataset, the final cohort included 672 patients [median age: 50 years (IQR: 40–58)] of the initial 1,392 patients, after excluding 720 patients. The patient characteristics, US features, and biochemical results are shown in Table 1. Overall, no significant statistical differences were observed between two cohorts for most baseline characteristics except three laboratory indicators (FT4, A-TG, and A-TPO; p = 0.047, <0.001, and 0.002, respectively). These differences likely reflect case-mix shifts from time-based cohort division and variability in laboratory assays.

Table 1
www.frontiersin.org

Table 1. Comparison of baseline clinical characteristics and ultrasound features of thyroid nodules between the training and validation datasets a,b.

3.2 Factors influencing pathology

In the training dataset, univariate analysis identified significant differences (p < 0.05) in 2 patient characteristics, 15 US features, and 4 biochemical markers across the groups (Table 2). Specifically, thyroid echogenicity and A-TG levels were significantly different between Groups 1 and 3 (p = 0.047 and p = 0.046, respectively) whereas FT4 levels were significantly different between Groups 2 and 4 (p = 0.032). All significant variables were included as independent covariates in the subsequent multivariate analysis.

Table 2
www.frontiersin.org

Table 2. Patient clinical characteristics and ultrasound findings of the nodules associated with grouping in the training dataset a-c.

3.3 Construction of equations P1, P2, and P3

There versions of Equation P1 were derived: P1(w/o) (χ2 = 457.323, p < 0.001), P1(w) (χ2 = 300.627, p < 0.001), and P1(c) (χ2 = 300.627, p < 0.001). P1(c) was generated by cross-validation to address the absence of biochemical indicators in P1(w). Two versions of Equation P2 were developed: P2 (w/o) (χ2 = 324.479, p < 0.001) and P2 (w) (χ2 = 198.300, p < 0.001). Two versions of Equation P3 were established: P3 (w/o) (χ2 = 148.499, p < 0.001) and P3 (w) (χ2 = 98.663, p < 0.001).

3.4 Verification of equations P1, P2, and P3

The validation results showed that among the three Equation P1 variants, P1(c) demonstrated the highest SEN (88.3%), SPE (68.0%), ACC (83.1%), PPV (89.2%), and NPV (66.1%), while maintaining comparable ROC-AUC (0.830 vs. 0.842/0.842 in P1(w/o)/P1(w), all p < 0.001). The reduced variable count (from 10 to 6) enhanced clinical utility. In the final selected Equation P1, significant predictors included markedly hypoechoic feature (OR: 10.286, 95% CI: 6.118–17.296), hypoechoic feature (OR: 4.703, 95% CI: 3.190–6.932), irregular/extra-thyroidal extension (OR: 1.705, 95% CI: 1.180–2.463), enhanced posterior features (OR: 1.853, 95% CI: 1.265–2.715), and shadowing (OR: 2.809, 95% CI: 1.220–5.031), whereas lobulated shape showed nonsignificant association (OR: 1.122, 95% CI: 0.636–1.980). Isoechoic/hyperechoic pattern, oval-to-round shape, and absent posterior features were identified as independent protective factors for benign nodules.

Among the 498 non-Group 1 cases predicted by Equation P1, Equation P2 (w) demonstrated higher SEN (80.6% vs. 74.4%) and NPV (74.0% vs. 73.3%) compared to P2 (w/o), with comparable ROC-AUC (0.735 vs. 0.759, both p = 0.000). Thus, P2 (w) was selected to reduce missed diagnoses of malignancy. Key risk factors in Equation P2 included isthmus location (OR: 4.000, 95% CI: 1.475–10.843), size > 5 mm (highest risk at 5–10 mm; OR: 3.058, 95% CI: 1.671–5.596), markedly hypoechoic/hypoechoic features (OR: 20.203, 95% CI: 5.203–81.179), taller-than-wide shape (OR: 5.165, 95% CI: 2.889–9.235), microcalcifications/complex calcifications (OR: 1.199, 95% CI: 0.626–2.296), and elevated TRAb (OR: 1.628, 95% CI: 1.119–2.368). These were independent predictors of malignant nodules.

Among the 181 cases predicted as neither Group 1 nor 4 by Equation P1 and P2, Equation P3(w) showed higher SPE (96.0% vs. 95.4%) than P3(w/o) with similar SEN (both 37.5%), ACC (93.4% vs. 92.7%), PPV (both 2.9%), NPV (70.0% vs. 72.7%), and ROC-AUC (0.814 vs. 0.837, both p = 0.000). The predictive performance of Equations P1, P2 and P3 in the validation dataset are presented in Table 3 and Figure 2.

Table 3
www.frontiersin.org

Table 3. Predictive efficacy of equations P1 (w/o), P1 (w), P1 (c), P2 (w/o), P2 (w), P3 (w/o), and P3 (w) in the validation dataset.

Figure 2
Panel A shows an ROC curve with three lines representing different conditions with AUC values 0.842, 0.842, and 0.830. Panel B shows an ROC curve with two lines, AUC 0.759 and 0.735. Panel C shows an ROC curve with two lines, AUC 0.837 and 0.814. Each panel includes a dashed diagonal reference line.

Figure 2. Receiver operating characteristic (ROC) curve analysis for three regression equations. (A) ROC curves comparing three designs (Equation P1) predicting Group 1 (BSRTC II). AUC values: P1(w/o): 0.842 (95% confidence interval [CI] 0.807–0.876), P1(W): 0.842 (95% CI 0.808–0.876), P1(C): 0.830 (95% CI 0.792–0.868). (B) ROC curves comparing two designs (Equation P2) predicting Group 4 (BSRTC V/VI). AUC values: P2(w/o): 0.759 (95% CI 0.717–0.801), P2(w): 0.735 (95% CI 0.691–0.779). (C) ROC curves comparing two designs (Equation P3) distinguishing Groups 2 (BSRTC I/III) and 3 (BSRTC IV). AUC values: P3(w/o): 0.837 (95% CI 0.650–1.000); P3(w): 0.814 (95% CI 0.599–1.000). BSRTC, Bethesda System for Reporting Thyroid Cytopathology (ROC curve plotting: SPSS 20.0, IBM; image editing: Adobe Photoshop CS5).

3.5 Overall efficacy of the B-Model

For the validation dataset, the number of cases correctly predicted by the B-Model were 115, 91, 3, and 195 in Groups 1, 2, 3, and 4, respectively. The prediction results of B-Model in the validation dataset are presented in Table 4. True Group 2/3 cases were 255, and true Group 2/3 cases incorrectly classified as Group 1/4 by B-Model was 153. The rFNA reduction rate was 40%.

Table 4
www.frontiersin.org

Table 4. The prediction results of B-Model in the validation dataset.

4 Discussion

US remains the primary imaging tool for TN risk stratification. While certain US features are associated with malignancy, most nodules still require FNA for definitive diagnosis. This study bridges this gap by integrating clinical, biochemical, and US features into a cascaded multivariable logistic regression model (B-Model) for pre-FNA prediction of BSRTC categories.

Operationally, the B-Model links three logistic regression equations in sequence. At the point of use, clinicians input the available clinical, ultrasound, and biochemical variables; the model sequentially evaluates benign vs. non-benign (Equation P1), malignant vs. non-malignant (Equation P2), and follicular neoplasm vs. indeterminate/atypia (Equation P3). A fixed threshold of 0.5 is applied at each step, ensuring that every nodule is ultimately assigned to one, and only one, predicted BSRTC group.

As illustrated in Figure 3 this structured, pre-FNA assignment provides direct guidance for patient management. In contrasts to the conventional workflow, where indeterminate cytology (BSRTC I, III, IV) often necessitate rFNA and may ultimately proceed to diagnostic lobectomy, the B-Model enables early identification of nodules likely to yield indeterminate results. Such cases can be directly triaged to FNA plus molecular testing or diagnostic lobectomy, thereby avoiding redundant punctures. In the validation dataset, this approach reduced the rFNA by 40.0%, minimizing patient trauma and conserving healthcare resources. Importantly, the B-Model theoretically requires only a single FNA per nodule, representing a significant advancement in clinical efficiency.

Figure 3
Two diagrams compare workflows for diagnosing nodules. A: Conventional Workflow starts with FNA, leading to either a definitive diagnosis or further testing for indeterminate nodules. B: Proposed Workflow uses the B-Model before FNA, refining predictions and leading directly to diagnosis or further testing if needed, enhancing efficiency.

Figure 3. Diagnostic workflows for thyroid nodular diseases. (A) Conventional workflow based on fine-needle aspiration (FNA). Indeterminate results (BSRTC I, III, IV) require repeat FNA/and molecular testing, with unresolved nodules often proceeding to diagnostic lobectomy. (B) Proposed workflow using B-Model. Nodules are stratified into predicted BSRTC II/V/VI (direct FNA), BSRTC I/III (FNA + molecular testing), and BSRTC IV (molecular testing or direct diagnostic lobectomy), providing a more streamlined and individualized management strategy. Notably, in the B-Model, each nodule theoretically requires only a single FNA, avoiding repeated punctures. FNA, fine-needle aspiration; BSRTC, Bethesda System for Reporting Thyroid Cytopathology [Flowchart design: Boardmix Online Platform (https://boardmix.cn)].

A key methodological consideration was the reduction of cumulative errors inherent to cascaded regression. To mitigate this risk, BSRTC categories with similar clinical management strategies were merged (BSRTC I with III, and BSRTC V with VI), reducing six categories to four groups (1, 8, 11, 18). This consolidation balanced statistical robustness clinical practicality and minimized propagation error. Similar sequential or multi-step logistic regression strategies have been applied successfully in other medical domains, supporting both interpretability and transparency of the modeling process (1921).

Although machine learning and deep learning methods such as convolutional neural networks (CNNs) have been increasingly applied in radiomics, they remain limited by several drawbacks (2226). First, the ‘black-box’ nature of CNNs prevents transparent identification of the imaging features driving classification, thereby reducing interpretability. Second, overfitting may arise when models are over-parameterized, which undermines generalizability (2629). In contrast, we selected a cascaded logistic regression model because it provides transparent and interpretable results that facilitate the training of junior clinicians; its sequential structure mimics a decision tree, which helps handle data imbalance while preserving a linear framework; and it also offers a necessary foundation for subsequent AI research, enabling insight into the underlying decision logic before moving toward more advanced algorithms (17, 20, 30).

Beyond diagnostic utility, the B-Model highlighted certain features that deserve further clinical attention. Equation P2 identified younger age, isthmus location, and small nodule size (particularly 5–10 mm) as predictors of malignancy. While some study have reported similar findings, one possible explanation for this observation in our cohort is the relatively high proportion of sub-centimeter and isthmus-located nodules (3134). This indicated that conventional size–risk associations, which are largely derived from nodules ≥1 cm, may not fully capture the risk pattern of microcarcinomas. As a result, the diagnosis of microcarcinomas remains challenging, particularly for junior clinicians (35). By incorporating these features, our model provides intuitive “rules of thumb” that support structured image interpretation and enhance diagnostic confidence, especially for nodules ≤1 cm. Thus, the B-Model serves not only as a decision-support system but also as a valuable teaching aid.

This study has limitations. First, although the training and validation cohorts were largely comparable, differences were observed in FT4, A-TG, and A-TPO levels. These variations likely reflect case-mix shifts from time-based cohort division and assay-related variability in laboratory testing, but they were confined to biochemical indicators and did not affect model performance. Second, as a single-center study, variability in ultrasonography and pathologic interpretation may limit generalizability. Third, collinearity and potential confounding were not explicitly tested, though variables were selected based on clinical relevance and univariable screening, and regression coefficients remained stable. Finally, while the B-Model reduced rFNA by 40% under retrospective conditions, its real-world effectiveness and operational feasibility requires validation through prospective multicenter studies.

In conclusion, we developed a cascaded logistic regression model and demonstrated its effectiveness. By integrating clinical, ultrasound, and biochemical indicators, the B-Model enabled pre-FNA prediction of BSRTC categories, thereby optimizing the diagnostic workflow for TNs, reducing unnecessary FNAs, and advancing precision medicine in TN management.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The study protocol was approved by the Ethics Committee of the First Affiliated Hospital of Dalian Medical University (Approval No. PJ-KS-KY-2023-213) and registered with the Chinese Clinical Trial Registry (Registration ID: ChiCTR2400082395). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

SG: Methodology, Investigation, Writing – review & editing, Formal analysis, Software, Visualization, Data curation, Writing – original draft. BL: Formal analysis, Data curation, Software, Writing – original draft, Writing – review & editing, Investigation. MT: Formal analysis, Validation, Writing – original draft, Writing – review & editing. YZ: Investigation, Writing – review & editing, Visualization, Writing – original draft. LW: Writing – original draft, Writing – review & editing, Investigation, Visualization. LD: Writing – original draft, Writing – review & editing, Visualization, Investigation. CS: Writing – review & editing, Writing – original draft. MH: Writing – review & editing, Writing – original draft. YC: Conceptualization, Writing – original draft, Writing – review & editing, Methodology, Project administration, Supervision.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Acknowledgments

We would like to sincerely thank Dr. Qigui Liu, formerly of the School of Public Health, Dalian Medical University, for his invaluable guidance.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2025.1641266/full#supplementary-material

References

1. Haugen, BR, Alexander, EK, Bible, KC, Doherty, GM, Mandel, SJ, Nikiforov, YE, et al. 2015 American Thyroid Association management guidelines for adult patients with thyroid nodules and differentiated thyroid Cancer: the American Thyroid Association guidelines task force on thyroid nodules and differentiated thyroid Cancer. Thyroid. (2016) 26:1–133. doi: 10.1089/thy.2015.0020

PubMed Abstract | Crossref Full Text | Google Scholar

2. Mu, C, Ming, X, Tian, Y, Liu, Y, Yao, M, Ni, Y, et al. Mapping global epidemiology of thyroid nodules among general population: a systematic review and meta-analysis. Front Oncol. (2022) 12:1029926. doi: 10.3389/fonc.2022.1029926

PubMed Abstract | Crossref Full Text | Google Scholar

3. Vaccarella, S, Lortet-Tieulent, J, Colombet, M, Davies, L, Stiller, CA, Schüz, J, et al. Global patterns and trends in incidence and mortality of thyroid cancer in children and adolescents: a population-based study. Lancet Diabetes Endocrinol. (2021) 9:144–52. doi: 10.1016/S2213-8587(20)30401-0

Crossref Full Text | Google Scholar

4. Administration NHCotPsRoCMAaH. Guidelines for the diagnosis and treatment of thyroid carcinoma. Chin J Pract Surg. (2022) 42:1343–57.

Google Scholar

5. Shonka, DC Jr, Ho, A, Chintakuntlawar, AV, Geiger, JL, Park, JC, Seetharamu, N, et al. American head and neck society endocrine surgery section and international thyroid oncology group consensus statement on mutational testing in thyroid cancer: defining advanced thyroid cancer and its targeted treatment. Head Neck. (2022) 44:1277–300. doi: 10.1002/hed.27025

PubMed Abstract | Crossref Full Text | Google Scholar

6. Russ, G, Bonnema, SJ, Erdogan, MF, Durante, C, Ngu, R, and Leenhardt, LJETJ. European thyroid association guidelines for ultrasound malignancy risk stratification of thyroid nodules in adults: the EU-TIRADS. Eur Thyroid J. (2017) 6:225–37. doi: 10.1159/000478927

PubMed Abstract | Crossref Full Text | Google Scholar

7. Tessler, FN, Middleton, WD, Grant, EG, Hoang, JK, Berland, LL, Teefey, SA, et al. ACR thyroid imaging, reporting and data system (TI-RADS): White paper of the ACR TI-RADS Committee. J Am Coll Radiol. (2017) 14:587–95. doi: 10.1016/j.jacr.2017.01.046

PubMed Abstract | Crossref Full Text | Google Scholar

8. Lee, YH, Baek, JH, Jung, SL, Kwak, JY, Kim, JH, Shin, JH, et al. Ultrasound-guided fine needle aspiration of thyroid nodules: a consensus statement by the korean society of thyroid radiology. Korean J Radiol. (2015) 16:391–401. doi: 10.3348/kjr.2015.16.2.391

PubMed Abstract | Crossref Full Text | Google Scholar

9. Cibas, ES, and Ali, SZ. The Bethesda system for reporting thyroid cytopathology. Am J Clin Pathol. (2009) 132:658–65. doi: 10.1309/AJCPPHLWMI3JV4LA

PubMed Abstract | Crossref Full Text | Google Scholar

10. Cibas, ES, and Ali, SZ. The 2017 Bethesda system for reporting thyroid cytopathology. J Am Soc Cytopathol. (2017) 6:217–22. doi: 10.1016/j.jasc.2017.09.002

PubMed Abstract | Crossref Full Text | Google Scholar

11. Ali, SZ, Baloch, ZW, Cochand-Priollet, B, Schmitt, FC, Vielh, P, and VanderLaan, PA. The 2023 Bethesda system for reporting thyroid cytopathology. J Am Soc Cytopathol. (2023) 12:319–25. doi: 10.1016/j.jasc.2023.05.005

PubMed Abstract | Crossref Full Text | Google Scholar

12. Todorovic, E, Sheffield, BS, Kalloger, S, Walker, B, and Wiseman, SMJC. Increased cancer risk in younger patients with thyroid nodules diagnosed as atypia of undetermined significance. Cureus. (2018) 10:e2348. doi: 10.7759/cureus.2348

Crossref Full Text | Google Scholar

13. Huang, J, Shi, H, Song, M, Liang, J, Zhang, Z, Chen, X, et al. Surgical outcome and malignant risk factors in patients with thyroid nodule classified as Bethesda category III. Front Endocrinol. (2021) 12:686849. doi: 10.3389/fendo.2021.686849

PubMed Abstract | Crossref Full Text | Google Scholar

14. Zhou, J, Yin, L, Wei, X, Zhang, S, Song, Y, Luo, B, et al. 2020 Chinese guidelines for ultrasound malignancy risk stratification of thyroid nodules: the C-TIRADS. Endocrine. (2020) 70:256–79. doi: 10.1007/s12020-020-02441-y

PubMed Abstract | Crossref Full Text | Google Scholar

15. Dal Negro, RW, Micheletto, C, Tognella, S, Visconti, M, Guerriero, M, and Sandri, MF. A two-stage logistic model based on the measurement of pro-inflammatory cytokines in bronchial secretions for assessing bacterial, viral, and non-infectious origin of COPD exacerbations. COPD. (2005) 2:7–16. doi: 10.1081/COPD-200050680

PubMed Abstract | Crossref Full Text | Google Scholar

16. Zhu, Y, and Fang, J. Logistic regression-based trichotomous classification tree and its application in medical diagnosis. Med Decis Making. (2016) 36:973–89. doi: 10.1177/0272989X15618658

PubMed Abstract | Crossref Full Text | Google Scholar

17. Van Holsbeke, C, Ameye, L, Testa, AC, Mascilini, F, Lindqvist, P, Fischerova, D, et al. Development and external validation of new ultrasound-based mathematical models for preoperative prediction of high-risk endometrial cancer. Ultrasound Obstet Gynecol. (2014) 43:586–95. doi: 10.1002/uog.13216

Crossref Full Text | Google Scholar

18. Chen, DW, Lang, BHH, McLeod, DSA, Newbold, K, and Haymart, MR. Thyroid cancer. Lancet. (2023) 401:1531–44. doi: 10.1016/S0140-6736(23)00020-X

Crossref Full Text | Google Scholar

19. Grover, SB, Patra, S, Grover, H, Mittal, P, and Khanna, G. Prospective revalidation of IOTA "two-step", "alternative two-step" and "three-step" strategies for characterization of adnexal masses - an Indian study focussing the radiology context. Indian J Radiol Imaging. (2020) 30:304–18. doi: 10.4103/ijri.IJRI_279_20

PubMed Abstract | Crossref Full Text | Google Scholar

20. Landolfo, C, Bourne, T, Froyman, W, Van Calster, B, Ceusters, J, Testa, AC, et al. Benign descriptors and ADNEX in two-step strategy to estimate risk of malignancy in ovarian tumors: retrospective validation in IOTA5 multicenter cohort. Ultrasound Obstet Gynecol. (2023) 61:231–42. doi: 10.1002/uog.26080

PubMed Abstract | Crossref Full Text | Google Scholar

21. Zhang, M, Li, S, Xue, M, and Zhu, Q. Two-stage classification strategy for breast cancer diagnosis using ultrasound-guided diffuse optical tomography and deep learning. J Biomed Opt. (2023) 28:086002. doi: 10.1117/1.JBO.28.8.086002

PubMed Abstract | Crossref Full Text | Google Scholar

22. Yao, J, Wang, Y, Lei, Z, Wang, K, Feng, N, Dong, F, et al. Multimodal GPT model for assisting thyroid nodule diagnosis and management. NPJ Digit Med. (2025) 8:245. doi: 10.1038/s41746-025-01652-9

PubMed Abstract | Crossref Full Text | Google Scholar

23. Wang, J, Dong, C, Zhang, YZ, Wang, L, Yuan, X, He, M, et al. A novel approach to quantify calcifications of thyroid nodules in US images based on deep learning: predicting the risk of cervical lymph node metastasis in papillary thyroid cancer patients. Eur Radiol. (2023) 33:9347–56. doi: 10.1007/s00330-023-09909-1

PubMed Abstract | Crossref Full Text | Google Scholar

24. Chang, L, Zhang, Y, Zhu, J, Hu, L, Wang, X, Zhang, H, et al. An integrated nomogram combining deep learning, clinical characteristics and ultrasound features for predicting central lymph node metastasis in papillary thyroid cancer: a multicenter study. Front Endocrinol. (2023) 14:964074. doi: 10.3389/fendo.2023.964074

PubMed Abstract | Crossref Full Text | Google Scholar

25. Yao, J, Zhang, Y, Shen, J, Lei, Z, Xiong, J, Feng, B, et al. AI diagnosis of Bethesda category IV thyroid nodules. iScience. (2023) 26:108114. doi: 10.1016/j.isci.2023.108114

PubMed Abstract | Crossref Full Text | Google Scholar

26. Peng, S, Liu, Y, Lv, W, Liu, L, Zhou, Q, Yang, H, et al. Deep learning-based artificial intelligence model to assist thyroid nodule diagnosis and management: a multicentre diagnostic study. Lancet Digit Health. (2021) 3:e250–9. doi: 10.1016/S2589-7500(21)00041-8

PubMed Abstract | Crossref Full Text | Google Scholar

27. Buda, M, Wildman-Tobriner, B, Hoang, JK, Thayer, D, Tessler, FN, Middleton, WD, et al. Management of Thyroid Nodules Seen on US images: deep learning may match performance of radiologists. Radiology. (2019) 292:695–701. doi: 10.1148/radiol.2019181343

PubMed Abstract | Crossref Full Text | Google Scholar

28. Wu, X, Li, M, Cui, XW, and Xu, G. Deep multimodal learning for lymph node metastasis prediction of primary thyroid cancer. Phys Med Biol. (2022) 67:035008. doi: 10.1088/1361-6560/ac4c47

PubMed Abstract | Crossref Full Text | Google Scholar

29. Zhou, L, Zheng, Y, Yao, J, Chen, L, and Xu, D. Association between papillary thyroid carcinoma and cervical lymph node metastasis based on ultrasonic radio frequency signals. Cancer Med. (2023) 12:14305–16. doi: 10.1002/cam4.6107

PubMed Abstract | Crossref Full Text | Google Scholar

30. Puggioni, G, Gelfand, AE, and Elmore, JG. Joint modeling of sensitivity and specificity. Stat Med. (2008) 27:1745–61. doi: 10.1002/sim.3186

PubMed Abstract | Crossref Full Text | Google Scholar

31. Cavallo, A, Johnson, DN, White, MG, Siddiqui, S, Antic, T, Mathew, M, et al. Thyroid nodule size at ultrasound as a predictor of malignancy and final pathologic size. Thyroid. (2017) 27:641–50. doi: 10.1089/thy.2016.0336

PubMed Abstract | Crossref Full Text | Google Scholar

32. Al-Hakami, HA, Alqahtani, R, Alahmadi, A, Almutairi, D, Algarni, M, and Alandejani, T. Thyroid nodule size and prediction of Cancer: a study at tertiary Care Hospital in Saudi Arabia. Cureus. (2020) 12:e7478. doi: 10.7759/cureus.7478

PubMed Abstract | Crossref Full Text | Google Scholar

33. Dong, Y, Mao, M, Zhan, W, Zhou, J, Zhou, W, Yao, J, et al. Size and ultrasound features affecting results of ultrasound-guided fine-needle aspiration of thyroid nodules. J Ultrasound Med. (2018) 37:1367–77. doi: 10.1002/jum.14472

PubMed Abstract | Crossref Full Text | Google Scholar

34. Lyu, YS, Pyo, JS, Cho, WJ, Kim, SY, and Kim, JH. Clinicopathological significance of papillary thyroid carcinoma located in the isthmus: a Meta-analysis. World J Surg. (2021) 45:2759–68. doi: 10.1007/s00268-021-06178-1

PubMed Abstract | Crossref Full Text | Google Scholar

35. Babayid, Y, Gökçay Canpolat, A, Elhan, AH, Ceyhan, K, Çorapçıoğlu, D, and Şahin, M. Should there be a paradigm shift for the evaluation of isthmic thyroid nodules? J Endocrinol Investig. (2024) 47:2225–33. doi: 10.1007/s40618-024-02313-6

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: fine-needle aspiration, logistic regression, ultrasound imaging, thyroid nodules, precision medicine

Citation: Gao S, Liu B, Tong M, Zhu Y, Wang L, Du L, Shi C, Han M and Che Y (2025) A cascaded clinical-ultrasound-biochemical model for precise prediction before thyroid nodule fine-needle aspiration biopsy. Front. Med. 12:1641266. doi: 10.3389/fmed.2025.1641266

Received: 04 June 2025; Accepted: 08 September 2025;
Published: 18 September 2025.

Edited by:

Angelika Buczyńska, Medical University of Bialystok, Poland

Reviewed by:

Jincao Yao, University of Chinese Academy of Sciences, China
Tianhan Zhou, Zhejiang Chinese Medical University, China

Copyright © 2025 Gao, Liu, Tong, Zhu, Wang, Du, Shi, Han and Che. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ying Che, Y2hleWluZ0BkbXUuZWR1LmNu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.