Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol., 21 November 2025

Sec. Cancer Imaging and Image-directed Interventions

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1681000

CT radiomics analysis facilitates preoperative risk stratification of central lymph node metastasis in papillary thyroid cancer: a multicenter study

Zeyong LiZeyong Li1Li ZhangLi Zhang1Shuangxin LiShuangxin Li1Guishen JiangGuishen Jiang1Zhenqi ZhangZhenqi Zhang1Dan LiuDan Liu1Hongwei LiHongwei Li2Bo XiaoBo Xiao1Jialin Yu*Jialin Yu3*
  • 1Department of Radiology, Bishan Hospital of Chongqing Medical University, Chongqing, China
  • 2Department of Radiology, The Affiliated Mianyang Hospital of Chongqing Medical University, Mianyang, China
  • 3Department of Radiology, The Second Affiliated Hospital of Army Medical University, Chongqing, China

Rationale and objectives: To develop a CT-based radiomics model to predict central lymph node metastasis (CLNM) in papillary thyroid cancer (PTC) patients and classify risk.

Materials and methods: 218 PTC patients from institution 1 were retrospectively enrolled and randomly assigned to a training set and an internal test set (ratio 7:3). Another 64 patients from institution 2 were assigned to an independent test set. Radiomics features were extracted from the arterial phase CT images of PTC. A radiomics signature (Rad-score) was developed using the least absolute shrinkage and selection operator (LASSO) method. Three models, combined model, clinical model, and Rad-score, were established by logistic regression analysis. These models were comprehensively assessed by the area under the receiver operating characteristic curve (AUC), the calibration curve, and the decision curve analysis (DCA). The improvement in predictive efficacy of the combined nomogram was evaluated using the integrated discrimination improvement index (IDI) and net reclassification improvement index (NRI). The defined threshold of the predicted risk score was set at 0.5, and the stratification effect of the combined nomogram was evaluated by subgroup analysis.

Results: The Rad-score and another three independent predictors (tumor margin, thyroid capsule state and tumor site) were integrated into a combined nomogram. The AUCs of the combined nomogram were 0.848, 0.858, and 0.840 in the training, internal test, and external test sets, respectively, which were greater than those of the clinical model and the Rad-score. The IDI and NRI were greater than 0 indicating better discriminatory accuracy of the combined nomogram than the clinical nomogram and Rad-score. The net benefit of the combined nomogram in the clinical setting was reflected in the DCA. The combined model allows for the effective stratification of patients in diverse risk subgroups.

Conclusion: Combining Rad-score and clinical predictors in an integrated model allow for more accurate prediction of CLNM in PTC patients and enables effective risk stratification.

1 Introduction

Papillary thyroid cancer (PTC) is the commonest pathological type of thyroid cancer (1, 2) and generally has a favorable prognosis, but central cervical lymph node metastasis (CLNM) occurs at an incidence of 30%-80% (3, 4), which is strongly associated with tumor recurrence and poor prognosis (5, 6). Therapeutic central lymph node dissection is therefore recommended, but controversy exists over whether prophylactic central lymph node dissection (pCND) should be performed in patients with low-risk PTC (4). pCND facilitates the detection of occult CLNM, provides accurate clinical staging, and reduces the risk of tumor recurrence and reoperation. Relatively, it increases the risk of postoperative complications and may lower the quality of life (710). In addition, owing to the high incidence and concealment of CLNM, 11.7% to 63.8% of cases with CLNM are not identified preoperatively (11). For these patients, more aggressive clinical management may be required. Therefore, accurately predicting CLNM preoperatively is crucial.

Ultrasound (US) and computed tomography (CT) are commonly used to assess CLNM among PTC patients. Nevertheless, the effectiveness of US is limited by the presence of the esophagus, trachea, and bones. The sensitivity of US for diagnosing CLNM is poor and can only affect the surgical procedure in 20% of patients (4, 12). CT provides higher sensitivity than US in CLNM evaluation, and the combination of CT and US further improves detection performance, but it is still poor, with a sensitivity of 40%-55% (1315). In addition, conventional imaging evaluations are susceptible to radiologists’ subjectivity. Moreover, patients with the same clinical features can exhibit different outcomes of CLNM. Thus, it is necessary to further explore methods to accurately predict CLNM and enable risk stratification.

Radiomics, a novel method based on medical imaging, has emerged as a noninvasive approach to assess tumor heterogeneity by quantifying the spatial distribution and gray variation of voxels (16, 17). Recently, it has attracted widespread attention and application in tumor research (18, 19). Several studies have illustrated the availability of CT image-based radiomics to predict CLNM in PTC, but mainly focused on patients with single or micro PTC (2024), which makes the clinical applicability limited because multifocal cases are quite common in clinical practice. Furthermore, the patients recruited in these studies might exhibit varying degrees of invasive risk, but no studies investigated whether radiomics could stratify CLNM risks in patients with diverse clinical characteristics. Hence, the aim of this research is to develop a preoperative nomogram utilizing CT radiomics and clinical characteristics to predict CLNM among PTC patients and to stratify the risk of CLNM for personalized clinical management.

2 Materials and methods

2.1 Patients

The research project was approved by the Ethics Committee of Bishan Hospital of Chongqing Medical University (ethical approval code: cqbykyll-20230705-10). The informed consents were waived due to the retrospective nature of this study. Patients with pathologically diagnosed PTC who underwent surgical resection at institution 1 from November 2020 to May 2024 and at institution 2 from August 2021 to December 2023 were retrospectively included in this study. This study adheres to the criteria of METhodological RadiomICs Score (METRICS) (25).

The inclusion criteria include (a) patients with PTC confirmed by pathology and no other concomitant malignancy; (b) patients who received thyroidectomy and central neck lymph node dissection. The exclusion criteria include (a) patients who had underwent radiofrequency ablation, radiotherapy, or any other antitumor treatments prior to the current operation; (b) patients without enhanced CT images within 2 weeks prior to surgery; (c) PTC was not clearly visible on CT images; (d) PTC was indistinguishable from nodular goiter or lymphocytic thyroiditis; (e) patients whose maximum tumor diameter was less than 0.4 cm; and (f) patients without complete clinical and pathological information. Ultimately, this study included 282 consecutive PTC patients. Among these, 218 patients recruited from institution 1 were randomized into a training set and an internal test set (ratio 7:3). A total of 64 patients from institution 2 served as an external test set. The patient recruitment process is depicted in Figure 1 and the study flow is depicted in Figure 2.

Figure 1
Flowchart detailing patient selection for a study on thyroid disease. Institution 1 started with 410 patients; 88 with benign lesions and 20 with other malignancies were excluded. From the remaining 302 diagnosed with PTC, further exclusions were made based on criteria like tumor visibility and data availability, leaving 218 patients. They were randomized into a training set of 153 and an internal test set of 65. Institution 2 began with 221 patients; 88 with benign lesions and 16 with other malignancies were excluded. Of the 117 patients diagnosed with PTC, exclusions were applied using similar criteria, resulting in 64 patients for an external test set.

Figure 1. Flowchart of patient recruitment process.

Figure 2
Flowchart illustrating five stages of CT image analysis. I: Image segmentation shows an arterial phase CT image with a highlighted region of interest. II: Feature extraction includes histograms for first-order features, a shape-based feature, and a textural features matrix. III: Feature selection features inter- and intraclass correlation and LASSO analysis plots. IV: Nomogram construction and evaluation includes a nomogram chart, calibration curves, and AUC analysis. V: Risk stratification displays a subgroup analysis table with odds ratios and p-values, alongside graphical summaries.

Figure 2. Workflow of the necessary steps in the present study.

2.2 Image acquisition and evaluation

CT scan procedures were conducted at institution 1 and institution 2 using one of the two 128-slice CT scanners (SOMATOM Definition Flash, SIEMENS; Discovery CT750HD, GE). The technical parameters include 120 kV, 300 mAs or automatic modulation of the tube current, 0.5 s rotation time, 0.6mm or 0.625 mm collimator width, 2.5mm or 5 mm scanning slice thickness, 512×512 matrix, and 0.625mm or 1mm reconstructed slice thickness. After the non-contrast CT scan, 70 mL or 60mL of iohexol (General Electric Pharmaceutical Co., Ltd) and 30 mL of saline were injected sequentially at a rate of 3.0 mL/s. The arterial phase was scanned with 30-second delay, the venous phase with 45-second delay, and the scan was performed from the base of the skull to the subclavian region.

Radiologists 1 (ZL with 7 years of experience in diagnostic radiology) and 2 (LZ with 9 years of experience in diagnostic radiology) independently evaluated the following semantic CT features: tumor site (unilateral or bilateral), tumor diameter (the maximum diameter on axial section), multifocal (yes or no), thyroid capsule (intrathyroid or capsule contact), tumor shape (round and irregular), tumor margin (smooth or unsmooth), tumor calcification (absent or present), and enhancement patterns (homogeneous or heterogeneous). Both radiologists made their assessments independently without knowledge of the patients’ clinical data or follow-up information, and any divergence was resolved through consultation.

2.3 Clinical characteristics

The clinical characteristics were as follows: age, sex (female and male), free triiodothyronine (FT3), free thyroxine (FT4), thyroid-stimulating hormone (TSH), thyroglobulin antibody (TGAb), and thyroid peroxidase antibody (TPOAb). The normal standards in institution 1 are as follows: TGAb (<115 IU/mL), TPOAb (< 34 IU/mL), FT3 (3.1–6.8 pmol/L), FT4 (12–22 pmol/L), and TSH (0.27–4.2 µIU/mL). The normal standards in institution 2 are as follows: TGAb (<115 IU/mL), TPOAb (0–5.61 IU/mL), FT3 (2.43–6.01 pmol/L), FT4 (9.01–19.05 pmol/L), and TSH (0.35–4.94 µIU/mL).

2.4 Imaging segmentation and radiomics feature extraction

The volume of each primary tumor on arterial phase CT images was manually delineated by Radiologist 1 using 3D Slicer (version 4.9.0; http://www.slicer.org). One month later, radiologists 1 and 2 randomly delineated the tumor of 30 patients to assess the inter- and intra-class consistency of the radiomics features. For patients with multifocal PTC, all visible tumors were delineated.

Radiomics features were extracted via PyRadiomics package (version 2.2.0), including first-order features, shape-based features, texture features, and derived features of the wavelet and Laplacian of Gaussian filters. Before extracting the features, we set normalization to true, the resampling pixel spacing to 1×1×1 mm, the bin width to 25, and the interpolator to sitkNearestNeighbor (18). The radiomics features were standardized by the z score.

2.5 Rad-score construction

A Rad-score was constructed based on the training set. Radiomics features with inter- and intra-class correlation coefficients (ICC) greater than 0.85 were screened for the subsequent univariate logistic regression analysis, and the features with a p-value less than 0.05 were selected. The logistic regression algorithm of least absolute shrinkage and selection operator (LASSO) was subsequently used to select non-zero coefficient features to construct the Rad-score.

2.6 Nomogram development and evaluation

Univariate logistic analyses were performed based on the training set data to select statistically different variables (p < 0.05), multivariate logistic analysis was subsequently performed to identify independent risk factors (p < 0.05). The variables with variance inflation factor (VIF) greater than 10 were excluded due to multicollinearity (26). The combined nomogram included the Rad-score, clinically and semantic CT independent risk factors, and the clinical model included only clinically and semantic CT independent risk factors.

The models’ performance was evaluated and compared. The predictive efficiency was assessed using the area under the receiver operating characteristic curve (AUC). The incremental predictive value of the combined model was evaluated via integrated discrimination improvement (IDI) and net reclassification improvement (NRI) (27). The concordance between predicted and actual probabilities was assessed by the calibration curve and Hosmer-Lemeshow test (H-L test), and the clinical utility was evaluated by the decision curve analysis (DCA) (28). The validation of the combined model was conducted on both the internal and external test sets.

2.7 Risk stratification of CLNM

The cutoff value for the predicted risk of the combined model was defined as 0.5. Patients with a predicted risk value > 0.5 were defined as high risk, and those with a value ≤ 0.5 were defined as low risk. Risk stratification analysis was performed based on the entire dataset, which is presented in the form of bar-risk plots and forest plots.

2.8 Statistical analysis

R software (version 4.1.3) was used for statistical analysis. The three datasets were compared using analysis of variance (ANOVA) or Kruskal-Wallis test for continuous variables and Chi-square test or Fisher’s exact test for categorical variables. LASSO analysis was conducted via the ‘glmnet’ package. Logistic regression analyses were performed with the ‘MASS’ package. The calibration curves were constructed via the ‘rms’ package. the AUC calculation and DeLong-test were performed using the ‘pROC’ package. The IDI and NRI were computed with the ‘PredictABEL’ package. The H-L test was performed with the ‘ResourceSelection’ package. The DCA was executed with the ‘rmda’ package. A two-tailed p-value less than 0.05 indicates statistical significance.

3 Results

3.1 Demographics characteristics

Ultimately, 282 patients were included in this study, including 153 in the training set (male: 36, female: 117, age range: 22–72), 65 in the internal test set (male: 12, female: 53, age range: 28–69), and 64 in the external test set (male: 15, female: 49, age range: 22–74). The clinical baseline data are presented in Table 1. The CLNM rates were 56.2% (86/153), 64.4% (42/65), and 68.8% (44/64) in the training, internal test, and external test sets (p = 0.178), respectively.

Table 1
www.frontiersin.org

Table 1. Baseline characteristics of the training, internal test, and external test sets.

3.2 Rad-score construction

A total of 1070 radiomics features were calculated, of which 771 features with ICC > 0.85. 181 features with a p-value less than 0.05 were then selected by univariate analysis, and 5 features were ultimately screened via LASSO to generate the Rad-score (Figure 3). The arithmetic formula of the Rad-score is presented in Table 2.

Figure 3
Panel A displays a line graph showing coefficients versus log(lambda), with lines of various colors converging as lambda decreases. Panel B depicts a plot of binomial deviance against log(lambda), featuring a downward trend of red dots with error bars, stabilizing at lower lambda values. Both graphs have dashed lines marking specific log(lambda) points.

Figure 3. Radiomics features selection by using the logistic regression algorithm of LASSO. (A) Coefficients convergence plot of the radiomics features. (B) Penalty parameter selection by using 10-fold cross-validation via criteria of minimum bias variance. The dotted vertical line indicates the optimal Lambda value of 0.07145 (ln (Lambda)= -2.63875) resulting in five features with nonzero coefficients.

Table 2
www.frontiersin.org

Table 2. Radiomics features selected for Rad-score construction.

3.3 Nomogram construction

The outcomes of the logistic regression analyses, both univariate and multivariate, are presented in Table 3. The variables that were statistically significant in the univariate analysis include the Rad-score, tumor diameter, tumor site, thyroid capsule, tumor shape, tumor margin, multifocal and calcification. The independent predictors identified by multivariate analysis (p < 0.05) include the Rad-score, tumor margin, tumor site and thyroid capsule state (range of VIF: 1.00-1.05). The Rad-score, tumor margin, tumor site and thyroid capsule were integrated into a combined nomogram (Figures 4A, B). The tumor margin, tumor site and thyroid capsule were integrated into a clinical model.

Table 3
www.frontiersin.org

Table 3. Univariate and multivariate logistic regression analyses.

Figure 4
A composite image showing multiple panels related to a clinical study. Panel A: A forest plot detailing odds ratios and p-values for various tumor characteristics, with significant values marked with an asterisk. Panel B: A nomogram for predicting probabilities based on various tumor-related factors. Panels C-E: ROC curves for combined, clinical, and Rad-score models, with area under curve (AUC) values noted. Panels F-H: Calibration plots comparing predicted versus actual probability for the same models across different datasets or scenarios. Results emphasize the performance differences between the models.

Figure 4. Nomogram, AUC, and calibration curves. (A) The forest plot for multivariate logistic regression analysis. (B) The nomogram of the combined model. The AUC of the combined model, clinical model, and Rad-score in the training (C), internal test (D), and external test (E) sets. The calibration curves of the combined model, clinical model, and Rad-score in the training (F), internal test (G), and external test (H) sets.

3.4 Performance of the models

The AUCs of the combined model were 0.848 (95% confidence interval [CI]: 0.786–0.911), 0.858 (95% CI: 0.755–0.962), and 0.840 (95% CI: 0.728–0.952) in the training, internal test, and external test sets, respectively, which were greater than those of the clinical model and the Rad-score (Table 4 and Figures 4C–E). The IDI and NRI were greater than 0 indicating that the classification accuracy of the combined model was improved by combining clinical predictors and Rad-score (Table 5). The calibration curves revealed that the predicted probabilities of the combined model for CLNM were well consistent with the actual probabilities (H-L test >0.05) (Figures 4F–H). The DCA results indicated that the combined model had favorable clinical utility (Figure 5).

Table 4
www.frontiersin.org

Table 4. The AUC of models in the training set, internal test set, and external test set.

Table 5
www.frontiersin.org

Table 5. The NRI and IDI of Combined model compared with Clinical model and Rad-score.

Figure 5
Three line graphs labeled A, B, and C show standardized net benefit versus threshold probability for different models. The red line represents the combined model, blue the clinical model, green the Rad-score, gray all, and black none. Each graph shows varying performance across the threshold probabilities.

Figure 5. Decision curve analysis of the combined model, clinical model, and Rad-score in the training (A), internal test (B), and external test (C) sets.

3.5 Risk stratification

The CLNM rates were significantly different between the high- and low-risk groups of PTC patients in the training (p < 0.001; odds ratio [OR] = 13.35, 95% CI: 6.29–30.05), internal test (p < 0.001; OR = 13.71, 95% CI: 4.21–51.40), and external test (p < 0.001; OR = 9.53, 95% CI: 2.78–37.31) datasets. The subgroup analyses demonstrated that the combined model enables subgroup risk stratification of CLNM among PTC patients with different clinical characteristics. For instance, regardless of whether the tumor diameter was less than 1 cm (p < 0.001; OR = 9.84, 95% CI: 4.89–20.74) or greater than 1 cm (p = 0.005; OR = 5.93, 95% CI:1.68–20.76), or whether the PTC was intrathyroid (p = 0.002; OR = 5.80, 95% CI: 1.92–18.78) or thyroid capsule contact (p < 0.001; OR = 10.72, 95% CI:5.22–22.90), the combined model could effectively perform stratification (Figure 6).

Figure 6
Three graphs labeled A, B, and C show predicted risk for patients with a cutoff value of 0.50, with cyan for CLNM negative and red for CLNM positive. Panel D lists factors affecting risk, including sex, age, and tumor characteristics, with odds ratios and p-values. A forest plot on the right visualizes these data with confidence intervals.

Figure 6. Risk stratification of the combined model. Risk bar charts for the training (A), internal test (B), and external test (C) sets. The forest plot of the subgroup analysis for the risk stratification (D).

4 Discussion

In this study, a Rad-score based on arterial-phase CT images was recognized as a valid biomarker of CLNM in PTC and served to improve the predictive efficacy of the combined model. The combined model enables subgroup risk stratification of CLNM among PTC patients with different clinical characteristics.

For the decision-making process in clinical practice, accurate assessment of CLNM is crucial (5, 6). Consistent with several previous studies (2022), the Rad-score based on CT images serves as an independent predictor for CLNM, and helps improve the predictive efficacy of the nomogram. The reason for this is that radiomics can quantify tumor heterogeneity by characterizing the spatial distribution and gray variation of voxels, which strongly correlates with the biological aggressiveness of tumors (16, 17, 29, 30). In our study, five radiomics features were selected for Rad-score construction. These features characterize the randomness and uncertainty in the distribution of zone sizes, run lengths, and gray levels of voxels in the CT images, which reflects the spatial heterogeneity of tumors in multiple dimensions. And the close connection between the radiomics features and the biological characteristics of PTC has been revealed by previous studies at the genetic and molecular levels (21, 23, 31).

Although previous studies have suggested that radiomics analysis based on CT images is an effective method for predicting CLNM in PTC patients, relevant studies are still limited (2024). Peng et al. (20) reported the usefulness of CT radiomics in predicting CLNM for cN0 PTC patients, but the study was a single-center research with a small sample size and did not include clinical features. Several high-quality studies have further explored and validated the good predictive performance of the radiomics, while these studies mainly focused on patients with single or micro PTC and the role of the radiomics for risk stratification in subgroups of diverse clinical characteristics remains unclear (2124). In this study, we extended the use of radiomics to patients with multiple or bilateral lesions. The combined model based on Rad-score achieved good efficacy for CLNM prediction and outperforms clinical model and Rad-score. Furthermore, subgroup analysis in this study indicated that the combined model could effectively stratify CLNM risk among nearly all clinical characteristics. However, we noted that the risk stratification for multifocal and bilateral lesions lost its statistical efficacy. This is because most patients with multifocal or bilateral PTC in this study exhibited CLNM, and nearly all of them achieved accurate prediction and classification. This may also be related to selection bias or relatively limited sample size, ultimately resulting in a lack of sufficient low-risk control patients with multifocal or bilateral lesions. But this does not affect the model’s generalizability for both unifocal and multifocal PTC, as its good predictive performance was validated in an external validation group. While it is still necessary to conduct further research on multifocal PTC as an independent topic.

Furthermore, the tumor margin, thyroid capsule state and tumor site were independent predictors of CLNM in this study. Consistent with previous studies (22, 32, 33), unsmooth margin, thyroid capsule contact and bilateral PTC suggested a high risk of CLNM. An unsmooth margin reflects the more aggressive of malignant cells (32). Similarly, bilateral PTC is more aggressive than unilateral PTC, exhibiting a higher rate of lymph node metastasis and a worse prognosis (33). The sign of thyroid capsule contact is closely associated with extrathyroidal extension in PTC (34, 35), while extrathyroidal extension is a well-established prognostic biomarker for CLNM (11, 36). It is worth noting that tumor diameter, multifocal, tumor shape, and calcification were independent predictors of CLNM in previous studies (21, 23, 32, 37, 38), but these variables lost statistical significance in the multivariate analysis of this study. This is attribute to the difference of cases composition and the interaction among variables. For example, although tumor diameter greater than 1cm is generally considered as a risk factor for CLNM, a considerable portion of patients with tumor diameter less than 1cm still develop CLNM in clinical practice, and the incidence was reported to be 20.7%~62% (21). In our training dataset, the CLNM rate was 45.5% (46/101) in PTC patients with tumor diameter less than 1cm. Thus, the statistical significance of these variables were masked. And precisely the difference in patients’ characteristics and the interaction among variables further illustrate the importance and necessity of effective risk stratification for PTC patients.

There are several limitations in this study. First, this is a retrospective study, which may be associated with greater patient selection bias. Second, even though this is a multicenter study, the sample size is still relatively limited. Third, the pathological and molecular biological basis of radiomics features is not addressed in this paper. Therefore, further validation and improvement through large sample, prospective studies is necessary. It is also necessary to further reveal the molecular biology and pathological nature of radiomics.

5 Conclusion

Rad-score is a valid biomarker of CLNM in PTC patients and contributes to improving the predictive efficiency of the combined model. The combined model enables subgroup risk stratification of CLNM. This allows for personalized evaluation of CLNM risk preoperatively, thus facilitating personalized clinical management for PTC patients.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Ethics Committee of Bishan Hospital of Chongqing Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. The informed consent was waived due to the retrospective nature of this study.

Author contributions

ZL: Conceptualization, Data curation, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Visualization, Writing – original draft, Writing – review & editing. LZ: Data curation, Investigation, Supervision, Writing – review & editing. SL: Data curation, Writing – review & editing. GJ: Data curation, Writing – review & editing. ZZ: Data curation, Writing – review & editing. DL: Data curation, Writing – review & editing. HL: Funding acquisition, Methodology, Writing – review & editing. BX: Conceptualization, Methodology, Writing – review & editing. JY: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Project administration, Supervision, Validation, Writing – review & editing, Writing – original draft.

Funding

The author(s) declare financial support was received for the research and/or publication of this article. This work was supported by the Social and Livelihood of Science and Technology project of Chongqing Bishan District Science and Technology Bureau (BSKJ2023006), the Scientific Research Project of Bishan Hospital of Chongqing, Bishan hospital of Chongqing Medical University (BYKY2024017), and the Scientific Research Project of Sichuan Provincial Medical and Health Promotion Association (KY2022QN0168).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2021) 71:209–49. doi: 10.3322/caac.21660

PubMed Abstract | Crossref Full Text | Google Scholar

2. Lim H, Devesa SS, Sosa JA, Check D, and Kitahara CM. Trends in thyroid cancer incidence and mortality in the United States, 1974-2013. JAMA. (2017) 317:1338–48. doi: 10.1001/jama.2017.2719

PubMed Abstract | Crossref Full Text | Google Scholar

3. Sherman SI. Thyroid carcinoma. Lancet. (2003) 361:501–11. doi: 10.1016/s0140-6736(03)12488-9

PubMed Abstract | Crossref Full Text | Google Scholar

4. Haugen BR, Alexander EK, Bible KC, Doherty GM, Mandel SJ, Nikiforov YE, et al. 2015 American thyroid association management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: the american thyroid association guidelines task force on thyroid nodules and differentiated thyroid cancer. Thyroid. (2016) 26:1–133. doi: 10.1089/thy.2015.0020

PubMed Abstract | Crossref Full Text | Google Scholar

5. Lee YC, Na SY, Park GC, Han JH, Kim SW, and Eun YG. Occult lymph node metastasis and risk of regional recurrence in papillary thyroid cancer after bilateral prophylactic central neck dissection: A multi-institutional study. Surgery. (2017) 161:465–71. doi: 10.1016/j.surg.2016.07.031

PubMed Abstract | Crossref Full Text | Google Scholar

6. Chen L, Zhu Y, Zheng K, Zhang H, Guo H, Zhang L, et al. The presence of cancerous nodules in lymph nodes is a novel indicator of distant metastasis and poor survival in patients with papillary thyroid carcinoma. J Cancer Res Clin Oncol. (2017) 143:1035–42. doi: 10.1007/s00432-017-2345-2

PubMed Abstract | Crossref Full Text | Google Scholar

7. Kim SK, Woo JW, Lee JH, Park I, Choe JH, Kim JH, et al. Prophylactic central neck dissection might not be necessary in papillary thyroid carcinoma: analysis of 11,569 cases from a single institution. J Am Coll Surg. (2016) 222:853–64. doi: 10.1016/j.jamcollsurg.2016.02.001

PubMed Abstract | Crossref Full Text | Google Scholar

8. Zhao WJ, Luo H, Zhou YM, Dai WY, and Zhu JQ. Evaluating the effectiveness of prophylactic central neck dissection with total thyroidectomy for cN0 papillary thyroid carcinoma: An updated meta-analysis. Eur J Surg Oncol. (2017) 43:1989–2000. doi: 10.1016/j.ejso.2017.07.008

PubMed Abstract | Crossref Full Text | Google Scholar

9. Hughes DT, Rosen JE, Evans DB, Grubbs E, Wang TS, and Solórzano CC. Prophylactic central compartment neck dissection in papillary thyroid cancer and effect on locoregional recurrence. Ann Surg Oncol. (2018) 25:2526–34. doi: 10.1245/s10434-018-6528-0

PubMed Abstract | Crossref Full Text | Google Scholar

10. Lang BH, Ng SH, Lau LL, Cowling BJ, Wong KP, and Wan KY. A systematic review and meta-analysis of prophylactic central neck dissection on short-term locoregional recurrence in papillary thyroid carcinoma after total thyroidectomy. Thyroid. (2013) 23:1087–98. doi: 10.1089/thy.2012.0608

PubMed Abstract | Crossref Full Text | Google Scholar

11. Ma B, Wang Y, Yang S, and Ji Q. Predictive factors for central lymph node metastasis in patients with cN0 papillary thyroid carcinoma: A systematic review and meta-analysis. Int J Surg. (2016) 28:153–61. doi: 10.1016/j.ijsu.2016.02.093

PubMed Abstract | Crossref Full Text | Google Scholar

12. Zhao H and Li H. Meta-analysis of ultrasound for cervical lymph nodes in papillary thyroid cancer: Diagnosis of central and lateral compartment nodal metastases. Eur J Radiol. (2019) 112:14–21. doi: 10.1016/j.ejrad.2019.01.006

PubMed Abstract | Crossref Full Text | Google Scholar

13. Choi JS, Kim J, Kwak JY, Kim MJ, Chang HS, and Kim EK. Preoperative staging of papillary thyroid carcinoma: comparison of ultrasound imaging and CT. AJR Am J Roentgenol. (2009) 193:871–8. doi: 10.2214/ajr.09.2386

PubMed Abstract | Crossref Full Text | Google Scholar

14. Xing Z, Qiu Y, Yang Q, Yu Y, Liu J, Fei Y, et al. Thyroid cancer neck lymph nodes metastasis: Meta-analysis of US and CT diagnosis. Eur J Radiol. (2020) 129:109103. doi: 10.1016/j.ejrad.2020.109103

PubMed Abstract | Crossref Full Text | Google Scholar

15. Cho SJ, Suh CH, Baek JH, Chung SR, Choi YJ, and Lee JH. Diagnostic performance of CT in detection of metastatic cervical lymph nodes in patients with thyroid cancer: a systematic review and meta-analysis. Eur Radiol. (2019) 29:4635–47. doi: 10.1007/s00330-019-06036-8

PubMed Abstract | Crossref Full Text | Google Scholar

16. Lambin P, Leijenaar RTH, Deist TM, Peerlings J, de Jong EEC, van Timmeren J, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol. (2017) 14:749–62. doi: 10.1038/nrclinonc.2017.141

PubMed Abstract | Crossref Full Text | Google Scholar

17. Bi WL, Hosny A, Schabath MB, Giger ML, Birkbak NJ, Mehrtash A, et al. Artificial intelligence in cancer imaging: Clinical challenges and applications. CA Cancer J Clin. (2019) 69:127–57. doi: 10.3322/caac.21552

PubMed Abstract | Crossref Full Text | Google Scholar

18. Li Z, Yu J, Li Y, Liu Y, Zhang M, Yang H, et al. Preoperative radiomics nomogram based on CT image predicts recurrence-free survival after surgical resection of hepatocellular carcinoma. Acad Radiol. (2023) 30:1531–43. doi: 10.1016/j.acra.2022.12.039

PubMed Abstract | Crossref Full Text | Google Scholar

19. Huang YQ, Liang CH, He L, Tian J, Liang CS, Chen X, et al. Development and validation of a radiomics nomogram for preoperative prediction of lymph node metastasis in colorectal cancer. J Clin Oncol. (2016) 34:2157–64. doi: 10.1200/jco.2015.65.9128

PubMed Abstract | Crossref Full Text | Google Scholar

20. Peng Y, Zhang ZT, Wang TT, Wang Y, Li CH, Zuo MJ, et al. Prediction of central lymph node metastasis in cN0 papillary thyroid carcinoma by CT radiomics. Acad Radiol. (2023) 30:1400–7. doi: 10.1016/j.acra.2022.09.002

PubMed Abstract | Crossref Full Text | Google Scholar

21. Mou Y, Han X, Li J, Yu P, Wang C, Song Z, et al. Development and validation of a computed tomography-based radiomics nomogram for the preoperative prediction of central lymph node metastasis in papillary thyroid microcarcinoma. Acad Radiol. (2024) 31:1805–17. doi: 10.1016/j.acra.2023.11.030

PubMed Abstract | Crossref Full Text | Google Scholar

22. Zhou Y, Su GY, Hu H, Tao XW, Ge YQ, Si Y, et al. Radiomics from primary tumor on dual-energy CT derived iodine maps can predict cervical lymph node metastasis in papillary thyroid cancer. Acad Radiol. (2022) 29:S222–s231. doi: 10.1016/j.acra.2021.06.014

PubMed Abstract | Crossref Full Text | Google Scholar

23. Li J, Wu X, Mao N, Zheng G, Zhang H, Mou Y, et al. Computed tomography-based radiomics model to predict central cervical lymph node metastases in papillary thyroid carcinoma: A multicenter study. Front Endocrinol (Lausanne). (2021) 12:741698. doi: 10.3389/fendo.2021.741698

PubMed Abstract | Crossref Full Text | Google Scholar

24. Wang C, Yu P, Zhang H, Han X, Song Z, Zheng G, et al. Artificial intelligence-based prediction of cervical lymph node metastasis in papillary thyroid cancer with CT. Eur Radiol. (2023) 33:6828–40. doi: 10.1007/s00330-023-09700-2

PubMed Abstract | Crossref Full Text | Google Scholar

25. Kocak B, Akinci D’Antonoli T, Mercaldo N, Alberich-Bayarri A, Baessler B, Ambrosini I, et al. METhodological RadiomICs Score (METRICS): a quality scoring tool for radiomics research endorsed by EuSoMII. Insights Imaging. (2024) 15:8. doi: 10.1186/s13244-023-01572-w

PubMed Abstract | Crossref Full Text | Google Scholar

26. O’brien RM. A caution regarding rules of thumb for variance inflation factors. Qual Quantity. (2007) 41:673–90. doi: 10.1007/s11135-006-9018-6

Crossref Full Text | Google Scholar

27. Pencina MJ, D’Agostino RB Sr, D’Agostino RB Jr, and Vasan RS. Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond. Stat Med. (2008) 27:157–72. doi: 10.1002/sim.2929

PubMed Abstract | Crossref Full Text | Google Scholar

28. Vickers AJ and Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Making. (2006) 26:565–74. doi: 10.1177/0272989x06295361

PubMed Abstract | Crossref Full Text | Google Scholar

29. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RG, Granton P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer. (2012) 48:441–6. doi: 10.1016/j.ejca.2011.11.036

PubMed Abstract | Crossref Full Text | Google Scholar

30. Aerts HJ, Velazquez ER, Leijenaar RT, Parmar C, Grossmann P, Carvalho S, et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun. (2014) 5:4006. doi: 10.1038/ncomms5006

PubMed Abstract | Crossref Full Text | Google Scholar

31. Dong L, Han X, Yu P, Zhang W, Wang C, Sun Q, et al. CT radiomics-based nomogram for predicting the lateral neck lymph node metastasis in papillary thyroid carcinoma: A prospective multicenter study. Acad Radiol. (2023) 30:3032–46. doi: 10.1016/j.acra.2023.03.039

PubMed Abstract | Crossref Full Text | Google Scholar

32. Tan L, Ji J, Sharen G, Liu Y, and Lv K. Related factor analysis for predicting large-volume central cervical lymph node metastasis in papillary thyroid carcinoma. Front Endocrinol (Lausanne). (2022) 13:935559. doi: 10.3389/fendo.2022.935559

PubMed Abstract | Crossref Full Text | Google Scholar

33. Wang W, Su X, He K, Wang Y, Wang H, Wang H, et al. Comparison of the clinicopathologic features and prognosis of bilateral versus unilateral multifocal papillary thyroid cancer: An updated study with more than 2000 consecutive patients. Cancer. (2016) 122:198–206. doi: 10.1002/cncr.29689

PubMed Abstract | Crossref Full Text | Google Scholar

34. Xu XQ, Zhou Y, Su GY, Tao XW, Ge YQ, Si Y, et al. Iodine maps from dual-energy CT to predict extrathyroidal extension and recurrence in papillary thyroid cancer based on a radiomics approach. AJNR Am J Neuroradiol. (2022) 43:748–55. doi: 10.3174/ajnr.A7484

PubMed Abstract | Crossref Full Text | Google Scholar

35. Seo YL, Yoon DY, Lim KJ, Cha JH, Yun EJ, Choi CS, et al. Locally advanced thyroid cancer: can CT help in prediction of extrathyroidal invasion to adjacent structures? AJR Am J Roentgenol. (2010) 195:W240–4. doi: 10.2214/ajr.09.3965

PubMed Abstract | Crossref Full Text | Google Scholar

36. Sun W, Lan X, Zhang H, Dong W, Wang Z, He L, et al. Risk factors for central lymph node metastasis in CN0 papillary thyroid carcinoma: A systematic review and meta-analysis. PloS One. (2015) 10:e0139021. doi: 10.1371/journal.pone.0139021

PubMed Abstract | Crossref Full Text | Google Scholar

37. Shi Y, Zou Y, Liu J, Wang Y, Chen Y, Sun F, et al. Ultrasound-based radiomics XGBoost model to assess the risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: Individual application of SHAP. Front Oncol. (2022) 12:897596. doi: 10.3389/fonc.2022.897596

PubMed Abstract | Crossref Full Text | Google Scholar

38. Mao J, Zhang Q, Zhang H, Zheng K, Wang R, and Wang G. Risk factors for lymph node metastasis in papillary thyroid carcinoma: A systematic review and meta-analysis. Front Endocrinol (Lausanne). (2020) 11:265. doi: 10.3389/fendo.2020.00265

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: papillary thyroid cancer, radiomics, lymph node metastasis, computed tomography, nomogram

Citation: Li Z, Zhang L, Li S, Jiang G, Zhang Z, Liu D, Li H, Xiao B and Yu J (2025) CT radiomics analysis facilitates preoperative risk stratification of central lymph node metastasis in papillary thyroid cancer: a multicenter study. Front. Oncol. 15:1681000. doi: 10.3389/fonc.2025.1681000

Received: 06 October 2025; Accepted: 07 November 2025; Revised: 03 November 2025;
Published: 21 November 2025.

Edited by:

Orhan K. Oz, University of Texas Southwestern Medical Center, United States

Reviewed by:

Xin Fan, Nanjing University, China
Luchao Dong, Binzhou People’s Hospital, China

Copyright © 2025 Li, Zhang, Li, Jiang, Zhang, Liu, Li, Xiao and Yu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jialin Yu, eXVqaWFsaW5AdG1tdS5lZHUuY24=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.