- 1Department of Ultrasound, First Hospital of Shanxi Medical University, Taiyuan, China
- 2College of Medical Imaging, Shanxi Medical University, Taiyuan, China
- 3Institute of Biomedical Engineering, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Objective: This study aims to predict cervical lymph node metastasis in papillary thyroid carcinoma (PTC) patients with high accuracy. To achieve this, we introduce a novel deep learning model, DualSwinThyroid, leveraging multi-modal ultrasound imaging data for prediction.
Materials and methods: We assembled a substantial dataset consisting of 3652 multi-modal ultrasound images from 299 PTC patients in this retrospective study. The newly developed DualSwinThyroid model integrates various ultrasound modalities and clinical data. Following its creation, we rigorously assessed the model’s performance against a separate testing set, comparing it with established machine learning models and previous deep learning approaches.
Results: Demonstrating remarkable precision, DualSwinThyroid achieved an AUC of 0.924 and an 96.3% accuracy on the test set. The model efficiently processed multi-modal data, pinpointing features indicative of lymph node metastasis in thyroid nodule ultrasound images. It offers a three-tier classification that aligns each level with a specific surgical strategy for PTC treatment.
Conclusion: DualSwinThyroid, a deep learning model designed with multi-modal ultrasound radiomics, effectively estimates the degree of cervical lymph node metastasis in PTC patients. In addition, it also provides early, precise identification and facilitation of interventions for high-risk groups, thereby enhancing the strategic selection of surgical approaches in managing PTC patients.
1 Introduction
Papillary Thyroid Carcinoma (PTC) is the most common type of thyroid cancer, constituting 85-90% of malignant thyroid tumors. Although PTC progresses slowly with a generally favorable prognosis, the onset of cervical lymph node metastasis in patients can significantly increase the risk of recurrence and distant metastasis, ultimately leading to potential mortality. 2015 American Thyroid Association Management Guidelines for Adult Patients with Thyroid Nodules and Differentiated Thyroid Cancer emphasized that the number of cervical lymph node metastases is a crucial factor in assessing the recurrence risk of thyroid cancer (1). An increase in the number of lymph node metastases corresponds to a poorer clinical outcome for the patient, with a consequent reduction in the 5-year survival rate (2–4).
In the rapidly advancing realm of medical imaging, two principal ultrasonography techniques have emerged as key in predicting cervical lymph node metastasis in thyroid cancer. The first method meticulously examines the primary tumor, while the second method assesses suspicious lymph nodes. The assessment of suspicious lymph nodes to gauge the aggressiveness of thyroid cancer is a direct strategy, but the intricate anatomy of the thyroid gland, coupled with imaging technology limitations, turns preoperative ultrasonography of cervical lymph nodes into a complex task that frequently obstructs the swift identification of suspicious nodes. Therefore, the bulk of research, informed by a pragmatic approach, is derived from studies of the primary tumor, investigating attributes closely linked to the spread of cancer to cervical lymph nodes (5, 6).
The recent technological renaissance has fostered the ascent of radiomics, transcending traditional paradigms of medical imaging analysis. Algorithms meticulously mine imaging data, unveiling hidden information and enabling a comprehensive evaluation of tumor heterogeneity. This forms a foundational framework for the development of precise diagnostic and treatment models, reinforcing the pillars of clinical decision-making. Deep learning stands at the forefront of this innovation, threading significant breakthroughs in computer vision into the fabric of Artificial Intelligence (AI). This technological wonder is extensively applied in medical imaging for tasks such as segmentation, localization, detection, and image fusion, thus elevating the diagnostic precision for pathological changes. Deep learning differs from traditional machine learning—which requires intensive image preprocessing and manual feature identification—by skillfully utilizing raw pixel values from images as input and iteratively refining its models through training (7). However, to our knowledge, there has yet to be any research employing multi-modal ultrasonic radiomics data to develop corresponding deep learning models for evaluating the lymph node status of primary lesions (8, 9).
This study presents DualSwinThyroid, a deep learning classification model meticulously designed for evaluating the invasiveness of thyroid nodules. The model’s ‘Dual’ structure processes multi-modal data, and its ‘Swin’ element utilizes the Swin-Transformer’s (10) advanced image processing capabilities for thorough analysis and extraction of features from ultrasound images. ‘Thyroid’ in its name highlights the model’s specific application to thyroid nodule assessment. DualSwinThyroid is predicated on the Swin-Transformer’s solid framework and is finely calibrated to harness not just its imaging strengths but also to Integrate clinical and ultrasonic data characteristics. Such integration sharpens diagnostic accuracy and enhances efficiency, leading to more targeted and evidence-based treatment plans for patients. Additionally, DualSwinThyroid transcends the conventional binary classification of nodules, introducing a tripartite categorization that corresponds with specific therapeutic approaches and provides a clear framework for complex clinical decision-making processes, aiding in the navigation of diverse treatment alternatives.
2 Methods
2.1 Patients
Approval for this retrospective study was obtained from the Ethics Committee of the First Hospital of Shanxi Medical University and the informed consent requirement was waived. Data were gathered from patients who underwent thyroid ultrasonography and subsequent surgical treatment in this hospital from July 2021 to June 2023. Through a rigorous selection process guided by predefined inclusion and exclusion criteria, 299 patients were enrolled, encompassing 339 thyroid nodules captured in 3652 ultrasound images. The postoperative pathological findings served to divide the data into three classes: Class I denoted no lymph node metastasis, Class II included cases with up to five metastatic lymph nodes, and Class III involved cases with more than five metastatic lymph nodes. Figure 1 displays the types of data images collected.
During data analysis, stringent inclusion and exclusion criteria were applied. The inclusion criteria consisted of: (1) patients who had a total or subtotal thyroidectomy with cervical lymph node dissection; (2) nodules with confirmed surgical pathological diagnoses of papillary thyroid carcinoma; (3) patients who underwent routine ultrasonography and elastography within two weeks before surgery, obtaining clear, complete, and original DICOM images. The exclusion criteria were: (1) patients who received radiofrequency ablation, radiation therapy, or chemotherapy before surgery; (2) ultrasound images of the target tumor marred by artifacts; (3) patients with other malignant tumors; and (4) patients with prior thyroid surgery.
2.2 Data collection
All ultrasound scans were performed using the Canon Aplio i800 Color Doppler Ultrasound Diagnostic Device, equipped with an i18LX5 linear wide-band probe with a frequency range of 5-18MHz and real-time ultrasound elastography technology. During routine examinations, patients were positioned supine to expose their necks for scanning. The physician conducted a thorough examination of the thyroid’s bilateral lobes and isthmus, focusing on capturing the echogenicity, dimensions, and vascular flow within the gland. Additionally, for each thyroid nodule, precise records were made of its location, size, composition, echogenicity, shape, margins, presence of hyper-echoic areas, and vascular flow characteristics.
In the ultrasonic elastography examination, the region of interest on the elastographic image was configured to include the entire thyroid lesion and adjacent normal tissue, typically extending 2-3 times beyond the nodule’s size. Patients were asked to hold their breath during the procedure. The physician then positioned the probe perpendicularly on the skin, exerted steady pressure with minimal vibration, and attentively monitored the color patterns displayed on the elastographic images, ensuring to archive the pertinent images.
2.3 Data processing
For the purposes of this study, data from 299 patients were included and subsequently randomized into training and testing datasets at a 7:3 ratio. The image data for each patient followed the same categorization protocol. The DualSwinThyroid model underwent training on this dataset and its performance was benchmarked against the Swin Transformer image processing model and the MLP clinical and ultrasound information classification model (11). The construction, training, and prediction of the models were carried out using Python (version 3.8.0). Statistical analysis of the data and calculations of relevant variables were performed using SPSS (version 26.0, IBM Corporation, Armonk, New York).
2.4 Statistical analysis
Normality and variance homogeneity tests were conducted for patient characteristics like age and nodule size. Subsequently, the Chi-Squared Test test was utilized to evaluate differences in ultrasonic and clinical features across patient cohorts. Multivariate ordinal logistic regression analysis was applied to determine independent risk factors influencing the extent of PTC lymph node metastasis, with statistical significance established at a two-tailed P-value of less than 0.05. ROC curves were then constructed based on these identified independent risk factors.
2.5 Model design and training
This study utilized data from 299 patients to train the model, with 209 allocated to the training set and 90 to the test set. Additionally, of the 3652 nodule images, 2556 were used for training and 1096 for testing, with the pathological outcomes as the labels for training. It is important to note that the test data were not used during the model’s training phase. The Adam optimizer was employed to train the model across 500 epochs, with a batch size of 16 and an initial learning rate set at 0.0001. The computational work was performed on a platform equipped with an i7-13900F CPU and an RTX 4080TI GPU, and the network architecture was developed on Pytorch 2.0.0+cuda1.18. For more information on the training process, please see Figure 2.
2.5.1 DualSwinThyroid model
In this research, the DualSwinThyroid model serves as a deep learning instrument for evaluating thyroid nodule invasiveness and risk levels, detailed in Figure 3. It utilizes the Swin-Transformer framework to adeptly process longitudinal sectional and transverse sectional, color Doppler ultrasound and elastographic images. The model operates through three primary image processing stages to extract features deeply and classify invasiveness with precision. Data, once categorized, enters the Data Fusion block, integrating with clinical data vetted through univariate analysis (p-value <0.05). After normalization, this combined data passes through a fully connected layer with a ReLu activation function, proceeds to a subsequent fully connected layer, and culminates in generating predictive probabilities for each category using the Softmax function. Significantly, DualSwinThyroid is capable of processing multiple ultrasound images from the same nodule to produce diagnostic predictions.
2.5.2 Single modality model
A Multi-Layer Perceptron (MLP) model with eight neurons was developed for the classification of clinical and ultrasound data, employing ReLu as the activation function and Softmax for the output layer’s classification purpose. Cross-entropy served as the loss function for optimization. In the training process, the model processed clinical and ultrasound data as inputs, with the features selected based on univariate analyses that produced p-values less than 0.05.
For the classification of images, the Swin-Transformer model was trained, utilizing its Swin-base as the pre-trained model. The input image data were primarily drawn from Regions of Interest (ROI) delineated by physicians during detailed scans and ultrasonic elastography of the thyroid’s bilateral lobes and isthmus.
2.6 Evaluation metrics
After training each model, Receiver Operating Characteristic (ROC) curves were plotted, and Area Under the Curve (AUC) values were computed for performance evaluation. Algorithms processed ultrasound images of thyroid nodules to accurately determine the extent of metastasis. The test set was then used to further assess the model’s predictive capabilities, including an examination of predicted outcomes and an evaluation of predictive accuracy. Graphs depicting the evolution of prediction accuracy and loss throughout the training were plotted for visual representation, as illustrated in Figure 4.
Figure 4 Delineates the training curves of three distinct models. The figure illustrates the training trajectories for multi-modal fusion, single image data, and single text data, each presented with three distinct curves: training loss, training accuracy, and ROC alongside AUC values. The training loss curve gradually stabilizes in the depicted region, contrasting the training accuracy curve, which exhibits significant oscillations. Comprehensively, the ROC curve and AUC values furnish insights into the model's true positive and false positive rates at varying thresholds, notably demonstrating that the multimodal model secures the highest AUC value.
3 Results
3.1 Data processing results
From July 2021 to June 2023, data from 504 patients who received thyroid ultrasound examinations were initially collected. After rigorous screening, exclusions were applied as follows: 114 cases for the absence of surgery, 20 cases for pathology results not confirming Papillary Thyroid Carcinoma (PTC), 3 cases for missing lymph node dissection records, and 68 cases for incomplete image data. Consequently, the study was narrowed down to 299 cases, which included 339 thyroid nodules and 3652 ultrasound images, along with 19 clinical and ultrasound features, detailed in Figure 5.
3.2 Statistical analysis results
Univariate analysis assessed clinical and ultrasonic features linked to the degree of cervical lymph node metastasis in Papillary Thyroid Carcinoma (PTC), examining 339 nodules. These were divided into three groups by metastasis count: 158 nodules in Group I, 120 in Group II, and 61 in Group III. The analysis revealed statistically significant differences in various factors, including age, gender, nodule location 2, maximum diameter, boundary, homogeneity, longitudinal-transverse ratio, halo sign, type of calcification, calcification ratio, capsular invasion, blood flow signal, and ultrasound-detectable suspicious lymph nodes. Each factor had P-values below 0.05, which are presented in Table 1.
Multivariate analysis using ordered logistic regression determined that variables like age over 45 years, a maximum nodule diameter of 1.0 cm or less, male gender, nodule position at the upper and lower poles, specific calcification types, and ultrasound-visible suspicious lymph nodes were statistically significant. Specifically, being over 45 years old, having nodules with a maximum diameter of 1.0 cm or less, microcalcification, coarse calcification, and a longitudinal-transverse ratio of 1 or less were identified as protective factors. In contrast, being male, nodules at the upper and lower poles, and suspicious lymph nodes on ultrasound were established as independent risk factors. These findings are elaborated in Table 2.
Table 2 Ordered regression analysis of features associated with the extent of cervical lymph node metastasis in PTC.
3.3 Model performance results
The development of the DualSwinThyroid model incorporated a 5-fold cross-validation approach to optimize hyperparameters. The model’s predictive performance was thoroughly evaluated, tracking not just the loss curve but also deriving accuracy metrics from the test set. The ROC curve was plotted, and the AUC value was calculated, as shown in Figure 4 (Training Curve). Additionally, essential metrics like sensitivity and specificity were analyzed.
In training the single-modality models, training accuracy and loss were also monitored, as shown in Figure 4. A significant observation was the Swin-Transformer’s classification results varying considerably with different image data types in the test set. Color Doppler ultrasound and elastography images yielded higher classification accuracy than transverse and longitudinal images, as evidenced by superior accuracy in Figure 6 (Classification Performance).
Figure 6 The prediction accuracy of different data in the corresponding model. The evaluation indicates that multi-modal data performs optimally within the test set. Specifically, when considering single modality data, the accuracy of Doppler ultrasound images and elastography images within the test set surpasses that of transverse and longitudinal images.
4 Discussion
In Papillary Thyroid Carcinoma (PTC), cervical lymph node metastasis is a common occurrence, with an estimated 40-90% of PTC patients potentially experiencing such metastasis (12, 13). In 2015, the American Thyroid Association issued management guidelines for adult thyroid nodules and differentiated thyroid cancer patients, highlighting the degree of cervical lymph node metastasis as a significant indicator of thyroid cancer recurrence. To curb the rapid progression of PTC, a crucial step is early identification of tumors with metastatic potential in clinically diagnosed PTC patients. Surgical removal is regarded as the primary treatment modality (14). For patients at risk of suspected lymph node metastasis, prophylactic central or lateral neck dissection is recommended. When performing prophylactic lymph node dissection and total thyroidectomy, the increased risk of postoperative complications, especially hypoparathyroidism, must be considered. The study by Henry et al. (15) reported that central neck lymph node dissection could escalate the risk of permanent hypoparathyroidism from 0% to 4%. However, research by Nixon I J et al. indicates that reoperation post PTC recurrence is relatively challenging, significantly elevating surgical complications and impacting the quality of life of patients (16). Hence, early identification of cervical lymph node metastasis in PTC not only aids in clinically selecting the appropriate surgical plan and scope, reducing the occurrence of postoperative complications, but also in minimizing recurrence risks, averting secondary surgeries, and proactively improving prognosis. Nonetheless, the sensitivity of solely relying on ultrasonographic characteristics to indicate lymph node metastasis remains insufficient, with some lymph node metastases exhibiting unremarkable ultrasonic features - a common scenario in clinical practice. Studies also revealed a mere 33% sensitivity of ultrasound in detecting central lymph node metastasis (17). Preoperative neck ultrasonography is inevitably influenced by inter-observer variability, thereby rendering the diagnostic outcome lacking in certain objectivity.
This study conducted a thorough investigation into the independent risk factors for cervical lymph node metastasis in Papillary Thyroid Carcinoma (PTC), analyzing 3,652 multi-modal ultrasonographic images and data from 299 patients. The results underscore the need for increased attention to cervical lymph node metastasis and recurrence risk, particularly in male patients, those aged 45 or younger, with thyroid cancer nodules larger than 1cm, nodules at the lower pole and center of the gland, presence of calcification, and suspicious lymph nodes detected by ultrasonography. To improve the accuracy of predicting cervical lymph node metastasis in PTC, the study introduced the DualSwinThyroid Model, a deep learning tool combining ultrasonographic images with clinical data. The model achieved a high AUC of 0.924, with an accuracy rate of 96.3%, and commendable sensitivity and specificity, highlighting its potential as an effective non-invasive assessment tool for lymph node involvement in PTC. Model performance is detailed in Table 3.
Earlier research has typically hinged on machine learning and statistical methods, analyzing image, clinical, and ultrasonic features in isolation, without an in-depth approach (18–23). Luchen Chang et al. employed deep learning, alongside ultrasonic and clinical data, to develop a composite nomogram for predicting central lymph node metastasis in PTC patients, using grayscale images for radiomics and six related features. However, the absence of multi-modal ultrasonic data and additional correlating factors slightly impeded the predictive accuracy (22). Fu Li et al. achieved promising results in forecasting cervical lymph node metastasis using conventional machine learning models (19). These models, however, sometimes fail to discern complex patterns in large datasets, particularly with fluctuating data distributions or noise, which can hinder their generalization. Additionally, they depend on manual feature processing, which can compromise performance if not done meticulously. Notably, most current studies focus on merely detecting cervical lymph node metastasis, which affects the precision of choosing surgical methods and predicting patient outcomes. Details on comparative studies can be found in Table 4.
The DualSwinThyroid model introduces a refined thyroid nodule management method by categorizing them into three different levels to enhance clinical decision-making accuracy. For Class I nodules without lymph node metastasis, unilateral lobectomy and isthmectomy are advised as the primary surgical path. In Class II cases with less than five metastatic lymph nodes, the model recommends total or near-total thyroidectomy and ‘selective’ cervical lymph node dissection, in conjunction with clinical judgment. The term ‘selective’ cervical lymph node dissection, as used here, refers to a process where the surgeon, integrating systematic preoperative evaluation with intraoperative biopsy pathology, determines the necessity of lymph node dissection and identifies the specific regions for dissection, thereby minimizing the risk of secondary surgical interventions. For Class III nodules with more than five metastatic lymph nodes, total or near-total thyroidectomy and extensive cervical lymph node dissection are advised, which is more extensive than standard elective central neck dissection, especially in cases of widespread metastasis. Physicians expand the scope of dissection based on a comprehensive assessment of other clinical indicators. This recommendation balances comprehensive treatment and the risk of overtreatment. The DualSwinThyroid model’s multi-tiered classification not only demonstrates diagnostic accuracy but also conforms to contemporary medical guidelines, enhancing patient-centered surgical decision-making.
Throughout the development of our model, several key findings emerged. Initially, it was apparent that the accuracy of predictions using single-modality data alone was relatively low, with image data outperforming clinical and ultrasonic data, highlighting the importance of imaging in predictive models. In the training dataset, accuracy confidence was higher for transverse and longitudinal images than for color Doppler and elastography. Yet, this pattern shifted in the testing phase, where color Doppler and elastography images achieved greater accuracy, revealing variable performance across imaging types during different phases. Moreover, integrating four types of images—transverse, longitudinal, color Doppler, and elastography—with clinical and ultrasonic data provided the most accurate predictions for individual cases. Adding more images did not improve but rather slightly reduced the predictive performance. These findings have been instrumental in fine-tuning the model and determining the most effective imaging techniques to increase accuracy, setting a course for future advancements in model enhancement.
This study successfully developed a multi-modal ultrasound radiomics deep learning model for predicting cervical lymph node metastasis in PTC. It aims to leverage machine learning to identify features related to metastasis, thereby improving surgical decision-making in PTC. However, the work is retrospective and exploratory in nature. Moreover, its scope is limited by its single-institution design, which might not fully represent a wider patient population. Currently, the final pathological assessment of the extent of lymph node metastasis still relies on the thoroughness and accuracy of surgical removal. While the prediction model offers a potential foundation for clinical decision support, its benefits are yet to be confirmed in prospective clinical trials. Therefore, the anticipated advantages, such as reducing lymph node dissections, financial burden, and supporting emerging practitioners, are promising but need further validation in various clinical settings. Future efforts will focus on expanding data sources and rigorously validating the model’s clinical utility in different institutions.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by First Hospital of Shanxi Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because retrospective nature of the study: if the study is retrospective, analyzing previously collected data, written informed consent may not be required as the data is often de-identified and does not pose a risk to the privacy of the individuals involved.
Author contributions
QL: Funding acquisition, Investigation, Methodology, Writing – original draft, Writing – review & editing. YL: Data curation, Methodology, Software, Visualization, Writing – original draft. YH: Investigation, Supervision, Writing – original draft. WF: Investigation, Supervision, Validation, Writing – review & editing. JL: Supervision, Validation, Writing – original draft. LL: Conceptualization, Formal analysis, Funding acquisition, Supervision, Writing – original draft, Writing – review & editing. TL: Formal analysis, Supervision, Validation, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. Research reported in this publication was supported by Key Research and Development Program of Science and Technology Department of Shanxi Province (201903D321190) and Ministry of Human and Social Affairs Program for the selection of Science and Technology activities for Returned Overseas Scholar (2016-366).
Acknowledgments
We would like to express our gratitude to the members of our research team for their invaluable contributions to this study. We are deeply thankful to Key Research and Development Program of Science and Technology Department of Shanxi Province for providing financial support for this research.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Haugen BR, Alexander EK, Bible KC, Doherty GM, Mandel SJ, Nikiforov YE, et al. 2015 American thyroid association management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: the american thyroid association guidelines task force on thyroid nodules and differentiated thyroid cancer. Thyroid (2016) 26(1):1–133. doi: 10.1089/thy.2015.0020
2. Wang J, Zhanghuang C, Jin L, Zhang Z, Tan X, Mi T, et al. Development and validation of a nomogram to predict cancer-specific survival in elderly patients with papillary thyroid carcinoma: a population-based study. BMC Geriatr (2022) 22(1):736. doi: 10.1186/s12877-022-03430-8
3. Hay ID, Hutchinson ME, Gonzalez-Losada T, McIver B, Reinalda ME, Grant CS, et al. Papillary thyroid microcarcinoma: a study of 900 cases observed in a 60-year period. Surgery (2008) 144(6):980–7. doi: 10.1016/j.surg.2008.08.035
4. Pisanu A, Reccia I, Nardello O, Uccheddu A. Risk factors for nodal metastasis and recurrence among patients with papillary thyroid microcarcinoma: differences in clinical relevance between nonincidental and incidental tumors. World J Surg (2009) 33(3):460–8. doi: 10.1007/s00268-008-9870-8
5. Zhao H, Huang T, Li H. Risk factors for skip metastasis and lateral lymph node metastasis of papillary thyroid cancer. Surgery (2019) 166(1):55–60. doi: 10.1016/j.surg.2019.01.025
6. Gao X, Luo W, He L, Cheng J, Yang L. Predictors and a prediction model for central cervical lymph node metastasis in papillary thyroid carcinoma (cN0). Front Endocrinol (Lausanne) (2022) 12:789310. doi: 10.3389/fendo.2021.789310
7. Truhn D, Müller-Franzes G, Kather JN. The ecological footprint of medical AI. Eur Radiol (2023) 15. doi: 10.1007/s00330-023-10123-2
8. Peng S, Liu Y, Lv W, Liu L, Zhou Q, Yang H, et al. Deep learning-based artificial intelligence model to assist thyroid nodule diagnosis and management: a multicentre diagnostic study. Lancet Digit Health (2021) 3(4):e250–9. doi: 10.1016/S2589-7500(21)00041-8
9. Wu X, Li M, Cui XW, Xu G. Deep multi-modal learning for lymph node metastasis prediction of primary thyroid cancer. Phys Med Biol (2022) 67(3). doi: 10.1088/1361-6560/ac4c47
10. Liu L, Lin Y, Cao Y, Hu H, Wei Y, Zhang Z, et al. Swin transformer: hierarchical vision transformer using shifted windows. Proceedings of the IEEE/CVF international conference on computer vision (2021). doi: 10.48550/arXiv.2103.14030
11. Zhang M, Wen G, Zhong J, Chen D, Wang C, Huang X, et al. MLP-like model with convolution complex transformation for auxiliary diagnosis through medical images. IEEE J BioMed Health Inform (2023) 27(9):4385–96. doi: 10.1109/JBHI.2023.3292312
12. Tong Y, Li J, Huang Y, Zhou J, Liu T, Guo Y, et al. Ultrasound-based radiomic nomogram for predicting lateral cervical lymph node metastasis in papillary thyroid carcinoma. Acad Radiol (2021) 28(12):1675–84. doi: 10.1016/j.acra.2020.07.017
13. Dadafarin S, Rodríguez TC, Carnazza MA, Tiwari RK, Moscatello A, Geliebter J. MEG3 expression indicates lymph node metastasis and presence of cancer-associated fibroblasts in papillary thyroid cancer. Cells (2022) 11(19):3181. doi: 10.3390/cells11193181
14. Sturgeon C, Yang A, Elaraj D. Surgical management of lymph node compartments in papillary thyroid cancer. Surg Oncol Clin N Am (2016) 25(1):17–40. doi: 10.1016/j.soc.2015.08.013
15. Henry JF, Gramatica L, Denizot A, Kvachenyuk A, Puccini M, Defechereux T. Morbidity of prophylactic lymph node dissection in the central neck area in patients with papillary thyroid carcinoma. Langenbecks Arch Surg (1998) 383(2):167–9. doi: 10.1007/s004230050111
16. Nixon IJ, Simo R, Newbold K, Rinaldo A, Suarez C, Kowalski LP, et al. Management of invasive differentiated thyroid cancer. Thyroid (2016) 26(9):1156–66. doi: 10.1089/thy.2016.0064
17. Zhao H, Li H. Meta-analysis of ultrasound for cervical lymph nodes in papillary thyroid cancer: Diagnosis of central and lateral compartment nodal metastases. Eur J Radiol (2019) 112:14–21. doi: 10.1016/j.ejrad.2019.01.006
18. Yu J, Deng Y, Liu T, Zhou J, Jia X, Xiao T, et al. Lymph node metastasis prediction of papillary thyroid carcinoma based on transfer learning radiomics. Nat Commun (2020) 11(1):4807. doi: 10.1038/s41467-020-18497-3
19. Li F, Pan D, He Y, Wu Y, Peng J, Li J, et al. Using ultrasound features and radiomics analysis to predict lymph node metastasis in patients with thyroid cancer. BMC Surg (2020) 20(1):315. doi: 10.1186/s12893-020-00974-7
20. Zhan J, Zhang LH, Yu Q, Li CL, Chen Y, Wang WP, et al. Prediction of cervical lymph node metastasis with contrast-enhanced ultrasound and association between presence of BRAFV600E and extrathyroidal extension in papillary thyroid carcinoma. Ther Adv Med Oncol (2020) 12:1758835920942367. doi: 10.1177/1758835920942367
21. Li WH, Yu WY, Du JR, Teng DK, Lin YQ, Sui GQ, et al. Nomogram prediction for cervical lymph node metastasis in multifocal papillary thyroid microcarcinoma. Front Endocrinol (Lausanne) (2023) 14:1140360. doi: 10.3389/fendo.2023.1140360
22. Chang L, Zhang Y, Zhu J, Hu L, Wang X, Zhang H, et al. An integrated nomogram combining deep learning, clinical characteristics and ultrasound features for predicting central lymph node metastasis in papillary thyroid cancer: A multicenter study. Front Endocrinol (Lausanne) (2023) 14:964074. doi: 10.3389/fendo.2023.964074
Keywords: papillary thyroid carcinoma (PTC), cervical lymph node metastasis, multi-modal ultrasound imaging, deep learning, DualSwinThyroid model
Citation: Liu Q, Li Y, Hao Y, Fan W, Liu J, Li T and Liu L (2024) Multi-modal ultrasound multistage classification of PTC cervical lymph node metastasis via DualSwinThyroid. Front. Oncol. 14:1349388. doi: 10.3389/fonc.2024.1349388
Received: 04 December 2023; Accepted: 29 January 2024;
Published: 15 February 2024.
Edited by:
Akira Sugawara, Tohoku University, JapanReviewed by:
Xiaolong Li, Fudan University, ChinaJared Shenson, Orlando Health Cancer Institute, United States
Copyright © 2024 Liu, Li, Hao, Fan, Liu, Li and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Ting Li, bGl0aW5nQGJtZS5wdW1jLmVkdS5jbg==; Li-ping Liu, bGl1bGlwaW5nMTYwMEBzaW5hLmNvbQ==