Machine learning to predict the occurrence of thyroid nodules: towards a quantitative approach for judicious utilization of thyroid ultrasonography

Liang, Qijun; Qi, Zhenhong; Li, Yike

doi:10.3389/fendo.2024.1385836

ORIGINAL RESEARCH article

Front. Endocrinol., 07 May 2024

Sec. Thyroid Endocrinology

Volume 15 - 2024 | https://doi.org/10.3389/fendo.2024.1385836

Machine learning to predict the occurrence of thyroid nodules: towards a quantitative approach for judicious utilization of thyroid ultrasonography

Qijun Liang¹

Zhenhong Qi¹

Yike Li^2*

¹Health Management Center, Foshan Hospital of Traditional Chinese Medicine, Foshan, Guangdong, China
²Department of Otolaryngology-Head and Neck Surgery, Vanderbilt University Medical Center, Nashville, TN, United States

Introduction: Ultrasound is instrumental in the early detection of thyroid nodules, which is crucial for appropriate management and favorable outcomes. However, there is a lack of clinical guidelines for the judicious use of thyroid ultrasonography in routine screening. Machine learning (ML) has been increasingly used on big data to predict clinical outcomes. This study aims to leverage the ML approach in assessing the risk of thyroid nodules based on common clinical features.

Methods: Data were sourced from a Chinese cohort undergoing routine physical examinations including thyroid ultrasonography between 2013 and 2023. Models were established to predict the 3-year risk of thyroid nodules based on patients’ baseline characteristics and laboratory tests. Four ML algorithms, including logistic regression, random forest, extreme gradient boosting, and light gradient boosting machine, were trained and tested using fivefold cross-validation. The importance of each feature was measured by the permutation score. A nomogram was established to facilitate risk assessment in the clinical settings.

Results: The final dataset comprised 4,386 eligible subjects. Thyroid nodules were detected in 54.8% (n=2,404) individuals within the 3-year observation period. All ML models significantly outperformed the baseline regression model, successfully predicting the occurrence of thyroid nodules in approximately two-thirds of individuals. Age, high-density lipoprotein, fasting blood glucose and creatinine levels exhibited the highest impact on the outcome in these models. The nomogram showed consistency and validity, providing greater net benefits for clinical decision-making than other strategies.

Conclusion: This study demonstrates the viability of an ML-based approach in predicting the occurrence of thyroid nodules. The findings highlight the potential of ML models in identifying high-risk individuals for personalized screening, thereby guiding the judicious use of ultrasound in this context.

Introduction

Thyroid nodules are a prevalent condition detectable in up to 67% of individuals (1–4). Although the majority of cases involve benign and asymptomatic lesions that necessitate no intervention, approximately 5-15% of them are malignant or indicative of thyroid diseases, such as hyper- and hypo-thyroidism (5–7). These thyroid conditions, especially cancers, often exhibit a favorable prognosis when identified early. Early detection allows for timely and tailored treatment strategies, thereby increasing the likelihood of a successful outcome (8–10).

Ultrasound has become a widely utilized method for thyroid examination by virtue of its non-invasive nature, fair cost-effectiveness, and broad accessibility. Equipped with linear probes that deliver high-resolution detail of the thyroid gland, ultrasound exhibits a remarkable sensitivity in detecting early-stage lesions as small as a few millimeters (10, 11). Additionally, ultrasound can unveil patterns of vascularity, characterize the nature of the mass, aid in the assessment of adjacent tissues, and offer real-time guidance for biopsy. As a result, thyroid ultrasonography has emerged as the primary tool for thyroid imaging, playing a pivotal role in the global assessment of thyroid diseases (12).

Although broadly regarded as a preferred means of assessing the thyroid, the clinical indication for thyroid ultrasonography varies considerably across the global healthcare system. In some countries like China, thyroid ultrasonography is routinely included in regular health examinations. However, the excessive use of ultrasound may place additional strain on medical resources and substantially increase healthcare costs (13–15). Conversely, in regions where initial thyroid assessment relies primarily on palpation, ultrasound is typically not indicated until a thyroid mass grows into a palpable size or causes symptoms (7, 10). Therefore, a considerable number of thyroid nodules remain unrevealed at early stages, potentially delaying diagnosis and treatment and causing suboptimal outcomes (16, 17). In this regard, the judicious utilization of ultrasound for thyroid examination has gained increasing attention from healthcare professionals (18, 19). Nevertheless, there is still a lack of clinical guidelines providing recommendations for the appropriate circumstances in which ultrasound should be prescribed for screening thyroid nodules.

Machine learning (ML) is a subset of artificial intelligence empowering computers to learn from historical data and predict outcomes for new data based on acquired knowledge. With the advent of the big data era, ML has been increasingly applied to perform predictive modeling in medicine (20–22). ML models exhibit promising performance, often matching or surpassing human judgement across diverse tasks such as disease detection, diagnosis, and risk prediction (23, 24). One notable advantage of ML over traditional statistics is its ability to function effectively with minimal assumptions about data characteristics. This makes ML particularly valuable in situations where data lack a controlled arm or involve intricate nonlinear interactions among predictor variables (25).

The objective of this study is to establish an effective method for assessing the risk of thyroid nodules. The development of thyroid nodules is associated with a mixed combination of biological, lifestyle, and environmental factors, such as age and metabolism (2, 26–28). In this study, electronic health record data, encompassing demographics, anthropometrics, and common laboratory tests, were collected from a large single-center cohort undergoing routine physical examinations including thyroid ultrasonography. ML models were constructed to predict thyroid nodules based on commonly accessible clinical features. The findings from this study not only suggest a clinically feasible approach that can guide the judicious use of thyroid ultrasonography, but also provide insight into the important factors associated with the occurrence of thyroid nodules.

Materials and methods

Study cohort

This study was conducted in full accordance with Good Clinical Practice and Declaration of Helsinki. Ethical approval was granted by the Ethics Committees at Foshan Hospital of Traditional Chinese Medicine (document number: KY-2022-151). Data were retrospectively collected from adults who received routine health examinations at the Health Management Center of this tertiary hospital between 2013 and 2023. Thyroid ultrasound, which was included in a comprehensive health examination package designed for early detection of health issues, was performed at patients’ discretion regardless of clinical indications. Individuals were excluded from the analysis if they (1): had a history of known thyroid diseases such as hyperthyroidism, hypothyroidism, subacute thyroiditis, and Hashimoto thyroiditis; or (2) had a history of thyroid therapy, including any medication, surgery, or radiotherapy; or (3) were pregnant or lactating; or (4) had any missing data at baseline. Patients were scheduled to follow up annually after their initial visits and observed until December 31st, 2023.

Data acquisition

Candidate independent variables included patients’ baseline characteristics (sex, age, body mass index, waist circumference, and mean arterial pressure) and laboratory test results (fasting blood glucose [FBG], triglycerides, total cholesterol, low-density lipoprotein cholesterol [LDL-C], high-density lipoprotein cholesterol [HDL-C], uric acid, alanine transaminase, aspartate aminotransferase, γ-glutamyl transpeptidase, and creatinine). These variables were selected by availability and potential association with the development of thyroid nodules (29, 30). All predictor variables were obtained at baseline during the initial visit.

The dependent variable was the presence or absence of thyroid nodules assessed through ultrasound at each visit. Thyroid nodules were considered present if any discrete lesions within the thyroid gland appeared radiologically distinct from the surrounding parenchyma. These nodules could exhibit solid, spongiform, cystic, or mixed components. Ultrasound examinations were conducted on patients in a supine position by senior sonographers with over 10 years of experience, using a B-mode high-resolution tomographic ultrasound system (Esaote, Genova, Italy). All images were reviewed by at least one independent clinical expert before final reports were generated.

Data preprocessing

Logarithmic transformation was applied to continuous variables that exhibited skewed distribution. Time to event was coded as the number of years between the initial visit and the onset of thyroid nodules or the last follow-up visit, whichever occurred earlier. The ground truth label was determined based on the presence or absence of thyroid nodules by the end of a 3-year observation period. Subjects exhibiting thyroid nodules at baseline or in less than 3 years from the initial visit were labeled as nodule-positive, while those who remained disease-free or only exhibited nodules after 3 years were labeled as nodule-negative. Individuals who were lost to follow-up or had missing ultrasound data were discarded. A fivefold cross-validation method was applied to train and test the ML models. Specifically, the dataset was split randomly into a training set (80%) and a test set (20%). This process was repeated 5 times, resulting in 5 distinct test sets. During training, a random selection of 20% data from the training set was employed for model validation. Data were split in a stratified fashion to ensure consistent class distribution in each subset as the entire dataset.

Model development

Models underwent training and testing in a binary classification task to assess the 3-year risk of thyroid nodules based on baseline features. Four ML algorithms were employed, including logistic regression, random forest, extreme gradient boosting (XGBoost), and light gradient boosting machine (LightGBM). Specifically, random forest is a classic ensemble method that combines the outputs of multiple parallel decision trees for making predictions (31). XGBoost comprises a group of decision trees that are weak prediction learners connected in a sequential fashion. This gradient boosting structure allows a new learner to concentrate on areas where the existing learners are performing poorly, thereby reducing the error in the entire model. The LightGBM has a similar structure to the XGBoost but uses a different strategy to split the data. These ML models represent the state-of-the-art ML techniques that show remarkable outcomes in a variety of tasks (32–35). Logistic regression served as a baseline model to allow unbiased performance assessment for these ML models.

A grid search was performed to determine the optimal hyperparameters for each algorithm. Each ML algorithm with the best hyperparameters was trained to achieve convergence on the training set. The cutoff threshold of each model was determined at the top left corner of the receiver operating characteristic curve (i.e., the maximum sum of sensitivity and specificity) from the validation set and applied unchangeably to the test set. All training and testing sessions were done in a Python 3.8 environment using scikit-learn v1.02, an open-source package for ML.

Feature importance

The importance of each predictor variable in a model was quantified by the permutation score on the test set. This score is defined as the decrease in model performance [measured by the area under the receiver operating characteristic curve (AUROC)] when all values of a given variable are randomly shuffled. Essentially, this procedure breaks the relationship between the feature and the outcome, and the extent of performance reduction indicates the reliance of the model on that particular feature. This process was iterated 50 times for each variable in a model to obtain an average score.

Statistical analysis

Descriptive statistics were applied to characterize the baseline features of this cohort. The performance of each ML model was evaluated based on a range of metrics, including accuracy, recall, specificity, precision, F1 score, AUROC, and area under the precision-recall curve. Results were averaged over 5 cross-validation folds and are presented as means with 95% confidence intervals. Cochrane’s Q test was employed to evaluate differences in predictive performance among the models, with statistical significance determined at an alpha threshold of 0.05.

To facilitate clinical application, a nomogram visually depicting the risk of thyroid nodules was developed using the simpleNomo package in Python (36). A calibration curve was utilized to measure the consistency between the predicted risks and the actual outcomes. The clinical benefit of the nomogram was evaluated by decision curve analysis (37, 38). All statistical analyses were conducted using Python 3.8 and Excel (Microsoft Corporation, Redmond, WA).

Results

Cohort characteristics

The final dataset comprised 4,386 individuals who met the inclusion criteria for the study (Figure 1). Subjects were predominantly male (58.8%) and generally in middle age (38.2 years) during their initial visits (Table 1). A total of 54.8% individuals (n = 2,404) exhibited thyroid nodules, either at baseline (n = 1,841) or within the 3-year observation period (n = 563).

Figure 1

Figure 1 A flow chart outlining the study design.

Table 1

Table 1 Baseline characteristics of the dataset.

Model performance

The optimal hyperparameters for each algorithm are presented in Table 2. Overall, these models successfully predicted the outcome in approximately two-thirds of individuals (Table 3). All ML models demonstrated superior performance compared to the baseline logistic regression model (p<0.001). Despite a modest difference in overall predictability, each model revealed similar recall and specificity scores, suggesting a balanced performance in identifying patients with and without thyroid nodules.

Table 2

Table 2 Best hyperparameters settings for each model.

Table 3

Table 3 Performance of each model in predicting the 3-year onset of thyroid nodules.

Critical predictors

The top 10 critical features influencing the development of thyroid nodules were largely consistent across all models (Figure 2). These pivotal factors encompassed age, HDL-C, FBG, creatinine, LDL-C, triglycerides, sex, and mean arterial pressure. Notably, age exhibited the most substantial impact on the outcome, maintaining the highest rank in every model. HDL-C, creatinine, and FBG were also identified as significant predictors, consistently appearing among the top four positions in all four models.

Figure 2

Figure 2 The top 10 most critical features for each model. Each bar represents the mean importance score, with the black horizontal line indicating the standard error of the mean. LightGBM, Light Gradient Boosting Machine; XGBoost, Extreme Gradient Boosting. BMI, Body Mass Index; WC, Waist Circumference; MAP, Mean Arterial Pressure; FBG, Fasting Blood Glucose; TG, Triglyceride; TCH, Total Cholesterol; LDL-C, Low-Density Lipoprotein Cholesterol; HDL, High-Density Lipoprotein Cholesterol; UA, Uric Acid; ALT, Alanine Transaminase; AST, Aspartate Aminotransferase; GGT, γ-Glutamyl Transpeptidase; Cr, Creatinine.

Nomogram

The nomogram was derived from the logistic regression model incorporating all features to optimize predictability (Figure 3). It demonstrated comparable performance in predicting the occurrence of thyroid nodules in both the training and validation sets, with accuracy scores of 0.67 and 0.66, and AUROC scores of 0.72 and 0.72, respectively. The calibration curve indicated a good agreement between the nomogram’s predicted probabilities and the actual observations. Furthermore, the decision curve analysis revealed that the nomogram offered greater net benefits in the evaluation of thyroid nodules compared to strategies that rely solely on age or use an all-or-none approach, especially at a probability threshold above 0.35 (Figure 4). Additionally, an Excel spreadsheet has been provided to facilitate the clinical implementation of this model on electronic devices for assessing the risk of thyroid nodules (Supplementary Material).

Figure 3

Figure 3 A nomogram for estimating the 3-year risk of thyroid nodule. Variables marked with “log” require logarithmic transformation with a base of 10 to obtain the proper scores. FBG, Fasting Blood Glucose; TG, Triglyceride; TCH, Total Cholesterol; LDL, Low-Density Lipoprotein Cholesterol; HDL, High-Density Lipoprotein Cholesterol; UA, Uric Acid; ALT, Alanine Transaminase; AST, Aspartate Aminotransferase; GGT, γ-Glutamyl Transpeptidase; Cr, Creatinine; BMI, Body Mass Index; MAP, Mean Arterial Pressure. This nomogram, along with other pre-trained models and code, are publicly accessible at the following repository: https://github.com/huntlylee/Thyroid-nodule.

Figure 4

Figure 4 Calibration plots and decision curve analysis of the nomogram. Calibration plots measure the accuracy of the nomogram’s predictions by comparing the average predicted risks against the actual observed probabilities, employing a bootstrapping technique in both the training (A) and validation (B) sets. The decision curve analysis quantifies the trade-off between the risk of taking unnecessary actions (i.e. unwarranted thyroid ultrasounds) and the advantages of appropriate interventions across various threshold levels for each assessment method in the training (C) and validation (D) cohorts.

Discussion

Evaluating the risk of thyroid nodule onset holds significant benefits for tailoring monitoring strategies and guiding the appropriate use of ultrasound in this context. This study demonstrated the feasibility of utilizing ML to forecast the 3-year risk of thyroid nodules based on readily accessible baseline features. All models were able to predict the occurrence of thyroid nodules in approximately two-thirds of individuals, displaying a sensitivity of up to 66%. This performance suggests the clinical potential as a regular screening tool for thyroid nodules in the general population. In contrast to recent ML studies that primarily focused on the detection, classification, or prognosis of thyroid malignancies (39–42), this study concentrated on the onset of thyroid nodules — a more common clinical task that is upstream to all these later-stage models and of public health significance. The current dataset also represents a sizable cohort with ample observation time and reliable study endpoints. Employing a fivefold cross-validation approach and a comprehensive set of performance metrics allows unbiased evaluation of the ML models. Quantifying the feature importance also provides insightful findings regarding thyroid nodule pathogenesis for future mechanistic research. This study reinforces the potential of artificial intelligence to revolutionize healthcare in the era of big data. With its capacity to generate timely and reliable predictions in intricate tasks, ML is poised to become an integral part of routine clinical practice, notably advancing personalized medicine.

Although ultrasound has been proven as a cost-effective approach for thyroid assessment, it is not routinely prescribed during heath checkup in many countries. Instead, thyroid is primarily evaluated through physical examinations, with further assessments being determined by physicians’ judgments. Palpation through fingers relies on clinicians’ experiences and skills, which results in inter-operator variability and suboptimal sensitivity (3, 43, 44). Consequently, most thyroid nodules may only be identified after progressing into palpable sizes or causing perceivable symptoms, leading to a potential delay in diagnosis and treatment of the underlying conditions. Although certain risk factors, such as age, obesity, and smoking, are known to be associated with the occurrence of thyroid nodules, there is currently no specific guideline that offers instruction for screening based on these factors. This study addresses this gap by the development of risk stratification models. These quantitative models demonstrate favorable performance in forecasting the 3-year risk of thyroid nodules based on common clinical features, suggesting an evidence-based approach for clinical decision-making that is deemed less biased compared to the subjective judgments. For one thing, these ML models may aid in estimating the need for further thyroid assessment during routine health examination. A timely ultrasound is expected to allow detection of tiny nodules before they increase in size or exhibit symptoms, potentially facilitating the early diagnosis and treatment of thyroid diseases and improving outcomes. For another thing, these models can also effectively spare low-risk individuals from unnecessary assessments, thus avoiding overtreatment or excess health spending. In China, with an annual estimate of 495 million individuals undergoing routine physical examination and a detection rate of 20% for thyroid nodules (45–47), these models are anticipated to substantially reduce thyroid ultrasonography by at least 285 million cases and save 4 billion dollars in costs per year.

The clinical viability of this quantitative approach is supported by the commendable model performance in identifying both nodule-positive and -negative patients, in addition to the net benefits of the nomogram over alternative strategies. These models can be developed into applications or integrated into electronic medical record systems for rapid risk assessment and clinical triage. In this study, all pre-trained models, along with a detailed instruction manual, have been shared in a public repository (https://github.com/huntlylee/Thyroid-nodule), allowing straightforward inference on user-provided data through simple command-line inputs. To aid in the clinical adoption of this model, a nomogram and a user-friendly spreadsheet have been provided, designed to support physicians across a range of ML expertise.

Identifying factors associated with disease onset is essential before considering focused treatment or preventive strategies. In line with prior studies, this research also identifies several patient characteristics, such as age, HDL-C, and FBG, as being associated with the development of thyroid nodules. Firstly, age is widely recognized as a significant risk factor for thyroid nodule (29, 48), potentially due to age-related oxidative stress and the involvement of vascular endothelial growth factor (49, 50). Evidence suggests that older adults are more likely to develop thyroid malignancies of high-risk histology, highlighting the need of early detection of thyroid nodules (26). Secondly, there is a noted prevalence of thyroid nodules in individuals with metabolic disorders like diabetes and hyperlipidemia (51, 52), which is corroborated by the identification of FBG, LDL-C, HDL-C, and triglycerides as critical predictors in this study. The prevailing theory suggests that metabolic disorders could promote thyroid cell growth through interactions between insulin and thyroid stimulating hormone (53, 54). Metabolic disorders might also trigger oxidative stress, causing cellular damage and affecting genomic stability in the thyroid (49, 50, 55–57). Additionally, creatinine levels have been found to be associated with thyroid nodules (29, 58), although the cause remains unclear. Creatinine might also act as a proxy marker for sex or other risk factors that exhibit sex discrepancies. Further research is required to elucidate the underlying mechanisms of these factors in the pathogenesis of thyroid nodules.

While previous studies have yielded inconclusive findings regarding the advantage of ML over traditional statistics in performing different clinical tasks (34, 35, 59, 60), it is generally believed that complex ML models often require big data to achieve optimal performance (61). This study involved a substantial dataset of more than 4,000 subjects with a balanced class distribution, where all three ML models showed significant improvements over the standard logistic regression, albeit to a modest extent. This finding reinforces the potential of ML in predicting medical or epidemiological outcomes when large datasets are available. Nonetheless, classic regression approaches may continue to play a pivotal role in these tasks by virtue of the model simplicity, which can mitigate bias and overfitting in scenarios with smaller or imbalanced datasets (35, 62). Moreover, the superior explainability of simple regression models over complex ML algorithms may make them more suitable for predicting clinical outcomes, as explainable equations may facilitate clinical adaptation. In this study, the logistic regression model was converted into a nomogram and a simple formula that can be implemented in an Excel spreadsheet, enabling effective utilization of this method by clinicians without the need for ML expertise.

Several limitations should be acknowledged in this study. Firstly, only a monocentric dataset was obtained. While this dataset comprises a substantial cohort with reliable ground truths, it may still be susceptible to certain biases related to race, region and sex. Notably, there was a minor gender disparity within the cohort, which may be attributed to a higher exclusion rate of female participants who are generally more susceptible to thyroid conditions. Hence, the generalizability of these models might necessitate further validation on a wider patient population. Secondly, only a limited number of variables were employed for prediction. Although these variables were chosen for their relevance and data availability, there are other risk factors not currently accessible in the database, such as smoking, family history, and radiation exposure. Including these features is likely to enhance the model performance. Thirdly, only a handful of models were tested in this early feasibility study, although they are representative of cutting-edge ML techniques. Given the rapid evolution in both medical research and data science, future studies will likely assess new ML approaches as they become available. Lastly, the critical features identified may indicate associations rather than causation due to the study’s retrospective nature. Although they offer insights for further investigation into disease pathogenesis, it will be essential to conduct mechanistic and prospective studies to understand the causal relationships and their roles in the development of thyroid nodules.

Future research should aim to address these limitations and facilitate model deployment in clinical settings. For example, additional variables linked to the onset of thyroid nodules will be collected to improve model performance. A broader dataset will be compiled from multiple independent hospitals to evaluate and enhance the generalizability of these models. This approach may also be extended to forecast other clinically significant outcomes, such as the trajectory, malignancy, or prognosis of thyroid nodules. These ML models will be integrated into existing electronic health record systems with user-friendly interfaces to facilitate human-machine interaction and enable efficient decision-making. Efforts are underway to collect more data and test these models in prospective studies. The ultimate objective of this research line is to establish a robust artificial intelligence system that can effectively support clinicians in the evaluation and management of thyroid diseases.

Conclusion

In conclusion, this study showed the feasibility of ML in predicting the occurrence of thyroid nodules. Age, HDL-C, FBG, and creatinine levels were identified as the critical factors associated with the outcome. These findings pave the way for a quantitative approach in guiding the judicious use of ultrasound for personalized screening. Future research will involve conducting external validation and enhancing the model by incorporating additional predictor variables.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation. All models and code are available via: https://github.com/huntlylee/Thyroid-nodule.

Ethics statement

The studies involving humans were approved by the Ethics Committees at Foshan Hospital of Traditional Chinese Medicine. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

QL: Conceptualization, Data curation, Funding acquisition, Investigation, Visualization, Writing – original draft. ZQ: Project administration, Resources, Writing – review & editing. YL: Conceptualization, Data curation, Formal analysis, Methodology, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study is supported by the Science and Technology Bureau of Foshan City (Grant number: 2220001004516 to QL).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2024.1385836/full#supplementary-material

References

1. Gharib H, Papini E, Paschke R, Duick DS, Valcavi R, Hegedüs L, et al. American association of clinical endocrinologists, associazione medici endocrinologi, and European thyroid association medical guidelines for clinical practice for the diagnosis and management of thyroid nodules. Endocr Pract. (2010) 16:1–43. doi: 10.4158/10024.GL

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Ross DS. Overview of thyroid nodule formation (2023). Available online at: https://www.uptodate.com/contents/overview-of-thyroid-nodule-formation.

Google Scholar

3. Ezzat S, Sarti DA, Cain DR, Braunstein GD. Thyroid incidentalomas. Prevalence by palpation and ultrasonography. Arch Intern Med. (1994) 154:1838–40. doi: 10.1001/archinte.154.16.1838

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Brander A, Viikinkoski P, Nickels J, Kivisaari L. Thyroid gland: US screening in a random adult population. Radiology. (1991) 181:683–7. doi: 10.1148/radiology.181.3.1947082

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Tamhane S, Gharib H. Thyroid nodule update on diagnosis and management. Clin Diabetes Endocrinol. (2016) 2:17. doi: 10.1186/s40842-016-0035-7

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Durante C, Grani G, Lamartina L, Filetti S, Mandel SJ, Cooper DS. The diagnosis and management of thyroid nodules: A review. JAMA. (2018) 319:914–24. doi: 10.1001/jama.2018.0898

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Cooper DS, Doherty GM, Haugen BR, Kloos RT, Lee SL, Mandel SJ, et al. Revised American thyroid association management guidelines for patients with thyroid nodules and differentiated thyroid cancer. Thyroid. (2009) 19:1167–214. doi: 10.1089/thy.2009.0110

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Roman S, Mehta P, Sosa JA. Medullary thyroid cancer: early detection and novel treatments. Curr Opin Oncol. (2009) 21:5–10. doi: 10.1097/CCO.0b013e32831ba0b3

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Nabhan F, Dedhia PH, Ringel MD. Thyroid cancer, recent advances in diagnosis and therapy. Int J Cancer. (2021) 149:984–92. doi: 10.1002/ijc.33690

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Alexander EK, Doherty GM, Barletta JA. Management of thyroid nodules. Lancet Diabetes Endocrinol. (2022) 10:540–8. doi: 10.1016/S2213-8587(22)00139-5

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Sipos JA. Overview of the clinical utility of ultrasonography in thyroid disease (2021). Available online at: https://www.uptodate.com/contents/overview-of-the-clinical-utility-of-ultrasonography-in-thyroid-disease.

Google Scholar

12. Grani G, Sponziello M, Pecce V, Ramundo V, Durante C. Contemporary thyroid nodule evaluation and management. J Clin Endocrinol Metab. (2020) 105:2869–83. doi: 10.1210/clinem/dgaa322

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Janovsky CCPS, Bittencourt MS, de Novais MAP, Maciel RMB, Biscolla RPM, Zucchi P. Thyroid cancer burden and economic impact on the Brazilian public health system. Arch Endocrinol Metab. (2018) 62:537–44. doi: 10.20945/2359-3997000000074

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Liu Y, Lai F, Long J, Peng S, Wang H, Zhou Q, et al. Screening and the epidemic of thyroid cancer in China: An analysis of national representative inpatient and commercial insurance databases. Int J Cancer. (2021) 148:1106–14. doi: 10.1002/ijc.33298

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Cheng F, Xiao J, Shao C, Huang F, Wang L, Ju Y, et al. Burden of thyroid cancer from 1990 to 2019 and projections of incidence and mortality until 2039 in China: findings from global burden of disease study. Front Endocrinol (Lausanne). (2021) 12. doi: 10.3389/fendo.2021.738213

CrossRef Full Text | Google Scholar

16. Krumeich LN, Kelz RR. Editorial: time to surgery and thyroid cancer survival in the United States. Ann Surg Oncol. (2021) 28:3459. doi: 10.1245/s10434-021-10101-2

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Lopez B, Fligor SC, Randolph GW, James BC. Inequities in thyroid cancer care: populations most at risk for delays in diagnosis and treatment. Thyroid. (2023) 33:724–31. doi: 10.1089/thy.2022.0723

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Edwards MK, Iñiguez-Ariza NM, Singh Ospina N, Lincango-Naranjo E, Maraka S, Brito JP. Inappropriate use of thyroid ultrasound: a systematic review and meta-analysis. Endocrine. (2021) 74:263–9. doi: 10.1007/s12020-021-02820-z

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Kim DW, Lee EJ, Lee JH. Role of ultrasound diagnosis in assessing and managing thyroid nodules with inadequate cytology. Am J Roentgenol. (2011) 197:1213–9. doi: 10.2214/AJR.11.6418

CrossRef Full Text | Google Scholar

20. Shah NH, Milstein A, Bagley SC. Making machine learning models clinically useful. JAMA. (2019) 322:1351–2. doi: 10.1001/jama.2019.10306

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Obermeyer Z, Emanuel EJ. Predicting the future — Big data, machine learning, and clinical medicine. N Engl J Med. (2016) 375:1216–9. doi: 10.1056/NEJMp1606181

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Yasmin F, Shah SMI, Naeem A, Shujauddin SM, Jabeen A, Kazmi S, et al. Artificial intelligence in the diagnosis and detection of heart failure: the past, present, and future. Rev Cardiovasc Med. (2021) 22:1095–113. doi: 10.31083/j.rcm2204121

PubMed Abstract | CrossRef Full Text | Google Scholar

23. De Fauw J, Ledsam JR, Romera-Paredes B, Nikolov S, Tomasev N, Blackwell S, et al. Clinically applicable deep learning for diagnosis and referral in retinal disease. Nat Med. (2018) 24:1342–50. doi: 10.1038/s41591-018-0107-6

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Wang YM, Li Y, Cheng YS, He ZY, Yang JM, Xu JH, et al. Deep learning in automated region proposal and diagnosis of chronic otitis media based on computed tomography. Ear Hear. (2020) 41:669–77. doi: 10.1097/AUD.0000000000000794

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Bzdok D, Altman N, Krzywinski M. Statistics versus machine learning. Nat Methods. (2018) 15:233–4. doi: 10.1038/nmeth.4642

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Ospina NS, Papaleontiou M. Thyroid nodule evaluation and management in older adults: A review of practical considerations for clinical endocrinologists. Endocr Pract. (2021) 27:261–8. doi: 10.1016/j.eprac.2021.02.003

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Demetriou E, Fokou M, Frangos S, Papageorgis P, Economides PA, Economides A. Thyroid nodules and obesity. Life (Basel Switzerland). (2023) 13(6):1292. doi: 10.20944/preprints202305.0882.v1

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Rezzonico J, Rezzonico M, Pusiol E, Pitoia F, Niepomniszcze H. Introducing the thyroid gland as another victim of the insulin resistance syndrome. Thyroid. (2008) 18:461–4. doi: 10.1089/thy.2007.0223

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Moon JH, Hyun MK, Lee JY, Shim JI, Kim TH, Choi HS, et al. Prevalence of thyroid nodules and their associated clinical parameters: a large-scale, multicenter-based health checkup study. Korean J Intern Med. (2018) 33:753–62. doi: 10.3904/kjim.2015.273

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Liu J, Wang C, Tang X, Fu S, Jing G, Ma L, et al. Correlation analysis of metabolic syndrome and its components with thyroid nodules. Diabetes Metab Syndr Obes. (2019) 12:1617–23. doi: 10.2147/DMSO

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Breiman L. Random forests. Mach Learn. (2001) 45:5–32. doi: 10.1023/A:1010933404324

CrossRef Full Text | Google Scholar

32. Zhang Y, Zhang Z, Wei L, Wei S. Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer. Front Public Heal. (2022) 10. doi: 10.3389/fpubh.2022.1008137

CrossRef Full Text | Google Scholar

33. Peng Y, Zhao S, Zeng Z, Hu X, Yin Z. LGBMDF: A cascade forest framework with LightGBM for predicting drug-target interactions. Front Microbiol. (2023) 13. doi: 10.3389/fmicb.2022.1092467

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Zhang D, Li Y, Kalbaugh CA, Shi L, Divers J, Islam S, et al. Machine learning approach to predict in-hospital mortality in patients admitted for peripheral artery disease in the United States. J Am Heart Assoc. (2022) 11(20):e026987. doi: 10.1161/JAHA.122.026987

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Zhang Y, Zhang L, Li B, Ye T, Zhang Y, Yu Y, et al. Machine learning to predict occult metastatic lymph nodes along the recurrent laryngeal nerves in thoracic esophageal squamous cell carcinoma. BMC Cancer. (2023) 23(1):197. (In print). doi: 10.1186/s12885-023-10670-3

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Hong H, Hong S. simpleNomo: A python package of making nomograms for visualizable calculation of logistic regression models. Heal Data Sci. (2023) 3:0023. doi: 10.34133/hds.0023

CrossRef Full Text | Google Scholar

37. Vickers AJ, Van Calster B, Steyerberg EW. Net benefit approaches to the evaluation of prediction models, molecular markers, and diagnostic tests. BMJ. (2016) 352:i6. doi: 10.1136/bmj.i6

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Making. (2006) 26:565–74. doi: 10.1177/0272989X06295361

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Luong G, Idarraga AJ, Hsiao V, Schneider DF. Risk stratifying indeterminate thyroid nodules with machine learning. J Surg Res. (2022) 270:214–20. doi: 10.1016/j.jss.2021.09.015

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Peng S, Liu Y, Lv W, Liu L, Zhou Q, Yang H, et al. Deep learning-based artificial intelligence model to assist thyroid nodule diagnosis and management: a multicentre diagnostic study. Lancet Digit Heal. (2021) 3:e250–9. doi: 10.1016/S2589-7500(21)00041-8

CrossRef Full Text | Google Scholar

41. Yu J, Deng Y, Liu T, Zhou J, Jia X, Xiao T, et al. Lymph node metastasis prediction of papillary thyroid carcinoma based on transfer learning radiomics. Nat Commun. (2020) 11:4807. doi: 10.1038/s41467-020-18497-3

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Liu W, Wang S, Ye Z, Xu P, Xia X, Guo M. Prediction of lung metastases in thyroid cancer using machine learning based on SEER database. Cancer Med. (2022) 11:2503–15. doi: 10.1002/cam4.4617

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Tarigan TJE, Anwar BS, Sinto R, Wisnu W. Diagnostic accuracy of palpation versus ultrasound-guided fine needle aspiration biopsy for diagnosis of Malignancy in thyroid nodules: a systematic review and meta-analysis. BMC Endocr Disord. (2022) 22:181. doi: 10.1186/s12902-022-01085-5

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Pinchera A. Thyroid incidentalomas. Horm Res. (2007) 68 Suppl 5:199–201. doi: 10.1159/000110625

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Li Y, Teng D, Ba J, Chen B, Du J, He L, et al. Efficacy and safety of long-term universal salt iodization on thyroid disorders: epidemiological evidence from 31 provinces of mainland China. Thyroid. (2020) 30:568–79. doi: 10.1089/thy.2019.0067

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Dong X, Li Y, Xie J, Li L, Wan Z, Kang Y, et al. The prevalence of thyroid nodules and its factors among Chinese adult women: A cross-sectional study. Front Endocrinol (Lausanne). (2022) 13:967380. doi: 10.3389/fendo.2022.967380

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Liang Y, Li X, Wang F, Yan Z, Sang Y, Yuan Y, et al. Detection of thyroid nodule prevalence and associated risk factors in southwest China: A study of 45,023 individuals undergoing physical examinations. Diabetes Metab Syndr Obes. (2023) 16:1697–707. doi: 10.2147/DMSO.S412567

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Kwong N, Medici M, Angell TE, Liu X, Marqusee E, Cibas ES, et al. The influence of patient age on thyroid nodule formation, multinodularity, and thyroid cancer risk. J Clin Endocrinol Metab. (2015) 100:4434–40. doi: 10.1210/jc.2015-3100

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Zaki SM, Mohamed EA, Abdel Fattah S, Abdullah H, Kaszubowska L. Age-associated functional morphology of thyroid and its impact on the expression of vimentin, cytokeratins and VEGF. The role of nigella in refinement. Folia Histochem Cytobiol. (2018) 56:159–71. doi: 10.5603/FHC.a2018.0015

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Yang H, Fang B, Wang Z, Chen Y, Dong Y. The timing sequence and mechanism of aging in endocrine organs. Cells. (2023) 12(7):982. doi: 10.3390/cells12070982

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Wan Z, Li Y, Dong X, Kang Y, Luo J, Wang J, et al. Influence of metabolic syndrome and lifestyle factors on thyroid nodules in Chinese adult men: a cross-sectional study. Eur Thyroid J. (2023) 12(6):e230168. doi: 10.1530/ETJ-23-0168

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Zhang F, Teng D, Tong N, Wang G, Li Y, Yu X, et al. Gender-specific associations between metabolic disorders and thyroid nodules: A cross-sectional population-based study from China. Thyroid. (2022) 32:571–80. doi: 10.1089/thy.2021.0686

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Tsatsoulis A. The role of insulin resistance/hyperinsulinism on the rising trend of thyroid and adrenal nodular disease in the current environment. J Clin Med. (2018) 7(3):37. doi: 10.3390/jcm7030037

PubMed Abstract | CrossRef Full Text | Google Scholar

54. Yildirim Simsir I, Cetinkalp S, Kabalak T. Review of factors contributing to nodular goiter and thyroid carcinoma. Med Princ Pract. (2020) 29:1–5. doi: 10.1159/000503575

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Franchini F, Palatucci G, Colao A, Ungaro P, Macchia PE, Nettore IC. Obesity and thyroid cancer risk: an update. Int J Environ Res Public Health. (2022) 19(3):1116. doi: 10.3390/ijerph19031116

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Faam B, Ghadiri AA, Ghaffari MA, Totonchi M, Khorsandi L. Comparing oxidative stress status among Iranian males and females with Malignant and non-malignant thyroid nodules. Int J Endocrinol Metab. (2021) 19(1):e105669. doi: 10.5812/ijem

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Santos MCS, Louzada RAN, Souza ECL, Fortunato RS, Vasconcelos AL, Souza KLA, et al. Diabetes mellitus increases reactive oxygen species production in the thyroid of male rats. Endocrinology. (2013) 154:1361–72. doi: 10.1210/en.2012-1930

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Bolat H, Erdoğan A. Benign nodules of the thyroid gland and 25-hydroxy-vitamin D levels in euthyroid patients. Ann Saudi Med. (2022) 42:83–8. doi: 10.5144/0256-4947.2022.83

PubMed Abstract | CrossRef Full Text | Google Scholar

59. Bai Q, Su C, Tang W, Li Y. Machine learning to predict end stage kidney disease in chronic kidney disease. Sci Rep. (2022) 12:8377. doi: 10.1038/s41598-022-12316-z

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J Clin Epidemiol. (2019) 110:12–22. doi: 10.1016/j.jclinepi.2019.02.004

PubMed Abstract | CrossRef Full Text | Google Scholar

61. Crown WH. Potential application of machine learning in health outcomes research and some statistical cautions. Value Health. (2015) 18:137–40. doi: 10.1016/j.jval.2014.12.005

PubMed Abstract | CrossRef Full Text | Google Scholar

62. Desai RJ, Wang SV, Vaduganathan M, Evers T, Schneeweiss S. Comparison of machine learning methods with traditional models for use of administrative claims with electronic medical records to predict heart failure outcomes. JAMA Netw Open. (2020) 3(1):e1918962. doi: 10.1001/jamanetworkopen.2019.18962

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: machine learning, thyroid nodule, ultrasonography, precision medicine, artificial intelligence, nomograms, logistic models, random forest

Citation: Liang Q, Qi Z and Li Y (2024) Machine learning to predict the occurrence of thyroid nodules: towards a quantitative approach for judicious utilization of thyroid ultrasonography. Front. Endocrinol. 15:1385836. doi: 10.3389/fendo.2024.1385836

Received: 13 February 2024; Accepted: 15 April 2024;
Published: 07 May 2024.

Edited by:

Yuting Huang, Mayo Clinic Florida, United States

Reviewed by:

Yichen Wang, University of Pennsylvania, United States
Zhenyang Zhao, University of Michigan, United States
Pingjiang Ge, Guangdong Provincial People’s Hospital, China
Jiale Wang, Mayo Clinic Florida, United States

Copyright © 2024 Liang, Qi and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yike Li, eWlrZS5saS4xQHZ1bWMub3Jn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.