ORIGINAL RESEARCH article
Front. Endocrinol.
Sec. Thyroid Endocrinology
Volume 16 - 2025 | doi: 10.3389/fendo.2025.1644396
Intelligent prediction of thyroid cancer in China based on GBD data and hospital electronic medical records: disease burden analysis combined with multiple machine learning models
Provisionally accepted- Shandong Provincial Third Hospital, Jinan, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
This study aims to conduct an in-depth analysis of the disease burden pattern and future trends of thyroid cancer in China, and constructed an intelligent prediction model in combination with hospital electronic medical record data. It comprehensively reveals the disease burden trend of thyroid cancer in China, predicts the mortality rate of thyroid cancer in China, and emphasizes the causal role of high BMI as an important controllable risk factor. And provided a high-precision prediction model for benign and malignant thyroid cancer.The results show that the prevalence of thyroid cancer in China has shown a significant upward trend from 1990 to 2021, especially among women, and the peak age of onset has shifted later. The mortality rate of men is on the rise, while that of women is on the decline. The risk of thyroid cancer mortality caused by high BMI significantly increases during this period, and MR analysis confirms that high BMI increases the risk of thyroid cancer. The ARIMA model predicts that the prevalence of thyroid cancer in China will continue to increase in the next ten years, while the mortality rate will remain relatively stable. Among the machine learning models, XGBoost achieved the highest predictive accuracy and identified BMI as the most influential clinical feature in distinguishing between benign and malignant thyroid tumors.This study provides a solid scientific basis for the development of more accurate and effective strategies for the prevention, early diagnosis, and management of thyroid cancer in China and even globally, and provides a feasible path for the use of artificial intelligence assisted diagnosis in clinical practice.
Keywords: thyroid cancer, disease burden, Body Mass Index, EMR, Mendelian randomization, machine learning
Received: 10 Jun 2025; Accepted: 31 Jul 2025.
Copyright: © 2025 Yang, Zhang, Wang, Yang and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Mengya Chen, Shandong Provincial Third Hospital, Jinan, China
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.