Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Endocrinol.

Sec. Thyroid Endocrinology

Volume 16 - 2025 | doi: 10.3389/fendo.2025.1644396

Intelligent prediction of thyroid cancer in China based on GBD data and hospital electronic medical records: disease burden analysis combined with multiple machine learning models

Provisionally accepted
Lina  YangLina YangShixia  ZhangShixia ZhangXinguo  WangXinguo WangJianjun  YangJianjun YangMengya  ChenMengya Chen*
  • Shandong Provincial Third Hospital, Jinan, China

The final, formatted version of the article will be published soon.

This study aims to conduct an in-depth analysis of the disease burden pattern and future trends of thyroid cancer in China, and constructed an intelligent prediction model in combination with hospital electronic medical record data. It comprehensively reveals the disease burden trend of thyroid cancer in China, predicts the mortality rate of thyroid cancer in China, and emphasizes the causal role of high BMI as an important controllable risk factor. And provided a high-precision prediction model for benign and malignant thyroid cancer.The results show that the prevalence of thyroid cancer in China has shown a significant upward trend from 1990 to 2021, especially among women, and the peak age of onset has shifted later. The mortality rate of men is on the rise, while that of women is on the decline. The risk of thyroid cancer mortality caused by high BMI significantly increases during this period, and MR analysis confirms that high BMI increases the risk of thyroid cancer. The ARIMA model predicts that the prevalence of thyroid cancer in China will continue to increase in the next ten years, while the mortality rate will remain relatively stable. Among the machine learning models, XGBoost achieved the highest predictive accuracy and identified BMI as the most influential clinical feature in distinguishing between benign and malignant thyroid tumors.This study provides a solid scientific basis for the development of more accurate and effective strategies for the prevention, early diagnosis, and management of thyroid cancer in China and even globally, and provides a feasible path for the use of artificial intelligence assisted diagnosis in clinical practice.

Keywords: thyroid cancer, disease burden, Body Mass Index, EMR, Mendelian randomization, machine learning

Received: 10 Jun 2025; Accepted: 31 Jul 2025.

Copyright: © 2025 Yang, Zhang, Wang, Yang and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Mengya Chen, Shandong Provincial Third Hospital, Jinan, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.