ORIGINAL RESEARCH article
Front. Endocrinol.
Sec. Pediatric Endocrinology
Development of Clinical Prediction Models for Girls with Central Precocious Puberty in China: Machine learning Approaches and Interpreting with Explainable Artificial Intelligence
Provisionally accepted- 1Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, Jiangsu, China., Xuzhou, Jiangsu, China., China
- 2The Affiliated Hospital of Xuzhou Medical University, Xuzhou, Jiangsu Province, China
- 3Anhui Provincial Children’s Hospital, Hefei, Anhui Province, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Background and objective: The incidence of central precocious puberty (CPP) in girls has been steadily increasing. However, stimulation tests remain costly, time-consuming, and painful. Despite efforts to simplify these tests or identify alternative biomarkers, consensus has not been achieved. Our objective is to develop a machine learning (ML) model to predict the response of girls suspected of CPP to stimulation tests before injection of GnRHa. Methods: We conducted a retrospective review of 320 girls suspected of CPP who visited the Affiliated Hospital of Xuzhou Medical University and Anhui Provincial Children’s Hospital from 2019 to 2023. We utilized 11 variables (including essential clinical data, baseline hormone data, and pituitary length) to construct the first basic clinical model. Additionally, we developed a second model by incorporating 6 additional variables extracted from pelvic ultrasound reports into the first basic clinical model, aiming to evaluate the incremental value of pelvic ultrasound in the prediction of CPP. Furthermore, we utilized 3 explainable artificial intelligence (XAI) methods to elucidate the optimal model. Results: Among 320 girls suspected of CPP, 188 (58.75%) were diagnosed with CPP, while 132 (41.25%) were diagnosed with non-CPP. Across 9 ML models, the multilayer perceptron (MLP) emerged as the optimal model, delivering outstanding performance. Moreover, pelvic ultrasound demonstrated incremental value in predicting CPP, with uterine length contributing more significantly to prediction model than uterine volume. Consistent findings from Shapley additive explanations (SHAP), local interpretable model-agnostic explanations (LIME), and Break Down methods revealed that difference between bone age and chronological age (BA-CA), chronological age (CA), and basal luteinizing hormone (LH) were the three most important features. Conclusion: Our study is the first to integrate LIME, SHAP, and Break Down methods for interpreting the CPP prediction model. The MLP exhibited overall good performance, serving as a pre-screening tool to reduce unnecessary procedures and facilitate clinicians in making more informed decisions. LIME, SHAP, and Break Down demonstrated robust performance as XAI methods.
Keywords: 5-Fold cross-validation, central precocious puberty, Explainable artificial intelligence, gonadotropin-releasing hormone agonist, machine learning, Prediction model
Received: 24 Feb 2025; Accepted: 31 Dec 2025.
Copyright: © 2025 Zhu, Xue, Zhou, Qin, Zhao, Wu, Li and Jin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Yingliang Jin
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
