Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Endocrinol.

Sec. Reproduction

Volume 16 - 2025 | doi: 10.3389/fendo.2025.1544724

This article is part of the Research TopicLifestyle and Environmental Factors and Human FertilityView all 24 articles

Machine learning algorithm based on combined clinical indicators for the prediction of infertility and pregnancy loss

Provisionally accepted
Rui  ZhangRui ZhangYuanbing  GuoYuanbing GuoXiaonan  ZhaiXiaonan ZhaiJuan  WangJuan WangXiaoyan  HaoXiaoyan HaoLiu  YangLiu YangLei  ZhouLei ZhouJiawei  GaoJiawei GaoJiayun  LiuJiayun Liu*
  • Xijing hospital, Xian, China

The final, formatted version of the article will be published soon.

Background and objectives: Diagnosis and treatment of infertility and pregnancy loss are complicated by various factors. We aimed to develop a simpler, more efficient system for diagnosing infertility and pregnancy loss.Methods: This study included 333 female patients with infertility and 319 female patients with pregnancy loss, as well as 327 healthy individuals for modeling; 1264 female patients with infertility and 1030 female patients with pregnancy loss, as well as 1059 healthy individuals for validating the models. The average age and basic information were matched between the groups. Three methods were used for screening 100+ clinical indicators, and five machine learning algorithms were used to develop and evaluate diagnostic models based on the most relevant indicators.Results: Multivariate analysis revealed significant differences in several factors between the patients and the control group. 25-hydroxy vitamin D3 (25OHVD3) was the factor exhibiting the most prominent difference, and most patients presented deficiency in the levels of this vitamin. 25OHVD3 is associated with blood lipids, hormones, thyroid function, human papillomavirus infection, hepatitis B infection, sedimentation rate, renal function, coagulation function, and amino acids in patients with infertility. The model for infertility diagnosis included eleven factors and exhibited area under the curve (AUC), sensitivity, and specificity values higher than 0.958, 86.52%, and 91.23%, respectively.The model for potential pregnancy loss was also developed using five machine learning algorithms and was based on 7 indicators. According to the results obtained from the testing set, the sensitivity was higher than 92.02%, the specificity was higher than 95.18%, the accuracy was higher than 94.34%, and the AUC was higher than 0.972.The simplicity, good diagnostic performance, and high sensitivity of the models presented here may facilitate early detection, treatment, and prevention of infertility and pregnancy loss.

Keywords: Infertility, Pregnancy loss, machine learning, 25OHVD3, miscarriage, Fetal Viability

Received: 13 Dec 2024; Accepted: 26 Jun 2025.

Copyright: © 2025 Zhang, Guo, Zhai, Wang, Hao, Yang, Zhou, Gao and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Jiayun Liu, Xijing hospital, Xian, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.