Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Pharmacol.

Sec. Pharmacoepidemiology

Risk Assessment of Drug-Associated Miscarriage Using XGBoost and SHAP Explainability: A Real-World Pharmacovigilance Analysis Based on the FAERS Database

Provisionally accepted
  • 1Shenzhen Traditional Chinese Medicine Hospital, Shenzhen, China
  • 2Guangzhou University of Chinese Medicine, Guangzhou, China

The final, formatted version of the article will be published soon.

Background: Miscarriage is a common and serious adverse pregnancy outcome. Assessing drug-associated miscarriage risk is essential for medication safety in pregnancy. Using the FDA Adverse Event Reporting System (FAERS), this study systematically mined adverse drug events (ADEs) related to miscarriage and combined machine learning with explainable Artificial Intelligence (AI) to evaluate potential high-risk drugs. Methods: We retrieved FAERS reports of miscarriage-associated ADEs from 2005 to 2024. Disproportionality analyses were conducted using the reporting odds ratio (ROR), proportional reporting ratio (PRR), Bayesian Confidence Propagation Neural Network (BCPNN), and Multi-Item Gamma Poisson Shrinker (MGPS), with subgroup analyses by age and body weight. An eXtreme Gradient Boosting (XGBoost) model was developed to predict miscarriage risk, and Shapley Additive exPlanations (SHAP) were used to interpret feature contributions. Finally, Weibull distribution modeling characterized the time-to-onset (TTO) from drug exposure to miscarriage. Results: A total of 36,389 ADEs were included. We identified several potential high-risk classes, notably immunomodulators, psychoactive/neuroactive agents, and antimicrobials. The XGBoost model showed favorable discrimination with a mean area under the curve (AUC) of 0.738. SHAP analysis reveals that immunomodulatory factors, such as adalimumab and infliximab, are significant predictors of miscarriage events in this model. The distribution of their SHAP values suggests a strong association between these drugs and miscarriage reports. TTO analyses suggested that most miscarriages occurred within two years after drug exposure, with marked heterogeneity in risk timing across agents; anti-Tumor Necrosis Factor-alpha drugs (TNF-α) exhibited a higher early risk. Conclusions: Machine learning and SHAP interpretability analysis based on the FAERS database effectively identified immunomodulators, antiviral drugs, and psychiatric/neuropsychiatric medications as potential risk signals associated with miscarriage. These findings underscore the need for individualized medication assessment that considers patient age and body weight, providing evidence-based guidance and early alerting for reference for drug risk assessment during pregnancy.

Keywords: Disproportionality analysis4, Drug-associatedmiscarriage1, FAERSdatabase3, immunomodulators2, XGBoost5

Received: 01 Oct 2025; Accepted: 10 Feb 2026.

Copyright: © 2026 Lin, Ma, Zhao, Peng, Zhang, Zhang, Li, Li and He. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Li He

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.