ORIGINAL RESEARCH article

Front. Toxicol.

Sec. Computational Toxicology and Informatics

Volume 7 - 2025 | doi: 10.3389/ftox.2025.1640612

A Deep-Learning Approach to Predict Reproductive Toxicity of Chemicals Using Communicative Message Passing Neural Network

Provisionally accepted
Owen  HeOwen He1*Daoxing  ChenDaoxing Chen2Yimei  LiYimei Li3*
  • 1Deerfield Academy, Deerfield, United States
  • 2Wenzhou Medical University School of Pharmaceutical Sciences, Wenzhou, China
  • 3St Jude Children's Hospital Department of Biostatistics, Memphis, United States

The final, formatted version of the article will be published soon.

Reproductive toxicity is a concern critical to human health and chemical safety assessment. Recently, the U.S. Food and Drug Administration announced plans to assess toxicity with artificial intelligence-based computational models instead of animal studies in "a win-win for public health and ethics." In this study, we used a reproductive toxicity dataset using Simplified Molecular Input Line Entry Specifications (SMILES) to represent 1091 reproductively toxic and 1063 non-toxic small-molecule compounds. A repeated nested cross-validation procedure was applied, in which the dataset was randomly partitioned into five distinct folds in the outer loop, each time, one fold serving as the test set. In the inner loop, a similar procedure was also repeated five times, with ⅛ each time serving as the validation set. We first evaluated the performance of classical machine learning (ML) methods such as Random Forest and Extreme Gradient Boosting on predicting reproductive toxicity, using standard model evaluation metrics including accuracy score (ACC), the area under the curve (AUC) of the receiver operating characteristics curve (ROC) and F1 score. Our analyses indicate that these methods' overall results were mediocre and insufficient for high-throughput screening. To overcome these limitations, we adopted the Communicative Message Passing Neural Network (CMPNN) framework, which incorporates a communicative kernel and a message booster module. Our results show that our ReproTox-CMPNN model outperforms the current best baselines in both embedding quality and predictive accuracy. In independent test sets, ReproTox-CMPNN achieved a mean AUC of 0.946, ACC of 0.857 and F1 score of 0.846, surpassing traditional algorithms to establish itself as a new state-of-the-art model in this field. These findings demonstrate that CMPNN's deep capture of multi-level molecular relationships offers an efficient and reliable computational tool for rapid chemical safety screening and risk assessment.

Keywords: reproductive, Artificial intelligence (AI), deep learning, Graph neural network, CMPNN, in silico

Received: 04 Jun 2025; Accepted: 10 Jul 2025.

Copyright: © 2025 He, Chen and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Owen He, Deerfield Academy, Deerfield, United States
Yimei Li, St Jude Children's Hospital Department of Biostatistics, Memphis, United States

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.