REVIEW article
Front. Bioinform.
Sec. Network Bioinformatics
This article is part of the Research TopicAdvanced Computational Approaches For Data And Model Integration In Bioinformatic and Biomedical ResearchView all articles
Artificial Intelligence in Drug Discovery from Advanced Molecular Representation to Pipeline Applications
Provisionally accepted- 1Department of Nuclear Medicine, The Affiliated Huaian No. 1 People’s Hospital of Nanjing Medical, University, No. 1 Huanghe West Road, Huai'an, 223300, Jiangsu, China, Huai'an, China
- 2Jiangsu Vocational College of Finance and Economics, Huai'an, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
The pharmaceutical research and development (R&D) process is persistently challenged by high financial costs, protracted timelines, and remarkably low success rates. Artificial intelligence (AI) technology, by simulating complex biological systems, has accelerated the innovation of the entire drug discovery pipeline. This review positions AI as a pivotal technology for reengineering the R&D process by utilizing sophisticated molecular representations to predict pharmacodynamic (PD) and toxicological effects significantly earlier. The scope systematically covers the AI foundations in chemoinformatics, detailing how the performance of AI models is intrinsically linked to the quality of molecular representation. We elaborate on representations ranging from robust string-based methods to advanced topological models, including the five key categories of Graph Neural Networks (GNNs), three-dimensional (3D)-aware Geometric Deep Learning (GDL) and emerging Quantum Machine Learning (QML) as well as Hybrid Quantum-Classical Neural Networks (HQNNs). We analyzed the practical application of these models across the drug discovery pipeline, including de novo molecular design with biological foundation models and flow matching generative architectures, data scarcity solutions via Few-Shot Learning and meta-learning, and explainable AI (XAI) for transparent validation. We propose an integrated Q-BioFusion framework that synergizes quantum computing, autonomous experimentation, and generative models to address systemic R&D constraints. We hope future research will improve the geometric fidelity to achieve more accurate and faster 3D molecular prediction and generation, enhance data efficiency, and solve the inherent data sparsity problem in biological assays, and advance integrated XAI workflows. These efforts will ensure transparent, reliable and trustworthy guidance during the computer simulation process of drug design.
Keywords: ADME/Tox prediction, artificial intelligence, De novo design, Drug Discovery, models
Received: 27 Nov 2025; Accepted: 03 Feb 2026.
Copyright: © 2026 Tao and Zhou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Weijing Tao
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
