REVIEW article
Front. Pharmacol.
Sec. Experimental Pharmacology and Drug Discovery
Volume 16 - 2025 | doi: 10.3389/fphar.2025.1597351
Application of Artificial Intelligence Large Language Models in Drug Target Discovery
Provisionally accepted- 1Shandong Provincial Hospital, Jinan, China
- 2Shandong First Medical University, Tai'an, Shandong, China
- 3Linyi Central Hospital, Yishui, China
- 4Second People's Hospital of Dongying, Dongying, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Drug target discovery is a fundamental aspect of contemporary drug research and development. However, the use of conventional biochemical screening, omics analysis, and related approaches is constrained by substantial technical complexity and significant resource requirements. With the advancement of artificial intelligence-based large language models, notable progress has been achieved in drug target identification.During target mining, large language models with natural language comprehension capabilities can efficiently integrate literature data resources and systematically analyze disease-associated biological pathways and potential targets. Notably, models specifically designed for biomolecular "language" have demonstrated advantages across multiple aspects. The genomics-focused large language model has significantly enhanced the accuracy of pathogenic gene variant identification and gene expression prediction. In transcriptomics, large language models enable comprehensive reconstruction of gene regulatory networks. In proteomics, advancements have been made in protein structure analysis, function prediction, and interaction inference.Additionally, the single-cell multi-omics large language model facilitates data integration across different omics technologies. These technological advancements provide multi-dimensional biological evidence supporting drug target discovery and contribute to a more efficient screening process for candidate targets. The development of these models is generally based on deep neural networks of Transformer architecture, and powerful representation capabilities are obtained through large-scale unsupervised pre-training (such as mask language modeling, autoregressive prediction) combined with task-specific supervised fine-tuning. This review systematically examines recent advancements in the application of large language models in drug target discovery, emphasizing existing technical challenges and potential future research directions.
Keywords: Large Language Model, Drug target discovery, bioinformatics, Multi-omics integration, protein structure prediction
Received: 21 Mar 2025; Accepted: 27 Jun 2025.
Copyright: © 2025 Liu, Zhang, Wang, Teng, Wang and Zhou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
GuoYing Wang, Second People's Hospital of Dongying, Dongying, China
Xiaoming Zhou, Shandong Provincial Hospital, Jinan, China
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.