Your new experience awaits. Try the new design now and help us make it even better

SYSTEMATIC REVIEW article

Front. Med.

Sec. Translational Medicine

Volume 12 - 2025 | doi: 10.3389/fmed.2025.1594442

Progress and Trends on Machine Learning in Proteomics During 1997-2024: A Bibliometric Analysis

Provisionally accepted
  • Affiliated Hospital and Clinical Medical College of Chengdu University, Chengdu, Sichuan Province, China

The final, formatted version of the article will be published soon.

Objective: Despite growing interest in the application of machine learning (ML) in proteomics, a comprehensive and systematic mapping of this research domain has been lacking. This study addresses this gap by conducting the first large-scale bibliometric analysis focused exclusively on ML-driven proteomics, aiming to elucidate its knowledge structure, development trajectory, and emerging research trends.Methods: A total of 5,156 publications from the Web of Science Core Collection were retrieved and analyzed. Bibliometric tools including CiteSpace 6.4.R1, VOSviewer 1.6.18, Scimago Graphica, and the R package bibliometrix were used to extract and visualize key bibliometric indicators. After data cleaning and deduplication, analyses were conducted on keyword co-occurrence, citation networks, leading journals, influential authors, and institutional collaboration patterns to construct a comprehensive landscape of ML applications in proteomics.The number of publications has grown exponentially since 2010, with an average annual growth rate of 12.53% and a notable surge of 65.14% occurring between 2019 and 2020. The United States emerged as the most productive country, while the Chinese Academy of Sciences led among institutions. AlphaFold2-related research received the highest citations, reflecting the transformative role of deep learning in protein structure prediction. Thematic clustering revealed key research foci, including deep learning algorithms, protein-protein interaction prediction, and integrative multi-omics analysis. The field is characterized by strong interdisciplinary convergence, involving computer science, molecular biology, and clinical research. High-impact journals and influential authors were also identified, providing benchmarks for academic influence and collaboration.This study offers the first comprehensive bibliometric analysis of ML in proteomics, revealing key themes such as deep learning, pretrained models, and multiomics integration. Future efforts should focus on building interpretable models, enhancing cross-disciplinary collaboration, and ensuring secure, standardized data use to advance precision medicine.

Keywords: machine learning, Proteomics, Bibliometric, Visual Analytics, trend

Received: 28 Mar 2025; Accepted: 25 Jul 2025.

Copyright: © 2025 Tan, Liu, Zhang, Liu, Ai, Wu, Jian, Song and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Yongyan Song, Affiliated Hospital and Clinical Medical College of Chengdu University, Chengdu, 610081, Sichuan Province, China
jin Yang, Affiliated Hospital and Clinical Medical College of Chengdu University, Chengdu, 610081, Sichuan Province, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.