Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Bioinform.

Sec. Drug Discovery in Bioinformatics

Volume 5 - 2025 | doi: 10.3389/fbinf.2025.1687617

This article is part of the Research TopicAI in Drug DiscoveryView all 3 articles

Machine Learning-Guided Optimization of Triple Agonist Peptide Therapeutics for Metabolic Disease

Provisionally accepted
Anthony  WongAnthony Wong1*Sanskruthi  GuduriSanskruthi Guduri1TsungYen  ChenTsungYen Chen1,2Kunal  PatelKunal Patel1,2
  • 1Carle Illinois College of Medicine, Urbana, United States
  • 2Carle Foundation Hospital, Urbana, United States

The final, formatted version of the article will be published soon.

Multi-target peptide therapeutics targeting glucagon receptor (GCGR), glucagon-like peptide-1 receptor (GLP1R), and glucose-dependent insulinotropic polypeptide receptor (GIPR) represent a promising approach for treating diabetes and obesity. Triple agonist peptides demonstrate promising therapeutic potential compared to single-target approaches, yet rational design remains computationally challenging due to complex sequence-structure-activity relationships. Existing methods, primarily based on convolutional neural networks, impose limitations including fixed sequence lengths and inadequate representation of molecular topology. Graph Attention Networks(GAT) offer advantages in capturing molecular structures and variable-length peptide sequences while providing interpretable insights into receptor-specific binding determinants. A dataset of 234 peptide sequences with experimentally determined binding affinities was compiled from multiple sources. Peptides were represented as molecular graphs with seven-dimensional node features encoding physicochemical properties and positional information. The GAT architecture employed a shared encoder with task-specific prediction heads, implementing transfer learning to address limited GIPR training data. Performance was evaluated using 5-fold cross-validation and independent validation on 58 literature-derived sequences. A genetic algorithm framework was developed for peptide sequence optimization, incorporating multi-objective fitness evaluation based on predicted binding affinity, biological plausibility, and sequence novelty. Cross-validation demonstrated robust GAT performance across all receptors, with GCGR achieving high accuracy (AUC-ROC: 0.915 ± 0.050), followed by GLP1R (AUC-ROC: 0.853 ± 0.059), and GIPR showing acceptable performance despite limited data (AUC-ROC: 0.907 ± 0.083). Comparative analysis revealed receptor-specific advantages: GAT outperformed CNN for GCGR prediction (RMSE: 0.942 vs 1.209, p = 0.0013), while CNN maintained superior GLP1R performance (RMSE: 0.552 vs 0.723). Genetic algorithm optimization measurable improvement over baseline, with 4.0% fitness enhancement and generation of 20 candidates exhibiting mean binding probabilities exceeding 0.5 across all targets. The GAT-based framework provides a computational approach in computational peptide design, demonstrating receptor-specific advantages and robust optimization capabilities. Genetic algorithm optimization enables systematic exploration of sequence space within existing scaffolds while maintaining biological constraints. This approach provides a rational framework for prioritizing experimental validation efforts in triple agonist development.

Keywords: peptide design, machine learning, Bioactivity prediction, Drug Discovery, Graph attention networks

Received: 18 Aug 2025; Accepted: 22 Oct 2025.

Copyright: © 2025 Wong, Guduri, Chen and Patel. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Anthony Wong, awong16@illinois.edu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.