ORIGINAL RESEARCH article

Front. Genet.

Sec. RNA

Volume 16 - 2025 | doi: 10.3389/fgene.2025.1606016

This article is part of the Research TopicMicroRNA Secretion and Delivery: Mechanisms, Biomarkers, and Therapeutic InnovationsView all 3 articles

Integrating BERT Pre-training with Graph Common Neighbours for Predicting ceRNA Interactions

Provisionally accepted
Zhengxing  XieZhengxing Xie1Tianping  YinTianping Yin2Jing  GeJing Ge2Shiyang  LiangShiyang Liang3Junhua  LiuJunhua Liu4Lianghua  TangLianghua Tang2*
  • 1Guizhou University of Traditional Chinese Medicine, Guiyang, Guizhou Province, China
  • 2The Second Affiliated Hospital, Guizhou University of Traditional Chinese Medicine, Guiyang, Guizhou Province, China
  • 3The No. 944 Hospital of Joint Logistic Support Force of PLA, Gansu, China
  • 4School of Computing and Information Systems, Faculty of Engineering and Information Technology, University of Melbourne, Carlton, Victoria, Australia

The final, formatted version of the article will be published soon.

(ceRNAs), including long non-coding RNAs (lncRNAs) and circular RNAs (circRNAs), is essential for understanding gene regulation. With the development of Graph Neural Networks, existing works have demonstrated the ability to capture information from interactions between microRNA and ceRNAs to predict unseen interactions. However, due to deep GNNs, which only leverage node-node pairwise features, existing methods neglect the information in their chains since different RNAs have chains of different lengths. To address this issue, we propose a novel model termed the BERT-based ceRNA Graph Predictor (BCGP), which leverages both RNA sequence information and the heterogeneous relationships between lncRNAs, circRNAs, and miRNAs. Our BCGP method employs a transformer-based model to generate contextualized representations that consider the global context of the entire RNA sequence. Subsequently, we enrich the RNA interaction graph using these contextualized representations generated by BERT. Furthermore, to improve the performance of association prediction, BCGP utilizes the Neural Common Neighbour (NCN) technique to capture more refined node features, leading to more informative and flexible representations. Through comprehensive experiments on two real-world datasets of lncRNA-miRNA and circRNA-miRNA associations, we demonstrate that BCGP outperforms competitive baselines across various evaluation metrics and achieves higher accuracy in association predictions.In our case studies on two types of miRNAs, we demonstrate BCGP's remarkable performance in predicting both miRNA-lncRNA and miRNA-circRNA associations.

Keywords: lncRNA, circRNA, miRNA, ceRNA, pre-train, Graph neural network

Received: 04 Apr 2025; Accepted: 08 Jul 2025.

Copyright: © 2025 Xie, Yin, Ge, Liang, Liu and Tang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Lianghua Tang, The Second Affiliated Hospital, Guizhou University of Traditional Chinese Medicine, Guiyang, Guizhou Province, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.