ORIGINAL RESEARCH article
Front. Genome Ed.
Sec. Genome Editing Tools and Mechanisms
Volume 7 - 2025 | doi: 10.3389/fgeed.2025.1643888
This article is part of the Research TopicInsights in Genome Editing Tools and Mechanisms: 2024View all 4 articles
CRISPR-FMC: A Dual-Branch Hybrid Network for Predicting CRISPR-Cas9 On-Target Activity
Provisionally accepted- 1Zhejiang Agriculture and Forestry University, Hangzhou, China
- 2University of Electronic Science and Technology of China, Chengdu, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Accurately predicting the on-target activity of sgRNAs remains a key challenge in CRISPR-Cas9 applications, largely due to the limited generalization ability of existing models across diverse datasets, small-sample conditions, and complex sequence contexts. Many current approaches rely on shallow architectures or unimodal encodings, hindering their ability to capture the intricate dependencies underlying Cas9-mediated cleavage. To address these limitations, we propose CRISPR-FMC, a dual-branch hybrid neural network that begins with a data preprocessing stage combining One-hot encoding and contextual embeddings from a pre-trained RNA-FM model. This dual-input scheme enables the model to represent both explicit nucleotide composition and highlevel semantic features. The architecture incorporates multi-scale convolution (MSC), BiGRU, and Transformer blocks to extract hierarchical sequence features, and employs a bidirectional cross-attention mechanism with a residual feedforward network to enhance multimodal fusion and generalization. Evaluations on nine public CRISPR-Cas9 datasets show that CRISPR-FMC consistently outperforms existing baselines in both Spearman and Pearson correlation metrics, with particularly strong performance under low-resource and cross-dataset conditions. Ablation studies validate the contribution of each component, while base substitution analysis highlights the model's sensitivity to the PAM-proximal region, aligning with biological evidence. Overall, CRISPR-FMC provides a robust and interpretable framework for sgRNA activity prediction across heterogeneous genomic scenarios.
Keywords: CRISPR-Cas9, RNA-FM, deep learning, on-target, SgRNA
Received: 09 Jun 2025; Accepted: 08 Aug 2025.
Copyright: © 2025 Li, Li, Zou and Feng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Jian Li, Zhejiang Agriculture and Forestry University, Hangzhou, China
Hailin Feng, Zhejiang Agriculture and Forestry University, Hangzhou, China
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.