Connecting the dots: deep learning-based automated model building methods in cryo-EM

Bansia, Harsh; Georges, Amedee  des

doi:10.3389/fmolb.2025.1613399

REVIEW article

Front. Mol. Biosci.

Sec. Structural Biology

This article is part of the Research TopicBreakthroughs in Cryo-EM with Machine Learning and Artificial IntelligenceView all 8 articles

Connecting the dots: deep learning-based automated model building methods in cryo-EM

Provisionally accepted

Harsh Bansia^*

Amedee des Georges

New York University, New York City, United States

The final, formatted version of the article will be published soon.

The resolution revolution in single particle cryo-electron microscopy (cryo-EM) has dramatically expanded our structural knowledge of large biomolecular complexes. While high-resolution cryo-EM density maps enable atomic model building, lower-resolution maps can still reveal secondary structures, folds, and domains. When combined with integrative modeling approaches, such data can provide meaningful insights into biomolecular structure and function. Constructing accurate models, however, remains challenging: at low resolutions it is difficult to interpret density maps features reliably, and at high resolutions traditional model-building workflows can become a time-consuming bottleneck. Deep learning, which is transforming problem-solving across scientific domains, offers powerful new tools to automate and accelerate this process. In this review, we discuss deep learning-based methods developed to automate model building in cryo-EM density maps, assessing their impact on streamlining structure determination. Recognizing that bio-macromolecular structures exhibit hierarchical organization, we classify these methods according to their ability to model primary, secondary, tertiary, and quaternary structures of biomolecules. Deep learning tools for building atomic models in cryo-EM density maps are further grouped as de novo, where the model is predicted directly from features learned from the cryo-EM density, or hybrid, where it is derived by integrating structural templates with these features. We outline current limitations, including the challenge of obtaining sufficiently large and diverse datasets for training networks to model different types of biomolecules in the cryo-EM density maps, and the open challenge of constructing training sets that capture the conformational heterogeneity often observed in the cryo-EM maps. We conclude by highlighting emerging directions for this rapidly advancing field, which promise to make automated, data-driven model building an integral part of structural biology.

Keywords: Drug-discovery, model building, Structural Biology, cryo-EM atomic models, Deep Neural Network

Received: 17 Apr 2025; Accepted: 28 Oct 2025.

Copyright: © 2025 Bansia and Georges. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Harsh Bansia, hb2880@nyu.edu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.