ORIGINAL RESEARCH article

Front. Bioinform.

Sec. Genomic Analysis

Volume 5 - 2025 | doi: 10.3389/fbinf.2025.1632189

Long-Read Microbial Genome Assembly, Gene Prediction and Functional Annotation: A Service of the MIRRI ERIC Italian Node

Provisionally accepted
Sandro  Gepiro Gepiro ContaldoSandro Gepiro Gepiro Contaldo1Antonio  d'AciernoAntonio d'Acierno2Lorenzo  BosioLorenzo Bosio1Francesco  VeniceFrancesco Venice1Elisa  Li PerottinoElisa Li Perottino1Janneth  Estefania Hoyos ReaJanneth Estefania Hoyos Rea1Giovanna  Cristina VareseGiovanna Cristina Varese1Francesca  CorderoFrancesca Cordero1Marco  BeccutiMarco Beccuti1*
  • 1University of Turin, Turin, Italy
  • 2National Research Council (CNR), Roma, Lazio, Italy

The final, formatted version of the article will be published soon.

Background Understanding the structure and function of microbial genomes is crucial for uncovering their ecological roles, evolutionary trajectories, and potential applications in health, biotechnology, agriculture, food production, and environmental science. However, genome reconstruction and annotation remain computationally demanding and technically complex.We introduce a bioinformatics platform designed explicitly for long-read microbial sequencing data to address these challenges. Developed as a service of the Italian MIRRI ERIC node, the platform provides a comprehensive solution for analyzing both prokaryotic and eukaryotic genomes, from assembly to functional protein annotation. It integrates state-of-the-art tools (e.g., Canu, Flye, BRAKER3, Prokka, InterProScan) within a reproducible, scalable workflow built on the Common Workflow Language and accelerated through high-performance computing infrastructure. A user-friendly web interface ensures accessibility, even for non-specialists.Through case studies involving three environmentally and clinically significant microorganisms, we demonstrate the ability of the platform to produce reliable, biologically meaningful insights, positioning it as a valuable tool for routine genome analysis and advanced microbial research.

Keywords: genome assembly, functional annotation, gene prediction, HPC and Cloud Service, Reproducible analysis

Received: 20 May 2025; Accepted: 13 Jun 2025.

Copyright: © 2025 Gepiro Contaldo, d'Acierno, Bosio, Venice, Perottino, Hoyos Rea, Varese, Cordero and Beccuti. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Marco Beccuti, University of Turin, Turin, Italy

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.