Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Bioinform.

Sec. Integrative Bioinformatics

Volume 5 - 2025 | doi: 10.3389/fbinf.2025.1645785

Extracting a COVID-19 Signature from a Multi-Omic Dataset

Provisionally accepted
Baptiste  BauvinBaptiste Bauvin1*Thibaud  GodonThibaud Godon1*Guillaume  BachelotGuillaume Bachelot1Claudia  CarpentierClaudia Carpentier1Riikka  HuusariRiikka Huusari2Maxime  DeraspeMaxime Deraspe1Juho  RousuJuho Rousu2Caroline  QuachCaroline Quach3Jacques  CorbeilJacques Corbeil1
  • 1Laval University, Quebec, Canada
  • 2Aalto-yliopisto, Aalto, Finland
  • 3Universite de Montreal, Montreal, Canada

The final, formatted version of the article will be published soon.

We present a multi-omic signature for COVID-19 developed through a comprehensive Quebec initiative that established an extensive dataset of COVID-19 positive and negative patient samples.Moving beyond traditional symptomatic studies that rely on limited descriptors, our research integrates clinical, proteomic, and metabolomic data to classify COVID-19 status using thousands of features. Our multi-view machine learning approach extracts distinctive COVID-19 signatures from multi-omic data with remarkable effectiveness. By applying ensemble methods, we developed accurate and interpretable models for high-dimensional data-containing significantly more features than samples-achieving 89% ± 5% balanced accuracy. Through our novel feature relevance methodology, we identified condensed 12-and 50-feature signatures that enhanced classification accuracy by at least 3% compared to the original feature set. This approach successfully extracted and interpreted a robust multi-omic signature characterizing COVID-19positive individuals from a large, complex dataset, representing a significant advancement in COVID-19 biomarker discovery.

Keywords: machine learning, multi-omics, biomarker, COVID-19, Metabolomics, Proteomics, Signature

Received: 13 Jun 2025; Accepted: 13 Aug 2025.

Copyright: © 2025 Bauvin, Godon, Bachelot, Carpentier, Huusari, Deraspe, Rousu, Quach and Corbeil. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Baptiste Bauvin, Laval University, Quebec, Canada
Thibaud Godon, Laval University, Quebec, Canada

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.