ORIGINAL RESEARCH article
Front. Microbiol.
Sec. Virology
Volume 16 - 2025 | doi: 10.3389/fmicb.2025.1595180
Detection of viral contamination in cell lines using ViralCellDetector
Provisionally accepted- Michigan State University, East Lansing, United States
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Cell lines are widely used in biomedical research to study various biological events, including gene expression, cancer progression, and drug responses. However, crosscontaminations with bacteria, mycoplasma, and viruses remains a persistent issue in experiments. Detection of bacteria and mycoplasma contaminations in cell lines is relatively easy, whereas identifying viral contamination in cell lines is challenging. To address this challenge, we developed a tool called ViralCellDetector that detects viral contamination(s) through mapping RNA-seq data to a comprehensive viral genome library. Using this tool, we observed that approximately 10% (110 samples) of the RNAseq experiments involving MCF7 cells were likely contaminated with viruses. Furthermore, to facilitate the detection of samples with unknown sources of viral contamination, we identified differentially expressed genes associated with viral infection across two different cell lines and used these genes to train a machine learning model. This model successfully classified infected and non-infected samples based on host gene expression, achieving an AUC of 0.91 and an accuracy of 0.93. Overall, our mapping and marker-based approaches can detect viral contaminations in any sample based on their RNA-seq data, allowing researchers to avoid the use of unintentionally infected cell lines in their studies.
Keywords: cell lines, Viral contamination, Bacterial contamination, Differentially expressed genes, RNA-seq data, random forest, and machine learning
Received: 17 Mar 2025; Accepted: 28 Jul 2025.
Copyright: © 2025 Shankar, Paithankar, Gupta and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Rama Shankar, Michigan State University, East Lansing, United States
Bin Chen, Michigan State University, East Lansing, United States
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.