Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Cell. Infect. Microbiol.

Sec. Clinical and Diagnostic Microbiology and Immunology

Volume 15 - 2025 | doi: 10.3389/fcimb.2025.1641385

Whole-genome Sequencing and Machine Learning Reveal Key Drivers of Delayed Sputum Conversion in Rifampicin-Resistant Tuberculosis

Provisionally accepted
Xiangchen  LiXiangchen Li1Qing  FangQing Fang2Yewei  LuYewei Lu1Junshun  GaoJunshun Gao1Yvette  WuYvette Wu3Yi  ChenYi Chen4Yang  CheYang Che4*
  • 1Cosmos Wisdom Mass Spectrometry Center of Zhejiang University Medical School, Hangzhou, China
  • 2The First Affiliated Hospital of Ningbo University, Ningbo, China
  • 3Fountain Valley High, Fountain Valley, United States
  • 4Ningbo Municipal Center for Disease Control and Prevention, Ningbo, China

The final, formatted version of the article will be published soon.

Rifampicin-resistant tuberculosis (RR-TB) remains a major global health challenge, with delayed sputum culture conversion (SCC) predicting poor treatment outcomes. This study integrated whole-genome sequencing (WGS) and machine learning to identify clinical and genomic determinants of SCC failure in 150 RR-TB patients (2019-2023). Phenotypic and genotypic analysis revealed high rates of isoniazid resistance (74.0%) and rpoB mutations (97.3%, predominantly Ser450Leu), with 90% of strains belonging to Lineage 2 (Beijing family). While 64.7% achieved 2-month SCC, 18.0% remained culture-positive at 6 months. Univariate analysis linked 2-month SCC failure to smear positivity, resistance to isoniazid, amikacin, capreomycin, and levofloxacin, and pre-XDR-TB status, though only smear positivity (aOR=2.41, P=0.008) and levofloxacin resistance (aOR=2.83, P=0.003) persisted as independent predictors in multivariable analysis. A Random Forest model achieved robust prediction of SCC failure (AUC: 0.86±0.06 at 2 months; 0.76±0.10 at 6 months), identifying levofloxacin resistance (feature importance: 6.37), embB_p.Met306Ile (5.94), and smear positivity (5.12) as top 2-month predictors, while katG_p.Ser315Thr (4.85) and gyrA_p.Asp94Gly (3.43) dominated 6-month predictions. These findings underscore smear positivity, levofloxacin resistance, and specific resistance mutations as critical drivers of SCC failure, guiding targeted RR-TB treatment strategies.

Keywords: Rifampicin-resistant tuberculosis, Whole-genome sequencing, sputum culture conversion, machine learning, Drug resistance mutations

Received: 05 Jun 2025; Accepted: 22 Jul 2025.

Copyright: © 2025 Li, Fang, Lu, Gao, Wu, Chen and Che. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Yang Che, Ningbo Municipal Center for Disease Control and Prevention, Ningbo, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.