Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Cell. Infect. Microbiol.

Sec. Microbes and Innate Immunity

Volume 15 - 2025 | doi: 10.3389/fcimb.2025.1591464

Machine Learning-Driven Discovery of NETs-Associated Diagnostic Biomarkers and Molecular Subtypes in Tuberculosis

Provisionally accepted
Shoupeng  DingShoupeng Ding1Yimei  YangYimei Yang2Chunxiao  HuangChunxiao Huang1Yuyang  ZhouYuyang Zhou3Zihan  CaiZihan Cai3*
  • 1Gutian County Hospital, Gutian, China
  • 2School of Basic Medical Sciences, Dali University, Dali, China
  • 3Siyang Hospital, Siyang, China

The final, formatted version of the article will be published soon.

Object: NETs constitute a pivotal mechanism in the pathogenesis and progression of TB. Despite their recognized importance, the genetic underpinnings of NETs in TB remain inadequately elucidated. Accordingly, the present study endeavors to delineate the molecular characteristics of NRGs in TB, with the objective of reliably identifying associated molecular clusters and biomarkers. Methods: Gene expression profiles were analyzed from integrated datasets retrieved from the GEO database. Differential analysis, WGCNA, and an ensemble of 113 machine learning algorithms were employed to identify the core NETs genes. Subsequently, TB patients were stratified into distinct subtypes based on the expression profiles of these core genes, and the differences in immune infiltration characteristics among the subtypes were systematically compared. Finally, RT-qPCR was utilized to validate the differential expression of the key NETs core genes. Results: Analysis of the integrated GSE83456 and GSE54992 datasets yielded 630 DEGs. WGCNA subsequently identified a module comprising 1,252 genes, from which 26 key NETs genes were extracted via intersection with known NRGs. Among the ensemble of 113 machine learning methods, the "StepgIm[both]+RF" algorithm demonstrated superior performance, ultimately identifying six core NETs genes. Consensus clustering based on the expression profiles of these core genes stratified patients into two distinct subtypes. Functional enrichment analysis further underscored the predominance of immune-related pathways in subtype B. Moreover, immune infiltration analysis revealed marked differences in immune cell composition between the subtypes, thereby confirming a close association between the core NETs genes and these immunological disparities. Conclusion: Core NETs genes are pivotal in the pathogenesis and progression of tuberculosis, and they hold significant promise as novel biomarkers for the early diagnosis and targeted treatment of TB.

Keywords: Tuberculosis, machine learning, neutrophil extracellular traps, Molecular subtypes, Mycobacterium tuberculosis

Received: 11 Mar 2025; Accepted: 16 Sep 2025.

Copyright: © 2025 Ding, Yang, Huang, Zhou and Cai. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Zihan Cai, zihancai001@163.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.