Your new experience awaits. Try the new design now and help us make it even better

BRIEF RESEARCH REPORT article

Front. Cell. Infect. Microbiol., 11 September 2025

Sec. Clinical Infectious Diseases

Volume 15 - 2025 | https://doi.org/10.3389/fcimb.2025.1646194

This article is part of the Research TopicExploring Clinical Application Scenarios of Metagenomic Next-Generation Sequencing for Pathogen DiagnosisView all 7 articles

Optimization of decision thresholds for Mycobacterium tuberculosis can effectively improve the performance of mNGS in tuberculosis diagnosis

Yuecui Li*&#x;Yuecui Li1*†Lili Zhang&#x;Lili Zhang1†Guannan Ma&#x;Guannan Ma2†Chenghang LiChenghang Li1Weiyue HuWeiyue Hu1Ruotong RenRuotong Ren3Yinghui ZangYinghui Zang1Dandan YingDandan Ying1Shuai QiuShuai Qiu1Shuyan JinShuyan Jin1Chunjie QiuChunjie Qiu1Xuefang Cao*Xuefang Cao3*
  • 1Department of Infectious Disease, The First People’s Hospital of Yongkang, Affiliated to Hangzhou Medical College, Jinhua, China
  • 2Zhejiang Key Laboratory of Digital Technology in Medical Diagnostics, Hangzhou, China
  • 3MatriDx Biotechnology Co., Ltd, Hangzhou, China

Background: Pulmonary tuberculosis (TB) diagnosis remains challenging due to limitations in traditional methods. This study aimed to optimize the metagenomic next-generation sequencing (mNGS) threshold for Mycobacterium tuberculosis complex (MTBC) detection and evaluate its efficacy compared to standard diagnostic approaches.

Methods: A total of 264 bronchoalveolar lavage fluid (BALF) samples were collected from patients with suspected pulmonary TB at Yongkang First People’s Hospital between January 2022 and June 2023. After excluding patients with incomplete data, 59 clinically confirmed TB patients and 111 with non-tuberculous conditions were enrolled. mNGS data were analyzed to calculate reads per million (RPM) for MTBC, and thresholds of 0.02, 0.05, and 0.10 RPM were evaluated for diagnostic efficacy using clinical diagnosis as the gold standard.

Results: The area under the receiver operating characteristic (ROC) curve (AUC) for mNGS in diagnosing TB at RPM thresholds of ≥0.02, ≥0.05, and ≥0.10 were 0.881, 0.873, and 0.814, respectively. The optimal detection threshold was found at RPM ≥ 0.05. Comparative analysis showed mNGS (AUC = 0.873) outperformed routine culture (0.718), PCR (0.741), and Xpert (0.763). Combining mNGS with these methods improved AUC values to 0.837, 0.868, and 0.873, respectively.

Conclusion: Optimizing the mNGS threshold to ≥0.05 significantly enhances MTBC detection in pulmonary TB. Combining mNGS with traditional methods further improves diagnostic efficacy, suggesting a potential role for mNGS in clinical TB management.

Introduction

Tuberculosis (TB), caused by the Mycobacterium tuberculosis complex (MTBC), remains a significant global public health challenge, characterized by high morbidity and mortality rates in humans (Huang et al., 2022). Although the incidence of tuberculosis has gradually declined over the past decade, with mortality rates dropping by nearly one-third, this positive trend was abruptly disrupted by the COVID-19 pandemic. In many regions, the pandemic led to a substantial reduction in tuberculosis testing and case notifications, contributing to an increase in mortality and reversing a decade of progress in global tuberculosis control (Dheda et al., 2022). It is estimated that 10.6 million people will be infected with tuberculosis globally in 2021, equivalent to 134 cases of tuberculosis per 100,000 people. The diagnosis and treatment of tuberculosis in clinical practice presents significant challenges and controversies (Ding et al., 2021). Numerous studies have revealed co-infection with various strains of Mycobacterium or other pathogenic species in patients with pulmonary tuberculosis (Huang et al., 2010; Shi et al., 2020). Given that rapid and accurate diagnosis is essential for effective treatment of tuberculosis, there is an urgent need for the development and implementation of rapid screening and diagnostic methods to enhance the management of TB.

Recently, metagenomic next-generation sequencing (mNGS) has rapidly developed as a complementary method widely used for diagnosing various infectious diseases in clinical setting. With its Shorter turnaround time, unbiased detection, and semi-quantitative value, mNGS can theoretically identify all pathogens present in a clinical sample, making it particularly suitable for diagnosing rare, novel, and atypical etiologies of complicated infectious diseases. This capability enables more precise guidance for targeted antimicrobial therapy (Chiu and Miller, 2019; Han et al., 2019). Extensive studies have demonstrated that mNGS is becoming an important complement to etiologic diagnostic workflows for TB patients due to its superior performance (Sun et al., 2021; Liu et al., 2022). However, significant challenges persist in assessing the efficacy of mNGS for MTBC detection. Variability in the genomic copy number of MTBC in patient samples, coupled with difference in mNGS methodologies and data analysis approaches employed exclusively in previous studies, complicates this assessment. Many studies have either focused on unique MTBC reads instead of employing a semi-quantitative metric, such as Specific reads per million reads (RPM), as a fundamental parameter or failed to define a specific RPM threshold for MTBC detection. As a result, these factors hinder the development of a comparable semi-quantitative assay and obstruct validated investigations regarding the efficacy of mNGS for TB evaluation in a large cohort.

In this study, we conducted a retrospective cohort study to analyze and determine the optimal RPM detection threshold of mNGS for the detection of MTBC using mNGS data from bronchoalveolar lavage fluid (BALF) samples of patients with clinically diagnosed TB and non-TB disease. Furthermore, the adjunctive diagnostic efficacy of threshold-optimized mNGS for MTBC was comprehensively evaluated by comparing it with clinically routine MTBC detection methods including culture, PCR and X-pert.

Methods

Study design and patients

The present study retrospectively enrolled patients with suspected pulmonary tuberculosis from January 2022 to June 2023 at Yongkang First People’s Hospital. Demographic information. Clinical symptoms, laboratory test results, imaging examination results, diagnosis and treatment history, and outcomes were collected from electronic medical records. Patients excluded were those who have incomplete clinical data, or those who without anyone of microbial results, such as culture, PCR or X-pert. Suspected tuberculosis (TB) included met any of the following criteria: 1) Symptoms of TB poisoning such as cough, fever, night sweats or weight loss or 2) Imaging features of tuberculosis.

Definitions of TB cases

The final clinical diagnostic criteria for pulmonary tuberculosis cases was positive for microbial confirmed. For patients without any microbial evidence, the physician has diagnosed tuberculosis by combining the imaging and clinical manifestations of the patients, with excluding other diseases and histopathological confirmed TB, together with the patient’s confirmed responsiveness to anti-TB treatment after one month of follow-up.

Clinicians ultimately diagnosed TB cases according to the clinical guidelines of the Ministry of Health of the People’s Republic of China. Defined TB cases should comply with any of the following criteria: (1) The microbiological results (including acid-fast staining smear and culture, X-pert, PCR, mNGS) were positive. (2) The pathological result of lung biopsy tissue is consistent with the pathological features of tuberculosis. (3) The lung lesions reduced or disappeared after three months of antituberculosis treatment. Otherwise, they are classified as non-TB cases.

Clinical confirmed pulmonary TB was defined as a patient who presented with typical chest imaging suggestive of TB and met the criteria (typical symptoms of pulmonary TB, positive interferon-gamma release assay (IGRA), or positive purified protein derivative (PPD) test (or tuberculin skin test (TST)), positive culture results, nucleic acid test or acid-fast bacilli (AFB) smear for Mycobacterium tuberculosis (MTB) detection in sputum or BALF.

The study was approved by the Ethics Committee of Yongkang First People’s Hospital and all data were anonymized prior to analysis. The study was conducted in accordance with the Declaration of Helsinki and the study data were obtained from Department of Infectious Diseases, Yongkang First People’s Hospital. Informed consent was obtained from all participants or their legal guardians.

Specimen collection and laboratory procedures

BALF samples were collected by experienced bronchoscopists through bronchoscopy based on canonical operational procedures (Meyer et al., 2012). Saline (60 mL~100 mL) was injected into the bronchial lumen of the segment and withdrawn after washing briefly. The qualified BALF (at least 25 mL) was divided into 5 parts, 4 of which were subjected to traditional test (AFB staining and culture), MTB PCR, and X-pert MTB/RIF assay, and the rest were stored in a sterile container at -20 °C, and sent to MatriDx Biotechnology for mNGS testing. BALF samples for PCR and X-pert MTB/RIF assay were processed following the manufacturer’s instructions.

mNGS procedure and analysis

mNGS workflow comprised four sequential steps: sample processing, nucleic acid extraction, library preparation, and bioinformatics analysis. To maximize inter-laboratory reproducibility, all wet-lab and in-silico procedures followed a locked Standard Operating Procedure (SOP) with explicit QC metrics. Basically, according to the manufacturer’s instructions, DNA was extracted from the supernatant obtained after tissue pretreatment using a magnetic bead-based method (Genskey Co., Ltd., China). Following quantification and normalization of DNA concentration, a DNA library was prepared following the protocol of the NEBNext Ultra II DNA Library Prep Kit for Illumina (New England Biolabs Inc.). This process included enzymatic fragmentation, end repair, barcode ligation, library purification, and PCR amplification. The resulting DNA library quality was evaluated using the Qubit dsDNA High Sensitivity (HS) Assay Kit (Thermo Fisher Scientific, USA). Libraries that met the quality criteria were sequenced on the MGISEQ-200RS platform (MGI, China) using the MGISEQ-200RS High-throughput Sequencing Reagent Kit (FCL SE50). To ensure data quality, raw sequencing reads were filtered using the fastp software to remove low-quality reads, adaptor contamination, duplicates, and reads shorter than 50 base pairs. Reads aligning to the human reference genome (hg38) were excluded. The remaining reads were normalized to 20 million per sample and subjected to multi-sequence alignment. Microbial identification was performed by aligning the filtered reads against an in-house pathogenic microorganism genome database. It’s worth noting that, before data analysis, microbes detected in clinical samples were first compared with those detected in the parallel NTC (no template control). Mycobacterium tuberculosis was considered positive when at least 1 read is mapped to MTBC (strictly mapped to the number of sequences at the genus level).

Statistical analysis

All collected data were statistically analyzed using R packages. Cutoff value for TB diagnosis was evaluated by receiver operating characteristic (ROC) curves. The area under curve (AUC), accuracy (ACC), sensitivity (SEN), specificity (SPE), positive predictive value (PPV), negative predictive value (NPV), percentage of positive accordance (PPA) and percentage of negative accordance (NPA) were calculated for routine test, PCR, X-pert, mNGS and their combinations using clinical diagnosis as the reference standard. The comparison of ROC curves was performed using the method of Delong. P-value < 0.05 was considered statistically significant in this study. Pairwise comparisons among indicators (including SEN, SPE, PPA, ACC, PPV, NPV and NPA) were conducted using the McNemar’s exact test for paired data. All tests were two-sided, and the significance level was set at P < 0.05.

We conducted a decision curve analysis (DCA) to assess the clinical utility of the mNGS threshold (RPM ≥0.05) for diagnosing tuberculosis (TB). The analysis was based on a dataset of 170 BALF samples, including 59 confirmed TB cases and 111 non-TB cases. We calibrated the raw RPM values using logistic regression to convert them into continuous probabilities (ranging from 0 to 1). We evaluated the net benefit of using the mNGS probability across a range of threshold probabilities (0% to 60%), comparing it to two strategies: “Treat All” (treat all patients) and “Treat None” (treat no patients). The RPM ≥0.05 threshold was specifically marked on the decision curve to visualize its position relative to the net benefit and extreme strategies.

Results

Optimization of mNGS threshold for MTBC detection

A total of 264 BALF samples were collected from patients with suspected pulmonary tuberculosis at Yongkang First People’s Hospital from January 2022 to June 2023 (Figure 1). After excluding patients with incomplete clinical data or lacking microbiological results (e.g., culture, PCR, or X-pert), we enrolled 59 patients with clinically confirmed tuberculosis (TB, n=59) and 111 patients with clinically confirmed non-tuberculous tuberculosis (Non-TB, n=111) were finally enrolled. All BALF samples from these 170 patients were subjected to mNGS.

Figure 1
Flowchart of tuberculosis (TB) diagnosis process. Start: 264 suspected TB cases. Excluded: 94 (various reasons). Analyzed: 170 cases. Results: 59 confirmed TB, 111 non-TB. Notes include cut-off values, ROC curves, and performance assessment of diagnosis methods.

Figure 1. Flowchart of patients enrolled in this study.

The mNGS-detected MTBC reads were normalized using the total mNGS data volume of the corresponding samples to obtain Specific reads per million reads (RPM) as the semi-quantitative baseline data for MTBC. Based on the distribution characteristics of RPM values in MTBC-positive samples, we selected thresholds of 0.02, 0.05 and 0.10 RPM for optimization. We pooled the mNGS data from MTBC-positive samples according to the aggregated mNGS dataset, and evaluated the diagnostic efficacy of mNGS for tuberculosis at each threshold, using clinical diagnosis of TB/non-TB as the gold standard. The results demonstrated that the AUC values for mNGS in adjunctive diagnosis of TB at the RPM thresholds of ≥0.02, ≥0.05 and ≥0.10 were 0.881, 0.873 and 0.814, respectively (Figure 2). Furthermore, t-test analysis indicated that the AUC values for RPM≥0.02 (P = 0.003) and RPM≥0.05 (P = 0.005) were significantly higher than that of RPM≥0.10 (Figure 2). This suggests that the optimal threshold for mNGS detection of MTBC should be set at 0.05 or lower. Notably, the AUC values for RPM≥0.02 and RPM≥0.05 were not significantly different (P = 0.317) (Figure 2). Considering the challenges associated with detecting the MTBC genome as an intracellular bacterium and the inherent sensitivity limitations of mNGS, RPM≥0.05 was finally selected as the threshold value in the subsequent comparative study on the performance of multi-methods in this study.

Figure 2
ROC curves compare different threshold criteria for sensitivity versus 1-specificity. Three curves represent mNGS_0.02, mNGS_0.05, and mNGS_0.10 with respective AUC values of 0.881, 0.873, and 0.814. The coordinates (1.000) on each curve differ in sensitivity values. A table shows p-values for pair comparisons: 0.317, 0.003, and 0.005.

Figure 2. The ROC curves of different threshold criteria.

Efficacy evaluation of the threshold-optimized mNGS in MTBC detection

As described above, the positive determination of MTBC detected by mNGS was initially performed using the threshold of RPM≥0.05. Samples were classified into MTBC positive and negative group, and corresponding samples were pooled for further analysis based on results obtained from routine culture, PCR and X-pert methods. Ultimately, we evaluated the methodological efficacy of these different diagnostic approaches and their combinations for the adjunctive diagnosis of MTBC, using clinical diagnoses of TB or non-TB as the gold standard. The comparative analysis of the efficacy of multiple methods for TB diagnosis, based on ROC curves, revealed that the AUC values of mNGS, routine culture, PCR and X-pert were 0.873, 0.718, 0.741 and 0.763, respectively (Figure 3A). The AUC for mNGS was significantly higher than those of routine culture, PCR and X-pert, with P-values of 4.05e-07, 5.41e-06 and 5.15e-05, respectively (Figure 3A). These findings suggest that threshold-optimized mNGS demonstrates superior efficacy in detecting MTBC compared to the routinely used clinical assays (routine culture, PCR and X-pert).

Figure 3
Panel A and B show ROC curves comparing different methods with sensitivity and specificity axes. Panel C and D display bar charts of AUC, ACC, SEN, SPE, PPV, NPV, PPA, and NPA metrics for various methods like Routine, PCR, Xpert, and mNGS. The charts illustrate performance comparisons with different color bars representing each method.

Figure 3. Efficacy evaluation of the threshold-optimised mNGS for MTBC detection. (A) ROC curves for mNGS compared to various traditional methods (routine culture, PCR and X-pert). (B) ROC curves for combined approaches (mNGS+routine culture, mNGS+PCR and mNGS+X-pert) compared to individual traditional methods and overall performace (All). (C) Comparison of area under curve (AUC), accuracy (ACC), sensitivity (SEN), specificity (SPE), positive predictive value (PPV), negative predictive value (NPV), percentage of positive accordance (PPA) and percentage of negative accordance (NPA) between mNGS and traditional methods. (D) Comparison of AUC, ACC, SEN, SPE, PPV, NPV, PPA and NPA between combined approaches (mNGS+Routine culture, mNGS+PCR and mNGS+X-pert), traditional methods and overall performance.

Furthermore, we explored the combination of threshold-optimized mNGS with routinely used clinical MTBC assays. The results showed that when combined with routine culture (AUC = 0.718), PCR (AUC = 0.741), and X-pert (AUC = 0.763) individually, the AUC values for the combined approaches, mNGS+Routine culture, mNGS+PCR and mNGS+X-pert, were significantly higher, with AUC values of 0.837, 0.868 and 0.873, respectively (Figure 3B). The P-values for these comparisons were all less than 0.05 (2.16e-05, 8.72e-06 and 5.15e-05, respectively) (Figure 3B). The diagnostic strategy using mNGS+Xpert, the number of false-negative cases decreased from 28 to 15, indicating 13 additional cases were correctly identified, reflecting improved diagnostic sensitivity. The net correct reclassification was 13 cases, corresponding to a Net Reclassification Improvement (NRI) of 0.08%, with a Number Needed to Test (NNT) of 13.1—meaning one additional true case would be identified for every 13 individuals tested (Supplementary Table S1). These results indicate that combining threshold-optimized mNGS with standard clinical assays improves the diagnostic efficacy for detecting MTBC.

In addition to the AUC, we assessed the efficacy of various MTBC assays using ROC curves based on additional parameters related to diagnostic performance, including ACC, SEN, SPE, PPV, NPV, PPA and NPV (Figures 3C, D). The results showed that the threshold-optimized mNGS demonstrated superior performance compared to routinely used clinical MTBC assays (routine culture, PCR and X-pert), with the highest values for ACC (0.912), SEN (0.746), SPE (1.000), PPV (1.000), NPV (0.881), PPA (0.746) and NPA (1.000). Notably, four parameters (ACC, SEN, NPV and PPA) showed significant improvements relative to other assays (Figure 3C). Additionally, when compared to routine culture, PCR and X-pert alone, the various combinations of assays, including with mNGS with routine culture, mNGS with PCR, and mNGS with X-pert, also exhibited significant enhancements in AUC, ACC, SEN, NPV and PPA (Figure 3D).

The DCA curve shows that the orange curve (RPM ≥ 0.05) outperforms the other strategies (Treatment All and Treatment None) between approximately 5% and 35% threshold probabilities, indicating the highest clinical net benefit in this range. The red dashed line marks the RPM ≥ 0.05 threshold, which falls within this optimal net benefit region, confirming its validity. Below 5%, the orange curve aligns closely with the Treatment All strategy, suggesting a high false positive rate in this range. Above 35%, the orange curve maintains a higher net benefit than Treatment None, showing that RPM ≥ 0.05 is still more beneficial than no treatment. The orange curve remains stable and above the Treatment None line beyond 40%, indicating consistent clinical value even at higher threshold probabilities (Supplementary Figure S1).

Discussion

TB is a serious threat to human health, so precise diagnosis and treatment of TB has attracted much attention. Common diagnostic methods for TB include tuberculin skin test, TSPOT.TB assay, acid-resistant staining, mycobacterial culture, and molecular diagnostic techniques such as PCR and X-pert MTB/RIF (Acharya et al., 2020). However, challenges associated with culture techniques and the limitations in sensitivity and specificity of conventional methods often result in these tests failing to meet clinical needs in a timely manner. Therefore, there is an urgent need for advanced precision diagnostic methods and in vitro diagnostic products for TB that are rapid, convenient, accurate, and cost-effective.

As we know, mNGS is regarded as an important complementary diagnostic method for identifying pathogenic microorganisms (Gu et al., 2019) This approach enables the rapid screening of difficult-to-cultivate or rare microorganisms, as well as the prompt identification of known pathogens such as TB, and detection of drug resistance genes (Gautam et al., 2018; Guthrie et al., 2018; Borroni et al., 2019; Chiu and Miller, 2019). Indeed, mNGS has been widely utilized for the diagnosis of pathogenic microorganisms and for guiding clinical treatment in various infections, including central nervous system infections, lung infections, bloodstream infections, ocular diseases, and infections in immunocompromised patients. Several clinical application studies that have demonstrated that mNGS enhances the clinical detection of MTBC and provides adjunctive diagnostic value for TB compared to conventional diagnostic methods (Liu et al., 2021; Liu et al., 2022). Despite rapid advancements in metagenomic next-generation sequencing (mNGS) and its growing application in clinical diagnostics, several major challenges remain in data analysis and clinical interpretation. These include the absence of standardized workflows and quality control measures, interference from high host DNA background and limited microbial biomass, difficulties in interpreting large volumes of complex data, limitations in current databases and bioinformatics pipelines, and the need for effective integration of sequencing results with clinical context (Simner et al., 2018; Laudadio et al., 2019; Liu and Ma, 2024; Zhao et al., 2024). Key questions include how to ensure comparability of mNGS data for the same pathogenic microorganisms across different samples and how to establish an appropriate threshold for positive determination of mNGS results for identical pathogens. These issues present substantial challenges for clinicians interpreting mNGS data in patients with suspected TB and appear to have been inadequately addressed in existing studies.

To address the challenges associated with mNGS in detecting MTBC, this study analyzed the RPM of MTBC from patients’ BALF samples and its correlation with clinical TB diagnosis as gold standard. A retrospective cohort study was conducted comprising 59 patients clinically diagnosed with TB and 111 patients diagnosed with non-TB. Our study demonstrated a high diagnostic performance of mNGS for pulmonary TB using BALF specimens (AUC = 0.873), which is consistent with previous findings, such as study (Liu et al., 2021). This similarity across independent cohorts supports the reproducibility and generalizability of mNGS in diverse clinical settings. Both studies employed mNGS in TB-suspected patients using BALF samples, suggesting a stable diagnostic capability of this technique. While there are differences in study design-including sample size [170 in our study vs. 322 in (Liu et al., 2021)], reference standards, and patient inclusion criteria—the convergence of diagnostic accuracy suggests that mNGS offers robust performance irrespective of these contextual variations. Additionally, we also compared the performance of mNGS at the optimal threshold (MTBC RPM≥0.05) with that of conventional MTBC detection methods, including culture, PCR, and Xpert. The results suggest that optimizing the threshold can enhance the interpretability of mNGS data, enabling clinicians to utilize this information more confidently for efficient diagnosis of MTBC. The decision curve analysis showed that our model provided a higher net benefit compared to the “treat all” or “treat none” strategies across a range of clinically relevant threshold probabilities (0.1 to 0.6), indicating its practical value in guiding clinical decision-making (Supplementary Figure S2).

Furthermore, we also found that the combination of mNGS with routine culture, mNGS with PCR, and mNGS with X-pert exhibited significantly higher efficacy in determining TB than any of the conventional methods alone. This finding aligns with previous studies (Zhou et al., 2019), while our work provides more concrete data to support this conclusion.

Clinical practice and challenges

In translating the ≥0.05 RPM threshold into routine process, three real barriers must be weighed.

a. Turnaround time (TAT): Our in-house workflow delivers results within 24–48 h (median 30 h), exceeding the 2 h of Xpert MTB/RIF. Shortening TAT to <24 h is feasible with same-day batching and rapid library prep kits (e.g., Illumina DNA Prep Mx), but requires 24-hour sequencing core staffing. We therefore propose tiered algorithms: smear/Xpert for initial triage, reserving mNGS for smear-negative, clinically suspicious cases where delayed TAT is acceptable.

b. Cost and infrastructure: Reagent costs are currently ≈US$250 per mNGS test (library prep + sequencing) versus ≈US$10 for smear microscopy and ≈US$15–20 for Xpert (cartridge-based). However, the higher specificity (100%) of the 0.05 RPM threshold has the opportunity to offsets downstream costs by averting empiric multi-drug therapy in false-positive patients. In low-resource settings lacking NGS infrastructure, regional sequencing centers receiving couriered samples or cloud-based bioinformatics (upload FASTQ to secure servers) also can reduce capital outlay.

c. Balancing sensitivity and specificity to prevent overtreatment: The selected 0.05 RPM threshold yields sensitivity 74.6% and specificity 100% in our cohort. The zero false-positive rate at this cut-off minimizes the risk of unnecessary anti-TB therapy and its attendant toxicities, a critical advantage where second-line drug resources are scarce.

Limitations

While our study establishes a preliminary RPM threshold (≥0.05) for MTBC detection by mNGS, several limitations must be acknowledged. First, the retrospective, single-center design and moderate sample size (n = 170) restrict the generalizability of the threshold to populations with distinct MTBC strain diversity, HIV prevalence, or pediatric paucibacillary disease. Additionally, the sample composition may not fully represent patients with rare co-infections or atypical presentations, requiring further validation in larger, multi-center cohorts.

Second, the threshold’s applicability to other sample types, such as sputum or cerebrospinal fluid (CSF), remains uncertain. Different sample types may exhibit varying pathogen loads and levels of host DNA contamination, which could affect the optimal RPM threshold. For instance, sputum samples may have a higher bacterial load, while CSF samples might have lower pathogen concentrations. Therefore, further studies are necessary to determine if this threshold holds for other sample types commonly used in clinical practice.

Third, inter-laboratory reproducibility remains unexplored; variations in DNA extraction kits, sequencing depth (< 20M reads), or bioinformatic pipelines (e.g., k-mer vs. alignment-based) could necessitate recalibration of the threshold across different laboratories and sequencing platforms. Different sequencing platforms may yield varying results due to differences in depth, error rates, and sensitivity, making the threshold potentially platform-dependent.

We also did not evaluate the threshold in immunocompromised hosts (e.g., transplant recipients), where atypical bacterial loads and co-infections could alter RPM distributions. The absence of longitudinal data precludes assessment of threshold stability during anti-TB therapy or across specimen types (BALF vs. sputum vs. CSF). Finally, real-world implementation must consider infrastructure, turnaround time, cost-effectiveness, and regulatory validation—none of which were formally assessed here. Multicenter, prospective studies incorporating health-economic modeling in resource-constrained environments are therefore warranted.

Conclusions

In summary, the results of the present study suggest that optimizing thresholds for mNGS may enhance the efficiency of clinical detection of MTBC and improve the accurate diagnosis of TB. This optimization could lead to additional patient benefits, including increased diagnostic efficacy, timeliness, and reduced clinical costs. Ultimately, the implementation of such threshold adjustments provides a more efficient, convenient, and cost-effective technology for accurate diagnosis and treatment of TB. To address these limitations in this study, we have designed a multicenter validation roadmap we plan to carry out a phased multicenter validation roadmap in future. Phase 1 will engage TB consortium networks and pediatric TB centers across high-, intermediate-, and low-burden regions to recruit ≥1,000 prospectively enrolled patients (adults, children, HIV-positive, and immunocompromised). Standardized SOPs—including identical DNA extraction kits, spike-in controls (ERCC), and ≥20 M reads per sample-will be distributed to minimize inter-site variability. And, we will perform longitudinal sub-studies to examine whether the ≥0.05 RPM threshold remains valid. Phase 2 will test threshold portability on alternative platforms (Ion Torrent, Nanopore) and at reduced sequencing depths (5–10 M reads) typical of resource-limited laboratories. Phase 3 will integrate decision-analytic modeling to compare mNGS-guided versus standard diagnostic algorithms in terms of cost per accurate diagnosis, turnaround time, and patient outcomes. Meanwhile, RPM dynamics were incorporated into the health economics model to evaluate the cost-effectiveness of personalized treatment based on RPM in reducing toxic and side effects and QALY.

Data availability statement

Metagenomic sequencing data without reads of human genome was publicly available in NCBI with the SRA accession PRJNA1052360.

Ethics statement

This study was approved by the Ethics Committee of Yongkang First People’s Hospital (YKSDYRMYYEC2022-KT-HS-001-01) and conducted in accordance with Declaration of Helsinki principles and relevant ethical and legal requirements. Informed consent was obtained from all participants or their legal guardians.

Author contributions

YL: Writing – original draft. LZ: Writing – original draft. GM: Writing – review & editing, Writing – original draft. CL: Writing – original draft. WH: Writing – original draft. RR: Writing – original draft. YZ: Writing – original draft. DY: Writing – original draft. SQ: Writing – original draft. SJ: Writing – original draft. CQ: Writing – original draft. XC: Writing – review & editing, Writing – original draft.

Funding

The author(s) declare financial support was received for the research and/or publication of this article. This work was supported by grants from Jinhua Key Science and Technology Program Projects (Grant No. 2022-3-050).

Acknowledgments

We gratefully acknowledge the contributions of all study participants.

Conflict of interest

Author GM is employed by Zhejiang Key Laboratory of Digital Technology in Medical Diagnostics. Authors RR and XC are employed by MatriDx Biotechnology Co., Ltd.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Correction note

This article has been corrected with minor changes. These changes do not impact the scientific content of the article.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcimb.2025.1646194/full#supplementary-material

References

Acharya, B., Acharya, A., Gautam, S., Ghimire, S. P., Mishra, G., Parajuli, N., et al. (2020). Advances in diagnosis of Tuberculosis: an update into molecular diagnosis of Mycobacterium tuberculosis. Mol. Biol. Rep. 47, 4065–4075. doi: 10.1007/s11033-020-05413-7

PubMed Abstract | Crossref Full Text | Google Scholar

Borroni, D., Romano, V., Kaye, S. B., Somerville, T., Napoli, L., Fasolo, A., et al. (2019). Metagenomics in ophthalmology: current findings and future prospectives. BMJ Open Ophthalmol. 4, e000248. doi: 10.1136/bmjophth-2018-000248

PubMed Abstract | Crossref Full Text | Google Scholar

Chiu, C. Y. and Miller, S. A. (2019). Clinical metagenomics. Nat. Rev. Genet. 20, 341–355. doi: 10.1038/s41576-019-0113-7

PubMed Abstract | Crossref Full Text | Google Scholar

Dheda, K., Perumal, T., Moultrie, H., Perumal, R., Esmail, A., Scott, A. J., et al. (2022). The intersecting pandemics of tuberculosis and COVID-19: population-level and patient-level impact, clinical presentation, and corrective interventions. Lancet Respir. Med. 10, 603–622. doi: 10.1016/S2213-2600(22)00092-3

PubMed Abstract | Crossref Full Text | Google Scholar

Ding, L., Liu, Y., Wu, X., Wu, M., Luo, X., Ouyang, H., et al. (2021). Pathogen metagenomics reveals distinct lung microbiota signatures between bacteriologically confirmed and negative tuberculosis patients. Front. Cell Infect. Microbiol. 11, 708827. doi: 10.3389/fcimb.2021.708827

PubMed Abstract | Crossref Full Text | Google Scholar

Gautam, S. S., Mac Aogáin, M., Cooley, L. A., Haug, G., Fyfe, J. A., Globan, M., et al. (2018). Molecular epidemiology of tuberculosis in Tasmania and genomic characterisation of its first known multi-drug resistant case. PloS One 13, e0192351. doi: 10.1371/journal.pone.0192351

PubMed Abstract | Crossref Full Text | Google Scholar

Gu, W., Miller, S., and Chiu, C. Y. (2019). Clinical metagenomic next-generation sequencing for pathogen detection. Annu. Rev. Pathol. 14, 319–338. doi: 10.1146/annurev-pathmechdis-012418-012751

PubMed Abstract | Crossref Full Text | Google Scholar

Guthrie, J. L., Delli Pizzi, A., Roth, D., Kong, C., Jorgensen, D., Rodrigues, M., et al. (2018). Genotyping and whole-genome sequencing to identify tuberculosis transmission to pediatric patients in british columbia, Canada, 2005-2014. J. Infect. Dis. 218, 1155–1163. doi: 10.1093/infdis/jiy278

PubMed Abstract | Crossref Full Text | Google Scholar

Han, D., Li, Z., Li, R., Tan, P., Zhang, R., and Li, J. (2019). mNGS in clinical microbiology laboratories: on the road to maturity. Crit. Rev. Microbiol. 45, 668–685. doi: 10.1080/1040841X.2019.1681933

PubMed Abstract | Crossref Full Text | Google Scholar

Huang, H.-Y., Tsai, Y.-S., Lee, J.-J., Chiang, M.-C., Chen, Y.-H., Chiang, C.-Y., et al. (2010). Mixed infection with Beijing and non-Beijing strains and drug resistance pattern of Mycobacterium tuberculosis. J. Clin. Microbiol. 48, 4474–4480. doi: 10.1128/JCM.00930-10

PubMed Abstract | Crossref Full Text | Google Scholar

Huang, Y., Ai, L., Wang, X., Sun, Z., and Wang, F. (2022). Review and updates on the diagnosis of tuberculosis. J. Clin. Med. 11, 5826. doi: 10.3390/jcm11195826

PubMed Abstract | Crossref Full Text | Google Scholar

Laudadio, I., Fulci, V., Stronati, L., and Carissimi, C. (2019). Next-generation metagenomics: methodological challenges and opportunities. OMICS. 23, 327–333. doi: 10.1089/omi.2019.0073

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, X., Chen, Y., Ouyang, H., Liu, J., Luo, X., Huang, Y., et al. (2021). Tuberculosis Diagnosis by Metagenomic Next-generation Sequencing on Bronchoalveolar Lavage Fluid: a cross-sectional analysis. Int. J. Infect. Dis. 104, 50–57. doi: 10.1016/j.ijid.2020.12.063

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, Y. and Ma, Y. (2024). Clinical applications of metagenomics next-generation sequencing in infectious diseases. J. Zhejiang Univ Sci. B. 25, 471–484. doi: 10.1631/jzus.B2300029

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, Y., Wang, H., Li, Y., and Yu, Z. (2022). Clinical application of metagenomic next-generation sequencing in tuberculosis diagnosis. Front. Cell Infect. Microbiol. 12, 984753. doi: 10.3389/fcimb.2022.984753

PubMed Abstract | Crossref Full Text | Google Scholar

Meyer, K. C., Raghu, G., Baughman, R. P., Brown, K. K., Costabel, U., du Bois, R. M., et al. (2012). An official American Thoracic Society clinical practice guideline: the clinical utility of bronchoalveolar lavage cellular analysis in interstitial lung disease. Am. J. Respir. Crit. Care Med. 185, 1004–1014. doi: 10.1164/rccm.201202-0320ST

PubMed Abstract | Crossref Full Text | Google Scholar

Shi, C.-L., Han, P., Tang, P.-J., Chen, M.-M., Ye, Z.-J., Wu, M.-Y., et al. (2020). Clinical metagenomic sequencing for diagnosis of pulmonary tuberculosis. J. Infect. 81, 567–574. doi: 10.1016/j.jinf.2020.08.004

PubMed Abstract | Crossref Full Text | Google Scholar

Simner, P. J., Miller, S., and Carroll, K. C. (2018). Understanding the promises and hurdles of metagenomic next-generation sequencing as a diagnostic tool for infectious diseases. Clin. Infect. Dis. 66, 778–788. doi: 10.1093/cid/cix881

PubMed Abstract | Crossref Full Text | Google Scholar

Sun, W., Lu, Z., and Yan, L. (2021). Clinical efficacy of metagenomic next-generation sequencing for rapid detection of Mycobacterium tuberculosis in smear-negative extrapulmonary specimens in a high tuberculosis burden area. Int. J. Infect. Dis. 103, 91–96. doi: 10.1016/j.ijid.2020.11.165

PubMed Abstract | Crossref Full Text | Google Scholar

Zhao, Y., Zhang, W., and Zhang, X. (2024). Application of metagenomic next-generation sequencing in the diagnosis of infectious diseases. Front. Cell Infect. Microbiol. 14, 1458316. doi: 10.3389/fcimb.2024.1458316

PubMed Abstract | Crossref Full Text | Google Scholar

Zhou, X., Wu, H., Ruan, Q., Jiang, N., Chen, X., Shen, Y., et al. (2019). Clinical Evaluation of Diagnosis Efficacy of Active Mycobacterium tuberculosis Complex Infection via Metagenomic Next-Generation Sequencing of Direct Clinical Samples. Front. Cell Infect. Microbiol. 9, 351. doi: 10.3389/fcimb.2019.00351

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: tuberculosis, bronchoalveolar lavage fluid, metagenomic next-generation sequencing, threshold optimization, diagnostic efficacy

Citation: Li Y, Zhang L, Ma G, Li C, Hu W, Ren R, Zang Y, Ying D, Qiu S, Jin S, Qiu C and Cao X (2025) Optimization of decision thresholds for Mycobacterium tuberculosis can effectively improve the performance of mNGS in tuberculosis diagnosis. Front. Cell. Infect. Microbiol. 15:1646194. doi: 10.3389/fcimb.2025.1646194

Received: 13 June 2025; Accepted: 25 August 2025;
Published: 11 September 2025; Corrected: 18 September 2025.

Edited by:

Beiwen Zheng, Zhejiang University, China

Reviewed by:

Xiao Guo Hui, Zhejiang University, China
Meng Fu, University of Science and Technology of China (USTC), China

Copyright © 2025 Li, Zhang, Ma, Li, Hu, Ren, Zang, Ying, Qiu, Jin, Qiu and Cao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yuecui Li, eWtseWN3aEAxMjYuY29t; Xuefang Cao, eGZjemp1QDE2My5jb20=

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.