- 1Department of Radiology, Chongqing Public Health Medical Center, Chongqing, China
- 2Department of Research and Development, Shanghai United Imaging Intelligence, Shanghai, China
- 3Department of Radiology, Public Health Clinical Center of Chengdu, Chengdu, China
Objectives: Develop and evaluate a machine learning (ML) model based on CT radiomics for the identification of non-tuberculous mycobacterial lung disease (NTM-LD) and pulmonary tuberculosis (PTB).
Materials and methods: Retrospectively, chest CT images with NTM-LD and PTB patients confirmed at Medical Center 1 between January 2019 to December 2024 were collected. The dataset was divided into a training cohort and a validation cohort in a 7:3 ratio. Additionally, patients from medical center 2 were collected for external test. A radiomics model was constructed using five machine learning algorithms: Logistic Regression (LR), Random Forest (RF), Quadratic Discriminant Analysis (QDA), YeoJohnson_LR, and YeoJohnson_LR (LDA). Receiver operating characteristic (ROC) and area under the curve (AUC) were used to evaluate the diagnostic efficacy of the five models, and the optimal prediction model was obtained. The optimal model was compared with three radiologists in the testing cohort.
Results: A total of 1,512 cases were included, including 1,407 cases from Center 1 (NTM-LD: 547; PTB: 860) and 105 patients from Center 2 (NTM-LD: 32; PTB: 73). Patients in the NTM-LD group were significantly older than those in the PTB group (p < 0.001). There was a significant gender difference between the NTM-LD group and the PTB group (p = 0.005). By comparing the five models, it was found that the YeoJohnson_LR (LDA) model was the best-performing prediction model, with an accuracy of 0.8286 in the external test. The AUCs of the YeoJohnson_LR (LDA) model on the training, validation, and test cohort were 0.8421, 0.8037, and 0.8233, respectively. In comparison with radiologists, the YeoJohnson_LR (LDA) model demonstrated gains of 3.12 ~ 15.62% in sensitivity and 6.85 ~ 12.33% in specificity.
Conclusion: The YeoJohnson_LR (LDA) model can be used to distinguish NTM-LD from PTB, assisting in rapid clinical diagnosis and benefiting patients with NTM-LD.
Introduction
Pulmonary tuberculosis (PTB), an infectious disease caused by Mycobacterium tuberculosis, is to being the world’s leading killer from a single infectious disease with 1.25 million deaths in 2023 (1). The transmission of Mycobacterium tuberculosis infection among the population mainly occurs through inhalation of aerosols containing Mycobacterium tuberculosis. Exposure to Mycobacterium tuberculosis can lead to primary active pulmonary tuberculosis or asymptomatic latent tuberculosis infection (LTBI) (2).
Nontuberculous mycobacteria (NTM) refer to mycobacterial infectious diseases other than Mycobacterium tuberculosis complex and Mycobacterium leprae. NTM is a common opportunistic pathogen in the environment and is prone to occur in people with weakened immune systems (3). Unlike Mycobacterium tuberculosis, which has received widespread attention, NTM has received much less attention. However, an insufficient understanding of NTM may lead to an underestimation of its harmfulness. NTM are ubiquitous environmental bacteria, and people can get infected by inhaling or ingesting bacteria in soil and water (4). NTM can invade multiple organs and systems throughout the body, with the lungs being the most commonly affected and having a very high mortality (5, 6).
In recent years, the infection rate of NTM has been on the rise in some countries and regions, which has drawn widespread attention (7–10). And it is reported that the NTM separation rate in China has increased (11–13). An analysis of sputum samples from suspected cases collected at 72 national tuberculosis surveillance sites in 31 provinces of the Chinese mainland revealed that among 4,917 mycobacterial isolates, 6.4% were NTM (11). Environmental pollution, population aging, and the use of immunosuppressants may have contributed to the infection rate of NTM. Although technological development and medical progress have to some extent increased the detection rate of NTM, the diagnosis of NTM lung diseases (NTM-LD) remains challenging due to the similar clinical and imaging manifestations of NTM-LD and PTB. The existing diagnostic methods mainly rely on etiological detection and CT examination, but they have certain limitations (14, 15). To explore the potential characteristic differences between NTM-LD and PTB, it is urgent to develop a new, efficient, sensitive, and economical technical means to assist in differential diagnosis.
Recently, radiomics has been increasingly widely applied in the medical field. By mining data features, it has shown great potential in assisting the diagnosis (16, 17), differentiation (18, 19), treatment (20, 21) and prediction (22, 23) of diseases. This provides new inspiration for identifying NTM-LD from PTB.
This study aims to construct radiomics models of NTM-LD and PTB using five machine learning algorithms (Z_score_LR, Z_score _RF, Z_score _QDA, YeoJohnson_LR, YeoJohnson_LR (LDA)). To explore the value of computed tomography (CT) radiomics in the differentiation of NTM-LD and PTB.
Materials and methods
Patients and dataset
This retrospective study received ethical approval from the Ethics Committee of Chongqing Public Health Medical Center (Approval No. 2024-019-02-KY, Center 1) and was granted an informed consent waiver for the period spanning January 2019 to December 2024. Definitive diagnosis was established through microbiological confirmation according to established reference standards: NTM-LD infections were diagnosed following the Treatment of Nontuberculous Mycobacterial Pulmonary Disease: An Official ATS/ERS/ESCMID/IDSA Clinical Practice Guidelines (version 2020) (24), while PTB diagnoses adhered to the National Health Commission of the People’s Republic of China. In: Diagnostic criteria. For pulmonary tuberculosis (WS 288-2017) (25). Inclusion criteria comprised: (1) age ≥ 18 years; (2) microbiologically confirmed PTB or NTM-LD; and (3) availability of pretreatment CT images. Exclusion criteria encompassed: (1) co-infection with both TB and NTM-LD; (2) concomitant pulmonary pathologies, including lung cancer, fungal infections, or pneumoconiosis, etc.; (3) prior history of pulmonary surgery; and (4) missing or suboptimal CT images precluding reliable analysis. Identical ethical and diagnostic criteria were employed for data collection at Center 2. Baseline demographic characteristics (age and sex) were documented for all enrolled patients, with non-contrast CT scans acquired within 30 days before treatment initiation. Patient screening and enrollment was shown in Figure 1.
Figure 1. Flowchart of patient screening. PTB, pulmonary tuberculosis; NTM-LD, nontuberculous mycobacteria lung disease.
CT image acquisition
CT image acquisition was performed using two scanner models: the GE Optima CT680 Expert 64-slice CT scanner (Center 1) and the GE BrightSpeed CT scanner (Center 2). All patients were positioned supine and scanned during a single breath-hold at full inspiration, covering the entire lung volume from the apex to the costophrenic angles. Standard non-contrast scans were acquired with a tube voltage of 120 kV and automated tube current modulation. Images were reconstructed with a slice thickness and interval of 5 mm using a 512 × 512 matrix. For the Optima CT680 Expert scanner (Center 1), the in-plane pixel resolution ranged from 0.312 mm × 0.312 mm to 0.977 mm × 0.977 mm. For the BrightSpeed scanner (Center 2), spatial resolution was 20 lp/cm with a density resolution of 0.3%. To standardize images for subsequent radiomics analysis and minimize variations due to differences in slice thickness or other acquisition parameters between scans, the CT values (Hounsfield Units, HU) from each scan were interpolated to a voxel size of 1 mm × 1 mm × 5 mm using a lung window setting (window level/width: -600 HU / 1,500 HU).
Lungs segmentation and radiomics feature extraction
All images were processed using the uAI Research Portal (Version: 20250130sp1), an independently developed research platform by Shanghai United Imaging Intelligent Medical Technology Co., Ltd. Automatic segmentation of bilateral lung fields was performed using the VB-Net deep learning model. This model demonstrated high performance in our prior work, achieving a mean Dice similarity coefficient (DSC) of 0.989 ± 0.004 (mean ± SD), and has been validated in other pulmonary infection studies (26, 27). Following automated segmentation, all results underwent expert review by board-certified thoracic radiologists (>10 years’ experience), with any inaccuracies manually corrected.
Radiomics feature extraction from the segmented lung volumes was implemented using PyRadiomics (version 3.8.8). Image preprocessing included resampling to an isotropic in-plane resolution of 0.7 × 0.7 × 5 mm3 and gray-level discretization into 25 fixed bins using nearest-neighbor interpolation. A total of 2,264 radiomics features were extracted, including 450 first-order features, 14 shape features, 525 gray level co-occurrence matrix (GLCM) features, 350 gray level dependence matrix (GLDM) features, 400 gray level run length matrix (GLRLM) features, 400 gray level size zone matrix (GLSZM) features, and 125 neighboring gray tone difference matrix (NGTDM) features.
Radiomics feature selection and radiomics model construction
Radiomics feature selection was performed using a sequential approach: Pearson correlation analysis (p < 0.05) identified relevant features, followed by Least Absolute Shrinkage and Selection Operator (LASSO) regression with 10-fold cross-validation to refine the most predictive subset, retaining only features with non-zero coefficients. The optimal regularization penalty (λ) was determined based on the criterion of minimum binomial deviance plus one standard deviation. A patient-specific radscore was subsequently computed for each subject using the linear predictor derived from the LASSO model.
Using the selected features, predictive models were trained on the training cohort employing three distinct MLalgorithms: LR, RF, and QDA. We systematically evaluated the impact of feature normalization strategies—comparing Z-score normalization against Yeo-Johnson transformation—and examined the influence of dimensionality reduction by constructing models both with and without prior application of Linear Discriminant Analysis (LDA). Finally, we have constructed five models, including Z_score_LR, Z_score _RF, Z_score_QDA, YeoJohnson_LR and YeoJohnson_LR (LDA). Hyperparameter tuning and model selection were performed using the validation cohort. The final performance and generalizability of the optimal model were rigorously assessed on an independent testing cohort from Center 2. The step-by-step workflow for radiomics model development and validation is provided in Supplementary material 1. And the entire workflow of model development, validation, testing, and comparison was illustrated in Figure 2.
Statistical analysis
Statistical analyses were performed using R software (version 4.4.1; R Foundation for Statistical Computing, Vienna, Austria). Model discrimination was evaluated using receiver operating ROC and AUC. Calibration was assessed with the Hosmer-Lemeshow test and calibration curves, while clinical utility was quantified via decision curve analysis (DCA). To compare model performance, the DeLong test analyzed differences in AUC values, and the net reclassification index (NRI) and integrated discrimination improvement (IDI) measured the incremental predictive value of the optimal model over compared methods. Additionally, three radiologists independently reviewed CT images in the testing cohort under blinded conditions to differentiate NTM-LD from PTB, with the understanding that each patient had a single-pathogen infection. Performance metrics—including AUC, accuracy, sensitivity, specificity, precision, and F1 score—were calculated for both the machine learning models and radiologists. Confusion matrices provided visual comparisons of their diagnostic outcomes. Statistical significance was defined as p < 0.05, with p-values adjusted for multiple testing using Benjamini-Hochberg method where applicable.
Results
Baseline characteristics
This study included 1,407 cases from Center 1, comprising 547 patients with NTM-LD (mean age:50 ± 18 years; 293 male) and 860 patients with PTB (mean age:46 ± 17 years; 517 male). Data were randomly split into a training cohort (70%, n = 984) and a validation cohort (30%, n = 423) at a 7:3 ratio. Additionally, 105 patients from Center 2 were collected for external testing, including 32 NTM-LD (mean age:57 ± 15 years; 19 male) and 73 PTB (mean age:48 ± 20 years; 54 male) patients. Patients in the NTM-LD group were significantly older than those in the PTB group (p < 0.001). There was a significant gender difference between the NTM-LD group and the PTB group (p = 0.005). Baseline characteristics were detailed in Table 1 and Figure 3.
Figure 3. Pie charts of demographic analysis (top row: PTB, middle row: NTM-LD) and density plots of age distribution (bottom row) between NTM-LD and PTB patients across total, training, validation, and testing cohort. p < 0.05 were considered statistically significant.
Feature selection
A total of 2,264 radiomics features were extracted from non-contrast CT scans. Following selection of 1,414 features based on Pearson correlation (p < 0.05), LASSO regression with 10-fold cross-validation identified 85 robust features. Among these the three most influential features for the model were glszm_wavelet.LLL.ZonePercentage (ZP), gldm_log.sigma.2.0.mm.3D.DependenceVariance (DV), and glszm_log.sigma.1.0.mm.3D.GrayLevelVariance (GLV), with LASSO coefficients of 0.104, 0.059, and 0.058, respectively (see Supplementary material 2; Table 1). And the top 15 discriminative features differentiating NTM-LD from PTB groups detailed in Table 2 of Supplementary material 2. Notably, the Radscore derived from these features demonstrated significantly higher values in NTM-LD patients compared to PTB patients across both training and validation cohorts (p < 0.001), as visualized in Figure 4.
Figure 4. Radscore distributions between NTM-LD and PTB patients in the training cohort and the validation cohort. The top panel displays rank-ordered Radscore profiles for individual patients in the training (A) and validation (B) cohorts. The bottom panel presents violin plots of Radscore distributions, where the shape of the violins reflects data density in the training (C) and validation (D) cohorts. In both the training and validation cohorts, NTM-LD and PTB exhibit significantly different Radscore distributions (p < 0.001). PTB, pulmonary tuberculosis; NTM-LD, nontuberculous mycobacteria lung disease.
Performance of radiomics models in NTM-LD versus PTB classification
The machine learning models exhibited varying performance depending on normalization strategies and algorithms. Specifically, under Z-score normalization, Z_score_LR outperformed Z_score _RF and Z_score _QDA in the testing cohort, with AUC of 0.7526 vs. 0.7276 (RF, p = 0.5745) and 0.6211 (QDA, p = 0.0166), sensitivity of 0.6176 vs. 0.5882 (RF, p = 0.7539) and 0.4706 (QDA, p < 0.001), and accuracy of 0.6977 vs. 0.6899 (RF, p = 0.5114) and 0.6822 (QDA, p = 0.5032). Building on this baseline, YeoJohnson normalization further improved Z_score_LR performance, yielding a + 1.15% AUC gain (0.7641, p = 0.5966) and +1.57% accuracy improvement (0.7134, p = 0.0215). Crucially, the incorporation of LDA prior to the LR classifier significantly enhanced model performance, achieving a state-of-the-art AUC of 0.8233 (+7.07% vs. baseline Z_score_LR, p = 0.0061; +5.92% vs. YeoJohnson_LR, p = 0.0094). Beyond discriminative metrics, calibration analysis revealed YeoJohnson_LR (LDA) most closely approximated the ideal calibration line, with Brier scores of 0.182 (validation) and 0.140 (testing). And decision curve analysis further demonstrated YeoJohnson_LR (LDA)'s consistent superiority across the clinically relevant threshold spectrum (0.15–1.0), yielding higher net benefit than all comparator models. Significant reclassification improvement was confirmed by Delong test (p = 0.0094), NRI (0.1703, p = 0.0129) and IDI (0.0851, p = 0.0003) versus YeoJohnson_LR. Tables 2, 3 detail the individual model performance and provide a comparative analysis in training, validation, and testing cohorts. And Figure 5 provided various line charts to visualize the models’ evaluation performance and clinical net benefit.
Figure 5. Performance evaluation of the model across training, validation, and testing cohorts. The figure presents ROC curves (A,D,G), calibration curves (B,E,H), and clinical decision curves (C,F,I) for the training cohort (top row), validation cohort (middle row), and testing cohort (bottom row), respectively. Each row consistently displays the three evaluation metrics for its respective cohort, demonstrating the model’s diagnostic performance, prediction accuracy, and clinical utility across development and independent validation stages.
Comparison of performance between optimal model and radiologists
The YeoJohnson_LR (LDA) model demonstrated superior diagnostic performance compared to all three radiologists, regardless of experience level (Table 4). Its significant advantage is quantified across key metrics: achieving a substantially higher AUC (0.8233) than the radiologists (ranging from 0.6773 to 0.7391, representing improvements of 8.42 to 14.60%). Similarly, the model’s accuracy (0.8286) surpassed that of the junior and median-experience radiologists by 9.53% (both 0.7333) and the senior radiologist by 5.72% (0.7714). Notably, the model exhibited its largest performance gap in specificity (0.8904), outperforming the radiologists by 6.85 to 12.33%, highlighting a particular strength in reducing false positives. It also showed higher sensitivity (0.6875 vs. 0.5313 ~ 0.6563, +3.12% to +15.62%), significant gains in precision (+11.57% ~ 16.66%) and F1 score (+7.33% ~ 16.13%). These results collectively demonstrate the model’s consistent superiority in diagnostic accuracy, particularly its ability to minimize false-positive rates. The human-machine comparison was detailed in Table 4, Figure 6 represents the confusion matrix of YeoJohnson_LR (LDA) and three radiologists.
Figure 6. Diagnostic performance comparison. Confusion matrices for the optimal radiomics model (YeoJohnson_LR (LDA), A) and three radiologists: junior (C), median (B), and senior (D). Color intensity corresponds to prediction frequency (darker blue = higher counts).
Discussion
In this study, we included baseline characteristics such as gender and age. We found that patients in the NTM-LD group were significantly older than those in the PTB group (p < 0.001), which was consistent with the results of previous studies (15, 28–30). The elderly are prone to NTM-LD infection. This might be related to the fact that NTM is an opportunistic pathogen. The immunity of the elderly may decline as they get older, making them more prone to illness (12, 29). Our study revealed that NTM-LD is more prevalent in females (p = 0.005). It has been reported that NTM-LD is more common among older women; specifically, morbidity and disease risk increase markedly in women aged ≥65 years (8, 31). However, some other studies have found that the incidence of NTM-LD and PTB has no relation to gender (32–34). Males and females differ in pathogenic infections due to the sexes providing different genetic backgrounds, anatomic niches, immunological profiles, and hormonal environments that can directly affect pathogens as well as the development of diseases following infection (35). Thus, the baseline characteristics of the patients alone are not sufficient to distinguish between NTM-LD and PTB.
Regarding clinical symptoms, both NTM-LD and PTB may present with symptoms such as cough, fever, dyspnea, night sweats, fatigue, weight loss, etc. In the study by Kim C et al. (33), dyspnea was more common in patients with PTB, while cough was more prevalent in NTM-LD patients (p < 0.001). Contrary to the results of Kim et al., in the study by Wangle L et al. (15), cough was more common in PTB, while dyspnea was more common in NTM-LD. Giller D et al. (36) and Klann E et al. (37) found that weakness, dyspnea, fever, weight loss, and night sweats were more common in PTB, while hemoptysis and pulmonary hemorrhage were more common in NTM-LD (p < 0.05) (38). And Zhang W et al. (29) revealed symptoms such as cough, sputum production, fever, dyspnea, shortness of breath, hemoptysis, chest pain, fatigue, emaciation, and appetite have no significant difference between NTM-LD and PTB. Therefore, it is impossible to distinguish patients with NTM-LD from those with PTB solely based on clinical symptoms.
Bacterial culture and strain identification are the gold standard for diagnosing NTM-LD. However, these methods are difficult to be widely applied due to their time-consuming (2 ~ 6 weeks) nature (14, 39), economic factors, and the requirements of laboratory conditions. Several commonly used laboratory methods (such as AFB, Xpert, and T-SPOT) also have certain limitations. Acid-fast bacilli (AFB) sputum smear is widely used and the most efficient procedure for the initial screening of PTB. The presence of AFB in the stained sputum (AFB smear-positive) indicates a preliminary diagnosis of pulmonary mycobacterial infection (40). However, a positive AFB smear is not specific for PTB; both PTB and NTM-LD show positive results (41, 42). T-SPOT is an interferon-γ release assay (IGRA) used to detect tuberculosis. And NTM infection may also result in a positive T-SPOT result (14). Xpert is a molecular testing method used for detecting tuberculosis, but its results may have the possibility of false negatives (41). Notedly, Xpert and T-SPOT not only require professional equipment and technical support but also cannot distinguish between active tuberculosis and LTBI. In countries with a high burden of tuberculosis, a considerable proportion of the population suffers from LTBI (43), which may lead to positive results in Xpert or T-SPOT tests. So, laboratory tests have limitations in enabling early diagnosis. In many tuberculosis epidemic areas, it is not uncommon for patients to receive empirical anti-tuberculosis treatment while waiting for the results of the bacterial culture (15). This may lead to the burden of adverse drug reactions and economic costs.
CT examination has the advantages of being non-invasive, available, and efficient, and is widely used in the screening, diagnosis, and follow-up of lung diseases (44–46). Although the pulmonary manifestations of NTM-LD and PTB are highly similar (40, 47), there are still some differences. For instance, Yuan M et al. (40) discovered that the prevalence of pleural effusion, nodules < 10 mm in size, tree-in-bud pattern, and bronchiectasis in PTB patients was significantly higher than NTM-LD patients (p < 0.05). The Chu HQ et al. (42) compared the imaging examinations of two groups of patients and found that the NTM-LD patients more frequently had bronchiectasis, thin-walled cavity (D ≥ 3 cm) (p < 0.05), and the right middle lobe and left lingualar segment bronchiectasis were more prominent in NTM-LD (p < 0.001). Noted, bronchiectasis in the right middle lobe and left lingualar segment is regarded as a key characteristic of NTM-LD (42, 48). The thin-walled cavity is also considered to be more indicative of NTM-LD rather than PTB (33). Certain imaging differences between NTM-LD and PTB enable machine models based on imaging data to distinguish between them. So far, there are very few studies on radiomics in differentiating NTM-LD from PTB. As far as we know, only a few reports have been published (30, 34, 47, 49, 50).
In this study, we developed and evaluated a set of machine learning models based on radiomics for differentiating NTM-LD from PTB. The results showed that the YeoJohnson_LR (LDA) model had the best diagnostic efficacy. In this study, we selected LR, RF, and QDA classifiers. The results showed that the LR classifier was the best-performing predictive model. Similar to the reports in the literature, previous studies have also shown that the LR model has a good recognition effect in differentiating NTM-LD from PTB. For example, Li HL et al. [30]constructed a radiomics model for discriminating NTM-LD from PTB, and found that the LRmodel was the optimal model. Its AUC, sensitivity, and specificity in the external test were 0.766, 0.833, and 0.710, respectively. Yan Q et al. (49) used six classifiers (KNN, SVM, XGBoost, RF, LR, and DT) to distinguish NTM-LD from PTB, and the results showed that the LR classifier has the highest precision, recall, and F1-score, which were 0.92, 0.94, and 0.93. Its AUC, sensitivity, and specificity were 0.95, 0.94, 0.87 in the external test. Zhou L et al. (34) compared the discriminatory abilities of four classifiers (XGBoost, LR, SVM, and RF) for NTM-LD and PTB. The results showed that the AUCs of these four classifiers were ≥ 0.7759 in both the training and validation cohorts. However, only SVM demonstrated stable performance in effectively identifying these two diseases.
Unlike the previous studies, in order to enhance the performance and stability of the LR, we preprocessed the data with the YeoJohnson transformation, both alone and combined with LDA, before constructing the model. As a result, in testing cohort, compared with the LR without preprocessing, the AUC, sensitivity, and accuracy of the YeoJohnson_LR model were improved. Moreover, the AUC, sensitivity, specificity, accuracy, and F1 score of YeoJohnson_LR (LDA) were significantly enhanced. Compared with YeoJohnson_LR, YeoJohnson_LR (LDA) demonstrated significant improvements in AUC, specificity, accuracy, precision, and F1 score in the testing cohort. This proves that by applying the YeoJohnson and LDA preprocessing methods before LR modeling, the performance of the model can be further optimized and enhanced. Furthermore, significant reclassification improvement was confirmed between the YeoJohnson_LR and YeoJohnson_LR (LDA) by Delong test (p = 0.0094), NRI (0.1703, p = 0.0129) and IDI (0.0851, p = 0.0003). It indicates that the LDA algorithm holds significant value in improving the application of model reclassification. In addition, the results of the decision curve analysis indicated that the YeoJohnson_LR model yielded a superior net benefit across a wide range of threshold probabilities compared to other models. We also observed that, compared with the PTB group, the radscore values were higher in the NTM-LD group. This suggests that a higher radscore might be more indicative of NTM-LD rather than PTB. However, the radscore calculation may vary depending on the number and type of imaging features included. Therefore, whether there is a critical threshold for distinguishing between NTM-LD and PTB still requires further investigation. Finally, in this study, we compared the YeoJohnson_LR (LDA) model with radiologists. The results showed that our model had higher sensitivity (0.6875) and specificity (0.8904) compared to radiologists. Our model achieved a gain of 3.12–15.62% in sensitivity and 6.85–12.33% in specificity compared to radiologists. This indicates that our model has a lower false negative rate and false positive rate, and has good clinical applicability.
In our study, the three most influential features incorporated into the model, as indicated by their Lasso coefficients, were glszm_wavelet.LLL.ZP (0.104), gldm_log.sigma.2.0.mm.3D.DV (0.059), and glszm_log.sigma.1.0.mm.3D.GLV (0.058). The sign of the Lasso coefficient indicates the direction of its effect on identifying NTM-LD, while its magnitude reflects the strength of influence.
Specifically, ZP quantifies texture coarseness as the ratio of the number of zones to the number of voxels in the ROI; a higher value implies that the ROI is comprised of a greater number of small zones, which signifies a finer, more homogeneous texture. In our model, ZP showed a positive association with NTM-LD (Lasso coefficient = +0.104), suggesting that lesions exhibiting such a finer, more homogeneous texture patterns are more likely to be classified as NTM-LD.
Dependence Variance (DV) DV Measures the variance in dependence size in the image. A higher DV value indicates greater dispersion in dependency sizes, revealing a high degree of spatial irregularity and heterogeneity in local structures. Its positive coefficient (0.059) suggests that imaging patterns with highly disordered and heterogeneous local textures correlate with the diagnosis of NTM-LD.
Gray-Level Variance (GLV): GLV measures the variance in gray level intensities for the zones. A higher GLV value signifies substantial intra-regional density variation, characterized by pronounced fluctuations in pixel values ranging from very low to very high density, thereby indicating strong overall heterogeneity. Its positive coefficient (0.058) suggests that this signature of high overall textural heterogeneity serves as another important criterion for the model in identifying NTM-LD. In summary, our model identifies the feature combination of “high ZP, high DV, and high GLV” as a key indicator for differentiating NTM-LD. This combination collectively defines an imaging texture pattern characterized by “High fineness and homogeneity, high dependence size heterogeneity, and high density heterogeneity” at both local and global scales. While these key features quantify a complex textural phenotype, whether it reflects a single specific change or results from a confluence of multiple imaging findings in NTM-LD cannot be definitively concluded from our current dataset. This is an important question that our future research will aim to address.
According to the literature, there are also some studies that have achieved good results by using different methods. For instance, Hu Y et al. (47) developed A multi-lesion radiomic (MLR) model to distinguish NTM-LD from PTB. The AUC of this model on the internal testing cohort was 90.1% (95% CI, 87.7–92.4%). Li HL et al. (30) constructed a multimodal model incorporating age, IL-6, and the 2 radiomics features, and the optimal model was from the LightGBM algorithm. Its favorable performance was verified in the external test dataset, with an AUC of 0.858, an accuracy of 0.745, and a sensitivity of 0.900. Liu Q et al. (28) established a nomogram prediction model for differentiating NTM-PD from PTB, incorporating six features (TB-IGRA, bronchiectasis, lymph node calcification, pleural effusion, hilar and mediastinal lymph node enlargement) with significant diagnostic value. Its AUC, sensitivity, specificity, and Youden index obtained were 0.938, 0.835, 0.911, and 0.746, respectively. Similarly, Zhang W et al. (29) analyzed the critical differential characteristics between NTM-PD and PTB, and developed a nomogram prediction model based on age, BMI, bronchiectasis, and lung cavitation, which effectively assesses the risk of NTM-PD occurrence. Its AUC, sensitivity, specificity, and accuracy were 0.861, 0.880, 0.740, and 0.833, respectively. Wang L et al. (15) developed a deep learning framework (3D-ResNet) based on CT images to distinguish NTM-PD from PTB. As a result, the AUC in the external test set was 0.78, the sensitivity was 0.75, and the specificity was 0.63. Park M et al. (51) constructed an ensemble model (ResNet 50 + EfficientNet B4) based on chest X-ray images and also achieved good results. The AUC, precision, and accuracy of this model for diagnosing NTM-LD on the external test set were 0.80, 0.82, and 0.78, respectively. Ying C et al. (14) combined T-SPOT with a deep learning model and discovered that it can greatly improve the classification precision of NTM-PD and PTB when the two methods of prediction are consistent. Therefore, it can be seen that the machine model has certain value in distinguishing NTM-LD from PTB, but there are differences in diagnostic performance among different models. To assist clinical diagnosis, developing a robust model that can adapt to multiple centers and devices will be a key issue to be addressed in the future.
This study is meaningful. First, current diagnosis of NTM-LD and PTB mainly relies on bacterial culture and species identification. Our model provides a rapid and cost-effective auxiliary diagnostic approach. It operates using chest CT images, making it easy to adopt across healthcare institutions at all levels, especially in primary care settings with limited resources. It is expected to shorten the diagnostic cycle, reduce medical costs, and provide immediate reference for clinical decision-making. Second, traditional imaging diagnosis depends on radiologists’ subjective visual assessment, which is easily influenced by experience. Our model analyzes CT images by quantifying imaging indicators, offering objectivity. This reduces interference from human subjectivity and enhances the standardization and reproducibility of the diagnostic process. Third, the study involved 1,512 cases from two centers, which supplements the shortcomings of small sample size and single center in previous studies on radiomics machine learning (28, 29, 47, 50).
Finally, previous studies focused on comparing the diagnostic performance of various classifiers in machine learning (30, 34). Based on the previous research, we added a new data preprocessing method, which was an innovative attempt and achieved satisfactory results.
However, this study also has some limitations. Firstly, this study only distinguishes between NTM-LD and PTB. In addition, NTM-LD needs to be differentiated from various other diseases, such as viral or fungal infections, tumors, and so on, which will be even more challenging in actual clinical scenarios. Future studies will incorporate a broader range of diseases to explore the model’s ability to identify diseases across a wider spectrum. Secondly, the model is constructed based on radiomics features and does not incorporate clinical characteristics or laboratory results. Future studies will incorporate more types of data to build the model and explore more information with potential diagnostic value.
Conclusion
In conclusion, this study developed a radiomics machine learning YeoJohnson_LR (LDA) model based on CT images for differentiating NTM-LD from PTB. Compared with radiologists, this model has higher diagnostic efficacy and may potentially become a screening tool for patients with pulmonary mycobacteria. Compared with the time-consuming bacterial culture, the acquisition of CT images is instantaneous. Our model can assist in the early diagnosis when clinicians suspect that patients have NTM-LD or PTB, thereby benefiting the patients.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.
Ethics statement
The studies involving humans were approved by Ethics Committee of Chongqing Public Health Medical Center (Approval No. 2024-019-02-KY). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.
Author contributions
JZ: Writing – original draft, Data curation. FW: Writing – review & editing, Data curation. XZ: Writing – review & editing, Data curation. WS: Writing – review & editing, Data curation. ZH: Data curation, Writing – review & editing. YL: Data curation, Writing – review & editing. YG: Data curation, Writing – review & editing. SL: Writing – review & editing, Project administration. XL: Writing – review & editing, Project administration.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This study has received funding from Chongqing medical scientific research project (Joint project of Chongqing Health Commission and Science and Technology Bureau) (No. 2024MSXM046).
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that Generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2026.1721949/full#supplementary-material
References
1. World Health Organization. Global tuberculosis report 2024. Geneva:World Health Organization. (2024). Licence: CC BY-NC-SA 3.0 IGO.
2. Dheda, K, Barry, CE, and Maartens, G. Tuberculosis. Lancet. (2016) 387:1211–26. doi: 10.1016/s0140-6736(15)00151-8,
3. Mathewos, B, Kebede, N, Kassa, T, Mihret, A, and Getahun, M. Characterization of mycobacterium isolates from pulmomary tuberculosis suspected cases visiting tuberculosis reference Laboratory at Ethiopian Health and Nutrition Research Institute, Addis Ababa Ethiopia: a cross sectional study. Asian Pac J Trop Med. (2015) 8:35–40. doi: 10.1016/s1995-7645(14)60184-x,
4. Prevots, DR, Marshall, JE, Wagner, D, and Morimoto, K. Global epidemiology of nontuberculous mycobacterial pulmonary disease: a review. Clin Chest Med. (2023) 44:675–721. doi: 10.1016/j.ccm.2023.08.012,
5. Chai, J, Zhang, S, Ma, C, Mei, Q, Liu, T, Liu, J, et al. Clinical analysis and risk factors associated with poor prognosis in nontuberculous mycobacterial infection. Virulence. (2025) 16:2459313. doi: 10.1080/21505594.2025.2459313,
6. Wang, P, Marras, TK, Hassan, M, and Chatterjee, A. Incremental mortality associated with nontuberculous mycobacterial lung disease among US Medicare beneficiaries with chronic obstructive pulmonary disease. BMC Infect Dis. (2023) 23:749. doi: 10.1186/s12879-023-08689-9,
7. Furuuchi, K, Morimoto, K, Yoshiyama, T, Tanaka, Y, Fujiwara, K, Okumura, M, et al. Interrelational changes in the epidemiology and clinical features of nontuberculous mycobacterial pulmonary disease and tuberculosis in a referral hospital in Japan. Respir Med. (2019) 152:74–80. doi: 10.1016/j.rmed.2019.05.001,
8. Winthrop, KL, Marras, TK, Adjemian, J, Zhang, H, Wang, P, and Zhang, Q. Incidence and prevalence of nontuberculous mycobacterial lung disease in a large U.S. managed care health plan, 2008-2015. Ann Am Thorac Soc. (2020) 17:178–85. doi: 10.1513/AnnalsATS.201804-236OC,
9. Santin, M, Barrabeig, I, Malchair, P, Gonzalez-Luquero, L, Benitez, MA, Sabria, J, et al. Pulmonary infections with nontuberculous mycobacteria, Catalonia, Spain, 1994-2014. Emerg Infect Dis. (2018) 24:1091–4. doi: 10.3201/eid2406.172095,
10. Dahl, VN, Mølhave, M, Fløe, A, van Ingen, J, Schön, T, Lillebaek, T, et al. Global trends of pulmonary infections with nontuberculous mycobacteria: a systematic review. Int J Infect Dis. (2022) 125:120–31. doi: 10.1016/j.ijid.2022.10.013,
11. Liu, CF, Song, YM, He, WC, Liu, DX, He, P, Bao, JJ, et al. Nontuberculous mycobacteria in China: incidence and antimicrobial resistance spectrum from a nationwide survey. Infect Dis Poverty. (2021) 10:59. doi: 10.1186/s40249-021-00844-1,
12. Tan, Y, Su, B, Shu, W, Cai, X, Kuang, S, Kuang, H, et al. Epidemiology of pulmonary disease due to nontuberculous mycobacteria in southern China, 2013-2016. BMC Pulm Med. (2018) 18:168. doi: 10.1186/s12890-018-0728-z,
13. Zhu, Y, Hua, W, Liu, Z, Zhang, M, Wang, X, Wu, B, et al. Identification and characterization of nontuberculous mycobacteria isolated from suspected pulmonary tuberculosis patients in eastern China from 2009 to 2019 using an identification array system. Braz J Infect Dis. (2022) 26:102346. doi: 10.1016/j.bjid.2022.102346,
14. Ying, C, Li, X, Lv, S, Du, P, Chen, Y, Fu, H, et al. T-SPOT with CT image analysis based on deep learning for early differential diagnosis of nontuberculous mycobacteria pulmonary disease and pulmonary tuberculosis. Int J Infect Dis. (2022) 125:42–50. doi: 10.1016/j.ijid.2022.09.031,
15. Wang, L, Ding, W, Mo, Y, Shi, D, Zhang, S, Zhong, L, et al. Distinguishing nontuberculous mycobacteria from Mycobacterium tuberculosis lung disease from CT images using a deep learning framework. Eur J Nucl Med Mol Imaging. (2021) 48:4293–306. doi: 10.1007/s00259-021-05432-x,
16. Li, H, Li, T, Cai, Q, Wang, X, Liao, Y, Cheng, Y, et al. Development and validation of a radiomics nomogram for differentiating Mycoplasma pneumonia and bacterial pneumonia. Diagnostics (Basel). (2021) 11:1330. doi: 10.3390/diagnostics11081330,
17. Pan, Z, Hu, W, Dai, Z, Li, Y, Fang, Z, Shen, C, et al. Utilizing multimodal radiomics technology from cervical MRI for diagnosis of cervical spinal cord injury and spinal cord concussion. Sci Rep. (2024) 14:18686. doi: 10.1038/s41598-024-69784-8,
18. Zhao, W, Xiong, Z, Jiang, Y, Wang, K, Zhao, M, Lu, X, et al. Radiomics based on enhanced CT for differentiating between pulmonary tuberculosis and pulmonary adenocarcinoma presenting as solid nodules or masses. J Cancer Res Clin Oncol. (2023) 149:3395–408. doi: 10.1007/s00432-022-04256-y,
19. Xu, J, Guo, J, Yang, HQ, Ji, QL, Song, RJ, Hou, F, et al. Preoperative contrast-enhanced CT-based radiomics nomogram for differentiating benign and malignant primary retroperitoneal tumors. Eur Radiol. (2023) 33:6781–93. doi: 10.1007/s00330-023-09686-x,
20. Huang, ZN, Zhang, HX, Sun, YQ, Zhang, XQ, Lin, YF, Weng, CM, et al. Multi-cohort study in gastric cancer to develop CT-based radiomic models to predict pathological response to neoadjuvant immunotherapy. J Transl Med. (2025) 23:362. doi: 10.1186/s12967-025-06363-z,
21. Han, X, Guo, Y, Ye, H, Chen, Z, Hu, Q, Wei, X, et al. Development of a machine learning-based radiomics signature for estimating breast cancer TME phenotypes and predicting anti-PD-1/PD-L1 immunotherapy response. Breast Cancer Res. (2024) 26:18. doi: 10.1186/s13058-024-01776-y,
22. Li, Y, Xu, Z, Lv, X, Li, C, He, W, Lv, Y, et al. Radiomics analysis of lung CT for multidrug resistance prediction in active tuberculosis: a multicentre study. Eur Radiol. (2023) 33:6308–17. doi: 10.1007/s00330-023-09589-x,
23. Warkentin, MT, Al-Sawaihey, H, Lam, S, Liu, G, Diergaarde, B, Yuan, JM, et al. Radiomics analysis to predict pulmonary nodule malignancy using machine learning approaches. Thorax. (2024) 79:307–15. doi: 10.1136/thorax-2023-220226,
24. Daley, CL, Iaccarino, JM, Lange, C, Cambau, E, Wallace, RJ Jr, Andrejak, C, et al. Treatment of nontuberculous mycobacterial pulmonary disease: an official ATS/ERS/ESCMID/IDSA clinical practice guideline. Eur Respir J. (2020) 56:2000535. doi: 10.1183/13993003.00535-2020,
25. National Health Commission of the People’s Republic of China. Diagnostic criteria for pulmonary tuberculosis (WS 288-2017). Electron J Emerg Infect Dis. (2018) 3:59–61. doi: 10.19871/j.cnki.xfcrbzz.2018.01.017
26. Cheng, Z, Qin, L, Cao, Q, Dai, J, Pan, A, Yang, W, et al. Quantitative computed tomography of the coronavirus disease 2019 (COVID-19) pneumonia. Radiol Infect Dis. (2020) 7:55–61. doi: 10.1016/j.jrid.2020.04.004,
27. Ouyang, X, Huo, J, Xia, L, Shan, F, Liu, J, Mo, Z, et al. Dual-sampling attention network for diagnosis of COVID-19 from community acquired pneumonia. IEEE Trans Med Imaging. (2020) 39:2595–605. doi: 10.1109/tmi.2020.2995508,
28. Liu, Q, Pan, X, An, H, Du, J, Li, X, Sun, W, et al. Building a model for the differential diagnosis of non-tuberculous mycobacterial lung disease and pulmonary tuberculosis: a case-control study based on immunological and radiological features. Int Immunopharmacol. (2023) 124:111064. doi: 10.1016/j.intimp.2023.111064,
29. Zhang, W, Li, T, Liu, H, Cao, X, Yan, H, and Gao, Y. LASSO regression-based nomogram for distinguishing nontuberculous mycobacterial pulmonary disease from pulmonary tuberculosis: a clinical risk prediction model. Sci Rep. (2025) 15:21099. doi: 10.1038/s41598-025-08456-7,
30. Li, HL, Zhi, RZ, Liu, HS, Wang, M, and Yu, SJ. Multimodal machine learning-based model for differentiating nontuberculous mycobacteria from mycobacterium tuberculosis. Front Public Health. (2025) 13:1470072. doi: 10.3389/fpubh.2025.1470072,
31. Bents, SJ, Mercaldo, RA, Powell, C, Henkle, E, Marras, TK, and Prevots, DR. Nontuberculous mycobacterial pulmonary disease (NTM PD) incidence trends in the United States, 2010-2019. BMC Infect Dis. (2024) 24:1094. doi: 10.1186/s12879-024-09965-y,
32. Hoza, AS, Mfinanga, SG, Rodloff, AC, Moser, I, and König, B. Increased isolation of nontuberculous mycobacteria among TB suspects in northeastern, Tanzania: public health and diagnostic implications for control programmes. BMC Res Notes. (2016) 9:109. doi: 10.1186/s13104-016-1928-3,
33. Kim, C, Park, SH, Oh, SY, Kim, SS, Jo, KW, Shim, TS, et al. Comparison of chest CT findings in nontuberculous mycobacterial diseases vs. Mycobacterium tuberculosis lung disease in HIV-negative patients with cavities. PLoS One. (2017) 12:e0174240. doi: 10.1371/journal.pone.0174240,
34. Zhou, L, Wang, Y, Zhu, W, Zhao, Y, Yu, Y, Hu, Q, et al. A retrospective study differentiating nontuberculous mycobacterial pulmonary disease from pulmonary tuberculosis on computed tomography using radiomics and machine learning algorithms. Ann Med. (2024) 56:2401613. doi: 10.1080/07853890.2024.2401613,
35. vom Steeg, LG, and Klein, SL. SeXX matters in infectious disease pathogenesis. PLoS Pathog. (2016) 12:e1005374. doi: 10.1371/journal.ppat.1005374,
36. Giller, D, Scherbakova, G, Enilenis, I, Martel, I, Kesaev, O, Koroev, V, et al. A comparison of clinical and radiographic signs of nontuberculous mycobacterial pulmonary disease, destructive drug-resistant pulmonary tuberculosis and a combination of nontuberculous Mycobacterium pulmonary disease and pulmonary tuberculosis. Pathogens. (2023) 12:887. doi: 10.3390/pathogens12070887,
37. Klann, E, Beal, SG, and Tremblay, EE. Evaluating differences in tuberculosis and nontuberculous mycobacterial lung disease in Florida. Am J Infect Control. (2019) 47:1324–8. doi: 10.1016/j.ajic.2019.04.171
38. Xing, Z, Ding, W, Zhang, S, Zhong, L, Wang, L, Wang, J, et al. Machine learning-based differentiation of nontuberculous mycobacteria lung disease and pulmonary tuberculosis using CT images. Biomed Res Int. (2020) 2020:6287545. doi: 10.1155/2020/6287545,
39. Pfyffer, GE, and Wittwer, F. Incubation time of mycobacterial cultures: how long is long enough to issue a final negative report to the clinician? J Clin Microbiol. (2012) 50:4188–9. doi: 10.1128/jcm.02283-12
40. Yuan, MK, Chang, CY, Tsai, PH, Lee, YM, Huang, JW, and Chang, SC. Comparative chest computed tomography findings of non-tuberculous mycobacterial lung diseases and pulmonary tuberculosis in patients with acid fast bacilli smear-positive sputum. BMC Pulm Med. (2014) 14:65. doi: 10.1186/1471-2466-14-65,
41. Liang, Q, Jiang, X, Jia, J, Zhao, L, Li, Y, Wang, F, et al. An early and trustable indicator suggestive of non-tuberculosis mycobacteria isolation in a high tuberculosis burden setting. J Infect. (2024) 88:106149. doi: 10.1016/j.jinf.2024.106149
42. Chu, HQ, Li, B, Zhao, L, Huang, DD, Zhang, ZM, Xu, JF, et al. Chest imaging comparison between non-tuberculous and tuberculosis mycobacteria in sputum acid fast bacilli smear-positive patients. Eur Rev Med Pharmacol Sci. (2015) 19:2429–39. doi: 10.1183/13993003.congress-2015.pa2674
43. Gopalaswamy, R, Shanmugam, S, Mondal, R, and Subbian, S. Of tuberculosis and non-tuberculous mycobacterial infections - a comparative analysis of epidemiology, diagnosis and treatment. J Biomed Sci. (2020) 27:74. doi: 10.1186/s12929-020-00667-6,
44. Christensen, J, Prosper, AE, Wu, CC, Chung, J, Lee, E, Elicker, B, et al. ACR lung-RADS v2022: assessment categories and management recommendations. Chest. (2024) 165:738–53. doi: 10.1016/j.chest.2023.10.028,
45. Xu, YH, Dong, JH, An, WM, Lv, XY, Yin, XP, Zhang, JZ, et al. Clinical and computed tomographic imaging features of novel coronavirus pneumonia caused by SARS-CoV-2. J Infect. (2020) 80:394–400. doi: 10.1016/j.jinf.2020.02.017,
46. Guan, CS, Lv, ZB, Li, JJ, Du, YN, Chen, H, Cui, T, et al. CT appearances, patterns of progression, and follow-up of COVID-19: evaluation on thin-section CT. Insights Imaging. (2021) 12:73. doi: 10.1186/s13244-021-01019-0,
47. Hu, Y, Zhong, L, Liu, H, Ding, W, Wang, L, Xing, Z, et al. Lung CT-based multi-lesion radiomic model to differentiate between nontuberculous mycobacteria and Mycobacterium tuberculosis. Med Phys. (2025) 52:1086–95. doi: 10.1002/mp.17537,
48. Koh, WJ, Lee, KS, Kwon, OJ, Jeong, YJ, Kwak, SH, and Kim, TS. Bilateral bronchiectasis and bronchiolitis at thin-section CT: diagnostic implications in nontuberculous mycobacterial pulmonary infection. Radiology. (2005) 235:282–8. doi: 10.1148/radiol.2351040371,
49. Yan, Q, Wang, W, Zhao, W, Zuo, L, Wang, D, Chai, X, et al. Differentiating nontuberculous mycobacterium pulmonary disease from pulmonary tuberculosis through the analysis of the cavity features in CT images using radiomics. BMC Pulm Med. (2022) 22:4. doi: 10.1186/s12890-021-01766-2,
50. Yan, Q, Zhao, W, Kong, H, Chi, J, Dai, Z, Yu, D, et al. Ct-based radiomics analysis of consolidation characteristics in differentiating pulmonary disease of non-tuberculous mycobacterium from pulmonary tuberculosis. Exp Ther Med. (2024) 27:112. doi: 10.3892/etm.2024.12400,
51. Park, M, Lee, Y, Kim, S, Kim, YJ, Kim, SY, Kim, Y, et al. Distinguishing nontuberculous mycobacterial lung disease and Mycobacterium tuberculosis lung disease on X-ray images using deep transfer learning. BMC Infect Dis. (2023) 23:32. doi: 10.1186/s12879-023-07996-5,
Glossary
AFB - Acid-fast bacilli
AUC - area under the curve
BMI - body mass index
CI - confidence interval
CT - computed tomography
DSC - Dice similarity coefficient
DT - Decision tree
DV - DependenceVariance
GLCM - gray level co-occurrence matrix
GLDM - gray level dependence matrix
GLRLM - gray level run length matrix
GLSZM - gray level size zone matrix
GLV - GrayLevelVariance
IDI - integrated discrimination improvement
IGRA - interferon-γ release assay
KNN - k-Nearest Neighbor
LASSO - Least Absolute Shrinkage and Selection Operator
LR - Logistic Regression
LTBI - latent tuberculosis infection
ML - machine learning
NGTDM - neighboring gray tone difference matrix
NRI - net reclassification index
NTM - nontuberculous mycobacteria
NTM-LD - non-tuberculous mycobacterial lung disease
PTB - pulmonary tuberculosis
QDA - Quadratic Discriminant Analysis
RF - Random Forest
ROC - receiver operating characteristic
SVM - Support Vector Machin
XGBoost - eXtreme Gradient Boosting
ZP - ZonePercentage
Keywords: computed tomography, machine learning, non-tuberculous mycobacterial lung diseases, pulmonary tuberculosis, radiomics
Citation: Zheng J, Wang F, Zeng X, Shu W, He Z, Li Y, Gao Y, Lv S and Liu X (2026) Distinguishing nontuberculous mycobacterial lung disease from pulmonary tuberculosis using radiomics machine learning models from CT images. Front. Med. 13:1721949. doi: 10.3389/fmed.2026.1721949
Edited by:
Francisco Epelde, Parc Taulí Foundation, SpainReviewed by:
Junwei Liu, Guangzhou National Laboratory, ChinaRoslidar Roslidar, Syiah Kuala University, Indonesia
Copyright © 2026 Zheng, Wang, Zeng, Shu, He, Li, Gao, Lv and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xueyan Liu, ZnNrbHh5MjAyNUAxNjMuY29t; NjI5NDk3OTgwQHFxLmNvbQ==
Fang Wang2