Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Immunol., 14 November 2025

Sec. Microbial Immunology

Volume 16 - 2025 | https://doi.org/10.3389/fimmu.2025.1652747

This article is part of the Research TopicAdvancements in Combatting Nontuberculous Mycobacterial Infections: Mechanisms and TreatmentsView all 4 articles

A diagnostic model for NTM disease in HIV-positive patients: a machine learning-based analysis with novel inflammatory markers

Longfen Li,,&#x;Longfen Li1,2,3†Chunjing Shi,,&#x;Chunjing Shi1,2,3†Xing Liu&#x;Xing Liu4†Wenming Li,,Wenming Li1,2,3Yun Luo,,Yun Luo1,2,3Huajie Zhang,,Huajie Zhang1,2,3Ge Wang,,Ge Wang1,2,3Yanhong Zhao,,Yanhong Zhao1,2,3Yuanqing Huang,,Yuanqing Huang1,2,3Juan Yang,,Juan Yang1,2,3Jiao Yang,,Jiao Yang1,2,3Shengan Li*Shengan Li5*Lingjun Shen,,*Lingjun Shen1,2,3*
  • 1Tuberculosis Department 3, The Third People's Hospital of Kunming, Kunming, China
  • 2Tuberculosis Department 3, Yunnan Provincial Clinical Medical Center for Infectious Diseases, Kunming, China
  • 3Tuberculosis Department 3, Medical Technology Center for the Diagnosis and Treatment of Non-tuberculous Mycobacterial Disease, Kunming, China
  • 4Department of Pharmacy, The Third People's Hospital of Kunming, Kunming, China
  • 5Department of Pathogen Biology and Immunology, Faculty of Basic Medical Science, Kunming Medical University, Kunming, China

Background: Globally, the prevalence of nontuberculous mycobacterial (NTM) co-infection among HIV-positive patients is increasing. The diagnosis of HIV-positive patients co-infected with NTM relies on mycobacterial culture and identification, as well as molecular biology techniques. However, culture-based methods are technically challenging, time-consuming, and costly. Therefore, it is urgent to explore early diagnostic methods for HIV-positive patients co-infected with NTM. To address this issue, the present study had aimed to explore new approaches for the early diagnosis of NTM disease in HIV-positive patients. This study aimed to thoroughly investigate the potential value of novel inflammatory markers in the early diagnosis of nontuberculous mycobacterial (NTM) disease among HIV-positive patients using machine learning techniques, thereby providing a scientifically sound and clinically feasible diagnostic basis for the early identification of this condition in clinical practice.

Methods: A retrospective analysis was conducted on 4,496 HIV-infected individuals admitted to the Third People’s Hospital of Kunming, between August 1, 2021, to August 31, 2024.Based on the inclusion and exclusion criteria, a total of 78 HIV-infected individuals with NTM disease were finally included as the experimental group, and 187 HIV-positive patients without NTM disease were included as the control group. Clinical data of the participants were collected. For comparisons between groups, the chi-square (χ2) test or the nonparametric Mann-Whitney U test was used as appropriate. Indicators with p < 0.05 in the comparison between the two groups were subjected to LASSO regression for variable screening. Subsequently, Logistic regression(LR), Random Forest(RF), and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) were employed for further variable selection. To assess multicollinearity among variables, tolerance and variance inflation factor (VIF) were used as criteria. LR, RF, and SVM models were established. All the subjects included in the study were assigned to the training set, and 3/10 of the subjects were randomly selected as the validation set. The area under the receiver operating characteristic (ROC) curve (AUC) was used to evaluate the discrimination of the models, and the DeLong test was used for comparing AUCs between models. The Hosmer-Lemeshow test and calibration curves were used to evaluate the calibration of the models. Decision Curve Analysis (DCA) and Clinical Impact Curves(CIC) were employed to assess the clinical utility of the models. SHapley Additive exPlanations(SHAP) was used for models visualization and interpretation.

Results: Among the 265 participants included in the study, there were 52 males and 26 females in the experimental group of 78 patients, with an average age of 44.5 ± 10.34 years. In the control group of 187 patients, there were 108 males and 79 females, with an average age of 49.8 ± 12.20 years. After differential analysis and LASSO regression screening, WBC, SAA, NLR, MLR, PLR, CAR, and CALLY were selected as the 7 indicators, with no multicollinearity among them. Subsequently, using LR, RF, and SVM for further screening, we established three early diagnostic prediction models for NTM disease in HIV-positive patients. In the training set, The AUC values(AUCs) indicated that the predictive performance of the three models was as follows: Logistic regression model (AUC: 0.850, 95% CI: 0.797–0.903), Random Forest model (AUC: 0.849, 95% CI: 0.797–0.890), and SVM model (AUC: 0.813, 95% CI: 0.750–0.876). The sensitivities were 69.2%, 71.8%, and 76.9%. The specificity values were 89.8%, 85.0%, and 78.6%. The Youden index scores were recorded as 0.590, 0.568, and 0.555. The Positive Likelihood Ratios (LR+) were found to be 6.780, 4.787, and 3.590. The Negative Likelihood Ratios (LR-) were determined to be 0.343, 0.332, and 0.294. The comparison of the AUC values among the three models indicated that there were no statistically significant difference in the predictive efficacy between them. The calibration curves indicated that the predicted probabilities were generally aligned with the observed actual probabilities. With regard to quantitative evaluation, the results of the Hosmer-Lemeshow test were as follows: for the LR model, χ2 = 8.078, p = 0.426; for the RF model, χ2 = 13.081, p = 0.1091; and for the SVM model, χ2 = 0.620, p < 0.001.These findings indicate that both the LR and random forest models exhibit good calibration and accuracy, whereas the SVM model shows poor calibration. Consequently, the SVM model was excluded from further consideration. Consequently, the SVM model was deemed unsuitable and subsequently discarded. The clinical decision curve analysis showed that both the LR and RF models could provide benefits to patients, demonstrating comparable levels of advantage. In the validation set, the ROC curve indicated that the AUC of the LR model was 0.873 (95% CI: 0.782 - 0.961) and that of the RF model was 0.860 (95% CI: 0.768 - 0.952). The calibration curve showed that the predictions of these two models tend to be consistent with the actual values. Hosmer-Lemeshow test: LR model: χ2 = 5.111, p = 0.746; RF model: χ2 = 12.489, p = 0.131, indicating that both models have good calibration. The clinical decision curve shows that LR and RF can also bring clinical benefits to patients in the validation set. Both the LR and RF models demonstrated good predictive performance, calibration, and clinical applicability in both the training and validation sets, indicating that these two models have good stability. The Shapley Additive exPlanations (SHAP) were employed to illustrate the decision-making process of the models. The SHAP summary plot revealed that, in the LR model, the feature with the greatest contribution was WBC, while the feature with the least contribution was CAR. In the RF model, the feature with the greatest contribution was PLR, whereas the feature with the least contribution was CALLY.

Conclusion: We have found that WBC, NLR, PLR, CAR, and CALLY can assist in the early identification of HIV-positive patients with NTM disease. Based on established parameters, we have successfully developed two early diagnostic prediction models for HIV-positive patients coexisting with NTM disease. Both models demonstrate strong discrimination, calibration, clinical applicability and stability. In clinical practice, for HIV patients with suspected concurrent mycobacterial infection, after excluding TB, these two models can be used for screening and early identification of HIV concurrent NTM disease.

1 Introduction

According to the Joint United Nations Programme on HIV/AIDS, by the end of 2023, a total of 39.9 million people were infected with HIV. In 2023, 1.3 million people had become newly infected with HIV, and 630,000 people died from AIDS-related diseases (1). By the time of the latest reports, nontuberculous mycobacteria (NTM) were classified into approximately 200 species and 13 subspecies, and NTM disease were known to significantly impair patients’ quality of life (2). In clinical practice, for all cases where imaging and clinical manifestations indicated the need to “rule out mycobacterial infection,” our hospital routinely performed rapid tests such as QuantiFERON-TB, Xpert® MTB/RIF (or Xpert® MTB/RIF Ultra), TB-LAMP, and TB-SAT RNA simultaneously. These tests enable prompt confirmation or exclusion of tuberculosis. However, when all the above results were negative but the patient’s symptoms continued to worsen and the imaging showed progression, the most challenging next question for the clinical practice was whether there is an infection of NTM. In a management guideline for HIV-positive patients with NTM disease from South Africa, it was pointed out that the prevalence of NTM in HIV-positive patients were increasing globally, and species identification and drug susceptibility testing were crucial in the diagnosis and treatment of NTM infections (3). It was noted that NTM cultivation is highly demanding, difficult, and time-consuming, with different species requiring distinct culture media and varying incubation periods (4). Moreover, the positive culture rate for NTM disease patients is only about 6.3% (5). Therefore, this study focuses on how to identify NTM early in HIV-suspected mycobacterium-infected patients after TB has been ruled out, in response to the urgent need in real clinical scenarios.

New inflammatory markers are those beyond the traditional indicators such as C-Reactive Protein (CRP), Interleukin 6 (IL6), and Procalcitonin (PCT). They include composite indicators like the Systemic Immune-Inflammation Index (SII), Neutrophil-to-Lymphocyte Ratio (NLR), and Platelet-to-Lymphocyte Ratio (PLR). These markers can provide more information about the inflammatory state and are cost-effective, convenient, and rapid. They have been widely recognized and applied in a variety of diseases (6, 7). However, their value and application in HIV-associated NTM infections have not yet been fully explored. There is an urgent need for in-depth research to fully uncover their potential in the early diagnosis of HIV-associated NTM disease.

For its ability to make automated decisions, handle data efficiently, perform massive computations, and deliver excellent predictive power, machine learning is highly favored. Shapley Additive exPlanations (SHAP) is a method for interpreting the predictions of machine learning models. It is grounded in the concept of Shapley values from game theory, assigning importance values to each feature of the model. By quantifying the impact of each feature on the target variable, SHAP elucidates the prediction process of the model (8).

Based on the clinical necessity for early diagnosis of NTM infections in HIV-positive patients, and taking into account the advantages offered by novel inflammatory indicators—such as economic affordability, easy accessibility, and the high efficiency along with substantial computational capabilities of machine learning in data processing—this paper aims to leverage machine learning to thoroughly investigate the clinical significance of these new inflammatory markers in HIV-positive individuals diagnosed with NTM disease. We will develop an early prediction model for NTM disease specifically tailored for HIV-positive patients and visualize the interpretability of this diagnostic model using the SHAP method.

2 Materials and methods

2.1 Sample size estimation

Based on the current literature, the prevalence of NTM disease among HIV-positive patients ranges from 11.8% to 47% (9, 10).For this study, the prevalence of NTM disease among HIV-positive patients was determined to be 29.4%. Drawing on a meta-analysis of risk factors for NTM disease (11), an odds ratio (OR) of 4.46 was chosen for a rough estimation of the sample size.

2.2 Study participants

A retrospective analysis was conducted on 4,496 HIV-positive patients who were hospitalized at the Third People’s Hospital of Kunming from August 1, 2021, to August 31, 2024. Inclusion criteria: HIV infection was diagnosed according to the diagnostic criteria specified in the “Chinese HIV/AIDS Diagnosis and Treatment Guidelines (2024)” (12). NTM disease was diagnosed in line with the criteria detailed in the “Guidelines for the Diagnosis and Treatment of Non-tuberculous Mycobacterial Disease (2020)” (13). The exclusion criteria were as follows: (1) patients younger than 18 years old or older than 80 years old; (2) pregnant or breastfeeding women; (3) patients with a hospital stay of less than 24 hours or incomplete data; and (4) patients with other severe infections, severe cardiopulmonary diseases, or malignancies. Based on the inclusion and exclusion criteria, a total of 78 patients with HIV-associated NTM disease were included as the experimental group, and 187 patients with HIV but without NTM disease were included as the control group(See Figure 1).

Figure 1
Flowchart of AIDS-infected patients hospitalized from August 2021 to August 2024. Out of 4496 cases, 4231 were excluded due to exclusion criteria, including duplicate cases and concurrent infections. Ultimately, 265 cases were included. These were divided into two groups based on the presence of NTM disease: 78 in the experimental group and 187 in the control group.

Figure 1. Study participants.

From 2021 to 2024, a total of 4496 HIV patients were hospitalized in our hospital. After excluding 4,231 cases, 265 cases remained. These cases were further divided into an experimental group of 78 cases and a control group of 187 cases based on whether they were complicated with NTM.

2.3 Laboratory test

Routine blood tests were performed using the Sysmex XN400 analyzer (Sysmex Corporation, Japan), with 3–5 ml of whole blood collected within 24 hours of admission.

2.4 Data collection

We collected data on the gender, age, ethnicity, medical history, and laboratory test results of the study participants: (1) Laboratory data included the following test indicators: white blood cell count (WBC), monocyte count (M), lymphocyte count (L), neutrophil count (N), platelet count (PLT), albumin (ALB), creatinine (CREA), serum amyloid A (SAA), procalcitonin (PCT), C-reactive protein (CRP), and interleukin-6 (IL-6).

(2)Calculation of novel inflammatory markers: see Table 1 below.

Table 1
www.frontiersin.org

Table 1. Calculation of novel inflammatory markers.

2.5 Statistical analysis

Data were analyzed using SPSS 27.0, R 4.4.2, and Python 3.12. Variables with a missing data rate of ≥70% were discarded, and those with a missing rate of <30% were imputed using multiple imputation. Categorical data were expressed as rates and proportions, and intergroup comparisons were performed using the chi-square test. Continuous data that followed a normal distribution were expressed as mean ± standard deviation x¯±s and intergroup comparisons were conducted using the t-test. The non-normally distributed data were described as M(Q1, Q3) and compared using a non-parametric test for two independent samples. Variable selection was performed using Least Absolute Shrinkage and Selection Operator regression(LASSO), Logistic regression(LR), Random Forest(RF), and Support Vector Machine - Recursive Feature Elimination (SVM-RFE). LASSO was implemented using the “glmnet” package with the following parameters: alpha = 1, nfolds = 10, and standardize = TRUE. LR was carried out using the “stats” package with the “logit” link function. The RF was built using the “randomForest” package with ntree = 500 and importance = TRUE. SVM-RFE was performed using the “e1071” and “caret” packages with the parameter setting of method = “svmLinear”. To assess multicollinearity among variables, the tolerance and variance inflation factor (VIF) were used. LR, RF, and SVM models were constructed. All the subjects included in the study were assigned to the training set, and 3/10 of the subjects were randomly selected as the validation set. Model evaluation was applied to R packages such as pROC, caret, CalibrationCurves, ResourceSelection, rmda, ggplot2 and et al. The discrimination of the models was evaluated using the area under the receiver operating characteristic (ROC) curve (AUC), and comparisons of AUC between models were conducted using the DeLong test. The calibration of the models was evaluated using the Hosmer-Lemeshow test and calibration curves, while the clinical utility of the models was assessed using decision curve analysis (DCA) and clinical impact curves(CIC). The model was visualized and interpreted using SHAP. The SHAP library was applied, and the summary_plot, dependence_plot, force_plot, decision_plot, and heatmap functions were called. A two-sided test was used, with a significance level of α = 0.05. A p-value of less than 0.05 was considered statistically significant.

3 Results

3.1 Comparison of baseline characteristics between the experimental group and the control group

Among the 265 study subjects, the analysis of baseline characteristics had shown that: as seen in Table 2, there was no statistically significant difference between the experimental and control group in terms of gender, ethnicity, route of HIV infection, marital and reproductive, smoking history, as well as the history of diabetes, hypertension, and alcohol consumption (p > 0.05). Only age had statistically significant differences between the two groups (p < 0.05). Among them, in the experimental group of 78 patients, there were 52 males and 26 females, with a mean age of 44.5 ± 10.337 years, and 46 patients (59.0%) were smokers; in the control group of 187 patients, there were 108 males and 79 females, with a mean age of 49.8 ± 12.203 years, and 84 patients (44.9%) were smokers.

Table 2
www.frontiersin.org

Table 2. Comparison of baseline characteristics of the study subjects.

The comparison of traditional and novel inflammatory markers between the two groups showed that: as presented in Table 3, the traditional inflammatory markers WBC, CRP, SAA, IL-6, and PCT had statistically significant differences between the two groups (p < 0.05); among the 15 novel inflammatory markers, SII, NLR, MLR, PLR, CAR, CLR, SIRI, CALLY, and ILI had statistically significant differences between the two groups (p < 0.05).

Table 3
www.frontiersin.org

Table 3. Comparison of traditional and novel inflammatory markers between the control and experimental group.

3.2 WBC, SAA, NLR, MLR, PLR, CAR, and CALLY were identified as significant factors

Excluding the indicators NEUT, NPR, PAR, ACR, NAR, PIV, and WBI due to their lack of statistical significance, Lasso regression screening was performed on the remaining 14 inflammatory markers of both new and old types. The methodology is illustrated in Figure 2A. Through a process of 10-fold cross-validation, seven indicators—WBC, SAA, NLR, MLR, PLR, CAR, and CALLY—were ultimately selected (as depicted in Figure 2B). Subsequently, multicollinearity tests were conducted on these seven indicators. The absolute values of the correlation coefficients (R) between each pair of indicators were all found to be less than 0.8 (as shown in Figure 3), and the Variance Inflation Factor (VIF) for each indicator was below 3. These results indicate that there is no multicollinearity present among the selected seven indicators.

Figure 2
Panel A shows a plot of coefficients against log lambda, with several colored lines representing different coefficients decreasing and converging as log lambda increases. Panel B displays a binomial deviance plot against log lambda, featuring a red curve with standard error bars, indicating deviance decreases then increases, creating a U-shape.

Figure 2. Variable selection by Lasso regression. (A) λ and Partial Regression Coefficient Plot. The horizontal axis represented log(λ), the upper axis corresponded to the number of non-zero coefficients; the vertical axis represented the partial regression coefficient values of each variable. It indicated the order in which variables enter the LASSO regression model and the coefficient trajectory as λ decreases. (B) Ten-fold Cross-validation Error Curve. The horizontal axis represented log(λ), the upper axis corresponded to the number of non-zero coefficients; the vertical axis represented the mean squared prediction error (MSE) of the model. The left dotted line indicated the log(λ) at the minimum error, retaining 7 variables; the right dotted line indicated the log(λ) under the “1-standard error” criterion, retaining 5 variables.

Figure 3
Correlation matrix chart with circular heatmap showing relationships among SAA, CAR, NLR, MLR, PLR, WBC, and CALLY. Values range from 1.00 (perfect correlation) to negative values. Red indicates positive correlations, and blue indicates negative correlations, with intensity reflecting strength.

Figure 3. Heatmap of correlations among the 7 selected indicators. The various inflammatory indicators of behavior were listed as corresponding indicators. The figure showed the Pearson correlation coefficient(r), with the color gradient ranging from blue (r = –1) to red (r = +1), and white indicating r=0.

3.3 Three diagnostic models, specifically LR, RF, and SVM, were developed to forecast the occurrence of HIV in conjunction with NTM

The above-mentioned 7 indicators were included in the binary logistic regression analysis, with the method selected as “Forward: LR.” Ultimately, the variables entered into the equation were shown to be WBC, NLR, PLR, and CAR (as shown in Table 4). This indicates that these 4 inflammatory indicators are independent influencing factors for HIV-infected patients with NTM coinfection. Using the 4 indicators as key factors, a LR model was established. The 7 indicators selected by Lasso regression were ranked by importance using random forests, and the top 4 indicators (PLR, CALLY, NLR, WBC) were chosen as key factors to establish a RF model. The 7 indicators were also ranked by importance using SVM-RFE, and the top 4 indicators (PLR, CALLY, NLR, CAR) were selected as key factors to establish an SVM model.

Table 4
www.frontiersin.org

Table 4. Analysis of independent influencing factors for HIV coinfected with NTM.

3.4 Comparison and evaluation of models

3.4.1 The three models—LR, RF, and SVM—demonstrate strong predictive performance in forecasting the occurrence of HIV in conjunction with NTM

The ROC curves were generated, and the areas under the curve (AUC) were compared to assess the predictive efficacy of the LR,RF and SVM models for the patients in the experimental group. The predictive efficacy in descending order was as follows (see Table 5, Figure 4): the LR model (AUC: 0.850, 95%CI: 0.797 - 0.903), the RF model (AUC: 0.849, 95%CI: 0.797 - 0.890), the SVM model (AUC: 0.813, 95%CI: 0.750 - 0.876). The sensitivity rates were recorded at 69.2%, 71.8% and 76.9%. The specificity rates were noted as 89.8%, 85.0% and 78.6%. The Youden index were 0.590, 0.568 and 0.555. The Positive Likelihood Ratio (LR+) were found to be 6.780, 4.787 and 3.590. The Negative Likelihood Ratio (LR-) were recorded as 0.343, 0.332 and 0.294. These findings indicate that the three models possess a certain predictive value for HIV complicated by NTM.

Table 5
www.frontiersin.org

Table 5. ROC curves analysis of the three models for predicting the risk of HIV complicated with NTM.

Figure 4
ROC curve comparing three models: LR (red, AUC 0.850), RF (blue, AUC 0.849), and SVM (green, AUC 0.813). Sensitivity is on the y-axis, and 1 - Specificity on the x-axis.

Figure 4. ROC curves of the three models for predicting the risk of HIV complicated with NTM. The x-axis represented 1-specificity, and the y-axis represented sensitivity. The red curve illustrated the ROC curve of the logistic regression model used for predicting HIV in conjunction with NTM disease. The blue curve depicted the ROC curve of the random forest prediction model for forecasting HIV concurrent with NTM disease. Lastly, the green curve represented the ROC curve of the support vector machine model aimed at predicting HIV alongside NTM disease.

When the AUCs of the three models were compared (see Table 6), it was found that the AUC of the LR model was higher than that of the other two models. However, the differences in AUC among the three models were not statistically significant (p> 0.05).

Table 6
www.frontiersin.org

Table 6. Comparison of ROC curves for the three models.

3.4.2 The LR and RF models demonstrated strong calibration in predicting the concurrent occurrence of HIV and NTM. However, the performance of the SVM model was found to be unsatisfactory

Calibration curves were generated for each of the three models, as illustrated in Figure 5. They were observed that the predicted and actual values of the LR and RF models exhibit a strong alignment, closely following the diagonal line. However, the calibration curve of the SVM model exhibited a deviation from the diagonal line.

Figure 5
Three calibration plots labeled A, B, and C show predicted versus observed probabilities. Each plot contains dashed lines for the ideal model, red lines for apparent predictions, and green lines for bias-corrected predictions, illustrating model performance. Axes are labeled from zero to one.

Figure 5. Calibration curves of the three models. The X-axis represented the predicted probability, and the Y-axis represented the actual observed occurrence probability. The closer the curve was to the 45° dotted line (Ideal line), the more consistent the predicted probability was with the actual incidence rate. (A) Calibration curve of the Logistic regression model; (B) Calibration curve of the Random Forest model; (C) Calibration curve of the SVM model.

The Hosmer-Lemeshow test was used to quantify the differences between the predicted and actual values. The test results were as follows: for LR model, χ2 = 8.078, p = 0.426, indicating that there was no statistically significant difference between the predicted and actual values of this model; for RF model, χ2 = 13.081, p = 0.1091, indicating that there was no statistically significant difference between the predicted and actual values of this model.

Considering both the calibration curves and the Hosmer-Lemeshow test, the LR and RF models demonstrated satisfactory calibration. These two models demonstrated a high level of accuracy in distinguishing the presence of NTM disease.

For the SVM model,χ2 = 0.620, p < 0.001, indicating that there was a statistically significant difference between the predicted and actual values of this model. This finding suggests that the SVM model exhibited poor accuracy in distinguishing whether NTM disease was present. Therefore, this model was discarded.

3.4.3 Both the LR and RF models demonstrated considerable potential for application in clinical settings

The clinical utility of the models was evaluated through the use of clinical decision curves, as illustrated in Figure 6. The decision curves of the two models were predominantly situated above the two extreme lines. With the optimal cutoff values of 0.630 and 0.653 for the two models as the threshold probabilities, the net benefit rates of the two models were greater than zero and higher than the net benefit rates under the two extreme scenarios. The net benefit rates of the two models were close to each other. This indicated that both models could benefit patients in a similar manner and had good clinical applicability.

Figure 6
A decision curve analysis graph showing net benefit versus high risk threshold for two models: LR model in red and RF model in blue. Both lines gradually decrease, indicating similar performance. Additional lines represent “All” in gray and “None” in black for comparison.

Figure 6. Clinical Decision Curve. The X-axis represented the high risk threshold, and the Y-axis represented the net benefit. The curve showed that at different risk thresholds, the clinical net benefit of the models were higher than that of the “intervene all” or “no intervention” strategies. The red curve represented the clinical decision curve of the logistic regression model; the blue curve represented the clinical decision curve of the random forest model. They suggested that the two models have good clinical practicability.

3.4.4 In the validation set, both the LR and RF models exhibited outstanding predictive performance, calibration accuracy, and clinical applicability

In the validation set, first, the AUCs were plotted, and the predictive performance of the models in sequence (Figure 7) was as follows: the LR model (AUC: 0.873, 95% CI: 0.782-0.961) and the RF model (AUC: 0.860, 95% CI: 0.768-0.952). The AUCs of both models were greater than 0.8, indicating that these two models also had good predictive performance in the validation set. Secondly, the calibration curves for the LR and RF models were plotted using the validation set, as illustrated in Figure 8, indicating that the predictions of these two models tend to be consistent with the actual values. The Hosmer-Lemeshow test showed that for the LR model: χ2 = 5.111, p = 0.746, and for the RF model: χ2 = 12.489, p = 0.131, indicating that there was no statistically significant difference between the predicted values and the actual observed values for these two models. The results of the comprehensive calibration curve and Hosmer-Lemeshow test indicated that the LR and RF models also have good calibration in the validation set. Finally, the clinical decision curves of the LR and RF models were plotted, as shown in Figure 9: The decision curves of both models were mostly above the two extreme lines, indicating that LR and RF also demonstrated the potential to benefit patients in the validation set. Both the LR and RF models demonstrated good predictive performance, calibration, and clinical utility in both the training and validation sets, indicating that these two models have good stability, as shown in Figure 10.

Figure 7
ROC curve comparing RF and LR models, showing sensitivity versus 1-specificity. The RF model has an AUC of 0.860, and the LR model has an AUC of 0.873. Both models are plotted with the RF model in red and the LR model in blue. The diagonal line represents random chance.

Figure 7. The ROC curves of the LR and RF models for predicting the risk of HIV combined with NTM in the validation set. The x-axis represented 1-specificity, and the y-axis represented sensitivity. The red curve illustrated the ROC curve of the random forest model for predicting HIV concurrent with NTM disease in the validation set, while the blue curve represented the ROC curve of the logistic regression model for predicting HIV concurrent with NTM disease in the same validation set.

Figure 8
Two calibration plots show observed versus predicted probabilities. Plot A and Plot B both have three lines: a dashed black line for the ideal model, a red line for the apparent model, and a green line for the bias-corrected model. Each plot features similar trends with lines closely aligning diagonally. Probability scales range from 0 to 1 on both axes. Vertical bars appear at the top of each plot, indicating data distribution.

Figure 8. Calibration curves of LR and RF models in the validation set. (A) The X-axis represented the predicted probability, and the Y-axis represented the actual observed occurrence probability. The closer the curve was to the 45° dotted line (Ideal line), the more consistent the predicted probability was with the actual incidence rate. Calibration curve of the logistic regression model in the validation set; (B) Calibration curve of the random forest model in the validation set.

Figure 9
Line chart showing net benefit versus high risk threshold for two models and control groups. The red line represents the LR model, blue for the RF model, grey for the “All” group, and black for “None.” The LR and RF models show higher net benefits compared to the “All” and “None” groups, with net benefit decreasing as the threshold increases.

Figure 9. Clinical decision curves of the LR and RF models in the validation set. The X-axis represented the high risk threshold, and the Y-axis represented the net benefit. The curve showed that at different risk thresholds, the clinical net benefit of the models were higher than that of the “intervene all” or “no intervention” strategies. The red curve represented the clinical decision curve of the logistic regression model in the validation set; the blue one was that of the random forest model in the validation set. This indicated that in the validation set, both models also have good clinical practicability.

Figure 10
Graphs A and B are SHAP value heatmaps showing the impact on model output for different features across instances. Each graph includes a line plot of the function f(x) above the heatmap. The features include PLR, WBC, NLR, and CAR or CALLY. The SHAP values range from negative (blue) to positive (red), indicating the influence on model predictions. The x-axis represents instances from 0 to 250, while the color bar to the right of each heatmap denotes the SHAP value scale.

Figure 10. SHAP heatmap. 1. (A) the Logistic regression model heatmap; (B) the Random Forest model heatmap. 2. The x-axis represented the number of instances, the y-axis represented the impact of features on instances, and the color indicated the extent and direction of the impact. (red for positive, blue for negative).

3.5 SHapley Additive exPlanations explains the models

3.5.1 Visual analysis and interpretation of multi-feature contribution based on SHAP heatmap

The important feature rankings were shown on the left y-axis, with features sorted by their influence from greatest to smallest. Their visualizations were presented on the right y-axis. In the image, the depth of color indicates the magnitude of the SHAP values, which reflect the impact of the feature values on the model. The deeper the color, the larger the absolute value of the SHAP value and greater the influence on the model. The top part of the image visualizes the model’s prediction results based on these values.

3.5.2 Feature contributions: WBC dominated LR, CAR contributed least; PLR Led RF, CALLY trailed

The SHAP summary plot of the LR model (as shown in Figure 11A) revealed that among the four indicators used for modeling, WBC was the most contributive to the model, with its different levels exerting significantly varying impacts on the model’s prediction outcomes. In contrast, CAR was the least contributive indicator. The SHAP summary plot of the RF model (as shown in Figure 11B) indicated that among the four indicators used for modeling, PLR was the most contributive to the model, with a predominantly negative impact on the model’s prediction outcomes across the vast majority of its levels. In contrast, CALLY was the least contributive. A comparison of the two plots revealed that the contributions of the same indicators varied across different models.

Figure 11
Dot plots display SHAP values for features: WBC, PLR, NLR, CAR in panel A; PLR, NLR, WBC, CALLY in panel B. Feature values range from low (purple) to high (yellow).

Figure 11. SHAP summary plot. 1. (A) Summary plot of the logistic regression model; (B) Summary plot of the random forest model. 2. The x-axis shown the SHAP values of each feature, indicating the impact of each feature on the prediction outcome. The y-axis shown the importance ranking of the features. The color indicated the direction of the feature’s impact on the model’s prediction, with orange-yellow representing positive impact and blue-purple representing negative impact.

3.5.3 The correlation between feature variables and target variables

The SHAP dependence plots have illustrated the relationships between the features and the target variable. Figure 12A have shown the relationships of the four features used to build the Logistic regression model with the target variable (whether NTM disease was present), while Figure 12B have depicted the relationships of the four features used to build the RF model with the target variable. For instance, PLR was positively correlated with concurrent NTM in both models.

Figure 12
Scatter plots depicting SHAP values versus four variables: PLR, WBC, NLR, and CAR/CALLY across two panels labeled A and B. Points are colored by NTM value, with orange indicating 1 and purple indicating 0. Panel A includes PLR, WBC, NLR, and CAR, while panel B features PLR, WBC, NLR, and CALLY. Each plot shows differing trends and distributions for the variables.

Figure 12. SHAP dependence plot. The x-axis represented the variable values, and the y-axis represented the SHAP values of the model. The color indicated the model output variable, with dark blue representing HIV-positive patients without concurrent NTM and yellow representing HIV-positive patients with concurrent NTM. (A) SHAP dependence plots of PLR, WBC, NLR, and CAR in the logistic-regression model with respect to “whether concurrent NTM”. (B) SHAP dependence plots of PLR, WBC, NLR, and CALLY in the random-forest model with respect to “whether concurrent NTM”.

3.5.4 The decision-making process of the research subjects in the model

The SHAP decision plot illustrated how each feature contributed to the model’s output results. Figure 13A shown the decision - making process of 265 subjects in the LR model, while Figure 13B shown the decision-making process of 265 patients in the RF model.

Figure 13
Parallel coordinate plots comparing model output values for two cases. In plot A, variables include PLR, NLR, CAR, and WBC, with values ranging from negative fifteen to fifteen. In plot B, variables are WBC, CALLY, NLR, and PLR, with values ranging from negative 0.4 to 1. Lines transition from blue to red, indicating the distribution of values across variables.

Figure 13. SHAP decision plot. (A) the SHAP decision plot for the Logistic regression model; (B) the SHAP decision plot for the Random Forest model. The x-axis represented the feature values, with the first horizontal line indicating the starting point of the prediction and the second horizontal line indicating the endpoint of the prediction. The color denoted the direction of the prediction. The lines between the two horizontal lines shown the prediction path. The y-axis represented the features involved in the model’s decision - making process.

3.5.5 The LR and RF models demonstrated accurate predictions when discriminating between random and external validation singletons

A sample was randomly selected, with the following feature values: PLR: 101, WBC: 6.22, NLR: 3.17, CAR: 1.33, CALLY: 1, NTM: 0 (indicating that the patient did not have NTM). SHAP single - sample force plots were drawn for the two models, as shown in Figure 14. The single sample force plot of the LR model, as shown in Figure 14A, indicates that the baseline value (E[f(x)]) of this sample in the Logistic regression model is 0.291, which is the starting point of this force plot. The most influential indicator is PLR, with CAR positively contributing 0.0422, PLR negatively contributing 0.0821, WBC negatively contributing 0.0523, and NLR negatively contributing 0.0248. The final predicted value f(x) is 0.174. The predicted risk is 1/(1 + exp(-f(x)), that is, the predicted risk is 0.524, which is less than the cutoff value of 0.630 (see Table 5), and is judged as no concurrent NTM, consistent with the actual result. In the RF model of this sample, as shown in Figure 14B, it is indicated that E[f(x)] = 0.302, f(x) = 0.098, and the measured risk is 0.525, which is less than the optimal cut-off value of 0.653 (see Table 5). Since f(x) is less than 0.653, it is judged as not concurrent with NTM, which is consistent with the actual result. The information of the only patient diagnosed with HIV concurrent with NTM disease in our hospital from September 1st to 30th, 2024 will be selected for further verification. The patient’s clinical information: PLR: 68.18, NLR: 14.05, CAR: 1.67, CALLY: 0.13, NTM: 1. The single sample force diagrams corresponding to the two models were drawn separately, as shown in Figure 15. The single sample force diagram of the LR model is shown in Figure 15A, with the final predicted value f(x) = 1.77, and the corresponding predicted risk is 0.855, which is greater than the optimal value of 0.630, indicating a concurrent NTM. In the RF prediction model, this sample is shown in Figure 15B, with the final predicted value f(x) = 1.77, and the corresponding predicted risk is 0.855, which is greater than the optimal value of 0.653, indicating HIV concurrent NTM disease. The prediction results of the two models are consistent with the actual situation.

Figure 14
Diagram showing two prediction models, labeled A and B, using SHAP values. Model A indicates a positive contribution of +0.0759 from CAR=1.82, with negative impacts from WBC=8.01 (-0.163), PLR=201 (-0.0166), and NLR=3.5 (-0.014), resulting in a prediction of f(x)=0.186. Model B displays positive contributions from PLR=201 (+0.106), CALLY=0.858 (+0.191), and NLR=3.5 (+0.207), with a negative impact from WBC=8.01 (-0.054), predicting f(x)=0.74. Both models show expected predictions, with E[f(x)]=0.304 for A and E[f(x)]=0.29 for B.

Figure 14. Single sample force plot. 1. Figures (A, B) were force plots of the same patient based on different models. A was based on the Logistic regression model, and B was based on the Random Forest model. 2. E[f(x)] represented the baseline value of the model, and f(x) represented the final prediction value. The length of each feature’s bar indicated its contribution to the prediction value, and the color indicated the direction of influence, with orange-yellow representing positive impact and dark red representing negative impact.

Figure 15
Two horizontal bar charts, labeled A and B, depict variable contributions to a model. Both charts have similar scales with ranges from -6 to 4. Positive factors like CAR, WBC, and NLR in red increase values, while blue, such as PLR, decrease them. A shows CAR = 1.67, WBC = 3.41, NLR = 14.05 affecting the outcome f(x) = 1.77. B shows CALLY = 0.13, WBC = 3.41, NLR = 14.05 leading to a similar outcome of f(x) = 1.77.

Figure 15. External validation single sample ROC curve. Figures (A, B) were force diagrams of the same patient based on different models (The patient’s clinical information: PLR: 68.18, NLR: 14.05, CAR: 1.67, CALLY: 0.13, NTM: 1). A was based on the logistic regression model, while Bwas based on the random forest model. The base value represented the model’s baseline value, and f(x) represented the final predicted value. The length of each feature bar indicates its contribution to the predicted value, and the color indicated the direction of the influence, with blue representing a positive influence and red representing a negative influence.

4 Discussion

In this study, the new and old inflammatory indicators had different values in the early diagnosis of HIV-associated NTM disease. WBC, NLR, PLR, CAR, and CALLY had assisted in identifying individuals with HIV-associated NTM. The LR models and RF models based on these indicators were established, which have shown good diagnostic value for the early identification of HIV-associated NTM.

In this study found that the WBC level in individuals with HIV-associated NTM disease was significantly lower than those without concurrent NTM disease (p < 0.001).This might be due to the direct impact of the HIV on WBC, especially the reduction of CD4+ T lymphocytes, which leads to the suppression or deficiency of the immune system and subsequently results in a decrease in WBC.

The reduction in WBC cannot effectively identify and eliminate infections, leading to the occurrence and progression of NTM infections (14, 15). NLR is considered to reflect the balance between innate and adaptive immune responses and is widely used in clinical practice to predict the severity, prognosis and risk of death in patients with various diseases, such as chronic obstructive pulmonary disease (COPD) (16), acute pancreatitis (17), malignant tumors (18), and HPV (19). In infectious diseases, similar value also exists. Shangguan X et al. discovered (20) that NLR levels are associated with the survival of sepsis patients and can be used to predict the 30-day mortality rate of sepsis patients and identify those at high risk of death. Some scholars have also found that NLR is associated with poor prognosis of tuberculosis, and high levels of NLR can significantly increase the adverse outcomes of tuberculosis treatment (21). The NLR has also been linked to mortality risk in people living with HIV (22), and it demonstrates strong discriminatory ability for identifying concurrent human papillomavirus infection in this population, achieving an AUC of 0.882 (19).In this study, the NLR level in HIV patients co-infected with NTM was significantly higher than that in the control group, and the difference was statistically significant (p < 0.05). In the multivariate analysis, NLR was an independent risk factor for NTM in HIV patients. Therefore, NLR can be used as an auxiliary indicator to help clinicians assess the inflammatory status of HIV-infected individuals and assist in the early prediction of NTM infection risk, thereby guiding treatment decisions.

PLR reflects the balance between platelets and lymphocytes in patients. Studies have shown that PLR can serve as an assessment indicator for disease progression in HIV patients and can also predict HIV-related complications. It is particularly valuable in predicting the risk of cardiovascular and cerebrovascular diseases such as HIV-associated hypertension (23) and stroke (24), as well as the risk of malignant tumors such as rectal cancer (25), liver cancer (26), and lymphoma (27). Furthermore, elevated PLR levels may inform treatment decisions, influencing whether to initiate or modify antiretroviral therapy (28, 29). In this study, the PLR level in the HIV patients with NTM disease was higher than that in the control group. Multivariate Logistic regression analysis showed that PLR was an independent risk factor for NTM disease in HIV patients. Moreover, the LR model and RF model established with PLR as a participant both had good diagnostic value. The increase in PLR may reflect the intensification of the inflammatory state in HIV-infected individuals, and is related to the severity of NTM infection and immune activation (30). Therefore, PLR not only facilitates the assessment of complications in HIV-infected individuals but also serves as a useful indicator for predicting the risk of HIV concurrent NTM.

The C-reactive protein to albumin ratio (CAR), serving as a comprehensive index for assessing inflammation and nutritional status, has demonstrated significant application value across various clinical domains. Elevated levels of CAR were associated with adverse outcomes in a range of diseases, including cancer (31), acute heart failure (32), chronic obstructive pulmonary disease (33), and sepsis, among others. Chenciner L et al. noted (34) that the CAR serves as a potential inflammatory marker for adult HIV infection, with CAR levels exhibiting an increase during the early stages of HIV infection. Research has demonstrated that C-reactive protein (CRP) levels are elevated (35), while albumin (ALB) levels are significantly reduced (36) in HIV patients who also have concurrent NTM disease. Theoretically, it follows that CAR levels should likewise show a significant increase.

This is consistent with our results. CAR was significantly elevated in the HIV co-infected NTM disease group compared to the control group. Multivariate analysis also showed that a high level of CAR was an independent risk factor for concurrent NTM disease. Our research shows that CAR can effectively distinguish whether HIV patients are complicated with NTM disease. Clinicians should enhance their understanding and clinical application of CAR.

The CALLY index, derived from CRP, ALB, and lymphocyte count, provides a comprehensive reflection of the body’s immune, nutritional, and inflammatory status. Studies have shown that the CALLY index is independently associated with gastrointestinal malignancies (37) and is linearly negatively correlated with the mortality rate of cancer patients (38). It has also been found that there was a significant negative correlation between the CALLY index and the all-cause mortality and cardiovascular mortality among the elderly (39). Moreover, a cross-sectional study showed that the CALLY index was correlated with the disease activity of rheumatoid arthritis (40). However, there is no relevant research on this index in HIV-associated NTM disease. In our study, the CALLY index was significantly lower in the HIV-associated NTM disease group than in the control group. In the SVM-RFE and random forest feature importance rankings, the CALLY index was among the top 4. Moreover, the random forest model established with its involvement could predict whether HIV-positive patients had NTM disease relatively well.

In recent years, machine learning has rapidly emerged in clinical research, particularly in the field of HIV, where significant progress was made in the construction and application of related diagnostic or predictive models. Saldana CS et al. (41) developed a machine learning model for predicting the incidence of HIV. Among the 85,224 patients included in the study, the model demonstrated high predictive performance for 2.37% of the newly diagnosed cases. Lin B et al. systematically compared seven machine learning algorithms and ultimately constructed an HIV infection risk prediction model for men who have sex with men (MSM) based on XGBoost, with an area under the curve (AUC) of 0.777 (95% CI: 0.639 - 0.915) (42). Machine learning models have also demonstrated excellent performance in predicting HIV complications and related aspects. Zhang H et al. discovered through machine learning that the levels of inflammatory markers such as IL-2, IL-4, IL-6, IL-10, TNF-α and IFN-γ were significantly elevated in HIV-positive patients with tuberculosis. Moreover, the support vector machine (SVM) model constructed based on these markers demonstrated good predictive potential (43). Zhang H et al. utilized machine learning techniques and found that among HIV-positive patients, those with tuberculosis exhibited significantly elevated levels of inflammatory markers including IL-2, IL-4, IL-6, IL-10, TNF-α, and IFN-γ. Furthermore, the support vector machine (SVM) model established based on these markers demonstrated promising predictive potential (44). In addition, machine learning technology is also applied in the treatment of NTM disease. A research study utilized deep learning techniques to predict the 3-year, 5-year, and 10-year mortality rates of patients diagnosed with Nontuberculous Mycobacteria Pulmonary Disease (NTM-PD) based on chest X-ray images. In the validation set, the area under the curve (AUC) for the model was recorded at 0.865, 0.942, and 0.865 for each respective time interval. Furthermore, when clinical information was incorporated into the model, its predictive performance demonstrated significant enhancement (45).This study established three prediction models for HIV-associated NTM disease based on LR model, RF, and SVM. After evaluating the discrimination, calibration, and clinical applicability of these three models, we successfully developed two prediction models for HIV-associated NTM disease using Logistic regression and random forest. The two models can provide diagnostic clues for the early identification of HIV-positive patients without NTM disease. Moreover, our study also employed SHAP to interpret the models. SHAP is a model interpretation method based on game theory, which aims to enhance the interpretability of models. By calculating the contribution of each feature to the model’s prediction (Shapley value), it provides both local and global explanations (46). In this study, SHAP was used to interpret the importance of each indicator and the decision-making process of the two models, visualize the prediction process, and facilitate understanding and application.

Our study also has certain limitations. First, In the comparison of baseline data, the age of patients in the HIV co-infected with NTM disease group (44.5±10.337years) was significantly younger than that in the control group (49.8±12.203 years), and the difference was statistically significant. However, the 2020 version of the “Diagnosis and Treatment Guidelines for Nontuberculous Mycobacterial Disease” has already pointed out that increasing age is a risk factor for concurrent NTM disease, so the two groups are also comparable. Secondly, due to the relatively small number of patients with HIV concurrent with NTM, in order to ensure the scientific nature of the sample size, we randomly selected 3/10 of the study subjects as the validation set. Although this can support the robustness of the model prediction and increase confidence in the model prediction, strictly speaking, its value is not as high as that of external validation. To address this deficiency, we are also actively seeking remedial measures. We eventually established two diagnostic models. One can be randomly selected for prediction, and the other for validation. Thirdly, in our research design, the control group excluded tuberculosis patients, which simplified the clinical practice diagnostic issues. The differential diagnosis of HIV concurrent NTM and tuberculosis will be the focus of our future research. Finally, our study a single-center retrospective study, and it remained to be validated by enlarging the sample size and conducting multicenter prospective cohort studies.

5 Conclusion

We found that WBC, NLR, PLR, CAR, and CALLY can assist in the early identification of HIV-positive patients with NTM disease. Moreover, utilizing these indicators, we have successfully developed two diagnostic models for the early diagnosis of HIV-related NTM disease, specifically LR and RF. Both models have good discrimination, calibration, clinical applicability and stability. and can be used for screening and early identification of HIV- positive patients with NTM disease. In clinical practice, for HIV patients with suspected concurrent mycobacterial infection, after excluding TB, these two models can be used for screening and early identification of HIV concurrent NTM disease.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Ethics statement

All the patients in this study had obtained informed consent. The study had been approved by the Ethics Committee of the Third People’s Hospital of Kunming (Kun San Lun), with the approval number: KSLL202405211014. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

LL: Methodology, Writing – original draft. CS: Methodology, Writing – original draft. XL: Methodology, Writing – original draft. WL: Data curation, Writing – original draft. YL: Writing – original draft, Data curation. HZ: Writing – original draft, Data curation. GW: Data curation, Writing – original draft. YZ: Writing – original draft, Data curation. YH: Writing – original draft, Investigation. JuY: Investigation, Writing – original draft. JiY: Investigation, Writing – original draft. SL: Supervision, Writing – review & editing, Funding acquisition. LS: Funding acquisition, Writing – review & editing, Supervision.

Funding

The author(s) declare financial support was received for the research and/or publication of this article. National Natural Science Foundation of China (82460001); Scientific Research Project of Yunnan Provincial Clinical Medical Center (2024YNLCYXZX0207); Science and Technology Department of Yunnan Province Joint Special Fund for Local Universities (202401BA070001-063; 202401BA070001-069); Kunming Municipal Health Commission Health Science Research Project (2023-03-08-012; 2023-03-02-019).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Joint United Nations Programme on HIV/AIDS. Global AIDS update 2024. the urgency of now: AIDS at a crossroads. (Geneva: UNAIDS) (2024).

Google Scholar

2. Mehta M and Marras TK. Impaired health-related quality of life in pulmonary nontuberculous mycobacterial disease. Respir Med. (2011) 105:1718–25. doi: 10.1016/j.rmed.2011.08.004

PubMed Abstract | Crossref Full Text | Google Scholar

3. Dawood H, Richards L, Lutchminarain K, Parker A, Wattrus C, Sipambo N, et al. Southern African HIV Clinicians Society guideline on the management of non-tuberculous mycobacteria in people with HIV. South Afr J HIV Med. (2024) 25:1657. doi: 10.4102/sajhivmed.v25i1.1657

PubMed Abstract | Crossref Full Text | Google Scholar

4. Hoza AS, Mfinanga SG, Rodloff AC, Moser I, and Konig B. Increased isolation of nontuberculous mycobacteria among TB suspects in Northeastern, Tanzania: public health and diagnostic implications for control programmes. BMC Res Notes. (2016) 9:109. doi: 10.1186/s13104-016-1928-3

PubMed Abstract | Crossref Full Text | Google Scholar

5. Yu X, Liu P, Liu G, Zhao L, Hu Y, Wei G, et al. The prevalence of non-tuberculous mycobacterial infections in mainland China: Systematic review and meta-analysis. J Infect. (2016) 73:558–67. doi: 10.1016/j.jinf.2016.08.020

PubMed Abstract | Crossref Full Text | Google Scholar

6. Liang L and Su Q. Systemic immune-inflammation index and the short-term mortality of patients with sepsis: A meta-analysis. Biomol BioMed. (2024) 25:798–809. doi: 10.17305/bb.2024.11494

PubMed Abstract | Crossref Full Text | Google Scholar

7. Chen KL, Qiu YW, Yang M, Wang T, Yang Y, Qiu HZ, et al. Prognostic value of preoperative systemic immune-inflammation index/albumin for patients with hepatocellular carcinoma undergoing curative resection. World J Gastroenterol. (2024) 30:5130–51. doi: 10.3748/wjg.v30.i48.5130

PubMed Abstract | Crossref Full Text | Google Scholar

8. Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. (2020) 2:56–67. doi: 10.1038/s42256-019-0138-9

PubMed Abstract | Crossref Full Text | Google Scholar

9. Ochayo A, Wamalwa R, Barasa E, Zablon J, Sowayi G, Were T, et al. Prevalence of non-tuberculosis mycobacterium pulmonary disease in HIV-1 patients with presumptive pulmonary tuberculosis in western Kenya. Ethiop J Health Sci. (2023) 33:735–42. doi: 10.4314/ejhs.v33i5.3

PubMed Abstract | Crossref Full Text | Google Scholar

10. Lan R, Yang C, Lan L, Ou J, Qiao K, Liu F, et al. Mycobacterium tuberculosis and non-tuberculous mycobacteria isolates from HIV-infected patients in Guangxi, China. Int J Tuberc Lung Dis. (2011) 15:1669–75. doi: 10.5588/ijtld.11.0036

PubMed Abstract | Crossref Full Text | Google Scholar

11. Loebinger MR, Quint JK, van der Laan R, Obradovic M, Chawla R, Kishore A, et al. Risk factors for nontuberculous mycobacterial pulmonary disease: A systematic literature review and meta-analysis. Chest. (2023) 164:1115–24. doi: 10.1016/j.chest.2023.06.014

PubMed Abstract | Crossref Full Text | Google Scholar

12. Acquired Immunodeficiency Syndrome Professional Group, Society of Infectious Diseases, Chinese Medical Association; Chinese Center for Disease Control and Prevention. Chinese guidelines for the diagnosis and treatment of human immunodeficiency virus infection/acquired immunodeficiency syndrome (2024 edition). Chin Med J (Engl). (2024) 137:2654–80. doi: 10.1097/CM9.00000000000003383

PubMed Abstract | Crossref Full Text | Google Scholar

13. Chinese Medical Association Tuberculosis Branch. Guidelines for the diagnosis and treatment of non-tuberculous mycobacterial disease (2020 edition). Chin J Tuberculosis Respir Dis. (2020) 43:918–946. doi: 10.3760/cma.j.cn112147-20200508-00570

Crossref Full Text | Google Scholar

14. Hunakova L, Sabaka P, Zvarik M, Mikolaskova I, Gidron Y, and Bucova M. Linear and non−linear indices of vagal nerve in relation to sex and inflammation in patients with Covid−19. BioMed Rep. (2023) 19:80. doi: 10.3892/br.2023.1662

PubMed Abstract | Crossref Full Text | Google Scholar

15. Coco-Bassey SB, Asemota EA, Okoroiwu HU, Etura JE, Efiong EE, Inyang IJ, et al. Glutathione, glutathione peroxidase and some hematological parameters of HIV-seropositive subjects attending clinic in University of Calabar teaching hospital, Calabar, Nigeria. BMC Infect Dis. (2019) 19:944. doi: 10.1186/s12879-019-4562-6

PubMed Abstract | Crossref Full Text | Google Scholar

16. Hassan M and Noor L. Neutrophil to lymphocyte ratio (NLR): a road to an inexpensive biomarker for acute exacerbation of COPD. J Pak Med Assoc. (2025) 75:1317. doi: 10.47391/JPMA

PubMed Abstract | Crossref Full Text | Google Scholar

17. Qi R, Wang H, Luo R, Li J, and Su L. Development of a clinical prediction model for intra-abdominal infection in severe acute pancreatitis using logistic regression and nomogram. Front Med (Lausanne). (2025) 12:1636733. doi: 10.3389/fmed.2025.1636733

PubMed Abstract | Crossref Full Text | Google Scholar

18. Zhou H, Jin W, Li L, Nie X, Wu W, Chen R, et al. Risk prediction model for overall survival in lung cancer based on inflammatory and nutritional markers. Sci Rep. (2025) 15:30840. doi: 10.1038/s41598-025-16443-1

PubMed Abstract | Crossref Full Text | Google Scholar

19. Fook KD, Araujo M, Muniz A, Carvalho MM, Morais ACCD, Araujo DR, et al. Neutrophil/lymphocyte and platelet/lymphocyte ratio in seropositive women for human immunodeficiency virus (HIV) and human papillomavirus (HPV) coinfection. Rev Inst Med Trop Sao Paulo. (2024) 66:e67. doi: 10.1590/S1678-9946202466067

PubMed Abstract | Crossref Full Text | Google Scholar

20. Shangguan X, Zhang Z, Shangguan X, Wang J, and Gong Y. Association between whole blood cell-derived inflammatory markers and all-cause mortality in patients with sepsis-associated acute kidney injury: A retrospective study based on the MIMIC-IV database. J Intensive Care Med. (2025) 25:1374692877. doi: 10.1177/08850666251363853

PubMed Abstract | Crossref Full Text | Google Scholar

21. Ji Y, Xie Q, Wei W, Huang Z, Liu X, Ye Q, et al. Association between blood inflammatory status and the survival of tuberculosis: a five-year cohort study. Front Immunol. (2025) 16:1556857. doi: 10.3389/fimmu.2025.1556857

PubMed Abstract | Crossref Full Text | Google Scholar

22. Raffetti E, Donato F, Casari S, Castelnuovo F, Sighinolfi L, Bandera A, et al. Systemic inflammation-based scores and mortality for all causes in HIV-infected patients: a MASTER cohort study. BMC Infect Dis. (2017) 17:193. doi: 10.1186/s12879-017-2280-5

PubMed Abstract | Crossref Full Text | Google Scholar

23. Ou-Yang H, Fu H, Luo Y, Xu ZY, Liu J, Gao R, et al. Inflammation markers and the risk of hypertension in people living with HIV. Front Immunol. (2023) 14:1133640. doi: 10.3389/fimmu.2023.1133640

PubMed Abstract | Crossref Full Text | Google Scholar

24. Wang A, Wang F, Huang Y, Cui Q, Xu Y, Zhang W, et al. Association between systemic inflammatory markers and all-cause mortality in patients with stroke: A prospective study using data from the UK Biobank. J Stroke Cerebrovasc Dis. (2024) 33:108076. doi: 10.1016/j.jstrokecerebrovasdis

PubMed Abstract | Crossref Full Text | Google Scholar

25. Quiros-Roldan E, Properzi M, Amadasi S, Raffetti E, Ferraresi A, Biasi L, et al. Prognostic role of inflammatory biomarkers in HIV-infected patients with a first diagnosis of hepatocellular carcinoma: A single-center study. J Med Virol. (2019) 91:241–8. doi: 10.1002/jmv.25317

PubMed Abstract | Crossref Full Text | Google Scholar

26. Stem J, Yang Q, Carchman E, Striker R, and Sanger CB. Do immune inflammatory markers correlate with anal dysplasia and anal cancer risk in patients living with HIV? Int J Colorectal Dis. (2022) 37:983–8. doi: 10.1007/s00384-022-04123-9

PubMed Abstract | Crossref Full Text | Google Scholar

27. Zeng J, Zhang X, Jia L, Wu Y, Tian Y, Zhang Y, et al. Pretreatment lymphocyte-to-monocyte ratios predict AIDS-related diffuse large B-cell lymphoma overall survival. J Med Virol. (2021) 93:3907–14. doi: 10.1002/jmv.26655

PubMed Abstract | Crossref Full Text | Google Scholar

28. Stem J, Hewitt AJ, Yang Q, and Sanger CB. Commonly drawn immunologic and inflammatory markers as risk predictors for anal cancer in veterans living with HIV. J Low Genit Tract Dis. (2024) 28:300–4. doi: 10.1097/LGT.000000000000000811

PubMed Abstract | Crossref Full Text | Google Scholar

29. Obeagu EI and Obeagu GU. Platelet index ratios in HIV: Emerging biomarkers for immune health and disease management. Med (Baltimore). (2024) 103:e37576. doi: 10.1097/MD.0000000000037576

PubMed Abstract | Crossref Full Text | Google Scholar

30. Ou-Yang H, Fu HY, Luo Y, Xu ZY, Liu J, Gao R, et al. Inflammation markers and the risk of hypertension in people living with HIV. Front Immunol. (2023) 14:1133640. doi: 10.3389/fimmu.2023.1133640

PubMed Abstract | Crossref Full Text | Google Scholar

31. Senturk A and Harmantepe AT. The role of tumor characteristics and biomarkers in predicting long-term survival rates of rectal cancer patients. Med (Baltimore). (2025) 104:e43799. doi: 10.1097/MD.0000000000043799

PubMed Abstract | Crossref Full Text | Google Scholar

32. Algin A, Ozkan A, Yusufoglu K, Ozdemir S, and Afacan MA. The role of C-reactive protein and albumin combined indexes in predicting the need for intubation in acute heart failure: A prospective observational study. North Clin Istanb. (2025) 12:314–20. doi: 10.14744/nci.2025.76032

PubMed Abstract | Crossref Full Text | Google Scholar

33. Ao T, Huang Y, Zhen P, and Hu M. Association between C-reactive protein to albumin ratio and chronic obstructive pulmonary disease: a cross-sectional study. BMC Pulm Med. (2025) 25:1. doi: 10.1186/s12890-024-03469-w

PubMed Abstract | Crossref Full Text | Google Scholar

34. Chenciner L, Symonds M, Dissanayake O, Hunter A, Burns F, and Miller RF. Lymphocyte-CRP ratio and CRP-albumin ratio as potential inflammation markers in adults with HIV. J Acquir Immune Defic Syndr. (2022) 91:e1–3. doi: 10.1097/QAI.00000000000003028

PubMed Abstract | Crossref Full Text | Google Scholar

35. Yanagihara T, Ogata H, Mori A, Kadowaki M, Moriuchi Y, Ishimatsu A, et al. Amikacin liposome inhalation suspension in the real-world management of refractory mycobacterium avium complex pulmonary disease. Cureus. (2024) 16:e56622. doi: 10.7759/cureus.56622

PubMed Abstract | Crossref Full Text | Google Scholar

36. Wetzstein N, Geil A, Kann G, Lehn A, Schuttfort G, Kessel J, et al. Disseminated disease due to non-tuberculous mycobacteria in HIV positive patients: A retrospective case control study. PloS One. (2021) 16:e254607. doi: 10.1371/journal.pone.0254607

PubMed Abstract | Crossref Full Text | Google Scholar

37. Li H, Mo Z, and Tong G. Value of CRP, albumin, and lymphocyte index in predicting survival of patients with gastrointestinal Malignancies: a systematic review and meta-analysis. Front Oncol. (2025) 15:1592794. doi: 10.3389/fonc.2025.1592794

PubMed Abstract | Crossref Full Text | Google Scholar

38. Zhu D, Lin YD, Yao YZ, Qi XJ, Qian K, and Lin LZ. Negative association of C-reactive protein-albumin-lymphocyte index (CALLY index) with all-cause and cause-specific mortality in patients with cancer: results from NHANES 1999-2018. BMC Cancer. (2024) 24:1499. doi: 10.1186/s12885-024-13261-y

PubMed Abstract | Crossref Full Text | Google Scholar

39. Luo L, Li M, Xi Y, Hu J, and Hu W. C-reactive protein-albumin-lymphocyte index as a feasible nutrition-immunity-inflammation marker of the outcome of all-cause and cardiovascular mortality in elderly. Clin Nutr ESPEN. (2024) 63:346–53. doi: 10.1016/j.clnesp.2024.06.054

PubMed Abstract | Crossref Full Text | Google Scholar

40. Kilic O, Tecer D, Canbas M, Kaya MN, Çınar M, and Yılmaz S. Immune nutrition indices are associated with disease activity in rheumatoid arthritis: a cross-sectional study. biomark Med. (2024) 18:1093–102. doi: 10.1080/17520363.2024.2430942

PubMed Abstract | Crossref Full Text | Google Scholar

41. Saldana CS, Burkhardt E, Pennisi A, Oliver K, Olmstead J, Holland DP, et al. Development of a machine learning modeling tool for predicting HIV incidence using public health data from a county in the southern United States. Clin Infect Dis. (2024) 79:717–26. doi: 10.1093/cid/ciae100

PubMed Abstract | Crossref Full Text | Google Scholar

42. Lin B, Liu J, Li K, and Zhong X. Predicting the risk of HIV infection and sexually transmitted diseases among men who have sex with men: cross-sectional study using multiple machine learning approaches. J Med Internet Res. (2025) 27:e59101. doi: 10.2196/59101

PubMed Abstract | Crossref Full Text | Google Scholar

43. Li X, Li Y, Zhu Z, Tan B, Zhou X, Shi H, et al. Development and spatial external validation of a predictive model of survival based on random survival forest analysis for people living with HIV and AIDS after highly active antiretroviral therapy in China: retrospective cohort study. J Med Internet Res. (2025) 27:e71257. doi: 10.2196/71257

PubMed Abstract | Crossref Full Text | Google Scholar

44. Zhang H, Li L, Liu Y, Xiao W, Xu R, Lu M, et al. Serum cytokine biosignatures for identification of tuberculosis among HIV-positive inpatients. Thorax. (2024) 79:465–71. doi: 10.1136/thorax-2023-220782corr1

PubMed Abstract | Crossref Full Text | Google Scholar

45. Lee S, Lee HW, Kim H, Kim DK, Yim JJ, Yoon SH, et al. Deep learning-based prediction model using radiography in nontuberculous mycobacterial pulmonary disease. Chest. (2022) 162:995–1005. doi: 10.1016/j.chest.2022.06.018

PubMed Abstract | Crossref Full Text | Google Scholar

46. Phan TG, Srikanth VK, Cadilhac DA, Nelson M, Kim J, Olaiya MT, et al. Framingham risk score prediction at 12 months in the STANDFIRM randomized control trial. J Am Heart Assoc. (2025) 14:e40254. doi: 10.1161/JAHA.124.040254

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: machine learning, HIV, non-tuberculous mycobacterial disease, new inflammatory biomarkers, diagnostic model, SHapley Additive exPlanations

Citation: Li L, Shi C, Liu X, Li W, Luo Y, Zhang H, Wang G, Zhao Y, Huang Y, Yang J, Yang J, Li S and Shen L (2025) A diagnostic model for NTM disease in HIV-positive patients: a machine learning-based analysis with novel inflammatory markers. Front. Immunol. 16:1652747. doi: 10.3389/fimmu.2025.1652747

Received: 24 June 2025; Accepted: 03 October 2025;
Published: 14 November 2025.

Edited by:

Lia Danelishvili, Oregon State University, United States

Reviewed by:

Zhaowei Xu, Fujian Medical University, China
Andre Pastor Da Silva, Icahn School of Medicine at Mount Sinai, United States

Copyright © 2025 Li, Shi, Liu, Li, Luo, Zhang, Wang, Zhao, Huang, Yang, Yang, Li and Shen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Shengan Li, bGlzaGVuZ2FuQDE2My5jb20=; Lingjun Shen, bTE4NjA4NzcwMjAyQDE2My5jb20=

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.