- 1Department of Oncology, The Second Affiliated Hospital of Guizhou Medical University, Kaili, China
- 2Department of Oncology, Affiliated Hospital of Guizhou Medical University, Guiyang, China
- 3Department of Oncology, Affiliated Cancer Hospital of Guizhou Medical University, Guiyang, China
- 4Division of Oncology, School of Clinical Medicine, Guizhou Medical University, Guiyang, China
- 5Department of Oncology, Qiandongnan Prefecture People’s Hospital, Kaili, China
- 6Department of Oncology, Guiyang Pulmonary Hospital, Guiyang, China
- 7Department of Oncology, Dujiangyan Shoujia Hospital, Chengdu, China
- 8Department of Oncology, Xingyi People’s Hospital, Xingyi, China
- 9Department of Oncology, Panzhou People’s Hospital, Liupanshui, China
- 10Department of Oncology, The First People’s Hospital of Guiyang, Guiyang, China
- 11Department of Pathology, The Second Affiliated Hospital of Guizhou Medical University, Kaili, China
Background: This study aimed to develop and validate an interpretable machine learning model that harnesses circulating tumor DNA (ctDNA) to predict progression-free survival (PFS) in patients with non-small cell lung cancer (NSCLC) undergoing immunotherapy, thereby addressing the inherent limitations of conventional biomarkers such as PD-L1 expression and tumor mutational burden.
Methods: This multicenter study involved pretreatment ctDNA profiling of 441 patients with non-small cell lung cancer (NSCLC), stratified into three independent cohorts: a training set (n=303, OAK trial), a validation set (n=97, POPLAR trial), and a local test set (n=41, multicenter retrospective cohort, 2023–2024). Using 5-fold cross-validated LASSO-Cox (Least Absolute Shrinkage and Selection Operator-Cox Proportional Hazards) regression, 25 prognostic genomic features were identified for integration into an eXtreme Gradient Boosting (XGBoost) model. Model performance was systematically evaluated via three approaches: (1) discrimination metrics, including AUC with 95% confidence intervals, accuracy, sensitivity, and specificity; (2) Kaplan-Meier survival analysis complemented by log-rank testing; and (3) SHapley Additive exPlanations (SHAP) for interpreting feature importance.
Results: The model exhibited robust predictive performance, with AUCs of 0.82 (training cohort), 0.79 (validation cohort), and 0.77 (test cohort). Key genomic predictors included TP53 mutations, which were associated with shorter PFS, and BRCA2 mutations, which correlated with longer PFS. SHAP analysis identified NOTCH1 as a novel predictive biomarker, whose feature contribution profile suggests a role in immune modulation in lung squamous cell carcinoma. Risk stratification significantly distinguished PFS outcomes (log-rank P < 0.05). Decision curve analysis confirmed the model’s clinical utility, as it outperformed “treat-all” strategies.
Conclusion: This study establishes a robust, interpretable ctDNA-derived machine learning algorithm for predicting PFS in NSCLC patients receiving immune checkpoint inhibitors. The identification of TP53, BRCA2, and NOTCH1 as biologically plausible predictive biomarkers advances understanding of immunotherapy response mechanisms and enables clinically actionable risk stratification to guide therapeutic decision-making. These findings underscore the need for prospective multicenter validation to facilitate translation into precision oncology practice.
1 Introduction
According to the latest statistics, lung cancer remains the leading cause of cancer morbidity and mortality worldwide, topping the list of cancer-related deaths for ten consecutive years (1). Histopathologically, lung cancer is categorized into two main types: non-small cell lung cancer (NSCLC), which accounts for approximately 80% of cases and includes adenocarcinoma, squamous cell carcinoma, and other subtypes; and small cell lung cancer (2, 3). Recent advances in immunotherapy have conferred substantial clinical benefits to an expanding cohort of NSCLC patients (4). Immune checkpoint inhibitors targeting programmed cell death protein 1 (PD-1) and programmed death-ligand 1 (PD-L1) have emerged as a cornerstone of treatment for advanced NSCLC (5). By modulating immune inhibitory pathways, these agents enhance the immune system’s capacity to recognize and eliminate tumor cells, thereby controlling tumor progression and metastasis (6). Clinical evidence confirms that immune checkpoint inhibitors significantly improve survival outcomes in patients with NSCLC (7–10).
While ICIs have become a cornerstone treatment, they represent one facet of a broader movement towards precision oncology. This paradigm is further exemplified by the development of sophisticated nanomaterial-based therapies, which seek to achieve spatiotemporal control over treatment. Examples include light-activated nanopolyplexes for targeted gene silencing (11), multifunctional graphene derivatives for integrated diagnosis and therapy (12), and gold nanorod-based platforms that synergize photothermal ablation with immunotherapy (13). A fundamental challenge unifying all these advanced modalities, however, is the reliable identification of patients who will respond.
However, clinical responses to immunotherapy are highly heterogeneous, with only a subset of patients deriving meaningful benefit (14). Currently, biomarkers such as PD-L1 expression levels and tumor mutational burden (TMB) are limited by suboptimal sensitivity and detection accuracy, highlighting the need for more precise monitoring tools to optimize treatment decisions (15). Circulating tumor DNA (ctDNA), a core component of liquid biopsies, provides real-time insights into tumor genomic profiles and disease burden, facilitating more accurate prediction of immunotherapeutic efficacy (16). Progression-free survival (PFS) is a key metric for evaluating the clinical benefits of immunotherapy (17). Given that tumor patients often experience pseudoprogression following immunotherapy (18, 19), this study selected PFS as the primary endpoint rather than short-term efficacy measures (such as objective response rate). Moreover, dynamic changes in ctDNA have been shown to closely correlate with tumor treatment response and disease progression (20). Compared with traditional tissue biopsies, ctDNA testing is minimally invasive, readily repeatable, and capable of sensitively monitoring tumor clonal evolution and minimal residual disease. Multiple studies have demonstrated that machine learning algorithms can effectively predict the short-term efficacy of immunotherapy in cancer patients (21–25). Nevertheless, reliable ctDNA-based PFS prediction models remain elusive.
This study aimed to develop and validate a machine learning model for predicting PFS in NSCLC patients using pre-immunotherapy ctDNA data. After evaluating multiple machine learning algorithms—including random forest, logistic regression, support vector machines, and eXtreme Gradient Boosting (XGBoost)—we selected XGBoost for its superior performance in processing high-dimensional genomic data, robustness against overfitting, and model interpretability; these features are critical for clinical translation.
2 Methods
This study analyzed ctDNA data from two clinical trials: the OAK trial (a phase III, open-label, multicenter randomized controlled trial comparing atezolizumab with docetaxel in previously treated NSCLC patients) and the POPLAR trial (a phase II, open-label, multicenter randomized controlled trial evaluating atezolizumab versus docetaxel in advanced NSCLC). We included data from 425 patients in the OAK trial and 144 patients in the POPLAR trial who received immunotherapy. Additionally, we collected ctDNA data from 52 NSCLC patients treated at the Second Affiliated Hospital of Guizhou Medical University, People’s Hospital of Qiandongnan Region, and Guiyang Pulmonary Hospital between January 2023 and June 2024. The study was approved by the Ethics Committee of The Second Affiliated Hospital of Guizhou Medical University (Approval No. 2023-LS-02). As a retrospective study, this research strictly adhered to the Declaration of Helsinki (26), with full confidentiality of patient information ensured; a waiver of informed consent was granted by the institutional review board due to the retrospective nature of the analysis.
Figure 1 displays the flow chart. Inclusion criteria: (a) Histologically confirmed NSCLC; (b) Received immunotherapy as monotherapy; (c) Completed ctDNA testing prior to initiating immunotherapy; (d) Without other primary malignant tumors. Exclusion criteria: (a) Incomplete clinical data; (b) Incomplete hematological data; (c) Lost to follow-up; (d) Incomplete immunotherapy course. The primary endpoint was PFS within 2 years, defined as the time from treatment initiation to disease progression or death from any cause. Data from the OAK trial served as the training set (n=303), data from the POPLAR trial as the validation set (n=97), and data from the three local hospitals as the test set (n=41). The training (OAK) and validation (POPLAR) datasets were obtained as analysis-ready datasets from www.clinicalstudydatarequest.com. No missing values were present in the acquired datasets; hence, no imputation was performed. The genomic features were binary variables (mutated: 1, wild-type: 0), and as such, no normalization or feature transformation was applied. The same preprocessing logic was applied to the internal test set. The full analysis code is available at (https://github.com/Yali15207856138/improved-sn).
After cohort assignment, we analyzed baseline characteristics (age, gender, histological type, and TMB expression level) across the three groups and generated a table of baseline characteristics (Table 1) using SPSS software (v26.0). For the training set, univariate Cox regression analysis was performed on ctDNA data. Genes with P < 0.2 in the univariate analysis were included in multivariate Cox regression, with those exhibiting P < 0.05 retained (Supplementary Table 1). Following gene selection via 5-fold cross-validated LASSO-Cox regression (glmnet package in R v3.5.1), the selected features were incorporated into an XGBoost model (Python v3.0.1). Prior to final model building, the XGBoost hyperparameters were optimized through an iterative process guided by 5-fold cross-validation performance (AUC). We explored a broad parameter grid, including but not limited to: learning_rate [0.01, 0.05, 0.1, 0.15, 0.2], max_depth [3, 4, 5, 6, 7], subsample [0.7, 0.8, 0.9, 1.0], and reg_lambda [0.1, 0.5, 1.0, 1.5, 2.0]. The final parameter set was selected as it yielded the highest and most stable cross-validated AUC, configured as follows: booster = ‘gbtree’, objective = ‘binary:logistic’, n_estimators = 100, learning_rate = 0.10, max_depth = 6, subsample = 0.80, reg_lambda = 1.00, with gamma = 0.00 and reg_alpha = 0.00. The model output yielded prediction probabilities, which were used to calculate AUC, accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV).
Kaplan-Meier survival analysis was performed using the survival package in R, generating PFS rate estimates and survival curves. Patients were stratified into risk groups based on the optimal cutoff value, which was determined using the Youden index. Survival differences between groups were assessed using the log-rank test. To evaluate model generalizability, calibration curves were generated and decision curve analysis (DCA) was conducted using R. To interpret model behavior, we applied SHapley Additive exPlanations (SHAP) analysis (R v3.5.1), which included SHAP feature importance analysis to identify key predictive features and SHAP force plots to visualize individual prediction mechanisms (27). This integrated approach reveals both global feature contributions and instance-level decision patterns. Statistical significance was defined as P < 0.05 for all hypothesis tests.
3 Results
A total of 441 patients were included in this study. Table 1 summarizes their clinical characteristics. No significant differences in the distribution of gender, age, and TMB expression level were observed in the training set, validation set, and local test set. However, histological type distributions differed: squamous cell carcinoma comprised 68.29% of the local test set (vs. 31.71% non-squamous), whereas the opposite trend was noted in the training and validation sets.
From the 192 initial gene features extracted from 441 patients, 5-fold cross-validated LASSO-Cox regression identified 25 significant gene features (Figure 2). As shown in Table 2 and Figure 3, the training set yielded an AUC of 0.82 (95% CI: 0.77–0.88), while the validation and local test sets demonstrated AUCs of 0.79 (95% CI: 0.64–0.94) and 0.77 (95% CI: 0.47–1.00), respectively. The confusion matrix (Figure 3) and performance metrics (Table 2) confirmed consistent model performance across the training set (n=303), validation set (n=97), and independent test set (n=41), with accuracies of 83% (95% CI: 78–87%), 75% (65–83%), and 95% (83–99%), respectively. Sensitivity values were 60%, 75%, and 67%, while specificity reached 91%, 75%, and 100% across the three datasets. This relatively stable performance indicates no significant overfitting, although the higher accuracy observed in the small test set necessitates further validation in larger cohorts. In addition, we developed an integrated model combining the 25 screened gene features with clinical data (gender, age, pathology type, and TMB expression level). However, the prediction performance did not show any significant improvement. The corresponding ROC curves for the validation and local test sets are presented in Supplementary Figure 1.
Figure 2. Feature selection by Lasso-Cox regression. (A) Coefficient curves. (B) Cross-validation curve.
Figure 3. Model performance evaluation across datasets. (A) Training set ROC curve. (B) Validation set ROC curve. (C) Test set ROC curve. (D) Training set confusion matrix. (E) Validation set confusion matrix. (F) Test set confusion matrix.
The optimal model threshold was determined by maximizing the Youden index (0.653). Risk stratification using the model revealed significant differences in tumor progression rates (log-rank test P < 0.001 for the training set, P = 0.043 for the validation set, and P < 0.001 for the local test set; Figure 4). Given the small sample sizes of the validation and local test sets, we further performed decision curve analysis (DCA) and generated calibration plots to assess the model’s clinical utility. As shown in the DCA (Figure 4) and calibration plot (Supplementary Figure 2), the model’s predictions strongly correlated with patients’ actual tumor progression status.
Figure 4. Validation of progression-free survival prediction and clinical utility. (A) Training set Kaplan-Meier curve for PFS. (B) Validation set Kaplan-Meier curve for PFS. (C) Test set Kaplan-Meier curve for PFS. (D) Training set decision curve analysis. (E) Validation set decision curve analysis. (F) Test set decision curve analysis.
In the training set, TP53, BRCA2, and PTCH1 exhibited the highest absolute average SHAP values (Figure 5), indicating that their mutation statuses contributed most to the model’s predictions of poor outcomes. Moreover, across all three datasets, TP53 mutations were consistently associated with shorter PFS (reflected by negative SHAP values, indicating higher risk), whereas BRCA2 mutations correlated with longer PFS (positive SHAP values, indicating a protective effect against disease progression). Figure 6 shows the SHAP values for a representative single sample in the training set (A), validation set (B), and test set (C), as well as CT images of four representative patients illustrating the two treatment outcomes in the test set (D). Consistent patterns were observed across all datasets: TP53 mutations (red bars) positively contributed to higher risk predictions, while BRCA2 mutations (blue bars) exerted protective effects. This consistency underscores the robustness of the model’s feature interpretation.
Figure 5. Consistent interpretation of feature importance via SHAP analysis across datasets. (A) Training set SHAP feature contribution plot. (B) Validation set SHAP feature contribution plot. (C) Test set SHAP feature contribution plot. (D) Training set SHAP importance plot. (E) Validation set SHAP importance plot. (F) Test set SHAP importance plot.
Figure 6. Model interpretability analysis with clinical exemplars. (A) Training set SHAP sample plot. (B) Validation set SHAP sample plot. (C) Test set SHAP sample plot. (D) Representative CT images of test set patients stratified by treatment response: two non-progressors (top) and two progressors (bottom).
4 Discussion
The model’s ability to maintain consistent performance across distinct datasets (training AUC: 0.82; validation AUC: 0.79; test AUC: 0.77) is particularly notable, considering the histological heterogeneity among cohorts. This robustness suggests that the identified genomic signatures reflect fundamental biological processes rather than dataset-specific artifacts. The significant inverse association between TP53 mutations and PFS aligns with accumulating evidence that TP53 dysfunction fosters an immunosuppressive tumor microenvironment (28–30). Recent studies demonstrate that TP53 mutations can downregulate antigen presentation machinery (e.g., MHC class I) and recruit myeloid-derived suppressor cells, collectively promoting an immune-evasive phenotype (31–33). This mechanistic plausibility strengthens the biological validity of our model’s predictions. Our findings contribute to a growing body of evidence that diverse molecular alterations converge on an immunosuppressive TME. Beyond the genetic mutations captured here, epigenetic dysregulation represents a parallel pathway to immune evasion. For instance, in mucosal melanoma, hypomethylation-driven overexpression of CNDP1 was recently linked to a ‘cold’ TME and inferior immunotherapy outcomes (34). More directly relevant to NSCLC, Yuan et al. demonstrated that the histone methyltransferase KMT5C promotes immune evasion by activating DNA damage repair to suppress the STING-IRF3-type I interferon pathway, thereby inhibiting CD8+ T cell recruitment and function. Crucially, they showed that high KMT5C expression predicts poor response to immune checkpoint blockade (35). These studies collectively underscore that both the genomic features identified by our model and broader epigenetic mechanisms can shape a TME refractory to immunotherapy.
The protective effect associated with BRCA2 mutations represents a complex yet compelling finding. Although initially counterintuitive, the correlation between BRCA2 mutations and longer PFS in our model is consistent with two potential mechanisms: (i) adaptive silencing of the spindle assembly checkpoint (via NSFL1C/AURKB downregulation) to mitigate genomic instability, as reported in BRCA2-deficient prostate cancer (36), and (ii) enhanced tumor immunogenicity stemming from accumulated DNA damage. These dual vulnerabilities may collectively increase the susceptibility of BRCA2-mutated tumors to immunotherapy. Our results complement recent work by Samstein et al., who reported improved outcomes in BRCA2-mutated tumors across multiple cancer types treated with immunotherapy (37). The consistency of this association across all three datasets not only reinforces its validity but also positions BRCA2 status as a candidate predictive biomarker for immunotherapy response, warranting prospective validation.
In the test set feature importance analysis, NOTCH1 emerged unexpectedly as a prominent predictor, warranting further discussion. While direct evidence linking NOTCH1 to PD-L1 regulation in squamous cell carcinoma remains limited – and the relationship is complex and context-dependent – research indicates that NOTCH1 plays a multifaceted role in this cancer type (38). Activation of NOTCH1 signaling pathway can function as an oncogenic driver, potentially promoting PD-L1 expression indirectly to establish an immunosuppressive microenvironment (39). Conversely, loss-of-function mutations in NOTCH1 (where it acts as a tumor suppressor) may lead to hyperactivation of the NF-κB signaling pathway, resulting in upregulated PD-L1 expression (40, 41). This represents an indirect regulatory mechanism mediated through the release of NF-κB inhibition, previously reported in head and neck squamous cell carcinoma. Furthermore, NOTCH1 signaling frequently cross talks with pathways such as RAS/MAPK and PI3K/AKT – established regulators of PD-L1 expression (42). Thus, it is plausible that NOTCH1 indirectly modulates PD-L1 expression in lung squamous cell carcinoma through these or related mechanisms, though this requires confirmation in future studies. Given the overrepresentation of squamous cell carcinoma in our test set (68.29%), the prominence of NOTCH1 in our analysis may reflect squamous cell carcinoma-specific biological interactions or dependencies involving this gene, which merit further investigation.
While PD-L1 expression remains the gold standard for immunotherapy selection, our model’s AUC (0.77–0.82) compares favorably with the predictive accuracy of PD-L1, which typically ranges from 0.65 to 0.70 in meta-analyses (15). This improvement likely stems from capturing multidimensional genomic information beyond a single immune checkpoint marker. Notably, our approach diverges from TMB-based methods by prioritizing functional mutations over total mutation burden. This distinction may explain why we detected predictive signals from BRCA2—a low-frequency mutation—often missed by TMB-centric approaches (37). The predictive power of our model, which integrates signals from multiple genes, finds strong independent validation in the recent work on KMT5C. The power of a multi-gene approach to capture such complex biology is further exemplified by recent research beyond NSCLC. For instance, a very recent study by Liu developed and validated a prognostic model based on circadian rhythm-related genes (CRGs) in skin cutaneous melanoma (43). Despite the different cancer type and gene set, their model similarly identified distinct immune subtypes and demonstrated a strong association between a high-risk CRG score and an immunosuppressive TME, characterized by upregulated immune checkpoints and reduced sensitivity to therapy. The convergence of findings—that multi-gene signatures derived from disparate biological contexts (somatic mutations in NSCLC vs. circadian genes in SKCM) consistently predict TME status and therapeutic outcome—considerably strengthens the validity and generalizability of the integrative genomics approach that underpins our model. The fact that a single epigenetic regulator like KMT5C can profoundly influence the TME and immunotherapy response underscores the biological rationale for why multi-gene models like ours are necessary to capture the complex determinants of treatment outcome (35). While recent studies have explored ctDNA for early relapse detection or resistance monitoring (44–47), our work uniquely focuses on baseline genomic predictors of PFS, providing a clinically actionable tool for risk stratification prior to treatment initiation.
Our work addresses two critical gaps in translational machine learning: reproducibility and interpretability. By leveraging SHAP values, we transcend “black box” predictions to identify biologically plausible drivers of model behavior. For instance, the directional consistency of TP53 and BRCA2 effects across the training and validation phases (Figures 5, 6) confirms these are genuine biological signals rather than overfitting artifacts. Decision curve analysis (Figure 4) further substantiates the model’s clinical utility. At a threshold probability of 30%—reflecting real-world clinical willingness to intervene—the model demonstrated a superior net benefit compared to both “treat-all” and “treat-none” strategies. This suggests potential cost savings by avoiding ineffective therapies in predicted non-responders, a critical consideration given the substantial economic burden of immunotherapy (48). From a translational standpoint, our findings carry several immediate implications (1): Risk stratification: The model can identify high-risk patients—particularly those with TP53 mutations—who may derive greater benefit from more aggressive or combination therapeutic strategies (2). Treatment selection: BRCA2-mutated patients might represent ideal candidates for immunotherapy monotherapy (3). Trial design: The model could function as an enrichment tool in future clinical trials. Notably, the excellent negative predictive value (95% in the test set) suggests particular utility in avoiding unnecessary treatment for low-risk patients. This addresses a critical current challenge in NSCLC immunotherapy: overtreatment, wherein a substantial proportion of patients experience toxicity without clinical benefit.
The model’s performance is particularly noteworthy given the challenges posed by the test set’s distinct histological composition (68.29% squamous cell carcinoma vs. 26.40% in the training set). This suggests the identified genomic signatures may transcend histological subtypes, potentially reflecting fundamental mechanisms of tumor-immune interaction. The strategic use of LASSO-Cox regression for feature selection prior to XGBoost modeling proved particularly effective, as evidenced by the model’s ability to identify clinically relevant genes such as NOTCH1—ones that might have been overlooked by conventional approaches.
We also observed that when ctDNA was combined with clinical data (gender, age, histologic type, TMB expression level), the validation set (AUC: 0.787 vs. 0.770) and the test set (AUC: 0.767 vs. 0.686) performed worse than the ctDNA-only model. This counterintuitive phenomenon may stem from three reasons: First, baseline characterization revealed significant histological distribution bias in the test set (68.29% squamous carcinoma), where clinical factors are highly sensitive to distributional variations, potentially reducing the generalizability of the combined model. Second, the 25 ctDNA features capture molecular heterogeneity of tumors, suggesting their biological information may overshadow the predictive value of clinical phenotypes (e.g., the NOTCH1 gene). Third, the limited sample size of the test set (n=41) combined with increased model complexity amplified overfitting risk, implying that pure ctDNA-based models may offer greater robustness than traditional clinical-ctDNA integration in precision medicine. This highlights the “more features ≠ better performance” paradox in biomarker research, emphasizing that biological relevance outweighs feature quantity (49).
While our results are promising, several limitations warrant consideration (1): The retrospective design introduces potential biases in patient selection (2). The relatively small test set (n=41) constrains the precision of performance estimates (3). The model does not currently incorporate imaging information, such as CT and MRI. Moving forward, we plan to conduct prospective validation in larger multicenter cohorts, further integrate radiological features and serial ctDNA measurements, and perform deeper mechanistic studies to clarify the biological basis underlying NOTCH1’s predictive role.
5 Conclusions
This study developed an interpretable ctDNA-based machine learning model for predicting PFS in NSCLC patients receiving immunotherapy. SHAP analysis identified TP53, BRCA2, and other genomic predictors, while elucidating their underlying biological mechanisms. The model’s consistent performance across diverse datasets highlights its clinical potential, although prospective validation is required to guide personalized therapeutic strategies.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://github.com/Yali15207856138/improved-sn, www.clinicalstudydatarequest.com.
Ethics statement
The studies involving humans were approved by The Ethics Committee of The Second Affiliated Hospital of Guizhou Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. As a retrospective study, this research strictly adhered to the Declaration of Helsinki, with full confidentiality of patient information ensured; a waiver of informed consent was granted by the institutional review board due to the retrospective nature of the analysis.
Author contributions
YL: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Software, Validation, Visualization, Writing – original draft, Writing – review & editing. JX: Conceptualization, Data curation, Investigation, Methodology, Project administration, Resources, Supervision, Writing – original draft, Writing – review & editing. TH: Conceptualization, Data curation, Investigation, Writing – review & editing. YH: Conceptualization, Data curation, Investigation, Writing – review & editing. DBZ: Conceptualization, Data curation, Investigation, Writing – review & editing. DZ: Conceptualization, Data curation, Investigation, Writing – review & editing. BL: Data curation, Formal analysis, Investigation, Writing – original draft, Writing – review & editing. MnZ: Data curation, Formal analysis, Investigation, Writing – review & editing. ZH: Data curation, Formal analysis, Investigation, Writing – review & editing. MZ: Data curation, Formal analysis, Investigation, Writing – review & editing. XL: Data curation, Formal analysis, Investigation, Writing – review & editing. MW: Data curation, Formal analysis, Investigation, Writing – review & editing. HL: Data curation, Formal analysis, Investigation, Writing – review & editing. FL: Formal analysis, Investigation, Project administration, Resources, Supervision, Writing – review & editing. CZ: Data curation, Investigation, Resources, Supervision, Writing – review & editing. XZ: Data curation, Investigation, Resources, Supervision, Writing – review & editing. SS: Conceptualization, Formal analysis, Methodology, Supervision, Writing – original draft, Writing – review & editing. JP: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing.
Funding
The author(s) declared financial support was received for this work and/or its publication. This work was supported by the Qian Dong Nan Science and Technology Program (No. qdnkhJz [2023] 14), Scientific Research Project of Guizhou Provincial Health and Wellness Commission (No. gzwkj2024-099, No. gzwkj2025-608), Guizhou Medical University National Natural Science Foundation Cultivation Project (No.25NSFCP35), Public Hospital High-Quality Development Research Public Welfare Project Fund(No.GL-A014), Cultivation of High-Level Innovative Talents in Guizhou Province (No.qian qian ceng ren cai[2024]202215), Spark Program (No. XHJH-0048), CHEN XIAO-PING FOUNDATION FOR THE DEVELOPMENT OF SCIENCE AND TECHNOLOGY OF HUBEI PROVINCE (No.CXPJJH125009-05) and WU JIEPING MEDICAL FOUNDATION (No.320. 6750.2025- 16-21).
Acknowledgments
We thank all members for their invaluable contributions.
Conflict of interest
The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that Generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2025.1686260/full#supplementary-material
References
1. Rebecca LS, Angela NG, and Ahmedin J. Cancer statistics, 2024. CA: Cancer J Clin. (2024). doi: 10.3322/caac.21820
2. Nicolas G. New strategies and novel combinations in EGFR TKI-resistant non-small cell lung cancer. Curr Treat options Oncol. (2022). doi: 10.1007/s11864-022-01022-7
3. Larissa AP, Varune Rohan R, Stephen L, and Wan LL. Genetic alterations defining NSCLC subtypes and their therapeutic implications. Lung Cancer (Amsterdam Netherlands). (2013). doi: 10.1016/j.lungcan.2013.07.025
4. Aritraa L, Avik M, Pravin DP, Navneet S, Purvish MP, Bharti B, et al. Lung cancer immunotherapy: progress, pitfalls, and promises. Mol Cancer. (2023). doi: 10.1186/s12943-023-01740-y
5. Michael JG, Roy SH, and Sarah BG. Selecting the optimal immunotherapy regimen in driver-negative metastatic NSCLC. Nat Rev Clin Oncol. (2021). doi: 10.1038/s41571-021-00520-1
6. Julian AMA, ErinMarie OK, and Yanyan L. Next generation of immune checkpoint inhibitors and beyond. J Hematol Oncol. (2021). doi: 10.1186/s13045-021-01056-8
7. Leena G, Delvys RA, Shirish MG, Emilio E, Enriqueta F, Flávia De A, et al. Pembrolizumab plus chemotherapy in metastatic non-small-cell lung cancer. New Engl J Med. (2018). doi: 10.1056/NEJMoa1801005
8. Silvia N, Dariusz MK, Alexander L, Mahmut G, David V, Julien M, et al. Pembrolizumab plus chemotherapy in squamous non-small-cell lung cancer: 5-year update of the phase III KEYNOTE-407 study. J Clin Oncol Off J Am Soc Clin Oncol. (2023). doi: 10.1200/JCO.22.01990
9. Heather AW, Moïshe L, Terufumi K, Masahiro T, Se-Hoon L, Shugeng G, et al. Perioperative pembrolizumab for early-stage non-small-cell lung cancer. New Engl J Med. (2023). doi: 10.1056/NEJMoa2302983
10. Marina Chiara G, Shirish MG, Giovanna S, Enriqueta F, Emilio E, Manuel D, et al. Pembrolizumab plus pemetrexed and platinum in nonsquamous non-small-cell lung cancer: 5-year outcomes from the phase 3 KEYNOTE-189 study. J Clin Oncol Off J Am Soc Clin Oncol. (2023). doi: 10.1200/JCO.22.01989
11. Jingchao J, Jing Y, Leimin Q, Biao Z, Xiaodong T, Shuanghai L, et al. Controlled siRNA release of nanopolyplex for effective targeted anticancer therapy in animal model. Int J nanomedicine. (2024). doi: 10.2147/IJN.S443636
12. Li Wen C, Lu Yao F, and Zhiyong S. Application research progress of nanomaterial graphene and its derivative complexes in tumor diagnosis and therapy. Curr medicinal Chem. (2024). doi: 10.2174/0109298673251648231106112354
13. Cao K, Zhou Y, Shen Y, Wang Y, Huang H, and Zhu H. Combined photothermal therapy and cancer immunotherapy by immunogenic hollow mesoporous silicon-shelled gold nanorods. J Pharm Sci. (2024) 113:2232–44. doi: 10.1016/j.xphs.2024.03.007
14. Xin Y, Chaonan H, and Chunxia S. Immunotherapy resistance of lung cancer. Cancer Drug resistance (Alhambra Calif). (2022). doi: 10.20517/cdr.2021.101
15. Steve L, Julie ES, David LR, Daphne WW, Michael AB, Douglas BJ, et al. Comparison of biomarker modalities for predicting response to PD-1/PD-L1 checkpoint blockade: A systematic review and meta-analysis. JAMA Oncol. (2019). doi: 10.1001/jamaoncol.2019.1549
16. Jiachen X, Rui W, Yiran C, Shangli C, Lin W, Baolan L, et al. Circulating tumor DNA-based stratification strategy for chemotherapy plus PD-1 inhibitor in advanced non-small-cell lung cancer. Cancer Cell. (2024). doi: 10.1016/j.ccell.2024.08.013
17. Yunfang Y, Dongqiang Z, Qiyun O, Shengbo L, Anlin L, Yongjian C, et al. Association of survival and immune-related biomarkers with immunotherapy in patients with non-small cell lung cancer: A meta-analysis and individual patient-level analysis. JAMA network Open. (2019). doi: 10.1001/jamanetworkopen.2019.6879
18. Marina AL, Denis SF, Pardabekova OA, Nosova MV, Veronika T, Vladimir L, et al. The efficacy of immune checkpoint inhibitors in patients with cancer with pseudoprogression. J Clin Oncol. (2024). doi: 10.1200/jco.2024.42.16_suppl.2620
19. Meng-Yu C and Yue-Can Z. Pseudoprogression in lung cancer patients treated with immunotherapy. Crit Rev oncology/hematology. (2021). doi: 10.1016/j.critrevonc.2021.103531
20. Xinghao A, Bo J, Zhiyi H, Junping Z, Minglei Z, Jun Z, et al. Noninvasive early identification of durable clinical benefit from immune checkpoint inhibition: a prospective multicenter study (NCT04566432). Signal transduction targeted Ther. (2024). doi: 10.1038/s41392-024-02060-3
21. Jie P, Baowen X, Honglian M, Rui W, Xiao H, and Zhongjun H. Deep learning based on computed tomography predicts response to chemoimmunotherapy in lung squamous cell carcinoma. Aging Dis. (2024). doi: 10.14336/AD.2024.0169
22. Jie P, Jing Z, Dan Z, Lushan X, Honglian M, Xudong Z, et al. Deep learning to estimate durable clinical benefit and prognosis from patients with non-small cell lung cancer treated with PD-1/PD-L1 blockade. Front Immunol. (2022) 13:960459. doi: 10.3389/fimmu.2022.960459
23. Zhichao L, Guo L, Zeping Y, Linduo L, Xingchen W, Jingrong S, et al. Predictive mutation signature of immunotherapy benefits in NSCLC based on machine learning algorithms. Front Immunol. (2022) 13:989275. doi: 10.3389/fimmu.2022.989275
24. Seong-Keun Y, Conall F, Byuri Angela C, Bailey GF, Catherine YH, Elizabeth SK, et al. Prediction of checkpoint inhibitor immunotherapy efficacy for cancer using routine blood tests and clinical data. Nat Med. (2025). doi: 10.1038/s41591-024-03398-5
25. Yawei L, Wu X, Deyu F, and Yuan L. Informing immunotherapy with multi-omics driven machine learning. NPJ digital Med. (2024) 333:71–4. doi: 10.1038/s41746-024-01043-6
26. World medical association declaration of helsinki: ethical principles for medical research involving human participants. Jama. (2024). doi: 10.1001/jama.2024.21972
27. Ramtin Zargari M. ExplaineR: an R package to explain machine learning models. Bioinf Adv. (2024). doi: 10.1093/bioadv/vbae049
28. Gang G, Miao Y, Wei X, Esteban C, and Yan C. Local activation of p53 in the tumor microenvironment overcomes immune suppression and enhances antitumor immunity. Cancer Res. (2017). doi: 10.1158/0008-5472.CAN-16-2832
29. Xiaonan Z, Simin M, Yifan Y, Dushan D, Qicai L, Saisai L, et al. A TP53 related immune prognostic model for the prediction of clinical outcomes and therapeutic responses in lung adenocarcinoma. Front Immunol. (2022) 13:876355. doi: 10.3389/fimmu.2022.876355
30. Xin C, Tianqi L, Jianqi W, Chen Z, Gefei G, Cunyi Z, et al. Molecular profiling identifies distinct subtypes across TP53 mutant tumors. JCI Insight. (2022). doi: 10.1172/jci.insight.156485
31. Max DW, Seth BC, Danique EMD, Martine HVM, Maarten S, Iris d, et al. Loss of p53 triggers WNT-dependent systemic inflammation to drive breast cancer metastasis. Nature. (2019). doi: 10.1038/s41586-019-1450-6
32. Kristen ES, Diane AF, Christine CA, Zoe LC, Amanda LB, Wentao Y, et al. T cell immune deficiency rather than chromosome instability predisposes patients with short telomere syndromes to squamous cancers. Cancer Cell. (2023). doi: 10.1016/j.ccell.2023.03.005
34. Jia J, Junjie G, Lina G, Kaihua L, Jie D, Fanshuang Z, et al. CNDP1 overexpression by promoter hypomethylation predicts poor prognosis and immunotherapy response in mucosal melanoma. Cancer Sci. (2025). doi: 10.1111/cas.70062
35. Yunfeng Y, Qianyu L, Guoquan Y, Yifei Q, Wenyun G, Songling L, et al. Targeting KMT5C suppresses lung cancer progression and enhances the efficacy of immunotherapy. Advanced Sci. (2025). doi: 10.1002/advs.202407575
36. Jian W, Yuke C, Shiwei L, Wanchang L, Xiao Albert Z, Yefei L, et al. PP2A inhibition causes synthetic lethality in BRCA2-mutated prostate cancer models via spindle assembly checkpoint reactivation. J Clin Invest. (2023). doi: 10.1172/JCI172137
37. Robert MS, Chung-Han L, Alexander NS, Matthew DH, Ronglai S, Yelena YJ, et al. Tumor mutational load predicts survival after immunotherapy across multiple cancer types. Nat Genet. (2019). doi: 10.1038/s41588-018-0312-8
38. Qingmiao S, Chen X, Yifan Z, Xin Y, Qingfei C, Shuwen J, et al. Notch signaling pathway in cancer: from mechanistic insights to targeted therapies. Signal transduction targeted Ther. (2024). doi: 10.1038/s41392-024-01828-x
39. Lu Z, Du W, Jiao X, Wang Y, Shi J, Shi Y, et al. NOTCH1 mutation and survival analysis of tislelizumab in advanced or metastatic esophageal squamous cell carcinoma: A biomarker analysis from the randomized, phase III, RATIONALE-302 trial. J Clin Oncol Off J Am Soc Clin Oncol. (2025) 43:1898–909. doi: 10.1200/JCO-24-01818
40. Nicholas JW, Zachary S, Kelly LA, Laura JB, Wilson L, Charlotte MP, et al. Loss-of-function mutations in Notch receptors in cutaneous and lung squamous cell carcinoma. Proc Natl Acad Sci United States America. (2011). doi: 10.1073/pnas.1114669108
41. Nicolas S, Ann Marie E, Aaron DT, Aleksandar DK, Kristian C, Andrey S, et al. The mutational landscape of head and neck squamous cell carcinoma. Science. (2011). doi: 10.1126/science.1208130
42. Matthew AC, Sophie de Carné T, Sareena R, Davide Z, Christopher M, Míriam MA, et al. Oncogenic RAS signaling promotes tumor immunoresistance by stabilizing PD-L1 mRNA. Immunity. (2017). doi: 10.1016/j.immuni.2017.11.016
43. Chengling L, Xingchen L, Pengjuan C, Haiming X, Xin L, and Sailing Z. Circadian rhythm related genes identified through tumorigenesis and immune infiltration-guided strategies as predictors of prognosis, immunotherapy response, and candidate drugs in skin cutaneous Malignant melanoma. Front Immunol. (2025). 16:1513750. doi: 10.3389/fimmu.2025.1513750
44. Liu Y, Jiahong J, Song Y, Yaping X, and Dongsheng H. Circulating tumor DNA analysis to detect minimal residual disease and predict recurrence in patients with resectable pancreatic cancer. J Clin Oncol. (2020). doi: 10.1200/jco.2020.38.15_suppl.e16799
45. Aaron CT, Gillianne L, Stephanie PLS, Kevin Lee Min C, Angela T, Boon-Hean O, et al. MA07.06 circulating tumor DNA for monitoring minimal residual disease and early detection of recurrence in early stage lung cancer. J Thorac Oncol. (2021). doi: 10.1016/j.jtho.2021.08.144
46. Hua C, Xinyi L, Yixin C, Pan-Chyr Y, Tanxiao H, Lele S, et al. Circulating tumor DNA is capable of monitoring the therapeutic response and resistance in advanced colorectal cancer patients undergoing combined target and chemotherapy. J Clin Oncol. (2020). doi: 10.1200/jco.2020.38.15_suppl.e15524
47. Lorenzo G, Carolina R, Andrew AD, Marko V, Katherine C, Whitney LH, et al. Defining resistance mechanisms to CDK4/6 inhibition in hormone receptor-positive HER2-negative metastatic breast cancer (MBC) through a machine learning approach applied to circulating tumor DNA (ctDNA). J Clin Oncol. (2022). doi: 10.1200/jco.2022.40.16_suppl.3055
48. Mingjun R, Yingcheng W, Yunfei L, and Zhengyang F. Immunotherapy guided by immunohistochemistry PD-L1 testing for patients with NSCLC: A microsimulation model-based effectiveness and cost-effectiveness analysis. BioDrugs: Clin immunotherapeutics biopharmaceuticals Gene Ther. (2023). doi: 10.1007/s40259-023-00628-z
Keywords: NSCLC, immunotherapy, CtDNA, PFS, XGBoost
Citation: Li Y, Xia J, He T, Hu Y, Zhou D, Zou D, Li B, Zhang M, Huang Z, Zhang M, Liu X, Wang M, Luo H, Lu F, Zhang C, Zhao X, Su S and Peng J (2025) Development and validation of an interpretable machine learning model for predicting progression-free survival after immunotherapy in patients with non-small cell lung cancer: a multicenter study. Front. Immunol. 16:1686260. doi: 10.3389/fimmu.2025.1686260
Received: 15 August 2025; Accepted: 03 December 2025; Revised: 28 October 2025;
Published: 19 December 2025.
Edited by:
Paulo Rodrigues-Santos, University of Coimbra, PortugalCopyright © 2025 Li, Xia, He, Hu, Zhou, Zou, Li, Zhang, Huang, Zhang, Liu, Wang, Luo, Lu, Zhang, Zhao, Su and Peng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jie Peng, c2FuazQ0QHNpbmEuY29t; Shengfa Su, c3VzaGVuZ2ZhMjAwNUAxNjMuY29t
†These authors have contributed equally to this work
Ya Li1,2,3,4†