Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Immunol., 12 January 2026

Sec. Cancer Immunity and Immunotherapy

Volume 16 - 2025 | https://doi.org/10.3389/fimmu.2025.1656375

This article is part of the Research TopicPredictive Biomarkers to Immune Checkpoint Inhibitors in Lung CancerView all 14 articles

Identification of a prognostic signature based on immunogenic adverse event-related genes to guide therapy for non-small cell lung cancer

Jun Zhu,&#x;Jun Zhu1,2†Qing Ye,&#x;Qing Ye1,3†Gang Li,Gang Li1,2Lidong Liu,Lidong Liu1,2Shushu Tian,Shushu Tian1,2Shuangyan Li,Shuangyan Li1,2Maoru Wen,Maoru Wen1,2Luying ShenLuying Shen2Jiang Wang,Jiang Wang1,2Xinmiao Song*Xinmiao Song4*Hong Chen,*Hong Chen1,2*Yi Li,*Yi Li1,2*
  • 1Kunming Medical University, Kunming, Yunnan, China
  • 2thDepartment of Oncology, 920 Hospital of Joint Logistics Support Force, Kunming, Yunnan, China
  • 3Department of Radiation Oncology, The First Affiliated Hospital of Kunming Medical University, Kunming, Yunnan, China
  • 4Department of Radiation Oncology, Eye, Ear, Nose, and Throat Hospital, Fudan University, Shanghai, China

Background: Immune checkpoint inhibitors (ICIs) improve outcomes in non−small cell lung cancer (NSCLC), yet reliable predictive biomarkers are still lacking. Given that immune−related adverse events (irAEs) often correlate with better ICI efficacy, this study aimed to develop and validate an irAE−related gene signature for risk stratification and treatment–response prediction in NSCLC.

Methods: Transcriptomic and clinical data were obtained from the Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases, designated as training and validation cohorts, respectively. Least absolute shrinkage and selection operator (LASSO) regression and Cox proportional hazards models were applied to identify eight irAE-associated genes for constructing a risk score (RS). Multivariate analyses evaluated differences in overall survival (OS), immunotherapy response, irAE incidence, immune escape potential, and tumor microenvironment (TME) profiles between high- and low-risk groups. Prognostic performance was validated using Kaplan–Meier curves, Cox regression, and receiver operating characteristic (ROC) analysis. A nomogram integrating RS with clinical factors was developed to improve prediction stability.

Results: Via gene set variation analysis (GSVA) enrichment of 30 known immunologic gene sets in TCGA-LC, we stratified samples by immune score (cutoff = 0.23), identified 1,057 differentially expressed genes (DEGs) between groups, intersected with cancer-normal DEGs, and selected 132 irAE-related DEGs via Pearson correlation—genes functionally tied to T-cell activation and cytokine-mediated signaling pathways. LASSO–Cox regression derived an eight-gene prognostic signature demonstrating robust cross-cohort validation [area under the curve (AUC): TCGA = 0.770, GSE50081 = 0.767, GSE37745 = 0.758] and predictive accuracy for irAEs in GSE186143 (AUC = 0.807). In a validation NSCLC cohort (n = 41), irAE occurrence correlated with superior clinical outcomes: objective response rate (87.50% vs. 44.00%), disease control rate (93.75% vs. 76.00%), median progression-free survival (20.0 vs. 9.9 months), and overall survival (21.4 vs. 12.1 months). Three key risk genes (POU2AF1, ANKRD44, CRTAM) showed significantly elevated protein expression levels and positive rates in irAE patients. High RS cohorts exhibited immunosuppressive TME characteristics and increased immune escape potential. A clinically relevant nomogram integrating RS with clinical factors (age, stage, and tumor recurrence) improved prognostic stability.

Conclusions: Our irAE-associated gene signature robustly stratifies NSCLC patients for immunotherapy response and survival. Integrating RS with clinical parameters provides a practical tool to balance efficacy and safety.

1 Introduction

Lung cancer is the most common malignant tumor with the highest mortality globally, and over 87% of patients have non-small cell lung cancer (NSCLC) (1). In recent years, immunotherapy designed to induce a cellular immune response has become a mainstay of cancer therapy. Specifically, immune checkpoint inhibitors (ICIs), targeting the programmed cell death protein-1/programmed cell death ligand-1 (PD-1/PD-L1) axis, have been approved for first- or second-line treatments of NSCLC, regardless of the patient’s history or driver mutational status (2, 3). As compared with conventional chemotherapy, immunotherapy can achieve unprecedented and sustained responses in patients with advanced cancer (4, 5). However, the efficacy of ICIs is of remarkable diversity, with an objective response rate of less than 30% in patients with advanced NSCLC (6). Therefore, there is an urgent clinical need for novel tools to predict therapeutic efficacy, which could guide clinical treatment decisions more effectively.

The widespread use of ICIs is often accompanied by an increased risk of immune-related adverse events (irAEs) (7, 8). The incidence of irAEs in patients treated with ICIs reaches up to 70%, which can diminish quality of life, increase healthcare costs, and even lead to severe toxic effects (9, 10). However, due to the immune system activation and inflammatory response induced by ICIs, irAEs have been reported to correlate with improved survival in cancer patients treated with ICIs (11, 12). For example, cutaneous irAEs have been identified as an independent prognostic factor for progression-free survival (PFS) in melanoma patients treated with nivolumab (13). Similarly, patients with irAEs have shown increased PFS and higher objective response rate (ORR) compared to those without irAEs in NSCLC (14). Studies further revealed that the occurrence of irAEs, such as rash, was closely relevant to the efficacy of ICIs in hepatocellular carcinoma patients (15). Recently, advances in genome sequencing technologies have provided new approaches to elucidate the underlying cancer gene/pathway-associated molecular mechanisms, facilitating the identification of both tumor intrinsic and extrinsic factors as predictive biomarkers. Therefore, focusing on molecular characteristics of irAEs could offer important insights into precision medicine for NSCLC patients.

In the present study, we constructed a risk signature of eight genes crucially associated with irAEs. We evaluated the differences in overall survival (OS), immunotherapy response, irAEs, immune escape, and tumor microenvironment (TME) between high- and low-risk score (RS) groups. Finally, we developed a nomogram model to improve the utility of RS.

2 Materials and methods

2.1 Study design and data acquisition

Our study design is illustrated in Figure 1. The gene expression data and clinicopathological parameters of lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) were downloaded from The Cancer Genome Atlas (TCGA) database (https://gdc-portal.nci.nih.gov/), including 994 cancerous and 107 normal samples with prognostic information. Additionally, gene expression patterns of lung cancer with prognostic information were also obtained from the Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/geo/), namely, GSE50081 (16), GSE37745 (17), and GSE248378 (18). Meanwhile, GSE186143 (19), an independent cohort of malignant melanoma receiving ICI treatment, was also downloaded for the subsequent study. All data were assessed using the Illumina HiSeq 2000 RNA Sequencing platform.

Figure 1
Flowchart illustrating a research process on NSCLC. Steps include sample collection, obtaining and analyzing differentially expressed genes (DEGs), irAE-related DEGs analysis, and constructing risk score and nomogram models. It features data collection, analysis methods, and model validation using both public databases and clinical samples, culminating in a nomogram model for patient validation.

Figure 1. Schematic diagram of this study.

2.2 Screening of irAE-related DEGs

Gene expression data for LUAD (n = 585) and LUSC (n = 550) were downloaded from TCGA, generated on the Illumina HiSeq 2000 platform as standardized log2(FPKM + 1)values. After integrating clinical information, a total of 994 NSCLC tumor samples and 107 normal control samples with complete survival data were included in the final analysis (Supplementary Tables 1, 2). To leverage transcriptomic data reflecting immune dysregulation in irAEs—a strategy supported by prior studies—we prioritized TCGA-LC for analysis. This public dataset, with comprehensive lung cancer transcriptomes and clinical annotations, allowed cost-effective validation of gene expression in large cohorts. Firstly, we curated an initial set of irAE-related genes from the literature (20, 21) and applied objective statistical criteria [FDR < 0.05, p < 1 × 10−4 and RS (Spearman’s correlation coefficients) ≥0.7] to select the 30 most significant immunologic gene sets for further analysis (Supplementary Table 3), which represent key immune pathways in irAE pathogenesis. Subsequently, using GSVA (R v1.48.0), we assessed enrichment scores (ESs) for these gene sets in tumors vs. normal lung tissues to quantify their tumor microenvironment activity. Then, the optimal ES cutoff was determined via the X-tile software (Yale University, version 3.6.1) by maximizing the separation of high-/low-ES samples in TCGA-LC, as defined by the cutoff that generated the maximum log-rank χ² statistic. This cutoff was further validated by comparing overall survival between high- and low-ES groups. Next, differentially expressed genes (DEGs) were identified using Student’s t-test on the log2(FPKM + 1) normalized expression values, with adjustment for multiple testing using the Benjamini–Hochberg false discovery rate (FDR) procedure, with significance thresholds set at FDR <0.05 and |log2FC| >1. This analysis was performed between high- and low-ES specimens, as well as between cancer and non-cancer samples. By intersection analysis of DEGs between the above different groups, the correlations between the overlapping DEGs and the 30 genes related to irAEs were analyzed by applying the “cor” function in R. The genes with threshold values of |Pearson R| >0.7 and p <0.05 were identified as irAE-related DEGs. Then, to explore the biological significance and pathway enrichment, Gene Ontology (GO) analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis were performed using the Database for Annotation, Visualization, and Integrated Discovery (DAVID) (22).

2.3 RS model construction based on irAE-related DEGs

Based on TCGA-LC samples, we first identified prognosis-related genes from irAE-related DEGs using univariate Cox regression (R package survival) with a significance threshold of p <0.05. Next, we applied the least absolute shrinkage and selection operator (LASSO) regression (R package lars) to derive a parsimonious subset of prognostic genes, selecting the optimal regularization parameter (λ) via 10-fold cross-validation under the minimum-criteria rule. These candidates were then entered into a multivariate Cox regression to construct a multigene predictive model. The irAE-related prognostic risk score was calculated as follows:

RS=CoefGene1×Gene1 expression+CoefGene2×Gene2 expression+…+CoefGene n ×Gene n expression.

Where, CoefGene n is the regression coefficient of Gene n.

Using this expression, the RSs of samples in the TCGA lung cancer dataset and the GEO validation dataset were analyzed. According to the median of RSs, all specimens were divided into high- and low-risk groups in the training and validation datasets of lung cancer, and survival analysis was conducted to assess the prognosis between different risk groups via the “survival” package (p <0.05). Additionally, the differences of RS values between irAE and non-irAE groups in the GSE186143 dataset were analyzed via the Kruskal–Wallis rank sum test.

2.4 Immune microenvironment and immune escape assessment in different risk groups

Based on the expression levels in each sample of the TCGA-LC cohort, the proportion of immune cells in the tumor microenvironment was evaluated via the CIBERSORT algorithm, and the differences in immune cells (DICs) between high- and low-risk subtypes were analyzed via the “preprocessCore,” “e1071,” and “limma” packages. Additionally, the estimate package in R was applied to estimate the immune and stromal components via the ESIMATE scores, and the differences in stromal and immune scores between different risk groups were estimated using the “estimate” and “limma” packages. Furthermore, between the two risk groups, Hallmark signaling pathways were identified using gene set enrichment analysis (GSEA), which were analyzed by the “org.Hs.eg.db,” “clusterProfiler,” and “enrichplot” packages. Moreover, the expression of human leukocyte antigen (HLA) genes was analyzed via the “limma” package, with a threshold of p <0.05. The tumor immune dysfunction and exclusion (TIDE) scores of NSCLC samples were calculated utilizing the Tumor Immune Dysfunction and Exclusion (TIDE, http://tide.dfci.harvard.edu/login/) (23).

2.5 Nomogram model construction and evaluation

To study the independent prognostic factors, RSs and clinicopathological parameters in the TCGA-LC dataset were subjected to univariate and multivariate Cox regression analysis through the “survival” package, with a threshold value of p <0.05. Then, a nomogram model was constructed based on the independent parameters via the “rms” package. Concordance index (C-index) and area under the curve (AUC) for 1-, 3-, and 5-year survival probabilities were analyzed to evaluate the predictive accuracy of the nomogram model. Furthermore, to specifically assess the model’s stability and correct for overoptimism, we performed an internal validation using bootstrap resampling with 1,000 repetitions. The optimism-corrected C-index was calculated to evaluate the calibrated predictive performance.

2.6 Patients and treatments

A total of 41 patients with histologically confirmed NSCLC who were treated with ICIs were included from January 2020 to April 2023 at the 920th Hospital (Yunnan, China). All data, including patient history, laboratory results, and irAEs, were obtained retrospectively. In our study, ICIs for NSCLC included sintilimab, camrelizumab, and tislelizumab, one of which was administered at a fixed dose every 3 weeks, as there was no evidence which ICI is superior to the others. All the patients were followed up for more than 6 months, and computed tomography (CT) was performed to evaluate the patients’ therapeutic response according to the Response Evaluation Criteria in Solid Tumors (RECIST) V1.1 (24). Furthermore, their irAEs were evaluated at every visit according to the Common Terminology Criteria for Adverse Events (CTCAE) version 5.0 (25). Two independent oncologists jointly determined irAEs based on the patients’ clinical symptoms, laboratory tests (e.g., liver function indicators, thyroid function), and imaging reports. In cases of disagreement in grading, a third expert was consulted for arbitration. irAEs were classified into grades 1–5 based on severity, with events of grade ≥3 defined as severe toxic reactions. The Institutional Ethics Committee at the 920th Hospital approved this research, and informed consent based on the Declaration of Helsinki was obtained from all patients.

2.7 Immunohistochemical staining

The specimens from the patients were fixed in 10% formalin, dehydrated with a series of acetone, and embedded in paraffin. The specimens were sliced into 4-μm slices and analyzed by hematoxylin and eosin staining. Immunohistochemistry (IHC) was performed to assess the protein expression of CRTAM, ANKRD44, and POU2AF1 on formalin-fixed, paraffin-embedded (FFPE) tissue sections. The following primary antibodies were used: rabbit monoclonal anti-CRTAM (Cat. ab820487, Zen bioscience, Chengdu, China; dilution 1:100), rabbit polyclonal anti-ANKRD44 (Cat. AP50195PU-N, OriGene, Beijing, China; dilution 1:200), and rabbit recombinant monoclonal anti-POU2AF1 (Cat. ab76010, Hua’an Biotechnology, Hangzhou, China; dilution 1:200). Antigen retrieval was performed in citrate buffer (pH 6.0) at 95°C for 20 min. Sections were then incubated with primary antibodies overnight at 4°C. After washing, immunostaining was visualized using a DAB detection system and counterstained with hematoxylin. Two senior pathologists reviewed the slides independently. The intensity scoring criteria were as follows: 0, no staining; 1, <10% staining; 2, 10% to 50% staining; and 3, 50% to 100% staining. The final IHC score was the sum of both scores: 0–2 for negative and 3–6 for positive.

2.8 Statistical analysis

All data processing was performed using R version 3.6.1. The normality of continuous variables was assessed using the Shapiro–Wilk test. Based on the results of the normality test and the exploratory nature of the data, non-parametric methods were primarily employed for group comparisons. Continuous variables were analyzed using the Kruskal–Wallis test and the Wilcoxon rank-sum test, while categorical variables were compared using the chi-square (χ²) test. Spearman’s correlation test was applied to determine correlation coefficients. Independent predictive factors were identified through Cox proportional hazards regression analysis. Statistical significance was defined as p <0.05.

3 Results

3.1 Construction of RS for NSCLC

The screening strategy of irAE-related DEGs in our study is shown in Figure 2A. GSVA was conducted to assess the ESs of 30 immunologic gene sets reported in the literature based on the TCGA-LC dataset. Normal lung tissues exhibited significantly higher ES values than NSCLC samples (Supplementary Figure S1A, p < 0.05), confirming immune pathway suppression in tumors. According to the optimal cutoff value of 0.23, the lung cancer samples were divided into high-ES groups (n = 761) and low-ES groups (n = 233). Notably, the OS of the high-ES group was significantly longer than that of the low-ES group (p < 0.05) (Supplementary Figure S1B). Then, 1,057 DEGs between the high- and low-ES groups were identified. After intersection with 3,263 DEGs between cancer and normal groups, 565 overlapping DEGs were obtained (Figures 2B, C; Supplementary Table 4). By conducting further Pearson correlation analysis, 132 irAE-related DEGs were identified (Figure 2D), with a total of 607 pairs of connections (|Pearson R| > 0.7 and p < 0.05, Supplementary Table 5).

Figure 2
Composite image of data analysis results related to immunological gene sets and non-small cell lung cancer (NSCLC).   A: Flowchart showing the analysis workflow, highlighting gene selection and intersection processes.  B: Two volcano plots illustrating differentially expressed genes (DEGs) in NSCLC versus control and low versus high enrichment scores (ES).  C: Venn diagram showing the overlap of DEGs between two comparisons.  D: Network diagram displaying the relationship between DEGs.  E: Dot plot of gene ontology terms related to immune processes, ordered by relevance.  F: Forest plot and LASSO analysis graphs indicating gene significance and survival prediction.  G: Dot plot for KEGG pathway analysis related to cell signaling.  H: Kaplan-Meier survival plots for several genes, showing survival differences based on expression levels.

Figure 2. Construction of risk scores for NSCLC. (A) The screening process of irAE-related DEGs. (B) DEGs in the NSCLC vs. normal groups (left) and the low- vs. high-ES groups (right). (C) Overlapping DEGs between the low- vs. high-ES groups and the NSCLC vs. normal groups. (D) Correlation network of 30 genes reported by the literature and 132 irAE-related DEGs. (F) Univariate Cox analysis (left) and LASSO parameter plots (right) of 14 irAE-related genes. (E, G) GO enrichment analysis (E) and KEGG signaling pathway analysis associated with irAE-related DEGs (G). (H) KM plots of eight irAE-related genes. Blue and red curves represent low- and high-expression subgroups, respectively.

To elucidate the biological significance and pathway enrichment underlying the irAE-related DEG set, GO and KEGG pathway analyses were conducted, respectively. As shown by GO analysis, 50 biological processes, including cytokine activation, cell adhesion, and receptor recognition, were significantly enriched. Meanwhile, some immunologic regulatory pathways, such as immune response, inflammatory response, and positive regulation of T-cell proliferation, were identified via KEGG analysis (Figures 2F, G; Supplementary Table 6).

Moreover, according to the prognostic information in the TCGA-LC dataset, 14 prognosis-related genes were identified from 132 irAE-related DEGs. To avoid overfitting, LASSO regression analysis was further performed. A panel of eight feature genes was selected and used to generate the RS model as follows (Figure 2E):

RS = (−0.0443) × ANKRD44 + (−0.0134) × POU2AF1 + (−0.0056) × P2RY13 + (−0.0056) × CXCL13 + (−0.0036) × CD69 + (−0.0025) × HLA-DMB + (−0.0002) × ARHGAP25 + (−0.0160) × CRTAM

Furthermore, the TCGA-LC training dataset was clustered into different subtypes based on the medium expression of each feature gene. Kaplan–Meier (KM) survival analysis revealed that there was a significant difference in prognosis between the high and low subtypes (Figure 2H, p < 0.05).

3.2 Validation of the prognostic value of RS

The RS of each sample was analyzed in the TCGA lung cancer dataset. According to the median level of RS, patients were stratified into high- and low-risk groups. Compared to the TCGA lung cancer cases in the low-risk group, the OS rate in the high-risk group was shorter (Figure 3A). ROC analysis showed that the predictive value (AUC) of this RS for the 5-year survival rates of lung cancer patients in the TCGA dataset was 0.770 (Figure 3D). We also observed that the proportion of deaths in the high-risk group was higher than in the low-risk group (Figures 3B, C). Furthermore, the prognostic value of the RS was also validated in the GSE50081 and GSE37745 datasets. In these two cohorts, the OS rates of patients with high RSs were significantly lower than those with low RSs (Figures 3E, I), and the AUC values for predicting the 5-year survival rate were 0.767 and 0.758, respectively (Figures 3H, L). Meanwhile, a higher proportion of deaths was also identified in the high-risk group (Figures 3F, G, J, K).

Figure 3
Grouped graphs display survival analysis data. Graphs A, E, and I show survival curves comparing high and low-risk groups. Graphs B, F, and J plot risk scores against patient numbers. Graphs C, G, and K depict survival time relative to risk scores with status (alive or dead) indicated. Graphs D, H, and L are ROC curves with AUC values of 0.770, 0.767, and 0.758, respectively.

Figure 3. The prognostic value of the RS models. (A–D) KM curves, RS distribution, survival status, and ROC curve in the TCGA cohort, respectively. (E–H) KM curves, RS distribution, survival status, and ROC curve in GSE50081, respectively. (I–L) KM curves, RS distribution, survival status, and ROC curve in GSE37745, respectively. For KM curves, the blue and red curves represent the survival ratio in the low- and high-RS groups, respectively.

3.3 Predictive value of immunotherapy response

To investigate the ability of RS to predict immunotherapy efficacy, RS was applied to the GSE248378 cohort, in which all lung cancer patients had complete clinical information of ICI treatment. Firstly, the expression of eight signature genes of RS (ANKRD44, POU2AF1, P2RY13, CXCL13, CD69, HLA-DMB, ARHGAP25, and CRTAM) was investigated between the two RS groups. In the low-RS group, eight feature genes were upregulated (Figures 4A–H). When compared to the high-risk group, the patients in the low-risk group had significantly better OS (Figure 4I). Time-dependent ROC analysis showed that the AUC values at 1, 3, and 5 years for predicting OS associated with ICI treatment were 0.852, 0.784, and 0.763, respectively (Figures 4J). The proportion of patients with no benefits was lower in the low-risk group than in the high-risk group (Figure 4K). Moreover, by TIDE analysis, we found that the patients with low RSs had lower TIDE scores than those with high RSs (Figure 4L). These results indicated that patients in the high-risk group exhibited resistance to ICIs.

Figure 4
Boxplots A-H display gene expression levels of POU2AF1, ANKRD44, CRTAM, CD69, CXCL13, ARHGAP25, HLA-DMB, and P2RY13 for high and low-risk groups. Plot I shows overall survival probabilities for different risks. Plot J illustrates ROC curves for risk prediction at one, three, and five years. Plot K presents percentages of 'No Benefits' in high and low-risk groups. Plot L is a violin plot of TIDE scores comparing low and high risk, with significant differences indicated by p-values.

Figure 4. Predictive value of RS for immunotherapy response in GSE248378. (A–H) Differential expressions of eight signature genes in different risk groups. (I, J) Kaplan–Meier (I) and time-dependent ROC (J) curves in different risk groups. (K) The proportion of patients with no benefits in different risk groups. (L) The TIDE scores in different risk groups.

3.4 Predictive value of irAEs

Considering that the RS was conducted based on irAE-related genes, we evaluated the predictive value of irAEs. In the melanoma cohort GSE186143 with clinical information of irAEs, we found that the incidence of irAEs in low-risk group was significantly higher than in the high-risk group, and patients with irAEs had significantly lower RS than patients without irAEs (Figures 5A, B). Further analysis revealed that the AUC value of the RS to predict irAEs was 0.807 (Figure 5C). In addition, we investigated the potential of irAE prediction in our cohort with a total of 41 NSCLC patients with ICI treatment (Supplementary Table 7). Among them, 16 patients (39.02%) experienced at least one adverse event, and 25 patients (60.98%) had no irAEs. The most common irAEs were thyroiditis (n = 5, 31.25%), followed by reactive cutaneous capillary endothelial proliferation (RCCEP) (n = 4, 25%) and immune-related pneumonia (n = 2, 12.5%). Among the patients with irAEs, 4 patients (25%) presented with a complete response (CR), 10 patients (62.5%) developed a partial response (PR), 1 patient (6.25%) had stable disease (SD), and 1 patient (6.25%) had progressive disease (PD). Meanwhile, in the non-irAE group, the number of patients with CR, PR, SD, and PD was 2, 9, 8, and 6, respectively (Supplementary Table 8). The objective response rate (ORR, CR+PR) and the disease control rate (DCR, CR+PR+SD) were higher in the irAE group than in the non-irAE group (87.50% vs. 44.00%, 93.75% vs. 76.00%) (Figure 5D). Furthermore, patients with irAEs had a significantly longer PFS and OS than patients without irAEs (median PFS: 20.00 vs. 9.86 months; median OS: 21.37 vs. 12.08 months) (Figures 5E, F). Next, we evaluated the expressions of three risk genes with the highest weights (POU2AF1, ANKRD44, and CRTAM) in patients with irAEs versus those without irAEs. As compared to patients without irAEs, the protein level and positive rate of CRTAM, POU2AF1, and ANKRD44 were increased in patients with irAEs (Figures 5G, H). We further evaluated the correlation between survival period and protein expression and found that the OS in the ANKRD44-positive group was significantly improved than in the negative group (Figure 5I). The expression of POU2AF1, CRTAM, and ANKRD44 was significantly upregulated in lung tumor tissues with irAEs (10.5-fold, 1.97-fold, and 2.62-fold vs. no irAEs, p < 0.05) and showed a positive correlation with irAE occurrence (Supplementary Table 9). Immunohistochemical staining (10 × 20 magnification) further validated nuclear localization differences in the three risk gene expression patterns (Figure 5J).

Figure 5
The image contains multiple panels presenting data on risk groups and survival statistics:  A. Bar chart showing the percentage of status by risk group (p = 0.0202). B. Box plot of RS scores in different groups (p = 0.0036). C. ROC curve with AUC of 0.8076 for sensitivity and specificity. D. Bar chart indicating clinical efficacy distribution (p = 0.0075). E. Kaplan-Meier plot for progression-free survival (p = 0.041). F. Kaplan-Meier plot for overall survival (p = 0.035). G. Bar charts of POU2AF1, CRTAM, and ANKRD44 status (various p-values). H. Box plots for pathological score based on expression and adverse events. I. Box plots for overall survival concerning expression. J. Histological images of paracancerous tissue, lung cancer tissue with and without immune-related adverse events (irAEs), showing stains for HE, POU2AF1, CRTAM, and ANKRD44.

Figure 5. Predictive value of irAEs in GSE186143 and 41 NSCLC patients. (A–C) GSE186143. The incidence of irAEs in the low- and high-RS groups (A); RS differences between the irAE and no-irAE groups (B), with blue and red curves representing the no-irAE and irAE groups, respectively. (C) ROC curves predicting irAE occurrence, with numbers in parentheses indicating specificity and sensitivity. (D) Distribution of clinical efficacy (CR, PR, SD, and PD) after ICI treatment in patients with irAEs and no-irAEs. (E, F) The OS (E) and PFS (F) of the irAE group and the no-irAE group. (G–I) Expression of the three risk genes [POU2AF1 (G), CRTAM (H), ANKRD44 (I)] in patients with no-irAEs and irAEs (G), protein level (H), and OS (I) of negative and positive patients. (J) Expression differences of the three key model genes (POU2AF1, CRTAM, ANKRD44) in patients with lung cancer adjacent tissues, lung tumor tissues with irAEs, and lung tumor tissues with no-irAEs (image magnification of 10 × 20). Black arrows indicate areas of significant immunohistochemical positivity.

3.5 Immune analysis related to the RS

Because dysregulation of immune cell infiltration in the TME plays a crucial role in immunotherapy, we investigated the correlation between immune cell infiltration and RS based on the CIBERSORT algorithms. The proportion of 22 types of immune cells was quantified on the expression data in TCGA-LC samples. As compared with the high-risk group, the abundance of CD8+ T cells, CD4+ memory resting T cells, M1 macrophages, and activated mast cells was obviously elevated in the low-risk group (Figure 6A). Based on the estimate algorithm, we found that the stromal, immune, and ESTIMATE scores were significantly higher in the low-risk group, along with a dramatic decrease in tumor purity (Figure 6B). We further used GSEA to search for Hallmark terms across the TCGA-LC dataset and found that the significantly enriched Hallmark terms in the low-risk group were cytokine–cytokine receptor interaction, cell adhesion molecules, etc. (Figures 6D, E). Meanwhile, the differences of human leukocyte antigen (HLA) genes were assessed between the two risk groups, which were essential in antigen presentation. As compared with the high-risk group, most HLA genes, such as HLA-I and HLA-II, had higher expression levels in the low-risk group (Figure 6C). Our results indicated that risk score was closely related to TME in lung cancer patients, in which the low-risk group tended to have more immune cells with enhanced antitumor immune responses.

Figure 6
Panel of charts analyzing risk levels:   A. Box plot comparing cell content in low-risk and high-risk groups across various cell types with significance noted.  B. Four box plots show scores (Stromal, Immune, ESTIMATE, and Tumor Purity) for low-risk versus high-risk groups.  C. Box plot depicting expression levels of different genes for low-risk and high-risk groups with significance.  D. Line graph displaying enrichment scores for pathways enriched in the high-risk group.  E. Line graph displaying enrichment scores for pathways enriched in the low-risk group.

Figure 6. Immune analysis related to the RS. (A) Distribution of various immune cell types in different risk groups. (B) Stromal score, immune score, ESTIMATE score, and tumor purity in different risk groups. (C) Expression levels of HLA genes in different risk groups. (D, E) Hallmark signaling pathways significantly correlated with the low- (D) and high-risk (E) groups. * denotes p < 0.05, ** denotes p < 0.01, *** denotes p < 0.001.

3.6 A nomogram based on RS and clinicopathology parameters

To explore the independent prognostic factors for patients with lung cancer, we combined clinical features and the precalculated score value into an integrated analysis. As shown by the univariate Cox regression analysis, some factors, including age, pathological T, clinical stage, tumor recurrence, enrichment score of irAEs, and RS, were significantly related to OS (p < 0.05) (Figure 7A). The multivariate Cox regression analysis further revealed that age, clinical stage, tumor recurrence, and RS were independent prognostic factors (p < 0.05) (Figure 7B). The detailed statistical results are shown in Supplementary Table 10. The effect of age, clinical stage, and tumor recurrence on survival was shown by KM plot analysis (Figure 7C). To improve the clinical unity of RS, a nomogram was established based on the four independent factors (Figure 7D). The C-index values of our model at 1, 3, and 5 years were 0.796, 0.784, and 0.763, respectively (Figure 7E). As shown by the decision curve analysis (Figure 7F), the nomogram combined with four factors had a better clinical benefit net than that combined with a single factor. In addition, Figure 7G reflects the clinical benefit of the predictive model at different decision thresholds, indicating that the model has significant clinical value in identifying high-risk patients and guiding early intervention. Finally, bootstrap internal validation (n = 1,000) confirmed the model’s robustness. Visual comparison of the corrected calibration curves (Supplementary Figure S2) showed that our RS model’s performance surpassed all alternative predictors, highlighting its superior and stable predictive ability.

Figure 7
A series of graphs and charts analyzing survival data.   Graph A shows univariable regression Cox analysis with hazard ratios for factors like age, gender, and pathological stages.  Graph B displays multivariable regression Cox analysis highlighting similar factors.  Graph C contains survival curves for age, recurrence, and stage impacts on survival over time.  Graph D is a nomogram predicting survival probabilities based on total points from factors like age, stage, and recurrence.   Graph E plots actual versus predicted survival probabilities at 1, 3, and 5 years.  Graph F illustrates standardized net benefits against high-risk thresholds.  Graph G shows the number of high-risk individuals and those with events across cost-benefit ratios.

Figure 7. Construction of the nomogram model based on multi-omics. (A, B) Forest plots of univariate (A) and multivariate (B) Cox regression for clinical parameters and RS. (C) KM plots for different subgroups of age (left), recurrence (middle), and stage (right). (D) Nomogram plot for the survival prediction model based on independent prognostic factors. (E) Consistency between predicted and actual survival rates at 1, 3, and 5 years, with the horizontal axis indicating predicted survival rates and the vertical axis indicating actual survival rates. (F, G) Decision line analysis curve plot [decision analysis curve (F) and clinical impact curve (G)].

4 Discussion

Immunotherapy is one of the most important treatments for patients with advanced NSCLC, while only a fraction of patients have experienced long-term benefit from such treatments (26). Therefore, improved models urgently needed to stratify patients are a prerequisite for precision cancer medicine. Accumulated evidence has identified irAEs to be closely related to the therapeutic effects of ICIs (14, 2729). Although prognostic scoring systems have been used to predict the outcome of cancer patients, there have been no studies that focused on the irAE gene-related prognostic signature in NSCLC.

Current immunotherapeutic biomarkers, including PD-L1 expression (TPS ≥ 50%) and tumor mutational burden (TMB-high, ≥10 mut/Mb), exhibit considerable limitations that constrain their predictive reliability. While associated with improved responses to ICIs, their efficacy is substantially undermined by tumor spatiotemporal heterogeneity; single-biopsy response rates in heterogeneous tumors can be as low as 15%–20%, and the concordance between PD-L1 and TMB positivity is only 54.8% (30). Although tertiary lymphoid structure (TLS) signatures offer an alternative for PD-L1-negative cases, their predictive accuracy remains unstable. Combination strategies have been explored to improve predictive power, yet they introduce new complexities. For instance, the co-occurrence of high TMB and high PD-L1 expression (TPS ≥ 50%) was associated with an objective response rate of up to 57% in patients treated with anti-PD-1 therapy (31). Similarly, the combination of an interferon-gamma (IFN-γ) gene signature with TMB identified patient subgroups with pathological response rates (pRR) ranging from 39% to 100% (32). However, their clinical translation is hindered by technical and biological complexities, as well as the low prevalence of dual-positive patients, limiting broad applicability. Notably, established benchmarks such as TMB and TIDE show AUC values ranging from 0.62 to 0.75 in predicting sustained clinical benefits or responses to immunotherapy, underscoring an unmet need for more robust predictors (33, 34). Against this backdrop, we developed a novel RS based on eight irAE-related genes, which effectively stratified patients into distinct risk groups with significantly divergent OS. The prognostic robustness of the RS was consistently validated across external datasets (GSE50081 and GSE37745). Our RS model demonstrated comparatively higher predictive performance, with AUCs of 0.852, 0.784, and 0.763 for 1-, 3-, and 5-year ICI-related OS predictions. This suggests that our model may provide complementary value to existing benchmarks like TMB and TIDE. Low-RS patients derived greater benefit from ICIs, showing alignment with and potential superiority to TIDE analysis, a computational framework known to outperform single biomarkers (23). This was further supported by significantly lower TIDE scores (indicative of immune escape and poor response) in the low-RS group, confirming the model’s capacity to capture critical tumor immune microenvironment dynamics. To enhance clinical applicability, we integrated the RS with key clinical features into a nomogram, which demonstrated higher predictive accuracy (C-index) and stability than the RS alone. By integrating clinical and molecular data, our model overcomes key limitations of single biomarkers, providing a cost-effective and reliable tool for personalized NSCLC immunotherapy where traditional biomarkers fall short.

Moreover, our RS presented a promising prediction effect of irAE incidence in a melanoma cohort. To further test its value in NSCLC, we validated the expression of RS genes with the top 3 genes ranked by regression coefficient. POU2AF1, a B-cell-specific transcriptional co-activator, regulates immune response via interaction with the octamer-binding transcription factors Oct-1 and Oct-2 (35). POU2AF1 is involved in promoting CD4+ T-cell memory and prospectively identifies memory precursors, which could improve the durable antitumor immune response (36, 37). CRTAM, a class I-restricted T-cell-associated molecule, is a cell adhesion molecule expressed by activated cytotoxic lymphocytes (38, 39). CRTAM–Necl-2 interactions promote cytotoxicity of NK cells and interferon-gamma secretion of CD8+ T cells in lung cancer, thus exerting the immunosurveillance role to distinguish tumors from normal cells (40). CRTAM also promotes antitumor immune response by enhanced CD8+ T-cell infiltration in breast cancer, which may correlate with the increased expression of MHC class I molecules (41). The ANKRD44 gene encodes for ankyrin repeat domain proteins, a member of a putative regulatory ARS subunit of the protein phosphatase 6 (PP6) complex (42, 43). ANKRD44 gene silencing constitutively activates the NF-kb protein via the TAK/AKT pathway, which leads to trastuzumab resistance via cell proliferation and glycolysis in breast cancer cells (44). ANKRD44 is also involved in immune cell metabolism and inflammation regulation via the mTOR pathway (4547). In our NSCLC cohort, the presence of irAEs was mostly relevant to the efficacy of ICIs as well as improved survival. In line with the expression levels in the low-RS group, the increased expression of POU2AF1, CRTAM, and ANKRD44 was observed in the patients with irAEs, suggesting the potential of RS to predict irAEs in NSCLC. It is important to note that the mechanistic roles of POU2AF1, CRTAM, and ANKRD44 in irAEs are inferred from prior literature and clinical associations. While these inferences provide a rationale for the predictive capacity of our RS model, they remain hypothetical without direct functional validation in this study. The precise molecular pathways by which these genes influence irAE development thus require further investigation. Future studies using in vitro and in vivo models are essential to definitively establish their causal roles and underlying mechanisms in irAEs.

Due to the fact that immunological characteristics were associated with immune activity and immunotherapy efficacy, we explored the individual heterogeneity in the immune microenvironment. In our study, the abundance of cancer-suppressive immune cells was significantly decreased in patients with high RSs, accompanied by an increase in cancer-promoting immune cells. Moreover, we found that the risk score was highly negatively associated with immune and stromal scores while showing a positive association with tumor purity. In addition, the downregulated expression of most HLA genes was found in the high-risk group, which could prevent T cells from being activated to effectively kill tumors due to the lack of antigen presentation. Therefore, we speculated that the worse survival condition and therapeutic response may be due to the formation of an immune-suppressive microenvironment in patients with high RSs. However, this study has several limitations. First, our findings were obtained from public databases through retrospective analysis; thus, multicenter study cohorts still need to be used to validate the accuracy and stability of the RS. Second, we only validated the expression of the top 3 risk genes with high weight in our NSCLC cohort via IHC, and in vitro and in vivo assays designed to explore the functions and the underlying molecular mechanism need to be performed in a follow-up study. Third, while we integrated transcriptomics with clinical parameters to build the nomogram model, this transcriptomics-centric approach precluded the exploration of post-transcriptional, epigenetic, or proteomic mechanisms underlying irAE pathogenesis. Finally, the validation cohorts were primarily composed of Asian populations, which may limit the generalizability of our findings to other ethnic groups, such as those of African and European ancestry. Future work should incorporate multi-omics data to comprehensively dissect the molecular drivers of irAEs and refine the model’s mechanistic depth.

Overall, the risk signature constructed by eight key genes associated with irAEs had remarkable prognostic value, and the RS could accurately predict the immunotherapy response and immune-related adverse events, thus potentially guiding treatment choice for patients with NSCLC. Moreover, a nomogram model was developed to improve the prognostic stability in NSCLC. Our study may provide new ideas on the role of irAEs in the therapy of NSCLC.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving humans were approved by the Ethics Committee of the 920th Hospital of the Joint Logistics Support Force of the People’s Liberation Army. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because this study utilized data obtained from TCGA and GEO public databases, which contain de-identified patient information, as well as from a retrospective analysis of existing data in our center, with no direct patient intervention. All patient data were anonymized prior to analysis to protect patient privacy and confidentiality, in compliance with relevant data protection regulations.

Author contributions

JZ: Data curation, Methodology, Conceptualization, Formal Analysis, Writing – original draft. QY: Methodology, Software, Writing – original draft. GL: Writing – original draft. LL: Visualization, Writing – original draft. ST: Writing – original draft. SL: Writing – original draft, Visualization. MW: Writing – original draft, Visualization. LS: Visualization, Writing – original draft. JW: Writing – original draft, Visualization. XS: Writing – original draft. HC: Data curation, Methodology, Writing – review & editing, Validation, Supervision, Conceptualization. YL: Validation, Project administration, Supervision, Writing – review & editing, Investigation, Funding acquisition, Visualization, Resources.

Funding

The author(s) declared that financial support was received for this work and/or its publication. We acknowledge financial support from Platform Project(202305AS350028,202449CE340026 and YSZJGZZ2023120), Applied Basic Research Project(202201AV070009, 202301AY070001-158).

Acknowledgments

We acknowledge the 920th Hospital of the Joint Logistics Support Force Ethics Committee for approving the study protocols.

Conflict of interest

The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2025.1656375/full#supplementary-material

Abbreviations

NSCLC, non-small cell lung cancer; LASSO, least absolute shrinkage and selection operator; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; GSEA, gene set enrichment analysis; ICIs, immune checkpoint inhibitors; DEGs, differentially expressed genes; IrAEs, immune-related adverse events; NLR, neutrophil-to-lymphocyte ratio; PD-L1, programmed death-ligand 1; TMB, tumor mutational burden; TCGA, The Cancer Genome Atlas; AUC, area under the curve; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; GEO, Gene Expression Omnibus; TCGA-LC, TCGA lung cancer; ES, enrichment score; GSVA, gene set variation analysis; IHC, immunohistochemistry; OS, overall survival; PFS, progression-free survival; RS, risk score; HLA, human leukocyte antigen; CR, complete response; PR, partial response; SD, stable disease; PD, progressive disease; TME, tumor microenvironment; CI, combination index; TIDE, tumor immune dysfunction and exclusion; CTCAE, Common Terminology Criteria for Adverse Events; KM, Kaplan–Meier

References

1. Schabath MB and Cote ML. Cancer progress and priorities: lung cancer. Cancer Epidemiol Biomarkers Prev. (2019) 28:1563–79. doi: 10.1158/1055-9965

PubMed Abstract | Crossref Full Text | Google Scholar

2. Garon EB, Hellmann MD, Rizvi NA, Carcereny E, Leighl NB, Ahn MJ, et al. Five-year overall survival for patients with advanced non-small-cell lung cancer treated with pembrolizumab: results from the phase I KEYNOTE-001 study. J Clin Oncol. (2019) 37:2518–27. doi: 10.1200/JCO.19.00934

PubMed Abstract | Crossref Full Text | Google Scholar

3. de Castro G Jr, Kudaba I, Wu YL, Lopes G, Kowalski DM, Turna HZ, et al. Five-year outcomes with pembrolizumab versus chemotherapy as first-line therapy in patients with non-small-cell lung cancer and programmed death ligand-1 tumor proportion score ≥1%: KEYNOTE-042 study. J Clin Oncol. (2024) 42:227–36. doi: 10.1200/JCO.23.01308

PubMed Abstract | Crossref Full Text | Google Scholar

4. Reck M, Ciuleanu TE, Cobo M, Schenker M, Zurawski B, Menezes J, et al. First-line nivolumab plus ipilimumab combined with two cycles of chemotherapy versus chemotherapy alone in metastatic non-small-cell lung cancer: 3-year update from CheckMate 9LA. N Engl J Med. (2023) 389:2041–55. doi: 10.1056/NEJMoa2301303

Crossref Full Text | Google Scholar

5. Bagchi S, Yuan R, and Engleman EG. Immune checkpoint inhibitors for the treatment of cancer: clinical impact and mechanisms of response and resistance. Annu Rev Pathol. (2021) 16:223–49. doi: 10.1146/annurev-pathol-042020-042741

PubMed Abstract | Crossref Full Text | Google Scholar

6. Tarhini AA, Kang N, Lee SJ, Hodi FS, Cohen GI, Hamid O, et al. Immune adverse events (irAEs) with adjuvant ipilimumab in melanoma, use of immunosuppressants and association with outcome: ECOG-ACRIN E1609 study analysis. J Immunother Cancer. (2021) 9:e002535. doi: 10.1136/jitc-2021-002535

PubMed Abstract | Crossref Full Text | Google Scholar

7. Yan Y, Shen S, Li J, Su L, Wang B, Zhang J, et al. Cross-omics strategies and personalised options for lung cancer immunotherapy. Front Immunol. (2024) :1471409. doi: 10.3389/fimmu.2024.1471409

PubMed Abstract | Crossref Full Text | Google Scholar

8. Princk MH, Pervan M, and Riedl J. Nebenwirkungsmanagement unter Immuntherapie [Management of immune-related adverse events]. Urologie. (2025) 64:281–7. doi: 10.1007/s00120-024-02517-x

PubMed Abstract | Crossref Full Text | Google Scholar

9. Wang DY, Salem JE, Cohen JV, Chandra S, Menzer C, Ye F, et al. Fatal toxic effects associated with immune checkpoint inhibitors: A systematic review and meta-analysis. JAMA Oncol. (2018) 4:1721–8. doi: 10.1001/jamaoncol.2018.3923

PubMed Abstract | Crossref Full Text | Google Scholar

10. Michailidou D, Khaki AR, Morelli MP, Diamantopoulos L, Singh N, and Grivas P. Association of blood biomarkers and autoimmunity with immune related adverse events in patients with cancer treated with immune checkpoint inhibitors. Sci Rep. (2021) 11:9029. doi: 10.1038/s41598-021-88307-3

PubMed Abstract | Crossref Full Text | Google Scholar

11. Han X, Chen Y, Xie H, Zhang Y, Cui Y, Guan Y, et al. Organ-specific immune-related adverse events and prognosis in cancer patients receiving immune checkpoint inhibitors. BMC Cancer. (2025) 25:139. doi: 10.1186/s12885-025-13566-6

PubMed Abstract | Crossref Full Text | Google Scholar

12. Patrinely JR Jr, Johnson R, Lawless AR, Bhave P, Sawyers A, Dimitrova M, et al. Chronic immune-related adverse events following adjuvant anti-PD-1 therapy for high-risk resected melanoma. JAMA Oncol. (2021) 7:744–8. doi: 10.1001/jamaoncol.2021.0051

PubMed Abstract | Crossref Full Text | Google Scholar

13. Schnell A, Bod L, Madi A, and Kuchroo VK. The yin and yang of co-inhibitory receptors: toward anti-tumor immunity without autoimmunity. Cell Res. (2020) 30:285–99. doi: 10.1038/s41422-020-0277-x

PubMed Abstract | Crossref Full Text | Google Scholar

14. Socinski MA, Jotte RM, Cappuzzo F, Nishio M, Mok TSK, Reck M, et al. Association of immune-related adverse events with efficacy of atezolizumab in patients with non-small cell lung cancer: pooled analyses of the phase 3 IMpower130, IMpower132, and IMpower150 randomized clinical trials. JAMA Oncol. (2023) 9:527–35. doi: 10.1001/jamaoncol.2022.7711

PubMed Abstract | Crossref Full Text | Google Scholar

15. Vaddepally R, Doddamani R, Sodavarapu S, Madam NR, Katkar R, Kutadi AP, et al. Review of immune-related adverse events (irAEs) in non-small-cell lung cancer (NSCLC)-their incidence, management, multiorgan irAEs, and rechallenge. Biomedicines. (2022) 10:790. doi: 10.3390/biomedicines10040790

PubMed Abstract | Crossref Full Text | Google Scholar

16. Der SD, Sykes J, Pintilie M, Zhu CQ, Strumpf D, Liu N, et al. Validation of a histology-independent prognostic gene signature for early-stage, non-small-cell lung cancer including stage IA patients. J Thorac Oncol. (2014) 9:59–64. doi: 10.1097/JTO.0000000000000042

PubMed Abstract | Crossref Full Text | Google Scholar

17. Botling J, Edlund K, Lohr M, Hellwig B, Holmberg L, Lambe M, et al. Biomarker discovery in non-small cell lung cancer: integrating gene expression profiling, meta-analysis, and tissue microarray validation. Clin Cancer Res. (2013) 19:194–204. doi: 10.1158/1078-0432.CCR-12-1139

PubMed Abstract | Crossref Full Text | Google Scholar

18. Altorki NK, Bhinder B, Borczuk AC, Elemento O, Mittal V, and McGraw TE. A signature of enhanced proliferation associated with response and survival to anti-PD-L1 therapy in early- stage non-small cell lung cancer. Cell Rep Med. (2024) 5:101438. doi: 10.1016/j.xcrm.2024.101438

PubMed Abstract | Crossref Full Text | Google Scholar

19. Lozano AX, Chaudhuri AA, Nene A, Bacchiocchi A, Earland N, Vesely MD, et al. T cell characteristics associated with toxicity to immune checkpoint blockade in patients with melanoma. Nat Med. (2022) 28:353–62. doi: 10.1038/s41591-021-01623-z

PubMed Abstract | Crossref Full Text | Google Scholar

20. Jing Y, Liu J, Ye Y, Pan L, Deng H, Wang Y, et al. Multi-omics prediction of immune-related adverse events during checkpoint immunotherapy. Nat Commun. (2020) 11:4946. doi: 10.1038/s41467-020-18742-9

PubMed Abstract | Crossref Full Text | Google Scholar

21. Harjunpää H, Blake SJ, Ahern E, Allen S, Liu J, Yan J, et al. Deficiency of host CD96 and PD-1 or TIGIT enhances tumor immunity without significantly compromising immune homeostasis. Oncoimmunology. (2018) 7:e1445949. doi: 10.1080/2162402X.2018.1445949

PubMed Abstract | Crossref Full Text | Google Scholar

22. Sherman BT, Hao M, Qiu J, Baseler MW, Lane HC, Imamichi T, et al. DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update). Nucleic Acids Res. (2022) 50:W216–21. doi: 10.1093/nar/gkac194

PubMed Abstract | Crossref Full Text | Google Scholar

23. Lin H, Hargreaves KA, Li R, Reiter JL, Wang Y, Mort M, et al. RegSNPs-intron: a computational framework for predicting pathogenic impact of intronic single nucleotide variants. Genome Biol. (2019) 20:254. doi: 10.1186/s13059-019-1847-4

PubMed Abstract | Crossref Full Text | Google Scholar

24. Eisenhauer EA, Therasse P, Bogaerts J, Schwartz LH, Sargent D, Ford R, et al. New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1). Eur J Cancer. (2009) 45:228–47. doi: 10.1016/j.ejca.2008.10.026

PubMed Abstract | Crossref Full Text | Google Scholar

25. Freites-Martinez A, Santana N, Arias-Santiago S, and Viera A. Using the common terminology criteria for adverse events (CTCAE - version 5.0) to evaluate the severity of adverse events of anticancer therapies. Actas Dermosifiliogr (Engl Ed). (2021) 112:90–2. doi: 10.1016/j.ad.2019.05.009

PubMed Abstract | Crossref Full Text | Google Scholar

26. Reck M, Ciuleanu TE, Cobo M, Schenker M, Zurawski B, Menezes J, et al. First-line nivolumab plus ipilimumab with two cycles of chemotherapy versus chemotherapy alone (four cycles) in metastatic non-small cell lung cancer: CheckMate 9LA 2-year patient-reported outcomes. Eur J Cancer. (2023) 183:174–87. doi: 10.1016/j.ejca.2023.01.015

PubMed Abstract | Crossref Full Text | Google Scholar

27. Brunot A, Grob JJ, Jeudy G, Grange F, Guillot B, Kramkimel N, et al. Association of anti-programmed cell death 1 antibody treatment with risk of recurrence of toxic effects after immune- related adverse events of ipilimumab in patients with metastatic melanoma. JAMA Dermatol. (2020) 156:982–6. doi: 10.1001/jamadermatol.2020.2149

PubMed Abstract | Crossref Full Text | Google Scholar

28. Grangeon M, Tomasini P, Chaleat S, Jeanson A, Souquet-Bressand M, Khobta N, et al. Association between immune-related adverse events and efficacy of immune checkpoint inhibitors in non-small-cell lung cancer. Clin Lung Cancer. (2019) 20:201–7. doi: 10.1016/j.cllc.2018.10.002

PubMed Abstract | Crossref Full Text | Google Scholar

29. Zhou Y, Guo T, Liang F, Wang Z, Zhang J, Ni J, et al. Cumulative incidence and risk factors of brain metastases in metastatic non-small cell lung cancer without baseline brain metastasis: Pooled analysis of individualized patient data from IMpower130, IMpower131, and IMpower150. Cancer. (2024) 130:2601–10. doi: 10.1002/cncr.35242

PubMed Abstract | Crossref Full Text | Google Scholar

30. Vanguri RS, Luo J, Aukerman AT, Egger JV, Fong CJ, Horvat N, et al. Multimodal integration of radiology, pathology and genomics for prediction of response to PD-(L)1 blockade in patients with non-small cell lung cancer. Nat Cancer. (2022) 3:1151–64. doi: 10.1038/s43018-022-00416-8

PubMed Abstract | Crossref Full Text | Google Scholar

31. Ricciuti B, Wang X, Alessi JV, Rizvi H, Mahadevan NR, Li YY, et al. Association of high tumor mutation burden in non-small cell lung cancers with increased immune infiltration and improved clinical outcomes of PD-L1 blockade across PD-L1 expression levels. JAMA Oncol. (2022) 8:1160–8. doi: 10.1001/jamaoncol.2022.1981

PubMed Abstract | Crossref Full Text | Google Scholar

32. Rozeman EA, Hoefsmit EP, Reijers ILM, Saw RPM, Versluis JM, Krijgsman O, et al. Survival and biomarker analyses from the OpACIN-neo and OpACIN neoadjuvant immunotherapy trials in stage III melanoma. Nat Med. (2021) 27:256–63. doi: 10.1038/s41591-020-01211-7

PubMed Abstract | Crossref Full Text | Google Scholar

33. Jiang P, Gu S, Pan D, Fu J, Sahu A, Hu X, et al. Signatures of T cell dysfunction and exclusion predict cancer immunotherapy response. Nat Med. (2018) 24:1550–8. doi: 10.1038/s41591-018-0136-1

PubMed Abstract | Crossref Full Text | Google Scholar

34. Hwang S, Kwon AY, Jeong JY, Kim S, Kang H, Park J, et al. Immune gene signatures for predicting durable clinical benefit of anti-PD-1 immunotherapy in patients with non-small cell lung cancer. Sci Rep. (2020) 10:643. doi: 10.1038/s41598-019-57218-9

PubMed Abstract | Crossref Full Text | Google Scholar

35. Song S, Cao C, Choukrallah MA, Tang F, Christofori G, Kohler H, et al. OBF1 and Oct factors control the germinal center transcriptional program. Blood. (2021) 137:2920–34. doi: 10.1182/blood.2020010175

PubMed Abstract | Crossref Full Text | Google Scholar

36. Kruse B, Buzzai AC, Shridhar N, Braun AD, Gellert S, Knauth K, et al. CD4+ T cell-induced inflammatory cell death controls immune-evasive tumours. Nature. (2023) 618:1033–40. doi: 10.1038/s41586-023-06199-x

PubMed Abstract | Crossref Full Text | Google Scholar

37. Lin J, Xia L, Oyang L, Liang J, Tan S, Wu N, et al. The POU2F1-ALDOA axis promotes the proliferation and chemoresistance of colon cancer cells by enhancing glycolysis and the pentose phosphate pathway activity. Oncogene. (2022) 41:1024–39. doi: 10.1038/s41388-021-02148-y

PubMed Abstract | Crossref Full Text | Google Scholar

38. Dessarthe B, Thedrez A, Latouche JB, Cabillic F, Drouet A, Daniel P, et al. CRTAM receptor engagement by Necl-2 on tumor cells triggers cell death of activated Vγ9Vδ2 T cells. J Immunol. (2013) 190:4868–76. doi: 10.4049/jimmunol.1202596

PubMed Abstract | Crossref Full Text | Google Scholar

39. García-Álvarez MÁ, González-Fernández C, Esteban MÁ, and Cuesta A. Molecular characterization of the cytotoxic and regulatory T cell coreceptor (CRTAM), and its ligand CADM1, in the European seabass and gilthead seabream. Fish Shellfish Immunol. (2023) 134:108569. doi: 10.1016/j.fsi.2023.108569

PubMed Abstract | Crossref Full Text | Google Scholar

40. Boles KS, Barchet W, Diacovo T, Cella M, and Colonna M. The tumor suppressor TSLC1/NECL-2 triggers NK-cell and CD8+ T-cell responses through the cell-surface receptor CRTAM. Blood. (2005) 106:779–86. doi: 10.1182/blood-2005-02-0817

PubMed Abstract | Crossref Full Text | Google Scholar

41. Zheng S, Yang B, Li L, Chen M, Zhang L, Chi W, et al. CRTAM promotes antitumor immune response in triple negative breast cancer by enhancing CD8+ T cell infiltration. Int Immunopharmacol. (2024) 129:111625. doi: 10.1016/j.intimp.2024.111625

PubMed Abstract | Crossref Full Text | Google Scholar

42. Li M, Shen YJ, Chai S, Bai YL, and Li ZH. miR-133a-3p inhibits the osteogenic differentiation of bone marrow mesenchymal stem cells by regulating ankyrin repeat domain 44. Gen Physiol Biophys. (2021) 40:329–39. doi: 10.4149/gpb_2020038

PubMed Abstract | Crossref Full Text | Google Scholar

43. Koo HK, Morrow J, Kachroo P, Tantisira K, Weiss ST, Hersh CP, et al. Sex-specific associations with DNA methylation in lung tissue demonstrate smoking interactions. Epigenetics. (2021) 16:692–703. doi: 10.1080/15592294.2020.1819662

PubMed Abstract | Crossref Full Text | Google Scholar

44. La Ferla M, Lessi F, Aretini P, Pellegrini D, Franceschi S, Tantillo E, et al. ANKRD44 gene silencing: A putative role in trastuzumab resistance in her2-like breast cancer. Front Oncol. (2019) 9:547. doi: 10.3389/fonc.2019.00547

PubMed Abstract | Crossref Full Text | Google Scholar

45. Xu X, Wang X, Li Y, Chen R, Wen H, Wang Y, et al. Research progress of ankyrin repeat domain 1 protein: an updated review. Cell Mol Biol Lett. (2024) 29:131. doi: 10.1186/s11658-024-00647-w

PubMed Abstract | Crossref Full Text | Google Scholar

46. Boiu S, Paschalidis N, Sentis G, Manolakou T, Nezos A, Gialitakis M, et al. Increased interferon I signaling, DNA damage response and evidence of T-cell exhaustion in a patient with combined interferonopathy (Aicardi-Goutières Syndrome, AGS) and cohesinopathy (Cornelia de Lange Syndrome, CdLS). Pediatr Rheumatol Online J. (2025) 23:11. doi: 10.1186/s12969-024-01050-7

PubMed Abstract | Crossref Full Text | Google Scholar

47. Boulter E, Estrach S, Tissot FS, Hennrich ML, Tosello L, Cailleteau L, et al. ell metabolism regulates integrin mechanosensing via an SLC3A2-dependent sphingolipid biosynthesis pathway. Nat Commun. (2018) 9:4862. doi: 10.1038/s41467-018-07268-w

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: immune-related adverse events, immunotherapy, non-small cell lung cancer, prognosis, risk score

Citation: Zhu J, Ye Q, Li G, Liu L, Tian S, Li S, Wen M, Shen L, Wang J, Song X, Chen H and Li Y (2026) Identification of a prognostic signature based on immunogenic adverse event-related genes to guide therapy for non-small cell lung cancer. Front. Immunol. 16:1656375. doi: 10.3389/fimmu.2025.1656375

Received: 30 June 2025; Accepted: 17 December 2025; Revised: 11 December 2025;
Published: 12 January 2026.

Edited by:

Mariacarmela Santarpia, University of Messina, Italy

Reviewed by:

Guoyan Qi, Hebei Medical University, China
Tingdong Yu, The Third Affiliated Hospital of Kunming Medical University, China

Copyright © 2026 Zhu, Ye, Li, Liu, Tian, Li, Wen, Shen, Wang, Song, Chen and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xinmiao Song, bXVxaW5nZXJAc2luYS5jb20=; Hong Chen, aG9uZ2NoZW4yMDAwQHRvbS5jb20=; Yi Li, bGl5aTZAa21tdS5lZHUuY24=

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.