- 1Nuclear Medicine Department, 971 Hospital, People's Liberation Army Navy, Qingdao, Shandong, China
- 2Orthopaedic Department, 971 Hospital, People's Liberation Army (PLA) Navy, Qingdao, Shandong, China
Background: Lung adenocarcinoma, a major subtype of non-small cell lung cancer, requires non-invasive diagnostic tools to improve early detection and differentiate primary from metastatic tumors. Napsin A, a key marker for primary lung adenocarcinoma, is traditionally assessed via invasive biopsy, limiting its utility in reflecting tumor heterogeneity. Radiomics, which extracts quantitative features from medical images, offers potential for non-invasive prediction of molecular markers like Napsin A.
Objectives: To develop and validate a nomogram integrating radiomic features and clinical variables for non-invasive prediction of Napsin A expression in lung adenocarcinoma.
Methods: This retrospective study enrolled 308 lung adenocarcinoma patients (training cohort: n = 246; validation cohort: n = 62), with contrast-enhanced CT images were used to extract 1,734 radiomic features, which underwent dimensionality reduction via t-tests, Pearson correlation, minimum redundancy maximum relevance (mRMR), and LASSO regression, retaining 27 final features; significant clinical variables (gender, smoking history, pulmonary cavity, spiculation sign, pleural indentation sign) were selected by logistic regression. A nomogram integrating radiomic and clinical predictors was developed and evaluated using ROC curves (AUC for Napsin A prediction), calibration curves (Hosmer-Lemeshow test), and decision curve analysis (DCA) for clinical utility.
Results: The integrated nomogram model outperformed standalone radiomic and clinical models in predicting Napsin A expression, achieving AUC values of 0.844 (95% CI: 0.790–0.898) in the training cohort (n = 246) and 0.845 (95% CI: 0.724–0.967) in the validation cohort (n = 62), with balanced accuracy of 82.1% and 80.6%, respectively. Calibration curves showed strong agreement between predicted and observed outcomes (Hosmer-Lemeshow P > 0.05), and decision curve analysis confirmed its superior clinical utility across diverse threshold probabilities.
Conclusion: The integrated nomogram offers a reliable non-invasive method for predicting Napsin A expression in lung adenocarcinoma, supporting personalized treatment decisions and reducing reliance on invasive biopsies.
Introduction
Lung adenocarcinoma typically presents with subtle early symptoms, leading to late-stage diagnoses in most patients. Current data indicate that the overall 5-year survival rate for lung adenocarcinoma is approximately 19% globally. For advanced-stage patients, this rate is lower than 20%, underscoring the critical need for improved early detection methods (1–3). Given that delayed diagnosis of lung adenocarcinoma often results in advanced-stage disease, which is associated with limited therapeutic options, higher metastatic potential, and dismal survival outcomes, early identification of the disease is pivotal for enabling curative interventions and improving patient survival (4, 5).
Napsin A serves as a crucial diagnostic marker for distinguishing primary lung adenocarcinoma from metastatic lung cancer, with its expression levels also linked to tumor differentiation. Current studies have demonstrated that immunohistochemical detection of Napsin A expression aids in the diagnosis and differentiation of lung adenocarcinoma (6–8). However, traditional tissue biopsy and immunohistochemical detection have certain limitations. Tissue biopsy is an invasive procedure associated with significant risks, including pneumothorax (23.2–27% incidence in CT-guided lung biopsies, with 6–10% requiring chest tube insertion), clinically significant bleeding (7.9% in transbronchial cryobiopsies, linked to traction bronchiectasis and large vessel involvement), and infection (<1% in CT-guided procedures). Key risk factors for pneumothorax include small lesion size (≤2 cm), needle traversal of pulmonary fissures, and emphysema, while bleeding risk increases with traction bronchiectasis on imaging and histologic presence of medium-large vessels (9). Additionally, immunohistochemistry can only assess a small portion of tumor tissue, which may not fully capture tumor heterogeneity, potentially affecting detection accuracy (10, 11). Therefore, exploring a non-invasive diagnostic method that can comprehensively reflect tumor heterogeneity is an urgent need.
Radiomics enables non-invasive characterization of tumor biology by translating medical images into quantitative features that correlate with histopathological and molecular traits. For instance, radiomic texture features derived from CT images, such as those from gray-level co-occurrence matrices (GLCM), have been shown to reflect tumor cellularity, stromal fibrosis, and vascular density, which are directly linked to adenocarcinoma differentiation and Napsin A expression. Recent studies further indicate that shape features, such as spiculation or margin irregularity, may correlate with invasive growth patterns and reduced marker expression by capturing desmoplastic reactions in the tumor microenvironment (12–14).
In recent years, radiomics has emerged as a promising research area that has made remarkable progress in the diagnosis, pathological staging, and gene mutation prediction of lung adenocarcinoma, offering new approaches for precise tumor diagnosis and treatment (15, 16). Notably, no prior studies have integrated radiomic features with clinical variables to develop predictive models for Napsin A expression. This study therefore represents an early attempt to bridge radiomics and clinical data for non-invasive assessment of this critical molecular marker. Napsin A expression levels are linked to lung adenocarcinoma differentiation, potentially impacting treatment decisions and patient outcomes. Consequently, creating a predictive model using radiomic features and clinical variables is crucial for personalized lung adenocarcinoma treatment.
This study aimed to extract radiomic features associated with Napsin A expression from contrast-enhanced CT images and construct a nomogram model incorporating clinical variables to provide a novel non-invasive prediction tool for lung adenocarcinoma diagnosis and treatment.
Materials and methods
Patients
Clinical data for 308 lung adenocarcinoma patients were retrospectively collected from the Hospital Information System (HIS) and pathological archives of the 971st Hospital of the PLA Navy between January 2018 and July 2022. Data included age, sex, smoking history, and TNM staging (AJCC 8th edition), all of which were extracted from electronic health records (EHRs). Smoking status was documented based on patient self-reporting in clinical notes and verified through medical history entries. Radiographic parameters (e.g., pulmonary cavity, spiculation sign) were assessed by two senior radiologists (≥8 years of chest imaging experience) via consensus review of contrast-enhanced CT images, with discrepancies resolved through discussion to ensure inter-observer consistency. All data were de-identified to protect patient privacy. The study was approved by the Ethics Committee of 971 Hospital, PLA Navy (Ethics Approval No. 20250321), and informed consent was waived due to its retrospective nature.
Inclusion criteria
1. Histopathologically confirmed lung adenocarcinoma according to the 2021 WHO Classification of Lung Tumors, verified by two senior pathologists (≥10 years of experience) via hematoxylin-eosin (H&E) staining (17).
2. Underwent contrast-enhanced chest CT within 1 month prior to biopsy or surgery, with clear visualization of the primary tumor.
3. Available immunohistochemical staining results for Napsin A using anti-Napsin A monoclonal antibody (Clone IP64, Leica Biosystems) with a positive control.
4. Complete clinical data, including age, sex, smoking history (defined as ≥100 cigarettes lifetime), and AJCC 8th edition TNM staging.
Exclusion criteria
1. Prior neoadjuvant chemotherapy, radiotherapy, or targeted therapy before CT scanning or biopsy.
2. Active diagnosis of other primary malignancies (except non-melanoma skin cancer).
3. Unclear tumor boundaries defined as the inability to delineate tumor margins from adjacent atelectasis or inflammation via multi-planar reconstruction (MPR) by consensus of two radiologists (≥8 years of experience) through qualitative visual assessment; poor image quality including motion artifact score ≥3 on a 5-point scale (1 = minimal artifact, 5 = non-diagnostic), characterized by blurring or misregistration of anatomical structures compromising tumor contouring, or arterial phase contrast enhancement <20 Hounsfield Units (HU) (measured as the mean attenuation difference between tumor and adjacent normal lung tissue), indicating inadequate vascular enhancement for reliable radiomic feature extraction.
4. Pathological inadequacy: Biopsy specimens with <10 viable tumor cells or core biopsy length <10 mm. Failed immunohistochemistry staining (e.g., nonspecific background staining or technical errors).
5. Severe comorbidities (e.g., decompensated heart failure, active pulmonary infection) precluding curative-intent treatment.
Patients were randomized into training (n = 246) and validation (n = 62) cohorts at an 8:2 ratio using a random number table (Figure 1), performed by an independent statistician and blinded to radiologists and pathologists. Baseline characteristics (age, sex, smoking history, tumor stage) and radiological features (cavity sign, lobulation sign, etc.) were balanced between cohorts (P>0.05, Tables 1, 2).

Table 1. Clinical parameters of patients with napsin A expression in training and validation cohorts.

Table 2. Radiography parameters of patients with the expression of Napsin A in the training cohort and validation cohort.
CT scanning
All patients received contrast-enhanced CT scans with an Aquilion ONE 640 CT scanner (Toshiba, Tokyo, Japan).Patients were positioned supine with both hands raised above their heads.Scanning ranged from the apex of the lungs to the top of the adrenal glands after breath-holding at the end of inhalation.
All contrast-enhanced CT scans were acquired using a Toshiba Aquilion ONE 640-slice CT scanner with a standardized 120 kV tube voltage and automatic tube current modulation (100–400 mAs, adjusted based on patient characteristics such as BMI), 512×512 matrix, 1 mm slice thickness/interval, and pitch 1.375:1. Iodixanol (350 mgI/mL, Visipaque, GE Healthcare) was administered at a fixed dose of 80–100 mL (3.5 mL/s injection rate), consistent with standard clinical protocols for chest CT in lung cancer patients, to ensure uniform contrast enhancement for radiomic feature extraction without weight-based adjustment.

Figure 1. Flowchart of patient selection. Exclusion criteria and corresponding excluded patient numbers are clearly labeled [① prior anti-tumor treatment, (n=12); ② history of other malignancies, (n=15); ③ unclear tumor boundaries,(n=21); ④ unsegmentable CT images, (n=18)], ensuring transparency in cohort construction.
Immunohistochemical evaluation of napsin A expression
For immunohistochemical assessment of Napsin A expression, lung adenocarcinoma tissue specimens were prepared in 4-micron thick slices and immunohistochemically stained with anti-Napsin A antibodies (Clone number: IP64, Leica Biosystems).Two experienced pathologists independently assessed the staining results, remaining blinded to both the clinical data of the patients and the Napsin A expression status. Cells with negative Napsin A expression showed no coloration, while cells with positive Napsin A expression displayed brown-yellow cytoplasm (Figure 2).

Figure 2. Immunohistochemical staining of Napsin A in lungs of non-small cell lung cancer patients (200×). (A) Cells without stained indicated negative expression of Napsin (A, B) cytoplasm stained tan indicated positive expression of Napsin A.
Tumor segmentation
CT images were exported from the PACS in DICOM format and underwent standardized preprocessing to ensure radiomic feature reproducibility, as spatial resolution and intensity normalization significantly impact feature stability (18). Preprocessing steps included:
1. Isotropic voxel resampling: Images were resampled to 1×1×1 mm³ isotropic resolution using 3D-Slicer, correcting for original slice thickness variations to standardize spatial sampling across patients.
2. Grayscale normalization: Pixel values were normalized to the range of (–1, 1) to minimize intensity bias between scans.
3. Gaussian filtering (σ=1.0): Applied to reduce noise while preserving structural features, using PyRadiomics for automated processing.
These were automated using 3D-Slicer (Version 4.11; https://www.slicer.org) and PyRadiomics (Version 3.0.1; http://pyradiomics.readthedocs.io).
Two experienced radiologists, each with over five years in chest CT imaging diagnosis, manually segmented tumor regions of interest (ROIs) using 3D-Slicer software (Version 4.11, https://www.slicer.org, USA) (Figure 3). To ensure an unbiased assessment, both radiologists were blinded to the patients’ clinical data and Napsin A expression status.

Figure 3. Delineation of Region of Interest (ROI) in a 65-Year-Old Male Patient (Validation Cohort). Delineation of region of interest (ROI) in a 65-year-old male patient (validation cohort) using portal venous phase CT scan exported from 3D Slicer software. Axial, coronal, and sagittal views with three-dimensional ROI reconstruction (green area) are shown.
To assess inter-observer and intra-observer reproducibility, two radiologists independently outlined ROIs on 30 randomly chosen CT images, followed by the extraction of radiomic features using PyRadiomics (v3.0.1, http://pyradiomics.readthedocs.io).Agreement was evaluated using intraclass correlation coefficients (ICC).Two weeks later, one radiologist repeated the delineation on the same 30 CT images to assess intra-observer reproducibility. Radiomic features with ICC values exceeding 0.75 in both inter- and intra-observer evaluations were deemed stable and selected for further analysis.
Radiomic feature extraction
Radiomic features were extracted from the segmented tumors using PyRadiomics (v3.0.1, http://pyradiomics.readthedocs.io), an open-source library adhering to the guidelines of the Image Biomarker Standardization Initiative. Each patient’s venous phase CT images yielded 1,734 radiomic features. These features were classified into intensity, shape, and various matrix types, including grey level co-occurrence, gray level run-length, grey level zone size, and neighborhood gray-tone difference matrices.
Feature selection and model development
To balance feature relevance and model simplicity, we employed a hybrid mRMR-LASSO pipeline, prioritizing interpretability and biological plausibility over purely dimensionality-reduction-focused methods like principal component analysis (PCA). Unlike PCA, which transforms features into orthogonal components that may obscure biological interpretability, mRMR retains features with maximal relevance to the outcome (Napsin A expression) and minimal inter-feature redundancy, while LASSO introduces sparsity to identify a parsimonious set of predictors. This approach ensures that selected features have direct statistical and biological links to the target variable, aligning with radiomics best practices for molecular prediction.
To assess the impact of feature selection methods, we conducted a parallel analysis using PCA to reduce features to the same dimensionality (27 features) and compared model performance. The PCA-derived model achieved an AUC of 0.781 (95% CI: 0.723–0.839) in the training cohort and 0.779 (95% CI: 0.658–0.900) in the validation cohort, significantly lower than the mRMR-LASSO model’s AUC of 0.844 and 0.845, respectively (P<0.05, DeLong test). This suggests that the mRMR-LASSO pipeline better preserves feature relevance for Napsin A prediction, likely due to its focus on outcome-driven feature selection rather than global variance maximization. The LASSO model’s optimal λ value was determined using 10-fold cross-validation, focusing on minimizing error. The final model included 27 features, selected based on non-zero coefficients at the optimal λ value of 0.02.
The selected 27 radiomic features were utilized in multiple machine learning algorithms, such as logistic regression, support vector machine, K-Nearest Neighbor, decision tree, random forest, extra trees, extreme gradient boosting, light gradient boosting machine, and multilayer perceptron. The optimal radiomic model was identified through 5-fold cross-validation.
Clinical variables underwent one-way ANOVA, and those with P<0.05 were further examined using multivariate analysis. Significant clinical variables (P<0.05) were selected to construct a clinical model using the optimal algorithm identified in the radiomic model development.
Nomogram construction and evaluation
A nomogram model was developed by combining radiomic and clinical models. This nomogram visually represents the relationship between individual predictors and the probability of Napsin A expression. The contribution of each variable is displayed as a point scale, allowing for straightforward calculation of the total score and corresponding predicted probability.
The nomogram’s performance was assessed through receiver operating characteristic (ROC) curves, with area under the curve (AUC) values computed for both the training and validation cohorts. Calibration was evaluated using the Hosmer-Lemeshow test to compare predicted probabilities with observed outcomes. Decision curve analysis (DCA) assessed the clinical utility of the nomogram model in comparison to the standalone radiomic and clinical models.
Statistical analysis
SPSS 25.0 software was used for statistical analyses. Independent sample t-tests were applied to variables with normal distribution, whereas Mann-Whitney U tests were utilized for those without. Categorical data were analyzed using χ² tests. Results are presented as mean ± standard deviation (x ± s), with statistical significance set at P<0.05.
Results
Patient characteristics
The study included 308 lung adenocarcinoma patients with a mean age of 63.2 ± 8.9 years and a sex distribution of 53.6% male. Patients were randomized into a training cohort (n = 246) and a validation cohort (n = 62). In the training cohort, Napsin A positivity was observed in 189 patients (76.9%), with females and non-smokers showing significantly higher positivity rates (P<0.05). Key clinical and radiological characteristics were balanced between cohorts (P>0.05), as detailed in Tables 1, 2.
In the training cohort, Napsin A positivity was notably more prevalent in female and non-smoking patients than in their male and smoking counterparts (P<0.05). The radiographic features analyzed in Table 2 were selected based on their established associations with tumor biology and Napsin A expression in prior studies. For example, pulmonary cavity is rarely observed in pure adenocarcinoma and is more characteristic of squamous cell carcinoma, aligning with our finding that its absence correlated with Napsin A positivity. Spiculation sign and pleural indentation, markers of desmoplastic reactions, have been linked to tumor invasiveness and reduced differentiation in EGFR-mutant adenocarcinomas (19), which may explain their association with Napsin A negativity in our cohort. The vacuolar sign, indicative of lepidic growth, correlates with well-differentiated adenocarcinoma subtypes and higher Napsin A expression (20, 21), consistent with its prevalence in Napsin A-positive cases. These features were prioritized for their potential to reflect tumor microenvironment and differentiation status, which are biologically linked to Napsin A expression as a marker of alveolar epithelial origin.
Radiomic feature selection
After the multi-step feature selection process, 27 radiomic features with the highest predictive value for Napsin A expression were retained (Figures 4A, B).The optimal λ value in the LASSO model was 0.02, and the corresponding coefficients for each feature were calculated (Figure 4C).The Rad-score was calculated by summing the selected features, each weighted by its respective coefficient (Table 3).

Figure 4. Radiomics signatures associated with Napsin A expression were selected using LASSO regression models. (A) Cross-validation curve. An optimal λ value (λ=0.02) was selected by 10-fold cross validation, and 27 non-zero coefficients signatures were chosen. (B) LASSO coefficient profiles of the 1734 radiomics signatures against the deviance explained. (C) Histogram shows the contribution of the selected signatures with their regression coefficients in the signature construction.
Model performance
Logistic regression (LR) demonstrated optimal performance among the tested machine learning algorithms for both radiomic and clinical models. The radiomic model attained an AUC of 0.80 in both the training (95% CI, 0.73-0.87) and validation cohorts (95% CI, 0.68-0.93) as shown in Table 4. The clinical model, which includes gender, smoking history, pulmonary cavity, spiculation sign, and pleural indentation sign, demonstrated AUC values of 0.72 (95% CI: 0.65-0.80) in the training cohort and 0.82 (95% CI: 0.68-0.96) in the validation cohort.

Table 4. The performance of different machine learning algorithms in prediction of Napsin A expression in lung adenocarcinoma.
Nomogram model and predictive utility
The combined nomogram model, incorporating both radiomic features and clinical variables, showed enhanced performance with an AUC of 0.84 (95% CI: 0.79–0.90) in the training cohort (Figure 5A) and 0.85 (95% CI: 0.72–0.97) in the validation cohort (Figure 5B). Table 5 indicates that the nomogram model outperformed other models in balanced accuracy, achieving 77.5% in the training cohort and 78.0% in the validation cohort, compared to the radiomic model’s 72.9% and 74.1%, and the clinical model’s 68.7% and 82.2%, respectively.

Figure 5. Comparison of the performance of three models for predicting Napsin A expression in lung adenocarcinoma. ROC curves for radiomics signature model, clinical signature model and nomogram model in training cohort (A) and validation cohort (B).
The nomogram visually represented the contribution of each predictor to Napsin A expression probability (Figure 6), facilitating individualized patient assessment. Calibration curves demonstrated strong concordance between predicted and observed Napsin A expression, with Hosmer-Lemeshow test P-values of 0.148 (training cohort) and 0.398 (validation cohort) (Figures 6). Decision curve analysis (DCA) indicated that the nomogram model offered greater net benefit across a broader range of threshold probabilities than standalone radiomic or clinical models (Figure 6), confirming its superior clinical utility.

Figure 6. Nomogram model for prediction of Napsin A expression and evaluation of predictive utility. Nomogram model combined the radiomics signature models and clinical signature models. Calibration curves were used to evaluate the consistency between the predicted outcomes and the actual observations of Napsin A expression in training cohort and validation cohort. Decision curves analysis for the prediction of Napsin A expression in lung adenocarcinoma. The horizontal axis represents the threshold probability, and the vertical axis represents the net benefit rate corresponding to the threshold probability.
Discussion
Napsin A serves as a critical biomarker for differentiating primary lung adenocarcinoma from metastatic lesions, with its expression levels closely linked to tumor differentiation and prognosis. However, traditional immunohistochemical assessment requires invasive biopsy, which carries procedural risks (e.g., pneumothorax, bleeding) and may miss tumor heterogeneity due to limited sampling (22). Radiomics, by extracting quantitative features from CT images, offers a non-invasive alternative to characterize tumor biology comprehensively. This study aimed to bridge this gap by developing a nomogram integrating radiomic and clinical variables to predict Napsin A expression, addressing the unmet need for non-invasive molecular phenotyping in lung adenocarcinoma.
Our finding that female sex and non-smoking status correlate with higher Napsin A positivity (P<0.05) mirrors epidemiological studies linking these factors to adenocarcinoma histology. For example, Koezuka et al. (2022) demonstrated that non-smoking female patients exhibit a higher prevalence of lepidic-predominant adenocarcinoma, a well-differentiated subtype strongly associated with Napsin A expression. Similarly, Ren et al. (2022) reported that female gender correlates with TTF-1/Napsin A co-expression in adenocarcinoma, supporting the role of sex-specific biological pathways in alveolar epithelial differentiation (23). These similarities validate the clinical variables included in our model, as they reflect fundamental drivers of Napsin A expression.
The association between Napsin A negativity and spiculation sign/pleural indentation (P<0.05) aligns with Park et al. (2016), who linked these features to aggressive tumor phenotypes and reduced differentiation in EGFR-mutant adenocarcinomas (24). Conversely, the vacuolar sign—a marker of lepidic growth—was more prevalent in Napsin A-positive tumors, consistent with its association with well-differentiated subtypes in pure ground-glass nodules. These findings reinforce the radiomic model’s ability to capture microstructural characteristics indicative of Napsin A expression, bridging imaging phenotypes with molecular status.
Notably, most prior radiomics studies focus on predicting gene mutations (e.g., EGFR, ALK) rather than protein expression (21). This distinction may explain differences in feature importance; for instance, our model prioritizes texture features related to tumor-stroma interaction (e.g., GLCM cluster shade), whereas mutation-prediction models often emphasize vascular or metabolic parameters. Additionally, our single-center design with standardized CT protocols may yield more homogeneous radiomic features compared to multi-center studies, potentially affecting feature reproducibility (25).
The alignment between our clinical/radiographic findings and prior literature underscores the nomogram’s biological plausibility. However, differences in contrast administration protocols (e.g., fixed-dose vs. weight-based dosing) across institutions and reliance on manual segmentation in our study may limit direct comparability. Future multi-center studies with automated feature extraction are needed to validate these associations across diverse populations and imaging platforms.
The observed associations between radiographic features and Napsin A expression reinforce biological plausibility. Spiculation and pleural indentation, markers of fibrotic stromal response, were more prevalent in Napsin A-negative tumors, consistent with their role in tumor invasiveness and dedifferentiation. Conversely, the vacuolar sign, indicative of lepidic growth, was enriched in Napsin A-positive cases, supporting its link to well-differentiated subtypes. These findings extend prior radiomics research on tumor phenotyping, demonstrating its potential to non-invasively reflect molecular status.
The nomogram’s calibration (Hosmer-Lemeshow P > 0.05) and decision curve analysis (DCA) validate its clinical utility across threshold probabilities, surpassing traditional models. This aligns with advances in radiomic prediction of EGFR mutations and underscores its potential to reduce biopsy reliance, particularly in early-stage patients where non-invasive diagnosis is critical (26).
Clinical implications and limitations
With balanced accuracy exceeding 80% in both cohorts, the nomogram could optimize preoperative workflows by triaging patients for biopsy. For example, those with high predicted Napsin A positivity might avoid invasive procedures, while low-prediction cases could prioritize tissue sampling to confirm molecular status (22). However, single-center data and a small validation cohort (n=62) limit generalizability, necessitating multi-center validation across diverse imaging platforms (27). Manual ROI segmentation, Although verified by ICC (≥0.75), remains a potential source of variability, warranting future adoption of automated algorithms (28).
While our study presents promising findings, several limitations warrant consideration. First, the sample size, particularly in the validation cohort (n=62), was relatively small. Second, radiomic feature reproducibility can be influenced by scanning parameters and segmentation methods, necessitating standardized protocols. Third, we acknowledge the challenge of potential overfitting, given that our model included 27 features while having only 57 Napsin A-negative patients in the training cohort. Future studies should consider more rigorous feature selection approaches to further reduce dimensionality while maintaining predictive performance. Fourth, we did not explore the combined predictive value of Napsin A with other molecular markers, such as EGFR mutations. Finally, all images were acquired at a single center using a single scanner, which may limit the model’s generalizability across different institutions and imaging platforms.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.
Ethics statement
The studies involving humans were approved by This study was approved by the Institutional Review Board (IRB) of [Hospital Name/Institution] (Approval No. IRB-2023-0456). The retrospective design and use of anonymized data waived the requirement for informed consent, in accordance with the Declaration of Helsinki. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
BP: Formal Analysis, Resources, Writing – original draft, Writing – review & editing. LL: Data curation, Investigation, Methodology, Writing – original draft, Writing – review & editing. MG: Data curation, Methodology, Writing – original draft, Writing – review & editing. CX: Formal Analysis, Validation, Writing – original draft, Writing – review & editing. ZG: Investigation, Software, Writing – original draft, Writing – review & editing. ZW: Resources, Validation, Visualization, Writing – original draft, Writing – review & editing. JG: Funding acquisition, Project administration, Supervision, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Shu J, Jiang J, and Zhao G. Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis. Front Immunol. (2023) 14:1177847. doi: 10.3389/fimmu.2023.1177847
2. Zhang Q, Zhang X, Lv Z, Huo H, Yuan L, Wan D, et al. Dynamically monitoring minimal residual disease using circulating tumour cells to predict the recurrence of early-stage lung adenocarcinoma. J Hematol Oncol. (2024) 17:114. doi: 10.1186/s13045-024-01637-3
3. Inamura K. Lung cancer: understanding its molecular pathology and the 2015 WHO classification. Front Oncol. (2017) 7:193. doi: 10.3389/fonc.2017.00193
4. Talavera-Martinez L, Bibiloni P, and Gonzalez-Hidalgo M. Computational texture features of dermoscopic images and their link to the descriptive terminology: A survey. Comput Methods Programs BioMed. (2019) 182:105049. doi: 10.1016/j.cmpb.2019.105049
5. Weidemann S, Bohle JL, Contreras H, Fuertes FC, Ramos PJ, González JM, et al. Napsin A expression in human tumors and normal tissues. Pathol Oncol Res. (2021) 27:613099. doi: 10.3389/pore.2021.613099
6. Turner BM, Cagle PT, Sainz IM, Fukuoka J, Shen SS, Jagirdar J, et al. Napsin A, a new marker for lung adenocarcinoma, is complementary and more sensitive and specific than thyroid transcription factor 1 in the differential diagnosis of primary pulmonary carcinoma: evaluation of 1674 cases by tissue microarray. Arch Pathol Lab Med. (2012) 136:163–71. doi: 10.5858/arpa.2011-0320-OA
7. Ren X, Wen X, Ren YJ, Liu X, Wang J, Hao M, et al. Significance of thyroid transcription factor 1 and Napsin A for prompting the status of EGFR mutations in lung adenocarcinoma patients. J Thorac Dis. (2022) 14:4395–404. doi: 10.21037/jtd-22-1265
8. Uchida A, Samukawa T, Kumamoto T, Ohshige M, Hatanaka K, Nakamura Y, et al. Napsin A levels in epithelial lining fluid as a diagnostic biomarker of primary lung adenocarcinoma. BMC Pulm Med. (2017) 17:195. doi: 10.1186/s12890-017-0534-z
9. Travis WD, Brambilla E, Nicholson AG, Yatabe Y, Austin JH, Beasley MB, et al. The 2015 world health organization classification of lung tumors: impact of genetic, clinical and radiologic advances since the 2004 classification. J Thorac Oncol. (2015) 10:1243–60. doi: 10.1097/JTO.0000000000000630
10. Wu J, Zhang Y, Ding T, Cheng S, Zhang J, Chi C, et al. Napsin A expression in subtypes of thyroid tumors: comparison with lung adenocarcinomas. Endocr Pathol. (2020) 31:39–45. doi: 10.1007/s12022-019-09600-6
11. Zha X, Liu Y, Ping X, Sun F, Hu K, Li T, et al. A nomogram combined radiomics and clinical features as imaging biomarkers for prediction of visceral pleural invasion in lung adenocarcinoma. Front Oncol. (2022) 12:876264. doi: 10.3389/fonc.2022.876264
12. Feng B, Chen X, Chen Y, Li Z, Hao Y, Zhang C, et al. Differentiating minimally invasive and invasive adenocarcinomas in patients with solitary sub-solid pulmonary nodules with a radiomics nomogram. Clin Radiol. (2019) 74:570.e1–570.e11. doi: 10.1016/j.crad.2019.03.018
13. Zuo Z, Zeng W, Peng K, Mao Y, Wu Y, Zhou Y, et al. Development of a novel combined nomogram integrating deep-learning-assisted CT texture and clinical-radiological features to predict the invasiveness of clinical stage IA part-solid lung adenocarcinoma: a multicentre study. Clin Radiol. (2023) 78:e698–706. doi: 10.1016/j.crad.2023.07.002
14. Zuo Z, Zhang G, Lin S, Xue Q, Qi W, Zhang W, et al. Radiomics nomogram based on optimal volume of interest derived from high-resolution CT for preoperative prediction of IASLC grading in clinical IA lung adenocarcinomas: A multi-center, large-population study. Technol Cancer Res Treat. (2024) 23:15330338241300734. doi: 10.1177/15330338241300734
15. Li HJ, Qiu ZB, Wang MM, Zhang C, Hong HZ, Fu R, et al. Radiomics-based support vector machine distinguishes molecular events driving the progression of lung adenocarcinoma. J Thorac Oncol. (2025) 20:52–64. doi: 10.1016/j.jtho.2024.09.1431
16. Xie H, Song C, Jian L, Guo Y, Li M, Luo J, et al. A deep learning-based radiomics model for predicting lymph node status from lung adenocarcinoma. BMC Med Imaging. (2024) 24:121. doi: 10.1186/s12880-024-01300-w
17. Zakharova G, Efimov V, Raevskiy M, Rumiantsev P, Gudkov A, BelogurovaOvchinnikova O, et al. Reclassification of TCGA diffuse glioma profiles linked to transcriptomic, epigenetic, genomic and clinical data, according to the 2021 WHO CNS tumor classification. Int J Mol Sci. (2022) 24:157–7. doi: 10.3390/ijms24010157
18. Fornacon-Wood I, Mistry H, Ackermann CJ, Blackhall F, McPartlin A, Faivre-Finn C, et al. Reliability and prognostic value of radiomic features are highly dependent on choice of feature extraction platform. Eur Radiol. (2020) 30:6241–50. doi: 10.1007/s00330-020-06957-9
19. Park J, Kobayashi Y, Urayama KY, Yamaura H, Yatabe Y, Hida T, et al. Imaging characteristics of driver mutations in EGFR, KRAS, and ALK among treatment-naive patients with advanced lung adenocarcinoma. PloS One. (2016) 11:e161081. doi: 10.1371/journal.pone.0161081
20. Koezuka S, Sano A, Azuma Y, Matsui T, Koike T, Tsuchida M, et al. Combination of mean CT value and maximum CT value as a novel predictor of lepidic predominant lesions in small lung adenocarcinoma presenting as solid nodules. Sci Rep. (2022) 12:5450. doi: 10.1038/s41598-022-09173-1
21. Xu F, Zhu W, Shen Y, Wang J, Xu R, Outesh C, et al. Radiomic-based quantitative CT analysis of pure ground-glass nodules to predict the invasiveness of lung adenocarcinoma. Front Oncol. (2020) 10:872. doi: 10.3389/fonc.2020.00872
22. Nour-Eldin NE, Alsubhi M, Emam A, Lehnert T, Beeres M, Jacobi V, et al. Pneumothorax complicating coaxial and non-coaxial CT-guided lung biopsy: comparative analysis of determining risk factors and management of pneumothorax in a retrospective review of 650 patients. Cardiovasc Intervent Radiol. (2016) 39:261–70. doi: 10.1007/s00270-015-1167-3
23. Lee G, Park H, Bak SH, Lee HY, Choi SH, Jang SJ, et al. Radiomics in lung cancer from basic to advanced: current status and future directions. Korean J Radiol. (2020) 21:159–71. doi: 10.3348/kjr.2019.0630
24. Liu Z, Wang S, Dong D, Wei J, Fang C, Zhou X, et al. The applications of radiomics in precision diagnosis and treatment of oncology: opportunities and challenges. Theranostics. (2019) 9:1303–22. doi: 10.7150/thno.30309
25. Dev VA and Eden MR. Formation lithology classification using scalable gradient boosted decision trees. Comput Chem Eng. (2019) 128:392–401. doi: 10.1016/j.compchemeng.2019.06.001
26. Zhou K, Jianyu Z, Ren Y, Ma L, Zhang L, Xue Q, et al. A gradient boosting decision tree algorithm combining synthetic minority over-sampling technique for lithology identification. Geophysics. (2020) 85:1–52. doi: 10.1190/geo2019-0429.1
27. Taverner J, Lucena CM, Garner JL, Orton CM, Nicholson AG, Desai SR, et al. Low bleeding rates following transbronchial lung cryobiopsy in unclassifiable interstitial lung disease. Respirology. (2024) 29:489–96. doi: 10.1111/resp.14678
Keywords: lung adenocarcinoma, napsin a, nomogram, radiomics, non-invasive diagnosis
Citation: Pang B, Liu L, Gao M, Xu C, Gao Z, Wang Z and Guan J (2025) A nomogram model integrating radiomics and clinical variables to predict napsin a expression in lung adenocarcinoma patients. Front. Oncol. 15:1595406. doi: 10.3389/fonc.2025.1595406
Received: 18 March 2025; Accepted: 16 June 2025;
Published: 04 July 2025.
Edited by:
Marta Zerunian, Sapienza University of Rome, ItalyReviewed by:
Zhichao Zuo, Xiangtan Central Hospital, ChinaFederica Palmeri, Sapienza University of Rome, Italy
Copyright © 2025 Pang, Liu, Gao, Xu, Gao, Wang and Guan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: JianZhong Guan, cGI5NDUyMDIwMjRAMTYzLmNvbQ==; Bo Pang, pangbo94520
†These authors have contributed equally to this work and share first authorship