The clinical validity of radiomics-based prediction of molecular subtypes in breast cancer from digital mammary tomosynthesis

Xue, Jing; Li, Yilun; Qu, Tianyun; Qin, Yidi; Wang, Haoqi; Rong, Xiaocui; Tian, Jingliu; Wang, Tao; Zhang, Jianhua; Li, Zhigang; Ping, Yong

doi:10.3389/fonc.2025.1661116

ORIGINAL RESEARCH article

Front. Oncol., 13 October 2025

Sec. Breast Cancer

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1661116

The clinical validity of radiomics-based prediction of molecular subtypes in breast cancer from digital mammary tomosynthesis

Jing Xue¹

Yilun Li²

Tianyun Qu¹

Yidi Qin³

Haoqi Wang²

Xiaocui Rong¹

Jingliu Tian⁴

Tao Wang³

Jianhua Zhang⁵

Zhigang Li^1*

Yong Ping^1*

¹Department of Radiology, The Fourth Hospital of Hebei Medical University, Shijiazhuang, Hebei, China
²Department of Breast surgery, The Fourth Hospital of Hebei Medical University, Shijiazhuang, Hebei, China
³Department of Radiology, Shijiazhuang Hospital of Traditional Chinese Medicine, Shijiazhuang, Hebei, China
⁴Department of Imaging Department, Baixiang County Central Hospital, Xingtai, Hebei, China
⁵Department of Radiology, Xiongxian Hospital, Baoding, Hebei, China

Objective: To explore the use of digital breast tomography (DBT) imaging omics in developing breast cancer (BC) diagnostic models to identify molecular subtype characteristics of BC.

Methods: A retrospective analysis was conducted on 433 DBT images. Candidate features were extracted, and least absolute shrinkage and selection operator (LASSO) regression model was established. Within the training set, machine learning (ML) models were constructed, and their predictive performance was evaluated using receiver operating characteristic (ROC) curves and confusion matrixes in the test set, thereby screening the best predictive classifier. Univariate and multivariate Cox regression analyses were conducted to obtain key characteristics of nomogram modeling, correction and decision curve analysis (DCA) were used to evaluate the clinical potential of this model.

Results: The LASSO selected 14 features. Random Forest (RF) had the highest AUC value, the highest accuracy, sensitivity, recall rate and F1 score on the training set and test set, and was the best classifier. A nomogram model was established. The odds ratio (OR) of BC patients increased with the increase of the total score.

Conclusion: The key features of BC were revealed by image omics and ML models, and a nomogram model with diagnostic value was constructed.

1 Introduction

Breast cancer (BC) is the most common cancer diagnosed in women, and the second leading cause of all cancer-related deaths (1). Early diagnosis of BC, as well as predicting prognosis and treatment response, is a primary focus of clinical research. Based on specific receptor expression levels, BC subtypes include luminal, human epidermal growth factor receptor 2 (HER2)-enriched, and triple-negative (TN) (2, 3). TN BC is notably more aggressive and untreatable with endocrine therapy or trastuzumab, while its distinct MRI patterns can be quantified via radiomics for precise subtype diagnosis (2, 4–6).

Unlike genomic/transcriptomic profiling, which often analyzes limited tumor samples, radiomics assesses whole-tumor heterogeneity (7–9). While mammography, ultrasonography, and MRI findings correlate with molecular subtypes (10–12), recent efforts focus on predicting them radiomically, which extracts high-dimensional, quantitative features from images, capturing both tissue characteristics and gene expression profiles (13). Digital breast tomography (DBT) has become the breast imaging standard: adding DBT to digital mammography increases cancer detection rates versus mammography alone (14, 15). Although MRI excels in tissue characterization, its routine use remains limited (16, 17). Thus, non-invasive subtype prediction using widely available DBT has significant clinical value—it avoids invasive biopsies that cause patient discomfort, reduces the risk of complications from invasive procedures, and enables early, precise subtype-guided treatment.

Despite DBT’s role in BC diagnosis, challenges like increased reading workload and inconsistent mass segmentation (due to numerous image slices) limit radiomic application (18). Synthetic mammography offers a solution for integrating radiomics into clinical practice. However, prior radiomic studies focused on MRI (costly/non-routine) or conventional mammography, with scarce research on DBT-radiomics for subtype prediction, and radiomics-ML integration remains underdeveloped.

Some studies have explored the use of radiomic methods for analyzing molecular subtypes in DBT-derived synthetic mammography. In the study by Xiong et al., which focused on patients with invasive BC, radiomic features proved effective in predicting disease-free survival (DFS) and outperformed clinicopathological nomograms (19). Some investigations have shown that radiomic features derived from magnetic resonance imaging (MRI) correlate with the molecular subtypes of BC (20–22). While MRI combined with radiomics has greatly contributed to personalized BC treatment, there is currently a lack of research on using DBT imaging radiomics to predict the molecular subtypes of BC. However, the research on imaging omics and machine learning to build diagnostic models to predict diseases in BC is not deep enough, and it is worth further exploration.

In our study, a variety of machine learning algorithms were applied to establish a combined model of DBT imaging omics and immunohistochemistry (IHC) data, and 5 high-risk characteristics of BC were screened, which provided more valuable information to the clinic and were conducive to personalized clinical treatment.

2 Materials and methods

2.1 Patients

This retrospective study, approved by our Institutional Review Board (IRB) with waived written consent, identified 433 consecutive female patients who were diagnosed with invasive BC and had available preoperative mammography at our institution between February 2019 and June 2023. This study has been approved by the Ethics Committee of S&T Program of Hebei (20377783D), Medical Science Research Project of Hebei (20221342), and Medical Science Research Project of Hebei (20230863)). All patients included in the study met the following inclusion criteria: (1) underwent DBT examination within one month prior to surgery; (2) were pathologically diagnosed with invasive breast carcinoma; (3) had no documented history of any other malignancy; and (4) did not undergo a biopsy or receive treatment for the breast tumor before the DBT examination. Patients were excluded from the study if they met any of the following exclusion criteria: (1) insufficient clinicopathological data or suboptimal image quality; (2) pathological diagnosis of non-invasive breast carcinoma, or concurrent presence of other malignancies; (3) multiple BC lesions or distant metastases; (4) tumors on DBT images that did not appear as masses but presented in other forms, such as pure calcification, asymmetry, or architectural distortion. The 433 cases included 111 patients with subtype A BC, 100 patients with subtype B BC, 107 patients with HER2-positive BC, and 112 patients with basal-like BC. These cases were used to screen for regions of interest (ROI) and extract image-omics features (Figure 1A).

Figure 1

Flowchart and diagram illustrating a study on patients with invasive breast cancer and tumor analysis. Part A shows a selection process involving 1,278 patients, excluding criteria, resulting in 433 patients divided into training and validation sets. Part B depicts tumor segmentation in mammogram views, followed by feature extraction with models such as RegLogistic and SVM. It proceeds to feature selection using Lasso regression and nomogram modeling, concluding with model validation including ROC, calibration, and decision curves.

Figure 1. The flowchart of the study. This flowchart outlines the key steps of the study. (A) Patient selection process. (B) Workflow for constructing and validating the radiomics nomogram.

2.2 DBT examination

In this study, the patients were scanned using GE Senographe Essential digital mammography (GE Healthineers) fullfield digital mammography systems. The typical imaging parameters were established within the ranges of 27–32 kV and 28–68 mAs. Additionally, both craniocaudal and mediolateral oblique images were successfully obtained for every patient. This imaging technique facilitated the acquisition of both a standard digital mammogram and a tomosynthesis scan under the same breast compression (23, 24). With a single low-dose exposure, the X-ray tube was rotated through an angular range of 12.5 degrees, completing a total of nine rotations. Advanced computerized imaging algorithms were then employed to reconstruct projections from each viewing angle, enabling three-dimensional visualization of breast tissue.

2.3 Image information

Two experienced radiologists, each with more than 5 years of professional expertise, performed an impartial evaluation of DBT images, which were anonymized. The three-dimensional ROI that encompassed the tumor on synthetic mammography was manually segmented (as shown in Figures 2, 3) by a resident radiologist with five years of experience (referred to as reader 1) using the “3D Slicer” software (https://www.slicer.org/). Subsequently, the delineated ROIs were meticulously examined and verified by a breast radiologist who possessed a decade of subspecialty experience (referred to as reader 2). In cases where there were discrepancies regarding the ROI, they were resolved through consensus-based discussions. This assessment was conducted in a blinded manner, meaning that the radiologists were not provided access to the associated histopathological information to ensure an unbiased judgment. In instances where discrepancies arose between the two radiologists, a third radiologist with more than 10 years of experience was consulted to resolve the discrepancies.

Figure 2

Mammogram images showing two different views labeled A and B. Each view consists of two images: one with a white arrow indicating a prominent nodule in the breast tissue, and the adjacent image with the nodule outlined in red.

Figure 2. Tumor segmentation example 1. Example of tumor segmentation on synthetic mammography. The synthetic mediolateral oblique (A) and craniocaudal (B) views of a 69-year-old female diagnosed with the luminal A subtype of breast cancer. The breast lesion appears as a circumscribed and round mass with high density (arrow).

Figure 3

Mammogram image showing two pairs of scans labeled A and B. Each pair includes one scan with an arrow pointing to a dense, circular region, and the other scan shows the same region outlined in red. The dense regions are roughly in the center of the breast area.

Figure 3. Tumor segmentation example 2. Example of tumor segmentation on synthetic mammography. The synthetic mediolateral oblique (A) and craniocaudal (B) views of a 79-year-old female diagnosed with the triple-negative subtype of breast cancer. The breast lesion appears as a spiculated mass (arrow).

2.4 Pathological and IHC analysis

Surgical resection of BC specimens was performed, followed by validation of diagnoses through histopathological examination. IHC analyses were conducted to assess the expression levels of estrogen receptor (ER), progesterone receptor (PR), human epidermal growth factor receptor 2 (HER-2), and the Ki-67 antigen (25, 26). The absence of positive staining in 1% or fewer carcinoma nuclei indicated a negative status for both ER and PR, as per references (26, 27). According to the IHC scoring system, HER-2 expression was categorized into four levels: 0, 1+, 2+, or 3 +. Confirmation of a negative HER-2 status can be achieved through two methods: obtaining an IHC score of either 0 or 1+, or achieving an IHC score of 2+ alongside a negative result from fluorescence in situ hybridization (FISH) testing. Conversely, a positive HER-2 status can be confirmed with either a score of 3+ or a score of 2+ combined with a positive FISH test result (27, 28). Furthermore, if a patient presented with an IHC score of 2+ but lacked FISH results, the recorded status for HER-2 was classified as suspicious positive. A Ki-67 proliferation index below 14% was categorized as a low level of proliferation, while a value equal to or exceeding 14% was regarded as a high level of proliferation (28). The IHC antibodies used in this study were as follows: ER (Roche, Clone Number SP1), PR (Roche, Clone number IE2), HER2 (Roche, Clone number 4B5), and KI-67 (Maxin, Clone number MX006).

2.5 Lesion segmentation and feature extraction

The ROI on the cranio-caudal (CC) and mediolateral oblique (MLO) views of DBT images was manually delineated, following the contours of the tumor’s maximum diameter area. The lesion segmentation task was conducted by two radiologists, referred to as Radiologist 1 and Radiologist 2, who possessed 10 and 7 years of experience in BC diagnosis, respectively. They employed 3D Slicer (version 4.11; available at http://www.slicer.org) to outline all ROIs while remaining unaware of the histopathological data during this process. Figure 1B presents a diagrammatic representation illustrating the segmentation of the ROI. Prior to commencing feature extraction, the images underwent resampling and grayscale discretization for normalization purposes, adhering to recommendations established in prior research studies (28, 29). Radiomic features were extracted from both CC and MLO images encompassing each patient’s ROI.

2.6 Selecting of radiomics features and establishment of the radscore model

Radiomic feature extraction was performed using Python (version 3.7) of the PyRadiomics package (version 3.0.1, http://pyradiomics.readthedocs.io), according to the original, wavelet, gauss Laplace (LoG), index, square, square root, logarithm and gradient image retrieval. These image omics features were statistically processed by Z-score and Kruskal-Wallis rank sum test to screen out candidate features (p < 0.05).

For the sake of screening features for predicting BC, the dataset was randomly split into training and validation subsets at a 7:3 ratio. The radiomics features were normalized via Z-score standardization (28). To precisely identify the most effective set of predictive features, the study utilized least absolute shrinkage and selection operator (LASSO) regression, implementing five-fold cross-validation. The One-vs-Rest (OvR) strategy was utilized for predicting the sample subtypes via building a binary classifier. Based on the training set, the LASSO regression model was then constructed by the R package ‘glmnet’ (Ver. 4.1-6) to select LASSO features with the parameters of ‘famil’=‘binomial’ and ‘type.measure’=‘class’. The results of multivariate classification were determined by taking the category with the highest probability (HER2-positive) through 10-fold cross-validation. And the LASSO features were ascertained when the error rate of model was lowest. Moreover, the receiver operating characteristic (ROC) curves were plotted in training and testing set to evaluate the performance of the model in predicting the subtypes of BC (area under of ROC curves (AUCs) > 0.70).

In order to assess the correlations between LASSO features and BC subtypes, Dunn’s test was exploited to determine whether there were significant discrepancies in LASSO features between luminal A-subtype, luminal B-subtype, HER2-positive BC and basal-like BC. The violin plots were created via the R package ‘ggstatsplot’ (Ver. 0.12.0) to exhibit the outcomes.

2.7 Selecting of the optimal classifier and nomogram modeling

To screen the optimal classifier for further predicting BC subtypes using image-omics LASSO features, 8 machine learning (ML) models in the R package ‘caret’ (Ver. 6.0-94) were constructed. In the training set, Regularized Logistic Regression (regLogistic), support vector machine (SVM), Random Forest (RF), k-nearest neighbors (KNN), eXtreme Gradient Boosting (xgboost), Gradient Boosting Machine (GBM), Naive Bayes and Neural Network (NNET) models were constructed to predict the categories of samples, and the diagnostic efficacy of each model was calculated separately with the 5-fold cross-validation. The predictive performances of the 8 models were evaluated with ROC curve and confusion matrix in training and testing sets, and AUCs of 8 models in the training and testing sets were compared. Simultaneously, accuracy, sensitivity, specificity, recall, precision and F1 score of each model were estimated to filter the classifier with the best performance. Eventually, the optimal classifier was used to rank the importance of each LASSO feature. Ulteriorly, univariate (p < 0.05) and multivariate Cox regression analyses (p < 0.05) were proceeded through the R package ‘rms’ (Ver. 6.5-0) to acquire crucial features for nomogram modeling. The relationships between crucial features and BC were predicted in the light of the odds ratio (OR) of the patients in the nomogram model. Additionally, calibration curve and Decision Curve Analysis (DCA) were adopted to evaluate the predictive power of nomogram model.

2.8 Statistical analysis

Bioinformatics analyses were conducted using R software (Version 6.0-94). Significant differences among three or more groups were assessed by the Kruskal-Wallis rank sum test, followed by Dunn’s test for pairwise comparisons between multiple groups. A p-value or adjusted p-value (p.adj) of less than 0.05 was considered statistically significant.

3 Results

3.1 A sum of 14 LASSO features were obtained

A total of 7 types of features, encompassing 306 first-order features, 14 shape features, and 1,241 texture features (glcm, gldm, glrlm, glszm and ngtdm) emerged from the original, wavelet, LoG, exponential, square, squareRoot, logarithm, gradient images (Table 1, Figure 4A). In sum of 1,175 candidate features were obtained after Z-score standardization and Kruskal-Wallis rank sum test for 7 types (Figure 4B). In order to screen features that were strongly associated with BC, a LASSO regression model was built in the training set, producing 14 LASSO features when the minimum Lambda value was 0.0481 (Figures 4C, D). The AUCs of the training and testing sets were 0.723 and 0.727, respectively (Figures 4E, F). The correlation analysis between LASSO features and BC subtypes revealed significant differences among the 14 LASSO features across different subtype comparisons. Specifically, between luminal B and HER2-positive subtypes, the following nine features showed significant differences: gradient glszm SmallAreaLowGrayLevelEmphasis, log sigma 1–0 mm 3D glcm Idmn, logarithm ngtdm Busyness, squareroot glcm Imc1, wavelet HHH firstorder RobustMeanAbsoluteDeviat, wavelet LHH ngtdm Busyness, wavelet LLH firstorder Entropy, wavelet LLL glcm Idm and wavelet LLL glszm SizeZoneNonUniformityNormalized. Additionally, between luminal A and luminal B subtypes, eight features exhibited significant differences: gradient glszm SmallAreaLowGrayLevelEmphasis, squareroot glcm Imc1, wavelet LHH firstorder Entropy, wavelet LHH ngtdm Busyness, wavelet LLH firstorder Entropy, wavelet LLH firstorder InterquartileRangewavelet LLL glcm Idm and wavelet LLL glszm SizeZoneNonUniformityNormalized (Figure 5).

Table 1

Table 1. The number of features extracted.

Figure 4

A: Pie chart showing the distribution of feature types: glrm 17.4%, glszm 17.4%, gldm 15.2%, ngtdm 5.4%, shape 0.9%, firstorder 19.6%, glcm 24.0%. B: Violin plot displaying p-values for feature types, including firstorder, glcm, gldm, glrm, glszm, ngtdm, shape. C: Graph illustrating coefficients against log lambda, highlighting Lambda.min at 0.0181 and Lambda.1se at 0.0481. D: Binomial deviance plot with log lambda, showing data points and error bars. E: Training set Lasso ROC curve with an AUC of 0.723. F: Training set Lasso ROC curve with an AUC of 0.727.

Figure 4. (A) The proportion of extracted features; (B) P-value distribution of statistical tests of imaging omics features; (C, D) Feature screening by LASSO logistic regression; (E, F) Model ROC curves for validation and test sets. (A) The percentage of each feature extracted by LASSO model; (B) Kruskal-Wallis rank sum test was performed on all image omics features, and only the image omics features with a p value less than 0.05 were retained; (C) Characteristic coefficient variations with penalty coefficient; (D) Cross-verify error plots. Figure (left): The horizontal coordinate is Lambda, and the vertical coordinate represents the error of cross-validation. In the actual analysis, we expect the position with the smallest mean square error of cross-validation, and the left dotted line is the position with the smallest cross-validation error. According to the position lambda.min, the corresponding horizontal coordinate lambda is determined and the optimal Lambda value is found. The minimum Lambda value of 0.0481 produces 14 noose features. The size of the mean square error of the model is shown in the figure on the right; (E) Verify the model ROC curve of the set; (F) Test the model ROC curve of the set. The model ROC curve AUC of the verification set and the test set is greater than 0.7.

Figure 5

Violin plots display various features across different groups, including Luminal A, Luminal B, HER2-positive, and Basal-like. Each plot shows data distribution, median, and statistical comparisons among the groups.

Figure 5. Violin diagram of strong correlation between different types of breast cancer (BC). The R package ggstatsplot (version 0.12.0) was used to plot the strong correlation features between different BC subtypes, evaluate the correlation between the strong correlation features and BC subtypes, and show the most common post-hoc test after the important Kruskal-Wallis test - Dunn test. P<0.05 indicates that there are significant differences in the strong correlation characteristics between different clinical groups.

3.2 RF model was the optimal classifier

In sum of 8 ML models were constructed in the training set to select the best classifier to accurately predict the BC subtypes. Among 8 ML models, RF possessed the highest AUC value and accuracy in training set (Figures 6A, B), As presented in training and testing sets, the RF model retained the highest AUC values for the four BC subtypes, as did the Macro average AUC and Micro average AUC values, demonstrating the excellent predictive capacity of RF (Figure 6C). In addition, the abilities of 8 ML models were assessed using confusion matrix, highlighting RF had the highest accuracy, sensitivity, recall and F1 score in 8 ML models (Table 2, Figure 6D). The line graph illustrating the AUC discrepancies of 8 ML models between training and testing sets emphasized the highest AUCs of RF in both sets (Figure 6E). In conclusion, RF was the optimal classifier predicting image-omics LASSO features of BC. As a consequence,14 LASSO features were sorted by Random Forest model according to their importance. Among these, logarithm ngtdm Busyness and original shape Surface Volume Ratio contributed the most to the model due to their higher importance values (Figure 6F).

Figure 6

Panel A presents a boxplot comparing accuracy, AUC, and Kappa for various models like random forest, xgboost, and knn. Panel B features a dot plot showing confidence intervals of AUC for these models. Panel C consists of multiple ROC curves for model performance across different tasks, annotated with legend details. Panel D displays confusion matrices for each model's testing phase, with color intensity indicating frequency. Panel E includes a line graph comparing AUC for test and train tasks for each model. Lastly, Panel F is a bar chart ranking feature importance, highlighted by color intensity.

Figure 6. (A, B) ROC curves and confusion matrices for 8 machine algorithms; (C) The test set verifies the ROC curve for each model separately; (D) The test set verifies each model confusion matrix separately; (E) Test set and verification set accuracy line chart curve; (F) Feature importance ranking of Random Forest. (A, B) Random Forest has the highest ROC Area under Curve (AUC) value; (C) ROC is a one-to-many (OvR) multi-class strategy, also known as one-to-many, which involves calculating each class. At each step, the given class is treated as a positive class, and the remaining classes are treated as a negative class of the whole. ‘macro’: calculates the metrics for each label and finds their unweighted average. This does not take into account label imbalance. ‘micro’: calculates the global indicator by counting total true positives, false negatives, and false positives; (D) Confusion matrix is a situation analysis table that summarizes the prediction results of classification model in machine learning, and summarizes the records in the data set in the form of matrix according to the real category and the category judgment predicted by the classification model; (E) AUC for different test sets and validation sets; (F) The importance of features of Random Forest model corresponds to the scores displayed by corresponding color cards.

Table 2

Table 2. Model energy efficiency index.

3.3 Using the nomogram model to predict BC

To identify BC features for nomogram modeling, univariate Cox regression analysis was performed, followed by multivariate Cox analysis. Ten LASSO-selected features (p < 0.05) from the univariate analysis were subsequently incorporated into the multivariate model (Table 3, Figure 7A), yielding 5 crucial features, namely logarithm glrlm RunLengthNonUniformityNormalized, logarithm ngtdm Busyness, original sape SurfaceVolumeRatio, wavelet LLH firstorder InterquartileRange and wavelet LLL glcm Idm (Table 4, Figure 7B). A nomogram model embracing 5 crucial features was developed immediately; the OR values of BC patients increased with the elevated total points (Figure 7C). Importantly, the validity and universality of the nomogram model were certified via calibration curve and DCA curve. The calibration curve manifested that the slope of the nomogram model almost achieved to 1. In addition, the c-index of 0.732 after model correction was close to the c-index of 0.759 (Figure 7D). Further DCA demonstrated that the nomogram model outweighed any single crucial feature by providing a superior net benefit (Figure 7E).

Table 3

Table 3. Single factor logistic regression model.

Figure 7

Panel A and B display tables with variables, coefficients, odds ratios, confidence intervals, and p-values in red and black text, accompanied by forest plots. Panel C shows a logistic regression nomogram with variables and points, indicating total points and odds. Panel D presents a calibration plot comparing actual vs. predicted probability with a diagonal line, histogram, and calibration curves. Panel E features a decision curve analysis with lines for various predictors and net benefit against high-risk threshold.

Figure 7. (A) Single factor logistic regression model forest map; (B) Multifactor logistic regression model forest map; (C) The nomogram predicted the relative risk of patients; (D) Nomogram calibration curve; (E) Decision curve. (A) A single factor logistic regression model was constructed based on the 14 image omics features of the training set, and the risk forest map was drawn according to the model. The results of 10 features in the model were significant (p < 0.05); (B) A multi-factor logistic regression model was constructed based on the 14 image-omics features of the training set, and the risk forest map was drawn according to the model. The results of 5 features in the model were significant (p < 0.05); (C) Multivariate logistic regression analysis was performed to obtain five significant factors and a column graph was constructed. Each factor corresponds to a score. The total score of each factor was added to correspond to the total score, and the relative risk (Odds Ratio) of the patient was predicted according to the total score; (D) Based on the above nomogram prediction model, the calibration curve was drawn. The closer the slope is to 1, the more accurate the prediction is. In addition, the c-index of the model was 0.759, and the corrected c-index was 0.732, indicating that the column-line model was well fitted and the prediction results of our logistic regression model were quite good, which can be used in clinical diagnosis; (E) The horizontal coordinate is the threshold probability: In the risk assessment tool, the probability that patient i is diagnosed with the disease is denoting Pi; When Pi reaches a certain threshold (denoted as Pt), the case is defined as positive and treatment is administered. There will be patient benefit (benefit), non-patient harm (harm) and patient loss (harm) if the patient is not treated. The ordinate is the Net Benefit (NB) after subtracting the disadvantages from the advantages. In addition to the curved lines that represent different models of clinical diagnosis (identified by the legend), there are two lines that represent the two extremes. The horizontal one indicates that all samples are negative (Pi < Pt), all are untreated, and the net benefit is 0. The slanted one means that all the samples were positive, all of them were treated, and the net benefit is a negative backslash. As can be seen from the figure, within the Pt [0-1] interval, the benefits of the imaging features, clinical features and Nomogram are all higher than those of the extreme curves, so the optional Pt ranges are relatively large and safe.

Table 4

Table 4. Multiple logistic regression models.

4 Discussion

BC has the highest incidence rate among all female cancers globally. The use of imaging genomics and machine learning to construct novel cancer diagnostic models has been widely applied, but there has been no complete report on its application in BC. In our study, we developed a model with improved predictive performance based on the specific molecular subtypes of BC to meet the individualized treatment needs. We confirmed that radiomics characteristics derived from DBT can predict the manifestations of different molecular types of BC, thus providing more value and information for patient personalized treatment. In this study, we integrated imagomics with machine learning to uncover five novel key features of BC. Based on these findings, we constructed a nomogram model capable of predicting the risk level in BC patients with acceptable accuracy. Furthermore, we devised a combined radiomic model that integrates the radiomic features derived from DBT with IHC results for personalized risk prediction. This approach fully underscores the necessity and clinical significance of establishing a robust BC risk prediction model. Compared with clinical radiological nomogram, combined radiomic nomogram has superior prognostic performance in patients with different molecular types of BC.

Recent studies have revealed that the application of radiomics holds promising potential in enhancing tumor prognosis. Notably, research has demonstrated that radiomics-based nomograms can effectively predict the efficacy of neoadjuvant chemotherapy in BC patients, utilizing pre-treatment magnetic resonance imaging as a foundation (30, 31). In addition, radiomics signature (Rad-score) was used to predict DFS in HER-2 positive invasive BC receiving neoadjuvant chemotherapy, which may be used to personalize treatment strategies (32). Exploration of tumor heterogeneity by radiomics can be an alternative to genomic and transcriptomic analysis (16, 33, 34). Radiomics of magnetic resonance imaging has shown high performance and remains valid for radiomics of mammography—a finding of great importance for studies related to DBT (17, 35). Studies by Ma et al. (17) and Zhang et al. (35) have demonstrated high accuracy in differentiating TN BC subtypes, with Ma’s approach showing optimal TN discrimination (alongside HER2 and luminal subtypes), while Zhang’s radiomics-based method achieved comparable performance in digital mammography. However, all analyses relied on DM imaging, and the replicability of these findings using DBT remains uncertain. Some studies (7) have proposed using synthetic mammography instead of original DBT images to plot ROI on synthetic mammography in clinical practice, and suggested that it is impractical to plot ROI on original DBT images, and the reproducibility of ROI on original DBT images will be limited. Although synthetic mammography may lose some tomographic data, based on the current research status of DBT, a radiomic model was constructed in this study. A total of 1175 imaging features (candidate features) were extracted based on the fusion of 433 DBT images in 4 groups of BC subtypes (luminal A-subtype, luminal B-subtype, HER2-positive and basal-like BC). We identified five novel key features of BC by integrating imagomics and machine learning. Our study presents several significant advantages over previous studies by constructing an ensemble learning model based on mammography and IHC through radiomic analysis of mammography to predict risk models based on molecular subtypes, thus providing enhanced value for personalized treatment. In contrast, prior studies solely relied on routine clinical and radiological features, lacked precise subtype analyses, or utilized only imaging omics methods.

In the realm of medical imaging holography, machine-learning methods hold the potential to attain greater precision while integrating diverse types of information for a broad array of applications, such as disease diagnosis and prognosis evaluation. Research (8) has indicated that machine-learning models possess a marginally superior edge over traditional risk factor-based models in predicting future BC risk. Furthermore, neural network-based BC risk prediction models that incorporate imaging features demonstrate outstanding performance. This finding implies that the integration of imaging inputs within machine-learning models can provide more precise breast cancer risk prediction. Prior BC risk assessments have already acknowledged the significance of imaging features in mammography (9, 36). Nevertheless, the existing model was grounded on the underlying pattern visually assessed by radiologists, and the whole image was subjectively summarized as a density score on mammography as the model input (37). Some studies have developed a novel LASSO-logic modeling approach to perform initial variable screening and eliminate relatively insignificant coefficients of independent variables in the model (38). Thus, regression analysis effectively addresses variable collinearity, particularly in high-dimensional screening scenarios (39). 8 machine learning models were referenced in this study, with the LASSO model used to identify features strongly correlated with BC (LASSO features). LASSO logistic regression was then applied to each mammary gland category, and the category with the highest probability was selected through 10x cross-validation calculations for classification. Following a comprehensive parameter analysis, the random forest algorithm was chosen as the best performing machine learning method. The optimal algorithm was determined and the image holographic score (Rad_score) was calculated. The features of the random forest model were ranked by importance, resulting in 14 significant image group features.

In oncology research, nomogram models utilizing multivariate regression analysis (particularly logistic/Cox regression) are widely adopted for predicting clinical outcomes such as tumor recurrence, metastasis, and mortality (40–42). These tools transform identified risk factors into visual scoring systems, with multivariate regression serving as their computational foundation. Compared to conventional methods, nomograms provide enhanced predictive accuracy and interpretability (42, 43), as evidenced by their capacity to quantify variable contributions through graphical outputs. Our implementation aligns with established methodological frameworks in the field: we constructed predictive models and nomographs based on patients’ risk factors and verified their accuracy and validity to predict the risk of these patients, and evaluated the diagnostic accuracy and clinical value of the models using decision curve analysis. Although this model has certain predictive capabilities, it is not yet suitable for standalone clinical decision-making, such as replacing invasive biopsy to confirm subtypes. Instead, its primary clinical value lies in providing complementary information to guide preliminary treatment planning until the accuracy is further improved.

This study has several limitations that warrant discussion. First, the inherent constraints associated with its single-center retrospective design must be acknowledged, which may limit the generalizability of our findings to other populations or institutions due to potential variations in patient demographics, imaging protocols, and pathological practices. Second, the training and validation of the five key features necessitated a vast amount of medical image data. During this process, machine learning algorithms might absorb biases present in the data, potentially leading to skewed prediction outcomes. Nevertheless, our commitment to the subtype research and diagnosis of BC remains unwavering. Third, we did not analyze the morphological characteristics of the four subtypes in this study, leaving room for future studies to explore this relevant content. And also, we did not handle the potential class imbalance among the four BC subtypes in the model training. Strategies should be implemented to further improve the model’s robustness and generalizability across all subtypes. Fourth, radiomics features were extracted based on manually-drawn ROIs. To mitigate potential issues, features with poor inter-observer reproducibility were excluded from the analysis. Fifth, another limitation of this study is the lack of external validation on an independent cohort, which would strengthen the generalizability of our findings; future studies should include multi-center external validation to confirm model robustness. Sixth, although the radiomic features identified exhibit statistical significance for predicting BC subtypes, their specific pathophysiological implications remain unclear, resulting in limited clinical interpretability. Seventh, although this study constructed a risk model applicable to the clinical diagnosis of BC patients and screened five key features for constructing the nomogram through univariate and multivariate regression analysis, which does possess certain clinical value, we will continue to expand the sample size in subsequent studies to further verify these key features.

5 Conclusions

In summary, this study analyzed Luminal A, Luminal B, HER-2 positive and TN types of BC patients by means of imaging omics analysis and a variety of machine learning methods, Based on our validation results, these models demonstrate high reproducibility in BC patients. Additionally, we have identified potential prognostic variables in patients with BC, with the aim of identifying an optimal classification model and providing new insights for the diagnosis and clinical treatment of BC.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding authors.

Ethics statement

All experimental protocols were approved by the Ethics Committee of The Fourth Hospital of Hebei Medical University. Informed consent was obtained from all the participants. All methods were carried out in accordance with Declaration of Helsinki. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

JX: Conceptualization, Writing – original draft, Data curation, Formal Analysis. YL: Writing – review & editing, Data curation, Conceptualization, Methodology. TQ: Methodology, Writing – review & editing, Data curation. YQ: Project administration, Methodology, Investigation, Writing – review & editing. HW: Methodology, Writing – review & editing, Software, Investigation. XR: Writing – review & editing, Investigation, Visualization, Methodology. JT: Methodology, Investigation, Validation, Writing – review & editing. TW: Methodology, Investigation, Writing – review & editing. JZ: Writing – review & editing, Investigation, Methodology. ZL: Writing – review & editing, Funding acquisition, Methodology, Project administration, Supervision. YP: Investigation, Writing – review & editing, Supervision, Resources, Validation.

Funding

The author(s) declare that no financial support was received for the research, and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

BI-RADS, Breast Imaging Reporting and Data System; BC, Breast Cancer; DM, Digital mammography; DBT, digital breast tomosynthesis; PR, progesterone receptor; ER, estrogen receptor; HER2, human epidermal growth factor receptor 2.

References

1. Siegel RL, Kratzer TB, Giaquinto AN, Sung H, and Jemal A. Cancer statistics, 2025. CA: Cancer J Clin. (2025) 75:10–45. doi: 10.3322/caac.21871

PubMed Abstract | Crossref Full Text | Google Scholar

2. Harbeck N and Gnant M. Breast cancer. Lancet (London England). (2017) 389:1134–50. doi: 10.1016/S0140-6736(16)31891-8

PubMed Abstract | Crossref Full Text | Google Scholar

3. Waks AG and Winer EP. Breast cancer treatment: A review. JAMA. (2019) 321:288–300. doi: 10.1001/jama.2018.19323

PubMed Abstract | Crossref Full Text | Google Scholar

4. Xiong X, Zheng LW, Ding Y, Chen YF, Cai YW, Wang LP, et al. Breast cancer: pathogenesis and treatments. Signal transduction targeted Ther. (2025) 10:49. doi: 10.1038/s41392-024-02108-4

PubMed Abstract | Crossref Full Text | Google Scholar

5. Prat A, Pineda E, Adamo B, Galván P, Fernández A, Gaba L, et al. Clinical implications of the intrinsic molecular subtypes of breast cancer. Breast (Edinburgh Scotland). (2015) 24 Suppl 2:S26–35. doi: 10.1016/j.breast.2015.07.008

PubMed Abstract | Crossref Full Text | Google Scholar

6. Jiang L, You C, Xiao Y, Wang H, Su GH, Xia BQ, et al. Radiogenomic analysis reveals tumor heterogeneity of triple-negative breast cancer. Cell Rep Med. (2022) 3:100694. doi: 10.1016/j.xcrm.2022.100694

PubMed Abstract | Crossref Full Text | Google Scholar

7. Gao Y, Li S, Jin Y, Zhou L, Sun S, Xu X, et al. An assessment of the predictive performance of current machine learning-based breast cancer risk prediction models: systematic review. JMIR Public Health Surveill. (2022) 8:e35750. doi: 10.2196/35750

PubMed Abstract | Crossref Full Text | Google Scholar

8. Brentnall AR, Harkness EF, Astley SM, Donnelly LS, Stavrinos P, Sampson S, et al. Mammographic density adds accuracy to both the Tyrer-Cuzick and Gail breast cancer risk models in a prospective UK screening cohort. Breast Cancer Res. (2015) 17:147. doi: 10.1186/s13058-015-0653-5

PubMed Abstract | Crossref Full Text | Google Scholar

9. Tice JA, Cummings SR, Smith-Bindman R, Ichikawa L, Barlow WE, and Kerlikowske K. Using clinical factors and mammographic breast density to estimate breast cancer risk: development and validation of a new predictive model. Ann Intern Med. (2008) 148:337–47. doi: 10.7326/0003-4819-148-5-200803040-00004

PubMed Abstract | Crossref Full Text | Google Scholar

10. Wu M and Ma J. Association between imaging characteristics and different molecular subtypes of breast cancer. Acad Radiol. (2017) 24:426–34. doi: 10.1016/j.acra.2016.11.012

PubMed Abstract | Crossref Full Text | Google Scholar

11. Çelebi F, Pilancı KN, Ordu Ç, Ağacayak F, Alço G, İlgün S, et al. The role of ultrasonographic findings to predict molecular subtype, histologic grade, and hormone receptor status of breast cancer. Diagn. Interv. Radiol. (2015) 21:448–53. doi: 10.5152/dir.2015.14515

PubMed Abstract | Crossref Full Text | Google Scholar

12. Dogan BE and Turnbull LW. Imaging of triple-negative breast cancer. Ann Oncol. (2012) 23 Suppl 6:vi23–9. doi: 10.1093/annonc/mds191

PubMed Abstract | Crossref Full Text | Google Scholar

13. Gillies RJ, Kinahan PE, and Hricak H. Radiomics: Images are more than pictures, they are data. Radiology. (2016) 278:563–77. doi: 10.1148/radiol.2015151169

PubMed Abstract | Crossref Full Text | Google Scholar

14. Skaane P, Bandos AI, Gullien R, Eben EB, Ekseth U, Haakenaasen U, et al. Comparison of digital mammography alone and digital mammography plus tomosynthesis in a population-based screening program. Radiology. (2013) 267:47–56. doi: 10.1148/radiol.12121373

PubMed Abstract | Crossref Full Text | Google Scholar

15. Bernardi D, Macaskill P, Pellegrini M, Valentini M, Fantò C, Ostillio L, et al. Breast cancer screening with tomosynthesis (3D mammography) with acquired or synthetic 2D mammography compared with 2D mammography alone (STORM-2): A population-based prospective study. Lancet Oncol. (2016) 17:1105–13. doi: 10.1016/S1470-2045(16)30101-2

PubMed Abstract | Crossref Full Text | Google Scholar

16. Ma W, Zhao Y, Ji Y, Guo X, Jian X, Liu P, et al. Breast cancer molecular subtype prediction by mammographic radiomic features. Acad Radiol. (2019) 26:196–201. doi: 10.1016/j.acra.2018.01.023

PubMed Abstract | Crossref Full Text | Google Scholar

17. Zhang HX, Sun ZQ, Cheng YG, and Mao GQ. A pilot study of radiomics technology based on X-ray mammography in patients with triple-negative breast cancer. J Xray Sci Technol. (2019) 27:485–92. doi: 10.3233/XST-180488

PubMed Abstract | Crossref Full Text | Google Scholar

18. Tagliafico AS, Calabrese M, Bignotti B, Signori A, Fisci E, Rossi F, et al. Accuracy and reading time for six strategies using digital breast tomosynthesis in women with mammographically negative dense breasts. Eur Radiol. (2017) 27:5179–84. doi: 10.1007/s00330-017-4918-5

PubMed Abstract | Crossref Full Text | Google Scholar

19. Xiong L, Chen H, Tang X, Chen B, Jiang X, Liu L, et al. Ultrasound-based radiomics analysis for predicting disease-free survival of invasive breast cancer. Front Oncol. (2021) 11:621993. doi: 10.3389/fonc.2021.621993

PubMed Abstract | Crossref Full Text | Google Scholar

20. Sutton EJ, Dashevsky BZ, Oh JH, Veeraraghavan H, Apte AP, Thakur SB, et al. Breast cancer molecular subtype classifier that incorporates MRI features. J Magn Reson Imaging. (2016) 44:122–9. doi: 10.1002/jmri.25119

PubMed Abstract | Crossref Full Text | Google Scholar

21. Li H, Zhu Y, Burnside ES, Huang E, Drukker K, Hoadley KA, et al. Quantitative MRI radiomics in the prediction of molecular classifications of breast cancer subtypes in the TCGA/TCIA data set. NPJ Breast Cancer. (2016) 2. doi: 10.1038/npjbcancer.2016.12

PubMed Abstract | Crossref Full Text | Google Scholar

22. Wang J, Kato F, Oyama-Manabe N, Li R, Cui Y, Tha KK, et al. Identifying triple-negative breast cancer using background parenchymal enhancement heterogeneity on dynamic contrast-enhanced MRI: A pilot radiomics study. PloS One. (2015) 10:e0143308. doi: 10.1371/journal.pone.0143308

PubMed Abstract | Crossref Full Text | Google Scholar

23. Cai S, Yan J, Cai D, Huang M, and Yan L. Comparison of the diagnostic efficiency between digital breast tomosynthesis and full-field digital mammography. Zhong Nan Da Xue Xue Bao Yi Xue Ban. (2016) 41:1075–81. doi: 10.11817/j.issn.1672-7347.2016.10.011

PubMed Abstract | Crossref Full Text | Google Scholar

24. Yang TL, Liang HL, Chou CP, Huang JS, and Pan HB. The adjunctive digital breast tomosynthesis in diagnosis of breast cancer. BioMed Res Int. (2013) 2013:597253. doi: 10.1155/2013/597253

PubMed Abstract | Crossref Full Text | Google Scholar

25. Xu M, Tang Q, Li M, Liu Y, and Li F. An analysis of Ki-67 expression in stage 1 invasive ductal breast carcinoma using apparent diffusion coefficient histograms. Quant Imaging Med Surg. (2021) 11:1518–31. doi: 10.21037/qims-20-615

PubMed Abstract | Crossref Full Text | Google Scholar

26. Xu M, Li F, Yu S, Zeng S, Weng G, Teng P, et al. Value of histogram of Gray-Scale ultrasound image in differential diagnosis of small triple negative breast invasive ductal carcinoma and fibroadenoma. Cancer Manag Res. (2022) 14:1515–24. doi: 10.2147/CMAR.S359986

PubMed Abstract | Crossref Full Text | Google Scholar

27. Zhang J, Wang G, Ren J, Yang Z, Li D, Cui Y, et al. Multiparametric MRIbased radiomics nomogram for preoperative prediction of lymphovascular invasion and clinical outcomes in patients with breast invasive ductal carcinoma. Eur Radiol. (2022) 32:4079–89. doi: 10.1007/s00330-021-08504-6

PubMed Abstract | Crossref Full Text | Google Scholar

28. Xu ML, Zeng SE, Li F, Cui XW, and Liu GF. Preoperative prediction of lymphovascular invasion in patients with T1 breast invasive ductal carcinoma based on radiomics nomogram using grayscale ultrasound. Front Oncol. (2022) 12:1071677. doi: 10.3389/fonc.2022.1071677

PubMed Abstract | Crossref Full Text | Google Scholar

29. Ji G, Zhu F, Xu Q, Wang K, Wu M, Tang W, et al. Radiomic features at contrast-enhanced CT predict recurrence in early stage hepatocellular carcinoma: a multi-institutional study. Radiology. (2020) 294:568–79. doi: 10.1148/radiol.2020191470

PubMed Abstract | Crossref Full Text | Google Scholar

30. Chen S, Shu Z, Li Y, Chen B, Tang L, Mo W, et al. Machine learning based radiomics nomogram using magnetic resonance images for prediction of neoadjuvant chemotherapy efficacy in breast cancer patients. Front Oncol. (2020) 10:1410. doi: 10.3389/fonc.2020.01410

PubMed Abstract | Crossref Full Text | Google Scholar

31. Li Q, Xiao Q, Li J, Duan S, Wang H, and Gu Y. MRI-based radiomic signature as a prognostic biomarker for her2-positive invasive breast cancer treated with NAC. Cancer Manag Res. (2020) 12:10603–13. doi: 10.2147/CMAR.S271876

PubMed Abstract | Crossref Full Text | Google Scholar

32. Lee HW, Cho HH, Joung JG, Jeon HG, Jeong BC, Jeon SS, et al. Integrative radiogenomics approach for risk assessment of post-operative metastasis in pathological T1 renal cell carcinoma: A pilot retrospective cohort study. Cancers (Basel). (2020) 12:866. doi: 10.3390/cancers12040866

PubMed Abstract | Crossref Full Text | Google Scholar

33. Fischer S, Tahoun M, Klaan B, Thierfelder KM, Weber MA, Krause BJ, et al. A radiogenomic approach for decoding molecular mechanisms underlying tumor progression in prostate cancer. Cancers. (2019) 11:1293. doi: 10.3390/cancers11091293

PubMed Abstract | Crossref Full Text | Google Scholar

34. Peng C, Ma W, Xia W, and Zheng W. Integrated analysis of differentially expressed genes and pathways in triple-negative breast cancer. Mol Med Rep. (2017) 15:1087–94. doi: 10.3892/mmr.2017.6101

PubMed Abstract | Crossref Full Text | Google Scholar

35. Son J, Lee SE, Kim EK, and Kim S. Prediction of breast cancer molecular subtypes using radiomics signatures of synthetic mammography from digital breast tomosynthesis. Sci Rep. (2020) 10:21566. doi: 10.1038/s41598-020-78681-9

PubMed Abstract | Crossref Full Text | Google Scholar

36. Tan M, Zheng B, Leader JK, and Gur D. Association between changes in mammographic image features and risk for near-term breast cancer development. IEEE Trans Med Imaging. (2016) 35:1719–28. doi: 10.1109/TMI.2016.2527619

PubMed Abstract | Crossref Full Text | Google Scholar

37. Tibshirani R. The lasso method for variable selection in the Cox model. Stat Med. (1997) 16:385–95. doi: 10.1002/(SICI)1097-0258(19970228)16:4<385::AID-SIM380>3.0.CO;2-3

Crossref Full Text | Google Scholar

38. Bunea F, She Y, Ombao H, Gongvatana A, Devlin K, and Cohen R. Penalized least squares regression methods and applications to neuroimaging. Neuroimage. (2011) 55:1519–27. doi: 10.1016/j.neuroimage.2010.12.028

PubMed Abstract | Crossref Full Text | Google Scholar

39. Dong D, Zhao D, Li S, Liu W, Du F, Xu X, et al. Nomogram to predict overall survival for patients with non-metastatic cervical esophageal cancer: a SEER-based population study. Ann Transl Med. (2020) 8:1588. doi: 10.21037/atm-20-2505

PubMed Abstract | Crossref Full Text | Google Scholar

40. Zhao F, Lu RX, Liu JY, Fan J, Lin HR, Yang XY, et al. Development and validation of nomograms to intraoperatively predict metastatic patterns in regional lymph nodes in patients diagnosed with esophageal cancer. BMC Cancer. (2021) 21:22. doi: 10.1186/s12885-020-07738-9

PubMed Abstract | Crossref Full Text | Google Scholar

41. Oh SE, Seo SW, Choi MG, Sohn TS, Bae JM, and Kim S. Prediction of overall survival and novel classification of patients with gastric cancer using the survival recurrent network. Ann Surg Oncol. (2018) 25:1153–9. doi: 10.1245/s10434-018-6343-7

PubMed Abstract | Crossref Full Text | Google Scholar

42. Liu S, Yu X, Yang S, Hu P, Hu Y, Chen X, et al. Machine learning-based radiomics nomogram for detecting extramural venous invasion in rectal cancer. Front Oncol. (2021) 11:610338. doi: 10.3389/fonc.2021.610338

PubMed Abstract | Crossref Full Text | Google Scholar

43. Raghav K, Hwang H, Jácome AA, Bhang E, Willett A, Huey RW, et al. Development and validation of a novel nomogram for individualized prediction of survival in cancer of unknown primary. Clin Cancer Res. (2021) 27:3414–21. doi: 10.1158/1078-0432.CCR-20-4117

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: breast cancer, imagomics, diagnosis, nomogram model, molecular subtypes

Citation: Xue J, Li Y, Qu T, Qin Y, Wang H, Rong X, Tian J, Wang T, Zhang J, Li Z and Ping Y (2025) The clinical validity of radiomics-based prediction of molecular subtypes in breast cancer from digital mammary tomosynthesis. Front. Oncol. 15:1661116. doi: 10.3389/fonc.2025.1661116

Received: 07 July 2025; Accepted: 22 September 2025;
Published: 13 October 2025.

Edited by:

Salih Ibrahem, University of Kirkuk, Iraq

Reviewed by:

Mustafa Cem Algin, Kutahya Health Sciences University, Türkiye
Shilan Jabbar, University of Kirkuk, Iraq
Khaleel Mohson, University of Baghdad, Iraq
Rezvan Faisal AbdulJabbar, University of Duhok, Iraq

Copyright © 2025 Xue, Li, Qu, Qin, Wang, Rong, Tian, Wang, Zhang, Li and Ping. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yong Ping, cGluZzY4MTJAaGVibXUuZWR1LmNu; Zhigang Li, emhpZ2FuZ2xpMDUwMDExQDE2My5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.