Skip to main content

SYSTEMATIC REVIEW article

Front. Oncol., 04 February 2022
Sec. Breast Cancer
This article is part of the Research Topic Quantitative Imaging and Artificial Intelligence in Breast Tumor Diagnosis View all 27 articles

The Diagnostic Performance of Machine Learning-Based Radiomics of DCE-MRI in Predicting Axillary Lymph Node Metastasis in Breast Cancer: A Meta-Analysis

Jing ZhangJing ZhangLongchao LiLongchao LiXia ZheXia ZheMin TangMin TangXiaoling ZhangXiaoling ZhangXiaoyan LeiXiaoyan LeiLi Zhang*Li Zhang*
  • Department of MRI, Shaanxi Provincial People’s Hospital, Xi’an, China

Objective: The aim of this study was to perform a meta‐analysis to evaluate the diagnostic performance of machine learning(ML)-based radiomics of dynamic contrast-enhanced (DCE) magnetic resonance imaging (MRI) DCE-MRI in predicting axillary lymph node metastasis (ALNM) and sentinel lymph node metastasis(SLNM) in breast cancer.

Methods: English and Chinese databases were searched for original studies. The Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) and Radiomics Quality Score (RQS) were used to assess the methodological quality of the included studies. The pooled sensitivity, specificity, diagnostic odds ratio (DOR), and area under the curve (AUC) were used to summarize the diagnostic accuracy. Spearman’s correlation coefficient and subgroup analysis were performed to investigate the cause of the heterogeneity.

Results: Thirteen studies (1618 participants) were included in this meta-analysis. The pooled sensitivity, specificity, DOR, and AUC with 95% confidence intervals were 0.82 (0.75, 0.87), 0.83 (0.74, 0.89), 21.56 (10.60, 43.85), and 0.89 (0.86, 0.91), respectively. The meta-analysis showed significant heterogeneity among the included studies. There was no threshold effect in the test. The result of subgroup analysis showed that ML, 3.0 T, area of interest comprising the ALN, being manually drawn, and including ALNs and combined sentinel lymph node (SLN)s and ALNs groups could slightly improve diagnostic performance compared to deep learning, 1.5 T, area of interest comprising the breast tumor, semiautomatic scanning, and the SLN, respectively.

Conclusions: ML-based radiomics of DCE-MRI has the potential to predict ALNM and SLNM accurately. The heterogeneity of the ALNM and SLNM diagnoses included between the studies is a major limitation.

Highlights

Overall pooled AUC was 0.89 with (95%CI: 0.86, 0.91)

ML, 3.0 T, being manually drawn, using biopsy as gold standard could improve diagnostic performance compared to deep learning, 1.5 T, semiautomatic scanning, pathology, respectively.

Introduction

*Axillary lymph node metastasis (ALNM) is common in breast cancer patients and determines the clinical stage, treatment plans, surgical procedure and patient outcome (1, 2). Currently, the axillary lymph node (ALN) status of patients with breast cancer is diagnosed by sentinel lymph node biopsy (SLNB) and axillary lymph node dissection (ALND). However, these procedures are not risk-free operations and can potentially lead to implantation metastasis (3). Therefore, it is essential to explore a noninvasive approach for assessing ALNM to reduce the incidence of surgical complications and improve the patient’s quality of life.

Dynamic contrast-enhanced (DCE) magnetic resonance imaging (MRI) has generally been well accepted and routinely used for breast cancer staging (4, 5). For predicting ALNM, previous studies of DCE-MRI have primarily focused on node size, cortical thickness, disappearance of lymph parenchyma, and enhancement patterns (6). Unfortunately, early diagnosis of ALNM through DCE-MRI is not yet ideal since it is limited by subjective factors, such as the radiologist’s experience and knowledge level. Additionally, subtle changes, such as cell density, morphology, and microtissue structure, in ALNM might not be apparent to the naked eye (7, 8).

In recent years, radiomics and machine learning (ML) models have become increasingly popular for analyzing diagnostic images (9, 10). The ability of radiomics analysis to maximize the number of features in quantitative images has excellent potential for evaluating ALNM in breast cancer patients (1115).

However, because of the small sample sizes of previous studies, statistical research has been limited, and research results have also varied from study to study. Thus, it is necessary to perform a meta‐analysis to further evaluate the diagnostic performance of ML-based radiomics of DCE-MRI in predicting ALNM and SLNM in breast cancer.

Materials and Methods

We conducted and reported this meta-analysis based on the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines (16).

Literature Search

The PubMed, Embase, Web of Science, and Cochrane Library databases and four Chinese databases [VIP, CNKI, Wanfang and Chinese BioMedical Literature Databases (CBM)] were searched by two observers independently to identify studies. The search was performed on June 23, 2021, without a start date limit. The study search was conducted using the following keywords: “magnetic resonance imaging”, “MRI”, “MRI scans”, “breast cancer”, “breast carcinoma”, “metastasis”, “machine learning”, “radiomics” and “lymph node”. MeSH terms and variations of each term were used. Moreover, we restricted the studies to those published in English or Chinese and performed a manual search of the related articles’ reference lists to identify other articles that might meet the inclusion criteria. Endnote software, version X9, was used to manage all records. Disagreements were discussed and resolved to reach a consensus.

Study Selection

The titles and abstracts of potentially relevant studies were screened for appropriateness by two reviewers(Z-J and Z-L). Inconsistencies were discussed by the reviewers, and consensus was reached.

All of the studies were selected according to the following criteria: (a) original research studies; (b) patients with breast cancer were enrolled who were confirmed to have ALNM or SLNM by biopsy or histopathology; (c) ML-based DCE-MRI applied to classify ALNM or SLNM using radiomics; and (d) data are sufficient to reconstruct the 2×2 contingency table to estimate the sensitivity and specificity of the diagnosis.

Studies were excluded if: (a) reviews, editorials, abstracts, animal studies, and conference presentations; and (b) multiple reports published for the same population (in this case, the publication with the most details was chosen to be included in this meta-analysis).

Data Extraction

Relevant data were extracted from each study, including the first author, publication year, sample size, magnetic field strength, information about radiomics and ML pipeline, data sources and reference standards, detailed information on lesion segmentation, contrast agents, and DCE phases. For each study, the true positive (TP), false-positive (FP), false negative (FN), and true negative (TN) values were extracted, and a pairwise (2×2) contingency table was created.

Data Quality Assessment

The Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) and Radiomics Quality Score (RQS) were used to assess the methodological quality of the included studies and the risk of bias at the study level, respectively (17, 18). RQS items comprise: (a) image acquisition; (b) radiomics feature extraction; (c) data modeling; (d) model validation; and (e) data sharing. Each of the 16 items (Table 1) of the RQS is rated, resulting in a total of points ranging from −8 to 36, with −8 defined as 0% and 36 defined as 100% (18).

TABLE 1
www.frontiersin.org

Table 1 Elements of the RQS and average rating achieved by the studies included in this meta-analysis.

The QUADAS-2 tool consists of: (a) patient selection; (b) index test; (c) reference standard; and (d) flow and timing. Two independent reviewers (L-LC and Z-L) conducted the quality assessment, and disagreements were discussed with a third reviewer (T-M) to reach a consensus.

Statistical Analysis

This meta-analysis was conducted via Stata software, version 16.0, Review Manager software, version 5.3, and the Open Meta-analyst software tool. The predictive accuracy was quantified using pooled sensitivity, specificity, diagnostic odds ratio (DOR), positive likelihood ratio (PLR) and negative likelihood ratio (NLR) with 95% confidence intervals (CIs). The summary receiver operating characteristic curve (SROC) and area under the curve (AUC) were used to summarize the diagnostic accuracy.

Q and I2 were calculated to estimate the heterogeneity among the studies included in this meta-analysis. I2 values of 0 to 25%, 25 to 50%, 50 to 75% and >75% represent very low, low, medium and high heterogeneity, respectively. Pooling studies and effect size were evaluated using a random-effects model, indicating that estimating the distribution of true effects between studies considers heterogeneity (19). If there was obvious heterogeneity, Spearman’s correlation coefficient was used to assess the threshold effect between the sensitivity logit and the specificity logit. Subgroup analysis was performed to further investigate the cause of the heterogeneity. The following covariates were used to explain factors that could contribute to heterogeneity: (a) 1.5 T MR vs. 3.0 T MR; (b) Pathology of SLNB or ALND vs. Pathology of ALND; (c) deep learning vs. ML; (d) ALN vs. SLN vs. ALN and SLN; (e) area of interest (ROI) including ALN vs. ROI including breast cancer; and (f) semiautomatic vs. manual drawing; (g) support vector machines(SVM) vs. logistic regression(LR); (h) Siemens MR equipment vs. GE MR equipment.

In addition, the sensitivity analysis was assessed by eliminating the included studies one by one. The effective sample size funnel plot described by Deek’s test was used to estimate publication bias (20).

Clinical Utility

A Fagan plot was used to assess the clinical utility, which provided the posttest probability (P post) of ALNM when pretest probabilities (P pre, suspicion of ALNM) were calculated (21).

Results

Literature Search

The complete literature search flowchart is presented in Figure 1.

FIGURE 1
www.frontiersin.org

Figure 1 Flow diagram of study selection for meta-analysis.

According to the search strategy described above, 450 potentially eligible citations were identified. After removing 95 duplicate records, 355 titles were considered. After the title and abstract evaluation, 268 citations were omitted because they did not meet the inclusion criteria. After revision, 22 articles were excluded, leaving 13 articles for inclusion in the meta-analysis (1115, 2229).

Data Quality Assessment

The 13 studies achieved an average RQS range of 11.38, a median of 13, and a range of 5 to 15. The mean RQS proportion was 13.9%, with a maximum of 41.7%. Table 1 summarizes the mean scores for each dimension, and Table S1 (Supplement Materials) shows the RQS for each study and the individual scores for each study. None of the included articles employed prospective validation, and only one study evaluated the cost-effectiveness of radiomics (25). No studies publicly shared segmentation, functionality, or code. Generally, the data quality was considered acceptable, and the details of the risk of bias and applicability concerns of the included studies are presented in Figure 2.

FIGURE 2
www.frontiersin.org

Figure 2 The risk of bias (left) and concerns for applicability (right) for each included study using QUADAS-2.

Characteristics of the Included Studies

The characteristics of the included studies are summarized in Tables 2 and 3. Detailed two-by-two contingency tables of every study are shown in Table S2 (Supplement Materials). The 13 studies included in this meta-analysis had 2253 patients, and 1618 participants in valid or test set. All of the studies used retrospectively collected data. The models in these studies included ML (n=11) and deep learning (n=2) for unsupervised learning. Of these models, the ML algorithm comprised different types of logistic regression models (12, 15, 22, 23, 28), convolutional neural network models (26, 27), multiple classifier systems (11, 13, 14, 24, 25), and support vector machine models (29).

TABLE 2
www.frontiersin.org

Table 2 Baseline characteristic of included studies (1).

TABLE 3
www.frontiersin.org

Table 3 Baseline characteristic of included studies (2).

In 13 studies, different phase analysis methods of DCE were used, including the strongest enhanced phase, the second postcontrast phase, the first postcontrast images, and two-phase images in 4 (12, 22, 23, 29), 3 (11, 13, 28), 3 (14, 25, 27), and 3 (15, 24, 26) studies, respectively. 3 T scanners were used in 8 studies (1113, 22, 24, 25, 28, 29), and 1.5 T MR was used in 5 studies (14, 15, 23, 26, 27). Seven studies (1315, 23, 24, 28, 29) employed SLNB or ALND to serve as the reference standard, while the remaining 3 studies (12, 25, 26) were based on ALND. Additionly, PET/CT (27) and ultrasound-guided fine-needle aspiration or ALND (11) of 1 study, respectively. Five studies (11, 12, 2527) focused specifically on ALN, whereas 3 studies (2224) focused on SLN, the remaining 5 studies (1315, 28, 29) focused on ALN and SLN. Only 4 studies (11, 13, 26, 27) used semiautomatic segmentation, and 9 studies (12, 14, 15, 2225, 28, 29) used manual ROIs. The ROIs of the breast tumor area and ALN area were employed in 10 studies (1115, 2224, 26, 29) and 3 studies (25, 27, 28), respectively.

Data Analysis

For all 13 studies, the mean values and 95% CIs of pooled sensitivity, specificity, PLR, NLR, and DOR for the radiomics signature based on DCE-MRI in assessing ALNM and SLNM in breast cancer were 0.82 (0.75, 0.87), 0.83 (0.74, 0.89), 4.70 (3.01, 7.35), 0.22 (0.15, 0.31), and 21.56 (10.60, 43.85), respectively (Table 4). The ML models for ALNM and SLNM in breast cancer showed an overall pooled AUC=0.89 (0.86, 0.91) (Figure 3).

TABLE 4
www.frontiersin.org

Table 4 The results of subgroup analysis.

FIGURE 3
www.frontiersin.org

Figure 3 Hierarchical summary receiver operating characteristic (SROC). curve of the diagnostic performance of ML-based radiomics of DCE-MRI in predicting ALNM in breast cancer.

Exploration of Heterogeneity

There was significant heterogeneity in sensitivity (I2 = 80.6%) and specificity (I2 = 89.57%). As shown in Figure 4, the results of the diagnostic threshold analysis showed that there is no threshold effect because Spearman’s correlation coefficient was 0.181, and the P value was 0.553.

FIGURE 4
www.frontiersin.org

Figure 4 Forest plots of the sensitivity and specificity of ML-based radiomics of DCE-MRI in predicting ALNM in breast cancer. I2>50% indicated substantial heterogeneity in the diagnostic parameters across studies.

Subgroup analysis was also performed by comparing studies with the different variables. Table 4 shows the results of the analysis for subgroups.

Studies (n =11) using ML had higher specificity (0.83 vs. 0.65) and an equivalent sensitivity (0.80 vs. 0.84) compared to studies (n=2) that used deep learning. The studies that used a 3.0 T MR had higher sensitivity (0.82 vs. 0.78) and specificity(0.83 vs. 0.76) than those that used 1.5 T MR. Five studies with SLNB or ALND as the gold standard had an equivalent sensitivity (0.82 vs. 0.80) and specificity(0.82 vs. 0.80) with studies(n=3) with ALND as reference standard. Studies (n=3) that only included SLNs had lowest sensitivity (0.71 vs. 0.81 vs.0.84) and an similar specificity (0.80 vs. 0.78 vs. 0.82) in among studies that only included ALNs and combined SLNs and ALNs groups. Eight manually drawn studies had higher specificity (0.84 vs. 0.74) and equivalent sensitivity (0.80 vs. 0.82) than studies (n=5) using semiautomatic segmentation. Studies (n =3) with LN as the ROI had higher sensitivity (0.85 vs. 0.79) and equivalent specificity (0.81 vs. 0.80) compared to studies (n =10) with breast cancer as the ROI. ML including the different algorithms in models, SVM algorithms had higher sensitivity (0.81 vs. 0.75) and lower specificity (0.75 vs. 0.88) compared to studies with LR algorithms. The studies that used Siemens MR equipment had higher sensitivity (0.88 vs. 0.77) and specificity (0.82 vs. 0.75) than studies used GE equipment. The corresponding forest plots are presented in Figures S1–8 (Supplement Materials).

Sensitivity Analyses

There were no significant changes when eliminating the included studies one by one. The results of sensitivity analyses for each study are shown in Table S3 (Supplement Materials).

Publication Bias

There was no publication bias based on the Deeks funnel plot (P=0.22) (Figure 5) (20).

FIGURE 5
www.frontiersin.org

Figure 5 Deeks funnel plot shows the likelihood of publication bias is low with a P value of 0.22. ESS, effective sample size.

Clinical Utility

Using an ML-based radiomics DCE-MRI model would increase the posttest probability to 54 from 20% with a PLR of 5 when the pretest was positive and would reduce the posttest probability to 5% with an NLR of 0.22 when the pretest was negative (Figure 6).

FIGURE 6
www.frontiersin.org

Figure 6 Fagan plot of ML-based radiomics models of DCE-MRI in predicting ALNM in breast cancer.

Discussion

In our meta-analysis, radiomics DCE-MRI showed promising results for ALNM characterization, with a pooled sensitivity, specificity, and AUC of 0.82, 0.83, and 0.89, respectively. This finding indicates that this approach could be considered an effective and accurate tool for ALNM and SLNM prediction.

In the present study, we found that there was obvious heterogeneity between the studies. Indeed, heterogeneity can be caused by many factors, e.g., threshold effect, different magnetic fields, segmentation, etc. In this meta-analysis, the threshold effect was not the source of heterogeneity because Spearman’s coefficient was not significant. Therefore, subgroup analysis was used to determine the source of heterogeneity. Our results demonstrated that studies using 3.0 T MR had better diagnostic performance than studies using 1.5 T MR. We are not surprised by this result. Since high magnetic fields can improve image resolution, they can help to improve diagnostic accuracy. Another subgroup analysis result showed that studies employing ML have slightly better value than those employing deep learning. Deep learning has greater potential for very large datasets with thousands or even millions of instances. In this setting, datasets usually consist of hundreds of patients at most, which is better than with deep learning in this case. Similar findings have been previously reported for ML in other applications (9, 10, 30). However, deep learning only included two studies. Future studies employing deep learning are needed to confirm this conclusion.

ROIs including the ALN area have good diagnostic performance compared with ROIs including the breast tumor area. While an ROI of the ALN is useful to evaluate ALN status, it suffers from some limitations, such as the ALN breast surface coil being mainly concentrated in the breast area; nevertheless, some positive lymph nodes might be located at the edge of the coil, and some might not even be in the imaging range (31). Studies have focused on breast tumors themselves, which could help to avoid the above limitations. Studies with SLNB or ALND as the gold standard had an equivalent sensitivity and specificity with ALND group. The reason may be that the patient with negative SLN, SLNB maybe an effective and accuracy approach. The sensitivity of predict SLNM is lower than that to predict ALNM and the two kinds of LNs. Therefore, for SLNM, the diagnostic performance of this imaging tool might not be satisfactory, as concluded in this meta-analysis. Further studies should investigate how to improve the sensitivity of SLNM. Although studies in which ROIs are manually drawn by radiologists might be more prone to error and user variability, the prediction is still good compared with the semiautomatic segmentation method. However, manual segmentation is time consuming, tedious, and prone to error. In the future, it would be ideal to develop a reliable and validated automatic method. Our results showed that LR algorithm had higher DOR than SVM. Generally, LR and SVM algorithms are all suitable for model construction with small sample sizes and binary variables. However, for ML-based DCE-MRI radiomics in predicting ALNM, the LR algorithm is more recommended for use with our meta-analysis result. We also found that studies using Siemens MR equipment had higher diagnostic performance than using GE equipment. It means different MR equipment maybe affect the diagnostic performance. Therefore, prospective studies compared the two MR equipment are necessary to explore the diagnostic performance of ML-based DCE-MRI radiomics in predicting ALNM and SLNM. In addition, different DCE phases and cross-validation of different multiples could lead to unknown biases. Moreover, other unmentioned differences between studies might contribute to the heterogeneity.

A previous meta-analysis (32) including 3 studies of DCE-MRI (n=187) reported that the mean sensitivity and specificity were 0.88 and 0.73, respectively. Another study (6) included 7 studies using DCE-MRI and reported that the median sensitivity was 0.60 (range 0.33.3–0.97) (31). Our findings showed higher sensitivity than studies that included DCE-MRI. Conventional DCE only included morphology and a few quantitative parameters. However, radiomics could provide many new quantitative imaging markers and help to characterize heterogeneous tumor lesions (33). This method could provide more valuable information to help radiologists to improve detection, diagnosis, staging, and prediction power.

Limitations

All of the methodological issues followed the Cochrane handbook (34), but there are still some limitations that must be discussed. First, a relatively small number of studies met the selection criteria. The second limitation was the significant heterogeneity, which is an issue similar to that in other meta-analyses of diagnostic accuracy using ML based on radiomics (9, 10, 30).

Furthermore, study characteristics, such as different ROIs, DCE phases, and reference standards, could lead to heterogeneity. Therefore, we employed subgroup analysis to reduce heterogeneity.

Third, while there were some uncertainties in the QUADAS-2 assessment, the overall quality of the study was sufficient for analysis. Thus, this uncertain risk might not have had a significant impact on the outcomes.

Fourth, 3 studies(3/13)showed an RQS score<20%. The mean RQS score obtained by analyzing the articles reviewed in this study was 11.1 (30.1%), indicating moderate overall quality. The most important points were the type of study, biological relevance tests and discussion, validation, comparison with the gold standard, potential clinical utility, economic analysis and open scientific data (Table 1 and Table S1). Fifth, in most studies, the lymph nodes assessed by MR have not been specifically associated with histological findings in a node-to-node manner, which is a difficult problem to solve in clinical practice. And it is inevitable that very small lesions may be missed through DCE-MRI. Sixth, some studies used the SLNB as reference standard, which may be caused some false negative rate. Finally, in this meta-analysis, the PLR, NLR and posttest probability were moderate, which would limit the recommendation of their integration into clinical practice.

Future

To improve the clinical applicability of future studies utilizing ML-based radiomics for ALNM, several factors must be followed.

First, external validation is usually not performed, which should be seen as a major limitation in the field of study. Therefore, it is advisable to verify the accuracy of these models further. When reporting ML-based radiomics, it is crucial to follow quality guidelines that include external validation. Second, future studies should also consider expanding datasets from multiple centers to overcome imbalances caused by oversampling small samples and to improve classifier performance. Third, the variation process might affect bias. There are significant variations in the number of features selected, the risk of overfitting and redundancy, and the preprocessing steps (such as manual segmentation), reducing reproducibility. In addition, the different DCE phases should be considered. Therefore, it is necessary to build better radiomics and ML paper standards to establish image acquisition, segmentation, feature engineering, statistical analysis and report format standardization to achieve reproducibility and facilitate the search for radiomics (35). Finally, the ALNM and SLNM prediction model was constructed with a combination of MR radiomics and DCE quantitative parameter and clinical characteristic data to further explore more precise predictions and to improve the clinical utility for ALNM and SLNM.

Conclusion

Our results indicated that ML-based DCE-MRI radiomics indicates good diagnostic performance in predicting ALNM and SLNM in breast cancer with high sensitivity and specificity. Nevertheless, due to the heterogeneity of the included studies, caution should be taken when applying the results.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Author Contributions

JZ, LL, and LZ conceived and designed the study strategy. LL and MT worked for study search. JZ an LZ worked for study selection. LL and LZ extracted data from each included study and assessed the study quality. XL and JZ prepared the tables and all figures. XLZ and XZ worked as the supervisor and made arbitration for all possible disagreements. All authors have read and approved the content.

Funding

This work was supported by the Talent Support Program of Shaanxi Provincial People’s Hospital (2021JY-43).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2022.799209/full#supplementary-material

Abbreviations

ALNM, Axillary lymph node metastasis; SLNM, Sentinel lymph node metastasis; ALND, Axillary lymph node dissection; AUC, Area under the curve; CI, Confidence intervals; DCE, Dynamic contrast-enhanced; MRI, Magnetic resonance imaging; ML, Machine learning; PLR, Positive likelihood ratio; QUADAS-2, Quality Assessment of Diagnostic Accuracy Studies 2; RQS, Radiomics Quality Score; SROC, Summary receiver operating characteristic curve; SLNB, Sentinel lymph node biopsy; NLR, Negative likelihood ratio.

References

1. Van Roozendaal LM, Schipper RJ, Van de Vijver KK, Haekens CM, Lobbes MB, Tjan-Heijnen VC, et al. The Impact of the Pathological Lymph Node Status on Adjuvant Systemic Treatment Recommendations in Clinically Node Negative Breast Cancer Patients. Breast Cancer Res Treat (2014) 143:469–76. doi: 10.1007/s10549-013-2822-5

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Voutsadakis IA, Spadafora S. Axillary Lymph Node Management in Breast Cancer With Positive Sentinel Lymph Node Biopsy. World J Clin Oncol (2015) 6:1–6. doi: 10.5306/wjco.v6.i1.1

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Kootstra J, Hoekstra-Weebers JE, Rietman H, de Vries J, Baas P, Geertzen JH, et al. Quality of Life After Sentinel Lymph Node Biopsy or Axillary Lymph Node Dissection in Stage I/II Breast Cancer Patients: A Prospective Longitudinal Study. Ann Surg Oncol (2008) 15:2533–41. doi: 10.1245/s10434-008-9996-9

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Kuhl CK, Schild HH. Dynamic Image Interpretation of MRI of the Breast. J Magn Reson Imaging (2000) 12:965–74. doi: 10.1002/1522-2586(200012)12:6<965:aid-jmri23>3.0.co;2-1

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Karahaliou A, Vassiou K, Arikidis NS, Skiadopoulos S, Kanavou T, Costaridou L. Assessing Heterogeneity of Lesion Enhancement Kinetics in Dynamic Contrast-Enhanced MRI for Breast Cancer Diagnosis. Br J Radiol (2010) 83:296–309. doi: 10.1259/bjr/50743919

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Kuijs VJ, Moossdorff M, Schipper RJ, Beets-Tan RG, Heuts EM, Keymeulen KB, et al. The Role of MRI in Axillary Lymph Node Imaging in Breast Cancer Patients: A Systematic Review. Insights Imaging (2015) 6:203–15. doi: 10.1007/s13244-015-0404-2

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Dong YH, Feng QJ, Yang W, Lu ZX, Deng CY, Zhang L, et al. Preoperative Prediction of Sentinel Lymph Node Metastasis in Breast Cancer Based on Radiomics of T2-Weighted Fat-Suppression and Diffusion-Weighted MRI. Eur Radio (2018) 28:582–91. doi: 10.1007/s00330-017-5005-7

CrossRef Full Text | Google Scholar

8. Nowikiewicz T, Wnuk P, Małkowski B, Kurylcio A, Kowalewski J, Zegarski W. Application of Artificial Neural Networks for Predicting Presence of Non-Sentinel Lymph Node Metastases in Breast Cancer Patients With Positive Sentinel Lymph Node Biopsies. Arch Med Sci (2017) 13:1399–407. doi: 10.5114/aoms.2016.57677

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Cuocolo R, Cipullo MB, Stanzione A, Romeo V, Green R, Cantoni V, et al. Machine Learning for the Identification of Clinically Significant Prostate Cancer on MRI: A Meta-Analysis. Eur Radiol (2020) 30:6877–87. doi: 10.1007/s00330-020-07027-w

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Sohn CK, Bisdas S. Diagnostic Accuracy of Machine Learning-Based Radiomics in Grading Gliomas: Systematic Review and Meta-Analysis. Contrast Media Mol Imaging (2020) 2020:1–12:2127062. doi: 10.1155/2020/2127062

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Arefan D, Chai R, Sun M, Zuley ML, Wu SD. Machine Learning Prediction of Axillary Lymph Node Metastasis in Breast Cancer: 2D Versus 3D Radiomic Features. Med Phys (2020) 47:6334–42. doi: 10.1002/mp.14538

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Chen JM, Zhu HY, Gao J, Ge YQ, Wang MH, Li Y, et al. Radiomics Models Based Onclinical-Pathology and Conventional and Functional MRI for Predicting Lymph Node Metastases of Breast Cancer Axillary. Chin J Med Imaging Technol (2021) 37:885–90. doi: 10.13929/j.issn.1003-3289.2021.06.022

CrossRef Full Text | Google Scholar

13. Cui XY, Wang N, Zhao Y, Chen S, Li SB, Xu MJ, et al. Preoperative Prediction of Axillary Lymph Node Metastasis in Breast Cancer Using Radiomics Features of DCE-MRI. Sci Rep (2019) 9:2240. doi: 10.1038/s41598-019-38502-0

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Han L, Zhu YB, Liu ZY, Yu T, He CJ, Jiang WY, et al. Radiomic Nomogram for Prediction of Axillary Lymph Node Metastasis in Breast Cancer. Eur Radiol (2019) 29:3820–9. doi: 10.1007/s00330-018-5981-2

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Li L, Yu T, Sun JQ, Jiang SX, Liu DH, Wang XX, et al. Prediction of the Number of Metastatic Axillary Lymph Nodes in Breast Cancer by Radiomic Signature Based on Dynamic Contrast-Enhanced MRI. Acta Radiol (2021) 0:1–9. doi: 10.1177/02841851211025857

CrossRef Full Text | Google Scholar

16. Moher D, Liberati A, Tetzlaff J, Altman DG. Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. BMJ (2009) 339:b2535. doi: 10.1136/bmj.b2535

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Whiting PF, Rutjes AW, Westwood ME, Mallett S, Deeks JJ, Reitsma JB, et al. QUADAS-2: A Revised Tool for the Quality Assessment of Diagnostic Accuracy Studies. Ann Intern Med (2011) 155:529–36. doi: 10.7326/0003-4819-155-8-201110180-00009

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Philippe L, Ralph TH, Timo MD, Jurgen P, Evelyn EC, Janita van T, et al. Radiomics: The Bridge Between Medical Imaging and Personalized Medicine. Nat Rev Clin Oncol (2017) 14:749–62. doi: 10.1038/nrclinonc.2017.141

PubMed Abstract | CrossRef Full Text | Google Scholar

19. DerSimonian R, Laird N. Meta-Analysis in Clinical Trials. Cont Clin Trials (1986) 7:177–88. doi: 10.1016/0197-2456(86)90046-2

CrossRef Full Text | Google Scholar

20. Deeks JJ, Macaskill PE, Irwig L. The Performance of Tests of Publication Bias and Other Sample Size Effects in Systematic Reviews of Diagnostic Test Accuracy Was Assessed. J Clin Epidemiol (2005) 58:882–93. doi: 10.1016/j.jclinepi.2005.01.016

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Hellmich M, Lehmacher W. A Ruler for Interpreting Diagnostic Test Results. Methods Inf Med (2005) 44:124–6. doi: 10.1055/s-0038-1633930

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Liu MJ, Mao N, Ma H, Shi YH, Dong JJ, Yang P, et al. Radiomics Based on DCE-MRI for the Preoperative Prediction of SLN Metastasis in Breast Cancer. Chin Imaging J Integr Tradit West Med (2020) 18:227–31. doi: 10.3969/j.issn.1672-0512.2020.03.003

CrossRef Full Text | Google Scholar

23. Liu CL, Ding J, Spuhler K, Gao Y, Serrano SM, Moriarty M, et al. Preoperative Prediction of Sentinel Lymph Node Metastasis in Breast Cancer by Radiomic Signatures From Dynamic Contrast-Enhanced MRI. J Magn Reson Imaging (2019) 49:131–40. doi: 10.1002/jmri.26224

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Liu J, Sun D, Chen LL, Fang Z, Song WX, Guo DJ, et al. Radiomics Analysis of Dynamic Contrast-Enhanced Magnetic Resonance Imaging for the Prediction of Sentinel Lymph Node Metastasis in Breast Cancer. Front Oncol (2019) 9:980. doi: 10.3389/fonc.2019.00980

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Luo HB, Liu YY, Qing HM, Wang CH, Chen XY, Meng RR, et al. Preoperative Diagnosis of Metastatic Axillary Lymph Nodes in Breast Cancer by Their Radiomic Features Based on Pharmacokinetic Modeling Dynamic Contrast-Enhanced MRI. J Clin Radiol (2021) 40:442–7. doi: 10.13437/j.cnki.jcr.2021.03.008

CrossRef Full Text | Google Scholar

26. Nguyen S, Polat D, Karbasi P, Moser D, Wang LQ, Hulsey K, et al. Preoperative Prediction of Lymph Node Metastasis From Clinical DCE MRI of the Primary Breast Tumor Using a 4D CNN. Med Image Comput Comput Assist Interv (2020) 12262:326–34. doi: 10.1007/978-3-030-59713-9_32

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Ren T, Cattell R, Duanmu H, Huang P, Li HF, Vanguri R, et al. Convolutional Neural Network Detection of Axillary Lymph Node Metastasis Using Standard Clinical Breast MRI. Clin Breast Cancer (2020) 20:e301–8. doi: 10.1016/j.clbc.2019.11.009

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Shan YN, Gong XY, Ding ZX, Shen QJ, Xu W, Pang PP, et al. Dynamic Contrast-Enhanced MRI Radiomic Features Predict Axillary Lymph Node Metastasis of Breast Cancer. Chin J Radiol (2019) 53:742–7. doi: 10.3760/cma.j.issn.1005-1201.2019.09.006

CrossRef Full Text | Google Scholar

29. Zhan CA, Hu YQ, Wang XR, Liu H, Xia LM, Ai T, et al. Prediction of Axillary Lymph Node Metastasis in Breast Cancer Using Intra-Peritumoral Textural Transition Analysis Based on Dynamic Contrast-Enhanced Magnetic Resonance Imaging. Acad Radiol (2021) S1076-6332:00071–4. doi: 10.1016/j.acra.2021.02.008

CrossRef Full Text | Google Scholar

30. Van Kempen EJ, Post M, Mannil M, Witkam RL, Ter LM, Patel A, et al. Performance of Machine Learning Algorithms for Glioma Segmentation of Brain MRI: A Systematic Literature Review and Meta-Analysis. Eur Radiol (2021) 12:9638–53. doi: 10.1007/s00330-021-08035-0

CrossRef Full Text | Google Scholar

31. Yun SJ, Sohn YM, Seo M. Differentiation of Benign and Metastatic Axillary Lymph Nodes in Breast Cancer: Additive Value of MRI Computer-Aided Evaluation. Clin Radiol (2016) 71:403.e1–7. doi: 10.1016/j.crad.2016.01.008

CrossRef Full Text | Google Scholar

32. Harnan SE, Cooper KL, Meng Y, Ward SE, Fitzgerald P, Papaioannou D, et al. Magnetic Resonance for Assessment of Axillary Lymph Node Status in Early Breast Cancer: A Systematic Review and Meta-Analysis. Eur J Surg Oncol (2011) 37:928–36. doi: 10.1016/j.ejso.2011.07.007

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RG, Granton P, et al. Radiomics: Extracting More Information From Medical Images Using Advanced Feature Analysis. Eur J Cancer (2012) 48:441–6. doi: 10.1016/j.ejca.2011.11.036

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Higgins JPT, Green S. Cochrane Handbook for Systematic Reviews of Interventions Version 5.1.0. Chichester: The Cochrane Collaboration (2011). Available at: https://ci.nii.ac.jp/naid/20000796633.

Google Scholar

35. Huang SY, Seethamraju RT, Patel P, Hahn PF, Kirsch JE, Guimaraes AR. Body MR Imaging: Artifacts, K-Space, and Solutions. Radiographics (2015) 35:1439–60. doi: 10.1148/rg.2015140289

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: breast cancer, axillary lymph node metastasis, radiomics, machine learning, dynamic contrast-enhanced magnetic resonance imaging, meta-analysis

Citation: Zhang J, Li L, Zhe X, Tang M, Zhang X, Lei X and Zhang L (2022) The Diagnostic Performance of Machine Learning-Based Radiomics of DCE-MRI in Predicting Axillary Lymph Node Metastasis in Breast Cancer: A Meta-Analysis. Front. Oncol. 12:799209. doi: 10.3389/fonc.2022.799209

Received: 21 October 2021; Accepted: 14 January 2022;
Published: 04 February 2022.

Edited by:

Xiang Zhang, Sun Yat-sen University, China

Reviewed by:

Huan Liu, GE Healthcare, China
Guobin Hong, The Fifth Affiliated Hospital of Sun Yat-sen University, China
Xiaohui Zhang, Peking Union Medical College Hospital (CAMS), China

Copyright © 2022 Zhang, Li, Zhe, Tang, Zhang, Lei and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Li Zhang, 18202990355@139.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.