Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol., 05 January 2026

Sec. Cancer Imaging and Image-directed Interventions

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1673060

This article is part of the Research TopicRecent Advancements in AI-Assisted Gynecologic Cancer DetectionView all 10 articles

Artificial intelligence-based magnetic resonance imaging for preoperative staging of patients with endometrial cancer: a systematic review and meta-analysis

  • Department of Radiology, Ningbo Medical Center Lihuili Hospital, Ningbo University, Ningbo, Zhejiang, China

Objective: To systematically collect all literature on the value of artificial intelligence (AI)-based magnetic resonance imaging (MRI) for preoperative prediction of myometrial invasion and cervical stroma invasion in patients with endometrial cancer and conduct a meta-analysis to provide the latest and most comprehensive synthesis of current research findings.

Methods: A systematic literature search was conducted in PubMed, Web of Science, Embase, and the Cochrane Library databases up to March 2025. The methodological quality of the included studies was assessed using the QUADAS-2 tool. Statistical analyses were primarily conducted using Stata15.0 and Review Manager 5.4.1 software. Outcomes included combined Sen, Spe, +LR, -LR, DOR, and their 95% CI. The SROC curve was plotted, and the AUC was calculated. The Deeks’ funnel plot was used to detect publication bias and assumed small-study effects.

Results: Finally, 8 studies (including 13 cohorts) were included. The overall performance of AI-based MRI for the prediction of deep myometrial invasion showed a combined Sen, Spe, +LR, -LR, DOR, and AUC value of 0.80 (95% CI: 0.75-0.85), 0.81 (95% CI: 0.64-0.91), 4.2 (95% CI: 2.0-8.5), 0.24 (95% CI: 0.17-0.34), 17 (95% CI: 6-47), and 0.83 (95% CI: 0.80-0.86), respectively. The overall performance of AI-based MRI for the prediction of cervical stroma invasion showed a combined Sen, Spe, +LR, -LR, DOR, and AUC value of 0.78 (95% CI: 0.55-0.91), 0.86 (95% CI: 0.79-0.91), 5.6 (95% CI: 4.3-7.4), 0.25 (95% CI: 0.12-0.55), 22 (95% CI: 11-44), and 0.90 (95% CI: 0.87-0.92) respectively.

Conclusion: AI-based MRI can improve the accuracy of preoperative staging of patients with endometrial cancer to a certain extent. However, considering the limitations of this article, additional large-scale, prospective, multicenter clinical trials are necessary to further investigate the utility of AI-based MRI in the preoperative staging of endometrial cancer.

1 Introduction

Endometrial cancer comprises a group of epithelial malignancies originating from the endometrium. Endometrioid carcinoma is the most common histologic type and is ranked among the three major malignant tumors of the female reproductive system. Its incidence continues to rise each year. In the United States and other countries, the incidence has exceeded that of cervical cancer and ovarian cancer (13). The global incidence rate of endometrial cancer was approximately 8.7 per 100,000 women in 2022, with nearly 420,000 new cases diagnosed annually worldwide (4). It is well known that the treatment of endometrial cancer includes surgical resection and adjuvant chemoradiotherapy. The choice of treatment and patient prognosis are influenced by multiple factors such as age, histological type, pathological grade, degree of cervical invasion, and lymph node metastasis (5). For younger patients, accurate staging is essential to guide clinicians in developing surgical plans to preserve fertility. Different types of cancer have different sensitivity to treatment and prognosis. Cancers with high pathological grades usually show invasive growth and poor prognosis, so more aggressive treatment strategies may be required. If cancer cells invade the cervical area or lymph nodes, treatment plans may differ and involve more extensive surgery or chemoradiotherapy. Therefore, it is particularly important to accurately diagnose the patient before treatment (6, 7).

Magnetic resonance imaging (MRI) can be used for both conventional sequence imaging and functional imaging. It has a high resolution for soft tissue and can distinguish between the uterine mucosa, submucosa, myometrium, and serosa (8, 9). MRI has a high clinical value in preoperatively evaluating the depth of myometrial invasion and pelvic iliac and abdominal lumbar lymph nodes metastasis in individuals diagnosed with endometrial cancer. To a certain extent, preoperative staging determines the choice of surgical scope for patients. When judging the degree of cancer invasion, the depth of myometrial invasion is a very important reference indicator because it is related to whether there is metastasis to the pelvic iliac and abdominal lumbar lymph nodes and will affect the patient’s 5-year survival rate (10). According to literature reports, when cancer cells do not infiltrate the myometrium or infiltrate less than 50% of the myometrium, the proportion of lymph nodes invaded by tumor cells is low, and the 5-year survival rate of such patients is high; when endometrial cancer infiltrates 50% or more of the myometrium, the lymph node metastasis rate of these patients will increase (11), and the 5-year survival rate will be significantly lower than that of patients who do not infiltrate the myometrium or only infiltrate less than 50% of the myometrium (12). From this, we can see that MRI can provide corresponding guidance for clinical staging treatment, but MRI examinations have certain error rates and misdiagnosis rates in terms of sensitivity and accuracy and need to be combined with other auxiliary examinations for joint evaluation.

Over the past few years, with the continuous development and application of artificial intelligence (AI), MRI-based deep learning models have shown great application prospects in the field of medical imaging diagnosis. AI reconstructs and analyzes image data in a structured manner, extracts key features of lesions, establishes computer vision training and recognition models, and then generates disease analysis reports and auxiliary diagnosis plans (1315). AI imaging diagnosis has achieved great results in endoscopic diagnosis of gastric cancer (16), MRI diagnosis of colorectal cancer (17), CT diagnosis of small lung nodules (18), and mammography diagnosis of breast cancer (19). Related studies have used AI technology to read fine features in images, classify images and predict diagnosis (20). In addition, preoperatively, the presence of lymphovascular space invasion, lymph node metastasis, etc., can be predicted based on the images, thereby guiding individualized treatment (21, 22). To date, many AI-based MRI technologies have been used for preoperative staging of endometrial cancer, but no relevant meta-analysis has been reported, and their exact sensitivity and specificity have not yet been determined. Therefore, this study systematically retrieved all relevant literature and performed a meta-analysis to compile the most current and extensive scientific data to assess the effectiveness of AI-based MRI for preoperative staging of endometrial cancer.

2 Methods

2.1 Protocol and registration

This meta-analysis and systematic review was conducted in accordance with the PRISMA 2020 guidelines (23) and was prospectively registered in the PROSPERO database (CRD420251059395).

2.2 Search strategy

We conducted a systematic search of literature published in PubMed, Web of Science, Embase, and Cochrane Library databases from their inception to March 2025, employing both MeSH terms and free-text keywords such as: “Artificial Intelligence”, “Magnetic Resonance Imaging”, and “Endometrial Neoplasms”. The search strategies for PubMed were as follows: (((“Artificial Intelligence”[Mesh]) OR ((((((((Computer Reasoning) OR (Machine Intelligence)) OR (Computational Intelligence)) OR (Computer Vision Systems)) OR (Computer Vision System)) OR (Knowledge Acquisition)) OR (Knowledge Representation)) OR (Knowledge Representations))) AND ((“Magnetic Resonance Imaging”[Mesh]) OR ((((MRI) OR (NMR Imaging)) OR (MR Tomography)) OR (NMR Tomography)))) AND ((“Endometrial Neoplasms”[Mesh]) OR (((((Endometrial Neoplasm) OR (Endometrial Carcinoma)) OR (Endometrium Cancer)) OR (Endometrium Carcinoma)) OR (Endometrial Cancer))). In addition, the reference lists of all selected studies were examined manually to identify any potentially relevant articles. Two independent reviewers carried out the study selection process. Discrepancies in study inclusion were resolved through discussion and consensus. All identified articles were organized and verified using EndNote X9.

2.3 Inclusion and exclusion criteria

Both prospective and retrospective studies investigating the role of AI-enhanced MRI in preoperative staging of endometrial cancer were considered in this analysis.

Inclusion criteria:

Study population: patients diagnosed with endometrial cancer.

Diagnostic criteria: the experiment to be evaluated is AI-based MRI, with no restrictions on the model or methodology. The gold standard is histopathology.

Outcomes: sensitivity (Sen), specificity (Spe), positive likelihood ratio (+LR), negative likelihood ratio (-LR), diagnostic odds ratio (DOR), pre-test probability, post-test probability, summary receiver operating characteristic (SROC) area under the curve (AUC), etc.

Study design: prospective and retrospective studies that focused on the value of AI-based MRI for preoperative staging of patients with endometrial cancer.

Exclusion criteria: (1) duplicate publications; (2) articles unrelated to this study; (3) reviews, meta-analyses, conferences, letters, comments, etc.; (4) studies involving animal models; (4) articles lacking sufficient or incomplete data; (5) non-English language publications. Two researchers (Zheng JJ and Lin XY) independently screened the literature. Any disagreements during the screening process were first resolved through discussion and negotiation between the two researchers. If no agreement could be reached after discussion, a third researcher (Li M) was invited to arbitrate, and the arbitrator’s opinion became the final decision.

2.4 Data extraction

Two reviewers independently collected relevant data from each eligible study, including details such as the first author’s name, publication year, study design, participants’ age and gender, AI models, reference standard, sample size, true positives (TP), false positives (FP), true negatives (TN), false negatives (FN), Sen, Spe, +LR, -LR, DOR, and AUC value, etc. Disagreements in data collection were resolved through consultation with a third reviewer. If specific information was lacking, attempts were made to contact the corresponding authors for clarification or additional data. All collected variables were organized using Microsoft Excel. Any disagreements during the data extraction process will first be resolved through discussion and negotiation between the two parties. If no agreement can be reached after discussion, a third researcher (Li M) will be invited to arbitrate, and the arbitrator’s opinion will be the final decision.

2.5 Quality assessment

In accordance with the methodological recommendations provided in the Cochrane Handbook for systematic reviews of diagnostic trials, we assessed the methodological quality of the included studies using the QUADAS-2 tool. Subsequently, the Review Manager 5.4.1 software was employed to present the final evaluation findings in a comprehensive manner (24). The quality of the included studies was independently evaluated by two reviewers, and any discrepancies were resolved through discussion or, if needed, consultation with a third reviewer. The QUADAS-2 evaluates study quality by examining two aspects: risk of bias and applicability to clinical practice. Risk of bias is assessed across four domains: patient selection, index test, reference standard, and flow and timing. Applicability concerns are judged based on three domains: patient selection, index test, and reference standard. Each domain related to risk of bias includes multiple signaling questions, with answer options of “yes/no/unclear”. If the answers to all signaling questions are “yes”, then that aspect is regarded as having a low probability of bias. If any response is marked as “no”, it may indicate a potential risk of bias. There are no signaling questions for clinical applicability, only an overall assessment, with answer options including “high risk/low risk/unclear”. It is worth noting that the “unclear” response should be chosen when the information provided in the literature is incomplete during the assessment process.

2.6 Data analysis

Data analysis was conducted using Stata version 15.0 and Review Manager 5.4.1. A bivariate random-effects model was employed to calculate pooled Sen, Spe, +LR, −LR, and DOR with 95% CIs. SROC curves were constructed, and AUC values were determined accordingly. The χ² test was applied to assess statistical heterogeneity across the included studies, and the significance level was set at α = 0.05. Heterogeneity was further quantified using the I² statistic. If I² ≤ 50%, heterogeneity was considered low; if I² > 50%, it was regarded as high. Deeks’ funnel plot was applied to evaluate publication bias and detect potential small-study effects, with a P value below 0.05 indicating the presence of bias in the analyzed studies.

3 Results

3.1 Literature screening process and results

According to the literature search strategy, 183 relevant publications were retrieved. Following the removal of duplicates using EndNote X9, 122 articles remained. After screening titles and abstracts, 106 publications were excluded, resulting in 16 articles for initial screening. Of the 16 articles assessed in full, 8 were excluded due to insufficient information or failure to meet inclusion criteria. The study ultimately included a total of 8 articles (2532). Figure 1 displays the screening procedure for the selected literature, and Table 1 summarizes the key characteristics of the studies included in the review. If there were multiple models or multiple cohorts in the same study, they were extracted separately and marked by adding the letters “a, b, c, or d, etc.” after the name of studies.

Figure 1
Flowchart depicting a PRISMA diagram for a meta-analysis. It shows the selection process: 183 records identified from databases (PubMed, Embase, Web of Science, Cochrane) and zero from other sources. After removing duplicates, 61 records remain. Of these, 122 are screened, with 106 excluded for reasons like irrelevance and replies. Sixteen full-text articles are assessed, eight excluded due to insufficient data. Eight studies are included in qualitative synthesis, and the same eight in quantitative synthesis.

Figure 1. PRISMA flow-chart for the systematic review and meta-analysis.

Table 1
www.frontiersin.org

Table 1. Characteristics of include studies.

3.2 Quality assessment of included studies

For the assessment of risk of bias, the QUADAS-2 tool was employed. Figure 2 presents the results of the methodological quality evaluation of the analyzed studies. In general, the publications analyzed were of high quality, with most exhibiting either a low or unclear risk of bias.

Figure 2
A table assessing risk of bias and applicability concerns for several studies. Columns: patient selection, index test, reference standard, and flow and timing. Rows list study authors and years. Symbols: green plus for low risk, yellow question mark for unclear, and red minus for high risk. Most studies show low risk, with some unclear areas.

Figure 2. Risk of bias and applicability concerns summary.

3.3 Meta-analysis of deep myometrial invasion

A total of six studies (including seven cohorts) reported the value of AI-based MRI for the detection of deep myometrial invasion for endometrial cancer. The overall performance of AI-based MRI for the prediction of deep myometrial invasion showed a combined Sen, Spe, +LR, -LR, and DOR of 0.80 (95% CI: 0.75-0.85), 0.81 (95% CI: 0.64-0.91), 4.2 (95% CI: 2.0-8.5), 0.24 (95% CI: 0.17-0.34), and 17 (95% CI: 6-47), respectively (Figure 3). Considerable heterogeneity in specificity was observed in the aggregated forest plot results (I2 = 88.27%; 95% CI: 81.01-95.52%) (Figure 3), whereas the heterogeneity in sensitivity was relatively lower (I²=25.09%; 95% CI: 0.00-86.94%) (Figure 3). Figure 4A represented the SROC with prediction and confidence contours, with an AUC of 0.83 (95% CI: 0.80-0.86).

Figure 3
Forest plot showing sensitivity and specificity with 95% confidence intervals for seven studies. Sensitivity ranges from 0.67 to 0.88; specificity from 0.45 to 1.00. Combined sensitivity is 0.80, and specificity is 0.81. Red dashed lines indicate average values.

Figure 3. Forest plots illustrate the pooled sensitivity and specificity of AI-based MRI prediction of deep myometrial invasion in endometrial cancer. A random-effects model was used for pooling. Estimates from each study are represented by squares, with size representing weight, and horizontal lines representing 95% confidence intervals. Pooled statistics are represented by diamonds. The I² statistic was used to quantify heterogeneity.

Figure 4
Two graphs labeled “a” and “b” display SROC curves with prediction and confidence contours. Both graphs plot sensitivity against specificity with observed data points marked as circles and summary operating points shown as diamonds. Graph “a” indicates SENS of 0.80 and SPEC of 0.81, while “b” shows SENS of 0.78 and SPEC of 0.86. Each graph includes curves for 95% confidence and prediction contours, with their respective AUC values of 0.83 and 0.90.

Figure 4. AI-based SROC curves for MRI prediction of (A) deep myometrial invasion and (B) cervical stromal invasion. The figure shows the pooled sensitivity and specificity (solid circles), SROC curves (solid lines), 95% confidence region (dashed lines), and 95% prediction region (dotted lines). AUC values ​​and their 95% CIs are given.

3.4 Meta-analysis of cervical stroma invasion

A total of two studies (including five cohorts) reported the value of AI-based MRI for the detection of cervical stroma invasion for endometrial cancer. The overall performance of AI-based MRI for the prediction of cervical stroma invasion showed a combined Sen, Spe, +LR, -LR, and DOR of 0.78 (95% CI: 0.55-0.91), 0.86 (95% CI: 0.79-0.91), 5.6 (95% CI: 4.3-7.4), 0.25 (95% CI: 0.12-0.55), and 22 (95% CI: 11-44), respectively (Figure 5). It is important to note that the 95% confidence interval of its sensitivity is relatively wide, suggesting that this estimate is subject to significant uncertainty and should be interpreted with extreme caution. Substantial heterogeneity in sensitivity was observed across the forest plot analysis (I2 = 92.73%; 95% CI: 87.97-97.49%) and specificity (I2 = 77.89%; 95% CI: 58.47-97.30%) (Figure 5). Figure 4B illustrates the SROC with prediction and confidence contours, with an AUC of 0.90 (95% CI: 0.87-0.92).

Figure 5
Forest plot displaying studies related to sensitivity and specificity. The left panel shows sensitivity values with a combined estimate of 0.78. The right panel shows specificity values with a combined estimate of 0.86. Each study is represented by black squares and lines indicating confidence intervals. Red dashed lines represent the mean summary estimates. Details include individual study estimates and combined statistical measures.

Figure 5. Forest plots illustrate the pooled sensitivity and specificity of AI-based MRI prediction of cervical stromal invasion in endometrial cancer. A random-effects model was used for pooling. Estimates from each study are represented by squares, with size representing weight, and horizontal lines representing 95% confidence intervals. Pooled statistics are represented by diamonds. The I² statistic was used to quantify heterogeneity.

3.5 Publication bias

Deeks’ test was employed to assess publication bias in the included studies, yielding a P value of 0.74 for deep myometrial invasion (Figure 6A) and 0.02 of cervical stromal invasion (Figure 6B), suggesting that the meta-analysis of cervical stroma invasion was affected by significant publication bias.

Figure 6
Panel a shows a Deeks' Funnel Plot Asymmetry Test with a p-value of 0.74. The plot has data points for studies and a regression line. Panel b displays a similar plot with a p-value of 0.02. Both plots have axes labeled as the inverse root of ESS and Diagnostic Odds Ratio.

Figure 6. Deeks’ funnel plots are used to assess publication bias in analyses of (A) deep myometrial invasion and (B) cervical stromal invasion. In the absence of significant publication bias, study points should be approximately symmetrically distributed around the regression line. The p-value is used to test the significance of asymmetry.

4 Discussion

In current clinical practice, preoperative assessment of local invasion of endometrial cancer typically involves a combination of transvaginal ultrasound and MRI (33). MRI is valuable in assessing the depth of myometrial invasion and cervical stromal invasion due to its superior soft tissue resolution. Furthermore, ultrasound also demonstrates high accuracy in detecting cervical invasion, and its accuracy is comparable to MRI when performed by an experienced operator (34). However, the diagnostic accuracy of both ultrasound and MRI depends largely on the experience of the operator or interpreter, and visual assessment can be subjective. Therefore, exploring artificial intelligence technology as an objective and reliable auxiliary tool to improve the accuracy and consistency of preoperative imaging assessment is of significant clinical significance.

This research conducted a comprehensive systematic review and meta-analysis of clinical data on the preoperative staging of endometrial cancer patients using AI-based MRI technology to obtain the combined diagnostic efficacy and comprehensively assess the diagnostic value of AI-based MRI for endometrial cancer patients. The results of this study indicated that the combined Sen, Spe, +LR, -LR and DOR of deep myometrial invasion detected by AI-based MRI were 0.80, 0.81, 4.2, 0.24 and 17, respectively. Additionally, the combined Sen, Spe, +LR, -LR and DOR of cervical stroma invasion detected by AI-based MRI were 0.78, 0.86, 5.6, 0.25 and 22, respectively, indicating that in clinical practice, AI-based MRI can be used as an additional tool to help improve the accuracy of preoperative evaluation in individuals diagnosed with endometrial cancer.

The results of this meta-analysis suggest that AI-based MRI demonstrates high diagnostic performance (pooled AUC = 0.90) in predicting cervical stromal invasion, a finding of significant clinical significance. Accurate assessment of cervical stromal invasion is directly related to the choice of surgical approach (whether to undergo radical hysterectomy), yet traditional MRI interpretation is challenging in this regard, relying on the subjective identification of subtle interruptions in the cervical fibrostromal hypointense band and subject to interobserver variability. The study by Gao et al. did not systematically evaluate cervical stromal invasion, while this study is the first to quantitatively synthesize it. Although the sample size is limited, it provides preliminary evidence for the application of AI in this specific field (35). Of particular note is the recent study by Lecointre et al. (2025) (26), which directly compared the diagnostic performance of an AI model with that of human radiologists. The study found that the AI system achieved comparable sensitivity to experienced radiologists in detecting cervical stromal invasion, while exhibiting superior specificity. This suggests that AI may serve as an effective adjunctive tool to reduce overdiagnosis of cervical invasion and avoid unnecessary extensive surgery. However, most current studies have limited sample sizes and are still in the exploratory stage, so we must also view these results with caution. Although the results of this study are encouraging, if AI-based MRI is to be truly integrated into the clinical workflow to assist in the diagnosis of cervical stromal invasion, prospective, large-sample studies are still needed to further verify its superiority over standard imaging interpretation and clarify its ultimate value in improving patient outcomes.

AI has many advantages in the recognition of endometrial cancer MRI images, which has led to its widespread use in clinical practice (36). First, AI can be trained on large amount of MRI data, thereby improving the accuracy and stability of endometrial cancer recognition. By using deep learning and machine learning algorithms to process these data, AI models can learn richer and more complex feature representations, thereby more accurately identifying potential lesions during the diagnosis process (37, 38). This data-driven approach not only improves diagnostic accuracy, but also enables the model to maintain stability and reliability when faced with different types of MRI data (39). Chen et al. (25) conducted a study using 530 MRI images of patients with endometrial cancer and developed a deep learning model. The model was developed to predict the depth of muscle invasion. The results showed that the accuracy of the deep learning model (84.4%) exceeded that of general radiologists (80.0%).

Secondly, AI can automatically extract features and patterns from magnetic resonance images of endometrial cancer, thereby identifying subtle differences that are difficult for doctors to detect. The application of deep learning and machine learning algorithms enables AI models to continuously learn and evolve, thereby continuously improving the accuracy and efficiency of their diagnosis (40). The model can analyze a large amount of case and imaging data to discover patterns and regularities hidden in the data, and apply this information to new imaging data to support doctors in making more accurate diagnoses and assessments (41). In addition, AI offers high efficiency and speed when processing large sets of medical imaging data. It can complete complex analysis tasks related to endometrial cancer in a relatively short time, greatly improving the efficiency and accuracy of diagnosis. Finally, AI can realize automated processes in magnetic resonance imaging recognition, reduce the errors and subjectivity of manual operations, and improve the consistency and repeatability of diagnosis (42). This automation not only improves the accuracy of diagnosis, but also saves doctors valuable time and energy, allowing them to focus more on clinical decision-making and treatment planning. Therefore, the use of AI in MRI interpretation provides important support for improving the efficiency and accuracy of endometrial cancer diagnosis and plays an increasingly important role in clinical practice. However, despite its technical advantages, AI may also have issues such as privacy protection of medical data and compliance with medical regulations. Effectively integrating AI into MRI-based diagnosis of endometrial cancer—while addressing related challenges—may provide clinicians with a valuable tool to improve early detection and treatment. The ultimate goal of this study is to promote the clinical application of AI-assisted decision-making. Although the results show that AI-MRI has good diagnostic performance, its true clinical value lies in whether it can be translated into improved patient management. For example, by more accurately identifying early-stage or low-risk patients, it can help clinicians avoid unnecessary extended surgery or lymph node dissection, thereby reducing surgical complications and improving patients’ quality of life. Future research should focus on comparative studies of “AI vs. doctors” and focus on evaluating the actual impact of AI-assisted decision-making on clinical diagnosis and treatment pathways and patients’ ultimate prognosis. Beyond simply pursuing high diagnostic accuracy, future research must give equal weight to the ‘translational readiness’ of AI models. This includes evaluating model interpretability (e.g., using saliency maps to highlight decision-relevant image regions), computational efficiency, and seamless integration into existing clinical workflows (PACS/RIS). Developing models that are not only accurate but also transparent, efficient, and user-friendly will be essential for gaining clinician trust and facilitating widespread adoption in resource-varying healthcare settings.

This study faces several potential threats to its validity. First, despite employing a comprehensive search strategy, including only English-language literature may lead to language bias, overlooking relevant studies published in other languages. Second, while standardized procedures and arbitration mechanisms exist for study screening and data extraction, researchers’ subjective judgment may still introduce selection and measurement bias. Third, many of the included original studies were retrospective, and some studies had “unclear” areas of risk of bias in the QUADAS-2 quality assessment, which could affect the reliability of the pooled results. Furthermore, clinical and methodological heterogeneity in patient population characteristics, AI model architecture, MRI scan parameters, and interpretation of the gold standard in pathology is not only a major source of statistical heterogeneity but may also threaten the generalizability of the study’s conclusions. Finally, publication bias was detected in the analysis of cervical stromal invasion, indicating that studies with negative results may not have been published, potentially leading to overly optimistic estimates of the AI’s diagnostic performance. These factors collectively threaten the validity of this study’s conclusions, requiring caution in interpreting the results.

This study is not without limitations. First, the majority of the analyzed research consists of retrospective studies, which may introduce inherent and uncontrollable biases. Second, most of the study populations are from Asia and Europe. The generalizability of AI-based MRI for preoperative staging of endometrial cancer to other countries or regions requires confirmation through additional research. Thirdly, the number of original studies included in this meta-analysis is limited, especially the cohorts used to analyze cervical stromal invasion are only 5, and the cohorts used to analyze deep myometrial invasion are 7. The relatively small sample size may affect the stability of the results. More studies need to be included in the future to increase statistical power and verify the current findings. Fourthly, Deeks’ funnel plot asymmetry test suggested potential publication bias in the analysis of cervical stromal invasion (P = 0.02), which may have led to overly optimistic estimates of the diagnostic performance of AI because studies with negative or null results may not have been published. Additionally, the AI ​​models included in this study exhibited significant differences in algorithmic architecture, feature extraction methods, and training processes, which is a major contributor to heterogeneity. This highlights the urgency of standardizing and normalizing AI models in future research. Improving model transparency and reproducibility is crucial to ensuring the comparability of research results and ultimately their translation into clinical practice. Due to limitations in the available data, subgroup analyses based on the model type could not be conducted to identify potential contributors to heterogeneity. In addition, the literature search for this study was closed to March 2025 and was unable to include the latest studies that may have been published thereafter. Future updated analyses need to include more recent results to further verify the conclusions of this study. Finally, some studies included in this meta-analysis used external validation datasets acquired using different imaging equipment and scanning protocols, which could influence both the validation outcomes and contribute to heterogeneity. Future research should involve larger, prospective, and multicenter clinical trials to better assess the utility of AI-based MRI for preoperative staging in endometrial cancer.

5 Conclusion

This study demonstrates that AI-based MRI, as an additional tool, can improve the accuracy of preoperative staging of individuals with endometrial cancer to a certain extent. However, due to certain limitations of this study—including small sample sizes, regional bias, retrospective design, and potential publication bias—there is a need for future large-scale, prospective, multicenter clinical trials to further assess the diagnostic utility of AI-based MRI in preoperative staging of endometrial cancer.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Author contributions

JZ: Conceptualization, Data curation, Formal Analysis, Methodology, Project administration, Writing – original draft, Writing – review & editing. XL: Software, Supervision, Validation, Visualization, Writing – original draft. ML: Formal Analysis, Resources, Software, Visualization, Writing – original draft.

Funding

The author(s) declare that no financial support was received for the research, and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2025.1673060/full#supplementary-material

References

1. Emons G, Steiner E, Vordermark D, Uleer C, Paradies K, Tempfer C, et al. Endometrial cancer. Guideline of the DGGG, DKG and DKH (S3-level, AWMF registry number 032/034-OL, september 2022). Part 1 with recommendations on the epidemiology, screening, diagnosis and hereditary factors of endometrial cancer, geriatric assessment and supply structures. Geburtshilfe und Frauenheilkunde. (2023) 83:919–62. doi: 10.1055/a-2066-2051

PubMed Abstract | Crossref Full Text | Google Scholar

2. Loukovaara MJ, Pasanen AM, Lassus HJ, Luomaranta AL, Hellberg P, Vartiainen J, et al. Comparative analysis of European guideline-based clinicopathological risk groups and the International Federation of Gynecology and Obstetrics staging system for endometrial cancer. Eur J obstet gyneco Reprod Biol. (2025) 308:85–9. doi: 10.1016/j.ejogrb.2025.02.049

PubMed Abstract | Crossref Full Text | Google Scholar

3. Sage L, Dwyer E, Gray DA, Oluloro A, Williams PA, Katz R, et al. Endometrial cancer knowledge and guideline-concordant practice patterns among first-line providers. J women’s Health (2002). (2023) 32:1104–10. doi: 10.1089/jwh.2022.0534

PubMed Abstract | Crossref Full Text | Google Scholar

4. Bray F, Laversanne M, Sung H, Ferlay J, Siegel RL, Soerjomataram I, et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: Cancer J Clin. (2024) 74:229–63. doi: 10.3322/caac.21834

PubMed Abstract | Crossref Full Text | Google Scholar

5. Liu B, Liu Y, Liu W, Lin C, Lin L, Chen W, et al. Oncological outcomes of minimally invasive surgery in non-endometrioid endometrial Cancer patients with varying prognostic risks: a retrospective cohort study based on the ESGO/ESTRO/ESP 2020 guidelines. BMC surg. (2024) 24:254. doi: 10.1186/s12893-024-02550-9

PubMed Abstract | Crossref Full Text | Google Scholar

6. Loukovaara M, Pasanen A, Aro K, Haltia UM, and Bützow R. Clinicopathologic stratification demonstrates survival differences between endometrial carcinomas with mismatch repair deficiency and no specific molecular profile: a cohort study. Int J gynecol can: Off J Int Gynecol Cancer Soc. (2025) 35:100048. doi: 10.1016/j.ijgc.2024.100048

PubMed Abstract | Crossref Full Text | Google Scholar

7. Mevius A, Lutter J, Karl FM, Tuffy L, Schochter F, Fuchs A, et al. Epidemiology, real-world treatment patterns, and patient outcomes of primary advanced or recurrent endometrial cancer in Germany between 2015 and 2021. Oncol Res Treat. (2025) 48:92–101. doi: 10.1159/000542773

PubMed Abstract | Crossref Full Text | Google Scholar

8. Ma W, Meng W, Yin J, Liang J, Wang X, Liu J, et al. Predictive value of models based on MRI radiomics and clinical indicators for lymphovascular space invasion in endometrial cancer. BMC can. (2025) 25:796. doi: 10.1186/s12885-025-14217-6

PubMed Abstract | Crossref Full Text | Google Scholar

9. Sun C, Mao J, Xia Y, Li M, and Zhao Z. Analysis of the correlation between MRI imaging signs and lymphovascular space invasion in endometrial cancer. Curr Med Imaging. (2025). doi: 10.2174/0115734056348172250407115649

PubMed Abstract | Crossref Full Text | Google Scholar

10. Jin Y and Zhou C. Diagnostic accuracy of TVUS and MRI in the preoperative evaluation of myometrial infiltration in patients with endometrial cancer: A meta-analysis. Clin radiol. (2025) 85:106868. doi: 10.1016/j.crad.2025.106868

PubMed Abstract | Crossref Full Text | Google Scholar

11. Wang J, Xu P, Yang X, Yu Q, Xu X, Zou G, et al. Association of myometrial invasion with lymphovascular space invasion, lymph node metastasis, recurrence, and overall survival in endometrial cancer: A meta-analysis of 79 studies with 68,870 patients. Front Oncol. (2021) 11:762329. doi: 10.3389/fonc.2021.762329

PubMed Abstract | Crossref Full Text | Google Scholar

12. Angioli R, Plotti F, Capriglione S, Scaletta G, Dugo N, Aloisi A, et al. Preoperative local staging of endometrial cancer: the challenge of imaging techniques and serum biomarkers. Arch gyneco obstetr. (2016) 294:1291–8. doi: 10.1007/s00404-016-4181-z

PubMed Abstract | Crossref Full Text | Google Scholar

13. Jusman Y, Ng SC, and Abu Osman NA. Intelligent screening systems for cervical cancer. Sci World J. (2014) 2014:810368. doi: 10.1155/2014/810368

PubMed Abstract | Crossref Full Text | Google Scholar

14. Bora K, Chowdhury M, Mahanta LB, Kundu MK, and Das AK. Automated classification of Pap smear images to detect cervical dysplasia. Comput Methods prog biomed. (2017) 138:31–47. doi: 10.1016/j.cmpb.2016.10.001

PubMed Abstract | Crossref Full Text | Google Scholar

15. Huang S, Yang J, Fong S, and Zhao Q. Artificial intelligence in cancer diagnosis and prognosis: Opportunities and challenges. Cancer lett. (2020) 471:61–71. doi: 10.1016/j.canlet.2019.12.007

PubMed Abstract | Crossref Full Text | Google Scholar

16. Popovic D, Glisic T, Milosavljevic T, Panic N, Marjanovic-Haljilji M, Mijac D, et al. The importance of artificial intelligence in upper gastrointestinal endoscopy. Diagnostics. (2023) 13:2862. doi: 10.3390/diagnostics13182862

PubMed Abstract | Crossref Full Text | Google Scholar

17. Trebeschi S, van Griethuysen JJM, Lambregts DMJ, Lahaye MJ, Parmar C, Bakers FCH, et al. Deep learning for fully-automated localization and segmentation of rectal cancer on multiparametric MR. Sci Rep. (2017) 7:5301. doi: 10.1038/s41598-017-05728-9

PubMed Abstract | Crossref Full Text | Google Scholar

18. Setio AAA, Traverso A, De Bel T, Berens MSN, Van Den Bogaard C, Cerello P, et al. Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: the LUNA16 challenge. Med imag anal. (2017) 42:1–13. doi: 10.1016/j.media.2017.06.015

PubMed Abstract | Crossref Full Text | Google Scholar

19. Ribli D, Horváth A, Unger Z, Pollner P, and Csabai I. Detecting and classifying lesions in mammograms with deep learning. Sci Rep. (2018) 8:4165. doi: 10.1038/s41598-018-22437-z

PubMed Abstract | Crossref Full Text | Google Scholar

20. Sato M, Horie K, Hara A, Miyamoto Y, Kurihara K, Tomio K, et al. Application of deep learning to the classification of images from colposcopy. Oncol lett. (2018) 15:3518–23. doi: 10.3892/ol.2018.7762

PubMed Abstract | Crossref Full Text | Google Scholar

21. Wang X, Deng C, Kong R, Gong Z, Dai H, Song Y, et al. Intratumoral and peritumoral habitat imaging based on multiparametric MRI to predict cervical stromal invasion in early-stage endometrial carcinoma. Acad radiol. (2025) 32:1476–87. doi: 10.1016/j.acra.2024.09.039

PubMed Abstract | Crossref Full Text | Google Scholar

22. Tao Y, Wei Y, Yu Y, Qin X, Huang Y, and Liao J. Development and validation of a nomogram based on multiparametric MRI for predicting lymph node metastasis in endometrial cancer: A retrospective cohort study. Acad radiol. (2025) 32:2751–62. doi: 10.1016/j.acra.2024.12.008

PubMed Abstract | Crossref Full Text | Google Scholar

23. Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ. (2021) 372:n71.

Google Scholar

24. YJQQ YJ, ZRYY ZR, FSS F, and SY SYZZ. Risk on bias assessment: (6) A Revised Tool for the Quality Assessment on Diagnostic Accuracy Studies (QUADAS-2). Zhonghua liu xing bing xue za zhi = Zhonghua liuxingbingxue zazhi. (2018), 524–31.

PubMed Abstract | Google Scholar

25. Chen X, Wang Y, Shen M, Yang B, Zhou Q, Yi Y, et al. Deep learning for the determination of myometrial invasion depth and automatic lesion identification in endometrial cancer MR imaging: a preliminary study in a single institution. Eur radiol. (2020) 30:4985–94. doi: 10.1007/s00330-020-06870-1

PubMed Abstract | Crossref Full Text | Google Scholar

26. Lecointre L, Alekseenko J, Pavone M, Karargyris A, Fanfani F, Fagotti A, et al. Artificial intelligence-enhanced magnetic resonance imaging-based pre-operative staging in patients with endometrial cancer. Int J gynecol can: Off J Int Gynecol Cancer Soc. (2025) 35:100017. doi: 10.1016/j.ijgc.2024.100017

PubMed Abstract | Crossref Full Text | Google Scholar

27. Lefebvre TL, Ciga O, Bhatnagar SR, Ueno Y, Saif S, Winter-Reinhold E, et al. Predicting histopathology markers of endometrial carcinoma with a quantitative image analysis approach based on spherical harmonics in multiparametric MRI. Diagn intervention imag. (2023) 104:142–52. doi: 10.1016/j.diii.2022.10.007

PubMed Abstract | Crossref Full Text | Google Scholar

28. Lefebvre TL, Ueno Y, Dohan A, Chatterjee A, Vallières M, Winter-Reinhold E, et al. Development and validation of multiparametric MRI-based radiomics models for preoperative risk stratification of endometrial cancer. Radiology. (2022) 305:375–86. doi: 10.1148/radiol.212873

PubMed Abstract | Crossref Full Text | Google Scholar

29. Rodríguez-Ortega A, Alegre A, Lago V, Carot-Sierra JM, Ten-Esteve A, Montoliu G, et al. Machine learning-based integration of prognostic magnetic resonance imaging biomarkers for myometrial invasion stratification in endometrial cancer. J magnet reson imag: JMRI. (2021) 54:987–95.

PubMed Abstract | Google Scholar

30. Tao J, Wang Y, Liang Y, and Zhang A. Evaluation and monitoring of endometrial cancer based on magnetic resonance imaging features of deep learning. Contrast med Mol imag. (2022) 2022:5198592. doi: 10.1155/2022/5198592

PubMed Abstract | Crossref Full Text | Google Scholar

31. Wang X, Bi Q, Deng C, Wang Y, Miao Y, Kong R, et al. Multiparametric MRI-based radiomics combined with 3D deep transfer learning to predict cervical stromal invasion in patients with endometrial carcinoma. Abdominal Radiol (New York). (2025) 50:1414–25. doi: 10.1007/s00261-024-04577-1

PubMed Abstract | Crossref Full Text | Google Scholar

32. Wang Y, Liu W, Lu Y, Ling R, Wang W, Li S, et al. Fully automated identification of lymph node metastases and lymphovascular invasion in endometrial cancer from multi-parametric MRI by deep learning. J magnet reson imag: JMRI. (2024) 60:2730–42. doi: 10.1002/jmri.29344

PubMed Abstract | Crossref Full Text | Google Scholar

33. Rei M, Bernardes JF, and Costa A. Ultrasound in endometrial cancer: evaluating the impact of pre-surgical staging. Oncol Rev. (2025) 19:1446850. doi: 10.3389/or.2025.1446850

PubMed Abstract | Crossref Full Text | Google Scholar

34. Spagnol G, Noventa M, Bonaldo G, Marchetti M, Vitagliano A, Laganà AS, et al. Three-dimensional transvaginal ultrasound vs magnetic resonance imaging for preoperative staging of deep myometrial and cervical invasion in patients with endometrial cancer: systematic review and meta-analysis. Ultrasound Obstetrics Gynecol. (2022) 60:604–11. doi: 10.1002/uog.24967

PubMed Abstract | Crossref Full Text | Google Scholar

35. Gao Y, Liang F, Tian X, Zhang G, and Zhang H. Preoperative risk assessment of invasive endometrial cancer using MRI-based radiomics: a systematic review and meta-analysis. Abdominal Radiol (New York). (2025). doi: 10.1007/s00261-025-05005-8

PubMed Abstract | Crossref Full Text | Google Scholar

36. Stanzione A, Cuocolo R, Del Grosso R, Nardiello A, Romeo V, Travaglino A, et al. Deep myometrial infiltration of endometrial cancer on MRI: A radiomics-powered machine learning pilot study. Acad radiol. (2021) 28:737–44. doi: 10.1016/j.acra.2020.02.028

PubMed Abstract | Crossref Full Text | Google Scholar

37. Qi X. Artificial intelligence-assisted magnetic resonance imaging technology in the differential diagnosis and prognosis prediction of endometrial cancer. Sci Rep. (2024) 14:26878. doi: 10.1038/s41598-024-78081-3

PubMed Abstract | Crossref Full Text | Google Scholar

38. Otani S, Himoto Y, Nishio M, Fujimoto K, Moribata Y, Yakami M, et al. Radiomic machine learning for pretreatment assessment of prognostic risk factors for endometrial cancer and its effects on radiologists’ decisions of deep myometrial invasion. Magnet reson imag. (2022) 85:161–7. doi: 10.1016/j.mri.2021.10.024

PubMed Abstract | Crossref Full Text | Google Scholar

39. Mainenti PP, Stanzione A, Cuocolo R, Del Grosso R, Danzi R, Romeo V, et al. MRI radiomics: A machine learning approach for the risk stratification of endometrial cancer patients. Eur J radiol. (2022) 149:110226. doi: 10.1016/j.ejrad.2022.110226

PubMed Abstract | Crossref Full Text | Google Scholar

40. Hoivik EA, Hodneland E, Dybvik JA, Wagner-Larsen KS, Fasmer KE, Berg HF, et al. A radiogenomics application for prognostic profiling of endometrial cancer. Commun Biol. (2021) 4:1363. doi: 10.1038/s42003-021-02894-5

PubMed Abstract | Crossref Full Text | Google Scholar

41. Gao J, Han F, Jin Y, Wang X, and Zhang J. A radiomics nomogram for the preoperative prediction of lymph node metastasis in pancreatic ductal adenocarcinoma. Front Oncol. (2020) 10:1654. doi: 10.3389/fonc.2020.01654

PubMed Abstract | Crossref Full Text | Google Scholar

42. Dong HC, Dong HK, Yu MH, Lin YH, and Chang CC. Using deep learning with convolutional neural network approach to identify the invasion depth of endometrial cancer in myometrium using MR images: A pilot study. . Int J Environ Res Public Health. (2020) 17. doi: 10.3390/ijerph17165993

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: artificial intelligence, magnetic resonance imaging, endometrial cancer, meta-analysis, systematic review

Citation: Zheng J, Lin X and Li M (2026) Artificial intelligence-based magnetic resonance imaging for preoperative staging of patients with endometrial cancer: a systematic review and meta-analysis. Front. Oncol. 15:1673060. doi: 10.3389/fonc.2025.1673060

Received: 31 July 2025; Accepted: 20 November 2025; Revised: 14 November 2025;
Published: 05 January 2026.

Edited by:

Imran Ashraf, Yeungnam University, Republic of Korea

Reviewed by:

Tao Zheng, First Hospital of Qinhuangdao, China
Roberto Tozzi, University of Padua, Italy

Copyright © 2026 Zheng, Lin and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jinjing Zheng, emhlbmdqaW5qaW5nMTIyM0AxNjMuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.