- 1Department of Radiology, Shaoxing People’s Hospital, Shaoxing, China
- 2School of Medicine, Graduate School, Zhejiang University, Hangzhou, China
- 3Department of Radiology, Shaoxing Maternity and Child Health Care Center, Shaoxing, China
Background: To develop a machine learning model integrates multi-parametric magnetic resonance imaging (MRI) radiomics features and clinicopathological features to predict the expression status of phosphatase and tension homolog (PTEN), phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha (PIK3CA), and mammalian target of rapamycin (mTOR), which are frequently linked with targeted therapy for endometrial cancer (EC), in order to establish a dependable foundation for personalized adjuvant therapy for EC patients.
Methods: we retrospectively recruited 82 EC patients who underwent preoperative MRI and radical resection at two independent hospitals. 60 patients from Center 1 were utilized as the training set for constructing the machine learning model, while 22 patients from Center 2 served as an external validation set to assess the model’s performance. We evaluated the performance of models predicted three proteins’ expression using receiver operating characteristic (ROC) analysis, calibration curve analysis, and decision curve analysis (DCA).
Result: To construct machine learning models for predicting the expression of PTEN, PIK3CA, and mTOR, we respectively screened 5 radiomic and 7 clinicopathologic features, 4 radiomic and 9 clinicopathologic features, and 2 radiomic and 10 clinicopathologic features. The area under the curve (AUC) values of the radscore, clinicopathology, and combination models predicting PTEN expression were 0.875, 0.703, and 0.891 in the training set, and 0.750, 0.844, and 0.833 in the validation set, respectively. The AUC values for the models predicted PIK3CA expression in the training set were 0.856, 0.633, and 0.880, respectively, in the validation set, they were 0.842, 0.667, and 0.825. The AUC of each model for mTOR were 0.896, 0.831, and 0.912 in the training set, and 0.729, 0.847, and 0.829 in the validation set. Calibration curve analysis and DCA showed that the combination models were both well calibrated and clinically useful.
Conclusion: Machine learning models integrating multi-parametric MRI radiomics and clinicopathological features can be a potential tool for predicting PTEN, PIK3CA, and mTOR expression status in EC patients.
Introduction
EC ranks among the most prevalent malignant tumors affecting the female reproductive system, and its incidence continues to rise each year (1). Although the majority of endometrial cancer patients can be diagnosed early and have a good prognosis after surgery, approximately 10% to 15% of advanced endometrial cancer patients still face limited treatment options and suboptimal systemic chemotherapy efficacy. The 5-year survival rate for patients with distant metastases is only 17% (2). Identifying risk factors for EC metastasis or recurrence remains challenging, and a reliable basis for individualized adjuvant therapy is lacking. Consequently, systemic toxicity often leads to overtreatment of EC. However, given the advancements in research on tumorigenesis and molecular pathway targets associated with EC development, personalized targeted therapy still holds significant promise at this stage.
Mutated genes and aberrant signaling pathways that induce the development of endometrial cancer vary. Notably, the PIK3/AKT/mTOR pathway exhibits the highest rate of alterations among solid tumors, particularly in 92% of type I and 60% of type II endometrial cancers. This is primarily attributed to the prevalence of mutations in genes such as PTEN and PIK3CA. Such findings underscore the significant role played by the PIK3/AKT/mTOR signaling pathway in the pathogenesis of endometrial cancer (3–5). The hyperactivation of the PIK3/AKT/mTOR pathway generally results from the loss of function of PTEN, the amplification or mutation of PIK3CA, and the elevated expression of the PIK3R1, AKT genes, and mTOR (3, 6). Therefore, targeting drugs against this pathway has become an increasing focus of scholarly attention.
MRI is widely used for the diagnosis of endometrial cancer due to its good soft tissue contrast. The International Federation of Gynecology and Obstetrics (FIGO) recommends MRI findings as the preferred staging basis for endometrial cancer. However, MRI-based imaging findings are susceptible to subjective factors due to their high inter-observer variability and lack of quantitative, objective assessment markers. Radiomics, a non-invasive and rapid method, quantitatively analyzes medical imaging data and transforms it into high-dimensional, mineable data. This enables the assessment of tumor heterogeneity, providing valuable information for tumor typing and grading, gene localization, early treatment, and prognosis assessment that cannot be identified by gross observation. Combining radiomics data with clinical data holds promise as an approach to improve clinical management (7).
The aim of this study was to evaluate the effectiveness of a machine learning model combining clinicopathological features and multi-parametric MRI-based radiomics features in predicting the expression of common endometrial cancer targeted therapy-related proteins PTEN, PIK3CA and mTOR, and to assist in clinical decision-making to optimize personalized treatment for EC patients.
Methods
Patient
We conducted a retrospective analysis on 60 endometrial cancer patients diagnosed at Center A between 2013 and 2022 and 22 endometrial cancer patients diagnosed at Center B between 2017 and 2022. None of these patients received any preoperative treatment. All participating institutions’ institutional review boards approved the retrospective study, and all boards waived the requirement for written informed permission from patients.
The following were the inclusion criteria for this study (1): all patients had a surgical pathology diagnosis of endometrial cancer (2); all patients had a preoperative standardized pelvic MRI; and (3) all eligible postoperative pathology specimens were stored in the tissue bank.
Strict exclusion criteria were used, including (1): radiotherapy, preoperative neoadjuvant chemotherapy, or any other intervention directed at the lesion site prior to MRI or surgical treatment (2); more than two weeks between the preoperative MR examination and the surgery (3); poor image quality due to artifacts or an incomplete MRI sequence as assessed by the treating radiologist; and (4) the lesion site was less than 1 centimeter in diameter at maximum or the tumor border could not be clearly delineated.
MRI acquisition
Magnetic resonance images were acquired using a Siemens Verio 3.0 T MR scanner (Germany) in center A and a 1.5T MRI scanner (Siemens Avanto, Germany) in center B. All patients were required to breathe freely in the supine position during data acquisition. The following sequences were acquired: the sagittal T2 weighing imaging (T2WI), the sagittal contrast-enhanced T1-weighted images (CE-T1WI), and axial diffusion weighing imaging (DWI) with apparent diffusion coefficient (ADC) maps with b-values of 0 and 800 s/mm2. The primary scanning sequence for Center A includes T2WI (TR/TE: 3480 ms/103 ms, FOV: 320×320 mm); CE-T1WI (TR/TE: 6700 ms, FOV: 288×288 mm); DWI (TR/TE: 6700 ms/92 ms, FOV: 160×160 mm); and ADC (TR/TE: 6700 ms/92 ms, FOV: 160×160 mm). The main scanning sequence for Center B includes T2WI (TR/TE: 5210 ms/79 ms, FOV: 320×320 mm); CE-T1WI (TR/TE: 735 ms/11 ms, FOV: 320×320 mm); DWI (8900 ms/84 ms, 160 mm×136mm); ADC (TR/TE: 8900 ms/84 ms, FOV:160 mm×136 mm).
Image segmentation and feature extraction
A radiologist(reader 1)with 7 years of experience in gynecologic imaging manually delineated the tumor on four sequences (including T2WI, CE-T1WI, DWI, and ADC) using 3D slicer software (5.2.1) to obtain a volume of interest (VOI). The radiologist was not given any clinical or pathologic information about the patient. Following that, a VOI + 3 mm was created by increasing the volume of interest by 3 mm and manually deleting these spots (which included leiomyosarcomas, cysts, and effusions). The Artificial Intelligent Kit (AK) software (3.2.0.R, GE Healthcare, China) was used to extract imaging features from the VOI. (Figure 1).
A week later, reader 2 and reader 3 who have more than 5 years of experience with pelvic magnetic resonance imaging randomly selected 30 patients and performed the picture segmentation process. The intraclass correlation coefficient (ICC) was used to assess the consistency and reproducibility of radiomic features. Features with ICC greater than 0.75 were retained for further analysis.
Pathological examination
PTEN, PIK3CA and mTOR were assessed by immunohistochemistry (IHC) protocol using paraffin‐embedded tissue samples which were obtained from surgery. We used monoclonal mouse anti‐human PTEN antibody (RTU, RM265, GeneTech Co., Ltd., Shanghai, China), monoclonal rabbit antihuman PIK3CA antibody (1:100, SP139 Abcam, Shanghai, China), or monoclonal rabbit anti-human mTOR antibody (1:400, Y391, Abcam, Shanghai, China). Two professional pathologists with more than 8 years of diagnostic experience will perform the interpretation of the testing results. The immunostaining results of specific antibodies were measured semi-quantitatively by immunoreactive score (IRS) method. The specific calculation method of IRS is as follows: Staining intensity (SI) classification: 0, no staining; 1, light yellow; 2, brown yellow, 3, dark brown; percentage of stained cells (PP): 0, no staining; 1, staining in<10% of tumor cells; 2, staining in 10–50%; 3, staining in 50–80%; 4, staining in>80%. IRS=SI×PP. IRS > 3 was defined as positive immunoreactivity according to the literature criteria (8). An additional movie file shows this in more detail [see Additional file 1]. (Figure 2).
Figure 2. IHC images: Magnification: 10×40 (a, b): IHC images of PTEN; (a), IRS: 8; (b), IRS: 0; (c, d): IHC images of PIK3CA; (c), IRS: 6; (d), IRS: 1; (e, f): IHC images of mTOR; (e), IRS: 8; (f), IRS: 2.
Radiomics feature selection and modeling
First, radiomic features from the training set were standardized using the Z-score method to eliminate scale differences. To reduce feature redundancy, highly correlated features with an absolute Pearson correlation coefficient ≥ 0.9 were removed. Subsequently, dimensionality reduction and feature selection were performed using SelectPercentile (percentile = 80) and LASSO algorithm to further eliminate irrelevant features and retain radiomic features with strong predictive value. Meanwhile, for the collected 11 key clinicopathological features, the SelectKBest feature selection method was used to evaluate the relationship between each feature and the target variable, screening out features with stronger associations for the predictive model. The finally selected radiomic features and clinicopathological features were combined, and three different predictive models were constructed using logistic regression algorithm: a clinicopathological model, a radiomics score model, and a combined model. The discriminative performance of the three established models was evaluated through ROC analysis and quantified using corresponding AUC values with 95% confidence intervals (CI), sensitivity, specificity, and accuracy. Then, calibration curves were used to assess the consistency between the model-predicted probabilities and actual probabilities, and the clinical applicability of the models was evaluated through Decision Curve Analysis (DCA).
Meanwhile, in order to further explore whether the intra-tumor combined peri-tumor imaging model incorporating the 3mm region around the tumor is superior to the single intra-tumor or peri-tumor model, we constructed the intra-tumor combined peri-tumor imaging model based on the same method of feature screening and dimensionality reduction, retained the optimal features incorporated into the model, and built the single intra-tumor and single peri-tumor imaging models by single-factor and multifactorial logistic regression, respectively, and analyzed the predictive efficacy of the models by using the subjects’ ROCs and calculated AUCs, and the Delong test was used to compare the predictive performances of each model.
Statistics
Data were statistically analyzed by applying SPSS 27.0 and R language (v.4.3.1) software. Differences in categorical indicators between risk groups were compared using the chi-square test or Fishers’ exact test, and differences in continuous variables were compared using the independent samples t-test or Mann-Whitney U-test, depending on whether they were normally distributed or not. In all two-sided tests, a P value <0.05 was considered statistically significant.
Result
Patient clinical characteristics
This study initially comprised 76 patients from Center 1 and 55 patients from Center 2. Following the application of the exclusion criteria, a total of 82 patients were enrolled, with 60 from Center 1 and the remaining 22 from Center 2 (Figure 3). The mean age of the patients in the training group was 58.9 ± 2.47 (range: 34–77 years), and the mean age of the patients in the test group was 59.27 ± 4.26(range: 36–81 years). As shown in Table 1, no significant differences were found between the train group and the test group in terms of clinicopathologic indicators such as age, FIGO, myofibrillar infiltration, Ki-67 expression, and P53 expression (all P>0.05). However, in terms of Grade, a statistical difference was found between the train and test groups (P<0.05).
PTEN, PIK3CA, mTOR expression status and clinicopathological features are shown in Table 2. There were significant differences in terms of PTEN expression and Ki-67 expression, mTOR expression and EC Grade (p<0.05), and there were no significant differences in other clinicopathological features and protein expression status (p>0.05).
Radiomics and clinical pathological feature selection
A total of 10,525 features were extracted from seven categories of 12 image preprocessing, including first-order features, shape features, texture features, luminance features (GLSZM), grayscale judgment matrix (GLDM), neighboring grayscale levels (NGTDM), and grayscale stroke length texture features (GLRLM).
Among the 10525 radiological features extracted from the MRI images, 9304 features were considered stable (ICC ≥ 0.75). First, feature pre-selection was performed by removing features with a Pearson correlation coefficient ≥ 0.9. Next, univariate feature selection was performed using SelectPercentile (percentile = 80), and finally, the most valuable radiological features were selected from the training cohort using the Lasso algorithm.
This study incorporated 7 clinicopathologic features associated with PTEN (FIGO, Myometrial invasion, Ki-67, P53, Lymphatic node transfer, Vessel carcinoma embolus, Ovarian metastasis) and 5 MRI-based radiomic features. Additionally, 9 clinicopathologic features related to PIK3CA (Age, Ovarian metastasis, Myometrial invasion, Ki-67, P53, ER, PR, Lymphatic node transfer, Vessel carcinoma embolus) along with 4 MRI-based radiomics features were selected to establish the models. Moreover, 10 clinicopathologic features associated with mTOR (Age, Vessel carcinoma embolus, Ovarian metastasis, FIGO, Grade, Myometrial invasion, Ki-67, P53, ER, Lymphatic node transfer) and 2 MRI-based radiomic features were utilized in model construction.
Based on the extracted radiomics features, we calculated a Radscore value for each patient, and then constructed a logistic regression model. Table 3 shows the selected radiomics features. The Radscore calculation formulas related to the three protein expressions are shown as follows:
In addition, to evaluate the predictive performance of the intratumoral and peritumoral radiomics models separately, the intratumoral and peritumoral regions of each case were analyzed using the Artificial Intelligence Kit software to extract 5264 intratumoral radiomics features and 5261 peritumoral radiomics features, respectively. Through intraclass and interclass consistency tests, 4652 stable features (ICC ≥ 0.75) were selected from each region. The most valuable radiomics features were then chosen from the training cohort using the same selection method for model construction.
Construction and evaluation of the model
The ROC curves of the three models for each protein expression in the training and validation sets are shown in Figure 4. In the training set, the AUC values for PTEN expression prediction were 0.875, 0.703, and 0.891 for the Radscore, Clinicopathology, and Combination models, respectively. In the validation set, the corresponding AUC values were 0.750, 0.844, and 0.833 (Figure 5a, b). For predicting PIK3CA expression, the training set AUC values for the models were 0.856, 0.633, and 0.880, while in the validation set, they were 0.842, 0.667, and 0.825(Figure 5c, d). Lastly, in the training set, the AUC values for mTOR expression prediction were 0.896, 0.831, and 0.912 for the three models, respectively, with corresponding validation set AUC values of 0.729, 0.847, and 0.829 (Figure 5e, f). The AUC, sensitivity, specificity, and accuracy values are presented in Table 4. Calibration curve analysis (Figure 6) and decision curve analysis (Figure 7) showed that the joint model had good calibration and clinical utility in predicting the expression of three different proteins.
Figure 4. The ROC curves of the Radscore, Clinicopathology, and Combination model for each protein expression in the training and validation sets. (a) The ROC curves to predict PTEN expression status in the training set. (b) The ROC curves to predict PTEN expression status in the validation set. (c) The ROC curves to predict PIK3CA expression status in the training set. (d) The ROC curves to predict PIK3CA expression status in the validation set. (e) The ROC curves to predict mTOR expression status in the training set. (f) The ROC curves to predict mTOR expression status in the validation set.
Figure 5. The ROC curves of the GTV, PTV, and GPTV model for each protein expression in the training and validation sets. (a) The ROC curves to predict PTEN expression status in the training set. (b) The ROC curves to predict PTEN expression status in the validation set. (c) The ROC curves to predict PIK3CA expression status in the training set. (d) The ROC curves to predict PIK3CA expression status in the validation set. (e) The ROC curves to predict mTOR expression status in the training set. (f) The ROC curves to predict mTOR expression status in the validation set.
Figure 6. Calibration curves of the three models for each protein expression in the training. Calibration curves of the combination model that predict protein expression status in the training set (a) PTEN; (b) PIK3CA; (c) mTOR. The blue line is the ideal calibration curve when the predicted value is equal to the actual value. The closer the model calibration curve is to this line, the better the prediction ability of this model is.
Figure 7. DCA of the three models for each protein expression in the training. DCA of the combination model that predict protein expression status in the training set (A) PTEN; (B) PIK3CA; (C) mTOR. The abscissa represents the risk of disease occurrence, the ordinate represents the patient’s net income rate, and the blue curve represents that all patients have adverse outcomes. When the prediction model curve is above the blue curve, it represents that the corresponding patients can benefit.
In our study, we observed that the ratio of positive to negative expression of PIK3CA in the total sample was 65:17 (with a positive rate of 79.27%), and for mTOR, it was 67:15 (with a positive rate of 81.71%). This suggests that there may be class imbalance in the two types of samples, which could lead to a decrease in the specificity of the model. To avoid the accuracy paradox, this study further evaluated the recall, precision, F1-score (harmonic mean), and negative predictive value (NPV) of the models predicting the expression status of PIK3CA and mTOR, with a particular focus on the models’ ability to identify negative samples. As shown in Table 5, by comparing the performance metrics of the training set and the validation set, it was found that the combined model generally exhibited higher precision and F1-score in most scenarios, but its recall and negative predictive value showed some fluctuations.
Table 5a. The ability of the respective models to identify negative samples in predicting the expression of PIK3CA.
Table 5b. The ability of the respective models to identify negative samples in predicting the expression of mTOR.
Figure 5 shows the ROC curves for the prediction of each protein expression in the training and validation sets for the gross tumor volume (GTV), peritumoral tumor volume (PTV), and combined intratumor and peritumor, i.e., Gross peritumoral tumor volume (GPTV) radiomics models. While the AUC, sensitivity, specificity, and accuracy values are presented in Table 6. The results showed that the performance of the GPTV model was improved over both the GTV and PTV models in predicting PIK3CA and mTOR expression. Except that the GPTV model for predicting mTOR expression showed a statistically significant difference from the GTV model in the validation set (P<0.05), the other GPTV models exhibited no statistically significant differences from the GTV model and PTV model in both the training set and the validation set (P>0.05).
Discussion
Targeted therapy remains an important component of precision treatment for endometrial cancer (EC) patients. Accurate prediction of the expression of molecular pathway target proteins is helpful for the development of personalized treatment plans for EC patients. This study is committed to constructing a machine learning model based on multiparametric MRI radiomics features combined with clinical pathological features, aiming to evaluate its predictive ability and clinical application value in predicting the expression status of endometrial cancer target therapy-related proteins (PTEN, PIK3CA, and mTOR). This machine learning model is expected to become a new type of non-invasive assessment tool that can accurately identify EC patients who can benefit from targeted therapy, thereby optimizing treatment decisions and significantly improving patients’ clinical outcomes.
Currently, immunohistochemistry and Western blotting are the primary methods for detecting PTEN, PIK3CA, and m-TOR expression. However, due to tumor growth heterogeneity and the limitations of local specimens in reflecting overall tumor characteristics, results from these assays can vary significantly due to technical and subjective factors. Consequently, there is a pressing clinical need for a comprehensive method to evaluate lesions, considering tumor heterogeneity, to accurately assess tumor characteristics. In comparison to conventional pelvic MRI, which is reliant on observer experience, radiomics offers a more effective approach, leveraging quantitative information extraction to capture tumor morphology, intensity, and texture features that may not be discernible to the human eye. This methodology improves the scientific rigor, objectivity, and precision of clinical diagnosis. Luo et al. (9) performed a retrospective MRI-based study and developed a column-line diagram based on clinical features and radiomic scores to assess EC lymphocyte-vascular gap invasion; Yan et al. (10) published a retrospective, multicenter study using an MRI- and Clinical-Based Radiomics Nomogram to predict preoperative high-risk endometrial cancer. The integration of radiomics and clinical features holds promise for enhancing the diagnostic efficacy of EC across various aspects such as muscle infiltration, metastasis, and risk stratification. However, there is a paucity of studies in the field of radio-proteomics. Prior research has illustrated the potential of radiomics in predicting clinically relevant molecular features and protein expression in diverse cancers, encompassing breast, lung, glioblastoma, and pancreatic cancers (11–13). In this study, we introduce a novel machine learning analysis method that amalgamates the radiomics features and clinicopathological features derived from multiparameter MRI. This approach enables an objective and comprehensive assessment of the expression status of PTEN, PIK3CA, and mTOR, offering distinct advantages over conventional pelvic MRI, immunohistochemistry (IHC), or protein blotting techniques.
PTEN stands as one of the most frequently mutated, deleted, and inactivated oncogenes in human cancers, exerting a pivotal role in the pathogenesis of EC and holding promise as a novel marker for this disease (14–16). PTEN is a tumor suppressor that causes cell cycle arrest and inhibits cell proliferation (17). Poly ADP-ribose polymerase (PARP) inhibitors are a class of targeted antitumor agents, and several experiments have confirmed the efficacy of PARP inhibitors against EC (18, 19). Notably, Emerging evidence suggests that PTEN mutant cells exhibit heightened sensitivity to PARP inhibitors due to characteristics associated with increased cellular DNA double-strand breaks and defective homologous recombination mechanisms (20, 21). It can be seen that PTEN, as a potential predictor of PARP efficacy, can provide a basis for the precise stratification and treatment of endometrial cancer patients if its expression is predicted early, thus optimizing individualized treatment and avoiding exposure to ineffective drugs. The machine learning model constructed in this study, which combines radiomics and clinical pathological features, has shown good performance in predicting PTEN expression, with a certain degree of generalizability and robustness. Ki-67, a biomolecular protein, is increasingly recognized as a preferred marker for assessing cell proliferative activity, reflecting the extent of cell growth and proliferation. Its expression holds potential as a biomarker for endometrial carcinogenesis and tumor cell proliferation (22). Previous studies have consistently reported a negative correlation between PTEN and Ki-67 expression in various malignant tumors, including head and neck tumors and lymphomas (23, 24), and a study by Uegaki K. et al. (25) demonstrated that PTEN induced cells to show cell cycle arrest and a decrease in the Ki-67 labeling index. In this study, we also confirmed through case-control analysis that there is a significant difference in the expression levels of PTEN and Ki-67 (p < 0.05), providing new experimental evidence to further elucidate the mechanism of action of PTEN in endometrial cancer.
The PIK3/AKT/mTOR signaling pathway, a classic cell signaling cascade, is among the most commonly activated pathways in malignant tumors, including EC (26). The PIK3CA gene encodes the catalytic subunit of PIK3, p110a, and exhibits frequent mutations across all molecular subtypes of endometrial cancers. mTOR, a conserved serine/threonine protein kinase, governs cell growth, survival, and migration by phosphorylating the Akt and AGC family of kinases. Consequently, aberrant activation of this pathway significantly influences tumor cell growth, proliferation, metastasis, and infiltration (27–30). Similar results were observed in this study, with a significant correlation between mTOR expression and EC pathologic grade (p < 0.05). However, due to the imbalance in baseline characteristics observed in this study (p=0.002), the subsequent validation analysis may be subject to bias. Therefore, our conclusions require further verification in larger, independent cohorts with more balanced baseline characteristics. mTOR represents a more advanced anticancer target within this signaling pathway. Notably, the reported efficacy of Everolimus (an mTOR-targeted inhibitor) in combination with letrozole (an aromatase inhibitor) for recurrent endometrial cancers (31, 32)underscores the potential for enhanced antitumor efficacy through the identification of suitable candidates for treatment with mTOR inhibitors or other targeted agents in conjunction with hormonal agents. Consequently, there is an imperative need for comprehensive characterization to optimize treatment strategies and enhance patient prognosis. The machine learning model constructed in this study, which combines radiomics and clinical pathological features, has shown good performance and high precision in predicting the expression of PIK3CA and mTOR. It outperforms both the single radiomics model and the traditional clinical pathological model in terms of calibration characteristics and clinical translation value.
In our investigation, we constructed a predictive model for the expression of targeted therapy-related proteins in endometrial cancer utilizing four sequences of CE-T1WI, FS-T2WI, DWI, and ADC images. We postulated that multiparametric imaging could more comprehensively capture the microscopic characteristics of lesions from diverse perspectives, thereby offering a more comprehensive depiction of tumor characteristics. Furthermore, this study concurrently integrated peritumoral imaging features, encompassing tumor infiltration margins, potentially shedding light on the role of the tumor microenvironment in cancer biology and behavior. Past studies across various tumor types have consistently underscored the significance of peritumoral imaging analysis in elucidating the implications for tumor biology, prognosis, and treatment response (33–35). The features employed in constructing the PTEN and mTOR machine learning models in this study all encompassed peri-tumor data, with these features carrying substantial relative weight. Additionally, we established distinct intratumoral, 3-mm peritumoral area around the tumor, and merged intratumoral and peritumoral models. Upon comparing AUC values of these models, we observed that the combined model demonstrated comparable performance to the single intratumoral model in predicting PTEN expression, whereas the combined model notably outperformed the separate models in predicting PIK3CA and mTOR expression. Furthermore, it is similar to the results of Beig et al (36), where the efficacy of distinguishing lung adenocarcinoma from sarcoidosis by combining intranodal and perinodal CT imaging histology was superior to that of the intranodal model. These results emphasize that the important contribution of peri-tumor data cannot be ignored.
This study has certain limitations that need to be further improved in subsequent research. First, the samples were only sourced from two medical centers, with a relatively limited sample size, while radiomics studies generally rely on large-scale datasets to ensure the reliability of the results. Although this study has conducted preliminary validation on an external dataset, its clinical application value still needs to be further confirmed through multicenter, prospective studies to verify the robustness, generalizability, and reproducibility of the model. Second, due to incomplete information for some patients, important clinical parameters such as weight, body mass index (BMI), estrogen levels, and CA125 could not be included in the model. In addition, under the retrospective design, the temporal heterogeneity of historical data from each center (such as the iteration of imaging equipment and the update of diagnostic criteria) may introduce inevitable confounding biases, and the differences caused by the different models of MRI equipment in the two centers cannot be ignored. Future research needs to further improve the data collection mechanism, use techniques such as resampling to mitigate potential class imbalance as much as possible, and ensure the comprehensive integration of clinical and imaging features to enhance the overall performance of the model. Finally, this study mainly focused on postoperative endometrial cancer patients, but given that targeted therapy is mainly used for patients with advanced endometrial cancer, it is urgently necessary to verify the validity of the current model in the context of advanced disease in future research.
Conclusion
In summary, we developed a ML model based on clinicopathologic features and multi-parametric MRI-based radiomic features with strong predictive value for the expression status of PTEN, PIK3AC, mTOR. This innovative approach could be a potential tool to objectively and non-invasively provide clinical information on targeted therapy for identifying EC patients who are most likely to benefit from personalized targeted therapy.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by The Academic Ethics Committee of Shaoxing People’s Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because it was a retrospective study conducted with existing data without direct interaction with individuals. The data used in the study were anonymized and did not involve any new interventions or risks.
Author contributions
CS: Data curation, Writing – original draft, Investigation, Methodology. QL: Data curation, Methodology, Writing – original draft. YH: Data curation, Methodology, Writing – original draft. YX: Resources, Writing – original draft. ML: Resources, Writing – original draft. XZ: Data curation, Writing – original draft. JZ: Data curation, Writing – original draft. ZZ: Funding acquisition, Methodology, Project administration, Resources, Supervision, Writing – review & editing.
Correction note
A correction has been made to this article. Details can be found at: 10.3389/fonc.2026.1790105.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This study in part by grants from the Shaoxing Health and Technology Plan Project (2022SY008) and the institution from Key Laboratory of Functional Molecular Imaging of Tumor and Interventional Diagnosis and Treatment of Shaoxing City (Shaoxing People’s Hospital, Shaoxing, Zhejiang, China, grant number, 2020ZDSYSO1).
Conflict of interest
The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Correction note
A correction has been made to this article. Details can be found at: 10.3389/fonc.2026.1790105.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Abbreviations
MRI, magnetic resonance imaging; PTEN, phosphatase and tension homolog; PIK3CA, phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha; mTOR, mammalian target of rapamycin; EC, endometrial cancer; ROC, receiver operating characteristic; DCA, decision curve analysis; AUC, area under the curve; FIGO, International Federation of Gynecology and Obstetrics; T2WI, T2 weighing imaging; CE-T1WI, contrast-enhanced T1-weighted images; DWI, diffusion weighing imaging; ADC, apparent diffusion coefficient; VOI, volume of interest; ICC, intraclass correlation coefficient; IHC, immunohistochemistry; IRS, immunoreactive score; SI, Staining intensity; LASSO, Least Absolute Shrinkage and Selection Operator; CI, confidence intervals; GTV, gross tumor volume; PTV, peritumoral tumor volume; GPTV, gross peritumoral tumor volume; PARP, Poly ADP-ribose polymerase.
References
1. Paleari L, Pesce S, Rutigliani M, Greppi M, Obino V, Gorlero F, et al. New insights into endometrial cancer. Cancers (Basel). (2021) 13:1496. doi: 10.3390/cancers13071496, PMID: 33804979
2. Makker V, Colombo N, Casado Herráez A, Santin AD, Colomba E, Miller DS, et al. Lenvatinib plus pembrolizumab for advanced endometrial cancer. N Engl J Med. (2022) 386:437–48. doi: 10.1056/NEJMoa2108330, PMID: 35045221
3. Brooks RA, Fleming GF, Lastra RR, Lee NK, Moroney JW, Son CH, et al. Current recommendations and recent progress in endometrial cancer. CA Cancer J Clin. (2019) 69:258–79. doi: 10.3322/caac.21561, PMID: 31074865
4. Slomovitz BM and Coleman RL. The PIK3/AKT/mTOR pathway as a therapeutic target in endometrial cancer. Clin Cancer Res. (2012) 18:5856–64. doi: 10.1158/1078-0432.CCR-12-0662, PMID: 23082003
5. Chen J, Zhao KN, Li R, Shao R, and Chen C. Activation of PIK3/Akt/mTOR pathway and dual inhibitors of PIK3 and mTOR in endometrial cancer. Curr Med Chem. (2014) 21:3070–80. doi: 10.2174/0929867321666140414095605, PMID: 24735369
6. Barra F, Evangelisti G, Ferro Desideri L, Di Domenico S, Ferraioli D, Vellone VG, et al. Investigational PIK3/AKT/mTOR inhibitors in development for endometrial cancer. Expert Opin Investig Drugs. (2019) 28:131–42. doi: 10.1080/13543784.2018.1558202, PMID: 30574817
7. Ren J, Li Y, Yang JJ, Zhao J, Xiang Y, Xia C, et al. MRI-based radiomics analysis improves preoperative diagnostic performance for the depth of stromal invasion in patients with early stage cervical cancer. Insights Imaging. (2022) 13:17. doi: 10.1186/s13244-022-01156-0, PMID: 35092505
8. Esteva FJ, Guo H, Zhang S, Santa-Maria C, Stone S, Lanchbury JS, et al. PTEN, PIK3CA, p-AKT, and p-p70S6K status: association with trastuzumab response and survival in patients with HER2-positive metastatic breast cancer. Am J Pathol. (2010) 177:1647–56. doi: 10.2353/ajpath.2010.090885, PMID: 20813970
9. Luo Y, Mei D, Gong J, Zuo M, and Guo X. Multiparametric MRI-based radiomics nomogram for predicting lymphovascular space invasion in endometrial carcinoma. J Magn Reson Imaging. (2020) 52:1257–62. doi: 10.1002/jmri.27142, PMID: 32315482
10. Yan BC, Li Y, Ma FH, Feng F, Sun MH, Lin GW, et al. Preoperative assessment for high-risk endometrial cancer by developing an MRI- and clinical-based radiomics nomogram: A multicenter study. J Magn Reson Imaging. (2020) 52:1872–82. doi: 10.1002/jmri.27289, PMID: 32681608
11. Liu Z, Duan T, Zhang Y, Weng S, Xu H, Ren Y, et al. Radiogenomics: a key component of precision cancer medicine. Br J Cancer. (2023) 129:741–53. doi: 10.1038/s41416-023-02317-8, PMID: 37414827
12. Zhou M, Leung A, Echegaray S, Gentles A, Shrager JB, Jensen KC, et al. Non-small cell lung cancer radiogenomics map identifies relationships between molecular and imaging phenotypes with prognostic implications. Radiology. (2018) 286:307–15. doi: 10.1148/radiol.2017161845, PMID: 28727543
13. Kayadibi Y, Kocak B, Ucar N, Akan YN, Akbas P, and Bektas S. Radioproteomics in breast cancer: prediction of ki-67 expression with MRI-based radiomic models. Acad Radiol. (2022) 29 Suppl 1:S116–s25. doi: 10.1016/j.acra.2021.02.001, PMID: 33744071
14. Raffone A, Travaglino A, Saccone G, Viggiani M, Giampaolino P, Insabato L, et al. PTEN expression in endometrial hyperplasia and risk of cancer: a systematic review and meta-analysis. Arch Gynecol Obstet. (2019) 299:1511–24. doi: 10.1007/s00404-019-05123-x, PMID: 30915635
15. Lee YR, Chen M, and Pandolfi PP. The functions and regulation of the PTEN tumor suppressor: new modes and prospects. Nat Rev Mol Cell Biol. (2018) 19:547–62. doi: 10.1038/s41580-018-0015-0, PMID: 29858604
16. Daix M, Gladieff L, Martinez A, Ferron G, and Angeles MA. Pocket memo based on the ESGO/ESTRO/ESP guidelines for the management of patients with endometrial carcinoma: definition of prognostic risk groups. Int J Gynecol Cancer. (2021) 31:1615–6. doi: 10.1136/ijgc-2021-003110, PMID: 34645685
17. Raffone A, Travaglino A, Saccone G, Campanino MR, Mollo A, De Placido G, et al. Loss of PTEN expression as diagnostic marker of endometrial precancer: A systematic review and meta-analysis. Acta Obstet Gynecol Scand. (2019) 98:275–86. doi: 10.1111/aogs.13513, PMID: 30511743
18. Shen K, Yang L, Li FY, Zhang F, Ding LL, Yang J, et al. Research progress of PARP inhibitor monotherapy and combination therapy for endometrial cancer. Curr Drug Targets. (2022) 23:145–55. doi: 10.2174/1389450122666210617111304, PMID: 34139979
19. Musacchio L, Caruso G, Pisano C, Cecere SC, Di Napoli M, Attademo L, et al. PARP inhibitors in endometrial cancer: current status and perspectives. Cancer Manag Res. (2020) 12:6123–35. doi: 10.2147/CMAR.S221001, PMID: 32801862
20. Tian T, Zhao Y, Zheng J, Jin S, Liu Z, and Wang T. Circular RNA: A potential diagnostic, prognostic, and therapeutic biomarker for human triple-negative breast cancer. Mol Ther Nucleic Acids. (2021) 26:63–80. doi: 10.1016/j.omtn.2021.06.017, PMID: 34513294
21. Bian X, Gao J, Luo F, Rui C, Zheng T, Wang D, et al. PTEN deficiency sensitizes endometrioid endometrial cancer to compound PARP-PIK3 inhibition but not PARP inhibition as monotherapy. Oncogene. (2018) 37:341–51. doi: 10.1038/onc.2017.326, PMID: 28945226
22. Ghalib Farhood R and Abd Ali Al-Humairi I. Immunohistochemical study of ki-67 in hyperplastic and endometrium carcinoma: A comparative study. Arch Razi Inst. (2022) 77:229–34. doi: 10.22092/ARI.2021.356540, PMID: 35891746
23. Ahmed MW, Kayani MA, Shabbir G, Ali SM, Shinwari WU, and Mahjabeen I. Expression of PTEN and its correlation with proliferation marker Ki-67 in head and neck cancer. Int J Biol Markers. (2016) 31:e193–203. doi: 10.5301/jbm.5000196, PMID: 26916897
24. Fu X, Zhang X, Gao J, Li X, Zhang L, Li L, et al. Phosphatase and tensin homolog (PTEN) is down-regulated in human NK/T-cell lymphoma and corrects with clinical outcomes. Med (Baltimore). (2017) 96:e7111. doi: 10.1097/MD.0000000000007111, PMID: 28723738
25. Uegaki K, Kanamori Y, Kigawa J, Kawaguchi W, Kaneko R, Naniwa J, et al. PTEN is involved in the signal transduction pathway of contact inhibition in endometrial cells. Cell Tissue Res. (2006) 323:523–8. doi: 10.1007/s00441-005-0082-3, PMID: 16283392
26. Vasjari L, Bresan S, Biskup C, Pai G, and Rubio I. Ras signals principally via Erk in G1 but cooperates with PIK3/Akt for Cyclin D induction and S-phase entry. Cell Cycle. (2019) 18:204–25. doi: 10.1080/15384101.2018.1560205, PMID: 30560710
27. Murugan AK. mTOR: Role in cancer, metastasis and drug resistance. Semin Cancer Biol. (2019) 59:92–111. doi: 10.1016/j.semcancer.2019.07.003, PMID: 31408724
28. Tao Y and Liang B. PTEN mutation: A potential prognostic factor associated with immune infiltration in endometrial carcinoma. Pathol Res Pract. (2020) 216:152943. doi: 10.1016/j.prp.2020.152943, PMID: 32279917
29. Kunitomi H, Banno K, Yanokura M, Takeda T, Iijima M, Nakamura K, et al. New use of microsatellite instability analysis in endometrial cancer. Oncol Lett. (2017) 14:3297–301. doi: 10.3892/ol.2017.6640, PMID: 28927079
30. Aoki M and Fujishita T. Oncogenic roles of the PIK3/AKT/mTOR axis. Curr Top Microbiol Immunol. (2017) 407:153–89. doi: 10.1007/82_2017_6, PMID: 28550454
31. Slomovitz BM, Jiang Y, Yates MS, Soliman PT, Johnston T, Nowakowski M, et al. Phase II study of everolimus and letrozole in patients with recurrent endometrial carcinoma. J Clin Oncol. (2015) 33:930–6. doi: 10.1200/JCO.2014.58.3401, PMID: 25624430
32. Soliman PT, Westin SN, Iglesias DA, Fellman BM, Yuan Y, Zhang Q, et al. Everolimus, letrozole, and metformin in women with advanced or recurrent endometrioid endometrial cancer: A multi-center, single arm, phase II study. Clin Cancer Res. (2020) 26:581–7. doi: 10.1158/1078-0432.CCR-19-0471, PMID: 31628143
33. Braman NM, Etesami M, Prasanna P, Dubchuk C, Gilmore H, Tiwari P, et al. Intratumoral and peritumoral radiomics for the pretreatment prediction of pathological complete response to neoadjuvant chemotherapy based on breast DCE-MRI. Breast Cancer Res. (2017) 19:57. doi: 10.1186/s13058-017-0846-1, PMID: 28521821
34. Braman N, Prasanna P, Whitney J, Singh S, Beig N, Etesami M, et al. Association of peritumoral radiomics with tumor biology and pathologic response to preoperative targeted therapy for HER2 (ERBB2)-positive breast cancer. JAMA Netw Open. (2019) 2:e192561. doi: 10.1001/jamanetworkopen.2019.2561, PMID: 31002322
35. Prasanna P, Patel J, Partovi S, Madabhushi A, and Tiwari P. Radiomic features from the peritumoral brain parenchyma on treatment-naïve multi-parametric MR imaging predict long versus short-term survival in glioblastoma multiforme: Preliminary findings. Eur Radiol. (2017) 27:4188–97. doi: 10.1007/s00330-016-4637-3, PMID: 27778090
Keywords: endometrial carcinoma, machine learning, Pten, PIK3, mTOR, targeted therapy
Citation: Sun C, Li Q, Huang Y, Xia Y, Li M, Zhu X, Zhu J and Zhao Z (2026) Development of a machine learning model for predicting the expression of proteins associated with targeted therapy in endometrial cancer. Front. Oncol. 15:1502370. doi: 10.3389/fonc.2025.1502370
Received: 26 September 2024; Accepted: 16 December 2025; Revised: 20 November 2025;
Published: 12 January 2026; Corrected: 26 January 2026.
Edited by:
Robert Fruscio, University of Milano Bicocca, ItalyReviewed by:
Giorgio Bogani, National Cancer Institute Foundation (IRCCS), ItalyZhenyu Shu, Zhejiang Provincial People’s Hospital, China
Copyright © 2026 Sun, Li, Huang, Xia, Li, Zhu, Zhu and Zhao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zhenhua Zhao, emhhbzIwNzVAMTYzLmNvbQ==
Yanan Huang1