Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 09 August 2022
Sec. Gynecological Oncology

Machine-learning-based contrast-enhanced computed tomography radiomic analysis for categorization of ovarian tumors

Jiaojiao Li,&#x;Jiaojiao Li1,2†Tianzhu Zhang&#x;Tianzhu Zhang3†Juanwei MaJuanwei Ma1Ningnannan ZhangNingnannan Zhang3Zhang Zhang*Zhang Zhang3*Zhaoxiang Ye*Zhaoxiang Ye1*
  • 1Department of Radiology, National Clinical Research Center for Cancer, Tianjin Key Laboratory of Cancer Prevention and Therapy, Tianjin’s Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin, China
  • 2Department of Radiology, First Affiliated Hospital of Hebei North University, Zhangjiakou, China
  • 3Department of Radiology, Tianjin Medical University General Hospital, Tianjin, China

Objectives: This study aims to evaluate the diagnostic performance of machine-learning-based contrast-enhanced CT radiomic analysis for categorizing benign and malignant ovarian tumors.

Methods: A total of 1,329 patients with ovarian tumors were randomly divided into a training cohort (N=930) and a validation cohort (N=399). All tumors were resected, and pathological findings were confirmed. Radiomic features were extracted from the portal venous phase images of contrast-enhanced CT. The clinical predictors included age, CA-125, HE-4, ascites, and margin of tumor. Both radiomics model (including selected radiomic features) and mixed model (incorporating selected radiomic features and clinical predictors) were constructed respectively. Six classifiers [k-nearest neighbor (KNN), support vector machines (SVM), random forest (RF), logistic regression (LR), multi-layer perceptron (MLP), and eXtreme Gradient Boosting (XGBoost)] were used for each model. The mean relative standard deviation (RSD) and area under the receiver operating characteristic curve (AUC) were applied to evaluate and select the best classifiers. Then, the performances of the two models with selected classifiers were assessed in the validation cohort.

Results: The MLP classifier with the least RSD (1.21 and 0.53, respectively) was selected as the best classifier in both radiomics and mixed models. The two models with MLP classifier performed well in the validation cohort, with the AUCs of 0.91 and 0.96 and with accuracies (ACCs) of 0.83 and 0.87, respectively. The Delong test showed that the AUC of mixed model was statistically different from that of radiomics model (p<0.001).

Conclusions: Machine-learning-based CT radiomic analysis could categorize ovarian tumors with good performance preoperatively. The mixed model with MLP classifier may be a potential tool in clinical applications.

Introduction

Ovarian cancer (OC) remains the third most common gynecological malignant tumor responsible for gynecological-cancer-related deaths (1). Patients with OC are often treated with cytoreductive surgery followed by chemotherapy or neoadjuvant chemotherapy followed by interval debulking (2). Conservative management can reduce unnecessary operative costs, decrease long-term surgical complications, and keep fertility for patients with asymptomatic benign ovarian masses (3, 4). Nevertheless, ovarian tumors were often detected incidentally, and most of them were addressed with surgery and proven to be benign (4). Therefore, characterizing the ovarian masses and assessing the possible malignant diseases would be critical for personalized treatment and follow-up plans.

Traditional imaging techniques have been widely applied for classifying ovarian tumors and monitoring treatment response in patients with OC. Specifically, ultrasound (US) is used as the first-line imaging modality for the assessment and characterization of ovarian tumors, and magnetic resonance imaging (MRI) was recommended as the problem-solving tool for the sonographically indeterminate adnexal masses (5). In addition, CT is regarded as the optimal selection for preoperative staging of OC according to the European Society of Urogenital Radiology guidelines (6); however, its role in the differentiation between benign and malignant ovarian masses is limited. In contrast to its suboptimal diagnostic performance, CT with its high spatial resolution and wide availability has been widely used in the incidental initial detection in routine clinical practice. The diagnosis of ovarian tumors mainly depended on these imaging modalities and subjective imaging assessment. However, it is limited in heterogeneity detection of ovarian masses. Therefore, it is crucial to develop a precise, objective, and non-invasive approach to preoperative categorization of ovarian tumors based on CT images.

Radiomics, allowing for high-dimensional quantitative data extraction from medical images, has shown significant power for tumor detection and classification along with objective support for clinical treatment strategy (7, 8). In brief, radiomics can offer additional diagnostic information that cannot be visible to the human eyes (9), and the information may remedy the limitation of lower soft tissue resolution in CT images. Recent studies have shown that radiomics images could distinguish benign from malignant masses with favorable performance using CT (1013). Furthermore, machine learning, an emerging data mining approach, has offered various useful methodologies to efficiently and effectively construct accurate models for prediction based on a large number of variables (1416). Combined with machine learning algorithms, radiomics techniques have been implemented for various cancer diagnoses (1618).

The study aims to apply 3D radiomic analysis to preoperatively categorize benign and malignant ovarian tumors using different machine learning classifiers to build the best radiomics model and to develop and validate the mixed model by combining the radiomic and clinical features.

Materials and methods

Patients

This retrospective study was approved by the review board of Tianjin Medical University Cancer Hospital (approval no. bc2022048), and informed consent was waived. All the clinical records have been desensitized. We included patients with ovarian tumors from January 2013 to July 2021. The inclusion criteria were as follows: (1) availability of pretreatment contrast-enhanced CT images and (2) definite histological diagnosis of ovarian tumors. The exclusion criteria were as follows: (1) patients with lesions smaller than 1 cm and (2) poor image quality or serious artifacts. Finally, 1,329 patients were selected and included in this study. In addition, all the patients were randomly divided into a training cohort (N=930) and a validation cohort (N=399) with a ratio of 7:3. A flowchart for the recruitment of patients is shown in Figure 1.

FIGURE 1
www.frontiersin.org

Figure 1 The patient selection workflow.

Clinical data

The clinical data of all patients were retrospectively analyzed from our picture archiving and communication system (PACS), including clinical baseline and CT-reported data. CA-125 and HE-4 are widely used to diagnose and monitor OC (19). The presence of ill-defined margin of the masses and a large number of ascites is an indication of ovarian malignant tumors (20, 21). Thus, the clinical baseline data including age, CA-125 level, and HE-4 level were recorded. In addition, CT-reported data including ascites and margin of tumor were recorded and evaluated by two radiologists (with 7 years of experience, JL; with 3 years of experience, TZ). The two radiologists were blinded to histological results and clinical information. Differences were resolved by consensus.

CT image acquisition

All patients underwent an abdominal-pelvic or pelvic contrast-enhanced CT scan before treatment. Contrast-enhanced CT scans were performed using seven different CT scanners: Discovery CT750 HD (GE medical systems), Revolution EVO (GE medical systems), Optima CT680 Expert (GE medical systems), LightSpeed 16 (GE medical systems), SOMATOM Definition AS+ (SIEMENS), SOMATOM Definition Drive (SIEMENS), and IQon Spectral CT (Philips). The acquisition parameters were as follows: matrix, 512×512; tube voltage, 120 kVp; auto tube current; and thickness, 1.0–2.5 mm. The volume of contrast medium was weight based (body weight in kg × 1.5 ml) with an upper limit of 150 ml, a concentration of 270–350 mg/ml iodine through an antecubital vein at the rate of 2.5–2.8 ml/s. The portal venous phase of contrast-enhanced CT images was retrieved at a 55–60 s delay after contrast medium injection.

Tumor segmentation

In the case of bilateral ovarian tumors, only the larger tumor was subjected to tumor segmentation and further radiomic analysis. For each patient, the volume of interest (VOI) for ovarian tumor was delineated manually slice by slice on the portal venous phase of contrast-enhanced CT using 3D-slicer software (www.slicer.org), as illustrated in Supplementary Figure S1, by a radiologist (with 7 years of experience, JL). The inter- and intra-observer consistency of tumor segmentation was analyzed with 50 randomly chosen patients after 3 weeks. The same radiologist (with 7 years of experience, JL) performed a region of interest (ROI) drawing with the same method for intra-observer agreement assessment, and another radiologist (with 3 years of experience, TZ) performed ROI segmentation with the same method independently to assess inter-observer reliability.

Extraction and selection of radiomic features

The radiomic features were extracted by PyRadiomics, and additional details are in Supplementary Materials 1.1. A total of 1,316 radiomic features were extracted from original and filtered images (five Laplace of Gaussian filter and eight wavelet transform). The radiomic features included 14 shape-based features, 18 first-order statistic features, 24 gray-level co-occurrence matrix (GLCM) features, 16 gray-level run-length matrix (GLRLM) features, 16 gray-level size zone matrix (GLSZM) features, 14 gray-level dependence matrix (GLDM) features, 5 neighboring gray-tone difference matrix (NGTDM) features, 465 Laplacian of Gaussian-filtered (LoG) features, and 744 wavelet features. The intra-class correlation coefficients (ICCs) were calculated to evaluate inter- and intra-observer repeatability of radiomic features extraction. The radiomic features with the values of ICCs >0.8 were reserved.

Extracted radiomic features were standardized by a z-score normalization, and additional details are in Supplementary Materials 1.2. We selected radiomic features based on the following steps in the training cohorts. First, the top 10% of best features calculated by univariate analysis using the SelectPercentile were selected. Then, Pearson or Spearman correlation matrices were utilized to assess the correlation between the radiomic features where a correlation coefficient >0.8 was considered redundant. Finally, a wrapper feature selection method based on RF classifier was used to choose the best predictive features.

Construction of radiomics and mixed model

Radiomics model based on selected radiomic features was established. Univariate analysis and multivariate logistic regression analysis were conducted to select independent predictors from clinical variables. Then, wrapper feature selection based on RF classifier was further used to select the significant features from the selected radiomic features and independent clinical predictors, based on which a mixed model was built.

Model validation and evaluation

Six supervised machine learning classifiers were used for the radiomics and mixed model: k-nearest neighbor (KNN), support vector machines (SVMs), random forest (RF), logistic regression (LR), multi-layer perceptron (MLP), and eXtreme Gradient Boosting (XGBoost). The fivefold cross-validation for each classifier was applied to the training cohort, and a StratifiedKFold iterator in scikit-learn was used. To select the best machine learning classifier, relative standard deviation (RSD) was employed to quantify the stability of the six classifiers. RSD (22) was defined as the ratio between the standard deviation and mean of the fivefold cross-validation AUC values in the training cohort:

RSD=sdAUCmeanAUC100

The lower the RSD value, the higher the stability of the classifier. Similar to a previous study (23), we used the median values of area under the receiver-operating characteristic curve (AUC) and RSD as thresholds to assess and select the classifiers among six classifiers in the training cohort. The classifiers with RSD ≤1.64 and AUC ≥0.91, and with RSD ≤0.94 and AUC ≥0.94 were considered as highly reliable and accurate in the radiomics and mixed models, respectively. Moreover, the classifier with the highest AUC in the validation cohort was considered as the best classifier. In addition, confusion matrix-derived metrics, including accuracy (ACC), sensitivity (Sens), specificity (Spec), positive predictive value (PPV), and negative predictive value (NPV) of two models were calculated to further assess the two models with the best classifier. The difference in the AUC values between the two models with the best classifier was tested using the p-value of DeLong (D) test. The one with the higher AUC was identified as the best model.

Human readout

All CT images from the validation cohort were evaluated by a senior (ZZ, with 12 years of working experience) in random order in the radiology department. The senior was blinded to the research design, clinical information, and background.

Statistical analysis

In this study, differences in patients’ clinical predictors between training and validation cohorts were compared by using the Mann–Whitney U test for continuous variables and the chi-square test for categorical variables. The Mann–Whitney U test or chi-square test was utilized to identify independent clinical predictors for the univariate analysis. Binary logistic regression analysis was used for multivariate logistic regression analysis. A two-tailed p<0.05 was considered statistically significant. Statistical analysis was performed using MedCalc 18.2.1. The feature selection and model building were performed using Python 3.7. The boxplot of ICCs of radiomics features and heatmap were drawn by R software (version 4.1.2).

Results

Patients’ demographics and clinical characteristics

Histological characteristics of ovarian tumors after surgery are summarized in Table 1. The study consisted of 719 patients with benign ovarian tumors and 610 patients with malignant ovarian tumors. The clinical baseline and CT-reported data of patients are presented in Table 2. There were no significant differences in patient age, level of CA-125, level of HE-4, ascites, and the margin of tumor between the training and validation cohorts (all p>0.05).

TABLE 1
www.frontiersin.org

Table 1 Histopathological characteristics of ovarian tumors included in the study.

TABLE 2
www.frontiersin.org

Table 2 Patients characteristics.

The results of the univariate analysis and multivariate logistic regression analysis are shown in Table 3. Patients with benign ovarian tumors were younger than those with malignant ovarian tumors (p<0.0001). The HE-4 and CA-125 levels in patients with malignant ovarian tumors were higher than those in patients with benign ovarian tumors (both p<0.0001). The women with benign ovarian tumors had fewer ascites (p<0.0001), and the margin of tumor was better defined in those (p<0.0001). Multivariate regression logistic analysis showed no significant differences in age and level of CA-125 (both p>0.05). Finally, the level of HE-4, ascites, and margin were selected as independent clinical predictors related to benign and malignant ovarian tumors.

TABLE 3
www.frontiersin.org

Table 3 Results of univariate analysis and multivariate logistic regression analysis for clinical predictors in the training cohort.

Radiomic features analysis and feature selection

The ICCs were calculated to assess the robustness and repeatability of radiomic features (Figure 2). The shape-based features, first-order statistic features, GLCM features, GLRM features, NGTDM features, LoG features, and wavelet features held high robustness and repeatability not only in the inter-observer measurement but also in the intra-observer measurement. The mean ICC values of these features were all >0.8. However, the GLSZM features (0.916 ± 0.127 vs. 0.783 ± 0.345) and GLDM features (0.883 ± 0.196 vs. 0.764 ± 0.388) had high robustness in intra-observer measurement, whereas there was less repeatability in inter-observer measurement.

FIGURE 2
www.frontiersin.org

Figure 2 Boxplot of ICCs of radiomic features extracted from nine feature groups. (A) Intra-observer ICCs. (B) Inter-observer ICCs. ICCs, intra-class correlation coefficients.

Totally, 1,229 radiomic features with ICCs>0.8 including 14 shape-based features, 17 first-order statistic features, 21 features of GLCM, 12 features of GLRLM, 11 features of GLSZM, 10 features of GLDM, 5 features of NGTDM, 429 LoG features, and 710 wavelet features were retained and used for further feature selection. Then, after feature selection including univariate analysis and rapper feature selection method, nine radiomic features including one shape feature, four LoG features, and four wavelet features associated with differentiation in ovarian tumors were identified and further applied to establish models. Detailed information about feature selection and these selected radiomic features can be found in Supplementary Material 1.3 and Supplementary Table S1.

Model construction and comparison

A radiomics model based on the nine selected radiomic features was developed. HE-4 level, margin, and six radiomic features were determined by the wrapper method based on RF classifier and selected as independent predictors to construct the mixed model. These predictors have been detailed in Supplementary Material 1.4 and Supplementary Table S2. The six classifiers including KNN, SVM, RF, LG, MLP, and XGBoost were evaluated in the radiomics and mixed model. The RSDs of the six classifiers in the radiomics and mixed model are shown in Table 4. The heatmap of the mean AUC values for the six classifiers in the radiomics and mixed model is presented in Figure 3. According to the criteria for RSD ≤1.64 and AUC ≥0.91, and RSD ≤0.94 and AUC ≥0.94 in the radiomics and mixed models, respectively, the classifiers (XGBoost, MLP, and SVM) in the radiomics model and the classifiers (LR, MLP, and SVM) in the mixed model were selected. The selected classifiers were then applied to the validation cohort. Among these selected classifiers, the MLP had the highest AUC in both the radiomics and mixed models, which was chosen as the best classifier. Therefore, MLP was chosen as the machine learning classifier for constructing the radiomics and mixed models. The performances (AUC, ACC, Sens, Spec, PPV, and NPV) of radiomics and mixed model with the six classifiers are listed in Table 5 and Figure 3. Figure 4 illustrates the confusion matrix of the optimal classifier of MLP in the radiomics and mixed model. The ACCs were 0.83 and 0.87 for the radiomics and mixed model with the optimal classifier. The Sens, Spec, PPV, and NPV of MLP classifier were 0.84, 0.82, 0.86, and 0.80 in the radiomics model, and 0.84, 0.84, 0.87, and 0.88 in the mixed model, respectively. Moreover, we compared the diagnostic performance of the aforementioned radiomics and mixed models with that derived from the visual inspection of senior radiologist. As shown in Table 5, the AUC was 0.78 for the senior radiologist, and it was inferior to the AUCs of radiomics and mixed models. DeLong test showed statistically significant differences between senior radiologist’s evaluation and radiomics model (D=5.87, p<0.001) and mixed model (D=9.03, p<0.001) separately. The illustration of the fivefold cross-validation receiver-operating characteristic (ROC) curve for the training cohort and ROC curve for the validation cohort in the radiomics and mixed model with the optimal classifier is presented in Figure 5. The values of the AUC of the two models with the optimal classifier on the validation cohort were compared by the Delong test, and the result showed a significant difference (D=−4.36, p<0.001). The mixed model with MLP classifier was the best.

TABLE 4
www.frontiersin.org

Table 4 The RSD of classifiers for radiomics and mixed model in the training cohort.

FIGURE 3
www.frontiersin.org

Figure 3 The heatmap illustrating the predictive performance (AUC) for the six classifiers. AUC, area under the receiver-operating characteristic curve; R_model_train, radiomics model in the training cohort; M_model_train, mixed model in the training cohort; R_model_validation, radiomics model in the validation cohort; M_model_validation, mixed model in the validation cohort; XGBoost, eXtreme Gradient Boosting; LR, logistic regression; RF, random forest; MLP, multi-layer perceptron; KNN, k-nearest neighbor; SVM, support vector machines.

TABLE 5
www.frontiersin.org

Table 5 Diagnostic performances of the classifiers for radiomics and mixed model and the senior radiologist in the validation cohort.

FIGURE 4
www.frontiersin.org

Figure 4 Confusion matrix with MLP classifier in the validation cohort. (A) The radiomics model with MLP classifier. (B) The mixed model with MLP classifier. MLP, multi-layer perceptron.

FIGURE 5
www.frontiersin.org

Figure 5 ROC curve of the optimal classifier (MLP). (A) The fivefold cross-validation ROC curve of radiomics model with MLP classifier in the training cohort. (B) The ROC curve of radiomics model with MLP classifier in the validation cohort. (C) The fivefold cross-validation ROC curve of mixed model with MLP classifier in the training cohort. (D) The ROC curve of mixed model with MLP classifier in the validation cohort. ROC, receiver-operating characteristic; MLP, multi-layer perceptron.

Discussion

In this study, we developed and compared CT 3D radiomic analysis to categorize benign and malignant ovarian tumors preoperatively using six machine learning classifiers, and the MLP classifier exhibited the best performance. Our results showed that both radiomics and mixed models with MLP classifiers can be used for OC differentiation. Additionally, the mixed model with MLP classifier demonstrated significantly improved discrimination performance for ovarian tumors.

Traditionally, the differentiation of benign and malignant ovarian tumors was mainly dependent on the subjective analysis of US and/or MRI imaging (24). Although CT may play a useful role in differentiating ovarian masses, its performance based on subjective analysis is often imperfect. In contrast, the mixed model with MLP classifier showed superior diagnostic performance in the validation cohort (AUC=0.96) in this study. Encouragingly, the mixed model with MLP classifier could afford good classification in patients with borderline tumors regarded as low potential malignancies and further demonstrated its superiority with high PPV (0.87) and NPV (0.88) in the validation cohort. The radiomics model with MLP classifier also showed relatively high diagnostic performance in the validation cohort (AUC=0.91). Therefore, the combination of machine learning classifier and radiomic features maybe the main reason for better outperformed diagnostic performance using CT images for differentiation of ovarian tumors, which may make up for the deficiency of CT clinical application to a certain extent.

Machine learning classifiers have been extensively applied in the field of radiomic analysis and further enhanced diagnostic performance (15, 23, 25, 26). Different classifiers in machine learning have different algorithms and diagnostic efficiencies. To data, a few studies have explored radiomic analysis based on machine learning classifiers for distinguishing ovarian tumors; however, the diagnostic efficiencies of different machine learning classifiers were not compared (27, 28). In our study, we explored and compared the diagnostic values of six classifiers, namely, RF, KNN, LG, SVM, MLP, and XGBoost. KNN is a relatively simple classifier that directly calculates images to images distances. XGBoost and RF are constructed by a multitude of decision trees, and their advantages are the ease of use and ability to calculate the importance of features (29). LR is currently the most widely used machine learning classifier because of its simplicity. However, it is necessary to pay attention to its inherent limitation, such as the independence assumption to features. SVM iteratively forms a hyperplane or a set of hyperplanes in high-dimensional feature space that separates the clinical problems (30). As deep-learning is now widely used for the diagnosis of various diseases, the deep-learning or convolutional neural network (CNN) method would assist in the construction of favorable classification models. The MLP is the simplest form of an artificial neural network; it simulates the nervous system properties and biological learning functions through an adaptive process (31, 32). The present study implemented the MLP classifier and conducted the preliminary study on CNN-based classification models. Although the diagnostic value of RF was slightly higher than that of the MLP, the latter was found to be more stable than the former in the training cohort. The stability of the machine learning classifier is also very important for its model construction and clinical application (22). Therefore, the MLP was chosen to develop the radiomics and mixed model. Moreover, the two models with the MLP classifier showed the highest diagnostic efficiency than the other conventional classifiers in the validation cohort, which suggested that MLP was the preferred classifier when differentiating benign and malignant ovarian tumors.

A prior study constructed a model to classify ovarian tumors using radiomic analysis based on plain CT (33). The radiomics nomogram demonstrated the best performance in both the training (AUC=0.95) and test (AUC=0.96) cohort. External validation also showed high diagnostic performance (AUC=0.95) in the previous study. In our study, a large number of patients were used for radiomic analysis based on CT images, and borderline tumors were included. Pan et al. demonstrated that the nomogram model, combining CT radiomic and semantic features, showed the best diagnostic value to differentiate serous and mucinous ovarian cystadenomas than the radiomics-based and semantics-based models (34). The CT semantic feature of the margin, as an independent predictor, was included in the mixed model in our study. Consistent with the previous study, the mixed model possessed a higher diagnostic value than the radiomics model in which only radiomic features were involved in our study. This result further validated complementarity in the radiomic and CT semantic features. Moreover, the margin of ovarian masses was divided into two categories, namely, well-defined margin and ill-defined margin in this study, which was easily evaluated by clinicians even for inexperienced clinicians in this study. This indirectly mirrored the operability and feasibility of the mixed model in our study. Deep learning, especially CNN, has been attracting attention within radiology. For instance, Wang et al. applied CNN on routine MRI to assess the nature of ovarian tumors and found that CNN could assist radiologists in improving their diagnostic performance (35). However, manual segmentation was used for ovarian lesions segmentation in the study. Nowadays, automatic segmentation based on deep learning has been successfully implemented; nevertheless, it is still challenging for ovarian lesion on CT or MRI because the bladder or uterus are frequently confused for the ovary or ovarian lesion. Furthermore, because of the inherent limitation of the sample size for medical images, training a CNN model from scratch for one specific clinical question often does not yield satisfactory results (36). In addition, the current study also showed that HE-4 level was an independent predictor for distinguishing benign and malignant ovarian tumors. Unlike previous studies (33, 37), the level of CA-125 and HE4 were evaluated simultaneously in our study. This study showed that only the HE-4 level could be used as an independent predictor. This may be because the level of CA-125 is slightly elevated in some patients with benign ovarian tumors (38), which decreases its specificity.

The previous studies used 2D radiomic features to distinguish ovarian tumors (28, 33). 3D segmentation method was utilized to segment lesions in this study. 3D segmentation is better in capturing full information about the whole tumor and more realistically to reflect the high heterogeneity of ovarian tumors. Liu et al. have shown that 3D segmentation has better performance than 2D segmentation in distinguishing ovarian borderline tumors and epithelial cancers (39). Several previous studies have also demonstrated that radiomic features based on 3D segmentation are preferably repeatable (40) and more insensitive to manual segmentation variability (41). Furthermore, our study chose the portal venous phase CT images to extract radiomic features and demonstrated that they had favorable predictive performance. An et al. have determined the histological subtype classification in epithelial ovarian carcinoma using CT texture features extracted from the portal venous phase of contrast-enhanced CT images (42). Yu et al. compared the diagnostic performance of radiomic features extracted from different phases for discriminating between serous borderline malignant ovarian tumors, and the result indicated that the portal venous phase model showed a good predictive performance with a relatively higher AUC (43). These findings and our results may provide an idea for conducting extraction of radiomic features in future studies. In addition, this study presented an excellent diagnostic performance for categorizing ovarian tumors on seven different CT scanners, which indicated that such a heterogeneous dataset may compensate for the limitation of no external validation.

This study has some limitations. First, it was a retrospective study in a single center, which may cause a selection bias. External multi-center validation in a larger cohort is needed to verify the generalization of our model in the future. Second, our study was limited in sample size for some subgroups of ovarian tumors because of the rarity of diseases. In the future study, we will increase the sample size to validate our predictive model. Third, different feature selection methods were not compared. Finally, the VOIs were drawn manually, and the influence of subjective factors cannot be avoided completely. In the future study, semi-automatic segmentation or automatic segmentation will be used.

Conclusion

In conclusion, this study indicated that machine learning based on CT radiomic analysis could be applied to classify benign and malignant ovarian tumors. The mixed model with MLP classifier held the best diagnostic performance, which may be used as a convenient and accurate tool in clinical settings to identify and distinguish benign and malignant ovarian tumors.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material. Further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving human participants were reviewed and approved by Tianjin Medical University Cancer Institute and Hospital. Written informed consent from the participants’ legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.

Author contributions

All the authors were involved in this study and approved the final manuscript. The individual contributions are listed below: JL: participated in data collection and analysis and drafted the manuscript. TZ: participated in data analysis and model building. JM: collected the data and performed statistical analysis. NZ: performed statistical analysis and model building. ZZ and ZY: designed the research and edited the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by the Chinese National Key Research and Development Project (Grant Nos. 2021YFC2500400 and 2021YFC2500402), the National Natural Science Foundation of China (Grant Nos. 82071907, 81301217, and 81301202), Tianjin Health Science and Technology Project (Grant No. MS20022), Natural Science Foundation of Tianjin (Grant Nos. 18JCYBJC25100 and 18JCQNJC80200), and Wu Jieping Medical Foundation—Special Fund for Clinical Research (320.6750.2022-3-5).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2022.934735/full#supplementary-material

References

1. Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer statistics, 2021. CA Cancer J Clin (2021) 71:7–33. doi: 10.3322/caac.21654

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Lheureux S, Braunstein M, Oza AM. Epithelial ovarian cancer: Evolution of management in the era of precision medicine. CA Cancer J Clin (2019) 69:280–304. doi: 10.3322/caac.21559

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Wolfman W, Thurston J, Yeung G, Glanc P. Guideline no. 404: initial investigation and management of benign ovarian masses. J Obstet Gynaecol Can (2020) 42:1040–50.e1041. doi: 10.1016/j.jogc.2020.01.014

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Froyman W, Landolfo C, De Cock B, Wynants L, Sladkevicius P, Testa AC, et al. Risk of complications in patients with conservatively managed ovarian tumours (IOTA5): a 2-year interim analysis of a multicentre, prospective, cohort study. Lancet Oncol (2019) 20:448–58. doi: 10.1016/s1470-2045(18)30837-4

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Forstner R, Thomassin-Naggara I, Cunha TM, Kinkel K, Masselli G, Kubik-Huch R, et al. ESUR recommendations for MR imaging of the sonographically indeterminate adnexal mass: an update. Eur Radiol (2017) 27:2248–57. doi: 10.1007/s00330-016-4600-3

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Forstner R, Sala E, Kinkel K, Spencer JA. ESUR guidelines: ovarian cancer staging and follow-up. Eur Radiol (2010) 20:2773–80. doi: 10.1007/s00330-010-1886-4

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Gillies RJ, Kinahan PE, Hricak H. Radiomics: images are more than pictures, they are data. Radiology (2016) 278:563–77. doi: 10.1148/radiol.2015151169

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Liu Z, Wang S, Dong D, Wei J, Fang C, Zhou X, et al. The applications of radiomics in precision diagnosis and treatment of oncology: Opportunities and challenges. Theranostics (2019) 9:1303–22. doi: 10.7150/thno.30309

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Hinzpeter R, Baumann L, Guggenberger R, Huellner M, Alkadhi H, Baessler B. Radiomics for detecting prostate cancer bone metastases invisible in CT: a proof-of-concept study. Eur Radiol (2022) 32:1823–32. doi: 10.1007/s00330-021-08245-6

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Tobaly D, Santinha J, Sartoris R, Dioguardi Burgio M, Matos C, Cros J, et al. CT-based radiomics analysis to predict malignancy in patients with intraductal papillary mucinous neoplasm (ipmn) of the pancreas. Cancers (Basel) (2020) 12:3089. doi: 10.3390/cancers12113089

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Yap FY, Varghese BA, Cen SY, Hwang DH, Lei X, Desai B, et al. Shape and texture-based radiomics signature on CT effectively discriminates benign from malignant renal masses. Eur Radiol (2021) 31:1011–21. doi: 10.1007/s00330-020-07158-0

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Zheng YM, Xu WJ, Hao DP, Liu XJ, Gao CP, Tang GZ, et al. A CT-based radiomics nomogram for differentiation of lympho-associated benign and malignant lesions of the parotid gland. Eur Radiol (2021) 31:2886–95. doi: 10.1007/s00330-020-07421-4

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Sun W, Liu S, Guo J, Liu S, Hao D, Hou F, et al. A CT-based radiomics nomogram for distinguishing between benign and malignant bone tumours. Cancer Imaging (2021) 21:20. doi: 10.1186/s40644-021-00387-6

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Miao R, Badger TC, Groesch K, Diaz-Sylvester PL, Wilson T, Ghareeb A, et al. Assessment of peritoneal microbial features and tumor marker levels as potential diagnostic tools for ovarian cancer. PloS One (2020) 15:e0227707. doi: 10.1371/journal.pone.0227707

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Zhang Y, Zhang B, Liang F, Liang S, Zhang Y, Yan P, et al. Radiomics features on non-contrast-enhanced CT scan can precisely classify AVM-related hematomas from other spontaneous intraparenchymal hematoma types. Eur Radiol (2019) 29:2157–65. doi: 10.1007/s00330-018-5747-x

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Qian Z, Li Y, Wang Y, Li L, Li R, Wang K, et al. Differentiation of glioblastoma from solitary brain metastases using radiomic machine-learning classifiers. Cancer Lett (2019) 451:128–35. doi: 10.1016/j.canlet.2019.02.054

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Xie T, Wang Z, Zhao Q, Bai Q, Zhou X, Gu Y, et al. Machine learning-based analysis of mr multiparametric radiomics for the subtype classification of breast cancer. Front Oncol (2019) 9:505. doi: 10.3389/fonc.2019.00505

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Liu X, Khalvati F, Namdar K, Fischer S, Lewis S, Taouli B, et al. Can machine learning radiomics provide pre-operative differentiation of combined hepatocellular cholangiocarcinoma from hepatocellular carcinoma and cholangiocarcinoma to inform optimal treatment planning? Eur Radiol (2021) 31:244–55. doi: 10.1007/s00330-020-07119-7

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Kim B, Park Y, Kim B, Ahn HJ, Lee KA, Chung JE, et al. Diagnostic performance of CA 125, HE4, and risk of ovarian malignancy algorithm for ovarian cancer. J Clin Lab Anal (2019) 33:e22624. doi: 10.1002/jcla.22624

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Janssen CL, Littooij AS, Fiocco M, Huige JCB, de Krijger RR, Hulsker CCC, et al. The diagnostic value of magnetic resonance imaging in differentiating benign and malignant pediatric ovarian tumors. Pediatr Radiol (2021) 51:427–34. doi: 10.1007/s00247-020-04871-2

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Huang H, Li YJ, Lan CY, Huang QD, Feng YL, Huang YW, et al. Clinical significance of ascites in epithelial ovarian cancer. Neoplasma (2013) 60:546–52. doi: 10.4149/neo_2013_071

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Shu Z, Mao D, Song Q, Xu Y, Pang P, Zhang Y. Multiparameter MRI-based radiomics for preoperative prediction of extramural venous invasion in rectal cancer. Eur Radiol (2022) 32:1002–13. doi: 10.1007/s00330-021-08242-9

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Parmar C, Grossmann P, Rietveld D, Rietbergen MM, Lambin P, Aerts HJ. Radiomic machine-learning classifiers for prognostic biomarkers of head and neck cancer. Front Oncol (2015) 5:272. doi: 10.3389/fonc.2015.00272

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Wong VK, Kundra V. Performance of o-rads mri score for classifying indeterminate adnexal masses at us. Radiol Imaging Cancer (2021) 3:e219008. doi: 10.1148/rycan.2021219008

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Nazari M, Shiri I, Zaidi H. Radiomics-based machine learning model to predict risk of death within 5-years in clear cell renal cell carcinoma patients. Comput Biol Med (2021) 129:104135. doi: 10.1016/j.compbiomed.2020.104135

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Yin P, Mao N, Zhao C, Wu J, Sun C, Chen L, et al. Comparison of radiomics machine-learning classifiers and feature selection for differentiation of sacral chordoma and sacral giant cell tumour based on 3D computed tomography features. Eur Radiol (2019) 29:1841–7. doi: 10.1007/s00330-018-5730-6

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Chiappa V, Bogani G, Interlenghi M, Salvatore C, Bertolina F, Sarpietro G, et al. The adoption of radiomics and machine learning improves the diagnostic processes of women with ovarian masses (the aroma pilot study). J Ultrasound (2021) 24:429–37. doi: 10.1007/s40477-020-00503-5

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Zhang H, Mao Y, Chen X, Wu G, Liu X, Zhang P, et al. Magnetic resonance imaging radiomics in categorizing ovarian masses and predicting clinical outcome: a preliminary study. Eur Radiol (2019) 29:3358–71. doi: 10.1007/s00330-019-06124-9

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Nakagawa M, Nakaura T, Namimoto T, Kitajima M, Uetani H, Tateishi M, et al. Machine learning based on multi-parametric magnetic resonance imaging to differentiate glioblastoma multiforme from primary cerebral nervous system lymphoma. Eur J Radiol (2018) 108:147–54. doi: 10.1016/j.ejrad.2018.09.017

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Hamerla G, Meyer HJ, Schob S, Ginat DT, Altman A, Lim T, et al. Comparison of machine learning classifiers for differentiation of grade 1 from higher gradings in meningioma: A multicenter radiomics study. Magn Reson Imaging (2019) 63:244–9. doi: 10.1016/j.mri.2019.08.011

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Fleury E, Marcomini K. Performance of machine learning software to classify breast lesions using BI-RADS radiomic features on ultrasound images. Eur Radiol Exp (2019) 3:34. doi: 10.1186/s41747-019-0112-7

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Nagawa K, Suzuki M, Yamamoto Y, Inoue K, Kozawa E, Mimura T, et al. Texture analysis of muscle MRI: machine learning-based classifications in idiopathic inflammatory myopathies. Sci Rep (2021) 11:9821. doi: 10.1038/s41598-021-89311-3

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Li S, Liu J, Xiong Y, Pang P, Lei P, Zou H, et al. A radiomics approach for automated diagnosis of ovarian neoplasm malignancy in computed tomography. Sci Rep (2021) 11:8730. doi: 10.1038/s41598-021-87775-x

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Pan S, Ding Z, Zhang L, Ruan M, Shan Y, Deng M, et al. A nomogram combined radiomic and semantic features as imaging biomarker for classification of ovarian cystadenomas. Front Oncol (2020) 10:895. doi: 10.3389/fonc.2020.00895

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Wang R, Cai Y, Lee IK, Hu R, Purkayastha S, Pan I, et al. Evaluation of a convolutional neural network for ovarian tumor differentiation based on magnetic resonance imaging. Eur Radiol (2021) 31:4960–71. doi: 10.1007/s00330-020-07266-x

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Hu Y, Xie C, Yang H, Ho JWK, Wen J, Han L, et al. Computed tomography-based deep-learning prediction of neoadjuvant chemoradiotherapy treatment response in esophageal squamous cell carcinoma. Radiother Oncol (2021) 154:6–13. doi: 10.1016/j.radonc.2020.09.014

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Qi L, Chen D, Li C, Li J, Wang J, Zhang C, et al. Diagnosis of ovarian neoplasms using nomogram in combination with ultrasound image-based radiomics signature and clinical factors. Front Genet (2021) 12:753948. doi: 10.3389/fgene.2021.753948

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Aslan K, Onan MA, Yilmaz C, Bukan N, Erdem M. Comparison of HE 4, CA 125, ROMA score and ultrasound score in the differential diagnosis of ovarian masses. J Gynecol Obstet Hum Reprod (2020) 49:101713. doi: 10.1016/j.jogoh.2020.101713

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Liu X, Wang T, Zhang G, Hua K, Jiang H, Duan S, et al. The two-dimensional and three-dimensional T2 weighted imaging-based radiomic signatures selected for the preoperative discrimination of ovarian borderline tumors and epithelial cancer. J Ovarian Res (2022) 15:22. doi: 10.1186/s13048-022-00943-z

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Xie XJ, Liu SY, Chen JY, Zhao Y, Jiang J, Wu L, et al. Development of unenhanced CT-based imaging signature for BAP1 mutation status prediction in malignant pleural mesothelioma: Consideration of 2D and 3D segmentation. Lung Cancer (2021) 157:30–9. doi: 10.1016/j.lungcan.2021.04.023

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Kocak B, Durmaz ES, Kaya OK, Ates E, Kilickesmez O. Reliability of single-slice-based 2d ct texture analysis of renal masses: influence of intra- and interobserver manual segmentation variability on radiomic feature reproducibility. AJR Am J Roentgenol (2019) 213:377–83. doi: 10.2214/ajr.19.21212

PubMed Abstract | CrossRef Full Text | Google Scholar

42. An H, Wang Y, Wong EMF, Lyu S, Han L, Perucho JAU, et al. CT texture analysis in histological classification of epithelial ovarian carcinoma. Eur Radiol (2021) 31:5050–8. doi: 10.1007/s00330-020-07565-3

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Yu XP, Wang L, Yu HY, Zou YW, Wang C, Jiao JW, et al. MDCT-based radiomics features for the differentiation of serous borderline ovarian tumors and serous malignant ovarian tumors. Cancer Manag Res (2021) 13:329–36. doi: 10.2147/cmar.S284220

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: radiomics, ovarian neoplasms, computed tomography, machine learning, classification

Citation: Li J, Zhang T, Ma J, Zhang N, Zhang Z and Ye Z (2022) Machine-learning-based contrast-enhanced computed tomography radiomic analysis for categorization of ovarian tumors. Front. Oncol. 12:934735. doi: 10.3389/fonc.2022.934735

Received: 03 May 2022; Accepted: 16 June 2022;
Published: 09 August 2022.

Edited by:

Le Lu, Alibaba DAMO Academy US, United States

Reviewed by:

Jingjing Lu, Beijing United Family Hospital, China
Fakai Wang, University of Maryland, College Park, United States

Copyright © 2022 Li, Zhang, Ma, Zhang, Zhang and Ye. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zhaoxiang Ye, zye@tmu.edu.cn; Zhang Zhang, filea1249@sina.com

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.