WOAENet: a whale optimization-guided ensemble deep learning with soft voting for uterine cancer diagnosis based on MRI images

Altal, Omar F.; Sindiani, Amer Mahmoud; Mhanna, Hamad Yahia Abu; Alhatamleh, Salem; Amin, Mohammad; Akhdar, Hanan Fawaz; Madain, Rola; Alqasem, Noor; Zayed, Faheem; Alanazi, Sitah; Sandougah, Kholoud J.

doi:10.3389/frai.2025.1664201

ORIGINAL RESEARCH article

Front. Artif. Intell., 20 October 2025

Sec. Medicine and Public Health

Volume 8 - 2025 | https://doi.org/10.3389/frai.2025.1664201

WOAENet: a whale optimization-guided ensemble deep learning with soft voting for uterine cancer diagnosis based on MRI images

Omar F. Altal¹

Amer Mahmoud Sindiani¹

Hamad Yahia Abu Mhanna²

Salem Alhatamleh³

Mohammad Amin³

Hanan Fawaz Akhdar⁴

Rola Madain¹

Noor Alqasem⁵

Faheem Zayed⁶

Sitah Alanazi⁴

Kholoud J. Sandougah⁷^*

¹Department of Obstetrics and Gynecology, Faculty of Medicine, Jordan University of Science and Technology, Irbid, Jordan
²Department of Medical Imaging, Faculty of Allied Medical Sciences, Isra University, Amman, Jordan
³Computer Science Department, Faculty of Information Technology and Computer Sciences, Yarmouk University, Irbid, Jordan
⁴Physics Department, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh, Saudi Arabia
⁵Department of Biomedical Systems and Informatics Engineering, Hijjawi Faculty for Engineering Technology, Yarmouk University, Irbid, Jordan
⁶Department of Obstetrics and Gynecology, Specialty Hospital, Irbid, Jordan
⁷Department of Radiology, College of Medicine, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh, Saudi Arabia

Objectives: Uterine cancer originates from the cells lining the uterus and can develop through abnormal cell growth, potentially leading to damage in surrounding tissues and the formation of precancerous cells. Early detection significantly improves prognosis. Despite advancements in deep learning-based diagnostic methods, challenges remain, including the dependence on expert input and the need for more accurate classification models. This study aims to address these limitations by proposing a novel and efficient methodology for diagnosing uterine cancer using an integrated deep learning pipeline optimized through a nature-inspired algorithm.

Methods: This study introduces the Whale Optimization Algorithm-based Ensemble Network (WOAENet), a deep learning pipeline that classifies uterine MRI into three classes: malignant, benign, and normal. The Whale Optimization Algorithm (WOA) is used to fine-tune the hyperparameters of three deep learning models: MobileNetV2, DenseNet121, and a lightweight vision model (LVM). Each model is trained with its optimized settings, and its outputs are combined using a Soft Voting Ensemble method that calculates the average of the predicted probabilities to arrive at the final classification.

Results: The WOAENet framework was evaluated using a uterine cancer MRI dataset obtained from King Abdullah University Hospital. Our proposed model outperformed standard pre-trained models across several performance metrics. It achieved an accuracy of 88.57%, a specificity of 94.29%, and an F1 score of 88.54%, indicating superior performance in diagnosing uterine cancer.

Conclusion: WOAENet demonstrates a high level of accuracy and reliability in classifying uterine MRI images, marking a significant advancement by utilizing a novel dataset. The findings support the potential of AI-driven approaches in enhancing the diagnosis and treatment of gynecological conditions, paving the way for more accessible and accurate clinical tools.

1 Introduction

One of the most common tumors of the female reproductive system is uterine cancer. It is brought on by the uterine lining’s abnormal cell proliferation, which damages the surrounding tissue as the cells divide (Esmaeilzadeh and Nasirzadeh, 2023). While uterine cancer is less common in Africa and Asia, it is more common in the Americas and Europe. Experts attribute this to environmental risk factors and obesity. Women between the ages of 40 and 60 are more likely to have the illness (Felix and Brinton, 2018). It is the fifteenth most common type of cancer in the general population (Hamoud et al., 2023). The most studied symptom of uterine cancer is abnormal uterine bleeding in premenopausal, postmenopausal, and perimenopausal women (Boeckstaens et al., 2020). Obesity is a significant risk factor for uterine cancer, with women who are overweight or obese being two to four times more likely to develop endometrial cancer compared to women with a lower body mass index (Somasegar et al., 2023).

The advancement of uterine cancer may make diagnosis and treatment more difficult, which could result in a poor prognosis. Tumor staging is therefore essential. There are three stages of this disease: low, intermediate, and high risk. Like other cancers, uterine cancer must be detected early (Dong et al., 2020). Based on their morphological and functional characteristics, analysis that depends on diagnostic accuracy can help classify tissues as either malignant or non-malignant (Keall et al., 2022). In modern clinical practice, magnetic resonance imaging (MRI) is frequently used for several purposes, such as the clinical staging of malignant tumors and the differentiation of benign from malignant gynecological problems (Lu and Broaddus, 2020). The primary method for determining the anatomical origin of uterine cancer is magnetic resonance imaging. MRI is necessary to differentiate between endometrial and cervical sources of uterine tumors when clinical and histological tests are not feasible (Gui et al., 2022). Ultrasonography is increasingly used to evaluate tissue elasticity to diagnose and treat clinical uterine cancers and other issues (Wang et al., 2022).

In recent years, artificial intelligence (AI) has been increasingly applied in medicine, particularly for diagnosis (Akazawa and Hashimoto, 2021). Deep learning and image processing techniques are utilized to improve the early detection of uterine cancer and determine if it is benign, malignant, or subclassified. Eventually, this will lead to stronger treatments that save lives (Maheswari et al., 2024). Convolutional neural network (CNN)-based deep learning approaches, also known as deep CNNs (DCNN), have recently produced impressive results in picture pattern recognition (Lundervold and Lundervold, 2019). Deep learning has been applied to a variety of computer vision applications, including segmentation (Soffer et al., 2019), classification (Fujioka et al., 2020), and lesion detection (Shin et al., 2016).

The use of MRI-based AI in gynecology for uterine cancer detection has not been adequately documented. Furthermore, some studies use a limited approach to tumor diagnosis. Therefore, this study presents an effective method that combines deep learning models and uses the WOA to optimize them. By automatically selecting the optimal set of hyperparameters, including learning rate, batch size, number of units in dense layers, and dropout rate, this method improves the performance of deep learning models. The main contributions of this paper are summarized as follows: This paper introduces a new integrated deep learning pipeline called WOAENet, which leverages the WOA for uterine cancer diagnosis.

• The outputs of the optimized models are combined using a Soft Voting Ensemble strategy, which increases classification robustness and accuracy by averaging the predicted probabilities.

• WOAENet was trained and evaluated on a new uterine MRI dataset from King Abdullah University Hospital, achieving superior performance (88.57% accuracy, 94.29% specificity, and 88.54% F1 score) compared to previously trained models.

• This study pioneers the use of deep learning to classify uterine tumors using a new dataset, highlighting the potential of artificial intelligence to improve the diagnosis and treatment of gynecological diseases.

The techniques employed, a thorough explanation of the dataset, the suggested research approach, and training regimens are all covered in Section 2. The data are analyzed, and the effectiveness of the suggested model in uterine cancer diagnostic tests is assessed in Section 3. The most significant studies in the diagnosis of uterine cancer are covered in Section 4, and Section 5 ends with crucial conclusions and recommendations for further study.

2 Materials and methods

The study follows a complete deep learning pipeline-based framework, WOAENet (Whale Optimization Algorithm-based Ensemble Network), optimized by the WOA algorithm for uterus image classification. As shown in Figure 1, three candidate models were built: MobileNetV2, DenseNet121, and a custom lightweight vision model (LVM). Each model contained tunable hyperparameters, such as learning rate, dropout rate, dense units, weight decay, activation function, and optimizer type. Each model contained tunable hyperparameters such as learning rate, dropout rate, dense units, weight decay, activation function, and optimizer type. All these components were controlled and fine-tuned via hyperparameters optimized using the Whale Optimization Algorithm (WOA), which was employed to minimize the validation loss by conducting an iterative population-based search over an 11-dimensional normalized hyperparameter space. The search was stopped at a small number of iterations and whales to maintain a balance between efficiency and performance. All the models were trained on a uterus MRI dataset. For performance evaluation, during WOA optimization, the candidate models were trained for only a few epochs, followed by further training of all candidates based on the very best hyperparameters found. After completing training, WOAENet applied a soft voting ensemble scheme to average class probabilities predicted by all individual models to improve robustness and classification accuracy.

Figure 1

Flowchart depicting a machine learning pipeline for medical imaging analysis. It follows four stages: Dataset acquisition and preprocessing, Model selection and modeling, Ensemble learning model construction, and Model testing and evaluation. Workflow includes dataset splitting, preprocessing, hyperparameter optimization using Whale Optimization Algorithm, model training with MobileNetV2, DenseNet121, and LVM, leading to a voting classifier and evaluation. MRI images are displayed representing the dataset.

Figure 1. Overview of the WOAENet framework for uterine cancer MRI image classification using optimized ensemble deep learning.

2.1 Data acquisition

This study was approved by the Institutional Review Board (IRB) at King Abdullah University Hospital, Jordan University of Science and Technology (JUST). Radiologists retrospectively diagnosed patients using MRI data collected over 4 years, from early 2020 to early November 2024. Image extraction and dataset assembly were finalized during the data collection window between December 2024 and March 2025, during which anonymized and pre-evaluated images were organized into a structured dataset. The dataset comprises 1,814 MRI images collected from 450 female patients, aged between 18 and 85 years. The cases were classified into three diagnostic categories: normal, benign, and malignant, with each case represented by three imaging planes—sagittal, coronal, and axial. All images were acquired using the Ingenia Ambition 1.5 T Sand MRI scanner and exported in JPG format at a standard resolution of 720 × 720 pixels. To ensure the accuracy of the classification, KAUH obstetrics and gynecology physicians independently reviewed the imaging data. Table 1 shows the distribution of the KAUH-UCM dataset, with a representative sample from each group shown in Figure 2.

Table 1

Table 1. The quantity and distribution of images in each KAUH-UCM category.

Figure 2

White rectangular blank image with no visible content or elements.

Figure 2. An example from the KAUH-UCM image dataset.

2.2 Preprocessing

In medical image analysis, and specifically in uterine cancer diagnosis via MRI, pre-processing holds paramount importance in establishing a strong foundation that will directly influence the accuracy and robustness of the classification models. This phase involves working on the input image resolution, encoding class labels, and stratified data splits to preserve class balance. To achieve class balance (699 images per class), we applied targeted data augmentation techniques—including shear, zoom, and horizontal flip—only to the training set. This was done to synthetically increase samples in underrepresented classes (normal and malignant) without duplicating existing images, avoiding potential overfitting, with 699 for each class: normal, benign, and malignant. To prevent data leakage and ensure generalizability, the data splitting was performed at the patient level. MRI images from different scanners and clinical sites are variable in size, resolution, and intensity profile (Moradmand et al., 2020). To standardize the inputs of a CNN, all images are resized to 224 × 224 fixed pixels compatible with popular pre-trained architectures such as MobileNetV2 and DenseNet121. The resizing operation is expressed in the following, as shown in Equation 1:

\begin{array}{l} \forall x \in X, x \to Resize (x, 224 \times 224) & (1) \end{array}

Beyond resizing, pixel intensity values are normalized to the range $[0, 1]$ by scaling all pixel values with a factor of $\frac{1}{255}$ , this normalization ensures numerical stability during training and helps the optimization algorithms converge more efficiently (Li et al., 2021), as shown in Equation 2:

\begin{array}{l} x_{normalized} = \frac{x_{raw}}{255} & (2) \end{array}

Overfitting reduction and increased model robustness through major data augmentation are situated within real-time training-oriented interventions considered under Keras’ Image Data Generator (Um et al., 2019). The arguments applied to the augmentations include a Shearing Transformation with shear intensity limited to 0.2 to mimic minor affine distortions, slight irregularities or imperfections in shape, Random Zooming while zooming on random areas of the image to help the model identify localized tumor features at varying scales, Horizontal Flipping: allowing random horizontal flips, whereby the model learns spatially invariant features, i.e., recognizing patterns that are symmetric to one another. Essentially, these augmentations generate great diversity and variability for the training data and enhance the generalization of the models to unseen cases. With the three categories for each MRI scan class being clinically relevant, normal, benign tumor, or malignant tumor, the categorical labels are transformed into integer indices for the model, as shown in Equation 3:

Label_Index (y) = {\begin{array}{l} 0, & if y = Normal \\ 1, & if y = Benign \\ 2, & if y = Malignant \end{array} (3)

Such encoding permits the network to consider labels as numerical tensors during training and evaluation. To avoid systematic biases and guarantee balance of representations across classes, the entire dataset is divided into train, validation, and test subsets while keeping the same class distributions (stratified splitting). The training set, $D_{train}$ , is formed from 80% of the full data. The validation set, $D_{val}$ comprises 10%. Test set, $D_{test}$ comprises 10%. If $N_{k}$ is the number of samples in class $k$ , the split obeys the, as shown in Equation 4, 5:

\begin{array}{l} D = {(x_{i}, y_{i})}_{i = 1}^{N} & (4) \end{array}

\begin{array}{l} ∣ D_{train}^{k} ∣ = 0.8 N_{k}, ∣ D_{val}^{k} ∣ = 0.1 N_{k}, ∣ D_{test}^{k} ∣ = 0.1 N_{k} & (5) \end{array}

Each subset is therefore a true representation of the entire dataset, ensuring no bias is created for the majority classes and thus trustworthy evaluation metrics. Preprocessing for uterine tumor MRI images consists of uniform resizing, pixel normalization, advanced data augmentation, error-free label encoding, and balanced data splitting. Together, these steps enrich model stability, improve generalization, and build a strong footing toward downstream classification tasks.

2.3 Whale optimization algorithm (WOA) for hyperparameter tuning

The Whale Optimization Algorithm (WOA) is used as a nature-inspired metaheuristic optimization tool within this study for the automated hyperparameter tuning of deep networks classifying uterine tumors from MRI images (Mirjalili and Lewis, 2016). Accurate classification depends not only on very high-capacity models, but also, more importantly, on the hyperparameter choices such as learning rates, batch sizes, regularization strengths, dropout rates, and architectural parameters such as the number of convolutional filters or dense units (Brodzicki et al., 2021). These hyperparameters greatly influence the model’s generalization ability, especially when competing with complicated, high-dimensional medical imaging data, such as MRI scans.

Traditional tools such as grid-search methods or manual examinations have become almost impossible in this modern context, simply due to prohibitive computational costs plaguing them (Brodzicki et al., 2021). WOA, thus, solves the problem by conjecturing therein an intelligent exploration of the high-dimensional parameter space, with the behavior of humpback whales searching for food in the natural world being the inspiration.

2.3.1 Motivation for metaheuristic-based optimization

In MRI tumor classification, a series of challenges are faced: high variability in anatomical structures, a limited dataset, and a need to generalize models strongly. In the application concerned, the hyperparameter shows a non-linear correlation with performance, interdependence, which renders brute force methods practically helpless (Nadimi-Shahraki et al., 2021). Thus, metaheuristic algorithms like the WOA are fitted with respect to ensure the escape from local minima and to perform efficient global search without the need to keep track of gradient information or convex assumptions.

2.3.2 Mathematical modeling of WOA

The Whale Optimization Algorithm (WOA) simulates the bubble-net hunting strategy of humpback whales and involves three primary mechanisms: to encircle the prey (Rana et al., 2020), to bubble-net attack (exploit), and to search for prey (explore). Here, the solution space is defined by $R^{d}$ , where d is the number of hyperparameters needing optimization. Each whale in the population stands for a possible solution vector $\vec{X} \in R^{d}$ .

Encircling Prey (Exploitation), the whales regard the current best solution as the prey and update their positions accordingly (Nadimi-Shahraki et al., 2023). Where ${\vec{X}}^{*}$ is the position of the best solution obtained so far. $\vec{X} (t)$ is the current location of the whale. $\vec{A} = 2 a \cdot {\vec{r}}_{1} - a$ , $\vec{C} = 2 \cdot {\vec{r}}_{2}$ are coefficient vectors, $a$ is a linearly decreased factor from 2 to 0 throughout iterations, and ${\vec{r}}_{1}, {\vec{r}}_{2} \sim U (0, 1)$ are random vectors, as shown in Equations 6, 7. The mechanism traverses toward intensification (local search) as the whales try to move toward the best solution.

\begin{array}{l} \vec{D} = ∣ \vec{C} \cdot {\vec{X}}^{*} - \vec{X} (t) ∣ & (6) \end{array}

\begin{array}{l} \vec{X} (t + 1) = {\vec{X}}^{*} - \vec{A} \cdot \vec{D} & (7) \end{array}

Bubble-Net Attacking Strategy (Exploitation) this simulates the spiral-shaped bubble-net behavior. Where $b$ is a constant defining the logarithmic spiral shape (commonly set to 1), $l \sim U [- 1, 1]$ is a random number. The algorithm stochastically chooses between spiral update and encircling with probability $p \in [0, 1]$ , as shown in Equation 8. This probabilistic behavior enhances the search diversity and mimics the natural behavior of whales, varying between exploration and exploitation.

\begin{array}{l} \vec{X} (t + 1) = \vec{D} \cdot e^{bl} \cdot cos (2 πl) + {\vec{X}}^{*} & (8) \end{array}

Searching for Prey (Exploration) if $∣ \vec{A} ∣ \geq 1$ , the whale randomly chooses another whale and updates its position. Here, ${\vec{X}}_{rand}$ is a randomly selected whale (Abualigah et al., 2024). This mechanism ensures the exploration of the global search space to avoid premature convergence, as shown in Equations 9, 10:

\begin{array}{l} \vec{D} = ∣ \vec{C} \cdot {\vec{X}}_{rand} - \vec{X} (t) ∣ & (9) \end{array}

\begin{array}{l} \vec{X} (t + 1) = {\vec{X}}_{rand} - \vec{A} \cdot \vec{D} & (10) \end{array}

2.3.3 Fitness function for hyperparameter optimization

The fitness of each whale (candidate hyperparameter set) is evaluated using a partial training strategy (Pham et al., 2020), where a deep learning model (MobileNetV2, DenseNet121, or LVM) is trained for a limited number of epochs 10 epochs, and the validation loss is recorded as the objective function (Mohammed et al., 2019). Where $\vec{X} \in R^{11}$ is the hyperparameter vector, $L_{val}$ is the validation loss, $θ$ are the model parameters, as shown in Equation 11. This formulation allows WOA to identify hyperparameter configurations that minimize validation loss and hence maximize generalization on unseen MRI scans.

\begin{array}{l} Fitness (\vec{X}) = L_{val} (θ; \vec{X}) & (11) \end{array}

2.3.4 Parameter encoding and normalization

Each dimension in the whale’s position vector $\vec{X}$ corresponds to a hyperparameter. To ensure scalability and uniformity in the search, all hyperparameters are normalized to $[0, 1]$ and decoded during evaluation. For example Nadimi-Shahraki et al. (2023), such normalization allows WOA to operate uniformly across parameters with different physical scales and types, as shown in Equations 12–14:

Learning rate:

\begin{array}{l} lr = 10^{- 5 + 3 x_{0}} & (12) \end{array}

Dropout:

\begin{array}{l} dr = 0.1 + 0.4 x_{2} & (13) \end{array}

Dense units:

\begin{array}{l} du = 64 + 448 x_{3} & (14) \end{array}

To appropriately tune the deep learning architectures for uterine tumor classification from MRI images, the WOA was used to perform a search on a multidimensional hyperparameter space. Table 2 summarizes the hyperparameters that were optimized with WOA, along with their corresponding search ranges and encoding strategies. This set includes learning rate, batch size, dropout rate, and number of dense units, along with convolutional filters (specific to the LVM model), as well as categorical variables such as optimizer and activation function (Aljarah et al., 2018). The normalized search spaces were mapped onto [0,1] and decoded accordingly during every fitness evaluation to allow thorough investigation of the configuration landscape.

Table 2

Table 2. Hyperparameter search space for WOA-based deep learning optimization.

2.3.5 Computational efficiency and convergence

Since deep learning algorithms have more computational requirements, the number of whales (2–5) and iterations (5–10) were selected based on preliminary trials that aimed to minimize computational overhead while maintaining classification performance. These values proved sufficient for stable convergence due to the limited feature dimensionality and the pre-trained nature of the backbone network (Nadimi-Shahraki et al., 2023). The optimization process terminated either when the maximum number of iterations was reached or when no improvement in validation loss was observed for several consecutive iterations (early stopping). WOA stays efficient due to its good exploration and exploitation balance and its applicability to non-differentiable and noisy objective spaces, which generally characterize deep learning hyperparameter landscapes. Convergence behavior is monitored through a fitness curve. Here, $T$ represents the total number of iterations, as shown in Equation 15. Such a curve provides insights into the sequence of optimization steps taken and how stable the search process has been.

\begin{array}{l} Convergence Curve = {[min_{i} Fitness ({\vec{X}}_{i}^{(t)})]}_{t = 1}^{T} & (15) \end{array}

Integrating WOA into the hyperparameter tuning pipeline turned out to be a more scalable and flexible system capable of autogenerating optimized deep learning models for the classification of uterine tumors from MRI images (Liu and Zhang, 2022). By way of contrast with grid search, which is exhaustive, and manual trial-and-error methods, WOA substantially improves model performance while improving generalization and minimizing training costs. That is particularly important in medical imaging, where diagnosis may favor or disfavor a clinician depending on how brain power is expended. Figure 3 represents the basic steps of the Whale Optimization Algorithm (WOA).

Figure 3

Flowchart showing whale census preparation and optimization process. It starts with generating variables $p$ and $\vec{A}$, checking their values, then updating by specific equations based on conditions. It calculates whale fitness, selects the best whale, checks iteration loop completion, and ends if maximum loops are reached.

Figure 3. Flowchart of the whale optimization algorithm (WOA) for iterative solution refinement.

2.4 Model architectures

Three CNN architectures, MobileNetV2, DenseNet121, and an in-house Lightweight Vision Model (LVM), were adopted to classify uterine cancer based on MRI images. Each of the architectures is parameterized and dynamically instantiated based on hyperparameters tuned with the WOA, including the learning rate, dropout rate, dense units, activation functions, and regularization strength. Hence, the design allows for flexibility, scalability, and adaptability of the model to domain-specific data such as MRI scans, which require careful treatment of spatial and structural features.

2.4.1 MobileNetV2 model

MobileNetV2 is a lightweight deep convolutional neural network architecture specifically meant for efficiency on mobile and embedded platforms (Sandler et al., 2018), while maintaining a good level of performance on image classification problems. Considering uterine tumor classification with MRI images, MobileNetV2 thus stands as a strong backbone, for it seems to be the only one that balances the computational efficiency and representational power that are crucial in medical image analysis with constrained annotated data. MobileNetV2 improves upon its predecessor by introducing two key innovations: inverted residuals with linear bottlenecks and depth-wise separable convolutions. Each block in MobileNetV2 is defined by an inverted residual structure wherein the input and output are thin bottleneck layers, and the intermediate expansion layer is of high dimensionality. Hence, features are preserved at a low computational cost.

Let $x \in R^{H \times W \times C}$ be the input tensor, where $H$ , $W$ , and $C$ are the height, width, and number of channels, respectively. Each MobileNetV2 bottleneck block applies the following transformations: expansion (Pointwise Convolution) where $t$ is the expansion factor (typically $t = 6$ ) as Equation 16, depthwise convolution as in Equation 17, projection (Linear Pointwise Convolution) as Equation 18, and residual connection as Equation 19. This inverted residual block allows the network to maintain gradient flow, preserve spatial features, and reduce the number of parameters and operations (Dong et al., 2020).

\begin{array}{l} x_{1} = ReLU 6 (x * W_{1}), W_{1} \in R^{1 \times 1 \times C \times tC} & (16) \end{array}

\begin{array}{l} x_{2} = ReLU 6 (DWConv (x_{1})), \\ DWConv : separate kernel per channel \end{array} (17)

\begin{array}{l} x_{3} = x_{2} * W_{3}, W_{3} \in R^{1 \times 1 \times tC \times C'} & (18) \end{array}

\begin{array}{l} y = x + x_{3}, if stride = 1 and C = C' & (19) \end{array}

Model customization with WOA in this study, MobileNetV2 is used as a feature extractor by setting include_top = False and freezing the pretrained layers (weights initialized on ImageNet). The extracted features are passed through.

Global average pooling. This reduces each channel to a single value, lowering overfitting risk and model complexity, as shown in Equations 20, 21. Dense layer (WOA-optimized units $u \in [64, 512]$ ) where $ϕ$ is an activation function (ReLU, LeakyReLU, or ELU) selected by WOA. Dropout layer (WOA-optimized rate $d \in [0.1, 0.5]$ ) to reduce overfitting (Xiang et al., 2019), as shown in Equations 22, 23. Softmax classification layer where $K = 3$ is the number of tumor classes (Benign, Malignant, Normal).

\begin{array}{l} z = \frac{1}{H \cdot W} \sum_{i = 1}^{H} \sum_{j = 1}^{W} x_{i, j} & (20) \end{array}

\begin{array}{l} h = ϕ (Wz + b) & (21) \end{array}

\begin{array}{l} h' = Dropout (h, rate = d) & (22) \end{array}

\begin{array}{l} {\hat{y}}_{k} = \frac{e^{o_{k}}}{\sum_{j = 1}^{K} e^{o_{j}}}, for k = 1, 2, \dots, K & (23) \end{array}

Training configuration of the network is compiled using the Adam optimizer or alternatives (SGD, RMSprop) as determined by WOA. The loss function used is sparse categorical cross-entropy, where $y$ is the true class index. The learning rate, batch size, regularization weight decay, and other hyperparameters are dynamically chosen by the WOA metaheuristic, ensuring model robustness and optimal convergence during training. Figure 4 illustrates the working architecture of the model, as shown in Equation 24:

\begin{array}{l} L = - log ({\hat{y}}_{y}) & (24) \end{array}

Figure 4

Diagram of a neural network architecture for processing medical images. It starts with a dataset of MRI scans, proceeds through a Conv2D layer, followed by a series of BR-Blocks. The sequence continues with multiple convolutional layers, batch normalization, ReLU activation, and pooling, ending with a final output layer.

Figure 4. MobileNetV2 model structure.

2.4.2 DenseNet121 model

DenseNet121 is a Dense Connected Convolutional Network deep structural architecture designed for maximum feature reuse and rampant gradient propagation in disjoint data communities like uterine MRI (Swaminathan et al., 2021). In this work, DenseNet121 is used as the backbone network to extract high-level features that discriminate between benign, malignant, and normal uterine tumors for classification.

Contrary to other conventional CNN architectures, where each layer takes input only from its preceding layer, DenseNet connects each layer to all its preceding layers in a feed-forward fashion. This means the actual input to any layer $l$ consists of the feature maps of all preceding layers $x_{0}, x_{1}, \dots, x_{l - 1}$ from previous layers Hewlett-Packard (Liu et al., 2021). $H_{l} (\cdot)$ represents some composite function of operations of the form Batch Normalization → ReLU → Convolution. $[\cdot]$ indicates concatenation, rather than summation, as shown in Equation 25. By using dense connectivity, this strengthens the gradient flow through the network. Moreover, it encourages feature reuse, thereby cutting down on the total number of parameters. Thus, this also alleviates the problem of gradient vanishing, especially for very deep networks such as DenseNet121. It starts with a convolution and pooling layer. Then, four dense blocks with transition layers (1 × 1 conv + 2 × 2 average pooling) after each one. Finally, it uses global average pooling and a fully connected softmax layer. They are distributed among the four dense blocks as 6, 12, 24, and 16 layers, respectively.

\begin{array}{l} x_{l} = H_{l} ([x_{0}, x_{1}, \dots, x_{l - 1}]) & (25) \end{array}

For feature extraction and customization for this classification task, we utilize DenseNet121 pretrained on ImageNet as a frozen feature extractor (include_top = False). The last convolutional block output is passed through a GlobalAveragePooling2D layer, where $z_{c}$ is the pooled feature for channel $c$ , $x_{i, j, c}$ is the activation at spatial location $(i, j)$ in channel $c$ , $H$ and $W$ are height and width of the feature map (Uemura et al., 2020). This operation reduces spatial dimensions, producing a vector of size equal to the number of channels, improving generalization and reducing overfitting. This operation reduces spatial dimensions, producing a vector of size equal to the number of channels, improving generalization and reducing overfitting, as shown in Equation 26:

\begin{array}{l} z_{c} = \frac{1}{H \cdot W} \sum_{i = 1}^{H} \sum_{j = 1}^{W} x_{i, j, c} & (26) \end{array}

WOA-optimized classification heads the extracted features are passed through a classification head that is parameterized dynamically using WOA Dense Layer $ϕ$ activation function (ReLU, LeakyReLU, or ELU), $H_{1} \in R^{u \times d}$ : weight matrix (with $u \in [64, 512]$ ), $e$ pooled features from DenseNet backbone, as shown in Equations 27, 28. Dropout $p \in [0.1, 0.5]$ dropout rate optimized by WOA. Softmax Output Layer $K = 3$ number of uterine tumor classes and $o_{k}$ logit corresponding to class $k$ . Loss Function $y_{k}$ is the one-hot encoded true label (Zhang et al., 2019), as shown in Equations 29, 30. Optimizer chosen among {Adam, SGD, RMSprop} as per WOA-optimized index. Regularization L2 weight decay (search range $10^{- 6}$ to $10^{- 3}$ ) applied on trainable dense weights, as shown in Equation 31:

\begin{array}{l} h = ϕ (H_{1} e + b_{1}) & (27) \end{array}

\begin{array}{l} h' = Dropout (h, rate = p) & (28) \end{array}

\begin{array}{l} {\hat{y}}_{k} = \frac{exp (o_{k})}{\sum_{j = 1}^{K} exp (o_{j})} & (29) \end{array}

\begin{array}{l} L_{CE} = - \sum_{k = 1}^{K} y_{k} log ({\hat{y}}_{k}) & (30) \end{array}

\begin{array}{l} L_{total} = L_{CE} + λ ∥ W_{1} ∥_{2}^{2} & (31) \end{array}

DenseNet121 has many advantages when used in the classification of uterine tumors from MRI. Increased feature propagation enables improved encoding of tissue textures and the lesion border. Fewer parameters mean better training with the given medical data since it is fewer. WOA-based parametrization helps adapt the architecture, so it generalizes best for the dataset at hand. Figure 5 illustrates the working architecture of the model.

Figure 5

Diagram depicting a deep learning architecture for medical imaging. It starts with input images labeled

Figure 5. DenseNet121 model structure.

2.4.3 Lightweight vision model (LVM)

For scenarios with limited computing machines or smaller datasets that are mostly found in medical imaging facilities, this study introduces a custom-built Lightweight Vision Model (LVM) (Jiang et al., 2025). The LVM is a modular, parameterizable convolutional neural network designed for uterine tumor classification from MRI images. While it could have used large-scale pretrained models, training the LVM from scratch allows it to fine-tune itself directly on the texture and contrast patterns inherent to uterine MRI images.

The LVM architecture follows a typical hierarchical paradigm of feature extraction, having three convolution + pooling blocks arranged in series, followed by a fully connected classifier. This allows for the extraction of low-level features such as edges and textures, along with high-level features that involve shapes and boundaries important for tumor detection. Parameters in each layer, such as the number of filters, activation function, and dropout rate, are all subject to optimization based on the Whale Optimization Algorithm (WOA) so that the best validation results may be achieved (Zhang et al., 2023). This ensures a trade-off between simplicity and robustness in classification, especially when imbalanced or small datasets in the medical domain are considered.

Network architecture and equations let the input image $X \in R^{224 \times 224 \times 3}$ be an RGB MRI slice. The model consists of convolutional block 1, $F_{1} \in [16, 96]$ : number of filters (WOA-tuned) $ϕ$ : Activation function (ReLU / LeakyReLU /ELU), as shown in Equations 32, 33. Output shape $112 \times 112 \times F_{1}$ . Convolutional block 2 $F_{2} \in [32, 192]$ WOA-optimized, as shown in Equations 34, 35. Convolutional block 3 $F_{3} \in [64, 384]$ : WOA-optimized, output from $A^{(3)} \in R^{28 \times 28 \times F_{3}}$ , as shown in Equations 36, 37:

\begin{array}{l} Z^{(1)} = ϕ (X * W^{(1)} + b^{(1)}), W^{(1)} \in R^{3 \times 3 \times 3 \times F_{1}} & (32) \end{array}

\begin{array}{l} A^{(1)} = MaxPool (Z^{(1)}, 2 \times 2) & (33) \end{array}

\begin{array}{l} Z^{(2)} = ϕ (A^{(1)} * W^{(2)} + b^{(2)}), W^{(2)} \in R^{3 \times 3 \times F_{1} \times F_{2}} & (34) \end{array}

\begin{array}{l} A^{(2)} = MaxPool (Z^{(2)}, 2 \times 2) & (35) \end{array}

\begin{array}{l} Z^{(3)} = ϕ (A^{(2)} * W^{(3)} + b^{(3)}), W^{(3)} \in R^{3 \times 3 \times F_{2} \times F_{3}} & (36) \end{array}

\begin{array}{l} A^{(3)} = MaxPool (Z^{(3)}, 2 \times 2) & (37) \end{array}

Following the final convolutional block of the LVM, the output feature maps are flattened into a one-dimensional vector for classification purposes (Fan et al., 2023). This flattened feature vector then passes through one fully connected dense layer whose number of output units is treated as a hyperparameter, ranging from 64 to 512. The activation function used here is selected through WOA; under different scenarios, it can be ReLU, Leaky ReLU, or ELU. The dropout layer has been included after dense transformations to avoid overfitting. The dropout rate is set in the range of 0.1–0.5. The final classification is done through a softmax layer to produce probability scores on the three classes of uterine tumors: benign, malignant, and normal. The predicted class will be the one having the highest softmax score.

The training of the model is achieved by minimizing the sparse categorical cross-entropy loss function between the predicted probability distribution and the true class label (Nie et al., 2023). Also, L2 regularization (weight decay) is applied to every trainable layer of the network, with the coefficient λ being optimized by the WOA as well. Regularization prevents over-fitting by penalizing large magnitudes of weights, thereby encouraging models to behave more generally. LVM for Medical Imaging is fully customizable. Your WOA will allow you to adapt filters, activation functions, dropout rate, or dense units. The model is lightweight with a minimal memory footprint, making it suitable for developing real-time diagnostics and mobile applications in clinical environments. It also learns directly from MRI data without bias induced by pretrained natural image datasets. LVM is a flexible and interpretable alternative to deep pretrained models while facilitating domain-specific tunings for optimizing accuracy and resource usage in uterine tumor classification. Figure 6 illustrates the working architecture of the model.

Figure 6

Flowchart illustrating a convolutional neural network (CNN) architecture. Starting with a dataset of medical images, the data passes through multiple ConvBlocks consisting of convolution, ReLU activation, and max pooling. The output is then flattened, followed by a dense layer, softmax activation, and ultimately produces the final output.

Figure 6. LVM model structure.

2.5 Ensemble via soft voting

The proposed ensemble learning strategy is based on soft voting and aims to enhance the classification performance, improve stability, and increase diagnostic robustness in the automatic detection of uterine tumors from MRI images (Salur and Aydın, 2022). Ensemble models use the strength that comes from diversity among different classifiers to achieve higher generalizations and accuracies than any individual classifier. Within the framework, three diverse CNNs-MobileNetV2, DenseNet121, and a custom-built Lightweight Vision Model (LVM)-are individually trained and optimized with Whale Optimization Algorithm and then combined, through soft voting, to give the final classification output.

Each model outputs the probability distribution of classes for an input image. Let there be M models in an ensemble; then the model m is expected to produce the predicted probabilities vector $p^{(m)} = [p_{1}^{(m)}, p_{2}^{(m)}, \dots, p_{C}^{(m)}]$ with $C$ denoting the total number of classes: in this case $C = 3$ (Benign, Malignant, Normal), and $\sum_{k = 1}^{C} p_{k}^{(m)} = 1$ . The soft voting mechanism determines the average predicted probability for each class from all models, and the final predicted class $\hat{y}$ is given by the index max of the average probability (Jaradat et al., 2024). This ensures that output from each model has a say in the final decision and that class probabilities correspond to the ensemble’s head count confidence, as shown in Equations 38, 39:

\begin{array}{l} {\overset{ˉ}{p}}_{k} = \frac{1}{M} \sum_{m = 1}^{M} p_{k}^{(m)}, for k = 1, 2, \dots, C & (38) \end{array}

\begin{array}{l} \hat{y} = arg max_{k} {\overset{ˉ}{p}}_{k} & (39) \end{array}

Generalization is enhanced by building an ensemble out of different architectures, thus reducing variance and model-specific overfitting. Greater Diagnostic Confidence soft voting preserves probability information, while double-checking adds a safety layer to mimic expert consensus. Real-World Robustness tackles noise, different tumor morphology, and subtle contrast differences commonly observed in MRI scans of uterine tissues. This ensemble system gives a clinical-quality trade-off between precision and reliability for uterine cancer classification with WOA-tuned models and double verification.

The Soft Voting Ensemble combines the predicted class probabilities from MobileNetV2, DenseNet121, and LVM. For each input MRI scan, the three models independently generate probability distributions over the classes (normal, benign, malignant). These probabilities are then averaged across models with equal weights, producing a consensus probability distribution. The final classification is assigned to the class with the highest average probability. This strategy leverages the complementary strengths of the individual models, reduces bias toward any single model, and significantly improves the robustness and accuracy of the overall system, as demonstrated in Figure 7.

Figure 7

Flowchart depicting a medical image classification process. The dataset is analyzed using three models: MobileNetV2, DenseNet121, and LVM. MobileNetV2 results: 10% Normal, 30% Benign, 60% Malignant. DenseNet121 results: 7% Normal, 25% Benign, 68% Malignant. LVM results: 15% Normal, 33% Benign, 52% Malignant. Soft Voting Average final results: 10.66% Normal, 29.33% Benign, 60% Malignant, leading to a classification of Malignant.

Figure 7. Soft voting-based ensemble classification of uterine MRI images using MobileNetV2, DenseNet121, and LVM.

3 Results analysis

3.1 Experimental setup and measurement

To evaluate and validate the proposed methodology, the dataset was divided into three groups: 10% for testing, 10% for validation, and 80% for training. Tests were conducted using images as input. Several statistical indicators, such as true negative (TN), true positive (TP), false negative (FN), and false positive (FP), can be used to evaluate the effectiveness of the proposed technique. This section presents several metrics for evaluating the effectiveness of the proposed model and pre-trained models for detecting uterine cancer using MRI images. The mathematical calculations for the various evaluation metrics are presented in the following Equations 40–44:

\begin{array}{l} Accuracy = \frac{TP + TN}{TP + TN + FP + FN} & (40) \end{array}

\begin{array}{l} Precision = \frac{TP}{TP + FP} & (41) \end{array}

\begin{array}{l} Sensitivity = \frac{TP}{TP + FN} & (42) \end{array}

\begin{array}{l} Specificity = \frac{TN}{TN + FP} & (43) \end{array}

\begin{array}{l} F 1 Score = 2 * \frac{Precision * Sensitivity}{Precision + Sensitivity} & (44) \end{array}

For a more in-depth probe into model accuracy, the Confidence Interval (CI) is derived by this formula, as shown in Equation 45:

\begin{array}{l} CI = [\hat{μ} - z . σ, \hat{μ} + z . σ], & (45) \end{array}

Given the accuracy mean, the value of critical 95% confidence, along with the standard deviations from measurement error, as shown in Equation 46:

\begin{array}{l} σ = \sqrt{\frac{\hat{μ} (1 - \hat{μ})}{n}} & (46) \end{array}

By incorporating such assessments, the proposed model provides reliable image classification across benign, malignant, and normal categories, thereby significantly contributing to medical diagnosis.

3.2 The hyperparameter configuration

Hyperparameters for different neural networks are compared in Table 3. Dropout rates, input layers, optimization techniques, and other pertinent variables are all included in the analysis. These were the best hyperparameters that produced the best performance, and they were chosen after several trials until the best outcome was attained.

Table 3

Table 3. Optimized hyper parameters for deep learning models tuned via whale optimization algorithm (WOA).

WOAENet is a soft voting ensemble made of three deep learning architectures: MobileNetV2, DenseNet121, and a lightweight vision model (LVM), custom-designed. These sub-models were optimally tuned independently using the Whale Optimization Algorithm (WOA). Such an algorithm is an application of metaheuristic optimization for finding an approximate or near-optimum hyperparameter configuration by simultaneously exploiting and exploring the search space.

The ensemble models used the same preprocessing dimension of 224 × 224 × 3 to have a standard input image size and to enable compatibility among the architectures. Both MobileNetV2 and DenseNet121 underwent the ReLU activation; they were trained with a batch size of 45 and 65 and learning rates of 1.95 × 10⁻⁴ and 7.26 × 10⁻⁴, respectively.

The LVM aimed for computational efficiency with LeakyReLU activation, 133 dense units, and convolution blocks with 31, 55, and 93 filters, respectively, across layers. VGG16 and VGG19 were also assessed independently and trained with dropout rates of 0.25 and 0.30, with learning rates of 1.2 × 10⁻⁴ and 1.5 × 10⁻⁴. Adam optimizer was used for LVM and VGG16, whereas SGD was considered for MobileNetV2 and VGG19 due to its momentum-based updates. The weight decay regularize was applied to all the models for improving generalization. Inside WOAENet, the optimized model connotes the kind of strength metaheuristic-based hyperparameter tuning can provide for leading to an enhancement of the classification performance and robustness on the multi-class uterine MRI image dataset.

3.3 Model performance evaluation and analysis

This study aims to develop an effective model for uterine cancer diagnosis utilizing advanced deep learning techniques. It introduces an ensemble model known as the WOAENet, which relies on the WOA algorithm to fine-tune model parameters. This framework comprises a set of deep neural network models, including MobileNetV2, DenseNet121, and a custom CNN model (LVM), whose results are combined using Soft Voting to provide a final, high-accuracy prediction. The proposed WOAENet approach is compared to pre-trained deep learning models such as MobileNetV2, DenseNet121, LVM, VGG16, and VGG19, using the KAUH-UCM dataset.

All experiments in this study were conducted on a Python-based laptop equipped with an i7-12700k processor, an NVIDIA GeForce RTX 4060Ti graphics card, 8GB of RAM, 48GB of storage, and a 2 TB SSD. Table 4 shows the performance of all models and the WOAENet network on the real KAUH-UCM dataset, which was first collected from King Abdullah University Hospital for uterine cancer diagnosis. The results showed that WOAENet outperformed the pre-trained models with an accuracy of 88.57%, a specificity of 94.29%, and an F1 score of 88.54%, while MobileNetV2 achieved an accuracy of 75.24%. The DenseNet121 model achieved an accuracy of 79.76%, while the LVM model achieved an accuracy of 74.76%. This indicates that the proposed approach, WOAENet, provides high accuracy and significant improvements in uterine cancer detection compared to MobileNetV2, DenseNet121, and LVM. The Whale Optimization Algorithm (WOA) improves the performance of deep learning models by intelligently searching for the best combination of hyperparameters, such as learning rate, batch size, number of units in dense layers, and dropout rate. Additionally, the VGG16 and VGG19 models were tested, with the latter achieving the lowest accuracy of 70.95%, while the VGG16 model performed relatively well at 77.14%. Figure 8 illustrates the model’s effectiveness.

Table 4

Table 4. Performance and analysis models.

Figure 8

Bar chart comparing model performance metrics for VGG16, VGG19, MobileNetV2, DenseNet121, LVM, and WOAENet. Metrics include accuracy, precision, sensitivity, specificity, and F1 score, coded by colors. WOAENet exhibits the highest scores across most metrics.

Figure 8. Comparison of performance metrics of different models.

Compared to individual models such as MobileNetV2, DenseNet121, and LVM, WOAENet offers clear advantages by combining their complementary strengths through WOA-guided hyperparameter tuning and soft voting, achieving higher sensitivity and specificity, both of which are critical for cancer diagnosis. Unlike standalone lightweight models, which sacrifice accuracy for efficiency, or heavier models like DenseNet121, which increase computational costs, WOAENet strikes a balance between diagnostic reliability and scalability. This makes it more suitable for real-world use as a clinical decision support tool, capable of assisting radiologists in accurate second-opinion classifications while maintaining feasible computational requirements.

Figure 9 shows confusion matrices for six distinct models used to visualize the models’ performance in classifying uterine images into three categories: benign, malignant, and normal. The numbers within each cell indicate the number of samples belonging to the actual category (rows) and predicted as the corresponding category (columns). For example, in the VGG16 matrix, the top-left value of 57 indicates that 57 benign cases were accurately classified as benign. Values along the main diagonal (shaded in light red) represent correct predictions, while off-diagonal values (other numbers in red and blue) indicate misclassifications. Together, these matrices demonstrate the effectiveness of each model in distinguishing between different cases, with higher diagonal values reflecting superior accuracy in correctly classifying each category.

Figure 9

Six confusion matrices for models VGG16, VGG19, MobileNetV2, DenseNet121, LVM, and WOAENet show classification performance. Rows represent actual labels and columns predicted labels for benign, malignant, and normal categories. Each matrix displays true positives, false positives, false negatives, and true negatives using a color gradient from blue to red, indicating varying accuracies across models.

Figure 9. Confusion matrices for models in uterine tumors classification.

Comparing the models, the proposed WOAENet model demonstrates significantly superior performance. When compared to the VGG16, VGG19, MobileNetV2, DenseNet121, and LVM models, WOAENet demonstrates an exceptional ability to accurately classify malignant cases, achieving 64 accurate predictions for this class. This number outperforms all other models (e.g., 56 for VGG16, 47 for VGG19, 45 for MobileNetV2, 55 for DenseNet121, and 60 for LVM). This indicates that the WOAENet framework, enhanced by the WOA optimization algorithm, has successfully extracted more effective and specific features for cancer image classification, which is crucial in medical diagnosis. Furthermore, WOAENet maintains strong performance in classifying both benign and normal cases, making it a more comprehensive and accurate model for this uterine image classification.

3.4 Performance analysis of the WOAENet by category

The Soft Voting Ensemble-based WOAENet model demonstrated strong and balanced performance in classifying uterine tumors and detecting cancer using MRI across three categories: benign, malignant, and normal. The model performed well across all categories, as shown in Table 5. The highest sensitivity was in malignant classification at 91.43%, indicating the model’s high ability to detect malignant cases. It also achieved the highest accuracy in the same category at 90.14%. Furthermore, the model demonstrated a good balance in classifying normal and benign cases, with an accuracy ranging from 87.5 to 88.06%, and a sensitivity of 90% for normal cases and 84.29% for benign cases. Overall, the accuracy, sensitivity, specificity, and F1 coefficient indicators reflect the advanced performance of the model, making it a promising and reliable tool for classifying cases with uterine diseases.

Table 5

Table 5. Class-wise performance metrics of the WOAENet model on the KAUH-UCM dataset.

3.5 Statistical analysis

The accuracy of several deep learning models, including the suggested WOAENet model, for the uterine cancer image identification task is compared in Table 6. The performance of a particular model is shown in each row, along with the 95% confidence interval, the accuracy difference from WOAENet, and the total accuracy percentage. The accuracy values show what proportion of each model’s predictions were accurate. DenseNet121, for instance, achieved an accuracy of 79.05%, whereas WOAENet achieved 88.57%. A clear indicator of performance disparity is provided by the “Difference from WOAENet column, which shows how much lower each model’s accuracy was when compared to WOAENet. The model’s actual accuracy is likely to lie within the range provided by the 95% confidence interval, which shows the statistical performance of a particular model in each row, along with the 95% confidence interval, the accuracy difference from WOAENet, and the total accuracy percentage. The accuracy values show what proportion of each model’s predictions were accurate.

Table 6

Table 6. Model accuracy evaluation, with 95% confidence interval and deviations from the WOAENet.

This demonstrates that the proposed WOAENet model significantly outperforms all other evaluated models in terms of accuracy. With an accuracy of 88.57%, WOAENet shows a significant improvement, achieving 9.52% higher accuracy than the best model, DenseNet121 (79.05%). The accuracy gap is even more evident when compared to models like VGG19, which lags by a significant 17.62%. WOAENet consistently has high accuracy and a narrow confidence interval (84.26, 92.87), indicating that the Whale Optimization Algorithm (WOA)-optimized baseline framework is highly effective in optimizing the deep learning pipeline for uterine image classification. This superior performance confirms WOAENet’s potential as a more reliable and robust solution for this critical medical diagnostic task compared to established frameworks like VGG16, MobileNetV2, DenseNet121, and LVM.

3.6 Evaluating the model in clinical environments

To evaluate the real-world clinical applicability of the WOAENet model, we conducted a prospective validation on a cohort of 30 anonymized uterine cancer cases from King Abdullah University Hospital. These cases were not part of the training or validation datasets. The model’s predictions were compared against the final clinical diagnoses made by expert radiologists. WOAENet correctly classified 23 out of 30 cases (76.7%), aligning with the radiologists’ final diagnoses. The remaining seven cases (23.3%) showed discrepancies, which we analyzed in detail: three cases were false positives, where the model flagged malignant patterns in images that were ultimately diagnosed as benign. These cases often involved atypical fibroids or inflammatory tissue that mimicked malignancy features on MRI. Four cases were false negatives, where the model failed to detect malignancy. Most of these involved small lesion sizes, diffuse tumor margins, or overlapping intensity features with benign conditions, highlighting challenges in early-stage or non-mass-forming malignancies. These error patterns provide critical insight into the model’s current limitations, especially in handling ambiguous or subtle findings, and will inform targeted improvements in future model iterations. Additionally, to assess the model’s practical impact on clinical workflow, we conducted a preliminary time-efficiency study involving two experienced radiologists. Each radiologist reviewed 15 cases with and without the WOAENet system, using a randomized and blinded setup. The results showed: Average interpretation time without WOAENet: 9.4 min per case. Average interpretation time with WOAENet assistance: 5.7 min per case. Time reduction: Approximately 39.4%, equating to an average savings of 3.7 min per case.

This demonstrates that WOAENet not only enhances diagnostic confidence but also provides substantial time-saving benefits, which can scale meaningfully across high-volume clinical settings. Clinician feedback emphasized that WOAENet was especially helpful in identifying regions of interest quickly and offering a second-look validation in equivocal cases. The system was particularly valued in time-sensitive contexts such as pre-surgical assessments and emergency diagnostics.

4 Discussion

The results of this study demonstrate the effectiveness of the WOAENet framework in diagnosing uterine tumors and detecting cancer from MRI images. The proposed methodology achieved accuracy and specificity, outperforming single models such as MobileNetV2 (75.24%), DenseNet121 (79.76%), and LVM (74.76%). This result highlights the known limitations of single-model classifiers, especially when they are not precision-optimized. The ensemble’s seamless voting mechanism further enhanced decision reliability by leveraging the complementary strengths of the constituent models, ultimately achieving an accuracy of 88.57%, a specificity of 94.29%, and an F1 score of 88.54%.

These results align with previous work that emphasizes the importance of domain adaptation and model customization in uterine imaging. For instance, Mulliez et al. (2023) demonstrated that even well-established CNN architectures like VGG16 and VGG11 require careful tuning and adaptation to the specific challenges of uterine MRIs, including anatomical variability and contrast ambiguity. Interestingly, despite using a similar CNN backbone (VGG16), their fully automated uterus measurement tool achieved high agreement with manual readings (OKS = 0.96), reinforcing the idea that model success in uterine imaging hinges on task-specific optimization.

Further support comes from Davarpanah et al. (2016), who evaluated diffusion-weighted MRI to distinguish benign from malignant uterine masses. Their results showed that while DWI provides qualitative diagnostic value, quantitative ADC metrics alone are not sufficient for reliable uterine malignancy classification due to significant feature overlap. This underscores the necessity of ensemble approaches like WOAENet that combine structural image features with optimized learning mechanisms.

Recent efforts have also explored integrating clinical, radiomic, and conventional MRI features to distinguish uterine leiomyosarcoma (LMS) from leiomyoma (LM). Roller et al. (2024) found that models combining radiomics with clinical and imaging features outperformed those based on imaging alone, achieving an AUC of 0.989. Although WOAENet currently focuses on image-based classification, this suggests future extensions could further benefit from incorporating structured clinical variables to enhance predictive power.

From an imaging quality perspective, Hausmann et al. (2025) demonstrated that deep learning-accelerated MRI sequences such as DL-VIBE significantly improve lesion delineation and diagnostic confidence in uterine MRI compared to traditional sequences. As image quality directly influences model input fidelity, incorporating DL-enhanced sequences into preprocessing could further boost WOAENet’s robustness.

Additionally, the work by Hodneland et al. (2024) draws attention to the variability of radiomic features due to differences in MRI protocols and highlights the need for normalization strategies. Their comparative analysis of z-score and linear regression model (LRM) normalization revealed that normalization has a strong impact on radiomic clustering and downstream prognostic modeling. This finding is particularly relevant as WOAENet may benefit from radiomic integration in future iterations, where normalization becomes critical for model generalizability across centers.

4.1 Generalization across different data sets

While the primary evaluation was conducted on the uterine MRI dataset, we also validated the WOAENet network on another dataset, the KAUH-OCM ovarian cancer MRI dataset (Amin et al., 2025). This dataset contained 478 images for each class (normal, benign, malignant) after processing, and the same preprocessing and classification methodology was applied.

These results demonstrate that WOAENet maintains strong performance when applied to an external dataset, as shown in Table 7, especially with high accuracy and F1 scores. This confirms the generalizability of the proposed framework to various gynecological MRI datasets, supporting its broader clinical applicability. Figure 10 shows the confusion matrix of the proposed WOAENet model.

Table 7

Table 7. Performance evaluation of the proposed WOAENet model on the KAUH-OCM.

Figure 10

Figure 10. Confusion matrix of the proposed model for ovarian tumor classification (MRI images).

4.2 Limitations of the study

Our study has several limitations that should be acknowledged. First, obtaining a balanced dataset for classification was a significant challenge. The dataset, collected exclusively from King Abdullah University Hospital in Jordan, was inherently imbalanced because it reflected the distribution of real cases, with some tumor types being much more common than others. To mitigate this, we applied data augmentation techniques to enhance the representation of underrepresented classes. However, such strategies cannot fully replace the value of a larger, more balanced, and diverse dataset. Second, while WOAENet showed promising results, its robustness against noisy or incomplete MRI data has not been extensively evaluated. Real-world clinical environments often face issues such as imaging artifacts, variability in acquisition protocols, and missing data, which could impact model reliability. Furthermore, although WOAENet has demonstrated efficiency in a controlled research setting, its scalability and interoperability with clinical imaging systems require further validation to ensure seamless integration into hospital workflows.

Future work should address these challenges by incorporating larger, multi-center datasets, integrating clinical and demographic data to enrich decision-making, and exploring transfer learning strategies to improve generalization across populations. Additionally, enhancing model interpretability through explainable AI techniques will be essential for building trust among clinicians and supporting its adoption in clinical practice.

The computational complexity of the methodology was another issue with this effort. Although the computational workload was managed using cloud-based technologies, the free version had limitations regarding runtime and processing power. Although it enabled us to finish the study within the limitations of our resources, these limitations posed significant difficulties. Access to a more powerful local computing setup or a professional version of these programs would have resolved these issues and expedited the process. To enhance the performance and application of the proposed methodology, future efforts should focus on obtaining more balanced and diverse datasets, as well as access to sophisticated computational resources.

Overall, the superior performance of WOAENet, achieved without prolonged training or extensive pre- or post-processing, positions it as a clinically viable tool. Its efficiency and accuracy make it suitable for real-world settings with limited computational resources. Furthermore, its ensemble architecture and optimization via WOA offer a flexible foundation for future enhancements, such as multimodal data fusion, radiomic incorporation, or transfer learning from DL-accelerated MRI.

5 Conclusion and future work

This study presents a comprehensive method for uterine cancer detection using MRI data. The proposed approach is based on an integrated deep learning pipeline framework, WOAENet (Whale Optimization Algorithm-based Ensemble Network), which is optimized using the WOA algorithm to classify uterine images into malignant, benign, and normal categories. Furthermore, we propose a WOA algorithm for fine-tuning the hyperparameters of deep learning models, including MobileNetV2, DenseNet121, and a custom CNN (LVM), by minimizing the validation loss. Each model is trained using its optimized parameters, and their outputs are combined using a smooth voting set, which calculates the average predicted probabilities across all models to arrive at a final prediction. We use the KAUH-UCM dataset of uterine MRI images from King Abdullah University Hospital to evaluate the proposed WOAENet model. The WOAENet model demonstrates the highest classification accuracy. Tests indicate that the proposed model is a successful tool for classifying uterine tumors, achieving an accuracy of 88.57%, outperforming all pre-trained models.

Beyond its experimental performance, WOAENet holds promise for clinical integration. Its lightweight architecture makes it feasible for deployment in hospital imaging systems, where it could assist radiologists by providing second-opinion classifications in real time. Nevertheless, certain challenges remain, including the need for large-scale validation across diverse populations, ensuring interoperability with existing medical imaging infrastructure, and addressing regulatory and ethical considerations before clinical adoption.

Our goal in future work is to evaluate the effectiveness of the WOAENet model using diverse hybrid datasets. Furthermore, future research will focus on expanding datasets, incorporating data from other sources, improving model interpretability, and cross-validating it in a broader clinical setting. Finally, we will examine the effectiveness of the proposed model in other diagnostic tasks.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Ethics statement

This study was conducted according to the guidelines and with the approval of the Institutional Review Board (IRB No. 21/171/2024) at King Abdullah University Hospital, Jordan University of Science and Technology, Jordan. Institutional Review Board approval has been granted. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author contributions

OA: Data curation, Formal analysis, Methodology, Validation, Writing – original draft. AS: Data curation, Formal analysis, Investigation, Resources, Writing – original draft. HM: Data curation, Resources, Validation, Visualization, Writing – original draft. SaA: Investigation, Methodology, Software, Writing – review & editing. MA: Conceptualization, Methodology, Software, Validation, Writing – original draft. HA: Project administration, Resources, Visualization, Writing – review & editing. RM: Data curation, Formal analysis, Validation, Visualization, Writing – original draft. NA: Formal analysis, Resources, Validation, Visualization, Writing – review & editing. FZ: Conceptualization, Project administration, Resources, Writing – review & editing. SiA: Formal analysis, Investigation, Validation, Writing – review & editing. KS: Conceptualization, Funding acquisition, Project administration, Resources, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported and funded by the Deanship of Scientific Research at Imam Mohammad Ibn Saud Islamic University (IMSIU) (grant number IMSIU-DDRSP2501).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Abualigah, L., et al. (2024). “Whale optimization algorithm: analysis and full survey” in Laith A, editors. Metaheuristic optimization algorithms (Amsterdam: Elsevier), 105–115.

Google Scholar

Akazawa, M., and Hashimoto, K. (2021). Artificial intelligence in gynecologic cancers: current status and future challenges–a systematic review. Artif. Intell. Med. 120:102164. doi: 10.1016/j.artmed.2021.102164

PubMed Abstract | Crossref Full Text | Google Scholar

Aljarah, I., Faris, H., and Mirjalili, S. (2018). Optimizing connection weights in neural networks using the whale optimization algorithm. Soft. Comput. 22, 1–15.

Google Scholar

Amin, M., Alhatamleh, S., Sindiani, A. M., Mhanna, H. Y., Madain, R., Anakreh, D., et al. (2025). GAM-attention enhanced DenseNet121 with Bayesian optimization for accurate ovarian Cancer diagnosis in MRI images. New York, NY: IEEE Access.

Google Scholar

Boeckstaens, S., Dewalheyns, S., Heremans, R., Vikram, R., Timmerman, D., den Van Bosch, T., et al. (2020). Signs and symptoms associated with uterine cancer in pre-and postmenopausal women. Heliyon 6:e05372. doi: 10.1016/j.heliyon.2020.e05372

PubMed Abstract | Crossref Full Text | Google Scholar

Brodzicki, A., Piekarski, M., and Jaworek-Korjakowska, J. (2021). The whale optimization algorithm approach for deep neural networks. Sensors 21:8003. doi: 10.3390/s21238003

PubMed Abstract | Crossref Full Text | Google Scholar

Davarpanah, A. H., Kambadakone, A., Holalkere, N. S., Guimaraes, A. R., Hahn, P. F., and Lee, S. I. (2016). Diffusion MRI of uterine and ovarian masses: identifying the benign lesions. Abdom. Radiol. 41, 2466–2475. doi: 10.1007/s00261-016-0909-2

PubMed Abstract | Crossref Full Text | Google Scholar

Dong, H.-C., Dong, H.-K., Yu, M.-H., Lin, Y.-H., and Chang, C.-C. (2020). Using deep learning with convolutional neural network approach to identify the invasion depth of endometrial cancer in myometrium using MR images: a pilot study. Int. J. Environ. Res. Public Health 17:5993. doi: 10.3390/ijerph17165993

PubMed Abstract | Crossref Full Text | Google Scholar

Dong, K., Zhou, C., Ruan, Y., and Li, Y. (2020). “MobileNetV2 model for image classification” in In 2020 2nd International Conference on Information Technology and Computer Application (ITCA) (New York, NY: IEEE), 476–480.

Google Scholar

Esmaeilzadeh, A. A., and Nasirzadeh, F. (2023). Uterus Cancer. Eurasian J. Chem. Med. Pet. Res. 2, 63–83.

Google Scholar

Fan, Q., Huang, H., Zhou, X., and He, R. (2023). Lightweight vision transformer with bidirectional interaction. Adv. Neural Inf. Process Syst. 36, 15234–15251.

Google Scholar

Felix, A. S., and Brinton, L. A. (2018). Cancer progress and priorities: uterine cancer. Cancer Epidemiol. Biomarkers Prev. 27, 985–994. doi: 10.1158/1055-9965.EPI-18-0264

PubMed Abstract | Crossref Full Text | Google Scholar

Fujioka, T., Katsuta, L., Kubota, K., Mori, M., Kikuchi, Y., Kato, A., et al. (2020). Classification of breast masses on ultrasound shear wave elastography using convolutional neural networks. Ultrason. Imaging 42, 213–220. doi: 10.1177/0161734620932609

PubMed Abstract | Crossref Full Text | Google Scholar

Gui, B., Lupinelli, M., Russo, L., Miccò, M., Avesani, G., Panico, C., et al. (2022). MRI in uterine cancers with uncertain origin: endometrial or cervical? Radiological point of view with review of the literature. Eur. J. Radiol. 153:110357. doi: 10.1016/j.ejrad.2022.110357

PubMed Abstract | Crossref Full Text | Google Scholar

Hamoud, B. H., Sima, R. M., Vacaroiu, I. A., Georgescu, M. T., Bobirca, A., Gaube, A., et al. (2023). The evolving landscape of immunotherapy in uterine cancer: a comprehensive review. Life 13:1502. doi: 10.3390/life13071502

PubMed Abstract | Crossref Full Text | Google Scholar

Hausmann, D., Marketin, A., Rotzinger, R., Heimer, J., Nickel, D., Weiland, E., et al. (2025). Improved image quality through deep learning acceleration of gradient-Echo acquisitions in uterine MRI: first application with the female pelvis. Acad. Radiol. 32, 2776–2786. doi: 10.1016/j.acra.2024.12.021

PubMed Abstract | Crossref Full Text | Google Scholar

Hodneland, E., Andersen, E., Wagner-Larsen, K. S., Dybvik, J. A., Lura, N., Fasmer, K. E., et al. (2024). Impact of MRI radiomic feature normalization for prognostic modelling in uterine endometrial and cervical cancers. Sci. Rep. 14:16826. doi: 10.1038/s41598-024-66659-w

PubMed Abstract | Crossref Full Text | Google Scholar

Jaradat, A., Alhatamleh, S., Nasayreh, A., and Gharaibeh, H., “Enhanced intrusion detection in wireless sensor networks: a voting and particle swarm optimization approach,” In 2024 IEEE 21st International Conference on Smart Communities: Improving Quality of Life using AI, Robotics and IoT (HONET). New York, NY: IEEE, (2024), pp. 223–228.

Google Scholar

Jiang, F., Tu, S., Dong, L., Wang, K., Yang, K., Liu, R., et al. (2025). Lightweight vision model-based multi-user semantic communication systems. arXiv.

Google Scholar

Keall, P. J., Brighi, C., Glide-Hurst, C., Liney, G., Liu, P. Z. Y., Lydiard, S., et al. (2022). Integrated MRI-guided radiotherapy—opportunities and challenges. Nat. Rev. Clin. Oncol. 19, 458–470. doi: 10.1038/s41571-022-00631-3

PubMed Abstract | Crossref Full Text | Google Scholar

Li, Y., Ammari, S., Balleyguier, C., Lassau, N., and Chouzenoux, E. (2021). Impact of preprocessing and harmonization methods on the removal of scanner effects in brain MRI radiomic features. Cancers 13:3000. doi: 10.3390/cancers13123000

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, T., Chen, T., Niu, R., and Plaza, A. (2021). Landslide detection mapping employing CNN, ResNet, and DenseNet in the three gorges reservoir, China. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 14, 11417–11428.

Google Scholar

Liu, L., and Zhang, R. (2022). Multistrategy improved whale optimization algorithm and its application. Comput. Intell. Neurosci. 2022:3418269.

Google Scholar

Lu, K. H., and Broaddus, R. R. (2020). Endometrial cancer. N. Engl. J. Med. 383, 2053–2064. doi: 10.1056/NEJMra1514010

PubMed Abstract | Crossref Full Text | Google Scholar

Lundervold, A. S., and Lundervold, A. (2019). An overview of deep learning in medical imaging focusing on MRI. Z. Med. Phys. 29, 102–127. doi: 10.1016/j.zemedi.2018.11.002

PubMed Abstract | Crossref Full Text | Google Scholar

Maheswari, S., Ponnibala, M., Govindaraj, S., Priyadharshini, S. N., Shobika, T., and Swetha, D. (2024). “AI based system to detect uterine cancer using ultrasound images” in 2024 International Conference on Knowledge Engineering and Communication Systems (ICKECS) (New York, NY: IEEE), 1–9.

Google Scholar

Mirjalili, S., and Lewis, A. (2016). The whale optimization algorithm. Adv. Eng. Softw. 95, 51–67. doi: 10.1016/j.advengsoft.2016.01.008

Crossref Full Text | Google Scholar

Mohammed, H. M., Umar, S. U., and Rashid, T. A. (2019). A systematic and meta-analysis survey of whale optimization algorithm. Comput. Intell. Neurosci. 2019:8718571.

Google Scholar

Moradmand, H., Aghamiri, S. M. R., and Ghaderi, R. (2020). Impact of image preprocessing methods on reproducibility of radiomic features in multimodal magnetic resonance imaging in glioblastoma. J. Appl. Clin. Med. Phys. 21, 179–190. doi: 10.1002/acm2.12795

PubMed Abstract | Crossref Full Text | Google Scholar

Mulliez, D., Poncelet, E., Ferret, L., Hoeffel, C., Hamet, B., Dang, L. A., et al. (2023). Three-dimensional measurement of the uterus on magnetic resonance images: development and performance analysis of an automated deep-learning tool. Diagnostics 13:2662. doi: 10.3390/diagnostics13162662

PubMed Abstract | Crossref Full Text | Google Scholar

Nadimi-Shahraki, M. H., Taghian, S., Mirjalili, S., Abualigah, L., Abd Elaziz, M., and Oliva, D. (2021). EWOA-OPF: effective whale optimization algorithm to solve optimal power flow problem. Electronics 10:2975.

Google Scholar

Nadimi-Shahraki, M. H., Zamani, H., Asghari Varzaneh, Z., and Mirjalili, S. (2023). A systematic review of the whale optimization algorithm: theoretical foundation, improvements, and hybridizations. Arch. Comput. Methods Eng. 30, 4113–4159. doi: 10.1007/s11831-023-09928-7

PubMed Abstract | Crossref Full Text | Google Scholar

Nie, Y, He, W, Han, K, Tang, Y, Guo, T, Du, F, et al. (2023) “Lightclip: Learning multi-level interaction for lightweight vision-language models,” arXiv

Google Scholar

Pham, Q.-V., Mirjalili, S., Kumar, N., Alazab, M., and Hwang, W.-J. (2020). Whale optimization algorithm with applications to resource allocation in wireless networks. IEEE Trans. Veh. Technol. 69, 4285–4297. doi: 10.1109/TVT.2020.2973294

Crossref Full Text | Google Scholar

Rana, N., Latiff, S. A., and Abdulhamid, S. M. (2020). “A metaheuristic based virtual machine allocation technique using whale optimization algorithm in cloud” in The International Conference on Emerging Applications and Technologies for Industry 4.0 (Berlin: Springer), 22–38.

Google Scholar

Roller, L. A., Wan, Q., Liu, X., Qin, L., Chapel, D., Burk, K. S., et al. (2024). MRI, clinical, and radiomic models for differentiation of uterine leiomyosarcoma and leiomyoma. Abdom. Radiol. 49, 1522–1533. doi: 10.1007/s00261-024-04198-8

PubMed Abstract | Crossref Full Text | Google Scholar

Salur, M. U., and Aydın, İ. (2022). A soft voting ensemble learning-based approach for multimodal sentiment analysis. Neural Comput. Appl. 34, 18391–18406. doi: 10.1007/s00521-022-07451-7

Crossref Full Text | Google Scholar

Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L.-C., “Mobilenetv2: inverted residuals and linear bottlenecks,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2018), pp. 4510–4520.

Google Scholar

Shin, H.-C., Roth, H. R., Gao, M., Lu, L., Xu, Z., Nogues, I., et al. (2016). Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans. Med. Imaging 35, 1285–1298. doi: 10.1109/TMI.2016.2528162

PubMed Abstract | Crossref Full Text | Google Scholar

Soffer, S., Ben-Cohen, A., Shimon, O., Amitai, M. M., Greenspan, H., and Klang, E. (2019). Convolutional neural networks for radiologic images: a radiologist’s guide. Radiology 290, 590–606. doi: 10.1148/radiol.2018180547

PubMed Abstract | Crossref Full Text | Google Scholar

Somasegar, S., Bashi, A., Lang, S. M., Liao, C. I., Johnson, C., Darcy, K. M., et al. (2023). Trends in uterine cancer mortality in the United States: a 50-year population-based analysis. Obstet. Gynecol. 142, 978–986. doi: 10.1097/AOG.0000000000005321

PubMed Abstract | Crossref Full Text | Google Scholar

Swaminathan, A., Varun, C., and Kalaivani, S. (2021). Multiple plant leaf disease classification using densenet-121 architecture. Int. J. Electr. Eng. Technol 12, 38–57.

Google Scholar

Uemura, T., Näppi, J. J., Hironaka, T., Kim, H., and Yoshida, H. (2020). “Comparative performance of 3D-DenseNet, 3D-ResNet, and 3D-VGG models in polyp detection for CT colonography” in Medical Imaging 2020: computer-aided diagnosis (New York, NY: SPIE), 736–741.

Google Scholar

Um, H., Tixier, F., Bermudez, D., Deasy, J. O., Young, R. J., and Veeraraghavan, H. (2019). Impact of image preprocessing on the scanner dependence of multi-parametric MRI radiomic features and covariate shift in multi-institutional glioblastoma datasets. Phys. Med. Biol. 64:165011. doi: 10.1088/1361-6560/ab2f44

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, X., Lin, S., and Lyu, G. (2022). Advances in the clinical application of ultrasound elastography in uterine imaging. Insights Imaging 13:141. doi: 10.1186/s13244-022-01274-9

PubMed Abstract | Crossref Full Text | Google Scholar

Xiang, Q., Wang, X., Li, R., Zhang, G., Lai, J., and Hu, Q., “Fruit image classification based on Mobilenetv2 with transfer learning technique,” in Proceedings of the 3rd International Conference on Computer Science and Application Engineering, (2019), pp. 1–7.

Google Scholar

Zhang, K., Guo, Y., Wang, X., Yuan, J., and Ding, Q. (2019). Multiple feature reweight DenseNet for image classification. IEEE Access 7, 9872–9880.

Google Scholar

Zhang, J., Qian, S., and Tan, C. (2023). Automated bridge crack detection method based on lightweight vision models. Complex Intell. Syst. 9, 1639–1652. doi: 10.1007/s40747-022-00876-6

Crossref Full Text | Google Scholar

Keywords: obstetrics and gynecology, uterine cancer, soft voting, deep learning, diagnoses, MRI

Citation: Altal OF, Sindiani AM, Mhanna HYA, Alhatamleh S, Amin M, Akhdar HF, Madain R, Alqasem N, Zayed F, Alanazi S and Sandougah KJ (2025) WOAENet: a whale optimization-guided ensemble deep learning with soft voting for uterine cancer diagnosis based on MRI images. Front. Artif. Intell. 8:1664201. doi: 10.3389/frai.2025.1664201

Received: 11 July 2025; Accepted: 06 October 2025;
Published: 20 October 2025.

Edited by:

Anderson Rodrigues dos Santos, Federal University of Uberlandia, Brazil

Reviewed by:

Ateeq Ur Rehman Butt, National Textile University, Pakistan
M. Roshni Thanka, Karunya Institute of Technology and Sciences, India

Copyright © 2025 Altal, Sindiani, Mhanna, Alhatamleh, Amin, Akhdar, Madain, Alqasem, Zayed, Alanazi and Sandougah. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Kholoud J. Sandougah, S3NhbmRvdWdhaEBpbWFtdS5lZHUuc2E=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.