Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Artif. Intell., 08 December 2025

Sec. Medicine and Public Health

Volume 8 - 2025 | https://doi.org/10.3389/frai.2025.1717913

This article is part of the Research TopicArtificial Intelligence and Medical Image ProcessingView all 5 articles

Gender-based Alzheimer's detection using ResNet-50 and binary dragonfly algorithm on neuroimaging

  • 1Department of Software Engineering, Superior University, Lahore, Pakistan
  • 2Department of Information Technology, University of Gujrat, Gujrat, Pakistan
  • 3Department of Computer Science, Kohsar University Murree, Murree, Pakistan
  • 4King Abdullah International Medical Research Center, Riyadh, Saudi Arabia
  • 5King Saud Bin Abdulaziz University for Health Science, Riyadh, Saudi Arabia
  • 6Ministry of National Guard Health Affairs, Riyadh, Saudi Arabia
  • 7School of Computer Science and Technology, Beijing Institute of Technology, Beijing, Haidian, China
  • 8College of Computer Science and Engineering, University of Ha'il, Ha'il, Saudi Arabia

Alzheimer's disease (AD) is an incurable, progressive neurodegenerative disorder. It is characterized by a gradual decline in memory, cognition, and behavior, which ultimately results in severe dementia and functional dependence. AD begins to develop in the brain at an early stage, while its symptoms appear gradually over time. Early diagnosis and classification of Alzheimer's is a critical research focus due to its silent progression. The current literature highlights a gap in gender-based studies, revealing that the risk of AD varies by gender, age, race, and ethnicity. The nature of the association between AD and these factors requires further exploration to better understand their impact on disease risk and progression. Effectively employing multiple algorithms is essential for accurate diagnosis of Alzheimer's development. This study proposed the GRDN model, which explored a critical aspect of gender-based Alzheimer's detection. To detect subtle changes in the brain, functional magnetic resonance imaging (fMRI) scans have been acquired from the ADNI dataset. In order to balance class distribution and enhance classifier performance on underrepresented groups, a generative adversarial network (GAN) is applied. A balanced dataset is provided to the ResNet-50 architecture for feature extraction, resulting in feature matrices set with a range of 100, 250, and 450. These feature set matrices were then fed to a swarm intelligence-based approach, the binary dragonfly algorithm (BDA), for feature selection, which identified the most informative features. After feature engineering, the resultant matrices of feature selection were provided to the five machine learning (ML) classification algorithms for data classification. The results show that as the size of the features set increases and the accuracy of the classification improves. The simulation results demonstrated that the fineKNN achieved strong performance, with an accuracy of 94.8% on the male group on a feature set of 450, and consistently outperformed other models across all study groups.

1 Introduction

Alzheimer's disease (AD) is a widespread neurodegenerative disorder affecting the lives of more than 55 million people across the globe, and its prevalence is projected to triple by 2050 (Chen Y. et al., 2024). Individuals with AD face a gradual loss of mental ability, including impaired memory, reduced reasoning, language disorder, and neuropsychiatric disorders such as apathy and anxiety. The most significant risk factor that affects individuals with AD is age, with risk increasing from age 50 onwards (Calderón-Garcidueñas, 2022).

Current research depicts that AD starts with minor alterations in the brain anatomy. One of the key characteristics of AD is the generation of abnormal proteins, mainly β-amyloid plaques and tau tangles, in a region of the brain called the hippocampus. Through the progression of this disease, these protein clumps spread to adjacent brain regions, causing shrinkage of the brain tissues, breaking down connections between cells, and leading to memory fade/loss problems (Rao et al., 2024). It is essential to note that these changes can typically be detected in the later stages of the disease, although this is not particularly helpful for early-stage diagnosis. Different neuroimaging modalities, such as Computed Tomography (CT) scans, Positron Emission Tomography (PET) scans, Magnetic Resonance Imaging (MRI), and Functional Magnetic Resonance Imaging (fMRI), are commonly used for AD detection. MRI provides clear structural images of brain tissues, revealing shrinkage and volume loss in the temporal lobe, which are key indicators of neurodegenerative disease. fMRI captures detailed functional information about the brain by measuring changes in blood flow and oxygen level, revealing which regions of the brain are active. fMRI utilizes blood oxygen level-dependent (BOLD) contrast to capture changes in oxygenation, distinguishing between oxygen-rich and oxygen-poor blood. PET scans work by visualizing and measuring metabolic activities in the brain by detecting radioactive signals emitted by radioactive tracers injected into the body. PET demonstrates how the body's cells are performing on a biochemical level, providing insight into metabolic processes. CT scans use X-ray methodology to create cross-sectional images of the brain (van Dyck et al., 2023; Chen Z. et al., 2024).

Researchers employed several publicly available datasets to further the scientific research, including Alzheimer's Disease Neuroimaging Initiative (ADNI), Open Access Series of Imaging Studies (OASIS), and Australian Imaging, Biomarker & Lifestyle Study (AIBL). The ADNI is the most used dataset that provides multimodal neuroimaging, biomarker, genetic, and clinical data of multiple modalities (Grigas et al., 2024). In order to categorize the different stages of AD, including cognitive normal (CN), mild cognitive impairment (MCI), and Alzheimer's disease (AD), ADNI provides a comprehensive neuroimaging and clinical dataset. Four different variants of the ADNI dataset progress with the timeline: ADNI-1 (2004–2009), ADNI-GO (2009–2011), an extension of ADNI-1, ADNI-2 (2011–2016), ADNI-3 (2016–2022), and ADNI-4 (2022–present) (Goyal et al., 2024).

This study uses fMRI scans from the ADNI-4 dataset, including novel remote cohorts that integrate screening and monitoring participants outside of a clinical setting. ADNI-4 includes three diagnostic groups—MCI, CN, and AD—comprising both brain images and clinical details. The dataset tracks how biomarkers change over time and examines factors such as age, gender, education, race, genetics, and marital status to understand their relationship to early signs of Alzheimer's. Collectively, all of these research elements aim to detect the disease at an earlier stage by making more accurate predictions and can better understand how Alzheimer's begins and progresses, with the goal of including a diversified population. The inclusion of remote cohorts provides greater opportunities to study how genotype influences AD risk across gender, age, race, and ethnicity (Beckett et al., 2024). This approach deepens the understanding of how this disease develops and progresses, and it supports tailored interventions for diverse groups.

When AD develops, the brain begins to undergo a progressive process called atrophy, characterized by the loss of neurons and the shrinking of brain tissue. Pathological changes in an AD brain and a normal brain are shown in Figure 1. In AD, amyloid-β builds up outside the neurons and neurofibrillary tangles inside due to the tau protein. As a result, synapses are lost and communication between neurons becomes weakened. This synaptic loss contributes to the cognitive decline seen in Alzheimer's patients.

Figure 1
Diagram comparing normal brain cells and those affected by Alzheimer's disease. The left side shows a neuron, astrocyte, and microglia under normal conditions. The right side illustrates Alzheimer's features: neurofibrillary tangles, amyloid-beta plaque, synapse loss, cytokine overproduction, microglial activation, and astrogliosis, highlighting brain atrophy.

Figure 1. Pathological hallmarks of Alzheimer's disease, including brain atrophy, amyloid-β deposits, tau tangles, and synaptic loss (Xia et al., 2021).

Machine learning (ML) has emerged as one of the best approaches in health informatics, especially for the detection of Alzheimer's. There are many famous ML algorithms, such as Support Vector Machine (SVM), Decision Tree (DT), and Artificial Neural Networks (ANN), which are used to detect different factors such as age, gender, biomarkers, education, marital status, and genetic profiles for the detection of AD at an early stage (Jiang et al., 2024; Mishra et al., 2024). Alongside, deep learning (DL) has produced remarkably better results for Alzheimer's diagnosis. To find the complicated pattern in neuroimaging data, there exist robust architectures such as CNNs, VGG, ResNet, and MobileNet, which have shown a strong ability to classify AD (Ftoutou et al., 2023; Mohsen, 2025).

Incorporating chronological changes in biomarkers along with diverse remote cohorts in ADNI-4 enhances biomarker reliability and participant diversity (Dubois et al., 2023). The binary classification of AD remains unsuccessful in capturing the disease's severity and progression. Therefore, multi-classification is necessary to capture temporal changes in biomarkers and improve predictive accuracy. Abuhantash et al. (2025) emphasized that many ML models achieve comparatively good accuracy while classifying AD multi-stages, but predicting the true severity and progression remains challenging due to often ignored clinical and demographic factors such as gender, age, education, race, genotype, and marital status, which are crucial for personalization. Existing studies incorporated imbalanced datasets, leading to biased classification results, especially for underrepresented stages. Additionally, there is a lack of comparison between established and new datasets that include additional features affecting model performance (Danter, 2024).

Artificial intelligence relies on a massive amount of data to learn generalized patterns and make precise predictions on unknown data instances. In the medical domain, datasets are usually smaller and often contain class imbalance across different classes of disease (Alayba et al., 2024). These limitations reduce the accuracy of classification models. To address these problems, various data augmentation techniques are used to expand the dataset and improve model performance. Traditional techniques include rotation, scaling, flipping, and noise addition, while deep learning-based methods, such as GANs and variational autoencoders (VAEs), generate realistic synthetic images (Siddhartha, 2020).

A significant gap that has been identified in Schwartz and Weintraub (2021) is the absence of gender-specific analysis, which limits the understanding of how men and women may respond differently to AD treatments. It has also identified the limited use of biomarkers to distinguish AD, which further reduced diagnostic accuracy. Addressing these shortcomings emphasized the need for more robust approaches to improve the reliability of predictive models for AD research. Some studies show uneven performance of recent DL models with respect to gender and ethnicity for neurodegenerative disease. In AD classification, these studies show inconsistent and heterogeneous results, indicating that there are several factors that have not been fully explored (Hammonds et al., 2024). The aforementioned discussion highlights the key research gaps and outlines the following objectives to further this research study.

• To identify an effective multi-stage classification model for early diagnosis of Alzheimer's disease.

• To find an effective gender-based Alzheimer's disease prediction model.

• To compare the results of the proposed model with state-of-the-art approaches.

The research article is distributed in the following sections, where related work is discussed in Section 2, while research methodology is elaborated in Section 3. Results and discussions are presented in Section 4, while the conclusion and future directions are covered in Section 5.

2 Related work

Previously, multiple studies had been carried out for the detection of Alzheimer's based on various factors like age, gender, and biomarkers. For better understanding, potential research has been collected and discussed in the following section.

Different DL and ML algorithms typically require huge data to train models effectively, to achieve accurate results. A 3D CNN model was trained on a comparatively large dataset, ABIDE, in order to address the challenge of limited AD-specific fMRI data. Furthermore, this model was fine-tuned on a small ADNI subset, which achieved an accuracy of 76.56%. This output was 30% higher than without TL. It shows the effectiveness of the proposed model on limited AD data (Warren and Moustafa, 2025).

Bolla et al. (2023) compared multiple ML algorithms for detecting MCI. The rs-fMRI scans from a regional Hungarian cohort and ADNI are utilized by the researchers. They had studied three different brain activity measures from these scans: intrinsic connectivity to evaluate how brain areas connect and communicate, local correlation to evaluate how nearby brain areas show similar activity, and fractional amplitude of low-frequency fluctuations to assess the strength of natural low-speed brain signals. Furthermore, five feature selection algorithms were utilized to select important features from these three measures for their analysis. SVM achieved an accuracy of 87% on the local dataset while attaining 90% accuracy on the ADNI dataset. Random forest, on the other hand, produced more consistent results by producing 84% and 82% accuracy on the local and ADNI datasets, respectively.

Feature selection plays a crucial role in managing the high-dimensional and complex data linked with AD. The BDA acts as an efficient feature selection strategy, inspired by dragonfly swarming dynamics to identify the most informative biomarkers. By optimizing feature subsets, BDA improves classification accuracy, minimizes redundancy, and enhances the performance and reliability of DL based AD diagnosis models (Zhang et al., 2021).

This has become increasingly evident that a combination of different classifiers with multiple feature extractors produces improved performance. In a study, conversion from MCI to AD by combining sMRI and rs-fMRI neuroimages from the ADNI repository is analyzed. The researchers compared different ML methods for analyzing data based on graph theory with different classifiers, including random forest, KNN, and AdaBoost, to evaluate which classifier performs better. The integration of SVM with graph theory achieved the best accuracies: 84.71% for MCIc (converted) vs. MCInc (non-converted) and 89.8% from MCIc vs. AD (Sun et al., 2022).

The researchers utilized the benefits of advancement in ML methods in studying the early diagnosis of AD and to predict if MCI will progress to AD or remain stable. Zhou et al. (2024) adopted fMRI images from the ADNI dataset and integrated them with a brain atlas, Automated Anatomical Labeling (AAL), to construct functional connectivity. GAN had been employed to create more training data by generating new, realistic images. To analyze temporal changes in data, researchers used multi-layer long short-term memory (LSTM), a variant of NN that is best fit for temporal space analyses. The model achieved an accuracy of 93.5% for the classification of AD patients apart from healthy people, and 75.5% for distinguishing sMCI (stable) from pMCI (progressed).

In Chattopadhyay et al. (2024), the researchers evaluated whether combining genetic data with fMRI scans could improve the results. Multiple DL models used in earlier researches are explored. The rs-fMRI scans from the ADNI dataset were utilized for this research. The goal was to test whether integrating genetic data could enhance the predictive accuracy of the model. The evaluation, however, demonstrated that the inclusion of genetic data did not show any improvement. The model achieved an accuracy of 92.8%, indicating that the genetic information did not contribute to better performance compared to using imaging features alone.

ResNet-50 has become a crucial DL model for AD classification due to its powerful residual learning and deep feature extraction capabilities. Its ability to capture subtle and fine-grained structural brain changes enhances diagnostic accuracy and reliability. When integrated with advanced frameworks, ResNet-50 significantly improves accuracy across AD, MCI, and CN classification tasks, which demonstrates its clinical potential for early diagnosis, disease monitoring, and personalized treatment in AD research (Zhou et al., 2025).

The researchers aimed to predict amyloid positivity (Aβ+) in AD, MCI, and CN (Turrisi et al., 2024). They compared several ML models, including logistic regression, XGBoost, and a shallow artificial neural network. They also evaluated several DL approaches, such as 2D CNNs, 3D CNNs, CNN-based transfer learning, and 3D vision transformer, to evaluate their effectiveness in AD prediction. The study utilized two neuroimaging modalities, MRI and PET, from the ADNI repository. A total of 1847 subjects were included. The result depicted that the DL models trained on MRI data achieved promising performance with a balanced accuracy of 77.11%.

ResNet-50 has emerged as a widely adopted learning architecture in AD classification. In a study (Han, 2025), ResNet-50 has been effectively integrated with vision transformers to enhance both local and global feature representations from brain MRI scans. The research shows remarkable accuracy in classifying AD stages, underscoring ResNet-50 significance in improving early diagnosis and aiding clinical decision-making.

In this study, a new hybrid model called 3D-CNN-VSwinFormer was created to detect AD at an early stage. This model used a 3D Convolutional Neural Network with an attention module to focus on important brain features. MRI scans from the ADNI dataset are utilized to test this model. The proposed model demonstrated remarkable performance, achieving an accuracy of 92.92% and an area under the curve of 0.9660 in distinguishing AD patients from CN individuals (Wang et al., 2023).

To make the diagnosis better, the study (Zamani et al., 2022) tested five different 3D-CNN models that vary in the number of convolution layers they have. The researchers also used three types of augmentation: zooming, shifting, and rotating to create more training data to improve the model performance. The researcher utilized MRI scans from the ADNI dataset. The model with 8 convolution layers achieved an accuracy of 87.21%, showing excellent performance of the proposed model.

Toga et al. (2024) utilized a multimodal Mamba classifier to fuse PET scans with MRI by using a Pixel-Level Bi-Cross Attention mechanism to make a prediction. To overcome the high cost and limited access to PET scans, the researchers synthesized PET scans from MRI. An accuracy of 90.78% of AD prediction is achieved in this study by utilizing two datasets, OASIS and ADNI.

Deep neural networks are top-notch at separating Alzheimer's from healthy brain MRIs, but their complex design makes them hard to interpret. To fix that, Rivera Mindt et al. (2024) compared three heatmap methods: Layer-wise Relevance Propagation, Integrated Gradients, and Guided Grad-CAM by measuring how well each overlapped with a trusted meta-analysis map. All three outperformed traditional SVM-based brain maps, and Integrated Gradients showed the closest match.

Selecting the most effective parameters is a major challenge in neuroimaging research. This study proposes an optimization method using five evolutionary algorithms. To measure functional connectivity, rs-fMRI scans from the ADNI dataset are used in this study. Functional connectivity was analyzed using graph measures, resulting in 1,155 parameters. The evolutionary algorithms selected a subset of parameters for classification. A two-layer ANN was used for classification into healthy control and early mild cognitive impairment. The model achieved an accuracy of 94.55%, which shows the strength of rs-fMRI and evolutionary optimization methods (Toga et al., 2024).

Collectively, these recent studies from 2022 onward illustrate a dynamic and rapidly evolving landscape of ML techniques applied to ADNI fMRI data, leveraging hybrid DL, transfer learning, evolutionary optimization, multimodal fusion, sophisticated feature selection, and novel architectures such as transformers. These approaches significantly push forward the early and accurate detection of Alzheimer's disease and its prodromal stages, offering promising routes toward clinical translation.

3 Methodology

The most common dataset repositories available for the detection of Alzheimer's are ADNI, Kaggle, ABIDE, and OASIS. The ADNI study was initiated in 2004, and its multiple variants of the dataset are available (Turrisi et al., 2024). The different variants of the ADNI dataset are based on differences in the group of people and updates in the data collected over time. These changes occur due to new scientific findings and improvements in technology.

3.1 Dataset

This study utilized the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset, which by mid-2024 had contributed to over 6,000 peer-reviewed studies worldwide. Over its two decades of operation, ADNI has created more than 47,000 user accounts, with about 26,000 of them currently active, which reflects its global research and ongoing scientific relevance (Alatrany et al., 2024). The ADNI has progressed through several phases to better understand how Alzheimer's disease develops, using brain imaging and data from various groups of people. The first phase, ADNI-1 (2004–2009), gathered baseline data from people with Alzheimer's, healthy individuals, and those with MCI, using scans like DTI, MRI, and PET. ADNI-GO (2009–2011) built on this by focusing on people with early MCI and added functional MRI (fMRI) scans. In ADNI-2 (2011–2016), the study expanded to include more groups, such as late MCI and people with subjective memory complaints, while continuing to collect different types of brain scans to better track Alzheimer's progression. ADNI-3 (2016–2022) kept the same participant groups and scans but improved image quality by using more advanced 3T MRI machines. The latest phase, ADNI-4 (2022 to now), aims to include the underrepresented population through remote cohorts and digital follow-ups. It continues to collect detailed imaging data, enabling researchers to gain deeper insight into Alzheimer's disease across different populations. The ADNI-4 dataset contains three classes: MCI, CN, and AD (Sánchez et al., 2025; Borawar and Kaur, 2023). Representative samples of fMRI scans from the ADNI-4 dataset are represented in Figure 2.

Figure 2
MRI brain scans in three panels labeled (a), (b), and (c). Each panel contains nine axial view images arranged in a three-by-three grid. The scans show variations in brain structures, including clear delineations of the cerebral cortex and ventricles, on a black background.

Figure 2. fMRI scans from the ADNI-4 dataset. (a) Cognitive normal. (b) Mild cognitive impairment. (c) Alzheimer's disease (AD).

The distribution of participants in this study by their diagnostic categories and gender depicts that the total population of this study is 430, downloaded from https://ida.loni.usc.edu on 21 March 2025, where 175 are male participants (40%) and 255 are female participants (59%). There are 264 CN participants, 97 are male participants (22% of the total sample), and 167 are female participants (38%), making up 61% of all participants. Among the 131 MCI, 60 are male participants (13%) and 71 are female participants (16%), representing 30% of the total population. The AD group has 35 participants, with 18 male subjects (4%) and female subjects (3%), accounting for 8% of all subjects. In the dataset, female participants constitute a larger portion of the sample across all three categories. Among these categories, CN is the biggest group. Figure 3 depicts the graphical distribution of fMRI scans from the ADNI-4 dataset. An overview of the sample composition used in this research is shown in Table 1.

Figure 3
Bar chart showing the distribution of participants by category and gender from the ADNI-4 dataset. Categories include CN, MCI, and AD. CN: 97 males, 167 females. MCI: 60 males, 71 females. AD: 18 males, 17 females. Blue bars represent males, orange bars represent females.

Figure 3. Graphical distribution of fMRI scans from the ADNI-4 dataset.

Table 1
www.frontiersin.org

Table 1. Distribution of categories and gender.

3.2 Data balancing

Dataset size and class imbalance remain significant challenges, hindering model generalizability and the development of personalized treatment plans. These limitations highlight the need for better model design and richer data integration to improve prediction performance in AD (Raeisi et al., 2025; Wang, 2025). So, limited dataset size and class imbalance while working with the ADNI-4 dataset were the core challenges. To address both challenges, a deep learning-based augmentation technique, DCGAN, to generate novel fMRI scans has been implemented. It was conducted to address gender-based stratification with the ResNet-50 training process. Initially, the dataset was imbalanced between genders within each class. After applying DCGAN-based augmentation, AD [50.0% male individuals, 50% female individuals, CN (49.9% male individuals, 50.1% female individuals), and MCI (50.0% male individuals, 50.0% female individuals)] achieved balanced gender distribution, ensuring fair representation and reducing bias in ResNet-50 model training. The dataset originally contained 35 AD, 264 CN, and 250 AD. Utilizing DCGAN, 229 AD, 96 CN, and 119 MCI scans were generated, yielding a balanced dataset. The dataset was split at the subject level to avoid overlap between sets, with 70% of subjects used for training, 15% for validation, and 15% for testing. All image slices from the same subject were kept in the same set. The generator learned the feature distribution of training data through transposed convolutional layers and expanded them into latent vectors into high-dimensional volumes. During training, the model demonstrates significant convergence, effectively capturing the characteristics of fMRI while maintaining anatomical plausibility in generated images. The parameter settings of the GCGAN and their corresponding values are provided in Table 2.

Table 2
www.frontiersin.org

Table 2. Hyperparameter settings for DCGAN.

3.3 Proposed DRGN model

The framework consists of four successive phases; preprocessing involves a comprehensive evaluation of fMRI sans form ADNI-4 dataset and the generation of synthetic images using an augmentation framework. Feature extraction is performed using ResNet-50, followed by BDA, generating feature sets of 100, 250, and 450. Finally, five ML classifiers were applied to ensure systematic evaluation and progression. The preprocessing phase involves removing non-brain tissue, commonly referred to as skull stripping, correcting head motion artifacts, standardizing anatomical alignment, and applying spatial smoothing to produce uniform, high-quality input data for analysis. As the fMRI scans in the ADNI-4 dataset exhibit class imbalance, a GAN was employed to generate additional synthetic fMRI data for the underrepresented class. The GAN develops a balance among classes of AD and reduces predication bias (Alshinwan et al., 2021). The diminution process protects important neurological patterns for categorization while eliminating lazy data. The proposed model uses multiple ML classifiers to process features extracted by ResNet-50 and features selected by BDA. The model established an affiliation between AD, MCI, and CN. ML algorithms are very well-suited for handling high-dimensional data distribution, such that found in fMRI neuroimaging scans. The ability of ML algorithms to process complex data to find patterns and relationships that might be difficult to distinguish otherwise. The proposed methodology of AD detection is illustrated in Figure 4.

Figure 4
Flowchart illustrating a machine learning pipeline for brain image classification. The process begins with preprocessing using fMRI scans labeled as CN, MCI, and AD, followed by augmentation using GANs. In feature extraction, a complex neural network architecture processes images. Features are selected using the Binary Dragonfly Algorithm, yielding 100, 250, and 450 features. The classification uses various machine learning classifiers, highlighting the Fine KNN. The output shows a clustered data visualization. Sections are labeled as (a) Preprocessing, (b) Feature Extraction, and (c) Feature Selection & Classification.

Figure 4. Proposed methodology for the detection of Alzheimer's disease. (a) Preprocessing. (b) Feature extraction. (c) Feature selection & classification.

ResNet-50: A 50-layer deep convolutional neural network architecture called ResNet-50 was developed by Microsoft Research in 2015. One of the key features of ResNet-50 is a residual connection, which is its ability to effectively train very deep networks by addressing the vanishing gradient problem (Cui et al., 2020). The residual architecture has been empirically proven to significantly improve generalization and feature representation, resulting in improved diagnostic accuracy and robustness in neuroimaging. The architecture consists of an initial convolutional layer followed by four residual blocks. Each block has a bottleneck design with three convolutional layers: a 1 × 1 convolution for dimensionality reduction, a 3 × 3 convolutional layer for spatial feature extraction, and another 1 × 1 convolutional layer to restore dimensions. After these, average pooling and a fully connected layer produce the final classification (Chantar et al., 2021). The capability of ResNet50 to handle spatial and temporal aspects of the data enhances the framework's performance in identifying AD-related changes in brain function (Ali et al., 2024).

The residual block, which is the main building unit of ResNet, simply adds the original input to the learned residual, making training easier. The mathematical expression of the residual block is shown in Equation 1, whereas after applying the activation function to the residual output, the next layer's input obtained is shown in Equation 2.

yl=h(xl)+f(xl,Wl)    (1)
xl+1=f(yl)    (2)

where xl denote the input feature, Wl weights and biases, h(xl) represents identity mapping, f(xl, Wl) represents the residual function, and f is ReLU function.

By stacking multiple residual blocks, the final feature representation at the layer l is shown in Equation 3.

xl=xl+i=lL-1F(xi,wi)    (3)

where xl indicates a series of matrix-vector products, and L represents the summation of the output of residual functions.

In ResNet, the connections allow gradients to flow directly during backpropagation. This prevents them from vanishing and makes training deep networks more effective. The feature xl can be described as a series of matrix-vector products, where L represents the summation of the outputs of all preceding residual functions. This means that rather than learning an unreferenced function, each residual block learns a residual function to refine the input features collectively. The loss equation functions as E from the chain rule of backpropagation, as in Equation 4.

δδEδxl =δEδ xL δ xLδxl=Eδ xL (1+δδxl i=lL-1(xi, wi)) (4)    (4)

where δEδxl allows gradient to flow back without touching weights, δEδxl (δδxl i=lL-1(xi, wi)) progress through weight layers, and δδxl ensure the information is propagated back to any layer. Equation 4 also depicts that it is less likely for the gradient δEδxl  to be canceled, it is so the term δδxl i=lL-1(xi, wi), cannot constantly be equal to −1. It shows that the gradient does not vanish when the weights are small.

The ResNet-50 is used to extract deep discriminative features from ADNI-4 fMRI data, enhancing Alzheimer's classification accuracy. Figure 5 provides a schematic representation of the ResNet-50 architecture. Its hierarchical convolutional layers capture complex spatial patterns linked to disease progression. The model was trained using stochastic gradient descent with momentum (SGDM) optimizer with a learning rate of 0.01, batch size of 128, momentum of 0.9, L2 regularization of 1 × 10−4, and 30 epochs. These parameters ensured stable convergence and improved classification performance across AD, MCI, and CN groups. To help the model adapt to fMRI data, balance learning speed, enhance feature extraction, and improve generalization, the parameter setting and their values are summarized in Table 3.

Figure 5
Diagram of a neural network architecture including layers with specific configurations: input, convolutional, ReLU, batch normalization, and fully connected layers. It shows residual blocks (res2, res3, res4, res5) in shades of blue, orange, pink, and green. Each block details convolutional operations, batch normalization, ReLU activation, and connections. The tables at the bottom provide category data and corresponding fused features for General, Male, and Female categories.

Figure 5. ResNet-50 architecture.

Table 3
www.frontiersin.org

Table 3. Hyperparameter settings for ResNet-50.

Binary Dragonfly Algorithm: A nature-inspired optimizer, Binary Dragonfly Algorithm (BDA) is a variant of the dragonfly algorithm, a swarm optimization technique introduced by Mirjalili in 2015. BDA addresses multiple optimization problems, including discrete, single-objective, and multi-objective problems (Sadiq et al., 2021). It is a nature-inspired algorithm based on the swarming behavior of dragonflies. The key concept is that each dragonfly represents a possible solution, which is based upon its movement, which is guided by behaviors like alignment, separation, cohesion, attraction to food, and avoidance of enemies. Each solution is evaluated through a fitness function, e.g., classification accuracy in feature selection (Bhutta et al., 2024).

Alignment describes associating the velocity of an individual dragonfly with respect to its neighboring dragonflies. The mathematical equation of alignment is written as Equation 5.

Ai=  N j=1 VjN    (5)

where the velocity vector is referred to as Vj.

Separation refers to the way dragonflies avoid collision. Mathematically, it can be written as in Equation 6.

Si=-j=1N X-Xj    (6)

where X denotes the current search agent, while Xj refers to j-the neighbor of X. N is the number of neighbors.

Cohesion is sticking together with the average position of its neighbors. Mathematically, it is referred to as in Equation 7.

Ci=  j=1NXjN-X    (7)

Attraction to food is a natural pull toward the direction of the food location. This drift in the swarm toward food is shown in Equation 8.

Fi=Flocation -X    (8)

where Flocationis the food source location, while X is the current member.

Distraction from the enemy is the desire of the dragonfly to flee from the enemy. Mathematically, it is defined as in Equation 9.

Ei=Elocation +X.    (9)

In this equation Elocationis the position of the enemy, and X is the position of the current member (Zhu et al., 2022).

Several variants of ML classifiers, including Quadratic SVM, Cubic SVM, Wide NN, Weighted KNN, and Fine KNN, have been employed for neuroimaging classification tasks, as shown in Table 4.

Table 4
www.frontiersin.org

Table 4. Variants of classifiers employed in different AD detection studies.

Fine K-Nearest Neighbor (Fine KNN) is an optimized variant of KNN designed to get improved predictive accuracy. This model incorporates fine-tuned hyperparameters, particularly the number of neighbors (k) and fine distance weighting strategies. In this study, Fine-KNN is applied to the selected feature set to distinguish between different classes of AD, providing a strong baseline for comparison.

Weighed KNN is a refined variant of the conventional KNN algorithm, designed to enhance classification performance by considering greater importance of nearby neighbors. This weighting strategy reduces the influence of noise and outliers. The studies depict that weighted KNN effectively captures variations in the brain, which are critical in distinguishing between AD, MCI, and CN when utilizing fMRI scans.

Wide-Neural Network (Wide-NN) employs a large number of hidden units in fewer layers, enabling the model to learn diverse feature interactions without deep stacking. In the case of fMRI images, this architecture is well-suited, as wide representations can capture complex spatial variation in brain activity. Moreover, it helps reduce overfitting and ensures better generalization across different stages of AD.

Cubic SCM: it uses a cubic kernel to map input data into higher dimensions, which makes it effective for handling complex, non-linear patterns. Cubic SVM can capture complicated relationships in neuroimaging features, making it well-suited for separating overlapping different classes where linear separation is insufficient.

Quadratic SVM: it offers a balance between linear and highly non-linear classifiers by applying a polynomial kernel of degree two. It works effectively for moderately complex decision boundaries, making it useful for distinguishing among various classes.

4 Results and discussion

In this study, a robust framework containing DCGAN + ResNet-50 + BDA + SVM evaluated information from the 430 preprocessed fMRI scans distributed across ADNI-4 global sites. While conventional 3D-CNN models demonstrated strong performance, the proposed GRDN model achieved an overall accuracy of 88.77%. When evaluated by gender, the model attained accuracies of 94.8% for male individuals and 93.0% for female individuals. By combining ResNet-50 with BDA and SVM, the model effectively extracted efficient spatiotemporal features and established a robust decision boundary, mitigating the typical challenges of analyzing neuroimaging data independently. Performance metrics such as precision, F1-score, recall, specificity, and accuracy confirmed the reliability of this proposed framework. Furthermore, residual learning and skip connections resolved the vanishing gradient problem, which means neural networks can now be trained much deeper. This helps the networks to learn and understand the data in a better and more detailed way, improving overall performance. By passing information directly from earlier layers to later, these techniques help the network keep important signals intact during training. This makes learning easier, improves overall performance, and allows the network to capture the data more efficiently. The framework faces challenges because brain scans are not always uniform. This difference depends on the hospital, the scanner model, or the chosen sequence settings. These inconsistencies in acquisition methods, scanner settings, resolution, and quality made it harder to build a framework that generalizes well across neuroimages. Additionally, the types and numbers of patient samples vary among various classes. These differences made it hard for the system to perform consistently and accurately every time because it struggled to adjust and work well with all the different types of data.

4.1 Experimental setup

The experimental setup comprises 430 fMRI scans and is conducted on a system running Windows11 Enterprise Edition, equipped with a Ryzen7 7700 processor, 32 GB RAM, 1 TB NVMe, and an NVIDIA RTX-3070ti GPU. MATLAB R2024a, a full-featured programming environment, was used for simulation and experimentation. This study employed ResNet50 for feature extraction, BDA for feature selection, and 5 classification algorithms, namely SVM, KNN, and NN, were used to predict AD. The extracted features for the general group, male group, and female group consisted of 100, 250, and 450 features, and are presented in Table 5. Architectural details of ResNet50 and parameters of BDA, used during the optimization process, are shown in Tables 6, 7, respectively. Among five classification algorithms, the highest accuracy of 94.8% was obtained using the Fine KNN classifier, as shown in Table 10.

Table 5
www.frontiersin.org

Table 5. Extracted features using ResNet-50.

Table 6
www.frontiersin.org

Table 6. Architectural details of ResNet-50.

Table 7
www.frontiersin.org

Table 7. Parameters of the binary dragonfly algorithm (BDA) and their dynamic value ranges used during the optimization process.

The BDA was tuned with a population size of 100 and 100 iterations, where the inertia weight (w) decreased from 0.9 to 0.4, the rate from 0.01 to 0, and behavioral weights were randomized as s, a, c ε [0, 2 × rate], f ε [0. 2], and e = rate. The stopping criterion was reaching the maximum iterations or observing no improvement in the fitness function. Parameters of BDA, used during the optimization process, are shown in Table 7.

4.2 Performance metrics

The proposed GRDN model accessed the ADNI-4 dataset using a confusion matrix, widely regarded as the benchmark for evaluation. A confusion matrix contains true positives or true negatives for correct classification and false positives and false negatives for incorrect classification. Key performance indicators used to assess the model are presented in Table 8.

Table 8
www.frontiersin.org

Table 8. The key performance indicators (KPIs) used to assess the performance of the model.

4.3 Classification results

In this study, a comprehensive framework for AD classification, focusing on both early diagnosis and gender-specific prediction, is evaluated. The proposed approach integrates DCGAN to address class imbalance, ResNet-50 for feature extraction, and BDA for feature selection. Extracted features were then classified using five ML classifiers, and the performance of the proposed model is compared with the state-of-the-art methods to assess its effectiveness.

4.3.1 General analysis of the ADNI-4 dataset

The proposed GRDN model uses ResNet-50 for feature extraction, BDA for feature selection, and then applies different machine learning classifiers for multi-class classification of AD, MCI, and CN. The results demonstrate how classifier choice impacts the general classification performance of these three classes. With 100 selected features, performance was moderate, but as the feature set increased to 250 and 450 features, both accuracy and robustness improved. Results in Table 9 show Fine KNN consistently outperformed others, reaching the highest accuracy of 88.8% (PER = 88.49, REC = 89.42, SPEC = 94.39) with 450 features. This shows not only a strong ability to detect true cases (high recall, fewer false negatives) but also an excellent ability to avoid mislabeling healthy subjects as diseased (high specificity, fewer false positives). Cubic SVM followed closely, achieving 88.4% accuracy with balanced precision and recall, meaning it maintained a stable trade-off between false positives and false negatives. On the other hand, Weighted KNN and the Wide Neural Network achieved accuracies of 84.9% and 83.4%, respectively, but their lower recall and specificity values indicate higher rates of both false positives (wrongly identifying CN as diseased) and false negatives (failing to detect AD or MCI). The Quadratic SVM performed the weakest (81.7% accuracy), producing the most frequent misclassifications. Overall, the analysis highlights that false positives are more harmful clinically because they may cause unnecessary anxiety or treatment for healthy subjects, while false negatives risk missing actual patients. Our results suggest that Fine KNN and Cubic SVM strike the best balance, with Fine KNN slightly ahead in accuracy and error control, making them more reliable for multi-class Alzheimer's classification. The confusion matrices generated using 100, 250, and 500 selected features are shown in Figures 68, respectively.

Table 9
www.frontiersin.org

Table 9. Classification performance for Alzheimer's disease detection using ResNet-50 combined with BDA and different ML algorithms.

Figure 6
Five confusion matrices labeled (a) to (e) compare true and predicted classes: AD, CN, and MCI. Each matrix shows different numerical distributions for predicted outcomes against true classes, highlighting classification performance.

Figure 6. Confusion matrices using 100 selected features from the ADNI-4 dataset: (a) Fine KNN, (b) Cubic SVM, (c) Weighted KNN, (d) Wide NN, and (e) Quadratic SVM.

Figure 7
Five confusion matrices labeled (a) to (e) display predicted vs. true classes for AD, CN, and MCI. Each matrix has colored squares representing different numeric values for each class intersection. The matrices illustrate classification performance for these conditions.

Figure 7. Confusion matrices using 250 selected features from the ADNI-4 dataset: (a) Fine KNN, (b) Cubic SVM, (c) Weighted KNN, (d) Wide NN, and (e) Quadratic SVM.

Figure 8
Five confusion matrix charts labeled (a) to (e), comparing true classes (AD, CN, MCI) against predicted classes. Each matrix contains numerical data with shaded cells indicating correct predictions. The matrices show variation in prediction accuracy across different classes in different scenarios or models.

Figure 8. Confusion matrices using 450 selected features from the ADNI-4 dataset: (a) Fine KNN, (b) Cubic SVM, (c) Weighted KNN, (d) Wide NN, and (e) Quadratic SVM.

4.3.2 Gender-based analysis of ADNI-4 dataset

The dataset was further analyzed by dividing subjects into male and female groups to study gender-based classification performance.

4.3.3 Male group

For the male group, the multi-class classification results using different feature dimensions (100, 250, and 450 features) show consistent improvements as the number of selected features increases. Table 10 shows that, with 100 selected features, Fine KNN achieved the best performance with an accuracy of 92.4%, precision of 92.34%, and recall of 92.74%, closely followed by Cubic SVM with an accuracy of 89.1%. Weighted KNN, Wide Neural Network, and Quadratic SVM showed moderate performance, with accuracies ranging from 80.2% to 88.1%. When the feature dimension was increased to 250, the Cubic SVM model emerged as the top performer with 94% accuracy, 93.96% precision, and 94.28% recall, indicating its strong ability to balance false positives and false negatives. Fine KNN also remained competitive with 92% accuracy, while Wide NN and Weighted KNN achieved around 89–90% accuracy. Finally, with 450 selected features, the best performance was observed from Fine KNN, which reached an impressive 94.8% accuracy, 94.71% precision, and 95.03% recall, followed closely by Cubic SVM with 93.5% accuracy. Weighted KNN and Wide NN also performed well, crossing 89–90% accuracy, while Quadratic SVM achieved 88.4% accuracy. Figures 911 represent the confusion matrices obtained using 100, 250, and 500 selected features, respectively.

Table 10
www.frontiersin.org

Table 10. Classification performance for Alzheimer's disease detection in the Male group using ResNet-50 combined with BDA and different ML algorithms.

Figure 9
Five confusion matrices labeled (a) to (e), showing classifications of three classes: AD, CN, and MCI. Each grid shows different numbers representing correctly and incorrectly classified instances. Matrix (a) displays high correct classifications in AD and CN. Matrices (b) and (c) have similar patterns with slight variations. Matrix (d) shows increased misclassifications for CN in AD and MCI. Matrix (e) has higher misclassifications for CN and MCI.

Figure 9. Confusion matrices for gender-wise classification (Male group) using 100 selected features from the ADNI-4 dataset: (a) Fine KNN, (b) Cubic SVM, (c) Weighted KNN, (d) Wide NN, and (e) Quadratic SVM.

Figure 10
Five confusion matrices labeled (a) to (e) show classification results for categories: AD, CN, and MCI. Each matrix compares the true class on the y-axis against the predicted class on the x-axis. Numbers within the matrices indicate the count of predictions for each category, with varying shading to highlight different areas. Each matrix shows different distributions of accuracy and misclassification rates across categories.

Figure 10. Confusion matrices for gender-wise classification (Male group) using 250 selected features from the ADNI-4 dataset: (a) Fine KNN, (b) Cubic SVM, (c) Weighted KNN, (d) Wide NN, and (e) Quadratic SVM.

Figure 11
Five confusion matrices labeled (a) to (e) for a classification task involving classes AD, CN, and MCI. Each matrix shows true classes versus predicted classes, with numerical values indicating the frequency of predictions. The matrices include color coding to highlight differences, with blue and orange shades.

Figure 11. Confusion matrices for gender-wise classification (Male group) using 450 selected features from the ADNI-4 dataset: (a) Fine KNN, (b) Cubic SVM, (c) Weighted KNN, (d) Wide NN, and (e) Quadratic SVM.

Overall, these results suggest that increasing the number of selected features leads to more robust and stable classification performance in the male group, with Fine KNN and Cubic SVM consistently outperforming other models. High precision and recall values indicate that these models not only detect true positives effectively but also minimize false positives, making them reliable for distinguishing among AD, MCI, and CN subjects in male participants.

4.3.3.1 Female group

The classification results for the female group, in a multi-class framework using the proposed model of ResNet50 for feature extraction, BDA for feature selection, and multiple ML classifiers show strong performance across different feature set sizes (100, 250, and 450). These results highlight how increasing the number of selected features improves the overall accuracy and robustness of the models for distinguishing AD, MCI, and CN. With 100 selected features, Fine KNN achieved the best performance, with an accuracy of 88.7%, precision of 88.62%, and recall of 89.43%, showing its reliability in female-based classification as shown in Table 11. Cubic SVM also performed strongly (ACC = 87.2%), while Weighted KNN and Wide NN had moderate results (ACC = 84.1% and 82.5%). Quadratic SVM showed the lowest accuracy (79.4%), reflecting challenges in handling limited feature dimensions. When the feature set was expanded to 250 features, the performance improved notably. Cubic SVM emerged as the top-performing model with an accuracy of 90.7%, precision of 90.84%, and recall of 90.89%, showing balanced generalization across classes. Fine KNN also performed comparably well (ACC = 90.2%), while Wide NN and Weighted KNN achieved reasonable accuracies (86.8% and 85.9%, respectively). Quadratic SVM followed closely with an accuracy of 85.4%. These results suggest that 250 selected features provide a richer and more discriminative representation for gender-based classification. With 450 features, the classification performance reached its peak. Cubic SVM achieved the highest accuracy of 92.9% along with strong precision (93.03%) and recall (93.06%). Fine KNN also performed very well with an accuracy of 91.5%, followed by Wide NN (ACC = 89.1%) and Quadratic SVM (ACC = 88.5%). Weighted KNN, while slightly lower, still achieved a solid 87.4% accuracy. These results confirm that larger feature sets enhance the discriminative ability of the proposed ResNet50-BDA pipeline in female subjects. In terms of false positives and false negatives, models with higher recall, such as Fine KNN and Cubic SVM, minimize false negatives, ensuring fewer AD or MCI cases are missed. High specificity, especially above 95% in Cubic SVM and Fine KNN (with 250–450 features), reduces false positives, preventing CN subjects from being misclassified as diseased. Overall, the results demonstrate that the proposed ResNet50-BDA framework performs robustly in gender-based classification. In the female group, the models achieved up to 92.9% accuracy, confirming the framework's effectiveness for distinguishing AD, MCI, and CN with different feature selection scales. Increasing the number of features significantly enhances classification performance, with Cubic SVM emerging as the most reliable classifier. The performance visualization through confusion matrices, based on 100, 250, and 500 selected features, is illustrated in Figures 1214.

Table 11
www.frontiersin.org

Table 11. Classification performance for Alzheimer's disease detection in the Female group using the proposed ResNet-50 combined with BDA and different ML algorithms.

Figure 12
Five confusion matrices labeled (a) to (e) display classification comparisons between true and predicted classes for AD, CN, and MCI. Each matrix shows numerical values in a grid format, with rows representing true classes and columns representing predicted classes. The matrices are color-coded to differentiate between correctly and incorrectly classified instances.

Figure 12. Confusion matrices for gender-wise classification (Female group) using 100 selected features from the ADNI-4 dataset: (a) Fine KNN, (b) Cubic SVM, (c) Weighted KNN, (d) Wide NN, and (e) Quadratic SVM.

Figure 13
Five confusion matrix diagrams labeled (a) to (e), comparing true classes (AD, CN, MCI) with predicted classes. Each matrix has nine cells with numbers representing classification counts, depicting variations in prediction accuracy and errors across models.

Figure 13. Confusion matrices for gender-wise classification (Female group) using 250 selected features from the ADNI-4 dataset: (a) Fine KNN, (b) Cubic SVM, (c) Weighted KNN, (d) Wide NN, and (e) Quadratic SVM.

Figure 14
Five confusion matrices labeled (a) to (e) show comparisons of true and predicted classes for AD, CN, and MCI. Each matrix displays numbers in sections: (a) AD has 6592 correctly predicted, (b) 6688, (c) 6441, (d) 6676, (e) 6432. CN and MCI sections have similar layouts, showing correct predictions and misclassifications with varying figures.

Figure 14. Confusion matrices for gender-wise classification (Female group) using 450 selected features from the ADNI-4 dataset: (a) Fine KNN, (b) Cubic SVM, (c) Weighted KNN, (d) Wide NN, and (e) Quadratic SVM.

4.4 Comparison with state-of-the-art algorithm

Table 12 presents a comparative summary of recent ML classifiers applied to AD using neuroimaging data from prominent datasets such as ADNI, ABIDE, and OASIS. Notably, 3D-CNN has revealed strong performance, with accuracy ranging from 87.21% on MRI data to 92.92% on the ADNI MRI dataset and up to 92.80% when trained on fMRI data. ANN applied to MRI exhibited relatively lower accuracy of 77.10%, indicating variability in model effectiveness across architectures. More complex hybrid models such as the combination of GIG-ICA with 3D CNN attained accuracy of 76.56% on a combined ADNI and ABIDE fMRI dataset. This reflects challenges related to data heterogeneity and limited sample sizes. Multimodal approaches like the Mamba classifier integrating MRI and PET data from ADNI and OASIS reached an accuracy of 90.78%, highlighting the advantages of combining imaging modalities. The model trained on the male group (450 features) achieved superior performance and is evaluated against state-of-the-art models. The classification approach focusing on the male subgroup with rs-fMRI data from the ADNI-4 dataset, employing a ResNet-50 backbone combined with the BDA and fine-tuned K-Nearest Neighbors (KNN), achieving the highest accuracy of 94.80%. This shows that a gender-specific model with DL can accurately predict using fMRI neuroimaging.

Table 12
www.frontiersin.org

Table 12. Comparison with state-of-the-art algorithm.

5 Conclusion

This study proposed a GRDN framework to address a critical research gap by exploring gender-based analysis in AD classification using fMRI data taken from the ADNI-4 dataset. Furthermore, the silent progression of AD by emphasizing accurate early diagnosis is examined, which is crucial for timely intervention and improved patient care. The dragonNet50 framework integrates ResNet-50 for deep feature extraction, the BDA for optimal feature selection, and multiple ML classifiers for final classification. The results demonstrated that this approach is effective in distinguishing between AD, MCI, and CN subjects. Subsequently, GAN to generate synthetic images, which helped balance the dataset and improve stability in classification performance, was employed. For general classification across the three classes AD, MCI, and CN, our framework showed strong accuracy, precision, recall, and F1-scores, highlighting its robustness and generalization ability. When analyzing gender-based classification, some differences were observed between female and male groups. For the male group, the FineKNN classifier achieved the best performance, with accuracy values above 94% when using 250 and 450 selected features. Cubic SVM and Weighted KNN also performed competitively, while Wide NN and Quadratic SVM showed moderate results. In the female group, the performance was slightly lower than the male group but still showed promising outcomes. For instance, Cubic SVM reached an accuracy of 92.9% with 450 selected features and 90.7% with 250 features. Weighted KNN and Wide NN performed reasonably well, though not at the level of Fine KNN and Cubic SVM. These findings suggest that gender-specific variations may play a role in classification accuracy and should be explored further in future studies. These results indicate that the model not only distinguishing between the classes effectively but also maintains stability across varying sample distributions. The strong performance across multiple metrics highlights the robustness, reliability, and generalization capability of the proposed framework in handling complex neuroimaging data. The results from this study suggested that our hybrid ResNet-50 + BDA + ML classifier framework provides an efficient and reliable approach for AD, MCI, and CN classification. The integration of GAN for synthetic data generation further strengthened the results by tackling the imbalance issue. The gender-based analysis revealed that male subjects showed slightly better classification accuracy compared to female subjects, indicating the importance of considering biological and demographic factors in neuroimaging studies.

Future research should focus on validating this framework on larger and multi-site datasets, incorporating longitudinal data to track disease progression, and applying explainable AI methods to enhance transparency and clinical trust. Moreover, to improve patient-specific diagnostic accuracy, future healthcare models should be trained on edge devices using federated learning, incorporating neuroimaging data, biomarkers, and region-specific demographic information. This approach ensures data privacy, minimize latency, and enables precise, personalized diagnostics tailored to individual patient profiles.

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found at: https://adni.loni.usc.edu/data-samples/adni-data/(ADNI).

Author contributions

MH: Data curation, Formal analysis, Investigation, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing. WB: Conceptualization, Formal analysis, Supervision, Writing – review & editing. AJ: Formal analysis, Resources, Visualization, Writing – review & editing. AAA: Validation, Visualization, Writing – review & editing. AA: Data curation, Formal analysis, Project administration, Software, Validation, Visualization, Writing – review & editing, Methodology. MA: Resources, Validation, Visualization, Writing – review & editing. MH: Resources, Software, Visualization, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Abuhantash, F., Welsch, R., Finkelstein, S., and AlShehhi, A. (2025). Alzheimer's disease risk prediction using machine learning for survival analysis with a comorbidity-based approach. Sci. Rep. 15:28723. doi: 10.1038/s41598-025-14406-0

PubMed Abstract | Crossref Full Text | Google Scholar

Alatrany, A. S., Khan, W., Hussain, A., Kolivand, H., and Al-Jumeily, D. (2024). An explainable machine learning approach for Alzheimer's disease classification. Sci. Rep. 14:2637. doi: 10.1038/s41598-024-51985-w

PubMed Abstract | Crossref Full Text | Google Scholar

Alayba, A. M., Senan, E. M., and Alshudukhi, J. S. (2024). Enhancing early detection of Alzheimer's disease through hybrid models based on feature fusion of multi-CNN and handcrafted features. Sci. Rep. 14:31203. doi: 10.1038/s41598-024-82544-y

PubMed Abstract | Crossref Full Text | Google Scholar

Ali, R., Arshid, N., Haq, M. I., Nisa, Z. U., Naqvi, S. A. A., Iqbal, M. W., et al. (2024). Diagnosis of pulmonary tuberculosis by posterior-anterior lung X-ray. J. Comput. Biomed. Informatics 7, 184–192.

Google Scholar

Alshinwan, M., Abualigah, L., Khasawneh, A. M., Hamad, H. A., Shehab, M., Elaziz, M. A., et al. (2021). Dragonfly algorithm: a comprehensive survey of its results, variants, and applications. Multimed. Tools Appl. 80, 14979–15016. doi: 10.1007/s11042-020-10255-3

Crossref Full Text | Google Scholar

Beckett, L. A., Saito, N., Donohue, M. C., and Harvey, D. J. (2024). Contributions of the ADNI Biostatistics Core. Alzheimer's Dementia 20, 7331–7339. doi: 10.1002/alz.14159

PubMed Abstract | Crossref Full Text | Google Scholar

Bhutta, M. R., Ali, M. U., Zafar, A., Kim, K. S., Byun, J. H., and Lee, S. W. (2024). Artificial neural network models: implementation of functional near-infrared spectroscopy-based spontaneous lie detection in an interactive scenario. Front. Comput. Neurosci. 17:1286664. doi: 10.3389/fncom.2023.1286664

PubMed Abstract | Crossref Full Text | Google Scholar

Bolla, G., Berente, D. B., Andrássy, A., Zsuffa, J. A., Hidasi, Z., Csibri, E., et al. (2023). Comparison of the diagnostic accuracy of resting-state fMRI driven machine learning algorithms in the detection of mild cognitive impairment. Sci. Rep. 13:22285. doi: 10.1038/s41598-023-49461-y

PubMed Abstract | Crossref Full Text | Google Scholar

Borawar, L., and Kaur, R. (2023). ‘ResNet: Solving vanishing gradient in deep networks,” Proceedings of International Conference on Recent Trends in Computing. Lecture Notes in Networks and Systems, Vol. 600, eds. R. P. Mahapatra, S. K. Peddoju, S. Roy, and P. Parwekar (Singapore: Springers). doi: 10.1007/978-981-19-8825-7_21

Crossref Full Text | Google Scholar

Calderón-Garcidueñas, L. (2022). Common fatal neurodegenerative diseases revisited: beyond age, comorbidities, and devastating terminal neuropathology there is hope with prevention. Front. Neurol. 13:901447. doi: 10.3389/fneur.2022.901447

PubMed Abstract | Crossref Full Text | Google Scholar

Chantar, H., Tubishat, M., Essgaer, M., and Mirjalili, S. (2021). Hybrid binary dragonfly algorithm with simulated annealing for feature selection. SN Comput. Sci. 2:295. doi: 10.1007/s42979-021-00687-5

PubMed Abstract | Crossref Full Text | Google Scholar

Chattopadhyay, T., Ozarkar, S. S., Buwa, K., Joshy, N. A., Komandur, D., Naik, J., et al. (2024). Comparison of deep learning architectures for predicting amyloid positivity in Alzheimer's disease, mild cognitive impairment, and healthy aging, from T1-weighted brain structural MRI. Front. Neurosci. 18:1387196. doi: 10.3389/fnins.2024.1387196

PubMed Abstract | Crossref Full Text | Google Scholar

Chen, Y., Lai, M., and Tao, M. (2024). Evaluating the efficacy and safety of Alzheimer's disease drugs: a meta-analysis and systematic review. Medicine 103:e37799. doi: 10.1097/MD.0000000000037799

PubMed Abstract | Crossref Full Text | Google Scholar

Chen, Z., Chen, K., Li, Y., Geng, D., Li, X., Liang, X., et al. (2024). Structural, static, and dynamic functional MRI predictors for conversion from mild cognitive impairment to Alzheimer's disease: Inter-cohort validation of Shanghai Memory Study and ADNI. Hum. Brain Mapp. 45:e26529. doi: 10.1002/hbm.26529

PubMed Abstract | Crossref Full Text | Google Scholar

Cui, X., Li, Y., Fan, J., Wang, T., and Zheng, Y. (2020). A hybrid improved dragonfly algorithm for feature selection. IEEE Access 8, 155619–155629. doi: 10.1109/ACCESS.2020.3012838

Crossref Full Text | Google Scholar

Danter, W. (2024). Tracing Alzheimer's genetic footprints: a pioneering longitudinal study using artificial intelligence to unravel mutation- driven risks and progression in virtual patients; part 1 – the APOE genotypes. medRxiv. doi: 10.1101/2024.04.02.24305206

Crossref Full Text | Google Scholar

Dubois, B., von Arnim, C. A. F., Burnie, N., Bozeat, S., and Cummings, J. (2023). Biomarkers in Alzheimer's disease: role in early and differential diagnosis and recognition of atypical variants. Alzheimer. Res. Ther. 15:175. doi: 10.1186/s13195-023-01314-6

PubMed Abstract | Crossref Full Text | Google Scholar

Ftoutou, A., Majdoub, N., and Ladhari, T. (2023). “Alzheimer's disease classification using deep learning,” in 2023 IEEE International Conference on Artificial Intelligence and Green Energy (ICAIGE) (Sousse: IEEE), 1–6. doi: 10.1109/ICAIGE58321.2023.10346502

Crossref Full Text | Google Scholar

Goyal, P., Rani, R., and Singh, K. (2024). A multilayered framework for diagnosis and classification of Alzheimer's disease using transfer learned Alexnet and LSTM. Neural Comput. Appl. 36, 3777–3801. doi: 10.1007/s00521-023-09301-6

Crossref Full Text | Google Scholar

Grigas, O., Maskeliunas, R., and Damaševičius, R. (2024). Early detection of dementia using artificial intelligence and multimodal features with a focus on neuroimaging: a systematic literature review. Health Technol. 14, 201–237. doi: 10.1007/s12553-024-00823-0

Crossref Full Text | Google Scholar

Hammonds, S. K., Eftestol, T., Oppedal, K., and Fernandez-Quilez, A. (2024). “Unraveling gender fairness analysis in deep learning prediction of Alzheimer's disease,” in 2024 4th International Conference on Applied Artificial Intelligence, ICAPAI 2024 (Halden: Institute of Electrical and Electronics Engineers Inc), 10541140. doi: 10.1109/ICAPAI61893.2024.10541140

Crossref Full Text | Google Scholar

Han, L. (2025). AD-Diff: enhancing Alzheimer's disease prediction accuracy through multimodal fusion. Front. Comput. Neurosci. 19:1484540. doi: 10.3389/fncom.2025.1484540

PubMed Abstract | Crossref Full Text | Google Scholar

Jiang, S., Yang, S., Deng, K., Jiang, R., and Xue, Y. (2024). Machine learning models for diagnosing Alzheimer's disease using brain cortical complexity. Front. Aging Neurosci. 16:1434589. doi: 10.3389/fnagi.2024.1434589

PubMed Abstract | Crossref Full Text | Google Scholar

Mishra, S., Sharma, V., and Ramya, G. (2024). “Alzheimer's disease prediction using machine learning,” in 2024 7th International Conference on Circuit Power and Computing Technologies (ICCPCT) (Kerala: IEEE), 366–370. doi: 10.1109/ICCPCT61902.2024.10673117

Crossref Full Text | Google Scholar

Mohsen, S. (2025). Alzheimer's disease detection using deep learning and machine learning: a review. Artif. Intell. Rev. 58:262. doi: 10.1007/s10462-025-11258-y

Crossref Full Text | Google Scholar

Raeisi, Z., Mehrnia, M., Ahmadi Lashaki, R., and Abedi Lomer, F. (2025). Enhancing schizophrenia diagnosis through deep learning: a resting-state fMRI approach. Neural Comput. Appl. 37, 15277–15309. doi: 10.1007/s00521-025-11184-8

Crossref Full Text | Google Scholar

Rao, Y. L., Ganaraja, B., Suresh, P. K., Joy, T., Ullal, S. D., Manjrekar, P. A., et al. (2024). Outcome of resveratrol and resveratrol with donepezil combination on the β-amyloid plaques and neurofibrillary tangles in Alzheimer's disease. 3 Biotech 14:190. doi: 10.1007/s13205-024-04034-2

PubMed Abstract | Crossref Full Text | Google Scholar

Rivera Mindt, M., Arentoft, A., Calcetas, A. T., Guzman, V. A., Amaza, H., Ajayi, A., et al. (2024). The Alzheimer's Disease Neuroimaging Initiative-4 (ADNI-4) engagement core: a culturally informed, community-engaged research (CI-CER) model to advance brain health equity. Alzheimer's Dementia 20, 8279–8293. doi: 10.1002/alz.14242

PubMed Abstract | Crossref Full Text | Google Scholar

Sadiq, A., Yahya, N., and Tang, T. B. (2021). “Classification of Alzheimer's disease using low frequency fluctuation of rs-fMRI signals,” in 2021 International Conference on Intelligent Cybernetics Technology and Applications, ICICyTA 2021 (Bandung: Institute of Electrical and Electronics Engineers Inc.), 40–45. doi: 10.1109/ICICyTA53712.2021.9689209

Crossref Full Text | Google Scholar

Sánchez, V., Güven, Ç., Nápoles, G., and Postma, M. Š. (2025). Data augmentation techniques for fMRI data: a technical survey. IEEE Access 13, 66529–66556. doi: 10.1109/ACCESS.2025.3560395

Crossref Full Text | Google Scholar

Schwartz, J. B., and Weintraub, S. (2021). Treatment for Alzheimer disease—sex and gender effects need to be explicitly analyzed and reported in clinical trials. JAMA Netw. Open 4:e2124386. doi: 10.1001/jamanetworkopen.2021.24386

PubMed Abstract | Crossref Full Text | Google Scholar

Siddhartha, B. (2020). Proceedings, 2020 Fifth International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN): November 26-27, 2020, CHRIST (Deemed to be University), Kengeri Campus, Bangalore, India. Bangalore: The Institute of Electrical and Electronics Engineers, Inc.

Google Scholar

Sun, H., Wang, A., and He, S. (2022). Temporal and spatial analysis of Alzheimer's disease based on an improved convolutional neural network and a resting-state fMRI brain functional network. Int. J. Environ. Res. Public Health 19:4508. doi: 10.3390/ijerph19084508

PubMed Abstract | Crossref Full Text | Google Scholar

Toga, A. W., Neu, S., Sheehan, S. T., and Crawford, K. (2024). The informatics of ADNI. Alzheimer's Dementia 20, 7320–7330. doi: 10.1002/alz.14099

Crossref Full Text | Google Scholar

Turrisi, R., Verri, A., and Barla, A. (2024). Deep learning-based Alzheimer's disease detection: reproducibility and the effect of modeling choices. Front. Comput. Neurosci. 18:1360095. doi: 10.3389/fncom.2024.1360095

PubMed Abstract | Crossref Full Text | Google Scholar

van Dyck, C. H., Swanson, C. J., Aisen, P., Bateman, R. J., Chen, C., Gee, M., et al. (2023). Lecanemab in early Alzheimer's disease. New Engl. J. Med. 388, 9–21. doi: 10.1056/NEJMoa2212948

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, D., Honnorat, N., Fox, P. T., Ritter, K., Eickhoff, S. B., Seshadri, S., et al. (2023). Deep neural network heatmaps capture Alzheimer's disease patterns reported in a large meta-analysis of neuroimaging studies. Neuroimage 269:119929. doi: 10.1016/j.neuroimage.2023.119929

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, X. (2025). The analysis of sculpture image classification in utilization of 3D reconstruction under K-means++. Sci. Rep. 15:18127. doi: 10.1038/s41598-025-01949-5

PubMed Abstract | Crossref Full Text | Google Scholar

Warren, S. L., and Moustafa, A. A. (2025). Towards clinical diagnoses: classifying Alzheimer's disease using single fMRI, small datasets, and transfer learning. Brain Behav. 15:70427. doi: 10.1002/brb3.70427

PubMed Abstract | Crossref Full Text | Google Scholar

Xia, X., Wang, Y., and Zheng, J. (2021). COVID-19 and Alzheimer's disease: how one crisis worsens the other. Transl. Neurodegener. 10:15. doi: 10.1186/s40035-021-00237-2

PubMed Abstract | Crossref Full Text | Google Scholar

Zamani, J., Sadr, A., and Javadi, A. H. (2022). Classification of early-MCI patients from healthy controls using evolutionary optimization of graph measures of resting-state fMRI, for the Alzheimer's disease neuroimaging initiative. PLoS ONE 17:e267608. doi: 10.1371/journal.pone.0267608

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, T., Liao, Q., Zhang, D., Zhang, C., Yan, J., Ngetich, R., et al. (2021). Predicting MCI to AD conversation using integrated sMRI and rs-fMRI: machine learning and graph theory approach. Front. Aging Neurosci. 13:688926. doi: 10.3389/fnagi.2021.688926

PubMed Abstract | Crossref Full Text | Google Scholar

Zhou, J., Wei, Y., Li, X., Zhou, W., Tao, R., Hua, Y., et al. (2025). A deep learning model for early diagnosis of alzheimer's disease combined with 3D CNN and video Swin transformer. Sci. Rep. 15:23311. doi: 10.1038/s41598-025-05568-y

PubMed Abstract | Crossref Full Text | Google Scholar

Zhou, X., Kedia, S., Meng, R., and Gerstein, M. (2024). Deep learning analysis of fMRI data for predicting Alzheimer's Disease: a focus on convolutional neural networks and model interpretability. PLoS ONE 19:e312848. doi: 10.1371/journal.pone.0312848

PubMed Abstract | Crossref Full Text | Google Scholar

Zhu, J. D., Huang, C. W., Chang, H. I., Tsai, S. J., Huang, S. H., Hsu, S. W., et al. (2022). Functional MRI and ApoE4 genotype for predicting cognitive decline in amyloid-positive individuals. Ther. Adv. Neurol. Disord. 15. doi: 10.1177/17562864221138154

Crossref Full Text | Google Scholar

Keywords: Alzheimer's disease, fMRI, ADNI, pretrained, gender

Citation: Haq MIu, Bangyal WH, Jaffar A, Alfayez AA, Ashraf A, Alazmi M and Hussain M (2025) Gender-based Alzheimer's detection using ResNet-50 and binary dragonfly algorithm on neuroimaging. Front. Artif. Intell. 8:1717913. doi: 10.3389/frai.2025.1717913

Received: 02 October 2025; Revised: 04 November 2025;
Accepted: 10 November 2025; Published: 08 December 2025.

Edited by:

Azhar Imran, Air University, Pakistan

Reviewed by:

Zheng Yuan, China Academy of Chinese Medical Sciences, China
Hugo Vega-Huerta, National University of San Marcos, Peru

Copyright © 2025 Haq, Bangyal, Jaffar, Alfayez, Ashraf, Alazmi and Hussain. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Waqas Haider Bangyal, d2FxYXMuYmFuZ3lhbEBrdW0uZWR1LnBr; Asma Abdullah Alfayez, YXNtYWFhLmFsZmF5ZXpAZ21haWwuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.