Machine learning based algorithms for virtual early detection and screening of neurodegenerative and neurocognitive disorders: a systematic-review

Yousefi, Milad; Akhbari, Matin; Mohamadi, Zhina; Karami, Shaghayegh; Dasoomi, Hediyeh; Atabi, Alireza; Sarkeshikian, Seyed Amirali; Abdoullahi Dehaki, Mahdi; Bayati, Hesam; Mashayekhi, Negin; Varmazyar, Shirin; Rahimian, Zahra; Asadi Anar, Mahsa; Shafiei, Daniel; Mohebbi, Alireza

doi:10.3389/fneur.2024.1413071

SYSTEMATIC REVIEW article

Front. Neurol., 09 December 2024

Sec. Artificial Intelligence in Neurology

Volume 15 - 2024 | https://doi.org/10.3389/fneur.2024.1413071

This article is part of the Research TopicExploring the Future of Neurology: How AI is Revolutionizing Diagnoses, Treatments, and BeyondView all 10 articles

Machine learning based algorithms for virtual early detection and screening of neurodegenerative and neurocognitive disorders: a systematic-review

Milad Yousefi¹^†

Matin Akhbari²^‡^†

Zhina Mohamadi³^†

Shaghayegh Karami⁴

Hediyeh Dasoomi⁵

Alireza Atabi⁶

Seyed Amirali Sarkeshikian⁷

Mahdi Abdoullahi Dehaki⁸

Hesam Bayati⁹

Negin Mashayekhi¹⁰

Shirin Varmazyar¹¹

Zahra Rahimian¹²

Mahsa Asadi Anar¹³^*^‡

Daniel Shafiei¹⁴^*^‡

Alireza Mohebbi¹⁵

¹Institute for Cognitive and Brain Sciences, Shahid Beheshti University, Tehran, Iran
²Faculty of Medicine, Istanbul Yeni Yuzyil University, Istanbul, Türkiye
³School of Medicine, Kermanshah University of Medical Sciences, Kermanshah, Iran
⁴School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
⁵Student Research Committee, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
⁶School of Medicine, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
⁷School of Medicine, Shahid Beheshti University of Medical Science, Tehran, Iran
⁸Master’s of AI Engineering, Islamic Azad University Tehran Science and Research Branch, Tehran, Iran
⁹Department of Radiology, Shahid Beheshti University of Medical Sciences, Tehran, Iran
¹⁰Department of Neuroscience, Bahçeşehir University, Istanbul, Türkiye
¹¹School of Medicine, Shahroud University of Medical Sciences, Shahrud, Iran
¹²School of Medicine, Shiraz University of Medical Sciences, Shiraz, Iran
¹³Student Research Committee, Shahid Beheshti University of Medical Sciences, Tehran, Iran
¹⁴School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
¹⁵Students Research Committee, Ardabil University of Medical Sciences, Ardabil, Iran

Background and aim: Neurodegenerative disorders (e.g., Alzheimer’s, Parkinson’s) lead to neuronal loss; neurocognitive disorders (e.g., delirium, dementia) show cognitive decline. Early detection is crucial for effective management. Machine learning aids in more precise disease identification, potentially transforming healthcare. This comprehensive systematic review discusses how machine learning (ML), can enhance early detection of these disorders, surpassing traditional diagnostics’ constraints.

Methods: In this review, databases were examined up to August 15th, 2023, for ML data on neurodegenerative and neurocognitive diseases using PubMed, Scopus, Google Scholar, and Web of Science. Two investigators used the RAYYAN intelligence tool for systematic reviews to conduct the screening. Six blinded reviewers reviewed titles/abstracts. Cochrane risk of bias tool was used for quality assessment.

Results: Our search found 7,069 research studies, of which 1,365 items were duplicates and thus removed. Four thousand three hundred and thirty four studies were screened, and 108 articles met the criteria for inclusion after preprocessing. Twelve ML algorithms were observed for dementia, showing promise in early detection. Eighteen ML algorithms were identified for Parkinson’s, each effective in detection and diagnosis. Studies emphasized that ML algorithms are necessary for Alzheimer’s to be successful. Fourteen ML algorithms were discovered for mild cognitive impairment, with LASSO logistic regression being the only one with unpromising results.

Conclusion: This review emphasizes the pressing necessity of integrating verified digital health resources into conventional medical practice. This integration may signify a new era in the early detection of neurodegenerative and neurocognitive illnesses, potentially changing the course of these conditions for millions globally. This study showcases specific and statistically significant findings to illustrate the progress in the area and the prospective influence of these advancements on the global management of neurocognitive and neurodegenerative illnesses.

Introduction

Machine learning (ML) describes circumstances in which machines can mimic human minds in learning and analysis and thus be used to solve problems (1). Recent advances in ML have produced a computational framework by integrating a multitude of patient data and providing unique risk assessments and recommendations to each patient, which has the potential to revolutionize clinical decision-making (2) fundamentally.

Helping with diagnosis is one of the most significant uses of machine learning in this field. The promise of machine learning-based disease diagnosis (MLBDD), which is affordable and time-effective, is demonstrated by numerous researchers and practitioners (2). To identify chronic kidney disease, Ma et al. (2020) suggested a heterogeneous modified artificial neural network (HMANN) model that obtained an accuracy of 87–99% (3). To improve the diagnosis of COVID-19, Apostolopoulos and Mpesiana (2020) used a CNN-based Xception model on an imbalanced dataset of 284 COVID-19 and 967 non-COVID-19 patient chest X-ray images and achieved 89.6% accuracy in diagnosis (4). Regarding the diagnosis of diabetes, Yahyaoui et al. (2019) showed that the machine-learning RF technique works with an accuracy of 83.67% (5). The examples demonstrate how machine learning algorithms can provide more accurate and reliable disease diagnosis than other diagnostic techniques.

Neurodegenerative disorders are characterized by a gradual loss of neurons, often leading to death. The term covers a wide range of clinical diseases and progressive dementing conditions, including Alzheimer’s disease (AD), Parkinson’s disease (PD), and a number of other neurological disorders (6). Neurocognitive disorders, including delirium, mild cognitive impairment and dementia, are characterized by a decrease in cognitive functioning from a previously attained level (7). Many of these diseases are incurable and sometimes fatal, but early detection can significantly improve the ability to control them.

AD is the most prevalent form of dementia. Patients with AD have trouble remembering things, which limits their ability to learn. Due to the slow progression of AD and the difficulty of current diagnostic techniques in identifying it in its early stages, early diagnosis of the disease is crucial.

PD is a progressive and chronic neurodegenerative disease. The overall validity of PD’s clinical diagnosis, particularly in the early stages of the disease, is unsatisfactory (8).

Delirium is acute brain dysfunction that causes cognitive impairment and shifting attention. Numerous symptoms, such as significant psychomotor agitation, a low level of consciousness, or both, may be present. Traditionally, one or more physicians’ evaluations have been used to diagnose delirium clinically. However, this method of diagnosis might contain flaws because of the disease’s unstable nature (9).

As evident, standard clinical diagnostic techniques for neurodegenerative and neurocognitive diseases have flaws, which make it difficult and occasionally impossible to diagnose the disease, especially in its early stages. On the other side, machine learning algorithms can be highly accurate when it comes to diagnosing a variety of diseases. Recently, many studies have been conducted on the efficacy of ML algorithms as a quick and reliable alternative diagnostic method. Therefore, in this article, we aimed to systematically assess different uses of ML algorithms in detecting neurodegenerative and neurocognitive disorders early.

Methods

This systematic review study was conducted as stated by Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA2020) principles (10). This review has been registered on The Open Science Framework (OSF) (registration DOI https://osf.io/rtsyk/).

Information sources, search strategy

A comprehensive search of several databases was conducted from inception to August 15th, 2023. The databases included PubMed/MEDLINE, Scopus, Google Scholar and Web of Science. As seen in table 1, the search for AI algorithms used for detecting and screening neurodegenerative and neurocognitive diseases involved a controlled vocabulary supplemented with keywords in each database. Table 1 demonstrates the specific search syntax used for each database involved.

Table 1

Table 1. Search strategies and databases used in the study.

Data screening and eligibility criteria

We used the RAYYAN intelligent tool for systematic reviews to screen the search results (11). Titles and abstracts from 7,069 articles obtained from our search strategy were independently and blindly screened by six reviewers (Zh.M., Sh.K., H.D., A.A., H.B., M.Y.). The duplicate records were removed using the same tool. The conflicts were resolved by a seventh reviewer (Sh.K.) using RAYYAN’s compute rating feature.

Inclusion criteria

The study was conducted on this specified list of neurodegenerative and neurocognitive diseases, and the search keywords included items below:

• Huntington

• Tauopathies and the subclassifications

• Neurofibrillary tangles

• Myelitis

• Paraneoplastic polyneuropathy

• Paraneoplastic cerebellar degeneration

• Tourette syndrome

• Neurofibromatoses

• Encephalopathy

• Neuropathy

• ALS

• Alzheimer’s disease (AD)

• Mild cognitive impairment (MCI)

• Parkinson’s disease (PD)

• Frontotemporal dementia (FTD)

• Lewy Body’s disease (LBD)

• Progressive supranuclear palsy (PSP)

• Corticobasal degeneration (CBD)

• Wernicke-Korsakoff syndrome

• Normal pressure hydrocephalus (NPH)

• Prion diseases, such as Creutzfeldt-Jakob disease

• Vascular dementia

Studies that were not available as open access were in languages other than English were conducted on animals, and were published as book chapters, Conference papers were excluded.

Quality assessment of included studies

Two assessors (MY and HD) evaluated each study separately based on the Cochrane risk of bias tool, evaluating all included studies (12). With a focus on six domains—sequence generation, allocation concealment, blinding, incomplete data, and selective reporting—the Cochrane risk of bias tool is a widely used and standard tool that contains all the necessary questions to evaluate methodological quality and bias risk. The two assessors settled other biases and disagreements through discussion and consensus.

Results

Study selection

Our search strategies in four databases yielded 7,069 studies, 1,365 were eliminated as duplicates. At least two individuals screened each of 4,334 remaining studies through title and abstract. Unrelated studies whose full text was unavailable, did not meet our inclusion criteria, and were not in English were excluded. At last, 108 studies were included for interpretation. Figure 1 depicts the study selection procedure.

Figure 1

Figure 1. Flow diagram of the study selection procedure.

Study characteristics

The included studies were published between 2015 and 2023. A study was carried out in Africa, another in Australia, 17 in Europe, 29 in America, and the remaining in Asia.

Findings

In the included studies, 3,723,329 participants were examined. Thirty-four studies on AD, 14 on PD, 13 on MCI, 10 on dementia, 7 on MS and the remaining studies were carried out on other neurodegenerative and neurocognitive disorders.

Dementia

In 10 studies conducted on dementia, 12 ML algorithms were used: XGBoost classification, Binary logistic regression (LR), A logistic model tree classifier combined with information gain feature selection, 3D convolutional neural networks (3D CNN), k-NearestNeighbor (kNN), support vector machine (SVM), random forest (RF), parallel recurrent convolutional neural network (PRCNN), support vector machine classifiers (SVC), support vector regression (SVR), partial least squares regression (PLSR) and Deep Neural Network (DNN), All of which showing promising results in early detection and screening of the disease. Table 2 summarizes our included studies.

Table 2

Table 2. Summary of included studies on the machine learning algorithms for early detection of NCDs and NDDs.

SVM and XGBoost are prominent models for early dementia detection, each with distinct advantages and disadvantages regarding sensitivity and specificity. SVM excels in handling unbalanced datasets, achieving high sensitivity and specificity (over 90% in some studies), making identifying subtle early-stage symptoms practical. However, it can struggle with scalability and requires significant computational resources. In contrast, XGBoost offers flexibility and speed, handling various input features well, with sensitivities reaching between 80 and 85%. Yet, it may only perform in specificity compared to SVM if carefully tuned, which demands advanced cross-validation methods and more computational power. Both models demonstrate effectiveness; however, SVM offers enhanced specificity, which is vital for precise diagnostic accuracy. However, XGBoost excels in sensitivity but necessitates meticulous tuning to achieve optimal performance.

Parkinson’s disease

The ML algorithms used for PD are as follows: Center of Pressure, Load Distribution, Random forest algorithm, Neural Network (NN), Support vector machine, and affine registration using the FSL library developed by the Oxford Centre for Functional MRI of the Brain (FMRIB), Multi-Layer Perceptron (MLP), Vertical Ground Reaction Force (VGRF), logistic regression (LR), linear discriminant analysis (LDA), kNN, classification and regression tree (CART), Naive Bayes (NB), bagged decision tree (BDT), extra tree classifier (ETC), AdaBoost classifier (AC), gradient boosting classifier (GBC), Extremely Randomized Trees (ERT), Discriminant Analysis (DIS), Deep Learning (DEEP). All mentioned algorithms showed significant early PD detection, diagnosis and screening capabilities, and most had considerable sensitivity and specificity.

According to the review of studies, various algorithmic models have been employed for the early diagnosis of Parkinson’s disease, with deep learning models demonstrating exceptional effectiveness. These models achieve nearly 100% accuracy, along with high sensitivity and specificity. Their advantages include remarkable accuracy, non-invasive techniques utilizing medical imaging data, and automated feature extraction, which minimizes the need for manual data handling. However, deep learning models necessitate substantial computational resources and large volumes of labeled data, and their “black box” nature poses challenges for interpretability.

Alzheimer’s disease

In the included studies, much attention was paid to using ML algorithms for diagnosing and progressing Alzheimer’s. The following algorithms were used for detection, screening and progression of AD, all of which were successful for the purposes: Sequential minimal optimization (SMO), Naive Bayes (NB), tree augmented Naive Bayes (TAN), K2, MATLAB PatternRecognition toolbox, TF-IDF, CountVectorizer (CV), Word2Vec, FastText, VGG16 with XGB, stacked fusion models//hybrid stacked fusion model, PRS, AAO, KNN, decision tree, random forest, ANN, 3D-CNN model, Boruta FS algorithm, Gradient, Information Gain (IG), Multi-view Separable Pyramid Network (MiSePyNet), PyWinEA using Mono-objective and Multi-objective Genetic Algorithms (NSGAII), Elastic Net (EN), Gaussian Processes (GP), kNN, (LR), Linear Discriminant, Support Vector Machine, Voting classifiers, Multi-Classifier Network (MCN), Gradient Boosted Trees (GBTs), basic three-layer Neural Network architecture using the OASIS, Sparse K-means w/Resampling, a deep neural network architecture, Adaboost, graph convolutional and recurrent neural network (graph-CNN-RNN), Single hidden layer neural network, Single-layer bidirectional, LSTM, Three-layer CNN, Deep Belief Network (DBN), stacked auto-encoder (SAE), SVR, SVC, PLSR, Shallow Models, Feature Pyramid Network (FPN) and temporally structured SVM (TS-SVM).

Studies in our review suggested different algorithms for best accuracy to early detection of Alzheimer’s disease but Deep learning models especially CNNs, and SVM reported more effective than others. SVM and CNN each offer distinct advantages and limitations. SVM is advantageous due to its reliable classification accuracy and specificity, reaching about 93% accuracy and 87% sensitivity in some studies, making it efficient for handling smaller datasets with feature selection methods. However, it can struggle with high-dimensional data unless combined with dimensionality reduction techniques. In contrast, CNN models have shown high sensitivity and specificity in early detection of Alzheimer’s disease, with some studies reporting accuracies above 95%. Their advantages include high accuracy, automated feature extraction, and the non-invasive nature of using medical imaging data. However, these models require significant computational resources, large amounts of labeled data, and are often considered “black boxes” due to their lack of interpretability.

Mild cognitive impairment (MCI)

ML algorithms used for MCI included 3D-CNN, support vector machines, Gaussian Naive Bayes(GNB), EMCI identification framework, LASSO logistic regression, Naïve Bayes, Decision Tree, RF, Gaussian, Polynomial-kernel Support Vector Machines, kNN, LR, Adaboost and TS-SVM model. Except for LASSO logistic regression, all showed remarkable performance in early detection of the disease and its development to AD and dementia.

For the early detection of MCI, 3D-CNNs have proven to be highly effective, with studies demonstrating over 95% accuracy, high sensitivity and specificity. The advantages of CNNs include their ability to automatically extract relevant features from complex datasets and their non-invasive application of medical imaging. However, they require substantial computational resources and large amounts of labeled data, and their decision-making processes are often not easily interpretable, rendering them “black boxes.” SVMs are another viable option, offering moderate sensitivity and good interpretability. However, they necessitate extensive parameter tuning and may overlook fine spatial features critical for accurate diagnosis. RF and Decision Trees provide high interpretability and effectively manage non-linear data, though they typically exhibit lower sensitivity than CNN models. Ensemble methods, such as combinations of LASSO logistic regression and Naïve Bayes, offer a balanced approach in terms of sensitivity and specificity. These methods can serve as cost-effective options for initial screenings, particularly when integrated with clinical biomarkers. Overall, the choice of model should consider the trade-offs between performance, interpretability, and resource requirements to optimize early detection of MCI.

Random forest, LR, support vector machine, LightGBM, kNN, Decision tree, Gaussian Naïve Bayes (gNB), Auto-sklearn, Gaussian Processes (GP) regression, Gaussian Process regression (GPR), CNN, Adaboost, NN and LDR algorithms were also successfully used for detection and progression of Huntington’s disease, Multiple Sclerosis, Amyotrophic lateral sclerosis (ALS), Corticobasal Syndrome (CS), Neurofibromatosis type 1, Amyloid and Delirium.

Discussion

In this study, we investigated 108 studies evaluating patients with neurological diseases for early detection using ML algorithms. This study showcases specific and statistically significant findings to illustrate the progress in the area and the prospective influence of these advancements on the global management of neurocognitive and neurodegenerative illnesses.

AI technologies can retrieve data from medical texts and generate diagnostic and prediction models using this data. An extensive collection of electronic medical records amassed over a considerable period can serve as the fundamental data for this form of research (13). Traditional diagnostic methods for PD diagnosis may misdiagnose because they evaluate small movements that are hard to classify. Early non-motor symptoms of PD may be minor and caused by other illnesses. Thus, these symptoms are typically missed, making early PD diagnosis difficult. ML algorithms have been used to classify PD and healthy controls or patients with comparable clinical presentations to overcome these issues and improve PD diagnosis and evaluation. Multiple ML-based computer-aided diagnosis and detection (CADD) systems have shown promise in identifying PD patients from healthy controls (14). Using preclinical indicators of non-motor symptoms, including sleep Behavior Disorder (RBD) and olfactory loss, CSF measures, and dopaminergic imaging to classify early PD and healthy normal Prashanth et al. found SVM classification near-perfect (15). Balaji et al. proposed a multi-class learning technique that differs from earlier machine learning approaches, which often focus on binary classification to identify the existence of PD. In contrast, the proposed approach can not only classify but also quantify the stages of PD (16).

Individuals afflicted with MCI typically experience a deterioration in cognitive abilities, which significantly affects their general health. Importantly, failure to promptly identify this illness by medical professionals can readily progress into dementia. Using artificial intelligence, a dimensional assessment technique may seamlessly combine classical neuropsychological measurements and facilitate the diagnosis of AD (17). Similarly, Raees et al. present an initial automated deep learning system that utilizes a large MRI dataset of normal and 111 patients to predict AD. By evaluating the effectiveness of SVM and DNN models, they demonstrate that Deep learning has a significant level of accuracy, ranging from 80 to 90%, in predicting AD (18). Goenka and Tiwari introduced a three-class CNN that utilizes three computational approaches for neuroimaging to classify AD. Their suggested model was empirically validated, demonstrating classification accuracies of 97.48, 96.62, and 86.49% for big, medium, and small patches, respectively (19). Artificial neural networks revealed an intricate correlation between cognitive state and auditory function that cannot be easily anticipated only by considering the cognitive differences between individuals with and without AD (20).

Zhao et al. reported that Support Vector Machines (SVM), which integrate short-term clinical and brain MRI data, show potential in predicting the course of MS illness and identifying individuals who would benefit from more aggressive treatment approaches (21). Similarly, Law et al. found that the possibility of disability in MS was most accurately predicted using non-parametric machine learning techniques. It also can select those with the highest and lowest progression risk for inclusion in secondary progressive M.S. (22). Concordantly, Zhang et al. found that computational approaches (Lesion Segmentation Toolbox) provide more accurate conversion predictions from CIS to MS than human visual analysis (23). Goyal et al. utilized a machine learning technique to predict MS by analyzing serum cytokines. Their findings indicate that the RF model achieved an accuracy of 91%, suggesting its potential for predicting MS using serum cytokine levels. Moreover, the RF model demonstrated a 70% accuracy in classifying MS patients into remitting and non-remitting categories (24). These data were similar to other studies (25–27).

This comprehensive systematic study of 108 articles includes papers that illustrate significant patterns in using artificial intelligence for early detection of neurological illnesses. Therefore, the authors confine their focus to presenting factual information on utilizing AI techniques in various tasks without evaluating the quality of these investigations. Further studies are required to evaluate additional aspects, such as new neuroimaging measurements and blood and genetic biomarkers. The utilization of predictive algorithms, as detailed in this study, may enhance the development of collaborative visualization and decision-making tools for physicians and patients, as previously explored in another research. Future research could focus on decreasing the number of attributes without compromising accuracy. The proposed strategies can also be extended to tackle other chronic disorders. When creating AI models for medical issues, it is advisable to use simple computational methods with the available datasets to make it easier to implement the predictive tool in healthcare settings and solve economic issues.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.

Author contributions

MY: Writing – original draft, Writing – review & editing. MA: Writing – original draft, Writing – review & editing. ZM: Writing – original draft, Writing – review & editing. SK: Writing – original draft, Writing – review & editing. HD: Writing – original draft, Writing – review & editing. AA: Writing – original draft, Writing – review & editing. SS: Writing – original draft, Writing – review & editing. MAD: Writing – original draft, Writing – review & editing. HB: Writing – original draft, Writing – review & editing. NM: Writing – original draft, Writing – review & editing. SV: Writing – original draft, Writing – review & editing. ZR: Writing – original draft, Writing – review & editing. MAA: Investigation, Methodology, Validation, Writing – original draft, Writing – review & editing. DS: Writing – original draft, Writing – review & editing, Validation, Conceptualization. AM: Writing – original draft, Validation, Writing – review & editing, Conceptualization.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Huang, G, Huang, G-B, Song, S, and You, K. Trends in extreme learning machines: a review. Neural Netw. (2015) 61:32–48. doi: 10.1016/j.neunet.2014.10.001