- 1Qixia District Hospital, Nanjing, Jiangsu, China
- 2Nanjing Lishui People's Hospital, Zhongda Hospital, Southeast University, Nanjing, China
Objective: To explore the risk factors of cognitive dysfunction in patients with leukoaraiosis (LA) and to construct a predictive model using machine learning.
Methods: A total of 273 patients with LA were included. Univariate analysis and multivariate logistic regression were performed to identify independent risk factors for cognitive dysfunction. The patients were divided into a training set (191 cases) and a validation set (82 cases) in a 7:3 ratio. Seven machine learning algorithms (Decision Tree, GBDT, Logistic Regression, Random Forest, SVM, KNN, XGBoost) were employed to construct predictive models. Evaluation metrics included accuracy, recall, F1 score, MCC, AUROC, and SHAP was used for model interpretation.
Results: Univariate analysis revealed that age, LDL-C, uric acid, CRP, Fazekas score, and IASA score were associated with cognitive dysfunction (p < 0.05). Multivariate logistic regression analysis showed that age, LDL-C, uric acid, Fazekas score, and IASA score were independent risk factors (p < 0.05). Among the machine learning algorithms, the Random Forest model performed the best, with an AUROC of 0.8373 for the validation set. SHAP analysis indicated that age, LDL-C, IASA score, and Fazekas score were the most important predictors.
Conclusion: The Random Forest model can be used to predict the risk of cognitive dysfunction in patients with LA, providing a reference for early warning.
Background
Leukoaraiosis (LA), also known as White Matter Hyperintensities (WMH), is a common form of cerebral small vessel disease that is widely present in the elderly population (Tang et al., 2021; Hentabli et al., 2022). The clinical symptoms of LA are subtle and non-specific in the early stages, and as the condition progresses, a series of clinical issues gradually emerge, including cognitive impairment, dementia, gait abnormalities, and emotional confusion (Kongbunkiat et al., 2017). Multiple studies have shown that LA has become an important risk factor for frailty, decline in quality of life, and even death in the elderly (Chae et al., 2020; Xu et al., 2023). Cognitive impairment caused by LA is often overlooked in its early stages, and by the time it is diagnosed, most patients already have moderate to severe cognitive impairment or even dementia (Schmidt et al., 2007). This limits subsequent treatment options and reduces their effectiveness, placing a heavy mental and economic burden on the families of patients and even exacerbating social conflicts (Chojdak-Łukasiewicz et al., 2021). The mechanisms underlying the development of LA are not fully understood, and no single theoretical explanation can comprehensively account for its formation (Sun et al., 2022). Under the premise of managing traditional risk factors, regular monitoring of biological indicators and imaging features, along with adjustments in lifestyle and medical treatments to control abnormal indicators (Feng and Yan, 2022), can help delay the progression of LA and reduce the rate of conversion from mild cognitive impairment to dementia in LA patients (Rastogi et al., 2022).
In recent years, the complexity of clinical data has increased, and traditional statistical methods face certain limitations in exploring and mining large sample, multi-variable data (Huo et al., 2021). The mechanisms underlying cognitive dysfunction caused by LA are complex, with diverse influencing factors, necessitating new approaches. Machine learning (ML), a powerful data analysis tool in artificial intelligence (AI), has rapidly developed in the medical field (Karvelas and Elahi, 2023). Initially applied to commercial data analysis, machine learning's ability to process complex and dynamic data has unlocked great potential for understanding disease patterns, diagnosis, and treatment (Verhaaren et al., 2015; Lee et al., 2021). The interactions between clinical variables are intricate, and using machine learning to build predictive models can better understand and predict clinical events. Machine learning-based models enhance the ability to handle common variables faced by traditional statistical methods and can fully account for non-linear relationships, thus improving the applicability and accuracy of clinical predictions. Supervised learning algorithms and unsupervised learning are the main components of machine learning (Tubi et al., 2020). Common supervised learning algorithms include Random Forest, Support Vector Machine (SVM), K-nearest neighbors, neural networks, and Naive Bayes (Dickie et al., 2020; Desmarais et al., 2021). Common unsupervised learning algorithms include clustering algorithms, dimensionality reduction methods, and association rule learning (Hase et al., 2018).
Early-stage LA patients do not exhibit obvious clinical symptoms, and early diagnosis of cognitive impairment in LA patients is crucial for assisting clinicians in diagnosis and improving patient prevention and treatment plans (Filley and Fields, 2016). Unfortunately, most previous studies have focused on a single type of risk factor, which significantly limits predictive effectiveness (Zeng et al., 2020). Although some studies have performed multivariable analyses, the results often appear too complex, or their accuracy is difficult to compare and evaluate intuitively (Morley, 2015; Bender and Raz, 2015). Establishing a simple, sensitive, and efficient predictive model is of great clinical value for the early identification and prevention of high-risk groups of cognitive impairment in LA patients (van Gijn, 1998). Therefore, constructing and selecting an efficient predictive model based on machine learning technology is of significant importance for accurately predicting cognitive impairment in LA patients, and it is also a primary task in current primary prevention efforts (Carnevale et al., 2018; Gómez-Soria et al., 2023). This study uses machine learning algorithms to construct a predictive model aimed at finding a high-precision, multidimensional, high-sensitivity, and specificity machine learning model for early prediction of cognitive dysfunction in LA patients, enabling classification and identification of high-risk populations (Yuan et al., 2021; Sakurai et al., 2015; Fu et al., 2020). This can reduce the risk of LA patients further developing into dementia, decrease medical costs, improve patient prognosis, and ultimately enhance patients' quality of life (Zhang et al., 2022; Choi et al., 2020).
Methods
Study subjects
This study used data from the Cerebral Small Vessel Disease (CSVD) research project (NQH-CSVDrp) at Qixia District Hospital, Nanjing, China, which began in June 2020. The project aims to identify biomarkers for early cognitive impairment detection in CSVD, predict cognitive decline progression, and offer personalized interventions to prevent vascular dementia. By April 2023, 390 CSVD patients were included, all providing informed consent, and the study was approved by the hospital's ethics committee. Demographic data and medical history were recorded, including age, gender, marital status, education level, BMI, anxiety, socioeconomic status, and income level. Personal and past medical history: smoking history, alcohol consumption history, hypertension, type 2 diabetes, coronary heart disease, etc.
Inclusion criteria
(1) Imaging characteristics consistent with LA: high signal lesions on T2 FLAIR, low signal on T1, and rare infarctions in the brainstem, thalamus, and basal ganglia. (2) Complete medical records and cooperation with the study. (3) Age between 50 and 85 years.
Exclusion criteria
(1) History of moderate to severe stroke (NIHSS ≥5). (2) White matter degeneration due to hereditary or inflammatory diseases. (3) White matter changes from toxic, immunological, or other causes. (4) Severe heart, lung, or liver diseases. (5) Cognitive impairment due to metabolic or infectious diseases. (6) Death during hospitalization. (7) Missing data >20%. (8) Co-existing neurodegenerative diseases, such as Parkinson's or Alzheimer's disease.
Laboratory tests
Serum blood markers were recorded within 24 hours of admission after an overnight fast, including triglycerides, total cholesterol, low-density lipoprotein, uric acid, plasma lipoprotein-associated phospholipase A2, fasting blood glucose, urinary albumin/creatinine ratio, thyroid hormone, serum creatinine, C-reactive protein, total bilirubin, urea, and homocysteine.
Neurological scale assessment
Cognitive function was assessed by neurologists trained in standardized protocols. Cognitive assessment: The Mini-Mental State Examination (MMSE) was used as a basis to evaluate cognitive function in enrolled participants. Detailed scores for each item were recorded, with a total score of 30 points. Participants with a score ≤ 24 were classified as having cognitive impairment. For participants with zero years of education (illiterate), 6 points were added to the total score to correct for educational bias. For participants with ≤ 6 years of education, 4 points were added to correct for educational bias. Based on the MMSE score, participants were divided into the cognitive impairment (CI) group and non-cognitive impairment (NCI) group.
MMSE, a widely used tool for assessing cognitive function, covers five domains with a total score of 30. Scores reflect mental status: 26–30 indicates normal cognition with no significant impairment; 21–25 suggests mild dementia with possible slight cognitive difficulties in daily life; 10–20 indicates moderate dementia with noticeable cognitive impairment in multiple areas; and 0–9 signifies severe dementia, with significant cognitive impairment across multiple domains, requiring substantial care and support (Folstein et al., 1975).
NIHSS is a standardized tool for assessing neurological deficits in stroke patients. It includes 15 evaluation items, with scores reflecting neurological function—lower scores indicate better status (Brott et al., 1989).
Imaging assessment
LA severity grading and location (Fazekas score)
MRI high-signal manifestations of periventricular and deep white matter on T2 images were assessed using the Fazekas scale. The scores from both sections were summed to determine the severity level. Periventricular: No abnormalities (0 points); Spot-like changes (1 point); Fused lesions (2 points); Large fused lesions (3 points). Deep white matter: No abnormalities (0 points); Pencil-like or cap-like changes (1 point); Halo-like changes (2 points); Irregular changes extending into the deep white matter (3 points). The total sum of both scores determines the severity grading of LA (Fazekas et al., 1987).
Intracranial artery stenosis assessment (IASA)
MRA scanning was performed using the Neusoft NSM-S15P 1.5T, TOF_3D_MRA sequence, with a 16-channel brain coil. The scanning parameters were set to TR 22ms, TE 6.9ms, flip angle < 20.1°, and matrix size 256 × 256. A total of 161 cross-sectional images were collected, with a scanning average time of 3.34 minutes. The collected images were processed for three-dimensional vascular reconstruction. The major intracranial and extracranial vessels, including the bilateral internal carotid arteries, anterior cerebral artery, middle cerebral artery, posterior cerebral artery, basilar artery, and vertebral arteries, were primarily assessed. The degree of vascular stenosis was calculated as: Where DstenosisD_Dstenosis is the diameter of the vessel at its most narrowed point, and DnormalD is the diameter of the normal proximal vessel. If multiple stenoses are present, the most severe stenosis is used for calculation. The severity of vascular stenosis was classified into 3 levels: (1) Any vessel with stenosis < 20% is assigned 2 points. (2)Two or more vessels with stenosis > 40% are assigned 4 points. (3) Stenosis between these two levels, indicating mild stenosis, is assigned 3 points. (4) No stenosis is assigned 1 point.
Machine learning model construction
Missing data were assessed before analysis. Variables with less than 10% missing values were imputed using the mean or mode, while those with more than 20% missing data were excluded. Multiple imputation was applied for variables with 10–20% missingness.
Logistic regression model
Logistic regression is commonly used in binary and multiclass classification tasks, especially in medical statistics. The Sigmoid function is used to map a linear combination of input features to probabilities. The goal is to model the probability of the dependent variable, not the variable itself.
Random forest model
Random Forest is an ensemble learning method that combines multiple weak models into a strong one. Key steps include: Bootstrap aggregation: Random sampling with replacement to create independent training datasets. Decision trees: Trees are built on different subsets, selecting features randomly for splits. Overfitting reduction: Different training datasets reduce overfitting risk.
SVM model
SVM uses nonlinear mapping to project data into higher-dimensional feature spaces for better classification. It finds the optimal decision boundary that maximizes the margin between classes. SVM is effective for small sample tasks and nonlinear problems. Kernel functions avoid the curse of dimensionality, enabling efficient learning in high-dimensional spaces.
BP neural network model
The BP neural network is a multi-layer model that simulates the human brain's processing. It adjusts weights and thresholds through gradient descent to minimize errors. It consists of an input layer, hidden layers, and an output layer. BP networks are highly adaptable and effective for large-scale, complex data analysis.
Statistical analysis methods
Statistical analysis was performed using SPSS 22.0. Continuous variables were described as mean ± standard deviation or median (interquartile range), and group comparisons were made using t-tests or Mann-Whitney U tests. Categorical variables were described as counts (percentages), and group comparisons were performed using chi-square tests. Variables with P < 0.05 in univariate analysis were included in the multivariate logistic regression model to calculate the odds ratio (OR) and 95% confidence interval (CI). A p value < 0.05 was considered statistically significant. Machine learning was implemented using Python 3.7 and R 4.0.1, including decision trees, GBDT, logistic regression, random forests, SVM, KNN, and XGBoost. Evaluation metrics included accuracy, recall, F1 score, MCC, and AUROC. SHAP was used for model interpretation.
Results
Baseline characteristics of the training cohort
A total of 273 patients with LA were included in this study, which were divided into a training set (191 cases) and a validation set (82 cases) at a 7:3 ratio. Table 1 shows the baseline characteristics of the training set population. Among the 191 patients, 88 were male (46.07%) and 103 were female (53.93%). The results indicated that there were statistically significant differences in age, low-density lipoprotein cholesterol (LDL-C), uric acid (UA), C-reactive protein (CRP), Fazekas score, and IASA score (P < 0.05). These differences suggest that clinical and biochemical indicators in the patient population exhibit certain characteristic variations, which may influence the occurrence and development of LA.
Multivariate analysis of cognitive dysfunction in patients with white matter leukoaraiosis
The variables with p < 0.05 in the univariate analysis were included in the binary logistic regression model, and the forward conditional method was used to select the variables. The results showed that age, LDL-C, UA, Fazekas score, and IASA score are independent risk factors for LA (p < 0.05). See Table 2 for details. To visually display the relationship between each risk factor and the risk of white matter leukoaraiosis, a nomogram based on the logistic regression coefficients was created (Figure 1).
Table 2. The results of the Logistic regression analysis for cognitive dysfunction in patients with LA.
Figure 1. A nomogram of risk factors for cognitive dysfunction in patients with LA. When using the nomogram to predict the risk of cognitive impairment, the specific value of each variable was first found on the corresponding coordinate axis, and then a vertical line was drawn up to the Points axis to determine the score of the variable. The predicted Risk of leukoaraiosis can be obtained by summing the scores of all variables to obtain the total score, and finally finding the corresponding total score on the risk axis at the bottom of the figure.
Building a predictive model based on machine learning
To construct a more accurate predictive model for cognitive dysfunction in LA, this study attempted to use machine learning algorithms. We employed seven common classification algorithms, including Decision Tree, Gradient Boosting Decision Tree (GBDT), Logistic Regression, Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Extreme Gradient Boosting (XGB). Tables 3, 4 show the performance metrics of each model on the training and validation sets, respectively. On the training set, GBDT and XGB achieved the best performance, with accuracy, recall, and other metrics reaching 100%, followed by Random Forest (88.48%). However, on the validation set, the Random Forest model demonstrated the most robust performance, with an accuracy of 78.05% and an AUROC of 0.8373, significantly outperforming the other models. Therefore, we selected Random Forest as the final predictive model. The DCA curve, ROC curve, and decision curve for the training and validation sets in Figure 2 visually demonstrate the superiority of the Random Forest model.
Figure 2. Performance evaluation of machine learning models. (a) ROC curves for the training set; (b) Calibration plot for the training set; (c) DCA curve for the training set; (d) ROC curves for the validation set; (e) Calibration plot for the validation set; (f) DCA curve for the validation set.
Random forest SHAP interpretation
To explain the predictive mechanism of the random forest model, we performed SHAP analysis. The SHAP swarm plot of Figure 3a reveals the direction (positive and negative) and magnitude of the influence of each feature on the model output. Figure 3b, sorted by mean absolute SHAP value magnitude, shows that age, LDC-C, IASA score, and Fazekas score are the most important predictors. Figure 3c is the SHAP chart of a typical patient, which intuitively shows the direction and contribution of each feature, among which age and Fazekas score have the greatest negative effect.
Figure 3. SHAP analysis of the Random Forest model. (a) SHAP bees plot showing the distribution and impact of each feature on model predictions; (b) SHAP importance plot showing mean absolute SHAP values for each feature; (c) SHAP force plot for a representative patient demonstrating individual feature contributions.
Discussion
Risk factors for cognitive dysfunction in LA
Previous studies have shown that age is one of the significant independent risk factors for LA (Heo et al., 2019; Fan, 2022). As age increases, there is a positive correlation with the quantity and density of WMH, as well as a parallel relationship with the severity of cognitive impairment (Chojdak-Łukasiewicz et al., 2021; Aspray and Hill, 2019). The potential mechanism by which age contributes to LA is primarily due to the natural aging of brain vessels, leading to a reduction in cerebral blood flow, ischemic and hypoxic changes in brain tissue, and the loss of the integrity of the white matter (Chen et al., 2019). In addition, the negative impact of cellular metabolism and oxidative stress associated with aging on the white matter should also be considered. These multiple factors lead to structural damage to the white matter, ultimately resulting in cognitive dysfunction in LA patients.
Intracranial arterial stenosis is an independent risk factor for LA-related cognitive impairment, and the degree of stenosis in intracranial arteries is proportional to the severity of cognitive dysfunction. Both domestic and international studies (Wang et al., 2018)suggest that patients with intracranial arterial stenosis have more severe LA, as the stenosis reduces the regulation ability of cerebral vessels, affecting hemodynamics and causing prolonged low blood perfusion in the brain. Under low perfusion conditions, the supply of oxygen and nutrients to the white matter is reduced, leading to damage of white matter fibers and a decline in the efficiency of brain network system connectivity, resulting in cognitive impairment. Intracranial arterial stenosis can also lead to the formation of microemboli, significantly increasing the overall incidence of stroke events. Acute stroke events trigger systemic changes in the brain, including the loss of white matter signal connections, neuroinflammatory reactions, and blood-brain barrier disruption, leading to damage to the brain's network system. Furthermore, during the chronic phase of a stroke, brain tissue may undergo secondary degeneration of white matter tracts and cortical atrophy, causing disruptions or even interruptions in the information connectivity of the frontal and temporal lobes, ultimately leading to cognitive damage. Liu et al. found that microvascular occlusion due to intracranial artery stenosis occurs in the white matter region, where this damage is more sensitive. Additionally, in a long-term observational study, Vincent Mok et al. found that patients with intracranial artery stenosis had a faster progression of WMH and a more significant decline in cognitive function compared to a control group. These studies collectively reveal that intracranial artery stenosis is a risk factor for cognitive impairment in LA, providing important theoretical support for clinicians in diagnosis and treatment.
The Fazekas scoring system was first developed in the 1990s by Fazekas et al., who used MRI imaging to score the periventricular and deep white matter lesions (Cedres et al., 2020). The traditional Van Swieten scale is limited to using CT imaging to assess the severity of LA, but MRI is far more sensitive than CT. Therefore, using the Fazekas score for assessing the severity of LA is considered more rigorous. The Fazekas score grades the appearance of WMH on MRI and separately scores periventricular and deep lesions before calculating the total score. This study found that the Fazekas score is positively correlated with age and shows a parallel relationship with the severity of cognitive dysfunction. A cross-sectional study by E. Labos et al. on MIC patients over the age of 65 who underwent head MRI found that the progression of structural damage to the white matter affected the correlation with overall cognitive and executive function. Moreover, Sandeepa Sur et al. also identified a significant association between the increase in the Fazekas score and the onset of mild Alzheimer's disease.
Therefore, the Fazekas score, as an effective tool, has significant value not only in clinical diagnosis but also in predicting the development of WMH and its associated cognitive impairment. As the score increases, the severity of WMH also increases, leading to further cognitive decline. The application of the Fazekas score deepens our understanding of age-related brain lesions and their impact on cognition, providing a foundation for targeted research and interventions.
Construction of machine learning models
This study constructs four predictive models: BP neural network, Random Forest, Logistic Regression, and SVM models. Since LA is a disease caused by the combined effect of multiple factors, its impact on cognitive function involves a comprehensive range of biological markers and clinical indicators. The ability of machine learning models to handle high-dimensional data and simulate complex biological processes is significant, making them suitable for accurately predicting cognitive dysfunction in LA patients (Arfat et al., 2022). The ROC curve and calibration curve are used to evaluate and identify the ability to predict cognitive dysfunction, especially the area under the ROC curve (AUC), which is an important indicator of the model's diagnostic capability, reflecting the overall effect in terms of false positive and true positive rates (Wang et al., 2018). The calibration curve reflects the accuracy of the model's predicted probability, with a curve approaching the 45-degree diagonal indicating that the predicted probability is consistent with the actual occurrence probability. Based on the accuracy of different models in predicting the risk of cognitive dysfunction in LA patients, clinical decisions can adjust clinical decision thresholds to optimize the diagnostic and treatment pathways for patients (Sadatsafavi et al., 2022). The ROC curve provides a pure accuracy indicator by showing the limits of the test's ability to distinguish between different health states across the entire range of working conditions (Sadatsafavi et al., 2022). Moreover, the ROC curve plays a central or unified role in the evaluation and use of diagnostic tools, performing quantitative ROC analysis and testing comparisons, modifying individual subject disease probabilities using likelihood ratios, selecting decision thresholds, using logistic regression analysis, performing discriminant function analysis, or incorporating the tool into clinical strategies through decision analysis (Zweig and Campbell, 1993). In a cross-sectional study, the least absolute shrinkage and selection operator (LASSO) regression model and multivariate logistic regression analysis were used for variable selection to establish predictive models, and the predictive nomogram was developed and validated. The model was evaluated through the area under the ROC curve and a good calibration curve, and similar results were observed in 10-fold cross-validation, demonstrating the model's excellent discrimination ability (Huang et al., 2022).
Comparison of predictive models
We used seven common classification algorithms, including Decision Tree, Gradient Boosting Decision Tree (GBDT), Logistic Regression, Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Extreme Gradient Boosting (XGB). Random Forest, as the final predictive model, demonstrated the strongest stability in both the training and validation sets, significantly outperforming the other models. Ma et al. (2020) utilized MRI voxel-based morphometric measurements, deformation-based morphometric measurements, and surface-based morphometric measurements to extract morphological indicators such as gray matter volume, Jacobian determinant values, cortical thickness, gyrification index, sulcal depth, and fractal dimension (Duncan et al., 2020). They constructed a Random Forest predictive model and validated its performance through 10-fold cross-validation. Additionally, through the importance feature ranking in Random Forest, it was found that gyrification index and cortical thickness outperformed other features in identifying MCI, suggesting that they are key morphological biomarkers for early MCI diagnosis. Matthew Velazquez et al. used a balanced Random Forest model with clinical data to provide personalized predictions for the conversion from MCI to Alzheimer's Disease (AD) (Velazquez and Lee, 2021). They selected nine clinical features, including demographics, brain volume, and a mix of cognitive testing variables. The Random Forest model proved effective in predicting the conversion of EMCI patients to AD based on these clinical features (93.6% accuracy), serving as a tool for predicting the transition from the prodromal phase to AD or identifying ideal candidates for clinical trials. Furthermore, by evaluating the importance of each clinical feature through Random Forest, they excluded and focused on features that enhance interpretability.
The Random Forest model can be applied in clinical practice to assist physicians in early risk stratification and individualized treatment planning for patients with leukoaraiosis-related cognitive impairment. In addition, assessing its cost-effectiveness, implementation barriers, and integration into electronic health record systems will help determine the feasibility of incorporating this model into routine clinical workflows.
Limitations
Limitations of sample size and data sources
The sample size and data sources used in this study may be limited, and the research primarily relies on specific populations (such as patients from particular regions or age groups). This could affect the generalizability of the findings to different populations. Therefore, future studies should include larger and more diverse samples to enhance the universality and accuracy of the model. Interpretability and Operability of the Model: Although the machine learning models constructed in this study demonstrate high accuracy and stability in predicting cognitive dysfunction, their “black-box” nature makes them less interpretable. Clinicians may find it difficult to fully understand and operate these models, especially when dealing with complex biomarkers and clinical indicators. Future research should focus on improving the interpretability of the models, providing more actionable guidance that can be integrated with clinical practices.
Conclusion
In summary, a horizontal comparison of the above predictive models provides a deeper understanding of the application of machine learning in predicting cognitive dysfunction in LA patients. There is potential for further exploration of the clinical applications of these models and a comprehensive analysis of the underlying mechanisms of cognitive dysfunction diagnosis in LA patients. This study, through exploring the risk factors of cognitive dysfunction in LA patients and establishing a predictive model, aims to early identify high-risk groups for cognitive dysfunction in LA patients, guiding clinical interventions for early risk factors and reducing the disability rate caused by cognitive dysfunction.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by Nanjing Qixia Distict Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
WX: Writing – review & editing, Writing – original draft. CH: Supervision, Investigation, Conceptualization, Software, Methodology, Funding acquisition, Resources, Project administration, Formal analysis, Validation, Writing – original draft, Visualization, Data curation. JL: Writing – review & editing. YL: Writing – review & editing, Writing – original draft. LY: Writing – original draft. GZ: Writing – review & editing, Conceptualization.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This research was supported by the Nanjing City Health Science and Technology Development Special Fund Project (Grant Numbers: YKK24233, YKK23218).
Acknowledgments
The authors would like to thank all participants in the Cerebral Small Vessel Disease research project (NQH-CSVDrp) for their cooperation and contribution to this study. We acknowledge the dedicated staff at Qixia District Hospital, Nanjing, for their assistance with patient recruitment, data collection, and neurological assessments. Special thanks to the radiology department for their expertise in MRI and MRA imaging assessments and the laboratory staff for their meticulous work in biomarker analysis. We also appreciate the support from the Department of Geriatrics at Qixia District Hospital for facilitating this research.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Abbreviations
LA, Leukoaraiosis; WMH, White Matter Hyperintensities; CSVD, Cerebral Small Vessel Disease; ML, Machine Learning; AI, Artificial Intelligence; MMSE, Mini-Mental State Examination; CI, Cognitive Impairment; NCI, Non-Cognitive Impairment; MRI, Magnetic Resonance Imaging; MRA, Magnetic Resonance Angiography; IASA, Intracranial Artery Stenosis Assessment; LDL-C, Low-Density Lipoprotein Cholesterol; UA, Uric Acid; CRP, C-Reactive Protein; RF, Random Forest; GBDT, Gradient Boosting Decision Tree; SVM, Support Vector Machine; KNN, K-Nearest Neighbors; XGB, Extreme Gradient Boosting; AUROC, Area Under the Receiver Operating Characteristic Curve; SHAP, SHapley Additive exPlanations; MCC, Matthews Correlation Coefficient; ROC, Receiver Operating Characteristic; DCA, Decision Curve Analysis; OR, Odds Ratio; CI, Confidence Interval; BMI, Body Mass Index; T2DM, Type 2 Diabetes Mellitus; TSH, Thyroid Stimulating Hormone; ACR, Albumin-to-Creatinine Ratio; TBIL, Total Bilirubin; NIHSS, National Institutes of Health Stroke Scale; TOF, Time of Flight; TR, Repetition Time; TE, Echo Time.
References
Arfat, Y., Mittone, G., Esposito, R., Cantalupo, B., DE Ferrari, G. M., and Aldinucci, M. (2022). Machine learning for cardiology. Minerva Cardiol Angiol. 70, 75–91. doi: 10.23736/S2724-5683.21.05709-4
Aspray, T. J., and Hill, T. R. (2019). Osteoporosis and the ageing skeleton. Subcell Biochem. 91, 453–76. doi: 10.1007/978-981-13-3681-2_16
Bender, A. R., and Raz, N. (2015). Normal-appearing cerebral white matter in healthy adults: mean change over 2 years and individual differences in change. Neurobiol Aging. 36, 1834–48. doi: 10.1016/j.neurobiolaging.2015.02.001
Brott, T., Adams Jr, H. P., Olinger, C. P., Marler, J. R., Barsan, W. G., Biller, J., et al. (1989). Measurements of acute cerebral infarction: a clinical examination scale. Stroke. 20, 864–70. doi: 10.1161/01.STR.20.7.864
Carnevale, L., D'Angelosante, V., Landolfi, A., Grillea, G., Selvetella, G., Storto, M., et al. (2018). Brain MRI fiber-tracking reveals white matter alterations in hypertensive patients without damage at conventional neuroimaging. Cardiovasc Res. 114, 1536–46. doi: 10.1093/cvr/cvy104
Cedres, N., Ferreira, D., Machado, A., Shams, S., Sacuiu, S., Waern, M., et al. (2020). Predicting Fazekas scores from automatic segmentations of white matter signal abnormalities. Aging. 12, 894–901. doi: 10.18632/aging.102662
Chae, S. H., Kim, Y., Lee, K. S., and Park, H. S. (2020). Development and clinical evaluation of a web-based upper limb home rehabilitation system using a smartwatch and machine learning model for chronic stroke survivors: prospective comparative study. JMIR Mhealth Uhealth. 8:e17216. doi: 10.2196/17216
Chen, X., Huang, L., Ye, Q., Yang, D., Qin, R., Luo, C., et al. (2019). Disrupted functional and structural connectivity within default mode network contribute to WMH-related cognitive impairment. Neuroimage Clin. 24, 102088. doi: 10.1016/j.nicl.2019.102088
Choi, R. Y., Coyner, A. S., Kalpathy-Cramer, J., Chiang, M. F., and Campbell, J. P. (2020). Introduction to machine learning, neural networks, and deep learning. Transl Vis Sci Technol. 9, 14.
Chojdak-Łukasiewicz, J., Dziadkowiak, E., Zimny, A., and Paradowski, B. (2021). Cerebral small vessel disease: a review. Adv Clin Exp Med. 30, 349–56. doi: 10.17219/acem/131216
Desmarais, P., Gao, A. F., Lanctôt, K., Rogaeva, E., Ramirez, J., Herrmann, N., et al. (2021). White matter hyperintensities in autopsy-confirmed frontotemporal lobar degeneration and Alzheimer's disease. Alzheimers Res Ther. 13, 129. doi: 10.1186/s13195-021-00869-6
Dickie, D. A., Gardner, K., Wagener, A., Wyss, A., Arba, F., Wardlaw, J. M., et al. (2020). Cortical thickness, white matter hyperintensities, and cognition after stroke. Int J Stroke. 15, 46–54. doi: 10.1177/1747493019851291
Duncan, M. J., Dobell, A., Noon, M., Clark, C., Roscoe, C., Faghy, M., et al. (2020). Calibration and cross-validation of accelerometery for estimating movement skills in children aged 8-12 years. Sensors. 20:2776. doi: 10.3390/s20102776
Fan, P. (2022). Random forest algorithm based on speech for early identification of Parkinson's disease. Comput Intell Neurosci. 2022, 3287068. doi: 10.1155/2022/3287068
Fazekas, F., Chawluk, J. B., Alavi, A., and Hurtig, H. I. (1987). MR signal abnormalities at 1.5 T in Alzheimer's dementia and normal aging. Am. J. Roentgenol. 149, 351–356. doi: 10.2214/ajr.149.2.351
Feng, Y., and Yan, X. (2022). Support vector machine based lane-changing behavior recognition and lateral trajectory prediction. Comput Intell Neurosci. 2022, 3632333. doi: 10.1155/2022/3632333
Filley, C. M., and Fields, R. D. (2016). White matter and cognition: making the connection. J Neurophysiol. 116, 2093–104. doi: 10.1152/jn.00221.2016
Folstein, M. F., Folstein, S. E., and McHugh, P. R. (1975). “Mini-mental state”: a practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res. 12, 189–98. doi: 10.1016/0022-3956(75)90026-6
Fu, S. S., Li, H. R., Shi, B. Y., Zhang, H. T., and Zhang, S. L. (2020). Characteristics of single subcortical small infarction in the middle cerebral artery territory. Zhonghua Yi Xue Za Zhi. 100, 202–6.
Gómez-Soria, I., Iguacel, I., Aguilar-Latorre, A., Peralta-Marrupe, P., Latorre, E., Zaldívar, J. N. C., et al. (2023). Cognitive stimulation and cognitive results in older adults: a systematic review and meta-analysis. Arch Gerontol Geriatr. 104, 104807. doi: 10.1016/j.archger.2022.104807
Hase, Y., Horsburgh, K., Ihara, M., and Kalaria, R. N. (2018). White matter degeneration in vascular and other ageing-related dementias. J Neurochem. 144, 617–33. doi: 10.1111/jnc.14271
Hentabli, H., Bengherbia, B., Saeed, F., Salim, N., Nafea, I., Toubal, A., et al. (2022). Convolutional neural network model based on 2D fingerprint for bioactivity prediction. Int J Mol Sci. 23:13230 doi: 10.3390/ijms232113230
Heo, J., Yoon, J. G., Park, H., Kim, Y. D., Nam, H. S., Heo, J. H., et al. (2019). Machine learning-based model for prediction of outcomes in acute stroke. Stroke. 50, 1263–5. doi: 10.1161/STROKEAHA.118.024293
Huang, M., Gao, X., Zhao, R., Dong, C., Gu, Z., Gao, J., et al. (2022). Development and validation of a nomogram for predicting mild cognitive impairment in middle-aged and elderly people. Asian J Psychiatr. 75, 103224. doi: 10.1016/j.ajp.2022.103224
Huo, L., Chen, P., Wang, Z., Li, X., Zhou, J., Wang, C., et al. (2021). Impact of leukoaraiosis severity on the association of outcomes of mechanical thrombectomy for acute ischemic stroke: a systematic review and a meta-analysis. J Neurol. 268, 4108–16. doi: 10.1007/s00415-020-10167-0
Karvelas, N., and Elahi, F. M. (2023). White matter hyperintensities: complex predictor of complex outcomes. J Am Heart Assoc. 12, e030351. doi: 10.1161/JAHA.123.030351
Kongbunkiat, K., Wilson, D., Kasemsap, N., et al. (2017). Leukoaraiosis, intracerebral hemorrhage, and functional outcome after acute stroke thrombolysis. Neurology. 88, 638–45. doi: 10.1212/WNL.0000000000003605
Lee, Y. W., Choi, J. W., and Shin, E. H. (2021). Machine learning model for predicting malaria using clinical information. Comput Biol Med. 129, 104151. doi: 10.1016/j.compbiomed.2020.104151
Ma, Z., Jing, B., Li, Y., Yan, H., Li, Z., Ma, X., et al. (2020). Identifying mild cognitive impairment with random forest by integrating multiple MRI morphological metrics. J Alzheimers Dis. 73, 991–1002. doi: 10.3233/JAD-190715
Morley, J. E. (2015). White matter lesions (leukoaraiosis): a major cause of falls. J Am Med Dir Assoc. 16, 441–3. doi: 10.1016/j.jamda.2015.03.023
Rastogi, A., Weissert, R., and Bhaskar, S. M. M. (2022). Leukoaraiosis severity and post-reperfusion outcomes in acute ischaemic stroke: a meta-analysis. Acta Neurol Scand. 145, 171–84. doi: 10.1111/ane.13519
Sadatsafavi, M., Saha-Chaudhuri, P., and Petkau, J. (2022). Model-based ROC curve: examining the effect of case mix and model calibration on the ROC plot. Med Decis Making. 42, 487–99. doi: 10.1177/0272989X211050909
Sakurai, T., Tomimoto, H., and Pantoni, L. (2015). A. new horizon of cerebral white matter hyperintensity in geriatric medicine. Geriatr Gerontol Int. 15 Suppl 1, 1–2. doi: 10.1111/ggi.12714
Schmidt, R., Petrovic, K., Ropele, S., Enzinger, C., and Fazekas, F. (2007). Progression of leukoaraiosis and cognition. Stroke. 38, 2619–25. doi: 10.1161/STROKEAHA.107.489112
Sun, L., Hui, L., Li, Y., Chen, X., Liu, R., and Ma, J. (2022). Pathogenesis and research progress in leukoaraiosis. Front Hum Neurosci. 16, 902731. doi: 10.3389/fnhum.2022.902731
Tang, X., Jiang, L., Luo, Y., Fan, H., Song, L., Liu, P., et al. (2021). Leukoaraiosis and acute ischemic stroke. Eur J Neurosci. 54, 6202–13. doi: 10.1111/ejn.15406
Tubi, M. A., Feingold, F. W., Kothapalli, D., Hare, E. T., King, K. S., Thompson, P. M., et al. (2020). White matter hyperintensities and their relationship to cognition: effects of segmentation algorithm. Neuroimage. 206, 116327. doi: 10.1016/j.neuroimage.2019.116327
van Gijn, J. (1998). Leukoaraiosis and vascular dementia. Neurology 51, S3–8. doi: 10.1212/WNL.51.3_Suppl_3.S3
Velazquez, M., and Lee, Y. (2021). Random forest model for feature-based Alzheimer's disease conversion prediction from early mild cognitive impairment subjects. PLoS ONE. 16, e0244773. doi: 10.1371/journal.pone.0244773
Verhaaren, B. F., Debette, S., Bis, J. C., Smith, J. A., Ikram, M. K., Adams, H. H., et al. (2015). Multiethnic genome-wide association study of cerebral white matter hyperintensities on MRI. Circ Cardiovasc Genet. 8, 398–409. doi: 10.1161/CIRCGENETICS.114.000858
Wang, J., Li, W., Yue, L., Hong, B., An, N., Li, G., et al. (2018). The study of White Matter Hyperintensity (WMH) and factors related to geriatric late-onset depression. Shanghai Arch Psychiatry. 30, 12–9.
Xu, Y., Sun, X., Liu, Y., Huang, Y., Liang, M., Sun, R., et al. (2023). Prediction of subjective cognitive decline after corpus callosum infarction by an interpretable machine learning-derived early warning strategy. Front Neurol. 14, 1123607. doi: 10.3389/fneur.2023.1123607
Yuan, C. L. Y. i. R., Dong, Q., Yao, L., and Liu, B. (2021). The relationship between diabetes-related cognitive dysfunction and leukoaraiosis. Acta Neurol Belg. 121, 1101–10. doi: 10.1007/s13760-021-01676-4
Zeng, W., Chen, Y., Zhu, Z., Gao, S., Xia, J., Chen, X., et al. (2020). Severity of white matter hyperintensities: Lesion patterns, cognition, and microstructural changes. J Cereb Blood Flow Metab. 40, 2454–63. doi: 10.1177/0271678X19893600
Zhang, Y., Duan, Y., Wang, X., Zhuo, Z., Haller, S., Barkhof, F., et al. (2022). A deep learning algorithm for white matter hyperintensity lesion detection and segmentation. Neuroradiology. 64, 727–34. doi: 10.1007/s00234-021-02820-w
Keywords: leukoaraiosis, cognitive dysfunction, machine learning, predictive model, risk factors
Citation: Xu W, Li J, Huang C, Lu Y, Yin L and Zhang G (2025) A comparative study of risk factors in predictive models for cognitive dysfunction in patients with leukoaraiosis based on machine learning algorithms. Front. Neurosci. 19:1654584. doi: 10.3389/fnins.2025.1654584
Received: 27 June 2025; Accepted: 15 October 2025;
Published: 10 December 2025.
Edited by:
Sukhjit Singh Sehra, Wilfrid Laurier University, CanadaReviewed by:
Ju Gao, Suzhou Guangji Hospital, ChinaOkan Imre, Karamanoglu Mehmetbey University, Türkiye
Copyright © 2025 Xu, Li, Huang, Lu, Yin and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Guoxin Zhang, emhhbmdndW94aW4ubmpxeHl5QGZveG1haWwuY29t
Jianning Li1