Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Neurol., 19 December 2025

Sec. Neurorehabilitation

Volume 16 - 2025 | https://doi.org/10.3389/fneur.2025.1709386

Predicting cognitive impairment in Parkinson’s disease: a machine learning approach based on clinical and neuropsychological data

Meili Yang,&#x;Meili Yang1,2Chuxin Wang&#x;Chuxin Wang1Jinying Zhang&#x;Jinying Zhang1Yao XiaoYao Xiao1Yafang ChenYafang Chen1Zeming GuoZeming Guo1Jiayin Wang
Jiayin Wang3*Jinzhong Huang
Jinzhong Huang3*
  • 1Department of Geriatrics, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China
  • 2Fujian Medical University Union Hospital, Fuzhou, China
  • 3Department of Neurosurgery, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China

Background: Cognitive impairment is a common and disabling non-motor symptom of Parkinson’s disease, markedly diminishing quality of life and elevating caregiver burden. Although considerable research has been conducted, the early prediction of cognitive impairment remains challenging owing to heterogeneous clinical presentations, variations in treatment adherence, and the inherent limitations in sensitivity of conventional biomarkers and cognitive assessment tools.

Methods and materials: A retrospective cohort study involving 514 Parkinson’s disease patients who had complete baseline data and a minimum of 6 months of follow-up. Participants were randomly allocated into a training cohort (n = 359) and a test cohort (n = 155). Demographic, clinical, biochemical, and neuropsychological variables were obtained at baseline. Cognitive impairment was defined based on Mini-Mental State Examination scores falling below education-adjusted thresholds and further validated using the Montreal Cognitive Assessment. Multiple machine learning models—including Random Forest, Logistic Regression, Gradient Boosting, CatBoost, and Support Vector Machine—were developed and evaluated using the area under the receiver operating characteristic curve, accuracy, recall, F1-score, calibration, and decision curve analysis. Feature importance analysis was performed to identify key predictive variables.

Results: During follow-up, patients who developed cognitive impairment were significantly older and had longer disease duration, lower levels of albumin, hematocrit, and blood lipids, as well as a higher prevalence of hypertension. Feature selection identified: Age, Platelet count, Time from diagnosis to baseline visit, Apolipoprotein B, and Hematocrit as the predictors. The Random Forest model demonstrated the best overall performance, with the area under the receiver operating characteristic curve = 0.846, accuracy = 0.75, and an F1-score = 0.775, followed by CatBoost and Logistic Regression. Calibration and decision curve analyses confirmed stable probability estimation and superior clinical utility of Random Forest compared with “treat all” or “treat none” strategies. Further use the Montreal Cognitive Assessment score to verify the stability of the model.

Conclusion: Machine learning models integrating multimodal clinical and neuropsychological data demonstrate high accuracy in predicting cognitive impairment in Parkinson’s disease, with Random Forest emerging as the most reliable approach. This framework provides a practical tool for early risk stratification, potentially enabling timely interventions and individualized management to reduce the burden of cognitive decline in Parkinson’s disease.

Introduction

Parkinson’s disease (PD) is a common neurodegenerative disorder marked not only by classical motor symptoms but also by a wide spectrum of non-motor manifestations, among which cognitive impairment and dementia are particularly disabling. Epidemiological studies have shown that cognitive decline occurs in a substantial proportion of PD patients, with prevalence estimates ranging from 20% to over 40% depending on the diagnostic criteria applied, and disease severity identified as a key predictor of dementia risk (1). Longitudinal cohort investigations further revealed that distinct cognitive syndromes exist in PD, and that genetic and clinical markers, such as the MAPT H1/H1 haplotype and specific baseline cognitive deficits, are strong predictors of progression to dementia (2). Importantly, cognitive decline and neuropsychiatric symptoms such as depression, anxiety, and apathy significantly worsen patients’ quality of life and are closely associated with caregiver burden, often surpassing the impact of motor disability (3). Indeed, longitudinal analyses demonstrate that deterioration in quality of life is most strongly related to worsening cognitive and psychosocial domains, rather than motor symptoms alone (4). Parallel studies confirm that both motor and non-motor features, including stigma and emotional well-being, directly influence caregivers’ stress levels and overall burden (5). In advanced PD, caregiver burden is further intensified, being linked to patient quality of life, speech difficulties, and cognitive deterioration, with therapeutic strategies such as continuous dopaminergic delivery systems offering partial relief (6). Taken together, these findings underscore that cognitive impairment represents not only a common and predictable outcome of PD but also a critical factor diminishing patients’ quality of life and increasing caregiver burden—highlighting the pressing need for early detection and the development of robust predictive models.

Although PD patients often require long-term management, the majority are treated in outpatient settings, where the variability in visit timing, location, and examination protocols contributes to heterogeneous clinical data and inconsistent follow-up. As a result, treatment adherence varies widely among patients, leading to significant differences in disease progression and outcomes. Early identification of patients at risk for cognitive impairment (CI) remains a major challenge. Against this backdrop, there is a clear demand for predictive models that not only incorporate multimodal clinical information but also effectively capture the heterogeneity of real-world outpatient care. Many prior investigations have hinged on the use of specialized, resource-intensive biomarkers, such as neuroimaging and cerebrospinal fluid measures (7). Similarly, cognitive assessment tools such as the Mini Mental Parkinson and SCOPA-COG scales demonstrate utility in rating deterioration, yet their sensitivity to early changes is insufficient for robust prognostication (8). Large cohort studies have further emphasized that baseline demographic and clinical parameters explain only part of the variability in cognitive outcomes, underscoring the need for more reliable predictive measures (9). In addition, although uric acid has been suggested as a potential neuroprotective factor, longitudinal data show inconsistent associations with cognitive decline, reflecting the complexity of disease mechanisms (10). These tools, however, may not be routinely available or scalable in real-world, heterogeneous clinical settings where data can often be fragmented or lacking, representing a large translational gap between the research environment and actual clinical practice. In sharp contrast to these approaches, the present study is uniquely designed to develop and validate a comprehensive and yet operationally practical predictive framework based on widely accessible clinical and neuropsychological parameters. By effectively harnessing the power of machine learning (ML) to integrate heterogeneous data sources, that can potentially overcome the challenges posed by fragmented outpatient data and improve prognostic accuracy without depending on specialized or resource-intensive biomarkers. Ultimately, this work holds the promise to enable earlier interventions, optimize individualized treatment strategies, and alleviate the overall burden of cognitive decline on both patients and caregivers.

The principal contributions of this work are summarized as follows:

1. Development of a practical predictive framework: The study developed and validated multiple ML models to predict cognitive impairment in Parkinson’s disease using exclusively routine clinical and neuropsychological data, thereby creating a tool readily applicable in heterogeneous outpatient settings.

2. Identification of accessible predictors: Through robust feature importance analysis, the study identified key predictive variables such as Age, Platelet count, Time from diagnosis to baseline visit, Apolipoprotein B, and Hematocrit, demonstrating the significant prognostic value embedded in commonly available clinical data.

3. Determination of an optimal model: Comprehensive evaluation of the research established the Random Forest algorithm as the most robust and clinically useful model, highlighting its suitability for handling complex, non-linear relationships in real-world patient data.

4. Rigorous and clinically relevant validation: The current analysis ensured the robustness and generalizability of the findings by validating the model’s performance not only on an independent test set but also using an alternative cognitive assessment tool (MoCA), and by rigorously assessing clinical utility through decision curve analysis.

5. Bridging the gap between research and clinical practice: This study addresses the critical gap between research-focused biomarkers and everyday clinical needs by providing a data-driven framework that leverages existing outpatient data to facilitate early risk stratification and personalized management strategies.

Methods and materials

Patient enrollment

Between January 1, 2020, and December 31, 2024, a total of 656 patients with PD attending their first visit at the Second Affiliated Hospital of Fujian Medical University were screened. After excluding 40 patients due to mortality and 102 lost to follow-up, 514 patients with complete datasets and at least 6 months of follow-up were included in the analysis. These participants were randomly allocated into a training set (n = 359, CI 198 vs. non-CI 161) and a test set (n = 155, CI 85 vs. Non-CI 70). Prediction models were developed using the training cohort and validated in the independent test cohort. Model performance was further verified using MoCA scores, and the results were visualized to enhance interpretability (Figure 1).

Figure 1
Flowchart illustrating a study on PD patients from January 1, 2020, to December 31, 2024. Out of 656 initial patients, 514 completed a 6-month follow-up. Losses included 40 deaths and 102 patients lost to follow-up. The dataset was split into a train set of 359 (198 CI, 161 non-CI) and a test set of 155 (85 CI, 70 non-CI). Models were established and validated, using MoCA scores to validate models, followed by model visualization.

Figure 1. Flowchart of patient selection, model development, and validation.

The definition of CI

CI was primarily defined by the Mini-Mental State Examination (MMSE), with a score below the education-adjusted cutoff (typically <24) indicating CI. As an external validation, the MoCA was also applied, with a cutoff of <26 points used to define CI.

Data collection

Demographic, clinical, and laboratory variables were collected at baseline through electronic medical records, including age, sex, time from diagnosis to baseline visit, comorbidities, hematological and biochemical indices (e.g., hematocrit, albumin, lipid profiles), and lifestyle factors. Cognitive function was assessed using standardized neuropsychological scales. All patients who included in the study met the inclusion criteria: (1) Age >18 years. (2) Patients diagnosed with PD at their first visit to the hospital. (3) Availability of complete demographic, clinical, and laboratory data at baseline. (4) No cognitive impairment at baseline, as determined by standardized neuropsychological assessment (MMSE and MoCA). (5) Patients with at least 6 months of follow-up after baseline evaluation. And other patients excluded from the study by the criteria: (1) Patients who died during the follow-up period. (2) Patients lost to follow-up or with incomplete clinical or neuropsychological data. (3) Patients with pre-existing CI at baseline.

Follow-up

All patients were followed for a period of 6 months after the baseline assessment. Cognitive status was re-evaluated at the end of the follow-up period, and clinical outcomes were confirmed by trained neurologists who were blinded to the model predictions. The start of follow-up for each patient was defined as the time of their first visit to the hospital, and the end of follow-up was set at 6 months thereafter. The occurrence of CI was recorded as a positive case.

Statistical analysis

Statistical analyses were performed using R software (version 4.3.2). Continuous variables were expressed as mean ± standard deviation and compared between groups using Student’s t-test or Mann–Whitney U test, depending on data distribution. Categorical variables were presented as frequencies and percentages and compared using the Chi-square test or Fisher’s exact test as appropriate. To construct prediction models for CI, multiple ML algorithms were applied, including Random Forest (RF), Logistic Regression (LR), Gradient Boosting, Support Vector Machine, and others. Model training was conducted on the training cohort, with hyper-parameter tuning performed via cross-validation. The independent test set was used for external validation. Model performance was evaluated using multiple metrics, including area under the receiver operating characteristic curve (AUROC), accuracy, recall, specificity, F1-score, and Matthews correlation coefficient (MCC). Calibration curves and decision curve analysis (DCA) were employed to assess calibration and clinical utility. Additionally, feature importance analysis was conducted to identify the most influential predictors. The primary predictive outcome was CI defined by the MMSE, while validation was further performed using CI defined by the MoCA scale. A two-sided p-value <0.05 was considered statistically significant.

Results

Patient demography

Baseline demographic and clinical characteristics are summarized in Table 1. Patients who developed CI were significantly older compared to those without CI in both the training and test cohorts (p < 0.001). Time from diagnosis to baseline visit was also longer in the CI group, highlighting “Time” as an important factor (p = 0.002 in the training set; p < 0.001 in the test set). Several laboratory indicators showed significant associations. Lower albumin levels were consistently observed in the CI group (p < 0.001 and p = 0.001, respectively), while the albumin-to-globulin ratio was reduced only in the test set (p = 0.007). Hematological parameters demonstrated differences as well, with reduced lymphocyte counts (p = 0.018), lower hematocrit (p < 0.001 and p = 0.022), and elevated hemoglobin A1c (HbA1c) in the test cohort (p = 0.032). Lipid metabolism markers revealed consistent trends, as patients with CI had lower total cholesterol (p = 0.005 in both sets), lower low-density lipoprotein (LDL-C) (p = 0.002 and p < 0.001), and reduced triglyceride, apolipoprotein A, and apolipoprotein B (APOB) levels (all p < 0.05). In addition, free triiodothyronine (FT3) was significantly lower in the CI group of the training set (p < 0.001). Regarding comorbidities, hypertension was more prevalent among CI patients in both cohorts (p = 0.047 and p < 0.001), and coronary artery disease was significantly enriched in the CI subgroup of the training set (p = 0.004). Interestingly, surgery history showed a difference only in the test set (p = 0.003), while other factors such as gender, diabetes, smoking, and drinking were not statistically different.

Table 1
www.frontiersin.org

Table 1. Demographic characteristics of the training set and test set.

Construct the predicted model by ML

The Table 2 showed that the decision tree-based feature selection method effectively identified the most predictive variables, with Age demonstrating the highest importance (0.23 ± 0.02), followed by platelet (PLT), Time (between diagnosis and initial presentation to the hospital), APOB and Hematocrit (HCT). This approach highlights the model’s ability to prioritize clinically relevant features while filtering out non-informative predictors, enhancing interpretability and reducing overfitting.

Table 2
www.frontiersin.org

Table 2. Clinical prediction of optimized machine learning framework.

In the optimized ML framework, feature importance analysis revealed that age was the most influential predictor of cognitive impairment in patients with PD, contributing substantially more than any other variable (mean importance = 0.23 ± 0.02). Other features with relatively modest but notable predictive value included PLT (0.03 ± 0.02), Time from diagnosis to baseline visit (Time, 0.03 ± 0.02), APOB (0.02 ± 0.01), and HCT (0.01 ± 0.01). By contrast, a large number of clinical and biochemical parameters—such as homocysteine, D-dimer, lipid profiles, thyroid hormones, and comorbidity indicators—showed negligible or no contribution, with mean importance values approaching zero. These findings suggest that the predictive performance of the optimized framework was primarily driven by a limited set of variables, with age emerging as the dominant determinant of model output, while most other clinical and laboratory features offered minimal incremental value.

Model discrimination and calibration

As shown in Figure 2, receiver operating characteristic (ROC) analysis demonstrated that most ML models exhibited favorable discrimination for predicting cognitive impairment in PD. The RF model achieved the highest area under the curve (AUC: 0.846), followed closely by CatBoost (AUC: 0.834) and Logistic Regression (LR) (AUC: 0.827), whereas AdaBoost and Naïve Bayes performed less effectively. Calibration plots further indicated that RF, LR, and Gradient Boosting Machine maintained relatively good alignment between predicted and observed outcomes, suggesting stable probability estimation.

Figure 2
Panel A shows ROC curves for multiple models, displaying different sensitivities and specificities. Panel B features calibration plots with observed versus predicted probabilities for each model, labeled with AUC values. Panel C presents DCA curves, illustrating net benefit across risk thresholds for the models, comparing them to standard strategies. Each model has distinct lines and colors.

Figure 2. Performance evaluation of machine learning models for predicting cognitive impairment after Parkinson’s disease. (A) ROC curve plot showing the discriminative ability of different machine learning models in the test cohort. The Random Forest (RF) model achieved the highest predictive performance (AUC = 0.846), followed by CatBoost (AUC = 0.834) and Logistic Regression (AUC = 0.827), while AdaBoost and naïve bayes yielded relatively lower AUC values. (B) Calibration plots demonstrating the agreement between predicted probabilities and observed outcomes across models. The Random Forest, Logistic Regression, and Gradient Boosting Machine (GBM) models showed better calibration, indicating reliable probability estimation compared with other algorithms. (C) Decision curve analysis (DCA) illustrating the clinical utility of each model across a range of threshold probabilities. Most machine learning models, particularly Random Forest, Logistic Regression, and Support Vector Machine (SVM), provided greater net benefit than the “treat all” or “treat none” strategies, highlighting their potential clinical applicability.

SHAP analyses were performed to improve the interpretability of the optimal RF model. The SHAP summary plot (Figure 3) demonstrates the effects of the top features for the model. It can be seen that the older Age and longer Time from diagnosis to presentation were the strongest predictors for higher risk of cognitive impairment. The higher values of APOB and HCT, on the other hand, were associated with the lower risk of CI, which was consistent with their protective effect. This individual-level explainability provides insights that the model’s decision is clinically intuitive.

Figure 3
SHAP importance plot displaying horizontal bars for features' contributions.

Figure 3. Global feature importance based on SHAP values. This bar plot depicts the mean absolute SHAP value for each of the top five predictive features in the final Random Forest model. The SHAP value represents the average magnitude of a feature’s contribution to the model’s output across all instances in the test cohort. The ranking confirms that age is the most influential predictor of cognitive impairment, followed by Time, between diagnosis and initial presentation to the hospital; PLT, platelet count; APOB, apolipoprotein B; HCT, hematocrit.

The model clinical utility

DCA revealed that several models, particularly RF, LR, and Support Vector Machine (SVM), consistently offered greater net clinical benefit across a wide range of threshold probabilities compared with “treat all” or “treat none” strategies. These findings underscore the clinical applicability of the optimized ML framework, with RF emerging as the most robust and reliable model (Figure 2).

Performance metrics of the ML models

As presented in Table 3, most ML algorithms demonstrated satisfactory predictive performance, with mean scores of 0.709 for accuracy, 0.778 for recall, 0.737 for F1-score, and an average AUROC of 0.765. Among the models, RF achieved the best overall balance, with high AUROC (0.846), accuracy (0.75), and F1-score (0.775), indicating strong discriminative power and robustness. LR and CatBoost also performed well, yielding AUROCs of 0.827 and 0.834, respectively, coupled with recall values exceeding 0.80, suggesting reliable sensitivity for detecting cognitive impairment. Decision Tree and XGBoost provided acceptable results, though slightly lower than the top-performing models. In contrast, Naïve Bayes showed the weakest performance, with the lowest accuracy (0.59), F1-score (0.583), and MCC (0.196), reflecting limited predictive value. Overall, RF, LR, and CatBoost consistently emerged as the most reliable algorithms for cognitive impairment prediction, outperforming other models across multiple evaluation metrics (Table 3).

Table 3
www.frontiersin.org

Table 3. Performance metrics of the machine learning model.

Performance of RF models

As shown in Figure 4, the RF algorithm demonstrated consistently strong predictive ability across ten iterations in the test cohort. The area under the ROC curve (AUC) values ranged from 0.779 to 0.929, with the RF_3 model achieving the highest discriminative power (AUC = 0.929). These results highlight both the robustness and stability of RF in identifying patients at risk of cognitive impairment. Calibration analysis revealed that several RF models showed close alignment between predicted and observed probabilities, supporting the reliability of probability estimation. DCA further confirmed the clinical usefulness of these models, as most RF variants provided greater net benefit compared with “treat all” or “treat none” strategies across a wide range of threshold probabilities. Collectively, these findings establish RF as the most effective model for cognitive impairment prediction in this study.

Figure 4
Panel A displays a ROC curve plot showing sensitivity versus one minus specificity for various tests with AUC values. Panel B illustrates a calibration plot with observed versus predicted probability, indicating the accuracy of predictions. Panel C presents a DCA curve plot depicting net benefit against risk threshold for different tests, comparing strategies of treating all versus treating none.

Figure 4. Performance evaluation of Random Forest models in predicting cognitive impairment after Parkinson’s disease. (A) ROC curves of ten Random Forest models in the test cohort. The majority of models achieved strong discriminative ability, with AUC values ranging from 0.779 to 0.929, and the RF_3 model showing the best overall performance (AUC = 0.929). (B) Calibration plots illustrating the agreement between predicted and observed probabilities. Several Random Forest models demonstrated good calibration, with curves aligning closely to the ideal diagonal, indicating reliable probability estimation across different thresholds. (C) DCA showing the net clinical benefit of the Random Forest models. Most variants consistently outperformed both the “treat all” and “treat none” strategies, demonstrating meaningful clinical utility across a wide range of risk thresholds.

Validated model by the MoCA score using LR

As illustrated in Figure 5, the Logistic Regression model demonstrated strong discriminative ability for predicting cognitive impairment in Parkinson’s disease. The ROC curve (Panel A) showed an area under the curve (AUC) of 0.86, indicating reliable classification performance between patients with and without cognitive impairment. Calibration analysis (Panel B) revealed close agreement between predicted and observed probabilities, with the bias-corrected line closely aligned with the ideal diagonal, and a mean absolute error of only 0.019 across 300 bootstrap resampling’s. These findings suggest that the model provided accurate probability estimation and was not overfitted. Furthermore, decision curve analysis (Panel C) confirmed the clinical utility of the Logistic Regression model. Across a wide range of threshold probabilities, the model consistently offered greater net clinical benefit compared to “treat all” or “treat none” strategies. Collectively, these results demonstrate that Logistic Regression is a robust and interpretable model with both high predictive accuracy and practical applicability for risk stratification in PD.

Figure 5
Panel A shows a logistic regression ROC curve with an AUC of 0.86, indicating model performance. Panel B presents a calibration curve, comparing apparent, bias-corrected, and ideal probabilities. Panel C displays a decision curve analysis plot, illustrating net benefit against risk thresholds, with lines for the model, treat all, and treat none scenarios.

Figure 5. Logistic Regression model performance in predicting CI after PD by the MoCA score. (A) ROC curve showing the discriminative performance of the Logistic Regression model in the test cohort, with an AUC of 0.86, indicating strong predictive accuracy. (B) Calibration plot demonstrating close agreement between predicted probabilities and observed outcomes. The bias-corrected curve aligned well with the ideal diagonal, and the mean absolute error was low (0.019), supporting the reliability of the model’s probability estimates. (C) DCA illustrating the clinical utility of the Logistic Regression model. Across a wide range of threshold probabilities, the model consistently provided greater net benefit compared with both “treat all” and “treat none” strategies, highlighting its potential value for guiding clinical decision-making.

Discussion

The current analysis developed and validated multiple ML models to predict CI in newly diagnosed patients with PD using baseline clinical and neuropsychological data. Among the algorithms tested, the RF model demonstrated the highest overall performance, achieving an AUROC of 0.846, with robust accuracy and F1-score across both training and test cohorts. Key predictive features included Age, PLT, Time from diagnosis to baseline visit, APOB, and HCT, suggesting that both demographic and routine laboratory variables carry significant prognostic value. The model’s reliability was further supported by calibration and decision curve analyses, which confirmed its stable probability estimates and clinical usefulness. These findings indicate that ML approaches—particularly tree-based models like RF—can effectively integrate heterogeneous outpatient data to enable early identification of PD patients at risk for CI, thereby offering a valuable tool for individualized care planning. The superior performance of RF may be attributed to its ability to handle non-linear relationships and interactions among variables without overfitting, which is particularly suitable for heterogeneous clinical datasets.

To provide more transparency of our best-performing RF model, the present research performed SHAP analysis and ‘opened the black box’ of the model. The resulting plots, displayed in Figure 3, offer global as well as individual level explainability. They allow for not only assessing the overall importance of the predicted model features and their direction (i.e., age and disease duration are positive risk factors), but also for importantly seeing how individual values of APOB and HCT drive the prediction toward or away from a CI diagnosis on an individual patient level. For example, one can see that a low HCT was the main driver for a high risk prediction for a certain patient. By thus moving away from simply providing accurate predictions to also offering explanations of these predictions.

The findings align with and extend previous research demonstrating that clinical and neuropsychological features are strong predictors of cognitive decline in PD. Similar to Schrag et al. who identified age, APOE status, and non-motor symptoms such as hyposmia and REM sleep behavior disorder as predictors of CI using a multimodal approach, the RF model also highlighted age and APOB as important features, supporting the role of both demographic and lipid-related factors in early cognitive prognosis (11). Unlike that study, however, the current analysis did not incorporate CSF or imaging biomarkers, and instead demonstrated that widely available clinical and laboratory data can achieve comparable predictive accuracy. The results also resonate with Savica et al. who found that gait parameters such as stride length and gait variability were associated with domain-specific cognitive decline; although the research did not explicitly include gait variables, the overlap in clinical markers like motor assessment scores suggests converging evidence on physical function as a cognitive marker (12). In contrast to studies like Pirogovsky-Turk et al. and Hu et al. which relied heavily on neuroimaging or task-specific neuropsychological assessments for prediction, our model emphasizes the value of a multi-domain, non-invasive approach using routine data to achieve clinically meaningful prediction in early PD cohorts (13, 14). This underscores a key contribution of the work: demonstrating that clinically accessible variables, when analyzed through ML frameworks, can rival more resource-intensive biomarker strategies for early risk stratification.

Multiple risk factors—including advanced age, longer disease duration, reduced HCT, APOB, and abnormal PLT activity—converge on several pathophysiological pathways to drive cognitive decline in PD. A key unifying mechanism is chronic neuroinflammation, wherein microglial activation and pro-inflammatory cytokine release are promoted by APOB-driven atherosclerosis, PLT-derived mediators, age-related immunosenescence, and hypoxia secondary to low HCT (1518). Concurrently, these factors exacerbate cerebrovascular dysfunction and blood–brain barrier (BBB) disruption. APOB and platelet hyperactivity contribute to microvascular damage and thrombotic risk, while anemia (low HCT) reduces cerebral oxygen delivery, impairing neuronal metabolism (19). Age and disease duration further weaken vascular integrity. A compromised BBB facilitates the entry of peripheral inflammatory agents and impedes the clearance of toxic proteins. Additionally, mitochondrial dysfunction and oxidative stress are amplified by impaired oxygen utilization (HCT), metabolic dysregulation (APOB), and inflammatory signals. Finally, these processes create a permissive environment for pathological protein accumulation and spread—particularly α-synuclein aggregation—which propagates through vulnerable neural networks, ultimately leading to synaptic failure and cognitive impairment (20). Thus, these risk factors collectively accelerate PD-related cognitive decline by interacting through neuroinflammatory, vascular, metabolic, and proteinopathic mechanisms. Targeting these shared pathways may offer novel therapeutic strategies to preserve cognitive function in PD.

To further validate the robustness of the RF model, the stdy incorporated cognitive outcomes based on the MoCA in addition to the MMSE. While MMSE is widely used in clinical settings, it may lack sensitivity to detect early executive and visuospatial deficits common in PD. MoCA, on the other hand, is more sensitive to mild CI. By leveraging both tools, the RF model benefits from complementary diagnostic strengths, enabling more accurate identification of CI across different clinical profiles in PD patients.

Limitation

This study has several limitations. First, while the sample size in this study was large enough to create the initial model, the study did not have external validation on a multi-center or prospective cohort. This could lead to limited generalizability to larger and more diverse populations of PD patients. External validation on an independent, multi-center dataset should be a future direction. Second, although our model demonstrated strong performance using clinical and neuropsychological features, the current analysis did not include imaging or CSF biomarkers, which may further enhance predictive accuracy. Third, the use of MMSE and MoCA as outcome measures, while clinically relevant, may be subject to educational and cultural bias. Lastly, the retrospective design may introduce unmeasured confounders, and external validation in prospective, multicenter cohorts is still needed. Future studies integrating multimodal biomarkers and real-world follow-up data will be critical to further improving the model’s clinical utility and robustness. Fourth, the 6-month follow-up duration is a limitation as it is relatively short for outcome prediction models. The natural history of cognitive decline in PD is protracted, with changes in cognition often occurring over several years. The purpose of this study was to identify early signals of risk and to enable earlier intervention, for which a 6-month time horizon is reasonable and can capture clinically meaningful shifts, particularly among those with more rapid progression or greater vulnerability at baseline. However, it is certainly too short to represent the long-term trajectory of cognitive decline, or to accurately predict conversion to Parkinson’s disease dementia. On the other hand, all the predictors in the final model have well-documented associations as longer-term risk factors (especially age, disease duration), which provides some face validity that these would apply over longer timescales as well. The model’s predictive accuracy for CI at 3–5 years is currently unknown, and should be explicitly tested in future prospective work with longer follow-up. Finally, potential effects of variants with well-established associations with PD (e.g., GBA or LRRK2) on cognitive end points were not investigated, because genetic testing was not systematically performed. Genetic data in future studies could be used to enhance the accuracy of the model and might reveal genotype-specific patterns of prediction.

Conclusion

This study demonstrates that ML models using clinical and neuropsychological data can effectively predict early cognitive impairment in Parkinson’s disease. The Random Forest model showed strong performance, and validation with both MMSE and MoCA enhanced robustness. Th findings demonstrate that data-driven approaches can enhance early identification of cognitive decline in PD and support personalized clinical decision-making. Future research should focus on prospective validation and the integration of imaging and molecular biomarkers to further improve predictive precision.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Second Affiliated Hospital of Fujian Medical University ethics committee ([2024] ethical review approval no. 338). The studies were conducted in accordance with the Declaration of Helsinki, local legislation and institutional requirements. Written informed consent was obtained from all participants or their legal guardians prior to enrollment.

Author contributions

MY: Conceptualization, Methodology, Writing – original draft. CW: Data curation, Software, Writing – review & editing. JZ: Formal analysis, Methodology, Writing – review & editing. YX: Investigation, Resources, Writing – review & editing. YC: Resources, Supervision, Writing – review & editing. ZG: Methodology, Resources, Writing – review & editing. JW: Supervision, Validation, Visualization, Writing – review & editing. JH: Conceptualization, Funding acquisition, Supervision, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by the Natural Science Foundation of Fujian Province, China (grant no.: 2023J01750), the Joint Funds for the Innovation of Science and Technology, Fujian Province (grant no.: 2024Y9361; 2024Y9396), and the Quanzhou High-Level Talents Innovation and Entrepreneurship Project (no. 2024QZC010YR).

Acknowledgments

This study was supported by the Second Affiliated Hospital of Fujian Medical University. We thank all of the participants in the study for their cooperation in this study.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2025.1709386/full#supplementary-material

References

1. Riedel, O, Klotsche, J, Spottke, A, Deuschl, G, Förstl, H, Henn, F, et al. Cognitive impairment in 873 patients with idiopathic Parkinson’s disease: results from the german study on epidemiology of Parkinson’s disease with dementia (GEPAD). J Neurol. (2008) 255:255–64. doi: 10.1007/s00415-008-0720-2,

PubMed Abstract | Crossref Full Text | Google Scholar

2. Williams-Gray, CH, Evans, JR, Goris, A, Foltynie, T, Ban, M, Robbins, TW, et al. The distinct cognitive syndromes of Parkinson’s disease: 5 year follow-up of the CamPaIGN cohort. Brain. (2009) 132:2958–69. doi: 10.1093/brain/awp245,

PubMed Abstract | Crossref Full Text | Google Scholar

3. Juneja, A, Anand, K, Chandra, M, Deshpande, S, Dhamija, R, Kathuria, P, et al. Neuropsychiatric symptoms and caregiver burden in Parkinson’s disease. Ann Indian Acad Neurol. (2020) 23:656–60. doi: 10.4103/aian.AIAN_91_20,

PubMed Abstract | Crossref Full Text | Google Scholar

4. Visser, M, Verbaan, D, Van Rooden, S, Marinus, J, Van Hilten, J, and Stiggelbout, A. A longitudinal evaluation of health-related quality of life of patients with Parkinson’s disease. Value Health. (2009) 12:392–6. doi: 10.1111/j.1524-4733.2008.00430.x,

PubMed Abstract | Crossref Full Text | Google Scholar

5. Rajiah, K, Maharajan, MK, Yeen, SJ, and Lew, S. Quality of life and caregivers’ burden of Parkinson’s disease. Neuroepidemiology. (2017) 48:131–7. doi: 10.1159/000479031,

PubMed Abstract | Crossref Full Text | Google Scholar

6. Tessitore, A, Marano, P, Modugno, N, Pontieri, FE, Tambasco, N, Canesi, M, et al. Caregiver burden and its related factors in advanced Parkinson’s disease: data from the PREDICT study. J Neurol. (2018) 265:1124–37. doi: 10.1007/s00415-018-8816-9,

PubMed Abstract | Crossref Full Text | Google Scholar

7. Shi, M, Huber, BR, and Zhang, J. Biomarkers for cognitive impairment in Parkinson disease. Brain Pathol. (2010) 20:660–71. doi: 10.1111/j.1750-3639.2009.00370.x,

PubMed Abstract | Crossref Full Text | Google Scholar

8. Serrano-Dueñas, M, Calero, B, Serrano, S, Serrano, M, and Coronel, P. Metric properties of the mini-mental Parkinson and SCOPA-COG scales for rating cognitive deterioration in Parkinson’s disease. Mov Disord. (2010) 25:2555–62. doi: 10.1002/mds.23322,

PubMed Abstract | Crossref Full Text | Google Scholar

9. Schneider, JS, Elm, JJ, Parashos, SA, Ravina, BM, and Galpern, WR. Predictors of cognitive outcomes in early Parkinson disease patients: the national institutes of health exploratory trials in Parkinson disease (NET-PD) experience. Parkinsonism Relat Disord. (2010) 16:507–12. doi: 10.1016/j.parkreldis.2010.06.001,

PubMed Abstract | Crossref Full Text | Google Scholar

10. Annanmaki, T, Pohja, M, Parviainen, T, Hakkinen, P, and Murros, K. Uric acid and cognition in Parkinson’s disease: a follow-up study. Parkinsonism Relat Disord. (2011) 17:333–7. doi: 10.1016/j.parkreldis.2011.01.013,

PubMed Abstract | Crossref Full Text | Google Scholar

11. Schrag, A, Siddiqui, UF, Anastasiou, Z, Weintraub, D, and Schott, JM. Clinical variables and biomarkers in prediction of cognitive impairment in patients with newly diagnosed Parkinson’s disease: a cohort study. Lancet Neurol. (2017) 16:66–75. doi: 10.1016/S1474-4422(16)30328-3,

PubMed Abstract | Crossref Full Text | Google Scholar

12. Savica, R, Wennberg, AMV, Hagen, C, Edwards, K, Roberts, RO, Hollman, JH, et al. Comparison of gait parameters for predicting cognitive decline: the mayo clinic study of aging. J Alzheimer's Dis. (2016) 55:559–67. doi: 10.3233/JAD-160697,

PubMed Abstract | Crossref Full Text | Google Scholar

13. Hu, W-D, Chen, J, Mao, C-J, Feng, P, Yang, Y-P, Luo, W-F, et al. Elevated cystatin C levels are associated with cognitive impairment and progression of Parkinson disease. Cogn Behav Neurol. (2016) 29:144–9. doi: 10.1097/WNN.0000000000000100,

PubMed Abstract | Crossref Full Text | Google Scholar

14. Pirogovsky-Turk, E, Moore, RC, Filoteo, JV, Litvan, I, Song, DD, Lessig, SL, et al. Neuropsychiatric predictors of cognitive decline in Parkinson disease: a longitudinal study. Am J Geriatr Psychiatry. (2017) 25:279–89. doi: 10.1016/j.jagp.2016.10.004,

PubMed Abstract | Crossref Full Text | Google Scholar

15. Pizza, V, Agresta, A, D’Acunto, CW, Festa, M, and Capasso, A. Neuroinflamm-aging and neurodegenerative diseases: an overview. CNS Neurol Disord Drug Targets. (2011) 10:621–34. doi: 10.2174/187152711796235014

Crossref Full Text | Google Scholar

16. de Araújo Boleti, AP, Oliveira Flores, TM, Moreno, SE, Anjos, LD, Mortari, MR, and Migliolo, L. Neuroinflammation: an overview of neurodegenerative and metabolic diseases and of biotechnological studies. Neurochem Int. (2020) 136:104714. doi: 10.1016/j.neuint.2020.104714,

PubMed Abstract | Crossref Full Text | Google Scholar

17. Ghadery, C, Koshimori, Y, Christopher, L, Kim, J, Rusjan, P, Lang, AE, et al. The interaction between neuroinflammation and β-amyloid in cognitive decline in Parkinson’s disease. Mol Neurobiol. (2020) 57:492–501. doi: 10.1007/s12035-019-01714-6,

PubMed Abstract | Crossref Full Text | Google Scholar

18. Adamantidi, T, Grabrucker, AM, and Tsoupras, A. Targeting platelet activating factor signaling for therapeutic benefits in neurodegenerative disorders. Front Biosci (Landmark Ed). (2025) 30:38300. doi: 10.31083/FBL38300,

PubMed Abstract | Crossref Full Text | Google Scholar

19. Tang, H, Gao, Y, Zhang, Q, Nie, K, Zhu, R, Gao, L, et al. Chronic cerebral hypoperfusion independently exacerbates cognitive impairment within the pathopoiesis of Parkinson’s disease via microvascular pathologys. Behav Brain Res. (2017) 333:286–94. doi: 10.1016/j.bbr.2017.05.061,

PubMed Abstract | Crossref Full Text | Google Scholar

20. Kramer, ML, and Schulz-Schaeffer, WJ. Presynaptic alpha-synuclein aggregates, not Lewy bodies, cause neurodegeneration in dementia with Lewy bodies. J Neurosci. (2007) 27:1405–10. doi: 10.1523/JNEUROSCI.4564-06.2007,

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: Parkinson’s disease, cognitive impairment, prediction, machine learning, Random Forest, outpatient management

Citation: Yang M, Wang C, Zhang J, Xiao Y, Chen Y, Guo Z, Wang J and Huang J (2025) Predicting cognitive impairment in Parkinson’s disease: a machine learning approach based on clinical and neuropsychological data. Front. Neurol. 16:1709386. doi: 10.3389/fneur.2025.1709386

Received: 24 September 2025; Revised: 17 November 2025; Accepted: 25 November 2025;
Published: 19 December 2025.

Edited by:

Shruti Verma, Symbiosis International University, India

Reviewed by:

Miodrag Zivkovic, Singidunum University, Serbia
Ali Amouzandeh, Ernst von Bergmann Clinic, Germany

Copyright © 2025 Yang, Wang, Zhang, Xiao, Chen, Guo, Wang and Huang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jiayin Wang, d2p5X3lzcUAxNjMuY29t; Jinzhong Huang, aHVhbmdqaW56aG9uZzIwMjVAMTYzLmNvbQ==

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.