ORIGINAL RESEARCH article
Front. Neurol.
Sec. Neurorehabilitation
Volume 16 - 2025 | doi: 10.3389/fneur.2025.1612222
This article is part of the Research TopicNew methods in neurorehabilitationView all 3 articles
Development and validation of an early predictive model for Hemiplegic Shoulder Pain: A Comparative Study of Logistic Regression, Support Vector Machine, and Random Forest
Provisionally accepted- 1Department of Rehabilitation Medicine, Shaoxing People's Hospital, Shaoxing, China
- 2Department of Integrated Chinese and Western Medicine, Jiashan First People's Hospital, Jiaxing, China
- 3Department of Gastroenterology, Third Affiliated Hospital of Zhejiang Chinese Medical University, Hangzhou, Zhejiang Province, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Objective In this study, we aim to identify the predictive variables for hemiplegic shoulder pain(HSP) through machine learning algorithms, select the optimal model and predict the occurrence of HSP. Methods Data of 332 stroke patients admitted to a tertiary hospital in Zhejiang Province from January 2022 to January 2023 were collected. After screening predictive variables by LASSO regression, three predictive models selected using the LazyPredict package, namely Logistic Regression(LR), Support Vector Machine(SVM) and Random Forest(RF), were established respectively. The performance parameters (accuracy, precision, recall, F1 score) of the models were calculated, the receiver operating characteristic curve(ROC) and the Decision Curve Analysis(DCA) were plotted to compare the performance of the three models. An explainability analysis (SHAP) was conducted on the optimal model. Results The RF model performed the best, with accuracy: 0.90, precision: 0.89, recall: 0.88, F1 score: 0.86, AUC-ROC: 0.94, and the range of the threshold probability in DCA: 7% -99%. Based on the SHAP analysis of the explainability of the RF model, the contribution degrees of the early HSP predictive variables from high to low are as follows: multiple injuries, shoulder joint flexion (p), biceps tendon effusion, sensory disorder, supraspinatus tendinopathy, subluxation, diabetes, and age.The RF prediction model has a good predictive effect on HSP and has good clinical explainability. It can provide objective references for the early warning and stratified management of HSP.
Keywords: Hemiplegic shoulder pain, Prediction model, random forest, Support vector machine, Shap
Received: 15 Apr 2025; Accepted: 02 Jun 2025.
Copyright: © 2025 Wu, Zhang, Fei, Sima, Gong, Tong, Jiao, Wu and Gong. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Jianqiu Gong, Department of Rehabilitation Medicine, Shaoxing People's Hospital, Shaoxing, China
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.