- 1Center for Movement Disorders, Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China
- 2Department of Neurology, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
- 3GYENNO Science Co., Ltd., Shenzhen, China
- 4HUST-GYENNO CNS Intelligent Digital Medicine Technology Center, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan, China
- 5China National Clinical Research Center for Neurological Diseases, Beijing, China
Objectives: To develop and validate machine learning models to predict levodopa responsiveness of tremor in Parkinson’s disease (PD) patients.
Methods: A total of 197 PD tremor patients underwent Levodopa Challenge Tests and were classified as having levodopa-responsive or levodopa-resistant tremor. Clinical and electromyogram (EMG) tremor analysis variables were recorded. The dataset was randomly divided into a training set (80%) and a test set (20%). To distinguish between the two groups, Support vector machine (SVM), random forest (RF), and logistic regression (LR) models were developed using training data. The optimal model was validated on test data. Calibration and decision curve analyses assessed model reliability and clinical utility.
Results: Among 197 patients, 95 had levodopa-responsive tremor, and 102 had levodopa-resistant tremor. The SVM model showed the best performance, achieving an accuracy of 81.5% in five-fold cross-validation, with a Kappa score of 0.624, sensitivity of 84.3%, specificity of 77.9%, and an area under the curve (AUC) of 0.850. Performance remained consistent on test data, with 82.5% accuracy, 0.653 Kappa, 93.8% sensitivity, 75% specificity, and 0.896 AUC. The best model incorporated 6 predictors: resting tremor score, rigidity/tremor ratio, postural and kinetic tremor score, disease duration, the Movement Disorder Society’s Unified Parkinson’s Disease Rating Scale III (MDS-UPDRS III) /disease duration, and supine diastolic blood pressure (DBP).
Conclusion: The SVM model, incorporating six key indicators, holds significant potential for predicting levodopa responsiveness in PD tremor, offering a valuable tool for the precise treatment of tremor in PD patients.
1 Introduction
Parkinson’s disease (PD) is characterized by dopamine depletion, resulting in symptoms such as akinesia, rigidity, and tremor. Tremor, which affects approximately 75% of PD patients, is one of the most important motor symptoms of the disease (Helmich et al., 2012). Levodopa remains the most effective treatment for PD, but its effect on tremor varies widely among patients. Compared with akinesia and rigidity, tremor tends to respond less effectively to dopaminergic therapy (Fishman, 2008). While some patients achieve near-complete tremor suppression, approximately 20–30% exhibit dopamine-resistant tremor even with optimized high-dose therapy (Solida et al., 2002). This therapeutic unpredictability leads to prolonged “trial-and-error” dosing, delaying effective symptom control by months or years. The reasons for this variability in tremor response to levodopa remain unclear. Given the significant interindividual variability in tremor response to levodopa, there is an urgent need to identify biomarkers that can predict levodopa efficacy and allow for personalized PD tremor treatment. Traditionally, the levodopa challenge test (LCT) has been used to assess the efficacy of levodopa. However, significant inconsistencies exist in its implementation and interpretation across different medical centers (Pieterman et al., 2018). LCT requires a lengthy evaluation period and can cause side effects such as dizziness, nausea, vomiting, and hypotension. Consequently, this methodology presents notable challenges, including patient discomfort, substantial time investment, and significant resource allocation for clinical facilities, while also posing non-negligible procedural risks. These limitations underscore the critical need for developing efficient predictive models, including valid predictors to evaluate levodopa responsiveness to tremor that mitigate these inherent drawbacks while maintaining diagnostic accuracy.
While numerous clinical factors have been identified as being associated with levodopa responsiveness, the factors influencing the responsiveness of PD tremor to levodopa remain inconsistent in the literature. A previous study showed that patients with dopamine-responsive tremor tended to have more severe disease, longer disease duration, and a higher incidence of dyskinesia (Zach et al., 2020). In another study, a comparison between dopamine-resistant and dopamine-responsive groups revealed statistical differences in gender and levodopa equivalent daily dose (LEDD), but no significant differences in disease duration, Hoehn and Yahr scale (H-Y stage), or Mini-Mental State Examination (MMSE) scores (Dirkx et al., 2019). A separate study focused on predicting clinical factors for dopaminergic treatment response in PD patients with resting tremor. It found that age was significantly different between the responsive and non-responsive groups, while no differences were observed in gender, disease duration, or levodopa dosage. The dopamine-responsive group had significantly higher scores for tremor, rigidity, akinesia, and postural instability, as well as a higher total Movement Disorder Society’s Unified Parkinson’s Disease Rating Scale III (MDS-UPDRS III) score and H-Y stage, compared to the non-responsive group (Sung et al., 2008). More critically, these studies universally neglect autonomic biomarkers—despite some of the PD patients exhibiting orthostatic hypotension and established correlations between sympathetic denervation and tremor-dominant subtypes. Previous findings demonstrate that supine diastolic blood pressure (DBP) reflects central autonomic regulation, potentially modulating tremor circuits through noradrenergic pathways, providing a novel physiological rationale for feature selection. In addition to clinical factors, electromyogram (EMG) tremor analysis has been widely used to measure the difference in dominant frequency and rhythm pattern of tremor between different types of PD tremor. Previous studies have shown that kinetic tremor, which tends to have a higher frequency, is less responsive to dopaminergic therapy than classic resting tremor (Dirkx et al., 2018; Pirker et al., 2023). However, features of EMG tremor analysis have not been investigated as predictors before. Because there are currently inconsistent univariate and multivariate results, further screening and confirmation of predictors is required. New predictive models are therefore urgently needed.
In contrast to traditional predictive models, machine learning approaches consider interregional correlations, which improves the accuracy of the model. Moreover, newer machine learning algorithms have outperformed traditional linear models, such as logistic regression or Cox regression, in handling non-linear and high-dimensional data. This highlights the importance of applying innovative machine learning methods that can effectively capture non-linearity and complex data in clinical settings. Currently, there are no machine learning predictive models for the responsiveness of PD tremor to levodopa.
In this study, we developed machine learning models using both clinical data and features extracted from EMG analysis to predict the levodopa response in PD tremor. Furthermore, in order to verify the validity of the model, independent test data were employed in the study to validate the final model.
2 Materials and methods
2.1 Subjects
A total of 197 patients diagnosed with idiopathic PD were enrolled from the Movement Disorder Centre, Department of Neurology, Beijing Tiantan Hospital. Participants were included based on the following criteria: (1) The diagnosis of PD was based on the 2015 MDS diagnosis criteria (Postuma et al., 2015); (2) at least one of the following MDS-UPDRS III items had a score of ≥1: Postural tremor of limbs, Kinetic tremor of limbs, resting tremor amplitude, and Constancy of resting tremor. The exclusion criteria were as follows: (1) a history of any brain surgery or neuromodulation therapy (i.e., DBS); (2) extreme physical disability that impairs mobility assessment; and (3) impaired cognitive function or mental problems leading to a failure to complete all the procedures. Written informed consent was obtained from all participants. This study was approved by the Institutional Review Board of Beijing Tiantan Hospital, Capital Medical University (KY2024-037-02).
2.2 Data collection
A comprehensive set of demographic data, clinical data, motor symptoms, non-motor symptoms and EMG tremor analysis was assessed under the guidance of movement disorder specialists. Only complete cases (i.e., data without missing values) were included in the analysis.
2.2.1 Demographic and clinical features
Sex, age, onset age, disease duration, and medication history were recorded by inquiring about the case history. Disease staging was assessed by the H-Y stage.
2.2.2 Motor symptoms features
Objective motor symptoms were assessed using the MDS-UPDRS III. In order to reflect different aspects of MDS-UPDRS III in OFF state, the following sub-symptoms and sub-scores were calculated: tremor score (sum of scores on items 3.15–3.18), rigidity score (sum of scores on item 3.3), postural instability score (sum of scores on items 3.9–3.13), hypokinesia score (sum of scores on items 3.4–3.8 and 3.14), midline function score (sum of scores on items 3.1, 3.2, 3.9–3.14), resting tremor score (sum of scores on items 3.17 and 3.18), postural and kinetic tremors score (sum of scores on items 3.15 and 3.16), bradykinesia upper extremity score on more affected side (MAS) [sum of scores on items 3.4 (MAS), 3.5 (MAS), and 3.6 (MAS)], and lower limb bradykinesia score (sum of scores on items 3.7 and 3.8). These definitions are summarized in Table 1 (Goetz et al., 2008). The MAS was determined based on the sum of item scores of the MDS-UPDRS III (3.15; 3.16; 3.17 RUE, LUE, RLE, LLE). If the sum of the above item scores was the same for the left and right side, then the MAS was determined based on the sum of the scores of all the left sided and right sided item on MDS-UPDRS III (3.3 RUE, LUE, RLE, LLE; 3.4; 3.5; 3.6; 3.7; 3.8; 3.15; 3.16; 3.17 RUE, LUE, RLE, LLE). In order to reflect disease progress, several features, including MDS-UPDRS III/disease duration, postural instability/disease duration, hypokinesia/disease duration, tremor /disease duration, and rigidity /disease duration (Table 1), were calculated.
2.2.3 Non-motor symptom features
We measured blood pressure (BP), including supine and standing systolic blood pressure (SBP) and diastolic blood pressure (DBP). Orthostatic hypotension (OH) was also recorded (Freeman et al., 2011). MMSE was used to assess global cognitive impairment.
2.2.4 EMG tremor analysis features
A six-channel Nicolet EDX system (Nicolet company, US) with 4 pairs of EMG surface electrodes and 2 accelerometers was used in this study. During the analysis of the upper limbs, the data were recorded in four states: (1) at rest: patients sat in an armchair with their forearms resting on the armrests, leaving the hands free in the air to measure hand tremors; (2) in a posture: patients raised their arms in front of their body with their wrists parallel with the ground; (3) in a kinetic: patients raised their arms in front of their body with their finger pointing to the nose; and (4) holding 1,000 g: patients held their arms in front of them while holding 1,000 g sandbags. Each state was recorded for 30 s. During the analysis of the lower limbs, the data were recorded under two states: (1) at rest: patients sat calmly in a chair with their feet flat on the ground and relaxed completely; and (2) in a posture: patients sat in a chair with their toes touching the ground and heels suspended. Each state was recorded for 30 s. The dominant frequency and rhythm pattern of tremor were measured under four conditions: at rest, posture, kinetic, and holding 1 kg conditions. These features were constructed as categorical variables to represent clinically relevant distinctions in tremor characteristics based on EMG data, as summarized in Table 2. Tremor frequency features were quantitatively obtained from the acquisition system, which automatically computed the dominant frequency of EMG signals. The continuous frequency values were then discretized into clinically meaningful ranges following established criteria from Helmich’s research (Helmich et al., 2012). 0: Not seen—No resting tremor was detected (clinically observed by a movement disorder specialist). 1: Irregular jerks—Inconsistent or sporadic movements, not classifiable as rhythmic tremor (clinically observed by a movement disorder specialist). 2: <4 Hz—Slow-frequency for resting tremor. 3: [4–6] Hz—Typical frequency range for resting tremor. 4: >6 Hz—Higher frequency for resting tremor. Thus, the raw frequency values were directly computed by the instrument, while the categorical thresholds (<4 Hz, 4–6 Hz, >6 Hz) were defined according to published clinical literature to facilitate consistent interpretation and modeling. Rhythm pattern features were determined through a movement disorder specialist’s interpretation of EMG recordings. Both the agonist and antagonist muscles were analyzed simultaneously to assess their activation relationship. This feature reflects the rhythmic relationship between muscle groups during tremor episodes and was encoded into five clinically relevant categories, as shown below: 0: Not seen—No tremor or rhythmic activity detected. 1: Irregular jerks—Inconsistent, arrhythmic muscle activations not forming a clear pattern. 2: Alternation—Alternating contractions between muscle groups (e.g., agonist–antagonist), often characteristic of Parkinsonian tremor. 3: Synchronization—Simultaneous activations across muscle groups, indicating a different tremor mechanism. 4: Alternation + Synchronization—A mixed pattern, where both alternating and synchronous bursts are observed within the same tremor episode. This combined approach ensures that the EMG-derived features incorporate both objective signal metrics and clinically validated classifications, supporting reproducibility and interpretability in subsequent analyses.
2.3 Outcome
All participants underwent a levodopa challenge test (LCT) in accordance with published guidelines (Albanese et al., 2001; Feng et al., 2009). The test was conducted in the morning, with withdrawal from anti-Parkinsonian drugs, to ensure a sufficient washout period. The dose for LCT was set gradually until a dose equivalent to 150% of the usual morning levodopa equivalent dose (LED). The drug type and LED were recorded. The MDS-UPDRS III was administered initially in the OFF condition and then repeated after administration. The analysis focused on the highest levodopa responsiveness recorded for each patient. ON and OFF tremor scores were evaluated for each participant, and levodopa tremor responsiveness was calculated as the difference ([OFF–ON]; ΔLR tremor score) and percentage change ([OFF–ON]/OFF*100%; %LR tremor score).
Cluster analysis was performed to establish a classification criterion that categorized PD patients into either levodopa-responsive tremor or levodopa-resistant tremor groups (see Online Resource). Based on the results of the cluster analysis, the following criterion was defined: if a participant had a ΔLR tremor score greater than 3, they were classified as having levodopa-responsive tremor; otherwise, they were classified as having levodopa-resistant tremor.
2.4 Model deployment and validation
2.4.1 Training and independent test split
The dataset was randomly divided into a training set (80%) and an independent test set (20%). The proportion of patients with levodopa-responsive tremor did not differ significantly between the training and test sets. All features were derived from assessments conducted in the OFF-medication state, including all ‘Before LCT’ features listed in Table 1 and EMG features collected prior to levodopa administration. Feature analysis and selection were conducted using the training data. Five-fold cross-validation was employed to tune hyperparameters and assess model performance within the training set. The independent test set was then used to evaluate the final model developed from the training data.
2.4.2 Model building and cross-validation
Two steps were performed for feature selection before model building. Step 1: Pairwise Pearson correlation of the features was calculated to remove features with high correlation (threshold 0.6) with another (only one feature was kept for each pair that has a high correlation with each other), following the High-correlation Features Removal Rule mentioned in the previous study (Lin et al., 2023). Step 2: Feature permutation importance was determined using the RF technique (Breiman, 2001), ranking the features retained from Step 1 based on their importance scores, measured by the mean decrease in Gini (MD Gini). A higher MD Gini value indicates greater feature importance, and features were ranked from highest to lowest accordingly.
All classifiers were evaluated using the same ordered feature list derived from the RF permutation-importance ranking, ensuring comparability across models. Classification models were developed using Random Forest (RF), support vector machine (SVM), and logistic regression (LR) techniques to predict whether a patient would be classified as levodopa-responsive tremor or levodopa-resistant tremor. The models were evaluated using the top K features, starting with the top five features (K = 5). The decision to begin with five features, rather than fewer, was made to prevent potential underperformance associated with using only a very limited set of features (e.g., 1, 2, 3, or 4 features), which may not sufficiently capture the complexity of the dataset. Additional features were then incrementally added one at a time until all features retained from Step 2 were included in the model, to determine the optimal set of features for the best predictive model performance (Liu et al., 2024; Zhu et al., 2024). At each feature configuration (each value of K), hyperparameter tuning was conducted via grid search within the training set during cross-validation. For the SVM with a Radial Basis Function (RBF) kernel, the hyperparameter grid included penalty parameters (C) ranging from 0.2 to 0.6 in steps of 0.05, and sigma values ranging from 0.02 to 0.06 in steps of 0.01. These ranges were selected to balance model flexibility and generalization capacity, ensuring a comprehensive exploration of decision boundary smoothness and margin settings. For the RF model, the parameter grid consisted of ntree values of 200 and 250, node_size values of 2, 3, 4, and 5 (minimum terminal node size), and mtry values corresponding to floor ( ) and ceiling ( ), where p denotes the total number of features considered. These ranges were chosen to explore the balance between model complexity and overfitting risk while maintaining computational efficiency. Each algorithm underwent independent feature selection and hyperparameter tuning.
Performance metrics—accuracy, Cohen’s Kappa, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and receiver operating characteristic area under the curve (AUC_ROC) were calculated for each fold and averaged across five folds. The combination of feature subset and hyperparameters that achieved the highest performance during cross-validation was selected as the optimal configuration for that algorithm. Model performances were then compared across algorithms within cross-validation. The feature, hyperparameter, and model techniques combination corresponding to the model with the highest five-fold cross-validation metrics was selected. This optimal model was then trained on the entire training set using the chosen feature and hyperparameter combination. In our study, we set levodopa-responsive tremor as the positive case, while levodopa-resistant tremor was the negative case. The threshold probability was set as 0.5, which was a cut-off value to define when a patient is positive (the patient was regarded as positive when the model predicted probability for that patient was ≥ ).
2.4.3 Independent test data validation
The independent test set was applied to evaluate the final model that was built on the entire training data. The model performance evaluation metrics mentioned above were also applied to check the predictive model performance with the independent test data.
2.4.4 Calibration plot and decision curve analysis
Calibration plots and decision curve analysis (DCA) were conducted on the best-performing model using both the training and test datasets to evaluate its clinical utility and predictive reliability. Calibration refers to how well the predicted outcomes align with the observed values (Van Calster et al., 2019). Good calibration indicates that the model accurately estimates risks without overestimating or underestimating them. DCA aimed to address the limitations of traditional statistical metrics (e.g., discrimination and calibration) by trying to answer whether the model should be used in clinical practice.
2.4.5 Additional stability and variability analysis
To further assess the robustness and variability of model performance, an additional repeated 5-fold cross-validation procedure (20 repetitions; 100 resamples) was performed. This analysis was conducted only for performance stability evaluation and did not contribute to model selection, feature selection, or parameter tuning, which were performed exclusively within the primary 5-fold cross-validation on the training set. For each resample, Accuracy, Kappa, Sensitivity, Specificity, PPV, NPV, and AUC were computed. The distributions of these metrics were compared across SVM, RF, and LR using paired t-tests and Wilcoxon signed-rank tests to examine whether differences in mean accuracy were statistically significant.
2.5 Statistical analysis
Statistical analyses were conducted using R software version 4.0.2. The statistical significance level was set as 0.05 for all tests. To assess the adequacy of the sample size, a stratified power analysis was performed. The calculation was based on a two-sample t-test with the actual group sizes (95 levodopa-responsive vs. 102 levodopa-resistant tremor), a moderate effect size (Cohen’s d ≈ 0.5), a significance level of α = 0.05, and a two-sided test. The analysis yielded a statistical power of over 80%, confirming that the sample size was sufficient to detect a moderate between-group difference. Statistical tests were selected according to the data type and distribution of each variable. The Shapiro–Wilk test was performed to check if a continuous variable followed a normal distribution. A continuous variable was presented as mean (standard deviation) if it was normally distributed, and an unpaired two-tailed t test was employed for comparison between the levodopa-responsive tremor and levodopa-resistant tremor groups. If a continuous variable was not normally distributed, it was presented as the median [Q1, Q3], where Q1 and Q3 correspond with the 25th percentile and 75th percentile, respectively, and the Wilcoxon rank sum test was employed. Chi-square test was employed to compare categorical variables between levodopa-responsive tremor and levodopa-resistant tremor groups, and categorical variable was presented as numbers (percentages).
Univariate odds ratios (ORs) and their corresponding 95% confidence intervals (CIs) were used to quantify the relationship between our tremor levodopa response outcome associated with each variable. The ORs represented the change in the odds of levodopa-responsive tremor for each one-unit increase in the variable for continuous variables, while they reflected the change in odds of levodopa-responsive tremor relative to the selected reference group for categorical variables.
3 Results
3.1 Participants characteristics
A total of 197 participants were included in this study. The main demographic and clinical characteristics of the participants are presented in Table 3. There was no statistically significant difference in the main demographic and clinical characteristics between the training data (157 participants) and the test data (40 participants). In the training data, 79 participants were classified as having levodopa-responsive tremor, while 78 participants were classified as levodopa-resistant tremor.
3.2 Feature analysis between levodopa-responsive tremor and levodopa-resistant tremor
All features analyzed in this comparison were collected during the OFF-medication state, including the ‘Before LCT’ variables listed in Table 1 and EMG-derived features obtained prior to levodopa administration. In bivariate analysis, 14 features were significantly different (p < 0.05, highlighted in bold) between levodopa-responsive tremor and levodopa-resistant tremor groups (Table 4). For example, the Rigidity/tremor ratio of participants of levodopa-responsive tremor was 0.99 (p < 0.001, Figure 1A) lower than that of the participants of levodopa-resistant tremor on average. The participants of levodopa-responsive tremor had 1.08 years (p = 0.017, Figure 1B) longer disease duration than that of the participants of levodopa-resistant tremor on average. The MDS-UPDRS III (OFF) of participants of levodopa-responsive tremor was 11 (p < 0.001, Figure 1C) higher than that of the participants of levodopa-resistant tremor on average. Univariate ORs analysis was also shown in Table 4. For example, regarding disease duration (Univariate OR = 1.09, 95% CI: 1.01–1.20), the increase in odds of levodopa-responsive tremor occurrence was 9.0% for each unit (year) increase in disease duration without considering other factors. Regarding lower limb resting tremor frequency (MAS), compared to ‘0: Not seen,’ ‘1: Irregular jerks’ had an OR of 1.14 (95% CI: 0.40–3.20), indicating a non-significant increase in the odds of levodopa-responsive tremor occurrence. ‘2: <4’ had an OR of 1.42 (95% CI: 0.50–4.03), which was also not statistically significant. ‘4: >6’ had an OR of 2.85 (95% CI: 0.26–62.79), which was also not statistically significant. However, ‘3: [4, 6]’ had a statistically significant increase in the odds of levodopa-responsive tremor occurrence, with an OR of 3.50 (95% CI: 1.56–8.28), indicating that participants have lower limb resting tremor frequency (MAS) between 4 and 6 Hz have 3.5 times the odd of levodopa-responsive tremor occurrence compared to those whose lower limb resting tremor frequency (MAS) was not seen without considering other factors.
Table 4. Comparison of features between the levodopa-resistant tremor and levodopa-responsive tremor group on training data.
Figure 1. Boxplots of features: (A) Rigidity-to-tremor ratio, (B) Disease duration, and (C) MDS-UPDRS III score in the OFF-medication state. Each boxplot summarizes the data distribution as follows: The horizontal line within each box denotes the median. The top and bottom edges represent the 75th percentile (Q3) and 25th percentile (Q1). Whiskers extend to the maximum (Q3 + 1.5 × IQR) and minimum (Q1 − 1.5 × IQR) values, where IQR is the interquartile range. Outliers, defined as values beyond these whiskers, are shown as individual dots.
3.3 Comparison of model performance
The model performance comparisons based on five-fold cross-validation on training data are shown in Table 5. SVM [ACC: 81.5%, Kappa 0.624, Sen: 84.3%, Spe: 77.9%, PPV: 80.2%, NPV: 84.4%, AUC: 0.850] outperformed the other two models, which shows good discrimination ability. Calibration plot shows good alignment between observed proportion of levodopa-responsive tremor cases and predicted probability of levodopa-responsive tremor cases obtained from the best model (SVM) on training data as the points were around the 45-degree line, which indicates a good calibration ability (Figure 2A). Decision curve shows the decision curve of the best model (SVM) on training data (Figure 2B). We can see that, except for a small range of high threshold probability, intervening on patients based on the best prediction model (SVM) of our study leads to higher net benefit than intervention for all or intervention for none. Therefore, we chose this model as the final model in our study. To further evaluate model robustness and determine whether the observed performance differences among the three classifiers were statistically meaningful, we conducted an additional repeated 5-fold cross-validation (20 repetitions; 100 resamples). Across the resamples, the mean accuracies were comparable among SVM (0.785 ± 0.069; 95% CI: 0.650–0.923), RF (0.770 ± 0.062; 95% CI: 0.646–0.875), and LR (0.784 ± 0.059; 95% CI: 0.677–0.875). Paired t-tests and Wilcoxon signed-rank tests revealed no statistically significant differences among the three models (all p > 0.05). Despite the lack of statistically significant differences, the SVM consistently showed the highest average accuracy across resamples and required fewer features compared with RF and LR. These findings further support the choice of SVM as the final predictive model, in line with the primary cross-validation results.
Figure 2. Calibration plot and decision curve of the final model. (A) Calibration plot on train data, (B) decision curve on train data, (C) calibration plot on test data, (D) decision curve on test data.
3.4 Independent test data validation
The final prediction model (SVM) was validated on the independent test set. The model performance on the independent test data [ACC: 82.5%, Kappa 0.653, Sen:93.8%, Spe: 75%, PPV: 71.4%, NPV: 94.7%, AUC: 0.896 (95% CI: 0.799, 0.992)] was as good as the five-fold cross validation model performance on training data, which indicated a good reproducibility of our model. In our study, this final model was able to correctly classify 82.5% patients as levodopa-responsive tremor and levodopa-resistant tremor, correctly classify 93.8% patients as levodopa-responsive tremor out of all levodopa-responsive tremor, and correctly classify 75% patients as levodopa-resistant tremor out of all levodopa-resistant tremor. The probability that a patient was predicted as levodopa-responsive tremor actually was 71.4%, while the probability that a participant predicted as levodopa-resistant tremor truly was levodopa-resistant tremor was 94.7%. Calibration plot demonstrates good alignment between observed proportion of levodopa-responsive tremor cases and predicted probability of levodopa-responsive tremor cases obtained from this final model on the test data, as indicated by the points closely aligning with the 45-degree line, reflecting good calibration (Figure 2C). The decision curve presents for this final model on the test data (Figure 2D). It shows that, except for a small range of high threshold probabilities, making intervention decisions based on the final prediction model yields a higher net benefit compared to intervening on all patients or none, which concludes that using this prediction model to determine whether patients should receive intervention would lead to good clinical outcome.
3.5 Predictors for optimal predictive model
The MD Gini for the predictors included in our optimal predictive model (SVM) were presented in Figure 3. A total of 6 predictors were incorporated into the final model with hyperparameters C as 0.55 and sigma as 0.06. Resting tremor score demonstrated the highest MD Gini, indicating it as the most important predictor in the construction of our predictive model. The remaining 5 predictors included rigidity/tremor ratio, postural and kinetic tremors score, disease duration, MDS-UPDRS III/disease duration, and supine DBP.
4 Discussion
Our study is the first to predict the responsiveness of PD tremor to levodopa treatment using machine learning models. Among the models evaluated, the SVM achieved the highest predictive performance and successfully passed validation, followed by RF and LR. The final SVM model incorporated 6 key predictors and demonstrated robust performance, offering a potential foundation for personalized treatment of tremor in PD patients. The selected predictors included: resting tremor score, rigidity/tremor ratio, postural and kinetic tremors score, disease duration, MDS-UPDRS III/disease duration, and supine DBP. The SVM model was trained with optimized hyperparameters, with the regularization parameter C set to 0.55 and sigma (kernel width) set to 0.06. Although the independent test accuracy of our final SVM model was 0.825, this level of performance is consistent with previous machine-learning studies addressing similar clinical prediction tasks in PD. For example, a machine-learning model predicting acute orthostatic hypotension after levodopa administration achieved a leave-one-out cross-validation (LOOCV) accuracy of 73.6% and an independent test accuracy of 72% (Liu et al., 2024). Likewise, wearable sensor–based gait-classification models distinguishing early PD from essential tremor reported LOOCV accuracy of 84% and independent test accuracy of 75.8% (Lin et al., 2023). These findings indicate that predictive accuracies around 0.75–0.80 are typical and considered acceptable for PD-related machine-learning models that aim to support rather than replace clinical decision-making. Therefore, the performance achieved by our model (test accuracy 0.825) falls within the expected and clinically meaningful range.
Recently, machine learning techniques have been increasingly utilized to improve the prediction of complex and hard-to-predict outcomes. SVM, RF, and LR were utilized to predict the responsiveness of PD tremor in our study. Our results show that SVM and RF performed better than the traditional machine learning method, LR. Traditional machine learning models were typically suitable for linearly separable data, under the assumption that the features exhibit linear relationships, independence, and specific distribution characteristics, which may not hold in real-world datasets (Peng et al., 2002). However, real-world medical data often present with high non-linearity and complex feature interactions, which limit the expressive power of LR in such settings. Compared with LR, other machine learning algorithms, such as SVM and RF, possess greater flexibility and modeling capacity, which do not rely on predefined functional forms and are capable of capturing complex non-linear relationships and feature interactions (Deo, 2015). Therefore, machine learning offers a powerful and flexible framework for developing predictive models, especially when dealing with complex, noisy, or high-dimensional data that may violate traditional modeling assumptions.
Although neural-network–based models have gained increasing attention in biomedical classification tasks (Huang et al., 2021; Martinez-Murcia et al., 2021), their application typically requires substantially larger datasets and offers limited interpretability, which may hinder their clinical usefulness. Given the modest sample size of the present cohort and the importance of transparent predictive features, classical machine-learning algorithms were prioritized. To further examine whether a more complex non-linear model could provide additional benefit, we conducted an exploratory analysis using a regularized multilayer perceptron neural-network classifier, applying the same feature-selection strategy as in the main analysis. The neural-network model achieved cross-validated performance [ACC: 73.8%, Kappa 0.472, Sen: 83.4%, Spe: 63.9%, PPV: 71.5%, NPV: 80.7%, AUC: 0.777] that was comparable to, but not superior to, that of the SVM, RF, and LR models. These findings suggest that deep-learning approaches did not offer measurable advantages under the current study conditions and support the appropriateness of selecting classical machine-learning algorithms for the final model development.
In this study, the SVM model was selected as our final model as it outperformed both the RF and LR models. SVM constructs a maximum-margin hyperplane and leverages kernel functions (such as the Radial Basis Function, RBF) to map low-dimensional non-linear problems into high-dimensional space, thereby enabling efficient classification of complex data structures (Noble, 2006). This characteristic makes SVM particularly suitable for our dataset, which is characterized by non-linear and partially overlapping clinical and EMG feature distributions within a moderate sample size. In our study, the SVM model was configured with two key hyperparameters: C = 0.55 and sigma = 0.06. The parameter C is the regularization parameter in SVM, which controls the trade-off between maximizing the margin (generalization) and minimizing classification errors (especially for noisy or overlapping data). A smaller C (e.g., 0.55) allows the model to tolerate some degree of misclassification, thus helping to avoid overfitting and improve generalization. The parameter sigma (the bandwidth of the RBF kernel) defines the spread of the kernel, influencing how far the influence of a single training example reaches. A sigma = 0.06 means the kernel is narrow, making the model highly sensitive to nearby points. This can capture complex, non-linear patterns but risks overfitting if the data is noisy. The combined impact of C = 0.55 and sigma = 0.06 corresponded to a conservative model: The small C suggests a preference for simplicity (larger margin), while the small sigma allows flexibility in non-linear decisions. In conclusion, SVM offers theoretical advantages in boundary control and non-linear modeling over LR and RF. With appropriately tuned hyperparameters, it further improves its performance in predicting clinical tremor responsiveness.
In addition to identifying the optimal model, we further examined whether the six predictors selected by the SVM reflect a generalizable subset of informative features across different classifiers. All three models (SVM, RF, and LR) were trained using feature subsets derived from the same Random Forest permutation-importance ranking, ensuring full comparability. Although the optimal number of predictors differed slightly among models (SVM: K = 6; RF and LR: K = 8), the selected feature sets showed substantial overlap, indicating that the most influential predictors were largely shared across classifiers rather than being specific to a single method. To evaluate how sensitive model performance was to the number of predictors, we visualized the cross-validated accuracy of each classifier across different values of K using a new summary Figure 4. These curves were generated directly from the results of our original modeling pipeline, in which each classifier had already been trained and fully hyperparameter-tuned at every value of K (as described in the Methods section). For each K, the plotted accuracy represents the best-performing hyperparameter configuration—that is, the highest cross-validated accuracy obtained among all tested hyperparameter combinations. As shown in Figure 4, SVM achieved its peak accuracy at K = 6 and remained stable across K = 5–10; RF performed best at K = 8 with stable performance across K = 7–10; and LR also peaked at K = 8, followed by a gradual decline thereafter. These smoothly varying performance patterns indicate that none of the classifiers are overly sensitive to small changes in the number of predictors.
Figure 4. Cross-validated accuracy of SVM, RF, and LR across different numbers of selected features (K).
Having established the performance and configuration of the SVM model, we then explored the clinical relevance of the selected predictors. In this study, the key discriminative indices of tremor responsiveness to levodopa based on SVM were discussed while considering demographic factors, clinical data, evaluations of motor symptoms, and evaluations of non-motor symptoms. The prominent predictors of our final prediction model included the resting tremor score, postural and kinetic tremor score, and disease duration. They were also the most important features for discriminating levodopa-responsive tremor from the levodopa-resistant tremor group based on univariate analysis. We found that the levodopa-responsive tremor group had higher tremor severity and longer disease duration, which were similar to previous studies. Imaging studies demonstrate that resting tremor is usually associated with a dopaminergic deficit (Brooks et al., 1992). However, tremor severity does not correlate with striatum dopamine depletion in other studies (Louis et al., 2001; Pirker, 2003). This suggests that tremor in PD can be understood merely as an expression of dopaminergic denervation of the basal ganglia, but that other neurotransmitter systems and brain areas are involved (Dirkx and Bologna, 2022). Thus, tremors in PD may respond differently to dopaminergic replacement therapy. Our results further supported the conclusion that a significant response to levodopa was still observed in patients with more pronounced resting tremor as well as postural and kinetic tremor, and those with longer disease duration (Swinnen et al., 2025). However, given that the mean disease duration of the patients in this study was 5.98 years and the mean tremor score was 7 points, it remains to be investigated whether PD patients with a longer disease duration and a more severe tremor score than the average in our study have a different levodopa response. In our study, we found that tremor score and disease duration had a significant correlation in the responsiveness of PD tremor to levodopa. Therefore, the resting tremor score, postural and kinetic tremor score, and disease duration were found to be the key predictors rationally.
Although tremors were more pronounced in the levodopa-responsive tremor group, their progression patterns were discordant. Our study also found a faster rate of tremor progression in the levodopa-responsive tremor group, with no difference in the progression of the other motor symptoms and overall rate of motor progression between the two groups. The levodopa-resistant tremor group had a higher rigidity/tremor ratio than the levodopa-responsive tremor group through multivariate analysis. Tremors progressed at their own pace, and tremor severity did not correlate with the severity of bradykinesia or rigidity (Louis et al., 2001). This reflects that there may be a unique mechanism for tremor compared to other PD motor symptoms, such as rigidity. Except for multifactorial interactions, the rigidity/tremor ratio and MDS-UPDRS III/disease duration ratio were the contributions and valid predictors for our model construction.
In this study, supine DBP before medication was retained for our SVM model construction, providing new insights into the potential involvement of autonomic dysfunction in PD tremor. Comparison between the levodopa-resistant tremor and levodopa-responsive tremor groups revealed a statistical difference in ΔSBP; whereas standing SBP, supine SBP, standing DBP, supine DBP, ΔDBP, and OH show no significant differences. Although supine DBP did not differ significantly between groups, its retention in the model suggests potential relevance when combined with other predictors. This finding should nevertheless be interpreted with caution, as supine DBP was one of six potential predictors in our final model and was not consistently significant in univariate analyses. This suggests that its contribution should therefore be interpreted in conjunction with the other predictors in the model. Previous studies have shown that levodopa is well known for its hypotensive effects, which can lead to OH in PD patients through both peripheral and central mechanisms (Kujawa et al., 2000; Palma et al., 2015; Liu et al., 2024). Recent evidence suggested that PD patients who had a better response to levodopa were likely to have a greater impact of the hypotensive effect of levodopa. Additionally, PD is a primary cause of autonomic failure, with OH often occurring even in the prodromal stage and in drug-naive patients (Postuma et al., 2013; Dommershuijsen et al., 2021). Given these observations, the presence of supine DBP in our final model may reflect an underlying autonomic component influencing the dopaminergic responsiveness of tremor. This finding highlights the importance of incorporating autonomic parameters into future predictive and mechanistic studies on PD tremor.
By comparing groups, we found that EMG features may reflect differences in tremor subtypes in PD, with a higher proportion of irregular jerks in the levodopa-resistant tremor group and more 4–6 Hz classical tremor in the levodopa-responsive group. One possible explanation for these findings is that voluntary movement increases neural excitability within the cerebello-thalamo-cortical motor circuit, resulting in faster (higher frequency) synaptic transmission and making it less dependent on dopaminergic influences from the basal ganglia (reduced dopamine response) (Dirkx et al., 2016). Although these findings suggest physiological heterogeneity between subtypes, none of the EMG-derived features were retained in the final SVM model. This indicates that their discriminative information likely overlapped with other clinical predictors, and their incremental contribution to the model’s performance was limited. Therefore, EMG characteristics may not provide additional predictive value beyond the core clinical features, which serve as more stable and clinically practical indicators.
Furthermore, our findings suggest that relatively simple and routinely available clinical features, such as resting tremor score, rigidity/tremor ratio, disease duration, and supine DBP, provided stronger predictive performance than complex EMG-derived features. This highlights the potential for developing clinically practical models that rely primarily on accessible clinical assessments rather than specialized electrophysiological data. The development of these pragmatic tools is therefore crucial for broadening access to precision medicine. This includes access to clinical decision workflows that help personalize therapeutic strategies, particularly in resource-limited environments.
Our findings indicate that several clinical features assessed during the OFF-medication state, including resting tremor score, rigidity/tremor ratio, postural and kinetic tremors score, disease duration, MDS-UPDRS III/disease duration, supine DBP, demonstrated significant discriminative power in predicting tremor responsiveness to the levodopa challenge. These results underscore the potential of these OFF-state markers as accessible, non-invasive predictors for stratifying patients based on their likelihood of benefiting from dopaminergic therapy. Emerging evidence suggests that the distinct disease trajectories, often categorized as tremor-dominant (TD) and postural instability/gait difficulty (PIGD) subtypes. These clinical findings may have their underpinnings in the recently proposed body-first and brain-first hypotheses of disease propagation (Dirkx et al., 2018). It has also been shown that data-driven subtyping, rather than clinical or pathological subtyping, provides a promising strategy to inform disease modification trials in PD (Negida et al., 2025). To operationalize these predictors, we developed an SVM classification model, which showed robust performance in distinguishing between levodopa-responsive and levodopa-resistant tremor subtypes based on the selected OFF-state features. Integrating such a machine learning model into clinical workflows could enhance decision-making by providing objective, data-driven support for individualizing treatment strategies, particularly in cases where tremor responsiveness remains uncertain. Further research is warranted to validate both the predictors and the model in larger, prospective cohorts and to explore their implementation in practical clinical tools such as clinical decision support systems or digital health platforms.
This study has several limitations. First, although clinical and demographic characteristics were thoroughly analyzed, the EMG characteristics were excluded from the predictive model as their contribution was not statistically significant. Incorporation of machine learning models based on more comprehensive EMG characteristics, MRI, and other biomarkers could potentially improve classification accuracy. Future studies with multidimensional data are needed. Second, machine learning has the advantage of leveraging training set resources to their fullest through intelligent algorithms, ensuring the efficiency of the resulting model. However, a major challenge is the potential for overfitting. Therefore, it is essential to construct an efficient and robust machine learning model using multi-center, multi-dimensional, and large datasets in the future. Third, there is no consistent definition of levodopa responsiveness for tremor in PD across various studies currently. Some studies have classified patients based on the median tremor change rate score (Wu et al., 2022; Zhang et al., 2022). Levodopa tremor responsiveness was calculated as the difference in tremor score in another study (Sung et al., 2008). Using cluster analysis, we optimized the cutoff value by maximizing the AUC to ensure the best classification performance. Fourth, both the training and test sets were derived from the same population, and additional validation in new patients from a different but related population is essential. Fifth, although the dataset in this study was relatively balanced (95 levodopa-responsive tremor vs. 102 levodopa-resistant tremor patients), minor class imbalance could still have a subtle effect on model learning. Because both the training and test sets maintained comparable proportions of each class, resampling or class-weighting methods were not applied. Future studies with larger and more heterogeneous datasets should further validate model stability under different class distributions. Sixth, certain subgroups had relatively small sample counts, such as the lower-limb resting tremor frequency categories (e.g., “>6 Hz”). This limited the stability and precision of the corresponding odds ratio estimates, as reflected by wide confidence intervals (e.g., OR = 2.85, 95% CI: 0.26–62.79). These sparse data increase the uncertainty of statistical inference and raise potential risks of both Type I (false-positive) and Type II (false-negative) errors. Therefore, these findings should be interpreted with caution, and validation in larger samples is warranted. Seventh, we did not investigate the subtypes of PD patients or tremor types based on levodopa response. Future studies with larger sample sizes are needed to investigate the prognosis of different subtypes. Finally, the study did not evaluate the long-term levodopa responsiveness of tremor in PD patients, suggesting the need for further longitudinal research. Despite these limitations, our study offers novel insights and a practical framework for integrating predictive modeling into tremor management in Parkinson’s disease.
5 Conclusion
In conclusion, the construction of machine learning models from different perspectives based on demographic and clinical indicators has high predictive efficiency and clinical net benefit for levodopa response of tremor in PD. The features used to construct our final predictive model are expected to serve as potential predictive markers for the responsiveness of PD tremor to levodopa, further deepening our understanding of its pathogenesis. Our SVM model may become an effective and clinically applicable new method to provide a basis for the precise treatment of tremor in PD patients.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors without undue reservation.
Ethics statement
The studies involving humans were approved by Institutional Review Board of Beijing Tiantan Hospital, Capital Medical University (KY2024-037-02). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
FL: Conceptualization, Data curation, Investigation, Methodology, Validation, Writing – original draft. SL: Conceptualization, Formal analysis, Methodology, Validation, Visualization, Writing– original draft. RY: Data curation, Investigation, Resources, Writing– review & editing. YC: Data curation, Investigation, Resources, Writing – review & editing. KR: Writing– review & editing. ZC: Project administration, Supervision, Writing– review & editing. LM: Project administration, Supervision, Writing– review & editing. TF: Funding acquisition, Project administration, Supervision, Writing– review & editing.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This research was supported by Natural Science Foundation of China (No. 82271459) and Beijing Natural Science Foundation (No. 7232048).
Acknowledgments
We thank all the patients who participated in this study.
Conflict of interest
SL, KR, and ZC were employed by GYENNO Science Co., Ltd.
The remaining author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that Generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnagi.2025.1690155/full#supplementary-material
References
Albanese, A., Bonuccelli, U., Brefel, C., Chaudhuri, K. R., Colosimo, C., Eichhorn, T., et al. (2001). Consensus statement on the role of acute dopaminergic challenge in Parkinson's disease. Mov. Disord. 16, 197–201. doi: 10.1002/mds.1069,
Brooks, D. J., Playford, E. D., Ibanez, V., Sawle, G. V., Thompson, P. D., Findley, L. J., et al. (1992). Isolated tremor and disruption of the nigrostriatal dopaminergic system: an 18F-dopa PET study. Neurology 42:1554. doi: 10.1212/wnl.42.8.1554,
Deo, R. C. (2015). Machine learning in medicine. Circulation 132, 1920–1930. doi: 10.1161/CIRCULATIONAHA.115.001593
Dirkx, M. F., and Bologna, M. (2022). The pathophysiology of Parkinson's disease tremor. J. Neurol. Sci. 435:120196. doi: 10.1016/j.jns.2022.120196,
Dirkx, M. F., den Ouden, H., Aarts, E., Timmer, M., Bloem, B. R., Toni, I., et al. (2016). The cerebral network of Parkinson's tremor: an effective connectivity fMRI study. J. Neurosci. 36, 5362–5372. doi: 10.1523/jneurosci.3634-15.2016,
Dirkx, M. F., Zach, H., Bloem, B. R., Hallett, M., and Helmich, R. C. (2018). The nature of postural tremor in Parkinson disease. Neurology 90, e1095–e1103. doi: 10.1212/wnl.0000000000005215,
Dirkx, M. F., Zach, H., van Nuland, A., Bloem, B. R., Toni, I., and Helmich, R. C. (2019). Cerebral differences between dopamine-resistant and dopamine-responsive Parkinson's tremor. Brain 142, 3144–3157. doi: 10.1093/brain/awz261,
Dommershuijsen, L. J., Heshmatollah, A., Mattace Raso, F. U. S., Koudstaal, P. J., Ikram, M. A., and Ikram, M. K. (2021). Orthostatic hypotension: a prodromal marker of Parkinson's disease? Mov. Disord. 36, 164–170. doi: 10.1002/mds.28303,
Feng, T., Li, W., Lu, L., Wang, Y., Shi, W., Zhang, J., et al. (2009). Acute stepwise challenge test with levodopa in treated patients with Parkinsonism. Parkinsonism Relat. Disord. 15, 354–358. doi: 10.1016/j.parkreldis.2008.08.010,
Fishman, P. S. (2008). Paradoxical aspects of parkinsonian tremor. Mov. Disord. 23, 168–173. doi: 10.1002/mds.21736,
Freeman, R., Wieling, W., Axelrod, F. B., Benditt, D. G., Benarroch, E., Biaggioni, I., et al. (2011). Consensus statement on the definition of orthostatic hypotension, neurally mediated syncope and the postural tachycardia syndrome. Clin. Auton. Res. 21, 69–72. doi: 10.1007/s10286-011-0119-5,
Goetz, C. G., Tilley, B. C., Shaftman, S. R., Stebbins, G. T., Fahn, S., Martinez-Martin, P., et al. (2008). Movement Disorder Society-sponsored revision of the Unified Parkinson's Disease Rating Scale (MDS-UPDRS): scale presentation and clinimetric testing results. Mov. Disord. 23, 2129–2170. doi: 10.1002/mds.22340,
Helmich, R. C., Hallett, M., Deuschl, G., Toni, I., and Bloem, B. R. (2012). Cerebral causes and consequences of parkinsonian resting tremor: a tale of two circuits? Brain 135, 3206–3226. doi: 10.1093/brain/aws023,
Huang, Z., Sun, M., and Guo, C. (2021). Automatic diagnosis of Alzheimer’s disease and mild cognitive impairment based on CNN+ SVM networks with end-to-end training. Comput. Intell. Neurosci. 2021:9121770. doi: 10.1155/2021/9121770,
Kujawa, K., Leurgans, S., Raman, R., Blasucci, L., and Goetz, C. G. (2000). Acute orthostatic hypotension when starting dopamine agonists in Parkinson's disease. Arch. Neurol. 57, 1461–1463. doi: 10.1001/archneur.57.10.1461,
Lin, S., Gao, C., Li, H., Huang, P., Ling, Y., Chen, Z., et al. (2023). Wearable sensor-based gait analysis to discriminate early Parkinson’s disease from essential tremor. J. Neurol. 270, 2283–2301. doi: 10.1007/s00415-023-11577-6,
Liu, Z., Lin, S., Zhou, J., Wang, X., Wang, Z., Yang, Y., et al. (2024). Machine-learning model for the prediction of acute orthostatic hypotension after levodopa administration. CNS Neurosci. Ther. 30:e14575. doi: 10.1111/cns.14575,
Louis, E. D., Levy, G., Côte, L. J., Mejia, H., Fahn, S., and Marder, K. (2001). Clinical correlates of action tremor in Parkinson disease. Arch. Neurol. 58, 1630–1634. doi: 10.1001/archneur.58.10.1630,
Martinez-Murcia, F. J., Ortiz, A., Ramírez, J., Górriz, J. M., and Cruz, R. (2021). Deep residual transfer learning for automatic diagnosis and grading of diabetic retinopathy. Neurocomputing 452, 424–434. doi: 10.1016/j.neucom.2020.04.148
Negida, A., Mukhopadhyay, N., Berman, B. D., and Barrett, M. J. (2025). Comparative analysis of progression milestones across Parkinson’s disease clinical, pathological, and data-driven subtypes: a 10-year follow-up. Res. Sq. 23:rs.3.rs-6574563. doi: 10.21203/rs.3.rs-6574563/v1
Noble, W. S. (2006). What is a support vector machine? Nat. Biotechnol. 24, 1565–1567. doi: 10.1038/nbt1206-1565
Palma, J. A., Gomez-Esteban, J. C., Norcliffe-Kaufmann, L., Martinez, J., Tijero, B., Berganzo, K., et al. (2015). Orthostatic hypotension in Parkinson disease: how much you fall or how low you go? Mov. Disord. 30, 639–645. doi: 10.1002/mds.26079,
Peng, C.-Y. J., Lee, K. L., and Ingersoll, G. M. (2002). An introduction to logistic regression analysis and reporting. J. Educ. Res. 96, 3–14. doi: 10.1080/00220670209598786
Pieterman, M., Adams, S., and Jog, M. (2018). Method of levodopa response calculation determines strength of association with clinical factors in Parkinson disease. Front. Neurol. 9:260. doi: 10.3389/fneur.2018.00260,
Pirker, W. (2003). Correlation of dopamine transporter imaging with parkinsonian motor handicap: how close is it? Mov. Disord. 18, S43–S51. doi: 10.1002/mds.10579,
Pirker, W., Katzenschlager, R., Hallett, M., and Poewe, W. (2023). Pharmacological treatment of tremor in Parkinson's disease revisited. J. Parkinsons Dis. 13, 127–144. doi: 10.3233/jpd-225060,
Postuma, R. B., Berg, D., Stern, M., Poewe, W., Olanow, C. W., Oertel, W., et al. (2015). MDS clinical diagnostic criteria for Parkinson's disease. Mov. Disord. 30, 1591–1601. doi: 10.1002/mds.26424,
Postuma, R. B., Gagnon, J. F., Pelletier, A., and Montplaisir, J. (2013). Prodromal autonomic symptoms and signs in Parkinson's disease and dementia with Lewy bodies. Mov. Disord. 28, 597–604. doi: 10.1002/mds.25445,
Solida, A., Ghika, J., and Vingerhoets, F. (2002). Acute dopaminergic challenge tests to assess postural/kinetic tremor of different origin: a case report. J. Neurol. Neurosurg. Psychiatry 73, 206–207. doi: 10.1136/jnnp.73.2.206,
Sung, Y. H., Chung, S. J., Kim, S. R., and Lee, M. C. (2008). Factors predicting response to dopaminergic treatment for resting tremor of Parkinson's disease. Mov. Disord. 23, 137–140. doi: 10.1002/mds.21793,
Swinnen, B., Frequin, H. L., Wiggerts, Y., Espay, A. J., Beudel, M., and de Bie, R. M. A. (2025). Tremor is highly responsive to levodopa in advanced Parkinson's disease. Mov. Disord. Clin. Pract. 12, 76–81. doi: 10.1002/mdc3.14262,
Van Calster, B., McLernon, D. J., van Smeden, M., Wynants, L., and Steyerberg, E. W. (2019). Calibration: the Achilles heel of predictive analytics. BMC Med. 17:230. doi: 10.1186/s12916-019-1466-7,
Wu, J., Zhou, C., Guo, T., Guan, X., Gao, T., Bai, X., et al. (2022). Cholinergic relevant functional reactivity is associated with dopamine responsiveness of tremor in Parkinson's disease. Brain Imaging Behav. 16, 1234–1245. doi: 10.1007/s11682-021-00610-9,
Zach, H., Dirkx, M. F., Roth, D., Pasman, J. W., Bloem, B. R., and Helmich, R. C. (2020). Dopamine-responsive and dopamine-resistant resting tremor in Parkinson disease. Neurology 95, e1461–e1470. doi: 10.1212/wnl.0000000000010316,
Zhang, Y., He, X., Mo, C., Liu, X., Li, J., Yan, Z., et al. (2022). Association between microbial tyrosine decarboxylase gene and Levodopa responsiveness in patients with Parkinson disease. Neurology 99, e2443–e2453. doi: 10.1212/wnl.0000000000201204,
Keywords: Parkinson’s disease, machine learning, levodopa responsiveness, tremor, support vector machine, validation
Citation: Li F, Lin S, Yan R, Cui Y, Ren K, Chen Z, Ma L and Feng T (2026) Machine learning models for the prediction of levodopa response to tremor in Parkinson’s disease. Front. Aging Neurosci. 17:1690155. doi: 10.3389/fnagi.2025.1690155
Edited by:
Dai Yanagihara, The University of Tokyo, JapanReviewed by:
Tetsuro Funato, The University of Electro-Communications, JapanFrancesco Asci, Azienda Ospedaliera San Giovanni Addolorata, Italy
Copyright © 2026 Li, Lin, Yan, Cui, Ren, Chen, Ma and Feng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zhonglue Chen, Y2hlbnpob25nbHVlQGd5ZW5uby5jb20=; Lingyan Ma, amVubmlmZXJfbWx5QDE2My5jb20=; Tao Feng, Ynhia3lqc0BzaW5hLmNvbQ==
†These authors have contributed equally to this work and share first authorship