AUTHOR=Yu Peil , Zhang Xinxin , Sun Guoxuan , Zeng Ping , Zheng Chu , Wang Ke TITLE=Sarcopenia prediction model based on machine learning and SHAP values for community-based older adults with cardiovascular disease in China JOURNAL=Frontiers in Public Health VOLUME=Volume 13 - 2025 YEAR=2025 URL=https://www.frontiersin.org/journals/public-health/articles/10.3389/fpubh.2025.1527304 DOI=10.3389/fpubh.2025.1527304 ISSN=2296-2565 ABSTRACT=BackgroundSarcopenia (SP), is recognized as a complication of cardiovascular disease (CVD), but few relevant diagnostic models have been developed. This study aims to establish an interpretable diagnostic model for the occurrence of SP in older adult CVD patients living in Chinese community-dwelling (CD).MethodsWe randomly selected participants with CVD recruited from CHARLS from 2011 to 2015 and divided them into a training set and a test set. In the training set, we processed and screened the predictor variables and addressed the data imbalance by the synthetic minority oversampling technique (SMOTE). Subsequently, we built four machine learning (ML) models to predict SP. After 100 iterations, we selected the best performing model for risk stratification by comparing model discrimination and calibration. Then, we analyzed the relationship between ML risk and SP using scatterplots and logistic regression (LR). Finally, the Shapley’s Additive Explanatory Plot (SHAP) illustrates how each feature level affects the predicted probability of SP.ResultsWe ultimately included 1,088 CD older adults, 18.61% of whom reported SP. The optimal model, XGBoost, was selected for prediction and risk stratification. After both univariate (odds ratio [OR]: 12.45, p = 4.74 × 10−10) and multivariate analyses (OR: 6.98, p = 3.96 × 10−10), participants with higher ML scores had a higher risk of SP. In sex-specific subanalyses, BMI, height, age, DBP, HDL, etc. were all significant predictors.ConclusionThis study develops a novel clinically-integrated tool that can be used to easily predict SP in the older adults population with CVD, providing a basis for the development of personalized therapeutic measures.