- 1Department of Rehabilitation, Shenzhen Second People’s Hospital, First Affiliated Hospital of Shenzhen University, Shenzhen, Guangdong, China
- 2Department of Nursing, Shenzhen Dapeng New District Nan’ao People’s Hospital, Shenzhen, Guangdong, China
Background: The Barthel Index (BI) is a standard, widely used measure of dependence in activities of daily living (ADL), particularly in stroke care. The Longshi Scale (LS) offers a simpler, more user-friendly alternative; however, it lacks a validated, age-stratified mapping to BI scores. This gap limits consistent outcome interpretation and application.
Objective: This study aims to establish and validate a standardized, age-stratified concordance between BI scores and LS grades, thereby providing a practical conversion tool for clinical and research settings.
Method: In a multi-center study of 16,412 stroke inpatients (3 months post-stroke), BI scores and LS grades were analyzed across age groups: <60 years (n = 12,662), 60–79 years (n = 2,596), and ≥80 years (n = 1,154). Sensitivity (correct identification) and specificity (correct exclusion), along with receiver operating characteristic (ROC) curves were used to determine optimal BI cutoff points for each LS grade. Spearman correlation and the Kruskal–Wallis test were applied across age groups.
Results: Key BI cutoffs were identified for LS grades: ≥75 for LS ≥ 2, ≥45 for LS ≥ 4, and <5 for LS = 6. These cutoff values were consistent across age groups. The BI scores were negatively correlated with LS disability level (e.g., r = −0.879 in patients aged ≥80 years, p < 0.001). Correlations remained strongest at severe disability levels (LS grades 5–6) across age groups (r = −0.60 to −0.65). AUC analysis demonstrated excellent discriminative ability, particularly for the mildest (LS 1) and most severe (LS 6) disability levels (AUC >0.95).
Conclusion: The study provides age-stratified BI cutoff values to guide resource allocation, emphasizing the need to prioritize care for individuals aged ≥80 years with BI scores below 5.
Introduction
Stroke is a leading global cause of long-term disability, affecting approximately 15 million people annually, with one-third (5 million) left permanently disabled, which results in dependency (GBD 2021 Stroke Risk Factor Collaborators, 2024; Zandvliet et al., 2019; Hotter et al., 2018; Shin et al., 2022). Assessment of functional independence is critical for measuring post-stroke outcomes (Littooij et al., 2022). The Barthel Index (BI), widely adopted globally, evaluates rehabilitation across diverse populations but faces limitations in its classification, which inadequately addresses age-related functional heterogeneity in complex clinical scenarios (Zhang et al., 2021). Alternatively, the Longshi Scale (LS), a clinically based tool ranging from complete independence (LS grade 1) to complete dependence (LS grade 6), provides granular disability assessment, particularly for sensory/communication domains, although it remains underutilized outside China, particularly in the long-term care of disabled patients (Chamberlain et al., 2016; Wang et al., 2020). Collectively, these metrics constitute critical indicators for assessing post-stroke dependence in activities of daily living (ADL). BI provides a fine-grained score useful for detailed tracking and research, while the LS offers a rapid, categorical assessment advantageous in busy clinical settings or for triage. Although both BI and LS assess post-stroke disability, discrepancies in their scoring frameworks cause interpretive confusion (Liu et al., 2022). Crucially, aging populations exhibit distinct functional trajectories, yet the interplay between age, BI scores, and LS grades remains unquantified.
At present, translated BI versions in multiple translations provide a reliable measure of basic ADL for evaluating the effectiveness of rehabilitation. Aging populations demand increasingly precise long-term care, yet BI’s broad categories may obscure age-specific functional declines (e.g., slower recovery in elderly patients). In comparison, the LS is a clinically based evaluation tool for disability that comprises six grades (Lübke et al., 2004; Pan et al., 2021). LS’s contextual sensitivity might better capture age-dependent care needs (e.g., domestic vs. bedridden states in older adults), particularly in emotional communication such as visual, auditory, and speech functions. Although it has been used frequently in large-scale outcome studies, it remains underutilized outside China (Zhang et al., 2024). Current studies lack consensus on the correspondence between BI–LS and age stratification, impeding personalized rehabilitation and cross-study comparisons (Liu et al., 2022; Zhou et al., 2021; Mueller-Schotte et al., 2020). This gap is critical given global demographic shifts toward older stroke populations.
Clinically defining LS grades and BI scores in stroke patients is an important issue not only for evaluating individual patient outcomes but also for evaluating the classification of disability levels among people of different age groups (Gao et al., 2021). It has been argued that LS can be utilized in conjunction with the BI for a comprehensive assessment to reflect the degree of disabilities comprehensively (Zhou et al., 2023). Disability measured by the LS and ADL measured by the BI often reveal discrepancies in explaining the functional outcomes of stroke survivors (Liu et al., 2023). Thus, this study aims to establish optimal age-stratified BI cutoffs for LS grades and support the design of clinical trials that utilize both measures. Understanding the BI–LS relationship across LS grades was intended to differentiate clinically distinct disability grades, thereby revealing the correspondence between the LS grades and BI scores and enabling precise translation between scales for tailored, age-informed clinical decision-making in stroke survivors.
Materials and methods
Study design and participants
This prospective multi-center project investigates residual disabilities, activity limitations, and the degree of disability in patients after stroke. Patients were identified and assessed during the post-acute, stable phase after stroke onset. All eligible patients reaching this assessment window were recruited between August 2022 and May 2023. Informed consent was obtained online from all participants prior to enrollment. If a patient was unable to provide informed consent due to incapacity, consent was obtained from their legally authorized representative.
Inclusion and exclusion criteria
Inclusion criteria
The inclusion criteria are as follows:
• Aged ≥18 years
• Diagnosed with ischemic or hemorrhagic stroke according to WHO criteria
• Stroke onset approximately 3 months (i.e., 12–16 weeks) prior to the assessment
• Voluntarily provided signed informed consent.
Exclusion criteria
The exclusion criteria are as follows:
• Comorbidities affecting functional status (e.g., fractures or amputations)
• Occurrence of serious adverse events, poor compliance, or inability to complete assessments during the study
• Critical illness precluding participation in rehabilitation interventions
Evaluation tool
The Barthel Index is a validated 100-point scale comprising 10 ADL items, including feeding, bathing, grooming, dressing, bowel and bladder control, toilet use, transfers (e.g., bed to chair), mobility, and stair climbing. The scores for items related to lower and upper extremity function, such as mobility and feeding, respectively, are particularly indicative of motor recovery (Yan et al., 2024). Each item is graded on a three-tiered system reflecting the degree of physical assistance required. This internationally recognized instrument, including its multilingual adaptations, demonstrates high reliability in assessing rehabilitation outcomes and has been extensively implemented in clinical trials and observational studies (Collin et al., 1988; Winstein et al., 2016).
The Longshi Scale is a scientifically validated tool that categorizes individuals into bedridden, domestic, and community groups based on their mobility (Liu et al., 2020). It further classifies patients into six disability grades, from Grade 1 (complete independence) to Grade 6 (complete dependence). Recommended as a national standard in 2018, this scale assesses disability using three key groups and six grades (Bahlman-van Ooijen et al., 2025), as shown in Figure 1. The assessment process for disability grades is shown in Figure 2. LS grades 5 and 6 fall within the bedridden group, with grade 6 representing the most severe level, in which patients are unable to get out of bed independently and can only move within the bed. LS grades 3 and 4 belong to the domestic group, in which patients can get out of bed and move independently within the home but are limited to indoor activities. LS grades 1 and 2 belong to the community group, in which patients are able to transfer outdoors and engage in outdoor activities (including using a wheelchair) (Xiong et al., 2023).
Figure 1. Longshi Scale for assessing the activities of daily living. (Reproduced from Liu et al., 2022, under the terms of the Creative Commons Attribution 4.0 International License).
Figure 2. Assessment process for disability grades. LS: Grade 1, complete independence; Grade 2, slight dependence; Grade 3, most self-care; Grade 4, partial self-care; Grade 5, severe dependence; Grade 6, complete dependence.
Data collection
The BI scores and LS grades were assessed using the Barthel Index and Longshi Scale through face-to-face interviews conducted by evaluators with the patients. To ensure optimal validity and interrater reliability, all data collection via questionnaires and patient evaluations, including the LS and BI, were performed by healthcare professionals, including physicians, nurses, and therapists, who received online training and understood the purpose of the study. Demographic variables and clinical characteristics were collected through structured electronic questionnaires administered via a secure WeChat-based platform (Mini Program). Several trained professional assessors independently evaluated participants using these standardized electronic questionnaires. Assessments were conducted during the stable post-stroke phase (approximately 3 months), with research coordinators at each center completing data collection within this time window.
Analysis methods
Statistical analyses were performed using the software package SPSS 24.0 (SPSS Inc., Chicago, IL, United States). The normality of continuous data distribution was assessed using the Shapiro–Wilk test. A significance level of p < 0.05 was set to reject the null hypothesis of normality (Benjamin et al., 2018). Because the data did not satisfy the assumption of normal distribution, all statistical tests applied were nonparametric. Based on the normality assessment, continuous variables are presented as the mean ± standard deviation (if normally distributed) or the median and interquartile range (if non-normally distributed). Categorical variables are presented as counts and percentages. Descriptive statistics were used to examine the distribution of the BI scores according to the LS grade. Sensitivity refers to the proportion of patients correctly identified by the BI cutoff for a given LS grade, while specificity refers to the proportion correctly excluded. To determine appropriate BI cutoff values corresponding to each LS grade, we calculated the sensitivity and specificity of multiple BI scores for discriminating between adjacent LS grades (e.g., grade 2 vs. grade 3). The score that maximized the Youden index (J = sensitivity + specificity − 1) was selected as the optimal cutoff. This method balances the correct identification of patients within a target LS grade (sensitivity) and the correct exclusion of those not in that grade (specificity).
Positive predictive value (PPV) represents the probability that a patient with a BI score above the cutoff truly belongs to the higher LS grade. Negative predictive value (NPV) represents the probability that a patient with a BI score below the cutoff truly belongs to the lower LS grade. The strength of the correlation was interpreted as follows: |r| <0.3 indicated a weak correlation; 0.3 ≤ |r| <0.7 indicated a moderate correlation; and |r| ≥0.7 indicated a strong correlation (Cohen, 2013). Spearman correlation analysis was used to examine the relationship between LS grades and BI scores, and the Kruskal–Wallis H test was used to compare BI scores across different LS grades. Where a significant difference was found, post hoc pairwise comparisons were conducted using Dunn’s test, applying a Bonferroni correction for multiple comparisons. The effect size for the Kruskal–Wallis test was calculated as eta-squared (η2) based on the H statistic, with values interpreted as follows: 0.01, small; 0.06, medium; and 0.14, large.
Receiver operating characteristic (ROC) curve analysis was employed to determine the optimal BI cutoff scores for discriminating adjacent disability levels defined by the LS. Given the ordinal nature of the LS, a threshold-based approach was utilized. The optimal cutoff point on the ROC curve for each comparison was selected by maximizing the Youden index (J = sensitivity + specificity − 1), which identifies the value that best balances the trade-off between sensitivity and specificity.
Results
Population characteristics
The patients’ baseline characteristics are shown in Table 1. Of the 16,412 patients, 6,387 (38.9%) were male and 10,025 (61.1%) were female. In addition, 12,662 (77.2%) were younger than 60 years, 2,596 (15.8%) were between the ages of 60 and 79, and 1,154 (7.0%) were ≥80 years. Figure 3 illustrates the BI frequency distribution in each LS grade and demonstrates the median, inter-quartile range, minimum, and maximum of the BI scores.
The distribution of BI total scores varied across the different LS grades, as visually summarized in Figure 3. Lower LS grades (indicating greater dependence) were associated with a clustering of higher BI scores, while higher LS grades showed a wider spread and lower median BI scores.
Optimal BI cutoff scores for LS grades
The optimal cutoff score of the BI was determined to distinguish different disability grades of the LS. The sensitivity, specificity, PPV, and NPV for each age group and the total population are detailed in Table 2.
The BI score effectively distinguishes between different LS grades, with optimal cutoff values varying by both age and the specific LS grade being distinguished. Table 2 presents the optimal BI cutoff scores for distinguishing different levels of disability severity. Furthermore, when examining different age groups, certain differences in the BI cutoff values were observed. Each cutoff value corresponds to the level at which reaching or exceeding this score classifies the patient into this LS grade or a lower numerical grade (indicating better functionality). The Youden index values of LS Grade 1 were 0.903, 0.858, and 0.850, respectively, across age groups (<60 years, 60–79 years, and ≥80 years). The results for LS Grade 1 and LS Grade 6 confirm that BI ≥75 and BI < 5 are reliable criteria for identifying patients with complete independence and complete dependence, respectively. However, the model’s low PPV for LS Grade ≥3 suggests that BI cutoff scores alone lack the precision to cleanly differentiate between adjacent levels of moderate dependency as they result in a high number of false positives.
Correlation between LS grades and BI scores
The correlations between LS grades and BI scores were stratified by three age groups and are presented in Table 3. Further analysis of the BI's discriminatory power across the three overarching LS categories (Bedridden, Domestic, and Community) is provided in the Supplementary Material (Supplementary Figures 1–4).
AUC for the BI cutoff score
The area under the curve (AUC) and the corresponding optimal BI cutoff scores derived from ROC analysis for each age group are summarized in Table 4.
All AUC values were statistically significant (greater than 0.5, p < 0.05). Notably, AUC values exceeding 0.9 for several key distinctions indicate excellent diagnostic accuracy. These results are further illustrated in the ROC curves shown in Figure 4.
As shown in Figure 4, AUC values were higher than 0.90 for LS levels 1 and 6, with the highest for LS grade 6 (AUC = 0.98), indicating that the BI has excellent discriminative ability for these grades.
Discussion
Optimal cutoff scores and diagnostic performance of BI for LS grades
In this study, we analyzed the statistically significant association between LS and BI by calculating the optimal BI cutoff scores for each LS grade. Based on our results, the optimal BI cutoffs for the total cohort and the <60-year age subgroup were identified as follows: ≥75 distinguishing LS Grade 1 from LS Grade ≥2), ≥50 distinguishing LS Grade ≤2 from LS Grade ≥3, ≥45 distinguishing LS Grade ≤3 from LS Grade ≥4, ≥25 distinguishing LS Grade ≤4 from LS Grade ≥5, and <5 identifying completely dependent patients (LS Grade 6). Notably, the BI cutoffs of ≥75 and <5 demonstrate exceptionally high discriminatory power by clearly separating the clinically distinct states of “complete independence” (LS Grade 1) and “complete dependence” (LS Grade 6). This moderate performance for intermediate grades (e.g., 2 and 4) aligns with the fact that the functional transitions between these adjacent LS levels (e.g., from moderate to mild dependence) are behaviorally more subtle and complex, making them inherently more challenging for a global functional measure like the BI to delineate sharply (Gittler and Davis, 2018). To our knowledge, this is the first study to establish BI cutoff scores corresponding to LS grades across age groups in stroke patients. The BI provides a multidimensional assessment of basic ADL and is increasingly recognized as a primary foundation for designing informal care interventions for individuals with disabilities (Bahlman-van Ooijen et al., 2025). The findings of this study may contribute to the evaluation of LS grades. For example, changes in LS grades could help determine clinically meaningful improvements in ADL function among stroke patients. Although there is currently no unified dichotomization for the LS grades, this study aims to establish and validate data-driven cutoffs to address this gap.
When evaluating the LS grade, it is clear that a BI score of 75 or above is classified as LS 1 (complete independence/slight dependence), and a BI score of less than 5 is classified as LS 6 (severe dependence/complete dependence). For patients with complete disability, the BI cutoff is highly consistent with other studies and is relatively accurate. The BI, with its finer granularity (0–100 scale), is more sensitive to small, incremental changes in function, making it superior for detailed progress tracking in rehabilitation research or intensive therapy settings and for predicting specific outcomes such as length of stay. Conversely, the LS, with its categorical grades, excels in rapid triage in busy clinical environments (e.g., emergency departments and outpatient clinics), facilitating quick decision-making about care pathways and in large-scale population screening or registry work, where speed and simplicity are paramount. The BI provides depth, while the LS provides efficiency. The combination of LS and BI can address specific application problems in scenarios, particularly in determining LS 6 grade when BI <5. For patients with extremely low BI scores (ADL ability grade 0), the BI may have difficulty distinguishing subtle differences in functional ability. However, the BI cutoff score addresses the “floor effect” of BI by providing a crucial stratification within the conventionally defined “severe disability” band (BI <40), thus may help distinguish the differences in daily living activities between completely disabled patients and those with other LS grades in care scenarios (Sahin and Acaröz, 2025; Balu, 2009). Although BI has been considered sufficient for measuring physical function in elderly rehabilitation patients, it may lack sensitivity for detecting finer, clinically meaningful gradations of disability, particularly in stroke populations (Quinn et al., 2011). Given that BI originates from and is strongly correlated with ADL, the cutoff values in LS 1–6 can help caregivers identify patients in LS grade 6 who are completely dependent, thereby further formulating care plans. For instance, a patient classified as LS grade 3 (moderate dependence) with a BI score of 60 may have specific deficits in stair climbing and bathing. This detailed BI profile, interpreted within the broader functional category provided by the LS, allows the clinical team to formulate a targeted care plan focusing on lower limb strength training and adaptive equipment for bathing while also setting a realistic goal of progressing toward LS grade 4 (mild dependence).
Differences in BI cutoff scores across LS grades among different age groups
The analysis revealed that several critical BI cutoffs remained consistent across age groups, specifically ≥75, ≥45, and <5. The optimal BI cutoff scores for distinguishing between adjacent LS grades did not show significant variation when analyses were conducted across different age groups in our study. This suggests that these cutoffs are robust and potentially generalizable across these subpopulations, reinforcing their utility as stable benchmarks for functional classification. In terms of age-stratified patterns in LS grades, while the overall inverse relationship between BI scores and LS grades remained consistent across age groups, significant intergroup variations emerged, necessitating age-specific interpretation. Critically, the cutoffs for the ≥80-year-old individuals were found to be higher than those of other age groups. From the study results, an age-specific pattern is evident: the proportion of patients with mild dependence (Grade 1) in the ≥80-year-old group sharply decreases, while over 55.6% fall into the severe dependence categories (grades 5–6). This aligns with real-world observations of a high degree of disability among the elderly. Therefore, this study suggests a dual-focused resource allocation strategy. Substantial resources must be directed to support and care for those with severe dependence (LS grades 5–6). Concurrently, the significant proportion of patients in the mild-to-moderate dependence categories (LS grades 1–4) presents a critical opportunity for secondary prevention and early intervention. Community-based initiatives such as tailored exercise programs, fall prevention strategies, and self-management education aimed at these individuals could help maintain functional capacity, reduce the risk of transitioning to higher dependence categories, and ultimately lessen the long-term societal care burden. This approach advocates for a more comprehensive and stratified allocation of healthcare resources (Doi et al., 2020).
The optimal BI cutoffs demonstrated marked age dependence, reflected in the composite sum score (sensitivity + specificity). The sum score for determining cutoff values reflects the optimal balance between maximizing the ability to correctly identify LS grades. Although BI has high discrimination efficacy for LS Grade 1 (0.898), we observe a slight downward trend in this index in the elderly group (from 0.903 in the <60-year-old group to 0.850 in the ≥80-year-old group). This may be due to comorbidities or age-related frailty, which makes the BI score pattern of elderly patients more heterogeneous, even though they are clinically assessed at the ‘community activity’ level (Bouwstra et al., 2019). It is notable that BI exhibited the weakest correlation coefficient for LS Grades 2–4 in the ≥80 group, indicating that the correlation shifts with age. In LS grades 5 and 6, the correlation between LS and BI is strongest in age group 3 (≥80 years old) compared with groups 1 and 2 (<60 years old and 60–79 years old, respectively) among different age groups. This indicates that, for the elderly, combining BI cutoff scores may be necessary to most accurately define their functional status.”
An interesting finding of this study is that sequential BI declines across LS grades 1–6 in the age group 1 (<60 years). For the age group 2 (60–79 years), there is substantial BI overlap in moderate grades (LS 2–5). In addition, for the age group 3 (≥80 years), there is pronounced merging of BI ranges between LS grades 3–4. The distribution range of BI cutoff values varies among LS grades representing different age groups. BI provides a limited ability to identify disability grades for elderly patients, especially in the moderate LS grades, and the cutoff values need to be combined for determination.
Interplay between the LS and BI in stroke disability assessment
The differential performance of the LS and BI stems from their inherent design philosophies. The LS prioritizes dysfunction, particularly mobility, often leading to a classification of “bedridden” (LS Grade 6) for patients with severe movement impairment. In contrast, the BI is a multi-item ADL score that credits preserved function in non-mobility tasks. Consequently, a patient with stroke might be classified as LS Grade 6 yet achieve a moderate or even high BI score by retaining the ability to perform self-care activities (e.g., eating and grooming) in bed with minimal assistance. This fundamental discrepancy explains seemingly paradoxical clinical observations and underscores why the BI may have limited distinguishing power for functional status in patients with moderate, mobility-defined disabilities. Therefore, we recommend a combined or sequential use of both scales in clinical practice and research. The LS can serve as an efficient tool for initial screening and gross categorization, while the BI can be deployed for detailed assessment, progress monitoring, and nuanced outcome measurement in selected patients or research protocols.
For older adults, particularly those aged ≥80 years, BI scores should be interpreted within an age-specific context and integrated with complementary assessments to minimize the risk of misclassification. Accordingly, resource allocation strategies should reflect this functional gradient, with particular emphasis on the ≥80-year-old age group, among whom more than 55% exhibited severe functional dependence. This interpretation is supported by previous studies, which have proposed a BI threshold of ≤60 at discharge to define new-onset functional impairment. Furthermore, studies combining the BI with discriminant function analysis have demonstrated concordance with standard expert judgment in over 86% of nursing status classifications. Importantly, the remaining discrepancies were predominantly observed in borderline cases between adjacent levels of nursing care (Masuda et al., 2024; Maidhof et al., 1999). For stroke patients with mild to severe disability, the LS has demonstrated higher responsiveness compared to the BI, potentially due to the ceiling and floor effects of the BI (Gao et al., 2023). Similarly, the BI is less precise than the LS in capturing overall changes in disability status, particularly in aspects such as emotional communication and recreational activities. The LS includes items that assess emotional wellbeing and social interaction, areas that are often overlooked in functional status evaluations using the BI. These domains are vital to functional recovery, especially in elderly patients with moderate disability, where traditional measures such as the BI may fail to detect significant declines in social and emotional functioning (Wang et al., 2019). Importantly, a maximum BI score of 100 (indicating independence in all 10 items) does not necessarily imply independence in more detailed ADLs (Bahlman-van Ooijen et al., 2025).
These findings collectively suggest that the BI alone may be insufficient for detecting subtle changes in functional impairment, particularly among patients with mild stroke. Nevertheless, the AUC values in this study were generally above 0.8 (except for LS Grade 2), confirming that the BI provides theoretical support for the scientific assessment of LS grades as a quantitative measure of functional independence in ADLs.
Implications of assessment for disabled individuals
Currently, most clinical applications use the BI scale. Our team independently developed the national standard LS earlier, which aligns with BI and demonstrates high consistency, reliability, and validity in assessing disability levels (Wang et al., 2019; Zhou et al., 2024). The precise correspondence between BI cutoffs and LS grades provides a quantitative tool for clinical practice, reducing reliance on imaging exams while enabling quick disability assessments.
The LS possesses distinct Chinese characteristics and offers a clinically efficient alternative to conventional disability assessments, with its pictorial-based design enabling rapid evaluation without specialized training (Wang et al., 2019; Liu et al., 2023). This research provides critical benchmarks for clinical practice: a BI score ≥75 serves as a reliable indicator of functional independence, while a score <5 flags complete dependency, necessitating prioritized care. Importantly, assessments for adults aged ≥80 must account for their higher disability threshold, where a seemingly moderate BI score may still conceal significant care needs. These findings support the combined use of BI scores and LS grades to provide a nuanced and comprehensive assessment of functional status. For instance, to provide a comprehensive assessment for older adults requiring care across different age groups, rather than relying solely on the BI (Ohura et al., 2014).
Strengths and limitations
The LS offers practical advantages due to its simplicity and minimal resource requirements, enhancing its feasibility for large-scale studies (Xue et al., 2024). Our validation of precise BI–LS cutoffs provides statistically robust criteria for disability grading, which is particularly valuable in resource-limited settings. However, this study has several limitations. First, as participants were recruited from inpatient rehabilitation units, the sample may over-represent survivors with rehabilitation potential and under-represent those with the most severe strokes, introducing selection bias. Nonetheless, this setting was necessary to obtain the large volume of paired BI and LS assessments required for robust cutoff determination. Second, the validity of the identified cutoff scores needs to be confirmed in more diverse populations, including non-Chinese ethnic groups and patients with disabilities from other etiologies. Finally, readers should note that the cutoff for Grade 2 in the 60–79 age group was based on a limited subsample (n = 6). While presented for completeness, this value should be considered preliminary and interpreted cautiously until validated in larger cohorts.
Conclusion
This study establishes and validates a critical bridge between the Barthel Index and the Longshi Scale by defining optimal, age-stratified cutoff scores that map BI scores to distinct LS disability grades. Our findings demonstrate that the BI excellently discriminates the extremes of functional independence (LS Grade 1) and complete dependence (LS Grade 6), while its performance for intermediate grades is less pronounced.
The BI cutoffs provide a standardized, quantitative method to enhance the precision of disability assessment in clinical practice, particularly within the conventionally poorly differentiated “severe disability” band (BI <40). This enables more tailored care planning and efficient resource allocation, especially for vulnerable elderly populations. Future research should focus on validating these cutoffs in diverse populations and healthcare settings to foster broader clinical adoption and cross-cultural consensus.
Summary statement of implications for practice
What does the research add to existing knowledge in gerontology?
• Establishes, for the first time, a quantitative conversion system between the Chinese scale (LS) and the international standard (BI).
• Reveals the effect of age stratification, showing that the corresponding BI scores for LS grades vary among different age groups.
• Demonstrates that picture-based LS grades, integrated with the BI cutoffs, enable precise classification, advancing gerontology from a universal single-scale approach to multiple evaluation models.
What are the implications of this new knowledge for nursing care for older people?
• Universal screening: A BI score ≥75 can serve as a reliable indicator to identify individuals with good functional ability and a lower risk of disability.
• Prioritized resource allocation: A BI score <5 is a clear marker for identifying complete dependency, signaling the need for highest-priority care interventions.
• Advanced age assessment: When assessing individuals aged ≥80 years, it is crucial to recognize their higher disability threshold to avoid overlooking potential care needs based on a deceptively moderate BI score.
How could the findings be used to influence policy, practice, research, or education?
• Long-term care insurance: The BI cutoff score (BI < 5 equivalent to LS Grade 6) serves as a decision trigger for long-term care, enabling the implementation of appropriate care measures for older adults.
• Primary healthcare funding: Additional subsidies should be allocated to community centers based on the coordinated BI-LS assessment care framework.
• Senior care facility protocols: Multiple healthcare settings, including community and home-based care, should be standardized based on the BI cutoff value framework.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by the Ethics Committee of Shenzhen Second People’s Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin. Written informed consent was obtained from the minor(s)’ legal guardian/next of kin for the publication of any potentially identifiable images or data included in this article.
Author contributions
JZ: Conceptualization, Data curation, Formal analysis, Methodology, Supervision, Writing – original draft. MZ: Formal analysis, Validation, Writing – review and editing. AL: Data curation, Validation, Writing – review and editing. RY: Conceptualization, Formal analysis, Writing – review and editing. YW: Conceptualization, Funding acquisition, Methodology, Resources, Supervision, Writing – review and editing.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This study was supported by the Sanming Project of Medicine in Shenzhen (No. SZSM202111010), the Shenzhen Key Medical Discipline Construction Fund (No. SZXK048), and the Shenzhen High-level Hospital Construction Fund.
Acknowledgements
The authors would like to express their gratitude to the participants and experts and caregivers for assisting us to perform the cross-sectional survey and Delphi method.
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fragi.2026.1701910/full#supplementary-material
References
Bahlman-van Ooijen, W., Giesen, J., Bakker-Jacobs, A., Vermeulen, H., and Huisman-de Waal, G. (2025). Low-value and high-value care recommendations in nursing: a systematic assessment of clinical practice guidelines. J. Nurs. Scholarsh. 57 (3), 486–499. doi:10.36740/WLek20230710210.1111/jnu.13029
Balu, S. (2009). Differences in psychometric properties, cut-off scores, and outcomes between the barthel index and modified rankin scale in pharmacotherapy-based stroke trials: systematic literature review. Acta Neurol. Belg 25 (6), 1329–1341. doi:10.1007/s13760-023-02343-610.1185/03007990902875877
Benjamin, D. J., Berger, J. O., Johannesson, M., Nosek, B. A., Wagenmakers, E. J., Berk, R., et al. (2018). Redefine statistical significance. Nat. Hum. Behav. 2 (1), 6–10. doi:10.1038/s41562-017-0189-z
Bouwstra, H., Smit, E. B., Wattel, E. M., van der Wouden, J. C., Hertogh, C., Terluin, B., et al. (2019). Measurement properties of the barthel index in geriatric rehabilitation. J. Am. Med. Dir. Assoc. 20 (4), 420–425.e421. doi:10.1016/j.jamda.2018.09.033
Chamberlain, A. M., Finney Rutten, L. J., Manemann, S. M., Yawn, B. P., Jacobson, D. J., Fan, C., et al. (2016). Frailty trajectories in an elderly population-based cohort. J. Am. Geriatr. Soc. 64 (2), 285–292. doi:10.1111/jgs.13944
Collin, C., Wade, D. T., Davies, S., and Horne, V. (1988). The barthel ADL index: a reliability study. Int. Disabil. Stud. 10 (2), 61–63. doi:10.3109/09638288809164103
Doi, T., Nakakubo, S., Tsutsumimoto, K., Kim, M. J., Kurita, S., Ishii, H., et al. (2020). Spatio-temporal gait variables predicted incident disability. J. Neuroeng Rehabil. 17 (1), 11. doi:10.1186/s12984-020-0643-4
Gao, Y., Wang, Y., Li, D., Zhao, J., Dong, Z., Zhou, J., et al. (2021). Disability assessment in stroke: relationship among the pictorial-based longshi scale, the barthel index, and the modified rankin scale. Clin. Rehabil. 35 (4), 606–613. doi:10.1177/0269215520975922
Gao, Y., Zhao, J., Liu, X., Xie, X., and Wang, Y. (2023). Content comparison of long-term care instruments based on the international classification of functioning, disability and health (ICF). BMC Geriatr. 23 (1), 146. doi:10.1186/s12877-023-03824-2
GBD 2021 Stroke Risk Factor Collaborators (2024). Global, regional, and national burden of stroke and its risk factors, 1990-2021: a systematic analysis for the global burden of disease study 2021. Lancet Neurol. 23 (10), 973–1003. doi:10.1016/s1474-4422(24)00369-7
Gittler, M., and Davis, A. M. (2018). Guidelines for adult stroke rehabilitation and recovery. Jama 319 (8), 820–821. doi:10.1001/jama.2017.22036
Hotter, B., Padberg, I., Liebenau, A., Knispel, P., Heel, S., Steube, D., et al. (2018). Identifying unmet needs in long-term stroke care using in-depth assessment and the post-stroke checklist - the managing aftercare for stroke (MAS-I) study. Eur. Stroke J. 3 (3), 237–245. doi:10.1177/2396987318771174
Littooij, E., Doodeman, S., Holla, J., Ouwerkerk, M., Post, L., Satink, T., et al. (2022). Setting meaningful goals in rehabilitation: a qualitative study on the experiences of clients and Clinicians in working with a practical tool. Clin. Rehabil. 36 (3), 415–428. doi:10.3389/fneur.2024.142044310.1177/02692155211046463
Liu, F., Tsang, R. C., Zhou, J., Zhou, M., Zha, F., Long, J., et al. (2020). Relationship of barthel index and its short form with the modified rankin scale in acute stroke patients. J. Stroke Cerebrovasc. Dis. 29 (9), 105033. doi:10.1016/j.jstrokecerebrovasdis.2020.105033
Liu, X., Zhou, M., Zhao, J., Gao, Y., Wang, Y., Zhou, J., et al. (2022). Functional Independence and disability evaluation in stroke patients: optimal cutoff scores for a pictorial-based longshi scale, barthel index, and modified rankin scale. Front. Neurol. 13, 710852. doi:10.3389/fneur.2022.710852
Liu, X., Gao, Y., Zhao, J., Zhou, M., Wan, L., Nie, G., et al. (2023). Reliability of pictorial longshi scale for informal caregivers to evaluate the functional Independence and disability. Nurs. Open 10 (3), 1852–1862. doi:10.1002/nop2.1448
Lübke, N., Meinck, M., and Von Renteln-Kruse, W. (2004). The barthel index in geriatrics. A context analysis for the Hamburg classification manual. Z Gerontol. Geriatr. 37 (4), 316–326. doi:10.1007/s00391-004-0233-2
Maidhof, R., Pirk, O., Winkel, M., and Hartung, J. (1999). A different approach--a new instrument for expert assessment of disability stages? Gesundheitswesen 61 (8-9), 380–384.
Masuda, S., Fukasawa, T., Inokuchi, S., Otsuki, B., Murata, K., Shimizu, T., et al. (2024). Early prediction of functional impairment at hospital discharge in patients with osteoporotic vertebral fracture: a machine learning approach. Sci. Rep. 14 (1), 31139. doi:10.1038/s41598-024-82359-x
Mueller-Schotte, S., Zuithoff, N. P. A., Van der Schouw, Y. T., Schuurmans, M. J., and Bleijenberg, N. (2020). Trends in risk of limitations in instrumental activities of daily living over age in older persons with and without multiple chronic conditions. J. Gerontol. A Biol. Sci. Med. Sci. 75 (1), 197–203. doi:10.1093/gerona/glz049
Ohura, T., Higashi, T., Ishizaki, T., and Nakayama, T. (2014). Assessment of the validity and internal consistency of a performance evaluation tool based on the Japanese version of the modified barthel index for elderly people living at home. J. Phys. Ther. Sci. 26 (12), 1971–1974. doi:10.1589/jpts.26.1971
Pan, H., Zhao, Y., Wang, H., Li, X., Leung, E., Chen, F., et al. (2021). Influencing factors of barthel index scores among the community-dwelling elderly in Hong Kong: a random intercept model. BMC Geriatr. 21 (1), 484. doi:10.1186/s12877-021-02422-4
Quinn, T. J., Langhorne, P., and Stott, D. J. (2011). Barthel index for stroke trials: development, properties, and application. Stroke 42 (4), 1146–1151. doi:10.1161/STROKEAHA.110.598540
Sahin, U. K., and Acaröz, S. (2025). Predictors of the disability in activities of daily living in nursing home residents: a descriptive study. Exp. Aging Res. 51 (3), 257–270. doi:10.1016/j.jfma.2021.05.02910.1080/0361073x.2024.2421686
Shin, S., Lee, Y., Chang, W. H., Sohn, M. K., Lee, J., Kim, D. Y., et al. (2022). Multifaceted assessment of functional outcomes in survivors of first-time stroke. JAMA Netw. Open 5 (9), e2233094. doi:10.1001/jamanetworkopen.2022.33094
Wang, Y., Guo, S., Zheng, J., Wang, Q. M., Zhang, Y., Liang, Z., et al. (2019). User testing of the psychometric properties of pictorial-based disability assessment longshi scale by healthcare professionals and non-professionals: a Chinese study in shenzhen. Clin. Rehabil. 33 (9), 1479–1491. doi:10.1177/0269215519846543
Wang, Y., Li, S., Pan, W., Xiao, P., Zhang, J., Wang, Q. M., et al. (2020). Evaluation of the disability assessment longshi scale: a multicenter study. J. Int. Med. Res. 48 (7), 300060520934299. doi:10.1177/0300060520934299
Winstein, C. J., Stein, J., Arena, R., Bates, B., Cherney, L. R., Cramer, S. C., et al. (2016). Guidelines for adult stroke rehabilitation and recovery: a guideline for healthcare professionals from the American heart association/american stroke association. Stroke 47 (6), e98–e169. doi:10.1161/str.0000000000000098
Xiong, F., Liu, X., Zhou, M., Wang, C., Liu, Q., Li, J., et al. (2023). Validation of the longshi scale in assessment of function in patients from different economic. Educational Regional Backgrounds China 37 (11), 1552–1558. doi:10.1177/02692155231174689
Xue, K., Li, W., Liu, F., Liu, X., Wong, J., Zhou, M., et al. (2024). Evaluation of activities of daily living using an electronic version of the longshi scale in patients with stroke: reliability, consistency, and preference. BMC Med. Inf. Decis. Mak. 24 (1), 125. doi:10.1186/s12911-024-02508-0
Yan, J., Zhang, Q., Zhou, J., Zha, F., Gao, Y., Li, D., et al. (2024). Inverted U-shaped relationship between barthel index score and falls in Chinese non-bedridden patients: a cross-sectional study. Top. Stroke Rehabil. 31 (7), 692–702. doi:10.1038/s41598-024-57671-110.1080/10749357.2024.2318089
Zandvliet, S. B., Meskers, C. G., Nijland, R. H., Daffertshofer, A., Kwakkel, G., and van Wegen, E. E. (2019). The effect of cerebellar transcranial direct current stimulation to improve standing balance performance early post-stroke. Study Protocol a Randomized Controlled Trial 14 (6), 650–657. doi:10.1177/1747493019830312
Zhang, Y., Xiong, Y., Yu, Q., Shen, S., Chen, L., and Lei, X. (2021). The activity of daily living (ADL) subgroups and health impairment among Chinese elderly: a latent profile analysis. BMC Geriatr. 21 (1), 30. doi:10.1186/s12877-020-01986-x
Zhang, Q., Yan, J., Long, J., Wang, Y., Li, D., Zhou, M., et al. (2024). Exploring the association between activities of daily living ability and injurious falls in older stroke patients with different activity ranges. Sci. Rep. 14 (1), 19731. doi:10.1038/s41598-024-70413-7
Zhou, M., Liu, X., Zha, F., Liu, F., Zhou, J., Huang, M., et al. (2021). Stroke outcome assessment: optimizing cutoff scores for the longshi scale, modified rankin scale and barthel index. PLoS One 16 (5), e0251103. doi:10.1371/journal.pone.0251103
Zhou, M., Zha, F., Liu, F., Zhou, J., Liu, X., Li, J., et al. (2023). Long-term care status for the elderly with different levels of physical ability: a cross-sectional survey in first-tier cities of China. Top. Stroke Rehabil. 23 (1), 953. doi:10.1080/10749357.2024.231808910.1186/s12913-023-09987-3
Keywords: activities of daily living, Barthel index, cutoff score, disability, receiver operating characteristic curve
Citation: Zhang J, Zhou M, Liu A, Ye R and Wang Y (2026) Determining optimal Barthel Index cutoff scores for predicting Longshi Scale grades across age groups in stroke patients. Front. Aging 7:1701910. doi: 10.3389/fragi.2026.1701910
Received: 09 September 2025; Accepted: 16 January 2026;
Published: 09 February 2026.
Edited by:
Lawrence D Hayes, Lancaster University, United KingdomReviewed by:
Michał Błaż, Krakowski Szpital Specjalistyczny im Jana Pawla II, PolandDaniel Tinnion, Manchester Metropolitan University, United Kingdom
Copyright © 2026 Zhang, Zhou, Liu, Ye and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yulong Wang, eWx3YW5nNjY4QDE2My5jb20=