Your new experience awaits. Try the new design now and help us make it even better

PERSPECTIVE article

Front. Rehabil. Sci., 14 August 2025

Sec. Disability, Rehabilitation, and Inclusion

Volume 6 - 2025 | https://doi.org/10.3389/fresc.2025.1657105

This article is part of the Research TopicBridging the Gap: Integrating Performance-Based Measures and Person-Reported Outcomes in Disability EvaluationView all 6 articles

Aligning metrics with meaning: considerations for measurement selection in disability evaluation

  • 1Department of Community Health, Tufts University, Medford, MA, United States
  • 2Rehabilitation Medicine Department, NIH Clinical Center, Bethesda, MD, United States

This article explores the role of patient-reported outcome measures (PROMs) in disability evaluation, a measurement domain traditionally dominated by clinical performance-based assessments. While performance tests are valued for their perceived objectivity, PROMs have gained prominence in research for their efficiency, patient-centered orientation, and capacity to capture subjective experiences relevant to functional decline related to potentially disabling conditions. The manuscript underscores the importance of aligning measurement tools with the specific purpose of evaluation—whether clinical, policy-driven, or programmatic. Using the Work Disability Functional Assessment Battery (WD-FAB) as an illustrative case, it compares the strengths and limitations of PROMs and performance-based tools in evaluating mental and physical function in the context of disability assessment. PROMs such as the WD-FAB can systematically and efficiently generate scores that represent function across multiple domains of function (e.g., mood and emotion, mobility, cognition) and are particularly well-suited for detecting change over time in large-scale applications. In contrast, performance-based assessments, while useful in certain clinical scenarios, are often resource-intensive and may not accurately reflect real-world functioning. The paper argues that although PROMs should not replace performance measures entirely, they represent a valuable and often preferable alternative or complement in many disability evaluation contexts. Ultimately, the choice of assessment tool should consider the intended use, resource constraints, and the need for comprehensive, patient-centered data.

Introduction

Historically, disability evaluation, whether for benefit determination, programmatic evaluation, or clinical purposes, has focused on performance-based testing due to the general underlying belief that it offers greater value than other forms of measurement such as patient-reported outcome measures (PROMs) (1, 2). However, for research purposes, there has been an overwhelming uptake of PROs with considerable investment in their development, validation, and interpretation in the last decade (3, 4). A substantial body of research has demonstrated the correspondence between performance tests and self-report as well as the conditions under which this correspondence breaks down (58). In addition, the advent of symptom-validity tests has helped to address concerns about the intentional misreporting of function for personal gain by offering a way to flag cases of concern (9). This raises questions about why there has not been a better uptake of PROMs for the purpose of disability evaluation. Thus, the goal of this perspective is to elucidate the issues relevant to the use of PROMs in disability evaluation.

Measurement considerations

The evaluation of disability varies widely based on the purpose of the assessment (10, 11). The selection of the best measurement tool to address a specific scientific question starts with clearly identifying the purpose of measurement. Yet, both clinicians and researchers often gravitate toward discussion of tool availability and features (including psychometric properties) without adequate consideration of what they are trying to accomplish. For instance, if you want to know the number of people who could potentially benefit from a new walker, you need to measure aspects of function among adults who are potential users of the device. Not only would you need to know the number of people with walking difficulties, but if arm use was required for this device, then you would also need to know about arm function and potentially the ability to grip the device. You may need to know about balance and the cognitive ability to operate features of the device. Thus, you need very specific information about a broad range of functioning in the population of interest. In contrast, if you wish to know the population of people who could potentially benefit from civil rights legislation to reduce barriers for people with disabilities, such as the Americans with Disabilities Act, you would want to capture the broadest group of people possible who could be affected by such legislation, including those with difficulties hearing, seeing, thinking, moving, doing everyday activities, and participating in everyday tasks and social activities.

As part of a collaboration with the Social Security Administration, we were tasked with identifying efficient ways to assess work disability. Since contemporary concepts of disability characterize it as the outcome of the interaction between the capabilities of individuals and the environment in which they are operating, the evaluation of work disability requires the examination of all aspects of functioning of the person (at the person level as opposed to the cellular, organ, or body system level) in relationship to the demands of various jobs (12). Thus, by clearly identifying the specific purpose for disability evaluation, the choice of measure, be it self-reported or performance-based, becomes clearer, which simplifies the task of measure selection.

Once measurement options have been narrowed by clarifying its purpose, then other aspects of measurement selection come into play such as content coverage, administration mode (paper/pencil, electronic, interview, telephone), other aspects of administration (observed performance vs. self-report, use of proxy respondents), administration time, cost of administration, the need for trained administrators, and psychometric properties of the instruments. Depending on the situational constraints, more than one type of measure may be selected to best address measurement needs.

One crucial aspect of measure selection that is often overlooked is whether the measure has been tested in real-world situations and if score interpretation has been developed and supported. What good is it to measure the work-related functional capabilities of individuals if the level of ability needed for each functional domain relative to a given job is not known? Real-world studies that provide a foundation for clinical and population-based score interpretation are crucial to selection and use of measures for disability evaluation. Examination of whether normative values have been established, the method for establishing the normative values, the basis for establishing cut-points or thresholds, and comparability of scores over time and across population, are essential considerations of score interpretability.

In the context of measuring functional abilities for the purposes of disability evaluation, assessment, and treatment planning, comprehensive measurement relies on obtaining both the subjective experience of individuals, typically collected via self-report measures, and objective indicators of functioning, which usually rely on performance-based assessments (12). Each measurement approach offers distinct advantages and limitations, which must be considered when interpreting outcomes in the context of disability assessment (13, 14). Performance based tests are especially valuable in clinical settings where decisions about rehabilitation or functional limitations must be grounded in demonstrable performance, however the degree to which that translates into real world functioning in daily activities such as work is limited based on previous research findings (15). Furthermore, when measuring the same underlying construct, self-reported measures and performance-based assessments scores have been found to be often moderate to highly correlated (16). A primary advantage of self-reported measures is their capacity to capture the subjective experience of disability. These self-reported instruments provide insights into how individuals perceive their limitations and how these limitations affect their quality of life and daily activities (17).

To illustrate these advantages and opportunities for improvement in the context of disability assessment, we will draw upon previous research examining the reliability and validity of the Work Disability Functional Assessment Battery (WD-FAB) (1822). This body of work provides illustrative examples of the advantages and disadvantages of using a robust self-report-based tool to characterize multiple dimensions of physical and mental health function relevant to disability vs. performance-based assessments of disability among populations with underlying physical and/or mental health conditions. The WD-FAB is a standardized self-report measurement tool using item response theory and computer adaptive testing to characterizes physical and mental functional relevant to work disability along eight key dimensions: Basic Mobility; Upper Body Function; Fine Motor Function; Community Mobility; Mood & Emotions; Resilience & Sociability; Self-Regulation; and Communication & Cognition. The WD-FAB also includes an opt-in scale for individuals who use wheelchairs to capture mobility-related tasks. The WD-FAB draws upon a pool of over 300 items to generate scores for all 8 scales in 15–25 min. WD-FAB scores allow direct comparisons between individuals with self-reported activity limitations with the general population of working age adults and are sensitive to detecting change in function over time. See Table 1 for a description of the WD-FAB measurement domains and descriptions of each scale.

Table 1
www.frontiersin.org

Table 1. Work disability functional assessment battery (WD-FAB) overview.

Advantages and disadvantages of PROMs and in vivo performance based assessment: illustrative examples using the Work-Disability Functional Assessment Battery (WD-FAB)

Mental health functioning

One of the key challenges in measurement related to mental and behavioral health is the extreme variation in condition profiles as well as the fluctuating nature of many limitations that may be related to a given mental or behavioral health condition (23). Clinical providers or family members with deep knowledge of a patient often have greater insight into the patient's overall functional ability as compared to the patient themselves. This brings into question how to best capture true functional ability. Almost all mental and behavioral outcomes are derived from some form of self-reported symptoms, impairments, or functional limitations as reported by the patient to the clinical provider. There are few opportunities to observe functional limitations in simulated clinical settings and even fewer opportunities for measurement in real world settings. The area of health outcome measurement for mental health functioning is unique in this way (23).

To examine strengths and weaknesses of performance-based measures vs. self-report measures among individuals with mental or behavioral health conditions, we have use the Vocational Situational Assessment (VSA) is a 35-item assessment used to guide observations and ratings by trained employment providers to measure work performance among individuals with mental and behavioral health disorders (24). To complete the VSA, sites need to simulate a work environment, and employment-specialist raters must be trained to complete the VSA. For VSA implementation, individuals are observed in a natural work setting for at least 1 h per day for a 3–5-day period. Trained raters then keep notes about their observations and provide final scores and ratings at the conclusion of the observation period.

Our previous work examining work-related disability using the WD-FAB vs. clinician reports found that clinical providers and patients have low to fair levels of overall agreement and systematic differences emerge when you examine various dimensions of mental and behavioral health function (25). This was true for both clinician-rated reports of the WD-FAB vs. the patient reported responses on the WD-FAB as well as the concordance with the performance based measure used. For example, in areas that measured aspects of Mood & Emotions, patients’ scores were generally lower than the matched clinical provider scores (26). This indicates that individuals reported that their functional abilities in the area of Mood & Emotions were more severely impaired than how the clinician rated them for the same construct of measurement. In contrast, for areas such as Self-Regulation, the clinician rating was lower than the patient rating. Follow-up qualitative inquiry with the clinical providers suggested this discrepancy related to potential self-awareness or reporting bias in the patient underreporting with impairment in the particular domains that may have more of a socially negative connotation than other domains of mental and behavioral health.

The discordance between provider and patient reporting indicates both perspectives provide valuable insights into underlying functional limitations. In terms of comparing self-report to observational assessments, we found that performance-based assessments were only moderately correlated to self-reported measures provided by both provider and patients. This discrepancy may reflect the underlying limitations of performance-based measures in terms of ecological validity given that these assessments are typically conducted in controlled environments that may not reflect the individual's day-to-day reality. In an ideal setting, both self-report and performance-based assessments would be used to take advantage of the strengths both modalities provide in characterizing a person's mental and behavioral health functioning. However, because many performance-based measures are resource intensive, the feasibility of implementing both is unlikely. Given that self-reported mental health functioning provides a patient-focused description of experiences and functional limitations, this type of reporting may be preferred in many contexts offering significant advantages of cost, time, and accessibility.

Physical functioning

Using a similar study structure, we examined the utility of using the self-reported WD-FAB as compared to clinician ratings of function and a performance-based functional capacity evaluation called the Physical Work Performance Evaluation (PWPE) among a sample of individuals experiencing physical limitations due to existing musculoskeletal conditions (27). The PWPE is a performance-based assessment that is used in clinical settings to systematically collect information on a person's ability to perform work-relevant tasks. The PWPE involves a range of performance tests consisting of 36 physical tasks that vary in duration. The PWPE must be completed by a licensed clinician and can take approximately 3.5–5 h to complete including preparation, rest periods, and transition/preparation of materials (28).

In contrast to the mental and behavioral health domains, we found that self-reported measures of physical function were more highly correlated with performance-based measures. This finding is consistent with a larger body of research indicating that self-reported measures of physical function are often moderately to highly correlated with performance-based measures of similar constructs of physical function. When thinking about the complexity of disability and work, the WD-FAB offers significant advantages in measurement scope compared to the PWPE. Although the PWPE represents similar tasks and activities to those included in the WD-FAB, the PWPE has a narrower range of tasks given that the assessment must be implemented in a laboratory/clinical setting. This finding again speaks to the limitation of ecological validity of many performance-based assessments. The extent to which these measures translate into real world representation of disability is limited as compared to the subjective experiences of activity limitations and participation restrictions patients describe in navigating their day-to-day lives. This subjective experience is important when looking at overall participation outcomes in the context of disability and is often best captured using self-reported measurement modalities.

In the context of disability evaluation using multiple sources of data, self-reported functioning should be compared to information from other sources to evaluate consistency between the current self-report and behavioral observations, collateral reports, previous self-reports or other records, type and degree of injury/illness, typical symptom presentation, course, response to treatment, etc. (29). Even without these additional sources of data, there are ways to assess the veracity of self-report without any modifications to the test or with minimal modifications. PROMS developed using IRT/CAT based methods such as the WD-FAB instrument can empirically identify discrepancies between obtained scores and expected scores. Mathematical algorithms can help analyze the internal consistency of responses (i.e., answering similar items, similarly, reporting less impairment on easier tasks, more impairment on harder tasks within the same domain). Other approaches to detecting intentional or unintentional misreporting are classified as symptom validity test (SVYs) approaches. Embedded or stand-alone SVTs are measures to assess whether a respondent is providing an accurate and consistent report of his or her symptoms and functioning. Thus, IRT/CAT-based PROMs, such as the WD-FAB, offer a combination of methodological approaches to examine potential misreporting in the form of measurement error by incorporating unscored items, increasing the randomization of item administration, and incorporating symptom validity measures throughout the test.

Clinical utility and feasibility of integrating performance based and patient reported outcomes in disability evaluation

Recent evidence has emerged to support the collection of both performance-based and patient-reported outcomes measures (PROMs) to evaluate disability across domains at the clinical level. It has been shown that complementary rather than redundant information about a person's functional abilities is captured by implementing both performance-based and PROMs (30, 31). For example many PROMs have been validated with minimal clinically detectable change scores affording standardized and objective longitudinal tracking in the clinical setting (32, 33). The opportunity for improved clinical outcomes by employing PROMs arises from shared decision-making, furthering specificity of care needs and thereby patient engagement in the current era of value-based care. Incorporating PROMs in parallel with performance measures can serve the current triple aim of healthcare if carefully and conscientiously implemented to assess both the individual and population levels. PROMs have been able to capture information affecting overall function previously undisclosed by examining performance-based measures alone (34, 35). Patient perception of performance includes longitudinal retrospection of activity whereas clinical performance measured in a structured, point-in-time environment is limited in its wider applicability to daily function (36). Integrating the two measurement approaches potentially provides insight into whole person function and which domains of function require treatment for return to activity that is meaningful to the patient and measurable by the clinician.

Data from PROMs and evidence-based performance measures is ideally captured at the outset of care to achieve mutually defined goals and expectations for care. PROMs integrated for clinical care have also demonstrated responsiveness over time to further provide guidance for ongoing treatment management (37, 38). Furthermore, PROMS are generally easier and cheaper to obtain, thus more likely to support longitudinal measurement. Given the current emphasis on patient centered care, collecting both relevant patient centered PROMs and performance-based measures in an efficient and timely manner presents challenges in the clinical setting. PROMs can be captured prior to the clinical visit in the EMR portal, via telephone interview, or at the point of care. Considerable resources, including previously implemented software, electronic literacy, health literacy, labor, and proxy availability from the health care setting, provider, and patient are required for PROM integration. Capturing domain information in computer adaptive testing (CAT) tools such as the WD-FAB can ultimately reduce provider and patient burden while improving more specific and individualized data for patient education, shared decision making, goal setting and treatment planning (38). Analysis and interpretation of both PROMs and performance measures at the point of care requires significant clinician focus in busy clinical environments (39). Limited provider time for interpretation and integration of findings from both the PROMs and performance measures is common but necessary to facilitate patient engagement and potentially facilitates self-management of function. Furthermore, when findings from PROMs and performance-based measures are discordant, further analysis may be necessary to specify an efficient and effective treatment plan (40).

Overall, self-report tools such as the WD-FAB provide a way to efficiently collect systematic and comprehensive information about individuals' experiences of their functioning. Self-report measures such as the WD-FAB are not designed to replace medical evidence but are intended to be used to complement and strengthen clinical performance-based measures of health and disability by providing valuable insights in better understanding disability not only as a biomedical issue but as a lived experience. Furthermore, discrepancies between the two types of assessments could prove to be diagnostically meaningful and indicative that further psychological evaluation for complex co-occurring physical and mental health conditions is required. Ultimately selecting between performance-based measures and self-reported measures of health, function and disability requires careful attention to the patient's individual needs, the clinical or research setting, and the practical resources available. Table 2 summarizes the overall strengths and limitations of each measure to consider when thinking about which to use in practice.

Table 2
www.frontiersin.org

Table 2. Summary of strengths and limitations of patient reported outcomes and performance measures.

Conclusion

Different measures offer different perspectives on health, function, and disability, each with their own benefits and tradeoffs. Performance-based testing supposedly provides a more objective perspective, but it is limited by the environment in which the testing is conducted, which does not often reflect the complexity or variability of real-world situations. Patient-reported outcome measures (PROMs) can be more comprehensive and more closely reflect respondents lived experiences, although there are sometimes concerns around reporting bias, which can be at least partially addressed through measure validation. In the context of disability evaluation, one key consideration is patient burden. PROMs can take less time to complete and can be completed in patients' own environments. This reduces the time, effort, and resources a patient would be required to put forth to make it to a testing site. Ultimately, it is important to use an approach that provides the most relevant measure to the disability evaluation purpose.

Author contributions

EM: Writing – review & editing, Writing – original draft. ER: Writing – review & editing, Writing – original draft. KC: Writing – review & editing, Writing – original draft. JP: Writing – review & editing, Writing – original draft. LC: Resources, Writing – original draft, Funding acquisition, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This research was supported by the Intramural Research Program of the National Institutes of Health (NIH). The contributions of the NIH author(s) were made as part of their official duties as NIH federal employees, are in compliance with agency policy requirements, and are considered Works of the United States Government. However, the findings and conclusions presented in this paper are those of the author(s) and do not necessarily reflect the views of the NIH or the U.S. Department of Health and Human Services.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Gross DP, Asante AK, Miciak M, Battié MC, Carroll LJ, Sun A, et al. Are performance-based functional assessments superior to semistructured interviews for enhancing return-to-work outcomes? Arch Phys Med Rehabil. (2014) 95(5):807–15.e1. doi: 10.1016/j.apmr.2014.01.017

PubMed Abstract | Crossref Full Text | Google Scholar

2. Wind H, Gouttebarge V, Kuijer PPF, Sluiter JK, Frings-Dresen MH. The utility of functional capacity evaluation: the opinion of physicians and other experts in the field of return to work and disability claims. Int Arch Occup Environ Health. (2006) 79(6):528–34. doi: 10.1007/s00420-005-0081-4

PubMed Abstract | Crossref Full Text | Google Scholar

3. Rivera SC, Kyte DG, Aiyegbusi OL, Slade AL, McMullan C, Calvert MJ. The impact of patient-reported outcome (PRO) data from clinical trials: a systematic review and critical analysis. Health Qual Life Outcomes. (2019) 17:1–19. doi: 10.1186/s12955-019-1220-z

PubMed Abstract | Crossref Full Text | Google Scholar

4. Vodicka E, Kim K, Devine EB, Gnanasakthy A, Scoggins JF, Patrick DL. Inclusion of patient-reported outcome measures in registered clinical trials: evidence from ClinicalTrials.gov (2007–2013). Contemp Clin Trials. (2015) 43:1–9. doi: 10.1016/j.cct.2015.04.004

PubMed Abstract | Crossref Full Text | Google Scholar

5. Latham NK, Mehta V, Nguyen AM, Jette AM, Olarsch S, Papanicolaou D, et al. Performance-based or self-report measures of physical function: which should be used in clinical trials of hip fracture patients? Arch Phys Med Rehabil. (2008) 89(11):2146–55. doi: 10.1016/j.apmr.2008.04.016

PubMed Abstract | Crossref Full Text | Google Scholar

6. Stuifbergen AK, Morris M, Becker H, Chen L, Lee HY. Self-report versus performance measure in gauging level of function with multiple sclerosis. Disabil Health J. (2014) 7(4):413–8. doi: 10.1016/j.dhjo.2014.03.003

PubMed Abstract | Crossref Full Text | Google Scholar

7. Young Y, Boyd CM, Guralnik JM, Fried LP. Does self-reported function correspond to objective measures of functional impairment? J Am Med Dir Assoc. (2010) 11(9):645–53. doi: 10.1016/j.jamda.2009.12.084

PubMed Abstract | Crossref Full Text | Google Scholar

8. Rogers ES, Millner UC, Brandt D, Chan L, Jette A, Marfeo E, et al. Concordance of assessments of clients’ mental and behavioral health with in vivo assessment of work performance. Work. (2018) 61(1):11–20. doi: 10.3233/WOR-182776

PubMed Abstract | Crossref Full Text | Google Scholar

9. Bianchini KJ, Aguerrevere LE, Guise BJ, Ord JS, Etherton JL, Meyers JE, et al. Accuracy of the modified somatic perception questionnaire and pain disability Index in the detection of malingered pain-related disability in chronic pain. Clin Neuropsychol. (2014) 28(8):1376–94. doi: 10.1080/13854046.2014.986199

PubMed Abstract | Crossref Full Text | Google Scholar

10. Madans JH, Loeb ME, Altman BM. Measuring disability and monitoring the UN convention on the rights of persons with disabilities: the work of the Washington group on disability statistics. BMC Public Health. (2011) 11(Suppl 4):S4. doi: 10.1186/1471-2458-11-S4-S4

PubMed Abstract | Crossref Full Text | Google Scholar

11. Altman BM, Rasch EK. Purpose of an international comparable census disability measure. Int Meas Disabil. (2016) 61:55–68. doi: 10.1007/978-3-319-28498-9

Crossref Full Text | Google Scholar

12. Brandt DE, Houtenville AJ, Huynh MT, Chan L, Rasch EK. Connecting contemporary paradigms to the social security administration’s disability evaluation process. J Disabil Policy Stud. (2011) 22(2):116–28. doi: 10.1177/1044207310396509

Crossref Full Text | Google Scholar

13. Snyder CF, Aaronson NK, Choucair AK, Elliott TE, Greenhalgh J, Halyard MY, et al. Implementing patient-reported outcomes assessment in clinical practice: a review of the options and considerations. Qual Life Res. (2011) 21(8):1305–14. doi: 10.1007/s11136-011-0054-x

PubMed Abstract | Crossref Full Text | Google Scholar

14. Campbell R, Ju A, King MT, Rutherford C, Campbell R, Ju A, et al. Perceived benefits and limitations of using patient-reported outcome measures in clinical practice with individual patients: a systematic review of qualitative studies. Qual Life Res. (2021) 31(6):1597–620. doi: 10.1007/s11136-021-03003-z

PubMed Abstract | Crossref Full Text | Google Scholar

15. Kasper JD, Chan KS, Freedman VA. Measuring physical capacity. J Aging Health. (2017) 29(2):289–309. doi: 10.1177/0898264316635566

PubMed Abstract | Crossref Full Text | Google Scholar

16. Coman L, Richardson J. Relationship between self-report and performance measures of function: a systematic review. Can J Aging. (2006) 25(3):253–70. doi: 10.1353/cja.2007.0001

PubMed Abstract | Crossref Full Text | Google Scholar

17. Weil J, Hutchinson SR, Traxler K. Exploring the relationships among performance-based functional ability, self-rated disability, perceived instrumental support, and depression. Res Aging. (2014) 36(6):683–706. doi: 10.1177/0164027513517121

PubMed Abstract | Crossref Full Text | Google Scholar

18. Marfeo EE, Haley SM, Jette AM, Eisen SV, Ni P, Bogusz K, et al. Conceptual foundation for measures of physical function and behavioral health function for social security work disability evaluation. Arch Phys Med Rehabil. (2013) 94(9):1645–52. doi: 10.1016/j.apmr.2013.03.015

PubMed Abstract | Crossref Full Text | Google Scholar

19. Marfeo EE, McDonough C, Ni P, Peterik K, Porcino J, Meterko M, et al. Measuring work related physical and mental health function: updating the work disability functional assessment battery (WD-FAB) using item response theory. J Occup Environ Med. (2019) 61(3):219–24. doi: 10.1097/JOM.0000000000001521

PubMed Abstract | Crossref Full Text | Google Scholar

20. Marfeo EE, Ni P, McDonough C, Peterik K, Marino M, Meterko M, et al. Improving assessment of work related mental health function using the work disability functional assessment battery (WD-FAB). J Occup Rehabil. (2018) 28:190–9. doi: 10.1007/s10926-017-9710-5

PubMed Abstract | Crossref Full Text | Google Scholar

21. Marino ME, Meterko M, Marfeo EE, McDonough CM, Jette AM, Ni P, et al. Work-related measures of physical and behavioral health function: test-retest reliability. Disabil Health J. (2015) 8(4):652–7. doi: 10.1016/j.dhjo.2015.04.001

PubMed Abstract | Crossref Full Text | Google Scholar

22. Meterko M, Marino M, Ni P, Marfeo E, McDonough CM, Jette A, et al. Psychometric evaluation of the improved work-disability functional assessment battery. Arch Phys Med Rehabil. (2019) 100(8):1442–9. doi: 10.1016/j.apmr.2018.09.125

PubMed Abstract | Crossref Full Text | Google Scholar

23. MacDonald-Wilson K, Rogers ES, Anthony WA, MacDonald-Wilson K, Rogers ES, Anthony WA. Unique issues in assessing work function among individuals with psychiatric disabilities. J Occup Rehabil. (2001) 11(3):217–32. doi: 10.1023/A:1013078628514

PubMed Abstract | Crossref Full Text | Google Scholar

24. Rogers ES, Sciarappa K, Anthony WA. Development and evaluation of situational assessment instruments and procedures for persons with psychiatric disability. Voc Eval Work Adj Bull. (1991) 24(2):61–7.

Google Scholar

25. Millner UC, Brandt D, Chan L, Jette A, Marfeo E, Ni P, et al. Exploring counselor-client agreement on Clients’ work capacity in established and consultative dyads. J Employ Couns. (2020) 57(3):98–114. doi: 10.1002/joec.12148

Crossref Full Text | Google Scholar

26. Marfeo EE, Eisen S, Ni P, Rasch EK, Rogers ES, Jette A, et al. Do claimants over-report behavioral health dysfunction when filing for work disability benefits? Work. (2015) 51(2):187–94.24594538

PubMed Abstract | Google Scholar

27. McDonough CM, Ni P, Peterik K, Hershberg JD, Bell LR, Chan L, et al. Validation of the work-disability physical functional assessment battery. Arch Phys Med Rehabil. (2018) 99(9):1798–804. doi: 10.1016/j.apmr.2018.04.014

PubMed Abstract | Crossref Full Text | Google Scholar

28. Lechner DE, Page JJ, Sheffield G. Predictive validity of a functional capacity evaluation: the physical work performance evaluation. Work. (2008) 31(1):21–25. doi: 10.3233/WOR-2009-0835

PubMed Abstract | Crossref Full Text | Google Scholar

29. Heilbronner RL, Sweet JJ, Morgan JE, Larrabee GJ, Millis SR. American Academy of clinical neuropsychology consensus conference statement on the neuropsychological assessment of effort, response bias, and malingering. Clin Neuropsychol. (2009) 23(7):1093–129. doi: 10.1080/13854040903155063

PubMed Abstract | Crossref Full Text | Google Scholar

30. Heinemann AW, Fatone S, LaVela SL, Deutsch A, Peterson M, Slater BCS, et al. Performance-based and patient-reported outcome measures for custom ankle-foot orthosis users: reliability, validity, and sensitivity evidence. Disabil Rehabil. (2025) Jan20:1–12. doi: 10.1080/09638288.2025.2453100

Crossref Full Text | Google Scholar

31. Sasseville M. Electronic implementation of patient-reported outcome measures in primary health care: mixed methods systematic review. J Med Internet Res. (2025) 27(1):e63639. doi: 10.2196/63639

PubMed Abstract | Crossref Full Text | Google Scholar

32. Bloom DA, Kaplan DJ, Mojica E, Strauss EJ, Gonzalez-Lomas G, Campbell KA, et al. The minimal clinically important difference: a review of clinical significance. Am J Sports Med. (2023) 51(2):520–4. doi: 10.1177/03635465211053869

PubMed Abstract | Crossref Full Text | Google Scholar

33. Molino J, Harrington J, Racine-Avila J, Aaron R. Deconstructing the minimum clinically important difference (MCID). Orthop Res Rev. (2022) 14:35–42. doi: 10.2147/ORR.S349268

PubMed Abstract | Crossref Full Text | Google Scholar

34. Boyce MB, Browne JP, Boyce MB, Browne JP. Does providing feedback on patient-reported outcomes to healthcare professionals result in better outcomes for patients? A systematic review. Qual Life Res. (2013) 22(9):2265–78. doi: 10.1007/s11136-013-0390-0

PubMed Abstract | Crossref Full Text | Google Scholar

35. Greenhalgh J, Gooding K, Gibbons E, Dalkin S, Wright J, Valderas J, et al. How do patient reported outcome measures (PROMs) support clinician-patient communication and patient care? A realist synthesis. J Patient Rep Outcomes. (2018) 2(42):1–28. doi: 10.1186/s41687-018-0061-6

PubMed Abstract | Crossref Full Text | Google Scholar

36. Brouwer S, Dijkstra P, Stewart R, Göeken L, Groothoff J, Geertzen J. Comparing self-report, clinical examination and functional testing in the assessment of work-related limitations in patients with chronic low back pain. Disabil Rehabil. (2005) 27(17):999–1005. doi: 10.1080/09638280500052823

PubMed Abstract | Crossref Full Text | Google Scholar

37. Brodke DJ, Zhang C, Shaw JD, Cizik AM, Saltzman CL, Brodke DS. How do PROMIS scores correspond to common physical abilities? Clin Orthop Relat Res. (2022) 480(5):996–1007. doi: 10.1097/CORR.0000000000002046

PubMed Abstract | Crossref Full Text | Google Scholar

38. Shaw JD, McEntarfer R, Ferrel J, Greene N, Presson AP, Zhang C, et al. What does your PROMIS score mean? Improving the utility of patient-reported outcomes at the point of care. Global Spine J. (2022) 12(4):588–97. doi: 10.1177/2192568220958670

PubMed Abstract | Crossref Full Text | Google Scholar

39. Noonan VK, Lyddiatt A, Ware P, Jaglal SB, Riopelle RJ, Bingham CO, et al. Montreal accord on patient-reported outcomes (PROs) use series—paper 3: patient-reported outcomes can facilitate shared decision-making and guide self-management. J Clin Epidemiol. (2017) 89:125–35. doi: 10.1016/j.jclinepi.2017.04.017

PubMed Abstract | Crossref Full Text | Google Scholar

40. Katzan IL, Li Y, McCune M, Lapin B. Relationship between objective performance and patient-reported outcomes measurement information system physical function in patients with stroke. J Am Heart Assoc. (2025) 14(10):1–9. doi: 10.1161/JAHA.124.039366

Crossref Full Text | Google Scholar

Keywords: disability evaluation (MeSH), patient report outcome measure (PRO), work disability, physical function ability, mental health functioning

Citation: Marfeo E, Rasch EK, Coale K, Porcino J and Chan L (2025) Aligning metrics with meaning: considerations for measurement selection in disability evaluation. Front. Rehabil. Sci. 6:1657105. doi: 10.3389/fresc.2025.1657105

Received: 30 June 2025; Accepted: 25 July 2025;
Published: 14 August 2025.

Edited by:

Reuben Escorpizo, University of Vermont, United States

Reviewed by:

Şeyda Özal, Ankara Medipol University, Türkiye

Copyright: © 2025 Marfeo, Rasch, Coale, Porcino and Chan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Elizabeth Marfeo, ZWxpemFiZXRoLm1hcmZlb0B0dWZ0cy5lZHU=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.