- 1The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China
- 2School of Nursing and Health, Zhengzhou University, Zhengzhou, Henan, China
- 3The Fifth Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China
- 4School of Nursing, Fudan University, Shanghai, China
Background: The Patient-Reported Outcomes Measurement Information System 29-item (PROMIS-29) profile is commonly used to measure patients’ self-reported health status. This study examined the psychometric properties of the Chinese version of the PROMIS-29 (v2.1) in older individuals.
Methods: Cognitive interviews and psychometric evaluations were conducted between January and August 2023. Cognitive interviews were conducted in accordance with the Cognitive Interviewing–Reporting Framework. The Chinese version of the PROMIS-29 was revised based on feedback from respondents and experts. The structural validity of the PROMIS-29 was evaluated using confirmatory factor analysis (CFA). The convergent and discriminant validities were assessed by calculating Spearman’s rank correlation and comparing known group differences. Reliability was assessed with Cronbach’s α, and item response theory (IRT)-based psychometric assessment was performed using Rasch models for unidimensionality, local independence, item characteristic curve (ICC) matrices, and model fit. Differential item functioning (DIF) was used to examine demographic bias.
Results: A total of 606 cases with a mean age of (72.88 ± 7.2) were included. The CFA showed acceptable convergence. PROMIS-29 was significantly correlated with comparable domains in the legacy questionnaires. Cronbach’s α of the instrument ranged from 0.82 to 0.95. The Rasch models explained 44.5 and 73.2% of the variance, respectively. Local independence analysis showed that the maximum standardized residual correlation coefficients between items within the short form ranged from 0 to 0.69 in absolute value. All items demonstrated excellent discriminatory power. A good Rasch model fit was revealed in terms of the outfit MNSQ, infit MNSQ, and overall outfit MNSQ. Most items showed acceptable item characteristic curve matrices and did not have statistically significant bias (DIF).
Conclusion: The Chinese version of the PROMIS-29 showed acceptable psychometric properties in community-dwelling older adults with multimorbidities. These findings suggest that this questionnaire can be beneficial in the assessment of symptoms and function in the older population.
1 Introduction
With the global population aging, multimorbidity among older adults is an increasing concern worldwide (1). The American Geriatrics Society Expert Panel developed a guiding principle for the care of older adults with multimorbidity, which suggests the importance of patient perspective for clinical decision making, health research, and policy making (2, 3). Information collected directly from patients could provide authentic and reliable support for primary care workers to make medical decisions and manage disease, as well as provide evidence for mitigating adverse outcomes, such as disability and debilitation, in community-dwelling older adults with multimorbidities (4). Patient-reported outcomes (PROs) are assessments of patients’ health status reported directly by patients, without interpretation by healthcare professionals or others (5). PROs are also associated with physical and psychological health-related quality of life (HRQoL) (6). Community-dwelling older adults with chronic diseases often experience several psychological, physical, and social dysfunctions (7). Accurate measurements of PROs can provide important information to assist in the education and self-management of community-dwelling older adults with multimorbidities, which remain to be investigated.
Recently, the Patient Reported Outcomes Measurement Information System (PROMIS) has attracted attention as a new and efficient instrument for evaluating person-centered health. The PROMIS 29-item profile (PROMIS-29 v2.1) was developed using an item response theory, calibrated, and scored based on contemporary samples. The PROMIS-29 has an advantage over the HRQOL tools, as it measures a more comprehensive range of symptoms and functions (8). It covers commonly reported health and functional problems, such as physical function, pain interference, anxiety, depression, fatigue, sleep disturbances, and the ability to participate in social roles and activities (3). Owing to its brevity and breadth, the PROMIS-29 has been used widely in community and clinical samples, such as in those with chronic pulmonary diseases, hemophilia, chronic lower back pain, and multiple chronic conditions (8–12). Robust psychometric properties have been set up in these contexts. PROMIS-29 was translated from English into simplified Chinese by the members of PROMIS National Center China (PNC-China) strictly following the Functional Assessment of Chronic Illness Therapy translation method (13, 14).
However, research on the use of PROMIS tools in community-dwelling older adults with multimorbidities is limited. Further, PROMIS-29 v2.1 has not been validated or applied in Chinese community-dwelling older adults with multimorbidities. The measurement domains of the PROMIS-29 include the most commonly reported health and functional issues among older adults with chronic conditions, which may offer a relatively comprehensive self-assessment of health status (15). From this perspective, the PROMIS-29 may be a promising tool for symptom management in primary healthcare and clinical research in community-dwelling older adults with multimorbidities. The purpose of this study was to conduct cognitive interviews using a simplified Chinese version of the PROMIS-29 and perform a psychometric evaluation on a sample of community-dwelling older adults with multimorbidities in mainland China.
2 Materials and methods
2.1 Study design and settings
This was a cross-sectional study. Two phases were conducted: cognitive interviews and psychometric evaluation of the Chinese version of the PROMIS-29 v2.1 scale.
2.2 Phase I: cognitive interviews
Cognitive interviews were conducted in strict accordance with the cognitive interviewing reporting framework (CIRF) before the psychometric evaluation (16). There were 25 community older adults with multimorbidities from Zhengzhou City (including 13 men and 12 women aged 62–90 years) who were interviewed. The primary technique used was verbal probing and thinking out loud. The interview materials were coded using the Question Appraisal System (QAS-99) evaluation system (17). Twenty respondents were included in the first round of cognitive interviews. Most respondents found that the content of the scale, including guidelines, number of items, and fonts, was well designed and easy to understand. Some respondents raised doubts about 12 of the items. A total of six items focused on “clarification,” which were related to grammar and word usage. For example, responses to four items in the domain of the ability to participate in social roles and activities were difficult for respondents. The aged tended to ignore the negative meaning of “难以完成” of the questions in their answers, showing significant logical errors. Therefore, we changed the original negative sentence to an affirmative sentence while reversing the scores of the options in which the semantic context of the options matched, keeping the logical relationship between the scores and concepts of the domain unchanged. In addition, three items related to “knowledge/memory,” including words, and two items that were difficult to understand were selected due to different socio-cultural backgrounds. The Chinese version of the PROMIS-29 was revised based on feedback from respondents and experts. Most items were easy to understand and conformed to the culture. Five respondents in the second round did not raise any questions, and they completed the questionnaire easily within 10 min or less. Thus, the scale was ready for psychometric testing.
2.3 Phase II: psychometric evaluation
2.3.1 Sampling
Eligible community-dwelling older adults were selected for this study using convenience sampling. Patients were recruited for the study if they met the following criteria: (1) The presence of two or more of the 18 common geriatric chronic diseases included in the functional comorbidity index (FCI) scale (18), (2) age ≥ 60 years, (3) permanent residents of the community who have lived in the community for more than 6 months, and (4) those who provided voluntary informed consent. The exclusion criteria included the following: (1) severe disease and inability to cooperate with the study and (2) cognitive dysfunction or severe mental disorder.
2.3.2 Measures
2.3.2.1 Sociodemographic information questionnaire
A sociodemographic information questionnaire was developed to collect sociodemographic and clinical data regarding gender, age, marital status, education level, pre-retirement/current employment, co-residence, health insurance, smoking, alcohol consumption, physical activity, and number of chronic diseases. The number of chronic diseases was determined using the FCI index. The FCI scale was developed by Groll et al. (18) in 2005 to assess comorbidity status in the aged. These included degenerative disc disease, arthritis, asthma, osteoporosis, diabetes, angina pectoris, chronic obstructive pulmonary disease/respiratory distress syndrome/emphysema, heart attack/myocardial infarction, congestive heart failure, heart attack, stroke/transient ischemic attack, neurological diseases, peripheral vascular diseases, upper digestive tract diseases, depression, anxiety/panic disorder, hearing impairment, visual impairment, and obesity (BMI > 30 kg/m2). Each type of disease scored 1 point; no disease scored 0 points; the scores were added to the FCI index. The FCI checklist also includes an additional item for manually adding chronic diseases that patients may have other than the 18 mentioned above, which will be included in the final score. A higher score indicates several comorbidities. Sociodemographic data were obtained from electronic health records by trained nursing researchers. Clinical data were collected, in part, from electronic health records and supplemented with self-reported patient data.
2.3.2.2 The patient-reported outcomes measurement information system 29-item profile (PROMIS-29)
PROMIS-29 v2.1 comprises 29 items in seven domains: physical function, anxiety, depression, fatigue, sleep disturbance, ability to participate in social roles and activities, pain interference, and pain intensity. We used a 5-point Likert scale (range: 1–5) to measure the severity or frequency of symptoms. The single pain intensity items were scored separately with a response scale ranging from 0 (no pain) to 10 (most severe pain imaginable) (9). Item scores in each domain were summarized and converted into a T-score metric, with values of 50 (SD = 10) representing a mean of the U. S. general population1. A higher score implied greater amplitude of the measured concept (19).
To evaluate the PROMIS-29 tool as a HRQOL instrument that can be used to assess the quality of life and as a symptom instrument for comprehensive assessment of health status, the convergent validity of the PROMIS-29 was tested using the Short Form-12 Health Survey (SF-12), the eight-item Somatic Symptom Scale (SSS-8), Patient Health Questionnaire-2 (PHQ-2), and the Generalized Anxiety Disorder-2 (GAD-2) (20–26).
2.3.3 Data collection
The survey was conducted between January and August 2023. Recruitment of the study population was done during a visit to the family doctor, and informed consent was obtained. We also designed a WeChat mini program suitable for the aging population who can use smartphones. Age-friendly design has been applied to this mini-program. For example, the font of the page was enlarged, and each question and option of the scale was set up with a voice assistant to minimize difficulties for the aged in filling in the questionnaire. We tested the feasibility of the WeChat applet based on the ISO 9241-11:2018 usability framework (27). Older adults could choose paper or electronic questionnaires according to their preferences. For older adults who were unable to complete the questionnaire by themselves (such as those with visual impairments), the researcher relayed the questionnaire content verbally without further explanation. Patients were informed confidentiality of the research.
2.3.4 Statistical analysis
Descriptive statistics were calculated for sample characteristics and study variables. Continuous variables are presented by means and standard deviations, while categorical variables are represented by counts and percentages. Floor or ceiling effects were considered noteworthy, which were defined as greater than 20% of the response for the proportion of minimum or maximum, respectively (28).
Confirmatory factor analysis (CFA) was performed for the structural validity of the PROMIS-29. The fitness of the proposed seven-factor model to the data was evaluated using the comparative fit index (CFI), χ2/degree of freedom (χ2/df), standardized root-mean-squared residual (SRMR), root mean square error of approximation (RMSEA), and Akaike’s information criterion (AIC). A CFI > 0.90, χ2/df < 3, and SRMR < 0.05 indicated a good fit (29, 30). RMSEA < 0.05 indicated a good fit, and < 0.08 was acceptable (31). The smaller the AIC value, the better the model fit. Items with factor loadings equal to or higher than the standard of 0.4 were retained (32).
Regarding structural validity, known-group validity was evaluated for groups with different expected scores: gender, age group (compared with three age groups), and the number of chronic illnesses (FCI). These groups were picked in view of a literature review and the authors’ clinical judgment (10). The Mann–Whitney U test was used for comparisons between two groups, and the Kruskal–Wallis H rank sum test was used for comparisons between multiple groups.
Due to the non-normality of the sample, Spearman’s rank correlations between scores of the PROMIS-29 and their corresponding legacy PRO measures were calculated to evaluate convergent validity. Correlation coefficients of 0.9–1.0 indicate very strong, 0.7–0.89 strong, 0.5–0.69 moderate, and 0.3–0.49 weak relationships (11). The correlations between the PROMIS-29 domain scores and dissimilar constructs of legacy measures supported discriminant validity; these correlations were expected to be less than 0.60 (11). The reliability of the measures was evaluated by Cronbach’s α coefficient and split-half reliability. Cronbach’α of 0.6–0.7 indicated acceptable levels, and ≥ 0.8 indicated very good reliability levels (33).
IRT-based psychometric assessment was done using Rasch models for unidimensionality, local independence, ICC matrices, and model fit. The absolute values of the maximum standardized residual correlation coefficients < 0.7 were considered to have good local independence (34). Generally, more than 40% of the variance explained by the Rasch model indicated unidimensionality of the domain (35). The fit of each item to the Rasch model was evaluated using infit and outfit mean-square statistics (MNSQ). Similarly, the fit of each domain was evaluated based on the overall outfit MNSQ. Av value of 0.5 to 1.5 is generally regarded as indicative of a satisfactory fit for outfit MNSQ, infit MNSQ, and overall outfit MNSQ values (36). Point-measure correlation (PTMEA Corr.) is an indicator of item discrimination, with values greater than 0.40 considered acceptable, indicating that the item positively correlates with the underlying latent trait and effectively discriminates between respondents of different ability levels (36).
To check whether the measures delivered biased results across various populations, the differential item functioning (DIF) of each item was checked on the PROMIS-29 scale. DIF analyses were performed to test measurement invariance and determine whether the probability of patients with the same characteristics from different groups responding to certain items is different (37). Items with significant DIF values indicated measurement bias; differences of ≥ 0.5 logits in item difficulties were considered meaningful (38). No DIF was expected for any of the variables since the universal usability of the PROMIS-29. Accordingly, measurement invariance was evaluated by pondering DIF of the PROMIS social function short forms in view of age, gender, and education.
Statistical significance was set at p value < 0.05. Data were analyzed using SPSS27.0, Amos Graphics24.0, and Winsteps3.72.0.
3 Results
3.1 Descriptive statistics
A total of 628 patients were surveyed in this study; 22 questionnaires with missing information were excluded and 606 valid questionnaires were analyzed. The mean age of the participants was (72.88 ± 7.2) years. In the cohort, 25.6% of the patients had attained high school and equivalent education. Most of the participants were married (74.8%), and most patients were employed as workers pre-retirement (33.7%) and staff of public institutions (29.4%). The mean FCI score of this multiple chronic disease group was 3.25. Most patients had healthy lifestyles and did not smoke or drink alcohol (Table 1).
Descriptive statistics of the patients are presented in Table 2. There were significant ceiling effects for physical function (44.4%) and floor effects for pain interference (41.3%). Except for these, no ceiling or floor effects were observed for any of the other short forms. The mean T-scores for each PROMIS-29 domain are shown in Table 2.
3.2 Structural validity
The goodness-of-fit indices and the main model fit results of the CFA are presented in Table 3 and Figure 1. The original seven-factor model structure within the PROMIS-29 was confirmed based on the following statistics (CFI = 0.95, SRMR = 0.05, RMSEA = 0.05, χ2/df = 2.85). The standardized regression coefficients were 0.438–0.954, indicating acceptable structural validity of the short form for each domain.
Figure 1. Confirmatory factor analysis for the seven-factor model of the PROMIS-29. F1–F7: physical function, anxiety, depression, fatigue, sleep disturbances, ability to participate in social roles and activities, and pain interference, respectively.
3.3 Convergent validity and discriminant validity
As shown in Table 4, the correlations between most PROMIS-29 domains and comparable PRO measures were significantly strong (r > 0.60). The sleep disturbance domain was strongly correlated with the SSS-8 domain of sleep trouble, with a Spearman coefficient of 0.824. Thus, convergent validity was achieved.
However, correlations between domains of PROMIS-29 and its conceptually different legacy PRO measures or domains of PROMIS-29 were not significantly correlated (r < 0.5). For example, the PROMIS-29 physical function score correlated with GAD-2 and PHQ-2 to a low degree (r < 0.4), and between PROMIS-29 anxiety and depression and SF-12 PF. Inter-factor correlations were weaker for most PROMIS-29 domains. A weak correlation was seen between the scores for PROMIS-29 domains related to “ability to participate in social roles and activities” and “sleep disturbance” (r = 0.230, p < 0.01), which supported satisfactory discriminant validity (Table 4).
To examine the known-group validity of PROMIS-29, the scores were compared across patients of different genders, ages, and number of chronic illness (FCI) statuses. Compared to men, women had higher T-scores for anxiety, depression, fatigue, sleep disturbance, and pain interference, and lower T-scores for their ability to participate in social roles and activities. With increasing age, the T-score for physical function and the ability to play social roles and participate in social activities decreased. The FCI scores were significantly different in all seven domains of PROMIS-29, with populations with FCI scores ≥ 4 tending to have more terrible health conditions than those with FCI scores ≤ 3. This indicates that the Chinese version of PROMIS-29 can generally distinguish between different groups and has acceptable known-group validity (Additional File 1).
3.4 Reliability
For the reliability analysis, the internal consistency coefficients and Guttman split-half coefficients were calculated. The Cronbach’s α values and split-half coefficients of the PROMIS-29 were above the standard of 0.70 (Table 3).
3.5 Item-level psychometric properties (Rasch analysis)
The item-level statistics from the Rasch analysis are presented in Table 5. All items demonstrated excellent discriminatory power, with Point-Measure Corr. values ranging from 0.61 to 0.94, well above the 0.40 threshold. All the items except Item Numbers “B1” and “G1” showed fit statistics ranging from 0.5 to 1.5, which was an acceptable fit. In addition, the items were located across a wide range of difficulties, ranging from −1.63 to +1.76 logits.
3.6 Unidimensionality
As shown in Table 6, the Rasch model explained more than 40% of the variance (ranging from 47.8 to 71.5%). The number of characteristic roots in the first control was less than 3 (ranging from 1.5 to 2.5). Thus, the Rasch model analysis illustrated the unidimensionality of the domains from a side perspective.
3.7 Local Independence and monotone increasing hypothesis
The absolute values of the maximum standardized residual correlation coefficients between the items within each short form ranged from 0 to 0.69. All seven short forms were considered to have good local independence. The item characteristic curves satisfy the assumption of monotonic progressivity (Figure 2).
3.8 Measurement invariance
Measurement invariance was evaluated by examining the DIF. The results showed that item 2 “my sleep was refreshing” and item 4 “I had difficulty falling asleep” of sleep disturbance had functional differences across age (<60 years old and 70–79 years old), indicating that there is cross-age measurement inequivalence in these items. Except for these two items, no significant DIF existed in the other PROMIS-29 items in patients with different backgrounds (gender, age, and education), suggesting that the instrument provided unbiased results in this population overall (Tables 7–9).
4 Discussion
The current study aimed to validate the PROMIS-29 profile in community-dwelling older adults with multimorbidities in mainland China. Unlike Huang’s study (24), which applied the Chinese version of PROMIS-29 to a postoperative population, this study examined the psychometric properties of the Chinese version of PROMIS-29 at a deeper micro level of a psychological measurement characteristic test. According to the PROMIS-29 guidelines and CIRF, a rigorous method was used to conduct cognitive interviews with the Chinese version of the PROMIS-29 questionnaire, which improved the accuracy and practicality of its application in community-dwelling older adults with multimorbidities.
No significant sensitivity bias was reported, such as “assumptions” and “sensitivity/bias” (13). However, the Chinese version of the PROMIS-29 would have comprehension bias when applied to the older population with chronic diseases due to the limitations of gender and age distribution. The Chinese version of the PROMIS-29 was designed for conceptual and semantic equivalence with the original version through cognitive interviews. Psychometric testing demonstrated that the measures reduced patient burden, along with sufficient reliability and construct validity in patients with multimorbidities.
Descriptive statistics showed that patients had lower scores for physical function, anxiety, depression, and fatigue, and higher scores for the ability to participate in social roles and activities, compared to the general population in the USA. The participants’ ability to participate in social roles and activities scored the highest of the seven health domains, similar to the results of studies by Kang and Ellen (9, 10). These results could be because the participants in the studies were outpatients or community residents. In addition, this score was higher than that of postoperative patients included in the research by Huang, and patients with breast cancer included in the study by Cai (13, 24), suggesting that community-dwelling older adults with multimorbidities might have better social function. However, the fatigue score of the participants was the lowest of the seven health domains, which was consistent with the situation of the complete sample in a Dutch study (10). The anxiety score was similar to that of a chronic population in a Dutch study (10), but the depression score was higher.
The present study found significant ceiling effects for physical function (44.4%) and floor effects for pain interference (41.3%). Similar results have been presented in previous studies (39, 40). Generally, the interference of the ceiling and floor effects should be considered according to the purpose of the study. If the short form is aimed at recognizing people with low levels of physical function and checking the relationship between low physical function and other long-term outcomes (such as frailty), then a lack of differences at higher levels of physical functioning is acceptable (39). The performance in these domains by patients who do not exhibit symptoms of depression, anxiety, or fatigue resembles that in the general population (40).
The initial seven-factor structure of the PROMIS-29 was supported by the CFA results. The model fit for sleep disturbances was not robust, which has also been found in prior studies (24). In this form, questions regarding “sleep quality” and “refreshment of sleep” could have some measurement error. Similar findings were reported by Huang et al. and Kang et al. (9, 24). Respondent bias in the interpretation of items or an overlap between these two items might have led to these errors. During the cognitive interview stage, we found that patients have difficulty understanding sleep issues. Sleep issues were also reflected in DIF. More research is needed to understand these effects better.
Moderate or strong correlations between the PROMIS-29 and the measures of similar constructs supported convergent validity. Additionally, discriminant validity was confirmed by negligible correlations between the PROMIS-29 and measures of dissimilar constructs. The five-symptom domains of the PROMIS-29 showed a strong correlation with similar concept legacy PRO measures (> 0.6), supporting the use of the PROMIS-29 questionnaire as a symptom assessment scale. Existing researches on PROs or QOL in older adults with chronic diseases frequently use the SF-36 and EQ-5D as measurement tools. The PROMIS-29 profile directly measures important symptoms, for instance, sleep problems, fatigue, anxiety, and depression, significantly reducing the measurement burden on patients (8, 24) compared to traditional PROs. Thus, the PROMIS-29 questionnaire may be more suitable for this patient population.
The results obtained from the questionnaire are reliable as determined by Cronbach’s α coefficient ≥ 0.80. These results have found a high Cronbach’s α coefficient for PROMIS profile domains, consistent with other studies (10, 24).
IRT-based analysis could provide more comprehensive item-level psychometric properties than the CTT. An analysis of the Rasch model showed that the domains satisfied the monotonic increasing assumption, confirming the unidimensional local independence of the seven domains with acceptable levels of reliability.
The PROMIS-29 profile is not reported to have measurement bias across individuals who differ in gender, age, and education (41). The study found that prominent DIF existed in different age groups for two items from the sleep disturbance domain, which was not reported in the study in the Dutch population with chronic diseases (10). Due to evolving sleep patterns and expectations with advancing age (42), there might be an item and concept understanding difference between community-dwelling older adults with multimorbidities of under and over age of 70 years. A previous study that analyzed the PROMIS-29 profile in older people with multiple chronic diseases found that only a small number of item pairs demonstrated statistically significant DIF, and all had negligible effect sizes (8). It is crucial to emphasize that the mere statistical significance of DIF does not necessarily equate to practical or clinical significance in the overall score interpretation. Two DIFs could be deemed acceptable. Consequently, we recommend retaining both items in the Chinese PROMIS-29. The instrument, as a whole, provided unbiased results across gender, age, and education groups for the vast majority of its items, supporting its overall fairness for use in our target population. Future studies with larger sample sizes could further monitor the performance of these items.
4.1 Limitations
This study has some limitations. First, our study findings are not generalizable because it was limited to convenience sampling and a population of community-dwelling older adults with multimorbidities from Central China only. Secondly, due to the cross-sectional design, the researchers did not assess the reactivity and interpretability of different clinical states. Additionally, due to the burden on the respondents, we did not compare PROMIS-29 with more commonly used PRO tools, such as EQ-5D or SF-36. Despite these limitations, the results of this study support the validity of the Chinese version of the PROMIS-29 (v2.1) profile for use in community-dwelling older adults with multimorbidities.
5 Conclusion
This study applied PROMIS-29 to the Chinese community-dwelling older individuals with multimorbidities earlier, providing an internationally standardized measurement tool for the health assessment of older comorbidities in Chinese communities, which is helpful for future research comparisons between populations in different regions of the world. As the use of the PROMIS-29 questionnaire increases, its use in primary care for HRQoL measurement in older adults with multimorbidities would serve as a valid and reliable tool. More applications, such as the development of computerized adaptive testing of the measures and electronic aging-friendly designs, need to be developed for use in community-dwelling older adults with chronic diseases. Future research may include direct comparisons with widely used measurement tools to further determine the relative utility of the Chinese version of the PROMIS-29 profile. In addition, with the PROMIS-29 scale incorporated into community chronic disease management, collecting multimodal data from physiological, psychological, and social health dimensions of patients can report their health changes to the system as soon as possible, achieving accurate evaluation and diagnosis of follow-up needs.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.
Ethics statement
The studies involving humans were approved by the Ethics Committee of Zhengzhou University (Registration No. ZZUIRB2023-036). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
TZ: Data curation, Investigation, Project administration, Software, Validation, Writing – original draft, Writing – review & editing. YZ: Conceptualization, Resources, Supervision, Writing – review & editing. QC: Investigation, Writing – review & editing. MZ: Conceptualization, Resources, Writing – review & editing. XH: Investigation, Resources, Writing – review & editing. JC: Project administration, Writing – review & editing. HX: Methodology, Formal analysis, Funding acquisition, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This research was supported by the Nursing Special Project of the First Affiliated Hospital of Zhengzhou University (HLKY2023009) and Research Topic of the Henan Provincial Federation of Social Sciences in 2023 (GKL-2023-569).
Acknowledgments
We express our great appreciation to the aged in the community who participated in the survey for this study and the community health workers who offered help.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2025.1631442/full#supplementary-material
Footnotes
References
1. Chen, X, Giles, J, Yao, Y, Yip, W, Meng, Q, Berkman, L, et al. The path to healthy ageing in China: a Peking University-lancet commission. Lancet. (2022) 400:1967–2006. doi: 10.1016/S0140-6736(22)01546-X
2. Guiding principles for the care of older adults with multimorbidity: an approach for clinicians: American Geriatrics Society expert panel on the Care of Older Adults with multimorbidity. J Am Geriatr Soc. (2012) 60:E1–E25. doi: 10.1111/j.1532-5415.2012.04188.x
3. Black, N, Burke, L, Forrest, CB, Sieberer, UH, Ahmed, S, Valderas, JM, et al. Patient-reported outcomes: pathways to better health, better services, and better societies. Qual Life Res. (2016) 25:1103–12. doi: 10.1007/s11136-015-1168-3
4. Porter, I, Davey, A, Gangannagaripalli, J, Evans, J, Bramwell, C, Evans, P, et al. Integrating patient reported outcome measures (PROMs) into routine nurse-led primary care for patients with multimorbidity: a feasibility and acceptability study. Health Qual Life Outcomes. (2021) 19:133. doi: 10.1186/s12955-021-01748-2
5. U.S. Department of Health and Human Services FDA Center for Drug Evaluation and Research; U.S. Department of Health and Human Services FDA Center for Biologics Evaluation and Research; U.S. Department of Health and Human Services FDA Center for Devices and Radiological Health. Guidance for industry: patient-reported outcome measures: use in medical product development to support labeling claims: draft guidance. Health Qual Life Outcomes. (2006) 4:79. doi: 10.1186/1477-7525-4-79
6. Bao, XY, Xie, YX, Zhang, XX, Peng, X, Huang, JX, Du, QF, et al. The association between multimorbidity and health-related quality of life: a cross-sectional survey among community middle-aged and elderly residents in southern China. Health Qual Life Outcomes. (2019) 17:107. doi: 10.1186/s12955-019-1175-0
7. Eckerblad, J, Waldréus, N, Stark, ÅJ, and Jacobsson, LR. Symptom management strategies used by older community-dwelling people with multimorbidity and a high symptom burden - a qualitative study. BMC Geriatr. (2020) 20:210. doi: 10.1186/s12877-020-01602-y
8. Rose, AJ, Bayliss, E, Huang, W, Baseman, L, Butcher, E, García, RE, et al. Evaluating the PROMIS-29 v2.0 for use among older adults with multiple chronic conditions. Qual Life Res. (2018) 27:2935–44. doi: 10.1007/s11136-018-1958-5
9. Kang, D, Lim, J, Kim, BG, Nam, H, Kim, Y, Kang, E, et al. Psychometric validation of the Korean patient-reported outcome measurement information system (PROMIS)-29 profile V2.1 among patients with chronic pulmonary diseases. J Thorac Dis. (2021) 13:5752–64. doi: 10.21037/jtd-21-591
10. Elsman, EBM, Roorda, LD, Smidt, N, de Vet, HCW, and Terwee, CB. Measurement properties of the Dutch PROMIS-29 v2.1 profile in people with and without chronic conditions. Qual Life Res. (2022) 31:3447–58. doi: 10.1007/s11136-022-03171-6
11. van Balen, EC, Haverman, L, Hassan, S, Taal, EM, Smit, C, Driessens, MH, et al. Validation of PROMIS Profile-29 in adults with hemophilia in the Netherlands. J Thromb Haemost. (2021) 19:2687–701. doi: 10.1111/jth.15454
12. Khutok, K, Janwantanakul, P, Jensen, MP, and Kanlayanaphotporn, R. Responsiveness of the PROMIS-29 scales in individuals with chronic low back pain. Spine (Phila Pa 1976). (2021) 46:107–13. doi: 10.1097/BRS.0000000000003724
13. Cai, T, Wu, F, Huang, Q, Yu, C, Yang, Y, Ni, F, et al. Validity and reliability of the Chinese version of the patient-reported outcomes measurement information system adult profile-57 (PROMIS-57). Health Qual Life Outcomes. (2022) 20:95. doi: 10.1186/s12955-022-01997-9
14. Eremenco, SL, Cella, D, and Arnold, BJ. A comprehensive method for the translation and cross-cultural validation of health status questionnaires. Eval Health Prof. (2005) 28:212–32. doi: 10.1177/0163278705275342
15. Ploeg, J, Canesi, M, K, DF, C, MA, Kaasalainen, S, Markle-Reid, M, et al. Experiences of community-dwelling older adults living with multiple chronic conditions: a qualitative study. BMJ Open. (2019) 9:e23345. doi: 10.1136/bmjopen-2018-023345
16. Boeije, H, and Willis, G. The cognitive interviewing reporting framework (CIRF) towards the harmonization of cognitive testing reports. Methodol Eur J Res Methods Behav Soc Sci. (2013) 9:87–95. doi: 10.1027/1614-2241/a000075
17. Jia, M, Gu, Y, Chen, YH, Tu, JY, Liu, Y, Lu, HQ, et al. A methodological study on the combination of qualitative and quantitative methods in cognitive interviewing for cross-cultural adaptation. Nurs Open. (2022) 9:705–13. doi: 10.1002/nop2.1121
18. Groll, DL, To, T, Bombardier, C, and Wright, JG. The development of a comorbidity index with physical function as the outcome. J Clin Epidemiol. (2005) 58:595–602. doi: 10.1016/j.jclinepi.2004.10.018
19. Hanmer, J, Cella, D, Feeny, D, Fischhoff, B, Hays, RD, Hess, R, et al. Selection of key health domains from PROMIS(®) for a generic preference-based scoring system. Qual Life Res. (2017) 26:3377–85. doi: 10.1007/s11136-017-1686-2
20. Ware, JJ, Kosinski, M, and Keller, SD. A 12-item short-form health survey: construction of scales and preliminary tests of reliability and validity. Med Care. (1996) 34:220–33. doi: 10.1097/00005650-199603000-00003
21. Windsor, TD, Rodgers, B, Butterworth, P, Anstey, KJ, and Jorm, AF. Measuring physical and mental health using the SF-12: implications for community surveys of mental health. Aust N Z J Psychiatry. (2006) 40:797–803. doi: 10.1080/j.1440-1614.2006.01886.x
22. Petrelis, M, and Domeyer, PR. Translation and validation of the Greek version of the somatic symptom Scale-8 (SSS-8) in patients with chronic low back pain. Disabil Rehabil. (2022) 44:4467–73. doi: 10.1080/09638288.2021.1900415
23. Gierk, B, Kohlmann, S, Kroenke, K, Spangenberg, L, Zenger, M, Brähler, E, et al. The somatic symptom scale-8 (SSS-8): a brief measure of somatic symptom burden. JAMA Intern Med. (2014) 174:399–407. doi: 10.1001/jamainternmed.2013.12179
24. Huang, W, Wu, Q, Zhang, Y, Tian, C, Huang, H, Huang, S, et al. Preliminary evaluation of the Chinese version of the patient-reported outcomes measurement information system 29-item profile in patients with aortic dissection. Health Qual Life Outcomes. (2022) 20:94. doi: 10.1186/s12955-022-02000-1
25. Kroenke, K, Spitzer, RL, Williams, JB, and Löwe, B. The patient health questionnaire somatic, anxiety, and depressive symptom scales: a systematic review. Gen Hosp Psychiatry. (2010) 32:345–59. doi: 10.1016/j.genhosppsych.2010.03.006
26. Zhang, YL, Liang, W, Chen, ZM, Zhang, HM, Zhang, JH, Weng, XQ, et al. Validity and reliability of patient health Questionnaire-9 and patient health Questionnaire-2 to screen for depression among college students in China. Asia Pac Psychiatry. (2013) 5:268–75. doi: 10.1111/appy.12103
27. ISO. (2024) ISO 9241-11:2018 ergonomics of human-system interaction — Part 11: Usability: Definitions and concepts. Available online at: http://www.iso.org/cms/render/live/en/sites/isoorg/contents/data/standard/06/35/63500.html.
28. van der Putten, JJ, Hobart, JC, Freeman, JA, and Thompson, AJ. Measuring change in disability after inpatient rehabilitation: comparison of the responsiveness of the Barthel index and the functional independence measure. J Neurol Neurosurg Psychiatry. (1999) 66:480–4. doi: 10.1136/jnnp.66.4.480
29. Cai, T, Huang, Q, Wu, F, and Yuan, C. Psychometric evaluation of the PROMIS social function short forms in Chinese patients with breast cancer. Health Qual Life Outcomes. (2021) 19:149. doi: 10.1186/s12955-021-01788-8
30. Wolf, MG, and McNeish, D. dynamic: An R package for deriving dynamic fit index cutoffs for factor analysis. Multivariate Behav Res. (2023) 58:189–94. doi: 10.1080/00273171.2022.2163476
31. Zhirong, W. Application of structural equation model and AMOS software. Appl Mech Mater. (2014) 168:1577–9. doi: 10.4028/www.scientific.net/AMM.687-691.1577
32. Carle, AC, Riley, W, Hays, RD, and Cella, D. Confirmatory factor analysis of the patient reported outcomes measurement information system (PROMIS) adult domain framework using item response theory scores. Med Care. (2015) 53:894–900. doi: 10.1097/MLR.0000000000000413
33. Ursachi, G, Horodnic, IA, and Zait, A. How reliable are measurement scales? External factors with indirect influence on reliability estimators. Procedia Econ Financ. (2015) 20:679–86. doi: 10.1016/S2212-5671(15)00123-9
34. Hung, M, Baumhauer, JF, Brodsky, JW, Cheng, C, Ellis, SJ, Franklin, JD, et al. Psychometric comparison of the PROMIS physical function CAT with the FAAM and FFI for measuring patient-reported outcomes. Foot Ankle Int. (2014) 35:592–9. doi: 10.1177/1071100714528492
35. Huang, Y, Zhang, X, Yang, Y, Zhang, W, Cai, T, Zhu, D, et al. Validation of the psychometric properties of the simplified Chinese version of PROMIS physical function short form 6b. J Nurs Sci. (2021) 36:29–32. doi: 10.3870/j.issn.1001-4152.2021.20.029
36. Tennant, A, and Conaghan, PG. The Rasch measurement model in rheumatology: what is it and why use it? When should it be applied, and what should one look for in a Rasch paper? Arthritis Care Res. (2007) 57:1358–62. doi: 10.1002/art.23108
37. Crins, M, Terwee, CB, Ogreden, O, Schuller, W, Dekker, P, Flens, G, et al. Differential item functioning of the PROMIS physical function, pain interference, and pain behavior item banks across patients with different musculoskeletal disorders and persons from the general population. Qual Life Res. (2019) 28:1231–43. doi: 10.1007/s11136-018-2087-x
38. Rouquette, A, Hardouin, JB, Vanhaesebrouck, A, Sébille, V, and Coste, J. Differential item functioning (DIF) in composite health measurement scale: recommendations for characterizing DIF with meaningful consequences within the Rasch model framework. PLoS One. (2019) 14:e215073. doi: 10.1371/journal.pone.0215073
39. McMullen, K, Bamer, A, Ryan, CM, Schneider, JC, Gibran, N, Stewart, BT, et al. Validation of PROMIS-29 domain scores among adult burn survivors: a National Institute on Disability, Independent Living, and Rehabilitation Research burn model system study. J Trauma Acute Care Surg. (2022) 92:213–22. doi: 10.1097/TA.0000000000003365
40. Tang, E, Ekundayo, O, Peipert, JD, Edwards, N, Bansal, A, Richardson, C, et al. Validation of the patient-reported outcomes measurement information system (PROMIS)-57 and −29 item short forms among kidney transplant recipients. Qual Life Res. (2019) 28:815–27. doi: 10.1007/s11136-018-2058-2
41. Cook, KF, Jensen, SE, Schalet, BD, Beaumont, JL, Amtmann, D, Czajkowski, S, et al. PROMIS measures of pain, fatigue, negative affect, physical function, and social function demonstrated clinical validity across a range of chronic conditions. J Clin Epidemiol. (2016) 73:89–102. doi: 10.1016/j.jclinepi.2015.08.038
42. Djonlagic, I, Mariani, S, Fitzpatrick, AL, Van Der Klei, V, Johnson, DA, Wood, AC, et al. Macro and micro sleep architecture and cognitive performance in older adults. Nat Hum Behav. (2021) 5:123–45. doi: 10.1038/s41562-020-00964-y
Glossary
PROMIS-29 - The patient-reported outcomes measurement information system 29-item profile
CIRF - The cognitive interviewing reporting framework
HRQoL - Health-related quality of life
SF-12 - Short form-12 health survey
SSS-8 - 8-Item somatic symptom scale
GAD-2 - Generalized anxiety disorder-2
PHQ-2 - Patient health questionnaire-2
CFA - Confirmatory factor analysis
PROs - Patient reported outcomes
PF - Physical functioning
SF - Social functioning
PCS - Physical component summary
MCS - Mental component summary
CFI - Comparative fit index
SRMR - Standardized root-mean-squared residual
RMSEA - Root mean square error of approximation
AIC - Akaike’s information criterion
IRT - Item response theory
ICC - Item characteristic curve
FCI - Functional comorbidity index
DIF - Differential item functioning
Keywords: PROMIS-29, psychometric validation, multimorbidity, older adults, item response theory
Citation: Zhao T, Zhang Y, Cui Q, Zhang M, Han X, Chen J and Xu H (2025) Psychometric validation of the Chinese version of the PROMIS-29 profile in community-dwelling older adults with multimorbidities. Front. Public Health. 13:1631442. doi: 10.3389/fpubh.2025.1631442
Edited by:
Anna Panzeri, University of Padua, ItalyReviewed by:
Xin Chen, Tongji University, ChinaKagiso Seakamela, University of Limpopo, South Africa
Copyright © 2025 Zhao, Zhang, Cui, Zhang, Han, Chen and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Huiping Xu, eGh1aXBpbmcxMDMwQDE2My5jb20=
Qinghua Cui2