- 1Division of Rheumatology, Guang’anmen Hospital, China Academy of Chinese Medical Sciences, Beijing, China
- 2School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai, China
Objective: We developed a consensus-based core outcome set (COS) for Traditional Chinese Medicine clinical trials in rheumatoid arthritis (COS-TCM-RA) that to addresses heterogeneity in outcome measurement and reporting across studies.
Methods: We performed a comprehensive systematic review to establish a repository of clinical outcome set for TCM interventions in RA management. Then, a semi-structured interview was conducted to identify additional important outcomes from the patient’s perspective. This was followed by an online three-round Delphi survey conducted with professionals to prioritize and refine clinically relevant outcomes from a previously developed longlist. Core entries were finalized based on three Delphi surveys and one expert consensus meeting.
Results: The systematic review screened 2,959 records and included 69 eligible studies. From these, a comprehensive list of reported outcomes was extracted and standardized, which after standardization, yielded a final list of 52 unique outcomes for the preliminary consensus process. After semi-structured interviews, three Delphi survey rounds and one consensus meeting, the most important outcomes were determined for COS of Traditional Chinese Medicine for RA (COS-TCM-RA), including physician assessment of global status, Clinical laboratory tests, radiographic assessment, Signs and symptoms, quality of life and security incident.
Conclusion: This study developed the first COS-TCM-RA, comprising 11 indicators across five domains: overall disease evaluation, physical and chemical indicators, quality of life, TCM syndromes, and adverse events. By integrating internationally recognized RA assessment tools with patient-reported outcomes and TCM-specific measures, the COS-TCM-RA provides a standardized, multi-dimensional framework to enhance trial quality, improve comparability, and support the integration and internationalization of TCM in evidence-based medicine.
1 Introduction
Rheumatoid arthritis (RA) is a chronic, systemic autoimmune inflammatory disease that primarily affects the joints and periarticular soft tissues (1), causing progressive cartilage and bone damage and disability (2), with a global prevalence of 0.5 to 1%. Traditional Chinese Medicine (TCM), grounded in a unique theoretical framework and practiced for over 2,000 years, is widely applied in China for the clinical management of RA (3), and clinical evidence demonstrates its therapeutic potential (4). It adopts a holistic perspective and individualized therapeutic approaches, including herbal medicine, acupuncture, and moxibustion (4). Currently, integrated treatment strategies combining TCM with Western medicine are commonly used in clinical practice, with many patients selecting TCM as their primary or adjunctive treatment modality. An increasing body of clinical research suggests that these interventions may exert therapeutic effects through mechanisms such as modulation of the immune network and inhibition of pro-inflammatory cytokines (5, 6). However, standardized methods specifically designed to evaluate the efficacy of TCM for RA are currently lacking. Moreover, significant heterogeneity in outcome measures across clinical trials impedes consistent efficacy assessment, limiting evidence synthesis and comparability among studies.
COS represent the minimum outcomes that should be measured and reported in clinical trials in a specific area of healthcare (7). At present, COS are increasingly being developed to identify outcomes important to decision makers, improve outcome reporting, and standardize definitions and measures. The internationally recognised RA-OMERACT provides a foundation for standardised assessment of disease activity. However, existing international standards fail to encompass patients health-related quality of life, including subjective experiences across physical, psychological, cognitive and social dimensions, nor do they sufficiently integrate safety monitoring into efficacy assessments (8). At the patient-reported outcome level, there is a lack of assessment tools suitable for specific cultural contexts. By systematically incorporating TCM syndromes and safety indicators, and employing localised reporting tools suitable for Chinese patient populations. Consequently, this study aims to develop a core set of outcomes specifically for TCM treatment of RA (COS-TCM-RA). The establishment of a COS for RA in TCM clinical research (COS-TCM-RA) offers a promising solution to address the heterogeneity of outcome measures in current TCM trials in RA. Based on the COS-TCM-RA technical specifications, our research team convened a panel of experts from the Rheumatology Branch of the Chinese Society of Traditional Chinese Medicine to develop consensus on standardized endpoints for TCM clinical trials targeting RA. This standardized COSwill enhance comparability across studies, improve clinical trial quality and efficiency, and facilitate objective evaluation of TCM therapeutic effects. Furthermore, implementing COS in TCM research will promote standardization and internationalization, ultimately advancing TCM’s global recognition in evidence-based medicine.
2 Methods
This study was part of the RA Chinese Medicine Clinical Research Core Indicator Set project, which was registered in the COMET database.1 Permission for this study was obtained from the Ethics Committee of Guang’anmen Hospital, China Academy of Traditional Chinese Medicine (2022-170-KY).
2.1 Design: the protocol involved five steps
Step 1. We conducted a comprehensive systematic literature review to identify, evaluate, and synthesize the reported outcomes in clinical trials of Traditional Chinese Medicine (TCM) for RA.
Step 2. A semi-structured interview was conducted to explore the impact of the disease on patients’ daily living before and after treatment, as well as the improvements resulting from Traditional Chinese Medicine.
Step 3. The first two rounds Delphi survey were conducted to prioritize and condense the outcomes.
Step 4. A consensus meeting was held to further deliberate on the RA-TCM-COS.
Step 5. The final round of Delphi surveys to develop the RA-TCM-COS.
The study was conducted according to the Core Outcome Measures in Effectiveness Trials (COMET) Handbook: Version 1.0, a guideline for COS development (9). The methodological design of this study aligns with the consensus standards established by the COS initiative. It adopts the minimum standards for development proposed by COS-STAD as the primary framework for programme construction, clearly defining the scope of research (10), stakeholder composition, and consensus-building process (Figure 1).
2.2 The scope of the COS
The COS-TCM-RA is developed to be applied in clinical studies evaluating Traditional Chinese Medicine interventions for adults with rheumatoid arthritis. Its specific scope is defined as follows:
(1) Health Condition Rheumatoid arthritis, as defined by internationally accepted classification criteria (ACR/EULAR 2010 criteria).
(2) Target Population: Adult patients (≥18 years) diagnosed with RA, who had either gone through Traditional Chinese Medicine treatment previously or were currently in the course of receiving it.
(3) Interventions: All therapeutic modalities of Traditional Chinese Medicine used in the management of RA, including but not limited to Chinese herbal medicine, acupuncture, moxibustion, and Tuina. This COS is also intended for use in studies evaluating integrated TCM and Western medicine treatment strategies.
(4) Settings: The primary setting for application is clinical trials. It may also be used to standardize outcome assessment in routine clinical practice to evaluate the effectiveness of TCM for RA.
2.3 Step 1: systematic literature review
We systematically reviewed recent (2019–2024) clinical studies on TCM for RA to identify current evidence on treatment outcomes. We searched the following Chinese and English databases using MeSH terms and keywords: We searched three English databases (PubMed, The Cochrane Library, and Embase) and four Chinese databases (China National Knowledge Infrastructure [CNKI], SinoMed, China Science and Technology Journal Database and Wanfang Database). In order to collect comprehensive outcomes, we also searched two clinical trial registries2 to retrieve any outcomes used in clinical trials.
2.3.1 Inclusion criteria
(1) Study Types: randomized controlled trials, non-randomized controlled trials, case series, case–control, cohort studies, and systematic reviews evaluating TCM for RA.
(2) Study population: Adult patients (≥18 years) with a confirmed diagnosis of rheumatoid arthritis based on internationally accepted ACR/EULAR 2010 criteria.
(3) Interventions: Any TCM-related monotherapy or combination therapy, including but not limited to: Chinese herbal medicine, acupuncture, moxibustion, tuina, and medicated baths.
(4) Outcome measures: Studies that reported at least one clinical outcome measure.
2.3.2 Exclusion criteria
(1) Non-clinical studies (basic science, in vitro experiments).
(2) Review articles, commentaries, or editorials without original data.
(3) Degree theses and conference papers.
(4) Duplicate publications.
2.3.3 Data extraction
Two reviewers (SMG and YBB) independently extracted the data using a pre-designed data extraction form. Disagreements were resolved by discussion or consulting a third researcher (ZXQ). The extracted information included:
(1) Study characteristics: First author, publication year, journal, country, study design, sample size, funding sources.
(2) Participant characteristics: Age, gender, disease duration, RA diagnostic criteria used, TCM syndrome pattern.
(3) Intervention details: Type of TCM therapy, dosage, frequency, treatment duration, and details of any control interventions.
(4) Outcome measures: For every outcome reported in a study, we extracted the following:
Outcome domain and specific name (Disease Activity: DAS28; Pain: Visual Analogue Scale score; Physical Function: Health Assessment Questionnaire).
Measurement instrument or tool (DAS28-CRP, VAS, HAQ-Disability Index, Szkudlarek M Standard for ultrasound synovitis grading).
Specific metric used (change from baseline, endpoint value, proportion of patients achieving remission).
Time points of assessment (baseline, week 4, week 12, week 24) (see Figure 2).
2.4 Step 2: semi-structured interview
2.4.1 Participant recruitment and sample size
Outcome selection was guided by a multi-stakeholder consensus process. Open-ended interviews served as the foundational step for subsequent quantitative prioritization. The sample size was determined by integrating purposive stratified sampling with the principle of thematic saturation. Participants were recruited through purposive stratified sampling based on the following variables: (1) gender (male, female); (2) age group (18–40, 41–65, >65 years); (3) disease stage, defined as early (<2 years), mid-term (2–5 years), or late (>5 years); (4) disease activity, classified by DAS28-CRP score as low (<3.2), moderate (3.2–5.1), or high (>5.1); (5) educational level (primary school or below, junior high school, senior high school, or university and above). (Eligible patients were those who had experienced RA-related symptoms and had either previously undergone or were currently receiving TCM treatment, thereby possessing firsthand knowledge of treatment outcomes). Participants rated their experiences on a 9-point Likert scale, with higher scores representing greater disease-related distress at baseline or more substantial improvement following treatment. This study systematically gathered and analyzed patient-reported concerns and priorities regarding treatment efficacy. During qualitative data analysis, we employed Braun and Clarke’s reflexive thematic analysis (RTA) to construct a coding framework: two researchers independently conducted open coding of interview transcripts, refining initial concepts into consensus-based thematic domains through multiple rounds of comparative discussion. Finally, outcomes were ranked based on their scores to facilitate the analysis and comparison of perspectives across different patient subgroups.
2.4.2 Data collection
The questions asked during the interview revolved around Chinese medicine therapies for RA. After receiving clarification on the study purpose and the definition of outcomes, the patients were asked to suggest relevant outcomes based on their experience with RA, potentially important outcomes following Chinese medicine treatment and their reasoning based on these choices judgments. The outline items are as follows:
(1) When was your RA diagnosed?
(2) What are the three most troubling symptoms before treatment?
(3) What are the three conditions that show the most significant degree of improvement after treatment?
(4) Score these in turn.
Narrative data were indexed and charted to produce a thematic framework. These patient centered outcomes were reviewed and added to the candidate outcome list. Table 1 presents the percentage of patients indicating the degree of distress prior to treatment and the degree of improvement following treatment, along with the corresponding mean scores (rating 7 or higher on the 9-point Likert scale).
2.5 Step 3: two-round Delphi survey
2.5.1 First round of the Delphi survey
(1) Delphi survey panel assembly
Following principles of representativeness and clinical expertise, we recruited rheumatology specialists practicing Traditional Chinese Medicine (TCM), integrated TCM-Western medicine and evidence-based medicine methodologists. The expert panel comprised rheumatologists from 86 clinical centers across all seven geographic regions of China (North, Northeast, East, Central, South, Southwest, and Northwest), representing 29 provincial-level administrative divisions (provinces, municipalities, and autonomous regions). All experts were selected based on their high professional qualifications, with 5 ~ 40 years of clinical experience in the field of rheumatology. Their extensive expertise and experience further confirm their authority in the field, ensuring the credibility and relevance of the Delphi method in establishing the COS indicators. Detailed demographic characteristics of the participants can be found in Appendix 2.
(2) Rating and refining outcomes
For the first Delphi round, participants were provided with an electronic questionnaire containing outcome measures identified through systematic literature review and Semi-structured interview.
To evaluate the outcomes’ importance, we used a 9-point Likert scale, where 1, 2, and 3 meant “unimportant” 4, 5, and 6 meant “important, but not essential”; and 7, 8, and 9 meant “essential” (11). If participants were uncertain about the importance of an outcome indicator, they could select “Not sure” as a response option.
Outcomes that received consensus in the first round, defined as being rated as critical’ by at least 70% of participants and not important’ by fewer than 15%, were retained as potential core outcomes. Conversely, outcomes rated as “not important” by 70% of participants and “critical” by fewer than 15% were considered for exclusion from the COS. To adhere to the COS principle of a minimal set and to keep the subsequent questionnaire manageable, the core steering group agreed to retain no more than four outcomes from each domain for the second round. When a domain yielded more than four eligible outcomes, the core experts re-evaluated their clinical relevance and consensus ratings, and selected four representative indicators to advance to the second round.
In the first round, the questionnaire was sent to experts, who were asked to complete the survey within 2 weeks. A reminder email was sent at the end of the first week. After 2 weeks, we performed a statistical analysis of the first-round results (12). These results included: (a) general content of establishment of the COS for clinical studies: number of questionnaires sent, number of questionnaires recovered, number of participants completed, and proportion of questionnaires completed; (b) scoring content: the scoring results and distribution of each outcome (including the mean, minimum, and maximum values of scores and the consensus of different scores). Outcomes with an average score of 7 or above were ranked from high to low. After the first round, we provided all participants with aggregated statistics. Each expert could view the summary results via the electronic questionnaire interface. The full set of scores from the 117 experts is available in Appendix 1.
2.5.2 Second round of the Delphi survey
(1) Delphi survey panel assembly
The expert panel for this second round of the Delphi survey comprised 40 specialists, all holding the title of Associate Chief Physician or above (the vast majority being Chief Physicians). They were drawn from 31 high-calibre hospitals across 19 provinces, municipalities, and autonomous regions nationwide. With the goal of further refining and validating the core indicator set for TCM-COS-RA. Detailed demographic characteristics of the participants can be found in Appendix 2.
(2) The outcomes were further refined through the second Delphi round
All indicators that met the pre-specified consensus criteria for importance were carried forward to the consensus meeting for comprehensive deliberation, as the survey results alone were insufficient to reduce the list to a minimal core set.
2.6 Step 4: consensus meeting
The consensus panel consisted of 10 nationally recognized rheumatology experts with all holding leadership positions as standing committee members or above. Drawn from 7 provinces/municipalities across China, all members held chief physician titles with ≥25 years of clinical experience in either TCM or integrated TCM-Western Medicine practice, and had completed all Delphi survey rounds prior to the consensus meeting.
The expert panel conducted a hybrid (on-site/web-based) consensus meeting to: (1) clarify the COS framework, and (2) review the two-round Delphi survey results prior to final voting.
2.7 Step 5: final round of Delphi survey
The final round was conducted with a panel of 38 experts to finalize the outcome list. To ensure continuity and build upon prior consensus, the panel included 31 experts (81.6%) who had participated in all three Delphi rounds. A consensus exceeding 70% was reached for the final 11 item COS. The scoring data for the final round are detailed in Appendix 1.
2.8 Statistical analysis
Statistical analyses were primarily descriptive and followed pre-specified consensus rules. Continuous variables were summarized as median using medians or means ± standard deviations, and categorical variables were presented as counts and percentages. The Delphi survey used a 1–9 importance scale (1–3 = not important, 4–6 = important but not critical, 7–9 = critical). For each item, summary statistics (median, IQR, and the proportions of ratings in the 7–9 and 1–3 bands) were reported overall and by stakeholder strata (patients, clinicians, methodologists). Between rounds, participants received feedback of the prior round’s summaries to facilitate convergence. Consensus thresholds were pre-specified: items were included if ≥70% of ratings were in the 7–9 band and <15% were in the 1–3 band; excluded if the converse applied; all others were classified as undetermined and carried forward to the next round or discussed at the consensus meeting (this threshold aligns with conventions used in OMERACT and related consensus programs). For each round we recorded and reported the effective sample size, composition, and attrition. Sensitivity analyses were performed for borderline items (varying thresholds by ±5% or re-checking using overall-only summaries) to examine robustness. “Not sure” responses were excluded from denominators and presented descriptively only. At the consensus meeting, undetermined items were discussed under a structured agenda and decided by anonymous voting, applying the same inclusion threshold as in the Delphi.
2.8.1 Consensus criteria
The first round of the Delphi survey employed a unified consensus criterion: indicators receiving scores of 7–9 from ≥70% of experts and scores of 1–3 from <15% were retained as high-importance indicators; those receiving scores of 1–3 from ≥70% of experts and scores of 7–9 from <15% were excluded by consensus; all others were classified as “undetermined” and advanced to the subsequent round of deliberation. The second Delphi survey and consensus meeting also adhered to this standard.
3 Results
The systematic review identified 2,959 records, from which 69 studies met the eligibility criteria. After extracting all reported outcome measures and undergoing a standardization process to merge synonyms and remove duplicates, a final list of 52 unique outcome items was generated. These outcomes were categorized into seven outcome domains: overall disease evaluation; signs and symptoms; clinical laboratory; radiographic assessment; Traditional Chinese Medicine syndrome; quality of life and security incidents. Subsequently, Seventy-two consecutively recruited RA patients completed semi-structured interviews. Quantitative analysis revealed the following symptom severity hierarchy: (1) joint pain, (2) disease diagnostic criteria, (3) disease global assessment, (4) fatigue, (5) joint swelling, (6) elevated inflammatory markers and; (7) Health-related quality of life (HRQoL). Post-treatment clinical assessments demonstrated improvement sequencing: (1) disease global assessment; (2) joint pain; (3) joint swelling followed by disease diagnostic criteria, laboratory values, fatigue, and muscle pain. Consequently, the patient group identified joint pain, disease global assessment, and joint swelling as highly important outcomes. These patient-prioritized outcomes were incorporated into the candidate list for the subsequent Delphi process.
Then, in the Delphi questionnaire, these indicators derived from patient interviews were specifically annotated with their patient origin and high priority ranking. This provided crucial contextual information for expert scoring, guiding them to fully consider the value of patient experience when weighing the importance of indicators. In the first Delphi round, all 117 invited experts (100% response rate) completed the survey. Following the pre-defined consensus criteria, 22 indicators were retained for the second round. Forty-two experts were invited to the second round, with 40 responses received (95.2% response rate). However, the results still contained a substantial number of high-priority outcomes, preventing the derivation of a minimal core set. Therefore, a consensus meeting with 10 experts was held, after which a third Delphi round was initiated. This final round employed a continuous sampling approach, involving 38 experts, 31 (81.6%) of whom had participated in all three rounds (Table 2).
Table 2. Patients’ distress after illness and improvement after treatment outcomes by semi-structured interviews.
4 Discussion
This study developed the first COS for clinical trials of TCM in rheumatoid arthritis (RA). The development process has been registered with the COMET database. The COS-TCM-RA developed in this study integrates TCM-specific characteristics, internationally validated metrics and patient-reported outcomes, directly addressing the heterogeneity, and limited comparability of outcomes in existing TCM-RA clinical research. This provides a unified and authoritative outcome indicator framework for future TCM-RA clinical trials (13).
Conducted by a team of 117 authoritative experts from 86 hospitals nationwide, the research adopted rigorous methodologies including systematic review, patient interviews, three-round Delphi surveys, and expert consensus meetings, and ultimately established five indicator domains: overall disease evaluation, physical and chemical indicators, quality of life, TCM syndromes, and adverse events. The Disease Activity Score in 28 joints (DAS28), endorsed by the ACR and EULAR (14), remains the most widely used measure of RA activity. When paired with the Chinese-developed CPRI-RA (15), it allows comprehensive evaluation of joint function and patient experience. Notably, early detection and quantitative monitoring of both inflammation and structural damage are critical in RA management and in evaluating the effectiveness of interventions (16, 17). Laboratory markers such as the erythrocyte sedimentation rate and C-reactive protein are well-established indicators of systemic inflammation. Musculoskeletal ultrasound (MSUS) (18, 19) provides a standardized way to detect synovial inflammation, joint effusion, power Doppler signals and bone erosions using the Szkudlarek 0–3 semi-quantitative grading system (20). Beyond inflammation and joint destruction, RA is a chronic disabling disease that imposes long-term limitations on dressing, walking, hygiene and independent living (21). TCM-COS-RA includes the Health Assessment Questionnaire–Disability Index (HAQ-DI) as the primary tool for assessing physical function (21). Currently, the evaluation of TCM efficacy is constrained by the lack of quantitative methods for TCM-specific syndrome indicators, hindering comprehensive and objective assessments. Syndrome patterns, as the core indicators for TCM efficacy evaluation, are of particular importance for standardized assessment. Safety monitoring is a mandatory component in all clinical trials (22), including those involving Traditional Chinese Medicine. To ensure consistent evaluation, the TCM-COS-RA includes adverse events as a core outcome domain (23).
The internationally recognized evaluation metrics for RA, such as the DAS28 endorsed by ACR/EULAR and traditional disease activity assessment systems, generally have the limitation of single-dimensional evaluation (24). Their core frameworks mainly rely on objective Western medicine parameters such as joint mobility and inflammatory indicators, without integrating patient-reported outcomes and TCM syndrome differentiation into the standardized assessment system. Take the clinical decision tree proposed by the French Society of Rheumatology in 2004 as an example (25), its assessment system includes three objective indicators: disease activity (DAS28), rheumatoid factor status, and structural damage. By contrast, the RA-TCM-COS retains the quantitative rigour of evidence-based measures while adding patient-reported outcome scales to capture subjective experience. It adopts an integrated model of Chinese and Western medicine that prioritises TCM syndrome evaluation. This represents a shift from single-pathology indicators to a multidimensional assessment.
The RA-COS multidimensional assessment employs an integrated framework combining holistic and localised approaches alongside objective and subjective measures. It encompasses five key domains: comprehensive disease evaluation, physicochemical indicators, quality of life, Traditional Chinese Medicine syndromes, and adverse events. This standardised multidimensional assessment system not only comprehensively addresses clinical requirements for RA disease evaluation but also holds significant value for clinical implementation and application.
Research on cell membrane-mimetic nanoparticles for targeted anti-inflammatory drug delivery centres its core evaluation on inhibiting inflammatory pathways by enhancing delivery efficacy and safety (26). Jo et al. (27) systematic review of 415 randomised controlled trials demonstrated that East Asian herbal medicine with conventional medicine combination therapy (EACM) not only significantly reduced inflammation levels but specifically monitored gastrointestinal damage and hepatic burden. Liu et al.’s (28) research demonstrated that Wutou decoction combined with Western medicine significantly reduced inflammatory markers with a lower incidence of adverse reactions than controls, while proposing combination strategies to mitigate Aconitine risks. A self-assembled clematine-glycyrrhizic acid hydrogel modulated inflammatory pathways and protected osteochondral tissue while monitoring hepatic and renal function (29). These findings correspond to the chemical indicator and safety monitoring domains of the COS. Concurrently, the EACM meta-analysis adopted DAS28 as a secondary key outcome. Studies of aconite decoction reported improvements in both DAS28 and pain scores, aligning with the COS’s emphasis on disease activity and patient-reported outcomes. This provides a unified benchmark for diverse intervention pathways. Moreover, nano-bionic research explicitly targets improved treatment outcomes and quality of life (QoL). This emphasis on patient-reported outcomes and quality of life aligns with RA-TCM-COS’s philosophy of incorporating patient quality of life. Collectively, this evidence underscores the core requirement for comprehensive RA outcome assessment to balance efficacy and safety, alongside objective and subjective measures–precisely the core value of COS-TCM-RA. Current RA-related research often focuses on single dimensions or localised indicators; the COS indicator set holds promise for achieving comprehensive and precise clinical evaluation of RA.
In this study, the TCM syndrome was assessed using a specialized “Syndrome Effectiveness Evaluation Scale” for RA. It is important to note that this scale was developed and preliminarily validated by our research team in a prior methodological study (31) based on the eight core TCM syndrome patterns defined in the Chinese guideline for RA (30). In that study, the final scale demonstrated a reliability coefficient of 0.788 and a structural validity coefficient of 0.735. While this provides a solid foundation for its use, we acknowledge that further validation in broader, independent cohorts will be valuable to fully establish it as a standardized tool for TCM outcome assessment in clinical trials. On this basis, future research can carry out multi-centre and large-sample validation studies to further improve TCM-COS-RA. Moreover, it can explore the effects of different TCM treatment protocols, such as TCM compounding, acupuncture, tuina, on the core indexes, so as to provide a richer theoretical basis and practical guidance for the treatment of RA by TCM. Furthermore, strengthen the integration with Western medicine research, and establish a core index set for RA that combines Western and Chinese medicine to comprehensively improve the diagnosis and treatment of the disease.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.
Ethics statement
The studies involving humans were approved by the Ethics Committee of Guang’anmen Hospital, China Academy of Traditional Chinese Medicine (2022-170-KY). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.
Author contributions
BY: Writing – original draft. MS: Writing – original draft. JY: Investigation, Writing – original draft. XT: Methodology, Writing – original draft. CX: Data curation, Writing – original draft. JZ: Formal analysis, Writing – original draft. XZ: Validation, Writing – original draft. QJ: Supervision, Writing – review & editing. MH: Supervision, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This study was funded by the Central High-level Hospital of Traditional Chinese Medicine (no. HLCMHPP2023087), the CACMS Innovation Fund (Grant no. CI2021A01503), the Key R&D and Transformation Plan Project of Qinghai Province (no. 2025-QY-212), 2023 Kunlun High-end Innovation and Entrepreneurship Talents Project of Qinghai Province–Training of leading talents (no. QHKLYC-GDCXCY-2023-313), Young Clinical Talent of Traditional Chinese Medicine of the Eaglet Plan Supported by the China Association of Chinese Medicine (no. CYJH2024015). This study did not receive any specific grants from public, commercial, or non-profit funding agencies.
Acknowledgments
We also extend our sincere appreciation to all the experts and RA patients who participated in the Delphi surveys, without whose valuable contributions this research would not have been possible. The authors would like to thank reviewers and editors for their constructive comments and suggestions that helped improve the quality of this manuscript.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2025.1690963/full#supplementary-material
Footnotes
References
1. Akram, M, Daniyal, M, Sultana, S, Owais, A, Akhtar, N, Zahid, R, et al. Traditional and modern management strategies for rheumatoid arthritis. Clin Chim Acta. (2021) 512:142–55. doi: 10.1016/j.cca.2020.11.003,
2. McInnes, IB, and Schett, G. The pathogenesis of rheumatoid arthritis. N Engl J Med. (2011) 365:2205–19. doi: 10.1056/NEJMra1004965,
3. Zhang, P, Li, J, Han, Y, Yu, XW, and Qin, L. Traditional Chinese medicine in the treatment of rheumatoid arthritis: a general review. Rheumatol Int. (2010) 30:713–8. doi: 10.1007/s00296-010-1370-0,
4. Guo, B, Zhao, C, Zhang, C, Xiao, Y, Yan, G, Liu, L, et al. Elucidation of the anti-inflammatory mechanism of Er Miao san by integrative approach of network pharmacology and experimental verification. Pharmacol Res. (2022) 175:106000. doi: 10.1016/j.phrs.2021.106000,
5. Liang, Y, Liu, M, Cheng, Y, Wang, X, and Wang, W. Prevention and treatment of rheumatoid arthritis through traditional Chinese medicine: role of the gut microbiota. Front Immunol. (2023) 14:1233994. Published 2023 Sep 14. doi: 10.3389/fimmu.2023.1233994,
6. Li, W, Yu, L, Li, W, Ge, G, Ma, Y, Xiao, L, et al. Prevention and treatment of inflammatory arthritis with traditional Chinese medicine: underlying mechanisms based on cell and molecular targets. Ageing Res Rev. (2023) 89:101981. doi: 10.1016/j.arr.2023.101981,
7. Verhoeven, AC, Boers, M, and van Der Linden, S. Responsiveness of the core set, response criteria, and utilities in early rheumatoid arthritis. Ann Rheum Dis. (2000) 59:966–74. doi: 10.1136/ard.59.12.966,
8. Bartlett, SJ, Hewlett, S, Bingham, CO 3rd, Woodworth, TG, Alten, R, Pohl, C, et al. Identifying core domains to assess flare in rheumatoid arthritis: an OMERACT international patient and provider combined Delphi consensus. Ann Rheum Dis. (2012) 71:1855–60. doi: 10.1136/annrheumdis-2011-201201,
9. Williamson, PR, Altman, DG, Bagley, H, Barnes, KL, Blazeby, JM, Brookes, ST, et al. The COMET handbook: version 1.0. Trials. (2017) 18:280. doi: 10.1186/s13063-017-1978-4,
10. Kirkham, JJ, Gorst, S, Altman, DG, Blazeby, JM, Clarke, M, Devane, D, et al. Core outcome set-standards for reporting: the COS-STAR statement. PLoS Med. (2016) 13:e1002148. doi: 10.1371/journal.pmed.1002148
11. Guyatt, GH, Oxman, AD, Kunz, R, Atkins, D, Brozek, J, Vist, G, et al. GRADE guidelines: 2. Framing the question and deciding on important outcomes. J Clin Epidemiol. (2011) 64:395–400. doi: 10.1016/j.jclinepi.2010.09.012,
12. Jiao, J, Chen, L, Peng, Y, Jia, Q, He, Y, Zhang, Y, et al. Development of a core outcome set for cardiovascular diabetology: a methodological framework. Front Endocrinol (Lausanne). (2023) 14:1271891. Published 2023 Dec 6. doi: 10.3389/fendo.2023.1271891,
13. Meissner, Y, Fischer-Betz, R, Andreoli, L, Costedoat-Chalumeau, N, de Cock, D, Dolhain, RJEM, et al. EULAR recommendations for a core data set for pregnancy registries in rheumatology. Ann Rheum Dis. (2021) 80:49–56. doi: 10.1136/annrheumdis-2020-218356,
14. Felson, D, Lacaille, D, LaValley, MP, and Aletaha, D. Re-examining remission definitions in rheumatoid arthritis: considering the 28-joint disease activity score, C-reactive protein level and patient global assessment. Ann Rheum Dis. (2022) 81:4–7. doi: 10.1136/annrheumdis-2021-221653,
15. Wang, F, Liu, J, Fang, Y, Wen, J, He, M, Han, Q, et al. Hypercoagulability in rheumatoid arthritis: a bibliometric analysis and retrospective data mining study. ACS Omega. (2023) 8:48522–34. doi: 10.1021/acsomega.3c08460
16. Zhang, F, Jonsson, AH, Nathan, A, Millard, N, Curtis, M, Xiao, Q, et al. Deconstruction of rheumatoid arthritis synovium defines inflammatory subtypes. Nature. (2023) 623:616–24. doi: 10.1038/s41586-023-06708-y,
17. Scott, DL, Wolfe, F, and Huizinga, TW. Rheumatoid arthritis. Lancet. (2010) 376:1094–108. doi: 10.1016/S0140-6736(10)60826-4,
18. Meznerics, FA, Kemény, LV, Gunther, E, Bakó, E, Dembrovszky, F, Szabó, B, et al. Multibiomarker disease activity score: an objective tool for monitoring rheumatoid arthritis? A systematic review and meta-analysis. Rheumatology (Oxford). (2023) 62:2048–59. doi: 10.1093/rheumatology/keac715
19. Sun, X, Deng, X, Xie, W, Wang, L, Wang, Y, and Zhang, Z. The agreement between ultrasound-determined joint inflammation and clinical signs in patients with rheumatoid arthritis. Arthritis Res Ther. (2019) 21:100. doi: 10.1186/s13075-019-1892-0,
20. D'Agostino, MA, Terslev, L, Aegerter, P, Backhaus, M, Balint, P, Bruyn, GA, et al. Scoring ultrasound synovitis in rheumatoid arthritis: a EULAR-OMERACT ultrasound taskforce-part 1: definition and development of a standardised, consensus-based scoring system. RMD Open. (2017) 3:e000428. doi: 10.1136/rmdopen-2016-000428
21. Rubbert-Roth, A, Aletaha, D, Devenport, J, Sidiropoulos, PN, Luder, Y, Edwardes, MD, et al. Effect of disease duration and other characteristics on efficacy outcomes in clinical trials of tocilizumab for rheumatoid arthritis. Rheumatology (Oxford). (2021) 60:682–91. doi: 10.1093/rheumatology/keaa259,
22. Zhao, Y, Gao, HH, Yan, Y, and He, H-j. Current status of research on treatment of osteonecrosis of femoral head with traditional Chinese medicine from the perspective of security-based Chinese medicine. World J Integr Trad West Med. (2023) 9:49–58. doi: 10.70976/j.2096-0964.2023.1.49-58
23. Aletaha, D, Neogi, T, Silman, AJ, Funovits, J, Felson, DT, Bingham, CO 3rd, et al. 2010 rheumatoid arthritis classification criteria: an American College of Rheumatology/European league against rheumatism collaborative initiative. Ann Rheum Dis. (2010) 69:1580–8. doi: 10.1136/ard.2010.138461
24. Benhamou, M, Rincheval, N, Roy, C, Foltz, V, Rozenberg, S, Sibilia, J, et al. The gap between practice and guidelines in the choice of first-line disease modifying antirheumatic drug in early rheumatoid arthritis: results from the ESPOIR cohort. J Rheumatol. (2009) 36:934–42. doi: 10.3899/jrheum.080762,
25. Felson, DT, Smolen, JS, Wells, G, Zhang, B, van Tuyl, LHD, Funovits, J, et al. American College of Rheumatology/European league against rheumatism provisional definition of remission in rheumatoid arthritis for clinical trials. Ann Rheum Dis. (2011) 70:404–13. doi: 10.1136/ard.2011.149765,
26. Ma, L, Jiang, X, and Gao, J. Revolutionizing rheumatoid arthritis therapy: harnessing cytomembrane biomimetic nanoparticles for novel treatment strategies. Drug Deliv Transl Res. (2025) 15:66–83. doi: 10.1007/s13346-024-01605-x,
27. Jo, HG, Seo, J, Baek, E, and Lee, D. Exploring the benefits and prescribing informations of combining east Asian herbal medicine with conventional medicine in the treatment of rheumatoid arthritis: a systematic review and multifaceted analysis of 415 randomized controlled trials. Pharmacol Res. (2025) 212:107616. doi: 10.1016/j.phrs.2025.107616,
28. Liu, P, Chen, Z, Huang, J, Wang, X, Chen, Y, He, X, et al. Wutou decoction: a latest review on molecular mechanisms, clinical studies, quality control, pharmacokinetic studies, pharmacological effects, toxicity, and target prediction. J Ethnopharmacol. (2025) 341:119307. doi: 10.1016/j.jep.2024.119307,
29. Jiang, H, Lu, Q, Huang, X, Zhang, H, Zeng, J, Wang, M, et al. Sinomenine-glycyrrhizic acid self-assembly enhanced the anti-inflammatory effect of sinomenine in the treatment of rheumatoid arthritis. J Control Release. (2025) 382:113718. doi: 10.1016/j.jconrel.2025.113718
30. Project Team of the “Guideline for Integrated Traditional Chinese Medicine Syndrome and Disease Diagnosis and Treatment of Rheumatoid Arthritis”, China Association of Chinese Medicine. Guideline for integrated traditional Chinese medicine syndrome and disease diagnosis and treatment of rheumatoid arthritis (2025). J Tradit Chin Med. (2025) 66:1842–56.
Keywords: rheumatoid arthritis, core outcome set, traditional Chinese medicine, Delphi survey, semi-structured interview
Citation: Yu B, Song M, Yang J, Tang X, Xia C, Zhang J, Zhao Xq, Jiang Q and Han M (2025) Developing a core outcome set for clinical trials of traditional Chinese medicine for rheumatoid arthritis. Front. Med. 12:1690963. doi: 10.3389/fmed.2025.1690963
Edited by:
Ryusuke Yoshimi, Yokohama City University, JapanReviewed by:
Xinchang Wang, Zhejiang Chinese Medical University, ChinaGawel Solowski, Bingöl University, Türkiye
Copyright © 2025 Yu, Song, Yang, Tang, Xia, Zhang, Zhao, Jiang and Han. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Quan Jiang, ZG9jdG9yanFAMTI2LmNvbQ==Man Han, aGFubWFuMTExQDEyNi5jb20=
†These authors have contributed equally to this work and share first authorship
Mengge Song1†