Your new experience awaits. Try the new design now and help us make it even better

SYSTEMATIC REVIEW article

Front. Med., 18 June 2025

Sec. Healthcare Professions Education

Volume 12 - 2025 | https://doi.org/10.3389/fmed.2025.1601116

Is the use of standardized patients more effective than role-playing in medical education? A meta-analysis


Jingyuan XiaoJingyuan XiaoXinjian Fu
Xinjian Fu
  • Guangzhou Xinhua University, Guangzhou, China

Aim: This study aimed to compare the effectiveness of using Standardized patients (SPs) and Role-playing (RP) in medical education. It is crucial to understand the differences in the effects of SPs and RP. However, the existing measurement results are varied, and the findings lack robustness.

Methods: We collected the results of various experiments and conducted a meta-analysis. In total, 10 articles and 27 effect sizes were included in the analysis, involving 721 students.

Results: The meta-analysis results showed that compared to the RP method, using SPs significantly improved students’ self-confidence (effect size = 0.415). However, in other aspects, the two methods showed similar outcomes. We observed that the effectiveness of SPs teaching methods increased over time.

Conclusion: SPs effectively enhance students’ self-confidence by simulating diverse roles, situations, and real-world work scenarios. This study provides a comprehensive comparative perspective on RP and SPs.

1 Introduction

The rapid development of economic globalization has driven the transformation of the medical field (1). As a result, the demands on medical workers have risen significantly, and their tasks have become increasingly complex (2). The traditional teaching method keeps medical students away from frontline work and fails to mobilize students’ enthusiasm (3). Therefore, it is essential to provide students with scenario-based and experiential teaching methods.

Standardized Patients (SPs) and Role-Playing (RP) are commonly used teaching methods in medicine and education (4). SPs are trained individuals who simulate various medical cases and clinical scenarios (5). Through interactions with SPs, students can practice physical examination, diagnosis, communication, patient management, and medical decision-making skills. RP allows learners to assume roles and engage with different situations to enhance their skills and cognition (6). As a situational simulation teaching method, RP has been widely applied across various disciplines and fields (7). In medical education, both RP and SP methods represent a shift from a teacher-centered to student-centered approach. These methods simulate clinical settings, helping students apply theory and build skills like observation and communication.

At present, there have been many studies related to SPs and RP, such as the situational strategy and the reshaping strategy of learning interest (8). Different research environments, student groups, and implementation details may lead to differences in the effectiveness of the strategies in different contexts (9). Although existing studies have examined SPs and RP separately, there is still no clear consensus on which approach is more effective. This study aims to evaluate the relative effectiveness of SPs and RP in medical education. Specifically, it examines whether SPs are more effective than RP overall, and whether significant differences exist between the two methods across various dimensions of learning outcomes.

2 Literature review

Many scholars have explored the impact of medical education using SPs. For instance, Ross et al. (10) found that combining SPs with lectures effectively educated nursing students about elder abuse. Similarly, Kim and Kim (11) reported that an 8-h psychiatric nursing simulation significantly improved students’ self-directed learning and self-efficacy. Webster (12) also confirmed the effectiveness of SPs in enhancing therapeutic communication skills. Ok et al. (13) showed that SPs reduced anxiety and improved communication in mental health training, while Ha (14) highlighted improvements in students’ self-confidence, nursing abilities, and interest in learning.

While the benefits of SPs are well-documented, some scholars have focused on RP as an alternative method that can also improve student learning. RP provides simulated scenarios that aim to improve students’ empathy and emotional intelligence. However, some studies have questioned the effectiveness of RP. Bayne (15) argued that regulatory factors might hinder learning outcomes, while Delnavaz et al. (16) reported no significant improvements in students’ skills or theoretical knowledge. Lee and Kim (17) similarly found no notable gains in emotional intelligence.

Despite the independent use of either SPs or RP in many studies, both methods share similarities. Several studies have compared the effectiveness of these two methods. Mounsey et al. (18) observed no significant differences between RP and SPs in post-teaching videos. Yeung (19) found that RP and SPs differed in their effects on self-efficacy but not on performance in adverse event disclosure training. Taylor et al. (20) and Schlegel et al. (21) both reported no significant differences between RP and SP in terms of communication skills and self-efficacy, respectively, suggesting comparable effectiveness in these domains.

Scholars have also conducted a literature review of SPs and RP. For example, Ma et al. (22) found that SP simulations significantly enhanced nursing students’ communication, self-efficacy, problem-solving, and learning satisfaction. Gelis et al. (23) showed that RP effectively improved communication skills and was more cost-effective than SPs. Dalwood et al. (24) found that peer simulation, where healthcare students role-play as patients, was more effective in enhancing empathy. Chua et al. (25) further supported this by demonstrating that simulation-based interventions effectively enhanced medical students’ empathy.

Although some systematic reviews have examined RP or SPs individually, few have compared the effectiveness of the two methods. Meta-analysis enables the systematic synthesis of empirical studies to obtain comprehensive effect estimates. Accordingly, this study employs a meta-analytic approach to examine the differences in the effectiveness of RP and SPs in medical education. The results can inform educators’ decisions in selecting appropriate instructional methods to enhance teaching outcomes.

3 Research design

3.1 Research methods and tools

A meta-analysis is a statistical method used to synthesize and integrate results from multiple independent studies (26). By combining data from various sources, it enables researchers to draw more comprehensive and reliable conclusions (27), while reducing the instability caused by small sample sizes or random errors in individual studies. This approach also incorporates a broader range of samples, leading to more robust and generalizable findings. Given these advantages, this study adopts meta-analysis as the primary research method. Comprehensive Meta-Analysis (CMA) software, a specialized tool for conducting meta-analyses, provides an all-in-one solution for data analysis (28). In addition to its analytical functions, CMA offers tools for testing heterogeneity and assessing bias, making it well-suited for this study’s data processing and analysis.

3.2 Search strategy

The procedure for selecting studies followed the guidelines of the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (29). The literature analyzed in this study was primarily sourced from databases such as EBSCO, ProQuest, ScienceDirect, SpringerLink, Web of Science, and PubMed. The literature search utilized two main groups of keywords: (1) RP-related keywords, including “role-play,” “role-playing,” “role playing,” and “RP”; (2) SP-related keywords, including “standardized patient,” “SP,” and “SPs”; and (3) keywords related to outcomes, such as “performance,” “effectiveness,” “effect,” “achievement,” and “outcome.” Boolean operators, “AND” and “OR,” were used to combine keywords both within and between groups, respectively.

3.3 Eligibility criteria

The literature was screened according to the following criteria: (1) The study employed an experimental or quasi-experimental design; (2) Both an experimental group and a control group were included, with the intervention involving the use of RP and SPs in teaching; (3) Sufficient statistical data were provided, including sample sizes, means, variances, and other relevant metrics for both groups; (4) The participants were medical personnel. Based on these criteria, 2, 137 papers were retrieved and screened, with the process shown in Figure 1. After thoroughly reviewing the titles, abstracts and full texts, the two researchers collaborated to identify 10 studies that would be included in the meta-analysis.

FIGURE 1
www.frontiersin.org

Figure 1. Flowchart of literature inclusion.

3.4 Quality appraisal

Among the included studies, five were quasi-experiments and five were randomized controlled trials (RCTs). Each study was independently reviewed by two investigators. The quasi-experimental studies were evaluated using the JBI Checklist for Quasi-Experimental Studies, which includes nine items with four response options: “Yes,” “No,” “Unclear,” and “Not Applicable.” For the RCTs, the Risk of Bias 2 (RoB 2) tool was employed. This tool assesses the risk of bias across five domains: selection bias, performance bias, detection bias, attrition bias, and reporting bias. Each domain was rated as having a “low risk,” “high risk,” or “unclear risk” of bias.

3.5 Data encoding and effect size

After the literature retrieval, two independent reviewers coded the characteristics of the collected original literature (30). Two independent reviewers coded the included studies based on a predefined coding framework. Any discrepancies between the reviewers were resolved through joint consultation. Inter-rater reliability was assessed using Cohen’s Kappa, with a value of 0.81 indicating substantial agreement. The coding categories of this study included author, publication year, experiment type, sample size, publication type, country, and learning effectiveness type. The experiments were classified into two types: RCTs (Randomized Controlled Trials) and quasi-experiments. Sample sizes were categorized into three groups: small (≤ 50 participants), medium (51 The experiments were classifi(> 100 participants). The publication types were categorized into two groups: journal articles and theses. The studies were conducted in Germany, Indonesia, Korea, China, and the United States. The learning outcomes examined in this study include communication, emotional response, overall performance, knowledge acquisition, self-efficacy, professional competence, and self-confidence. Detailed coding information is presented in Table 1.

TABLE 1
www.frontiersin.org

Table 1. Summary of included studies.

Effect sizes are statistical indicators that quantify the magnitude of differences or associations between variables or treatments of interest (31). Different disciplines often employ different effect size metrics (32). In educational research, commonly used measures include the standardized mean difference (SMD), as well as Cohen’s d and Hedges’ g. While Cohen’s d may overestimate effect sizes in small samples (33), Hedges’ g includes a correction for small sample bias, making it more appropriate for studies with limited sample sizes (19). Given the relatively small sample sizes in the included studies, this meta-analysis adopted Hedges’ g as the effect size metric.

4 Results

4.1 Risk of bias in studies

Using the JBI checklist for assessment, most studies were found to have a low risk of bias, as shown in Table 2. The RoB2 assessment identified four studies with a low risk in the randomization process. However, many studies lacked allocation concealment and random assignment, largely due to the nature of topic selection. Regarding deviations from the intended interventions, one study had a low risk, three raised some concerns, and one was classified as high risk. This was primarily because both participants and intervention providers were generally aware of the intervention. For missing outcome data, five studies demonstrated a low risk, with no major issues identified. All studies showed a low risk in outcome measurement, as they employed rigorous analytical methods. Finally, four studies were rated as having a low risk in the selection of reported results, as illustrated in Figure 2.

TABLE 2
www.frontiersin.org

Table 2. Quality assessment of the included studies of quasi-experimental study.

FIGURE 2
www.frontiersin.org

Figure 2. Assessment of the risk of bias in the included articles.

4.2 Test for bias

Publication bias refers to the tendency for studies with statistically significant findings to be more likely published (43). In contrast, non-significant results may remain unpublished due to editorial decisions or authors’ self-selection, leading to incomplete or skewed representations of the research landscape. To ensure the scientific validity of meta-analytic results, it is therefore essential to assess and account for publication bias in the primary studies included in the analysis.

This study employed funnel plots as well as Begg’s tests to assess publication bias (44). Funnel plots visually help detect publication bias. As shown in Figure 3, the funnel plot for the effect size of understanding level is symmetrically distributed around the axis, with g = 0.09. Additionally, Begg and Mazumdar’s rank correlation test (Kendall’s τ = –0.179, p = 0.18) and Egger’s regression test for the intercept [intercept = –3.189, 95% CI (–6.631, 0.251), t = 1.90, df = 25, p = 0.067] yielded p-values greater than 0.05. These results suggest that there is no significant publication bias in the sample, supporting its use for further analysis.

FIGURE 3
www.frontiersin.org

Figure 3. The funnel plot shows publication bias among outcome type the included studies. Each point represents a study, distributed around the center line.

4.3 Heterogeneity test

Heterogeneity tests assess the extent to which effect sizes vary across independent studies (45). These tests determine whether significant differences exist between studies’ effect sizes and whether it is appropriate to pool their results. The Q test and I2 statistic are commonly used to evaluate heterogeneity (46). If the Q value exceeds K−1 (where K is the number of effect sizes), and the p-value is ≤ 0.05, significant heterogeneity is indicated, and a random-effects model is recommended. An I2 value greater than 50% also suggests substantial heterogeneity (47). In the present study, the Q value was 100.963 (p < 0.05), and the I2 value was 74.248%. Given that the number of effect sizes was 27, the Q value exceeds the threshold of 26, and the I2 result confirms high heterogeneity. Therefore, a random-effects model was employed for analysis.

4.4 Overall effect test results

Based on the effect size interpretation framework proposed by Cohen (48), values above 0.2 are regarded as small, those exceeding 0.5 as medium, and values greater than 0.8 as large. Table 3 shows that the pooled effect size is 0.073, which does not reach statistical significance (P < 0.001), indicating no overall significant difference between using SPs and RP.

TABLE 3
www.frontiersin.org

Table 3. Overall effect size.

4.5 Subgroup analysis of learning outcomes

Subgroup analyses related to learning effects are shown in Table 4. The effect sizes (ES) for the influence of the virtual patient teaching method on learners’ communication, emotions, general performance, knowledge, professional competence, self-confidence, and self-efficacy were as follows: 0.200 (p > 0.05), –0.115 (p > 0.05), –0.089 (p > 0.05), 0.221 (p > 0.05), –0.068 (p > 0.05), 0.415 (p < 0.001), and –0.087 (p > 0.05). In this analysis, communication refers to learners’ ability to engage effectively with patients, including active listening and clear verbal expression (49). Knowledge denotes the learners’ understanding and retention of medical information (50). Confidence captures learners’ self-perceived assurance in carrying out clinical tasks (51). Among these, only the effect on self-confidence reached statistical significance, albeit with a small effect size. The heterogeneity for self-confidence was 0, indicating no significant variability. The heterogeneity test for the subgroup comparisons of different types of learning effects showed no impact on variance (QB = 8.111, p > 0.05).

TABLE 4
www.frontiersin.org

Table 4. Subgroup analyses related to learning effects.

4.6 Meta-analysis related to publication type

Subgroup analyses related to publication type are presented in Table 5. The results of the group effect test showed that QB = 4.531, p < 0.05, indicating a significant difference in the impact across different publication types. Within the publication types, the ES for articles and theses were –0.065 (p > 0.05) and 0.274 (p < 0.001), respectively. Although the effect size for articles is negative, it is not statistically significant. However, the effect size for theses reaches statistical significance, with a small positive effect. The degree of heterogeneity for this comparison was 0, indicating no variability.

TABLE 5
www.frontiersin.org

Table 5. Subgroup analysis with respect to publication type.

4.7 Subgroup analysis related to sample size

Subgroup analysis concerning sample size is shown in Table 6. The results of the group effect test indicated that QB = 4.165, p > 0.05. The effect sizes for large, medium, and small sample sizes were 0.264 (p < 0.001), 0.155 (p > 0.05), and –0.228 (p > 0.05), respectively. Only the large sample size reached statistical significance and had a positive effect. Regarding heterogeneity, the degree of heterogeneity for the large sample size was 0, indicating no significant variability.

TABLE 6
www.frontiersin.org

Table 6. Subgroup analysis with respect to sample size.

4.8 Analysis of subgroups related to countries

The subgroup analyses related to sample regions are shown in Table 7. The results of the group effect test indicated that QB = 12.101, p < 0.05, suggesting that the influence of different sample regions is significantly different. For the different sample regions, the ES for America, China, Germany, Indonesia, and Korea were 0.171 (p > 0.05), –0.102 (p > 0.05), 0.011 (p > 0.05), 0.906 (p < 0.001), and 0.209 (p < 0.05), respectively. Only the effect sizes for Indonesia and Korea reached statistical significance, with Indonesia showing a significant influence and Korea showing a small effect.

TABLE 7
www.frontiersin.org

Table 7. Subgroup analysis related to the sample area.

4.9 Cumulative analysis

Cumulative meta-analysis refers to a distinctive approach in which each original study is sequentially included in a meta-analysis, with the analysis updating as new data from individual studies are added. The most common method for accumulating studies is chronological, where the results demonstrate how the evidence has evolved over time. In this study, we assess the impact of the studies cumulatively based on the year of publication. The closer the value is to 1, the weaker the effect becomes. As shown in Figure 4, the effect size has exhibited an increasing trend over time.

FIGURE 4
www.frontiersin.org

Figure 4. Forest plot of outcomes type for SPs compared with RP. Favors A, SPs; Favors B, RP.

5 Discussion

There was no significant difference between SPs and RP in terms of their overall impact on students. Both teaching methods had a similar effect. When considering the specific dimensions of learning outcomes, no significant differences were found between the two methods regarding students’ communication, emotions, general performance, knowledge, professional competence, and self-efficacy. However, SPs had a more positive effect on improving students’ self-confidence compared to the RP method, which aligns with the findings of Cahyono et al. (36). This may be attributed to the fact that SPs expose students to realistic clinical scenarios that simulate emotional exchanges between patients and healthcare providers. The realistic interactions involved in SPs help students develop greater confidence in their ability to navigate real-world clinical scenarios (52). Emotional agitation, anxiety, and tension are common in clinical environments (53). The use of SPs enables students to develop emotion regulation and coping strategies within simulated clinical environments. Such training helps learners remain composed during emergencies and respond more effectively to patients’ emotional needs. For medical students who have not yet experienced real clinical settings, interacting with actual patients can cause anxiety and uncertainty (54). SPs training helps students gradually adapt to clinical roles, gain experience, and face real patients with more confidence during this transition (55). Through repeated interactions with SPs, students begin to perceive themselves as future healthcare professionals. This internalization of their professional identity plays a critical role in strengthening their self-confidence in clinical practice (56). These experiences have been shown to significantly enhance students’ self-confidence in real-world clinical practice. However, it is noteworthy that this improvement was domain-specific. SPs did not yield significantly greater benefits than RP in other learning domains, possibly because these outcomes demand broader or more sustained forms of engagement beyond confidence-building alone.

Some studies were excluded from the analysis due to concerns about potential contributions to heterogeneity. To further explore this issue, publication types were categorized as either “articles” or “theses,” and a subgroup analysis was conducted. The results indicated that SPs had a significantly greater positive effect in theses compared to other methods. In contrast, studies published as articles showed no significant difference between SPs and RP, possibly due to the limited number of studies in this subgroup. Regarding teaching implementation, sample size did not appear to significantly affect instructional outcomes overall. However, larger sample sizes were associated with more favorable results, particularly for SPs. This may be due to the inherent challenges of maintaining classroom discipline and engagement during RP activities in large groups, where instructors may struggle to effectively manage the learning environment, thereby reducing RP’s instructional effectiveness compared to SPs.

Significant differences were observed across different sample areas. Our analysis included studies conducted in five countries: the United States, China, Germany, Indonesia, and Korea. Among these, Indonesia showed the most significant effect, with an effect size of 0.906, which was statistically significant at the 0.1% level. However, this finding should be interpreted cautiously due to the small number of Indonesian studies. Overgeneralizing from such limited data may lead to inaccurate conclusions. Cumulative analysis showed that the effect size was initially small and highly variable but gradually increased over time, indicating an overall upward trend. This pattern may reflect improvements in research design and methodology over the years. As studies adopted more rigorous controls, the reported effect sizes became more stable and pronounced. In earlier periods, studies with small or non-significant effects were less likely to be published—a phenomenon known as the “file drawer effect.” However, growing emphasis on study registration and research transparency in recent years has helped reduce publication bias and expanded the range of studies available for cumulative analysis.

In order to optimize the use of standardized patients, clear learning objectives should be defined prior to implementation (57). These objectives should be aligned with course goals to ensure appropriate scenario selection and meaningful performance evaluation. Our findings suggest that SPs is particularly effective in enhancing students’ self-confidence. Therefore, when confidence-building is a central instructional goal, SPs should be prioritized when feasible. During course sessions, educators should guide the activity through timely, constructive feedback (58), helping students reflect and improve without disrupting their immersion.

The duration of simulation-based teaching should be carefully planned and adjusted based on the complexity of each scenario (59). More complex situations may require longer sessions and extended discussion time to ensure students can fully comprehend and apply relevant knowledge and skills (60). Each phase should be appropriately timed to avoid the negative effects of excessively long or short sessions on learning outcomes. Flexibility in time management is essential (61). For instance, if students are actively engaged, extending the session may enhance reflection and learning. Conversely, if students appear fatigued or time is limited, the session can be shortened. Clearly defined learning objectives help instructors select suitable scenarios and roles and assess whether student performance meets expectations. Teachers should also be mindful of when to intervene, avoiding interruptions during moments of deep engagement, as this may disrupt the learning process.

Given the variability in students’ learning preferences and responsiveness, educators should consider different instructional approaches (62). Some students may benefit more from RP, while others respond better to SPs. Thus, teaching methods should be selected and adapted based on students’ needs and the complexity of the course content.

6 Limitations

This study has the following advantages: first, the experimental data were retrieved from multiple major databases from the establishment of the study up to 2024. Secondly, the included studies are of high quality. We also conducted a subgroup analysis, which partially explains the sources of heterogeneity in this study.

We have to admit that the study has some limitations. The limitation is related to the study sample, which is a common issue in almost all meta-analyses. For instance, in our subgroup analysis based on sample region, only two studies from Indonesia were included. The small number of studies in this subgroup may lead to less precise estimates. In the future, some unpublished literature needs to be included, and as the number of studies increases, this problem can be solved more effectively. The second limitation is that this study considered only a limited set of variables that may affect the effectiveness of the intervention. Important factors such as gender differences and prior frontline work experience were not included, although they may significantly influence student outcomes. This omission was primarily due to the fact that many of the included studies did not report these variables in sufficient detail. Future research should aim to incorporate such factors to provide a more comprehensive understanding of their potential impact on intervention effectiveness. Finally, language bias may be present, as most studies were conducted in English. Future research could include studies in other languages to enhance the robustness of the overall findings.

7 Conclusion

This meta-analysis synthesized findings from studies comparing SP and RP methods in medical education. Our findings indicate that, in terms of overall impact, the effects of both methods on students are similar. However, SPs are more effective in improving self-confidence. Although this study offers a new comparative perspective, cultural and sample differences may limit the generalizability of the findings. While limitations remain, this study contributes meaningful evidence to the ongoing debate on the relative effectiveness of RP and SPs.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

JX: Writing – original draft, Conceptualization, Writing – review & editing, Funding acquisition, Formal Analysis. XF: Validation, Software, Writing – review & editing, Methodology, Writing – original draft.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Bastiaens I, Beesley C. Economic hardship and welfare policy preferences: What can the COVID-19 pandemic tell us? Polit Stud Rev. (2024) 23:313–32. doi: 10.1177/14789299241252386

Crossref Full Text | Google Scholar

2. Lewinson T, Washington T, Allen S, Murphey J, Onsando W. “We’re kind of on the back burner”: Psychological distress and coping among medical social workers during the COVID-19 pandemic. Soc Work Health Care. (2023) 62:243–62. doi: 10.1080/00981389.2023.2221718

PubMed Abstract | Crossref Full Text | Google Scholar

3. Grønkjær M, Voldbjerg S, Jørgensen L, Kusk K, Laugesen B. Establishing and leading a cross-institutional partnership to integrate fundamentals of care into clinical practice, nursing education and research. J Adv Nurs. (2023) 79:951–60. doi: 10.1111/jan.15366

PubMed Abstract | Crossref Full Text | Google Scholar

4. Bokken L, van Dalen J, Rethans J. The case of “Miss Jacobs”: Adolescent simulated patients and the quality of their role playing, feedback, and personal impact. Simul Healthc. (2010) 5:315–9. doi: 10.1097/SIH.0b013e3181ddcd71

PubMed Abstract | Crossref Full Text | Google Scholar

5. Becker K, Rose L, Berg J, Park H, Shatzer J. The teaching effectiveness of standardized patients. J Nurs Educ. (2006) 45:103–11. doi: 10.3928/01484834-20060401-03

PubMed Abstract | Crossref Full Text | Google Scholar

6. Yu H, Isahakyan A, van Merrienboer J, Köhler S, Asoodar M. Second language learning role-play: Effects of patient and doctor roles on motivation and competence. Front Med. (2023) 10:1163267. doi: 10.3389/fmed.2023.1163267

PubMed Abstract | Crossref Full Text | Google Scholar

7. Fulton A, Dimitropoulos G, Ayala J. Role-playing: A strategy for practicum preparation for foundation year MSW students. J Teach Soc Work. (2019) 39:163–80. doi: 10.1080/08841233.2019.1576573

Crossref Full Text | Google Scholar

8. Cortés-Rodríguez A, Roman P, López-Rodríguez M, Fernández-Medina I, Fernández-Sola C, Hernández-Padilla J. Role-play versus standardised patient simulation for teaching interprofessional communication in care of the elderly for nursing students. Healthcare. (2021) 10:46. doi: 10.3390/healthcare10010046

PubMed Abstract | Crossref Full Text | Google Scholar

9. Zhang W, Wei J, Guo W, Wang Z, Chen S. Comparing the effects of team-based and problem-based learning strategies in medical education: A systematic review. BMC Med Educ. (2024) 24:172. doi: 10.1186/s12909-024-05107-9

PubMed Abstract | Crossref Full Text | Google Scholar

10. Ross M, Bryan J, Thomas K, Asghar-Ali A, Pickens S. Elder abuse education using standardized patient simulation in an undergraduate nursing program. J Nurs Educ. (2020) 59:331–5. doi: 10.3928/01484834-20200520-06

PubMed Abstract | Crossref Full Text | Google Scholar

11. Kim S, Kim H. Effects of psychiatric nursing practicum using simulated patients on self-directed learning readiness, learning self-efficacy, satisfaction of learning. J Korea Contents Assoc. (2014) 14:396–408. doi: 10.5392/JKCA.2014.14.02.396

Crossref Full Text | Google Scholar

12. Webster D. Using standardized patients to teach therapeutic communication in psychiatric nursing. Clin Simul Nurs. (2014) 10:e81–6. doi: 10.1016/j.ecns.2013.08.005

Crossref Full Text | Google Scholar

13. Ok E, Kutlu F, Ates E. The effect of standardized patient simulation prior to mental health rotation on nursing students’ anxiety and communication skills. Issues Ment Health Nurs. (2020) 41:251–5. doi: 10.1080/01612840.2019.1642427

PubMed Abstract | Crossref Full Text | Google Scholar

14. Ha E. Experience of nursing students with standardized patients in simulation-based learning: Q-methodology study. Nurs Educ Today. (2018) 66:123–9. doi: 10.1016/j.nedt.2018.04.023

PubMed Abstract | Crossref Full Text | Google Scholar

15. Bayne H. Training medical students in empathic communication. J Spec Group Work. (2011) 36:316–29. doi: 10.1080/01933922.2011.613899

Crossref Full Text | Google Scholar

16. Delnavaz S, Hassankhani H, Roshangar F, Dadashzadeh A, Sarbakhsh P, Ghafourifard M, et al. Comparison of scenario based triage education by lecture and role playing on knowledge and practice of nursing students. Nurs Educ Today. (2018) 70:54–9. doi: 10.1016/j.nedt.2018.08.006

PubMed Abstract | Crossref Full Text | Google Scholar

17. Lee B, Kim S. Development and effects of a high-risk pregnancy emotive role-play program for nursing students: A quasi-experimental study. Korean J Women Health Nurs. (2022) 28:317–28. doi: 10.4069/kjwhn.2022.12.06

PubMed Abstract | Crossref Full Text | Google Scholar

18. Mounsey A, Bovbjerg V, White L, Gazewood J. Do students develop better motivational interviewing skills through role-play with standardised patients or with student colleagues? Med Educ. (2006) 40:775–80. doi: 10.1111/j.1365-2929.2006.02533.x

PubMed Abstract | Crossref Full Text | Google Scholar

19. Yeung J. Adverse event disclosure training for nursing students: Peer role-play and simulated patients. Int J Nurs Educ Scholarsh. (2019) 16:1–9. doi: 10.1515/ijnes-2019-0094

PubMed Abstract | Crossref Full Text | Google Scholar

20. Taylor S, Bobba S, Roome S, Ahmadzai M, Tran D, Vickers D, et al. Simulated patient and role play methodologies for communication skills training in an undergraduate medical program: Randomized, crossover trial. Educ Health. (2018) 31:10–6. doi: 10.4103/1357-6283.239040

PubMed Abstract | Crossref Full Text | Google Scholar

21. Schlegel C, Woermann U, Shaha M, Rethans J, van der Vleuten C. Effects of communication training on real practice performance: A role-play module versus a standardized patient module. J Nurs Educ. (2012) 51:16–22. doi: 10.3928/01484834-20111116-02

PubMed Abstract | Crossref Full Text | Google Scholar

22. Ma J, Lee Y, Kang J. Standardized patient simulation for more effective undergraduate nursing education: A systematic review and meta-analysis. Clin Simul Nurs. (2023) 74:19–37. doi: 10.1016/j.ecns.2022.10.002

Crossref Full Text | Google Scholar

23. Gelis A, Cervello S, Rey R, Llorca G, Lambert P, Franck N, et al. Peer role-play for training communication skills in medical students: A systematic review. Simul Healthc. (2020) 15:106–11. doi: 10.1097/SIH.0000000000000412

PubMed Abstract | Crossref Full Text | Google Scholar

24. Dalwood N, Bowles K, Williams C, Morgan P, Pritchard S, Blackstock F. Students as patients: A systematic review of peer simulation in health care professional education. Med Educ. (2020) 54:387–99. doi: 10.1111/medu.14058

PubMed Abstract | Crossref Full Text | Google Scholar

25. Chua J, Ang E, Lau S, Shorey S. Effectiveness of simulation-based interventions at improving empathy among healthcare students: A systematic review and meta-analysis. Nurs Educ Today. (2021) 104:105000. doi: 10.1016/j.nedt.2021.105000

PubMed Abstract | Crossref Full Text | Google Scholar

26. Balduzzi S, Rücker G, Schwarzer G. How to perform a meta-analysis with R: A practical tutorial. BMJ Ment Health. (2019) 22:153–60. doi: 10.1136/ebmental-2019-300117

PubMed Abstract | Crossref Full Text | Google Scholar

27. Paul J, Barari M. Meta-analysis and traditional systematic literature reviews—What, why, when, where, and how? Psychol Mark. (2022) 39:1099–115. doi: 10.1002/mar.21657

Crossref Full Text | Google Scholar

28. Borenstein M. Comprehensive meta-analysis software. In: M Egger, JPT Higgins, G Davey Smith editors. Systematic Reviews in Health Research: Meta-Analysis in Context. Hoboken, NJ: Wiley (2022). p. 535–48. doi: 10.1002/9781119099369.ch27

Crossref Full Text | Google Scholar

29. Moher D. Corrigendum to: Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. Int J Surg. (2010) 8:658. doi: 10.1016/j.ijsu.2010.07.299

Crossref Full Text | Google Scholar

30. Du L, Zhao L, Xu T, Wang Y, Zu W, Huang X, et al. Blended learning vs traditional teaching: The potential of a novel teaching strategy in nursing education-a systematic review and meta-analysis. Nurs Educ Pract. (2022) 63:103354. doi: 10.1016/j.nepr.2022.103354

PubMed Abstract | Crossref Full Text | Google Scholar

31. Gao X, Wang L, Deng J, Wan C, Mu D. The effect of the problem based learning teaching model combined with mind mapping on nursing teaching: A meta-analysis. Nurs Educ Today. (2022) 111:105306. doi: 10.1016/j.nedt.2022.105306

PubMed Abstract | Crossref Full Text | Google Scholar

32. Marufu T, Bower R, Hendron E, Manning J. Nursing interventions to reduce medication errors in paediatrics and neonates: Systematic review and meta-analysis. J Pediatr Nurs. (2022) 62:e139–47. doi: 10.1016/j.pedn.2021.08.024

PubMed Abstract | Crossref Full Text | Google Scholar

33. Gucciardi D, Lines R, Ntoumanis N. Handling effect size dependency in meta-analysis. Int Rev Sport Exerc Psychol. (2022) 15:152–78. doi: 10.1080/1750984X.2021.1946835

Crossref Full Text | Google Scholar

34. Bosse HM, Nickel M, Huwendiek S, Jünger J, Schultz JH, Nikendei C. Peer role-play and standardised patients in communication training: a comparative study on the student perspective on acceptability, realism, and perceived effect. BMC Med Educ. (2010) 10:1–9. doi: 10.1186/1472-6920-10-27

PubMed Abstract | Crossref Full Text | Google Scholar

35. Bosse HM, Schultz JH, Nickel M, Lutz T, Möltner A, Jünger J, et al. The effect of using standardized patients or peer role play on ratings of undergraduate communication training: a randomized controlled trial. Patient Educ Couns. (2012) 87:300–6. doi: 10.1016/j.pec.2011.10.007

PubMed Abstract | Crossref Full Text | Google Scholar

36. Cahyono B, Zuhroidah I, Sujarwadi M. Effectiveness of standardized patients and role-play simulation methods in improving self-confidence and therapeutic communication skills of nursing students. Nurs Health. (2020) 9:9–15. doi: 10.36720/nhjk.v9i1.128

Crossref Full Text | Google Scholar

37. Kim E, Kim SS, Kim S. Effects of infection control education for nursing students using standardized patients vs. peer role-play. Int J Environ Res Public Health. (2021) 18:107. doi: 10.3390/ijerph18010107

PubMed Abstract | Crossref Full Text | Google Scholar

38. Park KY, Park HK, Hwang HS. Group randomized trial of teaching tobacco-cessation counseling to senior medical students: a peer role-play module versus a standardized patient module. BMC Med Educ. (2019) 19:1–9. doi: 10.1186/s12909-019-1668-x

PubMed Abstract | Crossref Full Text | Google Scholar

39. Yeung WYJ. Using simulation with simulated patients and peer role-play for adverse event disclosure training in an undergraduate nursing program: a comparative study. Doctoral dissertation. University of Bristol (2020).

Google Scholar

40. Li H, Gu H, Chi C, Jiang H, Zhou Y, Jia W, et al. End-of-life communication skills training for undergraduate nursing students to address cultural obstacles in China. J Hosp Palliat Nurs. (2023) 25:E14–23. doi: 10.1097/NJH.0000000000000920

PubMed Abstract | Crossref Full Text | Google Scholar

41. Bradford JL. Effect of Simulated Patients Versus Peer Role-Play on Physical Therapist Student Clinical Reasoning and Confidence. Doctoral dissertation. The University of Memphis (2018).

Google Scholar

42. Lupiani C. A Comparison of Standardized Patients With Role Play for Teaching Therapeutic Communication. Doctoral dissertation. Teachers College, Columbia University (2023).

Google Scholar

43. Nakagawa S, Lagisz M, Jennions M, Koricheva J, Noble D, Parker T, et al. Methods for testing publication bias in ecological and evolutionary meta-analyses. Methods Ecol Evol. (2022) 13:4–21. doi: 10.1111/2041-210X.13724

Crossref Full Text | Google Scholar

44. Begg C, Mazumdar M. Operating characteristics of a rank correlation test for publication bias. Biometrics. (1994) 50:1088–101. doi: 10.2307/2533446

Crossref Full Text | Google Scholar

45. Tipton E, Bryan C, Murray J, McDaniel M, Schneider B, Yeager D. Why meta-analyses of growth mindset and other interventions should follow best practices for examining heterogeneity: Commentary on Macnamara and Burgoyne (2023) and Burnette et al. (2023). Psychol Bull. (2023) 149:229–41. doi: 10.1037/bul0000384

PubMed Abstract | Crossref Full Text | Google Scholar

46. Macaskill P, Takwoingi Y, Deeks J, Gatsonis C. Understanding meta-analysis. Cochrane Handb Syst Rev Diagn Test Accur. (2023):203–47. doi: 10.1002/9781119756194.ch9

Crossref Full Text | Google Scholar

47. Johnson-Gentile K, Clements D, Battista M. Effects of computer and noncomputer environments on students’ conceptualizations of geometric motions. J Educ Comput Res. (1994) 11:121–40. doi: 10.2190/49EE-8PXL-YY8C-A923

PubMed Abstract | Crossref Full Text | Google Scholar

48. Cohen J. Statistical Power Analysis for the Behavioral Sciences. Milton Park: Routledge (2013).

Google Scholar

49. Wright H, O’Shea M, Sekula J, Mitchell L. Assessment of communication skills using telehealth: Considerations for educators. Front Med. (2022) 9:841309. doi: 10.3389/fmed.2022.841309

PubMed Abstract | Crossref Full Text | Google Scholar

50. Jaramillo-Aguilar D, Simbaña-Rivera K. Genetic knowledge and attitudes towards genetic testing among final-year medical students at a public university in Ecuador. Front Med. (2024) 11:1363552. doi: 10.3389/fmed.2024.1363552

PubMed Abstract | Crossref Full Text | Google Scholar

51. Andre N, Muallem L, Yahav L, Golan-Tripto I, Shmuel A, Horev A. Israeli pediatricians’ confidence level in diagnosing and treating children with skin disorders: A cross-sectional questionnaire pilot study. Front Med. (2023) 10:1–7. doi: 10.3389/fmed.2023.1250271

PubMed Abstract | Crossref Full Text | Google Scholar

52. Perez A, Gaehle K, Sobczak B, Stein K. Virtual simulation as a learning tool for teaching graduate nursing students to manage difficult conversations. Clin Simul Nurs. (2022) 62:66–72. doi: 10.1016/j.ecns.2021.10.003

Crossref Full Text | Google Scholar

53. Ulrich B, Cassidy L, Barden C, Varn-Davis N, Delgado S. National nurse work environments-October 2021: A status report. Crit Care Nurs. (2022) 42:58–70. doi: 10.4037/ccn2022798

PubMed Abstract | Crossref Full Text | Google Scholar

54. Quek T, Tam W, Tran B, Zhang M, Zhang Z, Ho C, et al. The global prevalence of anxiety among medical students: A meta-analysis. Int J Environ Res Public Health. (2019) 16:2735. doi: 10.3390/ijerph16152735

PubMed Abstract | Crossref Full Text | Google Scholar

55. Johnson K, Scott A, Franks L. Impact of standardized patients on first semester nursing students self-confidence, satisfaction, and communication in a simulated clinical case. SAGE Open Nurs. (2020) 6:2377960820930153. doi: 10.1177/2377960820930153

PubMed Abstract | Crossref Full Text | Google Scholar

56. Swift L, Henderson A, Wu C. Self-confidence in clinical skill: A descriptive study of the perspective of first-year nursing students. Nurse Educ Pract. (2022) 58:103270. doi: 10.1016/j.nepr.2021.103270

PubMed Abstract | Crossref Full Text | Google Scholar

57. Alrehaili E, Al Osman H. A virtual reality role-playing serious game for experiential learning. Interact Learn Environ. (2022) 30:922–35. doi: 10.1080/10494820.2019.1703008

Crossref Full Text | Google Scholar

58. Tiedemann, E, Simmenroth A. Teaching alcohol and smoking counselling in times of COVID-19 to 6th-semester medical students: Experiences with a digital-only and a blended learning teaching approach using role-play and feedback. GMS J Med Educ. (2021) 38: doi: 10.3205/zma001513

PubMed Abstract | Crossref Full Text | Google Scholar

59. Li S, Wang S, Du J, Pei Y, Shen X. MOOC learners’ time-investment patterns and temporal-learning characteristics. J Comput Assist Learn. (2022) 38:152–66. doi: 10.1111/jcal.12597

Crossref Full Text | Google Scholar

60. Crisianita S, Mandasari B. The use of small-group discussion to improve students’ speaking skill. J Engl Lang Teach Learn. (2022) 3:61–6. doi: 10.33365/jeltl.v3i1.1680

Crossref Full Text | Google Scholar

61. Burgess S, Rawal S, Taylor E. Teachers’ use of class time and student achievement. Econ Educ Rev. (2023) 94:102405. doi: 10.1016/j.econedurev.2023.102405

Crossref Full Text | Google Scholar

62. Komalawardhana N, Panjaburee P. Trends and development of technology-enhanced personalized learning in science education: A systematic review of publications from 2010 to 2022. J. Comput. Educ. (2024) 11:721–42. doi: 10.1007/s40692-023-00276-w

Crossref Full Text | Google Scholar

Keywords: role-playing, standardized patients, meta-analysis, medical education, clinical simulation

Citation: Xiao J and Fu X (2025) Is the use of standardized patients more effective than role-playing in medical education? A meta-analysis. Front. Med. 12:1601116. doi: 10.3389/fmed.2025.1601116

Received: 28 March 2025; Accepted: 26 May 2025;
Published: 18 June 2025.

Edited by:

Md Anwarul Azim Majumder, The University of the West Indies, Cave Hill, Barbados

Reviewed by:

Addisu Melake, University of Gondar, Ethiopia
Mariela Bellotti, National University of Río Negro, Argentina

Copyright © 2025 Xiao and Fu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xinjian Fu, amlsaWFuZ2Z4akAxNjMuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.