Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med., 05 February 2026

Sec. Healthcare Professions Education

Volume 12 - 2025 | https://doi.org/10.3389/fmed.2025.1745975

This article is part of the Research TopicAdvancing Medical Education through Integrated Artificial Intelligence and Immersive Virtual TechnologiesView all 6 articles

Integrating large language models into medical undergraduate laboratory course to enhance bioethical competence: a quasi-experimental study

  • Department of Cell Biology and Genetics, The School of Basic Medical Sciences, Fujian Medical University, Fuzhou, Fujian, China

Objective: This study investigates the integration of different large language models (LLMs) into the Medical Cell Biology Laboratory Course (MCBLC) to enhance bioethics training for undergraduate medical students in China. It further compares the effectiveness of these LLMs in improving teaching outcomes and student learning performances. Key challenges encountered during implementation were identified, and potential strategies to address them were also explored.

Methods: First-year undergraduate medical students from three medical majors were assigned to five groups. The study involved three phases: instructor-led course introduction, LLM-assisted experimental practice addressing procedural, conceptual, and psychological challenges, and post-training evaluation via questionnaires and blind-graded laboratory reports. Four domestic robust LLMs (DeepSeek, Doubao, KIMI, ChatGLM) were compared to assess their impact on bioethics integration, instructional effectiveness, and student learning outcomes, while documenting students' perceptions and concerns regarding LLM use.

Results: The study demonstrated that all four LLMs supported first-year undergraduate medical students in consolidating foundational knowledge, enhancing bioethics proficiency during laboratory practice, and developing critical competencies for future physicians. Questionnaires from 86 students across three majors indicated generally high satisfaction. For Medical Imaging Technology students, DeepSeek (mean 4.3, SD 0.7) and KIMI (mean 4.3, SD 0.8) were rated significantly higher than Doubao (mean 3.9, SD 0.7) and ChatGLM (mean 3.3, SD 0.6). KIMI was also preferred among Health Surveillance and Quarantine (mean 4.4, SD 0.5) and Medical Prevention (mean 4.5, SD 0.5) students. Nevertheless, students expressed concerns regarding potential academic inaccuracies, bias, and possible impact on independent thinking.

Conclusions: This study suggested that recent LLMs, particularly KIMI and DeepSeek, may support integrating bioethics into undergraduate medical laboratory courses in a university in China. By assisting students in accessing information, reflecting on ethical issues, and navigating practical challenges, these tools can facilitate learning and foster ethical awareness, competent future physicians. These findings, as an initial exploration and context-specific, indicate that LLMs may support bioethics learning in undergraduate medical laboratory courses and help foster ethically aware, competent future physicians.

Introduction

Bioethics-oriented basic medical education has emerged as an influential paradigm in undergraduate medical training, exerting a significant impact on students during the early stages of their medical studies (1, 2). By integrating ethical, humanistic, existential and inclusive perspectives into the foundational medical curricula, it plays a crucial role in shaping students' professional identity and moral outlook (3). By fostering bioethical sensitivity among medical undergraduates in both clinical practice and bioethical research (4), it lays a foundation for further engagement. Students are then immersed in authentic bioethical cases, gaining the ability to manage psychological stress and navigate ethical dilemmas in academic and clinical environments (5). This ultimately equips future medical professionals to shoulder the responsibility of defining and upholding bioethical principles for the benefit of humanity and the global community (6, 7).

As biomedical science advances rapidly, the ethical challenges accompanying technological innovation have grown increasingly complex. A key issue in medical education is how to integrate bioethical principles effectively into foundational medical sciences (8). Genetic testing and result disclosure exemplify this need: while screening can prevent the transmission of sex-linked diseases, misuse for non-medical purposes such as sex selection may distort population balance and undermine societal ethics (9, 10). In biomedical research, the boundaries of genetic intervention remain debated. Gene therapy has achieved significant progress, as evidenced by FDA-approved treatments using lentiviral vectors and CRISPR/Cas9 for transfusion-dependent β-thalassemia (11, 12). Nevertheless, unethical practices such as human germline editing demonstrate how scientific advances can outpace moral oversight (13). Beyond the laboratory, healthcare professionals encounter complex ethical dilemmas in both study and practice, including asymmetries in informed consent, moral stress in emergencies, and conflicts in treatment decision-making (14). These challenges underscore an urgent educational responsibility, that is, medical undergraduates must be prepared early with solid bioethical knowledge, reflective skills, and psychological resilience. Such preparation enables them to navigate ethical complexities competently and to participate responsibly in the development and implementation of evolving standards that protect human dignity and promote societal welfare.

The growing complexity of ethical dilemmas in modern medicine has prompted educators to seek innovative strategies for cultivating bioethical awareness among undergraduates. Among existing strategies, institutional workshops or specialized training provide in-depth instruction on bioethical issues. However, they depend heavily on organizational support and lack flexibility (15). Similarly, integrating ethics cases or current affairs into courses leverages individual teaching expertise but often offers a single-dimensional perspective, insufficient for students' multifaced ethical reflection (16). In the context of LLM-driven innovation, advanced computational power and vast knowledge repositories allow LLMs to provide efficient, holistic, and readily accessible solutions (17). These capabilities have been widely applied across education, medicine, engineering, and the social sciences, tackling complex challenges with remarkable breadth and precision (1821). In this study, LLMs were explored as a dynamic platform within a medical cell biology course, employing a student-teacher collaborative approach. By promoting active engagement with bioethical principles, this method aims to develop undergraduates who combine competence with ethical awareness, while providing a scalable and adaptable model to enhance medical ethics education and support responsible, ethically informed practice.

Methods

Study overview

This study adopted a quasi-experimental design with student self-selection to explore whether large language models (LLMs) could promote medical undergraduates' understanding of bioethics-oriented values in a cell biology laboratory course. The instructional intervention focused on a laboratory task involving the examination of sperms to observe chromosomes in germ cells, a topic selected for its biological relevance as well as its ethical implications in medical education.

The study was conducted in three sequential phases. In the first phase, participating students received a standardized introduction covering the course structure, learning objectives, background concepts, experimental procedures, and report writing requirements from the instructor. Students were informed of multiple learning options, including the use of one of four LLM tools (DeepSeek, Doubao, KIMI, or ChatGLM) or the option of not using any LLM during the laboratory course.

During in-class laboratory activities, students who chose to use LLM tools were permitted and encouraged to access them using personal mobile phones, tablets or laptop computers to support their learning and ethical reflection. Based on their personal preferences, students voluntarily selected their preferred learning approach and were subsequently categorized into five groups: four LLM-assisted learning groups (one for each LLM tool) and one non-LLM group. To avoid extremely large or small group sizes that could affect instructional feasibility and data analysis, the instructor provided limited informational guidance regarding approximate group sizes during the selection process. Final group decisions remained entirely voluntary, and no student was reassigned by the instructor. No random allocation or compulsory reassignment was applied.

In the final phase, all students completed a laboratory report and a post-course questionnaire reflecting on their learning experience, ethical engagement, and perceptions of LLM-assisted learning. Laboratory reports were collected and graded by the instructors, who also provided qualitative evaluations. Students' learning outcomes and feedback were then compared across the five groups to examine whether and how the use of different LLM tools influenced ethical awareness, concerns related to LLM platforms, and perceived learning effectiveness in the context of integrating bioethical principles into basic medical education (Figure 1).

Figure 1
Flowchart outlining a medical education process for undergraduates in Medical Imaging Technology, Health Surveillance and Quarantine, and Medical Prevention. Phase I involves teacher-led instruction covering themes, educational goals, and preparation. Phase II focuses on student-centered, bioethics-oriented practice, including exercises, data handling, and reflection, with LLM-assisted or standalone approach. Phase III comprises learner and teacher evaluations.

Figure 1. A schematic overview of the study on LLM-assisted integration of bioethical principles into the undergraduate cell biology laboratory course and its impact on educational outcomes.

Participants

A total of 86 medical undergraduates participated in this study. All participants were enrolled in the cell biology laboratory course during the 2025 academic year and were drawn from three majors: Medical Imaging Technology (n = 31), Preventive Medicine (n = 28), and Health Surveillance and Quarantine (n = 27). These majors were selected because each represents a distinct interdisciplinary orientation within medical education. General demographic information of the participants is summarized in Tables 1, 2.

Table 1
www.frontiersin.org

Table 1. General information of all participants from the three majors.

Table 2
www.frontiersin.org

Table 2. Overview of academic background of the students across three majors.

Medical Imaging Technology integrates foundational medical knowledge with engineering principles, emphasizing cross-disciplinary competencies required in modern diagnostic practice (22). Preventive Medicine combines medical science with social science perspectives, reflecting the integration of theoretical knowledge and applied approaches in public health and disease prevention (23). Health Surveillance and Quarantine draws on medicine and natural sciences, requiring a comprehensive understanding of both theoretical frameworks and practical applications within broader biological and medical contexts (24).

Selected LLMs and modes of interaction

Large language models (LLMs) are defined as advanced text-based machine learning models. They have been trained on extensive multilingual corpora to generate and analyze natural language. In the context of this educational reform study, LLMs were used as interactive tools that assist undergraduate students in retrieving, understanding, and reflecting on bioethical knowledge within the cell biology laboratory course. Four prominent LLMs in China were evaluated in this study. DeepSeek, developed as a free, open-source platform, excels in generating clinical materials and has demonstrated strong performance in national medical examinations (25). Doubao, created by ByteDance, is trained on extensive Chinese social media datasets and performs well in addressing routine queries (26). KIMI, developed by Moonshot AI, has been applied in clinical contexts for generating case descriptions due to its scientific rigor and reliability (27). ChatGLM, developed by Tsinghua University and collaborators, is suitable for both daily use and academic research (28). We assessed how these LLMs can facilitate the integration of bioethical education into medical laboratory courses using student surveys and teacher evaluations of laboratory reports.

The versions of the four LLMs used in this study were DeepSeek-R1, Doubao-1.5 Pro, KIMI K2 series, and ChatGLM/GLM series. During the laboratory course, students accessed these platforms on personal devices, such as cell phones, tablets, or laptops via web-based interfaces. Interactions were conducted in Chinese, with typical factual or explanatory prompts, such as “What is the process of sample collection?”, simulating realistic laboratory queries. All LLMs adhered to the same local internet security and data protection regulations. System messages or guardrails were not standardized beyond the publicly available interfaces, reflecting natural usage in educational settings.

Evaluation metrics

Both the student questionnaire and instructor evaluations used a standardized five-point Likert scale for assessment. Each item required respondents to select one of five options, with no alternative or intermediate values allowed. The scale was defined as follows: 1 = poor or very dissatisfied, 2 = limited or slightly dissatisfied, 3 = fair or basically satisfied, 4 = good or satisfied, and 5 = excellent or very satisfied. Respondents were required to choose a single score for each item, ensuring that all ratings were constrained to this 1–5 scale. This approach allowed for consistent quantification of student performance, engagement, and instructor evaluations while maintaining clarity and standardization in the assessment process.

Instructor rating of laboratory reports

Laboratory reports were completed following a standardized format that included five predefined sections: experimental objectives, materials and reagents, procedures, observations, and key precautions. Lab reports were evaluated using a 5-point scale (1 = poor to 5 = excellent), and the scores were combined to produce an overall rating reflecting the quality and completeness of the report. All reports were assessed by a single course instructor who was blinded to students' group assignments. The evaluation followed a structured approach based on the predefined report components and aligned with the instructional objectives of the cell biology laboratory course. Although inter-rater reliability could not be assessed due to the use of a single rater, this approach was intended to maintain consistency in scoring across all student groups.

Statistical analysis

All statistical analyses were conducted using GraphPad Prism 7.0 and SPSS 26.0. Continuous variables were summarized using means and standard deviations, and categorical variables were summarized using frequencies and percentages. Internal consistency of the measurement instruments was assessed using Cronbach's alpha. Differences among the student groups were analyzed using one-way analysis of variance (ANOVA). The assumptions of normality and homogeneity of variances were checked before analysis. When ANOVA indicated a significant group effect, post-hoc pairwise comparisons were performed using Tukey's test. For all comparisons, F values, p-values, adjusted p-values, and 95% confidence intervals (CI) were reported. The significance level was set at α = 0.05. Effect sizes were calculated to complement p-values, providing an estimate of the magnitude of observed differences. Graphs and figures were generated using GraphPad Prism 7.0.

Results

Evaluation of the effectiveness of LLMs in integrating bioethical principles into medical education, using both student self-assessments and instructor grading

The study included medical undergraduate students enrolled in courses during the 2025 academic year from three majors: Medical Imaging Technology, Medical Prevention, and Health Surveillance and Quarantine. Students' interest in, experience with, and willingness to use LLMs were assessed via the course questionnaire using a 1–5 Likert scale, and no significant differences were observed among students from the three majors (Figure 2).

Figure 2
Three bar charts show survey results on the use of Large Language Models (LLMs) among students in medical imaging technology, medical prevention, and health surveillance and quarantine fields. Chart one indicates interest in using LLMs for recommending resources, with most agreeing. Chart two shows usage extent for learning latest developments, with moderate to considerable use. Chart three illustrates willingness to use LLMs for self-directed learning, with majority agreeing. All sections denote similar interest and usage levels across fields, marked as not significant (n.s.).

Figure 2. Evaluation of students' interest in, experience with and willingness to use LLMs (one-way ANOVA with post-hoc Tukey's test, *p < 0.05, n.s. = no significant difference).

In the laboratory course, students are expected to learn experimental design, principles, techniques, operations, and data analysis, and to summarize their findings in a laboratory report. These reports provide an objective measure of learning outcomes and help students consolidate practical knowledge, reflect actively, and organize theoretical and operational skills systematically. Preparing the report completes a full learning cycle and lays a foundation for future scientific inquiry. In this study, course effectiveness was evaluated from both instructor and student perspectives through instructor grading based on predefined criteria and student self-assessments, including experiences with or without LLM-assisted support. We collected overall assessments of using LLMs in bioethics-oriented education in medical cell biology laboratory course from 86 students across the three majors, using a five-point Likert scale ranging from 1 (not satisfied) to 5 (very satisfied) (Figure 3, the central lane). The self-evaluation data were integrated with instructors' grading of students' laboratory reports (assessed on a five-point scale ranging from 1 = poor to 5 = excellent). To clearly demonstrate the association among platform selection, students' perceived effectiveness, and their academic performance, a comprehensive Sankey diagram was generated, linking students' evaluations with instructors' assessments. The results clearly show that, among the four LLMs, both DeepSeek and KIMI received the highest levels of overall satisfaction from students. Meanwhile, Doubao and ChatGLM received moderate evaluations. Students reporting higher satisfaction with LLMs generally achieved higher grades, suggesting alignment between self-reported evaluations and objective performance. Conversely, students who did not use LLM tools gave lower evaluations and received lower lab grades, indicating that LLM use may positively influence both learning outcomes and student satisfaction (Figure 3).

Figure 3
Sankey diagram illustrating the distribution of reviews from five sources: DeepSeek, Doubao, KIMI, ChatGLM, and NA, to two categories: Self and Instructor, with scores ranging from 2 to 5. Flow thickness represents the volume of reviews in each category.

Figure 3. Overall assessment of LLM use in integrating bioethical principles into undergraduate medical education. Sankey diagram with three vertical columns: selected platforms (Left), students' self-evaluation of platform effectiveness (Middle), and teachers' assessment scores of the students' lab reports (Right).

In addition to students' self-assessments of their own report performance and instructor grading of laboratory reports, we conducted one-way ANOVA with Tukey's post-hoc tests to analyze two overall indicators: students' overall evaluation of the cell biology laboratory course and their overall assessment of the effectiveness of LLM-assisted bioethical medical education. The statistical results and relevant metrics are presented in below (Figure 4 and Tables 35).

Figure 4
Two bar charts compare evaluations. The left chart titled “Evaluation of medical laboratory course” shows NA, DeepSeek, Doubao, KIMI, and ChatGLM, with ratings from 2.5 to 3.5 and statistical significance markers. The right chart titled “Evaluation of the effectiveness of LLM-Assisted bioethical medical education” shows DeepSeek, Doubao, KIMI, and ChatGLM, with ratings from 3.0 to 4.0 and significance markers. Each bar includes a number indicating the sample size.

Figure 4. Pooled evaluation of medical laboratory course and effectiveness of LLM-assisted bioethical medical education among all participants (one-way ANOVA with post-hoc Tukey's test, *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001).

Table 3
www.frontiersin.org

Table 3. Summary of the statistical metrics.

Table 4
www.frontiersin.org

Table 4. Pooled evaluation of the medical laboratory course among students from three majors.

Table 5
www.frontiersin.org

Table 5. Pooled evaluation of the effectiveness of LLM-assisted bioethical medical education among students from three majors.

To further determine whether students from different majors hold differing overall views regarding the effectiveness of LLMs in facilitating the integration of bioethical principles into the cell biology laboratory course, we analyzed and compared the evaluations from students in the three majors separately. The detailed assessment scores are presented in Table 6 and are consistent with the overall evaluation patterns shown in the Sankey diagram (Figure 3).

Table 6
www.frontiersin.org

Table 6. Assessment of the effectiveness of the four LLMs by undergraduate medical students from three different majors in incorporating bioethics education into the cell biology laboratory course.

Students' evaluation of their mastery of bioethical skills in medical learning process

In this study, the medical cell biology laboratory session focused on “examining the chromosome of the mouse sperms.” During Phase I, the instructor introduced the course objectives, background knowledge, bioethical principles, and the experimental protocol. In Phase II, students performed the experiment and have the option to choose whether to use LLMs. Those using LLMs could search protocols, clarify critical steps, review relevant bioethical principles, and address concerns related to animal handling, including mouse anesthesia and tissue dissection. All students who agree to participate in this study were required to complete data collection, a lab report, and a questionnaire evaluating the learning approach (LLMs-assisted vs. non-assisted). The questionnaire was designed around five dimensions of bioethics-centered laboratory education: (1) integrating humanistic values into experimental design, (2) applying ethical principles to guide strategy selection, (3) collecting samples responsibly, (4) minimizing potential harm, and (5) balancing scientific rigor with respect for life. Students self-rated the extent to which using (or not using) LLMs supported each dimension while studying instructional materials on bioethical policies, such as the “Replacement, Reduction, Refinement (3Rs)” animal-use principles, and related ethical guidelines. According to the questionnaire results, KIMI received the highest overall rating from students. With strong academic text comprehension and effective handling of both Chinese and English scholarly sources, KIMI offered well-structured and detailed information across all five core topics. Its ability to display reference sources alongside the generated content further facilitated students' use of relevant information. In comparison, DeepSeek, Doubao and ChatGLM provided more limited content, likely due to less extensive or less multilingual training data. Overall, all four LLM-assisted groups scored higher than the non-LLM group. A summary of the results and statistic metrics are presented in Tables 7, 8.

Table 7
www.frontiersin.org

Table 7. Structure and internal consistency of the bioethical competency self-assessment.

Table 8
www.frontiersin.org

Table 8. Students' self-assessment of efficacy of LLMs in improving ethical competencies in experimental medicine.

Participants' perceived attitudes and concerns about LLMs in bioethics-oriented medical education

Students were then asked to evaluate the content quality and overall effectiveness of the LLMs-assisted bioethics-centered approach in the biomedical laboratory course. Results showed that students generally favored this instructional method and provided positive feedback on the LLM-assisted sessions. At the same time, students expressed concerns about the use of LLMs in bioethics-related medical education, centering on three issues: uncertainty when addressing complex biomedical questions, bias in interpreting bioethical principles, and the potential reduction of independent thinking. Specifically, students reported that Doubao often generated responses without proper citation, and ChatGLM relied heavily on non-peer-reviewed sources such as Baidu (a widely-used Chinese search engine) and ZhiHu (a well-known Chinese forum), which heightened concerns about accuracy and objectivity. In contrast, KIMI distinguished itself by providing responses supported by academic journals, scholarly forums, and knowledge platforms, which increased students' confidence in the platform's reliability and academic value in integrating bioethical principles into undergraduate medical training (Table 9).

Table 9
www.frontiersin.org

Table 9. Students' perspective on LLMs in ethics-oriented undergraduate medical education.

Comparing LLM performance on students' representative questions about bioethical principles in medical education

To compare the generated content and the feature of each LLMs in the context of medical cell biology laboratory course, we selected several representative questions from students and asked each LLM to provide informed responses. The first question addressed bioethical policies, the second examined the balance between scientific objectives and respect for life, the third explored professional values in a medical career, the fourth investigated emotional reflection in clinical challenges, and the fifth focused on educational innovation. All four LLMs provided basic coverage of these topics. Notably, KIMI, trained on a multilingual corpus with diverse literature and scientific resources, consistently produced detailed, structure, and contextually solid responses, often accompanied by references. These features enhance transparency, traceability, and academic rigor, highlighting KIMI's potential to support advanced bioethics-centered medical education. Detailed responses from each LLM are presented below (Table 10).

Table 10
www.frontiersin.org

Table 10. Typical questions posted by students in the medical laboratory course and corresponding responses from each LLM.

Discussion

Compared with existing studies, our results indicate that among the most robust and widely used LLMs in China, KIMI and DeepSeek demonstrate favorable performance in integrating bioethical principles into the cell biology curriculum. These findings highlight the potential of these platforms not only to enhance bioethics-centered learning in this context but also to inform broader applications and adoption across undergraduate medical education (see Figure 4 and Table 6).

On one hand, LLMs rely on the breadth and diversity of their training data, and their ability to generate accurate and contextually relevant outputs depdns on the quality of these datasets. Current studies have mainly focused on one of the most powerful LLMs, ChatGPT; however, educational studies on LLMs trained on Chinese-language databases remain limited, and their suitability for routine use in China is not well-established (29). On the other hand, due to the restricted accessibility of ChatGPT in China, there is an urgent need to develop and provide LLM alternatives that are better suited for localized use in China (30). Specifically, identifying the most effective LLMs model for undergraduate medical training in China could prominently enhance learning quality in multiple realms.

Although prior studies have mainly focused on LLMs-assisted enhancement of medical knowledge, relatively little attention has been given to integrating medical humanities, bioethical principles, and life care into foundational curricula (31, 32). Early exposure to medical ethics benefits first-year students by providing conceptual grounding and emotional preparedness for future challenges (33). Given their reliance on instructor guidance, we implemented an approach combining traditional teaching with LLM-assisted bioethics learning, allowing students to deepen understanding and explore case-specific solutions during experiments, thereby enhancing personalized learning and outcomes. Systematic investigation of China-developed LLMs in integrating bioethical awareness remains scarce. This study provides empirical evidence and practical insights on the potential of Chinese LLM platforms to embed bioethical principles into foundational medical education. Trained on extensive Chinese and English datasets related to bioethics, the four LLMs evaluated in this study demonstrated strong performance in providing effective references that helped undergraduate students develop bioethical sensitivity during medical training.

Beyond the integration of bioethics into the curriculum, students face ethically and emotionally demanding experiences in laboratory and clinical settings. Earlier research on bioethical education has mainly focused on the ethical treatment of experimental subjects. However, students themselves may experience moral distress, ethical stress, and psychological burden in ethical decision-making, manifesting as anxiety, guilt, fear, or internal conflict (34). In this study, many students exhibited such responses when handling experimental animals for the first time, particularly when required to sacrifice them for sample collection. These reactions mirror challenges they may later face in high-stakes clinical settings, such as critical care or end-of-life situations. We therefore explored whether LLMs could guide students through these ethically and psychologically challenging scenarios, alleviate moral and ethical distress, and foster resilience. Trained on extensive Chinese and English datasets related to bioethics, the four LLMs evaluated in this study demonstrated strong performance in providing effective references that helped students develop bioethical sensitivity, offering valuable empirical and practical support for integrating ethics into foundational medical education.

Previous studies have found it challenging to address the potential drawbacks of applying LLMs in the context of medical education in China (35, 36). To fill this gap, we collected students' opinions on the limitations they encountered while using LLMs in laboratory coursework and proposed several strategies to mitigate these issues. The main concerns reported were related to academic uncertainty, academic bias, and impairment of independent thinking. To address these concerns, we suggest the following approaches. First, students should verify the information provided by LLMs by cross-checking it with original academic sources to ensure accuracy and reliability. For example, there is broad international consensus that germline editing of human embryos remains ethically unacceptable under current conditions (37). By contrast, somatic gene-editing therapies targeting adult patients, such as base-editing therapies for transfusion-dependent β-thalassemia in China (38) and CRISPR-based therapies approved by the U.S. Food and Drug Administration (39), have different ethical, regulatory, and clinical considerations depending on the country. Students must therefore interpret LLM-generated information in the context of local policies and regulations rather than adopt it uncritically. Second, students should be encouraged to participated in discussions and exchange viewpoints on specific scientific questions, thereby fostering their independent thinking skills. Prior research has demonstrated that proactively engaging in research activities enhances both students' interest in the subject matter and their critical thinking skills (40, 41). Therefore, through this approach, LLMs can stimulate creative thinking rather than hinder it, helping students learn how to apply bioethical principles in the medical field, and providing conceptual and psychological preparation for the students' future role in developing these principles.

Building on these findings, and as one of the most distinctive contribution of this study, we implemented a closed-loop, teacher-student-LLM research model integrating instruction, active student engagement, and bilateral evaluation. This approach not only provides a comprehensive understanding of the students' learning process but also allows bioethics-centered education. Building on the class-based data and student feedback obtained through this model, teachers are thus better positioned to further refine both the methodology and practical implementation of the educational reform project. Specifically, LLMs-assisted instruction can be designed to help students navigate ethical issues that arise during their study of basic medical courses, offering timely, accurate, inclusive, and personalized support. This strategy not only strengthens the overall effectiveness of the closed-loop educational system but also aligns with recent perspectives in the field of LLMs-assisted education, which highlight the importance of integrating continuous feedback mechanisms and promoting equitable and high-quality learning experiences (42).

Strength and limitations

This study has several notable strengths. First, it rigorously investigates how mainstream LLMs can support bioethics-oriented education in undergraduate medical curricula. Second, it involved students from three representative medical majors—Medical Imaging Technology, Medical Prevention, and Health Surveillance and Quarantine—broadly reflecting the wider medical student population. Third, given medicine's specialization and rapid evolution, the study provides a foundation for applying LLMs in ethics-focused biomedical education and clinical training, potentially fostering well-rounded professionals with advanced competencies.

However, several limitations of the study should be acknowledged. The selected majors had relatively small class sizes, which restricted the number of participating students. Only four LLM platforms—DeepSeek, Doubao, KIMI, and ChatGLM—were evaluated, reflecting limitations in availability and cohort size. In addition, the rapid iteration of LLMs means that the findings should be considered preliminary, providing initial insights into incorporating life-centered perspectives in Chinese undergraduate medical education. On one side, this study is limited by its quasi-experimental design with student self-selection. Individual characteristics, such as motivation, prior experience with digital tools, or attitudes toward LLM-assisted learning, may have influenced outcomes. Baseline data indicate that participants across the three majors were broadly comparable in academic background, course exposure, demographics, and pre-course interest and experience with LLMs, which may partially mitigate this concern. Nevertheless, unmeasured confounding cannot be fully excluded, and future randomized or matched designs would strengthen causal inference. On the other side, selection bias and limited generalizability remain concerns, as all participants were drawn from a single institution and a single cell biology laboratory course. In addition, all students shared the same learning environment, making informal interactions across groups possible, which could have influenced engagement. Future studies could explore multi-institutional samples or separate course sections to address these issues. Furthermore, performance bias may have occurred because students were aware of their group assignments and access to LLMs, potentially affecting motivation or expectations. Moreover, measurement bias is possible, as learning outcomes and ethical understanding were assessed through self-reported questionnaires, which are susceptible to social desirability and response expectancy effects. Incorporating objective or performance-based assessments would enhance robustness in future research. In addition, while some differences were observed among groups using different LLMs, the study was not designed for definitive comparisons. These observations should be interpreted cautiously as preliminary insights into how structured LLM use may support engagement with bioethical topics in laboratory-based medical education.

These findings suggest that LLM-assisted approaches have the potential to support bioethical learning in undergraduate medical courses, while remaining exploratory and context-specific. Despite limitations in sample size, platform scope, and the rapid evolution of LLMs, the results offer preliminary guidance for integrating life-centered perspectives into medical education. Further studies with larger cohorts and broader disciplinary coverage are needed to validate and extend these insights.

Conclusion

LLM-assisted technologies may offer a supportive role in undergraduate medical education by complementing traditional instruction and engaging students with complex ethical issues. This study explored the use of four widely used LLM platforms in China (DeepSeek, Doubao, KIMI, and ChatGLM) within a basic medical laboratory course, focusing on their potential contribution to bioethical understanding, ethical skill development, and students' perceptions of LLMs in medical learning contexts. The findings suggest that LLM-assisted learning can be integrated into laboratory-based instruction and may help facilitate students' engagement with bioethical topics. While some platforms, such as KIMI and DeepSeek, were associated with relatively more positive outcomes in this specific educational setting, these observations should be interpreted cautiously within the exploratory scope of the study. The results are not intended to indicate the superiority of any particular tool, but rather to provide preliminary insights into how structured use of LLMs might support ethical reflection and learning processes in medical education. Overall, this exploratory educational study provides initial evidence that LLM-assisted approaches can enrich bioethics-related learning experiences in basic medical training. The findings are context-specific and have limited generalizability beyond the institution and the four Chinese LLM platforms examined. Further research using larger samples, more rigorous designs, and diverse educational contexts is needed to better understand the potential benefits and long-term value of LLM-supported instruction. Such efforts may inform the responsible and pedagogically sound integration of LLM technologies into future medical curricula.

Data availability statement

The datasets generated or analyzed in this study are available from the corresponding author on reasonable request.

Ethics statement

The animal study was approved by Biomedical Research Ethics Committee of Fujian Medical University. The study was conducted in accordance with the local legislation and institutional requirements.

Author contributions

YW: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This work was supported by grants from the National Natural Science Foundation of China (Grant No. 82101482 to YW), Natural Science Foundation of Fujian Province, China (Grant No. 2024J01488 to YW), Fujian Medical University Graduate education and teaching innovation program (Grant No. Y23014 to YW), and Fujian Medical University high-level talents scientific research start-up funding project (Grant No. XRCZX2018010 to YW).

Acknowledgments

The author would like to acknowledge the participants who took part in this study and express sincere thanks to the members of the neuroscience research group at Fujian Medical University.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Moorlock G, Deans Z, Trimble M. Assessing ethics and law in medical schools: there is no single best answer. J Med Ethics. (2025) 51:684–8. doi: 10.1136/jme-2024-110298

PubMed Abstract | Crossref Full Text | Google Scholar

2. Allana AA, Ali SK, Ghias K. Bioethics curriculum for undergraduate medical students: an evaluation study utilizing mixed methods approach. BMC Med Educ. (2024) 24:385. doi: 10.1186/s12909-024-05376-4

PubMed Abstract | Crossref Full Text | Google Scholar

3. Dowie A. Ethical sense, medical ethics education, and maieutics. Med Teach. (2023) 45:838–44. doi: 10.1080/0142159X.2023.2178885

PubMed Abstract | Crossref Full Text | Google Scholar

4. Ko YK, Cho C, Sun S, Ngan OM, Chan HY. Moral sensitivity and academic ethical awareness of nursing and medical students: a cross-sectional survey. Nurs Ethics. (2024) 31:1499–512. doi: 10.1177/09697330241226604

PubMed Abstract | Crossref Full Text | Google Scholar

5. Hlaing PH, Hasswan A, Salmanpour V, Shorbagi S, AlMahmoud T, Jirjees FJ, et al. Health professions students' approaches towards practice-driven ethical dilemmas; a case-based qualitative study. BMC Med Educ. (2023) 23:307. doi: 10.1186/s12909-023-04089-4

PubMed Abstract | Crossref Full Text | Google Scholar

6. DeFoor MT, Chung Y, Zadinsky JK, Dowling J, Sams RW II. An interprofessional cohort analysis of student interest in medical ethics education: a survey-based quantitative study. BMC Med Ethics. (2020) 21:26. doi: 10.1186/s12910-020-00468-4

PubMed Abstract | Crossref Full Text | Google Scholar

7. Noda Y. The new ethics of neuromodulation with transcranial magnetic stimulation: a critical appraisal. J Integr Neurosci. (2024) 23:112. doi: 10.31083/j.jin2306112

PubMed Abstract | Crossref Full Text | Google Scholar

8. González-Blázquez FJ, Ruiz-Hontangas A, López-Mora C. Bioethical knowledge in students and health professionals: a systematic review. Front Med (Lausanne). (2024) 11:1252386. doi: 10.3389/fmed.2024.1252386

PubMed Abstract | Crossref Full Text | Google Scholar

9. Bowman-Smart H, Savulescu J, Gyngell C, Mand C, Delatycki MB. Sex selection and non-invasive prenatal testing: a review of current practices, evidence, and ethical issues. Prenat Diagn. (2020) 40:398–407. doi: 10.1002/pd.5555

PubMed Abstract | Crossref Full Text | Google Scholar

10. Gallois H, Ravitsky V, Roy MC, Laberge AM. Defining ethical criteria to guide the expanded use of Noninvasive Prenatal Screening (NIPS): lessons about severity from preimplantation genetic testing. Eur J Hum Genet. (2025) 33:167–75. doi: 10.1038/s41431-024-01714-8

PubMed Abstract | Crossref Full Text | Google Scholar

11. Locatelli F, Thompson AA, Kwiatkowski JL, Porter JB, Thrasher AJ, Hongeng S, et al. Betibeglogene autotemcel gene therapy for non-β0/β0 genotype β-thalassemia. N Engl J Med. (2022) 386:415–27. doi: 10.1056/NEJMoa2113206

PubMed Abstract | Crossref Full Text | Google Scholar

12. Frangoul H, Altshuler D, Cappellini MD, Chen YS, Domm J, Eustace BK, et al. CRISPR-Cas9 gene editing for sickle cell disease and β-thalassemia. N Engl J Med. (2021) 384:252–60. doi: 10.1056/NEJMoa2031054

PubMed Abstract | Crossref Full Text | Google Scholar

13. Chen Q, Ma Y, Labude M, Schaefer GO, Xafis V, Mills P. Making sense of it all: ethical reflections on the conditions surrounding the first genome-edited babies. Wellcome Open Res. (2021) 5:216. doi: 10.12688/wellcomeopenres.16295.2

PubMed Abstract | Crossref Full Text | Google Scholar

14. Biller-Andorno N, Ferrario A, Joebges S, Krones T, Massini F, Barth P, et al. support for ethical decision-making around resuscitation: proceed with care. J Med Ethics. (2022) 48:175–83. doi: 10.1136/medethics-2020-106786

Crossref Full Text | Google Scholar

15. Willard J. Structuring bioethics education: the question, the disciplines, and the integrative challenge. Ethics Soc Welf. (2015) 9:280–96. doi: 10.1080/17496535.2015.1005555

Crossref Full Text | Google Scholar

16. AlMahmoud T, Hashim MJ, Elzubeir MA, Branicki F. Ethics teaching in a medical education environment: preferences for diversity of learning and assessment methods. Med Educ Online. (2017) 22:1328257. doi: 10.1080/10872981.2017.1328257

PubMed Abstract | Crossref Full Text | Google Scholar

17. Khakpaki A. Advancements in artificial intelligence transforming medical education: a comprehensive overview. Med Educ Online. (2025) 30:2542807. doi: 10.1080/10872981.2025.2542807

PubMed Abstract | Crossref Full Text | Google Scholar

18. Janumpally R, Nanua S, Ngo A, Youens K. Generative artificial intelligence in graduate medical education. Front Med (Lausanne). (2025) 11:1525604. doi: 10.3389/fmed.2024.1525604

PubMed Abstract | Crossref Full Text | Google Scholar

19. Zhang H, Lee I, Ali S, DiPaola D, Cheng Y, Breazeal C. Integrating ethics and career futures with technical learning to promote ai literacy for middle school students: an exploratory study. Int J Artif Intell Educ. (2022) 9:1–35. doi: 10.1007/s40593-022-00293-3

PubMed Abstract | Crossref Full Text | Google Scholar

20. Sanri E. Beyond metropolises: artificial intelligence awareness and educational needs among medical students in a developing country. Front Med (Lausanne). (2025) 12:1645484. doi: 10.3389/fmed.2025.1645484

PubMed Abstract | Crossref Full Text | Google Scholar

21. Chen Y. Evaluation of the impact of AI-driven personalized learning platform on medical students' learning performance. Front Med (Lausanne). (2025) 12:1610012. doi: 10.3389/fmed.2025.1610012

PubMed Abstract | Crossref Full Text | Google Scholar

22. Crotty E, Singh A, Neligan N, Chamunyonga C, Edwards C. Artificial intelligence in medical imaging education: recommendations for undergraduate curriculum development. Radiography (Lond). (2024) 30(Suppl. 2):67–73. doi: 10.1016/j.radi.2024.10.008

PubMed Abstract | Crossref Full Text | Google Scholar

23. Nawaz H, Petraro PV, Via C, Ullah S, Lim L, Wild D, et al. Lifestyle medicine curriculum for a preventive medicine residency program: implementation and outcomes. Med Educ Online. (2016) 21:29339. doi: 10.3402/meo.v21.29339

PubMed Abstract | Crossref Full Text | Google Scholar

24. Zhai J, Zhang H. Critical thinking disposition of medical students in Anhui Province, China: a cross-sectional investigation. BMC Med Educ. (2023) 23:652. doi: 10.1186/s12909-023-04646-x

PubMed Abstract | Crossref Full Text | Google Scholar

25. Wu J, Wang Z, Qin Y. Performance of DeepSeek-R1 and ChatGPT-4o on the Chinese national medical licensing examination: a comparative study. J Med Syst. (2025) 49:74. doi: 10.1007/s10916-025-02213-z

PubMed Abstract | Crossref Full Text | Google Scholar

26. Li L, Du P, Huang X, Zhao H, Ni M, Yan M, et al. Comparative analysis of generative artificial intelligence systems in solving clinical pharmacy problems: mixed methods study. JMIR Med Inform. (2025) 13:e76128. doi: 10.2196/76128

PubMed Abstract | Crossref Full Text | Google Scholar

27. Team K, Du A, Gao B, Xing B, Jiang C, Chen C, et al. Kimi k1.5: scaling reinforcement learning with LLMs. arXiv preprint arXiv:2501.12599. (2025). doi: 10.48550/arXiv.2501.12599

Crossref Full Text | Google Scholar

28. Du Z, Qian Y, Liu X, Ding M, Qiu J, Yang Z, et al. GLM: general language model pretraining with autoregressive blank infilling. arXiv preprint arXiv:2103.10360. (2021). doi: 10.48550/arXiv.2103.10360

Crossref Full Text | Google Scholar

29. Hung J, Chen J. The benefits, risks and regulation of using ChatGPT in Chinese academia: a content analysis. Soc Sci (Basel). (2023) 12:380. doi: 10.3390/socsci12070380

Crossref Full Text | Google Scholar

30. Xie Q, Li M, Cheng F. Between regulation and accessibility: how Chinese university students navigate global and domestic generative AI. arXiv preprint arXiv:2506.14377. (2025). doi: 10.1080/14767724.2025.2598275

Crossref Full Text | Google Scholar

31. Li Q, Qin Y. AI in medical education: medical student perception, curriculum recommendations and design suggestions. BMC Med Educ. (2023) 23:852. doi: 10.1186/s12909-023-04700-8

PubMed Abstract | Crossref Full Text | Google Scholar

32. Okamoto S, Kataoka M, Itano M, Sawai T. AI-based medical ethics education: examining the potential of large language models as a tool for virtue cultivation. BMC Med Educ. (2025) 25:185. doi: 10.1186/s12909-025-06801-y

PubMed Abstract | Crossref Full Text | Google Scholar

33. Saito Y, Kudo Y, Shibuya A, Satoh T, Higashihara M, Aizawa Y. Building medical ethics education to improve Japanese medical students' attitudes toward respecting patients' rights. Tohoku J Exp Med. (2011) 224:307–15. doi: 10.1620/tjem.224.307

PubMed Abstract | Crossref Full Text | Google Scholar

34. Neufeld-Kroszynski G, Michael K, Karnieli-Miller O. Associations between medical students' stress, academic burnout and moral courage efficacy. BMC Psychol. (2024) 12:296. doi: 10.1186/s40359-024-01787-6

PubMed Abstract | Crossref Full Text | Google Scholar

35. Yao Z, Duan L, Xu S, Chi L, Sheng D. Performance of large language models in the non-English context: qualitative study of models trained on different languages in chinese medical examinations. JMIR Med Inform. (2025) 13:e69485. doi: 10.2196/69485

PubMed Abstract | Crossref Full Text | Google Scholar

36. Leng L. Challenge, integration, and change: ChatGPT and future anatomical education. Med Educ Online. (2024) 29:2304973. doi: 10.1080/10872981.2024.2304973

PubMed Abstract | Crossref Full Text | Google Scholar

37. Wiley L, Cheek M, LaFar E, Ma X, Sekowski J, Tanguturi N, et al. The ethics of human embryo editing via CRISPR-Cas9 technology: a systematic review of ethical arguments, reasons, and concerns. HEC Forum. (2025) 37:267–303. doi: 10.1007/s10730-024-09538-1

PubMed Abstract | Crossref Full Text | Google Scholar

38. Wang L, Li P, Wang Y, Lan K, Zheng H, Zhu D, et al. Development of best-in-class gene editing therapy for β-hemoglobinopathies using innovative transformer Base Editor (tBE). Blood. (2023) 142:5243. doi: 10.1182/blood-2023-178440

Crossref Full Text | Google Scholar

39. Parums DV. Editorial: First regulatory approvals for CRISPR-Cas9 therapeutic gene editing for sickle cell disease and transfusion-dependent β-thalassemia. Med Sci Monit. (2024) 30:e944204. doi: 10.12659/MSM.944204

PubMed Abstract | Crossref Full Text | Google Scholar

40. Cheng X, Ka Ho Lee K, Chang EY, Yang X. The “flipped classroom” approach: stimulating positive learning attitudes and improving mastery of histology among medical students. Anat Sci Educ. (2017) 10:317–27. doi: 10.1002/ase.1664

PubMed Abstract | Crossref Full Text | Google Scholar

41. Zhou K, Cao C, Liu X, Sun M, Wu Z, Zheng W, et al. The impact of exposure to scientific research and inclusive mentoring style on medical undergraduates' perceptions of critical thinking, communication, and passion. Med Educ Online. (2025) 30:2535406. doi: 10.1080/10872981.2025.2535406

PubMed Abstract | Crossref Full Text | Google Scholar

42. Magzoub ME, Zafar I, Munshi F, Shersad F. Ten tips to harnessing generative AI for high-quality MCQS in medical education assessment. Med Educ Online. (2025) 30:2532682. doi: 10.1080/10872981.2025.2532682

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: artificial intelligence, bioethics, large language model, medical cell biology laboratory course, undergraduate medical education

Citation: Wang Y (2026) Integrating large language models into medical undergraduate laboratory course to enhance bioethical competence: a quasi-experimental study. Front. Med. 12:1745975. doi: 10.3389/fmed.2025.1745975

Received: 13 November 2025; Revised: 17 December 2025;
Accepted: 22 December 2025; Published: 05 February 2026.

Edited by:

Tong Wu, Huazhong University of Science and Technology, China

Reviewed by:

Cristian Vacacela Gomez, National Laboratory of Frascati (INFN), Italy
Judith Francisco-Pérez, Pontificia Universidad Católica del Ecuador, Ecuador

Copyright © 2026 Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yue Wang, eXVld2FuZ0Bmam11LmVkdS5jbg==; d2FuZ3l1ZWNobkAxNjMuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.