Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med.

Sec. Gastroenterology

This article is part of the Research TopicThe Emerging Role of Large Language Model Chatbots in Gastroenterology and Digestive EndoscopyView all 7 articles

Integrating Large Language Models into Clinical Pharmacy Education: Applications in Perioperative Medication Management for Gastric Cancer

Provisionally accepted
  • 1First Hospital of Shanxi Medical University, Taiyuan, China
  • 2Shanxi Bethune Hospital, Shanxi Medical University, Taiyuan, China
  • 3Shanxi Bethune Hospital, Taiyuan, China

The final, formatted version of the article will be published soon.

Objective: This study aims to evaluate the performance of ChatGPT-4o and DeepSeek-R1 in perioperative medication therapy management for gastric cancer, assessing their reliability and practicality as auxiliary tools in clinical pharmacy education. Methods: This study utilized a retrospective design to collate issues pertaining to perioperative medication management in gastric cancer, from which a standardized question set was developed. The set was concurrently submitted to both ChatGPT-4o and DeepSeek-R1 to generate model responses. Two independent assessors, blinded to the model sources, evaluated the outputs according to a predefined framework covering three core domains: (1) Clinical applicability, assessed via a 7-point Likert scale; (2) Information quality, evaluated using the DISCERN instrument for evidence reliability and content completeness; and (3) Readability, measured through the Flesch Reading Ease Score (FRES) and the SMOG Index. Results: In the 24-item evaluation of perioperative drug therapy for gastric cancer, both models exhibited high inter-rater reliability, with Cronbach's α values of 0.880 for DeepSeek-R1 and 0.852 for ChatGPT-4o. DeepSeek-R1 demonstrated superior performance in clinical applicability (Likert score: 5.63 ± 0.94 vs. 5.10 ± 0.78, p < 0.001) and information quality (DISCERN score: 54.50 ± 6.71 vs. 50.56 ± 6.08), although neither model reached the excellence threshold (≥65 points). Readability assessment revealed moderately complex text difficulty, with Flesch Reading Ease scores below 30 and SMOG indices indicating a reading level of ≥17 years, which remains appropriate for undergraduate clinical pharmacy education. Conclusion: Both ChatGPT-4o and DeepSeek-R1 have demonstrated potential in addressing issues related to perioperative medication management for gastric cancer, with their generated responses showing good practical Applicability and readability suitable for the clinical pharmacy professional community. However, it should be noted that the quality of information provided by both models does not currently meet professional standards for drug therapy management. Therefore, they can be utilized as auxiliary tools for training the analytical skills of undergraduate students in clinical pharmacy, but their use should be guided by mentors.

Keywords: Large language models, gastric cancer, Perioperative, Medication Therapy Management, Clinical Pharmacy Education

Received: 22 Sep 2025; Accepted: 30 Nov 2025.

Copyright: © 2025 Wang, Luan, Shang, Zhang and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Qingqing Li

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.