BRIEF RESEARCH REPORT article
Front. Vet. Sci.
Sec. Veterinary Humanities and Social Sciences
This article is part of the Research TopicTransforming Veterinary Medicine: Digital Tools and AI as Path to Sustainable Animal CareView all 6 articles
SPEECH RECOGNITION TOOLS FOR VETERINARY CASE LEARNING Enhancing Veterinary Education with Smartphone-Based Transcription and AI Summarization: A Comparative Study of Workflow and Usability
Provisionally accepted- 1Laboratory of Veterinary Physiology, Laboratory of Veterinary Physiology, School of Veterinary Medicine, Nippon Veterinary and Life Science University, Tokyo, Japan
- 2Nihon Jui Seimei Kagaku Daigaku, Musashino, Japan
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Abstract Background: Accurate documentation of clinical teaching sessions is critical, particularly in multilingual contexts. Recent advances in smartphone-based speech recognition and large language models (LLMs) may enhance transcription accuracy, streamline case summarization, and improve usability. However, their comparative performance in veterinary settings remains underexplored. Objectives: This study evaluated the quality, usability, and educational value of smartphone-native transcription compared with Whisper-based transcription and AI-assisted summarization in veterinary ophthalmology education. Methods: Clinical case discussions (n = 5) were recorded and transcribed using (1) iPhone-native speech recognition and (2) the Whisper automatic speech recognition system. Transcripts were further processed into SOAP-format summaries with and without LLM-based summarization. Final-year veterinary students (n = 4) and clinicians (n = 3) evaluated transcripts and summaries using a 5-point Likert scale across readability, accuracy, clinical clarity, and educational utility. Statistical comparisons were performed using Wilcoxon signed-rank tests. SPEECH RECOGNITION TOOLS FOR VETERINARY CASE LEARNING Results: iPhone-native transcription outperformed Whisper in readability, technical accuracy, and clinical flow (p < 0.05). AI-assisted SOAP-format summarization improved clarity and perceived learning value but occasionally introduced minor semantic distortions. Clinicians rated AI-enhanced summaries as more concise and educationally useful than raw transcripts. Both students and clinicians reported reduced cognitive load and usability with smartphone-based transcription workflows. Conclusions: Smartphone-native transcription combined with AI summarization provides a practical and effective workflow for veterinary education. While Whisper offers cross-device flexibility, its current accuracy in multilingual contexts is limited. Integration of smartphone transcription and LLM summarization may improve documentation, comprehension, and student engagement in clinical teaching.
Keywords: Veterinary education, transcription, artificial intelligence, Multilingual learning, usability, Clinical documentation
Received: 21 Aug 2025; Accepted: 04 Nov 2025.
Copyright: © 2025 Yogo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Takuya Yogo, yogo3@nvlu.ac.jp
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.