Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med.

Sec. Infectious Diseases: Pathogenesis and Therapy

Volume 12 - 2025 | doi: 10.3389/fmed.2025.1679153

Evaluating AI Performance in Infectious Disease Education: A Comparative Analysis of ChatGPT, Google Bard, Perplexity AI, Microsoft Copilot, and Meta AI

Provisionally accepted
  • 1Department of Clinical Pharmacy, College of Pharmacy, Jouf University, sakaka, Saudi Arabia
  • 2Department of Clinical Pharmacy, College of Pharmacy, King Khalid University, Abha, Saudi Arabia
  • 3Eastern Health Cluster, Dammam, Saudi Arabia
  • 4Department of family and community medicine, college of medicine, Jouf University, Sakaka, Saudi Arabia
  • 5School of Pharmacy, Faculty of Health and Medical sciences, Taylors University, Selangor, Malaysia
  • 6Department of Pharmacy Practice, Faculty of Pharmacy, Sindh University, Jamshoro, Pakistan
  • 7Iqra University - North Campus, Karachi, Pakistan
  • 8Clinical Laboratory Science, Medical Applied College. Jouf University, Sakaka, Saudi Arabia
  • 9Department of Pharmacology, College of Pharmacy, Jouf University, Sakaka, Saudi Arabia
  • 10Department of Pharmaceutics, college of pharmacy, Jouf University, Sakaka, Saudi Arabia

The final, formatted version of the article will be published soon.

Background This study systematically evaluates and compares the performance of ChatGPT 3.5, Google Bard (Gemini), Perplexity AI, Microsoft Copilot, and Meta AI in responding to infectious disease-related multiple-choice questions (MCQs). Methods A systematic comparative study was conducted using 20 infectious disease case studies sourced from Infectious Diseases: A Case Study Approach by Jonathan C. Cho. Each case study included 7–10 MCQs, resulting in a total of 160 questions. AI platforms were provided with standardized prompts containing the case study text and MCQs without additional context. Their responses were evaluated against a reference answer key from the textbook. Accuracy was measured by the percentage of correct responses, and consistency was assessed by submitting identical prompts 24 hours apart. Results ChatGPT 3.5 achieved the highest numerical accuracy (65.6%), followed by Perplexity AI (63.2%), Microsoft Copilot (60.9%), Meta AI (60.8%), and Google Bard (58.8%). AI models performed best in symptom identification (76.5%) and worst in therapy-related questions (57.1%). ChatGPT 3.5 demonstrated strong diagnostic accuracy (79.1%) but had a significant drop in antimicrobial treatment recommendations (56.6%). Google Bard performed inconsistently in microorganism identification (61.9%) and preventive therapy (62.5%). Microsoft Copilot exhibited the most stable responses across repeated testing, while ChatGPT 3.5 showed a 7.5% accuracy decline. Perplexity AI and Meta AI struggled with individualized treatment recommendations, showing variability in drug selection and dosing adjustments. AI-generated responses were found to change over time, with some models giving different antimicrobial recommendations for the same case scenario upon repeated testing. Conclusion AI platforms offer potential in infectious disease education but demonstrate limitations in pharmacotherapy decision-making, particularly in antimicrobial selection and dosing accuracy. ChatGPT 3.5 performed best but lacked response stability, while Microsoft Copilot showed greater consistency but lacked nuanced therapeutic reasoning. Further research is needed to improve AI-driven decision support systems for medical education and clinical applications through clinical trials, evaluation of real-world patient data, and assessment of long-term stability.

Keywords: Infectious Disease, artificial intelligence, ChatGPT, Google Bard, Perplexity AI

Received: 04 Aug 2025; Accepted: 22 Sep 2025.

Copyright: © 2025 Alzarea, Ishaqui, Maqsood, Alanazi, Alsaidan, Mallhi, Kumar, Imran, Alshahrani, Alhassan, Alzarea and Alsaidan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Azfar Ishaqui, azfar.hd@hotmail.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.