Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Artif. Intell.

Sec. Medicine and Public Health

This article is part of the Research TopicGenerative AI for Enhanced Predictive Models: From Disease Diagnosis to Diverse ApplicationsView all 4 articles

Development and Validation of a Multi-Agent AI Pipeline for Automated Credibility Assessment of Tobacco Misinformation: A Proof-of-Concept Study

Provisionally accepted
  • University of Bath, Bath, United Kingdom

The final, formatted version of the article will be published soon.

Abstract Background The proliferation of tobacco-related misinformation poses significant public health risks, requiring scalable solutions for credibility assessment. Traditional manual fact-checking approaches are resource-intensive and cannot match the pace of misinformation spread. Objective To develop and validate a proof-of-concept multi-agent AI pipeline for automated credibility assessment of tobacco misinformation claims, evaluating its performance against expert human reviewers. Methods We constructed a three-agent pipeline using OpenAI GPT-4.1 and the Crewai framework. The Serper API provided real-time evidence retrieval. The Content Analyzer classifies claims into four types: health impact, scientific assertion, policy, or statistical. The Scientific Fact Verifier queries authoritative sources (WHO, CDC, PubMed Central, Cochrane). The Health Evidence Assessor applies weighted scoring across five dimensions to assign 0–100 credibility scores on a five-level scale. Results The framework achieved an MAE of 6.25 points against expert scores, a weighted Cohen's κ of 0.68 (95% CI: 0.52–0.84) indicating substantial agreement, 70% exact category agreement, 95% adjacent-level agreement, and processed each claim in under seven seconds—over 1,000× faster than manual review. Limitations We validated our approach using 20 diverse tobacco claims through intensive expert review (2-4 hours per claim). The system exhibited a conservative bias (+3.25 points, p = 0.03) and did not classify any claims as "Highly Unlikely" despite expert assignment of two claims to this category. This proof-of-concept demonstrates technical feasibility and substantial inter-rater agreement while identifying areas for calibration in future large-scale implementations. Conclusions Our proof-of-concept agentic AI pipeline demonstrates substantial agreement with expert assessments of tobacco-related claims while providing dramatic speed improvements. By combining zero-shot LLM reasoning, retrieval-grounded evidence verification, and a transparent five-level scoring schema, the system offers a practical tool for real-time misinformation monitoring in public health. This proof-of-concept establishes technical feasibility for automated tobacco misinformation assessment, with validation results supporting further development and larger-scale testing before operational deployment.

Keywords: tobacco misinformation, multi-agent AI pipeline, Large language models, Automated fact-checking, Credibility assessment, expert validation, Public Health Informatics, Retrieval-Augmented Generation

Received: 04 Jul 2025; Accepted: 24 Nov 2025.

Copyright: © 2025 Elmitwalli, Mehegan, Braznell and Gallagher. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Sherif Elmitwalli

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.