BRIEF RESEARCH REPORT article

Front. Educ.

Sec. Digital Education

Volume 10 - 2025 | doi: 10.3389/feduc.2025.1601789

This article is part of the Research TopicTeaching and Assessing with AI: Teaching Ideas, Research, and ReflectionsView all 10 articles

RESEARCH-AI: AIccuracy-AI-Teacher Agreement in Evaluating Learning Diaries

Provisionally accepted
Lhea  ReinholdLhea Reinhold1*Marion  HändelMarion Händel2Nick  Naujoks-SchoberNick Naujoks-Schober2
  • 1University of Erlangen Nuremberg, Erlangen, Germany
  • 2Ansbach University of Applied Sciences, Ansbach, Bavaria, Germany

The final, formatted version of the article will be published soon.

Learning diaries are reflective tools, often used as formative assessments in adult education with the aim to promote cognitive and metacognitive learning strategies. As grading of and feedback on learning diaries is effortful for teachers, artificial intelligence (AI) may assist teachers in evaluating learning diaries. A prerequisite is that AI's ratings show high accordance with the teachers' ratings. AI accuracy, measured via absolute accuracy and bias, is the focus of the current study with N = 540 learning diary entries focusing on learning strategies, seven teachers, and ChatGPT-4o. Findings revealed that AI evaluations align closely with teacher assessments, indicated by high overall accuracy and low bias. Interestingly, the accuracy varied based on the types of learning strategies assessed in the diaries. Additionally, individual teacher assessments influenced the alignment between human and AI evaluations, suggesting that teachers applied their profession-specific expertise to the assessment process while AI produced somewhat generic evaluations. Overall, the study results indicate that AI can enhance the efficiency of formative assessments while providing timely feedback to learners.

Keywords: Learning diary, AI assessment, AI accuracy, AI-teacher agreement, Adult Education

Received: 28 Mar 2025; Accepted: 09 Jun 2025.

Copyright: © 2025 Reinhold, Händel and Naujoks-Schober. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Lhea Reinhold, University of Erlangen Nuremberg, Erlangen, Germany

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.