Beyond Speech: Leveraging Mouse Movements for Information Adaptation in Voice Interfaces

Kontogiorgos, Dimosthenis; Schlangen, David

doi:10.3389/fcomp.2025.1634228

ORIGINAL RESEARCH article

Front. Comput. Sci.

Sec. Human-Media Interaction

Beyond Speech: Leveraging Mouse Movements for Information Adaptation in Voice Interfaces

Provisionally accepted

Dimosthenis Kontogiorgos^1*

David Schlangen²

¹Massachusetts Institute of Technology, Cambridge, United States
²Universitat Potsdam, Potsdam, Germany

The final, formatted version of the article will be published soon.

As human speakers naturally adapt their linguistic styles to one another, voice user interfaces that prompt similar linguistic adaptations can augment human-like interaction. In this study, we leverage a corpus of human instructions to model the effectiveness of incremental instruction generation in artificial agents. Participants interacted with agents that guided them in selecting virtual puzzle pieces, varying the amount of information provided in each instruction. Through an empirical examination of the Gricean maxims in utterance construction, our initial perception study highlighted the significance of adaptive instruction generation. By employing mouse movements as a proxy for user understanding, we developed computational models that enabled agents to detect user uncertainty and refine instructions incrementally. Comparing speaker-based and listener-based models, we found that agents encouraging linguistic adaptations were preferred by users. Our findings offer new insights into the value of mouse movements as indicators of user comprehension and introduce a methodological framework for developing adaptive interactive systems that generate instructions dynamically.

Keywords: Instructions, Language production, mouse tracking, common ground, adaptive systems, incremental instruction, voice userinterfaces, conversational agents

Received: 23 May 2025; Accepted: 03 Nov 2025.

Copyright: © 2025 Kontogiorgos and Schlangen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Dimosthenis Kontogiorgos, kontogiorgos@gmail.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.