MINI REVIEW article

Front. Neural Circuits

Social learning and exploration–exploitation dilemma in decision-making

  • 1. Hitotsubashi University, Kunitachi, Japan

  • 2. The University of Melbourne, Parkville, Australia

Article metrics

View details

1

Views

The final, formatted version of the article will be published soon.

Abstract

This mini review examines the neurocomputational principles of social learning through the lens of the exploration–exploitation dilemma. While the neural mechanisms of learning from others—mediated by distinct signals in the ventromedial and lateral prefrontal cortices—are well established, less is known about how these mechanisms interact with the fundamental trade-off between gathering information ("exploration") and maximizing rewards ("exploitation"). We discuss how social environments shape this trade-off, leading to strategic behaviors such as informational free-riding or conformity. A central focus of this review is the issue of source selection: how agents decide whom to observe. We present recent evidence suggesting that, contrary to the predictions of optimal information-seeking theories, humans often exhibit a "reliability-seeking" bias, preferring to learn from consistent, exploitation-oriented partners rather than highly exploratory ones. We conclude by discussing the limitations of current paradigms, specifically the inherent confounding of social cues such as competence and predictability, and outline a computational framework for isolating the specific drivers of adaptive social decision-making.

Summary

Keywords

computational model, decision-making, Imitation, observational learning, reinforcement learning, Reward, social cognition

Received

06 January 2026

Accepted

20 February 2026

Copyright

© 2026 Suzuki and Morishita. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Shinsuke Suzuki; Gota Morishita

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Share article

Article metrics