Linear combination of one-step predictive information with an external reward in an episodic policy gradient setting: a critical analysis

Zahedi, Keyan; Martius, Georg; Ay, Nihat

doi:10.3389/fpsyg.2013.00801

ORIGINAL RESEARCH article

Front. Psychol., 04 November 2013

Sec. Cognitive Science

Volume 4 - 2013 | https://doi.org/10.3389/fpsyg.2013.00801

This article is part of the Research TopicIntrinsic motivations and open-ended development in animals, humans, and robotsView all 19 articles

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.