Your new experience awaits. Try the new design now and help us make it even better

METHODS article

Front. Robot. AI

Sec. Robotic Control Systems

Volume 12 - 2025 | doi: 10.3389/frobt.2025.1589025

This article is part of the Research TopicModel-Free Adaptive Control of Uncertain Autonomous SystemsView all articles

Exploiting the Kumaraswamy Distribution in a Reinforcement Learning Context

Provisionally accepted
Davide  PicchiDavide Picchi1,2*Sigrid  Brell-CokcanSigrid Brell-Cokcan1
  • 1Chair for Individualized Production, Aachen, Germany
  • 2RWTH Aachen University, Aachen, North Rhine-Westphalia, Germany

The final, formatted version of the article will be published soon.

Mini cranes play a pivotal role in construction due to their versatility across numerous scenarios. Recent advancements in Reinforcement Learning (RL) have enabled agents to operate cranes in virtual environments for predetermined tasks, paving the way for future real-world deployment. Traditionally, most RL agents use a squashed Gaussian distribution to select actions. In this study, we investigate a mini-crane scenario that could potentially be fully automated by AI and explore replacing the Gaussian distribution with the Kumaraswamy distribution, a close relative of the Beta distribution, for action stochastic selection. Our results indicate that the Kumaraswamy distribution offers computational advantages while maintaining robust performance, making it an attractive alternative for RL applications in continuous control applications.

Keywords: machine learning, crane, Construction, reinforcement learning, Kumaraswamy distribution

Received: 06 Mar 2025; Accepted: 22 Sep 2025.

Copyright: © 2025 Picchi and Brell-Cokcan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Davide Picchi, picchi@ip.rwth-aachen.de

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.