Active-sensing-based decentralized control of autonomous mobile agents for quick and smooth collision avoidance

Kano, Takeshi; Kanno, Takeru; Mikami, Taishi; Ishiguro, Akio

doi:10.3389/frobt.2022.992716

ORIGINAL RESEARCH article

Front. Robot. AI, 11 November 2022

Sec. Multi-Robot Systems

Volume 9 - 2022 | https://doi.org/10.3389/frobt.2022.992716

This article is part of the Research TopicUnderstanding and Engineering Cyber-Physical CollectivesView all 8 articles

Active-sensing-based decentralized control of autonomous mobile agents for quick and smooth collision avoidance

Takeshi Kano¹*

Takeru Kanno^1,2

Taishi Mikami^1,2

Akio Ishiguro¹

¹Research Institute of Electrical Communication, Tohoku University, Sendai, Japan
²Graduate School of Engineering, Tohoku University, Sendai, Japan

There is an increasing demand for multi-agent systems in which each mobile agent, such as a robot in a warehouse or a flying drone, moves toward its destination while avoiding other agents. Although several control schemes for collision avoidance have been proposed, they cannot achieve quick and safe movement with minimal acceleration and deceleration. To address this, we developed a decentralized control scheme that involves modifying the social force model, a model of pedestrian dynamics, and successfully realized quick, smooth, and safe movement. However, each agent had to observe many nearby agents and predict their future motion; that is, unnecessary sensing and calculations were required for each agent. In this study, we addressed this issue by introducing active sensing. In this control scheme, an index referred to as the “collision risk level” is defined, and the observation range of each agent is actively controlled on this basis. Through simulations, we demonstrated that the proposed control scheme works reasonably while reducing unnecessary sensing and calculations.

1 Introduction

Recently, labor shortages in the transportation industry have become a serious problem because of declining birth rates and an aging population. Although the transportation industry is an important economic sector, workers experience physical fatigue caused by long working hours. Moreover, recent developments in information and communication technology have increased the workload. Therefore, the development of control schemes for autonomous mobile systems such as automated transport robots in logistics warehouses and flying drones is necessary. In these systems, each mobile agent must reach its destination while avoiding other agents. Thus, safe collision avoidance must be considered in the design of control schemes. In addition to safety, other factors can further improve the utility of these systems. Specifically, if the speed of each agent is maintained at a maximum with minimal acceleration and deceleration, that is, if quick and smooth movement is achieved, agents can reach their destinations quickly with low energy consumption.

Multi-robot systems have been studied from various aspects, such as formation control (Ali et al., 2018), collective transport (Jurt et al, 2022), and task allocation (Valentini et al., 2022). Collision avoidance in multi-robot systems has also been extensively studied, and several methods such as path planning using centralized (Solovey et al., 2016; Yu and LaValle, 2016; Tang et al., 2018) and decentralized (Bekris et al., 2012; Zhou et al., 2017) control, learning (Arai et al., 1997; Godoy et al., 2016; Chen et al., 2017; Long et al., 2018; Fan et al., 2020; Liang et al., 2020), and approaches based on velocity obstacles (Fiorini and Shiller, 1998; Snape et al., 2009, Snape et al., 2010; van der Berg et al., 2011a; van der Berg et al., 2011b; Hennes et al., 2012; Snape et al., 2014; Alonso-Mora et al., 2013; Claes and Tuyls, 2018) have been proposed. However, in these studies, the focus was primarily on avoiding collisions, and the quickness of the agents’ movement was not considered. Conversely, several studies have been conducted to achieve multi-objective tasks. For example, genetic algorithms (Nazarahari et al., 2019; Yang et al., 2019), swarm intelligence algorithms (Hidalgo-Paniagua et al., 2015, Hidalgo-Paniagua et al., 2017), the electrostatic field approach (Bayat et al., 2018), and path planning (Ahmed and Deb, 2011; Ravankar et al., 2016) have been used to achieve multiple objectives, such as reducing the path length and increasing the safety and smoothness of trajectories. Zhu et al. (Zhu et al., 2020) developed a control scheme for multi-robot collision avoidance using a vertical-ellipse-based velocity obstacle method integrated with a dynamic window approach (Fox et al., 1997). Using mobile robots, they demonstrated that multiple objectives (i.e., reducing time consumption and path length and increasing safety) can be achieved with suitable parameters. However, their study aimed to achieve quick movement and a smooth trajectory, and reducing the acceleration and deceleration of the agents was beyond its scope.

To address this issue, we previously developed a simple decentralized control scheme for multiple mobile agents by modifying the social force model (Helbing and Molnár, 1995), which is a model of pedestrian dynamics, such that the prediction of collisions between mobile agents can be considered (Kano et al., 2021). Through simulations, we demonstrated that this control scheme enables safe collision avoidance while maintaining speed and reducing acceleration and deceleration. However, each agent still had to predict the motion of all agents within the observation range, regardless of the risk of collision (Figure 1A). Thus, unnecessary sensing and calculation were required, and this scheme was unreasonable considering hardware implementation.

FIGURE 1

FIGURE 1. Basic concept of (A) our previous control scheme and (B) the control scheme proposed in this study. (A) In our previous scheme, future motions of agents within a circular range of vision are predicted regardless of the risk of collision (both red and green agents). (B) In the control scheme proposed in this study, the range of vision is fan-shaped, and its direction is actively changed. Future motions of agents with a high risk of collision (red agent) are predicted, whereas those of the other agents (transparent green agents) are ignored.

This paper proposes a control scheme for mobile agents capable of enabling quick, smooth, and safe movement with limited sensing capability for each agent. For this purpose, we drew inspiration from the active sensing of living organisms, such as the echo location of bats (Yamada et al., 2020). Active sensing is an effective way to reduce the waste associated with sensing costs because important sensory information is actively brought into focus and useless information is neglected. We developed a simple control scheme for the active sensing and motion of each agent (Figure 1B). Subsequently, we investigated whether the proposed control scheme worked effectively through simulations. Moreover, we quantitatively evaluated the speed, smoothness, and safety and compared the proposed control scheme with our previous scheme (Kano et al., 2021).

The remainder of this paper is organized as follows. In Section 2, we briefly review our previous model (Kano et al., 2021) and propose a mathematical model incorporating active sensing. In Section 3, we present several simulation results and demonstrate that the proposed control scheme works satisfactorily. Finally, the conclusions and future work are presented in Section 4.

2 Problem statement

We assume that N agents are on a two-dimensional (2D) plane with a periodic boundary condition. The position of the agent i is denoted by r_i. Although the unit vector e_i, which represents the direction in which agent i intends to move, can be varied temporally in a real scenario, it was kept constant in this study for simplicity. The direction of e_i is determined randomly, that is, each agent aims to move in a different direction. The target speed v₀, that is, the speed in the absence of other agents, is common to all agents. Here, we do not consider the cases in which obstacles exist in the plane.

Our objective was to realize collision avoidance while maintaining speed and smoothness with low sensing and calculation costs. Specifically, we propose a decentralized control scheme for each mobile agent in which the following indices are kept as small as possible while reducing sensing and calculation costs:

E_{1} = 1 - \frac{\int_{0}^{T} \sum_{i = 1}^{N} ({\dot{r}}_{i} \cdot e_{i})}{v_{0} N T}, (1)

E_{2} = \frac{\int_{0}^{T} \sum_{i = 1}^{N} | {\ddot{r}}_{i} |^{2}}{n N T}, (2)

E_{3} = \frac{\int_{0}^{T} \sum_{i = 1}^{N} | f_{i j}^{phys} |}{n N T}, (3)

where T denotes the total time of the simulation experiment and $f_{i j}^{phys}$ denotes the physical force vector when agents i and j collide (Eq. 15). Indices E₁, E₂, and E₃ are small when the agents move quickly, smoothly, and safely, respectively.

In our previous study (Kano et al., 2021), we succeeded in realizing quick, smooth, and safe movement. In other words, we succeeded in reducing the aforementioned indices relative to those obtained with previously proposed models (Helbing and Molnár, 1995; Zanlungo et al., 2011). However, unnecessary sensing and calculations were required in our previous scheme. Herein, we propose a decentralized control scheme that uses active sensing, which is an effective way to reduce the waste associated with sensing costs. Because the sensing cost is reduced, performance deterioration (i.e.,, an increase in E₁, E₂, and E₃), is inevitable. However, we attempted to reduce this deterioration to the maximum possible extent.

3 Model

In this section, we briefly review our previous model (Kano et al., 2021) (Section 3.1) and propose a mathematical model that incorporates active sensing (Section 3.2).

3.1 Outline of our previous model (Kano et al., 2021)

We previously modified the social force model (Helbing and Molnár, 1995), a model of pedestrian dynamics, and proposed a mathematical model of mobile agents, in which each agent was controlled such that it could move quickly and smoothly while avoiding other agents (Kano et al., 2021). In the proposed model, the social force term in the social force model, which originates from psychological effects, was changed to the control input to mobile agents, and it was designed according to the following steps:

Step (i): Each agent detects the relative position and velocity of agents within a distance R from itself at every constant time interval.

Step (ii): Each agent predicts the future motion of nearby agents for duration T by numerically solving a differential equation in which the social force term is omitted from the social force model, with the initial position and velocity of the agents set to be the same as those obtained in Step (i) (Figure 2).

Step (iii): Based on the numerical calculation performed in Step (ii), the expected time until the distance between agents i and j has a minimum value or until they collide, τ_ij, is derived (Figure 2). Furthermore, the expected distance between agents i and j after the time interval τ_ij, $\hat{d} (τ_{i j})$ , is derived.

Step (iv): The control input is calculated using the values of τ_ij and $\hat{d} (τ_{i j})$ obtained in Step (iii). The equation that determines the control input is derived by modifying the social force term in the social force model (Helbing and Molnár, 1995). It consists of the exclusive volume effect and the avoidance force based on the prediction of future motion, the latter of which increases as τ_ij and $\hat{d} (τ_{i j})$ decrease.

FIGURE 2

FIGURE 2. Schematic of the previous model (Kano et al., 2021). (A) Based on the present position and velocity of the cars (blue filled circles and arrows, respectively), their future trajectories are estimated by numerically solving a differential equation [Step (ii)] (blue, red, and green dashed circles). (B) Schematic of the time evolution of the distance between cars i and j for the situation shown in (A).

In our previous study (Kano et al., 2021), we demonstrated through simulations that agents could move quickly and smoothly while avoiding each other. However, the calculation cost for each agent is high; each agent must calculate the future motion of all agents within a distance R from itself by solving a differential equation [Step (ii)]. A reduction in the calculation cost is required to consider the hardware implementation.

3.2 Proposed model

Herein, we propose a mathematical model that incorporates a decentralized control scheme for autonomous mobile agents to achieve quick, smooth, and safe movement with low sensing and calculation costs. For this purpose, an index referred to as the “collision risk level” is introduced for each pair of agents, and each agent is controlled such that it ignores agents whose collision risk level is low. The observation range of each agent is constrained to a fan shape, and its direction is actively controlled to focus on agents whose collision risk is high (Figure 1). Active sensing was used to identify high-risk agents. Furthermore, instead of solving the differential equation in Step (ii) of the previous model, the focused agents were predicted to move at a constant velocity; thus, each agent does not need to know the directions in which other nearby agents intend to move.

Each agent has a circular shape with radius r and can move omnidirectionally on a 2D plane. Agents have their own destinations, and their task is to reach these destinations without losing velocity or smoothness, while avoiding other agents. It is assumed that each agent can detect the relative position and velocity of other agents within its fan-shaped observation range.

The control algorithm for each agent consists of the following four steps (Figure 3).

Step 1: An agent detects other agents within its observation range and calculates the collision risk level.

Step 2: The agent controls the direction of the fan-shaped observation range to focus on agents whose collision risk is high, that is, it performs active sensing.

Step 3: The agent predicts the future motion of agents whose collision risk is high and calculates the avoidance force.

Step 4: The agent moves based on the calculated avoidance force.

FIGURE 3

FIGURE 3. Flow of the algorithm for the proposed control.

In the following section, each step is described in detail.

3.2.1 Calculation of collision risk level (Step 1)

For every time interval, agent i calculates the collision risk level between agents i and j, defined as K_ij, according to the following equation:

K_{i j} (t) = \{\begin{cases} α {\dot{φ}}_{i j} (t_{i j}^{obs}) & (t = t_{i j}^{obs}), \\ β^{t - t_{i j}^{obs}} K_{i j} (t_{i j}^{obs}) & (t > t_{i j}^{obs}), \end{cases} (4)

φ_{i j} (t_{i j}^{obs}) = 2 \arctan \frac{r}{d_{i j} (t_{i j}^{obs})}, (5)

where α is a positive constant, β is a constant that satisfies 0 < β < 1, $t_{i j}^{obs}$ is the latest time when agent i observes agent j, and φ_ij is the range of vision of agent j as viewed from agent i (Figure 4). The meaning of Eq. 4 is as follows: when agent j is within the observation range of agent i, the collision risk level K_ij is defined as the change rate of the range of vision ${\dot{φ}}_{i j}$ . Thus, K_ij increases when agent j approaches agent i. When agent j is outside the observation range of agent i, it is difficult to determine the extent to which agent j approaches agent i. Therefore, we assume that K_ij decreases exponentially (Figure 5). When K_ij is above the threshold K_th, agent i recognizes that agent j has collision risk.

FIGURE 4

FIGURE 4. Definitions of $d_{i j} (t_{i j}^{obs})$ and $φ_{i j} (t_{i j}^{obs})$ . Here, $t_{i j}^{obs}$ is the time when agent j is detected by agent i for the latest time.

FIGURE 5

FIGURE 5. Time evolution of collision risk level K_ij.

3.2.2 Determination of the observation range (Step 2)

The direction of the fan-shaped observation range with respect to the direction of the movement of agent i is defined as θ_i ( − π ≤ θ_i < π) (Figure 6). Angle θ_i is updated at every time interval Δt_view based on a probabilistic distribution function p(θ_i, t), which is given by.

f (θ_{i}, t) = ε - γ | θ_{i} - θ_{i, g o a l} | + \sum_{j \in K_{i j} \geq K_{t h}} K_{i j} (t) \max \{(w_{1} - w_{2} | θ_{i} - {\tilde{θ}}_{i j} (t) |), 0\}, (6)

p (θ_{i}, t) = \frac{f (θ_{i}, t)}{\int_{- π}^{π} f (θ_{i}, t) d θ_{i}}, (7)

where ɛ, γ, w₁, and w₂ are positive constants and θ_i,goal denotes the direction of the destination with respect to the moving direction of agent i. ${\tilde{θ}}_{i j} (t)$ denotes the estimated direction of the location of agent j with respect to the direction of movement of agent i, which is described as

{\tilde{θ}}_{i j} (t) = \arccos \frac{{\dot{r}}_{i} (t) \cdot {\tilde{r}}_{i j} (t)}{| {\dot{r}}_{i} (t) ‖ {\tilde{r}}_{i j} (t) |}, (8)

where r_i denotes the position of agent i and ${\tilde{r}}_{i j} (t)$ is the estimated relative position vector of agent j with respect to agent i at time t. The derivation of ${\tilde{r}}_{i j} (t)$ is given by Eq. 9.

FIGURE 6

FIGURE 6. Observation range and definitions of R, θ_view, and θ_i.

Here, p(θ_i, t) is the normalized function of f(θ_i, t) and f(θ_i, t) consists of three terms (Figure 7). The first term on the right side of Eq. 6 indicates that agent i pays equal attention in all directions. The second term indicates that agent i pays more attention to the destination direction. The third term indicates that agent i focuses on agents with collision risk levels. Here, the max function is introduced such that the probability distribution is selectively increased in the direction of agents whose collision risk levels are high. Thus, active sensing focuses on agents expected to collide in the near future.

FIGURE 7

FIGURE 7. Probability distribution of the direction of the observation range.

3.2.3 Calculation of avoidance force (Step 3)

The avoidance force of the agent is calculated for agents whose collision risk level is above threshold K_th. We explain how agent i generates an avoidance force from agent j when K_ij ≥ K_th. The basic idea is that the avoidance force vector of agent i is generated in the direction opposite to that of agent j when agent j approaches the closest point (Figure 8A), and that its magnitude is proportional to K_ij. The closest point was estimated by assuming that agent j moved at a constant velocity after it was observed by agent i. However, when agent j has already passed through the closest point, it is expected to move away from agent i in the near future. Therefore, in this case, the avoidance force vector of agent i is generated in the direction opposite to the current estimated position of agent j (Figure 8B).

FIGURE 8

FIGURE 8. (A) Prediction of the future position of mobile agents that have a risk of collision in the case where t ≤ τ_ij.(B) Prediction of the future position of mobile agents that have a risk of collision in the case where t > τ_ij.

This concept is mathematically described as follows:

{\tilde{r}}_{i j} (t) = {\dot{r}}_{i j} (t_{i j}^{obs}) (t - t_{i j}^{obs}) + r_{i j} (t_{i j}^{obs}), (9)

{\tilde{r}}_{i j} (τ_{i j}) \cdot {\dot{r}}_{i j} (t_{i j}^{obs}) = 0, (10)

{\tilde{r}}_{i j} (τ_{i j}) = {\dot{r}}_{i j} (t_{i j}^{obs}) (τ_{i j} - t_{i j}^{obs}) + r_{i j} (t_{i j}^{obs}), (11)

where $r_{i j} (t_{i j}^{obs})$ is the relative position vector of agent j with respect to agent i when agent i observes agent j and ${\tilde{r}}_{i j} (t)$ is the estimated relative position vector of agent j with respect to agent i at time t. Time τ_ij denotes the time at which agent j is expected to reach the closest point.

The avoidance force vector $f_{i j}^{avo} (t)$ is given by

f_{i j}^{avo} (t) = - C K_{i j} (t) {\hat{n}}_{i j} (t), (12)

where C is a positive constant and the unit vector ${\hat{n}}_{i j} (t)$ is defined as

{\hat{n}}_{i j} (t) = \{\begin{cases} \frac{{\tilde{r}}_{i j} (τ_{i j})}{| {\tilde{r}}_{i j} (τ_{i j}) |} & (t \leq τ_{i j}), \\ \frac{{\tilde{r}}_{i j} (t)}{| {\tilde{r}}_{i j} (t) |} & (t > τ_{i j}) . \end{cases} (13)

Here, agent j approaches agent i when t ≤ τ_ij (Figure 8A) while agent j has already passed the closest point and moves away from agent i in the case of t > τ_ij (Figure 8B).

3.2.4 Equation of motion (Step 4)

By modifying the social force model (Helbing and Molnár, 1995) and using the avoidance force derived in Step 3, the equation of motion for agent i can be expressed as

m {\ddot{r}}_{i} = a (v_{0} e_{i} - {\dot{r}}_{i}) + \sum_{j} f_{i j}^{phy} + \sum_{j \in K_{i j} \geq K_{t h}} f_{i j}^{soc}, (14)

where m denotes the mass of the agent; e_i denotes a unit vector representing the direction in which agent i intends to move; v₀ denotes the target speed; and a denotes a positive constant. The first term on the right side indicates that the velocity of the agent approaches the desired velocity v₀e_i. The second term denotes the physical interaction between the agents when they make contact with each other, and is described in the same manner as the social force model (Helbing and Molnár, 1995) as

f_{i j}^{phys} = - p θ (2 r - d_{i j}) n_{i j} + q θ (2 r - d_{i j}) \{({\dot{x}}_{j} - {\dot{x}}_{i}) \cdot t_{i j}\} t_{i j}, (15)

where p and q are positive constants, n_ij = (r_j − r_i)/|r_j − r_i|, t_ij is a unit vector perpendicular to n_ij, d_ij = |r_j − r_i|, and θ(x) = x for x > 0 and θ(x) = 0 for x ≤ 0.

The third term on the right side of Eq. 14 denotes the control input to the agent and $f_{i j}^{soc}$ is expressed as

f_{i j}^{soc} = - A \exp (- \frac{{\tilde{d}}_{i j} - 2 r}{B}) {\tilde{n}}_{i j} + f_{i j}^{avo}, (16)

where A and B are positive constants, ${\tilde{d}}_{i j} = | {\tilde{r}}_{j} - r_{i} |$ , and ${\tilde{n}}_{i j} = ({\tilde{r}}_{j} - r_{i}) / | {\tilde{r}}_{j} - r_{i} |$ . The first term on the right side of Eq. 16 originates from the social force term in the social force model (Helbing and Molnár, 1995), and represents the repulsive force exerted by agent j. However, when agent j was not within the observation range, a repulsive force was generated using the estimated relative position (Eq. 9). The second term on the right-hand side denotes the avoidance force derived in Step 3.

It should be noted that the third term on the right side of Eq. 14 uses only the sensory information of agents whose collision risk level is above threshold K_th. Thus, agents with a collision risk below K_th are ignored. Moreover, the avoidance force $f_{i j}^{avo}$ is derived by simply assuming that agent j is expected to move at a constant velocity after it is finally observed by agent i (Section 3.2.3). The future motion of a nearby agent is calculated by solving the differential equation in our previous model [Step (ii) in Section 3.1]. Thus, the calculation cost for each agent in the proposed model is significantly lower than that in the previous model (Kano et al., 2021).

4 Simulation

Simulations were conducted using the settings described in Section 2 to evaluate the performance of the proposed control scheme. The differential equations are discretized with time step δt. Because it was difficult to evaluate all the parameters, we first explored the parameters that enabled quick, smooth, and safe movement by hand tuning. We then changed several parameters that would affect the performance, whereas the other parameters were fixed. Unless otherwise noted, the parameter values listed in Table 1 are used.

TABLE 1

TABLE 1. Parameter values used in the simulation.

The performance indices E₁, E₂, and E₃ were calculated by discretizing Eqs. 1–3:

E_{1} = 1 - \frac{\sum_{k = 1}^{n} \sum_{i = 1}^{N} (v_{i} (k) \cdot e_{i})}{v_{0} n N}, (17)

E_{2} = \frac{\sum_{k = 1}^{n} \sum_{i = 1}^{N} | a_{i} {(k)}^{2} |}{n N}, (18)

E_{3} = \frac{\sum_{k = 1}^{n} \sum_{i = 1}^{N} | f_{i j}^{phys} (k) |}{n N}, (19)

where k denotes the time step, n the maximum time step, and N the number of agents. The velocities v_i(k) and accelerations a_i(k) are given by v_i(k) = (r_i(k + 1) − r_i(k))/δt and a_i(k) = (v_i(k + 1) − v_i(k))/δt, respectively.

In Section 4.1, we demonstrate that the proposed model performs reasonably well when the parameter values are appropriate. In Section 4.2, we change several key parameters for active sensing and quantitatively evaluate their dependencies to understand the properties of the proposed model. In Section 4.3, the performance of the proposed model is quantitatively compared with that of the previous model.

4.1 Representative results

First, we perform a simulation with N = 2. The video and snapshots are shown in Supplementary Movie S1 and Figure 9, respectively. In this section, we focus on one of the agents, and explain how they move. First, the agent in focus did not observe any other agents (Figure 9A). When the other agent entered the observation range, the collision risk level increased, and the agent was recognized as having a high risk of collision. Subsequently, the probability distribution of the center angle of the fan-shaped observation range changed, and the agent tended to focus on the other agent (Figure 9B). Consequently, the agent succeeded in avoiding the other agent, and the collision risk level decreased (Figures 9C, D). This result suggests that the proposed control algorithm performed well.

FIGURE 9

FIGURE 9. Snapshot of the simulation when N = 2. The fan-shaped observation range is shown for one of the agents. Two agents are connected by a line when K_ij is above the threshold K_th. The probability distribution of the observation direction (Eqs. 6,7) is indicated by the color of each agent. Red and green denote the directions where the probability distributions are high and low, respectively.

Next, we perform a simulation using N = 20 (Supplementary Movie S2). In this case, each agent found other agents with a high collision risk level and tended to focus on them. Subsequently, each agent successfully avoided the other agents.

4.2 Parameter dependence

To understand the properties of the proposed model, we performed simulations by changing the radius R and central angle θ_view of the fan-shaped observation range, time interval for updating the fan-shaped observation range Δt_view, and threshold of the collision risk level K_th. For each simulation, the speed, smoothness, and safety were evaluated quantitatively using E₁, E₂, and E₃, respectively.

Figure 10 shows the results when the radius of the observation range R is changed. Supplementary Movies S3, S4 show the cases with R = 10 and 80 m, respectively. We find that E₁ increases, whereas E₂ and E₃ decrease as R increases. For a small R, agents cannot find or avoid other agents until they are in proximity to them. Thus, each agent tends to move straight at a high speed until it finds the other agents. When it finds nearby agents with a high risk of collision, it abruptly attempts to avoid other agents but sometimes collides with them.

FIGURE 10

FIGURE 10. Result when the radius of the observation range R is varied. The average over five trials is shown. The error bars denote the standard deviation.

Figure 11 shows the results when the central angle θ_view of the observation range was changed. Supplementary Movies S5, S6 show the cases where θ_view = π/6 and 3π/2, respectively. We found that E₁ increased, whereas E₂ and E₃ decreased as θ_view increased. It is difficult to identify and avoid agents with a high risk of collision for a small θ_view. Thus, the dependence of θ_view was similar to that of R.

FIGURE 11

FIGURE 11. Result when the central angle of the observation range θ_view is varied. The average over five trials is shown. The error bars denote the standard deviation.

Figure 12 shows the results when the time interval for updating the fan-shaped observation range, Δt_view was changed. Supplementary Movies S7, S8 show the cases where Δt_view = 0.225 s and 0.675 s, respectively. We found that E₁ did not depend considerably on Δt_view, whereas E₂ and E₃ increased as Δt_view increased. For a large Δt_view, it is difficult to find and avoid other agents; thus, each agent sometimes needs to avoid other agents abruptly but fails to avoid them, and they collide with each other.

FIGURE 12

FIGURE 12. Result when the time interval for the update of the fan-shaped observation range Δt_view is varied. The average over five trials is shown. The error bars denote the standard deviation.

Figure 13 shows the results when the threshold of the collision risk level K_th is changed. Supplementary Movies S9, S10 show the cases in which K_th = 0.0001 and 0.001, respectively. Although E₂ does not depend considerably on K_th, E₁ decreases, and E₃ increases as K_th increases. For a small K_th (Supplementary Movie S9), the agents tend to focus on many nearby agents; thus, they can avoid collisions when speed is lost. In contrast, for a large K_th (Supplementary Movie S10), the agents focus only on a few nearby agents; thus, they can move at high speeds, although collisions cannot be avoided. Moreover, as K_th increases, the sensing and calculation costs decrease because many nearby agents are ignored.

FIGURE 13

FIGURE 13. Result when the threshold of collision risk K_th is varied. The average over five trials is shown. The error bars denote the standard deviation.

4.3 Comparison with the previous model (Kano et al., 2021)

The performance of the proposed model was compared with that of a previous model (Kano et al., 2021). In this experiment, N = 50, θ_view = π, Δt_view = 0.375 s, and the parameters A and C, which determine the magnitude of the repulsive force from other agents and the avoidance force based on the prediction of the future motion of other agents, respectively, were varied. The results of the previous and proposed models are shown in Figure 14 and Figure 15, respectively. Because the sensing and calculation costs of the proposed model are considerably lower than those of the previous model (Kano et al., 2021), the performance of the proposed model is inevitably worse than that of the previous model. However, when A = 4.00 × 10³[m kg s⁻²] and C = 5.00 × 10⁵[m kg s⁻²], E₁ and E₂ of the proposed model are approximately 1.9 times and E₃ of the proposed model is 2.3 times larger than that of the previous model. Thus, when A and C are properly chosen, the deterioration in performance is not severe, even though the sensing and calculation costs are considerably reduced.

FIGURE 14

FIGURE 14. Color map of (A) E₁, (B) E₂, and (C) E₃ in the previous study when N = 50. The color indicates the values of the indices (E₁, E₂ and E₃) when the parameters A and C, which determine the magnitude of the repulsive force from other agents and the avoidance force based on the prediction of the future motion of other agents, respectively [see Eq. 5 in (Kano et al., 2021)], were varied. The white areas indicate that the value exceeded the range shown in the color map.

FIGURE 15

FIGURE 15. Color map of (A) E₁, (B) E₂, and (C) E₃ for the proposed model when N = 50. The color indicates the values of the indices (E₁, E₂ and E₃) when the parameters A and C, which determine the magnitude of the repulsive force from other agents and the avoidance force based on the prediction of the future motion of other agents, respectively (Eqs. 12, 16) were varied. White areas indicate that the value exceeded the range shown in the color map.

5 Discussion

We proposed a decentralized control scheme for mobile agents that enables quick, smooth, and safe movement with limited sensing capability of each agent by introducing active sensing. To the best of our knowledge, this is the first study in which a control scheme based on active sensing was designed to achieve smooth collision avoidance in multi-robot systems. Through simulations, we confirmed that the proposed model performed reasonably well. Moreover, when the parameters were selected appropriately, the performance deterioration compared with that observed in a previous study (Kano et al., 2021) was not as severe, even though the amount of sensing was considerably reduced. Because the calculation cost for each agent is low, we believe that the proposed model can be used for various applications, such as those involving robots in warehouses and drones.

An advantage of the proposed control scheme is that its calculation cost is considerably lower than that of our previous scheme (Kano et al., 2021). Although deriving the decrease in the calculation cost is difficult, we can approximately estimate that the number of agents on which each agent focuses is approximately or less than θ_view/(2π). Furthermore, because each agent focuses only on agents whose collision risk is above threshold K_th, the calculation cost decreases as K_th increases. Another advantage of the proposed control scheme is that each agent does not need to know the directions in which nearby agents intend to move, (i.e., the unit vector e_i of the nearby agent i). In our previous control scheme, however, it was necessary for each agent to know this information.

However, several problems remain unresolved.

• Limitation of the control scheme: We found that the proposed control algorithm did not perform well when the density of the agents was large. This is likely because the prediction of the agents’ motions was oversimplified. We intend to further improve the control scheme such that its performance is improved without increasing the calculation cost.

• Technical problem: It is technically difficult to instantly change the direction of the fan-shaped observation range because mobile robots usually have a lidar sensor that is statically linked to the robot. Thus, the proposed model remains somewhat unrealistic. These problems should be addressed in future hardware realizations.

• Further extensions: We intend to extend the proposed control scheme to the three-dimensional case and the case where obstacles exist.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Author contributions

TK (1st author) designed the study. All authors proposed the mathematical model. TK (2nd author) performed simulation experiments. TK (1st author) wrote the initial draft of the manuscript. All authors contributed to the manuscript revision.

Funding

This work was supported in part by a research grant from the Futaba Foundation and the WISE Program for AI Electronics, Tohoku University.

Acknowledgments

The authors thank Daishin Ueyama of Musashino University, Mayuko Iwamoto of Doshisha University, and Yasufumi Yamada of Hiroshima University for their helpful suggestions. The authors would like to thank Masahiro Shimizu of Osaka University for providing a part of the source code.

Conflict of interest

The authors declare that this research was conducted in the absence of any commercial or financial relationships that could be construed as potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/frobt.2022.992716/full#supplementary-material

Supplementary Movie S1 | Simulation when N = 2.

Supplementary Movie S2 | Simulation when N = 20

Supplementary Movie S3 | Simulation when R = 10.

Supplementary Movie S4 | Simulation when R = 80

Supplementary Movie S5 | Simulation when θ_view = π/6.

Supplementary Movie S6 | Simulation when θ_view = 3π/2.

Supplementary Movie S7 | Simulation when Δt_view = 0.225 s.

Supplementary Movie S8 | Simulation when Δt_view = 0.675 s.

Supplementary Movie S9 | Simulation when K_th = 0.0001.

Supplementary Movie S10 | Simulation when K_th = 0.001.

References

Ahmed, F., and Deb, K. (2011). “Multi-objective path planning using spline representation,” in Proc. IEEE Int. Conf. Rob. Biomim, 1047–1052.

CrossRef Full Text | Google Scholar

Alonso-Mora, J., Breitenmoser, A., Rufli, M., Beardsley, P., and Siegwart, R. (2013). “Smooth coordination and navigation for multiple differential-drive robots,” in Distributed autonomous robotic systems. Springer tracts in advanced Robotics (Berlin, Heidelberg: Springer), 83.

Google Scholar

Arai, Y., Fujii, T., Asama, H., Kataoka, Y., Kaetsu, H., Matsumoto, A., et al. (1997). “Adaptive behavior acquisition of collision avoidance among multiple autonomous mobile robots,” in Proc. IEEE/RSJ Int. Conf. Intel. Rob. Sys., 1762–1767.

Google Scholar

Bayat, F., Najafinia, S., and Aliyari, M. (2018). Mobile robots path planning: Electrostatic potential field approach. Expert Syst. Appl. 100, 68–78. doi:10.1016/j.eswa.2018.01.050

CrossRef Full Text | Google Scholar

Bekris, K. E., Grady, D. K., Moll, M., and Kavraki, L. E. (2012). Safe distributed motion coordination for second-order systems with different planning cycles. Int. J. Rob. Res. 31, 129–150. doi:10.1177/0278364911430420

CrossRef Full Text | Google Scholar

Chen, Y., Liu, M., Everett, M., and How, J. P. (2017). “Decentralized non-communicating multiagent collision avoidance with deep reinforcement learning,” in Proc. IEEE Int. Conf. Rob. Autom, 285–292.

CrossRef Full Text | Google Scholar

Claes, D., and Tuyls, K. (2018). Multi robot collision avoidance in a shared workspace. Auton. Robots 42, 1749–1770. doi:10.1007/s10514-018-9726-5

CrossRef Full Text | Google Scholar

Fan, T., Long, P., Liu, W., and Pan, J. (2020). Distributed multi-robot collision avoidance via deep reinforcement learning for navigation in complex scenarios. Int. J. Rob. Res. 39, 856–892. doi:10.1177/0278364920916531

CrossRef Full Text | Google Scholar

Fiorini, P., and Shiller, Z. (1998). Motion planning in dynamic environments using velocity obstacles. Int. J. Rob. Res. 17, 760–772. doi:10.1177/027836499801700706

CrossRef Full Text | Google Scholar

Fox, D., Burgard, W., and Thrun, S. (1997). The dynamic window approach to collision avoidance. IEEE Robot. Autom. Mag. 4, 23–33. doi:10.1109/100.580977

CrossRef Full Text | Google Scholar

Godoy, J., Karamouzas, I., Guy, S. J., and Gini, M. (2016). “Implicit coordination in crowded multi-agent navigation,” in Proc. AAAI Conf. Artif. Intel, 2487–2493.

CrossRef Full Text | Google Scholar

Helbing, D., and Molnár, P. (1995). Social force model for pedestrian dynamics. Phys. Rev. E 51, 4282–4286. doi:10.1103/physreve.51.4282

PubMed Abstract | CrossRef Full Text | Google Scholar

Hennes, D., Claes, D., Meeussen, W., and Tuyls, K. (2012). “Multi-robot collision avoidance with localization uncertainty,” in Proc. Int. Conf. Auton. Agent. Multi. Sys., 147–154.

Google Scholar

Hidalgo-Paniagua, A., Vega-Rodríguez, M. A., Ferruz, J., and Pavón, N. (2015). Mosfla-mrpp: Multi-objective shuffled frog-leaping algorithm applied to mobile robot path planning. Eng. Appl. Artif. Intell. 44, 123–136. doi:10.1016/j.engappai.2015.05.011

CrossRef Full Text | Google Scholar

Hidalgo-Paniagua, A., Vega-Rodríguez, M. A., Ferruz, J., and Pavón, N. (2017). Solving the multi-objective path planning problem in mobile robotics with a firefly-based approach. Soft Comput. 21, 949–964. doi:10.1007/s00500-015-1825-z

CrossRef Full Text | Google Scholar

Kano, T., Iwamoto, M., and Ueyama, D. (2021). Decentralised control of multiple mobile agents for quick, smooth, and safe movement. Phys. A Stat. Mech. its Appl. 572, 125898. doi:10.1016/j.physa.2021.125898

CrossRef Full Text | Google Scholar

Liang, J., Patel, U., Sathyamoorthy, A. J., and Manocha, D. (2020). “Realtime collision avoidance for mobile robots in dense crowds using implicit multi-sensor fusion and deep reinforcement learning,” in Proc. Int. Conf. Auton. Agent. Multi. Sys, 03089. arXiv:2004.

Google Scholar

Long, P., Fan, T., Liao, X., Liu, W., Zhang, H., and Pan, J. (2018). “Towards optimally decentralized multi-robot collision avoidance via deep reinforcement learning,” in Proc. IEEE Int. Conf. Rob. Autom, 6252–6259.

CrossRef Full Text | Google Scholar

Nazarahari, M., Khanmirza, E., and Doostie, S. (2019). Multi-objective multi-robot path planning in continuous environment using an enhanced genetic algorithm. Expert Syst. Appl. 115, 106–120. doi:10.1016/j.eswa.2018.08.008

CrossRef Full Text | Google Scholar

Ravankar, A., Ravankar, A. A., Kobayashi, Y., and Emaru, T. (2016). Shp: Smooth hypocycloidal paths with collision-free and decoupled multi-robot path planning. Int. J. Adv. Robot. Syst. 13, 133. doi:10.5772/63458

CrossRef Full Text | Google Scholar

Snape, J., Guy, S. J., van der Berg, J., and Manocha, D. (2014). “Smooth coordination and navigation for multiple differential-drive robots,” in Experimental Robotics. Springer tracts in advanced Robotics (Berlin, Heiderberg: Springer), 79.

CrossRef Full Text | Google Scholar

Snape, J., van der Berg, J., Guy, S. J., and Manocha, D. (2009). “Independent navigation of multiple mobile robots with hybrid reciprocal velocity obstacles,” In Proc. IEEE/RSJ Int. Conf. Intel. Rob. Sys, 5917–5922.

CrossRef Full Text | Google Scholar

Snape, J., van der Berg, J., Guy, S. J., and Manocha, D. (2010). “Smooth and collision-free navigation for multiple robots under differential-drive constraints,” In Proc. IEEE/RSJ Int. Conf. Intel. Rob. Sys, 4584–4589.

CrossRef Full Text | Google Scholar

Solovey, K., Salzman, O., and Halperin, D. (2016). Finding a needle in an exponential haystack: Discrete rrt for exploration of implicit roadmaps in multi-robot motion planning. Int. J. Robotics Res. 35, 501–513. doi:10.1177/0278364915615688

CrossRef Full Text | Google Scholar

Tang, S., Thomas, J., and Kumar, V. (2018). Hold or take optimal plan (hoop): A quadratic programming approach to multi-robot trajectory generation. Int. J. Robotics Res. 37, 1062–1084. doi:10.1177/0278364917741532

CrossRef Full Text | Google Scholar

van der Berg, J., Guy, S. J., Lin, M., and Manocha, D. (2011a). “Reciprocal n-body collision avoidance,” in Robotics research. Springer tracts in advanced Robotics (Berlin, Heidelberg: Springer), 70.

Google Scholar

van der Berg, J., Snape, J., Guy, S. J., and Manocha, D. (2011b). “Reciprocal collision avoidance with acceleration-velocity obstacles,” in Proc. IEEE Int. Conf. Rob. Autom, 3475–3482.

Google Scholar

Yamada, Y., Mibe, Y., Yamamoto, Y., Ito, K., Heim, O., and Hiryu, S. (2020). Modulation of acoustic navigation behaviour by spatial learning in the echolocating bat rhinolophus ferrumequinum nippon. Sci. Rep. 10, 10751. doi:10.1038/s41598-020-67470-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, C., Zhang, T., Pan, X., and Hu, M. (2019). “Multi-objective mobile robot path planning algorithm based on adaptive genetic algorithm,” in Proc. Chin. Cont. Conf, 4460–4466.

CrossRef Full Text | Google Scholar

Yu, J., and LaValle, S. M. (2016). Optimal multirobot path planning on graphs: Complete algorithms and effective heuristics. IEEE Trans. Robot. 32, 1163–1177. doi:10.1109/tro.2016.2593448

CrossRef Full Text | Google Scholar

Zhou, D., Wang, Z., Bandyopadhyay, S., and Schwager, M. (2017). Fast, on-line collision avoidance for dynamic vehicles using buffered voronoi cells. IEEE Robot. Autom. Lett. 2, 1047–1054. doi:10.1109/lra.2017.2656241

CrossRef Full Text | Google Scholar

Zhu, X., Yi, J., Ding, H., and He, L. (2020). Velocity obstacle based on vertical ellipse for multi-robot collision avoidance. J. Intell. Robot. Syst. 99, 183–208. doi:10.1007/s10846-019-01127-6

CrossRef Full Text | Google Scholar

Keywords: collision avoidance, active sensing, decentralized control, mobile agents, autonomous systems

Citation: Kano T, Kanno T, Mikami T and Ishiguro A (2022) Active-sensing-based decentralized control of autonomous mobile agents for quick and smooth collision avoidance. Front. Robot. AI 9:992716. doi: 10.3389/frobt.2022.992716

Received: 12 July 2022; Accepted: 24 October 2022;
Published: 11 November 2022.

Edited by:

Zain Anwar Ali, Sir Syed University of Engineering and Technology, Pakistan

Reviewed by:

Yuqing Chen, Dalian Maritime University, China
Muhammad Shafiq, Sir Syed University of Engineering and Technology, Pakistan

Copyright © 2022 Kano, Kanno, Mikami and Ishiguro. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Takeshi Kano, dGthbm9AcmllYy50b2hva3UuYWMuanA=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.