Active fault-tolerant anti-input saturation control of a cross-domain robot based on a human decision search algorithm and RBFNN

Wang, Ke; Liu, Yong; Huang, Chengwei

doi:10.3389/fnbot.2023.1219170

ORIGINAL RESEARCH article

Front. Neurorobot., 14 July 2023

Volume 17 - 2023 | https://doi.org/10.3389/fnbot.2023.1219170

This article is part of the Research TopicDynamic Neural Networks for Robot Systems: Data-Driven and Model-Based ApplicationsView all 22 articles

Active fault-tolerant anti-input saturation control of a cross-domain robot based on a human decision search algorithm and RBFNN

Ke Wang

Yong Liu^*

Chengwei Huang

School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, China

This article presents a cross-domain robot (CDR) that experiences drive efficiency degradation when operating on water surfaces, similar to drive faults. Moreover, the CDR mathematical model has uncertain parameters and non-negligible water resistance. To solve these problems, a radial basis function neural network (RBFNN)-based active fault-tolerant control (AFTC) algorithm is proposed for the robot both on land and water surfaces. The proposed algorithm consists of a fast non-singular terminal sliding mode controller (NTSMC) and an RBFNN. The RBFNN is used to estimate the impact of drive faults, water resistance, and model parameter uncertainty on the robot and the output value compensates the controller. Additionally, an anti-input saturation control algorithm is designed to prevent driver saturation. To optimize the controller parameters, a human decision search algorithm (HDSA) is proposed, which mimics the decision-making process of a crowd. Simulation results demonstrate the effectiveness of the proposed control methods.

1. Introduction

In recent years, there has been a growing interest in multi-environment robots as single-environment robots are no longer sufficient to meet various practical needs (Cohen and Zarrouk, 2020). Researchers have proposed different designs to achieve this, such as bionic robots (Chen et al., 2021) and the legged amphibious robot (Xing et al., 2021). Furthermore, with the advancements in rotorcraft unmanned aerial vehicle (UAV) technology, researchers have started exploring the potential of integrating rotorcraft UAVs with wheeled mobile robots (WMRs) (Wang et al., 2019a). To enhance the capabilities of robots, cross-domain robots (CDRs) have been designed, which are capable of operating in multiple environments, including water, land, and air (Guo et al., 2019; Zhong et al., 2021). The robot presented in this paper is a CDR that combines a quadrotor UAV with a WMR equipped with webbed plates. These webbed plates on the wheels enable the robot to generate power at the water surface through their interaction with the water (Wang et al., 2022a,b).

The CDR presented in this study employs the same drive motors for ground and water surface operations. Assuming proper functionality during ground motion, a driver fault is considered to have occurred during the robot's operation on the water surface. Fault-tolerant controls (FTCs) are control algorithms that effectively deal with system faults (Najafi et al., 2022; Nan et al., 2022). Sliding-mode controllers (SMCs) are commonly employed in passive fault-tolerant algorithms due to their robustness in maintaining control performance when the maximum system fault is known. However, the use of non-singular terminal sliding mode control (NTSMC) and SMC results in jitter problems, and this robust control approach is considered too conservative (Ali et al., 2020; Hou and Ding, 2021; Guo et al., 2022). To address these issues, FTCs frequently employ adaptive sliding mode control (Wu et al., 2020) and integral sliding mode control (Yu et al., 2022). Additionally, observers are commonly used to detect drive faults. In Wang F. et al. (2022), a disturbance observer (DO) is used to quickly compensate and correct unknown actuator faults of unmanned surface vehicles (USVs). In the context of autonomous underwater vehicles (AUVs), a sliding mode observer-based fault-tolerant control algorithm has been proposed in the literature (Liu et al., 2018). However, the design of higher-order observers requires complex mathematical proofs and the adjustment of many parameters. Neural networks (NNs) are often used to estimate system model parameters and uncertainty terms due to their ability to approximate arbitrary non-linear functions. In Zhang et al. (2022), NNs are used to rectify the model parameters of a USV, and an NN-based adaptive observer is developed to estimate errors caused by drive faults. As demonstrated in Gao et al. (2022), NNs can directly estimate system faults by approximating the uncertainty terms in the system. Event-triggered fault-tolerant control is a type of AFTC algorithm that has the potential to reduce system hardware requirements. However, it requires the development of trigger thresholds and corresponding fault control algorithms, which increase the difficulty and complexity of controller design (Huang et al., 2019; Wu et al., 2021; Zhang et al., 2021). Another important consideration in the FTC algorithm is the control of input saturation. One efficient approach for solving this issue is to introduce virtual states in the controller. These virtual states regulate the input error of the controller, thereby suppressing control input saturation (Wang and Deng, 2019). Additionally, designing adaptive laws is an effective way to address control input saturation. In this approach, the adaptive control input decreases as the actual control input approaches the maximum physical constraint (Shen et al., 2018).

The controller design presented above does not involve any optimization of the controller parameters. To address this limitation, reinforcement learning techniques have been developed to optimize control parameters. In Gheisarnejad and Khooban (2020), a reinforcement learning algorithm is employed to optimize the PID controller parameters. Another study (Zhao et al., 2020) trains the optimal trajectory following controller using deep reinforcement learning. However, reinforcement learning algorithms typically require a significant amount of data and multiple iterations to achieve optimal results. Swarm intelligence (SI) optimization algorithms are a promising approach in practical applications, including data classification, path planning, and controller optimization (Xue and Shen, 2020, 2022). Among the various SI optimization algorithms, particle swarm optimization (PSO) is a classical algorithm known for fast convergence and few parameters (Song and Gu, 2004). However, traditional PSO algorithms tend to fall into local optima. Ant colony optimization (ACO) is another common SI optimization algorithm. ACO can jump out of local optima but has slower convergence (Dorigo et al., 1996). In addition, the gray wolf optimizer (GWO) simulates the predation process of wolves (Mirjalili et al., 2014) and the Harris hawk optimizer (HHO) simulates the predation process of hawks (Heidari et al., 2019). These algorithms have shown improvements in convergence speed and accuracy compared with other animal predation simulation algorithms. Other popular SI optimization algorithms include the firefly algorithm (Fister et al., 2013) and the sine/cosine search algorithm (Mirjalili, 2016). Each SI optimization algorithm has its own strengths and weaknesses and no single algorithm can effectively handle all optimization problems. The goal is to achieve satisfactory results in terms of convergence speed, accuracy, and robustness for a specific optimization problem.

Based on the previous discussion, an AFTC is proposed for the CDR on the ground and on the water surface. This control algorithm consists of three main parts:

a. To enhance the robustness of the robot control system, a fast NTSMC is designed based on the concept of passive FTC. Compared with traditional NTSMC and SMC, the proposed NTSMC has reduced control input chatter. Additionally, to reduce controller conservatism, an RBFNN is designed to detect and compensate for drive faults. The adaptive weight control law of the RBFNN is based on the Lyapunov function.

b. To prevent drive saturation, an anti-input saturation control algorithm based on the hyperbolic tangent (tanh) function is employed. An adaptive rate is designed to prevent singularities in this algorithm. This method does not require complex mathematical proofs and requires fewer tuning parameters.

c. A new SI optimization algorithm named HDSA is proposed for the optimization of the weight update rate parameter of RBFNNs. The proposed algorithm is compared with other SI optimization algorithms, and the test results demonstrate its faster convergence rate and higher accuracy.

2. Related work and mathematical models

2.1. HDSA's related work

To demonstrate the advantages of the proposed HDSA optimization algorithm, the results of the HDSA tests are shown in this section. The theory of HDSA is discussed in detail in the section entitled “RBFNN-Based Active Fault-Tolerant Control Algorithm”. The effectiveness of the proposed optimization algorithm was evaluated by comparing the test results of HDSA with other popular optimization algorithms, such as particle swarm optimization (PSO) (Song and Gu, 2004), the sine/cosine algorithm (SCA) (Mirjalili, 2016), the gray wolf optimizer (GWO) (Mirjalili et al., 2014), the firefly algorithm (FA) (Fister et al., 2013), and the Harris hawk optimizer (HHO) (Heidari et al., 2019). Twenty standard test functions were used for evaluation, which are presented in Tables 5–7 (included in the Simulation Results section).

The number of populations was pop = 100 and the maximum number of iterations was M = 100. The average fitness over 30 independent runs was considered as the optimization result. The convergence characteristics of the six algorithms in the single-peak function test are depicted in Figure 1, while Figure 2 illustrates the convergence characteristics in the multi-peak function test. Furthermore, Figure 3 demonstrates the convergence characteristics of the six algorithms on fixed-dimensional multi-peak functions. The test results of the six algorithms, based on 30 independent runs, are summarized in Tables 1, 2. In Tables 1, 2, purple indicates the optimal value of the test functions, pink indicates the mean value of the test functions, and white indicates the mean squared deviation of the test functions.

FIGURE 1

Figure 1. Single peak function test results. (A–G) represent the test results of the six algorithms in functions F1 to F7.

FIGURE 2

Figure 2. Multi-peak function test results. (A–F) represent the test results of the six algorithms in functions F8 to F13.

FIGURE 3

Figure 3. Fixed dimensional multi-peak function results. (A–G) represent the test results of the six algorithms in functions F14 to F20.

TABLE 1

Table 1. Test results of HDSA SCA and PSO algorithms run independently 30 times.

TABLE 2

Table 2. Test results of GWO FA and HHO algorithms run independently 30 times.

The results of the single-peak functions F1–F7 test results are presented in Tables 1, 2. In these tests, the mean and optimal values obtained by HDSA in F1–F5 are both 0, indicating that HDSA achieves the highest accuracy among the six algorithms. Although the accuracy of HDSA is slightly inferior to HHO in the F6–F7 test functions, it still outshines SCA, PSO, GWO, and FA. HDSA has a standard deviation of 0 in tests F1–F5, suggesting that HDSA is the most stable algorithm. Although its stability is slightly lower than HHO in tests F6–F7, it still outperforms the other four methods. Convergence speed is depicted in Figure 2. HDSA has a significantly faster convergence speed compared with the other five algorithms, but its convergence accuracy in the F6–F7 tests is lower than that of HHO.

The test results for the multi-peak functions F8–F13 are presented in Tables 1, 2. In the tests from F9 to F13, HDSA exhibits significantly better stability and convergence accuracy compared with the other five algorithms. It achieves higher accuracy and the smallest standard deviation. As depicted in Figure 3, except for the F8 test function, HDSA showcases the fastest convergence speed and highest convergence accuracy among the algorithms.

The results of the fixed dimensional multi-peak functions F14–F20 test results are shown in Tables 1, 2. In the F14 test, SCA has the best optimal and average accuracy, while HDSA exhibits slightly lower average accuracy and stability compared with SCA, PSO, and HHO. However, HDSA still manages to find the optimal solution in 30 runs. In the F15–F18 test results, HDSA, SCA, GWO, and HHO perform closely, with good stability and accuracy. In the F19–F20 tests, HDSA outperforms the other five algorithms significantly in terms of accuracy and stability. As shown in Figure 3, HDSA exhibits the fastest convergence speed among the other test functions, except for F15, F17, and F18. In the F15 test, HDSA is only slightly slower than HHO, while in the F17 and F18 tests, HDSA converges slightly slower than FA.

2.2. Mathematical model of the CDR

Before discussing the mathematical model of the CDR, the following assumptions are made: Assumption 1: The center of gravity and the geometric center of the robot body coincide. Assumption 2: The motor output torque meets the actual performance requirements of the robot during ground and water motion. Assumption 3: The robot's vertical swing, horizontal rocking, and longitudinal rocking during its movement on the water surface are ignored. Assumption 4: The motion of the robot on the ground is purely rolling, without any sliding motion.

The CDR designed in this study can be seen as a combination of a quadrotor UAV and a WMR. Figure 4A shows the robot moves on the ground. Figure 4B shows the robot moves on the water surface by webbed plates. Figure 4C shows the robot moves on the water surface by propllers. The robot moves in the air in a similar way to the quadrotor UAV as shown in Figures 4D, E. Figure 4F shows the structure of the robot, where webbed plates are mounted on the wheels. These webbed plates generate traction and rotational torque on the water surface by interacting with the water. However, as this paper focuses primarily on the FTC algorithm of the robot on the ground and on the water surface, the discussion does not explore the robot's aerial motion in detail.

FIGURE 4

Figure 4. (A) The robot moves on the ground. (B) The robot moves on the water surface by webbed plates. (C) The robot moves on the water surface by propllers. (D) The robot takes off from water surface. (E) The robot flying in the air. (F) The structure of robot.

The robot in the inertial frame and in the body frame is shown in Figure 5.

FIGURE 5

Figure 5. Robot in the inertial frame and the body frame.

In Figure 5, d is the distance from the geometric center of the robot O_b to the mass center of the robot. b is the axis radius and r is the wheel radius. ω_l, ω_r are the angular velocities of the left and right wheels. ψ is the angle between the robot body coordinate system b and the inertial coordinate system A, and ψ is the yaw angle of the robot. The kinematic model of the robot on the ground and water surface can be represented as (Liu et al., 2020):

\begin{array}{l} \dot{q} = R η & (1) \end{array}

where $q = [\begin{matrix} x & y & ψ \end{matrix}]$ represents the position and orientation of the robot in the inertial frame, while $η = [\begin{matrix} u & v & r \end{matrix}]$ is used to denote the longitudinal velocity, lateral velocity, and yaw angular velocity in the body frame. The coordinate conversion matrix is denoted by R, where $R = [\begin{matrix} cos ψ & sin ψ & 0 \\ - sin ψ & cos ψ & 0 \\ 0 & 0 & 1 \end{matrix}]$ . The dynamics model of the robot's motion on the ground can be expressed as

\begin{array}{l} M (q) \ddot{q} + C_{m} (q, \dot{q}) q + F (\dot{q}) + τ_{d} = B (q) τ & (2) \end{array}

The matrices M are symmetric positive definite inertia matrices, while C_m represents the centripetal and Coriolis matrix. The term $F (\dot{q})$ denotes mechanical friction, while τ_d is used to represent external disturbances. The input transformation matrices are denoted as B(q). Furthermore, the robot drive motors in the left and right wheel output torque are represented by $τ = {[\begin{matrix} τ_{l} & τ_{r} \end{matrix}]}^{T}$ .

M (q) = [\begin{matrix} m & 0 & m d sin ψ \\ 0 & m & - m d cos ψ \\ m d sin ψ & - m d cos ψ & I \end{matrix}],

B (q) = \frac{1}{r} [\begin{matrix} cos ψ & cos ψ \\ sin ψ & sin ψ \\ L & - L \end{matrix}],

C_{m} (q, \dot{q}) = {[\begin{matrix} m d {\dot{ψ}}^{2} cos ψ & m d {\dot{ψ}}^{2} sin ψ & 0 \end{matrix}]}^{T}

The mass of the robot is represented by m. The I is a scalar quantity and represents the rotational inertia of the robot as it rotates in the X-Y plane. The angular velocity of the robot is assumed to vary smoothly, so that $\dot{ψ} \approx 0$ . According to assumption 1, the Coriolis matrix can be assumed to be negligible, resulting in C_m ≈ 0. According to assumption 1, d = 0, so the matrix $M (q) = d i a g [\begin{matrix} m & m & I \end{matrix}]$ . Based on these assumptions, the dynamics model of the robot on the ground can be rewritten as follows:

\begin{array}{l} \bar{M} \ddot{q} + \bar{C} q + + \bar{F} (\dot{q}) + {\bar{τ}}_{d} = \bar{B} τ & (3) \end{array}

where $\bar{C} = R^{- 1} C_{m} Ṙ$ , $\bar{M} = R^{- 1} M R$ , $\bar{B} = R^{- 1} B$ . $\bar{F} (\dot{q}) = {[\begin{matrix} f_{u} & f_{v} & f_{r} \end{matrix}]}^{T}$ is the mechanical friction and ${\bar{τ}}_{d} = {[\begin{matrix} d_{u} & d_{v} & d_{r} \end{matrix}]}^{T}$ is the external disturbance. Rewriting 3 into algebraic form can be expressed as:

\begin{array}{l} {\begin{array}{l} \dot{u} = (F_{u} - f_{u} - d_{u}) / m + v ω \\ \dot{v} = - u ω - (f_{v} + d_{v}) / m \\ \dot{r} = (T_{r} - f_{r} - d_{r}) / I \end{array} & (4) \end{array}

The traction force is represented by F_u, while T_r represents the torque. To model the dynamics of the robot on the water surface, we can refer to the USV dynamics model (Chen et al., 2019), which can be expressed as follows

\begin{array}{l} M_{w} (q) \dot{η} + C_{w} (q, η) + D_{w} (η) η + F_{w} (η) + τ_{d w} = τ_{w} & (5) \end{array}

M_w is the inertia matrix. The traction force and torque of the robot at the water surface are $τ_{w} = {[\begin{matrix} F_{u} & 0 & T_{r} \end{matrix}]}^{T}$ . $τ_{d w} = {[\begin{matrix} d_{u w} & d_{v w} & d_{r w} \end{matrix}]}^{T}$ is the lumped disturbance and $F_{w} (η) = [\begin{matrix} f_{u w} & f_{v w} & f_{r w} \end{matrix}]$ is the water resistance.

M_{w} = [\begin{matrix} m_{11} & 0 & 0 \\ 0 & m_{22} & m_{23} \\ 0 & m_{32} & m_{33} \end{matrix}],

C_{w} (q, η) = [\begin{matrix} 0 & 0 & C_{13} (η) \\ 0 & 0 & C_{23} (η) \\ - C_{13} (η) & - C_{23} (η) & 0 \end{matrix}],

D_{w} (η) = [\begin{matrix} d_{11} & 0 & 0 \\ 0 & d_{22} & d_{23} \\ 0 & d_{32} & d_{33} \end{matrix}] .

The disturbances are represented by τ_dw. On the other hand, D_w(η) represents the water resistance. The Coriolis force matrix can also be neglected according to Assumption 1 and Assumption 3, so C_w(q, η) ≈ 0. The elements of the non-diagonal matrix in matrix D_w(η) and matrix M_w are small and can be neglected. This model simplification approach is also more common (Liao et al., 2016; Wang et al., 2019b; Deng et al., 2020), where $m_{11} = m - X_{\dot{u}}$ , $m_{22} = m - Y_{\dot{v}}$ , and m₃₃ = I_z−N_ṙ are the inertia parameters of the three axes and $X_{\dot{u}}$ , $Y_{\dot{v}}$ , and N_ṙ are the additional inertia parameters due to the wet water of the robot shell and the viscosity of the water. The dynamics model of the robot on the water surface can be expressed as:

\begin{array}{l} {\begin{array}{l} \dot{u} = \frac{m_{22}}{m_{11}} v ω - \frac{X_{u}}{m_{11}} u - \frac{X_{| u | u}}{m_{11}} | u | u + \frac{F_{u}}{m_{11}} + \frac{d_{u}}{m_{11}} \\ \dot{v} = - \frac{m_{11}}{m_{22}} u ω - \frac{Y_{u}}{m_{22}} v - \frac{Y_{| v | v}}{m_{22}} | v | v + \frac{d_{v}}{m_{11}} \\ \dot{ω} = \frac{m_{11} - m_{22}}{m_{33}} u υ - \frac{N_{ω}}{m_{33}} ω - \frac{N_{| ω | ω}}{m_{33}} | ω | ω + \frac{T_{r}}{m_{33}} + \frac{d_{r}}{m_{33}} \end{array} & (6) \end{array}

X_u, X_|u|u, Y_u, Y_|v|v, and N_ω, N_|ω|ω are the resistance coefficients. The resistance of the robot moving on the water surface can be approximated as a quadratic function of the velocity and angular velocity.

The mathematical model should be rewritten into a form that better suits the needs of the subsequent controller design. The dynamics model of the robot's motion on the ground is rewritten according to 4 as

\begin{array}{l} {\begin{array}{l} \dot{u} = F_{u} / m - \underset{d_{u g}}{\underset{︸}{(f_{u} + d_{u}) / m + v ω}} \\ \dot{r} = T_{r} / I - \underset{d_{r g}}{\underset{︸}{(f_{r} + d_{r}) / I}} \end{array} & (7) \end{array}

Where d_ug is the lumped disturbance and $d_{u g} \leq {\bar{d}}_{u g}$ , ${\bar{d}}_{u g}$ is the upper limit of the total disturbances. d_rg is the lumped disturbance and $d_{r g} \leq {\bar{d}}_{r g}$ , ${\bar{d}}_{r g}$ is the upper limit of the total disturbances. The dynamics model of the robot on the water surface is

\begin{array}{l} {\begin{array}{l} \dot{u} = \frac{F_{u c}}{m} - \underset{- D_{u w}}{\underset{︸}{\frac{ξ_{u} F_{u c}}{m_{11}} - F_{u a} - \frac{X_{u}}{m_{11}} u - \frac{X_{| u | u}}{m_{11}} | u | u + Δ_{F}}} \\ + \underset{d_{u w}}{\underset{︸}{\frac{m_{22}}{m_{11}} v ω + \frac{d_{u}}{m_{11}}}} \\ \dot{r} = \frac{T_{r c}}{I} - \underset{- D_{r w}}{\underset{︸}{\frac{ξ_{r} T_{r c}}{m_{33}} - T_{r a} - \frac{N_{ω}}{m_{33}} ω - \frac{N_{| ω | ω}}{m_{33}} | ω | ω + Δ_{T}}} \\ + \underset{d r w}{\underset{︸}{\frac{m_{11} - m_{22}}{m_{33}} u v + \frac{d_{r}}{m_{33}}}} \end{array} & (8) \end{array}

where F_uc is the desired tractive force and F_uc = F_u represents no force loss. $ξ_{u} \in [\begin{matrix} 0 & 1 \end{matrix})$ is the force loss parameter. Δ_F is the force disturbance due to mass change. d_uw is a lumped disturbance, $d_{u w} \leq {\bar{d}}_{u w}$ . ${\bar{d}}_{u w}$ is the upper bound of d_uw. D_uw is the uncertainty term when the robot moves on the water surface due to changes in system parameters, water resistance, and driver faults. T_rc is the desired torque and T_rc = T_r represents no force loss. $ξ_{r} \in [\begin{matrix} 0 & 1 \end{matrix})$ is the power loss parameter. Δ_T is the torque disturbance due to the change of inertia parameter. d_rw is a lumped disturbance, $d_{r w} \leq {\bar{d}}_{r w}$ . ${\bar{d}}_{r w}$ is the upper bound of d_rw. D_rw is the uncertainty term due to changes in system parameters, water resistance, and driver faults during robot rotation on the water surface.

3. Active fault tolerance control algorithm and human decision search algorithm

3.1. RBFNN-based active fault-tolerant control algorithm

Both the yaw control and the linear velocity control of the robot are essentially single-input single-output (SISO) second-order non-linear affine systems. Without loss of generality, a second-order non-linear affine SISO system with drive faults can be expressed as:

\begin{array}{l} {\begin{array}{l} {\dot{x}}_{1} = x_{2} \\ {\dot{x}}_{2} = f (x) + g (x) u_{c} + D \\ y = x_{1} \end{array} + d & (9) \end{array}

u_c is unconstrained control input, u_a is the drive bias, ξ is the power loss parameter, $ξ \in [\begin{matrix} 0 & 1 \end{matrix})$ , 0 represents no power loss, and 1 represents a complete loss of efficiency. D = −g(x)ξu_c + u_a is the uncertainty term due to the driver fault. The disturbance d has a well-defined upper limit and $| d | \leq \bar{d}$ . x₁, x₂ are system states. f(x) is the system function and g(x) is the input function. Owing to the physical constraints of the controlled object, the control input is subject to saturation:

\begin{array}{l} u_{c o n} = {\begin{array}{l} \begin{matrix} u_{\max}, & | u_{c} | > u_{\max} \end{matrix} \\ \begin{matrix} u_{c}, & u_{c} \leq u_{\max} \end{matrix} \end{array} & (10) \end{array}

u_max is the physical constraint. To make the control input smoother, the cutoff function is usually replaced by a saturation function, such as tanh.

\begin{array}{l} u_{c o n} = u_{max} tanh (u_{f} / u_{max}) & (11) \end{array}

where u_con is the constrained control input and u_f is a function of u_c. Thus, the control objective is to design the constrained control law u_con so that it satisfies the control requirements even in the presence of drive faults and external disturbances in the controlled object. The steps for designing an AFT controller are the following:

Step 1: Define the state error e₁ = x_1d − x₁. Establish the Lyapunov function $V_{1} = \frac{1}{2} e_{1}^{2}$ . Taking the derivative of V₁ with respect to the time t gives

\begin{array}{l} {\dot{V}}_{1} = e_{1} {\dot{e}}_{1} = e_{1} ({\dot{x}}_{1 d} - x_{2}) & (12) \end{array}

Define the virtual state α_x = k₁e₁ + ẋ_1d as the desired input of the next step. If x₂ can follow α_x, ${\dot{V}}_{1} = - k_{1} e_{1}^{2}$ . So, the next step of the control law must ensure that α_x − x₂ = 0. α_x is the next desired state x_2d.

Step 2: Define the state error e₂ = x_2d − x₂, and the fast NTSMC is designed as

\begin{array}{l} S = e_{2} + α e_{1} + β e_{1}^{λ} & (13) \end{array}

where α and β are positive adjustable parameters and λ is a positive odd number. The sliding mode convergence law is

\begin{array}{l} \dot{S} = - k_{2} S - k_{3} {| S |}^{γ_{1}} sgn (S) & (14) \end{array}

where k₁, k₂, and γ₁ are positive adjustable parameters. sgn is the symbolic function. The derivation of 13 yields:

\begin{array}{l} \dot{S} = {\dot{e}}_{2} + α {\dot{e}}_{1} + λ β e_{1}^{λ- 1} {\dot{e}}_{1} = - k_{2} S - k_{3} {| S |}^{γ_{1}} sgn (S) & (15) \end{array}

where

\begin{array}{l} \begin{array}{l} {\dot{e}}_{2} = {\dot{x}}_{2 d} - {\dot{x}}_{2} \\ = {\dot{α}}_{x} - f (x) - g (x) u_{c} - d - D \\ = - k_{2} S - k_{3} {| S |}^{γ_{1}} sgn (S) \end{array} & (16) \end{array}

The controller law can be designed as follows:

\begin{array}{l} u_{c} = \frac{1}{g (x)} ({\dot{α}}_{x} - f (x) - D + k_{2} S + k_{3} {| S |}^{γ_{1}} sgn (S) + α ė_{1} + λ β {e_{1}}^{λ - 1} ė_{1}) & (17) \end{array}

In 17, the uncertain term due to drive faults D is known. Establishing the Lyapunov function $V_{2} = \frac{1}{2} S^{2}$ , the derivative of V₂ yields

\begin{array}{l} \begin{array}{l} {\dot{V}}_{2} = S \dot{S} \\ \begin{matrix} = S ({\dot{e}}_{2} + α {\dot{e}}_{1} + λ β e_{1}^{λ- 1} {\dot{e}}_{1}) \end{matrix} \\ = S ({\dot{α}}_{x} - f (x) - g (x) u_{c} - d - D + α {\dot{e}}_{1} + λ_{1} β e_{1}^{λ_{1} - 1} {\dot{e}}_{1}) \end{array} & (18) \end{array}

Bringing 17 into 18 yields

\begin{array}{l} \begin{array}{l} {\dot{V}}_{2} = S \dot{S} \\ = S (- d - k_{2} S - k_{3} {| S |}^{γ_{1}} sgn (S)) \\ = - k_{2} S^{2} - k_{3} {| S |}^{γ_{1} + 1} - S d \\ \leq - k_{2} S^{2} - k_{3} {| S |}^{γ_{1} + 1} + | S | \bar{d} \\ = - k_{2} S^{2} - k_{3} {| S |}^{γ_{1} + 1} + | S | \bar{d} \\ = - k_{2} S^{2} - | S | (k_{3} {| S |}^{γ_{1}} - \bar{d}) \end{array} & (19) \end{array}

When $k_{3} > \bar{d} / | S |^{γ_{1}}$ , $k_{3} | S |^{γ_{1}} - \bar{d} = ε$ , ε > 0, thus:

\begin{array}{l} {\dot{V}}_{2} \leq - 2 k_{2} V_{2} - ε | S | \leq - 2 k_{2} V_{2} - \sqrt{2} ε V_{2}^{1 / 2} < - α_{1} V_{2}^{1 / 2} - β_{1} V_{2} & (20) \end{array}

where α₁ = 2k₂, $0 < β_{1} < \sqrt{2} ε$ .

LEMMA 1 [44] (Jiang and Lin, 2020): Consider a smooth positive definite V(x), x ∈ R_n. Suppose that real numbers p₁ ∈ (0, 1), α > 0, and β > 0 exist such that $V (x) < - α V {(x)}^{p_{1}} - β V (x)$ . Then, an area U₀ ∈ R_n exists, such that any V(x) starting from U₀ can reach V(x) = 0 in finite time T_v, which is expressed as $T_{v} \leq \frac{1}{β (1 - p_{1})} ln (\frac{V^{1 - p_{1}} (x_{0}) + α}{α})$ .

According to lemma 1, V₂ can converge to 0 in finite time. In the above discussion, the uncertainty term D is assumed to be known, but the actual uncertain term D is unknown. As RBFNN can approximate arbitrary uncertain non-linear functions and does not depend on a mathematical model, it is more suitable for estimating stochastic uncertain terms. Therefore, optimal neural network weights w^* must exist such that $D = ε_{0} + {w^{*}}^{T} h$ , ε₀ is the estimated residual and h is the neuron. $\tilde{w} = ŵ - w^{*}$ , ŵ is an estimate of w^* and w^* is a constant, so $\dot{\tilde{w}} = \dot{ŵ}$ . Rewrite 9 as:

\begin{array}{l} {\begin{array}{l} {\dot{x}}_{1} = x_{2} \\ {\dot{x}}_{2} = f (x) + g (x) u_{c} + d + ε_{0} + w^{*}^{T} h \\ y = x_{1} \end{array} & (21) \end{array}

Step 3: Establish the Lyapunov function V₃ as

\begin{array}{l} V_{3} = \frac{1}{2} S^{2} + \frac{1}{2} t r ({\tilde{w}}^{T} Γ^{- 1} \tilde{w}) & (22) \end{array}

The derivation of formula 22 yields

\begin{array}{l} V_{3} = S Ṡ + {\tilde{w}}^{T} Γ^{- 1} \dot{ŵ} \\ = S ({\dot{α}}_{x} - f (x) - g (x) u_{c} - d - ε_{0} - {w^{*}}^{T} h + α ė_{1} + λ_{1} β e_{1}^{λ_{1} - 1} ė_{1}) \\ + {\tilde{w}}^{T} Γ^{- 1} \dot{ŵ} & (23) \end{array}

The control law is designed to

\begin{array}{l} u_{c} = \frac{1}{g (x)} ({\dot{α}}_{x} - f (x) - ŵ^{T} h + k_{2} S + k_{3} {| S |}^{γ_{1}} sgn (S)) & (24) \end{array}

Bringing formula 24 into 23 yields

\begin{array}{l} {\dot{V}}_{3} = - k_{2} S^{2} - k_{3} | S |^{γ_{1} + 1} - S ε_{1} + {\tilde{w}}^{T} (S h + Γ^{- 1} \dot{ŵ}) & (25) \end{array}

where ε₁ = d + ε₀, the upper limit of the estimation error of the neural network is ${\bar{ε}}_{0}$ . ${\bar{ε}}_{0} \geq ε_{0}$ , $\bar{d} \geq d$ , so that $ε_{1} \leq \bar{d} + {\bar{ε}}_{0} = {\bar{ε}}_{1}$ . The update law of the RBFNN weights is designed as

\begin{array}{l} \dot{ŵ} = - Γ S h & (26) \end{array}

Bringing 26 into 25 yields

\begin{array}{l} {\dot{V}}_{3} = - k_{2} S^{2} - k_{3} | S |^{γ_{1} + 1} - S ε_{1} \\ \leq - k_{2} S^{2} - k_{3} | S |^{γ_{1} + 1} + | S | {\bar{ε}}_{1} \\ = - k_{2} S^{2} - | S | (k_{3} {| S |}^{γ 1} - {\bar{ε}}_{1}) & (27) \end{array}

when $k_{3} > \bar{ε} / | S |^{γ_{1}}$ , $k_{3} | S |^{γ_{1}} - \bar{ε} = ε_{2}$ , where ε₂ > 0, thus:

\begin{array}{l} {\dot{V}}_{3} \leq - 2 k_{2} V_{2} - ε_{2} | S | \leq - 2 k_{2} V_{2} - \sqrt{2} ε_{2} V_{2}^{1 / 2} \\ < - α_{1} V_{2}^{1 / 2} - β_{1} V_{2} < 0 & (28) \end{array}

According to lemma 1, V₂ can converge to 0 in finite time.

The control input u_c in formula 24 is the unconstrained, to prevent the control input saturation, define u_d = u_c, where u_d is the desired value in the next step, and the state error e₃ = u_d−u_con. u_con satisfies the constrained control input of the saturation function tanh; therefore, parameter u_f must exist, such that u_con = u_maxtanh(u_f/u_max), where u_max is the maximum input.

\begin{array}{l} {\dot{u}}_{c o n} = (1 - {tanh}^{2} (u_{f} / u_{max})) {\dot{u}}_{f} & (29) \end{array}

Step 4: Establish the Lyapunov function $V_{4} = \frac{1}{2} e_{3}^{2}$ and derive V₃ and bring it into 29 to obtain:

\begin{array}{l} {\dot{V}}_{4} = e_{3} ė_{3} \\ = e_{3} ({\dot{u}}_{d} - {\dot{u}}_{c o n}) \\ = e_{3} ({\dot{u}}_{d} - (1 - {tanh}^{2} (u_{f} / u_{max})) {\dot{u}}_{f}) & (30) \end{array}

${\dot{u}}_{f}$ is designed as

\begin{array}{l} {\dot{u}}_{f} = {\begin{array}{l} \begin{matrix} (k_{4} e_{3} + {| e_{3} |}^{γ_{2}} s g n (e_{3}) + {\dot{u}}_{d}) / (1 - \tanh^{2} (u_{f} / u_{\max})) ​​ & , δ \geq Δ \end{matrix} \\ \begin{matrix} {| δ e_{3} |}^{γ_{2}} s g n (e_{3}) + {\dot{u}}_{d} / (1 - \tanh^{2} (u_{f} / u_{\max})) ​​ & , δ < Δ \end{matrix} \end{array} & (31) \end{array}

where δ = |u_f|−2u_max, Δ is a smaller normal value. γ₂ ∈ (0, 1). The convergence of the controller is discussed in the following cases. When δ ≥ Δ, substituting 31 into 30 yields

\begin{array}{l} {\dot{V}}_{4} = - k_{4} e_{3}^{2} - {| e}_{3} |^{γ_{2} + 1} = - 2 k_{4} V_{3} - 2^{(γ_{2} + 1) / 2} V_{3}^{(γ_{2} + 1) / 2} \\ < - α_{2} V_{4}^{(γ_{2} + 1) / 2} - β_{2} V_{4} & (32) \end{array}

where $0 < α_{2} < 2^{(γ_{2} + 1) / 2}$ , 2k₃ = β₂. According to Lemma 1, V₄ can converge to 0 in finite time. When δ < Δ, substituting 31 into 30 yields

\begin{array}{l} {\dot{V}}_{4} = - ({| δ |}^{γ_{2}} {| e_{3} |}^{γ_{2} + 1}) / (1 - {tanh}^{2} (u_{c} / u_{max})) \\ = - ({| δ |}^{γ_{2}} 2^{(γ_{2} + 1) / 2} / (1 - {tanh}^{2} (u_{c} / u_{max}))) V_{4}^{α_{3}} \\ = - c V_{4}^{α_{3}} & (33) \end{array}

where α₃ = (γ₂ + 1)/2, $c = | δ |^{γ_{2}} 2^{(γ_{2} + 1) / 2} / (1 - {tanh}^{2} (u_{c} / u_{max}))$ , and tanh(u_c/u_max) < 1, so c > 0. According to Lemma 2, V₄ can converge in finite time.

LEMMA 2: Chu et al. (2022) Suppose that there is a positive definite continuous Lyapunov function V(x, t) defined on $U_{1} \times R^{+}$ , where U₁ ⊆ U ⊆ R_n. R_n is a neighborhood of the origin, and $V (x, t) \leq - c V^{α} (x, t), \forall x \in U_{1} \ {0}$ , where c > 0, 0 < α < 1. Then, the origin of the system is locally finite time stable. The settling time $T \leq V^{1 - α} (x (t_{0}), t_{0}) / c (1 - α)$ satisfies for a given initial condition x(t₀) ∈ U₁.

3.2. Human decision search algorithm

The human decision search algorithm (HDSA) is a swarm optimization technique that mimics the decision-making process of a human crowd. In many post-apocalyptic survival games or films, the strong group consciousness of humans is often portrayed, but the importance of individual consciousness is also emphasized. In human groups, a small group of individuals called decision-makers make the final decisions based on their experience and personal status. However, the decision of the decision-maker is not necessarily optimal. When the number of individuals in the group is small, it is important to involve more people in the decision-making process to guide the development of the group and to avoid the excessive impact of individual decisions on the group. However, when the number of individuals in the group is large, the proportion of decision-makers should be reduced and only a few elite individuals should be selected to determine the development of the group. This is because too many people involved in the decision-making process may take more time, and the experience of ordinary people may not be as good as that of elite individuals. Because people have emotions, they can think both rationally and emotionally when dealing with problems, and these two opposing ways of thinking must coexist.

Apart from the decision-makers, the rest of the human population is referred to as the executors, consisting of individuals who have no or less ability to make decisions. They carry out the optimal decisions made by the decision-makers. However, individuals among the executors who have some decision-making ability should be encouraged to seek more humane decisions based on the optimal decisions. These decisions should become more adapted to the current environment over time. The number of decision-makers is fixed, and elite individuals in the human population will always be selected as decision-makers. Over time, any individual has the potential to become a decision-maker, and the current decision-maker may become an executor.

In a human population, there are always individuals who question the current decision or believe they have a better one, including the decision-makers themselves. These individuals are known as adventurers, and their numbers and identities are random, making them a source of uncertainty within the population. Although adventurers can lead people to a better life, they can also lead them to disaster. Adventurers, on the other hand, inherit the current optimal choices of the human population and take them into account when making decisions. However, more adventurous individuals will also seek out possible optimal decisions based on their own state. To avoid harming the human population, adventurers must consider whether the decisions they make are more beneficial to their own survival. Additionally, there is a chance that an adventurer will become a decision-maker if they come up with a better or suboptimal decision. Based on the above analysis, the proposed algorithm for optimizing the human decision population consists of three main components: decision updating for decision-makers, decision updating for executors, and decision updating for adventurers.

3.2.1. Decision updates for decision makers

The number of decision-makers is fixed in proportion to the total number of people, and the number of decision-makers is 20–50% of the total number of people. The decision-makers make their decisions based on individual experience as well as individual characteristics. The sine and cosine functions are used to distinguish between rational and emotional decisions by people, and the individuals are randomly updated due to the random adoption of rational and emotional decisions by people.

\begin{array}{l} x_{i}^{t + 1} = {\begin{array}{l} r_{1} x_{i}^{t} \sin (r_{2} | \begin{matrix} r_{3} x_{i b e s t}^{t} - x_{i}^{t} \end{matrix} |), R < 0.5 \\ r_{1} x_{i}^{t} \cos (r_{2} | \begin{matrix} r_{3} x_{i b e s t}^{t} - x_{i}^{t} \end{matrix} |), R \geq 0.5 \end{array} & (34) \end{array}

where $x_{i}^{t}$ denotes the t_th iteration of the i_th human individual. r₁ is a non-linear term, r₁ = 2*(1 − i/(α₁*d_num)). d_num is the number of decision-makers. α₁ is a random number between (0, 1). r₂ = α₂2π and α₂ is the random number between (0, 1). r₃ = 2α₃, α₃ is a random number between (0, 1). r is the random number between (0, 1). $x_{i b e s t}^{t}$ is the individual optimal solution for 1 to t iterations.

3.2.2. Decision updates for executors

Except for the decision-maker, the rest of the individuals are the executors. Among the executors, individuals with a fitness that is higher than the intermediate fitness are ordinary executors that must follow the optimal decision of the decision-maker. Individuals with a fitness below the intermediate fitness are considered as executors with some decision-making ability, and this group can continue to explore the next optimal decision that may exist based on the current optimal decision.

\begin{array}{l} x_{i}^{t + 1} = {\begin{array}{l} x_{b e s t}^{t} + β_{1} | \begin{matrix} (x_{i}^{t} - x_{m}^{t}) / (f_{i}^{t} - f_{m}^{t}) \end{matrix} |, f_{i}^{t} > f_{m}^{t} \\ s g n (x_{e}^{t}) e x p (| \begin{matrix} x_{b e s t}^{t} - x_{i}^{t} \end{matrix} | / β_{2}), f_{i}^{t} \leq f_{m}^{t} \end{array} & (35) \end{array}

where $x_{b e s t}^{t}$ is the current global best individual and $x_{w o r s t}^{t}$ is the current global worst individual. $x_{e}^{t} = x_{b e s t}^{t} - x_{w o r s t}^{t}$ . $f_{i}^{t}$ is the fitness of the i_th individual, $f_{m}^{t} = (f_{b e s t}^{t} + f_{w o r s t}^{t}) / 2$ , $f_{b e s t}^{t}$ is the current best fitness, and $f_{w o r s t}^{t}$ is the current worst fitness. β₁ is the random number of normal distribution with mean 0 and variance 1. The sgn function determines the direction of exploration of individuals. $β_{2} = t^{2} / f_{b e s t}^{t}$ indicates that a more favorable decision result can be obtained over time.

3.2.3. Decision updates for adventurers

The adventurers are random individuals and the number of adventurers is also random. If the adventurer's fitness is less than the average fitness, the adventurer randomly explores based on the current optimal solution. If the adventurer's fitness is higher than the average fitness, the adventurer will continue to explore in the optimal direction according to the current state of the individual.

\begin{array}{l} x_{i}^{t + 1} = {\begin{array}{l} x_{b e s t}^{t} + c_{1} | \begin{matrix} x_{b e s t}^{t} - x_{i}^{t} \end{matrix} |, f_{i}^{t} > f_{a v r}^{t} \\ x_{i}^{t} + (2 c_{2} - 1) {| \begin{matrix} | \begin{matrix} x_{e}^{t} \end{matrix} | \end{matrix} |}_{2} s g n (x_{e}^{t}), f_{i}^{t} \leq f_{a v r}^{t} \end{array} & (36) \end{array}

where c₁ is a normally distributed random number with mean 0. c₂ is a random number between (0, 1) with variance 1. $|| x_{e}^{t} ||_{2}$ is the Euclidean norm of $x_{e}^{t}$ and $f_{a v r}^{t}$ is the current mean fitness.

Based on the above discussion, the proposed HDSA has three steps. The first step performs a global random search using the formula 34. In the second step, a local search is performed based on the first step using the formula 35. The third step performs a second global random search using the formula 36 on the basis of the first and second steps. HDSA framework as Algorithm 1.

ALGORITHM 1

Algorithm 1. HDSA.

3.3. Yaw controller and linear velocity controller

According to the control algorithm in the “RBFNN-Based Active Fault-Tolerant Control Algorithm” section, the AFTC is used to design controllers in this section to follow the desired yaw angle ψ_d and desired linear velocity v_d. The robot linear velocity sliding mode surface is: $S_{v} = α_{v} e_{v} + β_{v} {e_{v}}^{λ_{v}}$ , where e_v = v_d−v. The sliding mode convergence law is $Ṡ_{v} = - k_{2 v} S_{v} - k_{3 v} | S |^{γ_{1 v}} sgn (S_{v})$ .

The proof of convergence for the velocity controller is similar to that for the general-purpose controller in the “RBFNN-Based Active Fault-Tolerant Control Algorithm" section. The unconstrained control law is designed as

\begin{array}{l} F_{u c} = m ({\dot{v}}_{d} - {ŵ_{v}}^{T} h_{v} + k_{2 v} S_{v} + k_{3 v} {| S_{v} |}^{γ_{1 v}} sgn (S_{v})) & (37) \end{array}

The anti-input saturation controller of linear velocity is designed as

\begin{array}{l} {\begin{array}{l} F_{u f} = {\begin{array}{l} \begin{array}{l} \int (k_{4 v} e_{F} + {| e_{F} |}^{γ_{2 v}} s g n (e_{F}) + {\dot{F}}_{u c}) \\ / (1 - \tanh^{2} (F_{u f} / F_{\max})) d t, δ_{v} \geq Δ_{v} \end{array} \\ \begin{matrix} \int {| δ_{v} e_{F} |}^{γ_{2 v}} s g n (e_{F}) + {\dot{F}}_{u c} / (1 - \tanh^{2} (F_{u f} / F_{\max})) d t \\ , δ_{v} < Δ_{v} \end{matrix} \end{array} \\ F_{u c o n} = F_{\max} \tanh (F_{u f} / F_{\max}) \end{array} & (38) \end{array}

Where e_F = F_uc−F_ucon.

The yaw angle controller is $ω_{d} = k_{ψ} e_{ψ} + {\dot{ψ}}_{d}$ , where e_ψ = ψ_d − ψ. The yaw angle sliding mode surface is $S_{ω} = e_{ω} + α_{ψ} e_{ψ} + β_{ψ} {e_{ψ}}^{λ_{ψ}}$ . The sliding mode convergence law is $Ṡ_{ω} = - k_{2 ω} S_{ω} - k_{3 ω} | S_{ω} |^{γ_{1 ω}} sgn (S_{ω})$ .

The unconstrained control law is designed as

\begin{array}{l} T_{r c} = I ({\dot{ω}}_{d} - {ŵ_{ω}}^{T} h_{ω} + k_{2 ω} S_{ω} + k_{3 ω} {| S_{ω} |}^{γ_{1 ω}} sgn (S_{ω})) & (39) \end{array}

The anti-input saturation controller of the yaw angle is designed as

\begin{array}{l} {\begin{array}{l} T_{r f} = {\begin{array}{l} \begin{array}{l} \int (k_{4 ω} e_{T} + {| e_{T} |}^{γ_{2 ω}} s g n (e_{T}) + {\dot{T}}_{r c}) \\ / (1 - \tanh^{2} (T_{r f} / T_{\max})) d t, δ_{ω} \geq Δ_{ω} \end{array} \\ \begin{array}{l} \int {| δ_{ω} e_{T} |}^{γ_{2 ω}} s g n (e_{T}) + {\dot{T}}_{r c} / (1 - \tanh^{2} (T_{r f} / T_{\max})) d t, \\ δ_{ω} < Δ_{ω} \end{array} \end{array} \\ T_{r c o n} = T_{\max} \tanh (T_{r f} / T_{\max}) \end{array} & (40) \end{array}

where e_T = T_rc−T_rcon. The controller parameters are not described in this section as they have been discussed in the “RBFNN-Based Active Fault-Tolerant Control Algorithm" section.

The input to the angular velocity neural network is both the yaw error and the angular velocity error, and the output is the uncertainty term in the angular velocity control. The coordinate vector matrix of the centroids of the Gaussian basis function neurons in the angular velocity neural network is

$c_{ψ} = {[\begin{matrix} - 1.6 & - 0.8 & - 0.4 & - 0.2 & - 0.1 & 0 & 0.1 & 0.2 & 0.4 & 0.8 & 1.6 \\ - 1.6 & - 0.8 & - 0.4 & - 0.2 & - 0.1 & 0 & 0.1 & 0.2 & 0.4 & 0.8 & 1.6 \end{matrix}]}_{_{2 * 11}}$

The width of the Gaussian basis function b_ψ = 0.1, i = 1⋯11.

The input to the linear velocity neural network is the velocity error and the output is the linear velocity control uncertainty term. The coordinate vector matrix of the centroids of the Gaussian basis function of the neurons in the linear velocity neural network is

$c_{v} = {[\begin{matrix} - 1.6 & - 0.8 & - 0.4 & - 0.2 & - 0.1 & 0 & 0.1 & 0.2 & 0.4 & 0.8 & 1.6 \end{matrix}]}_{1 * 11}$ . The width of the Gaussian basis function b_v = 0.1, i = 1⋯11.

Based on the above discussion, the proposed framework for the AFTC is shown in Figure 6.

FIGURE 6

Figure 6. AFTC framework.

4. Simulation results

In the section entitled “HDSA's Related Work”, we have demonstrated the advantages of the proposed HDSA; therefore, in this section, the HDSA is used to optimize the sliding mode surface parameters of the yaw controller and the linear velocity controller. As the weight update parameters of the RBFNNs are related to the sliding mode parameters, this also indirectly optimizes the RBFNNs.

The parameters to be optimized for yaw angle control are the sliding mode surface coefficients α_ω, β_ω and the neural network update coefficient Γ_ω. According to the idea of AFTC, the presence of −3N.m of disturbance torque in the robot model simulates the worst case. The initialized optimization algorithm parameters are as follows: dimension is 3, the number of populations is 20, the number of max iterations is 10, and the upper limit of parameters is 20 and the lower limit is −20.

The evaluation function of the yaw controller is designed as f_obj = 0.8*|e_ψ| + 0.1*|e_ω| + 0.01*|T_rc|. For yaw control, we want to reduce both the yaw error and the yaw velocity error with the smallest control input. As the control objective is to eliminate the yaw error, the yaw error is given the largest weight in the evaluation function. To keep the control input and yaw error in the same order, the control input weight is reduced. The optimization parameters for the yaw controller are shown in Figure 7.

FIGURE 7

Figure 7. Yaw control parameter optimization and fitness of the yaw controller objective function. (A) The optimized parameters of yaw controller. (B) The objective function output value.

As shown in Figure 7, the optimized parameters converge after eight iterations. The values of Γ_ω = 20, α_ω = 7.4407, and β_ω = 2.9369 are obtained through the optimization process.

The optimized parameters are substituted into the AFTC and the control results are compared with the unoptimized AFTC, NTSMC, and SMC. Before 10 s, the yaw angle is influenced by a torque with a mean value of −1N.m and a mean square error of 0.1. After 10 s, the yaw angle is influenced by a torque with a mean value of −3N.m and a mean square error of 0.1. The control parameters are given in Table 3.

TABLE 3

Table 3. Parameters of yaw angle controllers.

The results of the yaw angle controller are shown in Figure 8.

FIGURE 8

Figure 8. (A) The yaw angle control results. (B) Control input torque. (C) Yaw angle RBFNN output value. (D) Yaw angle RBFNN weight.

In Figure 8A, the optimized AFTC has a significantly faster response speed (pink line). Despite being influenced by a −1 N.m torque disturbance in the range of 0–10 s, the AFTC, NTSMC (green line), and SMC (red line) maintain their robustness and are not affected by the disturbance. After 10 s, the yaw angle is subjected to a torque of −3N.m, in which case reliance on the robustness of the controller can no longer guarantee yaw angle control performance, as shown in the 10–11 s enlargement in Figure 8A. The SMC is unable to follow the desired yaw angle with a static error of ~0.05 rad, and the NTSMC also has a small static difference.

As shown in Figure 8B, the proposed AFTC (pink line) and the optimized AFTC (orange line) do not enter the driver saturation state. The NTSMC (purple line) and the SMC (green line) enter the driver saturation state. Compared with the conventional SMC (green line) and NTSMC (purple line) control inputs, which have high-frequency input chatter, the control input of the proposed AFTC is more stable. This suggests that the robustness achieved by the conventional SMC comes at the expense of control input performance. In Figure 8C, the output of the radial basis function neural network (RBFNN) is displayed, showing a value of 1 before 10 s and 3 after 10 s. The RBFNN can estimate the unknown yaw disturbances online. The RBFNN weights are updated accordingly, as shown in Figure 8.

The parameters to be optimized for the velocity controller are the sliding mode surface coefficients α_v and β_v and the neural network update coefficients Γ_v. The presence of −5N force in the robot model simulates the worst case. The initialized optimization algorithm parameters are as follows: the dimension is 3, the number of populations is 20, the number of maximum iterations is 10, and the upper limit of parameters 20 and the lower limit is 20.

The evaluation function is designed as f_obj = 0.8*|e_v| + 0.02*|F_uc|. When controlling the linear velocity, we want to minimize the linear velocity error with the smallest control input. Therefore, the linear velocity error has the largest weight in the evaluation function. The weight of the control input is reduced to keep the control input and the linear velocity error at the same level. The linear velocity controller optimization parameters are shown in Figure 9.

FIGURE 9

Figure 9. Velocity control parameter optimization and fitness of the velocity controller objective function. (A) The optimized parameters of velocity controller. (B) The objective function output value.

As shown in Figure 9, the optimization parameters converge after two iterations. The optimized parameters are Γ_v = 15.6467, α_v = 16.1866, and β_v = 20.

These parameters are used in the proposed AFTC, and the control results are compared and analyzed with the unoptimized AFTC, NTSMC, and SMC controllers. Before 10 s, the linear velocity is affected by a force with a mean value of −2N and a mean square error of 0.1. After 10 s, the velocity is influenced by a force with a mean value of −5N and a mean square error of 0.1. The velocity controller parameters are given in Table 4.

TABLE 4

Table 4. The parameters of velocity controllers.

The control results of linear velocity controllers are shown in Figure 10.

FIGURE 10

Figure 10. Linear velocity control results. (A) Velocity control results. (B) Control input force. (C) Velocity RBFNN output value. (D) Velocity RBFNN weight.

Similar to the performance of the yaw control, in Figure 10A, the optimized AFTC (pink line) responds faster compared with the proposed AFTC (purple line) and SMC (red line). Between 0 and 10 s, when the line speed is subjected to -2N force, AFTC (purple line), NTSMC (green line), and SMC (red line) are not affected by the disturbances. After 10 s, the linear velocity is subjected to a force of −5N and the velocity control performance cannot be guaranteed by the NTSMC and SMC. There is a static error of ~0.05m/s for the NTSMC and ~0.6m/s for the SMC, as shown in the 9–12 s enlargement in Figure 10A. Both the proposed AFTC and the optimized AFTC can follow the desired linear velocity, and the velocity controller is almost unaffected by the −5N force using the optimized parameters. The proposed AFTC and the optimized AFTC can effectively track the desired linear velocity, with minimal impact from the −5N force disturbance. The velocity controller of the AFTC is almost unaffected by the disturbance, indicating its robustness and ability to maintain precise control performance.

The previous discussion has highlighted the improved responsiveness and robustness of the optimized AFTC. To further emphasize the advantages of the optimized AFTC, the output value of the evaluation function is used as a criterion to evaluate the performance of the four controllers. A smaller output value of the evaluation function indicates better controller performance. The output values of the evaluation functions for the four controllers are depicted in Figure 11.

FIGURE 11

Figure 11. Four control evaluation function outputs. (A) Yaw angle evaluation function outputs. (B) Velocity evaluation function outputs.

As shown by the green lines in Figures 12A, B, the optimized AFTC controller exhibits the smallest value of the evaluation function. This signifies that the optimized AFTC achieves the best performance among the four controllers. As the linear velocity and yaw angle are consistently subjected to external disturbances, the output value of the evaluation function continually increases. This is because of the fact that the control inputs are not equal to zero. In the case of large external disturbances, the NTSMC and SMC controllers can no longer eliminate the yaw angle error and the linear velocity error. Consequently, the output value of the evaluation function rapidly increases, as indicated by the red and blue lines.

FIGURE 12

Figure 12. The robot tracks the desired trajectory. (A) Tracking the circle desired trajectory. (B) X-position control. (C) Yaw angle control. (D) Y-position control.

To further verify the effectiveness of the proposed algorithm, the AFTC is used to design the yaw angle controller and the velocity controller. The desired yaw angle and the desired linear velocity is planned by the LOS algorithm. The optimized parameters are selected as the controller's parameters. The LOS algorithm and the improved LOS algorithm can be found in the author's previous work (Wang et al., 2022b). The desired trajectory is a circular trajectory with radius R = 1m, angular velocity ω_r = 0.5rad/s, and linear velocity v_r = 0.5m/s. The initial position and pose of the robot is [0m, 0.5m, 0rad]. A drag force of −2N and a torque of −1N.m are applied to the robot. The LOS algorithm is

\begin{array}{l} {\begin{array}{l} ψ_{L} = ψ_{r} - α \\ α = \arctan (e_{y} / Δ) \\ v_{L} = v_{r} + k e_{x} \end{array} & (41) \end{array}

where ψ_L, v_L are the desired yaw angle and desired linear velocity planned by the LOS algorithm. e_x, e_y is the position error in Frenet-Serret (F-S) frame. Δ and k are the positive adjustable parameters.

The control results of the robot tracking the desired circle trajectory are shown as Figures 12–14. The robot position control and yaw angle control are shown in Figure 12.

The robot can track the desired trajectory. The actual position pose of the robot is consistent with the desired position pose. The linear velocity control and angular velocity control are shown in Figure 13.

FIGURE 13

Figure 13. The control results of linear velocity and yaw angular velocity. (A) Linear velocity control. (B) Yaw angle velocity control.

In Figure 13A, the linear velocity can track the desired linear velocity of 0.5m/s. In Figure 13B, the angular velocity can track the desired angular velocity of −0.5rad/s. Figure 14 shows the linear velocity control input and yaw angle velocity control input.

FIGURE 14

Figure 14. The linear velocity control input and yaw angular velocity control input. (A) Control input force. (B) Control input torque.

In Figures 14A, B, the −2N force and −1N.m torque are applied to the robot. So the control inputs are 2N and 1N.m to counteract the effect of the external force and torque on the robot.

The test functions for swarm intelligence optimization algorithms are shown in Tables 5–7.

TABLE 5

Table 5. The single-peak test functions.

TABLE 6

Table 6. The multi-peak test functions.

TABLE 7

Table 7. The fixed-dimensional multi-peak test functions.

5. Conclusion

This paper proposes an RBFNN-based anti-input saturation AFTC to solve the problem of degraded control performance of the CDR during movement on the water surface caused by drive faults, uncertain water resistance, and uncertain model parameters. The AFTC incorporates a fast NTSMC, which ensures the robustness of the robot against external disturbances and the effects of uncertain model parameters. The RBFNN is used to estimate drive faults and compensate for the controller output. Additionally, an anti-input saturation control algorithm is introduced to prevent controller input saturation. Furthermore, the traditional approach of manually tuning controller parameters based on the designer's experience and iterative debugging is replaced with an optimization method called HDSA. The HDSA algorithm optimizes the controller parameters to ensure the optimal control performance of the robot.

In further work, adaptive algorithms are necessary for the adjustment of the upper limit of the maximum control input to the robot on the ground and on the water surface.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

KW implementation and execution of the theory research and experiment and writing of the manuscript. YL theoretical support on the idea and helped write the manuscript. CH preliminary work and revising the manuscript. All authors actively contributed to the preparation of the content of this paper.

Funding

This work was supported in part by the Sharing Technology Project (41412040102), the China National Science Foundation (61473155), the Jiangsu Technology Department under Modern Agriculture (BE2017301), and the Six Talent Peaks Project in Jiangsu Province (GDZB-039).

Acknowledgments

We thank the editors and reviewers of the journal.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Ali, N., Tawiah, I., and Zhang, W. (2020). Finite-time extended state observer based nonsingular fast terminal sliding mode control of autonomous underwater vehicles. Ocean Eng. 218, 108179. doi: 10.1016/j.oceaneng.2020.108179

CrossRef Full Text | Google Scholar

Chen, G., Tu, J., Ti, X., Wang, Z., and Hu, H. (2021). Hydrodynamic model of the beaver-like bendable webbed foot and paddling characteristics under different flow velocities. Ocean Eng. 234, 109179. doi: 10.1016/j.oceaneng.2021.109179

CrossRef Full Text | Google Scholar

Chen, L., Cui, R., Yang, C., and Yan, W. (2019). Adaptive neural network control of underactuated surface vessels with guaranteed transient performance: theory and experimental results. IEEE Transact. Ind. Electron. 67, 4024–4035. doi: 10.1109/TIE.2019.2914631

CrossRef Full Text | Google Scholar

Chu, R., Liu, Z., and Chu, Z. (2022). Improved super-twisting sliding mode control for ship heading with sideslip angle compensation. Ocean Eng. 260, 111996. doi: 10.1016/j.oceaneng.2022.111996

CrossRef Full Text | Google Scholar

Cohen, A., and Zarrouk, D. (2020). “The amphistar high speed amphibious sprawl tuned robot: design and experiments,” in 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (Las Vegas, NV: IEEE), 6411–6418.

Google Scholar

Deng, Y., Zhang, X., Im, N., Zhang, G., and Zhang, Q. (2020). Adaptive fuzzy tracking control for underactuated surface vessels with unmodeled dynamics and input saturation. ISA Trans. 103, 52–62. doi: 10.1016/j.isatra.2020.04.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Dorigo, M., Maniezzo, V., and Colorni, A. (1996). Ant system: optimization by a colony of cooperating agents. IEEE Transact. Syst. Man Cybernet. Part B 26, 29–41. doi: 10.1109/3477.484436

PubMed Abstract | CrossRef Full Text | Google Scholar

Fister, I., Fister Jr, I., Yang, X.-S., and Brest, J. (2013). A comprehensive review of firefly algorithms. Swarm Evol. Comput. 13, 34–46. doi: 10.1016/j.swevo.2013.06.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Gao, B., Liu, Y.-J., and Liu, L. (2022). Adaptive neural fault-tolerant control of a quadrotor uav via fast terminal sliding mode. Aerospace Sci. Technol. 129, 107818. doi: 10.1016/j.ast.2022.107818

CrossRef Full Text | Google Scholar

Gheisarnejad, M., and Khooban, M. H. (2020). An intelligent non-integer pid controller-based deep reinforcement learning: Implementation and experimental results. IEEE Transact. Ind. Electron. 68, 3609–3618. doi: 10.1109/TIE.2020.2979561

CrossRef Full Text | Google Scholar

Guo, J., Zhang, K., Guo, S., Li, C., and Yang, X. (2019). “Design of a new type of tri-habitat robot,” in 2019 IEEE International Conference on Mechatronics and Automation (ICMA) (Tianjin: IEEE), 1508–1513.

Google Scholar

Guo, X., Huang, S., Lu, K., Peng, Y., Wang, H., and Yang, J. (2022). A fast sliding mode speed controller for PMSM based on new compound reaching law with improved sliding mode observer. IEEE Trans. Transp. Elect. 9, 2955–2968.

Google Scholar

Heidari, A. A., Mirjalili, S., Faris, H., Aljarah, I., Mafarja, M., and Chen, H. (2019). Harris hawks optimization: Algorithm and applications. Fut. Gen. Comp. Syst. 97, 849–872. doi: 10.1016/j.future.2019.02.028

CrossRef Full Text | Google Scholar

Hou, Q., and Ding, S. (2021). Finite-time extended state observer-based super-twisting sliding mode controller for pmsm drives with inertia identification. IEEE Transact. Transport. Electrif. 8, 1918–1929. doi: 10.1109/TTE.2021.3123646

CrossRef Full Text | Google Scholar

Huang, J., Wang, W., Wen, C., and Li, G. (2019). Adaptive event-triggered control of nonlinear systems with controller and parameter estimator triggering. IEEE Trans. Automat. Contr. 65, 318–324. doi: 10.1109/TAC.2019.2912517

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, T., and Lin, D. (2020). Fast finite-time backstepping for helicopters under input constraints and perturbations. Int. J. Syst. Sci. 51, 2868–2882. doi: 10.1080/00207721.2020.1803438

CrossRef Full Text | Google Scholar

Liao, Y.-,l., Zhang, M.-,j., Wan, L., and Li, Y. (2016). Trajectory tracking control for underactuated unmanned surface vehicles with dynamic uncertainties. J. Cent. South Univ. 23, 370–378. doi: 10.1007/s11771-016-3082-4

CrossRef Full Text | Google Scholar

Liu, K., Gao, H., Ji, H., and Hao, Z. (2020). Adaptive sliding mode based disturbance attenuation tracking control for wheeled mobile robots. Int. J. Control Automat. Syst. 18, 1288–1298. doi: 10.1007/s12555-019-0262-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, X., Zhang, M., and Yao, F. (2018). Adaptive fault tolerant control and thruster fault reconstruction for autonomous underwater vehicle. Ocean Eng. 155, 10–23. doi: 10.1016/j.oceaneng.2018.02.007

CrossRef Full Text | Google Scholar

Mirjalili, S. (2016). Sca: a sine cosine algorithm for solving optimization problems. Knowl. Based Syst. 96, 120–133. doi: 10.1016/j.knosys.2015.12.022

CrossRef Full Text | Google Scholar

Mirjalili, S., Mirjalili, S. M., and Lewis, A. (2014). Grey wolf optimizer. Adv. Eng. Softw. 69, 46–61. doi: 10.1016/j.advengsoft.2013.12.007

CrossRef Full Text | Google Scholar

Najafi, A., Vu, M. T., Mobayen, S., Asad, J. H., and Fekih, A. (2022). Adaptive barrier fast terminal sliding mode actuator fault tolerant control approach for quadrotor uavs. Mathematics 10, 3009. doi: 10.3390/math10163009

CrossRef Full Text | Google Scholar

Nan, F., Sun, S., Foehn, P., and Scaramuzza, D. (2022). Nonlinear mpc for quadrotor fault-tolerant control. IEEE Robot. Automat. Lett. 7, 5047–5054. doi: 10.1109/LRA.2022.3154033

CrossRef Full Text | Google Scholar

Shen, Q., Yue, C., Goh, C. H., and Wang, D. (2018). Active fault-tolerant control system design for spacecraft attitude maneuvers with actuator saturation and faults. IEEE Transact. Ind. Electron. 66, 3763–3772. doi: 10.1109/TIE.2018.2854602

CrossRef Full Text | Google Scholar

Song, M.-P., and Gu, G.-C. (2004). “Research on particle swarm optimization: a review,” in Proceedings of 2004 International Conference on Machine Learning and Cybernetics (IEEE Cat. No. 04EX826), (Shanghai: IEEE), 2236–2241.

Google Scholar

Wang, F., Ma, Z., Gao, H., Zhou, C., and Hua, C. (2022). Disturbance observer-based nonsingular fast terminal sliding mode fault tolerant control of a quadrotor UAV with external disturbances and actuator faults. Int. J. Cont. Autom. Syst. 20, 1122–1130. doi: 10.1007/s12555-020-0773-2

CrossRef Full Text | Google Scholar

Wang, H., Shi, J., Wang, J., Wang, H., Feng, Y., and You, Y. (2019a). Design and modeling of a novel transformable land/air robot. Int. J. Aero. Eng. doi: 10.1155/2019/2064131

CrossRef Full Text | Google Scholar

Wang, K., Liu, Y., Huang, C., and Bao, W. (2022a). Water surface flight control of a cross domain robot based on an adaptive and robust sliding mode barrier control algorithm. Aerospace 9, 332. doi: 10.3390/aerospace9070332

CrossRef Full Text | Google Scholar

Wang, K., Liu, Y., Huang, C., and Cheng, P. (2022b). Water surface and ground control of a small cross-domain robot based on fast line-of-sight algorithm and adaptive sliding mode integral barrier control. Appl. Sci. 12, 5935. doi: 10.3390/app12125935

CrossRef Full Text | Google Scholar

Wang, N., and Deng, Z. (2019). Finite-time fault estimator based fault-tolerance control for a surface vehicle with input saturations. IEEE Trans. Ind. Informat. 16, 1172–1181. doi: 10.1109/TII.2019.2930471

CrossRef Full Text | Google Scholar

Wang, N., Xie, G., Pan, X., and Su, S.-F. (2019b). Full-state regulation control of asymmetric underactuated surface vehicles. IEEE Trans. Ind. Informat. 66, 8741–8750. doi: 10.1109/TIE.2018.2890500

CrossRef Full Text | Google Scholar

Wu, G., Chen, G., Zhang, H., and Huang, C. (2021). Fully distributed event-triggered vehicular platooning with actuator uncertainties. IEEE Transact. Vehic. Technol. 70, 6601–6612. doi: 10.1109/TVT.2021.3086824

CrossRef Full Text | Google Scholar

Wu, L.-B., Park, J. H., Xie, X.-P., Gao, C., and Zhao, N.-N. (2020). Fuzzy adaptive event-triggered control for a class of uncertain nonaffine nonlinear systems with full state constraints. IEEE Transact. Fuzzy Syst. 29, 904–916. doi: 10.1109/TFUZZ.2020.2966185

CrossRef Full Text | Google Scholar

Xing, H., Shi, L., Hou, X., Liu, Y., Hu, Y., Xia, D., et al. (2021). Design, modeling and control of a miniature bio-inspired amphibious spherical robot. Mechatronics 77, 102574. doi: 10.1016/j.mechatronics.2021.102574

CrossRef Full Text | Google Scholar

Xue, J., and Shen, B. (2020). A novel swarm intelligence optimization approach: sparrow search algorithm. Syst. Sci. Control Eng. 8, 22–34. doi: 10.1080/21642583.2019.1708830

CrossRef Full Text | Google Scholar

Xue, J., and Shen, B. (2022). Dung beetle optimizer: a new meta-heuristic algorithm for global optimization. J. Supercomput. 1–32. doi: 10.1007/s11227-022-04959-6

CrossRef Full Text | Google Scholar

Yu, X.-N., Hao, L.-Y., and Wang, X.-L. (2022). Fault tolerant control for an unmanned surface vessel based on integral sliding mode state feedback control. Int. J. Control Automat. Syst. 20, 2514–2522. doi: 10.1007/s12555-021-0526-x

CrossRef Full Text | Google Scholar

Zhang, G., Chu, S., Zhang, W., and Liu, C. (2022). Adaptive neural fault-tolerant control for usv with the output-based triggering approach. IEEE Transact. Vehic. Technol. 71, 6948–6957. doi: 10.1109/TVT.2022.3167038

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, H., Xi, R., Wang, Y., Sun, S., and Sun, J. (2021). Event-triggered adaptive tracking control for random systems with coexisting parametric uncertainties and severe nonlinearities. IEEE Trans. Automat. Contr. 67, 2011–2018. doi: 10.1109/TAC.2021.3079279

CrossRef Full Text | Google Scholar

Zhao, Y., Qi, X., Ma, Y., Li, Z., Malekian, R., and Sotelo, M. A. (2020). Path following optimization for an underactuated usv using smoothly-convergent deep reinforcement learning. IEEE Transact. Intell. Transport. Syst. 22, 6208–6220. doi: 10.1109/TITS.2020.2989352

CrossRef Full Text | Google Scholar

Zhong, G., Cao, J., Chai, X., and Bai, Y. (2021). Design and performance analysis of a triphibious robot with tilting-rotor structure. IEEE Access 9, 10871–10879. doi: 10.1109/ACCESS.2021.3050182

CrossRef Full Text | Google Scholar

Keywords: cross-domain robot (CDR), radial basis function neural network (RBFNN), active fault-tolerant control (AFTC), anti-input saturation, human decision search algorithm (HDSA)

Citation: Wang K, Liu Y and Huang C (2023) Active fault-tolerant anti-input saturation control of a cross-domain robot based on a human decision search algorithm and RBFNN. Front. Neurorobot. 17:1219170. doi: 10.3389/fnbot.2023.1219170

Received: 08 May 2023; Accepted: 26 June 2023;
Published: 14 July 2023.

Edited by:

Long Jin, Lanzhou University, China

Reviewed by:

Ruoxi Qin, Henan Key Laboratory of Imaging and Intelligent Processing, China
Hao Xu, Anhui University of Technology, China

Copyright © 2023 Wang, Liu and Huang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yong Liu, bGl1eTE2MDJAbmp1c3QuZWR1LmNu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.