An Improved Fuzzy Brain Emotional Learning Model Network Controller for Humanoid Robots

Fang, Wubing; Chao, Fei; Lin, Chih-Min; Yang, Longzhi; Shang, Changjing; Zhou, Changle

doi:10.3389/fnbot.2019.00002

ORIGINAL RESEARCH article

Front. Neurorobot., 04 February 2019

Volume 13 - 2019 | https://doi.org/10.3389/fnbot.2019.00002

This article is part of the Research TopicMulti-Modal Information Fusion for Brain-Inspired RobotsView all 11 articles

An Improved Fuzzy Brain Emotional Learning Model Network Controller for Humanoid Robots

Wubing Fang¹

Fei Chao^1,2^*

Chih-Min Lin³

Longzhi Yang⁴

Changjing Shang²

Changle Zhou¹

¹Cognitive Science Department, School of Information Science and Engineering, Xiamen University, Xiamen, China
²Institute of Mathematics, Physics and Computer Science, Aberystwyth University, Aberystwyth, United Kingdom
³Department of Electrical Engineering, Yuan Ze University, Tao-Yuan, Taiwan
⁴Department of Computer and Information Sciences, Northumbria University, Newcastle upon Tyne, United Kingdom

The brain emotional learning (BEL) system was inspired by the biological amygdala-orbitofrontal model to mimic the high speed of the emotional learning mechanism in the mammalian brain, which has been successfully applied in many real-world applications. Despite of its success, such system often suffers from slow convergence for online humanoid robotic control. This paper presents an improved fuzzy BEL model (iFBEL) neural network by integrating a fuzzy neural network (FNN) to a conventional BEL, in an effort to better support humanoid robots. In particular, the system inputs are passed into a sensory and emotional channels that jointly produce the final outputs of the network. The non-linear approximation ability of the iFBEL is achieved by taking the BEL network as the emotional channel. The proposed iFBEL works with a robust controller in generating the hand and gait motion of a humanoid robot. The updating rules of the iFBEL-based controller are composed of two parts, including a sensory channel followed by the updating rules of the conventional BEL model, and the updating rules of the FNN and the robust controller which are derived from the “Lyapunov” function. The experiments on a three-joint robot manipulator and a six-joint biped robot demonstrated the superiority of the proposed system in reference to a conventional proportional-integral-derivative controller and a fuzzy cerebellar model articulation controller, based on the more accurate and faster control performance of the proposed iFBEL.

1. Introduction

The control of uncertain nonlinear systems with multiple inputs and outputs often presents a great challenge, and the robotic motion control is such a typical case. Robots, especially humanoid robots, are widely used in domestic, medical and other industrial areas (Liu et al., 2015; Li et al., 2017; Wu et al., 2018; Zhou et al., 2018). A humanoid robot must accurately control its two manipulators and two legs, in order to generate hand reaching/grasping motions and biped-leg walking gaits. Such crucial motion abilities allow humanoid robots to work in complicated, dangerous, and even poisonous environments with reduced labor costs, health implication, and other associated complications.

The Sliding Mode Control (SMC) proves to be an effective control method for uncertain nonlinear systems, especially for humanoid motion control. Once the state of the system reaches a sliding surface, the state will remain on that surface regardless of system uncertainties and external disturbances (Lin and Hsu, 2015). Yet, control input chattering, usually led by a combination of uncertainties from multiple pathways, is often not expected in humanoid robot systems when SMC is applied. It has been found in several studies that the collaboration of an artificial neural network with a SMC controller can enhance non-linear approximation ability in reducing the chattering effect (Boldbaatar and Lin, 2015).

A neural network with good non-linear learning abilities is therefore of great appeal to the SMC model. Note that an association between a stimulus and its emotional consequence in the amygdala of the mammalian brain was discovered by LeDoux (1992). The inspiration from the emotional consequence then led to the development of the brain emotional learning network (BEL) controller, which has a good nonlinear approximation capability. Such a neural network is comprised of a sensory neural network in simulating the orbitofrontal cortex of the brain, and an emotional neural network representing the amygdala cortex (LeDoux, 1992; Lotfi and Akbarzadeht, 2013). The sensory neural network is responsible for the major output of the controller, while the emotional neural network has an indirect impact on the sensory neural network. Despite of the effectiveness in uncertain non-linear control, most BEL networks face the dilemma of slow learning convergence leading to difficulty in on-line control of the multiple joints of a humanoid robot.

Fuzzy neural networks (FNN) are another popular choice for uncertain nonlinear control systems with reasonable non-linear approximation ability, due to their rapid learning convergence and simple structure which is particularly favorable for on-line humanoid robotic control (Rubio, 2012, 2018; Aguilar-Iban et al., 2018; Rubio et al., 2018). A typical FNN integrates a fuzzy inference system and a neural network (Pan et al., 2016; Meda-Campana et al., 2018). The weights of the network are usually updated by taking only the output errors of the FNN as the learning assessment. To achieve better performance for uncertain nonlinear systems, the FNN must also consider the overall performance of uncertain nonlinear systems when adjusting the control parameters, as reported in Zhao and Lin (2017). Therefore, the combination of the rapid convergence of FNN and the nonlinear mapping capability of BEL seems to be a good idea for controlling humanoid robots.

We believe that the chattering effect of the SMC model is a very challenging issue. Although, many existing algorithms had been developed to deal with the chatting; the artificial neural network still plays an important role in the control of uncertain nonlinear system with multiple inputs and outputs. In addition, FNN is good at rapid convergence and BEL can ideally increase the network's nonlinear mapping capability. Therefore, we focused on a combined neural network to deal with the chattering problem. Based on these considerations, this paper proposes an improved brain emotional learning model network (iFBEL) for a humanoid robot controller, in an effort to achieve better human-like control performance with the support of more nonlinear approximation capabilities. The proposed iFBEL is comprised of two components, with one built from a conventional BEL and the other created by an FNN; and the resulted iFBEL thus enjoys the advantages of both sub-systems. The iFBEL works with a robust controller to replace the ideal sliding mode controller for better system performance. To ensure the convergence and robustness, the adaptive laws of the FNN and the robust controller are derived from the Lyapunov function. The iFBEL was validated and evaluated on a robot with a three-joint manipulator and a biped-leg system, although applications in other control fields can be readily identified. The experimental results demonstrate competitive performance of the proposed systems in dynamic humanoid robotic control.

The reminder of this paper is organized as follows: section 2 introduces a group of uncertain nonlinear systems controlled by a sliding mode controller. Section 3 reports the proposed improved fuzzy brain emotional learning model neural network. Section 4 describes the implementations of the network controller and the updating rules. Section 5 shows the experimental results and compares the performances with the conventional proportional-integral-derivative (PID) controller and the fuzzy cerebellar model articulation controller (FCMAC). Section 7 concludes the paper and points out future work.

2. Humanoid Robot Control by Sliding Mode Controller

In order to understand the proposed network-based control system and realize the importance of the proposed neural network, this section introduces a typical uncertain nonlinear system controlled by a sliding mode controller as the work's background.

A humanoid robot needs to control multi-joints. Without loss of generality, consider a class of nth-order uncertain nonlinear systems with mth-order input and output states expressed in the following form:

\begin{matrix} x^{(n)} (t) = f (\underline{x} (t)) + G (\underline{x} (t)) u (t) + d (t), & (1) \end{matrix}

where x(t) = [x⁽ⁿ⁻¹⁾(t) … ẋ(t) x(t)] ∈ ℜ^m×n is the system state vector, $u (t) = {[u_{1} (t), u_{2} (t), \dots, u_{m} (t)]}^{T} \in ℜ^{m}$ is the control input vector, f(x(t)) ∈ ℜ^m is an unknown, but bounded, smooth nonlinear function, G(x(t)) ∈ ℜ^m×m is an unknown, but bounded, gain matrix, and $d (t) = {[d_{1} (t), d_{2} (t), \dots, d_{m} (t)]}^{T} \in ℜ^{m}$ is an external bounded disturbance.

The nominal model of such a nonlinear system can be defined as

\begin{array}{rcl} x^{(n)} (t) = f_{n} (\underline{x} (t)) + G_{n} u (t), & (2) \end{array}

where f_n(x(t)) is the nominal function of f(x(t)), and $G_{n} = d i a g [g_{n_{1}} \dots g_{n_{m}}] \in ℜ^{m \times m}$ is the nominal function of G(x(t)), with g_{n_i} being nominal gain constants, for i = 1, 2, …, m. Assume that g_{n_i} > 0 for the existence of $G_{n}^{- 1}$ , Equation 1 can be represented as:

\begin{array}{l} \begin{array}{l} x^{(n)} (t) = f_{n} (\underline{x} (t)) + △ f (\underline{x} (t)) + G_{n} u (t) + △ G (\underline{x} (t)) u (t) + d (t) \\ = f_{n} (\underline{x} (t)) + G_{n} u (t) + l (\underline{x} (t), t), \end{array} & (3) \end{array}

where l(x(t), t) = △f(x(t)) + △G(x(t))u(t) + d(t) is the lumped uncertainties and external disturbances. Let ${\underline{x}}_{d} (t) = {[x_{d}^{(n - 1) T} (t), \dots, ẋ_{d}^{T} (t), x_{d}^{T} (t)]}^{T} \in ℜ^{m \times n}$ be a desired trajectory in which the state of the system is tracked. The tracking error vector is defined as:

\begin{array}{l} \underline{e} (t) = [\begin{matrix} e^{(n - 1)} (t) & e^{(n - 2)} (t) & \dots & ė (t) & e (t) \end{matrix}] \in ℜ^{m n}, \end{array}

where e(t) = x_d(t) − x(t) is the tracking error.

An ideal sliding surface can be defined as

\begin{array}{l} s (\underline{e} (t)) = (\begin{matrix} s_{1} \\ s_{2} \\ ⋮ \\ s_{m} \end{matrix}) \\ = [\begin{matrix} e_{1}^{(n - 1)} (t) + & λ_{11} e_{1}^{(n - 2)} (t) + \dots + & λ_{n 1} \int_{0}^{T} e_{1} (t) d t \\ e_{2}^{(n - 1)} (t) + & λ_{12} e_{2}^{(n - 2)} (t) + \dots + & λ_{n 2} \int_{0}^{T} e_{2} (t) d t \\ ⋮ & ⋮ & ⋮ \\ e_{m}^{(n - 1)} (t) + & λ_{1 m} e_{m}^{(n - 2)} (t) + \dots + & λ_{n m} \int_{0}^{T} e_{m} (t) d t \end{matrix}] \\ = [\begin{matrix} 1 & λ_{11} & λ_{n 1} \\ ⋱ & ⋱ & ⋱ \\ 1 & λ_{1 m} & λ_{n m} \end{matrix}] [\begin{matrix} \underline{e} (t) \\ \int_{0}^{T} e (t) d t \end{matrix}] \end{array}

\begin{array}{rcl} = \bar{K} [\begin{matrix} \underline{e} (t) \\ \int_{0}^{T} e (t) d t \end{matrix}], & (4) \end{array}

where $\bar{K} = [I, K] = [\begin{matrix} I & λ_{1} I & \dots & λ_{n} I \end{matrix}] \in ℜ^{m \times (m + 1) n}$ . All $λ_{j} = {[λ_{1 j} \dots λ_{n j}]}^{T} \in ℜ^{n}$ are roots of the equation: $q^{n} + λ_{1} q^{n - 1} + \dots + λ_{n - 1} q + λ_{n} = 0$ in which q is the Laplace operator and is in the open left half-plane. The time derivative of Equation 4 leads to the following:

\begin{array}{l} \begin{array}{l} ṡ (\underline{e} (t)) = \bar{K} [\begin{matrix} \underline{ė} (t) \\ e (t) \end{matrix}] = \bar{K} [\begin{matrix} e^{(n)} (t) \\ \underline{e} (t) \end{matrix}] \\ = e^{(n)} (t) + K \underline{e} (t) = x_{d}^{(n)} (t) - x^{(n)} (t) + K \underline{e} (t) \\ = x_{d}^{(n)} (t) - f_{n} (\underline{x} (t)) - G_{n} u (t) - l (\underline{x} (t), t) + K \underline{e} (t), \end{array} & (5) \end{array}

where ė(t) = [e⁽ⁿ⁾(t)e⁽ⁿ⁻¹⁾(t) … ė(t)].

For the existence and reachability of this sliding surface, the control law of system is satisfied by the following inequation:

\begin{array}{rcl} \frac{1}{2} \frac{d}{d t} (s_{i}^{2}) \leq - \sum_{i = 1}^{m} σ_{i} | s_{i} | & (6) \end{array}

for σ_i > 0, i = 1, 2, …, m.

Taking 5 into 6, yields

\begin{array}{l} s^{T} (\underline{e} (t)) \dot{s} (\underline{e} (t)) = s^{T} (\underline{e} (t)) [x_{d}^{(n)} (t) - f_{n} (\underline{x} (t)) - G_{n} u (t) - l (\underline{x} (t), t) \\ + K \underline{e} (t)] \leq - \sum_{i = 1}^{m} σ_{i} | s_{i} | & (7) \end{array}

If the dynamic and the lumped uncertainty of the system are known exactly, the ideal sliding mode controller is designed as:

\begin{array}{rcl} u_{I S M C} = G_{n}^{- 1} [x_{d}^{(n)} (t) - f_{n} (\underline{x} (t)) - l (\underline{x} (t), t) + K \underline{e} (t) + σ s g n (s (\underline{e} (t)))], & (8) \end{array}

where sgn is a sign function and G_n is a positive define matrix. However, it is difficult to obtain the dynamical functions of most nonlinear systems, and the lumped uncertainty is always unmeasurable. Therefore, the ideal sliding mode controller is unobtainable.

3. The Proposed iFBEL Network

The configuration of the proposed iFBEL is depicted in Figure 1, consisting of an BEL and the FNN in addition to the input and output spaces. The outputs of this network are u_i = b_i − g_i for i = 1, 2, …, m, in which, b_i are the outputs of the the BEL and g_i are the outputs of the FNN. The BEL network is comprised of the input space I, the association memory space M₁, the weight memory space V, and the sub-output space B. The FNN shares the same input space with the BEL, and it also includes the association memory space M₂, the receptive-field space R, the weight memory space W, and the sub-output space G. In particular, the FNN channel of iFBEL also contains a set of fuzzy reference rules (Lee, 1990) as represented as follows:

\begin{array}{l} \begin{array}{l} R^{λ} : If p_{1} is ϕ_{1 j k} and p_{2} is ϕ_{2 j k}, \dots, p_{m} is ϕ_{m j k} then \\ g_{j k} = ω_{j k} for j = 1, 2, \dots, n_{f} . k = 1, 2, \dots, n_{k} . λ = 1, 2, \dots, n_{l}, \end{array} & (9) \end{array}

where n_f is the number of layers for each m input dimensions with each layer including n_k blocks and n_l = n_fn_k referring to the number of fuzzy rules, and ϕ_ijk represents the fuzzy set for ith input, jth layer and kth block; each fuzzy set's member function is implemented by the Gaussian function; ω_jk is the output weight in the consequent part; and g_jk is the rule's output. Note that: each fuzzy set's member function can be defined as rectangular, triangular or any continuously bounded function e.g., Gaussian or B-spline; in order to easily implement the iFBEL with the better non-linear approximation ability, the Gaussian function is adopted.

FIGURE 1

Figure 1. The configuration of iFBEL.

The aforementioned “spaces” are detailed as follows:

1. Input Space I: $p = {[p_{1}, p_{2}, \dots, p_{m}]}^{T} \in ℜ^{m}$ is an input vector which is quantized into discrete regions (elements), where m is the number of input state variables. The number of elements, n_e, is termed as a resolution. p is delivered to the BEL and the FNN simultaneously as their inputs.

2. Association Memory Spaces M₁ and M₂: Several elements are combined as a block; the number of blocks, n_b and n_f for the BEL and the FNN respectively, must be equal or greater than two. The association memory space of the BEL has n_a(= m × n_b) components, while that of the FNN has n_c(= m × n_f) components. Every component is represented as a Gaussian basis function; let φ denote a component for the BEL and f for the FNN:

\begin{array}{rcl} φ_{i j} = e x p [- \frac{{(p_{i} - y_{i j})}^{2}}{z_{i j}^{2}}] & (10) \end{array}

where i = 1, 2, …, m, j = 1, 2, …, n_b, and y_ij and z_ij are the means and variances, respectively; and

\begin{array}{rcl} f_{i j k} = e x p [- \frac{{(p_{i} - c_{i j k})}^{2}}{v_{i j k}^{2}}] & (11) \end{array}

where i = 1, 2, …, m, j = 1, 2, …, n_f, k = 1, 2, …, n_k, and c_ijk and v_ijk are the means and variances, respectively.

The block matrix of the BEL is defined as:

\begin{array}{rcl} Γ = {[\begin{matrix} φ_{11} & \dots & φ_{1 n_{b}} & φ_{21} & \dots & φ_{2 n_{b}} & \dots & φ_{m 1} & \dots & φ_{m n_{b}} \end{matrix}]}^{T} \in ℜ^{m n_{b}} . & (12) \end{array}

3. Receptive-field Space R for FNN: Every cell in this space is the product of the corresponding components of the association memory space M₂, which is defined as:

\begin{array}{rcl} ϕ_{j k} = \prod_{i = 1}^{m} f_{i j k} = \prod_{i = 1}^{m} e x p [- \frac{{(p_{i} - c_{i j k})}^{2}}{v_{i j k}^{2}}] = e x p [- \sum_{i = 1}^{m} \frac{{(p_{i} - c_{i j k})}^{2}}{v_{i j k}^{2}}], & (13) \end{array}

where j = 1, 2, …, n_f, and k = 1, 2, …, n_k. An example of the FNN with two input variables is shown in Figure 2, which has 4 layers (n_f = 4) for every input variable and 2 blocks (n_k = 2) for each layer. And n_l = n_fn_k is the number of receptive fields, such as Aa, Bb, …; ϕ_jk is associated with the jth layer and the kth block in the fuzzy rule as expressed in Equation 9. The block matrix of the FNN is defined as:

\begin{array}{rcl} Φ = {[\begin{matrix} ϕ_{11} & \dots & ϕ_{1 n_{k}} & ϕ_{21} & \dots & ϕ_{2 n_{k}} & \dots & ϕ_{n_{f} 1} & \dots & ϕ_{n_{f} n_{k}} \end{matrix}]}^{T} \in ℜ^{n_{f} n_{k}} . & (14) \end{array}

4. Weight Memory Spaces V and W : ν_ijk is the weight of the ith output, jth input, and kth block of the BEL; and ω_ijk is the weight of the ith output, jth layer, kth block of the FNN:

\begin{array}{l} V = [\begin{matrix} ν_{1 j k} & ν_{2 j k} & \dots & ν_{m j k} \end{matrix}] \\ = [\begin{matrix} ν_{111} & ν_{211} & \dots & ν_{m 11} \\ ⋮ & ⋮ & ⋮ \\ ν_{11 n_{b}} & ν_{21 n_{b}} & \dots & ν_{m 1 n_{b}} \\ ν_{121} & ν_{221} & \dots & ν_{m 21} \\ ⋮ & ⋮ & ⋮ \\ ν_{12 n_{b}} & ν_{22 n_{b}} & \dots & ν_{m 2 n_{b}} \\ ⋮ & ⋮ & ⋮ \\ ν_{1 m 1} & ν_{2 m 1} & \dots & ν_{m m 1} \\ ⋮ & ⋮ & ⋮ \\ ν_{1 m n_{b}} & ν_{2 m n_{b}} & \dots & ν_{m m n_{b}} \end{matrix}] \in ℜ^{m n_{b} \times m} \\ W = [\begin{matrix} ω_{1 j k} & ω_{2 j k} & \dots & ω_{m j k} \end{matrix}] \end{array}

\begin{array}{l} = [\begin{matrix} ω_{111} & ω_{211} & \dots & ω_{m 11} \\ ⋮ & ⋮ & ⋮ \\ ω_{11 n_{f}} & ω_{21 n_{f}} & \dots & ω_{m 1 n_{f}} \\ ω_{121} & ω_{221} & \dots & ω_{m 21} \\ ⋮ & ⋮ & ⋮ \\ ω_{12 n_{f}} & ω_{22 n_{f}} & \dots & ω_{m 2 n_{f}} \\ ⋮ & ⋮ & ⋮ \\ ω_{1 n_{k} 1} & ω_{2 n_{k} 1} & \dots & ω_{m n_{k} 1} \\ ⋮ & ⋮ & ⋮ \\ ω_{1 n_{k} n_{f}} & ω_{2 n_{k} n_{f}} & \dots & ω_{m n_{k} n_{f}} \end{matrix}] \in ℜ^{n_{f} n_{k} \times m} . & (15) \end{array}

5. Sub-output Space B and G: The ith output (b_i) and the output vector (b) of the BEL, and the ith output (g_i) and the output vector (g) of the FNN are represented as follows:

\begin{array}{l} b_{i} = \sum_{j = 1}^{m} \sum_{k = 1}^{n_{b}} ν_{i j k} φ_{j k}, & (16) \end{array}

\begin{array}{l} b = {[\begin{matrix} b_{1} & b_{2} & \dots & b_{m} \end{matrix}]}^{T} = V^{T} \cdot Γ, & (17) \end{array}

\begin{array}{l} g_{i} = \sum_{j = 1}^{n_{f}} \sum_{k = 1}^{n_{k}} ω_{i j k} ϕ_{j k}, & (18) \end{array}

\begin{array}{l} g = {[\begin{matrix} g_{1} & g_{2} & \dots & g_{m} \end{matrix}]}^{T} = W^{T} \cdot Φ . & (19) \end{array}

6. Output Space U: The output of the proposed iFBEL is the combination of the outputs of the BEL and the FNN, in which the BEL works as a primary controller and the FNN as an emotion controller:

\begin{array}{l} u_{i} = b_{i} - g_{i} = \sum_{j = 1}^{m} \sum_{k = 1}^{n_{b}} ν_{i j k} φ_{j k} - \sum_{j = 1}^{n_{f}} \sum_{k = 1}^{n_{k}} ω_{i j k} ϕ_{j k}, & (20) \end{array}

\begin{array}{l} u = b - g = V^{T} \cdot Γ - W^{T} \cdot Φ . & (21) \end{array}

FIGURE 2

Figure 2. Organization of an example 2-D FNN.

4. iFBEL-based Controller

The proposed intelligent controller, consisting of a sliding surface, an iFBEL network, and a robust controller, is shown in Figure 3. The iFBEL network and robust controller collaborate to imitate an ideal sliding mode controller. The updating rules of the BEL mechanism of the iFBEL network are followed by the brain emotional learning algorithm (Chung and Lin, 2015; Lin and Chung, 2015); and the adaptive laws of the FNN mechanism and robust controller are derived from the Lyapunov function. Besides, to ensure robust tracking performance.

FIGURE 3

Figure 3. Design of control system.

The updating rules are detailed as follows. Subtracting 8 into 5, yields:

\begin{array}{l} ṡ (\underline{e} (t)) = G_{n} [u_{I S M C} - u] - σ s g n [s (\underline{e} (t))] . & (22) \end{array}

Assume that an optimal iFBEL $u_{B F C}^{*}$ exists in the ideal sliding model controller, u_ISMC, and that ϵ is a minimum error vector; thus, the weight matrixes of $u_{B F C}^{*}$ are represented as V* and W* for the BEL and the FNN, respectively. Then, the output of the optimal sliding model controller is:

\begin{array}{l} \begin{array}{l} u_{I S M C} = u_{B F C}^{*} + ϵ = {(u_{B E L} - u_{F N N})}^{*} + ϵ \\ = {(V^{T} Γ - W^{T} Φ)}^{*} + ϵ = V^{* T} \hat{Γ} - W^{* T} Φ^{*} + ϵ, \end{array} & (23) \end{array}

where u_BEL and u_FNN are the outputs of the BEL and the FNN respectively, and Φ* and $\hat{Γ}$ are the optimal matrix and estimated matrix of Φ and Γ respectively. The output of the proposed iFBEL controller is defined by:

\begin{array}{l} u = u_{B F C} + u_{R C} = {\hat{V}}^{T} \hat{Γ} - Ŵ^{T} \hat{Φ} + u_{R C}, & (24) \end{array}

where u_RC is the output of the robust controller, and $\hat{V}, Ŵ, \hat{Φ}$ are the estimated matrices of V*, W*, Φ* respectively.

Taking 23 and 24 into 22, the following can be obtained:

\begin{array}{l} ṡ (\underline{e} (t)) & = & G_{n} [V^{* T} \hat{Γ} - W^{* T} Φ^{*} + ϵ - {\hat{V}}^{T} \hat{Γ} + Ŵ^{T} \hat{Φ} - u_{R C}] \end{array}

\begin{array}{l} - σ s g n [s (\underline{e} (t))] & (25) \end{array}

\begin{array}{l} = & G_{n} [Ṽ^{T} \hat{Γ} - {\tilde{W}}^{T} Φ^{*} - Ŵ^{T} \tilde{Φ} + ϵ - u_{R C}] - σ s g n [s (\underline{e} (t))], \end{array}

where $\tilde{Φ} = Φ^{*} - \hat{Φ}$ , and $Ṽ = V^{*} - \hat{V}$ . A partially linear form of the receptive-field basis function vector $\tilde{Φ}$ in the Taylor series is:

\begin{array}{l} \tilde{Φ} & = & (\begin{matrix} \tilde{ϕ_{1}} \\ ⋮ \\ \tilde{ϕ_{n_{d}}} \end{matrix}) = (\begin{matrix} {(\frac{\partial ϕ_{1}}{\partial c})}^{T} \\ ⋮ \\ {(\frac{\partial ϕ_{n_{d}}}{\partial c})}^{T} \end{matrix}) |_{c = ĉ} (c^{*} - ĉ) + (\begin{matrix} {(\frac{\partial ϕ_{1}}{\partial v})}^{T} \\ ⋮ \\ {(\frac{\partial ϕ_{n_{d}}}{\partial v})}^{T} \end{matrix}) |_{v = \hat{v}} (v^{*} - \hat{v}) + β \end{array}

\begin{array}{l} = & Φ_{c} \tilde{c} + Φ_{v} ṽ + β, & (26) \end{array}

where Φ_c and Φ_v are defined by:

\begin{array}{l} Φ_{c} = {[\frac{\partial ϕ_{1}}{\partial c}, \dots, \frac{\partial ϕ_{n_{d}}}{\partial c}]}^{T} |_{c = ĉ} \in ℜ^{n_{d} \times n_{f} n_{d}} \\ Φ_{v} = {[\frac{\partial ϕ_{1}}{\partial v}, \dots, \frac{\partial ϕ_{n_{d}}}{\partial v}]}^{T} |_{v = \hat{v}} \in ℜ^{n_{d} \times n_{f} n_{d}}, \end{array}

where $\tilde{c} = c^{*} - ĉ, ṽ = v^{*} - \hat{v}$ , β is a higher-order vector.

Rewriting 26 with $\tilde{Φ} = Φ^{*} - \hat{Φ}$ , yields:

\begin{array}{l} Φ^{*} = \hat{Φ} + \tilde{Φ} = \hat{Φ} + Φ_{c} \tilde{c} + Φ_{v} ṽ + β . & (27) \end{array}

Substituting 27 to 25, yields:

\begin{array}{l} \dot{s} (\underline{e} (t)) = G_{n} [{\tilde{V}}^{T} \hat{Γ} - {\tilde{W}}^{T} (\hat{Φ} + Φ_{c} \tilde{c} + Φ_{v} \tilde{v} + β) - {\hat{W}}^{T} \\ (Φ_{c} \tilde{c} + Φ_{v} \tilde{v} + β) + ϵ - u_{R C}] - σ s g n [s (\underline{e} (t))] \\ = G_{n} [{\tilde{V}}^{T} \hat{Γ} - {\tilde{W}}^{T} \hat{Φ} - {\hat{W}}^{T} (Φ_{c} \tilde{c} + Φ_{v} \tilde{v}) - u_{R C} + ω] \\ - σ s g n [s (\underline{e} (t))], & (28) \end{array}

where $ω = W^{* T} β + \tilde{W} (Φ_{c} \tilde{c} + Φ_{v} ṽ) + ϵ$ is a combined error of the FNN, and $Ṽ = V^{*} - \hat{V} = {[\tilde{ν_{1}}, \tilde{ν_{2}}, \dots, \tilde{ν_{m}}]}^{T} \in ℜ^{m \times m n_{b}}$ is an approximation error weight matrix of the BEL. Consider a H_∞ tracking performance (Chen et al., 2015) for the existence of ω and Ṽ as:

\begin{array}{l} \sum_{i = 1}^{m} \int_{0}^{T} s_{i}^{2} (t) d t \leq s^{T} (0) G_{n}^{- 1} s (0) + t r [{\tilde{W}}^{T} (0) η_{W}^{- 1} \tilde{W} (0)] \\ + {\tilde{c}}^{T} (0) η_{c}^{- 1} \tilde{c} (0) + ṽ^{T} (0) η_{v}^{- 1} ṽ (0) + \sum_{i = 1}^{m} λ_{i}^{2} \end{array}

\begin{array}{l} \int_{0}^{T} ω_{i}^{2} (t) d t + \sum_{i = 1}^{m} \int_{0}^{T} {\tilde{ν}}_{i}^{2} (t) d t, & (29) \end{array}

where η_W, η_c, η_v are diagonal positive constant learning-rate matrices, and λ_i is an attenuation constant. Set the initial conditions of the system as $s (0) = 0, \tilde{W} (0) = 0, \tilde{c} (0) = 0, ṽ (0) = 0$ ; then Equation 29 can be re-expressed as:

\begin{array}{l} \sum_{i = 1}^{m} \int_{0}^{T} s_{i}^{2} (t) d t \leq \sum_{i = 1}^{m} λ_{i}^{2} \int_{0}^{T} ω_{i}^{2} (t) d t + \sum_{i = 1}^{m} \int_{0}^{T} {\tilde{ν}}_{i}^{2} (t) d t . & (30) \end{array}

To approximate an ideal sliding mode controller, assume that the approximation error between the proposed iFBEL and an ideal controller are bounded; in other words, ω ∈ L₂[0, T₁] and Ṽ ∈ L₂[0, T₂] with ∀T₁, T₂ ∈ [0, ∞]. Therefore $\int_{0}^{T} ω_{i}^{2} (t) d t \leq N_{1}$ and $\int_{0}^{T} {\tilde{ν}}_{i}^{2} (t) d t \leq N_{2}$ , where N₁ and N₂ are two big positive constants. If λ = ∞, the minimum error cannot achieve approximation attenuation. If λ < ∞, the system is stable as shown by:

\begin{array}{l} \sum_{i = 1}^{m} \int_{0}^{T} s_{i}^{2} (t) d t \leq | | λ_{i} | |^{2} N_{1} + N_{2} < \infty . & (31) \end{array}

THEOREM 1. For the nonlinear system with Multiple Inputs and Multiple Outputs as represented by Equation 1, the proposed iFBEL can be described by Equation 24, in which the updating rule of the BEL is designed as expressed in Equation 32, and the adaptive laws of the FNN and robust controller are designed as stated in Equations (34-36).

\begin{array}{l} △ V = & α [Γ \times m a x (0, d - b)], & (32) \end{array}

\begin{array}{l} d = & γ \times p + τ \times u_{B F C}, & (33) \end{array}

where α is a learning-rate constant, and d consists of the input vector p and the output vector u_BFC with the learning constants γ and τ.

\begin{array}{l} \overset{\cdot}{Ŵ} = - η_{W} \hat{Φ} s^{T} (\underline{e} (t)), & (34) \end{array}

\begin{array}{l} \overset{\cdot}{ĉ} = - η_{c} Φ_{c}^{T} Ŵ s^{T} (\underline{e} (t)), & (35) \end{array}

\begin{array}{l} \overset{\cdot}{\hat{v}} = - η_{v} Φ_{v}^{T} Ŵ s^{T} (\underline{e} (t)), & (36) \end{array}

\begin{array}{l} u_{R C} = {(2 R^{2})}^{- 1} [(I + Γ^{2}) R^{2} + I] s^{T} (\underline{e} (t)), & (37) \end{array}

where $R = d i a g [\begin{matrix} λ_{1} & λ_{2} & \dots & λ_{m} \end{matrix}] \in ℜ^{m \times m}$ is a diagonal matrix of a robust controller to converge the proposed system with the update rules $\overset{\cdot}{Ŵ}, \overset{\cdot}{ĉ}$ and $\overset{\cdot}{\hat{v}}$ , and λ_i > 0, where i = 1, 2, …, m; thus, R is a positive definite matrix.

PROOF. The Lyapunov function is given by:

\begin{array}{l} V (s (\underline{e} (t)), \tilde{W}, \tilde{V}, \tilde{c}, \tilde{v}) = \frac{1}{2} [s^{T} (\underline{e} (t)) G_{n}^{- 1} s (\underline{e} (t)) + t r [{\tilde{W}}^{T} η_{W}^{- 1} \tilde{W}] \\ + {\tilde{c}}^{T} η_{c}^{- 1} \tilde{c} + {\tilde{v}}^{T} η_{v}^{- 1} \tilde{v} + t r [{\tilde{V}}^{T} α^{- 1} \tilde{V}]] . & (38) \end{array}

Taking the derivative of the Lyapunov function and using 28, yields

\begin{array}{l} \overset{\cdot}{V} (s (\underline{e} (t)), \tilde{W}, Ṽ, \tilde{c}, ṽ) \\ = s^{T} (\underline{e} (t)) G_{n}^{- 1} ṡ (\underline{e} (t)) + t r [{\tilde{W}}^{T} η_{W}^{- 1} \overset{\cdot}{\tilde{W}}] + {\tilde{c}}^{T} η_{c}^{- 1} \overset{\cdot}{\tilde{c}} + ṽ^{T} η_{v}^{- 1} \overset{\cdot}{ṽ} \\ + t r [Ṽ^{T} α^{- 1} \overset{\cdot}{Ṽ}] \\ = s^{T} (\underline{e} (t)) G_{n}^{- 1} ṡ (\underline{e} (t)) - t r [{\tilde{W}}^{T} η_{W}^{- 1} \overset{\cdot}{Ŵ}] - {\tilde{c}}^{T} η_{c}^{- 1} \overset{\cdot}{ĉ} - ṽ^{T} η_{v}^{- 1} \overset{\cdot}{\hat{v}} \\ - t r [Ṽ^{T} α^{- 1} \overset{\cdot}{\hat{V}}] \\ = s^{T} (\underline{e} (t)) Ṽ \hat{Γ} - s^{T} (\underline{e} (t)) \tilde{W} \hat{Φ} - s^{T} (\underline{e} (t)) Ŵ (Φ_{c} \tilde{c} + Φ_{v} ṽ) \\ + s^{T} (\underline{e} (t)) (ω - u_{R C}) - s^{T} (\underline{e} (t)) G_{n}^{- 1} σ s g n [s (\underline{e} (t))] \\ - t r [{\tilde{W}}^{T} η_{W}^{- 1} \overset{\cdot}{Ŵ}] - {\tilde{c}}^{T} η_{c}^{- 1} \overset{\cdot}{ĉ} - ṽ^{T} η_{v}^{- 1} \overset{\cdot}{\hat{v}} - t r [Ṽ^{T} α^{- 1} \overset{\cdot}{\hat{V}}] \\ \leq - t r [\tilde{W} (s (\underline{e} (t)) \hat{Φ} + η_{W}^{- 1} \overset{\cdot}{Ŵ})] - \tilde{c} [s^{T} (\underline{e} (t)) Ŵ Φ_{c} + η_{c}^{- 1} \overset{\cdot}{ĉ}] \\ - ṽ [s^{T} (\underline{e} (t)) Ŵ Φ_{v} + η_{v}^{- 1} \overset{\cdot}{\hat{v}}] + s^{T} (\underline{e} (t)) Ṽ \hat{Γ} \end{array}

\begin{array}{l} + s^{T} (\underline{e} (t)) (ω - u_{R C}) . & (39) \end{array}

Since $\overset{\cdot}{\hat{V}} = 0$ when d_i − b ≤ 0 and $\overset{\cdot}{\hat{V}} = α \cdot Γ \cdot [d_{i} - b] > 0$ if d_i − a > 0, consider Ṽ ∈ L₂[0, T₂] leading to $- t r [Ṽ^{T} α^{- 1} \overset{\cdot}{\hat{V}}] \leq 0$ . Substituting 34–37 into 39, yields:

\begin{array}{l} \overset{\cdot}{V} (s (\underline{e} (t)), \tilde{W}, Ṽ, \tilde{c}, ṽ) \\ \leq s^{T} (\underline{e} (t)) Ṽ \hat{Γ} + s^{T} (\underline{e} (t)) (ω - u_{R C}) \\ = s^{T} (\underline{e} (t)) Ṽ \hat{Γ} + s^{T} (\underline{e} (t)) ω - \frac{1}{2} s^{T} (\underline{e} (t)) s (\underline{e} (t)) - \frac{1}{2} \frac{s^{T} (\underline{e} (t)) s (\underline{e} (t))}{λ^{2}} \\ - \frac{1}{2} s^{T} (\underline{e} (t)) s (\underline{e} (t)) \hat{Γ} {\hat{Γ}}^{T} \\ = - \frac{1}{2} s^{T} (\underline{e} (t)) s (\underline{e} (t)) - \frac{1}{2} {[\frac{s (\underline{e} (t))}{λ} - λ ω]}^{2} - \frac{1}{2} {[s {(\underline{e} (t))}^{T} \hat{Γ} - Ṽ]}^{2} \\ + \frac{1}{2} λ^{2} ω^{2} + \frac{1}{2} Ṽ^{T} Ṽ \end{array}

\begin{array}{l} \leq & - \frac{1}{2} s^{T} (\underline{e} (t)) s (\underline{e} (t)) + \frac{1}{2} λ^{2} ω^{2} + \frac{1}{2} Ṽ^{T} Ṽ . & (40) \end{array}

Integrating 40 from t = 0 to t = T, yields:

\begin{array}{l} V (T) - V (0) \leq - \frac{1}{2} \sum_{i = 1}^{m} \int_{0}^{T} s_{i}^{2} (t) d t + \frac{1}{2} \sum_{i = 1}^{m} λ_{i}^{2} \int_{0}^{T} ω_{i}^{2} (t) d t \end{array}

\begin{array}{l} + \frac{1}{2} \sum_{i = 1}^{m} \int_{0}^{T} {\tilde{ν}}_{i}^{2} (t) d t . & (41) \end{array}

Since V(T) > 0 and V(0) > 0, Equations 30 and 31 lead to $\sum_{i = 1}^{m} \int_{0}^{T} s_{i}^{2} (t) d t < \infty$ .

5. Experimentation

To verify the effectiveness and efficacy of the proposed controller with the new iFBEL, it was applied to two typical humanoid robotic systems, including a three-joint robot manipulator and a six-joint biped robot. A comparative study is also included in this section to evaluate the performance of the proposed controller in reference to two important control approaches including a PID controller and an SMC with fuzzy cerebellar model articulation controller network (FCMAC) (Lin et al., 2009).

PID control is a classic control method, which is linearly combined by proportional control, integral control and differential control. The FCMAC network has the characteristics of rapid convergence, which enable the work to be suitable for the robotic control. The effectiveness of the FCMAC-based network controller has been demonstrated in many recent studies, such as Lin et al. (2016) and Zhao and Lin (2017). The experiments of both three-joint robot manipulator and six-joint biped robot are simulated in MATLAB R2016a. The configuration of the algorithm computer is set as follows: The CPU and the operating system of the development computer are Intel Core i5-4200U CPU@2.30GHz and Windows 10 professional. The source code of the algorithm can be found in this link¹.

The parameters for the robust controller and the iFBEL's Gaussian functions and weights are tuned by using Equations from 32 to 37. The learning rate parameters and iFBEL's network structure are set empirically.

5.1. Three-Joint Robot Manipulator

The first experiment was carried out using a relatively simple three-joint robot manipulator, to mainly practically evaluate the validity of the proposed system. The three-joint robot manipulator used in this experiment is illustrated in Figure 4; and the dynamic equation of such system is expressed as follows:

\begin{array}{l} M (q) \ddot{q} + C (q, \overset{\cdot}{q}) \overset{\cdot}{q} + g (q) = u + τ_{d}, & (42) \end{array}

where q ∈ ℜ³ is the joint angle state vector, $\overset{\cdot}{q} \in ℜ^{3}$ is the velocity vector, $\ddot{q} \in ℜ^{3}$ is the acceleration vector, M(q) ∈ ℜ^3×3 is the inertia matrix, $C (q, \overset{\cdot}{q}) \in ℜ^{3 \times 3}$ is the Coriolis/Centripetal matrix, g(q) ∈ ℜ³ is the gravity vector, and q = [−0.2, 0.5, −0.3]^T, $\overset{\cdot}{q} = 0$ , $\ddot{q} = 0$ are designated as the original state, u ∈ ℜ³ is the output torque. The detailed expression of $M (q), C (q, \overset{\cdot}{q}), g (q)$ and the nominal parameters of the manipulator are provided in Appendix 1.1.

FIGURE 4

Figure 4. The three-links robot manipulator.

The reference trajectories were given as $q_{d 1} = \frac{1}{2} {[\frac{1}{2} (\sin (t + 2.5) + 0.7 \cos (2 t + 1.5)], \sin (t) + \sin (2 t), 0.13 - (\sin (t) + \sin (2 t))]}^{T}$ , $\overset{\cdot}{q_{d 1}} = 0$ , $\ddot{q_{d 1}} = 0$ . To evaluate the robustness of the proposed control system, the reference trajectories were modified as $q_{d 2} = \frac{1}{2} {[\frac{1}{2} (sin (2 t) + cos (t + 1)), sin (2 t) + cos (t + 1), cos (2 t) - sin (t)]}^{T}$ , $\overset{\cdot}{q_{d 2}} = 0$ , $\ddot{q_{d 2}} = 0$ at t = 15s, with the external disturbance of $τ_{d} = ρ_{1} \times {[0.2 sin (2 t), 0.1 cos (2 t), 0.1 sin (t)]}^{T}$ , where ρ₁ = 1 is the amplification coefficient. In order to evaluate the proposed network's performances in various disturbance situations, two coefficients (ρ₁ = 1.5 and ρ₁ = 2) were also used in the experiments. The BEL and the FNN were characterized as follows:

• the number of elements for each state variable: n_E = 5 (elements);

• generalization: n_C = 4 (elements/block);

• the number of blocks for each state variable for both the BEL and FNN: n_b = n_f = 2 (blocks/layer) × 4 (layer) = 8 (blocks);

• the number of receptive fields: n_E = 2 (receptive fields/layer) × 4 (layer) = 8 (receptive fields).

The initial means of the Gaussian functions in the Association Memory Spaces were divided equally and set as [−1, 1] for the BEL, and [−2, 2] for the FNN. The initial variances were set as σ_ij = 0.1 for the BEL, and σ_pq = 0.1 for the FNN, where i = p = 1, 2, 3, and j = q = 1, 2…, 8. The weights of both the BEL and the FNN were initialized as zero and then automatically adjusted during the online training process. In addition, the learning rates were set as follows: η_ω = 20, η_m = 0.001, η_v = 0.001, α = 0.01, b = 0.1, c = 0.1.

The parameters of PID controller in the comparison experiments were set as: κ_P = 15, κ_I = 0.2, κ_D = 0.5, where κ_P, κ_I and κ_D are the coefficients of the proportional controller, integral controller and differential controller. FCMAC controller in the comparison experiments has the same parameters as FNN does.

The simulated position responses and the tracking errors at ρ₁ = 1 are shown in Figure 5. To better distinguish these values for the three controllers, Figures 6, 7 show the amplified trajectory responses and the tracking errors at t = 0 and t = 15. In Figure 6, the PID controller required 1.4s, 1.3s, and 0.05s for Joints 1, 2, and 3 to converge, respectively, while the FCMAC required 1.2s, 1.3s, and 0.05s for these joints respectively; however, the proposed iFBEL controller just needed 1.1s, 1.3s, 0.03s for these joints, respectively. In addition, the iFBEL performed the best when t = 15s.

FIGURE 5

Figure 5. Trajectory responses (in the left) and tracking errors (in the right) of Joints 1, 2, and 3 at ρ₁ = 1. The solid line indicates the performance of the iFBEL; the dotted line represents that of the PID controller; and the dot dash line implies that of the FCMAC controller. (A) Trajectory response and tracking error of Joint 1. (B) Trajectory response and tracking error of Joint 2. (C) Trajectory response and tracking error of Joint 3.

FIGURE 6

Figure 6. Amplified trajectory responses (in the left) and tracking errors (in the right) at t = 0 of Joints 1, 2, and 3. The solid line indicates the performance of the iFBEL; the dotted line represents that of the PID controller; and the dot dash line implies that of the FCMAC controller. (A) Amplified trajectory response and tracking error of Joint 1. (B) Amplified trajectory response and tracking error of Joint 2. (C) Amplified trajectory response and tracking error of Joint 3.

FIGURE 7

Figure 7. Amplified trajectory responses (in the left) and tracking errors (in the right) at t = 15 of Joints 1, 2, and 3. The solid line indicates the performance of the iFBEL; the dotted line represents that of the PID controller; and the dot dash line implies that of the FCMAC controller. (A) Amplified trajectory response and tracking error of Joint 1. (B) Amplified trajectory response and tracking error of Joint 2. (C) Amplified trajectory response and tracking error of Joint 3.

The accumulated RMSE values at ρ₁ = 1 during the entire experiment are listed in Table 1, which also proved that the proposed iFBEL controller outperformed others. However, the difference among the three controllers is insignificant. The FCMAC and the PID controllers also generated good control performances in this experiment, because the three-joint manipulator system is not very complicated. The accumulated RMSE values under ρ₁ = 1.5 and ρ₁ = 2 are listed in Tables 2, 3, respectively. With the increase of disturbance, the errors of the three controllers also increased. However, the iFBEL also achieved the best performance under the two disturbance situations. This proves that the proposed iFBEL can well handle larger disturbances.

TABLE 1

Table 1. The accumulated RMSE values of each joint at ρ₁ = 1.

TABLE 2

Table 2. The accumulated RMSE values of each joint at ρ₁ = 1.5.

TABLE 3

Table 3. The accumulated RMSE values of each joint at ρ₁ = 2.0.

5.2. The Biped Robot

The configuration of the six-link biped robot used in this second experiment is illustrated in Figure 8. The experiment reported in the last sub-section was mainly used to validate the proposed system, but the experiment reported in this sub-section was primarily used to evaluate the efficiency and efficacy of the proposed control system. The dynamic equation of the robot is given as follow:

\begin{array}{l} M (q) \ddot{q} + C (q, \overset{\cdot}{q}) \overset{\cdot}{q} + g (q) = u + τ_{d}, & (43) \end{array}

where q ∈ ℜ⁶, $\overset{\cdot}{q} \in ℜ^{6}$ , $\ddot{q} \in ℜ^{6}$ are the joint angle state vector, velocity vector and acceleration vector respectively, and M(q) ∈ ℜ^6×6, $C (q, \overset{\cdot}{q}) \in ℜ^{6 \times 6}$ , g(q) ∈ ℜ⁶ are the inertia matrix, the Coriolis/Centripetal matrix and the gravity vector respectively, u ∈ ℜ⁶ is the output torque. More details for $M (q), C (q, \overset{\cdot}{q}), g (q)$ and the nominal parameters of the biped robot can be found in Appendix 1.2.

FIGURE 8

Figure 8. The six-links biped robot used in the experiment.

This experiment also considered the phases of signal support of a gait cycle. The analysis planning and walking pattern generation are detailed in Appendix 1.3. The generated gait trajectory $q_{d} = {[θ_{1}, θ_{2}, \dots, θ_{6}]}^{T}$ , $\overset{\cdot}{q_{d}} = 0$ , $\ddot{q_{d}} = 0$ were set as the reference trajectories of the biped robot. The initial angles of each joint were given as q = [0.37, 0.5, 0.75, −0.15, −0.56, 0.85]^t, $\overset{\cdot}{q} = 0$ , $\ddot{q} = 0$ . τ_d = ρ₂ × exp(−0.1t)_6×1 was used in this experiment as the external disturbance, where ρ₂ = 1 is the amplification coefficient.

The BEL and the FNN are characterized as the same with that used in the first experiment as reported in section 5.1, but with different initializations. In particular, the initial means of the Gaussian functions in the Association Memory Spaces in this experiment were divided equally and set as [−1.4, 1.4] for the BEL, and [−1.6, 1.6] for the FNN. The initial variances were set as σ_ij = 0.01 for the BEL and σ_pq = 0.5 for the FNN, where i = p = 1, 2…, 6, and j = q = 1, 2…, 8. The weights of both sub-systems were initialized from zero and then automatically adjusted during the online training stage. In this experiment, the learning rates were chosen as η_ω = 0.01, η_m = 0.001, η_v = 0.001, α = 0.01, b = 0.05, and c = 0.01.

The parameters of PID controller in the second experiment were set as: κ_P = 8, κ_I = 0.5, κ_D = 1.3. FCMAC controller in the second experiment also has the same parameters as FNN does.

The simulated position responses and the tracking errors at ρ₂ = 1 led by the three controllers are illustrated in Figures 9, 10; with the performances of Joints 1, 2 and 3 illustrated in Figure 9 and those of Joints 4, 5, and 6 in Figure 10. The PID controller had a significant convergence delay, which therefore represented the worst performance within the three controllers. It is difficult from these figures to distinguish the performances led by the FCMAC and the iFBEL controllers, and thus the trajectories resulted from all the controllers in the range of [−1.4s, 1.4s] are magnified as displayed in Figures 11, 12 for better visualization and thus easier investigation.

FIGURE 9

Figure 9. Trajectory responses (in the left) and tracking errors (in the right) of Joints 1, 2, and 3 at ρ₂ = 1. The solid line indicates the performance of the iFBEL; the dotted line represents that of the PID controller; and the dot dash line implies that of the FCMAC controller. (A) Trajectory response and tracking error of Joint 1. (B) Trajectory response and tracking error of Joint 2. (C) Trajectory response and tracking error of Joint 3.

FIGURE 10

Figure 10. Trajectory responses (in the left) and tracking errors (in the right) of Joints 4, 5, and 6 at ρ₂ = 1. The solid line indicates the performance of the iFBEL; the dotted line represents that of the PID controller; and the dot dash line implies that of the FCMAC controller. (A) Trajectory response and tracking error of Joint 4. (B) Trajectory response and tracking error of Joint 5. (C) Trajectory response and tracking error of Joint 6.

FIGURE 11

Figure 11. Amplified trajectory responses (in the left) and tracking errors (in the right) of Joints 1, 2, and 3. The solid line indicates the performance of the iFBEL; the dotted line represents that of the PID controller; and the dot dash line implies that of the FCMAC controller. (A) Amplified trajectory response and tracking error of Joint 1. (B) Amplified trajectory response and tracking error of Joint 2. (C) Amplified trajectory response and tracking error of Joint 3.

FIGURE 12

Figure 12. Amplified Trajectory responses (in the left) and tracking errors (in the right) of Joints 4, 5, and 6. Solid line indicate iFBEL with the dotted one point PID controller and the dot dash one imply FCMAC controller. (A) Amplified trajectory response and tracking error of Joint 4. (B) Amplified trajectory response and tracking error of Joint 5. (C) Amplified trajectory response and tracking error of Joint 6.

From Figures 11, 12, it is clear that the PID controller could not converge rapidly in all the joints of the biped robot. The performances of the FCMAC and the iFBEL regarding all of the joints were very similar; both controllers rapidly converged the tracking errors. The tracking error amplitudes of the FCMAC controller in Joints 1, 2, 3, and 6 were larger than those of the iFBEL controller, which indicates the superiority of the proposed iFBEL controller.

The accumulated RMSE values are listed in Table 4. It is clear from this table that the convergence time of the iFBEL controller was shorter than those of the PID and the FCMAC for each joint. In this case, the RMSE values also proved that the proposed iFBEL controller achieved the best control performance within the three compared controllers used in this comparative study. The accumulated RMSE values at ρ₂ = 1.5 and ρ₂ = 2 are also given in Tables 5, 6, respectively. The iFBEL also achieved the best performance under the two disturbance situations.

TABLE 4

Table 4. The accumulated RMSE value of each joint of the biped robot at ρ₂ = 1.

TABLE 5

Table 5. The accumulated RMSE value of each joint of the biped robot at ρ₂ = 1.5.

TABLE 6

Table 6. The accumulated RMSE value of each joint of the biped robot at ρ₂ = 2.

6. Discussion

A humanoid robot usually consists of multiple joints and suffers many unexpected disturbances; therefore, the controller of humanoid robot must own the powerful non-linear approximation ability to handle these complex situations. Based on the results of the two simulations, the proposed iFBEL network successfully demonstrated a rapid convergence ability and a nonlinear mapping capability. In the two simulations, the iFBEL controller can always achieve the fastest reaction speed to reduce errors; in addition, the iFBEL controller still achieved the best performance in different disturbance patterns. Therefore, the proposed network is suitable for the control of humanoid robots.

Although the performance of iFBEL-based controller was better than those of the FCMAC and PID controllers, the iFBEL network's structure is more complicated than that of the FCMAC. To address this issue, we believe that a recurrent mechanism usually uses a simple network structure to achieve good dynamic performance. Therefore, in the future work, we will improve our method by embedding a recurrent network inside the iFBELC controller.

7. Conclusion

This paper proposed a novel humanoid robot controller, which integrates some components from a fuzzy neural network and a brain emotional learning model into a sliding mode controller for dynamic non-linear control. It has been theoretically proven that the proposed system is asymptotically stable, thus guaranteeing the convergence. Experimental results and comparative studies further verified this, and demonstrated precise position tracking, more favorable stability, and better performance in reference to the results generated from the recently-developed network controllers of PID and FCMAC.

This research can be further improved in several directions. The current iFBEL network does not include any recurrent mechanism, but such a mechanism can generally improve the dynamic performance of a network. Therefore, a future investigation will focus on the development of the recurrent feature to better support the iFBEL controller. In addition, the undesired chattering situation existing in the sliding surface has not been fully investigated; more efforts will focus on this issue. Furthermore, the proposed approach was only practically applied to the dynamic humanoid robot control in this work. It is worthwhile to apply the approach to a wider range of applications to fully discover its potential.

Author Contributions

WF contributed to this work by developing the proposed method and preparing the experiments. FC contributed the implementation of the proposed method and writing the manuscript. C-ML conducted the statistical analysis of the experimental results. LY contributed the planning and analysis of the experiments and writing of the manuscript. CS contributed to the design of the proposed method. CZ contributed to the writing of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (Nos. 61673322, 61673326, and 91746103), the Fundamental Research Funds for the Central Universities (No. 20720160126), Natural Science Foundation of Fujian Province of China (Nos. 2017J01128 and 2017J01129), and the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No. 663830.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We are very grateful to the reviewers for their constructive comments which have helped significantly in revising this work.

Supplementary Material

The Supplementary Material for this article can be found online at: http://journal.frontiersin.org/article/10.3389/fnbot.2019.00002/full#supplementary-material

Abbreviations

BEL, Brain Emotional Learning; BFC, BEL and FNN Controller; FCMAC, Fuzzy Cerebellar Model Articulation Controller; FNN, Fuzzy Neural Network; iFBEL, improved Fuzzy Brain Emotional Learning; ISMC, Ideal Sliding Model Controller; PID, Proportional Integral Derivative; RC, Robust Controller; RMSE, Root Mean Square Error; SMC, Sliding Mode Control.

Footnotes

1. ^ https://github.com/Xiaomu-Shan/Experiments-of-iFBEL

References

Aguilar-Ibanez, C., Suarez-Castanon, M. S., Mendoza-Mendoza, J., de Jesus Rubio, J., and Martnez-Garca, J. C. (2018). Output-feedback stabilization of the pvtol aircraft system based on an exact differentiator. J. Intell. Robot. Sys. 90, 443–454. doi: 10.1007/s10846-017-0660-0

CrossRef Full Text | Google Scholar

Boldbaatar, E. A., and Lin, C. M. (2015). Self-learning fuzzy sliding-mode control for a water bath temperature control system. Int. J. Fuzzy Syst. 17, 31–38. doi: 10.1007/s40815-015-0015-6

CrossRef Full Text | Google Scholar

Chen, C. H., Chung, C. C., Chao, F., Lin, C. M., and Rudas, I. J. (2015). Intelligent robust control for uncertain nonlinear multivariable systems using recurrent cerebellar model neural networks. Acta Polytech. Hung. 12, 7–33. doi: 10.12700/APH.12.5.2015.5.1

CrossRef Full Text | Google Scholar

Chung, C. C., and Lin, C. M. (2015). Fuzzy brain emotional cerebellar model articulation control system design for multi-input multi-output nonlinear. Acta Polytech. Hung. 12, 39–58. doi: 10.12700/APH.12.4.2015.4.3

CrossRef Full Text | Google Scholar

Huang, Q., Yokoi, K., Kajita, S., Kaneko, K., Arai, H., Koyachi, N., et al. (2002). Planning walking patterns for a biped robot. IEEE Trans. Robot. Automat. 17, 280–289. doi: 10.1109/70.938385

CrossRef Full Text | Google Scholar

LeDoux, J. E. (1992). “Chapter the amygdala: neurobiological aspects of emotion, memory, and mental dysfunction,” in Emotion and the Amygdala, ed J. P. Aggleton (New York, NY: Wiley-Liss), 339–351.

Google Scholar

Lee, C. C. (1990). Fuzzy logic in control systems: fuzzy logic controller. I. IEEE Trans. Syst. Man Cyber. 20, 404–418. doi: 10.1109/21.52551

CrossRef Full Text | Google Scholar

Li, H., Liu, M., and Zhang, F. (2017). Geomagnetic navigation of autonomous underwater vehicle based on multi-objective evolutionary algorithm. Front. Neurorobot. 11:34. doi: 10.3389/fnbot.2017.00034

CrossRef Full Text | Google Scholar

Lin, C. M., and Chung, C. C. (2015). Fuzzy brain emotional learning control system design for nonlinear systems. Int. J. Fuzzy Syst. 17, 117–128. doi: 10.1007/s40815-015-0020-9

CrossRef Full Text | Google Scholar

Lin, C. M., Chung, C. M., and Hsu, C. F. (2009). “Adaptive cmac control system design for a class of nonlinear systems,” in IEEE International Conference on Systems, Man and Cybernetics (San Antonio, TX), 4508–4513.

Google Scholar

Lin, C. M., and Hsu, C. F. (2015). Hybrid fuzzy sliding-mode control of an aeroelastic system. J. Guid. Cont. Dyn. 25, 829–832. doi: 10.2514/2.4955

CrossRef Full Text | Google Scholar

Lin, F. J., Sun, I. F., Yang, K. J., and Chang, J. K. (2016). Recurrent fuzzy neural cerebellar model articulation network fault-tolerant control of six-phase permanent magnet synchronous motor position servo drive. IEEE Trans. Fuzzy Syst. 24, 153–167. doi: 10.1109/TFUZZ.2015.2446535

CrossRef Full Text | Google Scholar

Liu, Z., Chen, C., Zhang, Y., and Chen, C. L. P. (2015). Adaptive neural control for dual-arm coordination of humanoid robot with unknown nonlinearities in output mechanism. IEEE Trans. Cyber. 45, 507–518. doi: 10.1109/TCYB.2014.2329931

PubMed Abstract | CrossRef Full Text | Google Scholar

Lotfi, E., and Akbarzadeht, M. R. (2013). “Emotional brain-inspired adaptive fuzzy decayed learning for online prediction problems,” in IEEE International Conference on Fuzzy Systems (Hyderabad), 1–7.

Google Scholar

Meda-Campana, J. A., Araceli, G. M., Rubio, J. D. J., Tapia-Herrera, R., Hernandez-Cortes, T., Curtidor-Lopez, A. V., et al. (2018). Design of stabilizers and observers for a class of multivariable ts fuzzy models on the basis of new interpolation functions. IEEE Trans. Fuzzy Syst. 26, 2649–2662. doi: 10.1109/TFUZZ.2017.2786244

CrossRef Full Text | Google Scholar

Pan, Y., Liu, Y., Xu, B., and Yu, H. (2016). Hybrid feedback feedforward: an efficient design of adaptive neural network control. Neural Netw. 76, 122–134. doi: 10.1016/j.neunet.2015.12

PubMed Abstract | CrossRef Full Text | Google Scholar

Rubio, J. D. J. (2018). Discrete time control based in neural networks for pendulums. Appl. Soft Comput. 68, 821–832. doi: 10.1016/j.asoc.2017.04.056

CrossRef Full Text | Google Scholar

Rubio, J. D. J., Garcia, E., Aquino, G., Aguilar-Ibanez, C., Pacheco, J., and Zacarias, A. (2018). Learning of operator hand movements via least angle regression to be teached in a manipulator. Evol. Syst. doi: 10.1007/s12530-018-9224-1. [Epubh ahead of print].

CrossRef Full Text | Google Scholar

Rubio, J. J. (2012). Modified optimal control with a backpropagation network for robotic arms. IET Control Theory Appl. 6, 2216–2225. doi: 10.1049/iet-cta.2011.0322

CrossRef Full Text | Google Scholar

Shih, C. L., Li, Y. Z., Churng, S., and Lee, T. T. (1990). “Trajectory synthesis and physical admissibility for a biped robot during the single-support phase,” in Proceedings of IEEE International Conference on Robotics and Automation, 1990, Vol. 3 (Cincinnati, OH), 1646–1652.

Google Scholar

Wu, Q., Lin, C.-M., Fang, W., Chao, F., Yang, L., Shang, C., and Zhou, C. (2018). Self-organizing brain emotional learning controller network for intelligent control system of mobile robots. IEEE ACCESS 6, 59096–59108. doi: 10.1109/ACCESS.2018.2874426

CrossRef Full Text | Google Scholar

Zhao, J., and Lin, C. M. (2017). An interval-valued fuzzy cerebellar model neural network based on intuitionistic fuzzy sets. Int. J. Fuzzy Syst. 19, 1–14. doi: 10.1007/s40815-017-0321-2

CrossRef Full Text | Google Scholar

Zhou, D., Shi, M., Chao, F., Lin, C.-M., Yang, L., Shang, C., et al. (2018). Use of human gestures for controlling a mobile robot via adaptive cmac network and fuzzy logic controller. Neurocomputing 282, 218–231. doi: 10.1016/j.neucom.2017.12.016

CrossRef Full Text | Google Scholar

Keywords: brain emotional learning network, humanoid robot control, Sliding mode control, neural network control, fuzzy neural network

Citation: Fang W, Chao F, Lin C-M, Yang L, Shang C and Zhou C (2019) An Improved Fuzzy Brain Emotional Learning Model Network Controller for Humanoid Robots. Front. Neurorobot. 13:2. doi: 10.3389/fnbot.2019.00002

Received: 18 June 2018; Accepted: 11 January 2019;
Published: 04 February 2019.

Edited by:

Feihu Zhang, Northwestern Polytechnical University, China

Reviewed by:

Tianhua Chen, University of Huddersfield, United Kingdom
Muye Pang, Wuhan University of Technology, China
Jose De Jesus Rubio, Instituto Politcnico Nacional, Mexico

Copyright © 2019 Fang, Chao, Lin, Yang, Shang and Zhou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Fei Chao, ZmNoYW9AeG11LmVkdS5jbg==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.