Emotional neural networks with universal approximation property for stable direct adaptive nonlinear control systems

Engineering Applications of Artificial Intelligence 89 (2020) 103447 Contents lists available at ScienceDirect Engineering Applications of Artificia...

Download PDF

1MB Sizes 0 Downloads 56 Views

Report

PDF Reader
Full Text

Engineering Applications of Artificial Intelligence 89 (2020) 103447

Contents lists available at ScienceDirect

Engineering Applications of Artificial Intelligence journal homepage: www.elsevier.com/locate/engappai

Emotional neural networks with universal approximation property for stable direct adaptive nonlinear control systems✩ F. Baghbani, M.-R. Akbarzadeh-T ∗, M.-B. Naghibi-Sistani, Alireza Akbarzadeh Department of Electrical Engineering, Center of Excellence on Soft Computing and Intelligent Information Processing, Ferdowsi University of Mashhad, Mashhad, Iran

ARTICLE

INFO

Keywords: Brain emotional learning Direct adaptive control Nonlinear control Lyapunov stability theory Neural networks

ABSTRACT Universal approximation, continuity, and differentiability are desirable properties of any computational framework, including those that rise from human cognition and/or are inspired by nature. Emotional machines constitute one such framework, but few studies have addressed their mathematical properties. Here, we propose a Continuous Radial Basis Emotional Neural Network (CRBENN) that benefits from the universal approximation property, continuous output, and simple structure of RBF; while keeping the fast response properties of emotion-based approaches. As such, CRBENN is amenable to a wide array of challenging problems in systems engineering and artificial intelligence. Here, we propose a CRBENN-based direct adaptive robust emotional neuro-control approach (DARENC) for a class of uncertain nonlinear systems. Stability is theoretically established using Lyapunov analysis of the closed-loop system. DARENC is then applied to control an inverted pendulum system, and the performance of the controller is numerically compared with several competing fuzzy, neural, and emotional controllers. The simulation results indicate improved tracking performance, better disturbance rejection, and less control effort. Finally, DARENC is implemented on a realworld 3-PSP (spherical–prismatic–spherical) parallel robot in our laboratory. The experimental results show the satisfactory performance of the robot in tracking the desired trajectory with low control effort.

1. Introduction In the design of controllers for nonlinear and complex systems, we often like to begin with a computational framework that has good approximation properties, continuity, and differentiability. The realms of neural networks and fuzzy logic, for instance, have made great strides by this way of solving problems. For emotion-based computational models, however, this has not been the case. The current emotional models are motivated by the way the emotional stimuli are evaluated in the relevant parts of the human brain that are responsible for emotional processing. They have been employed in various decision making and control engineering problems and have shown desirable numerical properties such as fast response, simple structure, learning ability, and robustness to uncertainties. And yet, most of them are problem specific; and in the realm of control engineering, few works have investigated important mathematical results such as stability. However, the stable emotional controllers often assume that the emotional model has the approximation property of the ordinary neural networks without necessarily offering any proof. Accordingly, we would like to

propose a general emotion-based computational model that is consistent with the basic laws of the emotional brain and yet is amenable to mathematical rigor and analysis. Such an emotional framework should illustrate mathematical properties such as function approximation property, continuity, differentiability, and above all, stability, along with the established capabilities of the emotional models. Most of the current emotion-based architectures are based on a simple computational model originally introduced by Moren and Balkenius (2000). Moren’s model of brain emotional learning (BEL) (Moren and Balkenius, 2000) consists of the Amygdala, which is known to be the main part where emotional learning occurs, the Orbitofrontal Cortex (OFC), the sensory cortex, and the Thalamus. The input data first enters the Thalamus, which is considered a simple identity function. The sensory cortex then receives the output of the Thalamus and distributes it to the Amygdala and the OFC parts. The overall output of the model is computed as the subtraction of the OFC’s output from the Amygdala’s output. The weights of the Amygdala nodes can only increase, but the weights of the OFC nodes can either decrease or increase, which inhibit the inappropriate responses of the Amygdala. The Amygdala

✩ No author associated with this paper has disclosed any potential or pertinent conflicts which may be perceived to have impending conflict with this work. For full disclosure statements refer to https://doi.org/10.1016/j.engappai.2019.103447. ∗ Corresponding author. E-mail addresses: [email protected] (F. Baghbani), [email protected] (M.-R. Akbarzadeh-T), [email protected] (M.-B. Naghibi-Sistani), [email protected] (A. Akbarzadeh).

https://doi.org/10.1016/j.engappai.2019.103447 Received 1 June 2019; Received in revised form 13 December 2019; Accepted 22 December 2019 Available online xxxx 0952-1976/© 2020 Elsevier Ltd. All rights reserved.

F. Baghbani, M.-R. Akbarzadeh-T, M.-B. Naghibi-Sistani et al.

Engineering Applications of Artificial Intelligence 89 (2020) 103447

here, the theoretical results are general from a modeling perspective and present a general emotion-based computational framework. The rest of this paper is organized as follows. In Section 2, the emotion-based models are reviewed. In Section 3, the proposed CRBENN is described, and its universal approximation property is proved. Then, problem formulation for a direct adaptive control structure is presented in Section 4. The proposed adaptive BEL-based control methodology is explained in Section 5. Next, the simulation results of the proposed controller are presented in Section 6. Finally, conclusions are drawn in Section 7. For better readability, we provide some of the theoretical preliminaries on the universal approximation property of the RBF networks in Appendix.

also receives an input from the Thalamus. This input connects the Thalamus directly to the Amygdala, resulting in a fast response and fault tolerance (Moren, 2002). In the first version of the model (Moren and Balkenius, 2000), this input is the maximum over all the inputs. While, in the second version (Moren, 2002), Moren argues that this type of connection is too coarse to model the exact functionality of this input. Accordingly, due to the harsh results in the simulation and interferences with normal learning, this input is omitted in his further investigations (Moren, 2002). Here, we begin with designing a new continuous radial-basis emotional neural network (CRBENN). The CRBENN has basis functions in the nodes of Thalamus, but there is no direct connection from the Thalamus to the Amygdala. In this way, the CRBENN with simple manipulations is shown to be equivalent to the RBF networks, but with the added properties of the emotional models because of the Amygdala component and its non-decreasing weights. Consequently, its universal approximation property is simply proved based on the similar property of the RBF networks. CRBENN thus benefits from the features of the RBF networks such as universal approximation, continuity, and differentiability with respect to weights. It is also shown that the proof of the universal approximation property for the CRBENN is general and any symmetric radial basis kernel function can be considered as the nodes of the Thalamus. CRBENN is then employed in a direct adaptive control structure to approximate the control input directly. The important aspect of the proposed controller is that we determine the overall stability of an emotional-based controller based on the Lyapunov stability theory. Such theoretical result has been reported in few emotion-based papers that are generally with specific considerations and simplifications. Another point is that the update laws are consistent with basic models of the emotional mind, i.e., they meet the requirement of the non-decreasing Amygdala weights. In short, the proposed method in comparison with previous approaches has the following novel aspects. First, CRBENN offers a simpler and continuous mapping with universal approximation property. Hence, as a general computational framework, it can be applied to various control engineering problems. This is in comparison with our earlier work WTAENN in Lotfi and Akbarzadeh-T (2016) that requires 𝑚 BEL modules to prove the universal approximation property and leads to a discontinuous output with a higher computational burden. In addition, this is in comparison with the previously published emotional controllers that generally assume that the emotional model has approximation property of the neural networks without mathematical proof. We should mention that the universal approximation property is an important and basic mathematical property that puts the proposed computational framework in the same class of approaches as polynomials and Fourier series. For a similar level of contribution, one may refer to the seminal works of Hornik and his colleagues in 1989 on neural networks (Hornik, 1989) and Castro in 1995 on fuzzy systems (Castro, 1995). Second, CRBENN is employed in a direct adaptive control framework for a class of uncertain affine nonlinear systems, and the stability of the overall structure is proved using the Lyapunov stability theory without deviating from the basic laws of the emotional brain. To validate its capabilities, the proposed control method is applied to an inverted pendulum system and the Duffing–Holmes chaotic system under different operational case studies, i.e., without disturbance, with external disturbance, and with measurement noise. The results are compared with several other competing RBFNN, fuzzy, and emotional controllers, which lead to the superiority of the proposed method in better tracking performance, lower computational time, and less control effort. Finally, the real-world applicability of the proposed controller is experimentally confirmed by implementing it on a 3-PSP parallel robot in our robotics laboratory at the Ferdowsi University of Mashhad, compared with our previous work in Baghbani et al. (2018) that was based on simulation results. We should emphasize that, even though we have applied the proposed approach to adaptive control systems

2. Literature review on emotional models There are a number of recent works that have gainfully used Moren’s original BEL model (Moren and Balkenius, 2000). Some of them are with decision-making and some with control backgrounds. From the decision-making perspective, a limbic-based artificial emotional neural network (LiAENN) is designed in Lotfi and Akbarzadeh-T (2014) based on Moren’s original model, and is applied it to facial detection and emotion recognition. Bias and activation function are added to the Amygdala and the OFC to program the LiAENN based on the artificial perceptron model. Later in Lotfi and Akbarzadeh-T (2016), a brain-inspired winner-take-all emotional neural network (WTAENN) structure is presented with proved universal approximation property. The WTAENN has 𝑚 sensory cortex modules. Each of these modules alone is a BEL structure to prove the approximation property according to multi-layered artificial neural networks (MLANN). In each time step, according to a competitive learning mechanism, only one sensory cortex module wins and produces the final output, which leads to a discontinuous output. The WTAENN is then used in several decisionmaking problems such as time series prediction, curve fitting, pattern recognition, and classification. In a recent paper, Parsapoor (2019) comprehensively reviews the brain emotional learning-inspired models (BELiMs) in historical, theoretical, structural, and functional aspects. She then validates the BELiMs in time-series prediction problems. Moren’s model of emotion was also a source of inspiration in the design of control systems. The brain emotional learning-based intelligent controller (BELBIC) by Lucas and his colleagues in 2004 (Lucas et al., 2004) was a pioneering work in this regard. BELBIC was employed as the only control block of the system, and the sensory and reward functions were designed in such a way to reach the control goal. The controller showed excellent control action and robustness to disturbances and system parameter variations for some single-input single-output (SISO) and multi-input multi-output (MIMO) linear and nonlinear systems. BELBIC has since been applied to a number of realworld and simulation control problems (Daryabeigi et al., 2019, 2014; Dehkordi et al., 2011; El-Garhy and El-Shimy, 2015; El-saify et al., 2017; Garmsiri and Sepehri, 2014; Gunapriya and Sabrigiriraj, 2017; Khalghani et al., 2016; Khalghani and Khooban, 2014; Khooban and Javidan, 2016; Markadeh et al., 2011; Mehrabian and Lucas, 2008; Nahian et al., 2014; Rouhani et al., 2007; Senthilkumar and Vijayan, 2014; Sharbafi et al., 2010; Soreshjani et al., 2015). It has indicated remarkable results such as fast response, learning capability, simple structure, good tracking performance, and robustness to uncertainties such as noise, disturbance, and parameter variation. However, the above control algorithms are application-specific, and theoretical stability analysis remains to be investigated. Moreover, BELBIC is often used as the only control block of the system that lacks other robust or adaptive control concepts. Few of the works that involve the stability of BELBIC center on linear systems or system identification. For instance, the stability of a known linear control system is derived by a numerical technique called ‘‘cell to cell’’ in Shahmirzadi and Langari (2005), where BELBIC has specific reward and sensory input. In Jafarzadeh et al. (2008), 2

F. Baghbani, M.-R. Akbarzadeh-T, M.-B. Naghibi-Sistani et al.

Engineering Applications of Artificial Intelligence 89 (2020) 103447

(2000) or lack a proof for the approximation property of their emotionbased models. Only two control frameworks, the G-BELBIC in Lotfi (2018) and the ARBENC in Baghbani et al. (2018), are based on modeling paradigms that have the universal approximation property of the WTAENN in Lotfi and Akbarzadeh-T (2016). Furthermore, the papers with non-decreasing update rules of the Amygdala usually consider update laws as a function of parameters such as the output of the emotional model and input to the model, see for example the works of Lin and coauthors (Chung and Lin, 2015; Fang et al., 2019; Lin and Chung, 2015; Zhao et al., 2019). In this paper, we first confirm the approximation property of the proposed emotional neural network. Then, for evaluating the capabilities of the proposed emotional neural network, we proceed to use it in a stable adaptive control structure, while staying true to the nondecreasing adaptive laws of the Amygdala, similar to the basic model by Moren in Moren and Balkenius (2000). In the next section, the proposed emotional neural network is introduced. Then in Sections 4–5, it is employed in a direct adaptive control structure.

the stability of first- and second-order linear BELBIC-based control systems is studied for regulating problems. Specific reward and sensory functions are considered, but the update rules of the Amygdala do not have the ‘max’ operator for a straightforward mathematical proof of stability. More recently, a generalized BELBIC is presented by Lotfi (2018) in which a competitive BEL model is applied for identification of the system under control. Using some restrictions on the learning weights of the Amygdala and the OFC, the convergence of the weights of the BEL-based model is verified. In Jafari and Xu (2019) the BELBIC controller is applied to the flocking control of the multi-agent systems with linear double integrator dynamics, and the overall stability of the system is proved. The stability of nonlinear control systems by emotion-based frameworks is addressed in few studies and generally center on the Lyapunov stability theory. In this regard, the pioneering works of Lin and Chung in Lin and Chung (2015) and Chung and Lin (2015) investigate the stability of adaptive fuzzy BEL-based controllers for nonlinear systems. However, their approach operates based on the assumption that the direct adaptive fuzzy brain emotional controller approximates the ideal controller. The update laws of the Amygdala and the OFC are not investigated in the proof of stability. The third study in Hsu and Lee (2017) shows that the parameter learning of the OFC component in the BEL model is equivalent to the update laws derived from a gradient descent algorithm. The authors then conclude the stability of the proposed control structure, assuming the gradient descent update laws follow those of the Amygdala. The fourth study is a radial basis emotional neural network (RBENN), which was earlier proposed by the authors in Baghbani et al. (2018). RBENN has radial basis functions in the nodes of the Thalamus, which makes it a more transparent and general model in comparison with previous application-specific emotional models. In this first work by the authors on the stability of closedloop emotional control systems, RBENN is employed in an indirect adaptive control structure. Suitable adaptive laws consistent with the basic update rules of the basic model in Moren and Balkenius (2000) are designed for the RBENN. The stability of the overall control structure is also derived according to the Lyapunov stability analysis. However, the approximation property of the RBENN is based on the WTAENN, which needs 𝑚 sensory cortex modules to prove the approximation property. There are other recent studies that are also based on stable emotional models. In Le et al. (2018) self-evolving interval type-2 fuzzy brain emotional learning control is designed for chaotic systems. The update rules of the weights of the Amygdala and the OFC are attained based on the gradient descent method, and the overall stability of the closed-loop system is assured using the Lyapunov stability theory. In Wu et al. (2018), self-organizing brain emotional learning controller is designed for the control of mobile robots. The update rules of the Amygdala are attained using the Lyapunov stability theory rather than the original non-decreasing rules of the BEL model to obtain more robust performance. In Zhao et al. (2019) wavelet fuzzy brain emotional controller is proposed for a class of MIMO nonlinear systems. The update laws of the emotional model are not assessed in the proof of the stability, similar to their previous works in Lin and Chung (2015) and Chung and Lin (2015). In Fang et al. (2019) an improved fuzzy BEL model (iFBEL) neural network is introduced and applied for the stable direct adaptive control of MIMO nonlinear systems. In Akhormeh et al. (2019) normalized brain emotional learning model (NBELM) is presented, and the convergence of the weights of the model is proved. The NBEL is then used to estimate the parameters of nonlinear systems. In Khorashadizadeh et al. (2019), BELBIC is used to approximate the unknown dynamics of a class of affine nonlinear SISO systems. However, the minimum approximation error and upper bound of the ideal weights of the Amygdala and the OFC are needed for the stability analysis. The main point about the above control approaches is that they either are inconsistent with the non-decreasing update rule of the Amygdala as explained in the seminal work of Moren and Balkenius

3. The proposed CRBENN and its universal approximation property In this section, the structure of the proposed CRBENN is presented, and its universal approximation property is derived using the universal approximation property of RBF networks. 3.1. The structure of CRBENN The configuration of the proposed CRBENN is depicted in Fig. 1. As this figure shows, the overall structure is similar to the previous BELbased models such as BELBIC (Lucas et al., 2004). But, CRBENN has three main characteristics. Firstly, there is no direct connection from the Thalamus to the Amygdala in CRBENN. This is in following Moren’s work in Moren (2002) that expressed that the direct connection from the Thalamus to the Amygdala interferes with normal learning and produces harsh results in the simulation. Once this direct connection is omitted, the output of the CRBENN becomes smooth and continuous. Secondly, each node in the Thalamus is a radial basis function. This is similar to our previous work in Baghbani et al. (2018), but different from those of Moren and Balkenius (2000) and BELBIC (Lucas et al., 2004), where the Thalamus is a simple identity function. A combined effect of these two characteristics leads to the third, and arguably the most important, property of CRBENN, that is its universal approximation property. Note that the proof of the universal approximation property for the CRBENN is general and therefore, any symmetric radial basis kernel function can be considered at the nodes of the Thalamus, as is discussed in the next section. But here Gaussian functions are considered as a popular example of RBFs. The input first enters the Thalamus, and the radial basis functions are constructed as follows, ( [( )𝑇 ( ) ]) 𝑧 − 𝜇𝑗 𝑧 − 𝜇𝑗 𝜑𝑗 = exp − , 𝑗 = 1, … , 𝑚 (1) 𝜎𝑗2 [ ]𝑇 where 𝑧 = 𝑧1 , … , 𝑧𝑛 ∈ R𝑛 is the sensory input vector in general, 𝑛 𝜇𝑗 ∈ R and 𝜎𝑗 > 0 are the corresponding mean and smoothing factor, respectively, and 𝑚 ∈ N is the total number of nodes. As is discussed in Section 3.2, the CRBENN has the universal approximation property and could approximate any nonlinear function with sufficiently large 𝑚. The output of the Thalamus is, therefore, the radial basis function vector of the sensory input. This vector then enters the Amygdala and the OFC through the sensory cortex path to calculate their outputs as (2) and (3), respectively, 𝐸𝑎 =

𝑚 ∑ 𝑗=1

3

𝑉𝑗 𝜑𝑗 = 𝑉 𝑇 𝜑,

(2)

F. Baghbani, M.-R. Akbarzadeh-T, M.-B. Naghibi-Sistani et al.

𝐸𝑜 =

𝑚 ∑

𝑊𝑗 𝜑𝑗 = 𝑊 𝑇 𝜑,

Engineering Applications of Artificial Intelligence 89 (2020) 103447

Corollary 1. The proposed CRBENN has universal approximation property. That is for any smooth function 𝑓 (𝑥) ∶ R𝑛 → R on a compact set 𝛺 ∈ R𝑛 , and a given 𝜀𝐶𝑅𝐵𝐸 > 0, and for a sufficiently large number 𝑚, there exist the ideal weight vectors 𝑊 ∗ ∈ R𝑚 and 𝑉 ∗ ∈ R𝑚 such that, ( )𝑇 𝑓 (𝑥) = 𝑉 ∗ − 𝑊 ∗ 𝜑 + 𝜀𝐶𝑅𝐵𝐸 , (8)

(3)

𝑗=1

where 𝑉𝑗 and 𝑊𝑗 (𝑗 = 1, … , 𝑚) are the corresponding weights of the Amygdala and the OFC nodes, respectively. The weight vectors are 𝑉 = [ ]𝑇 [ ]𝑇 𝑉1 , 𝑉2 , ⋯ 𝑉𝑚 ∈ R𝑚 and 𝑊 = 𝑊1 , 𝑊2 , ⋯ 𝑊𝑚 ∈ R𝑚 , [ ]𝑇 and 𝜑 = 𝜑1 , 𝜑2 , … , 𝜑𝑚 ∈ R𝑚 is the vector of radial basis functions. As can be seen from (2)–(3), the number of nodes in the Thalamus is equal to the number of nodes in the Amygdala and the OFC. As Fig. 1 shows, the output of the model is computed as the subtraction of the OFC output from the Amygdala output,

Proof. This property is a straightforward conclusion of 𝐸 defined by (7) to be dense in 𝐿𝑝 (R𝑝 ). ■ Note that, the proof of universal approximation property of the RBF network in Girosi and Poggio (1990) and Hartman et al. (1990) could also be extended to prove the universal approximation property for the proposed CRBENN. As is discussed in Section 4, we present new update laws fairly similar to the basic adaptive laws of the basic BEL model and prepare the stability proof based on Lyapunov stability theory.

(4)

𝐸 = 𝐸𝑎 − 𝐸𝑜 .

By substituting (2) and (3) into (4), the overall output of the CRBENN is computed and simplified as follows, 𝐸=

𝑚 ∑

𝑉𝑗 𝜑𝑗 −

𝑗=1

𝑚 ∑

𝑊𝑗 𝜑𝑗 =

𝑗=1

𝑚 ∑ (𝑉𝑗 − 𝑊𝑗 )𝜑𝑗 = (𝑉 − 𝑊 )𝑇 𝜑.

(5) 4. Problem formulation

𝑗=1

In this section, the overall structure of the CRBENN is introduced, and in Section 3.2 the proof of its universal approximation property is provided. In the adaptive control structure of Section 5, appropriate update laws are designed for the weights of the corresponding nodes in the Amygdala and the OFC that are consistent with the Moren model (Moren and Balkenius, 2000). Furthermore, in the control structure of Section 5, the parameters of the RBFs in the Thalamus, i.e. 𝜇𝑗 and 𝜎𝑗 , are fixed for simplicity and to reduce the number of adaptive parameters. In related simulations, the RBF centers are equally spaced in the range of the input to the network, and the smoothing factor is the same for all of the nodes.

To verify the capabilities of the proposed CRBENN such as universal approximation property, simple structure, and learning ability, we employ it in a direct adaptive control problem for a class of uncertain 𝑛th-order nonlinear system as follows, ( ) ( ) ( ) (9) 𝑥(𝑛) = 𝑓 𝑥 + 𝑔 𝑥 𝑢 + 𝑑 𝑥, 𝑡 , ̇ … , 𝑥(𝑛−1) ]𝑇 ∈ R𝑛 is the state vector, 𝑢 is the control where 𝑥 = [𝑥, 𝑥, ( ) input, 𝑑 𝑥, 𝑡 denotes external disturbance that has the upper bound ( ) ‖ ( )‖ as ‖𝑑 𝑥, 𝑡 ‖ ≤ 𝜀𝑑 , 𝑓 𝑥 is an unknown smooth function that satisfies ‖( ) ‖ ‖ ‖ ‖𝑓 𝑥 ‖ ≤ 𝑓1 < ∞ for all 𝑥 in the controllable region 𝑈 ⊂ R𝑛 , in ‖ ‖ ( ) which 𝑓1 is unknown positive constant, and 𝑔 𝑥 is considered a known ( ) smooth function that satisfies 0 < 𝑔 𝑥 ≤ 𝑔1 < ∞, with 𝑔1 > 0 as its upper bound. Note that the main focus of this paper is to examine the ( ) capabilities of the proposed CRBENN, therefore, 𝑔 𝑥 is assumed to be known for better illustrating the main concern of the paper. Some ( ) studies have designed direct adaptive controllers with unknown 𝑔 𝑥 . For example, see Hsueh et al. (2010), Hsueh and Su (2012) and Pan et al. (2014). The tracking error is defined as,

3.2. The proof of universal approximation property for CRBENN Here, the universal approximation property is confirmed for CRBENN based on the universal approximation property of the RBF networks. First, we restate the output of the CRBENN (5) in the general form of radial basis emotional (RBE) network (6) with 𝜑𝑗 as any symmetric radial basis kernel. Consider the family 𝐸 of RBE networks as follows, ) ( 𝑚 ∑ 𝑧 − 𝜇𝑗 , (6) (𝑉𝑗 − 𝑊𝑗 ) 𝜑𝑗 𝐸 (𝑧) = 𝜎𝑗 𝑗=1

𝑒 = 𝑥𝑑 − 𝑥,

where 𝑚 ∈ N is the number of the kernel nodes, 𝑧 ∈ R𝑛 is an input vector, 𝜑𝑗 is a radially symmetric kernel function, which the Gaussian type in (1) is an example of it, 𝜎𝑗 > 0 is the smoothing factor, and 𝜇𝑗 ∈ R𝑛 is the centroid. In the following theorem, it is proved that 𝐸𝜑 is dense in 𝐿𝑝 (R𝑛 ):

where 𝑥𝑑 is the desired trajectory. It is assumed that 𝑥𝑑 and its derivatives up to derivative of 𝑛th order are bounded. The function of error s is defined as follows, 𝑠 = 𝑒(𝑛−1) + 𝛬𝑛−1 𝑒(𝑛−2) + ⋯ + 𝛬1 𝑒,

where 𝑞𝑎 = 𝑥𝑑 (𝑛) + 𝛬𝑛−1 𝑒(𝑛−1) + ⋯ + 𝛬1 𝑒.̇ The goal of this section is to design a direct adaptive control structure while the tracking error converges to zero, and all the signals remain bounded, and while the effect of approximation error and external disturbances are kept below the desired attenuation level. ( ) ( ) ( ) If 𝑓 𝑥 and 𝑔 𝑥 are known and 𝑑 𝑥, 𝑡 = 0, using the feedback linearization method (Slotine and Li, 1991), the following ideal controller is attained, ( )( ( ) ) 𝑢∗ = 𝑔 −1 𝑥 −𝑓 𝑥 + 𝑞𝑎 + 𝐾𝑠 , (13) ( ) ( ) where 𝐾 > 0 is a real constant. The assumptions on 𝑓 𝑥 , 𝑔 𝑥 , and 𝑥𝑑 below (9) ensure the boundedness of 𝑢∗ . Substituting (13) into (12) and after simple manipulations 𝑠̇ + 𝐾𝑠 = 0 is attained, which is a Hurwitz equation and makes the error function converge to zero. However, practically speaking, the dynamics of the system are not always available, and external disturbances deteriorate the system performance.

Proof. By defining 𝑊𝑡𝑜𝑡𝑎𝑙𝑗 = 𝑉𝑗 − 𝑊𝑗 , (6) can be presented as (7), 𝑚 ∑ 𝑗=1

( 𝑊𝑡𝑜𝑡𝑎𝑙𝑗 𝜑𝑗

𝑧 − 𝜇𝑗 𝜎𝑗

) ,

(11)

where 𝛬𝑘 (𝑘 = 1, … , 𝑛 − 1) is a positive constant. By differentiating (11) with respect to time, the following is attained, ( ) ( ) ( ) 𝑠̇ = −𝑓 𝑥 − 𝑔 𝑥 𝑢 + 𝑞𝑎 − 𝑑 𝑥, 𝑡 , (12)

Theorem 1. Consider 𝜑𝑗 ∶ R𝑛 → R in (6), which is an integrable function | |𝑝 such that ∫R𝑛 𝜑𝑗 (𝑥) 𝑑𝑥 ≠ 0 and ∫R𝑛 |𝜑𝑗 (𝑥)| 𝑑𝑥 < ∞, with 𝑝 ∈ [1, ∞), then | | 𝐸, defined by (6), is dense in 𝐿𝑝 (R𝑛 ).

𝐸 (𝑧) =

(10)

(7)

where 𝑊𝑡𝑜𝑡𝑎𝑙𝑗 ∈ R is the weight corresponding to the 𝑗th node. The definition of the RBF networks and the related theorems of Park and Sandberg (1993, 1991) are provided in Appendix. Comparing the output of the CRBE network in (7), with the output of the RBF network in (A.2), both are the weighted linear summation of the radial basis functions of the input variables. The weight vector in the CRBE network is defined by 𝑊𝑡𝑜𝑡𝑎𝑙𝑗 = 𝑉𝑗 − 𝑊𝑗 , and in RBF network by 𝑤𝑗 . Therefore, it can be concluded that the CRBE network is equivalent to the RBF network. Hence, the properties of Theorems A.1 and A.2 in Appendix are valid for the CRBENN, which means that 𝐸, defined by (6), is dense in 𝐿𝑝 (R𝑛 ). ■ 4

F. Baghbani, M.-R. Akbarzadeh-T, M.-B. Naghibi-Sistani et al.

Engineering Applications of Artificial Intelligence 89 (2020) 103447

Fig. 1. Scheme of proposed CRBENN structure.

As the ideal parameters 𝑉 ∗ and 𝑊 ∗ are unknown, the minimum approximation error is defined as, ( )( ( ) ) 𝜔𝑐 = −𝑔 𝑥 𝑢̂ 𝑥|[𝑉 ∗ 𝑊 ∗ ] − 𝑢∗ , (20)

Subsequently, the ideal controller (13) is not applicable. The actual intelligent controller based on CRBENN is introduced in the next section ( ) that considers the effect of 𝑑 𝑥, 𝑡 and uncertainties. Furthermore, a robust control term is added to the control design to reduce the effect ( ) of the external disturbance, 𝑑 𝑥, 𝑡 , and the approximation error.

where 𝜔𝑐 ∈ 𝐿∞ . According to (18) and (20), (19) can be written as, ( ) ( ) 𝑇 ̃ 𝜑 − 𝐾𝑠 + 𝑢𝑟 + 𝜔𝑐 − 𝑑, 𝑠̇ = 𝑔 𝑥 𝑉̃ 𝑇 𝜑 − 𝑔 𝑥 𝑊

5. The proposed control structure

(21)

Here, the proposed CRBENN is employed in a direct adaptive control structure to approximate the ideal control law 𝑢∗ in (13) as 𝑢. ̂ The overall direct adaptive radial basis emotional neuro controller (DARENC) is designed as follows, 𝑢 𝑢 = 𝑢̂ − (𝑟 ) , (14) 𝑔 𝑥

̃ = 𝑊 ∗ − 𝑊 , and 𝑉̃ = 𝑉 ∗ − 𝑉 . where 𝑊 As is described in Moren and Balkenius (2000), the weights of the Amygdala can only increase. Therefore, the following update laws are considered for the weights of the Amygdala and the OFC,

where 𝑢𝑟 is a robust compensator term and is defined as,

𝑊̇ = −𝛽𝑃 𝜑𝑔(𝑥)𝑠,

1 𝑢𝑟 = − 𝑃 𝑠, 𝑟

𝑉̇ = 𝛼𝑃 𝜑𝑔(𝑥) max (𝑠, 0) ,

(15)

Theorem 2. Consider the nonlinear system (9) with the control law (14), where 𝑢̂ is attained from (15), and 𝑃 is the solution of Riccati-like equation (16). Also, the weights of the CRBENN are updated by the adaptive laws (22)–(23). Then, the following 𝐻∞ tracking performance criterion is fulfilled for a pre-given attenuation level 𝜌, all the variables remain bounded, and the error asymptotically converges to zero.

where 2𝜌2 ≥ 𝑟. The emotional-based control law 𝑢̂ is designed as follows, 𝑚 ∑ 𝑗=1

𝑉𝑗 𝜑𝑗 −

𝑚 ∑

𝑊𝑗 𝜑𝑗 = (𝑉 − 𝑊 )𝑇 𝜑,

(17)

𝑗=1

It is assumed that 𝑥 and the adaptive parameters 𝑉 and 𝑊 belong { } ‖ to the following compact sets, respectively: 𝛺𝑥 = 𝑥||‖ ‖𝑥‖ ≤ 𝑀𝑥 , 𝛺𝑓 𝑣 = { } { } 𝑉 | ‖𝑉 ‖ ≤ 𝑀𝑣 , 𝛺𝑓 𝑤 = 𝑊 | ‖𝑊 ‖ ≤ 𝑀𝑤 ; where 𝑀𝑥 , 𝑀𝑣 , and 𝑀𝑤 are positive constants. We define the ideal parameters 𝑉 ∗ and 𝑊 ∗ as follows, [ ] [ ∗ ] ) ‖ ( ‖ (18) sup ‖𝑢̂ 𝑥|[𝑉 𝑊 ] − 𝑢∗ ‖ . 𝑉 𝑊 ∗ = arg min ‖ ‖ 𝑉 ∈𝛺 &𝑊 ∈𝛺 𝑣

(23)

where 𝛼 > 0 and 𝛽 > 0 are learning rates for weights of the Amygdala and the OFC, respectively. The max operator in (22) makes the update law of the Amygdala consistent with the basic non-decreasing learning rules. The following theorem shows the main contribution of this section.

where 𝑟 is a positive constant, and 𝑃 = 𝑃 𝑇 is a semi-positive definite matrix that is the unique solution of the following Riccati-like equation for any given 𝑄 = 𝑄𝑇 > 0, ( ) 1 2 −2𝐾𝑃 + 𝑄 + 𝑃 − 𝑃 = 0, (16) 2 𝑟 𝜌

𝑢̂ =

(22)

𝑇

∫0

𝑠𝑇 𝑄𝑠𝑑𝑡 ≤ 𝑠𝑇 (0) 𝑃 𝑠𝑇 (0) +

1 ̃𝑇 𝑉 (0) 𝑉̃ (0) 𝛼 𝑇

+

1 ̃ 𝑇 ̃ (0) + 𝜌2 𝑊 (0) 𝑊 𝜔𝑇 𝜔𝑑𝑡. ∫0 𝛽

(24)

where 𝜔 is the worst-case uncertainty that is defined below in (31).

𝑤

Proof. Consider the following Lyapunov function,

By substituting (14) in (12) and after some manipulations, the below equation is obtained for derivative of 𝑠, ( ) 𝑠̇ = −𝑔 𝑥 (𝑢̂ − 𝑢∗ ) − 𝐾𝑠 + 𝑢𝑟 − 𝑑 ( )( ( ) ( ) ) = −𝑔 𝑥 𝑢̂ − 𝑢̂ 𝑥|[𝑉 ∗ 𝑊 ∗ ] + 𝑢̂ 𝑥|[𝑉 ∗ 𝑊 ∗ ] − 𝑢∗ − 𝐾𝑠 + 𝑢𝑟 − 𝑑,

𝑉𝐿 =

1 ̃ 𝑇 ̃ 1 ̃𝑇 ̃ 1 𝑇 𝑠 𝑃𝑠 + 𝑉 𝑉 + 𝑊 𝑊. 2 2𝛼 2𝛽

(25)

The derivative of (25) with respect to time is computed as below, 1 1 ̃ . 𝑉̇ 𝐿 = 𝑃 𝑠𝑠̇ − 𝑉̇ 𝑇 𝑉̃ − 𝑊̇ 𝑇 𝑊 𝛼 𝛽

(19) 5

(26)

F. Baghbani, M.-R. Akbarzadeh-T, M.-B. Naghibi-Sistani et al.

Engineering Applications of Artificial Intelligence 89 (2020) 103447

Remark 3. To assure the boundedness of the adaptive parameters, the update laws (22)–(23) are modified based on the projection algorithm in Wang (1997) as follows,

By substituting 𝑠̇ from (21) in (26) and after simple manipulations the following is obtained, ( ) 𝑇 ) ( ( ) ̃ 𝜑 − 𝐾𝑠 + 𝑢𝑟 + 𝜔′ − 𝑑 𝑉̇ 𝐿 = 𝑃 𝑠 𝑔 𝑥 𝑉̃ 𝑇 𝜑 − 𝑔 𝑥 𝑊 1 1 ̃ − 𝑉̇ 𝑇 𝑉̃ − 𝑊̇ 𝑇 𝑊 𝛼 𝛽 ) ( ( ) ( ) 1 = −𝑃 𝐾𝑠2 + 𝑃 𝑠 𝜔𝑐 − 𝑑 + 𝑃 𝑠𝑢𝑟 + 𝑉̃ 𝑇 𝑃 𝑠𝑔 𝑥 𝜑 − 𝑉̇ 𝛼 ( ) ( ) 1 ̇ 𝑇 ̃ −𝑊 𝑃 𝑠𝑔 𝑥 𝜑 + 𝑊 . (27) 𝛽 Using (16) and substituting (15) in (27) gives (28), ) ( 1 𝑃 𝑉̇ 𝐿 = − 𝑄 + 𝑃 𝑠2 + (𝜔𝑐 − 𝑑)𝑃 𝑠 2 𝜌2 ( ) ( ) ( ) ( ) 1 ̃ 𝑇 𝑃 𝑠𝑔 𝑥 𝜑 + 1 𝑊̇ . + 𝑉̃ 𝑇 𝑃 𝑠𝑔 𝑥 𝜑 − 𝑉̇ − 𝑊 𝛼 𝛽

) ( ⎧ if ‖𝑉 ‖ < 𝑀𝑣 or ‖𝑉 ‖ = 𝑀𝑣 and 𝛼𝑃 𝑉 𝑇 𝜑 max (𝑠, 0) ≤ 0 ⎪𝛼𝑃 𝜑𝑔(𝑥) max (𝑠, 0) 𝑇 𝑉̇ = ⎨ 𝛼𝑃 𝑉 𝜑𝑔(𝑥) max (𝑠, 0) ⎪𝛼𝑃 𝜑𝑔(𝑥) max (𝑠, 0) − 𝑉 if ‖𝑉 ‖ = 𝑀𝑣 and 𝛼𝑃 𝑉 𝑇 𝜑 max (𝑠, 0) > 0, ⎩ ‖𝑉 ‖2

(34) ( ) ⎧ if ‖𝑊 ‖ < 𝑀𝑤 or ‖𝑊 ‖ = 𝑀𝑤 and 𝛽𝑃 𝑊 𝑇 𝜑𝑔(𝑥)𝑠 ≥ 0 ⎪−𝛽𝑃 𝜑𝑔(𝑥)𝑠 𝑊̇ = ⎨ 𝛽𝑃 𝑊 𝑇 𝜑𝑠𝑔(𝑥) ⎪−𝛽𝑃 𝜑𝑔(𝑥)𝑠 + 𝑊 if ‖𝑊 ‖ = 𝑀𝑤 and 𝛽𝑃 𝑊 𝑇 𝜑𝑔(𝑥)𝑠 < 0. ⎩ ‖𝑊 ‖2

(35)

The above update laws force the adaptive parameters to remain bounded. If the adaptive parameters are less than their maximum bounds (i.e. 𝑀𝑣 and 𝑀𝑤 for V and W, respectively), or if they are at their maximum but are moving toward smaller values, (34)–(35) turn to the ordinary update laws (22)–(23). Hence, the second lines of (34)– (35) are true only if the adaptive parameters are at their maximum bounds and move toward larger values. With the above adaptive laws, the proof of Theorem 1 is still satisfied. If the second line of (34) is true, 𝛼𝑃 𝑉 𝑇 𝜑𝑔(𝑥) max(𝑠,0) then the added term to (29) is 𝑉̃ 𝑇 𝑉 . As ‖𝑉 ‖ = 𝑀𝑣 , then

(28)

By substituting the adaptive laws (22)–(23) in (28), the following is then achieved. ( ) ( ) 𝑃 1 𝑄 + 𝑃 𝑠2 + (𝜔𝑐 − 𝑑)𝑃 𝑠 + 𝑉̃ 𝑇 (𝑠 − max(𝑠, 0)) 𝑃 𝜑𝑔 𝑥 . (29) 𝑉̇ 𝐿 = − 2 2 𝜌

‖𝑉 ‖2

Using 𝑠 − max (𝑠, 0) ≤ 𝑠, the following is obtained: ( ) 1 𝑃 𝑉̇ 𝐿 ≤ − 𝑄 + 𝑃 𝑠2 + (𝜔𝑐 − 𝑑)𝑃 𝑠 + 𝛥 (𝑠 − max(𝑠, 0)) 𝑃 . (30) 2 𝜌2 ( 𝑇 ) where 𝛥 = 𝑔1 sup𝑉 ∈𝛺𝑣 𝑉̃ 𝜑 . The new worst-case perturbation is introduced as below, ( ) 𝜔 = 𝜔𝑐 − 𝑑 + 𝛥sign 𝜔𝑐 − 𝑑 . (31) { ( ) 1 𝜔𝑐 − 𝑑 ≥ 0 . The new perturbation 𝜔 is where sign 𝜔𝑐 − 𝑑 = −1 𝜔𝑐 − 𝑑 < 0 bounded because 𝜔𝑐 and 𝑑 are bounded. Note that this sign function is only for theoretical stability analysis, and it does not appear in the structure of the controller. Then (30) turns to, ( ) 𝑃 1 𝑄 + 𝑃 𝑠2 + 𝜔𝑃 𝑠. (32) 𝑉̇ 𝐿 ≤ − 2 𝜌2

𝑉̃ 𝑇 𝑉 ≤ 0, and since 𝛼𝑃 𝑉 𝑇 𝜑 max (𝑠, 0) > 0, then 𝑉̃ 𝑇

𝛼𝑃 𝑉 𝑇 𝜑𝑔(𝑥) max(𝑠,0) ‖𝑉 ‖2

𝑉 ≤

0. Therefore, the added negative term does not conflict with the proof of the theorem. In a similar analysis, (if the second ) line of (35) is true, 𝑇 ̃ 𝑇 𝑃 𝑊 𝜑𝑠 𝑊 ≤ 0, which does not then the added term to (29) is −𝑊 2 ‖𝑊 ‖ affect the proof of Theorem 1. Remark 4. The computational complexity of the CRBENN is of the order 𝑂(𝑚 × 𝑛), where 𝑚 is the number of the basis functions in the Thalamus, and 𝑛 is the number of inputs to the network. The computational complexity of the CRBENN is similar to that of RBFNN. As the computational complexities of the adaptive laws and the control input are of the order 𝑂(𝑚), the computational complexity ( of) the proposed 1 controller is of the order 𝑂(𝑚 × 𝑛 × 𝑇𝑡 ), where 𝑇𝑡 = 𝑑𝑡 × 𝑇 , T is the total run time, and 𝑑𝑡 is the time step.

By adding and subtracting 12 𝜌2 𝜔2 in (32), one can conclude the following, ( )2 [ ] 1 1 1 1 1 1 𝑃 𝑠 − 𝜌𝜔 + 𝜌2 𝜔2 ≤ − 𝑄𝑠2 + 𝜌2 𝜔2 . (33) 𝑉̇ 𝐿 ≤ − 𝑄𝑠2 − 2 2 𝜌 2 2 2

6. Simulation and experimental results This section presents simulation studies on an inverted pendulum system (Case I–IV), the Duffing–Holmes chaotic system (Case V–VI), and the real-world experimental results on a three spherical–prismatic– spherical (3-PSP) robot.

By integrating (33) from 𝑡 = 0 to 𝑡 = 𝑇 , 𝐻∞ tracking performance criterion (24) is achieved, which means that the effects of uncertainties are kept below the desired level 𝜌. As 𝜔 ∈ 𝐿2 , by the aid of Barbalat’s Lemma in Khalil (1996), it could be proved that error function 𝑠 asymptotically converges to zero. ■ The overall scheme of the proposed direct adaptive controller is shown in Fig. 2.

6.1. Simulation results on the inverted pendulum system Here, the proposed controller (DARENC) is applied to an inverted pendulum system. The dynamics of this system is presented as follows, 𝑥̇ 1 = 𝑥2 ,

Remark 1. The focus of the current work is on the design of stable adaptive controllers based on the CRBENN. Accordingly, the weights of the Amygdala and the OFC are adapted by the Lyapunov derived laws of adaptation, while the RBF parameters (mean and smoothing factor) are fixed for simplicity and to reduce the number of adaptive parameters. Fewer parameters in turn speed up the adaptation process. In simulation results, the RBF centers are equally spaced in the range of the input to the network, and the smoothing factor for all of the Gaussian nodes are chosen the same. This consideration (fixed parameters and partitioning the centers in the range of the input to the network) is common in fuzzy or NN-based adaptive controllers for the reasons mentioned above. See for example Lin et al. (2009) and Pan et al. (2017, 2014).

𝑥̇ 2 = 𝑓 (𝑥) + 𝑔 (𝑥) 𝑢 + 𝑑 = +

( ) 𝑎 cos 𝑥1 4𝑙 3

− 𝑎𝑚𝑝

𝑙 cos2 (𝑥

( ) 𝑔𝑝 sin 𝑥1 − 4𝑙 3

2

− 𝑎𝑚𝑝 𝑙 cos2 (𝑥1 )

𝑢 + 𝑑, 𝑎 = 1)

𝑎𝑚𝑝 𝑙𝑥22 sin(2𝑥1 )

1 , 𝑚𝑝 + 𝑀

(36)

𝑦 = 𝑥1 + 𝑣, where 𝑑 is the disturbance, 𝑣 is the measurement noise, 𝑥1 and 𝑥2 are the states of the system, 𝑥1 is the angle of the pendulum from the vertical axis, and 𝑥2 is the angular velocity. Also, 𝑚𝑝 = 0.1 kg is the mass of the pendulum, 𝑀 = 1 kg is the mass of card, 𝑔𝑝 = 9.8 sm2 is the gravity constant, and 2𝑙 = 0.5 m is the length of the pendulum. The initial value of the states are [𝑥1 (0), 𝑥2 (0)] = [0.52, 0]. The goal is to design a controller such that the system states follow the desired reference trajectory 𝑥1𝑑 = sin(𝑡). The control parameters are determined by trial and error to achieve reasonable tracking error and control

Remark 2. The specific adaptive control-based approach of this work is an example of employing the CRBENN in the design of a controller. The CRBENN could be employed in other control or decision-making problems such as system identification, prediction, and classification 6

F. Baghbani, M.-R. Akbarzadeh-T, M.-B. Naghibi-Sistani et al.

Engineering Applications of Artificial Intelligence 89 (2020) 103447

Fig. 2. The overall scheme of the proposed method (DARENC). Table 1 The average, maximum, and minimum simulation computational time per control cycle (in milliseconds) required per sampling time of the controllers for the inverted pendulum system with sinusoidal disturbance. The bold numbers indicate better results.

energy consumption and are set at 𝑄 = 12, 𝛼 = 0.5, 𝛽 = 30, 𝑟 = 0.08, 𝛬1 = 3, and 𝜌 = 0.02; also from Riccati equation (16) 𝑃 = 6 is obtained. The initial values for the weights of the Amygdala are in the interval [0.1, 1] and the weights of the OFC nodes are set at zero. Also 𝑚 = 40, 𝜎𝑗 = 0.5 {𝑗 = 1, … , 40}, and 𝜇 ∈ [−2, 2]𝑇 . The input to the basis function 𝜑𝑗 , defined by (1), is defined to be the function of error 𝑠 in (11). The system performance is evaluated under various test conditions, i.e., no disturbance, sinusoidal disturbance, pulse disturbance, and noise with SNR = 35 dB. The MATLAB software is used for the simulation on a computer powered by Intel Core i5-3337U, 1.6 GHz. The sampling time is 0.01 s, and the total time of simulation is 30 s. The measurement criterion for consumed control energy is considered as follows, 𝐽=

𝑇 1 ∑ |𝑢 (𝑖)| . 𝑇 𝑖=1

Runtime

Mean Min Max

RBFNN-based controller

T1F controller (1996)

DFS (2014)

DARENC (proposed method)

0.009945 0.0098 0.0104

0.017285 0.0169 0.0179

0.47481 0.468 0.4971

0.014105 0.0138 0.0144

can be seen that the structure of the RBFNN is the same as having the structure of CRBENN with only the OFC part and with a negative sign. The parameters of the RBFNN-based controller are the same as the proposed controller. Also, the parameters in (39)–(40) are set at 𝜇𝑅𝑗 ∈ [−2, 2]𝑇 , 𝜎𝑅𝑗 = 1 {𝑗 = 1, … , 40}, and 𝛾𝑅𝑢 = 30. The parameters of the DFS are provided in Hsueh et al. (2014), and the parameters of the fuzzy-based controller are provided in Chen et al. (1996), but with learning rate 30 for fuzzy rules. The following subsections present simulation case studies of the final control system. As was mentioned before, BEL-based models have shown fast response with low computational time. Hence, it is worthwhile to compare the computational time of the proposed model with other methods. In this way, we examine the computational time of the proposed controller with the RBFNN-based controller, the direct adaptive type-1 fuzzy controller (T1F) in Chen et al. (1996), and the DFS in Hsueh et al. (2014). Table 1 shows the average, maximum, and minimum computational time required per sampling time for all of the controllers in Case II: simulations with sinusoidal disturbance. All of the simulations are repeated 20 times and under the same conditions. As this table shows, the DFS controller has the maximum computational time, as is reported in Hsueh et al. (2014). The authors of Hsueh et al. (2014) also designed a simplified decomposed fuzzy system (SDFS) that with a small rise in error, the computational time is reduced. Furthermore, Table 1 shows that the RBFNN-based controller has the smallest computational time. The computational time of the proposed controller is 42% higher than the computational time of the RBF-based controller and 19% lower than the computational time of the Type-1 fuzzy controller (1996). The higher computational time in comparison with the RBFNN-based controller is because of two sets of adaptive parameters, the Amygdala and the OFC weights.

(37)

where 𝑇 is the total simulation time. The maximum absolute value of the control input (max |𝑢|) is also measured and presented in Table 2. For comparative purposes and showing the benefits of the CRBENN structure, the proposed method is compared with other methods. The comparative controllers are an RBFNN-based controller designed with the same structure and parameters as the proposed DARENC, a seminal work in direct adaptive fuzzy control in Chen et al. (1996), and a direct fuzzy controller based on decomposed fuzzy system (DFS) in Hsueh et al. (2014). The fuzzy controllers are chosen because of similar structures and assumptions. The RBFNN-based controller is designed with the same procedure of the proposed DARENC as (14), except that 𝑢̂ in (14) is replaced by, ( ) 𝑇 𝑢̂ 𝑅𝐵𝐹 𝑥 = 𝑊𝑅𝑢 𝜙𝑅𝑢 , (38) [ ]𝑇 where 𝑊𝑅𝑢 = 𝑊𝑅𝑢1 , … , 𝑊𝑅𝑢𝑚 is the weight vector of the RBFNN, 𝑚 is the total number of nodes. Also, 𝜙𝑅𝑢 = [𝜙𝑅𝑢1 , 𝜙𝑅𝑢2 , … , 𝜙𝑅𝑢𝑚 ]𝑇 is the RBF vector, 𝜙𝑅𝑢𝑗 (𝑗 = 1, … , 𝑚) is defined as the commonly Gaussian functions, ( [( )𝑇 ( ) ]) 𝑥 − 𝜇𝑅𝑗 𝑥 − 𝜇𝑅𝑗 𝜙𝑅𝑢𝑗 = exp − , 𝑗 = 1, … , 𝑚, (39) 2 𝜎𝑅𝑗 where 𝑥 ∈ R𝑛 , 𝜇𝑅𝑗 ∈ R𝑛 and 𝜎𝑅𝑗 > 0. With a similar procedure, the adaptive law for the weights of the RBFNN-based controller is attained as follows, 𝑊̇ 𝑅𝑢 = 𝛾𝑅𝑢 𝜙𝑅𝑢 𝑃 𝑠,

Method

(40)

6.1.1. Case I : Simulation results on the inverted pendulum system with no disturbances In this case, the external disturbance and the measurement noise are zero (𝑑 = 0, 𝑣 = 0). Fig. 3 shows the trajectories of the states of the

where 𝛾𝑅𝑢 > 0, and 𝑠 is defined similar to the error dynamics (21), but with appropriate substitution of the RBFNN output instead of the CRBENN output. Comparing the RBFNN output (38) with the CRBENN output (17); and the adaptive law (40) with the adaptive law (23); it 7

F. Baghbani, M.-R. Akbarzadeh-T, M.-B. Naghibi-Sistani et al.

Engineering Applications of Artificial Intelligence 89 (2020) 103447

system (𝑥1 and 𝑥2 ). As is shown, all of the controllers have satisfactory performance. Fig. 4 shows the error and the control input trajectory. According to this figure, the proposed method (DARENC) reaches the smallest tracking error, and the fuzzy controller (1996) depicts the largest error. The evolution of some weights of the Amygdala and OFC are illustrated in Fig. 5 with respect to time. This figure shows that the Amygdala weights are non-decreasing, but the OFC weights can both decrease and increase. Also, Table 2 shows the average control energy consumption (𝐽 ), the maximum absolute value of the control input (max |𝑢|), and the mean square error (MSE) for all of the controllers. According to the data in Table 2, the DARENC has the smallest MSE with slightly lower 𝐽 , while the fuzzy controller (1996) attains the largest 𝐽 and MSE, but lowest max |𝑢| among all. The DARENC has reached 7% higher max |𝑢| in comparison with the fuzzy controller (1996). Furthermore, in comparison with the RBFNN-based controller, the proposed controller has reached a lower MSE (by 18%), max |𝑢| (by 10%), and 𝐽 (by 2%). Having two subsystems, the Amygdala and the OFC, and interaction among them has resulted in better performance of CRBENN in comparison with the RBFNN-based controller that is constructed in the same way. 6.1.2. Case II: Simulation results on the inverted pendulum system with disturbance 𝑑(𝑡) = 4 sin(3𝜋𝑡) In this case, the system is exposed to the external sinusoidal disturbance 𝑑(𝑡) = 4 sin(3𝜋𝑡), but the measurement noise is zero (𝑣 = 0). Fig. 6 shows the trajectories of 𝑥1 and 𝑥2 . According to this figure, all of the controllers have satisfactory performance in the presence of the external disturbance. Also, Fig. 7 shows the error and the control input. According to this figure, the proposed approach reaches the smallest error, and the fuzzy controller (1996) attains the largest error, but the smallest max |𝑢|. In addition, according to Table 2, the proposed approach (DARENC) has reached the smallest MSE with slightly lower consumed control energy𝐽 . Again, the fuzzy controller (1996) shows the largest MSE and 𝐽 . In comparison with the RBFNN-based controller with the same structure, the proposed method again has shown lower MSE (by 18%), max |𝑢| (10%) and 𝐽 (by 2%).

Fig. 3. Case I. Trajectories of 𝑥1 and 𝑥2 for the inverted pendulum system without disturbance.

6.1.3. Case III: Simulation results on the inverted pendulum system with pulse disturbance In this case, the measurement noise is zero, but the system is exposed to the pulse disturbance 𝑑 (𝑡) = 5𝑢 (𝑡 − 15)−5𝑢(𝑡−17). Fig. 8 shows the trajectories of the states of the system. As the zoomed view shows, the proposed method has the quickest response in reducing the effect of the sudden disturbance. This is consistent with the fast response properties of the emotional models. Also, Fig. 9 presents the error and the control input for all of the controllers. As this figure shows, the proposed method has reached the lowest error in comparison with the other three controllers. The controllers have used similar control inputs (Fig. 9 (below)). According to the control input in Fig. 9 and the value of 𝐽 in Table 2, the proposed controller has used slightly lower control energy in comparison with the other three controllers. The fuzzy controller (1996) has reached the smallest max |𝑢|, while the DFS (2014) has the highest value. In comparison with RBFNN-based controller, the proposed method has reached a lower MSE (by 19%) by consuming 2% lower control energy (𝐽 ), and 10% lower max |𝑢|.

Fig. 4. Case I. The error (𝑒) (above) and control input (𝑢), (below) for the inverted pendulum system without disturbance.

all of the controllers. The RBFNN-based controller reaches lower, but comparable, MSE and 𝐽 in comparison with DFS (2014). In comparison with the RBFNN-based controller, the DARENC has reached a lower MSE (by 19%), lower 𝐽 (by 2%), and higher max |𝑢| (by 4%). 6.2. Simulation results on Duffing–Holmes chaotic system

6.1.4. Case IV: Simulation results on the inverted pendulum system with measurement noise Here, Gaussian white noise with SNR = 35 is added to the output (𝑥1 ) and the simulations are repeated 20 times for reliable results due to the random nature of noise. The results of MSE, 𝐽 , and max |𝑢| are averaged and gathered in Table 2. As the data in Table 2 shows, the proposed method reaches the smallest MSE, consumed control energy (𝐽 ), and max |𝑢| in comparison with three other controllers. The fuzzy controller (1996) has used the largest control energy to reduce the tracking error, but it also shows the largest MSE among

Here, the proposed method is applied to the Duffing–Holmes chaotic system in Lin and Chung (2015) with the following dynamics, √ 𝑥̈ = −0.25𝑥̇ + 𝑥 − 𝑥3 + 0.1 𝑥2 + 𝑥̇ 2 sin (𝑡) + 0.3 cos (𝑡) + 𝑢 (𝑡) + 𝑑 (𝑡) , 𝑦 = 𝑥 + 𝜐.

(41)

where 𝑑(𝑡) = 0.1 sin(𝑡) is the external disturbance. The total simulation time is 20 s with step time 0.01 s. The desired trajectory is 𝑥𝑑 (𝑡) = sin (1.1𝑡), and the initial values for the states of the system are set at 𝑥(0) = [0.25, 0.25]𝑇 . The proposed method is compared with two 8

F. Baghbani, M.-R. Akbarzadeh-T, M.-B. Naghibi-Sistani et al.

Engineering Applications of Artificial Intelligence 89 (2020) 103447

Table 2 Consumed control energy (𝐽 ), (Top), and Mean Square Error (1000*MSE), (Bottom) for the inverted pendulum system. Bold numbers indicate better results. Case

Criteria RBFNN-based controller

Case I: No disturbance Case II: Sinusoidal disturbance Case III: Pulse disturbance Case IV: Noise with SNR = 35(over 20 runs)

Mean Min Max

Case

DFS (2014)

DARENC (proposed)

max |𝑢|

𝐽

max |𝑢|

𝐽

max |𝑢|

𝐽

max |𝑢|

10.1551 10.3062 10.1051

36.6362 36.8693 36.6362

11.0602 11.1987 10.9928

30.5302 30.7245 30.5302

10.6215 10.8747 10.5805

42.8333 42.9131 42.8333

9.9197 10.0644 9.8592

32.9242 33.1615 32.9242

10.3308 9.950019 10.64889

43.88209 35.82268 57.46508

11.268 10.4578 12.1194

56.75474 41.78593 79.61289

10.99539 10.6581 11.5487

47.76601 41.44641 88.39882

10.08978 9.807832 10.54291

45.6858 39.23721 56.89615

MSE

Case I: No disturbance Case II: Sinusoidal disturbance Case III: Pulse disturbance Case IV: Noise with SNR = 35(over 20 runs)

T1F controller (1996)

𝐽

Mean Min Max

RBFNN-based controller

T1F controller (1996)

DFS (2014)

DARENC (proposed)

1.9717 2.0012 1.9976

4.0205 4.0782 4.0514

2.7259 3.0968 2.7983

1.6251 1.6332 1.6251

5.184665 4.091947 6.66381

7.14445 4.9986 10.6358

5.501335 4.0215 7.8099

4.218028 3.445608 5.611231

Fig. 5. Case I. The weights of the Amygdala and the OFC in CRBENN for the inverted pendulum system without disturbance.

Fig. 6. Case II. Trajectories of 𝑥1 and 𝑥2 for the inverted pendulum system with sinusoidal disturbance.

other emotional controllers named FBELC (Lin and Chung, 2015) and iFBEL (Fang et al., 2019). The parameters of FBELC and iFBEL are the same and provided in Lin and Chung (2015) and Zhao et al. (2019). Similarly, the parameters of the proposed method √ are set at 𝑄 =4, 0.005; also from 𝐾 = 2, 𝛬1 = 2, 𝛼 = 0.01, 𝛽 = 60, 𝑟 = 0.1, 𝜌 = Riccati equation (16) 𝑃 = 1 is obtained. The parameters of radial basis function in (1) are considered as: 𝑧 = 𝑠, 𝑚 = 31, 𝜇𝑗 ∈ [−0.5, 0.5] and 𝜎𝑗 = 0.25, (𝑗 = 1, … , 𝑚). The simulation results are compared in two cases: Case V: with disturbance, and Case VI: with measurement noise.

iFBEL (2019) and 30% lower MSE in comparison with FBELC (2015). The total control energy (𝐽 ) for all three controllers is approximately the same (but it is slightly lower for the proposed method). According to Table 3, the maximum absolute value of the control input (max |𝑢|) is considerably lower for the proposed method in comparison with the other two controllers. 6.2.2. Case VI: Simulation results on the chaotic system with measurement noise In this case, the external disturbance 𝑑(𝑡) = 0, but the Gaussian white noise with SNR=35 is considered as the measurement noise. Due to the random nature of noise and for reliable results, the simulations are repeated 20 times. The results of 𝐽 and MSE are averaged and presented in Table 3. As Table 3 shows, the proposed method has reached the lowest MSE in comparison with iFBEL (2019) (by 23%) and FBELC (2015) (by 42%). Additionally, comparing the values of 𝐽 in Table 3, the proposed method has used lower control energy in comparison with the other two controllers (13% lower than iFBEL and 10% lower than FBELC). Finally, the max |𝑢| in Table 3 is considerably lower for the proposed method (by 70%).

6.2.1. Case V: Simulation results on the chaotic system with sinusoidal disturbance In this case, 𝑣 = 0, but the system is exposed to the external sinusoidal disturbance. Fig. 10 shows the states of the system, and, Fig. 11 shows the control input and the error for the three controllers. As can be seen, all of the controllers track the desired trajectory. But according to Fig. 11, the proposed controller has reached a lower tracking error. Also, the evolution of some weights of the Amygdala and the OFC with respect to time are depicted in Fig. 12. The figure shows that the weights of the Amygdala can only increase, but the weights of the OFC can increase and decrease. As Table 3 shows the proposed controller has reached 26% lower MSE in comparison with 9

F. Baghbani, M.-R. Akbarzadeh-T, M.-B. Naghibi-Sistani et al.

Engineering Applications of Artificial Intelligence 89 (2020) 103447

Table 3 Consumed control energy (𝐽 ) and Mean Square Error (MSE), for the chaotic system. Bold numbers indicate better results. Case

Criteria max |𝑢|

𝐽

Case V: Sinusoidal disturbance Case VI: Noise with SNR = 35 (over 20 runs)

Mean Min Max

MSE

FBELC (2015)

iFBEL (2019)

Proposed

FBELC (2015)

iFBEL(2019)

Proposed

FBELC (2015)

iFBEL(2019)

Proposed

1.2217

1.2254

1.2073

49.9964

51.0504

5.9392

0.0020

0.0019

0.0014

2.462157 2.351954 2.565007

2.533367 2.356793 2.661897

2.216555 1.893844 2.964815

49.688 48.2597 52.20302

51.35641 49.21987 53.49309

15.40623 12.24214 22.01709

0.007791 0.004911 0.01188

0.005889 0.00427 0.008455

0.00454 0.003149 0.006025

full control cycle should not exceed 1 ms, from which a considerable portion is spent on sensing and actuating. Thus, it is challenging for a controller to meet low computational cost and fast update cycle. Fortunately, the proposed approach offers such a low computational burden and simple structure. This experimental benchmark is presented in Fig. 13. As the figure shows, the 3-PSP robot is fully symmetric. It has two fixed bases and one star-shaped platform. The angle between three branches of the starshaped platform is 120◦ . Three PSP legs connect the moving star to the fixed bases. Each leg has a linear actuated prismatic joint, a passive spherical joint, and a passive prismatic joint. Each leg is actuated by a motor, a gearbox, and a ball screw assembly. There are three active joints and twelve passive joints, where only the three active joints are controlled. The control systems should appropriately control the motors’ torques so that the end-effector tracks the desired trajectory (Rezaei et al., 2013). The dynamics of the robot is presented as follows, ( ) (42) 𝜏 = 𝑀𝑟 (𝛩) 𝛩̈ + 𝑉𝑟 𝛩, 𝛩̈ + 𝐺𝑟 (𝛩) 𝛩, where 𝛩 = [𝜃0 𝜃1 𝜃2 ] is the vector of angles of three motors in Radians. The motors are numbered from 0 to 2 as M0, M1, and M2. Also, ( ) 𝜏 is the torque vector, 𝑀𝑟 (𝛩) is the mass matrix, 𝑉𝑟 𝛩, 𝛩̈ contains centrifugal and Coriolis terms, and 𝐺𝑟 (𝛩) is the gravity vector. The detailed description of the robot and its parameters can be found in Rezaei et al. (2013). The overall control structure is depicted in Fig. 14. The error between the desired angle of motors 𝛩𝑑𝑒𝑠𝑖𝑟𝑒𝑑 and the actual angle 𝛩𝑎𝑐𝑡𝑢𝑎𝑙 is calculated as 𝑒 = 𝛩𝑑𝑒𝑠𝑖𝑟𝑒𝑑 − 𝛩𝑎𝑐𝑡𝑢𝑎𝑙 and fed to the controller. Then the control input (𝜏) is fed to the manipulator block. The manipulator output is the vector of linear coordinators 𝑞 = [𝑞1 𝑞2 𝑞3 ] related to the ball screws in meters. The angle of the 𝑗th motor and the linear coordinates are related by 𝜃𝑗 = 100𝜋𝑞𝑗 (𝑗 = 1, 2, 3). Here, the helix trajectory is considered as the desired trajectory for the end-effector

Fig. 7. Case II. The error (𝑒) (above) and control input 𝑢 (below) for the inverted pendulum system with sinusoidal disturbance.

6.3. Experimental results on the 3-PSP robot Because of the simple structure and low computations, the proposed controller is suitable for real-world implementation. In this way, the proposed controller is applied to a 3-PSP (Prismatic–Spherical– Prismatic) parallel robot at the robotic laboratory of Ferdowsi University of Mashhad. The robot has complexities and uncertainties such as complicated structure, kinematics, and dynamics, nonlinearities of the actuators, noise from sensor measurements, and friction. Also, the

Fig. 8. Case III. Trajectories of 𝑥1 and 𝑥2 for the inverted pendulum system with pulse disturbance. 10

F. Baghbani, M.-R. Akbarzadeh-T, M.-B. Naghibi-Sistani et al.

Engineering Applications of Artificial Intelligence 89 (2020) 103447

Fig. 9. Case III. The error (𝑒) and control input 𝑢 for the inverted pendulum system with pulse disturbance.

Fig. 11. Case V. The control input (above) and the error (below) for the chaotic system with external disturbance.

Fig. 10. Case V. Trajectories of 𝑥1 and 𝑥2 for the chaotic system with external disturbance.

Fig. 12. Case V. The weights of the Amygdala and the OFC in CRBENN for the chaotic system with external disturbance.

of the robot. Using the inverse Kinematics of the robot, the desired trajectory is converted to the desired angles of the motors. The overall computation per cycle has to be below 0.001 s. This is confirmed by computing the simulation computational time of the 3-PSP robot in the MATLAB software. All of the codes for the real robot are written in C++ software, which has a faster language environment than the MATLAB software, on a computer with a 3.0 GHz Pentium 4 processor. In addition, the maximum voltage limit that the drivers fed to the motors is ±10 V (Volts). As the experimental results show (voltage of motor in Fig. 16), the voltages of the motors are below ±1 V, which is well below the ±10 V. The function of the error 𝑠 is the input to the CRBENN. The control parameters are determined by trial and error to achieve reasonable tracking error and control energy consumption and are set at 𝑃 = 0.001, 𝛼 = 5, 𝛽 = 50, 𝑟 = 0.5, 𝛬1 = 7, and 𝜌 = 0.5. The initial values for the weights of the Amygdala and the OFC nodes are set at real values between [−2, 2]. Also 𝑚 = 5, 𝜎𝑗 = 1 {𝑗 = 1, … , 5}, and 𝜇 = [−3, −1.55, 0, 1.5, 3]𝑇 .

The average control energy 𝐽𝑗 (𝑗 = 1, 2, 3) and MSE (mean square error) are considered as measurement criteria. 𝐽𝑗 is defined as, 𝑇

𝐽𝑗 =

1 | | |𝑢 (𝑡)| 𝑑𝑡, 𝑗 (𝑚𝑜𝑡𝑜𝑟 𝑖𝑛𝑑𝑒𝑥) = 1, 2, 3, 𝑇 ∫0 | 𝑗 |

(43)

where 𝑢𝑗 the voltage of 𝑗th motor. Fig. 15 shows the desired and actual trajectories of motors angles for the proposed controller (DARENC). This figure and the zoomed view of the dotted area confirm that the controller makes the angles of motors to follow the desired angle trajectory. The voltage of motors is depicted in Fig. 16 and 𝐽𝑗 (𝑗 = 1, 2, 3) is computed and presented in Table 4. As Fig. 16 shows, almost over all the time length of the experiment, the voltages of the motors remain below ±1𝑉 . Also, the error between the desired angle of the motors and the actual ones is shown in Fig. 17, and MSE is provided in Table 4. The error has its largest value, which 11

F. Baghbani, M.-R. Akbarzadeh-T, M.-B. Naghibi-Sistani et al.

Engineering Applications of Artificial Intelligence 89 (2020) 103447

Fig. 13. The experimental 3-PSP robot at the Ferdowsi University of Mashhad.

Fig. 14. The closed-loop control structure for the 3-PSP robot (Baghbani et al., 2016). Table 4 Average motor voltage (𝐽𝑗 ) and 𝑀𝑆𝐸𝑗 for experimental motor voltage. Motor #

Criteria 𝐽

MSE

Motor 1 Motor 2 Motor 3

0.3525 0.2672 0.2563

0.0028 0.0018 0.0013

We have also experimentally applied the DARENC to an actual 3PSP robotic benchmark. The results show the successful performance of the proposed controller in making the robot to track the helix trajectory, as the desired trajectory of the robot end-effector, with a low control energy consumption. Additionally, DARENC is suitable to be applied in real-world experiments because of its simple structure and lower computations compared with fuzzy-based controllers and comparable computational time with RBFNN as was discussed in Section 6.1 according to Table 1.

is still reasonably small, at times with sudden changes of the desired trajectory. The MSE for the three motors is well below 0.01.

7. Conclusion The proposed CRBENN benefits from the radial basis structure in the nodes of the Thalamus, which makes it a transparent and general structure. It also avoids a direct connection from the Thalamus to the Amygdala, which leads to its continuous output mapping. From these two basic properties, the CRBENN becomes mathematically equivalent to the RBFNN, and therefore its universal approximation property is straightforwardly proved based on the universal approximation property of the RBF networks. This is while the CRBENN remains consistent with the basic laws of the emotional brain, i.e., the increasing update laws of the Amygdala and the dual critical analysis by the OFC and the Amygdala. Accordingly, CRBENN is a modeling paradigm that has universal approximation property, simple structure, continuous and differentiable output with respect to the weights, and the established capabilities of the emotional structures. As shown in this work, CRBENN is also amenable to theoretical analysis. To illustrate, it is employed to approximate the control law directly in a direct adaptive control structure DARENC; the stability of the closed-loop system with DARENC is proved using the Lyapunov stability theory, and suitable update laws that are consistent with the

6.4. Discussion

In this section, the proposed method is applied to an inverted pendulum system and the Duffing–Holmes chaotic system, and its performance is compared with other fuzzy, neuro, and emotional approaches. Simulation studies on the inverted pendulum system confirm that the proposed method consistently reaches better tracking performance and lower control energy consumption while achieving lower computational time in comparison with type-1 and type-2 fuzzy controllers. DARENC has more degrees of freedom in comparison with an RBF-based controller due to its two sets of adaptive parameters (the Amygdala and the OFC weights), which could lead to a slightly higher computational time. The simulation results on Duffing–Holmes chaotic system also confirm the superiority of the proposed controller in lower tracking error and control energy consumption in comparison with two other emotional controllers. 12

F. Baghbani, M.-R. Akbarzadeh-T, M.-B. Naghibi-Sistani et al.

Engineering Applications of Artificial Intelligence 89 (2020) 103447

Fig. 15. The trajectories of motor angles (𝜃0 𝜃1 𝜃2 ) for the 3-PSP robot (Left). The zoomed view of the dotted area (Right).

Fig. 17. The error 𝑒 = 𝛩𝑑𝑒𝑠𝑖𝑟𝑒𝑑 − 𝛩𝑎𝑐𝑡𝑢𝑎𝑙 for three motors of the 3-PSP robot. Fig. 16. The voltage of three motors (M0, M1, and M2) of the 3-PSP robot.

Appendix. Preliminaries on the approximation property of RBF networks

basic BEL model are designed for the Amygdala and the OFC weights. In the future, we hope to present new controllers, where the parameters of the radial basis functions, i.e., the mean, smoothing factor, and the number of nodes, could also be adaptively updated. In addition, we aim to investigate further the emotion-based control structures that employ emotional neural networks in different nonlinear control structures such as predictive or optimal control, also combined with other robust terms such as sliding mode or supervisory control.

Several studies have addressed the universal approximation property of the RBF networks. Hartman and his colleagues were some of the pioneering works in this regard (Hartman et al., 1990). They used Stone–Weierstrass Theorem (Stone, 1948) to show the universal approximation property of neural networks with a single hidden layer of Gaussian type. Similarly, it is shown in Girosi and Poggio (1990) that Stone–Weierstrass Theorem holds for Gaussian RBF networks with different smoothing factors. The following is an adaptation from the theoretical development in Park and Sandberg (1991, 1993) on the universal approximation property of radial basis functions (RBF) networks.

CRediT authorship contribution statement F. Baghbani: Conceptualization, Methodology, Software. M.-R. Akbarzadeh-T: Supervision, Conceptualization, Methodology. M.-B. Naghibi-Sistani: Supervision. Alireza Akbarzadeh: Investigation, Validation, Resources.

Consider a family of the RBF networks as follows, 𝑞 (𝑧) =

𝑚 ∑ 𝑗=1

13

𝑤𝑗 𝜙

(𝑧 − 𝜇) 𝜎

,

(A.1)

F. Baghbani, M.-R. Akbarzadeh-T, M.-B. Naghibi-Sistani et al.

Engineering Applications of Artificial Intelligence 89 (2020) 103447

where 𝑚 ∈ N is the number of the kernel nodes, in which N denotes the set of natural numbers, 𝑧 ∈ R𝑛 is an input vector, 𝜙 is a radially symmetric kernel function, 𝑤𝑗 ∈ R is the weight corresponding to the 𝑗th node, 𝜎 > 0 is the smoothing factor, and 𝜇 ∈ R𝑛 is the centroid. In Park and Sandberg (1991), this family is called 𝑆𝐾 . Park and Sandberg (1991) proved that based on some mild conditions on 𝜙, the RBF network described by (A.1) can approximate any function in 𝐿𝑝 (R𝑛 ) arbitrarily well. The term 𝐿𝑝 (R𝑛 ) denotes the usual spaces of R-valued maps 𝑓 defined on R𝑛 such that 𝑓 is 𝑝th power integrable, essentially bounded, and continuous with a compact support.

Fang, W., Chao, F., Lin, C., Yang, L., Shang, C., Zhou, C., 2019. An improved fuzzy brain emotional learning model network controller for humanoid robots. Front. Neurorobot. 13, 1–16. http://dx.doi.org/10.3389/fnbot.2019.00002. Garmsiri, N., Sepehri, N., 2014. Emotional learning based position control of pneumatic actuators. Intell. Autom. Soft Comput. 20, 433–450. Girosi, F., Poggio, T., 1990. Networks and the best approximation property. Biol. Cybernet. 63, 169–176. Gunapriya, B., Sabrigiriraj, M., 2017. Real-time implementation and performance evaluation of brain emotional learning developed for FPGA-based PMBLDC motor drives dynamic model of PMBLDC motor. J. Test. Eval. 45, 987–1004. http: //dx.doi.org/10.1520/JTE20150269. Hartman, E.J., Keeler, J.D., Kowalski, J.M., 1990. Layered neural networks with gaussian hidden units as universal approximations. Neural Comput. 2, 210–215. Hornik, K., 1989. Multilayer feedforward networks are universal approximators. Neural Netw. 2, 359–366. Hsu, C., Lee, T., 2017. Emotional fuzzy sliding-mode control for unknown nonlinear systems. Int. J. Fuzzy Syst. 19, 942–953. http://dx.doi.org/10.1007/s40815-0160216-7. Hsueh, Y., Su, S., 2012. Learning error feedback design of direct adaptive fuzzy control systems. IEEE Trans. Fuzzy Syst. 20, 536–545. Hsueh, Y., Su, S., Chen, M., 2014. Decomposed fuzzy systems and their application in direct adaptive fuzzy control. IEEE Trans. Cybern. 44, 1772–1783. Hsueh, Y., Su, S., Tao, C.W., Hsiao, C., 2010. Robust L2 -gain compensative control for direct-adaptive fuzzy-control-system design. IEEE Trans. Fuzzy Syst. 18, 661–673. Jafari, M., Xu, H., 2019. A biologically-inspired distributed resilient flocking control for multi-agent system with uncertain dynamics and unknown disturbances. Eng. Appl. Artif. Intell. 79, 1–12. http://dx.doi.org/10.1016/j.engappai.2018.12.001. Jafarzadeh, S., Motlagh, M.R.J., Barkhordari, M., Mirheidari, R., 2008. A new Lyapunov based algorithm for tuning BELBIC controllers for a group of linear systems. 16th Mediterranean Conference on Control and Automation 593–595. Khalghani, M.R., Khooban, M.H., 2014. A novel self-tuning control method based on regulated bi-objective emotional learning controller’s structure with TLBO algorithm to control DVR compensator. Appl. Soft Comput. J. 24, 912–922. http: //dx.doi.org/10.1016/j.asoc.2014.08.051. Khalghani, M.R., Khooban, M.H., Mahboubi-moghaddam, E., Vafamand, N., Goodarzi, M., 2016. A self-tuning load frequency control strategy for microgrids: Human brain emotional learning. Int. J. Electr. Power Energy Syst. 75, 311–319. Khalil, H.K., 1996. Nonlinear Systems, second ed. Prentice Hall, Upper Saddle River. Khooban, M., Javidan, R., 2016. A novel control strategy for DVR : Optimal bi-objective structure emotional learning. Int. J. Electr. Power Energy Syst. 83, 259–269. Khorashadizadeh, S., Hashem Zadeh, S.M., Koohestani, M.R., Shekofteh, S., Erkaya, S., 2019. Robust model-free control of a class of uncertain nonlinear systems using BELBIC: stability analysis and experimental validation. J. Braz. Soc. Mech. Sci. Eng. 41, 1–12. http://dx.doi.org/10.1007/s40430-019-1824-6. Le, T., Lin, C., Huynh, T., 2018. Self-evolving type-2 fuzzy brain emotional learning control design for chaotic systems using PSO. Appl. Soft Comput. J. 73, 418–433. http://dx.doi.org/10.1016/j.asoc.2018.08.022. Lin, C., Chung, C., 2015. Fuzzy brain emotional learning control system design for nonlinear systems. Int. J. Fuzzy Syst. 17, 117–128. Lin, T.-C., Liu, H.-L., Kuo, M.-J., 2009. Direct adaptive interval type-2 fuzzy control of multivariable nonlinear systems. Eng. Appl. Artif. Intell. 22, 420–430. Lotfi, E., 2018. Generalized BELBIC. Neural Comput. Appl. 1, 1–17. http://dx.doi.org/ 10.1007/s00521-018-3352-1. Lotfi, E., Akbarzadeh-T, M.-R., 2014. Practical emotional neural networks. Neural Netw. 59, 61–72. Lotfi, E., Akbarzadeh-T, M.-R., 2016. A winner-take-all approach to emotional neural networks with universal approximation property. Inf. Sci. (Ny) 346, 369–388. Lucas, C., Shahmirzadi, D., Sheikholeslami, N., 2004. Introducing BELBIC : Brain emotional learning based intelligent controller. Intell. Autom. Soft Comput. 10, 11–22. Markadeh, G.R.A., Daryabeigi, E., Lucas, C., Rahman, M.A., 2011. Speed and flux control of induction motors using emotional intelligent controller. IEEE Trans. Ind. Appl. 47, 1126–1135. Mehrabian, A.R., Lucas, C., 2008. Emotional learning based intelligent robust adaptive controller for stable uncertain nonlinear systems. Int. Sch. Sci. Res. Innov. 2, 892–898. Moren, J., 2002. Emotion and Learning- A Computational Model of the Amygdala. Lund University, Lund, Sweden. Moren, J., Balkenius, C., 2000. A computational model of emotional learning in the amygdala. In: From Animals to Animats 6 Proceedings of the Sixth International Conference on Simulation of Adaptive Behavior. MIT Press, Cambridge, MA. Nahian, S.A., Truong, D.Q., Ahn, K.K., 2014. A self-tuning brain emotional learning– based intelligent controller for trajectory tracking of electrohydraulic actuator. Proc. Inst. Mech. Eng. I 228, 461–475. http://dx.doi.org/10.1177/0959651814530275. Pan, Y., Er, M.J., Sun, T., Xu, B., Yu, H., 2017. Adaptive fuzzy PD control with stable H∞ tracking guarantee. Neurocomputing 237, 71–78. http://dx.doi.org/10.1016/j. neucom.2016.08.091. Pan, Y., Yu, H., Er, M.J., 2014. Adaptive neural PD control with semiglobal asymptotic stabilization guarantee. IEEE Trans. Neural Netw. Learn. Syst. 25, 2264–2274. Park, J., Sandberg, I.W., 1991. Universal approximation using radial-basis-function networks. Neural Comput. 3, 246–257.

Theorem A.1 (Park and Sandberg, 1991). Let 𝜙 ∶ R𝑛 → R be an integrable bounded function such that 𝜙 is continuous almost everywhere and ∫R𝑛 𝜙(𝑥) 𝑑𝑥 ≠ 0. Then the family of 𝑆𝐾 is dense in 𝐿𝑝 (R𝑛 ) for every 𝑝 ∈ [1, ∞). Proof. Refer to Park and Sandberg (1991), proof of Theorem 1.

■

In (A.1), the smoothing factor 𝜎 for all the kernels is the same. In Park and Sandberg (1993), the universal approximation property of the RBF networks is proved when 𝜎 has different values for each kernel node, as follows, ( ) 𝑚 ∑ 𝑧−𝜇 𝑞1 (𝑧) = 𝑤𝑗 𝜙 𝑗 , (A.2) 𝜎𝑗 𝑗=1 where the parameters are defined the same as in (A.1), but with different smoothing factors 𝜎𝑗 > 0 for each kernel node. In Park and Sandberg (1993), this family is called 𝑆1 . Theorem A.2 (Park and Sandberg, 1993).With 𝑝 ∈ [1, ∞), let 𝜙𝑗 ∶ R𝑛 → |𝑝 | R be an integrable function such that ∫R𝑛 𝜙𝑗 (𝑥) 𝑑𝑥 ≠ 0 and ∫R𝑛 |𝜙𝑗 (𝑥)| 𝑑𝑥 < | | 𝑝 𝑛 ∞, then 𝑆1 , defined by (A.2), is dense in 𝐿 (R ). Proof. See proof of Proposition 1 in Park and Sandberg (1993).

■

References Akhormeh, A.N., Roshanian, J., MoradiMaryamnegari, H., Khoshnood, A.M., 2019. Online and stable parameter estimation based on normalized brain emotional learning model (NBELM). Int. J. Adapt. Control Signal Process. 33, 1047–1065. http://dx.doi.org/10.1002/acs.3009. Baghbani, F., Akbarzadeh-T, M.-R., Akbarzadeh, A., Ghaemi, M., 2016. Robust adaptive mixed H 2/H∞ interval type-2 fuzzy control of nonlinear uncertain systems with minimal control effort. Eng. Appl. Artif. Intell. 49, 1–26. http://dx.doi.org/10.1016/ j.engappai.2015.12.003. Baghbani, F., Akbarzadeh-T, M.-R., Sistani, M.-B.N., 2018. Stable robust adaptive radial basis emotional neurocontrol for a class of uncertain nonlinear systems. Neurocomputing 309, 11–26. http://dx.doi.org/10.1016/j.neucom.2018.03.051. Castro, J.L., 1995. Fuzzy logic controllers are universal approximators. IEEE Trans. Syst. Man Cybern. 25, 629–635. Chen, B., Lee, C., Chang, Y., 1996. H∞ tracking design of uncertain nonlinear SISO systems: Adaptive fuzzy approach. IEEE Trans. Fuzzy Syst. 4, 32–43. Chung, C., Lin, C., 2015. Fuzzy brain emotional cerebellar model articulation control system design for multi- input multi-output nonlinear. Acta Polytech. Hungar. 12, 39–58. Daryabeigi, E., Abjadi, N.R., Markadeh, G.R.A., 2014. Automatic speed control of an asymmetrical six-phase induction motor using emotional controller (BELBIC). J. Intell. Fuzzy Syst. 26, 1879–1892. Daryabeigi, E., Mirzaei, A., Abootorabizarchi, H., Vaez-Zadeh, S., 2019. Enhanced emotional and speed deviation control of synchronous reluctance motor drives. IEEE Trans. Energy Convers. 34, 604–612. http://dx.doi.org/10.1109/TEC.2018. 2874486. Dehkordi, B.M., Kiyoumarsi, A., Hamedani, P., Lucas, C., 2011. A comparative study of various intelligent based controllers for speed control of IPMSM drives in the field-weakening region. Expert Syst. Appl. 38, 12643–12653. El-Garhy, A.M., El-Shimy, Mohamed E., 2015. BELBIC for MRAS with highly non-linear process. Alex. Eng. J. 54, 7–16. El-saify, M.H., El-sheikh, G.A., El-Garhy, A.M., 2017. Novel multi-input multi-output brain emotional learning based intelligent controller for PUMA 560 robotic arm. Adv. Intell. Syst. Comput. 639, 81–93. http://dx.doi.org/10.1007/978-3-31964861-3. 14

F. Baghbani, M.-R. Akbarzadeh-T, M.-B. Naghibi-Sistani et al.

Engineering Applications of Artificial Intelligence 89 (2020) 103447 Slotine, J.-J.E., Li, W., 1991. Applied Nonlinear Control. Prentice Hall. Soreshjani, M.H., Markadeh, G.A., Daryabeigi, E., Abjadi, N.R., Kargar, A., 2015. Application of brain emotional learning-based intelligent controller to power flow control with thyristor-controlled series capacitance. IET Gener. Transm. Distrib. 9, 1964–1976. http://dx.doi.org/10.1049/iet-gtd.2014.0986. Stone, M.H., 1948. The generalized weierstrass approximation theorem. Math. Mag. 21, 167–184. Wang, L.-X., 1997. A Course in Fuzzy Systems and Control. Prentice-Hall International, Inc.. Wu, Q., Lin, C., Fang, W., Chao, F., Yang, L., Shang, C., Zhou, C., 2018. Self-organizing brain emotional learning controller network for intelligent control system of mobile robots. IEEE Access 6, 59096–59108. http://dx.doi.org/10.1109/ACCESS. 2018.2874426. Zhao, J., Lin, C., Chao, F., Lin, C., 2019. Wavelet fuzzy brain emotional learning control system design for MIMO uncertain nonlinear systems. Front. Neurosci. 12, 1–14. http://dx.doi.org/10.3389/fnins.2018.00918.

Park, J., Sandberg, I.W., 1993. Approximation and radial-basis-function networks. Neural Comput. 5, 305–316. Parsapoor, M., 2019. An introduction to brain emotional learning inspired models (BELiMs) with an example of BELiMs ’ applications. Artif. Intell. Rev. 52, 409–439. http://dx.doi.org/10.1007/s10462-018-9638-y. Rezaei, A., Akbarzadeh, A., Mahmoodi Nia, P., Akbarzadeh-T, M.-R., 2013. Position, Jacobian and workspace analysis of a 3-PSP spatial parallel manipulator. Robot. Comput. Integr. Manuf. 29, 158–173. Rouhani, H., Jalili, M., Araabi, B.N., Eppler, W., Lucas, C., 2007. Brain emotional learning based intelligent controller applied to neurofuzzy model of micro-heat exchanger. Expert Syst. Appl. 32, 911–918. Senthilkumar, S., Vijayan, S., 2014. Impact of using a novel emotional intelligent controller for induction motor speed control. J. Test. Eval. 42, 1–15. Shahmirzadi, D., Langari, R., 2005. Stability of Amygdala Learning System Using CellTo-Cell Mapping Algorithm. In: Proceeding of IASTED International Conference on Intelligent Systems and Control, October 31-November 2, Cambridge, MA. Sharbafi, M.A., Lucas, C., Daneshvar, R., 2010. Motion control of omni-directional threewheel robots by brain-emotional-learning-based intelligent controller. IEEE Trans. Syst. Man Cybern. C 40, 630–638.

15

Emotional neural networks with universal approximation property for stable direct adaptive nonlinear control systems

Emotional neural networks with universal approximation property for stable direct adaptive nonlinear control systems

Recommend Documents