A novel online affine model identification of multivariable processes using adaptive neuro-fuzzy networks

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169 Contents lists available at ScienceDirect Chemical Engineering Research and Design ...

Download PDF

630KB Sizes 0 Downloads 23 Views

Report

PDF Reader
Full Text

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169

Contents lists available at ScienceDirect

Chemical Engineering Research and Design journal homepage: www.elsevier.com/locate/cherd

A novel online afﬁne model identiﬁcation of multivariable processes using adaptive neuro-fuzzy networks Karim Salahshoor ∗ , Morteza Hamzehnejad Department of Automation and Instrumentation, Petroleum University of Technology, Satarkhan st., Tehran, Iran

a b s t r a c t This paper presents a novel systematic identiﬁcation methodology for online afﬁne modeling of multivariable processes using adaptive neuro-fuzzy networks. The proposed approach introduces an integrated procedure to simultaneously estimate a number of adaptive neuro-fuzzy networks with simple and compact dynamic structures to realize a multivariable afﬁne model identiﬁcation in real-time. A new fuzzy rule signiﬁcance concept, based on a generic time-weighted rule activation record (WRAR), together with a measure of time-weighted root mean square (WRMS) error are incorporated to maintain efﬁcient structural and parametric mechanisms for proper adaptation of the resulting neuro-fuzzy networks. An extended Kalman ﬁlter (EKF) algorithm is developed to adaptively adjust the neuro-fuzzy free parameters corresponding to the nearest created fuzzy rules. Extensive simulation test studies will be conducted to explore the capabilities of the proposed identiﬁcation approach to adaptively develop online multivariable afﬁne dynamic models for a highly nonlinear and time-varying continues stirred tank reactor (CSTR) and a highly nonlinear binary distillation column as two challenging benchmark problems. © 2009 The Institution of Chemical Engineers. Published by Elsevier B.V. All rights reserved. Keywords: Online identiﬁcation; MIMO afﬁne model; Adaptive neuro-fuzzy model; EKF

1.

Introduction

The need to designing robust and simple control systems has motivated the control community to develop satisfactory model-based control design methodologies for nonlinear processes. Feedback linearization presents a typical control design techniques (Norgaard et al., 2003) which has drawn much attention in recent years. Application of this control design technique, however, is restricted to a certain class of processes whose dynamic mathematical models can be represented by the following “afﬁne structural form”: y(k) = fk (Xk ) + gk (Xk )u(k)

(1)

in which a linear relationship exists between the process output (y) and the process input (u). Where Xk denotes a vector of regressors containing past inputs and outputs, conﬁgured at the kth time instant. fk (·) and indicate general nonlinear functions whose models are time-varying as the nonlinear process evolves over time. Despite the model structural limitation, it has been illustrated that this class of dynamic representa-

∗

tion is still very broad for a wide range of physical nonlinear processes (Chen and Khalil, 1995; Narendra, 1992; Henson and Seborg, 1997; Sanner and Slotine, 1995). This puts more emphasizes to develop efﬁcient algorithms to ﬁnd a solution for exact afﬁne modeling of any nonlinear afﬁne or nonafﬁne process. Most of industrial processes, however, have inherently non-afﬁne dynamic model structures. Therefore, developing expedient models for the time-varying nonlinear functions fk (·) and gk (·) is not a trivial task. One solution for afﬁne modeling of a non-afﬁne process is based on using the approximate Taylor series expansion of process dynamic model around its input, yielding to NARMR-L2 model proposed by Narendra and Mukhopodhyay (1997). Many industrial processes have multi-input, multi-output (MIMO) dynamic model structures with time-varying characteristics. This makes the development of an exact afﬁne-type model for such practical processes much more complicated and challenging subject. In this work, a new online identiﬁcation algorithm is proposed to derive an adaptive afﬁne model for MIMO nonlinear and time-varying processes based on a self-organizing neuro-fuzzy approach. The proposed algo-

Corresponding author. Tel.: +98 21 44236062. E-mail address: [email protected] (K. Salahshoor). Received 24 February 2009; Received in revised form 12 July 2009; Accepted 15 July 2009 0263-8762/$ – see front matter © 2009 The Institution of Chemical Engineers. Published by Elsevier B.V. All rights reserved. doi:10.1016/j.cherd.2009.07.009

156

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169

the required model accuracy via appropriate rule addition, modiﬁcation and pruning mechanisms in a real-time manner. An EKF parameter learning algorithm is developed to adaptively adjust the free network coordinates, corresponding to the already created fuzzy rules which are nearest to the incoming process input pattern. This scheme implements the required rule modiﬁcation strategy by capturing the most recent dynamic process data in the identiﬁed set of f ik (·) and gijk (·) functions given by Eq. (2). In case of need for any network structural improvement, two model learning schemes are incorporated to realize appropriate online rule addition or replacement and rule pruning adaptations for multivariable processes. However, to avoid from any inaccurate model structural decision, a complementary measure is utilized in terms of a time-weighted root mean square (WRMS) error measure to assess the model inadequacy at each sampling time instant. The rest of this paper is organized as follows: a new online identiﬁcation algorithm is presented in Section two to develop an afﬁne-type model for multivariable processes. The EKF parameter learning algorithm is presented in Section three. A set of simulation studies are carried out in Section four to evaluate the performance of the proposed online afﬁne model identiﬁcation algorithm on a CSTR and a distillation column as two highly nonlinear multivariable benchmark problems. Section ﬁve summarizes the resulting concluding remarks.

Nomenclature h (t) C (t) CA qc q CAf Tf Tcf Li and Vi xi and yi

fouling coefﬁcient deactivation coefﬁcient efﬂuent concentration, the controlled variable coolant ﬂow rate, the manipulated variable feed ﬂow rate, disturbance feed concentration feed temperatures coolant inlet temperature liquid and vapor ﬂow from stage i [kmol/min] liquid and vapor composition of light component on stage i Mi liquid holdup on stage i [kmol] D and B distillate (top) and bottoms product ﬂow rate [kmol/min] L = LT and V = VB reﬂux ﬂow and boilup ﬂow [kmol/min] F and zF Feed rate [kmol/min] and feed composition [mole fraction] qF fraction of liquid in feed ˛ (in distillation column) relative volatility between light and heavy component taul time constant [min] for liquid ﬂow dynamics on each stage (in distillation column) constant for effect of vapor ﬂow on liquid ﬂow (“K2-effect”)

2. A new online multivariable afﬁne model identiﬁcation approach based on an adaptive neuro-fuzzy network

rithm presents a novel procedure to integrate simultaneous identiﬁcation of a number of adaptive neuro-fuzzy models to estimate the unknown functions f ik (·) and gijk (·), characterizing the afﬁne-type dynamic model structure between the jth input uj ∈ R and the ith output yi ∈ R in a real-time manner at the kth sampling time, leading to

yi (k) = f ik (Xk ) + gijk (Xk ) uj (k),

i = 1, . . . , b, j = 1, . . . , a.

(2)

This approach has several interesting advantages. It enables a multivariable afﬁne-type model to be identiﬁed adaptively for a nonlinear MIMO process without a priori dynamic knowledge requirement. Thus, the dimensions of neuro-fuzzy networks are not determined a priori to bet ter model the unknown nonlinear functions f ik (·) and gijk (·) in Eq. (2). The neuro-fuzzy networks have adaptive structural and parametric capabilities so as to recursively follow up the process time-varying and nonlinear dynamic characteristics with a small set of time-updated fuzzy rules. This is a major advantage over other conventional neuro-fuzzy modeling approaches in which their networks should be conservatively predetermined based on the highest dimension requirements to cover all the possible process working conditions. Hence, the proposed approach possesses enough ﬂexibility to efﬁciently deal with nonlinear and time-varying process dynamics in a wider range of operating conditions, leading to a compact network model structure with lower realtime computational burden. The proposed online multivariable identiﬁcation algorithm makes use of a new fuzzy rule signiﬁcance concept, expressed in terms of a weighted rule activation record (WRAR) measure, to provide a generic time-based contribution record with forgetting factor feature for online performance evaluation of each created fuzzy rule during its life-time. This incorporates an adaptive rule management strategy to maintain

A wide class of nonlinear dynamic processes can be represented by the following nonlinear auto-regressive with exogenous input (NARX) model description (Ljung and Soderstrom, 1983): Y(k) = N(X(k)),

(3)

where X(k) ∈ Rr indicates the input model regression vector, deﬁned as X(k)=[Y T (k − 1), . . . , Y T (k − n), UT (k − d − 1), . . . , UT (k − d − m)], r = nb + ma,

(4)

where N(·) denotes a nonlinear static transition function vector with a dimension similar to the process output vector Y, given by b × 1. U shows the process inputs, conﬁgured into a column vector which has the dimension of a × 1. d is an input time delay while n and m represent the process model orders. The idea of deriving afﬁne-type model has already been introduced by Narendra and Mukhopodhyay (1997). They developed a scalar afﬁne model, called as NARMA-L2, using an approximate Taylor series expansion of a general nonlinear process dynamic model around its scalar input u(k). The present work proposes an online multivariable afﬁne model identiﬁcation approach which is more general and practically oriented, leading to the following distinctive features: (i) the identiﬁcation procedure does not need to have an already available nonlinear process dynamic model, (ii) the procedure considers the problem as a black-box identiﬁcation case with no need to a priori process dynamic knowledge, (iii) the adaptive neuro-fuzzy network structures do not require to be set initially and start their learning from scratch without a priori assumed fuzzy rule, (iv) the neuro-fuzzy networks are recur-

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169

sively updated via either structural or parametric adaptation mechanism based on the novelty of the incoming process operational data, realizing a more compact and signiﬁcant set of fuzzy rules, and (v) the adaptive afﬁne model identiﬁcation approach has been proposed for practical industrial processes having MIMO dynamic structures with nonlinear and timevarying characteristics.

2.1.

Multivariable afﬁne model structure

Consider a MIMO process with a inputs U ∈ Ra and b outputs denoted by Y ∈ Rb . The general afﬁne model identiﬁcation for such a multivariable process can be represented by the following discrete-time afﬁne-type expression at the kth time instant:

Y(k) = Fk (Xk ) + Gk (Xk )U(k),

157

where f ik (·) and gijk (·) indicate nonlinear functions, estimated by the adaptive neuro-fuzzy networks. Thus, Fk (·) vector is implemented using b estimated neuro-fuzzy networks whereas b × a neuro-fuzzy networks should be identiﬁed to model matrix. This implies that each Gk (·) estimated neuro-fuzzy network models only a dynamic portion of the multivariable process, leading to reduction of the requisite size of the neuro-fuzzy networks. Accordingly, the online learning procedure of the networks becomes easier and, hence, the estimated afﬁne model becomes more robust to errors. This implies that b(a + 1) nonlinear functions should be estimated using the adaptive neuro-fuzzy networks to realize the proposed multivariable afﬁne model structure. For this purpose, each nonlinear function relating to the corresponding afﬁne model structure can be represented by the following NARX equation:

(5) yafj (k) = hj (X(k)),

j = 1, . . . , b(a + 1).

(10)

where

Y(k) = [y1 (k), . . . , yb (k)]

(estimated output vector),

(6)

T

(actual input vector),

(7)

U(k) = [u1 (k), . . . , ua (k)]

2.2.

T

T

Fk (Xk ) = [f 1k (Xk ), . . . , f bk (Xk )]b×1 ,

⎡

⎢ ⎢ ⎣

g11k (Xk ) .. .

Gk (Xk ) = ⎢

gb1k (Xk )

· · · g1ak (Xk ) ..

.. .

.

(8)

⎤ ⎥ ⎥ ⎥ ⎦

· · · gbak (Xk )

b×a

,

(9)

Adaptive neuro-fuzzy network

Fig. 1 demonstrates the basic conﬁguration of each adaptive neuro-fuzzy model in which fuzzy if-then rules are represented in a layered network structure, composing of ﬁve layers. The ﬁrst layer transmits the received input variables directly to the next layer whose number of nodes is equal to the input vector X dimension. The second layer performs fuzziﬁcation. For this purpose, the following Gaussian functions are employed as the membership functions, because they are differentiable, ensure the greatest possible generalization for each fuzzy rule (Angelov and Buswell, 2002) and yet yield a compact formula for the degree of rule fulﬁllment

Fig. 1 – Neuro-fuzzy model structure.

158

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169

in the third layer:

Aqpj (xp ) = exp

−

(xp − qpj )

2

,

qj 2

q = 1, . . . , Hj , j = 1, . . . , b(a + 1),

(11)

where xp denotes the pth input to the second layer of all the networks, apj and qj show the center and width of the qth Gaussian function for the pth input and jth network output, respectively. Hj indicates the generated number of adaptive fuzzy rules for the jth separate network output, available at the kth time instant where the general form of the qth rule is as follows:

1. Compute the overall network estimation for each output using Eq. (14). 2. Find the nearest created fuzzy rule to the new input observation data X(k), deﬁned on the basis of Euclidean distance measure determining by ||X(k) − nr,j || = Mini ||X(k) − ij ||, i = 1, . . ., Hj . 3. Calculate the following measures corresponding to the fuzzy rule addition/replacement criteria as

WRARnr,j (k) =

= 1j

aqj is the constant parameter corresponding to the qth rule consequence, deﬁned for the jth estimated network output. In the third layer, each node corresponds to a rule and hence the node output is equal to the ﬁring strength of each rule. Thus, the antecedent connecting part of the if-then fuzzy rules will be determined as follows on the basis of sum–product composition:

Rqj (X) =

Aqpj (xp ) = exp

p=1

= exp

−

||X − qj qj 2

||2

−

p=1

,

yaf (k) =

qj 2

q = 1, . . . , Hj , (13)

j

Hj Rq (X)aqj q=1 , j = 1, . . . , b(a + 1). Hj

(14)

R (X) q=1 qj

k−i−1 k−k−1 1j nr,j (i) + 1j nr,j (k)

Therefore, b(a + 1) separate neuro-fuzzy networks are developed simultaneously to make an exact MIMO afﬁne model. Eq. (14) represents all the individual neuro-fuzzy model outputs at each time instant and yaf shows the jth j

network output. The network output vector (Y) evolves the proposed b(a + 1) estimated nonlinear function outputs (i.e., [f 1 , . . . , f b , g11 , . . . , g1b , . . . , gab ]).

2.3. Online identiﬁcation scheme for adaptive neuro-fuzzy model The identiﬁcation algorithm starts with no a priori fuzzy rule. Following the reception of a new online observation data (X(k) ∈ Rr , yafj (k) ∈ R, j = 1, . . . , b(a + 1)), the model identiﬁcation procedure starts autonomously to sequentially adapt the structural and parametric free adjustments of the neuro-fuzzy networks as follow corresponding to the multi-input, multioutput (MIMO) afﬁne model, summarized in Eq. (5):

(15)

where WRARnr,j refers to a new global time-weighted rule activation record of the nearest created fuzzy rule based on the most recent sequence of past and present input–output process data organized in X for the jth network output. nr,j (k) indicates the instantaneous measure for evaluating the contribution of the nearest fuzzy rule for the estimation of the jth network output at the kth time instant:

nr,j (k) =

where r denotes the summation of model orders (i.e., n + m). The normalized contribution of each rule will then be computed in the normalization nodes of the fourth layer. The ﬁfth layer gives the estimated network model outputs by

i=1

= 1j WRARnr,j (k − 1) + nr,j (k),

j = 1, . . . , b(a + 1),

k−i−1 1j nr,j (i)

−1 = 1j [WRARnr,j (k − 1) + 1j nr,j (k)]

j

r 2 (xp − qpj )

k−1

(12)

where xi indicates the ith variable of the model regression vector X. Accordingly, Aq1j , . . . , Aqrj show the membership values dealing with the fuzzy sets of the corresponding terms con sidered in X. yaf denotes the jth estimated model output and

k

i=1

j

r

k−i 1j nr,j (i) = 1j

i=1

If x1 is Aq1j and x2 is Aq2j , . . . , and xr is Aqrj Then yaf is aqj , q = 1, . . . , Hj , j = 1, . . . , b(a + 1),

k

R

Hnr,j j

(X)

R (X) q=1 qj

(16)

.

Thus, evaluating the inﬂuence of the nearest rule on the basis of this instantaneous measure can make the neurofuzzy model severely sensitive to measurement noises, leading to unacceptable perturbation in the resulting estimated model. To overcome this problem, WRAR offers an efﬁcient new global measure which incorporates the past fuzzy rule contributions in a time interval whose width can be determined by assigning a forgetting factor indicated by 0 < 1j ≤ 1 in Eq. (15). Another measure which can assess the adequacy of the estimated neuro-fuzzy structure corresponds to the following time-weighted root-mean square (WRMS) error:

k ||ej (i)||2 EWRMS,j (k) = 2j i−k , j = 1, . . . , b(a + 1), Nj

i=k−(Nj −1)

(17)

where ej (i) = yafj (i) − yaf (i) indicates the jth network outj

put estimation error, manifesting the instantaneous quality of the estimated network. Nj shows the predeﬁned sliding data window for the jth network output and 2j denotes the corresponding time-weighted forgetting factor. This measure can, hence, provide a complementary tradeoff measure to maintain the balance between local and global model estimation requirements. 4. Check the rule addition/replacement criteria:

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169

If (||X(k) − nr,j ||> εj (k) and EWRMS,j (k) > Eth,j ) if WRARnr,j (k) > WRARth,j Allocate a new rule (Hj + 1) with the following coordinates: aHj +1,j = ej Hj +1,j = X Hj +1,j = j ||X − nr,j || where j is an overlap factor determining the extent of the rules responses overlap in the input space. Else Replace the nearest rule with the new data as anr,j = ej nr,j = X nr,j = j ||X − nr,j || End if

this time-varying proﬁle. As a consequence, the identiﬁcation procedure employs a conservative tight condition in the ﬁrst initial commissioning stage when there is no a priori dynamic knowledge. This causes the neuro-fuzzy model to be adapted smoothly under such a severe uncertainty whereas the timevarying proﬁle lets the resolution threshold to approach a lower value so as to enable the estimated model to be ﬁne tuned as more information is available. The other thresholds, i.e., WRARth,j and Eth,j are assumed constant values, indicating the lowest acceptable rule contribution and the network adequacy measures, respectively, to evaluate the nearest rule activation record and the weighted output error for the jth model output estimation. WRARnr,j denotes the lowest level of weighted rule activation record of the nearest rule to be considered for pruning the closest rule to the most recent input process data.

2.3.1.

Else Adjust the nearest rule free parameters (anr,j , nr,j , nr,j ) using an EKF learning algorithm. End if 5. Check the rule pruning criterion as If WRARnr,j (k) < WRARp,j Remove the nearest rule. Reduce the dimensionality of the model parameter vector. End if where εj (k), WRARth,j , and Eth,j represent decision thresholds for the jth MISO estimated network. εj (k) indicates the minimum input data space resolution required for the nearest fuzzy rule to have signiﬁcant inﬂuence on the most recent input data X(k). The following time-varying scheme is employed to realize different resolution in the course of online identiﬁcation implementation:

εj (k) = εmax,j − (εmax,j − εmin,j )

1 − exp

−(k − 2)

j

,

(18)

where εmax,j and εmin,j determines the largest and smallest distance resolution of interest, while j indicates the speed of

159

Online adaptation of fuzzy rule-based structure

Eq. (12) can be viewed in terms of a computational model in a fuzzy inference system (FIS) employing traditional fuzzy if-then rules. When such a fuzzy rule is created, the coordinates of its membership functions are adjusted on the basis of the new online observation data (X), as described in step 4 of the presented identiﬁcation scheme. This implies that the membership functions exemplify a characteristic input–output behavior of the process to be to be modeled. Although the number of rules (or membership functions) is automatically determined by the proposed online identiﬁcation algorithm, it should be noted that the user speciﬁed parameter j (i.e., the overlap factor characterizing the extent of the rules responses overlap in the input space for the jth separate network output) affects the number of rules that will be generated. This parameter is utilized to initialize the width of the newly added rule and hence can be set based on different objectives. An appropriate option is suggested to be chosen in the range 1 ≤ j ≤ 2. A large j will result in few rules and hence a coarser model, while a small j can produce excessive number of rules and hence a model that does not generalize well. Therefore, j can be regarded as an approximate speciﬁcation of the desired resolution of the model, which can be adjusted based on a tradeoff between the resultant local and global requirements of the model. In this paper, j = 1.75 has been observed as a proper tuning in all the simulation case studies.

Fig. 2 – Block-diagram representation of the proposed afﬁne model identiﬁcation approach.

160

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169

2.3.2.

Online parameter learning of fuzzy rules

The learning process of adaptive neuro-fuzzy model involves either a simple addition of a new fuzzy rule or replacement of an already created fuzzy rule based on novelty of the received observation input–output data, as speciﬁed in step 4 of the identiﬁcation algorithm. However, when the observation data do not pass the novelty criteria, the neurofuzzy network parameters, which are characterized by the rules premises (i.e., centers and widths of the membership functions) and consequents, can be updated using the EKF parameter learning algorithm. For multiple inputs, however, the input dimensions grow and this may arise numerical difﬁculties in parameter learning process. That is, when the number of inputs gets large and hence the number of rules grows, the size of the covariance matrix in the EKF algorithm becomes large and this results in a large learning time and storage requirement, causing a computational overload for online applications. To alleviate this problem, the EKF algorithm is utilized to update the parameters of the rule selected by the new input observation data on the basis of the winner rule strategy. That is, as described in step 4 of the identiﬁcation algorithm, only the neuro-fuzzy network parameters associated with the nearest rule to the most recent observation data is adapted. This enhances the robustness of the learning algorithm and yet decreases the required computational burden, leading to a fast identiﬁcation procedure.

2.4.

EKF parameter estimation algorithm

The extended Kalman ﬁlter (EKF) is an expedient learning algorithm for nonlinear parameter estimation. This algorithm is utilized for the parameter estimation to adjust the free coordinates of the nearest fuzzy rule, i.e., nr,j , nr,j and anr,j . For this purpose, the identiﬁed nonlinear neuro-fuzzy network, estimating the NARX model represented by Eq. (10), is ﬁrst linearized around the free adjustable network parameters as follows:

∂yaf (k) j

a˙ nr,j =

˙ nr,j =

∂yaf (k) ∂nr,j

Hj

=

∂anr,j

j

Rnr,j

,

R q=1 qj

⎡

=

(19)

∂yaf (k) ∂Rnr,j j ∂Rnr,j ∂nr,j

⎤

Hj ⎢ q=1 Rqj − Rnr,j ⎥ ∂Rnr,j = anr,j ⎣ 2 ⎦ ∂ , Hj nr,j R q=1 qj

(20)

⎡

∂yaf (k) j

˙ nr,j =

∂nr,j

=

∂yaf (k) ∂Rnr,j j ∂Rnr,j ∂nr,j

⎢

= anr,j ⎣

⎤

Hj

R q=1 qj

− Rnr,j ⎥ ∂Rnr,j

Hj R q=1 qj

2 ⎦ ∂ , nr,j (21)

where ∂Rnr,j ∂nr,j ∂Rnr,j ∂nr,j

= 2Rnr,j

= 2Rnr,j

X(k) − nr,j nr,j 2

,

||X(k) − nr,j nr,j 3

(22)

||2

.

(23)

Then, the gradient of unknown parameter vector, rep˙ nr,j , ˙ nr,j ]T , can be utilized in the resented by ˙ nr,j = [a˙ nr,j , following EKF algorithmic steps to estimate the free coordi-

Fig. 3 – Schematic diagram of a non-isothermal CSTR. nates of the nearest fuzzy rule: −1 T Kj (k) = Pnr,j (k − 1) ˙ nr,j (k)[Rj (k) + ˙ nr,j (k)Pnr,j (k − 1) ˙ nr,j (k)] ,

(24)

nr,j (k) = nr,j (k − 1) + Kj (k)ej (k),

(25)

T Pnr,j (k) = [I − Kj (k) ˙ nr,j (k)]Pnr,j (k − 1),

(26)

where Kj indicates the Kalman gain, Rj denotes the modeling error covariance, and Pnr,j deﬁnes the parameter error covariance matrix, all referring to the nearest fuzzy rule of the jth estimated MISO model incorporated in Eq. (5).

2.5. A new online multivariable identiﬁcation algorithm for an adaptive afﬁne neuro-fuzzy model A new online multivariable identiﬁcation algorithm is presented in this section to develop an adaptive afﬁne-type model for a MIMO process expressed by Eq. (5). For this purpose, Fk (·) includes b adaptive neuro-fuzzy networks while b × a adaptive neuro-fuzzy networks are utilized to estimate Gk (·) model. As illustrated in the block-diagram representation shown in Fig. 2, the proposed multivariable afﬁne model identiﬁcation scheme can be described as follows to simultaneously estimate the unknown nonlinear function vectors Fk (·) and Gk (·): 1. Initialization: The algorithm is initialized by setting G0 (X1 ) = ˛Y(0) and F0 (X1 ) = (1 − ˛)Y(0) in which 0 < ˛ < 1 is selected randomly. 2. Set k = 1 to indicate the ﬁrst sampling time instant.

Table 1 – Nominal CSTR operating condition. q = 100 l/min CAf = 1 mol/l Tf = 350 K Tcf = 350 K V = 100 l hA = 7 × 105 cal/(min K) k0 = 7.2 × 1010 min−1

E/R = 9.95 × 103 K − H = 2 × 105 cal/mol , C = 1000 g/l Cp , Cpc = 1 cal/(g K) qc = 103.41 l/min T = 440.2 K CA = 8.36 × 10−2 mol/l

161

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169

3.

3. Calculate Fk (Xk ) using

Simulation test studies

Fk (Xk ) = Y(k) − Gk−1 (Xk )U(k). 4. Perform the online neuro-fuzzy identiﬁcation algorithm to estimate Fk (Xk ) as the set of adaptive neuro-fuzzy networks to model Fk (Xk ). 5. Calculate Gk (Xk ) as

Gki (Xk ) = (Y(k) − Fk−1 (Xk ) −

i−1

Gkp (Xk )up (k)

p=1

−

a Gkp (Xk )up (k) p=i+1

ui (k)

,

i = 1, . . . , a.

Assuming ui (k) = / 0 which is a valid proposition at normal process operating condition. 6. Perform the online neuro-fuzzy identiﬁcation algorithm to estimate Gk (Xk ) as the set of required neuro-fuzzy networks to model Gk (Xk ). 7. Calculate the process output estimate as

A set of simulation test studies are carried out in this section to explore the performance evaluations of the proposed online multivariable afﬁne model identiﬁcation approach. For this purpose, two challenging multivariable benchmark processes are employed.

3.1.

Simulation case study I

A non-isothermal CSTR process, possessing highly nonlinear and time-varying dynamic characteristics with a non-afﬁne dynamic model, will be investigated as a benchmark problem in this case study.

3.1.1.

CSTR process description

Fig. 3 demonstrates the non-isothermal CSTR process in which an irreversible reaction (A → B) takes place. The dynamic behavior of the process is described by the following nonlinear equations in a non-afﬁne model structure (Nikravesh et al., 2000):

Y(k) = Fk (Xk ) + Gk (Xk )U(k). 8. Set k = k + 1 to denote for the next sampling time instant. 9. Go to step 3.

dCA E q = (CAf − CA ) − k0 CA exp − dt v RT

ϕC (t),

(27)

Fig. 4 – open-loop CSTR responses to +5% step change deviations: (a) and (b) depict the output responses to qc ; (c) and (d) show the output response to q.

162

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169

(− H)k0 CA E q dT exp − = (Tf − T) + dt v Cp RT

× 1 − exp −

hA qC CPC

ϕC (t) +

ϕh (t) (TCF − T).

C CPC qc CP V (28)

The CSTR nominal operating condition has been summarized in Table 1. Fouling and catalyst deactivation phenomena present two main sources to produce challenging time-varying characteristics in the CSTR process dynamic model during its normal operation. These time-varying dynamic characteristics are represented by the following equations (Nikravesh et al., 2000): ϕh (t) = 1 − 0.01t,

ϕc (t) = exp −0.00067

(29)

ER t , T

(30)

where the fouling effect is caused by deposing material on the heat transfer surface whereas catalyst is deactivated due to poisoning.

3.1.2. Online identiﬁcation of a multivariable afﬁne model under the inﬂuence of noise and time-varying characteristics The performance of the proposed online multivariable identiﬁcation approach will be assessed by considering the non-afﬁne MIMO CSTR dynamic. For this purpose, the CSTR input and output vectors are deﬁned as U = [qc , q]T and Y = [CA , T]T , respectively. Two sequences of white noises with variance of 0.01 are added as the process noises to the CSTR process dynamics, given in Eqs. (27) and (28), to exercise the identiﬁcation simulation tests under real challenging circumstances. Measurement noises present other harsh environmental condition which can inﬂuence the performance of the proposed identiﬁcation approach. Thus, two white noise sequences with variance of 0.02 will be added to the efﬂuent concentration (CA ) and temperature (T) to mimic as measurement noises. Furthermore, the fouling and catalyst deactivation phenomena will be included to carry out the simulation studies under more severe practical conditions. Fig. 4 demonstrates the CSTR open-loop responses to +5% step change deviation in the individual coolant ﬂow rate (qc ) and feed ﬂow rate (q), respectively. The obtained responses illustrate typical efﬂuent concentration (CA ) and temperature (T) behaviors under noise-free conditions for the observation purposes. A set of multi-level variations in the coolant ﬂow rate (qc ) and a one step change variation in the feed ﬂow rate (q) being superimposed by respective white noise sequences of variances 0.01 will be introduced, as shown in Fig. 5(a) and (b), respectively, to excite the CSTR benchmark process over a wide range of operating conditions starting from its initial normal operating point. Consequent output responses of the highly nonlinear CSTR are depicted in Fig. 6(a) and (b). To estimate an adaptive multivariable afﬁne-type model, the following simple input regressor vector is adopted: X = [CA (k − 1), T(k − 1), qc (k), q(k)].

(31)

Therefore, the following two multi-input, single-output (MISO) afﬁne-type models will be estimated using adaptive

Fig. 5 – Input signal sequences exciting the CSTR process: (a) coolant ﬂow rate (qc ); (b) feed ﬂow rate (q).

neuro-fuzzy networks in a real-time manner:

CA (k) = f Ca k (X) + gCa qc k (X)qC (k) + gCa qk (X)q(k),

T(k) = f Tk (X) + gTqc k (X)qC (k) + gTqk (X)q(k).

(32) (33)

Examining theses equations implies that six adaptive neuro-fuzzy networks are required to simultaneously be esti mated so as to model for f ik (X) and gijk (X) where i and j indicate the input and output variables, respectively. To implement the online multivariable identiﬁcation algorithm, the free parameters are set as follows during the whole excitation experiment to adaptively estimate f ik (X) and gijk (X) models: εmin,j = 0.27, εmax,j = 0.98, j = 4000, WRARth,j = 0.32, WRARP,j = 0.15, j = 1.75 and Eth,j = 0.07, j = 1, . . . , 6. The identiﬁed results corresponding to the estimated multivariable afﬁne-type model have been illustrated in Figs. 7–9. The comparative results, illustrated in Fig. 7, clearly demonstrate the high accuracy of the identiﬁed multivariable afﬁne model to efﬁciently approximate the actual non-afﬁne CSTR dynamic model at different operating points. This simulation observation can statistically be veriﬁed based on the overall mean square error (MSE) measure. The calculated

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169

163

Fig. 6 – CSTR output responses to input excitation signals: (a) efﬂuent concentration (CA ); (b) temperature (T).

Fig. 7 – Estimated CSTR multivariable afﬁne model outputs: (a) efﬂuent concentration (CA ); (b) temperature (T).

MSE measures for the estimated efﬂuent concentration and temperature behaviors are equal to 0.0005 and 0.0870, respectively. This indicates the high quality of the identiﬁed multivariable afﬁne model to reproduce the actual CSTR process output responses in the face of process and measurement noises which impose more challenging conditions. The time history evaluations of the created fuzzy rules in Fig. 8(a)–(f) correspond to the identiﬁed neuro-fuzzy networks represented by the two sets of f Ca k , gCa qc k , and gCa qk and f Tk , gTqc k , and gTqk model networks, given in Eqs. (32) and (33). The results indicate that the identiﬁed adaptive neuro-fuzzy models have compact structures whose most complex one is recursively ended up ﬁnally to three adaptive fuzzy rules as illustrated in Fig. 8(f). Fig. 9(a) and (b) illustrate the time updating evolutions of the estimated network parameters, corresponding to the adaptive gCa qc k and gTqc k neuro-fuzzy models, respectively, for typical observation purposes.

major control objective. The column shows signiﬁcant nonlinear behavior over its operating region (Sriniwas et al., 1995). Thus, this highly nonlinear process whose model has not been formulated in an afﬁne-type structure will present a proper benchmark case study to explore the performance evaluation of the proposed identiﬁcation approach.

3.2.

Simulation case study II

Distillation columns are industrially important process operations whose dynamics tend to become more nonlinear as the product purities increase. Skogestad (1997, website) presented a well-developed nonlinear model for a high-purity binary distillation column as a real benchmark problem. This distillation column, operated in the LV-conﬁguration, poses a

3.2.1.

Binary distillation column description

The process represents a ﬁrst principle model of high-purity binary distillation column, depicted schematically in Fig. 10. It is usually called as “column A” in the literature which consists of 39 trays, a reboiler and a condenser. The model is a 2 × 2“open-loop” column with reﬂux ﬂow rate, L, and boil-up ﬂow rate, V, as the two manipulated variables whereas the controlled variables are the top and bottom product composition, denoted by XD and XB , respectively. The proposed model has been provided by some assumptions as follows: Binary mixture, constant pressure, constant relative volatility, constant molar ﬂows, no vapor holdup, linear liquid dynamics, equilibrium on all stages, total condenser. Note that we do not assume constant holdup on the stages, that is, we include liquid ﬂow dynamics. This means that it takes some time (about (NT − 2) × taul ) from we change the liquid in the top of the column until the liquid ﬂow into the reboiler changes. The distillation column mathematical model equations have been described as follows:

164

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169

Fig. 8 – Time history evolutions of the created fuzzy rules corresponding to: (a) f CA K ; (b) gCA qc K ; (c) gCA qK ; (d) f TK ; (e) gTqc K ; (f) gTqK .

- Material balance for light component on each stage i:

- Total material balance on stage i: dMi = Li+1 − Li + Vi−1 − Vi . dt

(34)

d(Mi xi ) = Li+1 xi+1 + Vi−1 yi−1 − Li xi − Vi yi , dt

(35)

165

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169

Table 2 – Nominal operating conditions for distillation column. NT NF Feed rate, F Feed composition, zF Feed liquid fraction, qF Reﬂux ﬂow, LT Boilup, V Liquid holdup on all 41 stages, M0i taul

D yD=XNT B xB = x1 a

41 stages including reboiler and total condenser Feed at stage 21 counted from the bottom 1 [kmol/min] 0.5 [mole fraction units] 1 (i.e., saturated liquid) 2.706 [kmol/min] 3.206 [kmol/min] 0.5 [kmol]a 0.063 min 0 (assuming the vapor ﬂow doesn’t affect the liquid holdup) 0.5 [kmol/min] 0.99 [mole fraction units] 0.5 [kmol/min] 0.01 [mole fraction units]

Including the reboiler and condenser; for more realistic studies you may use M01 = 10 [kmol] (reboiler) and M0NT = 32.1 [kmol] (condenser).

equilibrium: yi =

˛xi , 1 − (˛ − 1)xi

Fig. 9 – Time updating evolution of network parameters: (a) gCA qc K ; (b) gTqc K .

Fig. 10 – The complete schematic diagram of distillation column. which gives the following expression for the derivative of the liquid mole fraction: dxi (d(Mi xi )/dt) − (xi (dMi /dt)) . = dt Mi

(36)

- Algebraic equations: The vapor composition yi is related to the liquid composition xi on the same stage through the algebraic vapor–liquid

Fig. 11 – Column input excitation signals in: (a) L; (b) V.

(37)

166

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169

Total condenser: i = NT(MNT = MD , LNT = LT ) dMi = Vi−1 − Li − D, dt

(42)

d(Mi xi ) = Vi−1 yi−1 − Li xi − Dxi . dt

(43)

Reboiler: i = 1 (M1 = MB , V1 = VB = V) dMi = Li+1 − Vi − B, dt

(44)

d(Mi xi ) = Li+1 xi+1 − Vi yi − Bxi . dt

(45)

We have written the model such that the states are xi (i = 1, NT) and Mi (i = 1, NT) – a total of 2 × NT states. Also nominal conditions have been presented in Table 2. Further details of the simulated model are described in Skogestad (1997).

3.2.2.

Online identiﬁcation of a multivariable afﬁne model

Two normal distributed random sequences of variances 0.001 and 0.007 are superimposed on the column input changes in L and V, respectively, to excite the column benchmark around its normal steady-state operating condition (Skogestad, website). Fig. 11(a) and (b) represent the corresponding input exci-

Fig. 12 – Column output responses: (a) XD ; (b) XB .

where ˛ is the relative volatility. Based on the assumption of constant molar ﬂows and no vapor dynamics, we have the following expression for the vapor ﬂows (except at the feed stage if the feed is partly vaporized, where VNF = VNF−1 + (1 − qF )F): Vi = Vi−1 .

(38)

The liquid ﬂows depend on the liquid holdup on the stage above and the vapor ﬂow as follows (this is a linearized relationship; we may alternatively use Francis’ Weir formula, etc.): Li = L0i +

(Mi − M0i ) + (V − V0)i−1 × , taul

(39)

where L0i [kmol/min] and M0i [kmol] are the nominal values for the liquid ﬂow and holdup on stage i. The vapor ﬂow into the stage may also affect the holdup. For packed columns is usually close to zero. The above equations apply at all stages except in the top (condenser), feed stage and bottom (reboiler).Feed stage: i = NF (we assume the feed is mixed directly into the liquid at the feed stage): dMi = Li+1 − Li + Vi−1 − Vi + F, dt

(40)

d(Mi xi ) = Li+1 xi+1 + Vi−1 yi−1 − Li xi − Vi yi + FzF . dt

(41)

Fig. 13 – Estimated multivariable afﬁne model for column outputs: (a) top product composition (X D ); (b) bottom product composition (X B ).

167

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169

Fig. 14 – Time history evolutions of the created fuzzy rules corresponding to: (a) f XD k ; (b) gXD Lk ; (c) gXD Vk ; (d) f XB k ; (e) gXB Lk ; (f) gXB Vk .

tation signals, resulting into the output column responses in XD and XB , depicted in Fig. 12(a) and (b), respectively. The following input regressor vector is utilized to implement the proposed online multivariable afﬁne identiﬁcation approach:

XD (k) = f XD k (X) + gXD Lk (X)L(k) + gXD Vk (X)V(k),

X = [XD (k − 1), XB (k − 1), XD (k − 2), XB (k − 2), L(k), V(k)].

XB (k) = f XB k (X) + gXB Lk (X)L(k) + gXB Vk (X)V(k).

(46)

This leads to an online estimation of the following two MISO afﬁne-type model structures using six adaptive neurofuzzy networks:

(47) (48)

168

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169

Fig. 15 – Time updating evolutions of network parameters: (a) f XD k ; (b) gXD Lk ; (c) gXD Vk ; (d) f XB k ; (e) gXB Lk ; (f) gXB Vk . For this purpose, the free parameters are set as follows to estimate f ik (X) and gijk (X) networks: εmin,j = 0.39, εmax,j = 0.89, j = 1200, WRARth,j = 0.22, WRARP,j = 0.01, j = 1.75 and Eth,j = 0.01, j = 1, . . . , 6. The identiﬁed multivariable afﬁne model results have been illustrated in Figs. 13–15. Fig. 13 depicts the comparative results in terms of the estimated afﬁne model outputs. The results clearly demonstrate the high accuracy of the identiﬁed

multivariable afﬁne model in approximating the highly nonlinear binary distillation benchmark whose dynamic model has been developed in a non-afﬁne structure. The corresponding MSE measures, calculated as 1.617 × 10−5 and 2.767 × 10−5 , for the top and bottom product compositions, respectively, verify the adequacy of the identiﬁed neuro-fuzzy networks to adaptively reproduce the actual column outputs. Figs. 14 and 15 show the real-time adjustments of the identiﬁed neuro-fuzzy networks to adaptively tune their structures and parameters, respectively. Fig. 14(a)–(f) depict the time history evolutions of the created fuzzy rules to realize the

chemical engineering research and design 8 8 ( 2 0 1 0 ) 155–169

individual structural adaptations corresponding to f ik (X) and gijk (X) models expressed in Eqs. (47) and (48). The obtained results indicate the simplicity of the estimated neuro-fuzzy model structures in which the identiﬁed afﬁne-type model has at most three adaptive fuzzy rules for each structure. Fig. 15(a)–(f) show the real-time updating of the estimated net work parameters, corresponding to the six adaptive f ik (X) and gijk (X) neuro-fuzzy models.

4.

Conclusions

Development of an adaptive afﬁne-type dynamic model for practical multivariable processes with nonlinear and timevarying characteristics provides a signiﬁcant contribution, enabling the utilization of a variety of nonlinear model-based control design techniques. A new online multivariable identiﬁcation scheme has been proposed in this paper to accomplish a practically demanding modeling objective. The proposed scheme presents a new procedure to simultaneously identify a set of complementary adaptive neuro-fuzzy networks to realize an online multivariable afﬁne model structure. The procedure introduces several functional evaluating metrics, such as WRAR, to facilitate the implementation of automatic model development with structural and parametric capabilities. This naturally leads to efﬁcient and compact neuro-fuzzy networks, contributing a tradeoff between local and global model estimation requirements over the whole process operating points spectrum. The extensive simulation test studies carried out on a CSTR and a binary distillation column, as two well-known benchmarks with highly nonlinear and timevarying characteristics, demonstrated the capability of the proposed identiﬁcation approach to automatically estimate a set of simultaneous adaptive neuro-fuzzy networks with simple and compact dynamic structures so as to develop an accurate afﬁne-type model framework.

169

References Angelov, P. and Buswell, R., 2002, Identiﬁcation of evolving fuzzy rule-based models. IEEE Trans Fuzzy Syst, 10(5): 667–677. Chen, F.C. and Khalil, H.K., 1995, Adaptive control of a class of nonlinear discrete-time systems using neural networks. IEEE Trans Autom Control, 40(5): 791–801. Henson, M.A. and Seborg, O.E., (1997). Nonlinear Process Control. (Prentice-Hall). Ljung, L. and Soderstrom, T., (1983). Theory and Practice of Recursive Identiﬁcation. (MIT Press, London). Narendra, K.S., 1992, Adaptive control of dynamical systems using neural networks, in Handbook of Intelligent Control: Neural, Fuzzy and Adaptive Approaches, White, D.A. and Sofge, D.A., Sofge, D.A. (eds) (Van Nostrand-Reinhold, New York), pp. 141–183. Narendra, K.S. and Mukhopodhyay, S., 1997, Adaptive control using neural network and approximation models. IEEE Trans Neural Networks, 8(3): 475–485. Nikravesh, M., Farell, A.E. and stanford, T.G., 2000, Control of nonisothermal CSTR with time varying parameters via dynamic neural network control (DNNC). Chem Eng J, 76: 1–16. Norgaard, M., Ravn, O., Poulsen, N.K. and Hansen, L.K., (2003). Neural Networks for Modeling and Control of Dynamic Systems. (Springer-Verlag, London), pp. 142–147 Sanner, R. and Slotine, J.J., 1995, Stable adaptive control of robot manipulators using neural networks. Neural Comput, 7(4): 753–790. Skogestad, S., 1997, Dynamics and control of distilation columns. Chem Eng Res Des (Trans IChemE), 7(5): 539–562. Skogestad, S., MATLAB Distillation Column Model (“Column A”), available at: http://www.nt.ntnu.no/users/skoge/book/ matlab m/cola/cola.html. Sriniwas, G.R., Arkun, Y., chien, I.L. and Ogunnaike, B.A., 1995, Nonlinear identiﬁcation and control of high-purity distillation column: a case study. J Process Control, 3: 149–162.

A novel online affine model identification of multivariable processes using adaptive neuro-fuzzy networks

A novel online affine model identification of multivariable processes using adaptive neuro-fuzzy networks

Recommend Documents