event systems

event systems

Mathl. Comput. Modelling Vol. 23, No. H/12, pp. 161-173, 1996 Copyright@1996 Elsevier Science Ltd Printed in Great Britain. All rights reserved 089571...

1MB Sizes 7 Downloads 98 Views

Mathl. Comput. Modelling Vol. 23, No. H/12, pp. 161-173, 1996 Copyright@1996 Elsevier Science Ltd Printed in Great Britain. All rights reserved 08957177/96 $15.00 t 0.00

Pergamon

SO8957177(96)00070-2

State Feedback of Condition/Event B. Department

of Electrical

H. KROGH

and Computer Pittsburgh,

Control Systems

Engineering,

Carnegie

Mellon

University

PA 15213-3890, U.S.A. [email protected]

S. KOWALEWSKI Department of Chemical Engineering, University of Dortmund Process Control Group (AST), D-44221 Dortmund, Germany stefanQast.chemietechnik.uni-dortmund.de Abstract-we consider the problem of synthesizing state feedback control policies for condition/event (C/E) systems where the control signals can be both conditions, which enable and inhibit state transitions, and events, which force state transitions. To represent the causal structure of condition and event feedback, a new feedback composition is defined. An algorithm is presented for synthesizing state feedback policies that keep the state of the C/E system out of a given set of forbidden states. The problem formulation and solution is illustrated for a process control example. Keywords-Supervisory Untimed models.

control, State feedback policies, Continuous-time

discrete event systems,

1. INTRODUCTION The most common approach to feedback control of untimed (logical) discrete event systems is the supervisory control theory of Ramadge and Wonham (RW) [l]. In the RW formulation, events in the controlled system (plant) are enabled or disabled by the controller (szlpervisor) to achieve a desired sequential behavior. The plant and supervisor are both modeled as finite-state automata with complementary input-output behavior: the plant automaton generates events from spontaneous state transitions, while the supervisor automaton accepts events from the plant which force state transitions in the supervisor and change the control input to the plant. ‘Thus, plant automata and supervisor automata in the RW model are different classes of discrete event systems with respect to their input-output behavior. Sreenivas and Krogh introduced condition/event (C/E) sys t ems to provide a single, uniform formalism for modular modeling of untimed discrete event systems [2]. C/E systems are continuous-time discrete event systems with two classes of symbolic-valued input and output signals: piecewise continuous condition signals and pointwise nonnull event signals. In general, both types of signals can be input signals and output signals for C/E systems, as illustrated in Figure 1. State transitions in C/E systems can be enabled or disabled by the condition input signal, and they can be forced by the event input signal. Plants and supervisors in the RW theory correspond to restricted classes of C/E systems as illustrated in Figure 2. Event signals from Research of the first author was supported in part by a grant from Rockwell International. The second author gratefully acknowledges the support of the German Academic Exchange Service (DAAD) which made it possible to perform most of the research described in this paper during a three-month visiting research appointment in 1994 with the Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA, U.S.A. Typeset 161

by A&@-TBX

B. EI. KROGH AND S. KOWALEWSKI

162

40

Y(t)

v(t)

z(t)

Conditionsignal flow lines e Event signal flow lines Figure 1. C/E system with continuoustime condition and event input-output signals.

Figure 3. General setting of C/E tern feedback control.

sys-

Figure 2. C/E system model of Ramadge and Wonham supervisory control.

Figure 4. State feedback policy and observer as subsystems of the controller.

the plant force transitions in the supervisor and condition signals from the supervisor enable or disable state transitions in the plant. This leads naturally to the question of what consequences will arise if both types of signals are allowed to flow between the plant to the controller and vice versa, as illustrated in Figure 3. In this paper, we consider the synthesis of state feedback control policies for C/E systems when the control objective is to avoid a given set of forbidden states in the plant. In the setting of Figure 3, we regard the control as consisting of two subsystems: a state observer, which identifies the current plant state from the plant output signals; and a state feedback policy, which maps the state trajectory into the two input signals of the plant. This controller structure is illustrated in Figure 4. We assume in this paper that a state observer exists (or the state can be measured directly) and focus on the synthesis of the state feedback policy. In the following section, we define C/E systems and state feedback policies, and characterize the closed-loop dynamics for systems with the controller structure in Figure 4. To model the causal sequence of event feedback based on state transitions in the plant, we define a new feedback composition operator for C/E systems called the sequential event feedback composition (SEFC). In Section 3, we formulate the state feedback problem for forbidden state specifications and present an algorithm that generates all state feedback policies that solve the problem when a solution exists. In Section 4, we present a process control example to illustrate the problem formulation and solution. The concluding section summarizes the results presented in this paper and discusses directions for future research. 2.

C/E STATE TRANSITION AND STATE FEEDBACK

STRUCTURES POLICIES

C/E systems provide a formalism for modeling precisely the behaviors of interconnected discrete-event systems in terms of continuous-time signals. Although the dynamics of C/E systems are untimed in the sense that there is no specification regarding the timing of events, the introduction of continuous-time input-output signals allows one to construct models of physical systems that reflect causality and spontaneity more intuitively than is possible with automatabased models. Moreover, interfaces between C/E systems and continuous dynamic systems can be more easily defined because C/E systems are already defined in a continuous-time setting [3].

State Feedback Control

163

In this section, we define C/E systems and the particular feedback control behaviors considered in this paper. General properties of interconnected C/E systems are discussed in [2]. DEFINITION 2.1. A C/E system is a nine-tuple C = (U, V, X, Y, 2, f, g, h, x0) where U. V, X, Y, Z are finite, .mutually disjoint sets of elements not including 0 (the null or zero event), defined, and output events; respectivel_y, as: input conditions, input events, states, output conditions, f, g, h are functions

function satisfying defined as: f : X x U x V” --+ 2xJ the state transition x E f (x, u, 0) for all x E X, with Vo = V U (0): g : X x U + Y, the condition output function, and h : X :KX x Vo + 20 the event output function satisfying h(x, x, 0) = 0 for all x E X, with Z,, E 2 u (0); and x0 E X is the initial state. Given a C/E system C as defined above and input signals U(O), U(O) defined on [0, co), the set of valid state trajectories and output signals contains all triples of signals (x(e)! Y(a). Z(O)) defined

on [0, co) satisfying 5 (o-)

= x0 E x,

x(t) E f (x (t--) ,u (t-)

7(t))

3

y(t) = g(x(t), u(t)), Z(t) = h (x (t-)

>x(t)> u(t)) 1

where x(o) and y(o) are condition signals and ~(0) is an event signal. In the state transition relation above, t- symbolizes the limit from the left. Condition signals (including the state trajectory) are defined to be right continuous on [0, co). It is also assumed condition and event signals are nonexplosive; that is, they have a finite number of discontinuities on any finite interval. The state feedback policy in Figure 4 maps the state trajectory into condition and event input signals for the plant. Inserting this controller structure into Figure 3 and combining the state observer dynamics with the plant dynamics yields the feedback structure shown in Figure 5 in which the state feedback policy is now seen to play the role of the output functions in the overall C/E system model. To formulate the state feedback control problem, we define the dynamics generating the state trajectory as a C/E state transition structure, and define the state feedback input policy by the functions g(o) and h(o,o), where g(*) maps the state into the condition signal and h(*,*) maps the state transitions into the event input signal. These notions are defined formally in the following paragraphs.

Figure 5. State feedback policy as output functions of an overall C/E system 2.2. A C/E state transition structure (C/E STS) is a five-tuple G = (U V, X, f, x0) with the components defined as in Definition 2.1 for a complete C/E system.

DEFINITION

2.3. Given a C/ESTSG = (U, V, X, f, XO), a state feedback policy is a pair F = (g, h) with g : X -+ U being the condition signal feedback function, and h : X x X - V the event signal feedback function satisfying:

DEFINITION

(i) h(x, x) = 0 for all x E X; and (ii) h(x’,x) = h( x”, x) for all distinct

x, x’, x” E X.

B. H. KROGH AND S. KOWALEWSKI

164

Given a state feedback policy F = (g, h), the inputs to the given C/E STS are determined from the state trajectory as u(t) = g(z(t)) and v(t) = h(z(t-), z(t)). Condition (i) on h in the definition of a state feedback policy guarantees nonzero event inputs occur only when state transitions occur. Condition (ii) indicates the event feedback depends only on the state at the end of the state transition; that is, h(x’, x) depends only on the “current” state 2, and is independent of the state x’ from which the transition began. Since the event signal h in a state feedback policy depends only on the second argument, we will denote the value of the event feedback policy due to a transition to a state 2 E X by h(*, x). Given a C/E STS G and a state feedback policy F, we denote the closed-loop system by G/F. We are interested in characterizing the admissible state trajectories for G/F and the set of all states reached along all possible state trajectories. At this juncture, we discover that if one applies the feedback composition for C/E systems defined by Sreenivas and Krogh [2], the resulting set of state trajectories for G/F would not represent the most natural interpretation of the behavior of the closed-loop system in Figure 4. In particular, since the event feedback policy is intended to force state transitions in the plant based on the detection of state transitions, one finds that the most natural modeling paradigm is to “freeze” time at the instants when spontaneous state transitions occur to allow for the event feedback to force a sequence of instantaneous state transitions. For example, if a spontaneous state transition occurring in the plant, say ~1 -+ x2, leads through the feedback policy h(xl,xz) to the forcing of another state transition 22 + 23 (with no new transition forced by h(x2, x3)), 1 ‘t IS . most natural in an untimed model to ignore the finite time for this transition-control-transition sequence to occur and allow for the forced state transition to occur instantaneously. Thus, in the continuous-time setting of C/E systems, if the state transition in this example occurs at time t 1, the state trajectory (which by definition is a right-continuous condition signal) would assume the values I = zi and I = 23. The system passes through the intermediate state 22 instantaneously, so the zero residence-time in state 52 is not reflected in the continuous-time state trajectory. (We define such states below as transient states.) Under the Sreenivas and Krogh feedback composition for C/E systems, the event feedback signal in this example would not force the transition x2 ---) ~3 since the state transition causing the event input would not be completed when the event input is received; that is, the state trajectory for the Sreenivas and Krogh feedback composition at time tr would assume the values I = ~1 and x(tr) = 52, To define a new feedback composition for state feedback policies which allows for the type of instantaneous sequential behavior described above, we first define the sequences of states that satisfy the state transition relation under the given state feedback policy. DEFINITION 2.4. Given a C/E STS G = (U, V, X, f, x0) and state feedback policy F, a finite or infinite sequence of states xk E X, k = 0, 1,2,. . . is an admissible state sequence for G/F if for all k = 0, 1,2, . . . , xk # Xkfl and either:

(i) (ii)

xk

E f(xk,g(zk),

h(*,

zk))

and

xk+l

E

xk

$

h( 0, xk))

and

zk+l

E f(xk,g(xk),

Moreover,

f(xkr

g(xk),

if the sequence

defined only for k = 0, 1, .

f(Zk,g(xk),

of states is finite length, . . , K,

then

XK

E f(xK,

0); h(*,

01. xk)).

that is, if there is some K > 0 such that xk is g(xK),

h(@, XK)).

The admissible state sequences for a closed-loop system G/F characterize the possible state transitions that can occur at any instant along a state trajectory. Condition (i) in Definition 2.4 pertains to the case when the system remains in a state xk for a finite residence time. This occurs when the event input signal does not force the system to leave state xk. Condition (ii) pertains to the case when the event input signal from the state transition xk-1 --t xk forces another state transition xk + %k+l. In this case, the latter state transition occurs instantaneously and the state 21, is called a transzent state. We note that if condition (ii) is satisfied at x0, the initial

State FeedbackControl

165

state is a transient state and the system makes an instantaneous transition to a nontransient state when the state feedback is applied at time t = 0. Since event inputs force state transitions only when the system cannot remain in the current state, we make the following assumption concerning the state transition function. ASSUMPTION 1. For any given C/E STS G = (U, V! X, f, Q),

fies for each u E U, ‘u E V, z E X, either (i) f(z,u,

the state transition function satis-

w) = f(z,u,O),

or (ii) z $ f(~,u,~).

Conditions (i) and (ii) in Assumption 1 correspond to (i) and (ii), respectively, in Definition 2.4. This assumption introduces no loss of generality since if it is not true for a given C/E STS, ,a new state transition function can be defined satisfying (i) in Assumption 1 whenever u and v are such that 5 E ~(z,u,v)

in the original state transition function. This modification will have no effect

on the closed-loop behavior of the system under state feedback policies due to condition (i) in Definition 2.4. Assumption 1 simplifies the statements of the computational procedures below. The last statement in Definition 2.4 regarding finite-length admissible state sequences indicates that the system must be able to remain in the last state in the sequence indefinitely; that is, the final state must be a nontransient state. We note that the definition above retains the nondeterministic behaviors allowed in general C/E system models. For example, even under forcing of state transitions, the system may make a transition to one of several possible states. To guarantee the state trajectories are well defined for the closed-loop system, we must preclude situations where the state feedback policy leads to the possibility of an infinite-length sequence of transient states. If this could occur, there would be a “burst” of an infinite number of events and time would not advance. Therefore, we define admissible state feedback policies as follows. Given a C/E STS G = (U, V, X, f, zo), a state feedback policy F is ad,missible provided there is no infinite-length admissible state sequence xk E X, k = 0,l. 2, and some K > 0 such that the states xk are transient states for all k > K. DEFINITION 2.5.

Since the set of states is finite, the admissibility of a state feedback policy can be tested by checking whether the event feedback policy can lead to a finite cycle of transient states, in which case this cycle could be repeated indefinitely and the feedback policy would not be admissible. The state trajectories for closed-loop C/E systems under admissible state feedback policies are defined by the following composition operation. DEFINITION 2.6.

SEQUENTIAL EVENT FEEDBACK COMPOSITION (SEFC).

Given a C/13 STS

0) and an admissible state feedback policy F, a condition signal x : [0, co) ---f X G = (U,V,X,f,x with discontinuities at 0 < tl < tz < ... is a state trajectory for the closed-loop system G/F if there exists an admissible state sequence xk E X, k = 0, 1,2,. . . for G/F, such that the subsequence xkO)xkl ) xk2) . . of all nontransient states in the admissible state sequence sa.tisfies z(tl) = xkl, for 1 = 0,1,2,. . . , where to = 0. Given a state trajectory x(o) for a closed-ioop system G/F, an admissible state sequence satisfying the condition in Definition 2.6 for x(o) is said to be an admissible state sequence assoczated with z(o). Since C/E systems are untimed models, a C/E system identifies only the sequence of states along a state trajectory, not, the times at which the state transitions (occur. Therefore there are an infinite number of state trajectories that have the same set of associated admissible state sequences. The following proposition demonstrates that the admissible state sequences completely characterize the state trajectories for a closed-loop system G/F. PROPOSITION 1. Given a C/E STS G = (U, V, X, f, x0), an admissible state feedback policy F,

and an admissible state sequence xk E X, k = 0, 1,2,. . . for G/F, there exists a state tirajectory x(o) ibr G/F such that xk E X, Ic = 0,1,2,. . . i’s associated with x(o). PROOF. Let Xk,, Xk2,. . . be the subsequence of nontransient states in the given admissible state sequence. Define L as the last subindex in the sequence xkl, xk2,. . . if the sequence has finite length; otherwise define L = 00. Note that since F is an admissible state feedback policy, there

B. H. KROGHAND S. KOWALEWSKI

166

is a finite number of nontransient states if and only if the given admissible state sequence has finite length. Moreover, if the admissible state sequence has finite length, the final state in the sequence is nontransient. Define the infinite-length sequence ~0, ~1,. . . by Zl =

xh

forOil
xkr,

for L < 1.

is the sequence of nontransient states extended to an infinite-length sequence Thus, zo,zl,... with a constant value equal to the last nontransient state when the admissible state sequence has finite length. Consider the condition signal x : [0, co) + X defined for I = 0, 1, . . . by z(t) = zl for t E [l, 1 + 1). We claim this condition signal x(o) is a state trajectory for G/F. This follows by observing that the discontinuity points of x(o) are at t = 1,. . . , L, which implies the given admissible state sequence is associated with x(m) as defined in Definition 2.6.

I

Our objective is to design state feedback policies that satisfy specifications on the states that are reached along state trajectories in the closed-loop system, including transient states. Thus, we define the set of reachable states for a closed-loop system G/F, denoted by R(G/F), as follows. DEFINITION 2.7.

Given C/E STS G = (U, V, X, f, x0 ) an d an admissible state feedback policy F, the set of reachable states R(G/F) for G/F is the union of states in all admissible state sequences

for G/F. We conclude this section by showing the set of reachable states R(G/F) the following iterative procedure.

can be computed by

REACH: Given C/E STS G = (U, V, X, f, 20) and state feedback policy F: x0 := (x0) Ic := 0 repeat X”+l := X” u Ic:=k+l until Xk = X”-l R(G/F) := X” end-REACH

a C/E STS terminates.

PROPOSITION 2. Given

procedure

REACH

(UzEXkf(x, g(x),h(., ~1))

G = (U, V, X, f, x0) and a state

feedback

policy

F,

the

PROOF. The proposition follows from the observations that the sequence Xk is monotone increasing with respect to set containment and the set of states X is finite. I PROPOSITION 3. Given a C/E STS G = (U, V, X, f, x0) and an admissible state feedback policy F, the set of states R(G/F) generated by the procedure REACH is the set of reachable states for G/F. PROOF. This proposition follows immediately from Definitions 2.4 and 2.7.

3. FORBIDDEN

STATES

AND

ADMISSIBLE

I

CONTROLS

Our objective is to synthesize state feedback policies that guarantee the plant state for the closed-loop system does not enter a given set of forbidden states, denoted by Xf c X. The problem to be solved is stated formally as the following. DEFINITION 3.1.

forbidden

states Xf

Given a C/E STS G and a set of state feedback policy F such that R(G/F) nXf = 0.

FORBIDDEN STATES PROBLEM (FSP). c X, find an admissible

In general, if there is a solution to a given FSP, the solution is not unique. In this section, we present an algorithm that generates a characterization of all possible state feedback policies that

State Feedback Control solve an FSP in terms of the sets of control

input

167

alternatives that can be applied at each state

of the given C/E STS. Given a C/E STS G = (U,V,X,f,zo), a function Q : X - 2”& defines control alternatives for each state. An admissible state feedback policy F = (g, h) is said to be

DEFINITION

3.2.

in Q (denoted F E @p) if for all x E X, (g(x), h(z))

E Q(x).

Given a C/E STS and a set of forbidden states, the following algorithm generates the control alternatives that characterize all solutions to the FSP (provided the initial state satisfies the condition stated in Theorem 1, below). ADMISSIBLE_CONTROLS: xj

(1 x: x;

Given a C/E STS G = (U. V, X, f, 20) and a set of forbidden states

:= Xf

for each z E Xj Q(x) := u x v, end-for-each k:=O

repeat X’ := 0 for each 5 E X - Xf” @p’“(Z) := ((‘11,u) E u x vo if Q’(x)

1f(z, 21,u) n xf” = 0)

= 0 X’:=X’U{x} Q(x) := u x v,

end-if end-for-each x:+1 := xf” u X’ k:=k+l until X’ = 0 X; ::: Xf” for each x E X - X$ (a(x) := @k(x) end-for-each end-ADMISSIBLE-CONTROLS PROPOSITION 4. Given C/E STS G = (U, V, X, f, x0 ) an d a set of forbidden states Xf c X, the procedure ADMISSIBLE_CONTROLS terminates. PROOF. The proposition follows from the observations that the sequence of sets Xf” is monotone increasing with respect to set containment and the set of states X is finite. I The set of states X; generated by the procedure ADMISSIBLE-CONTROLS is the set of states from which it is impossible to avoid reaching the set of forbidden states Xf under any control action. This fact is stated and proved as the following lemma. x0 ) an d a set of forbidden states Xf c X, Jet Xi > ADMISSIBLE_CONTROLS. If x’ E Xi - Xf . then there exists some K > 0 such that for any control sequence Uk E U, vk E VO, k = 0, 1, . , J: - i, there is a sequence of states xl,. ..,XK, such that xl E f(x',uOrvO),Xk 6 f(~k-lj~k-l~~k-I) for k = 2,. . , K, and xK E Xf.. LEMMA

Xf

1. GivenaC/ESTSG=(U,V,X,f,

be the set of states generated

PROOF.

by procedure

We prove the lemma by proving the following assertion:

if state x’ E Xi - Xf was added

to Xi on iteration k > 0 of the algorithm ADMISSIBLE-CONTROLS, i.e., if x’ E Xs - Xr-lY then the lemma is true for K = k. We prove this assertion by induction on k. Suppose x’ E Xi - Xfo (i.e., k = 1). This implies for all u E U, v E VO, ~(x’,~L,v) n Xf” # 8. Therefore,

168

B. H. KROGH AND S. KOWALEWSKI

for any given uo E U, vo E VO, there exists a state 21 E ~(z’,uo, WO)n Xf”, which implies the lemma is true for K = 1 since Xf” = Xf.

Now, suppose the assertion is true for k = k’ > 1, and - Xfk’, this implies for all u E U, 21E VO,

consider the assertion for k = k’ + 1. If 2’ E X:‘+’ f(z’,

IL,ZI)n Xf”’ # 0. Moreover, since 2’ $ X, ” , it must be the case that for some u E U, u E VO,

f(z’,

21,u) n x?‘-l

= 0, which implies for all u E U, w E VO, f(d,

u,v) n (Xy’ - XF’-‘)

# 8.

Therefore, for any given uo E U, ZIOE VO, there exists a state x1 E f(z’, ~0, ~0) n (Xj’ - X,“‘-l). Since the assertion is assumed to be true for k = k’, for any control sequence uk E U, ‘uk E VO, k = l,..., k’, there exists a sequence of states x2,. . . ,xp+l such that xk E f(xk_1,~k__1,~k_-l) for k = 2,. . . , k’ + 1, and xkf+l E Xf This proves the lemma holds for K = k’ + 1, which proves the induction hypothesis and the lemma.

I

By Lemma 1, if the initial state is in the set X;,

it will not be possible to guarantee a state

trajectory will not enter the set of forbidden states. On the other hand, if the initial state is not in the set X(f, FSP has a solution and all possible solutions are completely characterized by the set of control alternatives generated by the procedure ADMISSIBLE-CONTROLS. These facts are established by the following theorems. THEOREM 1. Given C/E ST’S G = (U, V, X, f, x0) and a set of forbidden states Xf c X, let X; be the set of states and @ the control alternatives generated by procedure ADMISSIBLE-CONTROLS. If x0 E X - X;, any admissible state feedback policy F E @ solves the FSP. Furthermore, if an admissible state feedback policy F E Q, solves the FSP, then x0 E X - Xi. PROOF. Regarding the last statement of the theorem, if F E Cpsolves the FSP, it follows immediately from Lemma 1 that x0 E X - X(f. Now, suppose x0 E X - Xi and that F E @ is an admissible state feedback policy. It must be shown that R(G/F) nXf = 0. We show by induction that for each iteration of the procedure REACH, Xk n X; = 0, which implies the desired result,

since R(G/F)

= X” and Xf C X(f. k = 0:

X0 n X; = {x0} n Xi = 0

(by assumption).

Induction hypothesis: X” n X; = 0 k + 1:

From REACH: Xk+l := X” U

u

f(x,

s(x), f4.7 x))

i XEX”

Since XknX;

= 0 and (g(x), h(z)) E Q(x), for any x E Xk, if x’ E f(x,g(x),

from procedure ADMISSIBLE-CONTROLS

that 2’ $! X;.

. )

h(o, x)), it follows

Therefore, Xk+’ n X; = 0.

I

2. Given C/E STS G = (U, V, X, f, x0) and a set of forbidden states Xf c X, let Xi be the set of states and Q the control alternatives generated by procedure ADMISSIBLE-CONTROLS. The sets of control alternatives Cpgenerated by procedure ADMISSIBLE_ CONTROLS for each state are the maximal sets of controls with respect to set containment which guarantee any admissible state feedback policy F E Q is a solution to the FSP for any x0 EX-XX;. THEOREM

PROOF. Suppose that for some x0 E X - Xi and admissible feedback policy F, there exists a 4 Q. Th is implies from the procedure reachable state x E R(G/F) such that F = (g(a),h(x)) ADMISSIBLE-CONTROLS that f(x, g(x), h(o, x)) n Xi # 0. Therefore, by Lemma 1, F is not a solution to the FSP. I

Thus, in the sense of set containment, the control alternatives generated by procedure ADMISSIBLE_CONTROLS is the maximal solution to a FSP. In contrast to the supervisory control problems formulated in the standard RW framework, there is no notion of a unique maximal control to be applied at each state. Rather, the structure of the C/E STS with the possibility of forcing controls creates a situation analogous to the RW formation with arbitrary control sets considered by Golaszewski and Ramadge [4].

State Feedback Control

169

4. EXAMPLE Consider processes

a tank for liquid material shown in Figure

as it is used for storage or dosing purposes

valve. Both valves can be in only one of two states, open or closed. a heating coil because the stored material can gelatinize at ambient coil has two states,

on (energized)

and 08.

The heating

it is on when the liquid level is below the heating example

in chemical

6. The tank can be filled via an inlet valve and drained

The tank is equipped with temperature. The heating

coil can be damaged

coil. This example

batch

via an outlet

by overheating

is a modified

version

if

of the

in [5,6]. Feed

Valve 1 full

J-5-G

I



I

/

medium

Valve 2

D&n Figure 6. Tank example

Figure 7. State graph for tank example.

Figure 7 shows the state graph of the tank model. For simplicity, we assume only one .valve can be operated at a time and control actions (opening/closing of valves or starting/stopping the heating) never happen simultaneously with a change in level range (see Figure 6 for definition of the level ranges). It is possible, however, that one of the valves and the heating coil change states at the same time. The state spill stands for overflow, gel represents the case when the material has gelatinized, oo is the situation when both valves are opened, and burn represents

170

B. H. KROGH

AND

S. KOWALEWSKI

the damaged state of the heating. All other states are named in the following fashion: the first letter describes the state of valve 1: o = open, c = closed; the second does the same for valve 2; the third letter represents the level-range: f = full, m = medium, 1 = low, e = empty; and a fourth letter h shows that the heating is on. At state ~0, the tank is empty, the heater is off, and the valves are closed. Concerning the control of the tank, we assume the following. The valves can be opened and closed by a human operator, but the tank is equipped with an automatic controller which can enable and disable these operations as well as operate the valves automatically. The heating is switched on and off by a supervisory control system so that the tank controller has no influence on it. To represent these assumptions in the C/E model of the tank, the condition input consists of two values: en = enable opening and closing of both valves, and dis = disable opening and closing of both valves. The event inputs are: opvl = open valve 1, clvl = close valve 1; opv2 and clv2 are similar for valve 2. The specific problem is to design a controller for the system described above such that, no matter how and when the operator opens and closes the valves, the process always shows a desired behavior described by the following four specifications: (1) The tank shall not overflow. (2) Both valves shall never be open at the same time. (3) The material shall not gelatinize in the tank. (This failure is possible if the material rests in the tank (both valves closed) and the heating is off.) (4) The heating coil shall not be damaged by overheating. (This can happen if the heating is on for some time while the tank level is low or empty.) It is easy to see that each specification listed above corresponds exactly to one state of the system model: (1) spill, (2) 00, (3) gel, and (4) burn. So, we can formulate the controller synthesis as an FSP and apply the approach introduced in the previous sections. The example nicely motivates a control policy combining condition and event signal feedback and illustrates the resulting phenomena including transient states. First, consider the state ocm in which the tank is being filled and has a medium level. For the controller, it is sufficient to know the state and map it back into a condition signal, g(ocm) = dis, disabling the operator from opening valve 2 and by this preventing a transition to the forbidden state 00. Event signal feedback as a response to a transition to ocm is not necessary: h(*, ocm) = 0. This is different from the situation when the level just reaches the range full. In the model for the open loop system, this corresponds to the spontaneous transition from ocm to ocf (Figure 8 shows a portion of the complete state transition graph). A reasonably designed controller would close valve 1 to prevent overflow in this situation. This means the controller would use the state transition information to force the system into state ccf (illustrated by the dotted arrow in Figure 8) and by this action preempt a spontaneous transition to spill. Forcing is only possible when event signals are fed back. In terms of C/E state feedback policies, the described control behavior corresponds to g(ocf) = en, h(e, ocf) = clwl. In state ccf, however, it is possible that the material will gelatinize (transition to gel). The controller therefore has to react on the transition ocf to ccf again and force the system to cof: g(ccf) = en, h(*, ccf) = opw2. In this case, no preemption by forcing is necessary and it is sufficient to disable the opening of valve 1: g(cof) = dis, h(o,cof) = 0. Figure 9 shows the logical sequence of states in the closed loop system resulting from the control policy described above. It is an admissible state sequence according to Definition 2.4. The transitions to the forbidden states are either disabled (to 00) or preempted (to spill and gel). There are two transient states in this sequence: ocf and ccf. In both cases, the transition leaving the states is forced as a reaction to the transition leading to the state. Figure 10 shows the state trajectory of the closed loop system composed by SEFC. All transient states are eliminated which results in just a spontaneous transition from ocm to cof. The time consumed by the sequence (ocm, ocf, ccf, cof) collapses to an instant.

State

Figure 8. Illustration transitions.

of cause-effect

Feedback

Control

relationship

between

Figure 9. Admissible state sequence ineluding transient states in the closed loop system.

An alternative of h(o,ccf)

to the state

= 0~~2.

This

feedback means

In this case, the spontaneous

feedback

policy

states

could not be applied

ocf and ccf.

this transition

trajectory

resulting

policy above would be to choose h(o, ccf)

because

= opal

in.stead

will open valve 1 as soon as the state

transition

to gel would be preempted, it causes

As soon as the tank becomes

the controller

forcing and forced state

Figure 10. State from SEFC.

the controller

reached.

171

a transient

as before,

cycle consisting

ccf

is

but the

of the two

full, valve 1 will be closed, and as a reaction

will force the opening

of valve 1 which again will result

to

in closing

valve 1, and so on. We now describe feedback

the first iteration the condition spill.

how the procedure

policies for the C/E STS

(k = 0) of the main repeat-untzl

generates

states

Xf

all admissible

= {spill,

loop, the algorithm

that for all states

next state.

This is represented

in which the tank is empty

will find for every state all by Q’(z).

and the heating

gel,

burn, oceh,

Consequently,

cceh, coeh}.

Now, the procedure

are forbidden,

the states

because

is no value of (u,u)

there

leads t,o Xf” = {spill,

represented

which can prevent

the heating

not being in Xf” with Q’(z)

with X;

The first line is for the states those

will determine

that

that a transition

with X; if oceh,

= {spill,

= Xf” and Q(z)

Table

X’ remains

too,

on. This will find

empty

1 shows the resulting

and a@(z).

in X;.

has to be implemented, by Q(z).

= Q’(z).

states,

The third iteration

= 0. Therefore,

00,

cceh, and coeh

from being switched

00, gel, burn, oceh, cceh, coeh, ace, ccc, toe}.

terminates

If a controller

will be started

it

is on, i.e., oceh, cceh,

ace, ccc, and toe have to be added to the set of forbidden

that there are no more states the iteration

the next iteration

s#tates

By doing this,

and coeh, @O(x) = 0 which means there is no value of (u, v) which will guarantee to burn does not occur.

state

00, gel, bzlrm}. In

and event input signal values (u, w) which will not allow any of the forbidden

oo, gel, or burn as a possible

will discover

ADMISSIBLE-CONTROLS

and the set of forbidden

This

we have to choose one unique state feedback

choice is restricted

by the constraint

that

policy from

the state

feedback

B. H. KROGH AND S. KOWALEWSKI

172

Table 1. @a(z) for the example resulting from ADMISSIBLE_CONTROLS. @(spill) = +(oo) = @(gel) = @(bum) = @(oceh) =

*(cceh)

= Q(coeh)

= Q(coe) = *(ax)

= @(toe) =

u x vi @(ocfh) = {(en, clwl), (dis, clvl)} Q(ccfh) = u x vo @(cofh) = {(en, cZw2), (dis, 0), (dis, clwl), (dis, opv2), (dis, cZv2)) @(ocf)

= {(en, clwl), (dis, @(ccf) = {(en, opal), (dis, +(cof) = {(en, cZw2), (dis, +(ocm) = {(en, clwl), (dis,

clwl)} opwl), (en, opw2), (dis, opw2)) 0), (dis, clwl), (dis, opw2), (dis, cZV2)) 0), (dis, opwl), (dis, clwl), (dis, CZw2))

@(ccm) = {(en, opwl), (dis, opwl), (en, opw2), (dis, opw2)) @(corn) = {(en, cZw2), (dis, 0), (dis, clwl), (dis, opw2), (dis, cZw2)) +(ocmh) = {(en, clwl), (dis, 0), (dis, opwl), (dis, clwl), (dis, cZw2)) @(cc&z) = {(en, 0), (en, opwl), (en, clwl), (en, opw2), (en, clw2), (dis, 0), (dis, opwl), (dis, clwl), (dis, opw2), (dis, cZv2)) @(CEO)= {(en,

0), (en, opwl), (en, clwl), (en, opv2), (en, cZ7~2), (dis, 0), (dis, opwl), (dis, clwl), (dis, opw2),

(dis, cZv2)) Q(comh)

= {(en,

cZw2), (dis, 0), (dis, clwl), (dis, opw2), (dis, clw2))

+(ocZ) = {(en, clwl), (dis, 0), (dis, opwl), (dis, clvl), (dis, clw2)) @(ccl) = ((en,

opwl), (dis, opwl), (en, opw2), (dis, 0~2))

+(coZ) = {(en,

cZw2), (dis, cZw2))

O(ocZh) = {(en,

clwl), (dis, 0), (dis, opwl), (dis, clwl), (dis, cZw2))

@(ccZh) = {(en, opwl), (dis, opwl)) +(coZh) = {(en, cZw2), (dis, cZw2)) Table 2. A cycle-free state feedback policy. F(spiZZ)

= F(o0)

= F(geZ) = F(bum)

= F( oceh) = F(cceh)

= F(coeh)

= F(coe)

= F(cce)

=

F(cm) =

{Cd& 0)) F(ocfh)

= {(en, clwl)}

F(ccfh) = {(en, 0)) F(cofh) = {(dis, 0)) F(ocf) = {(en, clwl)} F(4) F(cof)

= {(en, opv2)) = {(dis, 0))

F(ocm)

= {(dis, 0))

F(ccm) = {(en, opvl)) F(com) = {(dis, 0)) F(ocmh) = {(dis, 0)) F(ccrrzh) = {(en, 0)) F(z0) = {(en, 0)) F(comh) = {(dis, 0)) F(ocZ) = {(dis, 0)) F(ccZ) = {(en, opwl)} F(coZ) = {(en, cZw2)) F(ocZh)

= {(dis,

0))

F(ccZh)

= {(en,

opwl)}

F(coZh)

= {(en,

cZw2))

policy has to be free of transient cycles. Table 2 shows an example for a cycle-free state feedback policy. Here, the additional criterion was applied that event feedback shall be omitted whenever disabling of transitions by condition signal feedback is sufficient. We note that the state feedback policy in Table 2 leads to a cycle in which valve 1 is open until the tank is full, then valve 2 is open until the tank is emptied, and so on. The operator is not allowed to intervene once this

State Feedback

cycle is reached. assume

This is due to the simplifying

the controller

immediately

is not able to disable

in case it is turned the controller

the heater.

upon reaching

made in the model.

cannot

allow the operator

Eliminating

might

the empty

to avoid damaging

to interrupt

assumptions

for a more ‘complex system

we

This implies the tank must be emptied state

be turned

or enabled

the emptying

off; and the

together,

the heater

which means

or filling process

the other valve would also have to be enabled, these

In particular,

since the heater

on. We also assume the valves are disabled

the open valve because 00 st,ate.

173

assumptions

once it is full to avoid gelatitiizing,

tank must be refilled immediately

Control

by closing

allowing a transit,ion

would lead to a more realistic

feedback

to the

cont,rol policy

model.

5. CONCLUSIONS This

paper

C/E

systems.

tion

(SEFC)

presents

the formulation

A new feedback is introduced

set of state

trajectories

for closed-loop The difference

sequence responses

composition

The

state

servability framework, specifications

systems

and events,

C/E systems

of the SEFC feedback and observer

in practical

policies

can be measured.

is synthesis

the complete

of supervisory

for the closed-loop

solut,ion are suggested

controllers

for C/E systems

for general

of transition:;

and

above, whereas the feedback

The

constraints

feedback

for the example

developed

The

an instantaneous

loops as instantaneous

the state

composi-

systems.

and Krogh

loops in continuous-state are illustrated

for

from the trajecto-

reflects

of the sequence

system as described

design have not been completely

but steps toward

of Sreenivas

applications

assume

is different

problem

feedback

in practical

is that, the SEFC

views event feedback

control

event

control

under the SEFC

as an idealization

similar to algebraic

feedback

the sequential

using the algorithm

that would occur in a physical

for general

future research

composition,

between the approaches

of transitions

the set of valid trajectories, implications

systems

of a state

the effect of forcing

for closed-loop

rics obtained

control

system

to represent

C/E systems. causal

and solution

The

in Section

4.

problems

for the general in [7]. Another

on

systems. of stat{:

ob-

C/E system direction

to satisfy sequential

for

control

system.

REFERENCES I. P.J. Ramadge and W.M. Wonham, Supervisory control of a class of discrete event processes, SIAM .J. Control Optim. 25, 206-230 (1987). 2. R.S. Sreenivas and B.H. Krogh, On condition/event systems with discrete state realizations, Dzscrete Event Dynamic Systems: Theory and Applications 1, 209-236 (1991). 3. B.H. Krogh, Condition/event signal interfaces for modeling and analysis of hybrid systems, In proc. 8’” International Symposium on Intelligent Control Systems, Chicago, IL, June 1993, pp. 180-185. 4. C.H. Golaszewski and P.J. Ramadge, Control of discrete event processes with forced events, In Proc. 2eh Con& on Decision and Control, Los Angeles, CA, December 1987, pp. 247-251. 5. S. Kowalewski and H.-M. Hanisch, Permissive control of Boolean condition/event systems: Synthesis and limits, In Preprints gth International Symposium on Intelligent Control, Columbus. OH. August 1994, pp. 118--123. G. H.-M. Hanisch and S. Kowalewski, Algebraic synthesis and verification of discrete supervtsory controllers for forbidden states specifications, In Renssellaer’s 4 th Int. Conf. on Comp. Integrated Manufacturznq and Automation Technology, Troy, NY October 1994, pp. 157~162. 7. S. Kowalewski, Synthesis of static controllers for forbidden states problems in Boolean C/E systems using the Boolean differential calculus, In 11 th Internatzonal Conference on Analysis and Optimzzation of Systems, Sophia-Antipolis. France, June 1994, LNCIS 199, pp. 122-128, Springer-Verlag, London, f 1994).