Structural identifiability of the parameters of a nonlinear batch reactor model

Structural identifiability of the parameters of a nonlinear batch reactor model

Structural Identifiability Batch Reactor Model of the Parameters of a Nonlinear MICHAEL J. CHAPPELL AND KEITH R. GODFREY Department of Engineering, ...

534KB Sizes 1 Downloads 59 Views

Structural Identifiability Batch Reactor Model

of the Parameters of a Nonlinear

MICHAEL J. CHAPPELL AND KEITH R. GODFREY Department of Engineering, University of Warwick, Coventry,

CV4 7AL, England

Received 20 March 1991; revised 17 August 1991

ABSTRACT The similarity transformation approach is used to analyze the structural identifiability of the parameters of a nonlinear model of microbial growth in a batch reactor in which only the concentration of microorganisms is measured. It is found that some of the model parameters are unidentifiable from this experiment, thus providing the first example of a real-life nonlinear model that turns out not to be globally identifiable. If it is possible to measure the initial concentration of growth-limiting substrate as well, all model parameters are globally identifiable.

1.

INTRODUCTION

In this paper, the structural identifiability of the parameters of a model of microbial growth in a batch reactor is considered. The model incorporates Michaelis-Menten-type nonlinearities and has previously been considered in this journal by Holmberg [5]. A comprehensive review of the kinetics of microbial growth processes is given by Button [ 11. Microbial growth in a batch reactor can be expressed as

(14 Pm=

‘= - Y(K,+s)



P-4

where x is the concentration of microorganisms; S, the concentration of growth-limiting substrate; pm, the maximum velocity of the reaction; K,, the Michaelis-Menten constant; Y, the yield coefficient; and K,, the decay rate coefficient. The experiment considered is one where the microorganisms are prepared in a mixture in which the concentration can be controlled, and the

MATHEMATICAL

BIOSCIENCES

108:241-251 (1992)

OElsevier Science Publishing Co., Inc., 1992 655 Avenue of the Americas, New York, NY 10010

241 0025-5564/92/$05.00

242

MICHAEL

substrate is also prepared controlled. Then, at time reactor; it is assumed that can reasonably be regarded t 2 0,

J. CHAPPELL

AND KEITH

R. GODFREY

in a mixture in which the concentration can be t = 0, the two mixtures are put into the batch this is a rapid procedure, so that the two inputs as impulsive. Thus the model equations are, for

i=$$-K,x+b,A(t), s

j=

-

pL,sx + bpqf),

Y(K,+s)

that is, the system can be considered as having zero initial conditions with impulsive concentration inputs of size b, and b,. In principle, the preliminary measurements (i.e., before the substances are placed in the reactor) allow b, and b, to be determined, but here it will be assumed, at least initially, that they are not known. With modern instrumentation, it should be possible to measure b, in almost all cases, but the measurement of b, depends on the exact nature of the growth-limiting substrate, and b, will be difficult to measure accurately in many cases. Holmberg [5] used the Taylor series approach [8] to analyze the structural identifiability of the parameters of this model on the assumption that both concentration of microorganisms x(t) and substrate concentration s(t) could be measured within the reactor (i.e., for t > 0). Both b, and b, are then globally identifiable from the values of x(t) and s(t) at t = O+, and by examining the expressions for the first and second derivatives of x(t) and s(t) at t = 0+ it is readily shown that the four parameters CL,,,, K,, K,, and Y are globally identifiable from the experiment. It is usually easy to measure x(t) within the reactor by means of, for example, optical density (absorbence) methods, but it is often difficult to obtain a measurement of s(t) within the reactor (i.e., for t > 0). In this paper, therefore, we consider identifiability of the parameters of the model of Equations (2a) and (2b) with measurement of x(t) only, that is, with observation y(t) = x(t) for t > 0. Thus the unknown parameter set is where x,, ( = bi) and sO ( = b2) are the initial {x,,,+,,P,,K,, Y,K,), concentrations of microorganisms and substrate, respectively. The Taylor series identifiability approach has been applied to this model and experiment [2]. Clearly y(t) at t = O+ is equal to x0, so that x,, ( = b,) is globally identifiable, reducing the number of unknowns to five. Thus at least five derivatives of y(t) are required to resolve the question of identifiability using this approach. More may be required because, as far as we are aware, there is no known upper bound to the number of derivatives required to obtain the maximum number of independent equations from this approach for the type of nonlinear system under consideration.

NONLINEAR

BATCH REACTOR

MODEL

243

Even with the help of a modem symbolic manipulation package (MATHEMATICA), the equations in the derivatives of y(t) at t = O+ fail to yield conclusive identifiability results. The situation is still not resolved if the additional assumption is made that se ( = b2) is known. The source of the problem is that three of the four remaining unknown parameters (pm, K,, and Kd) enter into the first derivative of the observation, while all four appear in the second and higher derivatives and in such a way that makes it extremely difficult to make progress toward a solution. In this paper, therefore, an alternative approach to structural identifiability analysis of nonlinear systems will be applied. This is the similarity transformation approach [3,10,11], which, as far as we are aware, is the only other approach to global identifiability analysis for systems with specified inputs. The basis of the approach has been described in previous papers in this journal [3,11], but it will be summarized briefly in the next section before proceeding with the particular application. 2.

SIMILARITY TRANSFORMATION STRUCTURAL IDENTIFIABILITY

The approach considers structural nonlinear models of the form

APPROACH ANALYSIS identifiability

TO of the parameters

of

~(t,P)=f(x(f,P),P)+u(t)g(x(t,P),P), J+,P) = +(t,P)J+ x(0, P) = X0(P) 7

(3)

where tE[O,T], x(t, ~)ER”, y(t, p@Rm, u(t)EU[O, T], and peQ c Rq, Q being the feasible parameter space. The approach is an extension of the exhaustive modeling approach for linear systems [12, 131, with the extension making use of the local state isomorphism theorem [4, 6, lo]. Before proceeding with the identifiability analysis using this approach we need to establish whether the system is controllable and observable (i.e., minimal) in the sense described by Hermann and Krener [4]. In several cases of practical interest, the minimality conditions for nonlinear systems can be established in the same manner as those for linear systems. This is a consequence of Lemma 1 in [ 111, which states that if at a particular p* in Q, the system described in Equations (3) becomes linear, and if this linear system is controllable and observable, then so is the corresponding nonlinear system (at x0 = 0), for almost all p in B (i.e., except on a set of measure zero in Q). However, if this result does not apply, then the controllability and observability rank criteria described below must be satisfied.

244

MICHAEL J. CHAPPELL AND KEITH R. GODFREY

CONTROLLABILITY

Consider,

RANK CRITERION

(CRC)

for vector fields (a’ and (o*, the Lie bracket [(o’ , (p*] defined

by

where 8 cp’/ a x denotes the Jacobian matrix of (oi, i = 1,2. For a piecewise constant control ui, the vector field is (oi is defined by pi=f(x)+Uig(x),

i=1,2,3

.

,...

(5)

Consider then the Lie algebra f , which has elements that can be represented by finite linear combinations of elements of the form

Note that g is in f . Let f(x) denote the space of vectors spanned by the vector fields of f at x. The system of Equations (3) is said to satisfy the controllability rank criterion (CRC) at x0 if the dimension of f (x,) is n [4]. OBSERVABILITY

RANK CRITERION

(ORC)

Consider a system with a single output; the output function h(x) in Equations (3) is assumed to be continuously differentiable in x. The Lie derivative of h along the vector field cpi is defined by

&i(h)(X) where dh(x)

= dh(x).cp’(x),

denotes the gradient vector field

ah(x)

dh(x) = [

Consider the space of combinations of elements

a X”

I

l-forms dg of the form

d(L,i(L,i-I(...

ah(x)

-J-y,“‘,--

whose

&(h(x)))...

I .

elements

)))

are

finite

linear

(9)

Note that dh is in dg . Let da(x) denote the space of vectors obtained by evaluating the elements of dg at x. The system of Equations (3) is said to satisfy the observability rank criterion (ORC) at x0 if the dimension of dg ( x0) is n [4]. The concept extends readily to multi-output systems.

NONLINEAR

BATCH REACTOR

MODEL

Finally, the system of Equations (3) is said to be locally (minimal) at x0 if it satisfies both the CRC and the ORC at x0. STRUCTURAL

245

reduced

IDENTIFIABILITY

The identifiability of the system of Equations (3) is first studied at some parameter value p in Q in the experiments specified by (x,,, U[O, T]), where the initial state x,, is well defined after selecting p and is generally parameter-dependent; note that, for the moment, u( t)~ U[O, T], the set of bounded and measurable functions defined on the time interval [0, T]. The notation p - p denotes that the parameter values p, p in Q are indistinguishable in the experiments (x0, U[O, T]). The system is globally identifiable at p E Q if d - p implies that j = p, and it is locally identifiable if there exists an open neighborhood W of p in s2 such that d - p for p in W implies that d = p. The method for testing for the identifiability of the unknown parameters is given in Theorem 1, the proof of which is given in [ 111. THEOREM

I

Assume that the system of Equations (3) is locally reduced at x,,(p) for almost all p in Q. Consider the parameter values of p, p in $2, an open neighborhood V of x0(p) in R”, and an analytical mapping X: V -+ R” defined on V G R” such that

ax(x)

(i) Rank 7

for all 2’EV

= n

(ii) Wxo(P)) = x0(P)

(iii) f(x(% g(h(Z),

(10) (11)

ax(n) P)=

an

UW

f(n,P)

p)= %$Qg(R,

h(X(% PI= 42, i%

P)

(12b) WC)

for all 2 in V. Then there exists T > 0 such that the system of Equations (3) is globally identifiable at p in the experiments (x,(p), U(0, TJ) if and only if conditions (i), (ii), and (iii) imply p = p. For a globally identifiable model, the only value fi in Q satisfying Equations (lo)-(12) is j5 = p, for which A(Z) = 2, that is, the identity mapping on R”. If the model is not globally identifiable to every solution @ # p of Equations (lo)-(12) there corresponds a local isomorphism X(Z) # 2. If the system is not locally identifiable at p, then every open neighborhood of p contains a point j such that j5 - p; thus the number of solutions is infinite. From Corollary 1 of [ 111, the above result can be extended to systems

246

MICHAEL

with specified input if all the following (a) vector (b) (c) is, the 3.

J. CHAPPELL

conditions

AND KEITH

R. GODFREY

hold:

g(x, p) = b(p); that is, the input is multiplied by a constant b. f(0, p) = 0; that is, with no input, the system stays at rest. X( 2) = TZ, where T is an n x n constant nonsingular matrix; that equivalence transformations are linear and time-invariant.

APPLICATION MODEL

OF THE APPROACH

TO THE BATCH REACTOR

It is noted that the system of Equations (2a) and (2b) is of the form of Equations (3) with g = b = (b,, b,)* and u(t) = s(t). In order to apply Theorem 1, it is necessary to show first that the system of Equations (2a) and (2b) with observation y = x is locally reduced. The system becomes linear if pm = 0, but this linear system is not observable, so it is necessary to check that the CRC and ORC are satisfied. To check the CRC, initially we have the vector b = (b,, b,)T# (O,O)T. Then consider the vector field cp’ = f and the Lie bracket PCLmS --K, K,+s

[f,b]= -

which, when evaluated

LLms Y(K,+s)

-

at (0,O) yields

[fJ](O)=(&h&

(14

Since K,b, # 0 generically, this implies that b and [f,b](O) are linearly independent so that dim f(0) = 2 and the CRC is satisfied. To check the ORC, initially we have the vector h = [ 1, 01. Then consider the Lie derivative L#z)=(l

O).f=+&K,x, s

(15)

so that

which when evaluated at (0,O) yields d(L,(h))(O) = (- K,,O), which is not linearly independent to (l,O). Denoting F = d(LJh)), we then

NONLINEAR

BATCH REACTOR

247

MODEL

consider the Lie derivative L&2)=(1

O).F=~-K,,

(17)

s

so that

which when evaluated at (0,O) yields d( LF( h))(O) = (0, pm /KS). Since pm /K, # 0 generically, this is linearly independent to (l,O), so that dim dg(0) = 2 and the ORC is satisfied. The system is thus locally reduced at (O,O), and we can proceed with the identifiability analysis using Theorem 1. IDENTIFIABILITY

ANALYSIS

As with the Taylor series analysis, we start by assuming that the inputs b, and b, are unknown. Thus the unknown parameter vector is p = (b,, b,, pm, K,, Y, KJT. As before, y = x for t > 0. The state isomorphism being sought is denoted by h= (X,, &)r with arguments (2, QT. Initially, it is assumed that u( t)~U[0, T]. From Equation (12c), we have, for the system of Equations (2a) and (2b) 3

[l

o]

[;I=11

0]

[;I,

which implies that A,=.%,

(20)

so that

ah,

an=’ From Equation

and

ah

x=0.

(21)

(12b), we have

which implies that b, = &,

(23)

MICHAEL J. CHAPPELL AND KEITH R. GODFREY

248

and

Using the fact that A, = 2, the first equation

of (12a) gives

(25) This equation is true for all 2, so that 5? cancels out; hence a& /a 5? = 0, giving, from Equation (24),

ax,

b,

==z, Substituting

in the second equation

From Equations

Multiplying

(26)

of (12a),

(25) and (27), with 2 cancelled

out,

through by Z?S + S,

~(~)p,s-K,(~,+s)=~,S-R,(R,+s).

(29)

2

By comparing

coefficients

and since, generically,

we have from the constant term

kS is nonzero.

K, = Ed.

(30)

From the term in S, b,/Y=&,/i?

(31)

NONLINEAR BATCH REACTOR MODEL

From Equations

249

(27) and (31),

so that

(32) But aX2 /a$=

b2 /“b2, so that, equating terms in S-*, Pm = Pm.

There is then no term in s”; equating

(33)

constant terms,

(34) Thus, from this experiment, the individual parameters b,, pm, and K, are globally identifiable and so are the ratios b, / Y and b, /KS, but the individual values of b, , K,, and Y are unidentifiable. If the additional assumption is made that b, is known, then K, and Y are also globally identifiable. The transformations h in both cases are linear, so that condition (c) of Section 2 as well as conditions (a) and (b) apply for these models and experiments. Hence the results also apply to the specified form of input, that is, with impulsive input of microorganism and substrate. 4.

CONCLUSIONS

The case with b, unknown is of particular interest because it is the first example of a real-life nonlinear model that turns out not to be globally identifiable. The source of the unidentifiability is that 2 cancels out in Equations (25) and (27), thus reducing the number of coefficients that can be equated. The first generation of unidentifiable nonlinear models consisted of examples that were trivial mathematically, for example, with parametric parts that were not connected to the outputs. The second generation, which included the bilinear models of Examples 6 and 8 of [7] and the polynomial model of Example 1 of [l l] (considered earlier in [9]), was no longer trivial, but the examples were still purely artificial exercises. The example in the present paper could be regarded as the start of the third generation of unidentifiable nonlinear models, because the model is widely used in practice and was considered earlier in identifiability studies [5].

250

MICHAEL J. CHAPPELL AND KEITH R. GODFREY

The experiment with s(t) unknown has been considered precisely because in many cases it is difficult to obtain a measurement of substrate concentration within a reactor. In some cases (depending on the nature of the substrate), it is possible to determine b, from a preliminary measurement (i.e., before the substances are placed in the reactor), and if b, is known, the parameters pm, K,, K,, and Y are globally identifiable. If only x is observed (but with b, = so known), Taylor series identifiability analysis proves very complicated [2], and it does not appear to be possible to achieve a conclusive result even with the help of a symbolic manipulation package. In contrast, the similarity transformation analysis has proved to be straightforward, once the controllability and observability rank criteria have been checked. This is by no means always the case, and in general it is far from easy to predict which approach will prove the easier for a particular example [3].

The work described in this paper is financed by grant GR /F 6381.7 from the U.K. Science and Engineering Research Council. We are grateful to Dr. David Hodgson of the Department of Biological Sciences at the University of Warwick for advice on the problems of measuring microbial growth in batch reactors.

REFERENCES 1

D. K. Button, Kinetics of nutrient-limited transport and microbial growth, Microbio. Rev. 49:210-291 (1985). 2 M. J. Chappell and K. R. Godfrey, Structural identifiability of nonlinear systems: application to a batch reactor model, 9th IFACIIFORS Symposium on Zdentifcation and System Parameter Estimation, Budapest, 8-12 July 1991. 3 M. J. Chappell, K. R. Godfrey, and S. Vajda, Global identifiability of the parameters of nonlinear systems with specified inputs: a comparison of methods, Math. Biosci. 102:41-73 (1990). 4 R. Hermann and A. J. Krener, Nonlinear controllability and observability, IEEE Trans. Autom. Control AC-22:728-X0 (1977). 5 A. Holmberg, On the practical identifiability of microbial growth models incorporating Michaelis-Menten type nonlinearities, Math. Biosci. 62:23-43 (1982). 6 A. Isidori, Nonlinear Control Systems: An Introduction, Springer-Verlag, New York, 1985. 7 Y. Lecourtier, F. Lamnabhi-Lagarrigue, and E. Walter, Volterra and generating power series approaches to identifiability testing, in Identifiability of Parametric Models, E. Walter, Ed., Pergamon, Oxford, 1987, Chapter 5, pp. 50-66. 8 H. Pohjanpalo, System identifiability based on the power series expansion of the solution, Math Biosci. 41:21-33 (1978).

NONLINEAR BATCH REACTOR MODEL 9

10 11 12 13

251

S. Vajda, Identifiability of polynomial systems: structural and numerical aspects, in Identifiability of Parametric Models, E. Walter, Ed., Pergamon, Oxford, 1987, Chapter 4, pp. 42-49. S. Vajda and H. Rabitz, State isomorphism approach to global identifiability of nonlinear systems, IEEE Trans. Autom. Control AC-34:220-223 (1989). S. Vajda, K. R. Godfrey, and H. Rabitz, Similarity transformation approach to structural identifiability of nonlinear models, Math. Biosci. 93:217-248 (1989). E. Walter, Identifiability of State Space Models (Lect. Notes Biomath. Vol. 46), Springer-Verlag, New York, 1982. E. Walter and Y. Lecourtier, Unidentifiable compartmental models: what to do? Math. Biosci. 56:1-25 (1981).