Chapter 10 Estimation of the membership function

Chapter 10 Estimation of the membership function

C h a p t e r 10 Estimation of the membership function People think that mathematics is precise and exact. It seems that worldwide scientists and en...

1MB Sizes 0 Downloads 60 Views

C h a p t e r 10

Estimation of the membership function

People think that mathematics is precise and exact. It seems that worldwide scientists and engineers focus their attentions on finding the most exact numbers in their studies with errors smaller than 0.1%, 0.01% and even 0.001%. On the other hand, vague values and fuzzy language are more often used. Words like "tall" in "he is tall", "fat" in "she is fat" are all fuzzy words without clear definitions. 190 cm is definitely tall, but 175 may be tall or may not. Being tall cannot be measured by single index like height, and it is also influenced by one's figure, weight, face or even clothes. Mathematics is not unable to handle fuzzy phenomena. There is a mathematics branch, called fuzzy set theory able to describe such fuzzy things. Fuzzy set theory is introduced into mathematics by Zadeh in 1970s (Zimmermann, 1985). Since then, fuzzy set theory has been applied to language studies, control etc. The most important concept in fuzzy set theory is the membership function. Although fuzzy set theory and statistical estimation are two different branches of mathematics, the former can also be studied by use of the latter. So in this chapter, the method introduced in previous chapters is applied to determining membership functions. The membership function is usually determined by the user in applications. It seems that direct determination of the membership function based on sample data was proposed by Fujimoto et al (1994) and Zong et al (1995). 10.1 Introduction

In traditional set theory, the boundaries between two sets are crisp, meaning that an element is either in set A or set B, but not in both. In Figure 10.1(a) is shown three crisp sets intervals A~,A2,A 3 on the real line, that is, A~ = [0,30], A2 = (30, 70],

A3 =

(70,100]. 237

238

10. Estimation of the membership function

Al

A2 .

.

.

.

A3 .

.

.

.

9.:::. ~ i:::.li: ):i::..:.:.:.i:i:i:i 30 P

X

,,

v

70

(a) Crisp sets

20

40

60

80

(b) Fuzzy sets

Figure 10.1 Crisp sets vs. fuzzy sets If we introduce step functions defined by 1

if

x ~ [0,301

gt,(x) = 0

/f

x ~ (0,301'

1 /f P2(x)= 0 /f

x~(30,701

/13(x) =

10 /f /f

(lO.la)

xr

(10.1b)

xe(70,1001 x ~ (70,100]'

(1O.lc)

then AI, A2 and A 3 may be redefined by /~ = {x" ~,(x) ~ 0} , i= 1,2,3.

(10.2)

In words, ,4, is a collection of those points which do not make the function/~, vanish. We give a special name membership function to/t,, which is characterized by

(1)

/~, (x) > O,

(10.3a)

3

(2/

~ fl, (x) = 1.

(10.3b)

t=l

We do not have any reason to say that membership functions p, must be step functions. In fact, functions satisfying equations (10.3) can be used as membership functions. For example, we may define/1, by

10.1. Introduction

0

x<20

2 ( x - 2 20 0)....

20
1- 2 ( x - 4 0 ) 2 30
20

2 ( x - 27 20 0)

~,(x) =

50 < x <

70
1

x<20

60

l - 2 ( x - 2 0 ; 2 20
0

(lO.4a)

60
0

2 ( x - 24 20 0)

239

(10.4b)

30
0

(x-5o)

2k, 20

/~(x) =

l'

x<50 (10.4c)

-

20' 1

50
These functions are plotted in Figure 10.1 (b). Similar to Equation (10.2), we define three sets, specially written in the form of AI, A2 and _'J3, by _A, = {x : ,u, (x) 4: 0}, i = 1, 2, 3.

(10.5)

They are fuzzy sets. For crisp sets, if point P e A2 , then P r A~ and P r A3 , see figure 10.1(a). But for fuzzy sets, a point can be in two or even more sets

240

10. Estimation of the membership function

simultaneously. For Point P in figure 10.1 (b), both Pl * 0

and P2 s 0 .

Thus P e A! and P ~ A2 from definition (10.5). So fuzzy sets do not have clear boundaries. Detailed presentation of the theory can be found in Zimmermann (1985) One of prominent applications of fuzzy set theory is in the field of quantitative description of language variables due to the fact that language itself is fuzzy. Take age for instance. "Young", "middle-aged" and "old" are such language variables that their boundaries cannot be clearly defined. A person being 30 years old may either be young or be middle-aged. It is hard to draw a clear line between "young" and "middle-aged", so is "middle-aged" and "old". Therefore, ~ = young, A2 = middle- aged and A3 = old are three fuzzy sets. If we want to describe language variables like "young", "middle-aged" and "old" in the framework of crisp set theory, we have to define two critical numbers. For example, 30 years old and 70 years old are defined as the two critical numbers. Below 30 is young and above 30 is middle-aged. Below 70 is middle-aged and above 70 is old. This is in fact what is shown in Figure 10.1 (a) and defined in equation (10.1). Such definition is easy, but a little wired due to the absurdness that a person one day past his thirtieth birthday is middle-aged and another person one day to his thirtieth birthday is young. They are, however, may be only two days different in age. In fuzzy set theory, 30 years old is considered half young and half middle-aged. It may be interpreted at two levels. At the first level, half young and half middle-aged means that he is at the transition stage of life from young to middle-aged. At the second level, half young and half old means that if a survey is made of those 30 years old, half of them may be considered young while half of them are considered middle-aged. In this sense, fuzzy sets with unclear boundaries are in fact more informative and reasonable than crisp sets. This is schematically shown in Figure 10.1 (b) and defined by equation (10.4). Once the rationale for fuzzy sets is justified, the remaining question is how to determine the membership functions. Equation (10.4) and Figure 10. l(b) is in fact a way to define the membership functions for these three fuzzy sets. How to find them? To answer the question is more difficult than to ask. Over the years, several forms of functions have been employed for defining membership functions. Those in Figure 10.1 are two examples. In general, the forms of membership functions may be problem-dependent. Even for the same problem, they are also user-dependent. For the same problem, two users may use quite different forms of membership functions. Take height for instance. In Figure 10.2 are shown two different forms of membership functions for assessing one's height. Three fuzzy sets are defined: "short", "medium" and "tall". One may choose those in figure 10.2 (a) for membership functions and the other may well choose those in Figure 10.2.(b) as membership functions. Using different forms of membership functions will definitely influence the subsequent

10.1. Introduction short

medium

tall

241

short

medium

tall

Height (cm)

Height

(cm) v

150

160

170

180

160

150

170

180

Figure 10.2 Two different membership functions

M

~OO

$

OON

20 (a) randomly prepared triangles

9

30

40

50

60

70

80

(b) fuzzy sample

Figure 10.3 Experiment on the perception of triangular area by a student assessments. Therefore, a systematic methodology is needed to build and determine membership functions on an objective base. Membership functions/-6 (x) may be interpreted as the probability of a point to be in a specific fuzzy set. That is, for a point x, the probability for it to be is p, (x). Note that p~ (x) + kt2(x) + P3 (x) = l, meaning that a point must be in a set. In summary, the membership function lying at the heart of fuzzy set theory serves as two purposes. They define fuzzy sets themselves and they determine the probability of a point belonging to a specific fuzzy set. A membership function is in fact a probability density function determining the possibility for a point to be in a fuzzy set. Therefore, the methods proposed in the previous chapters are also applicable to the determination of the membership functions.

242

10. Estimation of the membershipfunction

By aid of measured entropy analysis, the membership function can be determined in an objective way.

10.2 Fuzzy experiment and fuzzy sample To statistically determine membership functions, we need to design a fuzzy experiment. A fuzzy experiment is slightly different from an ordinary statistical experiment. Fujimoto (1994) designed a fuzzy experiment. Here his experiment procedure is slightly adapted for general purposes.

10.2.1 How large is large? A survey designed to collect the information about people's perception of fuzzy concept like "large" and "small" is to prepare sixty one triangular sheets, the areas of which ranged from 20 cm 2 to 80 cm 2, to say. The side lengths and interior angles of the triangles were randomly determined so that all the triangles had different shapes, as shown in figure 10.3 (a). Three reference triangles, representing "large" with actual area 70, "medium" with actual area 50 and "small" with actual area 25, are also prepared. In the survey, the three reference triangles are shown to the people under test first. Then these triangular sheets are shown to each person in the survey one by one. For each sheet, he or she is requested to assess it using "small", "medium" or "large". After all the sheets are shown to him, a fuzzy sample shown in Figure 10.3 (b) is obtained. In the figure, the horizontal axis is the real triangle area and the vertical axis is the classification result. The experiment may be repeated to another person, and so on. Finally, a fuzzy sample as shown in Figure 10.3 (b) is obtained, from which membership functions are to be estimated. What is remarkable in the figure is that there are overlapping regions among each class. That is, the boundaries among each class are fuzzy. So, the data are called fuzzy data. The fuzzy data roughly indicate the correlation between the linguistic expressions and the physical quantity under consideration. The question asked at the very beginning of this section on "how large is large" can now be answered based on the data in Figure 10.3. For this problem, "large" is a set ranging roughly from 60 to 80 cm 2. It is clear from the figure that triangles with area below 55 cm 2 has never been classified as "large". Thus, it is safe to say that triangles with area above 55 cm 2 are large while those between 55 and 65 cm 2 are in transition stage from "large" to "medium".

10.2.2 Fuzzy data in physical sciences The above example is somewhat arbitrary because the experiment has been performed on humans. In physical sciences, however, such fuzzy data are also

10.2. Fuzzy experiment and fuzzy sample

243

available. Fluid flow in a pipe is a topic which has been of extensive interest in fluid mechanics. A parameter describing the flow state is called Reynolds number Re defined by Re =

Ud

(10.6)

V

where U is maximum flow velocity in the pipe, d is the diameter of the pipe and v is the kinetic viscosity of the fluid in the pipe. For water, v = 10-6 .

Turbulent AAAA AAA

A

90 O 0 0 ( X ~ Laminar v

logRe Turbulent Figure 10.4 Fuzzy data for Laminar and Turbulent flow Reynolds number Re dominates flow state in the pipe. If it is small, the flow is laminar, a state fluid particles smoothly flow in the pipe. If it is large, the flow is turbulent, a state fluid particles irregularly flow in the pipe. It has been experimentally found that transition l~om laminar flow to turbulent flow is not unique. Over a wide range of Reynolds number, the transition may happen depending pipe wall smoothness. If the pipe wall is very smooth, the transition does not occur until Re=40000, while if the pipe wall is rough, the transition may occur around 2100. When many experimental results are plotted in one single figure, we obtain fuzzy data schematically shown in Figure 10.4. Note this figure is schematic rather than physically accurate. Fuzzy set theory also provides us with a new look at some old problems. A rod under compression load P may become unstable suddenly as the load gradually increases. This is a well known buckling problem in mechanics of materials. The critical load at which the rod becomes unstable is theoretically a deterministic value, but in real-world application is random. If many rods of same size and same material are tested, we obtain data showing large scatters

10. Estimation of the membership function

244

around the theoretical critical value of the rod. From the viewpoint of mathematics stood away from physical background, fuzzy data in Figures 10.4 and 10.5 are no different in nature.

Figure 10.5 Fuzzy data for Stable and Unstable rod

Quite some examples in a variety of engineering fields and science disciplines can be given showing similar patterns as in Figures 10.3---10.5. All these phenomena can be treated using fuzzy set theory. But figure 10.3 and Figures 10.4---10.5 are slightly different in that the latter is physics-based phenomena requiring membership functions to be objectively determined while the former is human-related the membership functions of which are better to be determined objectively. This justifies the need to find membership functions in an objective way.

10.2.3 B-spline Approximation of the membership functions In this section, we will determine the membership function through the fuzzy data obtained in the previous section. Suppose the universe of discourse is X. Let x~,(e = 1,2,...,x,) be a sample from X. Further, suppose the fuzzy sets are A, ,

A2 ,

...,

AM and the corresponding membership functions are

/~(x),/u2(x),...,/~ M(x). In the triangle experiment, x, is the area of a triangle and A,, A2 and A3 correspond to "Small", "Medium" and "Large", respectively. As done in the previous Chapters, we again assume that the membership function/1 (x) can be expressed in the form of a linear combination of B-spline functions in the universe of discourse X, i.e.,

10.2. Fuzzy experiment and fuzzy sample

245

~, (x) = a,,B, (x) + a,,B~ (x) +... + a,~B~ (x) ~, (x) = a,, B, (x) + a~B~ (x) +... + a ~ B ~ (x) ...

(10.7)

/.tM (x) = aM,B , (X) + aM2B 2 (X) +'" + aMNBN (X) In concise form we have N

kt,(x) = Z o t , , B , ( x ) , i= 1,...,M

(10.8)

/=1

where N is the number of B-spline functions which consist of the membership functions, a, are the combination coefficients, and B~ (x) is the B-spline functions of chosen order and is of the following form if order 3 B-spline function is chosen 3 (x., - x) 2H(x,+, B,(x) = (x,-x,_3)~--' ~ +' -X).H(x-x,), w'(x,+,) t=0

(10.9)

3

(lO.lO)

w, (x) = l-[ ( x - x~+,) . t=O

Now we are to determine the parameters a, . From the process of classification, the membership function ,u,(xr) is regarded as same as the probability that a sample point x~ is classified into fuzzy set ~ , that is, Pr[x, ~ ,4,] =/t, (xc).

(10.11)

Therefore, we employ the likelihood analysis for the determination of the membership functions. The probability of the classification event for all the sample points x~(e = 1,2,-..,n, ) is expressed by the following likelihood function.

L= H

FI

FI

xt ~dl

xt eA_2

xr eA_,w

(10.12)

The log-likelihood function is M

/, = logL = ~ ~ log/~,(xr). ~=1x~~4,

(lo.13)

10. Estimation of the membership function

246 M

M

N

N

M

where ~--'p, (x)= Z ~-" o,,,B, (x)= Z ( Z a , , ) B , (x)= 1. t=l

t=l

I=i

/=l

t=l

According to the B-spline function properties we have N

~-" B,(x) = 1.

(10.14)

/=!

From the above two equations the following relationship is obtained, M

~--'c% = 1 , j = 1,...,N.

(10.15)

t=!

A membership function is always greater than or equal to zero. To guarantee this we simply impose the restriction that all parameters a,, are greater than or equal to zero, that is, a',, >0

i= 1,...,M;j = 1,...,N.

(10.16)

Usually, we hope r is a decreasing function of x and /tM(x ) is an increasing function. It is obvious that these are guaranteed by the following equations:

>crl, /+1

t~'l I - -

~

j=l,

"'''

N-l,

C~M~< C~M,~+i , j = 1,..., N - 1.

(10.17a) (10.17b)

Based on the above analyses the best estimates of the unknown parameters a,j must satisfy M

/, = ~ ~ log/.t, (x,.) ~ max

(10.18)

a=i x~ ~d,

subject to M

~ a , , =1 I=!

j=I,...,N,

(10.19a)

10.3. ME analysis a,j >__a~.,+,

t)t~ <_aM.,+' a,, >0

j = 1,...,N-1,

247 (10.19b)

j = 1,...,N-I,

(10.19C)

i= I , . . . , M ; j = 1,...,N.

(10.19d)

This optimization model has a good property. The optimum solution a,jo is unique (the proof is given in appendix 10.A). That is, if we can find a local maximum point a,~0 by some method it must be the global optimum solution. The optimization method like the Flexible Tolerance Method introduced in Himmeiblau (1972) may be employed to solve the maximization problem ( 10.8)~(10.9). 10.3 ME analysis

In the above optimization problem, the optimization parameters are N and

a,j. If N is fixed, this problem can be solved by ordinary nonlinear programming methods. Because N is also an optimization parameter, measured entropy analysis must be used. Consider a fuzzy sample of size n,. n) denotes the number of unknown parameters for the i-th fuzzy set. Without equation (10.15), n) would be equal to N . Equation (10.15) is an interlink among the M fuzzy sets. It is hard to distribute these N equalities among the M fuzzy sets. We may, however, avoid this difficulty using the method in the following. The entropy for i-th fuzzy set is (10.20)

H, = - I,u, (x) log,u, (x)dx.

The corresponding asymptotically unbiased estimator of the measured entropy

t

3n t

ME, = - I/~, (x[ c?) log/t, (x[ c?)dx + - ~ 2 n,

9

(10.21)

Then the total measured entropy should be M

MI

ME=~-'H,,=, =-)-",_,

3 nf /~,(xl c~)log/~,(xl c~)dx+~,=, n,

(10.22)

248

10. Estimation o f the membership function

From the definition, the second term become

Z' 3 ,=l nf M

M

t

2=

2

3 (M-1)N

n,

2

(10.23)

/7

Therefore, the best N solves the minimization problem M

ME = - ~

,=l

q,

~fl

3 (M-1)N

, (xl&)log ~,, (x [ d')dx + -

2

n

(10.24)

If AIC is used, M

AIC = - m a x {L} + ~ n't = - m a x {L} + N ( M - 1) ~ min

(10.25)

t=l

where the number of free parameters n t = M x N - N

because there are N

equality constraints in the above model. In the actual optimization process, the best N which makes M E or AIC minimum is numerically searched by the following procedure. First, a positive integer N is assumed. Then the maximum likelihood analysis is carried out to obtain the best estimates of a , . The value of M E or AIC is calculated. Next, changing N gradually, and the corresponding a,j and MEs or AICs are calculated. By the comparison of these ME or AIC values, the best N is found.

10.4 Numerical Examples Example 10.1 Triangular area problem Figure 10.6 (a) shows the fuzzy data obtained from the experiment described in section 10.2. Using these data, we may perform the ME analysis and the likelihood analysis using the method presented before. The B-spline functions of order 3 are used. For the optimization of the likelihood function, the Flexible Tolerance Method (FTM) as outlined in the appendix to this chapter is used. In many nonlinear programming methods a considerable portion of the computational time is spent on satisfying rather rigorous constraint requirements. However, the FTM does not satisfy the constraints first. But, the constraints are gradually satisfied as the search process proceeds toward the true solution. Tables 10.1 gives the results of the analysis. From the table, the minimum M E value is obtained if eleven B-splines are used, and minimum AIC value is

10. 4 Numerical Examples

249

obtained if eight B-splines are used. The estimated membership functions based on minimum ME and minimum AIC are plotted in Figures 10.6 (b) and (c). The results obtained from ME analysis and AIC analysis are quite consistent although the numbers of B-splines in the two analyses are different.

Table 10.1 Results for three fuzzy sets

N

-L

ME

AIC

6 7 8 9 10 11 13 12 14 15

19.7 16.7 14.5 15.5 14.3 13.6 13.1 14.2 13.1 13.2

5.54 4.11 2.95 3.80 3.19 2.52* 3.00 3.27 2.65 3.18

33.65 32.69 32.50* 35.57 36.26 37.63 41.13 40.22 43.09 45.20

L

(a) fuzzy data

M

S

1

,,N

tmO0

m

O0N

9

20

an

30

9

40

50

|

60

9

70

80

Area .

=

(c)A]C

(b) ME

I--4

..Q

0.5

E 0.5

0

0 20

30

40

50 Area

60

70

80

20

30

40

50 60 Area

70

80

Figure 10.6 Estimation of the membership functions for the problem of triangular areas

Example 10.2 Analysis of the fuzzy data offive classifications In shipbuilding industry, welding is a very important process, taking more than 40% workloads. Welding quality is directly related to the life of a ship.

10. Estimation of the membershipfunction

250

Thus, welding quality must be controlled within allowable errors. One factor controlling welding quality is called misalignment 8 as schematically shown in Figure 10.7. The figure shows that two vertical plates are welded to a horizontal plate. The two vertical plates are required to be in one line. This is difficult because a welding worker cannot see the upper plate as he is welding the lower plate to the horizontal plate, and he cannot see the lower plate as he is welding upper plate to the horizontal plate. So after the welding work, a surveyor must do quality examination. The examination results are classified into five classes: "Very Good" (VG), "Good" (G), "Medium" (M), "Bad" (B) and "Very Bad" (VB). Because of several factors involved, the quality assessment is not a simple correlation, but a complicated link as shown in Figure 10.7 (a), in which is given one hundred sample points to show the correlation between misalignment d and quality assessments. VG

(a)fuzzydata

G '

~

!,ql-- 8

8: misalignment t: thickness

9 9 u~m 00

M

9 9

Oll~mN

9 t

DIID OIIIDO

O mOtlO0

D

b" 0

1

b) ME

0.5

1

1.5

0.5

1

1.5

1

0.5

0.5

0

0 0.5

1

1.5

0

Figure 10.7 Five classifications of misalignment Using fuzzy data in Figure 10.7 (a), the membership functions are estimated. Order 3 B-spline functions have been in the analysis. Table 10.2 gives the results of the analysis.

I O.4. Numerical examples

251

Table 10.2 Results for five fuzzy sets N -L Me AIC N -L Me AIC 4 82.4 1.73 98.4 10 56.3 1.49 98.3 5 71.7 1.68 93.7 11 55.5 1.54 101.5 6 67.0 1.48 91.5 12 55.0 1.64 105.0 7 62.0 1.57 91.9 13 55.1 1.71 109.1 8 57.0 1.45 92.0" 14 51.7 1.73 109.7 9 56.1 1.49 94.0 15 53.4 1.78 115.4 The minimum ME value is obtained at N=8 and the minimum AIC is given as N=6. Figures 10.7 (b) and (c) compare the membership functions for the two cases. Example 10.3 Sample Size Influences In this example, the sample size influences of fuzzy data on the forms of membership functions are discussed. Suppose membership functions of the forms shown in Figure 10.8 (a) are given, representing three fuzzy sets A~, d2 and A3 . Then 100, 200 and 300 pairs of uniform random numbers (u,, v,)are generated in the range 0 _ ~/, (u,), then u, e ,~; If vr <_ll,(u,,) < ~t,(ur), then u, e A,. If v~ >/,t, (u~,), then u s is rejected;

1

1 ~..~(b) :o ,o, ~--r---,x s ,,: kT(.r\~.

0

0.5 !

0

2

4

6

8

10

0

, ~. u

2

V.

~., \~

~,.

" 9 ~ ' , ~ " Y , ' s9 clue9 e~

4

6

8

Figure 10.8 Given membership functions (a) and random numbers (b)

10

10. Estimation of the membershipfunction

252

n~ assumes three values, 100, 200 and 300. The three corresponding fuzzy samples are given in the left figures in Figure 10.9. Using the method presented in this Chapter, the membership functions corresponding to the three generated samples can be estimated. The results for ME values are given in Table 10.3 and plotted in Figure 10.9. In the Table, only ME results are given and AIC values are neglected.

Table 10.3 Sample size influences (a):Sample size:100

(b) Sample size:200

-L 39.5 34.5 35.1 33.8 32.0 32.0 32.0 32.0 31.5 31.4 31.6

-L 73.4 69.0 69.2 68.7 68.1 68.3 67.5 67.6 66.9 66.9 67.5

N 5 6 7 8 9 10 11 12 13 14 15

ME 5.70 4.07 4.50 4.14 4.24 3.76 3.86 3.82 3.83 3.72 3.90

ME 5.63 3.76 4.03 3.84 3.62 3.72 3.67 3.63 3.65 3.64 3.63

(c):Sample size:300 -L ME 123.1 5.60 107.9 3.77 109.9 4.12 107.3 3.88 107.2 3.54 127.4 3.65 106.4 3.58 106.7 3.61 105.9 3.69 105.4 3.68 106.5 3.69

For n - 1 0 0 , minimizing ME yields N-14. The profiles of the membership functions as shown in Figure 10.9 (a) are complicated because too many Bsplines are used. If sample size is over-small, statistical fluctuations have significant influence on the shapes of the membership functions. For n, - 2 0 0 and n , - 3 0 0 , minimizing ME yields same results at N-9. The estimated membership functions are shown in Figures 9 (b) and 9(c), which look much better than Figure 10.9 (a). Figures 10.9 (b) and (c) do not exhibit significant differences, showing the convergence as sample size increases. Both are close to the given membership functions as shown in Figure 10.8.

253

10.5. Concluding remarks

(a) n, = 1O0 9

M

9 1 4 9

~

ooq~oIN

OlBO, 9149149

9

9

0.5

-

0

2

-

-

4

9

-

6

9

-

O

10

0

2

4

6

8

10

0

2

4

6

8

10

0

2

4

6

8

10

260

(b),,,

9

M

-

8

9

Ill;

Oe

. . . . . . . . . . . . . . . .

9149149149

qlDO 9

0.5

.

0

.

.

.

2

.

.

i

4

6

i

0

i

8

l0

(c) h, 300 M

9

9

__L_ZL~2~

-

Dim

0.5 u l n ~ o g n ~ o l N

9

0

2

-

o

D

i

4

i

i

6

i

9 ,

8

i

10

0

Figure 10.9 Given membership functions (right) and fuzzy data (left)

10.5 Concluding Remarks In this chapter, a probabilistic model that can determine the forms of the membership functions based on experimental data is presented. In the method, the membership functions are approximated by a linear combination of B-spline functions. The best number of B-splines which compose the membership functions under consideration is determined by minimizing ME. And the best combination coefficients are determined probabilistically based on the likelihood analysis.

10. Estimation of the membershipfunction

254

The features of the membership functions presented in this paper can be characterized as follows: (1)

The membership functions can be automatically determined from the fuzzy data by the proposed method. No prior knowledge of the form of the membership functions is necessary in the estimation. (2) The method works well irrelevant to the number of classifications. Also, numerical calculations are easily performed because the optimum solution is unique. (3) If the sample size of fuzzy data becomes larger, the estimated membership function becomes more accurate.

Appendix

255

Appendix: Proof of uniqueness of the optimum solution Consider the following nonlinear programming problem.

f (X,,X2,''',Xn) --~ max

(10.A.1)

subject to the following constraints, g,(xi,x2,...,x,) >_0, i = 1,2,...,m,

(lO.a.2)

xj > 0, j = 1,2,.--,n.

(10.a.3)

The Lagrangian is m

L = f(x, ,x2," ",x,,) + ~ u,g, (x,,xz," ",x,,).

(10.A.4)

t=i

If x ~ is a local optimum solution in the above problem, then the sufficient condition for x ~ to be the global maximum point is:

L(x,u~ <_L(x~ u~ ~ 8L(x~176(x,-x, 0 ). a=l

(10.A.5)

~X I

where u ~ satisfies

(aL)

,

OL >O for allu,>O; u, ~u, =0' i = l , . . , m. Ou,

(10.A.6)

The Lagrangian for our problem is M

N

M

Z Z I=! x~ eA.s

I=1

t=!

2•

E .~=1

in which g, 's are constraints given by the following equations.

(10.A.7)

256

10. Estimation o f the membership function

g, (ail,...,aqu) = a'l/-al.j+ I > 0;s = 1 , . . . , N - 1 g, (OtMi ,''''aMN

) = ~[M,l+l

--O[U,I

~-- 0

(10.A.8)

s=N-I+j,j=I,2,...,N-1 ( ~ "- ( ~ I

I , " " " , O(IN , ~ Z I , " " " , ~ Z N

, " " " , ~Mi

, OIM Z , " " " , ~ M N

)

According to Taylor expansion of multivariate function we have M N 0L(o~o) t(~) -- t(~ o) + Z.__.Z,~. o~., (~,, _~o)

|M

2

N

M

t=l /=1

N

O~L(OotO+(l_O)~)

o

o

(a,, - a,, )(a,, - ~, )

(10.A.9)

= I=1

U N,~., OL

1 U N U ~

= L(~~

+ ZEZZ

82L Aot,jAotkt 8a,s Oakl

where 0 < 0 < 1 Because, Bj

2•

(10.A.10)

aL(~)_ Z Z +~, + Z ..h. in which i f (s - j, i = 1)or(s = N - 1 + j, i = M ) i f (s = j + 1, i = 1)or(s - N + j, i= M ) .

h, = 0~,, -

otherwise

So

e~t(a) =_ ~ OCr,lOOtkt 82L(a)

~ = 0

Oa, Oa~

~,~

B,& It 2,

fork = i (10.A.I 1) fork ~ i

Therefore, the third term on right hand side of equation (10.A.9) becomes

Appendix IMNM

N

257

02L

~Aa,jAau 2

t=l

/=1

1M =

= N

i=1 N

~2/-

Aa',1Aa,i I=! I=1

_

1 M

u u B1BIAc%A~.

-

2 Z Z Z Z I=i xt~ 4 I=!

= - ~ ,:, _ 1

2

I=!

(10.A.12)

1"/i

~ ( : 8,A~,,)(Z,:,~,A~,,) Z 7(

8,A~,, <0

2 ,--t x,~4

From equations (10.A.9) and (10.A.12), the following relationship is obtained. M

N

t(oO < L(cr ~ + ~ ~ aL zXa,,. ,-I i-1 c3a~,j

(10.A.13)

The above equation indicates that if a ~ is a local optimum solution it must be the global solution.