Study on segmented distribution for reliability evaluation

Study on segmented distribution for reliability evaluation

CJA 741 27 December 2016 Chinese Journal of Aeronautics, (2016), xxx(xx): xxx–xxx No. of Pages 21 1 Chinese Society of Aeronautics and Astronautics...

4MB Sizes 0 Downloads 57 Views

CJA 741 27 December 2016 Chinese Journal of Aeronautics, (2016), xxx(xx): xxx–xxx

No. of Pages 21

1

Chinese Society of Aeronautics and Astronautics & Beihang University

Chinese Journal of Aeronautics [email protected] www.sciencedirect.com

4

Study on segmented distribution for reliability evaluation

5

Li Huaiyuan a,b, Zuo Hongfu b,*, Su Yan b, Xu Juan b, Yin Yibing b

3

6 7

8

a b

School of Software Engineering, Jinling Institute of Technology, Nanjing 211169, China College of Civil Aviation, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China

Received 17 December 2015; revised 31 January 2016; accepted 18 October 2016

9

11 12

KEYWORDS

13

Bayesian estimates; Gibbs sampling method; Maximum likelihood estimation; Proximity sensors in leadingedge flap in aircraft; Reliability; Segmented distribution; Weibull distribution

14 15 16 17 18 19 20 21

22

Abstract In practice, the failure rate of most equipment exhibits different tendencies at different stages and even its failure rate curve behaves a multimodal trace during its life cycle. As a result, traditionally evaluating the reliability of equipment with a single model may lead to severer errors. However, if lifetime is divided into several different intervals according to the characteristics of its failure rate, piecewise fitting can more accurately approximate the failure rate of equipment. Therefore, in this paper, failure rate is regarded as a piecewise function, and two kinds of segmented distribution are put forward to evaluate reliability. In order to estimate parameters in the segmented reliability function, Bayesian estimation and maximum likelihood estimation (MLE) of the segmented distribution are discussed in this paper. Since traditional information criterion is not suitable for the segmented distribution, an improved information criterion is proposed to test and evaluate the segmented reliability model in this paper. After a great deal of testing and verification, the segmented reliability model and its estimation methods presented in this paper are proven more efficient and accurate than the traditional non-segmented single model, especially when the change of the failure rate is time-phased or multimodal. The significant performance of the segmented reliability model in evaluating reliability of proximity sensors of leading-edge flap in civil aircraft indicates that the segmented distribution and its estimation method in this paper could be useful and accurate. Ó 2016 Production and hosting by Elsevier Ltd. on behalf of Chinese Society of Aeronautics and Astronautics. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/ licenses/by-nc-nd/4.0/).

* Corresponding author. E-mail addresses: [email protected] (H. Li), [email protected], [email protected] (H. Zuo), [email protected] (J. Xu). Peer review under responsibility of Editorial Committee of CJA.

Production and hosting by Elsevier

1. Introduction

23

As one of the essential devices affecting aircraft safety, the proximity sensors in leading-edge flap is an important part of aircraft operating system. However, proximity sensors in leading-edge flap are also one of the devices with higher failure rate in aircraft.1 In order to ensure aircraft safety and reduce its maintenance costs, it’s necessary to accurately evaluate the reliability of proximity sensors in leading-edge flap.2

24

http://dx.doi.org/10.1016/j.cja.2016.12.008 1000-9361 Ó 2016 Production and hosting by Elsevier Ltd. on behalf of Chinese Society of Aeronautics and Astronautics. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

25 26 27 28 29 30

CJA 741 27 December 2016

No. of Pages 21

2 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92

However, like most other equipment, the failure rate of the proximity sensors in leading-edge flap is complicated, exhibiting different changes in different operation phases.3 In order to accurately evaluate the reliability of devices such as proximity sensors in leading-edge flap, whose failure rate is multimodal or multi-stage, only segmented distribution is applicable and inevitable.4 In this paper, two segmented reliability models are proposed: (1) failure rate is a piecewise linear function about lifetime, namely, segmented linear failure rate mode and (2) failure rate in any stage is of the same formula as the failure rate function of Weibull distribution, called segmented Weibull distribution. Currently, the application of segmented distribution to reliability evaluation is mainly classified into three categories: describing the degradation process of devices; creating reliability model; and testing and detecting change-points.5 (1) Now, how to fit the equipment degradation process by segmented distribution is discussed in a lot of literature. For example, Bae et al. 6 proposed a hierarchical Bayesian change-point regression model to fit two-phase degradation patterns and derived the failure-time distribution of a unit randomly selected from its population. Kvam 7 introduced a log-linear model with random coefficients and a change-point to describe the nonlinear degradation path. (2) Since the failure rates of many products may perform different trends at different stages, an application of segmented distribution to evaluating product reliability has been widely concerned. Li et al. 8 estimated the change-point for a piecewise hazard regression model in the presence of right censoring and long-term survivors. In a study by He et al. 9, a sequential testing approach to detect multiple change-points in the hazard function by likelihood ratio statistics and resampling was proposed, which was applicable to both right-censored and interval-censored data. Uhm et al. 10 studied a weighted least squares estimator for Aalen’s additive risk model with right-censored survival data which may allow for very flexible handling of covariates. (3) In addition, another important application of segmented distribution is to test whether there are change-points in product failure rate or reliability. For a segmented regression system with an unknown change-point over two domains of a predictor, a new empirical likelihood ratio statistic was proposed to test the null hypothesis of no change by Liu and Qian 11. Goodman et al. 12 expanded the set of alternatives to allow for multiple change-points, and proposed a model selection algorithm using sequential testing for the piecewise constant hazard model. Suresh 13 derived a test statistic and its asymptotic distribution, and compared the power of the test with other existing tests such as likelihood ratio, Weibull, and log Gamma tests. Nosek and Szkutnik 14 discussed a regression model with a possible structural change and a small number of measurements. Recently, segmented distribution and its estimation method have caught more attentions. To minimize the expected sum of manufacturing cost, bum-in cost, and warranty cost of failed items found during their warranty period, a cost model was formulated to find the optimal bum-in time based on Weibull hyperexponential distribution by Chou and Tang 15. Considering different prior densities for parameters and censored survival data, Achcar and Loibel 16 discussed Bayesian analysis on segmented constant hazard function models and put forward their inference methods by Metropolis algorithms. Weibull-exponential distribution, a common segmented

H. Li et al. distribution, was discussed and its accurate calculation formula by Bayes estimation was proposed by Boukai 17. Patra and Dey 18 proposed a general class of change-point hazard models for survival data, which included and extended many different types of segmented distribution. However, the existent segmented distribution is generally considered as a two-segment distribution, or their failure rate is supposed as a constant in recent literature 6–18. Compared with many other current segmented distributions, the contributions of this paper are as follows. (1) In this paper, not only two kinds of universal n-segment distribution have been discussed, but also two general methods to estimate parameters in the segmented distributions, namely Bayesian estimation and maximum likelihood estimation (MLE), are given. (2) An information criterion adequate for the n-segment distribution is proposed. It can not only test whether a change-point exists or not, but also measure the appropriateness and correctness of the segmented reliability model. (3) A new bathtub curve model is put forward to verify the effects of the segmented distributions.

93

2. Segmented distribution

113

In fact, the failure rate of equipment not only is changeable but also often manifests different trends at different phases. Reliability-centered maintenance (RCM) recommended six failure models as shown in Fig. 1.19 As Fig. 1 shows, apart from failure models C and E, the failure rates of the other failure models show different patterns at different phases, and change-points obviously exist in failure rate curves. Therefore, if one and same distribution is utilized to evaluate a failure whose failure rate is characterized by multiple change-points and multi-peak, evaluation may be rather difficult and inaccurate.20 Due to unevenness and inconsistency of failure rate, it is not appropriate to describe the change of reliability by one and same analytical function in the entire lifetime cycle. However, if we divide lifetime into some time intervals according to failure rate trend, the failure rate in every different interval can be described by a different function. Just as piecewise interpolation with a simple function can improve approximation precision, we can more accurately approximate the true change of failure rate. For example, as for failure model A, i.e., bathtub curve, if three different failure rate functions are in correspondence to each phase, then the change of failure rate in the

114

Fig. 1

Six kinds of failure model.

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112

115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135

CJA 741 27 December 2016

No. of Pages 21

Study on segmented distribution 136 137 138 139 140 141 142 143

145 146 147 148 149 150 151 152 153 154 155 156 157 158 160

entire lifetime cycle of equipment can be accurately fitted by segmented distribution. Supposing that lifetime is divided into n intervals and the failure rate function in the ith interval ½si ; siþ1 Þ is expressed by hi ðtÞ, then the segmented failure rate function hðtÞ can be expressed by Eq. (1) below, where hi ðtÞ > 0, t 2 ½si ; siþ1 Þ is requisite. 8 0 t < s1 > > > > > > > > h1 ðtÞ s1 6 t < s2 > > > > > > > > . > > < .. hðtÞ ¼ ð1Þ > > > hi ðtÞ si 6 t < siþ1 > > > > > > > .. > > > . > > > > > : hn ðtÞ sn 6 t < 1 From Eq. (1), devices have no failure probability before s1 , but may fail after s1 . Usually, s1 is a time point when devices are likely to begin degeneration, such as the first use of devices, namely, s1 ¼ 0.17,18 However, s1 –0 is possible in rare particular cases. If hi1 ðsi Þ–hi ðsi Þ, then Eq. (1) is discontinuous at si , or Eq. (1) is continuous. In most cases, si may be a changepoint of failure rate. For convenience, notation Ji and Ki(t) are introduced, expressed by Eqs. (2) and (3). According to Rt Rt fðtÞ ¼ hðtÞ expð s1 hðuÞduÞ and RðtÞ ¼ expð s1 hðuÞduÞ, we can get the probability density function fi ðtÞ and the reliability function Ri ðtÞ in the ith interval as Eqs. (4) and (5), respectively. Z siþ1 Ji ¼ hi ðuÞdu ð2Þ si

161

163

Z Ki ðtÞ ¼

si

t

hi ðuÞdu;

t 2 ½si ; siþ1 Þ

164 i1 X fi ðtÞ ¼ hi ðtÞ exp  Jj  Ki ðtÞ

166

i1 X Ri ðtÞ ¼ exp  Jj  Ki ðtÞ

170 171 172 173 174

176

ð3Þ

si 6 t < siþ1

ð7Þ

.. . sn 6 t < 1 179

2.1. Segmented linear failure rate model

180

2.1.1. n-segment linear failure rate model

181

As calculation of piecewise linear interpolation is easy and can meet most engineering requirements, we divide lifetime into n intervals and set the failure rate function hi ðtÞ in the ith interval ½si ; siþ1 Þ as Eq. (8), where li 6 si is required. Usually, li ¼ 0 or li ¼ si . According to Eqs. (2) and (3), we get Ji and Ki ðtÞ shown as Eqs. (9) and (10), respectively.

182

hi ðtÞ ¼ ai þ bi ðt  li Þ

li 6 si ;

t 2 ½si ; siþ1 Þ

ð8Þ

183 184 185 186 187 188 190 191

1 1 Ji ¼ ai siþ1  bi li siþ1 þ bi s2iþ1  ai si þ bi li si  bi s2i 2 2

ð9Þ

193 194

1 1 Ki ðtÞ ¼ ai t  bi li t þ bi li si þ bi t2  ai si  bi s2i 2 2

t 2 ½si ; siþ1 Þ ð10Þ

Furthermore, according to Eqs. (6)–(10), we can obtain the probability density function fi ðtÞ and the reliability function Ri ðtÞ in the ith interval, respectively, shown as Eqs. (11) and (12). fi ðtÞ ¼ hi ðtÞe



i1 P

196 197 198 199 200 201

ðaj sjþ1 bj lj sjþ1 þ12bj s2jþ1 aj sj þbj lj sj 12bj s2j Þðai tbi li tþ12bi t2 ai si þbi li si 12bi s2i Þ

j¼1

t 2 ½si ;siþ1 Þ

ð11Þ

203 204

t 2 ½si ; siþ1 Þ

ð4Þ

! t 2 ½si ; siþ1 Þ

ð5Þ

j¼1

Furthermore, fi ðtÞ and Ri ðtÞ can piecewise compose the segmented probability density function fðtÞ and the reliability function RðtÞ in the entire lifetime cycle, shown as Eqs. (6) and (7), respectively. 8 h1 ðtÞ expðK1 ðtÞÞ s1 6 t < s2 > > > > > h2 ðtÞ expðJ1  K2 ðtÞÞ s2 6 t < s3 > > > > .. .. > > > . > > ! . > i1 > X < si 6 t < siþ1 fðtÞ ¼ hi ðtÞ exp  Jj  Ki ðtÞ ð6Þ > j¼1 > > > > > .. .. > > . > > ! . > > n1 X > > > > sn 6 t < 1 : hn ðtÞ exp  Jj  Kn ðtÞ j¼1

177

s1 6 t < s2 s2 6 t < s3 .. .

j¼1

!

j¼1

167

169

3 8 expðK1 ðtÞÞ > > > > > expðJ1  K2 ðtÞÞ > > > > .. > > > . > > ! > > i1 X < RðtÞ ¼ exp  Jj  Ki ðtÞ > j¼1 > > > > > . > .. > > > ! > > n1 X > > > > : exp  Jj  Kn ðtÞ



i1 X

ðaj sjþ1 bj lj sjþ1 þ12bj s2jþ1 aj sj þbj lj sj 12bj s2j Þðai tbi li tþ12bi t2 ai si þbi li si 12bi s2i Þ

Ri ðtÞ ¼ e j¼1 t 2 ½si ;siþ1 Þ

ð12Þ

If bi > 0, then the failure rate in the ith interval will linearly increase; if bi ¼ 0, the failure rate in the ith interval will be constantly kept as ai ; if bi < 0, then the failure rate in the ith interval will linearly decrease. If t 2 ½tn ; 1Þ, then bn > 0 is required in order to ensure hn ðtÞ > 0; t 2 ½tn ; 1Þ, but if hðtÞ is defined in finite support on some special occasions, bn 6 0 may be acceptable.

206 207 208 209 210 211 212 213

2.1.2. Linear failure rate model

214

Particularly, if lifetime isn’t segmented, namely, nonsegmented, then the failure rate will linearly change in the entire lifetime cycle. hðtÞ, fðtÞ and RðtÞ of the linear failure rate mode are expressed as Eqs. (13)–(15), respectively. Fig. 2 shows some typical function curves of this distribution, from which we can see that l1 and s1 can adjust the curve shape and its starting point. Specially, when b1 ¼ 0 and l1 ¼ s1 , the distribution will become an exponential distribution.

215

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

216 217 218 219 220 221 222 223 224

CJA 741 27 December 2016

No. of Pages 21

4

H. Li et al.

Fig. 2

226

Some curves of linear failure rate model.

hðtÞ ¼ a1 þ b1 ðt  l1 Þ s1 6 t < 1

ð13Þ

227

fðtÞ ¼ ða1 þ b1 ðt  l1 ÞÞeða1 tb1 l1 tþ2b1 t 1

2 a s þb l s 1b s2 Þ 1 1 1 1 1 2 1 1

232

233 234 235 236 237 238 239 240 241 242 243 244 245 246

248

eða1 tb1 l1 tþ2b1 t 1

2 a

1 2 1 s1 þb1 l1 s1 2b1 s1 Þ

s1 6 t < s2

eða1 s2 b1 l1 s2 þ2b1 s2 a1 s1 þb1 l1 s1 2b1 s1 Þða2 tb2 l2 tþ2b2 t 1

2

1

2

1

2 a

1 2 2 s2 þb2 l2 s2 2b2 s2 Þ

s1 6 t < 1

s2 6 t < 1 ð17Þ

252

ð14Þ

229 230

( RðtÞ ¼

RðtÞ ¼ eða1 tb1 l1 tþ2b1 t 1

2 a

1 2 1 s1 þb1 l1 s1 2b1 s1

Þ

s1 6 t < 1

ð15Þ

2.1.3. Two-segment linear failure rate model In general, n ¼ 2 or n ¼ 3, namely two- or three-segment distribution can meet most of reliability evaluation. For twosegment distribution, hðtÞ and RðtÞ will be Eqs. (16) and (17), respectively. Supposing that l1 ¼ s1 ¼ 0, l2 ¼ s2 ¼ 1:5, some curves of the two-segment linear failure rate model are shown in Fig. 3. From Fig. 3, it can be seen that the segmented distribution may degenerate into a non-segmented mode when the failure rate changes consistently and continuously in the entire lifetime cycle. When the failure rate changes in an obviously phased manner, this segmented distribution can also precisely reflect the different changes of the failure rate in its every interval.  a1 þ b1 ðt  l1 Þ s1 6 t < s2 hðtÞ ¼ ð16Þ a2 þ b2 ðt  l2 Þ s2 6 t < 1

2.1.4. Three-segment linear failure rate model

253

When n=3, hðtÞ and R(t) of the three-segment linear failure rate model are respectively shown as Eqs. (18) and (19). When l1 ¼ s1 ¼ 0, l2 ¼ s2 and l3 ¼ s3 ¼ 2:5, some function curves of the three-segment linear failure rate model are shown in Fig. 4. From Fig. 4, it’s clear that the adaptability of segmented distribution is extensive, as it can express both the case where the failure rate changes sharply and the case where the failure rate has no obvious change when ai  aiþ1 ; bi  biþ1 . From Figs. 3 and 4, it’s apparent that fðtÞ seems more fluctuating when the trend of the failure rate is obviously in a staged or piecewise manner.

254

8 > < a1 þ b1 ðt  l1 Þ s1 6 t < s2 hðtÞ ¼ a2 þ b2 ðt  l2 Þ s2 6 t < s3 > : a3 þ b3 ðt  l3 Þ s3 6 t < 1

265

255 256 257 258 259 260 261 262 263 264

ð18Þ 267

249 250

268 269

8 ða1 tb1 l1 tþ12b1 t2 a1 s1 þb1 l1 s1 12b1 s21 Þ > : ða1 s2 b1 l1 s2 þ1b1 s2 a1 s1 þb1 l1 s1 1b1 s2 Þða2 s3 b2 l2 s3 þ1b2 s2 a2 s2 þb2 l2 s2 1b2 s2 Þða3 tb3 l3 tþ1b3 t2 a3 s3 þb3 l3 s3 1b3 s2 Þ 2 2 2 2 2 2 2 1 3 2 3 e

s1 6 t < s2 s2 6 t < s3 s3 6 t < 1

ð19Þ

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

CJA 741 27 December 2016

No. of Pages 21

Study on segmented distribution

270

271 272 273 274 275 276 277 278 279 280 281

5

Fig. 3

Some curves of two-segment linear failure rate model.

Fig. 4

Some curves of three-segment linear failure rate model.

2.2. Segmented Weibull distribution Although the n-segment linear failure rate function hðtÞ can express the change of failure rate in a staged manner, it can only express a linear relation. Meanwhile, linear fitting is not precise enough and its adaptability is not good enough. Nevertheless, by adjusting shape parameters, Weibull distribution can not only express a distribution with an increasing failure rate, but also model a distribution whose failure rate is decreasing.21,22 Therefore, the failure rate in each interval is fitted by Weibull failure rate functions with different parameters, and better precision and adaptability can be achieved.23,24

2.2.1. n-segment Weibull distribution

282

Let the failure rate function hi ðtÞ in the ith interval ½si ; siþ1 Þ be Eq. (20). From Eqs. (2) and (3), we obtain Ji and Ki ðtÞ, respectively shown as Eqs. (21) and (22). hi ðtÞ ¼ ai bi ðai ðt  li ÞÞbi 1

t 2 ½si ; siþ1 Þ

ð20Þ

283 284 285 286 288 289

Ji ¼ ðai ðsiþ1  li ÞÞbi  ðai ðsi  li ÞÞbi

ð21Þ

291 292

Ki ðtÞ ¼ ðai ðt  li ÞÞbi  ðai ðsi  li ÞÞbi

t 2 ½si ; siþ1 Þ

ð22Þ

Here li 6 si ; ai > 0; bi > 0. Furthermore, according to Eqs. (6), (7) and (20)–(22), the probability density function fi ðtÞ

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

294 295 296

CJA 741 27 December 2016

No. of Pages 21

6 297 298 299

H. Li et al.

and the reliability function Ri ðtÞ in the ith interval are respectively obtained as Eqs. (23) and (24). 

301

fi ðtÞ ¼ ai bi ðai ðt  li ÞÞ t 2 ½si ;siþ1 Þ

bi 1

e

i1 X

ððaj ðsjþ1 lj ÞÞbj ðaj ðsj lj ÞÞbj Þððai ðtli ÞÞbi ðai ðsi li ÞÞbi Þ

j¼1

ð23Þ

Let l1 ¼ s1 ¼ 0, some curves of the two-segment Weibull distribution are drawn in Fig. 5. ( a1 b1 ða1 ðt  l1 ÞÞb1 1 s1 6 t < s2 ð25Þ hðtÞ ¼ a2 b2 ða2 ðt  l2 ÞÞb2 1 s2 6 t < 1

302

318 319 320

322 323 324

( fðtÞ ¼

b

b

a1 b1 ða1 ðt  l1 ÞÞb1 1 eðða1 ðtl1 ÞÞ 1 ða1 ðs1 l1 ÞÞ 1 Þ

s1 6 t < s2

b2 1 ðða1 ðs2 l1 ÞÞb1 ða1 ðs1 l1 ÞÞb1 þða2 ðtl2 ÞÞb2 ða2 ðs2 l2 ÞÞb2 Þ

a2 b2 ða2 ðt  l2 ÞÞ

e

i1 X

ððaj ðsjþ1 lj ÞÞbj ðaj ðsj lj ÞÞbj Þððai ðtli ÞÞbi ðai ðsi li ÞÞbi Þ



Ri ðtÞ ¼ e t 2 ½si ; siþ1 Þ

RðtÞ ¼

j¼1

304

ð24Þ

313

If bi > 1, then hi ðtÞ increases in the interval ½si ; siþ1 Þ; if bi ¼ 2, then hi ðtÞ becomes a linearly increasing function; if bi ¼ 1, then hi ðtÞ is fixed at ai in the interval ½si ; siþ1 Þ; if bi < 1, then hi ðtÞ decreases in the interval ½si ; siþ1 Þ. In this way, the segmented Weibull distribution can flexibly approximate the changeful failure rate. If lifetime is non-segmented, it will become a classical Weibull distribution. Usually, the segment number n 6 3 can satisfy the requirement of reliability evaluation in most cases.

314

2.2.2. Two-segment Weibull distribution

315

In particular, if lifetime is divided into two intervals, according to Eqs. (23) and (24), hðtÞ; fðtÞ and R(t) of the two-segment Weibull distribution are respectively shown as Eqs. (25)–(27).

305 306 307 308 309 310 311 312

316 317

Fig. 5

ð26Þ

s2 6 t < 1

325

8 ððk ðtl ÞÞb1 ðk ðs l ÞÞb1 Þ 1 1 1 1 1 > :

b

b

s 1 6 t < s2 ð27Þ b

b

eððk1 ðs2 l1 ÞÞ 1 ðk1 ðs1 l1 ÞÞ 1 þðk2 ðtl2 ÞÞ 2 ðk2 ðs2 l2 ÞÞ 2 Þ s2 6 t < 1

327

2.2.3. Three-segment Weibull distribution

328

Similarly, when n = 3, then hðtÞ and RðtÞ of the three-segment Weibull distribution can be respectively written as Eqs. (28) and (29). When l1 ¼ s1 ¼ 0, some three-segment curves of the Weibull distribution are shown in Fig. 6.

329

8 > a1 b1 ða1 ðt  l1 ÞÞb1 1 > > > > > < hðtÞ ¼ a2 b2 ða2 ðt  l2 ÞÞb2 1 > > > > > > : a3 b3 ða3 ðt  l3 ÞÞb3 1

330 331 332 333 334

s1 6 t < s2 s2 6 t < s3

ð28Þ

s3 6 t < 1

Some curves of two-segment Weibull distribution.

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

336

CJA 741 27 December 2016

No. of Pages 21

Study on segmented distribution

7

8 ðða ðtl ÞÞb1 ða1 ðs1 l1 ÞÞb1 Þ > : ðða1 ðs2 l1 ÞÞb1 ða1 ðs1 l1 ÞÞb1 þða2 ðs3 l2 ÞÞb2 ða2 ðs2 l2 ÞÞb2 þða3 ðtl3 ÞÞb3 ða3 ðs3 l3 ÞÞb3 Þ e

337 338 339 340 341 342 343 344 345 346 347 348 349 350

351

From Figs. 5 and 6, the segmented Weibull distribution can precisely approximate the sharp fluctuation of failure rate. As for multimodal distribution or varying failure rate, such as bathtub curve, the segmented Weibull distribution is very appropriate. However, if the parameters in the segmented distribution are not appropriate, fðtÞ of the segmented distribution at t ¼ si may incur a discontinuity point. For example, when li ¼ si , hðtÞ and fðtÞ of the segmented Weibull distribution at t ¼ si are discontinued, but this discontinuity may not affect the measurement of reliability, because RðtÞ is continuous. As a whole, the segmented distribution in this paper can be fit to all kinds of failure mode recommended by RCM. 3. Estimation of segmented distribution

s1 6 t < s2 s2 6 t < s3 s3 6 t < 1

3.1. Bayesian regression estimation of segmented distribution

361

Usually, a piecewise-linear regression model is defined as Eq. (30). Namely, the response of any input xj 2 ½si ; siþ1 Þ contains two terms: the linear function ai þ bi xj and the stochastic disturbance ei that obeys normal distribution Nð0; r2i Þ. The aim of piecewise-linear regression is to infer parameters ai and bi under the circumstance that m input samples xj and their correspondingly output samples yj are known. Denoted by a = (ai)n1, b = (bi)n1, r2 = (r2i )n1, s = (si)n1, X = (xi)m1, Y = (yi)m1, under the condition that parameters a; b; r2 and s are given, then the maximum likelihood function of nsegment distribution is written as Eq. (31). In addition, the residual sum of squares Q in Eq. (32) can reflect precision of fitting.

362

y ¼ ai þ bi x þ ei 352 353 354 355 356 357 358 359 360

If the sample set S ¼ ft1 ; t2 ; . . . ; tk ; . . . ; tm g of segmented distribution is known, in order to estimate parameters in segmented distribution, we should, first of all, estimate change-points s1 ; s2 ; . . . ; sn and location parameters l1 ; l2 ; . . . ; ln .25 Usually, let li ¼ 0ði ¼ 1; 2; . . . ; nÞ or li ¼ si ði ¼ 1; 2; . . . ; nÞ. Of course, there are many other more accurate methods to estimate li and si . In this paper, Bayesian estimation and MLE are utilized to preliminarily estimate parameters in segmented distribution.26

Fig. 6

ð29Þ

ei 

Nð0; r2i Þ;

ð30Þ

fðX; Yja; b; r ; sÞ ¼ ð2pÞ

m2

n Y

364 365 366 367 368 369 370 371 372 373 374 375

x 2 ½si ; siþ1 Þ;

i ¼ 1; 2; . . . ; n 2

363

mi

 ðr2i Þ 2



e

1 2r2 i

P xj 2½si ;siþ1 Þ

377 378

ðyj ai bi xj Þ2

i¼1

ð31Þ Q¼

n X X

380 381

ðyj  ðai þ bi xj ÞÞ

2

ð32Þ

i¼1 xj 2½si ;siþ1 Þ

Some curves of three-segment Weibull distribution.

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

383

CJA 741 27 December 2016

No. of Pages 21

8

H. Li et al. n Y

384

3.1.1. Prior distribution of Bayesian regression

385

Usually, properly determining prior distribution is a key of Bayesian estimation. Although ai , bi , r2i , si in most Bayesian estimation are supposed to be independent27,28, in fact, there are close relations among ai , bi , r2i , si .

386 387 388

389 390 391 392 393 394

395 396 397 398

400 401 402

404 405 406 407 408

Theorem 1. In the ith interval ½si ; siþ1 Þ of the piecewise-linear regression model in Eq. (30), under the condition that r2i and s are given, parameters bi jr2i ; s obey normal distribution pffiffiffi 2 NðBi =A ði ¼ 1; 2; . . . ; nÞ, P where i =Ai Þ Þ Pi ; ð 2mi rP Ai ¼ xj 2½si ;siþ1 Þ xj 2½si ;siþ1 Þ ðxj1  xj2 Þ, Bi ¼ xj 2½si ;siþ1 Þ 1 2 1 P ðy  y Þ. j2 xj 2½si ;siþ1 Þ j1 2

Proof. From Eq. (30), if xj1 2 ½si ; siþ1 Þ; xj2 2 ½si ; siþ1 Þ, then their stochastic disturbances ej1 and ej2 will also obey normal distribution Nð0; r2i Þ, namely, (

ej1 ¼ yj1  ðai þ bi xj1 Þ  Nð0; r2i Þ ej2 ¼ yj2  ðai þ bi xj2 Þ  Nð0; r2i Þ

:

Thus, we get yj1  yj2  bi ðxj1  xj2 Þ pffiffiffi  Nð0; 1Þ: 2ri Furthermore, if any two samples are combined, then summation of all combinations will also obey normal distribution, i.e., X

yj1  yj2  bi ðxj1  xj2 Þ pffiffiffi  Nð0; m2i Þ: 2ri 2½si ;siþ1 Þ

xj1 2½si ;siþ1 Þ xj2

411

After deduction and simplification, it’s concluded that bi obeys normal distribution when r2i ; s are given, namely,

413 415

416 417 418 419 420

421 422 423 424 425 426 427

428 429 430 431 432 433

pffiffiffi 2 bi jr2i ; s  NðBi =Ai ; ð 2mi ri =Ai Þ Þ i ¼ 1; 2; . . . ; n

1 Ai pffiffiffiffiffiffiffiffi 2 e 2ri pða; b; r2 ; sÞ ¼ r 2p 2m i i i¼1

mi ðai Yi þbi Xi Þ þ12 2

434

A b B 2  i i mi

i

pðr2i jsÞpðsi Þ ð33Þ

436

Proof. From the topic supposes, as the parameters in different segments are independent of each other, we get

437

n Y pða; b; r2 ; sÞ ¼ pðsÞpða; b; r2 jsÞ ¼ pðsÞ pðai ; bi ; r2i jsÞ

438 439

i¼1

¼ pðsÞ

n Y

pðai jbi ; r2i ; sÞpðbi ; r2i jsÞ

i¼1

¼ pðsÞ

n Y

pðai jbi ; r2i ; sÞpðbi jr2i ; sÞpðr2i jsÞ

i¼1

¼

n Y

pðai jbi ; r2i ; sÞpðbi jr2i ; sÞpðr2i jsÞpðsi Þ:

i¼1

From Theorems 1 and 2, the prior distribution of the model in Eq. (30) is obtained as Eq. (33). Unlike other linear regression models, the above prior distribution in Theorem 3 can express relations among parameters ai ; bi ; r2i ; si . Usually, inverse Gamma distribution IGðui ; vi Þ or uniform distribution Uðui ; vi Þ or non-information distribution can be taken as a prior distribution of r2i js. If samples xj and yj have been correctly divided, namely s is known, the following Algorithm 1 can estimate hyper-parameters ui and vi of the ith segment. h

441 442 443 444 445 446 447 448 449 450 451 452 454

X

410

412



:

Theorem 2. In the ith interval ½si ; siþ1 Þ of the piecewise-linear regression model in Eq. (30), under the condition that bi , r2i , s are given, parameters ai jbi ; r2i ; sði ¼ 1; 2; . . . ; nÞ will obey normal P distribution NðYi  bi Xi ; r2i =mi Þ, where X ¼ m1i xj 2½si ;siþ1 Þ xj , P Y ¼ m1i xj 2½si ;siþ1 Þ yj . Proof. From Eq. (30), the stochastic disturbance ej of the jth sample xj and yj obeys normal distribution Nð0; r2i Þ, so it’s easy to know that ej ¼ yj  ðai þ bi xj Þ  Nð0; r2i Þ. Furthermore, we get Yi  ai  bi Xi  Nð0; r2i =mi Þ. bi ; r2i

Therefore, when and s are given, parameters ai will obviously obey normal distribution, denoted as ai jbi ; r2i ; s  NðYi  bi Xi ; r2i =mi Þi ¼ 1; 2; . . . ; n: h Theorem 3. Assuming that pðsi Þ is the prior distribution of si ; the distribution of r2i js is denoted as pðr2i jsÞ under the condition that s are given; and parameters ai , bi , r2i , si in the ith segment are independent from parameters aj , bj , r2j , sj in the jth segment, then the prior distribution of the piecewise-linear regression model in Eq. (30) can be written as

Algorithm 1

455

Step 1 Ascendingly sort all samples in the ith segment according to xj þ yj ðj ¼ 1; 2; . . . ; mi Þ, and obtain ðxð1Þ ; yð1Þ Þ; ðxð2Þ ; yð2Þ Þ; . . . ; ðxðjÞ ; yðjÞ Þ; . . . ; ðxðmi Þ ; yðmi Þ Þ ðj ¼ 1; 2; . . . ; mi Þ. Step 2 Determine empirical samples of r2i . For l ¼ 1 : l

456

 For m ¼ 1 : m y yðlÞ y yðlÞ Set alm ¼ yðlÞ  xðNmþ1Þ xðlÞ , blm ¼ xðNmþ1Þ . ðNmþ1Þ xðlÞ ðNmþ1Þ xðlÞ

Set elmj ¼ yj  ðalm þ blm xj Þ ðj ¼ 1; 2; . . . ; mi Þ. Calculate the variance of elmj denoted as rlm i , namely, Pmi lmj 1 PN lmj 2 lm 1 ri ¼ mi 1 j¼1 ðe  mi j¼1 e Þ . End End 2 Step 3 If rlm i can be taken as the samples of ri , then estimate hyper-parameters ui and vi by moment estimation.

457 458 459 460 461 462 463 464 465 466 467 468 469 470 471

 lines, where land m  are pre-set conAlgorithm 1 constructs lm stants. These lines are supposed as an approximation of optimal fitting, and then residuals caused by the samples relative to these lines can be regarded as an approximation of the stochastic disturbance ei . Therefore, rlm can be taken as an i approximation of r2i js, which can be utilized to estimate its hyper-parameters ui and vi of r2i js. The prior distribution of change-points si , denoted as pðsi Þ, is often considered as a discrete distribution defined in samples xi ði ¼ 1; 2; . . . ; mÞ.16 The greater the first- and second-order differences of ðxj ; yj Þ are, the higher the probability that change-points occur at xj will be, and thus it should be at xj

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

472 473 474 475 476 477 478 479 480 481 482 483

CJA 741 27 December 2016

No. of Pages 21

Study on segmented distribution 484 485

9

that samples are divided. In this paper, the flow of determination of pðsi Þ is shown as the following Algorithm 2.

486 488 489

Algorithm 2

494

Step 1 Ascendingly sort all samples ðxj ; yj Þ according to xj ðj ¼ 1; 2; . . . ; mÞ and get ðxð1Þ ; yð1Þ Þ; ðxð2Þ ; yð2Þ Þ; . . . ; ðxðjÞ ; yðjÞ Þ; . . . ; ðxðmÞ ; yðmÞ Þ ðj ¼ 1; 2; . . . ; mÞ. y yðj1Þ Step 2 Set @ 0j ¼ xðjþ1Þ ðj ¼ 2; 3; . . . ; m  1Þ, @ 01 ¼ 0, @ 0m ¼ 0, ðjþ1Þ xðj1Þ

495

and then normalize @ 0j , namely set p0j ¼ Pm j

490 491 492 493

j@ 0 j

496 497 498

499

500 501 502

503 504

Step 3 Set

@ 00j

¼

@ 0jþ1 @ 0j1 xðjþ1Þ xðj1Þ

and then normalize

@ 00j ,

j@ 0 j i¼1 i

ðj ¼ 2; 3; . . . ; m  1Þ,

namely set

p00j

. @ 001

j@ 00j j

¼ Pm

j@ 00 j i¼1 i

¼ 0,

@ 00m

Eq. (35) merely includes hyper-parameters ui , vi ði ¼ 1; 2; . . . ; nÞ; however, the posteriori distribution of parameters is difficult to be explicitly expressed. Therefore, only the Gibbs sampling method can find out the numerical solution.

525 526 527

Theorem 4. In the joint distribution in Eq. (35), if parameters al , b, r2 , s and samples X; Y are given, then the full conditional distribution of parameters ai ; i–l will obey normal distribution, namely,

528



r2 ai jal ; b; r ; s; X; Y  N Yi  bi Xi ; i ; 2mi

532

2

¼ 0,

523 524

l–i:

529 530 531

534

.

Proof. if al ; b; r2 ; s; X; Y; l–i in Eq. (35) are fixed, then we obtain

Step 4 Calculate wj . When m is an even number, set ( 4ðj1Þ j ¼ 1; 2; . . . ; m2 wj ¼ mðm2Þ ; when m is an odd 4ðmjÞ m m mðm2Þ j ¼ 2 þ 1; 2 þ 2; . . . ; m 8 < 4ðj1Þ2 j ¼ 1; 2; . . . ; mþ1 2 . number, set wj ¼ ðm1Þ : 4ðmjÞ2 j ¼ mþ3 ; mþ5 ; . . . ; m 2 2 ðm1Þ  m   m  Step 5 Set lpi ¼ max 1; ði  2Þ n , rpi ¼ min m; i n , and get the prior distribution 8 < Prwj ðp0j þp00j Þ j ¼ lpi ; lpi þ 1; . . . ; rpi pi w ðp0 þp00i Þ pðsi ¼ xj Þ ¼ i¼lpi i i : 0 Others j ¼ 1; 2; . . . ; m; i ¼ 2; 3; . . . ; n.

0 

fðai jal ; b; r ; s; X; YÞ / e 2

1 2r2 i

1

X

@

ðyj ai bi xj Þ2 þmi ðai Yi þbi Xi Þ

2

537

:

Assuming that Ci ¼ m1i xj 2½si ;siþ1 Þ xj yj , Di ¼ m1i P Ei ¼ m1i xj 2½si ;siþ1 Þ y2j , then we get

536

A

xj 2½si ;siþ1 Þ

P

535

539

P

2 xj 2½si ;siþ1 Þ xj ,

540 541 542

mi X

ðyj  ðai þ bi xj ÞÞ2 ¼ mi a2i  2ai mi Yi þ 2ai bi mi Xi  2bi mi Ci

j¼1

þ b2i mi Di þ mi Ei :

505 506 507 508 509 510 511

513

In Algorithm 2, Step 4 ensures that the middle samples are more likely to be regarded as si , which is inclined to make all segmentation as equalized as possible. In addition, pðs1 Þ and pðsnþ1 Þ are deemed as constants, for example, its distribution may be often written as the following formula.   1 s1 ¼ 0 1 s1 ¼ minðxi Þ pðs1 Þ ¼ or pðs1 Þ ¼ : 0 Others 0 Others

Pmi 2 2 If j¼1 ðyj  ðai þ bi xj ÞÞ þ mi ðai  Yi þ bi Xi Þ is transformed into a perfect square expression about ai , then fðai jal ;b;r2 ;s;X;YÞ / e

 12 2r i



2mi ðai ðYi bi Xi ÞÞ

2

515

If

r2i js

obeys inverse Gamma distribution IGðui ; vi Þ, namely, v

516 517 518 519 520

ðu þ1Þ 

ui

vi 2

pðr2i jsÞ ¼ Cðui i Þ ðr2i Þ i e ri , from Eqs. (31) and (33), it can be seen that the prior distribution and joint distribution of the piecewise-linear regression model are written as respectively Eqs. (34) and (35). n 1 Y Ai vui i pðsi Þ ðui þ2Þ 2ri pffiffiffiffiffiffiffiffi pða; b; r ; sÞ ¼ e ðr2i Þ 2p 2m Cðu Þ i i i¼1 2



mi ðai Yi þbi Xi Þ þ12 2

A b B 2 i i mi

i



þ2vi

ð34Þ

522

Þ2mi ðYi bi XÞ

2



is obtained. Furthermore, we get ð

ai  Yi bi Xi ri

pffiffiffiffi

fðai jal ; b; r2 ; s; X; YÞ / e

3.1.2. Bayesian piecewise-linear regression estimation

ð

þmi 2bi Xi Yi 2bi Ci þb2i X2i þb2i Di þEi þY2i

545 546 547 549

12

514

544

Þ

2mi

550 551

!2 :

553

Obviously, it is a kernel of normal distribution. Thus we get

r2i 2 : ai jal ; b; r ; s; X; Y  N Yi  bi Xi ; 2mi

554 555



557

Theorem 5. In the joint distribution in Eq. (35), if parameters a, bl , r2 , s and samples X, Y are given, then the full conditional distribution of parameters bi , i–l will obey normal distribution, namely,

558 559 560 561 562

0 n Y



m Ai vui i pðsi Þ ð i þui þ2Þ ðr2i Þ 2 e fðX; Y; a; b; r ; sÞ ¼ m pffiffiffiffiffiffiffiffi 2 2mi Cðui Þ i¼1 ð2pÞ

2

1 2r2 i

@

X xj 2½si ;siþ1 Þ

ðyj ai bi xj Þ2 þmi ðai Yi þbi Xi Þ þ12 2

A b B 2 i i mi

i

1 þ2vi A

ð35Þ

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

CJA 741 27 December 2016

No. of Pages 21

10

564

H. Li et al.

bi ja; bl ; r2 ; s; X; Y  N 3

2mi Ci þ 2m3i Xi Yi þ Ai Bi  4m3i ai Xi 2m2i r2i ; ; 2m3i Di þ 2m3i X2i þ A2i 2m3i Di þ 2m3i X2i þ A2i

bi ja; bl ; r2 ; s; X; Y  N

565

where

566

Ei ¼

1 mi

P

Di ¼ m1i

xj 2½si ;siþ1 Þ xj yj ,

586 587 588

3

2mi Ci þ 2m3i Xi Yi þ Ai Bi  4m3i ai Xi 2m2i r2i : ; 2m3i Di þ 2m3i X2i þ A2i 2m3i Di þ 2m3i X2i þ A2i

P

Ci ¼ m1i

is obtained. Obviously, it is also a kernel of normal distribution. Therefore, we get

P

2 xj 2½si ;siþ1 Þ xj ,

2 xj 2½si ;siþ1 Þ yj .



Theorem 6. In the joint distribution in Eq. (35), if parameters a, b, r2l , s and samples X, Y are given, then the full conditional distribution of parameters r2i , i–l will obey inverse Gamma distribution, namely,

589 590 591 592 593

 0 1  2  2 3 2 þ ðAi bi  Bi Þ2 þ 4m2i vi 2m a  2a Y þ 2a b X  2b C þ b D þ E þ a  Y þ b X i i i i i i i i i i i i i i i i m i 2 2 A: ri ja; b; rl ; s; X; Y  IG@ þ ui þ 1; 2 4m2i

599 567 568

Proof. Supposing that X, Y, a, bl , r2 , s, l–i in Eq. (35) are constants, then we get

569

0 

1 2r2 i

@

X

ðyj ai bi xj Þ2 þmi ðai Yi þbi Xi Þ þ12 2

i

576

After derivation and simplification, we get

A2 mi Di þmi X2i þ i2 2m i

1 2r2 i

0

A :

½fðbi ja;bl ;r2 ;s;X;YÞ / e



i i mi

xj 2½si ;siþ1 Þ

575 574 573 572 571

577 578

A b B 2

1

b2i þ

fðbi ja;bl ;r2 ;s;X;YÞ / e

581

In addition, because it’s easy to know

fðr2 ja;b;r2l ;s;X;YÞ / ðr2i Þ

ð 2i þui þ2Þ



1 2r2 i

m

X

@

e

ðyj ai bi xj Þ þmi ðai Yi þbi Xi Þ þ12 2

2

A b B 2 i i mi

xj 2½si ;siþ1 Þ

i

601 602 603

1 þ2vi A

:

605 606



580

600 Proof. Supposing that a, b, r2l , s, X, Y in Eq. (35) are constants, then we get

607



4ai mi Xi 2mi Ci 2mi Xi Yi 

Ai Bi bi m2 i

bi

608

After further derivation and simplification, we get

:

609 610





A2 Ai B i mi Di þ mi X2i þ i 2 b2i þ 4ai mi Xi  2mi Ci  2mi Xi Yi  2 bi 2mi mi 00 12 0 12 1 2 mi Ci þ mi Xi Yi þ A2mi B2i  2ai mi Xi mi Ci þ mi Xi Yi þ A2mi B2i  2ai mi Xi A i i A @ A A; ¼ mi Di þ mi X2i þ i 2 @@bi  A2 A2 2mi mi Di þ mi X2i þ 2mi 2 mi Di þ mi X2 þ 2mi 2 i

582 583

0 B

12@

585

fðbi ja; bl ; r2 ; s; X; YÞ / e

bi 

2m3 Ci þ2m3 Xi Yi þAi Bi 4m3 ai Xi i i i 2m3 Di þ2m3 X2 þA2 i p i i i 2mi ri

ffi pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 3 3 2 2

12 C A

i

fðr

2

ja; b; r2l ; s; X; YÞ

/

ð þui þ2Þ ðr2i Þ 2 e mi



2m3 i



ð

a2 2ai Yi þ2ai bi Xi 2bi Ci þb2 Di þEi þ ai Yi þbi Xi i i 4m2 r2 i i

Þ

2



þðAi bi Bi Þ2 þ4m2 vi i

Therefore, it’s tenable that

2m Di þ2m X þA i i i i

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

:

612 613 614

CJA 741 27 December 2016

No. of Pages 21

Study on segmented distribution Table 1

11

Results of Bayesian estimation in simulation.

Prior distribution ai jbi ; r2i ; s

bi jr2i ; s

NðYi  bi Xi ; mii Þ

 pffiffi 2 N ABii ; ð 2Ami i ri Þ  pffiffi 2  N ABii ; 2Ami i ri

Uðuai ; vai Þ

Uðubi ; vbi Þ

r2

NðYi  bi Xi ; mii Þ r2

EðkÞ

EðtÞ

EðQÞ

IGðui ; vi Þ

455.14

0.098106

0.559669

Uðui ; vi Þ

476.13

0.098392

0.559673

Uðui ; vi Þ

672.85

0.098240

0.560546

r2i js

Fig. 7

Illustration of a sampling result.

 0 1  2  þ ðAi bi  Bi Þ2 þ 4m2i vi 2m3i a2i  2ai Yi þ 2ai bi Xi  2bi Ci þ b2i Di þ Ei þ ai  Yi þ bi Xi m i A: r2 ja; b; r2l ; s; X; Y  IG@ þ ui þ 1; 2 4m2i

618 619 620 621

Theorem 7. In the joint distribution in Eq. (35), if parameters a, b, r2 , sl and samples X, Y are given, then the full conditional distribution of si ja; b; r2 ; sl ; X; Y about parameters si , i–l is same as the prior distribution pðsi Þ, namely,

fðsi ja; b; r2 ; sl ; X; YÞ / pðsi Þ:

fðsi ja; b; r2 ; sl ; X; YÞ ¼ pðsi Þ:

633

fðsi ja; b; r2 ; sl ; X; YÞ ¼ pðsi Þ:

According to the above full conditional distribution, if the model in Eq. (30) is inferred by Bayesian estimation, then the Gibbs sampling method is shown as the following Algorithm 3. h

634

622 624

625 626



Proof. Similarly, if parameters a, b, r , sl , X, Y in Eq. (35) are deemed as constants, then we obtain 2

Because pðsi Þ is a discrete distribution, we get

629 630 631

627

635 636 637 638

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

CJA 741 27 December 2016

No. of Pages 21

12

H. Li et al.

640 641

Algorithm 3

642

Step 1 Infer pðsi Þ by Algorithm 2, and set k ¼ 0. Step 2 According to Theorem 7, generate n1 discrete random numbers obeying the full conditional distribution

643 644

ðkÞ

ðkÞ

ðkÞ ; sl ; X; YÞ,

645

fðsi ja ; b ; r

646

ði ¼ 1; 2; . . . ; nÞ, which

647 648 649 650 651 652 653 654 655

2ðkÞ

ðkþ1Þ

ðkþ1Þ ðkþ1Þ and satisfy si 6 siþ1 ðkþ1Þ can be taken as si ði ¼ 2; 3; . . . ; nÞ. ðkþ1Þ 0, snþ1 ¼ maxðxi Þ, and then sðkþ1Þ is

Moreover, set s1 ¼ obtained. Step 3 The prior distribution of r2i ði ¼ 1; 2; . . . ; nÞ is estimated according to Algorithm 1. Step 4 Compute Ai , Bi , Ci , Di , Ei , Xi , Yi , mi , li ði ¼ 1; 2; . . . ; nÞ. ð0Þ

ð0Þ

2ð0Þ

Step 5 Determine initial values ai , bi , ri ði ¼ 1; 2; . . . ; nÞ, and set s ¼ 0. Step 6 According to Theorem 4, generate n normal random numbers obeying the full conditional distribution ðsÞ

659

fðai jal ; bðsÞ ; r2ðsÞ ; sðkþ1Þ ; X; YÞ ði ¼ 1; 2; . . . ; nÞ, which are taken as aðsþ1Þ . Step 7 According to Theorem 5, generate n normal random numbers obeying the full conditional distribution

660

fðbi jaðsþ1Þ ; bl ; r2ðsÞ ; sðkþ1Þ ; X; YÞ ði ¼ 1; 2; . . . ; nÞ, which are taken

656 657 658

661 662 663 664 665 666 667 668 669 670 671 672 673 674

ðsÞ

as bðsþ1Þ . Step 8 According to Theorem 6, generate n inverse Gamma random numbers obeying the full conditional distribution

fðr2i jaðsþ1Þ ; bðsþ1Þ ; rl ; sðkþ1Þ ; X; YÞ ði ¼ 1; 2; . . . ; nÞ, which are taken as r2ðsþ1Þ . Step 9 If aðsþ1Þ , bðsþ1Þ , r2ðsþ1Þ tend to converge to a standstill, then set aðkþ1Þ ¼ aðsþ1Þ , bðkþ1Þ ¼ bðsþ1Þ , r2ðkþ1Þ ¼ r2ðsþ1Þ and go to Step 10; otherwise, set s ¼ s þ 1 and go to Step 6. Step 10 If aðkþ1Þ , bðkþ1Þ , r2ðkþ1Þ , sðkþ1Þ satisfy the termination condition, then go to Step 11; otherwise, set k ¼ k þ 1 and go to Step 2. Step 11 Calculate expectations of random sequences aðkÞ , bðkÞ , r2ðkÞ , sðkÞ , denoted as a, b, r 2 , s, which are taken as estimation 2 ^ values ^ a, b, r ^ , ^s. 2ðsÞ

675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699

Assuming that parameters and their estimated values are ^2 Þ, then respectively denoted as h ¼ ða; b; r2 Þ and ^ h ¼ ð^ a; ^ b; r t ¼ k^ h  hk=khk can reflect estimation accuracy. Q in Eq. (32) can measure fitting precision. In addition, iterations k can imply algorithm efficiency. Therefore, the averages of k, t, Q from all simulating calculations, denoted as EðkÞ, EðtÞ, EðQÞ, can indicate the effectiveness and efficiency of Algorithm 3. From Table 1, the effectiveness and efficiency of the algorithm in this paper are satisfying, because the relations among parameters are fully considered.

700

3.1.3. Bayesian regression estimation of segmented distribution

710

In this paper, estimation of segmented distribution by Bayesian regression is based on the following hypotheses: (1) li are given or known and (2) parameters ai , bi , r2i , si in the ith segment are independent of parameters aj , bj , r2j , sj in the jth segment. Usually, these hypotheses can be met in most practical cases. Assuming that failure rate is linear, shown as Eq. (8), according to Eq. (36), Eq. (8) can be transformed into a linear relationship y ¼ ai þ bi x, which can meet the requirements of regression. If the estimation values of ai , bi with the Bayesian ^i ¼ b^i are regression model are denoted as a^i , b^i , then ^ ai ¼ a^i , b achieved.

711

yj ¼ hðtj Þ;

xj ¼ tj  li ;

tj 2 ½si ; siþ1 Þ;

ai ¼ ai ;

¼ bi

ð36Þ

Under the circumstance that samples follow segmented Weibull distribution, if we transform Eq. (20) according to Eq. (37), then the failure rate of samples in the ith interval can be expressed as Eq. (30). Therefore, after all samples are classified, if li , si are known and estimation values a^i , b^i have already been obtained by Bayesian estimation, moreover we  ^ ^ 1 ai bi þ1 ^ i ¼ b^i þ 1, ^ get the estimate values of ai , bi , i.e., b ai ¼ b^eþ1 . yj ¼ ln hi ðtj Þ; ¼

xj ¼ lnðtj  li Þ;

lnðabi i bi Þ;

tj 2 ½si ; siþ1 Þ;

702 703 704 705 706 707 708 709

712 713 714 715 716 717 718 719 720 721 722 723

bi

i

In Algorithm 3, there are many methods to estimate parameters li . In brief, we can set li ¼ 0 or li ¼ si . If n ¼ 1, Step 2 can be skipped, what’s more, Algorithm 3 only needs one loop. In Step 5, the expectations of ai , bi , r2i , which are worked out according to their full conditional distribution, can be ð0Þ ð0Þ 2ð0Þ regarded as initial values ai , bi , ri . In addition, if the dis2 tribution of ri js is a uniform or non-information distribution, then we can get a conclusion similar to Theorems 4–6. Under the same parameters circumstance, compute 1024 groups of simulation sample set. The average results are compared with those of other methods so as to verify the effects of the algorithms in this paper. In simulation verification, set n ¼ 1, a ¼ 2, b ¼ 2, r2 ¼ 0:1, s ¼ 0, l ¼ 0 in Eq. (30). Then set m ¼ 16, namely, every 16 pairs random number ðxj ; yj Þ compose one group sample set. Simulation results are listed in Table 1. The results of non-information and uniform prior distributions are also tabulated in Table 1. Fig. 7 is a scatter diagram illustrating all sampling samples in a certain simulating calculation, which shows that the sampling samples are concentratedly distributed in the neighborhood of the true values of parameters. Fig. 8 illustrates sampling samples in course of iteration, which implies the algorithm begins to converge only at a cost of small quantity of iteration.

701

725 726 727 728 729 730 731 732 733

ai

bi ¼ ðbi  1Þ

ð37Þ

Thus, the failure rate hðtj Þ of sample tj is inferred firstly, and then ðtj ; hðtj ÞÞ is transformed into sample ðxj ; yj Þ according to Eqs. (36) and (37). Next, regression coefficients can be estimated by Algorithm 3. In the end, the regression coefficients are transformed into parameters of segmented distribution.

735 736 737 738 739 740

3.2. MLE of segmented distribution

741

The unknown parameters in the ith interval are written as hi ¼ ½ai bi li T . Supposing that n, si ði ¼ 1; 2; . . . ; nÞ are known, then the likelihood function LðhÞ of n-segment distribution is written as Eq. (38) according to Eq. (6). Taking the logarithm of Eq. (38), we get its logarithmic likelihood function ln L shown as Eq. (39). We take the partial derivative of Eq. (39) with respect to hi , which is the parameter in the ith interval, and get Eq. (40).

742

!

n1 n n X X X X LðhÞ ¼ exp  Jj Ni  Ki ðtk Þ j¼1

i¼jþ1

i¼1 si 6tk
! n Y Y

743 744 745 746 747 748 749 750

hi ðtk Þ

i¼1 si 6tk
ð38Þ

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

752

CJA 741 27 December 2016

No. of Pages 21

Study on segmented distribution

13

Fig. 8

!

753

ln L ¼ 

n1 X j¼1

þ 755 756

758

Jj

n X

Ni

i¼jþ1

n X X



n X X

Parameters changing with iteration in course of sampling.

Ki ðtk Þ

i¼1 si 6tk
ln hi ðtk Þ

ð39Þ

i¼1 si 6tk
ð40Þ

759

3.2.1. Test change-points by improved information criterion

760

Usually, only when the sample number of each interval is approximately equal or the difference among the lengths of different intervals is slight, partitioning is most reasonable.29,30 In addition, the lower the number of interval is, the more concise the model will be.31 In order to measure the appropriateness of segmented distribution, learning from the idea that the information criterion SIC penalizes the complexity of a model, the information criterion SIC is improved. Thus a novel information criterion ICSD appropriated for segmented distribution is proposed in this paper, which is shown as Eq. (43).32,33 In Eq. (43), ln LðhÞ means the logarithm likelihood function shown as Eq. (39); h are the unknown parameters to be estimated; and dh is the dimensions of unknown parameters h; m is the total number of samples; n is the number of intervals, i.e., the segmentation number; cpenalty is the penalty

761 762 763 764 765 766 767 768 769 770 771 772 773 774

item expressed by the standard deviation, which is calculated by Eq. (41) or (42). In Eq. (41), mi indicates the sample number of the ith interval; snþ1 in Eq. (42) is usually supposed as the maximum sample, and of course, there are other methods to estimate snþ1 . In the special case of n ¼ 1, Eq. (43) is exactly the same as SIC information criterion.34 Therefore, when n and si are given, the MLE of the n-segment distribution can be taken as an optimization problem, which is shown as Eq. (44). sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi

2 1 Xn 1 Xn n cpenalty ¼ m  m ; snþ1 ¼ maxðti Þ ð41Þ i j i¼1 j¼1 i¼1 n n

cpenalty

vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi !2ffi u n u1 Xn X 1 ¼t ðsiþ1  si Þ  ðsjþ1  sj Þ ; i¼1 n n j¼1

775 776 777 778 779 780 781 782 783 784

786 787

snþ1

n

¼ maxðti Þ i¼1

ð42Þ

789 790

ICSDðhÞ ¼ 2 ln LðhÞ þ ðdh þ ðn  1Þ2 þ cpenalty Þ ln m

ð43Þ

792 793

min ICSDsi ;n ðhÞ ¼ min 2 ln LðhÞ þ ðdh þ ðn  1Þ2 þ cpenalty Þ ln m

ð44Þ

Eq. (44) can also be utilized to test whether change-points exist or not. Namely, min ICSDsi ;n ðhÞ < SIC indicates that there obviously exist some change-points and estimating by the seg-

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

795 796 797 798

CJA 741 27 December 2016

No. of Pages 21

14

H. Li et al.

805

mented distribution is appropriate. Let Eq. (40) be 0, we can get likelihood equations. Although the solution of likelihood equations can be regarded as an estimation of unknown parameters, usually we may get no solution at all or multiple solutions due to the too high complexity and dimensions of the likelihood equations, especially in the case of the estimation of li .

806

3.2.2. MLE of segmented linear failure rate model

807

When failure rate is expressed by Eq. (8), the derivatives of Eqs. (8)–(10) are substituted into Eq. (40), and we can get the likelihood equation with respect to ai ; bi in the ith interval of the linear failure rate model, shown as Eq. (45). As the likeln L lihood equation @@l ¼ 0 is an inconsistent equation, only i other methods are utilized to estimate li . In this paper, we set li ¼ minðtk Þ.

799 800 801 802 803 804

808 809 810 811 812 813 814

816

8 n X X X > 1 > > ðsiþ1  si Þ mj þ ðtk  si Þ  ¼0 > ai þbi ðtk li Þ > > si 6tk > > > > > > > < n X 1 X   1 mj þ ðtk  si Þ 12 tk þ 12 si  li > ðsiþ1  si Þ 2 siþ1 þ 2 si  li > si 6tk j¼iþ1 > > > > > > > X > > tk li > ¼0 > ai þbi ðtk li Þ : si 6tk
Fig. 9

In particular, when n = 2, the likelihood equation about a1 ; b1 ; a2 ; b2 of the two-segment linear failure rate model is expressed as Eq. (46). X X 8 1 > ðs2  s1 Þm2 þ ðtk  s1 Þ  ¼0 > a1 þb1 ðtk l1 Þ > > s1 6tk > > > > > >   > > > l1 s2 þ 12 s22 þ l1 s1  12 s21 m2 > > > > > > > X X   > > tk l1 <þ l1 tk þ 12 t2k þ l1 s1  12 s21  ¼0 a1 þb1 ðtk l1 Þ s 6t
819 820

s 6t
1 2 1 2 k k > > > > > X X > > 1 > > ðtk  s2 Þ  ¼0 > a2 þb2 ðtk l2 Þ > > s2 6tk s2 6tk > > > > > > >  X tk l2 > > X > l2 tk þ 12 t2k þ l2 s2  12 s22  ¼0 > a2 þb2 ðtk l2 Þ :

s2 6tk

s2 6tk

ð46Þ ð45Þ

817 818

822

3.2.3. MLE of segmented Weibull distribution

823

For n-segment Weibull distribution, we can substitute the derivative of Eqs. (20)–(22) into Eq. (40), and thus we can get the likelihood equation with respect to ai ; bi ; li in the ith interval of segmented Weibull distribution, shown as Eq. (47).

824

Some distribution curves of bathtub mode.

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

825 826 827 828

CJA 741 27 December 2016

No. of Pages 21

Study on segmented distribution Table 2

15

Verification results of segmented distribution in bathtub model.

Distribution

n

Bayesian estimation E(ICSD)

E(ln L)

E(Q)

E(ICSD)

E(ln L)

Linear failure rate model

1 2 3

0.763215 0.013991 0.005517

712.025561 1272.742802 1237.180250

351.368390 626.278686 598.496703

0.771646 0.017703 0.009220

586.574952 1136.994683 1236.104312

288.776617 558.576941 597.986081

Segmented Weibull distribution

1 2 3

50.175355 15.869131 2.610704

1210.214449 1227.338421 1251.915052

600.103278 601.092377 600.921539

78.316891 43.976807 48.027381

1082.648733 1113.451129 1137.161508

536.320420 544.134123 543.476008

8 n X > b b > mj > ððsiþ1  li Þ i  ðsi  li Þ i Þ > > > j¼iþ1 > > X > > i > þ ðtk  li Þbi  ðab þ ðsi  li Þbi Þmi ¼ 0 > i > > > si 6tk > > > n X > > > > ððsiþ1  li Þbi lnðai ðsiþ1  li ÞÞ  ðsi  li Þbi lnðai ðsi  li ÞÞÞ mj > > > j¼iþ1 < X i þ ððtk  li Þbi  ab i Þ lnðai ðtk  li ÞÞ > > > si 6tk > > > > > mi ðsi  li Þbi lnðai ðsi  li ÞÞ  bmi i ¼ 0 > ai bi > > > > n > X X > bi 1 bi 1 > > ððs  l Þ  ðs  l Þ Þ mj þ ððtk  li Þbi 1 > iþ1 i i i > > > s 6t > > > :  b bi 1 Þ  mi ðsi  li Þbi 1 ¼ 0 i ai bi ðtk li Þ

ð47Þ

830 831 832 833

In particular, if n = 2, we get the likelihood equation of two-segment Weibull distribution as Eq. (48). 8 X b1 b1 > ððs ðtk  l1 Þb1 2  l1 Þ  ðs1  l1 Þ Þm2 þ > > > s1 6tk > > > > b1 b1 > ða1 þ ðs1  l1 Þ Þm1 ¼ 0 > > > > > > ððs2  l1 Þb1 lnða1 ðs2  l1 ÞÞ  ðs1  l1 Þb1 lnða1 ðs1  l1 ÞÞÞm2 > > > X > > 1 > þ ððtk  l1 Þb1  ab > 1 Þ lnða1 ðtk  l1 ÞÞ > > > s1 6tk > > > > > m1 ðs1  l1 Þb1 lnða1 ðs1  l1 ÞÞ  bm1 1 ¼ 0 > a1 b1 > > > > > b1 1 b1 1 >  ðs1  l1 Þ Þm2 < ððs2  l1 Þ

X b1 1 b1 1 > >  m1 ðs1  l1 Þb1 1 ¼ 0 þ ðt  k  l1 Þ b1 > > ðtk l1 Þa1 b1 > s 6t 1 2 k > > > >X 2 > ðtk  l2 Þb2  m2 ðab þ ðs2  l2 Þb2 Þ ¼ 0 > 2 > > > s 6t 2 k > > > >X > 2 > ððtk  l2 Þb2  ab > 2 Þ lnða2 ðtk  l2 ÞÞ > > s2 6tk > > > > > > m2 ðs2  l2 Þb2 lnða2 ðs2  l2 ÞÞ  bm2 2 ¼ 0 > > a2 b2 > > > X > b2 1 b 1 > 2 > ððtk  l2 Þ  Þ  m2 ðs2  l2 Þb2 1 ¼ 0 > b : ðtk l2 Þa2 2 b2 s2 6tk

835

836

837 838

MLE

E(Q)

and MLE, the detailed steps of the method to estimate parameters are shown as the following Algorithm 4.

839 840 841

Algorithm 4 Step 1 Ascendingly sort lifetime samples to form sample set S ¼ ft1 ; t2 ; . . . ; tk ; . . . ; tm g and then estimate the failure rate hðtk Þ of the kth sample tk . Step 2 Determine the feasible region of si ði ¼ 1; 2; . . . ; nÞ and initial values of parameters. There are usually two methods to estimate the feasible region of si . (1) The parameters of segmented distribution are estimated by Algorithm 3. If the results of Bayesian estimation are ^si ¼ t^i , then set Wi ¼ ft^in0 ; . . . ; t^i1 ; t^i ; t^iþ1 ; . . . ; t^iþn0 g, namely, there are n0 samples around t^i which are regarded as the feasible region of si . (2) ðtk ; hðtk ÞÞ ðk ¼ 1; 2; . . . ; mÞ are categorized by a certain clustering method. If the best number of categories is n, then the distribution is also divided into n intervals. If the center of the ith class is denoted as ci , then we let Wi ¼ ft; t 2 S ^ ci1 < t < ci g ði ¼ 2; 3; . . . ; nÞ. In addition, we usually set W1 ¼ f0g or W1 ¼ fmint2S ðtÞg. We take Wi as the feasible solutions of si , namely si 2 Wi ði ¼ 1; 2; . . . ; nÞ. Step 3 Estimate the distribution parameters by MLE. Step 3.1 Initialize ICSDbest , abest , bbest , sbest , lbest . Step 3.2 Choose a group si ði ¼ 1; 2; . . . ; nÞ from Wi ði ¼ 1; 2; . . . ; nÞ. Step 3.3 The sample set S is divided into n intervals according to si ði ¼ 1; 2; . . . ; nÞ. Step 3.4 Solve the optimization problem in Eq. (44) or the likelihood equation in Eq. (45) or (47), and the result is denoted as ICSDsi ;n . If ICSDsi ;n < ICSDbest , set ICSDbest ¼ ICSDs;n , abest ¼ a, bbest ¼ b, sbest ¼ s, lbest ¼ l. Step 3.5 Go to Step 3.2 and compute repeatedly until all si ði ¼ 1; 2; . . . ; nÞ have been searched. Step 4 Output the solutions abest , bbest , sbest , lbest . 843

Steps 3.2–3.5 construct a loop, which can search optimal si ði ¼ 1; 2; . . . ; nÞ from their feasible region Wi . In order to estimate the parameters of segmented distribution, Algorithm 4 will solve the optimization problem or its likelihood equation Q for ni¼1 jWi j times.

844 845 846 847 848

4. Verification by simulation

849

Because the failure rate of a bathtub curve changes in an obvious phased manner over the entire lifetime cycle, a bathtub curve is suitable to test the segmented distribution and its estimation method.35,36 In order to test the performance of segmented distribution, a new bathtub curve model is put forward in this paper. Unlike what Lai et al. 37, 38 proposed,

850

ð48Þ 3.3. Flow of parameters estimation of segmented distribution If the sample set of segmented distribution is S ¼ ft1 ; t2 ; . . . ; tk ; . . . ; tm g, according to Bayesian estimation

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

851 852 853 854 855

CJA 741 27 December 2016

No. of Pages 21

16 Table 3

H. Li et al. Lifetime sample of proximity sensors in leading-edge flap.

Sample (day)

22

35

43

45

47

69

89

112

211

213

231

254

269

271

287

304

391

Frequency

1

1

1

1

1

1

1

1

1

1

1

1

1

1

1

1

1

Fig. 10

Table 4

Empirical reliability of proximity sensors in leading-edge flap estimated by product limit method.

Results of proximity sensors in leading-edge flap when li ¼ si .

Model

Bayesian estimation

n

MLE

Q

ICSD

Q

ICSD

Linear failure rate model

1 2 3

0.385113 0.391301 0.050524

998.322910 1495.454200 983.191595

0.461216 0.283457 0.603541

742.080213 1234.126147 1156.456345

Segmented Weibull distribution

1 2 3

3.013787 3.401046 1.392337

834.545652 1206.334961 929.041437

30.792925 23.915894 15.149311

329.938043 733.151436 554.069265

Table 5

Distribution parameters of proximity sensors in leading-edge flap when li ¼ si .

Method

Model

n

[a1, a2, a3]

[b1, b2, b3]

[s1, s2, s3]

[l1, l2, l3]

Bayesian estimation

Linear failure rate model

1 2 3 1 2 3

0.0003 0.0456, 0.0023 0.0432, 0.0830, 0.0118 0.0309 0.0431, 0.1000 0.0445, 0.0887, 0.1056

0.0015 0.0006, 0.0025 0.0006, 0.0004, 0.0049 1.7219 1.3331, 1.2538 1.2972, 1.0384, 1.3957

22 0, 89 0, 69, 211 0 0, 112 0, 69, 211

21.78 0, 88.11 0, 68.31, 208.89 0 0, 110.88 0, 68.31, 208.89

1 2 3 1 2 3

0.0004 0.0470, 0.0046 0.0455, 0.0664, 0.0095 0.0206 0.0287, 0.0667 0.0297, 0.0591, 0.0704

0.001 0.0003, 0.0022 0.0004, 0.0003, 0.0033 1.3776 1.0665, 1.0031 1.0378, 0.8307, 1.1165

22 22, 112 22, 69, 211 0 0, 112 0, 69, 211

10.89 21.78, 92.57 21.50, 36.58, 107.05 0 0, 55.02 0, 33.87, 104.43

Segmented Weibull distribution

MLE

Linear failure rate model

Segmented Weibull distribution

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

CJA 741 27 December 2016

No. of Pages 21

Study on segmented distribution Table 6

17

Results of proximity sensors in leading-edge flap when li ¼ 0.

Model

n

Bayesian Estimation

MLE

Q

ICSD

LnL

Q

ICSD

LnL

Linear failure rate model

1 2 3

0.023444 0.004479 0.000207

524.299062 826.590032 804.948647

259.441481 238.972833 250.450112

0.651190 0.493083 0.183909

680.265902 667.489418 765.032881

337.299737 337.507106 363.828470

Segmented Weibull distribution

1 2 3

0.736003 0.078336 0.068887

524.709035 818.894616 714.654662

259.646467 235.125125 237.598933

10.156623 6.117920 1.393693

509.523560 646.610284 842.849242

251.928567 245.101964 299.779892

Table 7

Distribution parameters of proximity sensors in leading-edge flap when li ¼ 0.

Method

Model

n

a1

Bayesian estimation

Linear failure rate model

1 2 3 1 2 3

0.032900 0.058192 0.049372 0.035840 0.043426 0.049048

1 2 3 1 2 3

0.029909 0.053672 0.043542 0.032581 0.039478 0.046568

Segmented Weibull distribution MLE

Linear failure rate model

Segmented Weibull distribution

a2

a3

b1

0.634507 0.171940

1.161458

0.006345 0.045332

0.006345

0.483327 0.018532

1.161420

0.005768 0.037705

0.007923

b2

0.000658 0.000299 0.000467 1.532463 1.327502 1.177229 0.000598 0.000257 0.000455 1.393148 1.206820 1.027791

b3

0.003251 0.001363

0.005208

4.797477 1.306900

4.797477

0.002298 0.000510

0.004735

4.361343 1.243400

3.675273

s1 0 0 0 0 0 0 0 0 0 0 0 0

s2

s3

231 211

271

231 45

231

213 112

271

231 43

211 885

856 857 858 859

the failure rate of the new bathtub curve model in this paper is not two additive Weibull distributions, but a sum of one increasing exponential function and another decreasing exponential function.

t

hðtÞ ¼ c1 ed1 t þ c2 ed2 > 0;

c1 > 0; c2 > 0; d1 > 0; d2

c1 d1 d2 > 1; t 2 ½0; 1Þ c2

RðtÞ ¼ e

4.1. A novel bathtub curve

861

A recent bathtub curve model proposed by Jiang 39 is proven to achieve a good effect, however, it is finite support. Learning from the thought of Jiang, a novel bathtub curve model is proposed in this paper, whose definition domain is ½0; þ1Þ, so as to verify the segmented distribution herein. If the failure rate decreases, h0 ðtÞ < 0, hðtÞ > 0 are obviously tenable. Therefore, supposition h0 ðtÞ ¼ d1 hðtÞ, d1 > 0 could be considered reasonable, which is a simplest differential equation. So when the failure rate decreases, the failure rate function hdec ðtÞ can be expressed as hdec ðtÞ ¼ c1 ed1 t . When the failure rate increases, then there exist h0 ðtÞ > 0, hðtÞ > 0, so assuming that hðtÞ ¼ d2 h0 ðtÞ, d2 > 0 is reasonable. After the differential equation is solved, we obtain the increasing

887 888

t c1 d t c e 1 c2 d2 ed2 d1 þc2 d2 d1 1

860

ð49Þ ð50Þ

890 891

862 863 864 865 866 867 868 869 870 871 872 873

t

874 875 876 877 878 879 880 881 882 883 884

failure rate function hinc ðtÞ ¼ c2 ed2 . If add hinc ðtÞ to hdec ðtÞ, then the failure rate function hðtÞ of the bathtub curve model is obtained as Eq. (49). Obviously, 2 ln c2 Þ hðtÞ has only one stationary point t ¼ d2 ðln cd11dd12dþ1 in the whole domain ½0; þ1Þ. hðtÞ is monotonously decreasing in the interval ½0; t Þ, but it is monotonously increasing in the interval ½t ; 1Þ. Meanwhile, it is easy to know that the reliability function and probability density function of the novel bathtub curve model can be expressed as Eqs. (50) and (51), respectively. Some curves of hðtÞ, fðtÞ, RðtÞ of the bathtub model are shown in Fig. 9

d1 t

fðtÞ ¼ ðc1 e

t d2

t c1 d t c e 1 c2 d2 ed2 d1 þc2 d2 d1 1

þ c2 e Þe

ð51Þ

4.2. Verifying segmented distribution under circumstance of bathtub curve Supposing that the random number r  Uð0; 1Þ is known, then the solution of equation

893

894 895

896 897

t

lnð1  rÞ ¼ dc11 ed1 x  c2 d2 ed2  dc11 þ c2 d2 , namely x, is a random number obeying the distribution in Eq. (50). Every 80 random numbers that obey the bathtub curve model are taken as a group of sample. Let li ¼ 0 and cpenalty is calculated by Eq. (42), and the simulation sample is estimated by the segmented model. Then randomly generate 100 groups of sample set, and the average results of estimation of 100 times are listed in Table 2. Q, ICSD, and ln L should be worked out according to Eqs. (32), (39), and (43) after each estimation, and the averages of Bayesian estimation and MLE from the estimation results of 100 times are denoted as EðQÞ, EðICSDÞ, Eðln LÞ, respectively. In addition, the results of Bayesian estimation are taken as the initial values of MLE, and directly solve the optimization problem in Eq. (44), rather than its likelihood equation. The experiment shows that the effect of segmented distribution is satisfying. (1) No matter Bayesian estimation or MLE,

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914

CJA 741 27 December 2016

No. of Pages 21

18

H. Li et al.

Fig. 11

Comparison of fitting effect of proximity sensors in leading-edge flap when li ¼ si .

Fig. 12

Comparison of fitting effect of proximity sensors in leading-edge flap when li ¼ 0.

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

CJA 741 27 December 2016

Study on segmented distribution

Fig. 13 The best segmentation fitting of proximity sensors in leading-edge flap.

936

the effect of segmented distribution is much better than that of non-segmented distribution. When the failure rate changes sharply, it is hard to fit with the Bayesian estimation under the circumstance of one and same distribution, because samples are too disorderly and unsystematic to meet the fitting prerequisites. Sometimes, even if the distribution can be estimated, the approximation error is very great. (2) It’s not like that the larger segmented number is the better the estimation will be. The number of segments in essence is determined by the nature of sample itself, and it is also closely related with the distribution model. When the distribution model is different, the optimal segment number n may also be different. If si is inconsistent with the characteristics of the sample itself, even though the segment number n is large, the estimation error is serious, too. (3) The parameters si in segmented distribution are critical and sensitive. Even though si slightly change, the results are obviously different. Such as being the case, the subtle nuance of the segmentation pattern can lead to very different results. (4) In measuring the pros and cons of segmented distribution and its estimation, Q is a more accurate index than ICSD when n is not the same. Only when the number of segments n is the same, can ICSD be comparable.

937

5. Application instance of segmented distribution

915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935

938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953

The leading-edge flap in aircraft is an airfoil, moveable device installed on the front part of the wing. It can deflect downward or (and) slide backward (forward), which is mainly utilized to increase lifting force during flight. On flaps/slats, 30 proximity sensors are fixed, among which there are eight proximity sensors in leading-edge flap. A flap/slat electronics unit (FSEU) can monitor the impedance of the proximity sensors, which is related with the positions of the sensors. Once the leadingedge flap moves, the target will move together and the impedance will change too. In this way, the proximity sensors can measure the position of the target that moves with the control surface. The proximity sensors provides the position data of the leading-edge flap to the FSEU. According to these data, the FSEU in turn controls the indicator panel of the leadingedge device and the lamps inside the cockpit, indicating the position. The FSEU also provides these data to the flight data

No. of Pages 21

19 acquisition system, a computer in a stall management yaw damper, and the proximity switch electronic unit. Therefore, the proximity sensors in leading-edge flap are very important for the manipulation of aircraft. Considering the importance of the proximity sensors in leading-edge flap to the aircraft safety, accurate evaluation of their reliability is very necessary. Here taking a B737-700 fleet of 25 aircraft as an example, the failure data of the proximity sensors in leading-edge flap from 2001 to 2012 have been collected from an airlines. After preprocessing, the lifetime samples of the proximity sensors in leading-edge flap are listed in Table 3, where the unit of the lifetime data is day. Firstly, the lifetime samples in Table 3 are estimated by the product limit method, and the estimation results, namely, failure rate and reliability are shown in Fig. 10. In Fig. 10, it can be seen that the failure rate function hðtÞ of the proximity sensors in leading-edge flap obviously is in a phase-wise manner: in the early period of their operation, the failure rate constantly keeps at a relatively low level; in the later period, the failure rate rises sharply, which is similar to the failure model B recommended by RCM from Fig. 1. Therefore, only segmented distribution can accurately be utilized to evaluate their reliability. Firstly, the distribution of the proximity sensors in the leading-edge flap is inferred by Bayesian estimation, namely Algorithm 3. Next, the parameters in segmented distribution are estimated by MLE according to Algorithm 4. Then under the condition that cpenalty is calculated with Eq. (42) and li ¼ 0, one-, two-, and three-segment linear failure rate models and Weibull models are respectively utilized to fit the reliability of the proximity sensors in leading-edge flap. The results of Bayesian estimation and MLE are respectively listed in Tables 4 and 5 when li ¼ si . In addition, estimation results are tabulated in Tables 6 and 7 when li ¼ 0. Comparison with various distributions and methods is shown in Figs. 11 and 12, after summarizing estimation results. In order to determine the most appropriate distribution of the proximity sensors in leading-edge flap, we must find out the optimal model from all the estimation results of MLE and Bayesian estimation. In this paper, the priority sequence is sorted respectively according to Q, ICSD obtained from each model. Then add the two sequence numbers of each model together. The model whose summation of sequence number is minimum is taken as the optimal model. Thus, the optimal distribution of the proximity sensors in leading-edge flap is a two-segment linear failure rate model estimated by Bayesian estimation, i.e., Eq. (17). The estimation results are tabulated in Table 4. From Table 4, we can know that the parameters of the optimal distribution are a ¼ ½0:058192; 0:634507, b ¼ ½0:000299; 0:003251, s ¼ ½0; 231, and l ¼ ½0; 0. Fig. 13 shows the best fitting of the failure rate hðtÞ of the proximity sensors in leading-edge flap. If all samples are not divided, the best distribution is a one-segment linear failure model, whose parameters are a ¼ 0:057239, b ¼ 0:001525. From Fig. 13, the error of non-segment fitting is very great, and even it is not a valid distribution. By comparison, segment distribution has an obvious advantage. In addition, the following can be concluded from this example. (1) Usually, no matter what distribution samples follow, they are measured from perspective of the whole lifetime. However, after the samples are divided, the local property of

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010 1011 1012 1013 1014

CJA 741 27 December 2016

No. of Pages 21

20

H. Li et al.

1024

the samples plays a more important role. The segmented distribution emphasizes the local characteristics of the samples, thus the expression of segmented distribution is often not the same as that of non-segmented distribution. For instance, even if the samples follow Weibull distribution, they may no longer follow segmented Weibull distribution after segmentation. (2) Although the reliability function of segmented distribution is continuous, the segmented failure rate function hðtÞ may be often discontinuous whether it is a linear failure rate mode or segmented Weibull distribution.

1025

6. Conclusion and discussion

1015 1016 1017 1018 1019 1020 1021 1022 1023

1026 1027 1028 1029 1030 1031 1032 1033

1034 1035 1036 1037 1038 1039 1040 1041 1042 1043 1044 1045 1046 1047 1048 1049 1050 1051 1052 1053 1054 1055 1056 1057 1058 1059 1060

In this paper, two n-segment segmented reliability models are proposed, and then how to estimate parameters of these segmented reliability models by Bayesian estimation and MLE is discussed. Whatever in the simulation verification or in the application examples, the segmented distribution and its estimation methods have achieved good effects. Nevertheless, there are still some problems related to the segmented distribution, which deserve our further study. (1) The unknown parameters contained in the segmented distribution are relatively too many, which makes the application of the segmented distribution become more difficult. (2) Although the estimation methods for the segmented distribution are discussed in this paper, these methods are still not accurate enough in estimating li , si , so in future research, better estimation methods should be studied, especially some methods to accurately estimate li , si . (3) The segmented function hðtÞ discussed in this article is usually discontinuous, which may be inconsistent with real circumstances. Although the reliability function is continuous, errors may still occur, so continuous segmented distribution should be discussed in the future. (4) Only when there exist change-points in failure rate, the segmented distribution can take its advantage. Thus there is a problem of detecting change-points before estimation by the segmented distribution. In this paper, merely MLE is preliminarily discussed to test changepoints, which is not sufficient. In the future, methods to detect change-points should deserve a careful study, especially Bayesian method and likelihood ratio. (5) From the experiments in this paper, it’s concluded that precision of the Gibbs sampling method is not sufficiently high. In future research, it is necessary that the Gibbs sampling method should be modified to improve its accuracy.

1061

1062

Acknowledgements

1063

This work was supported by the National Natural Science Foundation of China (Nos. 60672164, 60939003, 61079013, 60879001, 90000871), the Special Project about Humanities and Social Sciences in Ministry of Education of China (No. 16JDGC008), National Natural Science Funds, and Civil Aviation Mutual Funds (Nos. U1533128 and U1233114).

1064 1065 1066 1067 1068

References

1069

1. Zhao XL, Qian T, Mei G, Kwan C, Zane R, Walsh C, et al. Active health monitoring of an aircraft wing with an embedded piezoelectric sensor/actuator network: II. Wireless approaches. Smart Mater Struct 2007;16(4):1218–25. 2. Wu B, Tian Z, Chen M. Condition-based maintenance optimization using neural network-based health condition prediction. Qual Reliab Eng Int 2012;29(8):1151–63. 3. Bae SJ, Mun BM, Kim KY. Change-point detection in failure intensity: a case study with repairable artillery systems. Comp Indust Eng 2013;64(1):11–8. 4. Toms JD, Lesperance ML. Piecewise regression: a tool for identifying ecological thresholds. Ecology 2003;84(8):2034–41. 5. Atashgar K. Identification of the change point: an overview. Int J Adv Manuf Technol 2013;64(9–12):1663–83. 6. Bae SJ, Yuan T, Ning SL, Kuo W. A Bayesian approach to modeling two-phase degradation using change-point regression. Reliab Eng Syst Safety 2015;134:66–74. 7. Kvam PH. A change-point analysis for modeling incomplete burnin for light displays. IIE Trans 2006;38:489–98. 8. Li YX, Qian LF, Zhang W. Estimation in a change-point hazard regression model with long-term survivors. Statist Probab Lett 2013;83(7):1683–91. 9. He P, Kong G, Su Z. Estimating the survival functions for rightcensored and interval-censored data with piecewise constant hazard functions. Contemp Clin Trials 2013;35(2):122–7. 10. Uhm D, Huffer FW, Park C. Additive risk model using piecewise constant hazard function. Commun Statist—Simul Comput 2011;40(9):1458–77. 11. Liu ZH, Qian LF. Changepoint estimation in a segmented linear regression via empirical likelihood. Commun Statist - Simul Comput 2009;39(1):85–100. 12. Goodman MS, Li Y, Tiwari RC. Detecting multiple change points in piecewise constant hazard functions. J Appl Statist 2011;38 (11):2523–32. 13. Suresh RP. A test for constant hazard against a change-point alternative. Commun Statist - Theory Meth 2012;41(9):1583–9. 14. Nosek K, Szkutnik Z. Change-point detection in a shape-restricted regression model. Statist: A J Theoret Appl Statist 2013;48 (3):641–56. 15. Chou K, Tang K. Burn-in time and estimation of change-point with Weibull-Exponential mixture distribution. Dec Sci 1992;23 (4):973–90. 16. Achcar JA, Loibel S. Constant hazard function models with a change point: a Bayesian analysis using Markov chain Monte Carlo methods. Biomet J 1998;40(5):543–55. 17. Boukai B. Bayes sequential procedure for estimation and for determination of burn-in time in a hazard rate model with an unknown change point parameter. Seq Anal 1987;6(1):37–53. 18. Patra K, Dey DK. A general class of change point and change curve modeling for life time data. Ann Inst Statist Math 2002;54 (3):517–30. 19. Igba J, Alemzadeh K, Anyanwu-Ebo I, Gibbons P, Friis J. A systems approach towards reliability-centred maintenance (RCM) of wind turbines. Proc Comp Sci 2013;16(1):814–23. 20. Dehghanian P, Fotuhi-Firuzabad M, Aminifar F, Billinton R. A comprehensive scheme for reliability centered maintenance in power distribution systems—Part I: methodology. IEEE Trans Power Deliv 2013;28(2):761–70. 21. Li YX, Qian LF. Likelihood ratio test for a piecewise continuous Weibull model with an unknown change point. J Math Anal Appl 2014;412(1):498–504. 22. Pandya M, Jani PN. Bayesian estimation of change point in inverse Weibull sequence. Commun Statist-Theory Meth 2006;35 (12):2223–37.

1070

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

1071 1072 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1086 1087 1088 1089 1090 1091 1092 1093 1094 1095 1096 1097 1098 1099 1100 1101 1102 1103 1104 1105 1106 1107 1108 1109 1110 1111 1112 1113 1114 1115 1116 1117 1118 1119 1120 1121 1122 1123 1124 1125 1126 1127 1128 1129 1130 1131 1132 1133

CJA 741 27 December 2016

Study on segmented distribution 1134 1135 1136 1137 1138 1139 1140 1141 1142 1143 1144 1145 1146 1147 1148 1149 1150 1151 1152 1153 1154 1155 1156 1157 1158 1159 1160 1161 1162 1163 1164

23. Zhang T, Xie M. On the upper truncated Weibull distribution and its reliability implications. Reliab Eng Syst Safety 2011;96 (1):194–200. 24. Sarhan AM, Apaloo J. Exponentiated modified Weibull extension distribution. Reliab Eng Syst Safety 2013;112(4):137–44. 25. Maboudou-Tchao Edgard M, Hawkins Douglas M. Detection of multiple change-points in multivariate data. J Appl Statist 2013;40 (9):1979–95. 26. Hu Y, Zhao K, Lian H. Bayesian quantile regression for partially linear additive models. Statist Comput 2015;25(3):651–68. 27. Alhamzawi R, Yu K. Conjugate priors and variable selection for Bayesian quantile regression. Comput Statist Data Anal 2013;64 (4):209–19. 28. Reich BJ, Smith LB. Bayesian quantile regression for censored data. Biometrics 2013;69(3):651–60. 29. Nosek K. Schwarz information criterion based tests for a changepoint in regression models. Statist Papers 2010;51(4):915–29. 30. Pan JM, Chen JH. U-statistic based modified information criterion for change point problems. Commun Statist - Theory Meth 2008;37(17):2687–712. 31. Chen JH, Pan JM. Information criterion and change point problem for regular models. Ind J Statist 2006;68(2):252–82. 32. Shen G, Ghosh JK. Developing a new BIC for detecting changepoints. J Statist Plan Inf 2011;141(4):1436–47. 33. Ninomiya Y. Information criterion for Gaussian change-point model. Statist Probab Lett 2005;72(3):237–47. 34. Hannart A, Naveau P. An improved Bayesian information criterion for multiple change-point models. Technomet A J Statist Phys Chem Eng Sci 2012;54(3):256–68. 35. El-Gohary A, Alshamrani A, Al-Otaibi AN. The generalized Gompertz distribution. Appl Math Model 2013;37(1–2):13–24.

No. of Pages 21

21 36. Mazucheli J, Coelho-Barros EA, Achcar JA. Inferences for the change-point of the exponentiated Weibull hazard function. REVSTAT 2012;10(3):309–22. 37. Xie M, Lai CD. Reliability analysis using an additive Weibull model with bathtub-shaped failure rate function. Reliab Eng Syst Safety 1996;52(1):87–93. 38. Bebbington M, Lai CD, Wellington M, Zitikis R. The discrete additive Weibull distribution: a bathtub-shaped hazard for discontinuous failure data. Reliab Eng Syst Safety 2012;106 (5):37–44. 39. Jiang R. A new bathtub curve model with a finite support. Reliab Eng Syst Safety 2013;119:44–51. Huaiyuan Li is a lecturer in Jinling Institute of Technology. He graduated and obtained a doctor degree in engineering from Nanjing University of Aeronautics and Astronautics in 2016. He received the Master of Engineering in computer application and technology from Beifang University of Nationalities in 2008. His current research interests include reliability statistics, artificial intelligence and software engineering.

1165 1166 1167 1168 1169 1170 1171 1172 1173 1174 1175 1176 1177 1178 1179 1180 1181 1182 1183 1184 1185

Hongfu Zuo is currently a professor and PhD student supervisor in Nanjing University of Aeronautics and Astronautics. Presently, his research interests include reliability engineering, vehicle application engineering, man-machine maintainability and so on

1186

Juan Xu is a lecturer in Nanjing University Of Aeronautics And Astronautics. She is a Ph.D. student in Nanjing University of Aeronautics and Astronautics. She interest in reliability, safety and maintainability of equipment, supportability engineering.

1191

Please cite this article in press as: Li H et al. Study on segmented distribution for reliability evaluation, Chin J Aeronaut (2016), http://dx.doi.org/10.1016/j. cja.2016.12.008

1187 1188 1189 1190 1192 1193 1194 1195