Comparison of estimation methods for the parameters of the weighted Lindley distribution

Applied Mathematics and Computation 220 (2013) 463–471 Contents lists available at SciVerse ScienceDirect Applied Mathematics and Computation journa...

Download PDF

835KB Sizes 0 Downloads 69 Views

Report

PDF Reader
Full Text

Applied Mathematics and Computation 220 (2013) 463–471

Contents lists available at SciVerse ScienceDirect

Applied Mathematics and Computation journal homepage: www.elsevier.com/locate/amc

Comparison of estimation methods for the parameters of the weighted Lindley distribution J. Mazucheli a, F. Louzada b, M.E. Ghitany c,⇑ a

Universidade Estadual de Maringá, DEs, PR, Brazil Universidade de São Paulo, ICMC, SP, Brazil c Department of Statistics & O.R., Faculty of Science, Kuwait University, Kuwait b

a r t i c l e

i n f o

a b s t r a c t The aim of this paper is to compare through Monte Carlo simulations the ﬁnite sample properties of the estimates of the parameters of the weighted Lindley distribution obtained by four estimation methods: maximum likelihood, method of moments, ordinary leastsquares, and weighted least-squares. The bias and mean-squared error are used as the criterion for comparison. The study reveals that the ordinary and weighted least-squares estimation methods are highly competitive with the maximum likelihood method in small and large samples. Statistical analysis of two real data sets are presented to demonstrate the conclusion of the simulation results. Ó 2013 Elsevier Inc. All rights reserved.

Keywords: Weighted Lindley distribution Maximum likelihood Method of moments Ordinary least-squares Weighted least-squares

1. Introduction The Lindley distribution was introduced by Lindley [7], see also [8], in the context of ﬁducial distributions and Bayes’ theorem. Its probability density function (p.d.f.) is given by

f1 ðxjaÞ ¼

a2 ð1 þ xÞ eax ; aþ1

x > 0;

a > 0:

ð1Þ

The corresponding cumulative distribution function (c.d.f.) and hazard rate function (h.r.f.), respectively, are given by

ax eax ; F 1 ðxjaÞ ¼ 1 1 þ aþ1

x > 0;

a2 ð1 þ xÞ ; að1 þ xÞ þ 1

a > 0:

a > 0;

ð2Þ

and

h1 ðxjaÞ ¼

x > 0;

ð3Þ

Note that h1 ðxjaÞ is an increasing function in x, for all a > 0. Ghitany et al. [5] studied many statistical properties of the Lindley distribution from the reliability/survival analysis point of view. They also showed, using a real data set, that the Lindley distribution provides a better ﬁt than the exponential distribution. Application of the Lindley distribution in the competing risks analysis and in stress-strength reliability studies are considered by Mazucheli and Achcar [9] and Al-Mutairi et al. [1], respectively. ⇑ Corresponding author. E-mail address: [email protected] (M.E. Ghitany). 0096-3003/$ - see front matter Ó 2013 Elsevier Inc. All rights reserved. http://dx.doi.org/10.1016/j.amc.2013.05.082

464

J. Mazucheli et al. / Applied Mathematics and Computation 220 (2013) 463–471

Recently, Ghitany et al. [4] introduced the weighted Lindley distribution with p.d.f.

f ðxja; bÞ ¼ where CðaÞ ¼

abþ1 ða þ bÞCðbÞ

R1

xb1 ð1 þ xÞeax ;

x > 0;

a; b > 0;

ð4Þ

xa1 ex dx; a > 0, is the gamma function. The corresponding c.d.f. and h.r.f., respectively, are given by

0

Fðxja; bÞ ¼ 1

ða þ bÞCðb; axÞ þ ðaxÞb eax ða þ bÞCðbÞ

x > 0;

a; b > 0;

ð5Þ

and

hðxja; bÞ ¼

abþ1 xb1 ð1 þ xÞeax ða þ bÞCðb; axÞ þ ðaxÞb eax

x > 0;

a; b > 0;

ð6Þ

R1 where Cða; bÞ ¼ b xa1 ex dt; a > 0; b P 0, is the upper incomplete gamma function. Note that the Lindley distribution is a special case of the weighted Lindley distribution when b ¼ 1. Also, its h.r.f. hðxja; bÞ is bathtub-shaped (increasing) function in x if 0 < b < 1 (b P 1) for all a > 0. For reviews about bathtub-shaped hazard rate functions, see [11,10]. As far as the estimation of the parameters of the weighted Lindley distribution, [4] considered only the maximum likelihood estimation (MLE) method. It is of interest to compare the MLE method with other estimation methods such as the method of moments estimation (MME), ordinary least-squares estimation (OLSE) and weighted least-squares estimation (WLSE) methods. The main aim of this paper is to identify the most efﬁcient estimators among four estimators for different shape parameters values and sample sizes. The originality of this study comes from the fact that there has been no previous work comparing all of these estimators for the two-parameter weighted Lindley distribution. Related studies for other distributions can be found in, for example, [6,12,13]. In Section 2 we discuss the four estimation methods considered in this paper. The comparison of these methods in terms of bias and mean-squared error is presented in Section 3. The four estimation methods are used in ﬁtting two real data sets in Section 4. Some concluding remarks are presented in Section 5. 2. Methods of estimation In this section we describe the four considered estimation methods to obtain the estimates of the parameters a and b of the weighted Lindley distribution. 2.1. Maximum likelihood Let x1 ; x2 ; . . . ; xn be a random sample of size n from the weighted Lindley distribution with parameters a and b with p.d.f. (4). b MLE and b b MLE , of a and b, respectively, are given Ghitany et al. [4] showed that the maximum likelihood estimates (MLEs) a by

ab MLE ¼

b MLE ðx 1Þ þ b

qﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ 2 ½b b MLE ðx 1Þ þ 4 b b MLE ð b b MLE þ 1Þ x b MLE Þ; ¼ nð b 2x

ð7Þ

where x is the sample mean, b b MLE is the solution of the non-linear equation:

n ln ðnðbÞÞ

X n 1 wðbÞ þ lnðxi Þ ¼ 0; nðbÞ þ b i¼1

ð8Þ

d and wðbÞ ¼ db ln CðbÞ is the digamma function.

2.2. Method of moments The method of moments is another technique commonly used in parameter estimation. For the weighted Lindley distribution, we have

EðXja; bÞ ¼

b ða þ b þ 1Þ ; a ða þ bÞ

VarðXja; bÞ ¼

ðb þ 1Þða þ bÞ2 a2

a2 ða þ bÞ2

:

b MME and b b MME for a and b, respectively, are obtained by solving the equations: The method of moments estimates a

J. Mazucheli et al. / Applied Mathematics and Computation 220 (2013) 463–471

b MME ; b b MME Þ ¼ x; EðXj a

465

b MME ; b b MME Þ ¼ s2 ; VarðXj a

x and s2 are the sample mean and (biased) sample variance, respectively. where It follows that

ab MME ¼

b MME ðx 1Þ þ b

qﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ 2 b MME ðx 1Þ þ 4x b b MME ð b b MME þ 1Þ ½b ; 2x

ð9Þ

b MME is given by where b

b b MME ¼

bðx; s2 Þ þ

rﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ h i ﬃ 2 ½bðx; s2 Þ þ 16s2 s2 þ ðx þ 1Þ2 x 3 h i ; 2s2 s2 þ ðx þ 1Þ2

ð10Þ

and

bðx; s2 Þ ¼ s4 x ðx 3 þ 2 x 2 þ x 4 s2 Þ:

ð11Þ

2.3. Ordinary and weighted least-squares Let xð1Þ 6 xð2Þ 6 6 xðnÞ be the order statistics of a random sample of size n from the weighted Lindley distribution with c.d.f. (5). It is well known that:

E F xð i Þ ¼

i iðn i þ 1Þ and Var F xðiÞ ¼ : nþ1 ðn þ 1Þ2 ðn þ 2Þ

b OLSE and b b OLSE of the parameters a and b, respectively, are obtained by minimizing the function The least square estimates a

Sða; bÞ ¼

n X FðxðiÞ ja; bÞ i¼1

2 i : nþ1

These estimates can also be obtained by solving the non-linear equations: n X FðxðiÞ ja; bÞ

i D1 ðxðiÞ ja; bÞ ¼ 0; nþ1 i¼1 n X i FðxðiÞ ja; bÞ D2 ðxðiÞ ja; bÞ ¼ 0; nþ1 i¼1

ð12Þ ð13Þ

where

D1 ðxja; bÞ ¼ ½ða þ bÞð1 þ xÞ þ 1

D2 ðxja; bÞ ¼

eax ðaxÞb ða þ bÞ2 CðbÞ

ð14Þ

;

CðbÞJðaxjbÞ Cðb; axÞJð0jbÞ CðbÞ þ ða þ bÞJð0jbÞ ðaxÞb eax ; logðaxÞ 2 ða þ bÞCðbÞ ða þ bÞCðbÞ C ðbÞ

ð15Þ

and

JðtjbÞ ¼

Z

1

logðyÞ yb1 ey dy;

t P 0:

ð16Þ

t

b WLSE and b b WLSE of the parameters a and b, respectively, are obtained by minimizing The weighted least-squares estimates a the function

Wða; bÞ ¼

n X ðn þ 1Þ2 ðn þ 2Þ i¼1

iðn i þ 1Þ

FðxðiÞ ja; bÞ

2 i : nþ1

These estimates can also be obtained by solving the non-linear equations:

1 i FðxðiÞ ja; bÞ D1 ðxðiÞ ja; bÞ ¼ 0; iðn i þ 1Þ nþ1 i¼1 n X 1 i FðxðiÞ ja; bÞ D2 ðxðiÞ ja; bÞ ¼ 0; iðn i þ 1Þ nþ1 i¼1

n X

where D1 ðxðiÞ ja; bÞ and D2 ðxðiÞ ja; bÞ are given by (14) and (15), respectively.

ð17Þ ð18Þ

466

J. Mazucheli et al. / Applied Mathematics and Computation 220 (2013) 463–471

3. Simulations In this section we present results of some numerical experiments to compare the performance of the different estimators discussed in the previous section. We have taken sample sizes n ¼ 20; 30; . . . ; 100; a ¼ 0:5; 2, and b ¼ 0:5; 1; 2. For each combination ðn; a; bÞ, we have generated N ¼ 20; 000 pseudo-random samples from the weighted Lindley distribution using the acceptance–rejection algorithm proposed by Ghitany et al. [4]. The estimates were obtained using

b (: MLE, h: MME, M: OLSE and O: WLSE). Fig. 1. Bias of a

J. Mazucheli et al. / Applied Mathematics and Computation 220 (2013) 463–471

467

Ox version 6.20, (see [3]), using the maximization package MaxBFGS. To assess the performance of the methods, we calculated the bias and the mean-squared error for the simulated estimates of a and b:

hÞ ¼ Biasðb

N 1X h i hÞ; ðb N i¼1

hÞ ¼ MSEðb

N 2 1X h i hÞ ; ðb N i¼1

h ¼ a; b:

Figs. 1 and 2 show, respectively, the bias of the simulated estimates of a and b. From these two ﬁgures, we observe that:

Fig. 2. Bias of b b (: MLE, h: MME, M: OLSE and O: WLSE).

468

J. Mazucheli et al. / Applied Mathematics and Computation 220 (2013) 463–471

1. all estimators of the parameters are positively biased, i.e. the estimators exceed the true value of the parameters, 2. the biases of all estimators of the parameters tend to zero for large n, i.e. the estimators are asymptotically unbiased for the parameters, 3. the OLS (MM) estimator has the smallest (largest) biase among the considered four estimators, 4. the LS and WLS estimators have almost identical biases.

b (: MLE, h: MME, M: OLSE and O: WLSE). Fig. 3. MSE of a

J. Mazucheli et al. / Applied Mathematics and Computation 220 (2013) 463–471

469

Figs. 3, 4 show, respectively, the MSE of the simulated estimates of a and b. From these two ﬁgures, we observe that: 1. the MSE of all estimators of the parameters tend to zero for large n, i.e. all estimators are consistent for the parameters, 2. the MM estimators have the largest MSE among the considered four estimators,

Fig. 4. MSE of b b (: MLE, h: MME, M: OLSE and O: WLSE).

470

J. Mazucheli et al. / Applied Mathematics and Computation 220 (2013) 463–471

Table 1 Parameters estimates, K–S test, A–D test, and SSR for data set 1. Method

MLE MME OLSE WLSE

Estimated parameters

ab

b b

0.0073 0.0106 0.0049 0.0054

0.2769 0.8367 0.0702 0.0912

K–S

p-value

A–D

p-value

SSR

0.1157 0.1598 0.1338 0.1139

0.9468 0.6899 0.8627 0.9530

0.4250 1.0768 0.3674 0.3182

0.8214 0.3170 0.8788 0.9229

0.0549 0.1139 0.0553 0.0410

K–S

p-Value

A–D

p-Value

SSR

0.0723 0.0723 0.0702 0.0619

0.8620 0.8619 0.8831 0.9517

0.3325 0.3574 0.3179 0.2957

0.9112 0.8893 0.9246 0.9412

0.0494 0.0529 0.0474 0.0430

Table 2 Parameters estimates, K–S test, A–D test, and SSR for data set 2. Method

MLE MME OLSE WLSE

Estimated parameters

ab

b b

12.8227 13.2878 11.4272 11.7905

28.0882 29.1317 25.2070 25.9071

3. the ML, LS and WLS estimators have almost identical MSE, 4. All estimators have very close MSE when the parameter b > 1. 4. Examples In this section we analyze two real data sets for comparing the considered four estimation methods for the weighted Lindley distribution. Data set 1: ([14]). This data set represents the time to failure of 18 electronic devices:

5; 1; 21; 31; 46; 75; 98; 122; 145; 165; 195; 224; 245; 293; 321; 330; 350; 420: Data set 2: ([2]). This data set represents the failure stresses (in GPa) of 65 single carbon ﬁbers of length 50 mm:

1:339; 1:434; 1:549; 1:574; 1:589; 1:613; 1:746; 1:753; 1:764; 1:807; 1:812; 1:84; 1:852; 1:852; 1:862; 1:864; 1:931; 1:952; 1:974; 2:019; 2:051; 2:055; 2:058; 2:088; 2:125; 2:162; 2:171; 2:172; 2:18; 2:194; 2:211; 2:27; 2:272; 2:28; 2:299; 2:308; 2:335; 2:349; 2:356; 2:386; 2:39; 2:41; 2:43; 2:431; 2:458; 2:471; 2:497; 2:514; 2:558; 2:577; 2:593; 2:601; 2:604; 2:62; 2:633; 2:67; 2:682; 2:699; 2:705; 2:735; 2:785; 3:02; 3:042; 3:116; 3:174: In comparing the four considered estimation methods for the weighted Lindley distribution, we adopt the following criteria: (i) highest p-value of the Kolmogorov–Smirnov test (which considers the greatest difference between the theoretical and empirical distributions), (ii) highest p-value of the Anderson–Darling test (which gives more weight to the tails of the distribution), (iii) smallest sum of squares of the residuals (SSR) between the theoretical and empirical distributions, i.e. P ^ F n ðxðiÞ Þ2 . ^ ; bÞ SSR ¼ ni¼1 ½FðxðiÞ ja Tables 1 and 2 show that the WLSE method satisﬁes the above adopted criteria, i.e. highest p-values of the K–S and A–D tests as well as smallest sum of squares of the residuals. Hence, for each of the given data sets, we conclude that the weighted least-squares estimation method is the most adequate among the four considered estimation methods. 5. Conclusions In this paper we compared, via intensive simulation experiments, the estimation of the parameters of the weighted Lindley distribution using four well known estimation methods, namely the maximum likelihood, method of moments, ordinary least-squares, and weighted least-squares. The simulations show that the weighted least-squares is a highly competitive method compared to the maximum likelihood method. This is also supported by the applications of two real data sets.

J. Mazucheli et al. / Applied Mathematics and Computation 220 (2013) 463–471

471

Acknowledgements J. Mazucheli and F. Louzada gratefully acknowledge the ﬁnancial support from the National Council for Scientiﬁc and Technological Development (CNPq) and São Paulo Research Foundation (FAPESP). References [1] D.K. Al-Mutairi, M.E. Ghitany, D. Kundu, Inferences on stress-strength reliability from Lindley distributions, Communications in Statistics – Theory and Methods 42 (2013) 1443–1463. [2] M.G. Bader, A.M. Priest, Statistical aspects of ﬁber and bundle strength in hybrid composites, in: T. Hayashi, K. Kawata, S. Umekawa (Eds.), Progress in Science and Engineering Composites, vol. ICCM-IV, 1982, pp. 1129–1136. [3] J.A. Doornik, Object-Oriented Matrix Programming Using Ox, 3rd ed., Timberlake Consultants Press and Oxford, London, 2007. [4] M.E. Ghitany, F. Alqallaf, D.K. Al-Mutairi, H.A. Husain, A two- parameter weighted Lindley distribution and its applications to survival data, Mathematics and Computers in Simulation 81 (2011) 1190–1201. [5] M.E. Ghitany, B. Atieh, S. Nadarajah, Lindley distribution and its application, Mathematics and Computers in Simulation 78 (4) (2008) 493–506. [6] D. Kundu, M.Z. Raqab, Generalized Rayleigh distribution: different methods of estimations, Computational Statistics and Data Analysis 49 (1) (2005) 187–200. [7] D.V. Lindley, Fiducial distributions and Bayes’ theorem, Journal of the Royal Statistical Society, B 20 (1958) 102–107. [8] D.V. Lindley, Introduction to Probability and Statistics from a Bayesian Viewpoint, Part II: Inference, Cambridge University Press, New York, 1965. [9] J. Mazucheli, J.A. Achcar, The Lindley distribution applied to competing risks lifetime data, Computer Methods and Programs in Biomedicine 104 (2) (2011) 189–192. [10] S. Nadarajah, Bathtub-shaped failure rate functions, Quality and Quantity 43 (5) (2008) 855–863. [11] S. Rajarshi, M.B. Rajarshi, Bathtub distributions: A review, Communications in Statistics – Theory and Methods 17 (1988) 2597–2621. [12] A.I. Shawky, R.A. Bakoban, Exponentiated gamma distribution: different methods of estimations, Journal of Applied Mathematics, Art. ID 284296 (2012) 23 pp. [13] M. Teimouri, S.M. Hoseini, S. Nadarajah, Comparison of estimation methods for the Weibull distribution, Statistics 47 (1) (2013) 93–109. [14] F.K. Wang, A new model with bathtub-shaped failure rate using an additive Burr XII distribution, Reliability Engineering and System Safety 70 (2000) 305–312.

Comparison of estimation methods for the parameters of the weighted Lindley distribution

Comparison of estimation methods for the parameters of the weighted Lindley distribution

Recommend Documents