A new class of score generating functions for regression models
Statistics & Probability Letters 57 (2002) 205–214
A new class of score generating functions for regression models & & urkb; ∗ Young Hun Choia , Omer...
Statistics & Probability Letters 57 (2002) 205–214
A new class of score generating functions for regression models & & urkb; ∗ Young Hun Choia , Omer Ozt& a
b
Department of Statistics, Hanshin University, South Korea 447-791 Department of Statistics, The Ohio State University, 1958 Neil Avenue, Columbus, Oh 43210, USA Received June 2001; received in revised form December 2001
Abstract In this paper we introduce a new score generating function for the rank dispersion function in a multiple linear regression model. The score function compares the rth and sth power of the tail probabilities of the underlying probability model. We show that the rank estimator of the regression parameter based on the proposed score function converges asymptotically to a multivariate normal distribution. Further, we discuss the selection of the appropriate r and s to improve the e5ciency of the rank estimate of the regression parameter. It is shown that for right- (left-) skewed distributions the values of r ¡ s (s ¡ r) provide higher c 2002 Elsevier Science B.V. All rights reserved. e5ciency than the Wilcoxon scores. Keywords: Wilcoxon score; Rank estimate; Asymptotic normality; Pitman e5ciency; Score selection
c 2002 Elsevier Science B.V. All rights reserved. 0167-7152/02/$ - see front matter PII: S 0 1 6 7 - 7 1 5 2 ( 0 2 ) 0 0 0 6 1 - 5
. Ozt. . urk / Statistics & Probability Letters 57 (2002) 205–214 Y.H. Choi, O.
206
rth and sth power of the left tail probabilities of the underlying distribution. On the other hand, & urk and Hettmansperger (1997) state that if there is no knowledge about the outlier pattern, or Ozt& if the sample has both small and large outliers, the weight function that controls the robustness and & urk e5ciency of the estimator must reEect both on right and left tail probabilities. Therefore, Ozt& (2001) considered another class of Mann–Whitney–Wilcoxon test statistic by incorporating both right and left tail behavior of the underlying distributions. & urk and The main purpose of this paper is to introduce the weight function presented in Ozt& & Hettmansperger (1997) and Ozt&urk (2001) into the rank estimate of the regression parameters in & urk and Hettmansperger (1997) is embedded into the score linear models. The weight function of Ozt& generating function of rank dispersion function which produces similar results as in the minimum distance estimators. In Section 2, we propose our new score generating function. We deKne the dispersion function Dr; s (), see for example Eq. (2), and show that its minimizer ˆr; s converges to a multivariate normal distribution. In Section 3, we compare the e5ciency of the rank estimator based on our proposed score generating function with the e5ciency of the rank estimator based on the Wilcoxon scores and McKean and Sievers (1989) scores, respectively. In Section 4, we provide guidance for the selection of r and s that provides improvement over the Wilcoxon scores.
2. Assumption and score function Consider the linear regression model yi = + xi + ei , i = 1; : : : ; n, where xi and are p × 1 vectors of explanatory variables and unknown regression parameters, respectively, and ei is a random variable with density f and distribution function F with F(0) = 1=2. In this model, we consider the rank regression estimate of the regression parameter . In its general form, Jaeckel’s (1972) rank dispersion function can be stated as D() =
n i=1
(yi − xi ) a[R(yi − xi )];
where a(1) 6 a(2) 6 · · · 6 a(n) is a set of scores generated by a(i) = (i=(n + 1)) and ni a(i) = 0. We assume that the score generating function (u) is a nondecreasing, square-integrable and bounded function on (0; 1). Under fairly general conditions, the minimizer of D() produces a robust estimator in y-space. The property of such estimator is studied in detail for a general score function (·) in Jure;ckov
. Ozt. . urk / Statistics & Probability Letters 57 (2002) 205–214 Y.H. Choi, O.