Selection of predictor variables in assessing the severity of malocclusion

Selection of predictor variables in assessing the severity of malocclusion

T. J. Freer, Brisbane, B.D.Sc., Ph.D., Queensland, F.D.S.R.C.S., D.Orth.* Awtralia I n attempting to measure the over-all severity of a person’...

487KB Sizes 0 Downloads 13 Views

T. J. Freer, Brisbane,

B.D.Sc., Ph.D.,

Queensland,

F.D.S.R.C.S.,

D.Orth.*

Awtralia

I

n attempting to measure the over-all severity of a person’s malocclusion, several aspects of the problem are not always explicitly stated. Severity is generally accepted to be a continuous variable, ranging from ideal to very severe, although only a limited number of discrete points along the continuous scale are employed. However, very little is known about the processes by which a clinician arrives at an over-all severity score when it is demanded of him. The abstract nature of an over-all severity assessment is almost certainly a source of intraexaminer and interexaminer variation. It has been shown that this variability may be high.l A severity assessment may consist initially of a search for certain pathognomonic traits. Depending on the presence or absence of these traits, the clinician then looks at other secondary traits in arriving at a recommendation for or against treatment. It is also probable that previous clinical experience influences this decision. If, for example, the clinician can be reasonably sure that a certain condition will relapse if treated, he is more likely to recommend against treatment. Quantitative methods for estimating a person’s over-all severity score, usually implying an estimation of the need for orthodontic treatment, are dependent on the selection of a set of measured variables of the occlusion which may be used to determine that score. A large number of variables may be measured. Some method is required to determine which of these are used by the clinician in making a severity assessment. Observation of the different variables employed in previous indices of malocclusion strongly suggests that selection of the best set of predictors by means of clinical experience alone is not reliable. It is assumed that, a comprehensive set of variables will include the important *Dental

School,

University

of

Queensland,

Brisbane,

australia.

155

156

Table

Am. J. Orthod. August 1973

Freer I. Variables

used

in the

calculation

of

regression

equations

Abbreviation

Original Variables 1.

Overjet

2.

Labiolingual

OVJET spread ** hnaximum

value upper)

3.

Labiolingual

spread

(sum of upper

4.

Labiolingual

spread

(maximum

5.

Labiolingual

spread

(sum of lower

6.

Anterior

spacing

upper

spacing

lower

7.

Anterior Arch

length-upper

right

9.

Arch

length-upper

left minus

width

10.

Arch

Lingual

12.

Buccal

13.

Missing

14.

Upper

Composite

m

value lower)

LLSML

values)

LLSSL

ASL minus

minus

lower lower

right

ALURMLR ALULMLL

left

AWL3M4

m

crossbite-number

of posterior

crossbite-number

of posterior

teeth-number median

LLSSU

ASU

8.

11.

LLSMU

values)

of posterior

teeth

LXBT

teeth

BXBT MPTNO

teeth

diastema

UMD

Variables OVJET**2

1.

Overjet

squared

2.

Overjet

cubed

3.

Labiolingual

spread

(maximum

upper

value)

squared

LLSMU**2

4.

Labiolingual

spread

(maximum

upper

value)

cubed

LLSMU**3

5.

Cross-product (maximum

OVJET**3

of overjet upper

and labiolingual

spread

OVJ*LLS

value)

--

* XI

All distance Draker.

measurements

made to nearest

whole

millimetre.

1 9604.

predictors. For obvious reasons, however, the minimum number of predictors consistent with accuracy should be employed. Multiple regression offers a pragmatic and simple approach to the investigation of this problem, although th’e assumption of a linear model is not necessarily correct. A perplexing diEculty in applying multiple regression methods is the magnitude of the error of measurement in the criterion variable (severity). It has been shown not only that this error may be large but also that the error variance in the criterion is dependent on the magnitude of the criterion. As the severity of malocclusion increases, error variance decreases1 The methods which may be used to deal with the problem of unequal error variance, such as transformations, will not be discussed further here. In spite of any shortcomings, multiple regression techniques are useful in the preliminary investigation of a set of measured variables in order to determine which variables influence the clinician’s assessment of severity and are therefore most likely to be useful in

Volume Number

Table

64 2

Xelection of predictor

II. Stepwise

Step

multipie

Variable Entered

regression

OF severity

R

score

(O-10)

K2

on twelve

variables predictor

I

variables

increase in R*

ResiduaE Mean Square

OVJET

0.622

0.387

0.387

3.355

2

LLSMU

0.764

0.584

0.207

0.933

3

BXBT

0.780

0.608

0.024

0.891

4

AWL3M4

0.791

0.626

0.018

0.864

5

ASU

0.797

0.635

0.009

0.856

6

0.643

0.008

0.849

ASL

0.802

7

MPTNO

0.807

0.651

0.008

0.842

8

UMD

0.809

0.654

0.003

0.850

9

LLSSU

0.811

0.658

0.004

0.852

10

LLSSL

0.813

0.661

0.003

0.861

11

LXBT

0.815

0.664

0.003

0.867

12

LLSML

0.815

0.665

0.001

0.878

any quantitative assessment system. The present report concerns such an investigation, and its aim is purely expository. It is not designed to provide an index of the severity of malocclusion. Materials

and

methods

A sample of seventy-two sets of study models taken in upstate New York and Iowa was used. The present study was confined to persons with distocclusion as defined by Grainger. 2 Fourteen predictor variables were measured on each set of study models. In addition, five new predictor variables mere derived from two of the original fourteen predictor variables. These five new L’composite” variables were the second-order (squared) and third-order (cubed) terms of the original measurement of overjet, the second- and third-order terms of the original measurement of labiolingual spread (maximum upper value), and the cross-product term of overjet and labiolingual spread (maximum upper value). All predictor variables are shown in Table I. The dependent or criterion variable used in all analyses was the 0 to POseverity score of Grainger.” Each set of study models was evaluated by three orthodontists, and a score on the 0 to 10 scale was given, The mean of the three scores, to the nearest whole number, was used as the criterion variable. Stepwise multiple linear regressions were performed on the data, using the BMD” and SPSSt statistical packages. Standard methods were used to evaluate the regression analyses, including plots of normalized residuals and plots of residuals against the criterion and predictor variables. “Dixon, W. J. (Editor) : Biomedical California Press. SNie, N., Bent, D. H., and Hull, Company, Inc.

Computer C.

H.

Programs,

(Editors)

: New

Los York,

Angeles, 1970,

1970,

University

M&raw-Hill

of Book

158

Am. J. Orthod. August 1973

Freer

l’able III. Stepwise multiple including --

higher

order

regression severity score (O-10) on thirteen predictor and cross-product terms of the original variables

Variable Entered

1

OVJ’LLS

0.668

0.446

0.446

1.224

2

OVJET**2

0.716

0.513

0.067

1.091

I32

Increase in R2

Residual Mean Square

step

R

variables,

3

LLSMU

0.812

0.660

0.147

0.774

4

LLSML

0.827

0.684

0.024

0.730

5

MPTNO

0.836

0.700

0.016

0.704

6

LLSMU**2

0.843

0.710

0.010

0.689

7

BXBT

0.850

0.723

0.012

0.670

8

OVJET

0.864

0.729

0.006

0.666

9

LLSSU

0.857

0.734

0.005

0.664

10

LLSMU*“3

0.861

0.741

0.007

0.658

11

ASU

0.862

0.744

0.003

0.660

12

ASL

0.864

0.747

0.003

0.662

13

AWL3M4

0.866

0.749

0.002

0.668

Results

A preliminary regression on the original fourteen predictor variables showed that two variables added virtually nothing to the fit of the f%ial equation. These two variables (arch length-upper right minus lower right ; arch lengthupper left minus lower left) also contained missing data values. Because the residual plotting facilities of the program BMDOBR were to be employed and this program would not delete missing data, the two variables containing missing values were not considered further. The stepwise regression on the remaining twelve variables is summarized in Table II. A final multiple correlation of 0.82 was obtained ; however, the residual mmean square value began to rise after Step 8. The increase in the percentage of explained variance (R2) after Step 2 was, small, and after Step 4 it was less than 1 per cent at each step. The 65 per cent of explained variance at Step 8 has risen only 3 per cent from Step 4. A plot of normalized residuals from this regression showed that the fit of the model for negative residuals was poor and that second- and third-order terms fo-r some of the predictors might be more appropriate. Previous plots of the criterion against two of the predictors also suggested the presence of a seeondor third-order relationship. Accordingly, second- and third-order terms for the first two variables were investigated. In addition, the cross-product term of these two variables was investigated. Therefore, five more composite predictor variables were added to the original twelve variables, making a total of seventeen variables. A stepwise regression of severity on these seventeen variables was then carried out, giving a final multiple correlation coefficient of 0.866. Evaluation of the residual mean square values and the increase in the percentage of ex-

Selection

Volume Nunzbei-

64 2

Table

IV.

Stepwise

multiple

regression

of

severity

score

of predictor (O-10)

an

vartibles thirteen

1

predictor

variables Step

Variable Entered

I32

R

Increase in R2

Yean Square

1

DUMOOl NM002

0.195

0.038

0.038

2.156

2

OVJ*LLS

0.691

0.478

0.440

1.187

3

DVJET**2

0.740

0.547

0.069

1.045

4

LLSMU

0.830

0.688

0.141

0,730

5

LLSML

0.848

8.718

0.030

0.670

6

OVJET

0.859

0.739

0.021

0.632

7

LLSSU

0.868

0.753

0.014

0.607

8

MPTMO

0.873

0.763

0.010

0.592

9

ASU

0.876

0.768

0.005

0.589

10

LLSMU**2

0.878

0.772

0.004

0.588

11

LLSMU**3

0.885

0.782

0.011

0.570

12

ASL

0.886

0.786

0.004

0.571

13

LAW3M4

0.887

0.787

0.001

0.579

*One of these variables present = 1).

(buccal cross-bite)

was used as a dummy variable1 (a.bsent = 0 3

plained variance due to regression at each step suggested that at least four of these seventeen variables should be discarded. The four variables discarded were the values for upper median diastema, lingual cross-bite, third-order (cubed) term for overjet, and labiolingual spread (sum of lower values). The stepwise regression on the remaining thirteen variables is shown in Table III. This gave a final multiple correlation coefficient of 0.866, the same as for the regression on all seventeen variables. It should be noted that the residual mean square value begins to rise after Step 10. This suggests that even more of these variables could be discarded. Inclusion of the new composite terms has given an improvement over the regression on twelve variables (Table II). Of particular interest is the importance of the cross-product term and the second-order term for overjet. Previous plots of the criterion (severity) and residuals against raw scores of buccal cross-bite indicated that this variable was more important when considered on a presence-absence basis rather than as a continuous variable (number of teeth in buccal cross-bite) ~ -4 further regression was carried out using buceal cross-bite as a dummy variable (absent = 0, present = 1) which was forced in on the first step (Table IV). A further improvement in the result was seen and the residual mean square value began to rise after Step 11, the value of R at this step being 0.885. Discussion

The quantitative estimation of the over-all severity of malocclusion requires, for reasons of efficiency and cost, the minimum number of predictors consistent

160

Freer

Am. J. Orthod. August 1973

with accuracy. It is desirable, therefore, not only to eliminate those variables which do not influence the clinician’s assessment but also to reduce the numbers of highly intercorrelated variables, which are equally good predictors. Multiple stepwise regression is a reasonably simple method for achieving such reduction. The danger is that some variables will be overlooked because of the particular composition of the sample used. The other technique for identifying important variables, which is now widely available in statistical computer packages, is discriminant function analysis. This, however, suffers from the disadvantage of requiring the prior identification of groups, which, at present, cannot be done confidently. In the initial stages, however, these two techniques may be complementary, particularly with the rigorous analysis of residuals in regression in I order to identify outliers. One of the predictor variables used in the present study has not previously been used extensively-labiolingual spread.4 Although it reflects anterior crowding, it is more highly correlated with severity than other measures of crowding, such as metric assessments of space shortage. Labiolingual spread values will be increased with relative displacement of anterior teeth from any cause such as ectopic eruption. It may be speculated, therefore, that the clinician is not so much interested in crowding per se as in the displacement that it causes. Of the twelve original variables, the highest correlation for the severity score was with overjet (0.62) and the next highest was with labiolingual spreadmaximum upper value (0.55). The correlation of severity with the cross-product term of these two variables (0.67) was higher than for either variable alone. Also of interest were the correlations of severity with the four variables dependent on this same feature, that is, labiolingual spread. Severity was more highly correlated with the maximum upper value for labiolingual spread (0.55) than with the sum of the upper values (0.49). The correlations with the equivalent lower arch variables were much lower (0.20 and 0.21). The present sample is composed of distocclusion cases only, but earlier studies3 indicate that overjet is an important predictor of severity in all types of malocclusion. This point draws attention to the difficulty of interpreting estimating equations for samples of mixed buccal segment classes. Almost certainly, clinicians give different weights to various predictors, depending on the buccal segment classification. Again, experience has shown that estimations of severity, regardless of technique, will be more accurate if different buccal segment classes are separated first. It may be that Angle’s classification is so ingrained in orthodontic diagnosis that it influences the clinician unduly. Alternatively, Angle’s classification may be based on just that variable which clinicians tend to assessfirst before making an o-ver-all assessment. The importance of the cross-product term in the present analyses suggests that the clinician assesses at least some variables in the light of the values of other variables-that is, there is a strong interactive effect. While the researcher is usually limited in the number of regression analyses that may be made for any one set of data, more attention should be given to the influence of combinations of variables on the clinician. The introduction of buccal cross-bite as a

Volume Number

64 2

Selection

of

predictor

variables

1

dummy variable and t,he resulting improvement in the regression solution support the idea that the clinician does some prior sorting of variables before making an assessment. Summary

Multiple regression techniques have been used to investigate the importance to the clinician of different variables when making an over-a,11 assessment of the severity of malocclusion. The results indicated that most of the variance in the criterion (severity) was explained by a limited number of predictors. One of the predictors which has not been widely used (labiolingual spread) was shown to be very important. There was shown to be an important interactive effect of at least two of the predictors (overjet and labiolingual spread). When one of the predictors (buccal cross-bite) was used as a dummy variable, the regression fit was further improved. In future investigations, attention should be given to the effects of combinations of variables on the clinician’s assessments of severity. and

I would like Dr. Naham

to express Cons for

my their

thanks to Dr. James help with all phases

P. Carlos, Dr. J. Grewe, of this project.

Mr.

R’. Senning,

REFERENCES

1. Freer, T. J., Grewe, J. M., and Little, R. M.: Agreement among the subjective severity asses’sments of 10 orthodontists, Angle Orthod. 43: 42-45, 1973. 2. Grainger, R. M.: Orthodontic treatment priority index, Ser. 2, No. 25, Washington, 1967, United States Department of Health, Education and Welfare. 3. Freer, T. J.: Assessment The matched pair similarity technic, Int. of occlusal status: Dent. J. 22: 412-422, 1972. 4. Draker, H. L. : Handicapping labiolingual deviations : A proposed index for public health purposes, AM. J. ORTHOD. 46: 295-305, 1960. Twrbot

St.