Personality and Individual Differences 39 (2005) 1383–1393 www.elsevier.com/locate/paid
The spiritual well-being questionnaire: Testing for model applicability, measurement and structural equivalencies, and latent mean differences across gender Rapson Gomez *, John W. Fisher School of Behavioural and Social Sciences and Humanities, University of Ballarat, P.O. Box 663, Ballarat, 3353, Victoria, Australia Received 14 April 2004; accepted 22 March 2005 Available online 15 September 2005
Abstract Fisher (1998) proposed a spiritual well-being model, comprising primary factors for the domains of personal, communal, environmental and transcendental well-being, that cohere to form a single higher order or global spiritual well-being dimension. In line with this model, Gomez and Fisher (2003) published the Spiritual Well-Being Questionnaire (SWBQ), with scales for measuring personal, communal, environmental and transcendental spiritual well-being. This study used multi-group confirmatory factor analysis (CFA) to examine gender equivalencies of the measurement and structural models of the SWBQ, and the latent mean in the four SWBQ factors. A total of 3101 females and 1361 males, with age ranging from 15 to 32 years, completed the SWBQ. The statistical fit results supported the invariance of the measurement model, and some aspects of the structural model. The practical fit indices results provided support for the invariance of both the measurement and structural models. The results also showed little gender differences. Together, these findings support gender equivalencies for the SWBQ. Ó 2005 Published by Elsevier Ltd. Keywords: Spiritual well-being questionnaire; Gender; Invariance; Latent mean differences
*
Corresponding author. Tel.: +61 3 532 79 760; fax: +61 3 532 79 754. E-mail address:
[email protected] (R. Gomez).
0191-8869/$ - see front matter Ó 2005 Published by Elsevier Ltd. doi:10.1016/j.paid.2005.03.023
1384
R. Gomez, J.W. Fisher / Personality and Individual Differences 39 (2005) 1383–1393
1. Introduction Spirituality and other spiritual related areas are now emerging as an important area of research (Miller, 2003; Seeman, Dubin, & Seeman, 2003). Recently, Fisher (1998) proposed a hierarchical multidimensional model of spiritual well-being. The model comprises four oblique primary order factors, namely personal, communal, environmental and transcendental, that all cohere to form a higher order secondary overall or global spiritual well-being factor. In FisherÕs (1998; see also Gomez & Fisher, 2003) model, the personal domain deals with how one intra-relates with oneself with regard to meaning, purpose and values in life. The communal domain is expressed in the quality and depth of inter-personal relationships, between self and others, and includes love, justice, hope, and faith in humanity. The environmental domain deals with enjoyment, care and nurture for the physical and biological world, including a sense of awe, wonder and unity with the environment. The transcendental domain deals with the relationship of self with some-thing or some-One beyond the human level, such as a cosmic force, transcendent reality, or God, and involves faith towards, adoration and worship of, the source of mystery of the universe. FisherÕs multidimensional model has been supported in a number of studies (Gomez & Fisher, 2003; Fisher, 1998, 2001; Fisher, Francis, & Johnson, 2000). Gomez and Fisher (2003) have recently published the Spiritual Well-Being Questionnaire (SWBQ). The SWBQ comprises 20 items, five items for each of the four domains identified by Fisher. In a series of four studies reported in the same paper, Gomez and Fisher (2003) provided evidence for the factorial validity of the SWBQ. Both exploratory factor analyses and confirmatory factor analyses supported the four-factor oblique model. The four-factor oblique model showed statistically better fit than a four factor orthogonal model and also a one-factor model comprising all 20 SWBQ items. A joint factor analysis of the four SWBQ domains with EysenckÕs personality dimensions (Eysenck & Eysenck, 1991) showed that the spiritual well-being domains were independent of the personality dimensions, thereby supporting their factorial independence. Also, consistent with predictions from existing theory and data, the SWBQ domain scores for personal, communal, and environmental spiritual well-being correlated as expected with extraversion, neuroticism, psychoticism, and happiness. The SWBQ factor scores also contributed additional variance over that of the personality dimensions in the prediction of happiness, thereby indicating support for their incremental validity. The results also showed that SWBQ scores correlated appropriately with the scores of EllisonÕs (1983) Spiritual Well-Being Scale. In addition, the four studies supported the reliabilities of the four subscales in terms of internal consistency, composite reliability, and variance extracted. Overall, these findings indicate good support for validity and reliability of the SWBQ. However for the SWBQ to be useful when gender is considered, we will need first of all to demonstrate the invariance of the SWBQ and comparability of the factor scores across the gender groups. In our previous study (Gomez & Fisher, 2003), exploratory factor analyses showed support for a similar factor structure and loadings for males and females. However, exploratory factor analysis does not provide a clear test of invariance since the analysis is conducted separately for each group and uses observed scores, which contain measurement error (Byrne, 1998). A more valid approach is to use multi-group CFA as this approach tests for invariance for both groups simultaneously, and uses latent scores that are free of measurement error. So far there are no data on how females and males compare on the SWBQ factors of personal, communal, environmental
R. Gomez, J.W. Fisher / Personality and Individual Differences 39 (2005) 1383–1393
1385
and transcendental spiritual well-being. Exploration of this using latent scores that are free of measurement error would be valuable in this respect. This can also be done using multi-group CFA. The current study used multi-group CFA to examine the equivalencies (invariance) of the measurement (i.e., number of factors and factor loadings) and structural (i.e., factor variances and covariances) models of the SWBQ in males and females, and also the latent mean differences for these factors across these groups. The path model for the SWBQ is shown in Fig. 1. As shown, there are four factors, namely personal, communal, environmental and transcendental, that correlate with each other. Each of the factors had five indicators (the appropriate items from the SWBQ), and there was no loading of indicators across factors or correlation between residuals.
P1 P2 P3 P4 P5 PER C1 C2 C3 COM C4 C5 E1 ENV E2 E3 E4 TRA E5 T1 T2 T3 T4 T5
Fig. 1. The four-factor oblique model of the SWBQ. (Note: P, C, E and T are questionnaire items for personal, communal, environmental and transcendental spiritual well-being, respectively. PER, COM, ENV and TRA are latent factors for personal, communal, environmental and transcendental spiritual well-being, respectively.)
1386
R. Gomez, J.W. Fisher / Personality and Individual Differences 39 (2005) 1383–1393
2. Method 2.1. Participants The sample in this study consisted of 4462 participants from mainly secondary schools and universities, and some participants from the general community (mainly church groups) in Australia. There were also university students from the UK and Ireland. Their ages ranged from 15 to 32 years. The sample represented approximately 70% of individuals invited to participate in the study. There were 3101 females and 1361 males. The mean age (SD) for females and males were 22.16y (4.56) and 21.63y (4.80), respectively. All age categories had more or less equal proportions of females and males. 2.2. Measure All respondents completed the SWBQ. This questionnaire has already been described in detail in the introduction. The SWBQ has scales for personal, communal, environmental, and transcendental spiritual well-being. In all there are 20 items, with five items for each of the four scales. All items are positively worded. Respondents were instructed to indicate how they felt the statements provided in the questionnaire described their personal experience over the last 6 months. All items were rated on a five-point Likert scale, ranging from very low (rated 1) to very high (rated 5). The personal items were ‘‘developing a sense of identity’’, ‘‘developing self-awareness’’, ‘‘developing joy in life’’, ‘‘developing inner peace’’, and ‘‘developing meaning in life’’. The communal items were ‘‘developing a love for other people’’, ‘‘developing forgiveness towards others’’, ‘‘developing trust between individuals’’, ‘‘developing respect for others’’, and ‘‘developing kindness towards other people’’. The environmental items were ‘‘developing connection with nature’’, ‘‘developing awe at breathtaking view’’, ‘‘developing oneness with nature’’, ‘‘developing harmony with the environment’’, and ‘‘developing a sense of ÔmagicÕ in the environment’’. The transcendental items were ‘‘developing a personal relationship with the Divine/God ’’, ‘‘developing worship of the Creator’’, ‘‘developing oneness with God’’, ‘‘developing peace with God’’ and ‘‘developing prayer life’’. There is evidence of good reliability and validity for all four subscales of the SWBQ. For the sample in the current study, the internal reliability for the personal, communal, environmental and transcendental spiritual well-being subscales were .81, .80, .86 and .95, respectively for females, and .76, .80, .84 and .94, respectively for males. 2.3. Procedure The plain language statement to potential participants indicated that the study was addressing aspects of human spiritual experience and behavior. Following consent, participants were asked to complete the SWBQ either in groups at the end of lectures (for mainly university students) or at some time during school hours (for secondary school students), or individually (for mainly participants from the general community). In all instances, the completed questionnaire was collected immediately after it was completed. A total of 4572 ratings were obtained, of which there were 4462 complete ratings. Only the questionnaires with complete ratings were included in the study.
R. Gomez, J.W. Fisher / Personality and Individual Differences 39 (2005) 1383–1393
1387
2.4. Analytic strategy 2.4.1. Analysis of measurement and structural invariance Invariance was examined using the procedure suggested by Byrne (1998). The procedure involves computing at least three models with CFA multi-group analyses. These models relate to the invariance in the number of underlying factors and factor loadings (components of the measurement model), and the invariance in factor variances and covariances (structural component).1 In general, when two groups are involved, the model relevant to the invariance in the number of underlying factors involves specifying the baseline models in both groups so that the parameters are free to take on any value (model 1). The fit indices for this model provide a test for invariance of the number of factors. The test relevant to the invariance in factor loadings involves specifying the baseline model in the first group so that the parameters are free to take on any value, while in the second group, the factor loadings are constrained equal to the first group (model 2). The model relevant to the invariance in factor variances and covariances also involves specifying the baseline model for the first group with parameters free to take on any value, while constraining the factor loadings (if found tenable) and factor variances and covariances of the second group to be equal to the first group (model 3). Following this, the hypotheses relating to the invariance for factor loadings, and factor variances and covariances are tested. At the statistical level, the test for the invariance of factor loading involves using v2 statistics to determine the difference in statistical fit between models 2 and 1, while the test for the invariance of factor variances and covariances involves the comparison of v2 statistics for models 3 and 1. For both comparisons, non-significant difference indicates statistical support for the hypotheses being tested. The invariance for both factor loadings and factor variances and covariances can also be examined by comparing the other indices (root mean square error of approximation—RMSEA and Comparative Fit Index-CFI) of the models compared. Such comparisons provide a test for invariance at the practical level, with small differences supporting invariance for groups compared. In line with the method proposed by Byrne (1998), the analyses here tested for differences between females and males in the number of factors, the pattern of factor loadings of the 20 SWBQ items on their respective latent factors, and the pattern of variances and covariances among the four latent factors. 2.4.2. Analysis of mean latent factor difference In general for covariance structural analysis, it is assumed that all observed variables are measured as deviations from their mean, i.e., the means are equal to zero. Thus the intercepts associated with them are irrelevant in the analysis. However when latent mean difference is of interest, the observed mean scores take on nonzero values, and consequently the intercept parameters need to be included. In LISREL, parameterizing the necessary intercepts into CFA models is accomplished by incorporating into the model a dummy variable called constant. The intercepts are the factor loadings of the regression of the observed variables (called tau-X) and the regression of the latent factors (called kappa) on the constant. The kappa values of the model represent the 1 Although some researchers also test for invariance of error variances and covariances, it is now widely accepted that this is an overly restrictive test (Byrne, 1998). Thus this test was not conducted here.
1388
R. Gomez, J.W. Fisher / Personality and Individual Differences 39 (2005) 1383–1393
latent mean parameters. The LISREL approach to evaluation of latent mean difference requires that the number of latent factors and their loadings, and the regression of the observed variables on the constant, be constrained equal across the two groups. Also, the variances and covariances of the latent factors, and the variances (and covariances when included) of error terms are to be freely estimated across the groups. In addition, the kappa values are to be freely estimated in one group and constrained equal to zero in the other group. The latter is regarded as the reference group. Thus LISREL provides the relative differences between the latent factor scores of groups rather than differences between actual mean latent factor scores. In this study, latent mean differences across the gender groups were examined for personal, communal, environmental and transcendental latent factors, based on the 4-factor oblique model of the SWBQ. For this analysis, the number of latent factors (i.e., four) and their factor loadings were constrained equal across the two groups, as were the intercepts and loadings for the ratings on all the 20 SWBQ items. The variances and covariances of the latent factors, the variances of the error terms in the baseline model were freely estimated across the groups. Also, latent mean values for females were freely estimated, while these values for males (the reference group) were set at zero.
3. Results 3.1. SWBQ: Descriptive, skewness and kurtosis Table 1 shows the mean, standard deviation, skewness and kurtosis values of all the 20 items of the SWBQ for both females and males. The significance of the skewness and kurtosis values were examined using PRELIS 2.51 (Joresko¨g & Sorbo¨m, 1996b). As shown in the table, all scale scores for both groups showed statistically significant kurtosis. With the exception of the transcendental scale, the other three scales also showed significant skewness. Also, most of the questionnaire items showed statistically significant skewness and kurtosis. The test of multivariate normality for continuous variables for females showed significant multivariate skewness (17.91, z = 68.13, p < .001) and multivariate kurtosis (572.00, z = 55.70, p < .001). There was also significant multivariate skewness (23.40, z = 42.57, p < .001) and multivariate kurtosis (576.75, z = 37.64, p < .001) for males. Thus, the assumption of multivariate normality in both the groups was violated. With maximum likelihood estimation, lack of multivariate normality can cause several problems for model testing. These include inflated chi-square values, underestimation of fit indices, and inappropriately low standard errors leading to inflated loadings and correlations (West, Finch, & Curran, 1995). Although there is no perfect solution to the lack of multivariate normality, one solution is to use maximum likelihood with robust estimation (West et al., 1995). This procedure corrects for the lack of normality, resulting in a robust chi-square statistic referred to as the Satorra–Bentler chi-square statistic (S–Bv2). Like the v2 likelihood ratio test statistics, the S–Bv2 also tests the closeness of fit between the unrestricted sample covariance matrix and the restricted (postulated model) covariance matrix, after correcting for multivariate nonnormality. Thus this study used the maximum likelihood procedure with robust estimation.
R. Gomez, J.W. Fisher / Personality and Individual Differences 39 (2005) 1383–1393
1389
Table 1 Descriptive information of the SWBQ for females and males Items/scales
Female (N = 3101)
Male (N = 1362)
Mean
SD
S
K
Mean
SD
Personal P1 (self-identity) P2 (self-awareness) P3 (joy in life) P4 (inner peace) P5 (meaning in life) Scale score
4.09 4.01 4.17 3.86 3.98 20.13
0.91 0.86 0.92 1.02 0.95 3.53
0.85 0.69 1.02 0.60 0.71 0.72
Communal C1 (love others) C2 (forgive others) C3 (trust others) C4 (respect others) C5 (kindness—others) Scale score
4.13 4.02 4.25 4.40 4.33 21.12
0.81 0.87 0.84 0.75 0.77 3.01
Environmental E1 (connect to nature) E2 (awe at nature) E3 (oneness—nature) E4 (magic in nature) E5 (harmony—nature) Scale score
3.45 3.65 3.25 3.38 3.22 16.94
Transcendental T1 (relation with God) T2 (worship of God) T3 (one with God) T4 (peace with God) T5 (prayer life) Scale score
3.03 2.92 2.91 3.14 2.84 14.83
S
K
0.37 0.31 0.71 0.18 0.12 0.52
3.87 3.84 3.98 3.60 3.79 19.07
0.92 0.93 1.00 1.02 1.02 3.50
0.61 0.59 0.88 0.48 0.69 0.58
0.20 0.16 0.40 0.14 0.19 0.69
0.67 0.66 1.01 1.27 1.08 0.90
0.25 0.21 0.84 1.79 1.20 1.31
3.80 3.79 3.96 4.01 4.02 19.65
0.87 0.94 0.95 0.87 0.88 3.34
0.47 0.56 0.81 0.80 0.75 0.66
0.26 0.16 0.53 0.52 0.49 0.87
1.07 1.09 1.10 1.07 1.23 4.45
0.27 0.45 0.17 0.32 0.16 0.27
0.48 0.41 0.51 0.35 0.83 0.30
3.29 3.53 3.11 3.27 2.88 16.07
1.14 1.01 1.19 1.12 1.29 4.65
0.30 0.41 0.18 0.25 0.04 0.15
0.54 0.39 0.70 0.54 1.00 0.37
1.33 1.35 1.34 1.35 1.33 6.13
0.05 0.03 0.04 0.18 0.12 0.04
1.09 1.14 1.12 1.10 1.07 1.05
3.16 3.16 3.07 3.33 2.92 15.63
1.38 1.36 1.36 1.35 1.34 6.14
0.21 0.22 0.13 0.41 0.03 0.24
1.10 1.16 1.12 0.96 1.11 0.97
Note: SD = standard deviation, S = skewness, K = kurtosis. All S and K values were significant unless underlined.
3.2. Testing for measurement and structural invariance LISREL 8.51 (Joresko¨g & Sorbo¨m, 1996a) was used to perform all the CFA in the study. All analyses were based on the covariance matrices with maximum likelihood estimation. As already pointed out maximum likelihood procedure with robust estimation was used to minimize the effects of multivariate non-normality. Thus statistical model fit was evaluated with the S–Bv2, and the differences in statistical fit between models were based on differences in S–Bv2. As the difference between two S–Bv2 values is not distributed as a chi-square, it is necessary to adjust for this difference. The formula for this adjustment as proposed by Satorra and Bentler (1999) was therefore used.
1390
R. Gomez, J.W. Fisher / Personality and Individual Differences 39 (2005) 1383–1393
As v2 values are inflated by large sample sizes, two practical fit indices were also used. These were the Root Mean Square Error of Approximation (RMSEA), and the Comparative Fit Index (CFI). The RMSEA provides a measure of model fit relative to the population covariance matrix when the complexity of the model is also taken into account. Values less than .06 indicate good fit, values .06 to .08 indicate reasonable fit, values from .08 to .10 indicate mediocre fit, and values greater than .10 indicate poor fit (Byrne, 1998). The CFI provides a measure of the fit of the hypothesized model relative to the independent model, with values ranging from 0.00 to 1.00. CFI values greater than .90 suggest well fitting models. In general, before testing for measurement and structural invariance, and differences in latent mean scores, it is necessary to ensure well fitting models for the groups involved. The results of CFA of the four-factor oblique model (Fig. 1) showed excellent fit for males [v2(df = 164) = 1654.02, S–Bv2(df = 164) = 1356.00, RMSEA = .048 (90% CI for RMSEA = .046–.051), CFI = .96] and females [v2(df = 164) = 791.02, S–Bv2(df = 164) = 628.89, RMSEA = .046 (90% CI for RMSEA = .042–.049), CFI = .96]. Thus this model was used as the baseline models for both groups. Table 2 shows the results of analyses for testing the measurement and structural invariance across gender. As shown, the practical fit indices for model 1 were very good. Thus the invariance for the number of factors was supported. These indices for model 2 also showed very good fit, and their values were very close to those for model 1. In addition, the DS–Bv2 between models 2 and 1 was not significant. These results provide support for the invariance in the pattern of factor loadings across gender. The practical indices for model 3 were also very good and very close to those for model 1. However, the DS–Bv2 between models 3 and 1 was significant. Thus while there was practical support for invariance in factor variances and covariances for the groups, there was no statistical support for this (Table 3).
Table 2 Summary of test for invariance of SWBQ measurement and structural models Competing models (M)
df
1. Number of factors invariant 328 2. M2 with pattern of factor 344 loadings held invariant 3. M2 with all factor variance and 354 covariance held invariant 4. M2 with equivalent factor variances and Personal/Personal 345 Communal/Communal 346 Environmental/Environmental 346 Transcendental/Transcendental 347 Communal/Personal 348 Environmental/Personal 348 Environmental/Communal 348 Transcendental/Personal 348 Transcendental/Communal 349 Transcendental/Environmental 349
v2
S–Bv2
Ddf
DS–Bv2
RMSEA (90% CI)
CFI
2445.05 2464.03
2315.16 2368.68
– 16
– 26.54
.052 (.050–.054) .051 (.049–.053)
.96 .96
2569.06
2486.88
10
133.76*
.052 (.050–.054)
.96
invariant 1.96ns 33.13* 5.71ns 6.59ns 27.66* 26.88* 41.87* 10.97ns 89.44* 18.48*
.051 .052 .051 .051 .051 .051 .052 .051 .052 .051
.96 .96 .96 .96 .96 .96 .96 .96 .96 .96
covariances 2465.95 2487.90 2469.04 2469.04 2489.64 2486.61 2499.12 3473.69 2524.48 2478.58
held cumulatively 2370.95 1 2395.90 2 2375.65 2 2379.04 3 2399.15 4 2395.70 4 2407.80 4 2382.18 4 2439.4 5 2391.01 5
(.049–.053) (.050–.053) (.049–.053) (.049–.053) (.049–.053) (.049–.053) (.050–.053) (.049–.053) (.050–.054) (.049–.053)
Note: S–Bv2 = Satorra–Bentler v2; RMSEA = root mean square error of approximation; CFI = comparative fit index. All v2 and S–Bv2 values significant at p = .001. For DS–Bv2, *p < .001; ns = not significant.
R. Gomez, J.W. Fisher / Personality and Individual Differences 39 (2005) 1383–1393
1391
Table 3 Gender differences for the SWBQ factors based on observed scores SWBQ factors Personal Communal Environmental Transcendental
Group mean (standard deviation)
t-Value
Effect size
Female
Male
(df = 4463)
(CohenÕs d)
20.13 21.12 16.94 14.83
19.07 19.67 16.07 15.03
9.26 14.59 6.00 4.02
.20 .47 .20 .13
(3.53) (3.01) (4.45) (6.13)
(3.50) (3.34) (4.65) (6.14)
Note: All t values significant at p = .001.
Further analyses were conducted to ascertain the sources contributing to the non-invariance in the factor variances and covariances matrix. Following Byrne (1998), this procedure involved testing, independently, the invariance of each factor variance and covariance, and comparing the fit of this model to that for Model 2. As suggested by Byrne (1998), we conducted this set of tests by cumulatively constraining all parameters found to be equivalent across the groups. The results for all these analyses are also shown in Table 2. As shown, the results indicated no significant S–Bv2 differences for the variances for personal, environmental and transcendental, and the covariance between transcendental and personal. There were significant differences for the variance for communal, and the covariances for communal and personal, environmental and personal, communal and environmental, transcendental and communal, and transcendental and environmental. Examination of Table 2 shows however that for all comparisons, there was virtually no difference between the practical fit indices.2 3.3. Test for differences in latent factor mean Initially, for comparative reasons, gender differences for the personal, communal, environmental and transcendental spiritual well-being raw scale scores were computed. Table 1 shows the mean and standard deviation raw scale scores for males and females. These results of the t-test for gender differences showed that females had significantly higher raw scale scores than males for personal, t(4463) = 9.26, p < .00, communal, t(4433) = 14.59, p < .001, and environmental t(4463) = 6.00, p < .00, spiritual well-being. The transcendental raw scale score for males was significantly higher than females, t(4463) = 4.02, p < .001. The effect size (CohenÕs d) for personal, environmental and transcendental were all low at .20, .20 and .13, respectively. For communal, the effect size was medium, at .47. For an examination of latent mean differences, a prerequisite is invariance for the measurement model for the groups being compared. As reported earlier, there was invariance for the measurement model of the SWBQ for females and males, thereby justifying the examination of latent mean differences across the gender groups for the four subscales of the SWBQ. The analysis for the model to test the differences in mean latent scores across gender resulted in the following fit indices: v2(df = 360) = 2561.52, S–Bv2(df = 360) = 1989.07, RMSEA = .045 (90% CI for 2 Multigroup models can often show well-fitting practical fit indices, yet still have items that are statistically noninvariant across groups.
1392
R. Gomez, J.W. Fisher / Personality and Individual Differences 39 (2005) 1383–1393
RMSEA = .043–.047), and CFI = .96. These fit indices suggest a good fit for the model, thereby increasing the reliability of the results. For personal, communal and environmental well-being factors, the kappa or latent mean difference were all .01, with t values of 2.81, 4.24 and 2.09, respectively. The transcendental well-being factor had a kappa value of .01, with a t value of 1.68. The results indicate that only the communal well-being factor showed gender differences at p < .001. The positive kappa value for this factor implies a higher latent mean score for females.
4. Discussion The results of this study found support for the invariance in factor loadings across males and females for the SWBQ. For the invariance in factor variances and covariances, the study showed somewhat mixed findings. In terms of statistical fit, the results supported no differences for the variances for personal, environmental and transcendental, and the covariance between transcendental and personal. There were significant differences for the variance for communal, and the covariances for communal and personal, environmental and personal, communal and environmental, transcendental and communal, and transcendental and environmental. However, the practical fit indices supported the invariance for all variances and covariances. The test for latent mean differences showed a difference for only the communal well-being factor, with females scoring higher. These findings were comparable to the results for gender differences based on the raw scale scores as the differences for personal, environmental and transcendental were low, while the difference for the communal was medium. Overall, these findings can be inferred as providing reasonable support for the equivalencies of the SWBQ across males and females. Thus, overall, the SWBQ can be considered worthy of consideration in spiritual well-being research when gender is considered. It needs to be stressed, however, that the findings in the current study need to be viewed with certain limitations and controversies in mind. Firstly, as most participants in the study were secondary school and university students, between 17 and 32 years, it is uncertain if the results are applicable to the wider community and across a wider age range. Secondly, the use of self-ratings may have biased ratings, thereby confounding the ratings. Thirdly, the results of this study need also to be seen in the context that to date there is no agreement in the literature on what are the appropriate set of tests or sequence of tests that are needed for establishing group invariance and latent mean difference (Bentler, 1993; Bollen, 1989; Byrne, 1998; Joresko¨g & Sorbo¨m, 1989; Marsh, 1994). Given this, it can be expected that some researchers may not agree with our analytic approach. However we wish to make the point that our approach for testing both group invariance and latent mean differences is in accord with acceptable standards that have been used previously (Byrne, 1998). In concluding, this study may be of general methodological interests to researchers. Currently, most researchers generally apply theoretical models and use observed scores across gender groups without first testing their equivalencies across these groups. However, before groups can be validly compared on observed measures, there is need to establish group invariance (Byrne, 1998). It is clear that this has not been established for many measures in education, psychology and other social science research where groups have been compared using observed scores. As shown here, multi-group CFA methodology can be used to establish group invariance. It is hoped that this study has provided a basis for other researchers to use multi-group CFA methodology to further
R. Gomez, J.W. Fisher / Personality and Individual Differences 39 (2005) 1383–1393
1393
our understanding of group invariance and latent mean differences for a wider range of constructs used currently in education, psychology and other social science research.
References Bentler, P. M. (1993). EQS structural equations program manual. Los Angeles, CA: BMDP Statistical Software. Bollen, K. A. (1989). Structural equations with latent variables. New York: Wiley. Byrne, B. M. (1998). Structural equation modelling with LISREL, PRELIS, and SIMPLIS: basic concepts, applications, and programming. New Jersey: Lawrence Erlbaum Associates, Inc., Publishers. Ellison, C. (1983). Spiritual well-being: Conceptualization and measurement. Journal of Psychology and Theology, 11, 330–340. Eysenck, H. J., & Eysenck, S. B. G. (1991). Eysenck Personality Scales (EPS adult). London: Hodder & Stoughton. Fisher, J. W. (1998). Spiritual health: Its nature, and place in the school curriculum. Unpublished doctoral dissertation. The University of Melbourne, Melbourne, Victoria, Australia. Fisher, J. W. (2001). Comparing levels of spiritual well-being in state, catholic and independent schools in Victoria, Australia. Journal of Beliefs and Values, 22, 113–119. Fisher, J. W., Francis, L. J., & Johnson, P. (2000). Assessing spiritual health via four domains of spiritual wellbeing: The SH4DI. Pastoral Psychology, 49, 133–145. Gomez, R., & Fisher, J. W. (2003). Domains of spiritual well-being and development and validation of the Spiritual Well-Being Questionnaire. Personality and Individual Differences, 35, 1975–1991. Joresko¨g, K. G., & Sorbo¨m, D. (1989). LISREL 7: A guide to the program and applications (second ed.). Chicago, IL: SPSS Inc. Joresko¨g, K. G., & Sorbo¨m, D. (1996a). LISREL 8: UserÕs reference guide. Chicago, IL: Scientific Software. Joresko¨g, K. G., & Sorbo¨m, D. (1996b). PRELIS 2: UserÕs reference guide. Chicago, IL: Scientific Software. Marsh, H. W. (1994). Confirmatory factor analysis models of factorial invariance: A multifaceted approach. Structural Equation Modeling, 1, 5–34. Miller, W. R. (2003). Spirituality, religion, and health: An emerging research field. American Psychologist, 58, 24–35. Satorra, A., Bentler, P. M. (1999). A scaled difference chi-square test statistic for moment structure analysis. (UCLA Statistics Series 260). Los Angeles: University of California, Department of Psychology. Seeman, T. E., Dubin, L. F., & Seeman, M. (2003). Religiosity/spirituality and health. American Psychologist, 58, 53–63. West, S. G., Finch, J. F., & Curran, A. (1995). Structural equation models with nonnormal variables: Problems and remedies. In R. H. Hoyle (Ed.), Structural equation modelling: Concepts, issues, and applications (pp. 56–75). Thousand Oaks, CA: Sage.