Accepted Manuscript Teacher burnout in pre-schools: A cross-cultural factorial validity, measurement invariance and latent mean comparison of the Maslach Burnout Inventory, Educators Survey (MBI-ES)
Michael Osei Aboagye, Qin Jinliang, Qayyum Abdul, Collins Opoku Antwi, Yasin Jababu, Emmanuel Affum-Osei PII: DOI: Reference:
S0190-7409(18)30508-5 doi:10.1016/j.childyouth.2018.09.041 CYSR 4012
To appear in:
Children and Youth Services Review
Received date: Revised date: Accepted date:
28 June 2018 28 September 2018 28 September 2018
Please cite this article as: Michael Osei Aboagye, Qin Jinliang, Qayyum Abdul, Collins Opoku Antwi, Yasin Jababu, Emmanuel Affum-Osei , Teacher burnout in pre-schools: A cross-cultural factorial validity, measurement invariance and latent mean comparison of the Maslach Burnout Inventory, Educators Survey (MBI-ES). Cysr (2018), doi:10.1016/ j.childyouth.2018.09.041
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT Teacher burnout in pre-schools: a cross-cultural factorial validity, measurement invariance and latent mean comparison of the Maslach Burnout Inventory, Educators Survey (MBI-ES)
Michael Osei Aboagye, a, Qin Jinlianga, Qayyum, Abdula, Collins Opoku Antwi b, Yasin
PT
Jababuc
[email protected], Emmanuel Affum-Oseid
a
c
University of Shanghai for Science and Technology
SC
b
North Carolina A&T State University The Chinese University of Hong Kong
MA
NU
d
RI
Zhejiang Normal University, China
AC C
EP T
ED
Corresponding author at: 1601 East Market Street, Greensboro, NC, 27411, US
1
ACCEPTED MANUSCRIPT
Recent development in research on teacher wellbeing has been associated with the interest in assessing teacher burnout with the widely used Maslach Burnout Inventory, Educators Survey (MBI-ES). The increasing application of the MBI-ES, in and out of the Western world, stresses the need to investigate the cross-cultural applicability and factorial
PT
validity of the scale. The present study investigated whether (a) the MBI-ES is applicable in a
RI
cross-cultural context (i.e., the three-factor model of the scale is equivalent across cultures) and (b) burnout syndrome differs significantly across these cultures and gender. Results of
SC
confirmatory factor analysis (CFA) did not support the 22-item MBI-ES. Subsequent
NU
exploratory factor analysis resulted in significant item reduction. The reduced-item (adjusted) three-factor model fitted the data, and invariant across cultures and gender.
Significant
MA
differences in teacher burnout symptoms were found across cultures and gender. The alterations made to the MBI-ES further reinforce cultural influences on the assessment of
ED
cross-cultural teacher burnout dimensions. Further implications for ECE teacher burnout
EP T
management in cross-cultural contexts are discussed.
AC C
Keywords: Teacher burnout; Preschool Education; Maslach Burnout Inventory, Educator Survey (MBI-ES); Cross-culture; structured means comparison
2
ACCEPTED MANUSCRIPT
1.0 Background of teacher burnout The prevalence of burnout, and specifically, teacher burnout is not a 21st -century development. In fact, the concept was duly acknowledged in the 20 th -century American society (McGeary & McGeary, 2012; Sarros & Sarros, 1990), where teachers’ emotional,
PT
mental and physical well-being were severely evaded (Affrunti, Mehta, Rusch, & Frazier,
RI
2018; Wilmar B. Schaufeli, 2017; Cunningham, 1983; Freudenberger, 1974), threatening job ; Kotova, Rozanov, Alexandrov, & Ivanovo,
SC
outcome (Medvedskaya & Sheryagina, 2017;
2017; Heydari & Abbasian, 2016; Lauermann & König, 2016; Salloum, Kondrat, Johnco, &
NU
Olson, 2015; Cunningham, 1983). Published inquiries and teachers’ participation in burnout workshops, soundly echoed burnout spread, severity and prevailing sentiments of the period
MA
(McGeary & McGeary, 2012; Cunningham, 1983), and perhaps, awareness of its debilitating
ED
consequences (Schaufeli, 2017; Freudenberger, 1974). The assessment of teacher burnout, then gained popularity in academia (Oh & Lee, 2009;
EP T
Pines, 2002; Schwarzer et al., 2000; Sarros & Sarros, 1990), across distinct geographic contexts (Salmela-Aro & Read, 2017; Denton, Chaplin, & Wall, 2013; Maslach, Schaufeli,
AC C
& Leiter, 2001). Empirical studies have revealed that burnout is a worldwide work-related phenomenon (Oh & Lee, 2009; Schwarzer et al., 2000), that closely influences the quality of teacher-work performance such as job efficacy (Lizano & Mor Barak, 2015; Pines, 2002), interpersonal relationships, and the overall teacher wellbeing (Griffiths, Royse, & Walker, 2018; Antonopoulou, Killian, & Forrester, 2017; Squires et al., 2014). Theoretical models for the assessment of cross-cultural teacher burnout studies then became significant determinants of the outcomes of these studies (Sarros & Sarros, 1990; Schwarzer et al., 2000). The Maslach Burnout Inventory (MBI) (Maslach & Jackson, 1981), has been adjudged as having strong psychometric properties (Worley et al., 2008), employed in over 90% of all empirical 3
ACCEPTED MANUSCRIPT burnout studies (Maslach, 2015), and recognized as a key model that facilitated burnout advancement in literature. In spite of the scale’s acceptability among scholars, Worley & Colleagues' (2008) metaanalysis of 24 exploratory and 13 confirmatory factor-analytic (EFA) (CFA) studies, in
PT
addition to 8 studies combining the two methods evinced the issue of inconsistent dimensionality of MBI scale where some research findings indicate internal structure of two
RI
(Galanakis, Moraitou, Garivaldis, & Stalikas, 2009; Walkey & Green, 1992), four (Firth,
SC
McIntee, McKweown, & Britton, 1985; Golembiewski, Munzenrider, & Carter, 1983; Palmira Faraci, 2018; Valente, Wang, & Menezes, 2018), and five factors (Densten, 2001), Kokkinos, 2006; Byrne, 1993) which is
NU
instead of three (Pérez-Mármol & Brown, 2018;
MA
consistent with the original factor-structure in Maslach & Jackson (1981). It is noteworthy that, recent studies have produced similar conflicting patterns evidenced in Worley & Colleagues' (2008) meta-analysis. Though Denton, Chaplin, & Wall's (2013) cross-cultural
ED
studies of New York City teachers (N=150) and Jamaica W.I. teachers (150) confirmed the
EP T
three-factor structure of MBI scale, findings point to structural- and item-level differences between the two cultures. Mészáros, Ádám, Szabõ, Szigeti, & Urbán's (2013) study on 653
AC C
Hungarian healthcare professionals using the Hungarian version of the MBI-HSS revealed a bifactor model as most appropriate for explaining considerable variance in observed scores after adopting CFA to test the proposed one-factor (Kanste, Miettunen, & Kyngäs, 2006), two-factor (Brookings, Bolton, Brown, & McEvoy, 1985), three-factor (Christina Maslach, Jackson, & Leiter, 1997), four-factor (Iwanicki & Schwab, 1981), and five-factor (Densten, 2001) models’ fit. Using a sample of 309 Italian high school teachers and a 20-item rather than the 22-item MBI-ES scale due to 2 items being problematic, Faraci (2018)replicated Firth, McIntee, McKweown, & Britton's (1985) four-factor structure where the emotional exhaustion component was broken into two (i.e., strain and frustration). Additionally, 4
ACCEPTED MANUSCRIPT Valente, Wang, & Menezes (2018) with two emotional exhaustion factor (i.e., Exhaustion & Strain), reported a four-factor structure as the most suitable in terms of fit on 1046 Brazilian bank employees’ sample even though the three-factor was fairly reasonable, as they observed. These studies, except Mészáros et al. (2013), employed both EFA and CFA techniques.
PT
Additionally, a good number of studies have investigated MBI scale’s cross-cultural measurement invariance (Denton et al., 2013; Xanthopoulou, Bakker, Kantas, & Demerouti,
RI
2012; Schutte, Toppinen, Kalimo, & Schaufeli, 2000; Horn, Schaufeli, Greenglass, & Burke,
SC
1997). The outcome of Schutte et al.'s (2000) three-nation comparison of MBI-GS on Finnish, Swedish and Dutch samples (N= 9055) established the scale’s non-invariance across
NU
the three contexts at item level – recording low factor loadings and item error correlations. Xanthopoulou & Associates (2012) structural and item level invariance test of the MBI-GS
MA
on Dutch and Greek samples with CFA resorted to some modifications – that, items 1, and 2 from exhaustion and 5 from cynicism as well as 6 from professional efficacy were not
ED
invariant. Again, results from Denton & Associates' (2013) comparative study between New
EP T
York City (a cosmopolitan of a first world – U.S.) and Jamaica W.I. (a developing country on the Caribbean) displayed structural- and items- level differences. These findings are
AC C
indicative of the perspective that, cross-cultural comparisons should not assume invariance in the constructs and items nomological network of the phenomenology of burnout (Squires et al., 2014; Denton et al., 2013). Particularly, little attention has been given to the applicability of the scale in economies with low socio-economic conditions as most of these studies have concentrated on teachers’ burnout in the North American or European geographic regions (Poghosyan, Aiken, & Sloane, 2009; Schutte et al., 2000). In this regard, the generalizability of the measurement properties of the scale in developing economies such as those in subSaharan Africa (i.e., Ghana), and South-eastern Asia (i.e., Pakistan and China) with
5
ACCEPTED MANUSCRIPT challenging
socio-economic,
and
broadly
distinct
cultural conditions may be highly
challenged. In this respect, Worley et al., (2008) and Squires et al.'s (2014) conclusion that, the need to assess the cross-cultural fit of the most prominent burnout model (i.e., MBI-ES) in a
PT
distinct educational environment other than the original cultural context of the scale (U.S.), to evaluate the content, context, conceptual, semantic, and technical equivalence (Squires et al.,
RI
2014), is one of urgency. Also, Kristensen et al., (2005) declaring the MBI-ES conceptually
SC
flawed and unusable on Danish samples in its original state, further stressed on the untenable nature of the assumption that, the event of burnout or its antecedents stay unchanged across
NU
cultures and suggested structural- and item-level comparisons to clear such assumptions.
MA
These calls are in agreement with later scholars (Schimmenti, 2016; Craparo, Faraci, & Gori, 2015), that the test of factorial structure of popular socio-psychological research instruments across cultures for structural and item equivalence is a necessity. Therefore, the test of the
ED
MBI-ES scale’s cross-cultural (Ghana in West Africa, and China and Pakistan in Asia)
EP T
applicability on the three low socio-economic contexts is imperative, and is expected to elicit unique insights that could inform burnout assessment. Given that the proper assessment of the
AC C
burnout syndrome in these cultural settings requires culturally-adapted and theoreticallymeaningful measures, this study seeks to fill this gap. Accordingly, the first objective for undertaking this research is to assess whether or not the factor structure of the MBI-ES fits three cultural contexts (i.e., possesses factorial invariance across cultures). 1.1 Comparing teacher burnout across cultures and gender The international nature of burnout has necessitated considerable cross-cultural inquiries into the concept given its impact on employee well-being and organizational outcomes (Squires et al., 2014; Xanthopoulou et al., 2012; Poghosyan et al., 2009; Perrewé et al.,
6
ACCEPTED MANUSCRIPT 2002). Findings from these investigations indicate cross-cultural differences in teacher burnout experience. First, North American samples exhibit higher mean levels of cynicism (depersonalization) and exhaustion as compared to that of Europe with Polish sample being the only deviation, whilst the highest burnout incidences are recorded in Taiwanese and Japanese samples (Denton et al., 2013; Maslach et al., 2001; Schwarzer et al., 2000). For
PT
instance, Schwarzer et al. (2000) identified significant differences between German and Hong
RI
Kong on the emotional exhaustion subscale of the MBI-ES, where the Germans exhibited less
SC
exhaustion than their counterparts in Hong Kong with a post hoc t test analysis. However, the difference between German and Hong Kong cultures were found to be minor when the two
NU
cultures were compared to the American cultural setting. The authors’ found the American side recording significantly higher burnout than found in Germany or Hong Kong. Also, (2002) discovered
American teachers,
MA
Pines
having less stressful work
conditions,
experienced higher burnout than Israeli teachers who work in more stressful conditions. The
ED
author’s finding was explained by the fact that cultural differences in work attitude and work
EP T
significance to life explain the observed differences. Differences in burnout syndrome across cultures are not unique to the teaching
AC C
profession, and there have been numerous studies in other human services professions such as health services with similar findings. For example, Schaufeli & Janczur's (1994) examination of burnout among practicing nurses in Poland (N=200) and The Netherlands (N=183), discovered that Poles experienced higher burnout than their Dutch counterparts, even when the differential work conditions’ potential confounding effects were accounted for. Again, Suñer-Soler et al. (2014) found Spanish healthcare professionals in Spain to be, fairly differently, burned out compared to their counterparts in Latin America. The authors implied differences in culture to be a potential explanation for their findings. However, Pienaar &
7
ACCEPTED MANUSCRIPT Wyk (2006) and Kaur & Noman (2015) attributed cases of cultural variability to social support or the lack thereof, whilst Koki (2000) emphasizes the role of religion. Burnout researchers have indicated that the role of gender is a necessary consideration in teacher burnout syndrome (Droogenbroeck et al., 2014; Byrne, 1991; Russell et al., 1987). In
PT
most studies, female teachers have been found to experience higher levels of emotional exhaustion (Grayson & Alvarez, 2008; Lau et al., 2005), and lower levels of personal
RI
accomplishment than their male counterparts (Droogenbroeck et al., 2014; Grayson &
SC
Alvarez, 2008; Lau et al., 2005; Byrne, 1991), as a consequence of non-school environment responsibilities (for example, family duties that are deemed in certain cultures to be
NU
performed mainly by females) (Woodside et al., 2008). These societal female-oriented duties
MA
coupled with teacher job characteristics quicken the rate and magnitude of stress experience and the consequent burnout (Lau et al., 2005; Borthwick, Thornell, & F. Wilkinson, 1982). On another breadth, male teachers are known to keep their feelings to themselves, and as a
ED
result experience far greater depersonalization (Stoeber & Rennert, 2008; Lau et al., 2005;
EP T
Byrne, 1991), and cynicism than female teachers (Woodside et al., 2008). These findings on burnout differences across cultures and gender have been discovered in
AC C
the developed world, with little to no attention to low and middle income countries (i.e., Ghana, China and Pakistan), where children and youth education remain a major policy issue (Unesco, 2015; UNESCO, 2008). Therefore, comparing teacher burnout across LMICs will inform preventive and treatment intervention programs and policies through the pursuit, and discovery of distinct contextual resources and practices, and their adaptation in other contexts (Denton et al., 2013). In the light of this, the second objective of this study, therefore, was to examine whether or not burnout syndrome differs significantly across these cultures and gender.
8
ACCEPTED MANUSCRIPT 2.0 Methods 2.1 Participants and Procedure The data used in this study was a baseline data on pre-school teachers’ wellbeing and teacher-child relationship quality in Ghana, Pakistan and China. The schools were randomly
PT
selected using two stratification selection criteria: administrative level (provinces/ regions and district/ division), and sources of funding (government/ public and non-government/
RI
private). Between 145 to 163 schools were selected from provincial/regional and district
Schools
were
sampled
with
respect
SC
capitals, one second-tier level town and two rural settlements of Ghana, China and Pakistan. to
socio-economic and
cultural compositions.
NU
Compositionally, the provincial/regional and district capitals serve as the microcosms of
MA
ethnic, cultural, technological, economic and administrative hubs of the countries, with schools enlisting pupils and staffs from diverse socio-cultural and economic backgrounds.
ED
The majority of the schools had two kindergarten (i.e., KG 1 for four years olds and KG 2 for 5 to 6 years old) teachers, though the range was from one to five in all three countries.
EP T
No more than two teachers were sampled per school, and where teachers in the school were above two, two were randomly sampled for data collection (one each from KG 1 and KG 2).
AC C
Additionally, in instances where two teachers managed a class, only the lead teacher was sampled. In the five, eight and three schools from Ghana, China and Pakistan, which had two teachers in a class, the lead teachers were sampled in all cases. The schools’ classroom compositions are of a typical inclusive character; composed of students with or without special developmental needs and teachers with different Sex and ages. A sample size of 869 was used for the study. Prior to data collection, research permit was obtained from the municipal directorate of education in the three countries. In line with the declaration of Helsinki, consent letters with information about the fundamental goals of
9
ACCEPTED MANUSCRIPT the study were sent to the head teachers and principals of the schools and teacher consents were also obtained to ensure voluntary participation. The study protocol was approved by the Zhejiang Normal University Graduate Review Committee. Table 1 presents detailed description of the participants and schools’ characteristics. 2.2 Measures Burnout Scale
PT
2.2.1
RI
Teacher burnout was measured with the 22-item 3-factor Maslach Burnout Inventory, Educator’s Survey (MBI-ES), rated on a 7-point scale, ranging from (1= Never to 7=
SC
Everyday) (Maslach & Jackson, 1981). The emotional exhaustion sub-scale measures
NU
teachers’ feelings of being emotionally overextended and exhausted by their work (e.g., “I feel used up at the end of the workday”). Depersonalization sub-scale measures teacher’s
MA
perception of impersonal, and insensitive response to his/her students (e.g. “I feel I treat students as if they were impersonal objects”). Accomplishment on the other hand, assessed
ED
teachers’ feeling of competence in their work (e.g. “I have accomplished many worthwhile things in this job”). Reliability and validity of the MBI-ES have been established in previous
Teacher Stress
AC C
2.2.2
EP T
studies (Boles et al., 2000; Aluja et al., 2005).
Teachers responded to a collection of widely-used and validated self-report measures developed by previous scholars to measure work-related stressors. Workload was assessed with 5 items (e.g. “My work requires too much from me”: a=.71) on 4-point scale (1=never to frequently) (Siegrist et al., 2004), emotional demands with 7 items (e.g. “My work demands a lot from me emotionally”: a=.86) on 4-point scale (1=never to 4=always) (Veldhoven & Meijman, 1994), work-family conflict with 5 items (e.g. “My work takes up time that I’d like to spend with my family/friend”: a=.81) on 5-point scale (1=never to 5=very often) (Frone, Russell, & Barnes, 1996), and role conflict with 8 items (e.g. “I work with groups who 10
ACCEPTED MANUSCRIPT operate quite differently”: a=.88) on 7-point scale (1=very false to 4=very true) (Rizzo et al., 1970). 2.3 Data preparation, descriptive statistics and analytic approach
2.3.1
Missing data and Descriptive statistics
PT
Cases with missing data (2.42%) were retained and treated as missing at random (MAR) by using full-information maximum likelihood estimators (FIML) (Allison, 2003). In line with
RI
Curran et al., (1996) and Allison, (2003), item descriptive statistics (mean and standard
SC
deviation), and normality (skewness and kurtosis) were examined. Two items (MBI 5, 15) appeared to be marginally skewed, with skewness values greater than the recommended
NU
threshold (±2.0). Logarithmic transformation was employed to produce a near-normal
MA
distributions (Field, 2005). Univariate normality assumptions were not violated, because maximum likelihood statistics are robust to mild violations of normality (Jackson et al.,
2.3.2
Data analytic approach
ED
2009). That notwithstanding, the items were carefully monitored in subsequent analyses.
EP T
CFA, MGCFA and structured means analyses were performed in a structural equation modelling, using Amos 25 (Arbuckle, 2017). For the CFA, three competing models were
AC C
compared: (1) proposed three-factor model, (2) an alternative two-factor model, and (3) another alternative one-factor model. Model fit was evaluated, using the Chi-Square (χ2 ) statistics (Cochran, 1952; Bentler & Bonett, 1980), the Standardized Root Mean Square Residual (SRMR), Comparative Fit Index (CFI), Tucker Lewise Index (TLI), the Root Mean Square Error of Approximation (RMSEA) (Bentler & Bonett, 1980; Bentler, 1990; Mueller & Hancock, 2008), and Akaike’s information criterion (AIC) (Akaike, 1987; Bryk & Raudenbush, 1992). CFI ratio ≥.90, SRMR≤.08 and RMSEA ≤.08 show acceptable fit, whereas good fit obtains CFI≥.95, SRMR≤.06 and RMSEA≤.06 (Maccallum et al., 1996; Hu,
11
ACCEPTED MANUSCRIPT Bentler, & Hu, 1999), and the model with lower AIC value had a best fit (Akaike, 1987; Bryk & Raudenbush, 1992). Multi-group
confirmatory
factor
analyses
(MGCFAs)
were
performed
to
test
measurement invariance (MI) (Meredith, 1993), of the MBI-ES across gender, and cultures.
PT
Five invariance types of Byrne, Shavelson, & Muthen (1989), were tested: configural, metric, scalar, residual, and structural invariance (Mellenbergh, 1983; Meredith, 1993; Chen, 2007;
RI
Meredith & Teresi, 2006). Consistent with Cheung & Rensvold (2002), and Chen (2007),
SC
∆CFI, ∆RMSEA and ∆SRMR indices were examined for MI model fit. Considering the sample size (N>500), for testing metric invariance, a change of .010 in CFI, supplemented
NU
by changes of .015 in RMSEA and .030 in SRMR indicated equivalence. For test of scalar
MA
invariance and the other restricted models, ∆CFI.010, ∆RMSEA.015 and ∆SRMR.010 indicated equivalence (Cheung & Rensvold, 2002; Chen, 2007; Sass, 2011) (see Appendix B
ED
for summary of the MI tests steps).
Comparisons in the MBI-ES subscales were made using structured means analysis (Aiken
EP T
et al., 1994; Hancock, 1997). One group was set as reference group, treating the others as comparison groups. The latent mean for the reference group was fixed at zero (0), and the
AC C
latent group means for the comparison groups estimated as deviations from the reference group (Dimitrov, 2006). Differences in latent means are not influenced by the decision to fix reference groups’ mean at zero while freely estimating those of the comparison groups (Aiken et al., 1994; Hancock, 1997; Dimitrov, 2006). 3. Results 3.1 Assessing the factor structure of the MBI-ES across cultures Based on the hypothesised three-factor, alternative two-factor and one-factor structures of the MBI-ES, we performed CFAs on the whole sample. Factors, but not residual variances were allowed to be correlated. Factor loadings of all first indicator items were fixed to one.
12
ACCEPTED MANUSCRIPT Results revealed that restricting the model from three to two dimensions (M3 to M2 ) and from two to one dimension (M2 to M1 ), significantly reduced the model fit (see Table1). Although the three-dimensional model (M3 ) fitted the data best, the fit was considered unacceptable. Item-level analysis
(Reise et al., 1993; Vandenberg & Lance, 2000; Sass, 2011), revealed
some items as potentially problematic: low factor loadings (<.40), R2 (<.20) (see Table2),
PT
and cross-loadings on other factors (Tabachnick & Fidell, 2007).
RI
3.2 Country-level and Cross-cultural exploration of the original MBI-ES factor structure
SC
Because the original 22-item three-factor MBI-ES model had psychometric problems, we
NU
restarted the evaluation process with an exploratory approach at both country-level and crosscultural level, as recommended by Squires et al. (2014). To examine the context-appropriate
MA
number of MBI-ES factors, we performed parallel analyses (J. L. Horn, 1965), and a graphical solution to scree test. Oblique rotation (Quartimax) was applied to examine item
ED
loadings on their respective factors, because the factors were found to be inter-correlated in the initial CFA and previous studies using the MBI-ES (Costello & Osborne, 2005). In line
EP T
with Floyd & Widaman, (1995) and Costello & Osborne, (2005), three criteria were used to evalaute the items and their factors: a) items were required to extract on their theoritically
AC C
meaningful underlying component, b) obtain a minimum loadings .40 or greater and, c) items with crossloadings to be droped as suggested by Costello & Osborne, (2005) and Tabachnick & Fidell, (2007).
Ghana: The parallel analysis and graphical solution to scree test produced three-factor solution. Factor1 was represented, in total, by 5 items (MBI-ES 1-3, 6, 8) that have described personal accomplishment subscale in the MBI-ES. Factor 2 was interpreted as emotional exhaustion as it was represented by seven substantive items: (MBI-ES 4, 7, 9, 12, 17, 19, & 21) belonging to the emotional exhaustion subscale of the MBI-ES. Factor 3 was represented
13
ACCEPTED MANUSCRIPT by 4 items: (MBI-ES 10, I1, 15, 22), originally described as depersonalization subscale in the MBI-ES. Six items; four of them (MBI-ES 13, 14, 16, & 20) from the emotional exhaustion scale, one (MBI-ES 18) from the personal accomplishment subscale, and (MBI-ES 5) from the Depersonalization subscale were considered potentially problematic in the factor
PT
extraction as they substantially cross-loaded on more than one factor. China: As a result of the EFA, three-factor structure was extracted. Factor 1 was interpreted
Factor2 was interpreted as emotional
SC
belonging to the personal accomplishment subscale.
RI
as personal accomplishment on the bases of 8 items (MBI-ES 4, 7, 9, 12, 17-19, 21)
exhaustion; represented by 5 items (MBI-ES 1-3, 6, 8), all being the original items described
NU
in the MBI-ES as emotional exhaustion subscale. Factor 3 was named depersonalization and was represented by 4 items (MBI-ES 10, 11, 15, 22), which originally formed the
MA
depersonalization subscale of the MBI-ES. Five items, four of them (MBI-ES 13, 14, 16, 20) from the emotional exhaustion scale, and one (MBI-ES 5) from the depersonalization scale
ED
were identified as having problematic loadings; had low loadings on their respective
EP T
constructs, and cross-loaded on other factors. Pakistan: EFA results revealed three-factor structure. Factor 1 was represented by seven items (MBI-ES 4, 7, 9, 12, 17, 19, 21) belonging to the personal accomplishment scale of the
AC C
MBI-ES. Factor2 was represented, in total, by 4 items (MBI-ES 1-3, 8); these were among the original items described in the MBI-ES as capturing emotional exhaustion. Factor 3 was represented by 5 items (MBI-ES 5, 10, 11, 15, 22) all belonging to the MBI-ES’ depersonalization subscale. Six items, five of them (MBI-ES 6, 13, 14, 16, 20) from the emotional exhaustion scale and one (MBI-ES 18) from the personal accomplishment scale had low loadings on their respective constructs and cross-loaded on other factors. Cross-cultural EFA:
Results revealed similar factor structure on the cross-cultural sample.
Factor1 was represented by seven items (MBI-ES 4, 7, 9, 12, 17, 19, 21) from the personal 14
ACCEPTED MANUSCRIPT accomplishment, factor2 by 5 items (MBI-ES 1-3, 6, 8) from the emotional exhaustion, and factor 3 by four items (MBI-ES 10, 11, 15, 22) belonging to depersonalization of the original MBI-ES subscales. Items (MBI-ES 5, 13, 14, 16, 18, 20), were verified as having problems in the extraction.
PT
3.3 Post hoc scale reduction Moving forward, the country-level and cross-cultural factor structures were compared.
RI
Problematic items found in two-third (2/3) or all of the countries and verified in the cross-
SC
cultural EFA were deleted. Results revealed that items (MBI-ES 13, 14, 16, 20) had problems in Ghana, China and Pakistan, (MBI-ES I5) China and Ghana, and (MBI-ES 18) Ghana and
NU
Pakistan samples (see Table3). However, item (MBI-ES I6) appeared to be problematic in only one out of the three countries. These measurement problems are not unique to this study;
Aluja
et
al.,
2005).
Thus
MA
some of these items have been characterized same in several other studies (Boles et al., 2000; considering
previous
empirical studies
and
conceptual
ED
considerations, we proceeded with reduced 16-items EFA without items (MBI-ES -ES 5, 13,
EP T
14, 16, 18, & 20).
To obtain a statistically substantive and viable factor structure, having measurement items a) interpreted the same, and b) with similar meaning in the three cultures, we have first
AC C
established that six items are unfit for this cross-cultural study. As a result of the EFAs, these items may be lacking the same meaning and interpretation across-cultures. Excluding them in the final extraction resulted in a clean factor-structure.
The adjusted factor-structure for the
cross-cultural study generated a consistent picture of the original MBI-ES factor-structure having three factors: emotional exhaustion (Factor1; 5 items), personal accomplishment (Factor 2; 7 items), and depersonalisation (Factor3; 4 items), sharing similar content, conceptual, semantic, and technical meaning with the original MBI-ES instrument (Maslach & Jackson, 1981). Oblique rotated EFA factor loadings and communalities (h2 ) for the 15
ACCEPTED MANUSCRIPT reduced 16-item three-factor MBI-ES are presented in Table3, whiles Appendix C presents the parallel analysis and graphical solution to scree test for the three samples 3.4 Confirmation of the reduce 16-item three-factor structure on cross-cultural sample To confirm the reduced 16-tems three-factor structure, CFAs were performed. Problems in single-sample post hoc modifications were resolved by applying the two-step strategy
PT
recommended by Arlot & Celisse, (2010) and Freire et al. (2017). Accordingly, taking
RI
advantage of the sample size, we split the sample into strictly random independent
SC
homogenous halves (Calibration sample: N=435 and Validation sample: N=434). This allowed for validation and modifications of the adjusted model in one half (Calibration
NU
sample) and cross-validation on the second half (Validation sample). Again, testing the three priori models in the calibration sample, we first established that
MA
the three-dimensional model fitted the data as compared to the two- and one-dimensional models. However, item-level analysis with the modification indices revealed some items
subscale)
as
having
measurement
error-covariance.
The
modification
EP T
accomplishment
ED
(MBI-ES 1, 2 & 6 in the emotional exhaustion scale and all the items in the personal
allowed these errors to be correlated in order to improve the model fits, as done in previous studies (Boles et al., 2000; Aluja et al., 2005). The fit of the three-dimensional model
AC C
substantially improved, whilst the two- and one-dimensional models remained misfit (see M1a Calibration sample;
Table 4). Cross-validation on the validation sample (M1b Validation
as well as the whole sample verified the initial results (M1
whole sample;
sample;
Table 4)
Table 4). All factor
loadings were significant. Inter-factor correlation patterns of the reduced 16-item latent scale scores did not reveal any appreciable change from the 22-item latent scale scores, and the coefficients were consistent with those reported in previous studies (Aluja et al., 2005; Worley et al., 2008). Emotional exhaustion correlated positively with depersonalisation (roriginal 16
MBI-ES=.36;
rreduced
ACCEPTED MANUSCRIPT MBI-ES
=.29, p<.01), and negatively with personal accomplishment (roriginal
MBI-ES=
-.17,
p<.01); whereas depersonalization correlated
MBI-ES=
-.20; rreduced
negatively with personal
accomplishment (roriginal MBI-ES=.-26; rreduced MBI-ES =-.23, p<.01). 3.5 Validity and reliability of the reduced 16-item MBI-ES
PT
To test for construct, convergent and discriminant validity and reliability, average variance explained, maximum shared variance, composite reliability and Cronbach's alpha for
RI
internal consistency ratios were computed and compared to their thresholds (Hair, Black,
SC
Babin, & Anderson, 2010; Fornell & Larcker, 1981). Results are comparable to those reported in previous studies (see Table 2). Further validity evidence based on the scale’s
NU
relation with other variables was achieved by performing separate structural equation models (SEM). The latent scores of the reduced 16-item MBI-ES subscales were regressed on work
MA
stressors. The regression weights were in the expected directions, and statistically significant: work stressors (workload, emotional demands, work-family conflict and role conflict) were
ED
positively associated with emotional exhaustion and depersonalization. On the other hand, the
EP T
work stressors were negatively related to personal accomplishment, but only emotional demands and the total stress scores were statistically significant (see Table 5).
AC C
3.6 Invariance test of the MBI-ES across cultures and gender Multigroup confirmatory factor analyses (MGFAs) were performed on the adjusted measurement model to test whether or not there are: 1) equality of factor structure, 2) equality of magnitude of factor loadings/ scaling units, 3) cross-group difference in the means of the observed items, 4) equality of error-variance, and 5) equality of factor variance across cultures and gender. 3.6.1
Invariance across cultures
17
ACCEPTED MANUSCRIPT We first established that the adjusted three-factor model fitted the groups for configural invariance test (see M1 Whole sample; M1a Calibration sample; M1b Validation sample; Table 6). Simultaneous analysis of least restricted solution revealed strong metric invariance (see M2 , M2a, M2b; Table 6). Moving forward, additional constraints were imposed to test for scalar invariance. The model fit indices showed poor fit (see M3 , M3a, M3b, Table 6), signalling non-invariant
PT
items in the model. Item-level analysis (Sass, 2011), revealed one item of the emotional
RI
exhaustion and depersonalization subscales (MBI-ES 2 “I feel emotionally drained from my work” and MBI-ES 11 “I worry that this job is hardening me emotionally) and three items of
SC
the personal accomplishment subscale (MBI-ES 7 “I feel very energetic”, 9 “In my work I
NU
deal with emotional problems very calmly” and 12 “I can easily understand how my students feel”) lacked invariance across cultures. This result was further cross-verified on the
MA
validation and calibration samples. Consequently, the intercepts for the
items were freely
estimated, and partial scalar invariance model tested (Vandenberg & Lance, 2000). Result
ED
showed acceptable fit for partial strong scalar invariance (see M4 , M4a, M4b; Table 6). Residual invariance and structural invariance were finally tested by imposing additional
EP T
constraints. Results did not support the residual invariance model (see M5 , M5a, M5b; Table 6) and structural invariance (see M6 ; M6a, M6b; Table 6), because the additional constraints
AC C
altered their model fits significantly, Taken together, the invariance tests supported partial strong factorial invariance across the three cultures (Meredith, 1993; Chen, 2007; Meredith & Teresi, 2006).
3.6.1.1 Structured means analyses of the modified MBI-ES across cultures Based on the MI test results, we performed structured means analysis to test latent mean differences across cultures. Ghana and China had higher latent mean scores than Pakistan on the emotional exhaustion subscale, accomplishment
and
depersonalization
and
lower latent mean score on the personal
subscales. 18
These differences in the emotional
ACCEPTED MANUSCRIPT exhaustion subscale were significant (z=3.617, p<.01) and (z=6.532, p<.01) for Ghana and China respectively. On the other hand, the difference in the personal accomplishment subscale was significant for China (z= -3.073, p<.01), but was not significant for Ghana (z= 1.503, p>.05). Similarly, the differences in the depersonalization subscale was significant for Ghana (z=-1.933, p<.05), but not significant for China (z= -.728, p>.05). These latent means
Invariance across gender
RI
3.6.2
PT
differences were further explored in simple graphs (Fig. 1).
sample;
M1a
calibration sample ;
M1b
validation sample ;
whole
SC
Results showed good model fit for configural invariance test across gender (see M1
Table 7). After imposing constraints on factor
NU
loadings, AIC decreased and CFI was lower than the threshold, indicating strong metric
MA
invariance (see M2 , M2a and M2b; Table 7). The model for scalar invariance (M3 , M3a, M3b; Table 7) was also supported, indicating strong factorial invariance. Residual invariance (M5, M5a, M5b, Table 7) and structural invariance (M6 , M6a, M6b;Table 7) were then tested and
ED
found to be fit, together supporting that the MBI-ES possesses strict factorial invariance
EP T
across gender.
3.6.2.1 Structured means analyses of the MBI-ES across gender
AC C
We then tested latent mean differences across gender. The comparison group (male teachers) scored lower on the emotional exhaustion subscale and higher on the personal accomplishment and depersonalization subscales compared to the reference group (female teachers). These differences were significant; emotional exhaustion (z=-3.399, p<.001), personal accomplishment (z=2.595, p<.01) and depersonalization (z=3.169, p<.001), and further plotted in simple graph (Fig. 2). 4.0 Discussion
19
ACCEPTED MANUSCRIPT The present study validated the three-dimensional structure; emotional exhaustion, depersonalization and personal accomplishment of the MBI-ES, using parallel analysis, CFA and MGCFA. We verified and extended the cross-cultural validity and measurement invariance evidence of the MBI-ES across LMICs (i.e., Chinese, Pakistani and Ghanaian). Our study adds to investigations on culturally-specific issues: cross-cultural meaning and and the conceptual equivalence of burnout
PT
interpretations of the MBI-ES’s items,
RI
dimensions, important to understanding burnout phenomenon to add to the scale’s global
SC
applicability. The study, employing statistically sophisticated methods, highlights insightful revelations of the factorial and structural validity as well as measurement equivalence of the
NU
hypothesized three-factor MBI-ES model.
First, the three-factor pattern and item loadings of the MBI-ES were verified in a cross-
MA
cultural context. In line with previous studies (Denton et al., 2013; Schaufeli & Dierendonck, 1993; Schaufeli et al., 1994), our study did not support the 22-item three-factor structure; as potentially problematic in the cross-cultural context.
ED
some items were reported
EP T
Specifically, these items had poor psychometric properties; low factor loadings and crossloadings on more than one factor. Similar results have been reported by previous studies that assessed teacher burnout with the MBI-ES in different geographic regions, testing the MBI-
AC C
ES’s three-factor structure in Europe with samples from Catalonia (Aluja et al., 2005), Netherlands (Schaufeli & Dierendonck, 1993; Schaufeli et al., 1994), Norway (Richardsen & Martinussen, 2004), and also North America including the U.S. (Boles et al., 2000), and Canada (Byrne, 1991, 1993, 1994). This reveals the items’ inadequate predictive capacity for their latent constructs, and clear-cut interpretability of the 22-item MBI-ES three-factor solution (Destin, 2001). Maslach and Jackson (1981) advised burnout researchers to exclude some items (12 and 16) from the MBI-ES when assessing its measurement structure. Similarly, several scholars 20
ACCEPTED MANUSCRIPT of teacher burnout hinted of the need to exclude some of these same items, including (items 5, 13, 14, 16, 18, 20) in burnout assessment in different geographic regions that adopt the MBI-ES. A recent study on a cross-cultural adoption of the MBI-ES in eleven European countries with ten languages found several items, including (5, 14, 16, 20) problematic as they failed to possess similar interpretation across cultures (Squires et al., 2014). This
PT
magnitude of the 22-item MBI-ES measurement problems suggest the need for an adjusted
RI
model which is context- and culturally-relevant for the understanding of the underlying structure of burnout (Worley et al., 2008). The replicability of the MBI-ES structure and
SC
items’ validation across the three cultures, therefore verified the constructs’ cross-cultural
NU
meaning and usefulness.
The reduced 16-item three-dimensional factor model produced acceptable fit across the
MA
three cultures when some item error variances and the latent factors were correlated. This is consistently evidenced in previous studies that tested the MBI-ES’s three-factor model, and
ED
found it fit by allowing its factors to correlate, and permitting residual covariance (Worley et
EP T
al., 2008; Denton et al., 2013). We found comparable relationship patterns between the burnout factors and those reported in previous studies (Worley et al., 2008). Support for the two subscales: emotional exhaustion and depersonalization as being the core of burnout was
AC C
achieved (Hawrot & Koniewski, 2017), as they shared strong positive relation, and correlated negatively with personal accomplishment. These inter-factor correlations and the associations found between the burnout subscales and work stressors, together with validity and reliability evidence verified the scale’s psychometric strength, when adopted in cross-cultures (Boles et al., 2000; Aluja et al., 2005). Thus, we found the underlying factors in the reduced 16-item MBI-ES model substantially reliable and valid, fitting the Chinese, Pakistani and Ghanaian cultural contexts, allowing for cross-cultural invariance assessment and possible teacher burnout comparison. 21
ACCEPTED MANUSCRIPT The reduced 16-item MBI-ES in the cross-cultural context revealed partial strong factorial invariance across cultures and strong factorial invariance across gender (Reise et al., 1993; Cheung & Rensvold, 2002), serving as pre-requisites for multi-group comparison (Meredith & Teresi, 2006), for researchers to discern differences in teacher burnout in respect of the dimensions explored (Cheung & Rensvold, 2002; Meredith & Teresi, 2006; Chen,
PT
2007; Sass, 2011). That notwithstanding, our study found non-invariance items (MBI-ES 2,
RI
7, 9, 11, 12) across cultures. Thus, teachers’ perception of these items varied among those in
SC
Pakistan, Ghana and China. Possible reasons for this variations might be attributable to contextual elements of borderline distinctiveness in semantics, content, technicality and
NU
concepts of burnout as advised by (Squires et al., 2014).
MA
Significant differences in teacher burnout across countries and gender were revealed in our study. Ghanaian and Chinese preschool teachers reported higher experience of emotional exhaustion than their Pakistani counterparts. For personal accomplishment, the Pakistani
ED
preschool teachers recorded higher levels than their Ghanaian and Chinese counterparts. It
EP T
was further revealed that, the Pakistani preschool teachers experienced higher levels of depersonalization than the Ghanaian and Chinese preschool teachers. Despite this variability
AC C
in teacher burnout experience across these three low and middle income countries, classroom and school characteristics as presented in table 1 show fairly similar contextual conditions. Given the findings of empirical cultural analysis of burnout that assigned marked teacher burnout differences to cultural perspectives including the level of professional status of teaching in society (Denton et al., 2013), religious orientation (Koki, 2000), social support (Pienaar & Wyk, 2006), and existential significance of teaching profession in certain cultures (Pines, 2002), it is therefore indicative that, the results of this study may be due to the role of the broader cultural and systemic influence on teacher burnout reactions among pre-school teachers (Sarros & Sarros, 1990). This resonates with the theorisation by Bronfenbrenner & 22
ACCEPTED MANUSCRIPT Morris (1998), who highlighted not just the micro ecosystem (i.e., immediate contextual factors) of the school, but also, the macro level (i.e., broader cultural conditions) of economic, technological, legal, scientific and institutional ecosystem of people-environment relations.
PT
Gender differences where female teachers reported higher emotional exhaustion than their male counterparts, lower perceptions of personal accomplishment and depersonalization than
RI
male teachers have been found in most studies (Russell et al., 1987; Byrne, 1991; Lau et al.,
SC
2005; Grayson & Alvarez, 2008; Stoeber & Rennert, 2008; Droogenbroeck et al., 2014). The gender differences reported in our study might be explicable by the theses offered by
NU
Woodside et al. (2008) and Borthwick, Thornell, & Wilkinson, (1982), that extra societal gender roles such as excessive family-level responsibilities overburden females to be
MA
emotionally drained (Woodside et al., 2008), whilst men are expected to contain their emotions leading to alienation (Barbara M. Byrne, 1991; Lau et al., 2005b; Woodside et al.,
EP T
(Woodside et al., 2008).
ED
2008), is distinctively gender roles socialization issues which is contextually defined
Altogether, our study results support the cross-cultural factorial validity of a shortened
AC C
16-item MBI-ES, and its use plausible for cross-cultural teacher burnout evaluation. This possible cross-cultural evidence allows for discovery and understanding of key cross-country differences and similarities that may be important for teacher burnout management. Nevertheless, the interpretability of our study’s results should be made within the context of limitations. A number of limitations have been noted, and prominent amongst them is the generalizability of the results. The study samples are from Ghana, China, and Pakistan and by no means generalizable to the rest of Africa and Asia. The low correlations among some
23
ACCEPTED MANUSCRIPT items could relate to the complex nature of the constructs measured; emotional exhaustion, depersonalization and personal accomplishment (Squires et al., 2014). We recommend that futher tests in different African and Asian contexts are necessary. Regardless of these limitations, this study offers practical implications for ECE teacher burnout management.
PT
Our study’s results patterns indicate that, cross-cultural learning and sharing among preschool teachers through preschool teacher exchanges and continuous collaborations
RI
among ECE stakeholders can generate unique understanding of the best ways to fight against
SC
teacher burnout. For instance, Chinese and Ghanaian preschool teachers can adopt some unique and applicable Pakistani practices that make them less emotionally exhaustive,
NU
whereas higher depersonalization and personal accomplishment in the Pakistani contexts can be managed by looking into applicable practices in the Ghanaian and Chinese contexts. Our
MA
study has implication for a relook at the gender roles in Ghana, China and Pakistan that serve to facilitate and/ or deepen emotional exhaustion gap between men and women and that of
ED
depersonalization and personal accomplishment.
EP T
On policy, the findings of our study can shape policy discussions and formulation in creating harmonious educational environment through close study of the differences in the
AC C
three cultural settings and promote positive cultural learning and adaptions of relevant, and applicable ECE practices in other cultures. These investigations have become necessary in the light of the findings relating to differences in teacher burnout experience in different countries. Our findings also add to existing literature on the theoretical development and cross-cultural assessment of the teacher burnout with the MBI-ES. The results provoke discussions on pressing matters, including why emotional exhaustion is higher among Ghanaian and Chinese pre-school teachers than among the Pakistani preschool teachers in order to come up with sound explanation for these differences and ways to address them.
24
ACCEPTED MANUSCRIPT Reference Affrunti, N. W., Mehta, T., Rusch, D., & Frazier, S. (2018). Job demands, resources, and stress among staff in after school programs: Neighborhood characteristics influence associations in the job demands-resources model. Children and Youth Services Review,
PT
88, 366–374. https://doi.org/10.1016/j.childyouth.2018.03.031 Aiken, L. S., Stein, J. A., & Bentler, P. M. (1994). Structural equation analyses of clinical
RI
subpopulation differences and comparative treatment outcomes: Characterizing the daily
SC
lives of drug addicts,. Journal of Consulting and Clinical Psychology, 62, 488–499.
332. https://doi.org/10.1007/BF02294359
NU
AKAIKE, H. (1987). FACTOR ANALYSIS AND AIC. PSYCHOMETRIKA, 52(3), 317–
MA
Allison, P. D. (2003). Missing Data Techniques for Structural Equation Modeling. Journal of
ED
Abnormal Psychology, 112(4), 545–557. https://doi.org/10.1037/0021-843X.112.4.545 Aluja, A., Blanch, A., & Garcia, L. F. (2005). Dimensionality of the Maslach Burnout
EP T
Inventory in School Teachers : A Study of Several Proposals . European Journal of Psychological Assessment, (January). https://doi.org/10.1027/1015-5759.21.1.67
AC C
Antonopoulou, P., Killian, M., & Forrester, D. (2017). Levels of stress and anxiety in child and family social work: Workers’ perceptions of organizational structure, professional support and workplace opportunities in Children’s Services in the UK. Children and Youth Services Review. https://doi.org/10.1016/j.childyouth.2017.02.028 Arbuckle, J. L. (2017). Amos 25 User’s Guide. SPSS IBM. Arlot, S., & Celisse, A. (2010). A survey of cross-validation procedures for model selection. Statistics Surveys, 4, 40–79. https://doi.org/10.1214/09-SS054 Bentler, P. (1990). Comparative fit indices in structural models. Psychological Bulletin, 25
ACCEPTED MANUSCRIPT 107(2), 238–246. https://doi.org/0.1037/0033-2909.107.2.238 Bentler, P. M., & Bonett, D. G. (1980). Significance tests and goodness of fit in the analysis of covariance structures. Psychological Bulletin, 88(3), 588–606. https://doi.org/10.1037/0033-2909.88.3.588
PT
Boles, J. S., Dean, D. H., Ricks, J. M., Short, J. C., & Wang, G. (2000). The Dimensionality of the Maslach Burnout Inventory across Small Business Owners and Educators.
RI
Journal of Vocational Behavior, 56(1), 12–34. https://doi.org/10.1006/jvbe.1999.1689
SC
Borthwick, P., Thornell, J., & F. Wilkinson. (1982). Teacher burnout: A study of professional
NU
and personal variables. In Proceedings of the Annual Meeting of the American Association for Teacher Education. Houston, TX.
MA
Bronfenbrenner, U., & Morris, P. (1998). The ecology of de velopmental processes. In R. M. Lerner (Ed.), Handbook of child psychology: Vol. 1. Theoretical models of human devel-
ED
opment (pp. 993–1028). New York: Wiley. Brooks-Gunn, J., Berlin, L. J., Leve.
EP T
Brookings, J. B., Bolton, B., Brown, C. E., & McEvoy, A. (1985). Self-Reported Job Burnout Among Female Human Service Professionals. Journal of Occupational Behaviour, 6(2),
AC C
143–150.
Bryk, A. S., & Raudenbush, S. W. (1992). Hierarchical linear models in social and behavioral research: Applications and data analysis methods. Newbury Park, CA: Sage. Buys.
Byrne, B. M. (1991). Burnout: Investigating the impact of background variables for elementary, intermediate, secondary, and university educators. Teaching and Teacher Education, 7(2), 197–209. https://doi.org/10.1016/0742-051X(91)90027-M Byrne, B. M. (1993). The Maslach burnout inventory:Testing for factorial validity and 26
ACCEPTED MANUSCRIPT invariance across elementary, intermediate and secondary teachers. Journal of Occupational and Organizational Psychology, 66, 197–212. Byrne, B. M. (1994). Testing for the Factorial Validity, Replication, and Invariance of a Measuring Instrument: A Paradigmatic Application Based on the Maslach Burnout
PT
Inventory. Multivariate Behavioral Research, 29(3), 263288. https://doi.org/10.1207/s15327906mbr2903
RI
Byrne, M. B., Shavelson, R. J., & Muthen, B. O. (1989). Testing for the equivalence of factor
SC
covariance and mean structures: The issue of partial measurement invariance.
NU
Psychological Bulletin, 105, 456–466.
Chen, F. F. (2007). Sensitivity of goodness of fit indexes to lack of measurement invariance.
MA
Structural Equation Modeling, 14(3), 464–504. https://doi.org/10.1080/10705510701301834
ED
Cheung, G. W., & Rensvold, R. B. (2002). Evaluating Goodness-of- Fit Indexes for Testing
EP T
Measurement Invariance. Structural Equation Modeling: A Multidisciplinary Journal, 9(2), 233–255. https://doi.org/10.1207/S15328007SEM0902
AC C
Cochran, W. G. (1952). Test of Goodness of Fit. The Annals of Mathematical Statistics, 23(3), 315–345. https://doi.org/10.1214/aoms/1177729380 Costello, A. B., & Osborne, J. W. (2005). Best Practices in Exploratory Factor Analysis : Four Recommendations for Getting the Most From Your Analysis. Practical Assessment, Research & Evaluation, 10(7). Retrieved from http://pareonline.net/getvn.asp?v=10&n=7 Craparo, G., Faraci, P., & Gori, A. (2015). Psychometric properties of the 20-item toronto alexithymia scale in a group of Italian younger adolescents. Psychiatry Investigation,
27
ACCEPTED MANUSCRIPT 12(4), 500–507. https://doi.org/10.4306/pi.2015.12.4.500 Cunningham, W. G. (1983). Teacher burnout - solutions for the 1980’s: A review of the literature. The Urban Review, 15(1), 37–51. Curran, P. J., West, S. G., & Finch, J. F. (1996). The Robustness of Test Statistics to
PT
Nonnormality and Specification Error in Confirmatory Factor Analysis. Psychological
RI
Methods, 1(1), 16–29. https://doi.org/10.1037/1082-989X.1.1.16
Densten, I. L. (2001). Re-Thinking Burnout. Journal of Organizational Behavior, 22(8), 833–
SC
847. https://doi.org/10.1002/job.
NU
Denton, E. ge, Chaplin, W. F., & Wall, M. (2013). Teacher burnout: a comparison of two cultures using confirmatory factor and item response models. International Journal of
MA
Quantitative Research in Education, 1(2), 147.
ED
https://doi.org/10.1504/IJQRE.2013.056463
Dimitrov, D. M. (2006). Comparing groups on latent variables: A structural equation
EP T
modeling approach. Work, 26(2006), 429–436. Droogenbroeck, F. Van, Spruyt, B., & Vanroelen, C. (2014). Burnout among senior teachers :
AC C
Investigating the role of workload and interpersonal relationships at work. Teaching and Teacher Education, 43, 99–109. https://doi.org/10.1016/j.tate.2014.07.005 Field, A. (2005). Discovering Statistics Using IBM SPSS Statistics. 2nd Edition. London: Sage Publications. Firth, H., McIntee, J., McKweown, P., & Britton, P. G. (1985). Maslach Burnout Inventory: Factor Structure and Norms for British Nursing Staff. Psychological Reports, 57, 147– 150. Floyd, F. J., & Widaman, K. F. (1995). Factor Analysis in the Development and Refinement 28
ACCEPTED MANUSCRIPT of Clinical Assessment Instruments. Psychological Assessment, 7(3), 286–299. https://doi.org/10.1037/1040-3590.7.3.286 Fornell, & Larcker. (1981). structural equation models with unobservable variables and measurement error. Journal of Marketing Research, 48(1), 39–50.
PT
Freire, C., Ferradás, M. del M., Núñez, J. C., & Valle, A. (2017). Estructura factorial de las
https://doi.org/10.1016/j.ejeps.2016.10.001
SC
Journal of Education and Psychology, (10), 1–8.
RI
Escalas de Bienestar Psicológico de Ryff en estudiantes universitarios. European
NU
Freudenberger, H. J. (1974). Staff Burn-Out. Journal of Social Issues, 30(1), 159–165. https://doi.org/10.1111/j.1540-4560.1974.tb00706.x
MA
from the SAGE Social Science Collections . All Rights Reserved . (n.d.).
ED
Frone, M. R., Russell, M., & Barnes, G. M. (1996). Work--family conflict, gender, and health-related outcomes: A study of employed parents in two community samples.
8998.1.1.57
EP T
Journal of Occupational Health Psychology, 1(1), 57–69. https://doi.org/10.1037/1076-
AC C
Galanakis, M., Moraitou, M., Garivaldis, F. J., & Stalikas, A. (2009). Factorial Structure and Psychometric Properties of the Maslach Burnout Inventory (MBI) in Greek Midwives. Europe’s Journal of Psychology, 5(4), 52–70. https://doi.org/10.5964/ejop.v5i4.240 Golembiewski, R. T., Munzenrider, R., & Carter, D. (1983). Phases of progressive burnout and their work site covariants: Critical issues in OD researcg and Praxis. The Journal of Applied Behavioral Science, 19(4), 461–481. Grayson, Ã. J. L., & Alvarez, H. K. (2008). School climate factors relating to teacher burnout : A mediator model. Teacher and Teaching Education, 24, 1349–1363. 29
ACCEPTED MANUSCRIPT https://doi.org/10.1016/j.tate.2007.06.005 Griffiths, A., Royse, D., & Walker, R. (2018). Stress among child protective service workers: Self-reported health consequences. Children and Youth Services Review, 90(May), 46– 53. https://doi.org/10.1016/j.childyouth.2018.05.011
RI
7th. ed. Prentice-Hall, Inc., Upper Saddle River, NJ, USA.
PT
Hair, J. F., Black, W. C., Babin, B. J., & Anderson, R. E. (2010). Multivariate Data Analysis,
Hancock, G. R. (1997). Structural equation modeling methods of hypothesis testing of latent
SC
variable means. Measurement and Evaluation in Counseling and Development, 30, 91–
NU
105.
Hawrot, A., & Koniewski, M. (2017). Factor Structure of the Maslach Burnout Inventory –
MA
Educators Survey in a Polish-Speaking Sample. Journal of Career Assessment, 1–16.
ED
https://doi.org/10.1177/1069072717714545
Hernández, T. J., & Sánchez, G. (2012). Prevalence of Burnout Syndrome in Employees,
EP T
(1996), 207–225.
Heydari, S., & Abbasian, G.-R. (2016). The relationship between Iranian EFL teachers’
AC C
professional development and their job burnout 1. Basic Research Journal of Education Research and Review Basic Research Journal of Education Research and Review ISSN, 4(2), 2315–6872.
Horn, J. E., Schaufeli, W. B., Greenglass, E. R., & Burke, R. J. (1997). A CANADIANDUTCH COMPARISON OF TEACHERS’ BURNOUT. Psychological Reports, 81, 371–382. Horn, J. L. (1965). FACTORS IN FACTOR ANALYSIS. PSYCHOMETRIKA, 30(2), 179– 185. 30
ACCEPTED MANUSCRIPT Hu, L. T., & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis : Conventional criteria versus new alternatives Cutoff Criteria for Fit Indexes in Covariance Structure Analysis : Conventional Criteria Versus New Alternatives. A Multidisciplinary Journal, 6(1), 1–55. https://doi.org/10.1080/10705519909540118
PT
Iwanicki, E. F., & Schwab, R. L. (1981). A Cross Validation Study of the Maslach Burnout Inventory. Educational and Psychological Measurement, 41.
RI
https://doi.org/10.1177/001316448104100425
SC
Jackson, D. L., Gillaspy, J. A., & Purc-Stephenson, R. (2009). Reporting Practices in Confirmatory Factor Analysis: An Overview and Some Recommendations.
NU
Psychological Methods, 14(1), 6–23. https://doi.org/10.1037/a0014694
MA
Kanste, O., Miettunen, J., & Kyngäs, H. (2006). Factor structure of the Maslach Burnout Inventory among Finnish nursing staff. Nursing and Health Sciences, 8(4), 201–207.
ED
https://doi.org/10.1111/j.1442-2018.2006.00283.x
EP T
Kaur, A., & Noman, M. (2015). Exploring Classroom Practices in Collectivist Cultures Through the Lens of Hofstede ’ s Model Exploring Classroom Practices in Collectivist
AC C
Cultures Through the Lens. The Qualitative Report, 20(11), 1794–1811. Koki, S. (2000). Prevention and Intervention for Effective Classroom Organization and Management in Pacific Classrooms, (808), 1–12. Kokkinos, C. M. (2006). Factor structure and psychometric properties of the Maslach Burnout Inventory-Educators Survey among elementary and secondary school teachers in Cyprus. Stress and Health, 22(1), 25–33. https://doi.org/10.1002/smi.1079 KotovA, M. B., Rozanov, K. B., Alexandrov, A. A., & Ivanovo, E. I. (2017). Professional burnout and quality of life in teachers. Voprosy Psikhologii, 2017–Janua(2), 67–79.
31
ACCEPTED MANUSCRIPT Kristensen, T. S., Borritz, M., Villadsen, E., & Christensen, B. (2005). The Copenhagen Burnout Inventory : A new tool for the assessment of burnout The Copenhagen Burnout Inventory : A new tool for the. Work & Stress: An International Journal of Work, Health & Organisations, 19(3), 192–207. https://doi.org/10.1080/02678370500297720
PT
Lau, P. S. Y., Yuen, M. A. N. T. A. K., & Chan, R. M. C. (2005a). Do demographic characteristics make a difference to burnout among hong kong secondary school
RI
teachers? Social Indicators Research, (71), 491–516. https://doi.org/10.1007/s11205-
SC
004-8033-z
Lau, P. S. Y., Yuen, M. A. N. T. A. K., & Chan, R. M. C. (2005b). Do demographic
NU
characteristics make a difference to burnout among hong kong secondary school
MA
teachers?, 491–516. https://doi.org/10.1007/s11205-004-8033-z Lauermann, F., & König, J. (2016). Teachers’ professional competence and wellbeing:
ED
Understanding the links between general pedagogical knowledge, self-efficacy and burnout. Learning and Instruction, 45, 9–19.
EP T
https://doi.org/10.1016/j.learninstruc.2016.06.006 Lizano, E. L., & Mor Barak, M. (2015). Job burnout and affective wellbeing: A longitudinal
AC C
study of burnout and job satisfaction among public child welfare workers. Children and Youth Services Review, 55, 18–28. https://doi.org/10.1016/j.childyouth.2015.05.005 Maccallum, R. C., Browne, M. W., & Sugawara, H. M. (1996). Power Analysis and Determination of Sample Size for Covariance Structure Modeling of fit involving a particular measure of model. Psychological Methods, 1(2), 130–149. Maslach, C. (2015). Burnout , Psychology of. International Encyclopedia of Social & Behavioral Sciences (Second Edi, Vol. 2). Elsevier. https://doi.org/10.1016/B978-0-08097086-8.26009-1 32
ACCEPTED MANUSCRIPT Maslach, C., & Jackson, S. E. (1981a). Maslach Burnout Inventory--ES Form. PsycTESTS. https://doi.org/10.1037/t05190-000 Maslach, C., & Jackson, S. E. (1981b). The measurement of experienced burnout. JOURNAL OF OCCUPATIONAL BEHAVIOUR, 2, 99–113.
Evaluating Stress: A Book of Resources (pp. 191–218).
RI
https://doi.org/10.1017/S0033291798257163
PT
Maslach, C., Jackson, S. E., & Leiter, M. P. (1997). Maslach Burnout Inventory. In
SC
Maslach, C., Schaufeli, W. B., & Leiter, M. P. (2001). Job burnout. Annual Review of
NU
Psychology, 52, 397–422.
McGeary, C. A., & McGeary, D. D. (2012). Occupational Burnout. In R. J. Gatchel & I. Z.
MA
Schultz (Eds.), Handbook of Occupational Health and Wellness (pp. 3–563).
ED
https://doi.org/10.1007/978-1-4614-4839-6
Medvedskaya, E. I., & Sheryagina, E. V. (2017). Empathy and professional burnout of
EP T
Russian and Belarusian teachers. Konsul’tativnaia Psikhologiia I Psikhoterapiia, 25(2), 59–74. https://doi.org/http://dx.doi.org/10.17759/cpp.2017250204
AC C
Mellenbergh, J. (1983). ITEM BIAS AND ITEM RESPONSE. International Journal of Educational Research, 13(2), 127–143. https://doi.org/https://doi.org/10.1016/08830355(89)90002-5
Meredith, W. (1993). MEASUREMENT INVARIANCE, FACTOR ANALYSIS AND FACTORIAL INVARIANCE. PSYCHOMETRIKA, 58(4), 525–543. https://doi.org/10.1007/BF02294825 Meredith, W., & Teresi, J. A. (2006). An essay on measurement and factorial invariance. Medical Care, 44(11), S69–S77. https://doi.org/10.1097/01.mlr.0000245438.73837.89 33
ACCEPTED MANUSCRIPT Mészáros, V., Ádám, S., Szabõ, M., Szigeti, R., & Urbán, R. (2013). The bifactor model of the maslach burnout inventory-human services survey (MBI-HSS) - An alternative measurement model of burnout. Stress and Health, 30(1), 82–88. https://doi.org/10.1002/smi.2481
PT
Mueller, R. O., & Hancock, G. R. (2008). Best practices in structural equation modeling. Best Practices in Quantitative Methods, 488–508. https://doi.org/10.4135/9781412995627
RI
Oh, S. H., & Lee, M. (2009). Examining the psychometric properties of the Maslach Burnout
Youth Services Review, 31(2), 206–210.
NU
https://doi.org/10.1016/j.childyouth.2008.07.012
SC
Inventory with a sample of child protective service workers in Korea. Children and
MA
Palmira Faraci. (2018). TESTING FOR THE DIMENSIONALITY OF THE MASLACH BURNOUT INVENTORY (MBI) ON A SAMPLE OF HIGH SCHOOL TEACHERS.
ED
Clinical Neuropsychiatry, 15(1), 50–59.
EP T
Pérez-Mármol, J. M., & Brown, T. (2018). An Examination of the Structural Validity of the Maslach Burnout Inventory-Student Survey (MBI-SS) Using the Rasch Measurement
AC C
Model. Health Professions Education. https://doi.org/10.1016/j.hpe.2018.05.004 Perrewé, P. L., Hochwarter, W. A., Rossi, A. M., Wallace, A., Maignan, I., Castro, S. L., Van Deusen, C. A. (2002). Are work stress relationships universal? A nine-region examination of role stressors, general self-efficacy, and burnout. Journal of International Management, 8(4), 423. https://doi.org/10.1016/S1075-4253(02)00107-2 Pienaar, J., & Wyk, D. Van. (2006). Teacher burnout : construct equivalence and the role of union membership. South African Journal of Education, 26(4), 541–551. Pines, A. M. (2002a). Teacher Burnout : A psychodynamic existential perspective. Teachers
34
ACCEPTED MANUSCRIPT and Teaching: Theory and Practice, 8(2), 37–41. https://doi.org/10.1080/13540600220127331 Pines, A. M. (2002b). Theory and Practice Teacher Burnout : A psychodynamic existential perspective. Teachers and Teaching: Theory and Practice, 8(2), 37–41.
PT
https://doi.org/10.1080/13540600220127331 Poghosyan, L., Aiken, L. H., & Sloane, D. M. (2009). Factor structure of the Maslach
RI
burnout inventory: An analysis of data from large scale cross-sectional surveys of nurses
NU
https://doi.org/10.1016/j.ijnurstu.2009.03.004
SC
from eight countries. International Journal of Nursing Studies, 46(7), 894–902.
Reise, S. P., Widaman, K. F., & Pugh, R. H. (1993). Confirmatory factor analysis and item
MA
response theory: two approaches for exploring measurement invariance. Psychological Bulletin, 114(3), 552–566. https://doi.org/10.1037/0033-2909.114.3.552
ED
Richardsen, A. M., & Martinussen, M. (2004). The Maslach Burnout Inventory : Factorial
EP T
validity and consistency across occupational groups in Norway. Journal of Occupational and Organizational Psychology, (77), 377–384.
AC C
https://doi.org/0.1348/0963179041752691 Rizzo, J. R., House, R. J., & Lirtzman, S. I. (1970). Role Conflict and Ambiguity in Complex Organizations. Administrative Science Quarterly, 15(2), 150–163. Russell, D. W., Altmaier, E., & Vanvelzen, D. (1987). Job-related stress, social support, and burnout among classroom teachers. Journal of Applied Psychology, 72(2), 269–274. Salloum, A., Kondrat, D. C., Johnco, C., & Olson, K. R. (2015). The role of self-care on compassion satisfaction, burnout and secondary trauma among child welfare workers. Children and Youth Services Review, 49, 54–61.
35
ACCEPTED MANUSCRIPT https://doi.org/10.1016/j.childyouth.2014.12.023 Salmela-Aro, K., & Read, S. (2017). Study engagement and burnout profiles among Finnish higher education students. Burnout Research, 7, 21–28. https://doi.org/10.1016/j.burn.2017.11.001
RI
Study. Australian Journal of Education, 34(2), 145–152.
PT
Sarros, A. M., & Sarros, J. (1990). How Burned out are our Teachers ? A Cross-cultural
SC
``, J. C. (1990). How Burned out are our Teachers ? A Cross-cultural Study, 34(2), 145–152. Sass, D. A. (2011). Testing measurement invariance and comparing latent factor means
NU
within a confirmatory factor analysis framework. Journal of Psychoeducational
MA
Assessment, 29(4), 347–363. https://doi.org/10.1177/0734282911406661 Schaufeli, W. B. (2017). Applying the Job Demands-Resources model: A “how to” guide to
ED
measuring and tackling work engagement and burnout. Organizational Dynamics, 46(2), 120–132. https://doi.org/10.1016/j.orgdyn.2017.04.008
EP T
Schaufeli, W. B., Daamen, J., & Mierlo, H. Van. (1994). Burnout among Dutch teachers: An MBI-Validity study. Educational and Psychological Measurement, 54(3), 803–812.
AC C
https://doi.org/10.1177/0013164494054003027 Schaufeli, W. B., & Van Dierendonck, D. (1993). The construct validity of two emotional exhaustion measures. J Ournal of Organizational Behavior, 14(February 1991), 631– 647. https://doi.org/10.1002/job.4030140703 Schimmenti, A. (2016). Psychometric properties of the Adolescent Dissociative Experiences Scale in a sample of Italian adolescents. Journal of Trauma and Dissociation, 17(2), 244–257. https://doi.org/10.1080/15299732.2015.1064507 Schutte, N., Toppinen, S., Kalimo, R., & Schaufeli, W. B. (2000). The factorial validity of the 36
ACCEPTED MANUSCRIPT Maslach Burnout Inventory - General Survey (MBI-GS) across occupational groups and nations. Schwarzer, R., Schmitz, G. S., & Tang, C. (2000). Teacher Burnout in Hong Kong and Germany: A Cross-Cultural Validation of the Maslach Burnout Inventory. Anxiety,
PT
Stress & Coping, 13(3), 309–326. https://doi.org/10.1080/10615800008549268 Siegrist, J., Starke, D., Chandola, T., Godin, I., Marmot, M., Niedhammer, I., & Peter, R.
RI
(2004). The measurement of effort-reward imbalance at work: European comparisons.
SC
Social Science and Medicine, 58(8), 1483–1499. https://doi.org/10.1016/S0277-
NU
9536(03)00351-4
Squires, A., Finlayson, C., Gerchow, L., Cimiotti, J. P., Matthews, A., Schwendimann, R., …
MA
Sermeus, W. (2014). Methodological considerations when translating "burnout" Burnout Research, 1(2), 59–68.
ED
https://doi.org/10.1016/j.burn.2014.07.001
EP T
Stoeber, J., & Rennert, D. (2008). Perfectionism in school teachers : Relations with stress appraisals , coping styles, and burnout. Anxiety, Stress & Coping: An International
AC C
Journal, 21(1), 37–53. https://doi.org/10.1080/10615800701742461 Suñer-Soler, R., Grau-Martín, A., Flichtentrei, D., Prats, M., Braga, F., Font-Mayolas, S., & Gras, M. E. (2014). The consequences of burnout syndrome among healthcare professionals in Spain and Spanish speaking Latin American countries. Burnout Research, 1(2), 82–89. https://doi.org/10.1016/j.burn.2014.07.004 Tabachnick, B. G., & Fidell, L. S. (2007). Using Multivariate Statistics, (5th ed.). Needham Heights, MA: Allyn and Bacon Publishers. Needham Heights, MA: Allyn and Bacon Publishers.
37
ACCEPTED MANUSCRIPT Unesco. (2015). Education for all 2015 National Review. Unesco, 1–67. UNESCO. (2008). Countries on the Move. UNESCO Education for All GlobalMonotoring Report. Retrieved from: http://www.unesco.org/education/gmr2008/press/chapter3.pdf. Valente, M. do S. da S., Wang, Y. P., & Menezes, P. R. (2018). Structural validity of the
PT
Maslach Burnout Inventory and influence of depressive symptoms in banking workplace: Unfastening the occupational conundrum. Psychiatry Research, 267(March),
RI
168–174. https://doi.org/10.1016/j.psychres.2018.05.069
SC
Vandenberg, R. J., & Lance, C. E. (2000). A Review and Synthesis of the Measurement
NU
Invariance Literature: Suggestions, Practices, and Recommendations for Organizational Research. Organizational Research Methods, 3(1), 4–70.
MA
https://doi.org/10.1177/109442810031002
Veldhoven, M., & Meijman, T. (1994). Het meten van psychosociale arbeidsbelasting met
ED
een vragenlijst: de vragenlijst beleving en beoordeling van de arbeid (VBBA).
EP T
Measuring psychosocial workload with a survey: the questionnaire on the experience and evaluation of work (QEEW). Amsterdam: NIA.
AC C
Walkey, F. H., & Green, D. E. (1992). An Exhaustive Examination of the Replicable Factor Structure of the Maslach Burnout Inventory. Educational and Psychological Measurement, 52.
Woodside, J. R., Miller, M. N., Floyd, M. R., McGowen, K. R., & Pfortmiller, D. T. (2008). Observations on burnout in family medicine and psychiatry residents. Academic Psychiatry, 32(1), 13–19. https://doi.org/10.1176/appi.ap.32.1.13 Worley, J. A., Vassar, M., Wheeler, D. L., & Barnes, L. L. B. (2008). Factor Structure of Scores From the Maslach Burnout Inventory. Educational and Psychological
38
ACCEPTED MANUSCRIPT Measurement, 68(5), 797–823. https://doi.org/10.1177/0013164408315268 Xanthopoulou, D., Bakker, A. B., Kantas, A., & Demerouti, E. (2012). Measuring burnout and work engagement: Factor structure, invariance, and latent mean differences across Greece and the Netherlands. International Journal of Business Science and Applied
AC C
EP T
ED
MA
NU
SC
RI
PT
Management, 7(2), 40–52. https://doi.org/10.1080/09585192.2012.751438
39
ACCEPTED MANUSCRIPT Acknowledgement This study was funded by the Zhejiang Normal University (Graduate School PhD candidates’ sponsorship). We further extend our sincere thank you, to prof. Fan Chong Jun, Business School, the University of Shanghai for Science & Technology (USST) and prof. Yun-Fei Shao, University of Electronic Science and Technology of China for their statistical support and tutelage in the conduct of this study.
AC C
EP T
ED
MA
NU
SC
RI
PT
Declaration of Conflicting Interests The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
40
ACCEPTED MANUSCRIPT Highlights
Explored the factorial validity of the MBI-ES and burnout differences across three cultural contexts and gender. The MBI-ES fitted the three cultures after item-level modifications were made.
Significant differences in teacher burnout across countries and gender were found.
AC C
EP T
ED
MA
NU
SC
RI
PT
41
ACCEPTED MANUSCRIPT Table 1: Sample demographics and work characteristics, stratified by country (N=869) Descriptive statistics: China(n=274)
Participants’ demographics & work characteristics
Frequency (%)
Demographic Profile of ECTs Gender M ale Female Age category <30 years 30 to 39 40
Descriptive statistics: Ghana(n=285) Frequency (%)
M ean±SD
101(%)
93(%) 181(%) 33.1 ±7.59
34.45±10.03
34(12%) 81(28%) 93(33%) 77(27%)
55(17.7%) 23(7.4%) 206(66.5%) 26(8.4%)
145(51%) 129(45%) 9(3%) 2(1%)
188(60.6%) 116(37.4%) 5(1.6%) 1(0.3%)
NU
250(81%) 52(17%) 7(2%) 0(0%)
8.7±6.84
MA
149(48%) 106(34%) 54(18%) 259(84%) 50(16%)
10.62±8.52
8.07±6.30
132(46%) 70(25%) 83(29%)
165(53.2%) 88(28.4%) 57(18.4%)
141(49.5%) 144(50.5%)
154(49.7%) 156(50.3%) 32.96±6.20
32.30±8.3
ED
33.33±8.40 29(10%) 58(21%) 92(32%) 106(37%)
17(5.5%) 23(7.4%) 115(37.1%) 155(50%)
2(1%) 95(31%) 212(68%)
51(18%) 192(67%) 42(15%)
40(12.9%) 259(83.5%) 11(3.5%)
3(1%) 21(7%) 157(51%) 128(41%)
EP T
AC C
2.24±1.18
2.74±.99
SC
78(25%) 55(18%) 174(56%) 2(1%)
31.82±6.6
RI
1.94±.67
M ean±SD
140(45.2%) 122(39.4%) 48(15.5%)
PT
112(39%) 75(26%) 98(35%)
Educational level
Frequency (%)
97(%) 213(%)
184(%)
109(35%) 120(39%) 80(26%)
Junior High School High/Technical School College/Undergraduate/Diploma M asters + M arital Status M arried Single, Never M arried Single, Divorced Single, Widow (er) ECTs Work-related Characteristics Work experience 6 years or below 7 to 15 years >15 years School Type Public Private Class Size 15 16 to 25 26 to 35 >35 Job Task Child Caring Child Teaching Caring & Teaching Work hours/day 5 hours/day >5 hours/day Notes: ECTs= early childhood education range 18 to 59 (M age =33.27, SD=10.03)
M ean±SD
Descriptive statistics: Pakistan(n=310)
8.65±1.13 6.21±1.01 3(1%) 0(0%) 13(4.2%) 306(99%) 285(100) 297(95.8%) teachers. The final data comprised of male (n=291) and female (n=578) with age
6.28±2.27
Table 2: Fit of the original 22-item MBI-ES factor model across cultures: result of confirmatory factor analysis
Models M1 . 3-Factor proposed M2 . 2-Factor alternative M3 . 1-Factor alternative
χ2 [df] 1601.62[206] 1922.04[208] 2417.86[209]
X/df 7.78 9.24 11.57
CFI/TLI .67/.63 .60/.56 .48/.43
RMSEA .08 .09 .11
SRMR .09 .10 .11
Comparison M1 - M2 M1 - M3
Note: df= degree of freedom; TLI= Tucker Lewise Index; CFI= comparative fit index; RM SE= root mean squar approximation; SRM R=standardized root mean square residuals; AIC= Akaike’s information criterion. ***p<.001
42
Dχ2[Dd
320.41[ 816.23
ACCEPTED MANUSCRIPT Table 3: Descriptive statistics, CFA factor loadings, validity and reliability of the original 22-item and reduced 16-item MBI-ES Overall item statistics Skewness Kurtosis MeanSD 3.382.02
.41 -.02 .12 .91 .39 .97 -.48 1.26 1.25
4.342.06 3.842.23
-1.11 -.56 -1.44 -.48 -1.17 -.26 .06 .66 .46
RI
2.731.96 3.442.04 2.711.93 5.082.14 2.421.74 2.381.83 5.911.54
-1.53 -1.17 -.55 -.45 -1.82 -.71 -.81 -.63
1.58 .17 -1.19 -1.05 2.55 -.99 -.38 -.87
2.16 2.01 1.60 2.65 1.39
5.53 3.95 1.51 6.32 1.24
SC
5.691.75 4.912.18 4.721.99
NU
6.091.50 4.942.22 5.351.69 5.071.88 2.953.35
MA
1. Emotional Exhaustion MBI 1. I feel used up MBI 2. Emotionally drained MBI 3. I’m working too hard MBI 6. frustrated MBI 8. Fatigued when I get up MBI 13. burned out MBI 14. Working with people MBI 16. At the end of my rope MBI 20. People puts too much stress on me Personal Accomplishment MBI 4. I feel exhilarated MBI 7. I feel energetic MBI 9. I deal with emotional problems MBI 12. I can understand my students MBI 17. I have accomplished many things MBI 18. I deal effectively with problems MBI 19. I’ m positively influencing MBI 21. I can create atmosphere 3. Depersonalization MBI 5. students blame me ♦ MBI 10. More callous towards people♦ MBI 11. Job is hardening me MBI 15. I don’ t really care ♦ MBI 22. Some students were impersonal object
1.911.59 2.051.64
Standardized CFA Load 22-Item MBI-ES 16-i .85 a .77(.60) .73 .72(.52) .71 .74(.55) .78 .58(.33) .60 .67(.45) .66 .21(.04) __ .04(.00) __ .36(.13) __ .45(.21) __ .71 a .51(.26) .45 .61(.37) .54 .59(.34) .65 .39(.15) .35 .49(.24) .64 -.11(.01) __ .41(.17) .38 .41(.17) .40 .77 a .17(.03) __ .75(.56) .74 .57(.33) .59 .49(.24) .49 .35(.12) .34
PT
Items: shortened items
EP T
ED
1.611.44 2.271.39 Note: CFA Standardized estimates and squared multiple correlations (R2) were recorded from the three-factor M BI-ES model in the cross-cultural sample. Items: 5, 13, 14, 16, 18, & 20 are excluded in the reduced M BI-ES due to low loadings and cross loadings. a= Cronbach’s alpha estimates for internal consistency ; b= square roots of average variance explained for discriminant validity; CR= Composite reliability, AVE= average variance explained, M SV= maximum shared variance; ♦Items with violations of normality (Log10 transformed). Item loadings are significant at ***p<.001 and **p<.01.
ms abbreviated
ional Exhaustion S 1 S 2 S 3 S 6 S 8 S 13 S 14 S 16 S 20 nalAccomplishment S 4 S 7 S 9 S 12
AC C
Table 4: Country-level and Cross-cultural exploration of the MBI-ES factor structure: Results of exploratory factor analyses (EFA) Factor Loadings
1 .82 .80 .85 .63 .70 .31 .31 .38 .41
-.20 -.21
GHANA
2
3
.15
.12 .12
.56
.60 .71 .68 .36
.24 .32 .65 -.10 .64 .49
Factor Loadings
1
2
.81 .84 .82 .59 .70 .35 .26 .32 .58
.21 .16 .12
.19 .14
.52 .52 .48 .55
.25 -.24
.54 -.16
CHINA
3
.28 .24 .61 -.21 .57 .41
.21
43
Factors Loadings
1 .74 .73 .65 .34 .68 .38 .28 .30 .26
-.15 -.17
2
-.13 -.10
-.19
.61 .50 .67 .52
P AKISTAN
Factor Loadings CROSS
3
1
.21
.80 .77 .82 .54 .63 .32 .30 .35 .33
.27 .57 .29 .38 .52 .55 .52
-.21 -.22 -.24
2 .10
.11
.60 .65 .64 .44
3 .16 .11 .11 .28 .27 .59 .61 .65 .55
.19
-CULTU
ACCEPTED MANUSCRIPT
S 17 S 18 S 19 S 21 rsonalization S 5 S 10 S 11 S 15 S 22
.39 -.14
.42
.57 .10 .47 .36
-.18 -.42 -.24 -.19
-.31 -.21 -.10 -.20 -.12
-.24 .51 .62 .43 .40
.14
.52 .67 .72 .60 .28
.14 -.15 .12
-.10
.18 -.24
-.14 .25 .74 .66 .54 .50
.17 .14
.52 .15 .45 .45
-.35 .27
.60 .48 .50 .48
.48 .73 .62 .69 .55
-.12
.30 -.10 -.19 -.14
-.22 -.31 -.18 -.13 -.25 .63 .68 .56 .44
SC
RI
PT
Note: M BI-ES= M aslach Burnout Inventory, Educators Survey; substantial loadings rounded .40 are bolded, crossloadings rounded30 are italicized. h2= item communalities (recorded from the cross-cultural sample).
Table 5: Fit of the reduced 16-item MBI-ES factor model: result of confirmatory factor analysis (χ2 [df])
CFI/TLI
NU
X/df
RMSEA SRMR
Comparison
(∆χ2 [df])
2.53 4.35 5.72
.91/.90 .79/.73 .70/.62
.06 .09 .10
.06 .09 M1a-M2a .11 M1a-M3a
176.37[2]** 310.89[3]**
244.02[92] 402.65[94] 635.24[95]
2.65 4.28 6.69
.90/.90 .80/.75 .65/.56
.06 .09 .11
.06 .09 M1b -M2b .11 M1b -M3b
158.63[2]** 391.22[3]**
335.69[92] 660.54[94] 1027.27[95]
3.65 7.03 10.81
.92/90 .81/.76 .69/.61
.06 .08 .11
.05 .09 M1 -M2 .11 M1 -M3
324.85[2]** 691.58[3]**
ED
MA
232.77[92] 409.14[94] 543.66[95]
EP T
Models Calibration sample M1a. 3-Factor proposed M2a. 2-Factor alternative M3a. 1-Factor alternative Validation sample M1b . 3-Factor proposed M2b . 2-Factor alternative M3b . 1-Factor alternative Cross-Validation Whole sample M1 . 3-Factor proposed M2 . 2-Factor alternative M3 . 1-Factor alternative
AC C
Note: df= degree of freedom; TLI= Tucker Lewise Index; CFI= comparative fit index; RM SE= root mean squar approximation; SRM R=standardized root mean square residuals; AIC= Akaike’s information criterion. ***p<.001
Table 6: Validity of the reduced 16-item MBI-ES tested within several structural equation modelling Model
Cross-cultural model Workload Emotional demands Work family conflict Role conflict Total stress
Standardized Regression Coefficients Emotional Exhaustion Depersonalization Personal Accomplishment S.t Est (S.E)/R2 S.t Est (S.E)/R2 S.t Est (S.E)/R2 .18***(.11)/.03 .14***(.07)/.02 .29***(.07))/.09 .15***(.06)/.02 .27***(.03)/.07
.07+(.06)/.004 .07*(.04)/.01 .13***(.04)/.02 .11**(.03)/.01 .14***(.02)/.02
-.01(.08)/00 -.08*(.03)/.02 -.02(.06)/00 -.05(.04)/.002 -.12**(.03)/.06
Model fit RMSEA
.02 .03 .04 .02 .02
Note: St. Est.= Standardized Estimates, S.E=Standard Error, R=Explained variance of the M BI-ES latent factors due to the predictors (work stressors). Each row represent one SEM , Degree of freedom for all model =1, RM SEA= root mean squar approximation; CFI= comparative fit index; TLI = Tucker Lewise Index. + p<.10 *p<.05
44
Statistic
CFI/TLI
.99/.97 .99/.97 .99/.96 .99/.99 .99/.99
ACCEPTED MANUSCRIPT **p<.01 ***p<.001
Table 7: Test of invariance of the reduced 16-item three-factor MBI-ES across cultures; result of MGCFA
χ2 [df]
Models (g: Pk., Cn., Gh.)
CFI
RMSEA
SRMR
Comparison
CFI
.030 .028 .034 .030 .033 .031
.0623 .0628 .0623 .0629 .0634 .0640
M 1a-M 2a M 2a-M 3a M 2a-M 4a M 4a-M 5a M 4a-M 6a
.00 .06 .01 .07 .01
1.76 1.68 2.02 1.79 1.97 1.81
.91 .91 .85 .90 .83 .89
663.40[368] 699.37[407] 888.25[455] 796.97[440] 1061.61[533] 817.84[458]
1.80 1.72 1.95 1.81 1.99 1.79
.91 .91 .86 .90 .83 .89
.030 .029 .033 .031 .034 .030
.0655 .066 .0661 .066 .0656 .0659
M 1b -M 2b M 2b -M 3b M 2b -M 4b M 4b -M 5b M 4b -M 6b
.00 .05 .01 .07 .01
.92 .92 .86 .91 .84 .89
.027 .026 .032 .028 .032 .029
.0585 .0589 .0587 .059 .0591 .0595
M 1 -M 2 M 2 -M 3 M 2 -M 4 M 4 -M 5 M 4 -M 6
.00 .06 .01 .07 .02
RI
SC
NU
830.99[368] 873.46[407] 1263.03[455] 1055.93[440] 1459.17[533] 1110.02[458]
2.26 2.15 2.78 2.40 2.74 2.42
PT
646.35[368] 683.59[407] 917.49[455] 786.01[440] 1049.53[533] 829.79[458]
MA
Calibration sample M 1a. Configural inv. M 2a. Metric inv. M 3a. Scalar inv. M 4a. Partial Scalar inv.a M 5a. Residual inv. M 6a. Structural inv. Cross-validating Validation sample M 1b . Configural inv. M 2b . Metric inv. M 3b . Scalar inv. M 4b . Partial inv.a M 5b . Residual inv. M 6b . Structural inv. Whole sample M 1 . Configural inv. M 2 . Metric inv. M 3 . Scalar inv. M 4 . Partial inv.a M 5 . Residual inv. M 6 . Structural inv.
χ2 /df
-
-
-
AC C
EP T
ED
Note: g = groups involved in the measurement invariance test, Pk.= Pakistan, Cn.=China, Gh.= Ghana, inv.= invariance, TLI= Tucker Lewise Index; CFI= comparative fit index; RM SEA= root mean square approximation; SRM R= standardize root mean square residuals; AIC= Akaike’s information criterion. a. Intercepts of the items “M BI-ES 2, 7, 9, 11, 12” were freely estimated across groups.
Table 8: Test of invariance of the reduced 16-item three-factor MBI-ES across gender; result of MGCFA Models (g: male & female) Calibration sample M 1a. Configural inv. M 2a. Metric inv. M 3a. Strong factorial inv. M 4a. Residual inv. M 5a. Structural inv. Cross-validation Validation sample M 1b . Configural inv. M 2b . Metric inv. M 3b . Strong factorial inv.
χ2 [df]
χ2 /df
CFI
RMSEA
SRMR
Comparison
CFI
RMSEA
568.37[276] 578.82[302] 637.03[334] 688.85[396] 651.85[346]
2.06 1.92 1.91 1.74 1.88
.90 .91 .90 .90 .90
.035 .033 .032 .029 .032
.0623 .0622 .0622 .0623 .0622
M 1a-M 2a M 2a-M 3a M 3a-M 4a M 3a-M 5a
-.01 .01 .00 .00
.0 .0 .0 .0
567.75[276] 604.10[302] 679.55[330]
2.06 2.00 2.04
.91 .90 .90
.035 .034 .035
.0655 .0655 .0657
M 1b -M 2b M 2b -M 3b
.01 .00
.0 -.0
45
ACCEPTED MANUSCRIPT M 4b . Residual inv. M 5b . Structural inv. Whole sample M 1 . Configural inv. M 2 . Metric inv. M 3 . Strong factorial inv. M 4 . Residual inv. M 5 . Structural inv.
755.58[396] 686.68[346]
1.91 1.99
.88 .89
.032 .034
.0655 .0657
M 3b -M 4b M 3b -M 5b
.02 .01
.0 .0
762.59[276] 788.16[302] 911.47[334] 999.29[396] 932.73[346]
2.76 2.61 2.73 2.52 2.69
.92 .92 .91 .90 .90
.032 .030 .032 .030 .031
.0585 .0583 .0584 .0585 .0585
M 1 -M 2 M 2 -M 3 M 3 -M 4 M 3 -M 5
.00 .01 .00 .01
.0 -.0 .0 .0
PT
Note: g = groups involved in the measurement invariance test, inv.= invariance, TLI = Tucker Lewise Index; CFI= comparative fit index; RM SEA= root mean squar approximation; SRM R= standardized root mean square residuals; AIC= Akaike’s information criterion.
AC C
EP T
ED
MA
NU
SC
RI
Fig.1 Trend of burnout syndrome across cultures Fig.2 Trend of burnout syndrome across gender
46
Figure 1
Figure 2