Volume 13 • Number 2 • 2010 VA L U E I N H E A LT H
Reliability and Validity of a Chinese Version’s Health-Related Quality of Life Questionnaire for Hepatitis B Patients vhe_615
324..327
Siew Chin Ong, BPharm (Hons), PhD,1 Seng Gee Lim, MBBS (Hons), FRACP, FRCP, FAMS, MD,2,3,4* Shu Chuen Li, BPham, GradDipBus(Tech Mgt), MApplSc, MBA, PhD5* 1
Department of Pharmacy, National University of Singapore, Singapore; 2Department of Gastroenterology and Hepatology, National University Hospital, Singapore; 3Yong Loo Lin School of Medicine, National University of Singapore, Singapore; 4Centre for Molecular Medicine, Agency for Science, Technology and Research, Biopolis, Singapore; 5Discipline of Pharmacy & Experimental Pharmacology, School of Biomedical Sciences, University of Newcastle, Callaghan, New South Wales, Australia
A B S T R AC T Objectives: To culturally adapt a Chinese version of the Hepatitis Quality of Life Questionnaire (HQLQ) and assess its suitability for use in Chinesespeaking hepatitis B virus (HBV patients in Singapore. Study: Reliability was assessed using Cronbach’s alpha coefficients and intra-class correlation coefficients. Item-to-scale correlation was assessed using Spearman’s rank correlations (r) between scale scores and their constituent items. Convergent and divergent construct validities were tested in three and two a priori hypotheses, respectively, and the correlations were assessed using Spearman’s rank correlation coefficients.
Results: When tested in 134 HBV patients, the test-retest reliability was supported with all scales showing acceptable correlation coefficients (i.e., a > 0.7). Item-to-scale correlations were good with most items highly correlated with their hypothesized scales. Convergent and divergent construct validities were supported by the hypothesized correlations between the HQLQ and the EQ-5D domains. Conclusions: The culturally adapted questionnaire has good validity and reliability for use in Singapore. Keywords: disease-specific instrument, EQ-5D, health-related quality of life, outcomes research, SF-36.
Introduction
Methods
Currently, Health-related Quality of Life (HRQoL) instruments are not routinely used in clinical settings because of various reasons; one of them is the lack of validation study. Recently, we published two studies of HRQoL in patients with chronic hepatitis B using both generic and disease-specific instruments; results from both studies showed a gradual deterioration in HRQoL with progression of liver disease [1,2]. We adapted the Hepatitis Quality of Life Questionnaire (HQLQ) [2], which covers hepatitis specifically and contains the SF-36 health survey as the generic core [3]. The major adaptation of the English version was the changes of English terms not commonly used in Singapore. For example, “weighted down” in the original questionnaire are seldom used locally, and has been replaced with “down” or depressed which are more colloquial and well accepted in Singapore. Similarly, “a wonderful adventure” in the original HQLQ was replaced with “exciting” as the original phrase is found to be abstract by focus group participants. As Singapore is a multiracial country with a sizeable ethnic Chinese population (75%), validating a Chinese version of the HQLQ for those who cannot understand English well would be necessary. Therefore, the main objective of this study was to validate a Chinese version of the HQLQ for use in Chinesespeaking hepatitis B patients in Singapore.
Study Design
Address correspondence to: Shu Chuen Li, Discipline of Pharmacy & Experimental Pharmacology, School of Biomedical Science, University of Newcastle, Callaghan, NSW 2308, Australia. E-mail: Shuchuen.li@ newcastle.edu.au 10.1111/j.1524-4733.2009.00615.x *These senior authors contributed equally to this article.
324
A cross-sectional, multi-phase, institutional review board approved study was conducted at the National University Hospital (NUH) with informed consent obtained from all participating subjects.
Translation and Cultural Adaptation of the Instrument As the Singapore Chinese version of the SF-36 has been validated, Chinese translation of the original English version of the HQLQ was only performed for the additional scales from the HQLQ (five generic items, two generic scales: health distress and positive well-being, as well as hepatitis-specific limitations and hepatitis-specific health distress scales) according to the standard guidelines suggested by Guillemin [4] as follows: forward translations were performed independently by two bilingual translators; reconciliation of both forward versions; another two translators independently translated the reconciled Chinese version back into English. The final version was obtained after a pilot testing of the revised questionnaire in eight local Chinesespeaking individuals.
Validation Study Patient recruitment. The finalized Chinese version of HQLQ was validated in a sample of hepatitis B virus (HBV) patients attending outpatient clinics at NUH. Other inclusion criteria were above age 18 and ability to self-complete the questionnaires in Chinese. Aside from providing their socio-demographic information, participants were asked to fill in the HQLQ and EQ-5D questionnaires before or after their clinic appointment. For test-retest reliability, HBV patients were given a second set of the questionnaire, and were asked to complete the
© 2009, International Society for Pharmacoeconomics and Outcomes Research (ISPOR)
1098-3015/10/324
324–327
325
A Questionnaire for Hepatitis B Patients Table 1
Descriptive statistics and reliability for each scale of Hepatitis Quality of Life Questionnaire (HQLQ; n = 134)
SF-36 components Physical functioning Role physical Bodily pain General health Vitality Social functioning Role emotional Mental health HQLQ-specific component scales Health distress Positive well-being Hepatitis-specific limitations Hepatitis-specific health distress
Number of items
Missing data (%)
Minimum score
Maximum score
% showing floor effect
% showing ceiling effect
Cronbach’s alpha
10 4 2 5 4 4 4 5
2.2 0.0 0.7 1.5 0.7 0.0 0.0 0.7
28.57 0.0 22.0 0.0 5.0 0.0 0.0 8.0
100.0 100.0 100.0 100.0 95.0 87.5 100.0 100.0
0.0 3.7 0.0 0.7 0.0 0.7 3.0 0.0
40.3 63.4 44.0 0.7 0.0 0.0 63.4 0.7
0.864 0.830 0.720 0.750 0.764 0.775 0.777 0.798
4 4 3 4
0.7 0.7 0.7 0.0
0.0 5.0 26.7 0.0
100.0 100.0 100.0 100.0
1.5 0.0 0.0 1.5
20.9 3.7 44.8 31.3
0.964 0.929 0.912 0.966
questionnaire at least 3 days from their first test and return the questionnaire using a return-business envelope. Psychometric properties analyses. Reliability was assessed by Cronbach’s alpha coefficients (for internal consistency) and intraclass correlation coefficients (for test-retest reliability) [5,6]. For interpretation, Cronbach’s alpha coefficients above 0.7 and 0.9 are generally regarded as acceptable for conducting group comparisons and assessing individual subjects, respectively [6]. Scoring assumptions of the HQLQ were investigated at item level by examining item-scale convergent validity [7]. Item-toscale correlation (corrected for overlap) was assessed using Spearman’s rank correlations (r) between scale scores and their constituent items with r ⱖ 0.4 considered as acceptable [5]. The scale score distributions were evaluated by computing the percentage of respondents achieving either the highest possible score (ceiling) or the lowest possible score (floor) [3]. Construct validity of the HQLQ at scale level was investigated by examining the correlations between HQLQ and EQ-5D [8] domains. As the SF-36 has been validated previously in Singapore [9], only those additional HQLQ scales were tested. Convergent and divergent validities were assessed using Spearman’s rank correlation coefficients (r), with a rho value >0.5 considered as strong correlation, 0.35 to 0.5 as moderate correlation, and 0.2 to 0.34 as weak correlation [10]. Two other a priori hypotheses based on clinical expectation were generated for convergent construct validity where moderate-to-strong correlations (i.e., correlation coefficient ⱖ0.35) were expected between domains measuring similar constructs, namely: 1) HQLQ Health Distress, Positive Well-being, and Hepatitis-specific Health Distress with EQ-5D Anxiety/ depression; and 2) HQLQ Hepatitis-specific Limitations with EQ-5D Mobility and Usual Activities. Another two a priori hypotheses were generated for divergent construct validity where weak correlations were expected between domains measuring dissimilar constructs, namely: 1) HQLQ Health Distress, Positive Well-being, and Hepatitis-specific Health Distress with EQ-5D Mobility, Self-care, Usual Activities, and Pain/discomfort; and 2) HQLQ Hepatitis-specific Limitations with EQ-5D Anxiety/ depression and Self-care. All statistical analyses were performed using SPSS for Windows version 13.0 (SPSS Inc., Chicago, IL). Statistical significance for all tests was set at 0.05.
Results
Cultural Adaptation of Instrument The Chinese version of the culturally adapted HQLQ was well accepted in the pilot testing; the respondents generally found the
questionnaire clear and easy to understand and to answer. Hence, the adapted questionnaire was used in the subsequent validation study without any further revisions.
Sample Characteristics for Cross-Sectional Validation Study From November 2003 to November 2006, all Chinese outpatients with chronic hepatitis B infections were approached to participate in this study. In total, 134 chronic hepatitis B patients (47 asymptomatic carrier, 33 chronic hepatitis B, 24 compensated cirrhosis, 9 decompensated cirrhosis, 9 hepatocellular carcinoma, and 12 post-liver transplants) completed the questionnaires. The percentage of nonresponders and patients with reported comorbidities was less than 1% and 6%, respectively, so the data have not been analyzed further. The average time taken to complete the questionnaire was 20 minutes (range: 10–30 minutes). The mean age of the participants was 50.7 years (SD: 12.3; range of 22–80 years) with 64.2% being male. Additional socio-demographic information will be made available upon request.
Psychometric Properties The descriptive statistics and reliability coefficient (Cronbach’s alpha) of the scales were detailed as per Table 1. Missing data for each scale were low (<2%). Ceiling effect was observed for most scales with Role Physical and Role Emotional scales showing the highest percentage (63.4%) of ceiling effect, respectively. The other notable ceiling effects were observed in the Hepatitisspecific Limitation, Bodily Pain, and Physical Functioning scales (44.8%, 44.0%, and 40.3%, respectively). Although floor effects were observed in six scales, the percentage of respondents scoring the floor in these scales was generally less than 4%. The internal consistency reliability coefficients, alpha values were good with a > 0.7 in all the scales and half of these were >0.8. Notably, alpha values for all the HQLQ-specific component scales were even >0.9. A total of 28 subjects participated in the test-retest analysis. Acceptable correlation coefficients (i.e., a > 0.7) were observed in all the scales, with half of these showed a value greater than 0.8 (Table 2). Item-to-scale correlations were good with all items highly correlated with their hypothesized scales, with only one item each from Physical Functioning and Role Emotional scales (r = 0.21 and 0.36, respectively) below the cutoff value (r ⱖ 0.4) recommended for scale construction (Table 2). For convergent construct validity, moderate-to-high correlations were presented for six of nine subhypotheses (r between
Ong et al.
326 Table 2 Test-retest reliability of scales (n = 28) and item-to-scales correlation (n = 134)
SF-36 scales Physical functioning Role physical Bodily pain General health Vitality Social functioning Role emotional Mental health Hepatitis Quality of Life Questionnaire-specific scales Health distress Positive well-being Hepatitis-specific limitations Hepatitis-specific health distress
Range of convergent item-scale correlation
Median convergent item-scale correlation‡
Single measure ICC*
95% CI†
0.82 0.76 0.70 0.93 0.75 0.79 0.90 0.87
0.64–0.91 0.54–0.88 0.45–0.85 0.85–0.97 0.53–0.88 0.60–0.90 0.80–0.95 0.73–0.94
§
0.21–0.89 0.44–0.83 0.89–0.94 0.69–0.78 0.64–0.84 0.64–0.86 § 0.36–0.84 0.65–0.79
0.69 0.82 0.91 0.71 0.75 0.73 0.83 0.76
0.85 0.75 0.74 0.88
0.65–0.94 0.53–0.88 0.49–0.87 0.76–0.94
0.92–0.93 0.88–0.94 0.91–0.94 0.92–0.96
0.93 0.91 0.94 0.92
*Intra-class Correlation Coefficient. †95% confidence interval (CI) of single measure ICC. ‡Medians were calculated from a range of correlation coefficient for each item in a scale. §The item from the Physical Functioning scale asked patients about their limitation in bathing or dressing themselves, while the item from the Role Emotional scale ascertained patients’ ability in performing work or other activities.
0.39 and 0.51) of the two a priori hypotheses. On the other hand, divergent construct validity was supported by all a priori hypotheses, with all scales from HQLQ correlated weakly (r between 0.01 and 0.27) with EQ-5D domains measuring dissimilar constructs (data will be made available upon request).
Discussions To the best of our knowledge, this is the first Chinese diseasespecific HRQoL instrument adapted and validated for HBV patients in Asia. The significant ceiling effect observed in the present study especially in Role Physical and Role Emotional scales were also reported previously by Bayliss et al. [3] and our validation study of the English version of this questionnaire [2]. Nevertheless, this effect might be overcome by substituting the current dichotomous response choices with five level response options used in SF-36 V2 or HQLQ™ (Version 2) [11,12]. In the current study, the internal consistency reliability coefficients, Cronbach’s alpha, of the multi-item scales were good with a > 0.7 in all the scales. This is consistent with the results achieved by our previous validation study of the English version and supports the assertion that the Chinese version of HQLQ has good internal consistency reliability. Test-retest reliability was also strongly supported with all the 12 scales showing acceptable correlation coefficients (i.e., a > 0.7). In the measurement of item-to-scale correlations, one item each from the Physical Functioning and Role Emotional scales showed correlation coefficient below the cutoff value. The item from Role Emotional scale also reported the below cutoff value in our previous validation study for the English version of HQLQ [2]. Likewise, this item was also shown to have a relatively low correlation with its parent scale in the previous validation study in hepatitis C patients, probably due to its low rates of endorsement [3]. Nevertheless, as correlation coefficient of 0.36 from this item is close to the recommended cutoff value, i.e., r ⱖ 0.4, more validation studies are needed to allow a conclusive decision whether this item should be removed or moved to other scales. The construct validities of the instrument are supported by the expected correlations between the HQLQ scales and the EQ-5D domains measuring similar or dissimilar constructs, again consistent with our validation study of the adapted English version of HQLQ [2].
Finally, we acknowledged several limitations of the present study, the first being the cross-sectional study design of the study which precluded the evaluation of the instrument’s responsiveness. In addition, background information of nonrespondents was not collected in this study. Nevertheless, given the low nonresponse rate of this study (<1%) and the consistency in response from the groups, it seemed that our sample would be a reasonably good representation of the target population.
Conclusions Our study found evidence to support the current Chinese version of HQLQ as culturally appropriate, valid, and reliable in a sample of Chinese-speaking Singapore residents with various stages of HBV infection.
Acknowledgments We would like to express our gratitude to research nurses Amelia Cheok, Belinda Mak and Winnie Chua for their assistance during data collection. We would also like to thank all of the nurses and staffs from Unit Digestive Centre, National University Hospital for their help and cooperation in facilitating the study. Source of financial support: This study was funded by NHG grant NHGRPR/01086 and National University of Singapore Faculty Research Fund.
References 1 Ong SC, Mak B, Aung MO, et al. Health related quality of life in chronic hepatitis b patients. Hepatology 2008;47:1108–17. 2 Ong SC, Lim SG, Li SC. Cultural adaptation and validation of a questionnaire for used in hepatitis B patients. J Viral Hepat 2009;16:272–8. 3 Bayliss MS, Gandek B, Bungay KM, et al. A questionnaire to assess the generic and disease-specific health outcomes of patients with chronic hepatitis C. Qual Life Res 1998;7:39–55. 4 Guillemin F, Bombardier C, Beaton D. Cross-cultural adaptation of health-related quality of life measures: literature review and proposed guidelines. J Clin Epidemiol 1993;46:1417–32. 5 Fayers PM, Machin D. Quality of Life: Assessment, Analysis and Interpretation. Chichester: John Wiley & Sons, 2000. 6 Nunnally JC, Bernstein IH. Psychometric Theory (3rd ed.). New York: McGraw-Hill, 1994.
A Questionnaire for Hepatitis B Patients 7 Ware JE, Gandek B. Methods for testing data quality, scaling assumptions, and reliability: the IQOLA project approach. J Clin Epidemiol 1998;51:925–32. 8 Rabin R, de Charro F. EQ-5D: a measure of health status from the EuroQol Group. Ann Med 2001;33:337–43. 9 Thumboo J, Fong KY, Machin D, et al. A community-based study of scaling assumptions and construct validity of the English (UK) and Chinese (HK) SF-36 in Singapore. Qual Life Res 2001;10: 175–88. 10 Juniper EF, Gordon HG, Roman J. How to develop and validate a new health-related quality of life instrument. In: Spilker B,
327 ed. Quality of Life and Pharmacoeconomics in Clinical Trials (2nd ed.). Philadelphia, PA: Lippincott-Raven Publishers, 1996. 11 Ware JE, Kosinski M, Dewey JE. How to Score Version 2 of the SF-36 Health Survey. Lincoln, RI: QualityMetric Incorporated, 2000. 12 Bayliss MS, Maruish ME. Hepatitis Quality of Life QuestionnaireTM, (Version 2) [HQLQv2TM] Administration and Scoring Supplement. Lincoln, RI: QualityMetric Incorporated, 2004.