Evaluating symptoms in chronic obstructive pulmonary disease: validation of the breathlessness, cough and sputum scale©

Evaluating symptoms in chronic obstructive pulmonary disease: validation of the breathlessness, cough and sputum scale©

Vol. 97 (2003) (SUPPLEMENT A), $59-$70 IIIl~11111lllll]Ui~lll][lll|l~ Evaluating symptoms in chronic obstructive pulmonary disease: validation of th...

9MB Sizes 0 Downloads 64 Views

Vol. 97 (2003) (SUPPLEMENT A), $59-$70

IIIl~11111lllll]Ui~lll][lll|l~

Evaluating symptoms in chronic obstructive pulmonary disease: validation of the Breathlessness, Cough and Sputum Scale© N. K. LEIDY,I* J. K. SCHMIER, I M. K. C.JONES, 2 J. LLOYD 3 AND

K. ROCCHICaOLP I Center for Health Outcomes Research, MEDTAP International, Bethesda, MD, U.S.A.; 2AstraZeneca Pharmaceuticals LP,Wilmington, DE, U.S.A.; 3AstraZeneca R&D Charnwood, Loughborough, U.K.

Abstract This study examined the reliability and validity of a new measure for evaluating symptoms in patients with chronic obstructive pulmonary disease (COPD): the Breathlessness, Cough and Sputum Scale (BCSS©). Designed as a daily diary, the BCSS is a patient-reported outcome measure that asks patients to rate the severity of the three symptoms, each on a 5-point scale; higher scores indicate more severe symptoms. Item scores are summed to yield a total score. Secondary analysis of data were from two multinational trials (n=629; 765). The BCSS item and total scores exhibited evidence of internal consistency (oc = 0.70 daily; 0"95 to 0.99 over time) and reproducibility (ICC = 0"77 to 0'88). Correlations (r) with pulmonary function (FEV 1% predicted, PEF) were -0"01 (n.s.) to -0"36 (P < 0.001). Correlations with the St George's Respiratory Questionnaire total and SF-36 Physical Functioning subscale were 0.44to0-59 (P<0.001). Breathlessness and total scores differentiated patients by disease severity (P < 0"0 I) and rescue medication use (P < 0"0 I). Cough, sputum, and total scores increased with sputum volume (r=0-27, 0.30, 0.3 I; P < 0.001). Patients for whom treatment was moderately or highly effective reported significant improvements in BCSS scores (P < 0.0001). Results suggest that the BCSS is a reliable, valid, and responsive patient-reported outcome measure of symptom severity in patients with COPD. © 2003 ElsevierScienceLtd

INTRODUCTION Chronic obstructive pulmonary disease (COPD) refers to a cluster of diseases (including chronic bronchitis and emphysema) in which patients experience progressive reduction in expiratory airflow with symptoms of breathlessness, cough, and sputum production. Although pulmonary function is an important objective marker of the degree of airway obstruction, the patient's self-report of the severity of these symptoms is key to successfully monitoring disease activity, adjusting treatment, and evaluating outcomes (I). There are surprisingly few instruments to assess symptom severity and symptom outcomes in this population. The Baseline Dyspnoea Correspondenceshouldbe addressedto: Nancy Kline Leidy,PhD, Center for Health Outcomes Research,MEDTAPInternational,710 I WisconsinAvenue,Suite600, Bethesda,MD 20814,USA.Telephone: + I 301 654-9729;Facsimile:+0 301 654-9864; E-mail:[email protected]

Index (BDI)/Transition Dyspnoea Index (TDI) (2), modified Medical Research Council (MRC) Dyspnoea Scale (3), Oxygen Cost Diagram (4) and Borg Scale (5) have been used to evaluate breathlessness, but do not address cough or sputum.The length and 3-month recall period of the Chronic Lung Disease Severity Index (6) is not conducive to evaluating treatment outcomes in short-term clinical trials. Condition-specific healthrelated quality of life (HRQL) measures, such as the St George's Respiratory Questionnaire (SGRQ) (7) or the Chronic Respiratory Questionnaire (CRQ) (8), assess the impact of symptoms on daily functioning and wellbeing rather than symptom severity per se.To date, there has been no published, validated (reliable, valid, responsive) measure to assess day-to-day symptom severity in COPD. The purpose of this study was to assess the reliability, validity, and responsiveness of the Breathlessness, Cough and Sputum Scale (BCSS©). The BCSS is a patientreported outcome measure that was designed as a daily

RESPIRATORYM EDICINE

$60

diary in which subjects are asked to record the severity of three symptoms of COPD: breathlessness, cough, and sputum (AppendixA). Each symptom is represented by a single item which is evaluated on a 5-point Likert-type scale ranging from 0 to 4, with higher scores indicating a more severe manifestation of the symptom.Total score is expressed as the sum of three item scores, with a range of 0 to 12. Weekly and period (baseline, treatment, follow-up) scores can be computed by aggregating daily scores over time.

METHODS Design This study used secondary analyses of blinded data from two multicentre, multinational, randomized, double-blind, placebo-controlled clinical trials evaluating the safety and efficacy of AR-C68397AA (Viozan TM, sibenadet HCI), a novel dual dopamine D2-receptor/132-adrenoreceptor agonist for use in patients with COPD (9-14). Both trials involved a 7- to 14-day baseline period followed by 4 (trial I) or 6 (trial 2) weeks of treatment. The data analysis plan was formulated by the first two authors (NKL, JKS) based on the study protocols before receiving the data; these authors performed the analyses and were blind to treatment throughout the evaluation.

Study subjects Patient selection criteria were similar in both trials. All patients were between 45 and 70 years of age with a medical diagnosis of COPD = 2 years, a smoking history, and a per cent predicted forced expiratory volume in one second (FEV1% predicted) between 20% and 70%.Exclusion criteria included a complicating comorbid condition or need for domiciliary oxygen. In trial 2, patients must have recorded a BCSS total score of 2 or more on 7consecutive days during the baseline period. The research protocols were approved by the appropriate institutional review boards, and all patients gave written informed consent prior to enrolment in the trial.

Measures Clinical assessments Baseline FEV1% predicted, peak expiratory flow (PEF), rescue medication use, and sputum volume were used as clinical indicators for assessing the construct validity of the BCSS. Evening PEF values (Mini-Wright~ PEF meter) and rescue medication use during the previous 24 hours were recorded by patients each day in a diary. For the purposes of these analyses,these values were expressed as the mean daily PEF value and mean number of actuations per day during the baseline period. Sputum volume was gathered in trial I, during the last 24 hours of the baseline period.

Symptoms and Health-Related Quality of Life The modified Borg scale was used ~to assess breathlessness prior to spirometry at baseline in trial I (I 5). Specifically, patients were asked to describe their level of dyspnoea on a scale ranging from 0, indicating 'nothing at all', to 10, indicating 'maximal impairment'. The SGRQ, a condition-specific HRQL measure, was used to evaluate the validity of the BCSS.The instrument consists of 76items and three subscales (Symptoms, Activity, and Impacts [on daily life]) that can be aggregated to yield a total score. Seven of the items address activity-related dyspnoea (Part 2, Section 2), and four items address cough (Part 2, Section 3, 4). Subscale and total scores range from 0 to 100, with higher scores indicating poorer HRQL (16).The SGRQ has been used widely in research involving patients with COPD (7,16-24). It has been suggested that a change of 4.0 or greater in SGRQ total score be considered clinically meaningful (I 6). The Medical Outcomes Study Short-Form 36 Health Survey (SF-36), a generic HRQL measure, was also used to validate the BCSS (data gathered in trial I only).This 36-item measure comprises eight subscales that can be aggregated into two summary scores: the Physical Component Summary (PCS) and the Mental Component Summary (MCS) (24,25). Scores range from 0 t o 100, with higher scores indicating better HRQL.The SF-36 is widely used to evaluate burden of illness and outcomes of treatment in a variety of health conditions, including COPD (2,26-33), and is often used to assess construct validity of newly developed patient outcome measures.

Analyses Unless otherwise specified, baseline data were used in the analyses.

Missing data and distributional characteristics The rate of missing data and the distributional properties of the BCSS scores were examined first. Of particular interest were floor (the number and percentage of patients reporting the lowest possible score on the measure) and ceiling (number and percentage of patients reporting the highest possible score; worst symptoms possible) effects. These values indicate the extent to which an instrument is capable of detecting symptomatic improvement or deterioration (respectively) over time or with treatment.

Reliability The measurement error associated with the BCSS was examined by assessing its internal consistency and reproducibility (test-retest reliability). The internal

EVALUATING SYMPTOMS IN COPD

consistency reliability of the daily BCSS total score and total and item scores aggregated over time were assessed using the Cronbach's formula for coefficient alpha (0c). Reproducibility of scores from baseline to treatment was assessed using the intraclass correlation coefficient (random-effects model), with Pearson product-moment correlations and mean score differences used to further assess score stability. These analyses were performed across treatment groups in a subsample of patients for whom both physician and patient agreed that the treatment produced no change or was only mildly effective.

Validity Pearson product-moment correlations between BCSS scores and FEV1 % predicted, evening PEF, sputum volume (trial 2) and HRQL (SGRQ and SF-36 [trial I]) were used to assess the construct validity of the BCSS. Correlations between the BCSS and the Symptom subscale of the SGRQ, dyspnoea- and cough-specific items from the SGRQ, and the Borg scale (trial I) were used to assess the measure's concurrent, convergent and divergent validity. The BCSS was also subjected to two tests of discriminant validity.Analysis of variance (ANOVA) with post hoc Scheff~ test was used to test the extent to which BCSS scores varied by level of disease severity (35,36). The Student's t-test was used to compare BCSS scores for patients in the lower 25th and upper 25 th percentile of rescue medication use.The primary concern in these analyses was the performance of the Breathlessness item of the BCSS, although the discriminant validity of the Cough, Sputum and total scores was also assessed.

Responsiveness The responsiveness (sensitivity) of the BCSS to withingroup change (symptom improvement) and betweengroup differences (improved vs. stable) was also examined using the paired t-test and Student's t-test, respectively. Data were analysed across treatment group in a subsample of patients categorized as improved based on patient and physician agreement that treatment was moderately or highly effective.The method for identifying stable patients was described previously.

Exploratory analyses Three alternative methods for assessing symptom outcomes in COPD using BCSS diary data were also examined: symptom-free days (a score of 0 on BCSS total score), symptom relief ('good') days (0 or I on each BCSS item), and easy ('EZ') breathing days (Breathlessness score of 0 o r I, regardless of other symptoms). 'Good' and 'EZ' breathing days would offer

$61

two new methods for evaluating symptom outcomes in COPD. Because results from the two trials were comparable for these exploratory measures, only results from analyses of data from trial I are presented.

Significance testing and interpretation All statistical analyses were conducted using Statistical Analysis Software (SAS) version 6.12 (SAS Institute, Cary, NC), a significance level of 0-05, and 2-tailed tests. Because this was a psychometric evaluation, no adjustments were made for multiple comparisons. Correlation coefficients (r) and effect size estimates for within- and between-group differences were interpreted using guidelines proposed by Cohen (36), in which + 0" I 0 is considered small, ± 0"30 moderate, and + 0-50 large.To assist in interpretation of change scores, data were examined for convergence between mean change in BCSS total and item scores, per cent change, statistical effect size and change in HRQL.

RESULTS Sample Demographic and clinical characteristics of the two study populations used in this analysis are presented in Table I. Across trial populations, most subjects were male, mean FEV 1% predicted was less than 45%, and mean duration of COPD was greater than 5 years.

Missing data and distributional characteristics Missing data for patients who did not withdraw from the trial was less than 5% across all days. Mean baseline BCSS total scores were 4.14 + 2.20 and 5.09 ± 1.99 in the two studies. The distribution was near normal, with median values of 4 and 5 and modes of 3 and 6, respectively. Although the distribution fell on the low half of the 13-point scale, floor effects were small (< 2%), indicating symptomatic improvement could be detected.The small ceiling effects (< I% of the patients reported the highest score possible) also indicate there is room for score movement with increases in symptom severity. Of the three items, Breathlessness mean scores were the highest (trial I: 1.7 ± 0.9; trial 2:2-0 ± 0.7), followed by Cough (I .4 ± 0.9; 1.7 + 0"9) and Sputum (I. 10 + 0"88; 1.37 ± 0-87). Floor effects were lowest for the Breathlessness item (8%; I%), followed by Cough (9%; 4%) and Sputum ( 19%; 12%).

Reliability Internal consistency levels (~) of the day-to-day BCSS total score during the baseline and treatment periods ranged from 0-70 to 0.76 in trial I and from 0"68 to 0-80

$62

in trial 2 (data not shown).The range of ~ levels for BCSS scores over the baseline period ranged from 0.94 to 0.97 (Table 2). Intraclass correlation coefficients for the BCSS total score ranged from 0-77to0.78 (Table2). Pearson product-moment correlation coefficients were similar. BCSS total scores changed from 4. I (± 2.2) during the baseline period to 3"7 (± 2.0) during treatment in trial I (9% change, effect size of 0" 12) and from 5"0 (± 1.9) at baseline to 4.9 (± 1.9) during treatment in trial 2 (3% change; effect size = -0-07).

Validity Pearson product-moment correlation coefficients testing the construct and concurrent validity of the BCSS are shown in Tables 3 and 4. Correlations between the BCSS scores, FEV 1% predicted and PEF were in the predicted direction and small to moderate in magnitude, with the Breathlessness score the strongest correlate (Table 3). Correlations between the BCSS and sputum volume were also in the predicted direction, with Sputum score the strongest correlate. Sputum volumes by BCSS item responses for the same 24-hour period are shown in Figure la,b,c. Patients recording high scores on the Breathlessness, Cough, or Sputum items during this time

RESPIRATORYMEDICINE

period (scores of 3 or4) produced significantly more sputum than those reporting little or no symptom difficulty (scores of 0 or I; P < 0.05, P < 0-0001, and P < 0"000 I, respectively). Correlations between BCSS total and item scores and SGRQ total, Activity, and Impacts scores, tests of construct validity, were moderate and statistically significant, with the strongest correlations between BCSS and SGRQ total scores (Table4). The relationships between BCSS total and item scores and the generic SF-36 were also in the predicted direction, small to moderate in magnitude, and statistically significant. Correlations between BCSS scores and Symptoms subscale, dyspnoea- and cough-specific scores from the SGRQ and the modified Borg scale (trial I) indicate concurrent validity and provide data on the convergent and divergent validity of the three items comprising the BCSS. Correlations between the BCSS and the SGRQ Symptoms subscale were statistically significant, moderate to large in magnitude, and consistent across the two trials (Table 3). The correlation between the Breathlessness score and Borg scale score was also moderate (Table 3); the fact that this relationship was stronger than the correlation between the Borg scale score and either Cough or Sputum scores is evidence of the convergent and divergent validity of the BCSS.

EVALUATING SYMPTOMS IN COPD

Similar results were observed with dyspnoea- and cough-specific items from the SGRQ (Table3). Correlations between the dyspnoea-specific item scores from the SGRQ and BCSS Breathlessness item were moderate and stronger than correlations between these SGRQ item scores and the BCSS Cough or Sputum scores. Similarly, correlations between cough-specific item scores from the SGRQ and BCSS Cough scores were stronger than correlations between the SGRQ Cough item scores and BCSS Breathlessness score. SGRQ Cough scores also correlated with the BCSS Sputum score. Mean BCSS total scores increased as disease severity (FEV1% predicted, American Thoracic Society severity classification (34) increased (Figure 2).The Breathlessness score exhibited the strongest evidence of discriminant

$63

validity, differentiating among the four severity groups in both trials (F=7.03, R2=0.04, P < 0.001; F-20.61, R2=0• 10, P < 0.001, respectively; data not shown). Results were similar using the European Respiratory Society severity classification (35),with mean BCSS total scores increasing as disease severity increased (from 3-6 to 4.3 in trial I [F=2.53, R2=0-0 I, P < 0.09] and from 4.6 to 5.3 in trial 2 [F=6.45, R2=0.02, P < 0.01 ]). The BCSS total score was also able to discriminate between high and low rescue medication use in both data sets (Figure3). The BCSS Breathlessness score differentiated the two groups (t=-3.26, P< 0.01; t=-10.28, P<0"0001) as did Sputum score (t=-2.34, P < 0"05; -3.65, P < 0.001). The BCSS Cough scores differentiated the groups in trial 2 only (t=-2.91, P < 0-0 I; data not shown).

RESPIRATORYMEDICINE

$64

(a) 60A

50,m

4028 ± 54

~> 30. E20-

12±14

~10.

1

0

20± 19

17±19

I

Mild

None

1

"1"

Moderate

25 + 22

1 1 Marked

Severe

Response option for breathlessness

(b) 60 Q

E 40

~

E 20 3O. 10 ~

1

29 ± 24

3o

0

8±13

16 ±33

1

/

None

r Rare

35 ± 33

1 1

28 ± 24

, Occasional

1

Frequent

Almost constant

Response option for cough

(c) 60A

50.

31 ± 24

~ 30-

20 ± 36

~ 20•" 10O,

/ 1

39 ± 32

~'40-

7±11

1 None

1

Mild

T

1

~l

Moderate

1

18±26

r

Marked

1

Severe

Response option for sputum FIGURE I.

Mean sputum volume (£ SD) by BCSS response option; (a) breathlessness,(b) cough and (c) sputum.

Responsiveness Significant improvements in BCSS total and item scores were observed for patients categorized as having improved during the course of treatment (Figure4). Patients in the 'improved' group reported significant improvement in BCSS total and item scores from baseline to the end of treatment, with scores significantly better than those of the stable group. With the exception of the Sputum score in trial I, all betweengroup differences (improved vs. stable patients) were statistically significant. To facilitate the interpretation of changes in BCSS scores, data were examined for points of convergence between mean change, per cent change, and statistical effect size for BCSS scores and change in HRQL (Table 5). In trial I, the mean score change o f - I " I on the BCSS total score corresponded to 28% improvement in

symptom severity and a moderate to large effect size. For patients with HRQL data, the mean improvement in SGRQ total score exceeded the guideline of 4.0, indicating a clinically meaningful change. In trial 2, the mean change in BCSS total score of-0"9 corresponded to an 18% improvement and a moderate to large effect size, and the mean change in SGRQ total score exceeded the guideline for clinically meaningful change.

Exploratory analyses Patients who improved during trial l experienced significantly more symptom-free days than patients who were stable during the 28-day treatment period (5.7 + 8.5 vs. 2" I + 5.9 days; P < 0.05).These patients also experienced significantly more symptom-relief ('good') days (16.1 + 11.2 vs. 10.6 + 10.8) and'EZ' breathing days (18.7+ 10.6 vs. 13.4+ 11.3;P<0.05).

EVALUATING SYMPTOMS IN COPD

$65

7.0 ~, 6.0 i-

5.0

~

4.0

0 ~w

3.0

~

2.0

o

1.0

in

5.1

5.5

0.0

t ~0~

+o,O t o ~

I Trial 2

Trial 1 Disease severity*

Trial 1: NS; Trial 2: F=5.39, R2=0.03, P < 0.001. *ATS severity criteria based on FEV1% predicted (34). FIGURE 2.

Mean BCSStotal scores of patientsclassifiedby diseaseseveritybased on FEV1% predicted and ATS severity criteria.

DISCUSSION This secondary analysis examined the distributional properties, reliability, validity, and responsiveness of the BCSS, a new three-item measure for evaluating the severity of three symptoms associated with COPD: breathlessness, cough and sputum. The distribution of BCSS scores was near normal, with low floor and ceiling effects, indicating the measure is capable of detecting

improvement or deterioration in symptoms. Internal consistency reliability levels of the daily BCSS total score exceeded the recommended criterion of 0-70 considered sufficient for predictive or construct validation research (13). Internal consistency levels over time exceeded the 0"80 criterion for experimental research and the standard of 0.95 when individual-level data are being used for decision-making purposes (37).

$66

RESPIRATORYMEDICINE

7.0

5.8

~ ' 6.0

4.6

5.0 t¢u

~2~22222, "////////~

4.5

4.0

~////////,

o o 3.0

//.~/.~////.

.=

Y//////,~ /////////~

•"

2,0

~///////~

0 =1 1 . 0

9"////////

"////////~ 0.0

!

Lowest quartile 1.9 (0.9)

Highest quartile 10.7 (4.3)

Lowest quartile 0.5 (0.6)

Mean actuations per day (± SD)

Highest quartile 10.2 (4.0)

Mean actuations per day (+

Trial 1

so)

Trial 2 Rescue medication

use

Trial 1 : t = -2.82, P < 0.01; Trial 2: t = -6.58, P < 0.001

FIGURE 3.

Mean BCSStotal score of patients classifiedby rescue medication use.

Trial 1 Total* Breathlessness* Cough t I* S* I* S* I* S*

Trial 2

Sputum I"

Total* Breathlessness* Cough*

S*

I

S

I

S

I

S

Sputum* I

--0.2

-0.3 -0.4

///z

-0.4

Y///

-0.4

~//.~ -0.6

i / / / / / / ,

/ / / /

-0.8

"////

-0.9 -1 -1.1 -1.2

Patients classified as I = Improved or S = Stable according to patient and physician opinion of efficacy; *P < 0.01, t p < 0-001, P < 0.05. FIGURE 4.

Mean BCSSchangescore by changegroup (improved vs. stable):within-group and between-groupdifferences.

Reproducibility estimates for the BCSS suggest scores are replicable with values meeting or exceeding values reported for measures such as the Symptoms subscale of the SGRQ (21,38,39).

The validity of the BCSS was evaluated by examining the relationship between the item and total scores and FEV 1% predicted, PEF, sputum volume, modified Borg scale, and indicators of HRQL, including the specific

EVALUATING SYMPTOMS IN COPD

impact of cough and dyspnoea. Results suggest the BCSS item scores are valid indicators of the severity of patient breathlessness, cough and sputum, respectively. Correlations between FEV1% predicted and the BCSS Breathlessness score were small to moderate in magnitude and consistent with correlations between FEV1% predicted and BDI and CRQ Dyspnoea subscale score (r=0.31 and 0.35, respectively) reported by Hajiro etal. (21) and with the MRC dyspnoea scale score (Spearman p=0.37) reported by Eltayara et al. (3). One of the advantages of the BCSS is the simultaneous inclusion of breathlessness, cough and sputum assessments. Results suggest BCSS scores are valid indicators of the severity of cough and sputum in patients with COPD. Scores for both items were correlated with sputum volume. Patients who reported 'severe' cough and sputum (scores of 3 or 4) produced significantly more sputum than those reporting little or no symptom difficulty (scores of 0 or I). Scores on these items also correlated with cough-specific HRQL items from the SGRQ. Correlations between the BCSS total scores and indicators of HRQL suggest that the BCSS total score is a valid summary indicator of symptom severity.Correlations between the SGRQ Symptom and BCSStotal scores were similar to reported correlations between this SGRQ subscale and resting Borg (r=0.43) (40) and the Oxygen Cost Diagram (r=0.46) (41) and larger than correlations between SGRQ symptoms score and daily assessments of dyspnoea (r=0.27), sputum (r=0.30) and bronchitic symptoms (r=0"26) (42). The BCSS total score also increased as the degree of airway obstruction increased and discriminated between high and low rescue medication users. These associations suggest that the measure is sensitive to variation in disease severity. Results of this study also suggest that the BCSS total and item scores are sensitive to within-group change and between-group differences. Patients in the 'improved' group experienced 5 more 'good' days and over 5 more

$67

'EZ' breathing days during the 28-day treatment period than those in the stable group. This suggests that the daily diary approach may offer a new method for understanding symptomatic improvement in the COPD population in terms of number of 'good' and 'EZ' breathing days gained with treatment. Based on per cent change, effect size, mean HRQL changes and the number of 'good' days observed in the improved patients, the mean change of + I point on the BCSS total score appears to be a substantial improvement in symptom severity for patients with moderate to severe COPD. The BCSS was designed to evaluate the severity of three symptoms associated with COPD: breathlessness, cough, and sputum. Results of this study suggest that the BCSS is reliable, valid and responsive to change. Information about an instrument's reliability and validity accrues over time, and confidence in the instrument increases as it is used in repeated investigations by various scientists and in diverse conditions and settings. Results of these secondary analyses indicate that further use of this patient-reported outcome measure as a clinical tool and outcome measure in intervention studies involving patients with COPD is warranted.

ACKNOWLEDGMENTS The BCSS© was developed by Jane Lloyd and Karen Rocchiccioli of AstraZeneca. Copyright for the BCSS is owned by AstraZeneca. Nancy Kline Leidy and Jordana Schmier were funded by AstraZeneca to design and perform the analyses described. M. Kathryn C.Jones,Jane Lloyd and Karen Rocchiccioli are employees of AstraZeneca. The authors thank Vivian Alderfer, Chris Blango, Mitch Goldman,Trish Hepburn, Libby Keating, and Libby Pethick of AstraZeneca for their insightful comments on the data analysis plan, study report, and manuscript; Christine Thompson for SAS programming; and Dr Leah Kleinman of MEDTAP International Inc. for comments on an earlier version of the manuscript.

$68

REFERENCES I. National Heart, Lung, and Blood Institute. Global Initiative for Chronic Obstructive Lung Disease. Bethesda, MD: US Department of Health and Human Services, Public Health Service, National Institutes of Health; 2001. Publication Number 2701A. Available from: http://www.goldcopd.com/workshop/index.html. 2. Mahler DA, Faryniarz K, Tomlinson D, et aL Impact of dyspnea and physiologic function on general health status in patients with chronic obstructive pulmonary disease.Chest 1992; 102:395-40 I. 3. Eltayara L, Beck.lace MR, Volta CA, Milic-Emili J. Relationship between chronic dyspnea and expiratory flow limitation in patients with chronic obstructive pulmonary disease. Am J Respir Crit Care Meal 1996; 154: 1726-1734. 4. McGavin CR, Artvinli M, Naoe H, McHardy GJR. Dyspnoea, disability, and distance walked: comparison of estimates of exercise performance in respiratory disease.BMJ 1978; 2:241-243. 5. Borg GA. Psychophysical basesof perceived exertion. Med Sci Sports Exerc 1982; 14:377-38 I. 6. Selim AJ, Ren XS, Fincke G, Rogers W, Lee A, Kazis L.A symptombased measure for the severity of chronic lung disease:results from theVeterans Health Study. Chest 1997; I I I: 1607-1614. 7. Jones PW, Quirk FH, Baveystock CM, Littlejohns P.A self-complete measure of health status for chronic airflow limitation. The St. George's Respiratory Questionnaire. Am Rev Respir Dis 1992; 145: 1321-1327. 8. Guyatt GH, Berman LB,Townsend M, Pugsley SO, Champers LW.A measure of quality of life for clinical trials in chronic lung disease. Thorax 1987; 42: 773-778. 9. Hasani A,Toms N,Agnew JE, Lloyd J, SabinA, Dilworth JRThe effect of multiple doses of ViozanT M (AR-C68397AA) compared with salbutamol on lung mucociliary clearance and sputum production in smokers with COPD [abstract]. Am J Respir Crit Care Med 2000; 161: A820. 10. Ind PW, Sandstrom T,Wallaert B, Deamer L, Nystrom P,Rocchiccioli K.Tolerability profile ofViozanT M (AR-C68397AA):A novel therapy for symptomatic treatment of COPD [abstract]. Am J Respir Crit Care Med 2000; 161: A82 I. II. Laitinen LA, Laursen LC0 Wouters E, Lloyd J, Blackshaw R, Rocchiccioli K. Efficacy of ViozanT M (AR-C68397AA) versus salbutamol and ipratropium bromide in the management of COPD [abstract]. Am J Respir Crit Care Med 2000; 161: A490. 12. Wenzel S, Ind PW, Laursen LC, Deamer L, Nystrom P, Rocchiccioli K. ViozanT M (AR-C68397AA) reduces breathlessness, cough and sputum production in COPD patients [abstract]. Am J Respir Crit Care Meal 2000; 161: A490. 13. Wenzel S, Lindquist A, Sandstrom T, Deamer L, Nystrom P, Rocchiccioli K.ViozanT M (AR-C68397AA) improves health-related quality of life in patients with COPD [abstract].AmJ Respir Crit Care Med 2000; 161: A490. 14. YoungA, Dougall I, BlackhamA, Hallam C, Harper S,Taylor C, Brown R, Ince E Novel dual D2-receptor and Beta2-adrenoceptor agonists for the treatment of airways diseases [abstract]. Am J Respir Crit Care Med 2000; 161: A436. 15. Mahler DA, Donohue JF, Barbee RA, et al. Efficacy of salmeterol xinafoate in treatment of COPD. Chest 1999; I 15" 957-965. 16. Jones PW, Bosh TK. Quality of life changesin COPD patients treated with salmeterol. Am J Respir Crit Care Med 1997; 155:1283-1289. 17. Bestall JC, Paul EA, Garrod R, Garnham R, Jones PW,Wedzicha JA. Usefulness of the Medical Research Council (MRC) dyspnoea scale as a measure of disability in patients with chronic obstructive pulmonary disease. Thorax 1999; 54: 581-586. 18. Eiser N,West C, Evans S,JeffersA, Quirk E Effects of psychotherapy in moderately severe COPD: a pilot study. Eur RespirJ 1997; I 0: 1581-1584. 19. Gallefoss F, Bakke PS, Rsgaard PK. Quality of life assessment after patient education in a randomized controlled study on asthma and chronic obstructive pulmonary disease.Am J Respir Crit Care Med 1999; 159: 812-817.

RESPIRATORYMEDICINE

20. Hajiro T, Nishimura K, Tsukino M, Ikeda A, Koyama H, Izumi T. Analysis of clinical methods used to evaluate dyspnea in patients with chronic obstructive pulmonary disease.Am J Rest~irCrit Care Meal 1998; 158:1185-1189. 21. Hajiro T, Nishimura K, Tsukino M, Ikeda A, Koyama H, Izumi T. Comparison of discriminative properties among disease-specific questionnaires for measuring health-related quality of life in patients with chronic obstructive pulmonary disease.Am J Respir Crit Care Med 1998; 157: 785-790. 22. Ketelaars CA,Abu-Saad HH, Schlosser MA, Mostert R,Wouters EE Long-term outcome of pulmonary rehabilitation in patients with COPD. Chest 1997;112: 363-369. 23. National Emphysema Treatment Trial Research Group. Rationale and design of The National Emphysema Treatment Trial: a prospective randomized trial of lung volume reduction surgery.The National EmphysemaTreatment Trial Research Group. Chest 1999; 116: 1750-1761. 24. Ware JE, Snow KK, Kosinski M, Gandek B. SF-36 Health Survey Manual & Interpretation Guide. Boston, MA: The Health Institute, New England Medical Center, 1993. 25. Ware JE Jr., Kosinski M, Keller SD. SF-36 Physicaland Mental Health Summary Scales:A User's Manual. Boston, MA:The Health Institute, New England Medical Center, 1994. 26. Alonso J, Prieto L, Ferrer M, et al for the Quality of Life in COPD Study Group.Testing the measurement properties of the Spanish version of the SF-36 Health Survey among male patients with chronic obstructive pulmonary disease.J Clin Epidemiol 1998; 51: 1087-1094. 27. Mahler DA, Mackowiak JI. Evaluation of the short-form 36-item questionnaire to measure health-related quality of life in patients with COPD. Chest 1995; 107:1585-1589. 28. McKenna RJ, Brenner M, Gelb AR, et aI.A randomized, prospective trial of stapled lung reduction versus laser bullectomy for diffuse emphysema.J Thorac Cardiovasc Surg 1996; I I I: 317-322. 29. May ML, Ingenito EP, Mentzer SJ, Evans RB, Reilly JJ Jr. Healthrelated quality of life improves following pulmonary rehabilitation and lung volume reduction surgery. Chest 1999; I 15: 383-389. 30. Prieto L, Alonso J, Ferrer M, Anoto JM. Are results of the SF-36 health survey and the Nottingham Health Profile similar? A comparison in COPD patients. Quality of Life in COPD Study Group.J Clin Epidemiol 1997; 50: 463-473. 3 I. Stavem K, Boe J, ErikssenJ. Health status, dyspnea,lung function and exercise capacity in patients with chronic obstructive pulmonary disease. IntJ Tuberc Lung Dis 1999; 3: 920-926. 32. Stavem K, Erikssen J, Boe J. Performance of a short lung-specific health status measure in outpatients with chronic obstructive pulmonary disease.Respir Med 1999; 93: 467-475. 33. Wyrwich KW, Tierney WM, Wolinsky FD. Further evidence supporting an SEM-basedcriterion for identifying meaningful intraindividual changes in health-related quality of life.J Clin Epidemiol 1999; 52:861-873. 34. American Thoracic Society. Lung function testing: selection of reference values & interpretive strategies. Am Rev Respir Dis 1991; 144: 1202-1218. 35. Siafakas NM, Vermeire P, Pride NB, et aL Optimal assessment and management of chronic obstructive pulmonary disease (COPD). The European Respiratory SocietyTask Force. Eur RespirJ 1995; 8: 1398-1420. 36. Cohen J. Stati~cal PowerAnalysis far the Behavioral Sciences2rid edn. Hillsdale, NJ: Lawrence Erlbaum Associates, 1988. 37. Nunnally JC, Bernstein IH. Psychometric Theory 3rd edn. New Yorlc McGraw-Hill, lnc, 1994. 38. Harper R, Brazier JE, Waterhouse JC, Waiters sJ, Jones NMB, Howard R Comparison of outcome measures for patients with chronic obstructive pulmonary disease (COPD) in an outpatient setting. Thorax 1997;52: 879-887.

E V A L U A T I N G SYMPTOMS IN C O P D

39. Rutten-van Molken M, Roos B, Van Noord JA. An empirical comparison of the St George's Respiratory Questionnaire (SGRQ) and the Chronic Respiratory Disease Questionnaire (CRQ) in a clinical trial setting. Thorax 1999; 54:995-1003. 40. Ferret M,Alonso J, Prieto L, et a/.Validity and reliability of the St George's Respiratory Questionnaire after adaptation to a different language and culture: the Spanish example. Eur Respir J 1996; 9: 1160-1166.

$69

41. JanssensJP, Rochat T, Frey JG, et al. Health-related quality of life in patients under long-term oxygen therapy:,a home-based descriptive study. Respir Meal 1997; 91 : 592-602. 42. SeemungalTA, Donaldon GC, Paul EA, et al. Effect of exacerbation on quality of life in patients with chronic obstructive pulmonary disease.Am J Respir Crit Care Med 1998; 151: 1418-1422.

$70

RESPIRATORYMEDICINE