Beyond memory impairment: cognitive changes in Alzheimer's disease

Beyond memory impairment: cognitive changes in Alzheimer's disease

Archives of Clinical Neuropsychology 17 (2002) 179 – 193 Beyond memory impairment Cognitive changes in Alzheimer’s disease$ Edward Helmesa,*, Truls Ø...

121KB Sizes 4 Downloads 30 Views

Archives of Clinical Neuropsychology 17 (2002) 179 – 193

Beyond memory impairment Cognitive changes in Alzheimer’s disease$ Edward Helmesa,*, Truls Østbyeb,1 a

b

Department of Psychology, University of Western Ontario, London, ON, Canada Department of Epidemiology and Biostatistics, University of Western Ontario, London, ON, Canada Accepted 9 November 2000

Abstract In addition to memory impairment, deficits in other cognitive processes are common in the advanced stages of Alzheimer’s disease (AD). The diagnosis of AD does not consider the relative prevalence of deficits in cognitive areas other than memory. We report on the prevalence of aphasia, apraxia, and other cognitive changes in individuals from a large representative sample of elderly Canadians. The proportion of these symptoms and the relevant neuropsychological test performance were compared in a group of 749 people over 65 years in age with AD and a control group of 563 people without cognitive impairment. Agnosia was less common in both groups than were deficits in complex visuomotor tasks, abstract thinking. aphasia, and constructional defects. The occurrence of all symptoms increased, and levels of performance on relevant neuropsychological tests decreased, with severity of Alzheimer disease. The tests did not, however, distinguish between possible and probable AD. Both these diagnostic groups showed similar levels of performance, which suggests that this distinction is not clinically meaningful. D 2001 National Academy of Neuropsychology. Published by Elsevier Science Ltd. Keywords: Dementia; Alzheimer’s disease; Aphasia; Apraxia; Construction defect; Agnosia

$ An earlier version of this paper was presented at the 103rd Annual Convention of the American Psychological Association, 14 August 1995, New York, NY. * Corresponding author. School of Psychology, Edith Cowan University, 100 Joondalup Drive, Joondalup, WA 6027, Australia. Tel.: 618-9400-5543; fax: 618-9400-5834. E-mail address: [email protected] (E. Helmes). 1 Present address: Department of Community and Family Medicine, Duke University, Durham, NC 27710, USA.

0887-6177/01/$ – see front matter D 2001 National Academy of Neuropsychology. PII: S 0 8 8 7 - 6 1 7 7 ( 0 0 ) 0 0 1 0 9 - 8

180

E. Helmes, T. Østbye / Archives of Clinical Neuropsychology 17 (2002) 179–193

1. Introduction A basic feature of all forms of dementia is the gradual loss of both general and specific cognitive functions (Burns, Jacoby, & Levy, 1991; Zec, 1993). Disorders of language are common in the more advanced stages of many forms of dementia (Miller, 1989a; Thompson, 1987), as are apraxia (Rapcsak, Croswell, & Rubens, 1989), constructional defects (Edwards, Deuel, Baum, & Morris, 1991), and, less commonly, agnosia, the lack of recognition with adequate perception (Lincoln, 1995). In dementia of the Alzheimer type, impairments in these specific domains are part of common diagnostic criteria (American Psychiatric Association, 1987). The criteria do not, however, consider the relative frequency of occurrence of these disorders in dementing older people as the disorder progresses. If some disorders co-occur with memory problems, if they are as prevalent, but are present in different individuals, or occur very infrequently, then their inclusion in the diagnostic criteria does not add to the diagnostic power of the criteria. This report provides further insights into these matters, as well as the occurrence of the various disorders in a comparison group of cognitively intact older people. Language disorders in people with Alzheimer’s disease (AD) are extensive, and include impaired naming, word fluency, comprehension, and word associations (Hart, Smith, & Swash, 1988; Miller, 1989a, 1989b; Murdoch, Chenery, Wilks, & Boyle, 1987; Nicholas, Barth, Obler, Au, & Albert, 1997; Price et al., 1993). The deficits in language arise from the widespread neuronal loss in AD, as the specific language centers become affected. Thus, the term aphasia (Greek: ‘‘no speaking’’) may not be fully appropriate for the range of language difficulties in AD, although the term is often used more broadly to refer also to impairments in speaking (Bayles & Kaszniak, 1987). Apraxia (Greek: ‘‘no action’’) is a term applied to disorders of the organization of complex sequences of movements in space (Lezak, 1995). A distinction is often made between constructional apraxia, which involves drawing tasks and the assembly of two- and three-dimensional objects, and ideomotor apraxia, which is a failure to imitate or perform coordinated movements upon command in the absence of defects in motor control (Lucchelli, Lopez, Faglioni, & Boller, 1993). Both types of deficit have been noted to be impaired in AD (Edwards, Baum, & Deuel, 1991; Kirk & Kertesz, 1991; Rapcsak et al., 1989; Travniczek-Marterer, Danielczyk, Simanyi, & Fischer, 1993). Agnosia (Greek: ‘‘no knowledge’’), the absence of meaning in the presence of normal perception (Lezak, 1995), has not been the focus of much specific research in the context of AD, perhaps because as a complex concept, it has been viewed as part of the general deterioration in abstract thinking and judgement that are common in AD (Zec, 1993). Some specific agnostic disorders, such as anosognosia, the denial of illness, have recently been noted to be fairly common in AD (Starkstein et al., 1995). Abstract thinking (concept formation) and judgement (reasoning) are also abilities that deteriorate with AD. Some authors regard these as distinctly different faculties (e.g., Lezak, 1995), while others treat them as closely related, in part because they are both highly correlated with age and education (Zec, 1993). Impaired reasoning ability is among the more common changes observed in AD, whether finer distinctions are made or not.

E. Helmes, T. Østbye / Archives of Clinical Neuropsychology 17 (2002) 179–193

181

Few studies, if any, have compared the relative frequency of these cognitive disorders in older adults with or without dementia. Here, we describe the frequency of occurrence of these conditions in the Canadian Study of Health and Aging, a large, national representative sample of elderly Canadians with probable or possible AD.

2. Method 2.1. Participants This study was based on a stratified random sample of 10 263 participants from the 10 Canadian provinces (Canadian Study of Health and Aging Working Group, 1994). The working group comprised multidisciplinary teams of clinical and methodological investigators from across Canada. Medicare systems files and electoral rolls were used to select the sample for the 18 study centers. The five major geographic regions were sampled equally, with the age groups over 75 being oversampled because of the higher prevalence of dementia in these groups. Individuals living in the community, as well as institutions, were included. Details of the study design and the sampling methodology are given by the Canadian Study of Health and Aging Working Group (1994). All community participants received an initial screening interview that included the Modified Mini-Mental State Examination (3MS; Teng & Chui, 1987). Individuals in the community scoring below 78/100 and above 50/100 on the 3MS and a subsample of 494 who scored 78 or above, as well as all 1255 institutionalized participants, were invited to participate in a more detailed clinical assessment (N = 2914). This assessment included a clinical neurological examination, nurse’s evaluation, blood tests, and an extensive neuropsychological assessment. Diagnostic criteria from the American Psychiatric Association (1987) were used, and the final diagnosis was based upon a consensus conference of the neuropsychologist, nurse, and physician using all the clinical findings together with a computer algorithm to make a final diagnosis (Graham et al., 1996). This final diagnostic conference also used the NINCDS-ADRDA criteria for severity of dementia, and in the case of AD, whether this diagnosis was possible or probable. A total of 861 people undergoing the clinical assessment were determined to have some form of cognitive impairment but who did not meet the criteria of dementia, while a further 208 were diagnosed with vascular dementia and 175 with other forms of specific or unclassified dementia (Graham et al., 1996). The lack of imaging studies may have led to lower rates of diagnosis of other forms of dementia than if such tests had been performed routinely. Kappa (Cohen, 1960) was 0.81 (84.1% agreement) for the diagnosis of AD/no AD in a subsample of 210 participants who had an independent chart review (Graham et al., 1996). This reliability analysis did not include evaluations of the severity of the dementia or for the distinction between possible and probable AD. MacKnight, Graham, and Rockwood (1999) discuss the factors underlying some of the differences in diagnoses that were observed between physicians and neuropsychologists. The diagnoses used in the current analyses are based upon the final consensus of the case conferences.

182

E. Helmes, T. Østbye / Archives of Clinical Neuropsychology 17 (2002) 179–193

Using the criteria of McKhann et al. (1984), a total of 749 people (193 men and 556 women) with a mean age of 84.3 years (S.D. = 6.83) were determined to have probable (n = 448) or possible AD (n = 301). A comparison group, found to be cognitively normal after the clinical consensus conference, consisted of 563 people (181 males and 382 females), 398 from the community sample and 165 from the institutional sample. The mean age in this group was 78.9 years (S.D. = 6.62). 2.2. Procedure After the clinical assessment, those who scored above 50 on a second 3MS administered during the nurse’s examination underwent a neuropsychological assessment (n = 1843, Tuokko, Kristjansson, & Miller, 1995). The battery of tests used reflected the constructs included in the diagnostic criteria. The final selection of tests was based upon a feasibility study and comprised short forms of three Wechsler Adult Intelligence Scale (WAIS)-R (Wechsler, 1981) subtests (Block Design, Comprehension, and Similarities; Satz & Mogel, 1962), the WAIS-R Digit Symbol subtest, and other measures of memory and higher cognitive functions (Tuokko et al., 1995) (see below). In accordance with the diagnostic criteria used, the study distinguished ideational apraxia (performing integrated serial acts or tool use) or ideomotor apraxia (responding to verbal commands) from constructional apraxia (Zec, 1993) or constructional defect (Lezak, 1995). The terminology used here follows Lezak (1995): constructional defect refers to disorders of drawing and object assembly; apraxia refers to other forms of complex movement disorder. Both constructional defect and apraxia were diagnosed separately at the diagnosis meeting, along with the other diagnostic entities of aphasia and agnosia. Additional consensus judgements were made about defects in judgement and abstract thinking. Aphasia was assessed by an 11-item Token test (Benton & Hamsher, 1989), Animal Naming (Rosen, 1980), naming 12 objects (Visual Identification) from the cued recall paradigm of Buschke (1984), and Verbal Fluency (Controlled Oral Word Association Test; Spreen & Benton, 1977). Verbal fluency is not normally considered a measure of aphasia, but is included here with other measures of language as a result of the decisions made in the formation of the battery (Tuokko et al., 1995). The naming tests were open-ended, with achieved maximum scores of 62 for Verbal Fluency and 27 for Animal Fluency. Visuomotor skills comprised assembly, coding, and drawing tests. The measures used were two tests from the WAIS (Block Design and Digit Symbol) and the sum of three scores (maximum of 10) derived from a figure of interlocking pentagons used in the 3MS screening test. These tests involve primarily construction and object manipulation tasks. The battery used in the study did not have any direct measures of ideational or ideomotor apraxia. Agnosia was measured by the color naming section of the Token test (maximum score of 5) and the number of objects correctly identified by category from the object naming section of the Buschke paradigm (maximum score of 12). Finally, the Similarities and Comprehension subtests of the WAIS-R assessed abstract thinking and judgement, respectively. This test battery has been shown to be able to discriminate among various forms of dementia (Steenhuis & Østbye, 1995). All calculations for the present study were performed using SYSTAT (SPSS, 1998).

E. Helmes, T. Østbye / Archives of Clinical Neuropsychology 17 (2002) 179–193

183

3. Results 3.1. Diagnosis Not all of the 749 patients with probable or possible AD also had the individual cognitive disorders of aphasia, constructional defect, impaired abstract thinking, agnosia, or apraxia. In other words, the number of cases with a diagnosis for a given cognitive disorder was always less than the number with the diagnosis of the severity of AD. Excluding eight cases with missing data, 58.0% (430/741) of those with diagnosed AD were determined by the consensus conference to have aphasia, 51.7% (383/741) apraxia, 34.1% (253/741) agnosia, 87.2% (646/741) impaired abstract thinking, 77.3% (573/741) impaired judgment, and 65.0% (482/741) problems with construction skills. The last column of Table 1 indicates the number of cases with completed diagnoses of the specific disorders from the consensus conference and the total number who were evaluated at the consensus conference. The theoretical maximum for the latter figure is 1304. The denominator for the above figures for each disorder is therefore smaller than the denominators used in Table 1. Individuals who did not complete the full clinical assessment did not receive diagnoses of the individual cognitive disorders. Their data are treated as missing in the analyses of diagnoses of cognitive dysfunction and are omitted from the analyses of the neuropsychological tests (n = 294). An examination of possible bias due to these missing data used a type I error rate of 0.01 to avoid spurious findings due to the large number of comparisons made in this section. Comparisons of gender ratios between those diagnosed with a disorder and those who were not were not statistically significant for any of the six cognitive disorders. While those missing a diagnosis did not differ in age from those with a diagnosis of agnosia, those lacking diagnoses for aphasia, apraxia, constructional and judgement defect, and impaired abstract thinking were younger than those with diagnoses of aphasia: 80.3 vs. 82.6 years, t(1310) = 5.07, P < .001; apraxia: 80.8 vs. 82.6 years, t(1310) = 4.20, P < .001; construction defect: 81.1 vs. 82.4 years, t(1310) = 3.15, P < .001; judgement: 79.7 vs. 82.9 years, t(1310) = 7.45; abstract thinking: 80.0 vs. 82.7 years, t(1310) = 5.93, P < .001).

Table 1 Proportion of diagnoses of cognitive dysfunction by severity of AD Level of impairment Diagnosed disorder

None (N = 563)

Mild (N = 154)

Moderate (N = 279)

Severe (N = 308)

Number of AD cases diagnosed/total

Agnosia Apraxia Aphasia Judgement Constructional defect Abstract thinking

.000 .003 .011 .004 .007 .011

.07 .30 .48 .52 .62 .80

.30 .52 .51 .88 .77 .97

.87 .90 .82 .996 .97 1.00

253/834 384/884 433/952 574/942 484/861 649/968

184

E. Helmes, T. Østbye / Archives of Clinical Neuropsychology 17 (2002) 179–193

3.2. Demographic factors The initial rows of Tables 2 and 3 report age, years of education and 3MS scores for the control, severity level, and probable vs. possible AD groups. The groups with mild and moderate AD showed fewer years of education [ F(2,1218 df ) = 49.8, P < .001], and were older than the control group [ F(2,1271 df ) = 43.1, P < .001]. Post-hoc Tukey HSD tests showed that all three groups differed on 3MS scores [ F(2,746 df ) = 926.3, P < .001]. The same pattern was evident for the possible vs. probable AD grouping: the two groups with AD were older [ F(2,1309 df ) = 103.6, P < .001] and had fewer years of education [ F(2,1168 Table 2 Scores on demographic measures and neuropsychological tests with increasing severity of AD Level of severity None

Mild

Measure or test

n

Age Years of education 3MS Abstract reasoning and judgement Similarities

563 78.9 556 10.6 398 89.7

6.62 267 3.74 257 5.61 179

517

7.8

3.85

114

Comprehension

516

9.7

3.06

112

501 38.6

6.04

109

Aphasia Token test

Mean

S.D.

n

Verbal Fluency

512 28.4

11.57 106

Animal Fluency

508 14.6

3.96 109

Naming

506

11.8

0.47

112

Agnosia Objects

506

11.9

0.43

112

0.17

110

Colors

501

5.00a

Visuomotor Block Design

499 10.2

Digit Symbol

476 28.7

Drawing

537

a

9.1

4.57 107 11.49

93

1.58 139

Effect Mean 82.0 6.96 65.4

3.1a 1.2 6.1a 1.2 30.5a 1.3 16.9a 0.9 8.9 1.4 11.2 1.3

S.D.

Size

Moderate n

5.72 0.47 444 3.90 0.97 408 11.58 4.33 101

2.83 1.22 0.73 3.09 1.18 1.01 9.47 1.56 10.02 0.87 3.30 0.83 1.69 3.63

114 115

1.34 107 0.99 106 1.44 107 1.28 107

Mean 82.7 8.3 53.5

2.5a 1.4 5.5a 1.4

Effect S.D.

Size

h2

7.09 0.57 0.137 4.04 0.61 0.077 14.73 6.54 0.700

2.56 1.38 0.422 0.67 3.01 1.37 0.350 1.00

30.7a 1.4 14.0a 1.2 7.1 1.9 10.8 2.1

8.28 1.77 8.89 0.77 3.13 0.79 1.52 3.26

1.31 0.283 1.24 0.295 1.89 0.660 2.13 0.159

11.5 0.8 4.96a 0.2

1.31 0.93 108 3.04 0.23 0.24 109 1.55

11.2 1.7 4.91a 0.6

1.40 1.63 0.108 3.26 0.46 0.53 0.011 3.08

4.3a 1.3 13.8a 1.3 6.4 1.7

3.75 1.29 102 0.70 8.07 1.30 78 0.70 3.27 1.71 238 2.07

3.9a 1.4 10.6a 1.5 5.4 2.4

4.11 1.68 0.410 0.90 9.10 1.58 0.452 0.96 3.24 2.34 0.484 2.05

Means sharing the same superscript are not significantly different by Tukey’s honestly significant difference procedure. The second row for each variable gives the standard scores based upon the mean and S.D. for the control group. The mean for the comparison group was 0.0, with an S.D. of 1.0 within rounding error.

E. Helmes, T. Østbye / Archives of Clinical Neuropsychology 17 (2002) 179–193

185

Table 3 Comparison of controls with patients having possible and probable AD Control Mean

Possible Alzheimer’s

Test or measure

n

Age Years of education 3MS Abstract Reasoning and Judgement Similarities Comprehension Aphasia Token test Verbal Fluency Animal Fluency Naming Agnosia Objects Colors Visuomotor Block Design Digit Symbol Drawing

563 78.9 556 10.6 398 89.7

6.62 448 84.5a 6.80 0.85 3.74 360 8.4a 3.79 0.59 5.61 154 54.5 16.97 6.27

517 516

7.8 9.6

3.85 104 3.06 105

501 512 508 506

38.6 28.4 14.6 11.8

506 11.9 501 4.98a 499 10.2 476 28.7 537 9.1

S.D.

n

Mean S.D.

3.0a 5.8a

Probable Alzheimer’s

Effect size n

Effect size h2

301 84.1a 6.88 0.79 255 8.4a 3.66 0.59 101 59.8 13.22 5.33

2.73 1.25 3.08 1.24

132 131

8.68 9.18 3.35 1.67

1.24 1.16 1.72 1.70

0.43 101 11.5a 0.17 99 4.9a 4.57 93 4.3a 11.49 73 11.9a 1.58 235 4.3a

6.04 97 31.1a 11.57 96 15.0a 3.96 100 7.8a 0.47 101 11.0a

Mean S.D.

2.6a 5.7a

0.076 0.126 0.713

2.66 1.35 3.04 1.27

0.437 0.354

126 30.1a 123 15.8a 124 8.1a 126 10.9a

9.23 9.94 3.33 1.59

1.41 1.09 1.64 1.91

0.277 0.294 0.659 0.154

0.98 0.93 0.22 0.47

127 11.2a 127 4.9a

1.60 1.63 0.44 0.47

0.102 0.007

4.28 1.29 8.24 1.46 3.72 3.04

123 3.9a 103 12.4a 321 4.4a

3.66 1.38 9.08 1.42 3.59 2.97

0.467 0.454 0.706

a Means sharing the same superscript are not significantly different by Tukey’s honestly significant difference procedure.

df ) = 48.9, P < .001]. All three groups differed on the 3MS by post-hoc Tukey HSD testing [ F(2,650 df ) = 758.0, P < .001]. 3.3. Severity level The hypothesis that the prevalence of all types of cognitive dysfunction would increase with the severity of AD was tested by use of the consensus rating at each study center of the severity of the AD (mild, moderate, or severe) using DSM-III-R criteria (American Psychiatric Association, 1987). Eight cases did not receive severity ratings, leaving a total sample of 1304 for this analysis (370 males and 934 females). At the level of clinical diagnosis, not all individuals showed the deficits, even at the most severe level of impairment. Table 1 reports the relative frequency of the diagnoses of aphasia, apraxia, agnosia, and constructional defects, excluding the 294 individuals from the comparison group who did not undergo the neuropsychological assessment and consensus diagnosis. The linear increase in frequency of diagnosed cognitive disorder was statistically significant for all diagnosed disorders: abstract thinking [c2(1, n = 968) = 120.4, P < .001]; judgement [c2(1, n = 942) = 623.5, P < .001], agnosia [c2(1, n = 834) = 398.1, P < .001]; aphasia [c2(1, n = 952) = 10.9, P < .001], apraxia [c2(1, n = 884) = 460.3, P < .001], and constructional defects [c2(1, n = 861) = 208.8, P < .001], using a statistic for trends in contingency tables due to Maxwell, described by Marascuilo and McSweeney (1977; pp. 198–202). In all six

186

E. Helmes, T. Østbye / Archives of Clinical Neuropsychology 17 (2002) 179–193

cases, the values of the residual c2 were sufficiently large to suggest that the actual relationship was curvilinear. This finding is consistent with the common observation of a relatively slow decline in the early stages of AD that is followed by a period of more rapid deterioration. Table 2 gives the means for the age, education, and mean MMSE scores and for the neuropsychological tests at the different levels of severity of AD, excluding the group with the most severe dementia because of the comparatively small number of cases in that group (n = 4). (A copy of the table containing the results pertaining to this group can be obtained by contacting the first author). It should be noted that higher scores indicate better performance on all the tests used, and the overall pattern of results followed the expectation of lower performance with increasing severity of AD, with the exception of the Token test, which showed a modest increase in scores in the moderate AD group over the mild AD group. In addition, not all test scores showed parallel rates of decline, and some were better preserved than others at the level of moderate impairment. In order to demonstrate this, the scores for the two AD groups were converted to z scores by using the mean and S.D. of the control group. These values are also reported in Table 2, along with values of h2 and simple effect sizes to indicate the relative amount of variance accounted for by the independent variable for that particular test. An initial multivariate analysis of variance was performed to determine if there were significant differences across levels of severity. This test was significant [Pillai’s trace = 0.517, F(22,1222) = 19.37, P < .001]. A type I error rate of 0.01 was selected for the univariate tests because of the number of comparisons being made. The Tukey honestly significant difference procedure was used to maintain the type I error rate within each posthoc analysis. The abstract reasoning and judgement measures both showed the same pattern of decline. The overall effect for Similarities was significant, F(2,742) = 156.40, P < .001. The post-hoc tests showed that the scores of the two AD groups did not differ from one another. Comprehension showed a similar pattern, F(2,740) = 129.67, P < .001, with Tukey’s test showing the same pattern as for Similarities. Language test scores showed roughly comparable declines, except for the verbal fluency measure, which did not decline as quickly as the other measures in mild AD. Token test scores showed two outliers by the criterion of more than four Studentized residual units, and these individuals were deleted from the analysis, although the interpretation of the results remained the same. The corrected test showed significant differences, F(2,712) = 100.85, P < .001, with the control group significantly greater than both groups with dementia, which did not differ from one another by Tukey’s test. The naming test showed all three groups to differ among one another (Table 2) by Tukey’s test [overall F(2,722) = 53.48, P < .001]. Scores on this measure were quite skewed. Consequently, a nonparametric Kruskal–Wallis analysis was performed. This showed the same results [c2(2, n = 725) = 116.1, P < .001]. The fluency measures also had overall statistically significant differences, but these were more pronounced with the animal fluency measures than with the Verbal Fluency scores, F(2,721) = 238.07, P < .001 and F(2,724) = 106.24, P < .001, respectively. The post-hoc Tukey tests also showed more differences among groups for the Animal Fluency measure than for the F-A-S measure. All three groups differed significantly from one another on Animal Fluency, while only the controls differed from the two AD groups for Verbal Fluency.

E. Helmes, T. Østbye / Archives of Clinical Neuropsychology 17 (2002) 179–193

187

Both language tests that showed lower scores in the moderate AD group, Naming and Verbal Fluency, also were statistically significantly different from the mild AD group. The measures of agnosia showed little absolute differences across the three groups. The large samples led to statistically significant results for Object Identification, F(2,723) = 39.22, P < .001, but not for Color Identification, F(2,720) = 4.21, P > .01. Object Identification scores for all three groups differed significantly from one another by the post-hoc Tukey tests. Distributions of both agnosia measures were skewed. Kruskal–Wallis nonparametric analysis of variance showed the same results. Object Identification remained statistically significant [c2(2, n = 720) = 94.6, P < .001], but Color Identification did not differ significantly across the three groups [c2(2, n = 720) = 7.1, P > .01]. Visuomotor test scores showed more pronounced changes with increased severity of AD. Block Design scores dropped to less than half the level of the nondemented group, F(2,705) = 144.44, P < .001, after the deletion of one outlier. The control group differed significantly from both AD groups, which did not differ from one another on the Tukey tests. One outlier was also deleted from the Digit Symbol analysis, which showed similar levels of decline, F(2,644) = 145.41, P < .001. The post-hoc Tukey tests showed that the AD groups differed significantly from the control group, but not from each other. The Drawing test showed the greatest sensitivity to the degree of severity, F(2,911) = 220.66, P < .001. The Tukey post-hoc test showed all groups differed significantly from one another. The neuropsychological tests clearly did not show equivalent changes in performance from mild to moderate AD. The standard scores for the two AD groups (Table 2) show the effects most clearly. Some tests showed modest rates of decline of about one S.D. from no cognitive impairment to mild AD, such as Similarities, Comprehension, Verbal Fluency, and Object Identification. Others, such as Drawing and Animal Fluency, showed larger changes with mild AD. Some tests continued to decline from mild to moderate AD, such as Animal Fluency, Naming, Object Identification, and Drawing, while others showed a much less pronounced change (Comprehension, Similarities, Digit Span, Block Design, and the Token test). Animal Fluency showed the greatest association with the severity of AD, while Color and Object Identification showed the least association. 3.4. Probable vs. possible AD Table 3 compares the means for the nondemented comparison group with the subgroups of possible and probable AD for the same basic demographic information and test scores as in Table 2. The same conventions were used for this set of analyses as for those reported in Table 2. In this case, the multivariate analysis of variance was also significant, Pillai’s trace = 0.492, F(22,1232) = 18.29, P < .0001. Values of h2 are also given in Table 3 for the strength of association of the probability independent variable with the neuropsychological tests. The measures of abstract reasoning and judgement showed no distinction across the two groups with AD, both of which differed from the control group. Both effects were roughly the same size, although the proportionate change was less for Comprehension, F(2,750) = 163.9, P < .001, than it was for Similarities, F(2,749) = 132.50, P < .001. Tukey tests showed that the control group differed from both AD groups, which did not differ from each other.

188

E. Helmes, T. Østbye / Archives of Clinical Neuropsychology 17 (2002) 179–193

A similar pattern was shown with the aphasia measures. The control group had the highest scores on the Token test, while the two AD groups did not differ, F(2,721) = 99.95, P < .001, with one outlier removed. The retention of that person did not change the results of the analysis. Verbal Fluency and Animal Fluency showed the same pattern, F(2,728) = 106.96, P < .001 for Verbal Fluency, and F(2,729) = 239.78, P < .001 for Animal Fluency. This result was also the case for the object naming measure, F(2,730) = 56.01, P < .001. The distribution of scores for the naming test was highly skewed, and the Kruskal–Wallis statistic for this variable replicated the parametric result [c2(2, N = 733) = 113.7, P < .001]. Scores on the measures of agnosia showed little differences across groups, although the overall tests were significant for Object Identification, but not for Color Identification, F(2,731) = 37.44, P < .001 and F(2,724) = 2.67, P > .01, respectively. Post-hoc Tukey tests showed that the control group differed from both AD groups on Object Identification. As was the case in the severity analyses, the agnosia measures both had negatively skewed distributions. Kruskal–Wallis analyses done in both cases led to the same conclusions as with the parametric analyses, c2(2, N = 734) = 85.9, P < .001 for Object Identification and c2(2, N = 727) = 5.7, P > .01 for Color Identification. One outlier was removed from the final analysis for each of the Block Design and Digit Symbol visuomotor measures. In both cases, the conclusions remained the same as with all cases included. Both AD groups scored lower than the control group on Block Design, F(2,712) = 148.34, P < .001, and Digit Symbol, F(2,649) = 148.43, P < .001. The same pattern of results with no differences between the two AD groups was found on the Drawing measure, F(2,1090) = 384.58, P < .001 using the Tukey test. The associations between the probability of AD and the neuropsychological measures were stronger than in the case of severity as reflected in the values of h2. The Drawing measure and Animal Fluency both accounted for over two-thirds of the variance, while the agnosia measures again were at 10% or lower. A 2  2 table was formed for the association between the diagnostic severity levels of mild and moderate AD and that of possible/probable AD in order to determine if the probability of AD was significantly associated with the severity of dementia. It was not; Fisher’s exact test, P = .58, Cohen’s kappa = 0.03.

4. Discussion Overall, the results showed that the frequency of problems in the organization and regulation of visuomotor behavior and of abstract thinking and judgement were higher than that of problems with language functions in the group with AD. This was true both for the absolute level of performance on the neuropsychological tests and for the relative frequency of diagnosis. The increase in impairment with increasing severity of dementia is well known, but has been quantified less often, particularly for visuomotor tests. Edwards, Deuel, et al. (1991) reported levels of apraxia at different levels of severity. However, the figures obtained here were somewhat lower at all levels of severity than those reported by Edwards, Deuel, et al., who also used a different method of staging the severity of the dementia, Berg’s (1988)

E. Helmes, T. Østbye / Archives of Clinical Neuropsychology 17 (2002) 179–193

189

Clinical Dementia Rating. Yesavage, Brooks, Taylor, and Tinklenberg (1993) noted that the development of apraxia or aphasia was associated with more rapid deterioration than was the case in people with AD who did not have those disorders. At the same time, Forstl, Burns, Levy, and Cairns (1993) concluded that constructional defects were not specifically related to changes in the parietal lobes of people with advanced AD. More rapid deterioration with severe aphasia was reported by Bracco et al. (1994) and by Kraemer, Tinklenberg, and Yesavage (1994). The present study did not specifically examine rates of deterioration using longitudinal data, but more severe changes in the semantic fluency measure (animals) than for the phonemic fluency (letters) measure were found. These measures are differentially sensitive to frontal and temporal lobe damage (Troyer, Moscovitch, Winocur, Alexander, & Stuss, 1998). The present results are thus consistent with the noted deterioration in both frontal and temporal lobes in AD, and highlight the disruption of frontal–temporal systems. However, this pattern of results is not unique to AD, but is seen both in normal adults and other forms of dementia (Hart et al., 1988; Monsch et al., 1994; Suhr & Jones, 1998). Our results are also consistent with Hodges et al. (1999) in finding greater impairment in semantic fluency over phonemic fluency (see Zec (1993) for a review on this issue). Scores on the agnosia measures remained close to the maximum possible score in all groups. These measures were among the least sensitive to the progression of the disorder, although statistically significant differences in performance were found. There is little known about agnosia in AD, and basic references on the neuropsychology of AD, such as Salmon and Bondi (1997) and Zec (1993) do not refer to the assessment of agnosia. In this study, it was diagnosed least often of the various cognitive dysfunctions at the mild and moderate stages. It is also possible that the object and color identification tasks used in CSHA were not the most sensitive measures that could have been used. These two tasks can be classed as measures of anomia (Greek: ‘‘no name,’’ the inability to report the names of objects) and can be regarded more as measures of language function than of agnosia. In any event, it is clear that further study of agnosia and the other cognitive disorders in dementia are as worthy of study as those of memory functions. More research into agnosia in AD would certainly appear to be in order, as agnosia remains as one of the disorders that can contribute to the diagnosis of AD. The mean scores of the neuropsychological tests suggested that all the test scores did not decline in parallel from the comparison group through the groups with mild and moderate AD. An analysis in which the scores were transformed to standard scores using the comparison group’s mean and S.D. confirmed the impression of differential sensitivity to the general change in overall cognitive functioning. The agnosia measures showed the least change while the Drawing test showed the largest change. The measures of abstract reasoning and judgement from the WAIS-R, Token test, and Verbal Fluency measures showed a pronounced decline in the group with mild AD, but then did not show a further sharp decline, whereas that was not the case for the semantic fluency and naming measures, which showed further declines in moderate AD. Such unequal rates of decline suggest that efforts to find stable subtypes of AD will be highly dependent upon the nature of the tests used in addition to the difficulty of obtaining samples at equivalent stages of cognitive decline. The distinction between possible and probable AD is one that logically is made early in the diagnostic process, prior to there being sufficient evidence of impairment in several

190

E. Helmes, T. Østbye / Archives of Clinical Neuropsychology 17 (2002) 179–193

areas of physical and cognitive functioning to confirm a diagnosis of definite AD. The distinction is commonly repeated in educational material (e.g., Rosser, 1993), and so becomes more widely used in daily practice. The distinction can refer to the likelihood of AD being the cause of the observed cognitive deficits (Blacker et al., 1994). In practice, however, it may be used instead in a less precise manner to describe the severity of the deficits. The present study demonstrates that this latter usage is inaccurate and invalid, in that there were no neuropsychological tests that showed any difference in performance between groups described as possible and probable AD, and that there was no empirical agreement between the two variables. Blacker et al. (1994) found that the distinction between possible and probable AD was notably less reliable than other NINCDS-ADRDA diagnoses. We also replicated Hom and Brewer’s (1991) report of similar results with a different test battery in a small group (n = 42) of people with AD. The autopsy data that would confirm the diagnosis of AD was not collected by the CSHA, and so the more definitive analysis of the utility of the probable/possible AD distinction was not possible. The distinction between mild and moderate AD was supported to a greater extent in that generally test scores were lower in the group classified as moderately demented. This was not uniformly the case, however. One likely cause for this finding is that different cognitive functions are affected at different rates by the disease process. It is likely that clinicians primarily base the distinction between mild and moderate AD upon the severity of the memory impairment and of functional impairment. Cognitive functions that are affected later in the disorder, such as some language and motor functions, would not readily discriminate mild from moderate AD. This point could be addressed better in a longitudinal study that assessed distinct cognitive functions. The procedure used to select cases for neuropsychological assessment led to reduced numbers of people with severe AD in the sample and the number of cases with missing diagnoses of the specific cognitive disorders may both raise concerns over the generalizability of the results. No differences in the frequency of missing diagnoses were noted for gender, and the differences in age were never more than 2 years, a relatively minor difference in view of the advanced age of the sample, with the missing cases invariably being younger for those whose diagnosis was not made. While it is not possible to be entirely confident of the generality of the results, the absence of any major differences between those with missing diagnoses and those with complete data is of some reassurance. The missing data for the neuropsychological tests were most frequent at the highest level of severity, leading to the deletion of this group from some analyses. This group is obviously also the most impaired and the presence of missing data is not surprising. In large part, the missing data reflect the original decision that individuals scoring below 50 on the 3MS would not undergo the neuropsychological test battery (Canadian Study of Health and Aging Working Group, 1994; Graham et al., 1996). Such individuals would be very likely to have moderate to severe cognitive impairment and to be missing from these analyses. Tuokko et al. (1995) detail the causes for missing data from the clinical examination and the neuropsychological assessment. The single largest cause (n = 281) was refusal to participate. The missing data do have implications for the generalizability of the results, and should be considered in their interpretation, although no more than roughly 10% of the sample was missing from any given analysis.

E. Helmes, T. Østbye / Archives of Clinical Neuropsychology 17 (2002) 179–193

191

Overall, this study shows the progression of impairment in specific cognitive functions both in terms of the frequency of diagnosis and in quantitative terms of impaired test performance with increasing severity of AD. The general pattern of results suggests that the likelihood of identifying stable subtypes of AD based upon cognitive dysfunction is not likely due to the apparent differential sensitivity of tests to the overall rate of cognitive decline. The implication of these results for clinical practice is that the choice of tests in the assessment of individuals suspected of having AD should be made with particular care. It is clear that some commonly used tests are comparatively insensitive to the changes in function in AD, particularly during the early stages. Tests that have been shown to be sensitive in the early stages, such as fluency and drawing tasks, are more useful for purposes of early diagnosis than are less sensitive ones. It is also clear that much more research on changes in cognitive functions with increasing severity of AD is in order, particularly with the increasing number of medications being marketed for use in the early treatment of people with AD.

Acknowledgments We thank anonymous reviewers for suggesting the analysis comparing possible and probable AD, the addition of effect sizes, and the use of standard scores to compare the tests.

References American Psychiatric Association. (1987). Diagnostic and statistical manual of mental disorders (3rd ed., revised). Washington, DC: American Psychiatric Association. Bayles, K. A., & Kaszniak, A. W. (1987). Communication and cognition in normal aging and dementia. Boston: Little, Brown. Benton, A. L., & Hamsher, K. (1989). Multilingual aphasia examination: manual of instructions. New York: Psychological Corporation. Berg, I. (1988). Clinical Dementia Rating (CDR). Psychopharmacology Bulletin, 24, 637 – 639. Blacker, D., Albert, M. S., Bassett, S. S., Go, R. C. P., Harrell, L. E., & Folstein, M. F. (1994). Reliability and validity of NINCDS-ARDRA criteria for Alzheimer’s disease. Archives of Neurology, 51, 1198 – 1204. Bracco, L., Gallato, R., Grigorletto, F., Lippi, A., Lepore, V., Bino, G., Lazzaro, M. P., Carella, F., Piccolo, T., Pozzilli, C., Giomotto, B., & Amaducci, L. (1994). Factors affecting course and survival in Alzheimer’s disease: a 9-year longitudinal study. Archives of Neurology, 51, 1213 – 1219. Burns, A., Jacoby, R., & Levy, R. (1991). Progression of cognitive impairment in Alzheimer’s disease. Journal of the American Geriatrics Society, 39, 39 – 45. Buschke, H. (1984). Cued recall and amnesia. Journal of Clinical Neuropsychology, 6, 433 – 440. Canadian Study of Health and Aging Working Group. (1994). Canadian study of health and aging: study methods and prevalence of dementia. Canadian Medical Association Journal, 150, 899 – 913. Cohen, J. (1960). A coefficient of agreement for nominal scales. Educational and Psychological Measurement, 20, 37 – 46. Edwards, D. F., Baum, C. M., & Deuel, R. K. (1991). Constructional apraxia in Alzheimer’s disease: contributions to functional loss. Special Issue: The mentally impaired elderly: strategies and interventions to maintain function. Physical and Occupational Therapy in Geriatrics, 9, 53 – 68. Edwards, D. F., Deuel, R. K., Baum, C. M., & Morris, J. C. (1991). A quantitative analysis of apraxia in senile dementia of the Alzheimer type: stage-related differences in prevalence and type. Dementia, 2, 142 – 149.

192

E. Helmes, T. Østbye / Archives of Clinical Neuropsychology 17 (2002) 179–193

Forstl, H., Burns, A., Levy, R., & Cairns, N. (1993). Neuropathological basis for drawing disability (constructional apraxia) in Alzheimer’s disease. Psychological Medicine, 23, 623 – 629. Graham, J. E., Rockwood, K., Beattie, B. L., McDowell, I., Eastwood, R., & Gauthier, S. (1996). Standardization of the diagnosis of dementia in the Canadian Study of Health and Aging. Neuroepidemiology, 15, 246 – 256. Hart, S., Smith, C. M., & Swash, M. (1988). Word fluency in patients with early dementia of the Alzheimer type. British Journal of Clinical Psychology, 27, 115 – 124. Hodges, J. R., Patterson, K., Ward, R., Garrard, P., Bak, T., Perry, R., & Gregory, C. (1999). The differentiation of semantic dementia and frontal lob dementia (temporal and frontal variants of frontotemporal dementia) from early Alzheimer’s disease: a comparative neuropsychological study. Neuropsychology, 13, 31 – 40. Hom, J., & Brewer, K. K. (1991). Possible vs. probable Alzheimer’s disease: a meaningful distinction? Paper presented at the meeting of the American Psychological Association, San Francisco (August). Kirk, A., & Kertesz, A. (1991). On drawing impairment in Alzheimer’s disease. Archives of Neurology, 48, 73 – 77. Kraemer, H. C., Tinklenberg, J., & Yesavage, J. A. (1994). ‘How far’ vs. ‘How fast’ in Alzheimer’s disease: the question revisited. Archives of Neurology, 51, 275 – 279. Lezak, M. D. (1995). Neuropsychological assessment (3rd ed). Oxford: Oxford Univ. Press. Lincoln, N. B. (1995). The assessment and treatment of disorders of visual perception. Reviews in Clinical Gerontology, 5, 77 – 82. Lucchelli, F., Lopez, O. L., Faglioni, P., & Boller, F. (1993). Ideomotor and ideational apraxia in Alzheimer’s disease. International Journal of Geriatric Psychiatry, 8, 413 – 417. MacKnight, C., Graham, J., & Rockwood, K. (1999). Factors associated with inconsistent diagnosis of dementia between physicians and neuropsychologists. Journal of the American Geriatrics Society, 47, 1294 – 1299. Marascuilo, L. A., & McSweeney, M. (1977). Nonparametric and distribution-free methods for the social sciences. Monterey, CA: Brooks/Cole. McKhann, G., Drachman, D., Folstein, M., Katzman, R., Price, D., & Stadlan, E. M. (1984). Clinical diagnosis of Alzheimer’s disease: report of the NINCDS-ADRDA Work Group under the auspices of the Department of Health and Human Services Task Force on Alzheimer’s disease. Neurology, 34, 939 – 944. Miller, E. (1989a). Language impairment in Alzheimer type dementia. Clinical Psychology Review, 9, 181 – 195. Miller, E. (1989b). Language in dementia of the Alzheimer type. In: J. R. Crawford, & D. M. Parker (Eds.), Developments in clinical and experimental neuropsychology ( pp. 75 – 85). New York: Plenum. Monsch, A. U., Bondi, M. W., Butters, N., Paulsen, J. S., Salmon, D. P., Brugger, P., & Swenson, M. R. (1994). A comparison of category and letter fluency in Alzheimer’s disease and Huntington’s disease. Neuropsychology, 8, 25 – 30. Murdoch, B. E., Chenery, H. J., Wilks, V., & Boyle, R. S. (1987). Language disorders in dementia of the Alzheimer type. Brain and Language, 31, 122 – 137. Nicholas, M., Barth, C., Obler, L. K., Au, R., & Albert, M. L. (1997). Naming in normal aging and dementia of the Alzheimer’s type. In: H. Goodglass, & A. Wingfield (Eds.), Anomia: neuroanatomical and cognitive correlates ( pp. 166 – 188). San Diego: Academic Press. Price, B. H., Gurvit, H., Weintraub, S., Geula, C., Leimkuhler, E., & Mesulam, M. (1993). Neuropsychological patterns and language deficits in 20 consecutive cases of autopsy-confirmed Alzheimer’s disease. Archives of Neurology, 50, 931 – 937. Rapcsak, S. Z., Croswell, S. C., & Rubens, A. B. (1989). Apraxia in Alzheimer’s disease. Neurology, 39, 664 – 668. Rosen, W. G. (1980). Verbal fluency in aging and dementia. Journal of Clinical Neuropsychology, 2, 135 – 146. Rosser, M. (1993). Alzheimer’s disease. British Medical Journal, 307, 779 – 802. Salmon, D. P., & Bondi, M. W. (1997). The neuropsychology of Alzheimer’s disease. In: P. D. Nussbaum (Ed.), Handbook of neuropsychology and aging ( pp. 141 – 158). New York: Plenum. Satz, P., & Mogel, S. (1962). An abbreviation of the WAIS for clinical use. Journal of Clinical Psychology, 18, 77 – 79. Spreen, O., & Benton, A. L. (1977). Neurosensory Center Comprehensive Examination for Aphasia (NCCEA). 1977 revision. Manual of instructions. Victoria, B.C.: Department of Psychology, University of Victoria. SPSS. (1998). SYSTAT 8.0. Chicago: SPSS.

E. Helmes, T. Østbye / Archives of Clinical Neuropsychology 17 (2002) 179–193

193

Starkstein, S. E., Vasquez, S., Migliorelli, R., Teson, A., Sabe, L., & Leiguarda, R. (1995). A single-photon emission computed tomographic study of anosognosia in Alzheimer’s disease. Archives of Neurology, 52, 415 – 420. Steenhuis, R. E., & Østbye, T. (1995). Neuropsychological test performance of specific diagnostic groups in the Canadian Study of Health and Aging (CSHA). Journal of Clinical and Experimental Neuropsychology, 17, 773 – 785. Suhr, J. A., & Jones, R. D. (1998). Letter and semantic fluency in Alzheimer’s, Huntington’s, and Parkinson’s dementias. Archives of Clinical Neuropsychology, 13, 447 – 454. Teng, E. L., & Chui, H. C. (1987). The Modified Mini-Mental State (3-MS) examination. Journal of Clinical Psychiatry, 48, 314 – 318. Thompson, I. M. (1987). Language in dementia: I. A review. International Journal of Geriatric Psychiatry, 2, 145 – 161. Travniczek-Marterer, A., Danielczyk, W., Simanyi, M., & Fischer, P. (1993). Ideomotor apraxia in Alzheimer’s disease. Acta Neurologica Scandinavica, 88, 1 – 4. Troyer, A. K., Moscovitch, M., Winocur, G., Alexander, M. P., & Stuss, D. (1998). Clustering and switching on verbal fluency: the effects of focal frontal- and temporal-lobe lesions. Neuropsychologia, 36, 499 – 504. Tuokko, H., Kristjansson, E., & Miller, J. (1995). Neuropsychological detection of dementia: an overview of the neuropsychological component of the Canadian Study of Health and Aging. Journal of Clinical and Experimental Neuropsychology, 17, 352 – 373. Wechsler, D. (1981). WAIS-R manual. New York: Psychological Corporation. Yesavage, J. A., Brooks, J. O., Taylor, J., & Tinklenberg, J. (1993). Development of aphasia, apraxia, and agnosia and decline in Alzheimer’s disease. American Journal of Psychiatry, 150 (5), 742 – 747. Zec, R. F. (1993). Neuropsychological functioning in Alzheimer’s disease. In: R. W. Parks, R. F. Zec, & R. S. Wilson (Eds.), Neuropsychology of Alzheimer’s disease and other dementias ( pp. 3 – 80). New York: Oxford Univ. Press.