Validity of a multi-domain computerized cognitive assessment battery for patients with multiple sclerosis

Multiple Sclerosis and Related Disorders 30 (2019) 154–162 Contents lists available at ScienceDirect Multiple Sclerosis and Related Disorders journa...

Download PDF

1MB Sizes 1 Downloads 43 Views

Report

PDF Reader
Full Text

Multiple Sclerosis and Related Disorders 30 (2019) 154–162

Contents lists available at ScienceDirect

Multiple Sclerosis and Related Disorders journal homepage: www.elsevier.com/locate/msard

Clinical trial

Validity of a multi-domain computerized cognitive assessment battery for patients with multiple sclerosis

T

Daniel Golana,b, Jeﬀrey Wilkenc,d, Glen M. Donigere,f, Timothy Frattoc, Robert Kanec, Jared Srinivasang, Myassar Zarifg, Barbara Bumsteadg, Marijean Buhseg, Lori Fafardg, ⁎ Ilir Topallig, Mark Gudesblattg, a

Department of Neurology & Multiple Sclerosis Center, Lady Davis Carmel Medical Center, Haifa, Israel Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, Israel c Washington Neuropsychology Research Group, Fairfax, VA, USA d Department of Neurology, Georgetown University, Washington DC, USA e Department of Clinical Research, NeuroTrax Corporation, Modiin, Israel f Joseph Sagol Neuroscience Center & Center of Advanced Technologies in Rehabilitation, Sheba Medical Center, Ramat Gan, Israel g South Shore Neurologic Associates, 77 Medford Avenue, Patchogue, NY 11772, USA b

A R T I C LE I N FO

A B S T R A C T

Keywords: Cognitive function Computerized cognitive assessment Multiple sclerosis Response time Validity

Background: Computerized cognitive batteries may facilitate the integration of neuropsychological assessments into routine clinical care of patients with multiple sclerosis (PwMS). Objective: To assess the construct and criterion validity of a computerized, multi-domain cognitive assessment battery (CAB, NeuroTrax) in MS. Methods: 81 PwMS and 15 healthy controls (HC) completed the CAB and a set of traditional neuropsychological tests recommended for MS on the same day. Principal component factor analysis was used to assess construct validity. For criterion validity, the gold standard deﬁnition of cognitive impairment was a score of ≥1.5SD below average on at least one cognitive domain, based upon traditional test normative data. Receiver operating characteristic (ROC) analysis was used to examine the ability of the CAB to discriminate cognitively impaired PwMS. Results: Traditional and computerized tests of memory, processing speed, visuospatial and executive function converged by factor analysis. Computerized tests detected cognitive impairment with 85% sensitivity and 70% speciﬁcity. PwMS classiﬁed as impaired on only the computerized battery had signiﬁcantly prolonged response times and a higher rate of unemployment compared with PwMS classiﬁed as unimpaired on both batteries. Poor executive function was more likely to be revealed by the CAB. Conclusion: The speciﬁc computerized assessment battery evaluated is valid for cognitive screening of people with MS and may be more likely to detect prolonged response times and impaired executive function.

1. Introduction Cognitive dysfunction is a prevalent debilitating symptom of Multiple Sclerosis (MS). More than 50% of people with MS (PwMS) experience some form of cognitive impairment (Sumowski et al., 2018). MS-related cognitive dysfunction may prevent maintaining gainful employment (Rao et al., 1991a), adversely impact quality of life (Rao et al., 1991b) and may be a manifestation of disease activity (Benedict et al., 2014). Routine neurological assessment lacks sensitivity in identifying cognitive impairment. Neurologists' prediction of cognitive impairment based on a typical clinical visit is not signiﬁcantly ⁎

diﬀerent from chance (Romero et al., 2015). It is therefore important to include some form of objective cognitive assessment in the routine care and management of PwMS. In 2001 an expert panel of MS neuropsychologists recommended a 90-min battery, Minimal Assessment of Cognitive Function in MS (MACFIMS), covering 5 cognitive domains (processing speed/working memory, learning and memory, executive function, visual-spatial processing, and word retrieval) (Benedict et al., 2002). Unfortunately, this battery has several limitations which have resulted in its infrequent use in PwMS routine care and clinical trials. Among these limitations are: dependence upon neuropsychologists to supervise testing, labor-

Corresponding author. E-mail addresses: [email protected] (D. Golan), [email protected] (M. Gudesblatt).

https://doi.org/10.1016/j.msard.2019.01.051 Received 10 September 2018; Received in revised form 4 January 2019; Accepted 29 January 2019 2211-0348/ © 2019 Elsevier B.V. All rights reserved.

Multiple Sclerosis and Related Disorders 30 (2019) 154–162

D. Golan, et al.

by the traditional neurocognitive tests. A 10-min break was given between the two testing batteries. Participants were instructed not to take psychotropic medications (benzodiazepines, opioids) in the 6 h prior to testing. Cognitive assessments were supervised by a neuropsychologist.

intensive manual computation of standardized scores, lack of alternate forms for some components, and lengthy administration times. One proposed solution is to decrease the number of domains tested (Langdon et al., 2012). This approach may lead to loss of critical information for identifying the spectrum and degree of cognitive impairment in the population of PwMS and for any individual PwMS. Computerized screening tools oﬀer the opportunity to incorporate such objective cognitive screening measures into routine PwMS clinical care while retaining broad multi-domain assessment. Periodic computerized cognitive screening is already integrated into our routine for PwMS care, and data from ∼700 PwMS has provided further insight into the impact of fatigue and depression on MS-related cognitive impairment (Golan et al., 2018). Additional merits of computerized tools include: inherent administration standardization, instant result normalization, multiple alternate forms, and testing session supervision not requiring physician/neuropsychologist involvement. The computerized cognitive assessment battery (CAB) utilized in this study (NeuroTrax) has been validated and used to screen for cognitive impairment in other neurological diseases (Dwolatzky et al., 2003; Schweiger et al., 2003) (see the supplementary materials). Signiﬁcant correlations have been shown between CAB domain scores and test results from Rao's neuropsychological screening battery among 58 PwMS (Achiron et al., 2007). In that study, most CAB index scores signiﬁcantly discriminated PwMS from healthy individuals (Achiron et al., 2007). In this study we further evaluated the construct and criterion validity of this CAB among PwMS relative to a set of recommended, traditional neuropsychological tests, mainly from the Minimal Assessment of Cognitive Function in MS (MACFIMS) battery.

2.3. Traditional cognitive testing Traditional neuropsychological tests were mainly from the Minimal Assessment of Cognitive Function in MS (MACFIMS) battery, which was developed for use in PwMS by expert consensus (Benedict et al., 2002) with subsequent empirical validation (Benedict et al., 2006). The only diﬀerence was inclusion of the Selective Reminding Test (SRT) instead of the California Verbal Learning Test-II (CVLT-II). These two tests have shown comparable sensitivity in discriminating PwMS from demographically-matched controls (Strober et al., 2009). The SRT is also recommended for PwMS as part of the MS-COG battery (Sumowski et al., 2018; Erlanger et al., 2014). We opted for the SRT due to substantially reduced administration time than for the CVLT-II, and availability of multiple forms of SRT for longitudinal follow-up. The traditional cognitive assessment consisted of 7 tests which can be grouped into 5 domains (Benedict et al., 2002): information processing speed and working memory—Paced Auditory Serial Addition Test (PASAT) Trial 1: three second and Trial 2: two second (Rao et al., 1991b); SDMT (Sheridan et al., 2006). Memory—SRT for verbal memory (Ehrenreich, 1995); Brief Visuospatial Memory Test-Revised (BVMT-R) (Benedict, 1997) for visual memory. Verbal ﬂuency—Controlled Oral Word Association Test (COWAT) (Heaton et al., 2004). Visuospatial processing—Judgment of Line Orientation (JLO) (Mitrushina et al., 2005). Executive function—Delis-Kaplan Executive Function System (D-KEFS) Sorting Test (Delis et al., 2001). For all tests, z-scores were calculated based on published normative data. To improve the robustness of cognitive classiﬁcation, the z-scores of all tests within a particular cognitive domain were averaged to obtain summary cognitive domain z-scores.

2. Methods 2.1. Participants 81 PwMS, according to the revised McDonald criteria, under treatment at one MS Care Center (South Shore Neurologic Associates, New York, USA) were recruited for this study. Inclusion criteria were: age 25–55, vision corrected to at least 20/50, 12+ years of education, able to write and press buttons on a computer mouse. Exclusion criteria were: unable to understand and follow all test instructions, history of head injury, seizures, neurological conditions involving the central nervous system (other than MS), colorblindness, history of psychosis or other severe mental illness (other than mild-to-moderate depression), current alcohol/substance abuse and MS relapse < 30 days prior to enrollment. 15 healthy controls (HC) underwent equivalent study procedures. Inclusion and exclusion criteria for HC were similar to those of PwMS. All HC had to be free of neurological conditions known to impact cognitive function. For each participant, all evaluations (see below) were carried out on the same day. Institutional review board (IRB) approval was obtained, and all participants gave informed consent.

2.4. Gold standard deﬁnition of cognitive impairment PwMS with a cognitive domain z-score of −1.5 or lower on at least one cognitive domain were deﬁned as cognitively impaired by ‘gold standard’ traditional tests (Sumowski et al., 2018). 2.5. Computerized assessment battery This CAB consists of 10 tests: Verbal Memory, Non-Verbal Memory, Go-NoGo, Stroop, Verbal Function (naming, rhyming), Problem Solving, Visuospatial Processing, Staged Information Processing Speed (IPS), Finger Tapping, and a psychomotor “Catch Game”. For tests measuring response time, participants respond by clicking one mouse button on a two-button mouse (to avoid confounding eﬀects of weakness or poor coordination, no movement of the mouse is required). Further detailed description of these tests is available in the supplementary materials. To minimize diﬀerences in age and education and to permit averaging performance across diﬀerent types of outcome parameters (e.g., accuracy, response times), each CAB outcome parameter is normalized and ﬁt to an IQ-style scale (mean: 100, SD: 15) in an age- and education-speciﬁc fashion. Normative data is from individuals classiﬁed as cognitively healthy in controlled studies conducted at academic research centers. Normalized subsets of outcome parameters were averaged to produce nine summary scores, each indexing a diﬀerent cognitive domain (Table 1). A Global Cognitive Score was computed as the average of the 9 domain index scores.

2.2. Study procedures To examine ability of the CAB to detect various degrees of cognitive impairment, PwMS participants were ﬁrst screened for cognitive dysfunction using the Symbol Digit Modalities Test (SDMT). Based on SDMT results and normative data (Smith, 1982; Sheridan et al., 2006), PwMS participants were stratiﬁed into 4 subgroups: the ﬁrst subgroup had performance better than 1.00 standard deviation below the age expected mean. The other 3 subgroups had SDMT scores 1.00 to 1.50, 1.51 to 2.00 and >2.00 standard deviations below the age expected mean. An equal number of participants were recruited to each impairment-based subgroup. All participants completed the computerized battery ﬁrst, followed

3. Statistical analysis Data analysis was performed using SPSS, version 23.0 (IBM Corp., 155

Multiple Sclerosis and Related Disorders 30 (2019) 154–162

D. Golan, et al.

Table 1 Cognitive domain scores components of the computerized assessment battery. Domain

Averaged componentsa

Memory

Verbal Memory: Total Accuracy Delayed Verbal Memory: Accuracy Non-Verbal Memory: Total Accuracy Delayed Non-Verbal Memory: Accuracy Go-NoGo: Composite Score Verbal Memory: Accuracy, Repetition 1 Non-Verbal Memory: Accuracy, Repetition 1 Go-NoGo: Composite Score Stroop: Composite Score, Level 3 Catch Game: Total Score Go-NoGo: Response Time Go-NoGo: Response Time Standard Deviation Stroop Interference: Response Time, Level 2 Staged Information Processing Speed: Response Time, Level 1.2 Staged Information Processing Speed: Accuracy, Level 2.3 Staged Information Processing Speed: Composite Scores, Levels 1.1, 1.3, 2.1 and 2.2 Visual Spatial Processing: Accuracy Verbal Function: Rhyming, Accuracy Finger Tapping: Inter-Tap Interval Finger Tapping: Tap Interval Standard Deviation Catch Game: Time to Make First Move Problem Solving: Accuracy

Working Memory

Executive Function

Attention

Information Processing Speed Visual Spatial Verbal Function Motor Skills

Problem Solving

Table 2 Patient characteristicsa. N Age Female Race Education (years) EDSSb Disease durationc (years) Disease course

Disease modifying medication

a b c

81 45 ± 8.1 59 (73%) White: 77 (95%) African American: 4 (5%) 14.5 ± 2.4 4 ± 1.7 10.2 ± 7.3 Relapsing remitting: 74 (91%) Secondary progressive: 5 (6%) Primary progressive: 2 (3%) Natalizumab 48 (59%) Dimethyl Fumarate 11 (14%) Fingolimod 6 (7%) Ocrelizumab 4 (5%) Glatiramer acetate 3 (4%) Alemtuzumab 2 (3%) No medication 7 (8%)

Summary statistics are mean ± standard deviation. EDSS, Expanded Disability Status Scale. Time from ﬁrst conﬁrmed clinical episode suggestive of multiple sclerosis.

COWAT, which taps search and retrieval of verbal information). For the second factor (F2), tests loading on executive functions converged (Computerized: Problem Solving; traditional: D-KEFS sorting and COWAT). In the third factor (F3), computerized and traditional tests of visuospatial processing clustered. The fourth factor (F4) represents memory, wherein traditional and computerized tests of verbal and visual memory converged. The last factor (F5) contained only computerized tests of verbal function (naming and rhyming); this domain was not covered by recommended traditional tests.

a Details regarding the individual tests are provided in the supplementary materials.

Armonk, N.Y., USA). Principal components factor analysis with varimax rotation was conducted to assess how raw scores from traditional and computerized tests in PwMS clustered. Continuous variables were analyzed using between-groups t-test, with corresponding eﬀect sizes (Cohen's d). Receiver operating characteristic (ROC) analysis was used to evaluate the ability of the CAB cognitive domain (index) scores to discriminate PwMS with cognitive impairment from unimpaired PwMS, as deﬁned by traditional tests. Sensitivity, speciﬁcity and accuracy of the CAB domain scores for predicting the ‘gold standard’ classiﬁcation of cognitive impairment based on traditional measures were calculated, using two common cutoﬀs: ≤1 (domain score ≤85) and ≤1.5 (domain score ≤77.5) standard deviations below the age and education expected mean. Finally, sensitivity, speciﬁcity and accuracy of various combinations of the computerized domain scores for predicting the ‘gold standard’ classiﬁcation of cognitive impairment were tested. Domain scores were analyzed individually, using the cutoﬀ that yielded the best classiﬁcation accuracy. Domains were retained if they improved the ability of the CAB to discriminate among groups. Hence, the optimal combination of domain scores was ascertained. Demographics by subgroup of cognitive impairment classiﬁcation were compared using analysis of variance (ANOVA) for continuous variables and Fisher's exact test for categorical variables, followed by Bonferroni-adjusted pairwise comparisons. Descriptive statistics are reported as mean ± standard deviation.

4.2. Criterion validity Computerized domain scores discriminated PwMS from healthy controls (Table 4). Age- and education-adjusted computerized domain scores were signiﬁcantly lower among PwMS as compared to healthy controls, with large eﬀect sizes for the computerized memory, information processing speed, working memory, attention, executive functions, visuospatial processing and motor domains. Verbal and problem solving domain scores were not signiﬁcantly diﬀerent between the groups. Notably, PwMS and healthy controls were comparable in years of education and gender (Table 2); however, controls were signiﬁcantly younger than patients [33.3 ± 8.1 vs. 45.0 ± 8.1 years, P < 0.001]. Notably, age-adjusted scores were analyzed mitigating this diﬀerence. 58 PwMS were cognitively impaired and 23 were cognitively normal, by 'gold standard' traditional tests. All the computerized domain scores discriminated cognitively impaired from cognitively unimpaired PwMS (Table 5). For all computerized domain scores, a cutoﬀ for impairment of ≥1 standard deviation below the age- and educationadjusted mean (domain score ≤85) was superior, deﬁned as improved sensitivity and accuracy without substantially reduced speciﬁcity, as compared to the alternative ≥1.5 standard deviations below the normative mean (Table 5). Consequently, we used the domain score ≤85 cutoﬀ for analysis of the discriminability of various domain score combinations (Table 6) and for describing frequency of impairment (Figs. 1 and 3a). The optimal computerized domain combination for predicting cognitive impairment among PwMS, as deﬁned by traditional tests, included memory, information processing speed, attention, working memory and visuospatial indices. Computerized domain score ≤85 in at least one of these 5 domains predicted 'gold standard' cognitive impairment with 85% sensitivity and 70% speciﬁcity. Using the complete set of 9 computerized domains improved the sensitivity to 86% but decreased speciﬁcity to 57% (Table 6).

4. Results 4.1. Construct validity Patient characteristics are shown in Table 2. Traditional and computerized tests converged by factor analysis. Five components were rotated, together explaining 66.4% of the variance (Table 3). The ﬁrst factor (F1) included tests related to attention, processing speed and response times (Computerized: IPS, Stroop, Go-NoGo, “Catch Game”; Traditional: SDMT). Tests from the ﬁrst factor also rely on executive abilities (Computerized: Stroop and Go-NoGo, demanding response inhibition and the “Catch Game”, which requires planning; Traditional: 156

Multiple Sclerosis and Related Disorders 30 (2019) 154–162

D. Golan, et al.

Table 3 Construct validity of traditional and computerized batteries by principal components factor analysis. Testa

F1

C: Staged Information Processing Speed: Composite Score, Level 2.2 C: Stroop: Composite Score, Level 3 C: Go-NoGo: Composite Score C: Catch Game: Total Score C: Visual Spatial Processing: Accuracy C: Problem Solving: Accuracy C: Delayed Non-Verbal Memory: Accuracy C: Delayed Verbal Memory: Accuracy C: Verbal Function: Matching, Accuracy C: Verbal Function: Rhyming, Accuracy T: Symbol Digit Modalities Test (SDMT), Oral T: Controlled Oral Word Association Test (COWAT), Animals T: Controlled Oral Word Association Test (COWAT), F-A-S T: Delis-Kaplan Executive Function System (D-KEFS), Correct Sorts T: Paced Auditory Serial Addition Test (PASAT), 3 s T: Brief Visuospatial Memory Test-Revised (BVMT-R), Delayed Recall T: Judgment of Line Orientation (JLO), Number Correct T: Selective Reminding Test (SRT), Delayed Recall % variance explainedb

0.76 0.73 0.77 0.64 0.4

F2

F3

F4

0.4 0.4

0.77 0.56 0.44

F5

0.4 0.66 0.84 0.78

0.58 0.45

0.54 0.67 0.76 0.4 0.71 0.5

0.41 0.77

17

15.6

0.83 10.8

12.4

10.7

C, computerized; T, traditional. a Loadings <0.4 are omitted. b Percent of variance explained by all 18 variables for each factor. Table 4 Group data comparing age- and education-adjusted computerized cognitive scores of patients with multiple sclerosis and normal healthy controls.

Global Cognitive Score Memory Information Processing Speed Working Memory Attention Executive Function Visual Spatial Motor Skills Verbal Function Problem Solving a b c

MS Na

Score

81 81 76 81 81 81 79 77 80 79

89.1 87.7 85.8 89.4 86.6 90.0 96.5 89.9 87.6 91.8

± ± ± ± ± ± ± ± ± ±

13.9 18.4 17.5 13.4 18.3 14.9 16.8 18.5 26.6 17.7

Controls (N = 15) Score

Cohen's db

P valuec

101.2 ± 6.9 101.5 ± 9.2 98.0 ± 11.8 102.0 ± 123 100.5 ± 7.6 103.5 ± 10.2 109.0 ± 11.8 104.7 ± 4.3 95.5 ± 21.7 95.9 ± 15.4

1.1 0.93 0.82 0.97 0.96 1.1 0.86 1.1 0.33 0.25

<0.001 <0.001 0.002 0.001 <0.001 0.001 0.007 <0.001 0.28 0.4

Outcome parameter scores may be missing if patient responses are insuﬃcient on relevant tests. By convention, Cohen’s d values of 0.8, 0.5, and 0.2 represented large, medium and small eﬀect sizes, respectively (Fritz et al., 2012). Between-groups t-test.

Table 5 Validity of computerized domain scores in discriminating cognitively impaired from cognitively unimpaired participants with multiple sclerosis. Domain score

Area under curve

Standard error

Asymptotic signiﬁcance

95% conﬁdence interval

Sensitivity (Sn), Speciﬁcity (Sp) and Accuracy (Ac) of selected cutoﬀsa Score ≤ 85 Score ≤ 77.5 Sn Sp Ac Sn Sp Ac

Global Cognitive Score Memory Information Processing Speed Attention Working Memory Executive Function Motor Skills Problem Solving Verbal Function Visual Spatial

0.80 0.80 0.79

0.062 0.058 0.056

<0.001 <0.001 <0.001

0.68–0.92 0.68–0.91 0.68–0.90

41% 52% 52%

87% 87% 83%

54% 62% 60%

24% 33% 43%

91% 91% 87%

43% 49% 56%

0.78 0.75 0.72 0.69 0.68 0.67 0.66

0.061 0.064 0.069 0.071 0.071 0.068 0.077

<0.001 0.001 0.003 0.01 0.01 0.02 0.03

0.66–0.90 0.63–0.88 0.59–0.85 0.55–0.83 0.54–0.82 0.53–0.80 0.51–0.81

43% 36% 36% 35% 38% 35% 24%

87% 87% 83% 78% 83% 87% 83%

56% 51% 49% 47% 51% 49% 41%

29% 26% 24% 28% 21% 21% 14%

96% 96% 96% 87% 83% 91% 87%

48% 46% 44% 44% 38% 41% 40%

a Cutoﬀs for impairment were derived from research analyses in the present study, and no recommendation in this regard has been made by the developers of the computerized battery.

PwMS had impaired motor skills and attention, as well as verbal naming and rhyming (Fig. 1). The lower speciﬁcity associated with using all 9 CAB domain scores arises from 10 PwMS classiﬁed as: impaired by the CAB but unimpaired by traditional tests (Table 6). Interestingly, PwMS who were found to be

The frequency of cognitive impairment and count of aﬀected cognitive domains on both batteries among PwMS is shown in Fig. 1. Information processing speed and memory were the most commonly impaired domains on both batteries. For some PwMS, poor performance on domains not covered by traditional tests was evident: 28–35% of

157

Multiple Sclerosis and Related Disorders 30 (2019) 154–162 77% 77% 78% 80% 78%

impaired by the CAB only, had signiﬁcantly longer age- and educationadjusted response times on various tasks, such as time taken to decide whether to move left or right to catch an on-screen object or the time to add (or subtract) 2 numbers and decide if the sum is >4 (Fig. 2). Comparison of demographics by cognitive classiﬁcation is shown in Table 7. EDSS of participants impaired on only one battery, whether traditional or computerized, was not diﬀerent from EDSS of participants who were normal on both batteries. More participants unimpaired on both batteries were employed, as compared to participants impaired on both batteries. Employment status of those classiﬁed as impaired on only one battery, either computerized or traditional, was not signiﬁcantly diﬀerent from those classiﬁed as impaired on both batteries (Table 7). Taken together, PwMS classiﬁed as impaired on the CAB but not ‘gold standard' tests seem to be more disabled than participants classiﬁed as cognitively intact on both batteries. Details of the impaired cognitive domains for participants classiﬁed as impaired by only one battery is presented in Fig. 3. A classiﬁcation of impairment on the CAB only was usually associated with impairment in multiple domains (Fig. 3a). Impaired executive function was common among those classiﬁed as impaired on the CAB alone. Impairment on traditional tests only was due to reduced information processing speed and working memory, verbal ﬂuency or memory (Fig. 3b).

70% 70% 74% 70% 57%

72% 76% 78% 78%

This study substantiated the validity of a particular computerized cognitive screening battery for PwMS (Achiron et al., 2007). Construct validity is supported by the convergence of recommended traditional tests and computerized tests by factor analysis into 4 distinct domains, conceptually representing information processing speed, executive function, memory and visuospatial processing. Criterion validity was conﬁrmed by the ability of CAB to detect cognitive impairment among PwMS, as deﬁned by recommended traditional tests, with 85% sensitivity, as well as by signiﬁcantly lower scores of PwMS compared to healthy controls, with generally large eﬀect sizes. This CAB was not designed to mimic MACFIMS. It was meant to be comprehensive, with broad rather than disease-speciﬁc coverage of cognitive domains. This study oﬀers the opportunity to explore how tests not traditionally included (e.g., tests of response inhibition, goaldirected planning, and verbal naming and rhyming) perform in detecting cognitive impairment among PwMS. Executive deﬁcit as determined by the recommended D-KEFS sorting test was uncommon among our PwMS, despite the high overall frequency of cognitive impairment. Previous research with MACFIMS tests also identiﬁed low failure rates on this particular test, with at best moderate eﬀect sizes compared to healthy controls (Kim et al., 2017; Migliore et al., 2017). The frequency of executive deﬁcit on the CAB was 31%, arising from tests of response inhibition (Go-NoGo, Stroop) and from a 'Catch Game' that measures goal-directed planning. Traditional verbal ﬂuency tests may rely on executive abilities, but as they also rely on information processing speed, they may not be speciﬁc enough for detecting executive decline. These ﬁndings may call for a reconsideration of the optimal approach for identifying executive deficit among PwMS. This CAB includes a ‘motor skills’ domain, derived from a combination of a ﬁnger tapping task and the time it takes to make the ﬁrst move to catch a falling object on the screen. This domain score discriminated PwMS from healthy controls with a large eﬀect size (Table 4) and also discriminated impaired PwMS from cognitively normal patients (Table 5). As primary motor pathways (e.g., cerebellar circuits, pyramidal tracts) may be damaged in MS, the degree to which this outcome is inﬂuenced by pure physical disability requires further exploration. It is well recognized that ﬁnger tapping may be impaired in neurological diseases without an overt motor disability until later stages, like Alzheimer's disease (Roalf et al., 2018), implying that this domain is at least partly independent from primary motor modalities.

79% 79% 79% 85% 86% 12 12 12 9 8

IPS, Information Processing Speed. CAB, computerized cognitive assessment battery. b

a

7 7 6 7 10 46 46 46 49 50

16 16 17 16 13

5 5 5 6 42 44 45 45

18 18 18 17

5. Discussion

IPSa, Memory IPS, Memory, Attention IPS, Memory, Attention, Working Memory IPS, Memory, Attention, Working Memory, Executive Function IPS, Memory, Attention, Working Memory, Motor Skills IPS, Memory, Attention, Working Memory, Problem Solving IPS, Memory, Attention, Working Memory, Verbal Function IPS, Memory, Attention, Working memory, Visual Spatial All 9 CABb Domains

Impaired on traditional battery only Unimpaired on both batteries Impaired on computerized battery only Impaired on both batteries Computerized domain combination (at least 1 domain score ≤85):

Table 6 Sensitivity (Sn), speciﬁcity (Sp) and accuracy (Ac) of various computerized domain combinations for classiﬁcation of patients with multiple sclerosis as cognitively impaired.

Sn

16 14 13 13

Sp

78% 78% 78% 74%

Ac

74% 77% 78% 77%

D. Golan, et al.

158

Multiple Sclerosis and Related Disorders 30 (2019) 154–162

D. Golan, et al.

Fig. 1. (a) Percent of PwMS (N = 81) classiﬁed as impaired on traditional tests in each domain. Domain scores were the averaged z-scores of relevant traditional tests as described in the text. Cognitive domain z-score of −1.5 or lower was considered impaired. (b) Distribution of count of impaired domains on traditional tests per MS patient. 28% of PwMS (23 patients) had no impaired domains, thus classiﬁed as cognitively unimpaired on traditional tests. (c) Percent of PwMS (N = 81) classiﬁed as impaired on computerized tests in each domain. For all domains, a cutoﬀ score of ≤85 (at least 1 SD below average) was superior in predicting traditional classiﬁcation of impairment and was used to determine the frequencies shown. (d) Distribution of count of impaired domains on computerized tests per MS patient. 26% of PwMS (21 patients) had no impaired domains, thus classiﬁed as cognitively unimpaired on computerized tests. IPS, Information Processing Speed.

PwMS from healthy controls (Table 4) but did discriminate impaired PwMS from cognitively normal PwMS (Table 5). This test is intended to be a measure of non-verbal IQ and is conceptually and empirically correlated to Raven's standard progressive matrices (Raven, 2000; Doniger et al., 2008). This test imposes demands upon executive functions and visuospatial processing, which may decline in cognitively impaired PwMS. This decline is the likely explanation for decreased performance on the computerized problem solving test among cognitively impaired PwMS in this study; a similar decline has been reported for the Raven's standard progressive matrices (Zakzanis, 2000). The 57% speciﬁcity of the complete CAB (9 domains), arises from PwMS who were classiﬁed as impaired by the CAB but classiﬁed as unimpaired by traditional tests. This discordance may be due to ‘overdiagnosis’ by the CAB or to ‘under-diagnosis’ (e.g., under-recognition) of certain aspects of cognitive dysfunction by the recommended ‘gold standard’ tests. The high rate of unemployment among PwMS who were classiﬁed as impaired only on the CAB (Table 7) lends credence to the later explanation. Computerized tests are very sensitive to prolonged response times of patients (Fig. 2), which is not true of traditional pen and paper tests that lack the millisecond precision in response time measurement. Prolonged response times among PwMS were found by others using the same CAB (Achiron et al., 2007) and other computerized tests (Lapshin et al., 2013; Papathanasiou et al., 2014; Wilken

Furthermore, it has been shown that 73% of PwMS who failed the 9hole peg test, were still able to perform ﬁnger tapping (Tanigawa et al., 2017), implying that only a severe degree of physical disability actually interferes with ﬁnger tapping. Indeed 77/81 (95%) of patients in this study were able to complete the CAB motor skills tests, despite a rather high average EDSS (4.0 ± 1.7). Finally, functional MRI studies showed that among PwMS who perform ﬁnger tapping, activation of primary motor brain areas (e.g., cerebellum, motor cortex) occurs, but there is also activation of subcortical structures, such as the basal ganglia, thalamus and amygdala (Bonzano et al., 2017). These additional structures may be relevant to cognitive dysfunction (Batista et al., 2012). Taken together, the CAB ‘motor skills’ domain is likely valid for detection of cognitive impairment in PwMS, although in those with severe pyramidal or cerebellar deﬁcits, this measure should be interpreted with caution. The computerized verbal function score represents ability to name an object and select a rhyming word. Although verbal ﬂuency is conspicuously reduced in MS, confrontation naming, as assessed by such tests as the Boston Naming Test, is only modestly impaired (Zakzanis, 2000). Accordingly, this CAB domain did not discriminate PwMS from healthy controls (Table 4) but did discriminate impaired from cognitively normal PwMS (Table 5), with a modest eﬀect size. Likewise, the computerized problem solving test did not discriminate 159

Multiple Sclerosis and Related Disorders 30 (2019) 154–162

D. Golan, et al.

Further, the diﬀerence in complex response times between older drivers who passed or failed an on-road driving test has been reported to have a large eﬀect size (Cohen's d = 1.32) (Mathias and Lucas, 2009) similar in magnitude to the diﬀerence in response times between PwMS classiﬁed as impaired on the CAB only compared to those classiﬁed as unimpaired on both batteries (Fig. 2). Additional work is needed to deﬁne clinical thresholds for prolonged response times among PwMS. EDSS of participants classiﬁed as impaired on the CAB only was not diﬀerent from EDSS of participants classiﬁed as normal on both batteries (Table 7). Therefore, neither classiﬁcation as impaired on the CAB only nor prolonged response times among those classiﬁed as impaired on the CAB only were due to confounding by physical disability. Impaired executive function was also common among those classiﬁed as impaired on the CAB (Fig. 3a). Greater sensitivity of computerized tests to prolonged response times and impaired executive function may spuriously inﬂate the 'false positive' rate and decrease the speciﬁcity of computerized tests when compared with currently endorsed traditional batteries. This study has several limitations. Our control group was rather small and controls were younger than PwMS. However, the CAB domain scores are age-adjusted which mitigates this diﬀerence. Predictive validity (e.g., of vocational status or MRI parameters) and reliability are not addressed in this report and should be further explored. The 9-hole peg test (9HPT) was not part of this research protocol. Impaired manual dexterity, as measured by the 9HPT may aﬀect ﬁnger tapping and thus reduce this CAB motor domain score. However, only one participant was classiﬁed as impaired on the CAB due to impairment in the motor domain alone (Fig. 3a), which reassures us that impaired manual dexterity is not the main reason for classiﬁcation as impaired on the CAB only. Notwithstanding the merits of computerized batteries, their role is to screen for cognitive impairment, not to diagnose or manage such impairment. Ideally, whenever a computerized screening tool detects cognitive impairment, referral to a neuropsychologist or to a cognitive neurologist is important to critically conﬁrm the result, address differential diagnosis issues, weigh the contribution of comorbidities and tailor potential intervention or rehabilitation programs (Rao, 2018). In conclusion, this study demonstrates acceptable construct and criterion validity for this particular multi-domain computerized cognitive screening battery, which should encourage its incorporation in the routine care of PwMS. This CAB also detects certain aspects of MS-related cognitive impairment that may evade traditional tests, such as prolonged response times and impaired executive function.

Fig. 2. PwMS who were classiﬁed as impaired on the computerized cognitive assessment battery only had signiﬁcantly longer age- and education-adjusted response times on various tasks, including time taken to decide whether to move left or right to ‘catch’ an object on the screen (“Catch Game”) or the time to add/subtract 2 numbers and decide if the sum is greater than 4 (Staged Information Processing Speed [IPS] test). Smaller standardized scores reﬂect longer response times. P values are for a between-group t-test. By convention, Cohen's d ≥ 0.8 is considered a large eﬀect size (Fritz et al., 2012).

et al., 2003). Consequently, PwMS with subtly prolonged response times may be classiﬁed as impaired on computerized batteries but as intact on traditional tests. Notably, prolonged response times have been shown to predict driving ability among the elderly (Mathias and Lucas, 2009), illustrating the functional relevance of these measures.

Acknowledgment This study was supported by a grant from Biogen Inc.

Table 7 Demographics by cognitive classiﬁcationa. Impaired on both batteries (N = 50)1 Age Education (years) EDSSc Disease durationd (years) Percent unemployed

Impaired on computerized battery only (N = 10)2

Impaired on traditional battery only (N = 8)3

Unimpaired on both batteries (N = 13)4

Group diﬀerences P valueb

46 ± 7.4 14.2 ± 2 4.5 ± 1.7(4) 11.3 ± 7.9

42.3 ± 9.1 15.3 ± 2.6 3.2 ± 1.5 7.2 ± 4.9

48.7 ± 7.8 15.9 ± 2.9 4.1 ± 1.7 8.5 ± 6.1

44.3 ± 9.9 14.6 ± 3 2.9 ± 1.4(1) 9.2 ± 6.6

0.37 0.23 0.006 0.32

35 (70%)(4)

5 (50%)

3 (38%)

3 (23%)(1)

0.01

a

1,2,3,4

Summary statistics are mean ± standard deviation or frequency (%). Superscripts indicate the group numbers for signiﬁcant pairwise comparisons. Signiﬁcant pairwise comparisons are indicated by bold with the relevant group number shown in parentheses. b P values are for diﬀerences across all groups (ANOVA for continuous variables, Fisher's exact tests for categorical variable). c EDSS, Expanded Disability Status Scale. d Time from ﬁrst conﬁrmed clinical episode suggestive of multiple sclerosis. 160

Multiple Sclerosis and Related Disorders 30 (2019) 154–162

D. Golan, et al.

Fig. 3. (a) Ten patients were classiﬁed as impaired on the computerized battery only. The impaired domains for each patient are shown as shaded boxes. (b) Eight patients were classiﬁed as impaired on the traditional battery only. The impaired domains for each patient are shown as shaded boxes. IPS, Information Processing Speed; CAB, computerized cognitive assessment battery; Working M, Working Memory.

Supplementary materials

co-normed estimate of premorbid intelligence: implementation using mindstreams computerized testing. Appl. Neuropsychol. 15 (4), 250–263. Dwolatzky, T., Whitehead, V., Doniger, G.M., et al., 2003. Validity of a novel computerized cognitive battery for mild cognitive impairment. BMC Geriatr. 3, 4. Ehrenreich, J.H., 1995. Normative data for adults on a short form of the Selective Reminding Test. Psychol. Rep. 76, 387–390. Erlanger, D.M., Kaushik, T., Caruso, L.S., et al., 2014. Reliability of a cognitive endpoint for use in a multiple sclerosis pharmaceutical trial. J. Neurol. Sci. 340 (1–2), 123–129. Fritz, C.O., Morris, P.E., Richler, J.J., 2012. Eﬀect size estimates: current use, calculations, and interpretation. J. Exp. Psychol. Gen. 141 (1), 2–18. Golan, D., Doniger, G.M., Wissemann, K., et al., 2018. The impact of subjective cognitive fatigue and depression on cognitive function in patients with multiple sclerosis. Mult. Scler. J. 24 (2), 196–204. Heaton, R.K., Miller, S.W., Taylor, M.J., et al., 2004. Revised Comprehensive Norms for an Expanded Halstead–Reitan Battery: Demographically Adjusted Neuropsychological Norms for African American and Caucasian Adults Professional Manual. In: Lutz, F.L. (Ed.), Psychological Assessment Resources, Inc.. Kim, S., Zemon, V., Rath, J.F., et al., 2017. Screening instruments for the early detection of cognitive impairment in patients with multiple sclerosis. Int. J. MS Care 19 (1), 1–10. Langdon, D.W., Amato, M.P., Boringa, J., et al., 2012. Recommendations for a Brief International Cognitive Assessment for Multiple Sclerosis (BICAMS). Mult. Scler. 18 (6), 891–898. Lapshin, H., Lanctôt, K.L., O'Connor, P., et al., 2013. Assessing the validity of a computergenerated cognitive screening instrument for patients with multiple sclerosis. Mult. Scler. J. 19 (14), 1905–1912. Mathias, J.L., Lucas, L.K., 2009. Cognitive predictors of unsafe driving in older drivers: a meta-analysis. Int. Psychogeriatr. 21 (4), 637–653. Migliore, S., Ghazaryan, A., Simonelli, I., et al., 2017. Cognitive impairment in relapsing-

Supplementary material associated with this article can be found, in the online version, at doi:10.1016/j.msard.2019.01.051. References Achiron, A., Doniger, G.M., Harel, Y., et al., 2007. Prolonged response times characterize cognitive performance in multiple sclerosis. Eur. J. Neurol. 14 (10), 1102–1108. Batista, S., Zivadinov, R., Hoogs, M., et al., 2012. Basal ganglia, thalamus and neocortical atrophy predicting slowed cognitive processing in multiple sclerosis. J. Neurol. 259 (1), 139–146. Benedict, R., 1997. Brief Visuospatial Memory Test – Revised: Professional Manual. Psychological Assessment Resources, Inc., Lutz, FL. Benedict, R., Morrow, S., Rodgers, J., et al., 2014. Characterizing cognitive function during relapse in multiple sclerosis. Mult. Scler. 20 (13), 1745–1752. Benedict, R.H.B., Cookfair, D., Gavett, R., et al., 2006. Validity of the minimal assessment of cognitive function in multiple sclerosis (MACFIMS). J. Int. Neuropsychol. Soc. 12 (4), 549–558. Benedict, R.H.B., Fischer, J.S., Beatty, W.W., et al., 2002. Minimal neuropsychological assessment of MS patients: a consensus approach. Clin. Neuropsychol. 16 (3), 381–397. Bonzano, L., Pardini, M., Roccatagliata, L., et al., 2017. How people with multiple sclerosis cope with a sustained ﬁnger motor task: a behavioural and fMRI study. Behav. Brain Res. 325, 63–71. Delis, D.C., Kaplan, E., Kramer, J., 2001. Delis-Kaplan Executive Functioning System Examiner's Manual. NCS Pearson, Inc., San Antonio, TX. Doniger, G.M., Simon, E.S., Schweiger, A., 2008. Adjustment of cognitive scores with a

161

Multiple Sclerosis and Related Disorders 30 (2019) 154–162

D. Golan, et al.

cognitive impairment in multiple sclerosis. Mult. Scler. Relat. Disord. 4 (4), 291–295. Schweiger, A., Doniger, G.M., Dwolatzky, T., et al., 2003. Reliability of a novel computerized neuropsychological battery for mild cognitive impairment. Acta Neuropsychol 1, 407–413. Sheridan, L.K., Fitzgerald, H.E., Adams, K.M., et al., 2006. Normative Symbol Digit Modalities Test performance in a community-based sample. Arch. Clin. Neuropsychol. 21 (1), 23–28. Smith, A., 1982. Symbol Digit Modalities Test: Manual. Western Psychological Services, Los Angeles. Strober, L., Englert, J., Munschauer, F., et al., 2009. Sensitivity of conventional memory tests in multiple neuropsychological battery and the minimal assessment. Mult. Scler. 15, 1077–1084. Sumowski, J.F., Benedict, R., Enzinger, C., et al., 2018. Cognition in multiple sclerosis. Neurology 0 10.1212/WNL.0000000000004977. Tanigawa, M., Stein, J., Park, J., et al., 2017. Finger and foot tapping as alternative outcomes of upper and lower extremity function in multiple sclerosis. Mult. Scler. J. Exp. Transl. Clin. 3 (1), 205521731668893. Wilken, J.A., Kane, R., Sullivan, C.L., et al., 2003. The utility of computerized neuropsychological assessment of cognitive dysfunction in patients with relapsing remitting multiple sclerosis. Mult. Scler. J. 9 (2), 119–128. Zakzanis, K.K., 2000. Distinct neurocognitive proﬁles in multiple sclerosis subtypes. Arch. Clin. Neuropsychol. 15 (2), 115–136.

remitting multiple sclerosis patients with very mild clinical disability. Behav. Neurol. 1–10. Mitrushina, M., Boone, B.K., Razani, J., et al., 2005. Locator and data tables for the Judgment of Line Orientation test. Handbook of Normative Data for Neuropsychological Assessment, second ed. Oxford university press, New York, pp. 798–804. Papathanasiou, A., Messinis, L., Georgiou, V.L., et al., 2014. Cognitive impairment in relapsing remitting and secondary progressive multiple sclerosis patients: eﬃcacy of a computerized cognitive screening battery. ISRN Neurol. 2014, 1–7. Rao, S.M., 2018. Role of computerized screening in healthcare teams: why computerized testing is not the death of neuropsychology. Arch. Clin. Neuropsychol. 33 (3), 375–378. Rao, S.M., Leo, G.J., Bernardin, L., et al., 1991b. Cognitive dysfunction in multiple sclerosis. I. Frequency, patterns, and prediction. Neurology 41 (5), 685–691. Rao, S.M., Leo, G.J., Ellington, L., et al., 1991a. Cognitive dysfunction in multiple sclerosis. II. Impact on employment and social functioning. Neurology 41 (5), 692–696. Raven, J., 2000. The Raven's progressive matrices: change and stability over culture and time. Cogn. Psychol. 41 (1), 1–48. Roalf, D.R., Rupert, P., Mechanic-Hamilton, D., et al., 2018. Quantitative assessment of ﬁnger tapping characteristics in mild cognitive impairment, Alzheimer's disease, and Parkinson's disease. J. Neurol. 04 (0123456789), 1–4. Romero, K., Shammi, P., Feinstein, A., 2015. Neurologists accuracy in predicting

162

Validity of a multi-domain computerized cognitive assessment battery for patients with multiple sclerosis

Validity of a multi-domain computerized cognitive assessment battery for patients with multiple sclerosis

Recommend Documents