Neural network topology in ADHD; evidence for maturational delay and default-mode network alterations
Accepted Manuscript Neural network topology in ADHD; evidence for maturational delay and default-mode network alterations T.W.P. Janssen, A. Hillebran...
Accepted Manuscript Neural network topology in ADHD; evidence for maturational delay and default-mode network alterations T.W.P. Janssen, A. Hillebrand, A. Gouw, K. Geladé, R. Van Mourik, A. Maras, J. Oosterlaan PII: DOI: Reference:
Please cite this article as: Janssen, T.W.P., Hillebrand, A., Gouw, A., Geladé, K., Van Mourik, R., Maras, A., Oosterlaan, J., Neural network topology in ADHD; evidence for maturational delay and default-mode network alterations, Clinical Neurophysiology (2017), doi: https://doi.org/10.1016/j.clinph.2017.09.004
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
T.W.P. Janssen - Neural network topology in ADHD
Neural network topology in ADHD; evidence for maturational delay and default-mode network alterations T.W.P. Janssena, A. Hillebrandb, A. Gouwb, K. Geladéc, R. Van Mourikd, A. Marase, J. Oosterlaanc a
Corresponding author:
Vrije Universiteit Amsterdam Van der Boechorststraat 1 1081 BT Amsterdam, The Netherlands Tel.: +31 20 598 8962 E-mail: [email protected] b
VU University Medical Center
De Boelelaan 1117 1081 HV Amsterdam, The Netherlands c
Vrije Universiteit Amsterdam
Van der Boechorststraat 1 1081 BT Amsterdam, The Netherlands d
Mental Health Care Organisation Noord-Holland-Noord,
Stationsplein 138, 1703 WC Heerhugowaard, The Netherlands e
Yulius Academy
Dennenhout 1 2994 GC Barendracht, The Netherlands
1
T.W.P. Janssen - Neural network topology in ADHD
2
Highlights
Functional brain networks were explored in ADHD using source-reconstructed EEG and graph analysis.
Evidence was found for maturational delay and underlying default-mode network alterations.
This study provides new temporal and spatial insights in neural network topology in ADHD.
Abstract Objective: Attention-deficit/hyperactivity disorder (ADHD) has been associated with widespread brain abnormalities in white and grey matter, affecting not only local, but global functional networks as well. In this study, we explored these functional networks using source-reconstructed electroencephalography in ADHD and typically developing (TD) children. We expected evidence for maturational delay, with underlying abnormalities in the default mode network. Methods: Electroencephalograms were recorded in ADHD (n=42) and TD (n=43) during rest, and functional connectivity (phase lag index) and graph (minimum spanning tree) parameters were derived. Dependent variables were global and local network metrics in theta, alpha and beta bands. Results: We found evidence for a more centralized functional network in ADHD compared to TD children, with decreased diameter in the alpha band (ηp2=.06) and increased leaf fraction (ηp2=.11 and .08) in the alpha and beta bands, with underlying abnormalities in hub regions of the brain, including default mode network. Conclusions: The finding of a more centralized network is in line with maturational delay models of ADHD and should be replicated in longitudinal designs. Significance: This study contributes to the literature by combining high temporal and spatial resolution to construct EEG network topology, and associates maturational-delay and default-mode interference hypotheses of ADHD.
1. Introduction Throughout childhood and adolescence, healthy brain development is characterized by a range of neurobiological changes, such as synaptic pruning and myelination of long-distance axons (Craik and Bialystok, 2006) that ultimately lead to a matured brain that enables fast signal transduction while maintaining relatively low energy costs (Boersma et al., 2011). The organization of normal adult brain networks is described as an intermediate structure between tree extremes: (1) a locally connected, highly ordered (regular) network, (2) a random network and (3) a scale-free network, which is characterized by highly connected brain areas, or ‘hubs’ (Stam, 2014). Most graph theoretical research to date has focused on intermediate topological structures between the first two extremes, regular and random networks (Bullmore and Sporns, 2012). These so called small-world networks are optimally efficient, having a delicate balance between dense local connections and a few long-distance connections, and seem sensitive in capturing underlying neurobiological changes during normal and deviant development. Recently, EEG graph analysis studies demonstrated a shift from more random toward more regular small-world configurations with increasing age (Boersma et al., 2013, 2011; Smit et al., 2016). A useful research approach would be to map developmental disorders that are associated with developmental delay, such as attention-deficit/hyperactivity disorder (ADHD) (Shaw et al., 2007), on this changing network configuration with age. A growing literature conceptualizes ADHD from a structural and functional network perspective (Konrad and Eickhoff, 2010). Structurally, noninvasive neuroimaging methods such as magnetic resonance imaging (MRI) and diffusion tensor imaging (DTI) have revealed widespread reductions in white and grey matter volume (Valera et al., 2007) and white matter (WM) integrity in children with ADHD (van Ewijk et al., 2012). Graph analysis of WM connectivity, based on DTI, has demonstrated a less optimal WM topological organization in children with ADHD (Cao et al., 2013); although children with ADHD had a small-world brain network, this network was more regular in topology and therefore less efficient compared to typically developing (TD) children. The abnormal wiring of white matter networks may provide a crucial structural substrate that underlies abnormal functional connectivity and network topology in ADHD (Cao et al., 2014). Functional networks alterations in ADHD have been found using resting-state fMRI (rsfMRI) (Cao et
T.W.P. Janssen - Neural network topology in ADHD
4
al., 2014; Wang et al., 2009a) and electroencephalography (EEG) (Ahmadlou et al., 2012; Liu et al., 2015), and have been interpreted as evidence for a shift towards more regular networks in ADHD in line with structural DTI-based findings (Cao et al., 2013). Cao and colleagues (2014) concluded that these findings support the developmental delay model of ADHD, considering that maturation of healthy brains follows a “local to distributed” principle. However, based on developmental EEG studies in healthy children, which show a shift from random to more regular networks during maturation, one would expect a more random network in ADHD to reflect maturational delay, rather than a more regular network (Boersma et al., 2013, 2011; Smit et al., 2016). As graph analysis is mostly applied to resting-state data, the question arises whether the default-mode network (DMN) is implicated as underlying source of topological network alterations in ADHD. The DMN is an extended network of interconnected brain areas that are associated with internally focused cognition, showing higher activity and connectivity during rest than during externally driven tasks (Buckner et al., 2008). Studies are inconsistent about the direction of abnormal connectivity in ADHD, either suggesting hyperconnectivity or hypoconnectivity of the DMN (Konrad and Eickhoff, 2010). Others argue that the DMN is undisturbed during rest, but fails to be attenuated during the transition from rest-to-task (Sonuga-Barke and Castellanos, 2007), interrupting task-related processing. Abnormal intrinsic oscillation patterns during rest have been found for task-positive neuronal network as well, such as the ventral attention network (VAN), which was furthermore hyperconnected to the DMN (Sripada et al., 2014). This hyperconnectivity may be a reflection of reduced anti-correlations between task-negative (DMN) and task-positive networks, in support of the “default network interference hypothesis”, or impaired regulation of the VAN over the DMN (Sripada et al., 2014). Although the current literature has demonstrated interesting new avenues of investigating brain connectivity and network topology in children with ADHD, several methodological issues limit the conclusions that can be drawn from these data. Firstly, most EEG-based methods use coherence to measure connectivity (Barry et al., 2011), yet coherence fails to capture intrinsic nonlinearities of brain activity, is sensitive to spurious correlations due to volume conduction/field spread and different choices for the reference electrode (van Diessen et al., 2015). Moreover, changes in coherence can be
T.W.P. Janssen - Neural network topology in ADHD
5
induced by both changes in amplitude and connectivity (Stam et al., 2007). The phase lag index (PLI) has been developed as a connectivity measure to address these issues (Stam et al., 2007). Secondly, most EEG connectivity and network studies have been performed in signal space (scalp electrodes), which makes interpretation of findings difficult due to the mixture of signals arising from spatially separated sources at a single electrode, as well as the spread of signals from a single source over multiple electrodes (van Diessen et al., 2015). Thirdly, traditional graph metrics that have been used in fMRI and EEG studies, such as clustering coefficient, shortest path length and degree distribution, are hampered by methodological issues that impede comparability between studies (Papo et al., 2016). A recent application in graph analysis of brain networks is the minimum spanning tree (MST), which may be used to achieve unbiased estimates of brain networks in ADHD (Tewarie et al., 2015). The current study was conducted to further explore neural network topology in children with ADHD, while utilizing several new developments in network analysis. We recorded dense array EEG (128 electrodes) in relatively large numbers of children with ADHD (n=42) and TD controls (n=43) and performed connectivity and graph analysis at the source level. Based on the maturational delay model of ADHD and literature on typical development (Boersma et al., 2013; Smit et al., 2016), we expected to find more centralized, star-like networks in ADHD compared to TD controls, which have been associated with random networks (Tewarie et al., 2015). Furthermore, we expected underlying alterations in hub regions of the brain, specifically in the DMN. 2. Methods 2.1 Participants Eighty-five children with ADHD and 72 TD controls met our inclusion criteria, which required an estimated full scale IQ > 80, measured with a short version of the Wechsler Intelligence Scale for Children (WISC-III; Wechsler, 1991), using the subtests Vocabulary, Arithmetic, Block Design and Picture Arrangement. Children were excluded if there was a known history of neurological conditions. Recruitment procedures have been described in a previous study: “The ADHD group was recruited through mental health outpatient facilities in the west of the Netherlands. All children obtained a clinical diagnosis of ADHD according to the DSM-IV (American Psychiatric Association
T.W.P. Janssen - Neural network topology in ADHD
6
1994) as established by a child psychiatrist. ADHD diagnosis was confirmed by parent and teacher ratings on the Disruptive Behavior Disorders Rating Scale (DBDRS; Pelham et al., 1992), which required at least one of the scores on the Inattention or Hyperactivity/Impulsivity scales to be in the clinical (>90th percentile) range for both informants. Seventy-six percent of children were naive for stimulant medication and the remaining children discontinued use of stimulants at least four weeks before testing. Children with a clinical DSM-IV diagnosis of autism spectrum disorder were excluded. The TD group was recruited through three primary schools and a sports club in the same recruitment area as the ADHD group. Control children were required to obtain normal scores on the DBDRS (<90th percentile) for both informants and to be free of any psychiatric or neurological disorder” (Janssen et al., 2016). Complete data were available for 85 children in the age range 7 to 14 years with 42 children in the ADHD group (33 boys, 9 girls) and 43 children in the TD group (32 boys, 11 girls), see Table 1. Participants were excluded from the final data analysis due to technical EEG issues (ADHD: n=10; TD: n=6), excessive eye movement, muscle or movement artefacts as detected with automatic artifact rejection (ADHD: n=25; TD: n=20) or no sufficient epoch quality (ADHD: n=8; TD: n=3), see paragraph on epoch selection. Data quality is especially paramount for connectivity-based analysis, hence our stringent selection criteria and resulting data loss. The remaining dataset (49% ADHD, 60% TD) was not different compared to the initial dataset on age, IQ and DBDRS scales.
2.2 Procedure The study was conducted according to the Declaration of Helsinki, and was approved by the local ethics committee. Parents and children aged 12 years or older signed informed-consent. The current study sample partly overlaps with a sample participating in a randomized controlled trial on the effects of neurofeedback, methylphenidate and physical exercise as treatments for ADHD (trial number: NCT01363544). The resting EEG recording was followed by the stop-signal task (30 minutes) and oddball task (20 minutes), which are described elsewhere (Janssen et al., 2016, 2015). 2.3 Behavioral assessment
T.W.P. Janssen - Neural network topology in ADHD
7
Parent and teacher reports on the Strengths and Weaknesses of ADHD symptoms and Normal behavior scale (SWAN; Arnett et al., 2013; Swanson et al., 2006) were used for correlational analyses with the primary outcome measures. The SWAN employs 18 items on a seven-point scale ranging from ‘far below average’ (3) to ‘far above average’ (-3), to allow for ratings of relative strengths as well as weaknesses on the two scales comprising the SWAN: Attention and Impulse Control. 2.4 Electrophysiological recordings EEG recording is similar to a previous study, which was described as follows: “Continuous EEG was recorded at 512Hz using the ActiveTwo Biosemi system and ActiView software (Biosemi, Amsterdam, The Netherlands) from 128 scalp electrodes according to the ABC labelling system, referenced to the active common mode and grounded to the passive driven right leg, which functions as a feedback loop to drive average potentials across electrodes to the amplifier zero. Electrooculogram (EOG) was obtained using two electrodes at the external canthi, and two electrodes at infra- and supra-orbital sides” (Janssen et al., 2016). EEG was consecutively recorded during eyesopen (EO, 3 minutes), eyes-closed (EC, 3 minutes) and task conditions (50 minutes). Only 3 minutes of EC data were used in this study, considering that this condition is more stable over sessions, easier to standardize in children and that EC is characterized by robust activity in the alpha band (van Diessen et al., 2015). Off-line analyses were performed with Brain Vision Analyzer 2 software (Brain Products, Gilching, Germany, Version 2.1). A Butterworth Zero Phase band-pass filter of 0.1-30 Hz at 48 dB/oct was applied, and scalp electrodes were re-referenced to the average of 128 electrodes. A zero phase filter was used to avoid phase distortions and the average reference was chosen to reduce confounding effects of the reference (van Diessen et al., 2015). Broken electrodes were interpolated with the spherical splines method (Perrin et al., 1989). Ocular artefacts were detected with the method of Gratton and Coles (1983) and marked in the data, which were used to inform manual epoch selection. Finally, the continuous EEG was segmented into epochs of 4096 time frames (tf; ~8 seconds of data with 512Hz). 2.5 Epoch selection
T.W.P. Janssen - Neural network topology in ADHD
8
Epoch selection was performed in two steps: automatic and manual. First, automatic artifact rejection was applied to segments based on the following criteria: maximum allowed voltage step of 50 µV/ms, maximal peak-to-peak amplitude difference of 200 µV, maximal amplitude of ±150 µV, and minimal low activity of 0.50 µV for 100 ms intervals. If any of the 128 electrodes within an epoch contained an artefact according to these criteria, the entire epoch was rejected. Remaining epochs after the first step were subsequently manually rated for quality: (1) very good quality; no ocular (EOG), electromyographic (EMG), movement or other artefacts; (2) good quality; minimal presence of artefacts; (3) medium quality; moderate presence of artefacts; (4) poor quality; clear and strong presence of artefacts. Participants were included in the final dataset according to the following criteria: 5 epochs of very good or good quality (rated 1 or 2) with a maximum of 1 epoch of medium quality (rated 3). Epochs were chosen based on the best quality available and as close to the start of the recording as possible to minimize effects of increasing variance in vigilance (van Diessen et al., 2015), although we choose not to use a cutoff time for including epochs. 2.6 LAURA source estimation Sources underlying each selected epoch were estimated using the LAURA (Local AutoRegressive Averages) distributed linear inverse solution method (Grave de Peralta Menendez et al., 2004, 2001; Michel et al., 2004). The analysis was performed on broad-band (0.1-30Hz) data using the Cartool software by Denis Brunet (brainmapping.unige.ch/cartool). Source-based analysis was considered advantageous compared to signal-based analysis for the following reasons: (1) scalp-based network analysis may result in erroneous inferences about the underlying network topology due to volume conduction/field spread (Antiqueira et al., 2010), and (2) source-based analysis allows to associate local network metrics (such as the ‘hubness’ of regions) with brain anatomy. In a previous study we have described the method in more detail: “LAURA is a source reconstruction method that incorporates biophysical laws to obtain the optimal solution that fulfills both the observed data and bio-electromagnetic constraints. In this approach, the relationship between brain activity at one point and its neighbors is expressed in terms of a local autoregressive estimator with coefficients depending upon a power of the distance from the point (Grave de Peralta Menendez et al., 2004). Cartool software uses the L-curve method to find the optimal regularization parameter
T.W.P. Janssen - Neural network topology in ADHD
9
for a given data file (Hansen, 1992). We used the Locally Spherical Model with Anatomical Constraints (LSMAC) as lead field model, which has been shown to perform as well as more computationally intensive models like the Boundary Element Model (BEM) (Birot et al., 2014). Inverse solutions were calculated for each participant and epoch separately on a realistic head model that included 5004 equally distributed nodes within the gray matter of the Montreal Neurological Institute (MNI) transformed NIHPD pediatric brain atlas based on 7.5-13.5 years old children (Fonov et al., 2011, 2009)” (Janssen et al., 2016). Finally, 78 regions-of interest (ROIs) out of 5004 nodes were selected based on centroids in the regions of the automated anatomical labeling (AAL) atlas (Gong et al., 2009) in order to reduce the dimensionality of the data and to allow comparability with connectivity/network studies using other techniques such as MEG and fMRI (Hillebrand et al., 2016; Tewarie et al., 2015). This resulted in five epochs, each containing 78 timeseries (one for each AAL ROI) of 4096 samples (~8 seconds of data), containing intensity at each node as index of activation, where the intensity was computed as the vector norm of the source strength in tree orthogonal directions. These epochs were separately analyzed with Brainwave software (version 0.9.152.4.1) to obtain spectral, functional connectivity and graph metrics. 2.7 Spectral, functional connectivity and graph analysis Spectral, functional connectivity and graph analyses of individual epochs were performed for each frequency band separately: theta (4-8Hz), alpha (8-13Hz), and beta (13-25Hz), and averaged over 5 epochs. We refrained from analysing delta and gamma frequency bands, to limit any potential effects of electrophysiological confounds, respectively movement and electromyographic activity, that may have remained in the data despite stringent selection criteria. Connectivity was calculated using the Phase Lag Index (PLI) (Stam et al., 2007). PLI measures the strength of statistical interdependencies between pairs of time series, and was designed to reduce the effects of volume conduction/field spread by ignoring zero-lag (instantaneous) connectivity between signals. Another advantage of PLI over other connectivity methods is that this method is not sensitive for disease specific spectral changes in amplitude that have been documented in ADHD (Snyder and Hall, 2006), which can confound connectivity estimates (van Diessen et al., 2015).
T.W.P. Janssen - Neural network topology in ADHD
10
PLI connectivity matrices were subsequently used to construct functional networks using graph theory. For this purpose, the 78 ROIs were used as nodes and the functional connections between nodes as edges. In this study, we reconstructed the minimum spanning tree (MST) graph, for which we used 1/PLI as input. The MST is an acyclic sub-network that connects all nodes of weighted, undirected connectivity matrices, using Kruskal’s algorithm (Kruskal, 1956). This results in a graph containing (mainly) the strongest connections of the original network that can be considered as backbone of the functional network and which is unique if the link weights are unique (Stam et al., 2014). The MST addresses several methodological issues of conventional graph analyses (van Wijk et al., 2010), such as confounding effects of alterations in connection strength and link density (Tewarie et al., 2015). From the individual MSTs both global and local network metrics can be derived. Global network metrics can be informative about the integration and segregation of the entire network, while local network metrics can specify the level of importance of individual nodes within the network. We limited global network analyses to MST parameters (1) diameter, (2) leaf fraction, and (3) tree hierarchy (Th). Diameter is defined as the longest shortest path between any two nodes in the MST, and leaf fraction is the fraction of nodes with a degree of one in the MST, with degree being the number of links for a given node (Tewarie et al., 2015), see Figure 1 for an illustration of various MST parameters. The lower limit of the number of leafs in a MST is two, and the upper bound (m) is the number of nodes (n) minus one (in our network 77). Diameter is inversely related to leaf number. A small diameter and high leaf number are characteristic for a star-like topology (centralized network), and large diameter and low leaf number are characteristic for a line-like topology (decentralized network). An optimal tree requires a small diameter, but without overloading central nodes within the network (hubs), which is quantified with Th (Boersma et al., 2013; Tewarie et al., 2015). As a local network metric we used betweenness centrality (BC), which is a ‘hub’ measure based on the fraction of shortest paths that run through a node. It has been shown that important hub regions in the brain have high BC (Bullmore and Sporns, 2012). We restricted BC analysis to frequency bands where the global network parameters differed between groups. Although not a network parameter, local relative power spectra for theta, alpha and beta were calculated as well and compared between groups,
T.W.P. Janssen - Neural network topology in ADHD
11
to validate the distributed source localization solutions, and to assess whether BC differences depend on power differences despite using PLI and the MST.
2.8 Statistical analysis Statistical analyses were performed with SPSS 23 (IBM, 2015). Significance was assumed if p<.05 (two-tailed). Demographic data were compared between groups with one-way ANOVA or χ2 test with Fisher exact correction. Twelve separate General Linear Model (GLM) ANOVAs were used to test for group differences between TD and ADHD in mean PLI, MST diameter, leaf fraction and Th for theta, alpha and beta frequency bands. For the main outcomes, mean difference and 95% confidence interval [95% CI] are reported. Effect sizes are reported as partial eta-squared (ηp2), with effects interpreted as small (.01), medium (.06) or large (.14). Within each frequency band, the false discovery rate (FDR) criterion (q=.05), using the Benjamini-Hochberg procedure, was applied to control for multiple comparisons. Pearson correlations were calculated between connectivity/network metrics and IQ and age. To reduce the number of statistical tests, correlations between connectivity/network measures and SWAN scales were limited to measures with significant group differences. BC and relative power were explored for group differences using 5000 F-test permutations to account for skewed data, for each of the 78 nodes and visualized with MATLAB (The Mathworks Inc., Natick, Massachusetts, USA). Significance was assumed if p<.01 (two-tailed).
3. Results 3.1 Group characteristics and data quality Table 1 summarizes the group characteristics. Groups did not differ on age or gender. As expected, IQ was lower in the ADHD group. Groups did not differ in data quality, with no differences in epoch quality (rated 1-4, ADHD: mean=1.31; TD: mean=1.38), F(1,83)=0.63, p=.430, or number of interpolated channels (ADHD: mean=4.17; TD: mean=3.51), F(1,83)=1.98, p=.163. 3.2 Global network metrics
T.W.P. Janssen - Neural network topology in ADHD
12
Table 2 and Figure 2 show the main results of the study. The ADHD group demonstrated decreased MST diameter in the alpha band, mean difference(ADHD-TD)=-.009, 95%CI=[-.017, -.001], with a medium effect size, ηp2=.06. Furthermore, the ADHD group showed increased MST leaf fraction in the alpha band, mean difference(ADHD-TD)=.014, 95%CI=[.005, .023], with a medium/large effect size, ηp2=.11, and an increased MST Th in the alpha band, mean difference(ADHD-TD)=.010, 95%CI=[.002, .017], with a medium effect size, ηp2=.08. An increase in MST leaf fraction was found in the beta band as well; mean difference(ADHD-TD)=.017, 95%CI=[.005, .029], with a medium effect size, ηp2=.08. A strong correlation was found between IQ and MST leaf fraction in the theta band, only for the TD group, reflecting lower mean leaf fraction with increasing IQ. Another correlation between IQ and PLI in the beta band changed to non-significant when removing one outlier, and was therefore not considered a genuine effect. For the behavioral measures, moderate and significant correlations were found between MST diameter in the alpha band and the SWAN Attention scale according to parents for the ADHD and TD groups. The signs of these correlations were opposite for the groups, with a negative association for ADHD, in line with the ANOVA results, reflecting decreasing diameter with increasing inattention problems. In contrast, TD children demonstrated a positive association, reflecting decreasing diameter with increasing attentional skills. Figure 2 denotes that regression lines of the TD and ADHD groups cross near average attentional skills on the x-axis. Exploratory hierarchical linear regression analysis was performed to test whether a quadratic function of the SWAN Attention scale could explain alpha diameter better than a linear function for all children. Although a linear function did not fit the data, F(1,83)=2.71, p=.104, R2=.03, adding a quadratic function significantly improved the model, F(1,83)=6.71, p=.002, R2=.14, with R2 change of .11, F(1,83)=10.40, p=.002. At last, MST leaf fraction in the beta band and SWAN Attention scale according to parents, correlated positively for the ADHD group.