AnatomiCuts: Hierarchical clustering of tractography streamlines based on anatomical similarity

AnatomiCuts: Hierarchical clustering of tractography streamlines based on anatomical similarity

Accepted Manuscript AnatomiCuts: Hierarchical clustering of tractography streamlines based on anatomical similarity Viviana Siless, Ken Chang, Bruce F...

13MB Sizes 0 Downloads 125 Views

Accepted Manuscript AnatomiCuts: Hierarchical clustering of tractography streamlines based on anatomical similarity Viviana Siless, Ken Chang, Bruce Fischl, Anastasia Yendiki PII:

S1053-8119(17)30892-3

DOI:

10.1016/j.neuroimage.2017.10.058

Reference:

YNIMG 14436

To appear in:

NeuroImage

Received Date: 22 March 2017 Revised Date:

18 October 2017

Accepted Date: 26 October 2017

Please cite this article as: Siless, V., Chang, K., Fischl, B., Yendiki, A., AnatomiCuts: Hierarchical clustering of tractography streamlines based on anatomical similarity, NeuroImage (2017), doi: 10.1016/ j.neuroimage.2017.10.058. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT

RI PT

AnatomiCuts: Hierarchical Clustering of Tractography Streamlines Based on Anatomical Similarity Viviana Silessa , Ken Changa , Bruce Fischla,b , Anastasia Yendikia a

SC

Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA b Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA

M AN U

Abstract

Diffusion MRI tractography produces massive sets of streamlines that contain a wealth of information on brain connections. The size of these datasets creates a need for automated clustering methods to group the streamlines into meaningful bundles. Conventional clustering techniques group streamlines

D

based on their spatial coordinates. Neuroanatomists, however, define white-

TE

matter bundles based on the anatomical structures that they go through or next to, rather than their spatial coordinates. Thus we propose a similarity

EP

measure for clustering streamlines based on their position relative to cortical and subcortical brain regions. We incorporate this measure into a hierarchical clustering algorithm and compare it to a measure that relies on Euclidean

AC C

distance, using data from the Human Connectome Project. We show that the anatomical similarity measure leads to a 20% improvement in the overlap

of clusters with manually labeled tracts. Importantly, this is achieved without introducing any prior information from a tract atlas into the clustering algorithm, therefore without imposing the existence of any named tracts. Keywords: hierarchical clustering, normalized cuts, tractography, diffusion

Preprint submitted to Neuroimage

October 27, 2017

ACCEPTED MANUSCRIPT

MRI 1. Introduction

RI PT

1

Diffusion MRI (dMRI) allows us to estimate the preferential direction of

3

water molecule diffusion at each voxel in the white matter (WM) (Le Bihan

4

et al., 1986). Tractography algorithms follow these directions to reconstruct

5

continuous paths of diffusion (Mori et al., 1999; Basser et al., 2000). The

6

most common approach to segmenting the WM from dMRI data is to use

7

every voxel in the brain as a seed for tractography and to group the resulting

8

streamlines into bundles. Recent advances in dMRI acquisition hardware and

9

software (Wright & Wald, 1997; Setsompop et al., 2013) have increased both

10

spatial and angular resolution, yielding large tractography datasets that are

11

difficult to parse manually. This creates a need for computational methods

12

that can group streamlines into anatomically meaningful bundles automati-

13

cally.

TE

D

M AN U

SC

2

Several unsupervised clustering algorithms have been proposed to assign

15

tractography streamlines into bundles. Each method is characterized by (i)

16

the measure that it uses to quantify the similarity between two streamlines

17

and (ii) the algorithm that it follows to group streamlines based on their

18

similarity.

20

21

AC C

19

EP

14

Typical similarity measures for unsupervised clustering of streamlines are

functions of the streamline coordinates in Euclidean space. These measures express either the distance between streamlines (Ding et al., 2003; Gerig

22

et al., 2004; Moberts et al., 2005; Tsai et al., 2007; Wassermann et al., 2010;

23

Visser et al., 2011; Siless et al., 2013) or the similarity of some shape de-

2

ACCEPTED MANUSCRIPT

scriptor computed from spatial coordinates (Gerig et al., 2004; Brun et al.,

25

2004; Ros et al., 2013; Zhang et al., 2014). A subset of these methods use the

26

distance metrics to project the streamlines onto an eigenspace and perform

27

the clustering in that space (Tsai et al., 2007; O’Donnell & Westin, 2007).

RI PT

24

However, bundling streamlines based on their proximity or their shape

29

is not consistent with the approach followed by neuroanatomists. In neu-

30

roanatomy, WM bundles are defined based on the brain structures that they

31

go through or next to, rather than their coordinates in a reference space. For

32

example, “The uncinate fasciculus extends between the anterior temporal

33

lobe and the frontal lobe passing though the floor of the insula in an area be-

34

tween the anterior claustrum and the amygdala” (Makris & Pandya, 2008).

35

Here we propose a similarity measure that adopts the neuroanatomist’s ap-

36

proach of comparing streamlines based on their anatomical neighborhood.

M AN U

SC

28

Previous attempts to incorporate anatomical information in unsupervised

38

streamline clustering used the structures that the streamlines terminate in,

39

either in the similarity measure itself (Tunc et al., 2014), in a post-hoc man-

40

ner (Wassermann et al., 2010), or for initialization (Guevara et al., 2011).

41

Alternatively, supervised clustering approaches introduced prior information

42

on WM anatomy from an atlas of predefined bundles, labeled by an ex-

43

pert (O’Donnell & Westin, 2007; Maddah et al., 2008; Ziyan et al., 2009;

45

46

TE

EP

AC C

44

D

37

Guevara et al., 2012; Wang et al., 2013; Ros et al., 2013; Jin et al., 2014). Another class of supervised methods used deterministic rules on the position of streamlines with respect to anatomically defined regions, thus going

47

directly from the whole-brain tractography data to a set of pre-determined

48

WM bundles (Xia et al., 2005; Li et al., 2010; Wassermann et al., 2016). In

3

ACCEPTED MANUSCRIPT

Xia et al. (2005) and Li et al. (2010) clustering based on Euclidean dis-

50

tance was performed as a subsequent step, to refine these bundles, but the

51

anatomical regions were only used in the form of deterministic rules before

52

that clustering step.

RI PT

49

In this work we propose an anatomical similarity measure for unsuper-

54

vised streamline clustering that uses not only the anatomical regions that

55

the streamlines terminate in, but all regions that form the anatomical neigh-

56

borhood of a streamline, everywhere along its trajectory. We have previously

57

used a similar description of anatomical neighborhoods to incorporate prior

58

information from a set of training subjects into the tractography step it-

59

self (Yendiki et al., 2011). However, that was a supervised approach, limited

60

to a set of predefined bundles from an atlas. Here we are interested in ex-

61

ploratory studies that do not rely on training data. Therefore, we do not

62

impose the existence of any named pathways based on a WM atlas; instead,

63

we cluster whole-brain tractography streamlines based on their position with

64

respect to a set of cortical and subcortical structures.

TE

D

M AN U

SC

53

The algorithms that have been used for unsupervised streamline cluster-

66

ing are either partitioning or hierarchical. Partitioning approaches include

67

K-means (Tsai et al., 2007; O’Donnell & Westin, 2007; Siless et al., 2013),

68

affinity propagation (Zhang et al., 2014), maximum likelihood (Maddah et al.,

70

71

AC C

69

EP

65

2008), and maximum a posteriori (Wang et al., 2011; Tunc et al., 2014). Hierarchical algorithms that have been applied to unsupervised streamline clustering are either agglomerative or divisive. Agglomerative algorithms fol-

72

low a bottom-up approach, first assigning each streamline to its own cluster

73

and then merging clusters iteratively (Ding et al., 2003; Gerig et al., 2004;

4

ACCEPTED MANUSCRIPT

Wassermann et al., 2010; Visser et al., 2011; Guevara et al., 2012; Ros et al.,

75

2013). Divisive algorithms follow a top-down approach, first assigning all

76

streamlines to a single cluster and then splitting clusters iteratively (Brun

77

et al., 2004; Ziyan et al., 2009; Wu et al., 2012). Hierarchical methods orga-

78

nize streamlines into a tree structure, where larger clusters are composed of

79

smaller clusters. This is particularly suitable for describing WM organization,

80

as anatomical tracing studies have revealed that the large WM pathways are

81

subdivided into multiple smaller bundles that share a large part of their tra-

82

jectory but originate or terminate in different brain regions (Lehman et al.,

83

2011). Hence we follow the hierarchical approach here.

M AN U

SC

RI PT

74

Our proposed method, AnatomiCuts, incorporates an anatomical similar-

85

ity measure into normalized cuts, an algorithm that has been used previously

86

for streamline clustering with conventional similarity measures (Brun et al.,

87

2004; O’Donnell et al., 2006; Ziyan et al., 2009). We follow an implementa-

88

tion of normalized cuts where a hierarchy of clusters is generated by recursive

89

bipartitions of the full data set (Shi & Malik, 2000; Brun et al., 2004). The

90

present work expands on a prior conference publication (Siless et al., 2016),

91

by adding much more extensive experimental results on the performance of

92

the proposed method and the effects of various tuning parameters that are

93

involved in its implementation.

95

96

TE

EP

AC C

94

D

84

Specifically, we compare the performance of our anatomical similarity

measure to one based on Euclidean distance between streamlines, using data from the MGH-USC Human Connectome Project. We show that cluster-

97

ing streamlines based on their anatomical neighborhood rather than their

98

spatial coordinates leads to a 20% improvement in the agreement of the clus-

5

ACCEPTED MANUSCRIPT

ters with WM bundles labeled manually by a human rater. Importantly, we

100

achieve this improvement without using prior information from clusters la-

101

beled manually in training subjects; we only use a subject’s own anatomical

102

segmentation. Our approach is unsupervised, allowing exploratory studies

103

of whole-brain tractography data, without being constrained to a predeter-

104

mined set of bundles.

105

2. Methods

106

2.1. Similarity measures

M AN U

SC

RI PT

99

107

Let fi be a tractography streamline, defined as a sequence of N points,

108

fi = [xi1 , . . . , xiN ], where xik ∈ R3 , k = 1, ..., N . A tractography dataset is a

109

set of M streamlines, F = {f1 , ..., fM }.

In the following, we assume that all streamlines have the same number of

111

points N . Although the similarity measures can be adapted to streamlines

112

with variable numbers of points, the approach of downsampling streamlines

113

to an equal number of points has been used previously by others to make

114

computation tractable (O’Donnell & Westin, 2007; Visser et al., 2011; Gary-

115

fallidis et al., 2012; Guevara et al., 2012; Wu et al., 2012; Siless et al., 2013).

116

2.1.1. Euclidean similarity

118

119

120

TE

EP

AC C

117

D

110

We define a similarity measure based on the mean Euclidean distance

between pairs of points on two streamlines fi and fj , as previously used in the literature (Visser et al., 2011; Garyfallidis et al., 2012; Guevara et al., 2012; Wu et al., 2012; Siless et al., 2013): N −1 1 X ωE (fi , fj ) , 1 + kxik − xjk k2 . N k=1



6

ACCEPTED MANUSCRIPT

Since the ordering of points is not consistent across streamlines, it is possible

122

for the first point of fi to be closer to the last point of fj and vice versa. We

123

account for this by also evaluating the similarity between fi and the reversed

124

fj . This leads to the following definition for the similarity measure:

RI PT

121

125

(1)

SC

wE (fi , fj ) , max{ωE (fi , fj ), ωE (fi , fjrev )}, where fjrev = [xjN , . . . , xj1 ].

Another option for converting a Euclidean distance into a similarity mea-

127

sure is by means of a Gaussian kernel. However that formulation involves the

128

choice of a kernel width, and the optimal width can vary greatly depending

129

on the input (Brun et al., 2004; O’Donnell & Westin, 2007). Here we opt

130

for a similarity measure that does not involve free parameters (Sammut &

131

Webb, 2017).

132

2.1.2. Anatomical similarity

D

M AN U

126

We introduce a new similarity measure that uses anatomical information

134

in the form of a cortical and subcortical segmentation, S(x), x ∈ R3 . This is

135

done by finding the segmentation labels that each streamline goes through

136

or next to, at each point along its trajectory. Specifically, each point x on a

137

streamline is associated with a set of segmentation labels, S(x + dl (x)vl ), l =

139

140

EP

AC C

138

TE

133

1, ..., P , where dl (x) is the minimum d > 0 such that S(x+dvl ) 6= S(x). That is, for each point x, we find the nearest neighboring segmentation labels in a set of directions vl , l = 1, ..., P . A neighborhood of P = 26 elements includes

141

neighboring labels in the directions vl = [e1 , e2 , e3 ], where e1,2,3 ∈ {−1, 0, 1}.

142

We also define v0 = [0, 0, 0] to represent the segmentation label that the

143

streamline passes through. Fig. 1 shows an illustration of this neighborhood. 7

ACCEPTED MANUSCRIPT

144

Fig. 2 shows an example of a point in the WM and some of its neighboring

SC

6 neighbors

RI PT

segmentation labels in different directions.

14 neighbors

M AN U

26 neighbors

Figure 1: Directions in which nearest neighboring segmentation labels are found, for a streamline point that lies in the center of the cube. Red directions belong to the 6, 14 and 26-element neighborhoods. Blue directions belong to the 14 and 26-element neighbor-

D

hoods. Green directions belong only to the 26-element neighborhood. 145

For each direction l = 0, ..., P , we compute a label histogram Hil ∈ RK ,

147

where K is the total number of labels in the anatomical segmentation. This

148

histogram represents the frequency with which different segmentation labels

149

are the l-th neighbor across all points on the i-th streamline. We introduce a

150

similarity measure between two streamlines fi and fj that expresses the joint

151

probability of their anatomical neighborhoods. We define this anatomical

EP

AC C

152

TE

146

similarity measure as: wA (fi , fj ) , |Li ∩ Lj |

P X

hHil , Hjl i,

(2)

l=0

153

where h·, ·i is the inner product, and Li , Lj are the sets of all labels found to

154

be neighbors of streamlines fi , fj . The normalization factor |Li ∩ Lj |, which 8

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

vl

Neighbor

0

[0, 0, 0]

Left unsegmented white matter

1

[1, 0, 0]

Left pallidum

2

[−1, 0, 0]

Left thalamus proper

3

[0, 0, 1]

Left paracentral white matter

4

[0, 0, −1]

Left ventral diencephalon

5

[1, 0, 1]

Left insular white matter

6

[1, 0, −1]

Left hippocampus

7

[−1, 0, 1]

Left caudate

EP

TE

D

l

8

[−1, 0, −1] Third ventricle

...

...

AC C

...

Figure 2: Example of the anatomical neighborhood of a point in the corticospinal tract. Neighboring FreeSurfer segmentation labels in the L-R and I-S directions are shown in the table.

9

ACCEPTED MANUSCRIPT

is the number of common neighbors between the two streamlines, penalizes

156

trivial streamlines with too few neighbors.

157

2.2. Normalized cuts

RI PT

155

We approach clustering as a graph partitioning problem, where the nodes

159

of the graph are the elements to be clustered and the weights of the edges

160

between nodes are the similarities between elements. These weights are used

161

to form a similarity matrix, and clusters are defined based on the eigenvectors

162

of this matrix. Given a connected graph G, the normalized cuts algorithm

163

searches for a graph cut that divides G into sets A and B (A ∩ B = ∅), by

164

minimizing similarity between A and B and maximizing similarity within

165

A and B (Shi & Malik, 2000). Clusters are split recursively, generating a

166

top-down hierarchical structure.

M AN U

SC

158

When clustering tractography data, each streamline represents a node on

168

a graph G, and the weight of each edge is the similarity between the nodes

169

it connects. In this case graph G is a clique, and thus each streamline is

170

connected to every other streamline. For the purpose of finding the optimal

171

cut of the graph into A and B, we need to quantify the similarity between A

174

175

176

TE

EP

173

and B. This is done by summing the weights of the edges that connect them: P s(A, B) = u∈A,v∈B w(u, v), where w(·, ·) is a similarity function between two

AC C

172

D

167

streamlines. To avoid trivial solutions where A or B is a single isolated node, P s is normalized by an association measure a(A) = u∈A,t∈G w(u, t). Thus the minimum cut is defined based on the following cost function: min snorm (A, B), A,B

177

snorm (A, B) =

s(A, B) s(A, B) + . a(A) a(B)

(3)

We can formulate this equation with matrix notation where W is the similar10

ACCEPTED MANUSCRIPT

P

178

ity matrix with Wij = w(i, j), D is a diagonal matrix with Dii =

179

and x the partitioning vector, where xi = 1 if the i-th element belongs to A

180

and xi = −1 is the i-th element belongs to B. The association measure will

181

be encoded in k =

183

:

(1 + x)T (D − W)(1 + x) (1 + x)T (D − W)(1 + x) + . k1T D1 (1 − k)1T D1

The problem is embedded in the real-valued domain and can be efficiently solved by the following equation (Shi & Malik, 2000): 1

M AN U

182

di

SC

snorm (A, B) =

xi >0 P di

1

D 2 (D − W)D 2 z = λz, 184

w(i, j)

RI PT

P

j

(4)

where z = Dy, and D − W is a Laplacian matrix.

The second eigenvector of the Laplacian matrix is a solution to the above

186

equation. The optimal cut is approximated by assigning the i-th node to A

187

if yi > 0, where yi ∈ R is the i-th element of y, and to B otherwise (Golub

188

& Van Loan, 1996). The algorithm is run iteratively by cutting the largest

189

remaining cluster, until a desired number of clusters is reached or a threshold

190

for minimum cluster size or dispersion is met.

191

2.3. Prototype streamlines

193

194

TE

EP

AC C

192

D

185

Building an affinity matrix requires computing the similarity between

every pair of streamlines, which is extremely time- and memory-intensive for the large, high-resolution data sets that are increasingly common. In

195

previous work, the size of the affinity matrix was reduced by processing left,

196

right, and inter-hemisphere streamlines separately (Guevara et al., 2012);

197

by using a random subset of streamlines to guide the clustering of the full 11

ACCEPTED MANUSCRIPT

set (O’Donnell & Westin, 2007; Siless et al., 2013; Ros et al., 2013; Zhang

199

et al., 2014); or by generating clustering solutions from multiple subsets of

200

streamlines and merging them (Visser et al., 2011).

RI PT

198

The approach of using a subset of the streamlines to compute the affinity

202

matrix has been advocated as a way to reduce sensitivity to outliers in hier-

203

archical clustering (Guha et al., 2001). When clustering tractography data,

204

each bundle contains hundreds to thousands of similar streamlines, resulting

205

in redundancy. Therefore it is reasonable to expect a subset of streamlines

206

to be sufficient for capturing the general shape of the clusters. In the divi-

207

sive algorithm that we use in this work, we evaluate the similarity matrix to

208

make a binary cut. In early stages of the algorithm, where computational de-

209

mands are highest because of the large number of streamlines to be divided,

210

these divisions are rather coarse (for example, separating the left hemisphere

211

from the right hemisphere, or the SLF from the arcuate fasciculus). This

212

means that we only need to select enough streamlines to ensure that we have

213

representive examples from these coarse sets.

TE

D

M AN U

SC

201

Here we select a random subset of the streamlines to be clustered at each

215

iteration. We generate the affinity matrix from this subset, which we refer to

216

as prototype streamlines. After finding the minimum cut for the prototype

217

streamlines, we assign each non-prototype streamline to the same cluster as

219

220

AC C

218

EP

214

its closest prototype streamline. As part of the experiments described below, we investigate the effect that the number of prototype streamlines has on the performance of our method.

12

ACCEPTED MANUSCRIPT

221

2.4. Data acquisition For the experimental evaluation of our unsupervised clustering, we use

223

dMRI and structural MRI (sMRI) data from 32 healthy subjects, avail-

224

able publicly through the MGH-USC Human Connectome Project (http:

225

//humanconnectomeproject.org). The data was acquired on the MGH

226

Siemens Connectom, a Skyra 3T MRI system with a custom gradient capa-

227

ble of maximum strength 300 mT/m and slew rate 200 T/m/s (Setsompop

228

et al., 2013). The sMRI data was acquired with MEMPRAGE (Mugler &

229

Brookeman, 1990; Van der Kouwe et al., 2008), TR=2530ms, TE=1.15ms,

230

TI=1100ms, 1mm isotropic resolution. The dMRI data was acquired with

231

2D EPI, TR=8800ms, TE=57ms, 1.5mm isotropic resolution, 512 gradient

232

directions, bmax = 10, 000s/mm2 . The data were pre-processed as described

233

in Fan et al. (2016).

234

2.5. Data analysis

D

M AN U

SC

RI PT

222

We reconstruct orientation distribution functions from the dMRI data us-

236

ing the generalized q-sampling imaging model (Yeh et al., 2010) and perform

237

whole-brain deterministic tractography using DSI Studio (Yeh et al., 2013).

238

We seed every voxel in the segmentation map computed by FreeSurfer. Previ-

239

ously, separate processing strategies have been proposed for longer vs. shorter

241

242

EP

AC C

240

TE

235

streamlines (Guevara et al., 2012; Zhang et al., 2014). Here we focus on long-range connections, as short-range connections, such as those representing U-fibers, could be clustered simply based on the anatomical regions that

243

they terminate in. To obtain a large number of long-range connections while

244

keeping computation tractable, we generate a total of 500,000 streamlines

245

per subject and then exclude any streamlines shorter than 55mm, leaving 13

ACCEPTED MANUSCRIPT

between 100,000 and 150,000 streamlines per subject. We downsample all

247

streamlines to N = 10 equispaced points.

RI PT

246

For comparison with our unsupervised clustering, we use a set of ma-

249

jor WM bundles labeled manually in each subject by a trained rater, fol-

250

lowing the protocol described in Wakana et al. (2007). This includes the

251

following bundles: corticospinal tract (CST), inferior longitudinal fascicu-

252

lus (ILF), uncinate fasciculus (UNC), anterior thalamic radiation (ATR),

253

cingulum - supracallosal bundle (CCG), cingulum - infracallosal (angular)

254

bundle (CAB), superior longitudinal fasciculus - parietal bundle (SLFP), su-

255

perior longitudinal fasciculus - temporal bundle (SLFT), corpus callosum -

256

forceps major (FMAJ), corpus callosum - forceps minor (FMIN). The intra-

257

hemispheric bundles, i.e., all except FMAJ and FMIN, are labeled on both

258

hemispheres (L and R), leading to a total of 18 manually labeled WM bun-

259

dles per subject. The rater interacts with the whole-brain tractography data

260

in TrackVis (http://trackvis.org), using the fractional anisotropy map and

261

the color-coded map of the primary eigenvector of the tensor for anatomical

262

guidance. For each of the bundles, at least two inclusion masks are hand-

263

drawn on slices of those maps (as specified in Wakana et al. (2007)) to isolate

264

the streamlines that belong to the bundle. Additional inclusion and exclu-

265

sion masks are drawn as needed to refine the labeling. Fig. 3 shows the 18

267

268

M AN U

D

TE

EP

AC C

266

SC

248

manually labeled WM bundles from an example subject. Anatomical segmentations are obtained by processing the sMRI data with

the automated cortical parcellation and subcortical segmentation tools in

269

FreeSurfer (Fischl et al., 2002, 2004). As part of this processing, subcortical

270

WM labels are defined by classifying each WM voxel that is within 5mm

14

(b)

SC

(a)

RI PT

ACCEPTED MANUSCRIPT

Figure 3: Sagittal (a) and axial (b) view of the 18 manually labeled bundles that we use for

M AN U

comparison to the unsupervised clustering results, shown in a randomly selected subject.

from the cortex based on its nearest cortical label. This results in a total

272

of K = 261 cortical and subcortical labels. Each subject’s dMRI and sMRI

273

data are aligned with a within-subject, boundary-based, affine registration

274

method, which optimizes the contrast of the b = 0 dMRI volumes across the

275

grey-white boundary surfaces reconstructed from the sMRI data (Greve &

276

Fischl, 2009). The subject’s anatomical segmentation is then mapped from

277

sMRI space to dMRI space by applying the inverse of that transformation.

TE

D

271

For each subject, we perform unsupervised clustering with normalized

279

cuts and each of the two similarity measures defined in section 2.1. As the

280

selection of the total number of clusters is arbitrary, we evaluate the effect

281

of this number throughout our experiments. We do this by running the

283

284

AC C

282

EP

278

clustering algorithm until a total of 200 clusters are generated, and then pruning the hierarchical clustering tree to keep the first 75, 100, 125, 150 or 200 clusters.

15

ACCEPTED MANUSCRIPT

2.6. Evaluation metrics

286

2.6.1. Comparison with manual labeling

RI PT

285

For each subject, we evaluate the extent to which the clusters obtained

288

with each of the two similarity measures resemble bundles labeled by a human

289

rater in the same subject. First, we compute the overlap, as quantified by the

290

Dice coefficient (Dice, 1945; Sørensen, 1948), between each manually labeled

291

bundle and the union of all clusters for which at least 5% of streamlines

292

belong to the manually labeled bundle. The goal of this evaluation is to

293

determine the extent to which streamlines that a human rater would classify

294

into a single anatomical structure become divided among disparate clusters.

295

Second, we compute the homogeneity and the completeness of each clus-

296

ter of streamlines with respect to the 18 manually labeled bundles in each

297

subject. These measures are defined in Rosenberg & Hirschberg (2007). Ho-

298

mogeneity is highest when each cluster contains elements from a single manu-

299

ally labeled bundle only and completeness is highest when elements from the

300

same manually labeled bundle are assigned to a single cluster. It is important

301

to evaluate both measures, as it is possible to achieve perfect homogeneity

302

by compromising completeness, i.e., by assigning each streamline to its own

303

cluster.

305

306

M AN U

D

TE

EP

AC C

304

SC

287

2.6.2. Anatomical and spatial consistency of clusters One might expect streamlines that are close to each other in Euclidean

space to also have common anatomical neighbors, and vice versa. Hence,

307

we evaluate the extent to which streamlines that are clustered based on

308

the anatomical similarity measure wA are also similar with respect to the

309

Euclidean similarity measure wE and vice versa. To this end, we find the 16

ACCEPTED MANUSCRIPT

centroid of each cluster, defined as the streamline closest to the average of all

311

streamlines in a cluster. We then evaluate wE and wA between every element

312

in a cluster and the cluster centroid.

RI PT

310

We also compute a distance metric that neither the wE -optimal nor the

314

wA -optimal clusters are optimized for. This is the mean of the closest-point

315

distance (Corouge et al., 2004; Moberts et al., 2005; O’Donnell et al., 2006;

316

Guevara et al., 2012) between two streamlines fi and fj , defined as:

317

N 1 X N min kxik − xjl k2 , N k=1 l=1

M AN U

ωCP (fi , fj ) ,

SC

313

where ωCP (fi , fj ) and ωCP (fj , fi ) are averaged to ensure symmetry: 1 wCP (fi , fj ) , (ωCP (fi , fj ) + ωCP (fj , fi )). 2 In our case, fj is the centroid of the cluster that fi belongs to.

319

2.6.3. Robustness to errors in the anatomical segmentation

D

318

(5)

As our proposed similarity measure relies on an anatomical segmentation,

321

we evaluate the robustness of the clustering to errors in the boundaries of the

322

anatomical segmentation labels. To this end, we perturb the boundaries of

323

the FreeSurfer cortical and subcortical segmentation labels in each subject,

324

and repeat the clustering using the perturbed segmentation. We perform

326

327

EP

AC C

325

TE

320

the perturbations by replacing the segmentation label of each voxel with a label selected randomly among the labels of its neighboring voxels. This is done on segmentations that have been mapped into dMRI space, thus the

328

perturbations are in the order of the dMRI voxel size (1.5mm). To establish

329

that this is comparable to the actual accuracy of the FreeSurfer anatomical

330

segmentation, we also show plots of the mean closest point distance between 17

ACCEPTED MANUSCRIPT

automatically segmented and manually labeled brain structures in 26 sub-

332

jects from the FreeSurfer atlas. We note that no manual editing has been

333

performed on the FreeSurfer reconstructions that we use in our experiments.

334

2.6.4. Effect of prototype streamlines

RI PT

331

For the evaluations described above, we used 500 randomly selected pro-

336

totype streamlines to compute the similarity matrices. We assess the impact

337

of this choice by repeating the clustering using 5 different sets of 500 proto-

338

type streamlines, and also by varying the number of prototype streamlines

339

between 50 and 3000.

340

2.6.5. Effect of anatomical neighborhood size

M AN U

SC

335

For the evaluations described above, we computed our anatomical similar-

342

ity measure using the 26-element neighborhood from Fig. 1. We investigate

343

the impact of this neighborhood size by repeating the clustering with the

344

6-element and 14-element anatomical neighborhoods.

345

2.6.6. Effect of streamline downsampling

TE

D

341

When computing the distance of tractography streamlines in Euclidean

347

space, it is common for clustering algorithms to downsample streamlines to

348

a constant number of points N (O’Donnell & Westin, 2007; Visser et al.,

350

351

AC C

349

EP

346

2011; Garyfallidis et al., 2012; Guevara et al., 2012; Wu et al., 2012; Siless et al., 2013). This reduces computation time for Euclidean distances dramatically, as it makes the task of finding corresponding points between two

352

streamlines trivial. Although our anatomical similarity measure does not re-

353

quire all streamlines to have an equal number of points, we did downsample

354

streamlines for the purpose of comparing to the Euclidean distance measure. 18

ACCEPTED MANUSCRIPT

We evaluate the effect of this downsampling on our anatomical similarity

356

measure by comparing results obtained with streamlines sampled to N = 10

357

and N = 50 points.

358

2.6.7. Comparison to an alternative segmentation

RI PT

355

Our method is unsupervised, i.e., it does not impose the existence of

360

a set of named WM bundles from an atlas. Nonetheless, we explore here

361

the use of labels from a WM atlas, as an alternative to the subjects’ indi-

362

vidual FreeSurfer cortical parcellation and subcortical segmentation labels.

363

Specifically, we use the well-known JHU-ICBM WM atlas (Wakana et al.,

364

2007; Hua et al., 2008; Oishi et al., 2010), as distributed with the FSL soft-

365

ware package (Jenkinson et al., 2012). We map this WM atlas to each for

366

our subjects’ individual dMRI space by a non-linear transformation that we

367

obtain from co-registering the fractional anisotropy maps of the atlas and

368

the individual using the FNIRT tool from FSL. We then cluster each sub-

369

ject’s streamlines by computing our anatomical similarity measure based on

370

the WM labels from this atlas, instead of the labels from the subject’s own

371

FreeSurfer cortical parcellation and subcortical segmentation.

372

2.6.8. Comparison to shape similarity

EP

TE

D

M AN U

SC

359

AC C

Finally, we compared our anatomical similarity to a similarity measure

376

377

of prototype streamlines.

373

374

375

based on streamline shape descriptors, as defined in Brun et al. (2004). Shape descriptors reduce the dimensionality of streamlines, so that the full set of streamlines can be used for clustering, as an alternative to using only a set

19

ACCEPTED MANUSCRIPT

378

2.6.9. Computation time The computational complexity of evaluating the elements of the similarity

380

matrix is O(Mp2 )Ow , where Mp the number of prototype streamlines and Ow

381

the computational complexity of the similarity measure for a pair of stream-

382

lines. The latter is Ow = O(N ) for the Euclidean similarity measure and

383

Ow = O(L) for the anatomical similarity measure, where L ≤ N the number

384

of different anatomical labels that are neighbors of a streamline in a single

385

direction, assuming that computations for neighbors in different directions

386

are parallelized. The Euclidean distance measure can benefit from a substan-

387

tial speed-up when all streamlines have the same number of points N , which

388

is why we chose to downsample streamlines to a constant N here. However,

389

the complexity of our anatomical similarity measure does not depend on

390

whether streamlines have equal lengths or not. Furthermore, our implemen-

391

tation saves histograms of anatomical neighbors using the unordered map

392

class of the standard C++ library, where the complexity of element search

393

is on average O(1).

394

3. Results

395

3.1. Comparison with manual labeling

399

AC C

EP

TE

D

M AN U

SC

RI PT

379

400

all subjects for each WM bundle, when the total number of clusters is 200.

401

The anatomical similarity measure performed 20% better than the Euclidean

396

397

398

Fig. 4(a) shows the Dice coefficient between the manually labeled WM

bundles and the streamline clusters obtained with each of the two similarity measures, averaged over all 18 bundles and 32 subjects, as a function of the total number of clusters. Fig. 4(b) shows the average Dice coefficient over

20

ACCEPTED MANUSCRIPT

similarity measure in terms of its agreement with bundles defined by a human

403

rater. A repeated measures analysis of variance, with factors of similarity

404

measure and number of clusters, showed a statistically significant effect of

405

similarity measure on the Dice coefficient (p < 0.0001).

M AN U

SC

RI PT

402

(a)

(b)

D

Figure 4: (a) Dice coefficient between the manually labeled WM bundles and the streamline clusters obtained with each of the two similarity measures, averaged over all 18 bundles

TE

and 32 subjects, as a function of the total number of clusters. (b) Average Dice coefficient over all subjects by tract, when the total number of clusters is 200.

Fig. 5 and Fig. 6 depict the clusters used in the above evaluation, aver-

407

aged across the 32 subjects in the space of the FMRIB58 FA template (Smith

408

et al., 2006) and displayed as isosurfaces. (Note that we mapped clusters to

410

411

AC C

409

EP

406

this template only for the purposes of this illustration. Otherwise, all computations were performed in the native dMRI space of each individual.) For reference, the figures also show population averages of the corresponding

412

manually labeled bundles. Although individual variability and finer differ-

413

ences between methods cannot be appreciated from these population aver-

414

ages, the figures suggest that the 5% threshold that we use to select the 21

ACCEPTED MANUSCRIPT

415

clusters included in this evaluation is reasonable. Fig. 7 shows plots of homogeneity (a) and completeness (b) for both

417

the anatomical and Euclidean similarity measure. Our anatomical similarity

418

measure outperforms the Euclidean similarity measure in both homogeneity

419

and completeness. A repeated measures analysis of variance, with factors of

420

similarity measure and number of clusters, showed a statistically significant

421

effect of similarity measure on both homogeneity and completeness (p <

422

0.0001).

SC

RI PT

416

We also evaluated a combined similarity measure, summing the anatom-

424

ical and Euclidean similarities after normalizing them to account for their

425

different variances. We did not find any statistically significant difference

426

between the performance of the combined similarity measure and the anatom-

427

ical similarity itself (p = 0.85 for Dice coefficient, p = 0.60 for homogeneity,

428

and p = 0.72 for completeness, based on a repeated measures analysis of

429

variance with factors of similarity measure and number of clusters). This

430

suggests that, once the anatomical neighborhood of the streamlines has been

431

taken into account, the spatial coordinates of the streamlines may not con-

432

tribute additional information to the clustering. Hence, no further results

433

are shown for the combination of similarity measures.

435

436

437

D

TE

EP

Finally, we evaluated clustering the streamlines after mapping them to

AC C

434

M AN U

423

the subject’s sMRI space, instead of mapping the cortical parcellation and subcortical segmentation labels to dMRI space. We did not find statistically significant differences between the performance of the anatomical similarity

438

measure in sMRI and dMRI space (p = 0.62 for Dice coefficient, p = 0.55 for

439

homogeneity, and p = 0.9 for completeness, based on a repeated measures

22

ACCEPTED MANUSCRIPT

L-ATR

L-CCG

M AN U

SC

Anatomical

AC C

EP

TE

D

Euclidean Euclidean Anatomical Manual

L-CAB

RI PT

FMIN

Manual

FMAJ

Figure 5: Population averages of the manually labeled bundles and the streamline clusters that contain more than 5% streamlines overlapping with the corresponding manually labeled bundle. Each color represents a different WM pathway. The population averages across all subjects are shown as isosurfaces in axial and sagittal views.

23

ACCEPTED MANUSCRIPT

L-ILF

L-SLFP

Manual

L-UNC

M AN U

SC

Anatomical

AC C

EP

TE

D

Euclidean Euclidean Anatomical Manual

L-SLFT

RI PT

L-CST

Figure 6: Population averages of the manually labeled bundles and the streamline clusters that contain more than 5% streamlines overlapping with the corresponding manually labeled bundle (continued from Fig. 5). Each color represents a different WM pathway. The population averages across all subjects are shown as isosurfaces in axial and sagittal views.

24

SC

RI PT

ACCEPTED MANUSCRIPT

(b)

M AN U

(a)

Figure 7: Homogeneity (a) and completeness (b) of unsupervised clustering, averaged over 18 WM bundles and 32 subjects, as a function of the number of clusters.

analysis of variance with factors the number of clusters and the space that

441

the clustering was performed in). Throughout the paper, the anatomical

442

similarity measure is computed in dMRI space.

443

3.2. Anatomical and spatial consistency of clusters

TE

D

440

Fig. 8(a) shows plots of wE for clusters derived by optimizing either wE

445

or wA . Fig. 8(b) shows plots of wA for clusters derived by optimizing either

446

wE or wA . Finally, Fig. 8(c) shows plots of the mean closest-point distance,

447

wCP , for clusters derived by optimizing either wE or wA .

451

AC C

EP

444

452

of clusters (Fig. 8(a)). This is confirmed by a repeated measures analy-

453

sis of variance on these differences, with factors of similarity measure and

448

449

450

Of course we expect that the wE -optimal clusters have higher wE than the

wA -optimal clusters, and vice versa. This is confirmed in the plots. However, we find that the difference in wA between the wE -optimal and wA -optimal clusters (Fig. 8(b)) is greater than the difference in wE between the two sets

25

ACCEPTED MANUSCRIPT

number of clusters (p < 10−8 ). Furthermore, there is no difference in the

455

within-cluster mean closest-point distance between the two sets of clusters

456

(Fig. 8(c)). This is confirmed by a repeated measures analysis of variance

457

on wCP , with factors of similarity measure and number of clusters (p = 0.3).

458

These results imply that the anatomical similarity measure tends to group

459

together streamlines that are also close to each other in Euclidean space,

460

whereas the Euclidean distance similarity measure may not necessarily yield

461

clusters that are anatomically consistent.

M AN U

SC

RI PT

454

(b)

(c)

D

(a)

TE

Figure 8: Euclidean similarity measure wE (a), anatomical similarity measure wA (b), and mean closest-point distance wCP (c), for streamlines clustered together either by the Euclidean or the anatomical similarity measure (wE -optimal vs. wA -optimal). All measures

EP

are plotted as a function of the number of clusters, averaged over all subjects.

A visual illustration of this is shown in Fig. 9. For four pairs of anatomical

463

segmentation labels, we show the clusters for which at least 5% of streamlines

466

AC C

462

467

For example, streamlines that lie on opposite sides of the midline but are

468

close to each other in space may be erroneously clustered together by wE ,

464

465

pass through both labels, when the number of clusters is 200. As seen in the figure, the Euclidean distance similarity measure tends to produce noisier and less anatomically consistent clusters than the anatomical similarity measure.

26

ACCEPTED MANUSCRIPT

RI PT

Euclidean similarity

M AN U

SC

Anatomical similarity

(a)

(b)

(c)

(d)

Figure 9: Clusters from a single subject, obtained with each of the two similarity measures.

D

Clusters were selected so that at least 5% of the streamlines in a cluster pass through a pair of anatomical segmentation labels: (a) precentral and brainstem, (b) superior parietal and

cingulate.

TE

brainstem, (c) superior temporal and precentral, (d) isthmus cingulate and rostral anterior

but not by wA (see Fig. 9(d)).

470

3.3. Robustness to errors in the anatomical segmentation

474

AC C

EP

469

475

unsupervised clustering, when the original and perturbed segmentations are

476

used to compute the anatomical similarity measure. Based on repeated mea-

471

472

473

Fig. 10 shows examples of an original and perturbed segmentation that we

used to assess the robustness of the clustering to errors in the boundaries of the anatomical segmentation labels. Fig. 11 shows plots of the Dice coefficient with manually labeled bundles, the homogeneity, and the completeness of the

27

RI PT

ACCEPTED MANUSCRIPT

SC

Figure 10: Example of an original (left) and perturbed (right) indivudual anatomical

M AN U

segmentation used to evaluate robustness to segmentation errors.

(b)

(c)

D

(a)

Figure 11: Dice coefficient with respect to manually labeled bundles (a), homogeneity (b),

TE

and completeness (c) of unsupervised clustering with our anatomical similarity measure, when the subjects’ original and perturbed anatomical segmentations are used.

sures analyses of variance with factors of segmentation type and number of

478

clusters, the effect of the segmentation on the performance measures was not

479

statistically significant (p = 0.79 for Dice coefficient, p = 0.35 for homogene-

481

482

AC C

480

EP

477

ity, and p = 0.36 for completeness). This suggests that our results are robust

to errors in the individual anatomical segmentation. The perturbations that we performed for the above evaluation were in the

483

order of one dMRI voxel. For reference, Fig. 12 shows the accuracy of the au-

484

tomated FreeSurfer segmentation for a set of internal brain structures. The

28

ACCEPTED MANUSCRIPT

plots show the mean closest-point distance between the automatically seg-

486

mented and manually labeled structures. The mean error over all structures

487

is 0.9mm, which is less than the sMRI resolution, and much less than the

488

dMRI resolution. Hence, typical FreeSurfer segmentation errors are smaller

489

than the perturbations that we performed in our experiment.

490

3.4. Effect of prototype streamlines

SC

RI PT

485

Supplemental Fig. S1 shows how the number of prototype streamlines

492

affects the Dice coefficient with manually labeled bundles, the homogeneity,

493

the completeness, and the within-cluster anatomical similarity. As seen in

494

the plots, results obtained with the anatomical similarity measure are very

495

similar for the different numbers of prototype streamlines (50, 100, 250, 500,

496

1000, 3000). The anatomical similarity measure with 50 prototype stream-

497

lines still outperforms the Euclidean distance similarity measure with 500

498

prototype streamlines (p = 0.002 for Dice and p < 0.0001 for homogene-

499

ity and completeness, based on a repeated measures analysis of variance

500

with factors of similarity measure and number of clusters). Supplemental

501

Fig. S2 shows results using 5 different randomly selected sets of 500 pro-

502

totype streamlines to compute the anatomical similarity measure. For all

503

sets, the anatomical similarity measure outperforms the Euclidean distance

505

506

D

TE

EP

AC C

504

M AN U

491

similarity measure with 500 prototype streamlines (p < 0.0001 based on a repeated measures analysis of variance with factors of similarity measure and number of clusters). Moreover, the difference between the results obtained

507

with the 5 random sets of prototype streamlines used for the anatomical

508

similarity measure was not significant (p > 0.6 for Dice, homogeneity and

509

completeness, based on a repeated measures analysis of variance with factors 29

ACCEPTED MANUSCRIPT

of prototype set and number of clusters, for each pair of sets).

511

3.5. Effect of anatomical neighborhood size

RI PT

510

Supplemental Fig. S3 shows how the size of the anatomical neighborhood

513

used for clustering affects the Dice coefficient with manually labeled bundles,

514

the homogeneity, the completeness, and the within-cluster anatomical sim-

515

ilarity. Although the 6-element neighborhood leads to somewhat degraded

516

performance, all three anatomical neighborhood sizes perform better than

517

the Euclidean distance similarity measure (p < 0.0001 based on repeated

518

measures analysis of variance with factors of similarity measure and number

519

of clusters).

520

3.6. Effect of streamline downsampling

M AN U

SC

512

Supplemental Fig. S4 shows how downsampling streamlines to N = 10

522

vs. N = 50 points affects the Dice coefficient with manually labeled bundles,

523

the homogeneity, the completeness, and the within-cluster anatomical simi-

524

larity. Although the plots show a modest improvement in the Dice coefficient

525

and homogeneity when N = 50 points are used, the difference is not signif-

526

icant (p = 0.1 for Dice coefficient, p = 0.2 for the homogeneity, p = 0.7 for

527

the completeness and p = 0.1 for the anatomical similarity evaluated with

529

530

TE

EP

AC C

528

D

521

repeated measures analysis of variance with factors of similarity measure and number of clusters). 3.7. Comparison to an alternative segmentation

531

Supplemental Fig. S5 shows results from performing clustering with our

532

anatomical similarity measure, after replacing the subjects’ individual FreeSurfer

30

ACCEPTED MANUSCRIPT

cortical parcellation and subcortical segmentation with the JHU-ICBM WM

534

atlas. We find that, for the purposes of unsupervised clustering of whole-brain

535

tractography with our anatomical similarity measure, using the individual’s

536

cortical and subcortical labels from FreeSurfer outperforms using labels from

537

a WM atlas (p = 0.0007 for Dice coefficient, and p = 0.0001 for homogeneity

538

and completeness, based on a repeated measures analysis of variance with

539

factors of label set and number of clusters).

540

3.8. Comparison to shape similarity

M AN U

SC

RI PT

533

Results from the comparison of our anatomical similarity measure to a

542

shape similarity measure are shown in Supplemental Fig. S6. The anatomical

543

similarity measure outperformed the shape similarity measure (p = 0.0003 for

544

Dice coefficient, and p < 0.0001 for homogeneity and completeness, based on

545

a repeated measures analysis of variance with factors of similarity measure

546

and number of clusters). As seen in the figure, the anatomical similarity

547

measure yielded a 30% improvement of the Dice coefficient between clusters

548

and manually labeled WM bundles, when compared to the shape similarity

549

measure.

550

3.9. Computation time

554

AC C

EP

TE

D

541

555

putations of anatomical similarity for different neighborhood elements are

556

parallelized. Otherwise, the computation time for the anatomical similarity

551

552

553

Table 1 shows computation times for clustering a randomly selected sub-

ject’s data with the anatomical and Euclidean distance similarity measure, for six different numbers of prototype streamlines and two different numbers of points per streamline. These times are obtained by assuming that com-

31

ACCEPTED MANUSCRIPT

Number of prototype streamlines

50

100

250

500

1000

3000

9.09

21.37 60.07 243.46 1951.30

Euclidean similarity (N = 10) 2.45

4.16

10.14 41.26 282.30 1878.08

RI PT

Anatomical similarity (N = 10) 5.31

Anatomical similarity (N = 50) 8.46 12.57 27.44 74.16 260.20 2347.28 Euclidean similarity (N = 50) 9.20 15.26 36.03 93.29 321.49 2392.37

SC

Table 1: Computation times (in minutes) for clustering a randomly selected subject’s

data with the two similarity measures and different numbers of prototype streamlines or numbers of points N , assuming that computations of anatomical similarity for different

M AN U

neighborhood elements are parallelized.

measure would also depend on the size of the anatomical neighborhood that

558

is used. Computation is faster for the Euclidean similarity measure when the

559

number of sampled points is small, but it is faster for the anatomical similar-

560

ity measure when the number of sampled points is large. Note, however, that

561

it would be possible to accelerate computation for the Euclidean similarity

562

measure by parallelizing over the sampled points. Times are reported for a

563

quad-core Xeon 5472 with 3.0GHz CPUs and 7GB of RAM.

564

3.10. Visual illustration

EP

TE

D

557

Fig. 13 shows an example of the hierarchical tree generated with the

566

anatomical similarity measure for a randomly selected subject. The root of

567

568

569

AC C

565

the tree is the full set of streamlines and the leaves of the tree are the 200 streamline clusters. For a set of nodes that are marked (a)-(i) on the tree, we show images of the clusters at the corresponding iterations of the algorithm:

570

• Low-level nodes represent coarse anatomical divisions of the stream-

571

lines: (a) left hemisphere; (b) posterior components of the corpus cal32

ACCEPTED MANUSCRIPT

losum; (c) right hemisphere; (d) anterior and middle components of the

573

corpus callosum.

RI PT

572

• Mid-level nodes represent streamline groupings with greater anatomical

575

specificity: (e) left corticospinal tract and thalamic radiations; (f) left

576

superior longitudinal fasciculus and arcuate fasciculus; (g) left cingulum

577

bundle.

579

580

581

• High-level nodes are individual WM pathways: (h) the right corticospinal tract.

M AN U

578

SC

574

• Leaves (terminal-level nodes) are subdivisions of larger WM pathways: (i) sub-bundle of the left arcuate fasciculus.

In the supplemental material, we provide a video with an example of the

583

tree hierarchy for the anatomical similarity measure. The video shows the left

584

hemisphere clusters, including the subdivisions of the superior longitudinal

585

fasciculus and the arcuate fasciculus.

586

4. Discussion

EP

TE

D

582

In this work we propose a novel anatomical similarity measure for un-

588

supervised clustering of tractography streamlines. Our similarity measure

591

AC C

587

592

anatomical similarity measure into a divisive hierarchical clustering algo-

593

rithm, normalized cuts, and compare its performance to that of a conven-

589

590

does not use the spatial coordinates of the streamlines, relying instead on histograms of the anatomical segmentation labels that each streamline goes through or next to, at every point along its trajectory. We incorporate this

33

ACCEPTED MANUSCRIPT

tional similarity measure based on the Euclidean distance between stream-

595

lines.

RI PT

594

Our results show that streamline clusters obtained with our anatomical

597

similarity measure are in better agreement with WM bundles labeled manu-

598

ally by a trained rater. Specifically, there is a 20% improvement in the overlap

599

with the manually labeled bundles when streamlines where clustered based

600

on their anatomical similarity than their Euclidean distance (Fig. 4). Visual

601

inspection of average clusters across subjects suggests that, when streamlines

602

are clustered based on Euclidean distance, clusters that overlap with these

603

manually labeled bundles include streamlines that are close to the bundles

604

but that would not be assigned to these bundles by a human rater (Fig. 5,

605

Fig. 6). Furthermore, we find that the anatomical similarity measure leads

606

to an increase in both the homogeneity and the completeness of clusters with

607

respect to the manually labeled bundles, when compared to the Euclidean

608

distance-based similarity (Fig. 7). Importantly, the increased agreement be-

609

tween clusters and manually labeled bundles is achieved without introducing

610

any prior information from manual labeling. Our method is unsupervised, us-

611

ing only the subject’s own cortical and subcortical segmentation labels. The

612

improvement stems from the fact that, by not grouping streamlines based on

613

their proximity but based on their anatomical neighborhood, the algorithm

615

616

M AN U

D

TE

EP

AC C

614

SC

596

follows an approach that resembles the anatomical criteria used by human raters to identify WM bundles. We also evaluate the anatomical and spatial consistency of clusters ob-

617

tained by optimizing the anatomical or Euclidean distance similarity mea-

618

sure (Fig. 8). We find that the clusters obtained with the two methods differ

34

ACCEPTED MANUSCRIPT

significantly more in terms of within-cluster anatomical similarity than they

620

differ in terms of within-cluster Euclidean distance. Furthermore, the within-

621

cluster mean closest-point distance is not significantly different between the

622

clusters obtained with the two methods. This suggests that streamlines that

623

share common anatomical neighbors tend to also be close together in space,

624

but the reverse is not necessarily true (see also Fig.9). It has been pointed

625

out previously that clustering based on spatial coordinates cannot capture

626

all anatomical subdivisions, if the streamlines in the subdivisions do not vary

627

significantly in terms of shape (O’Donnell et al., 2006).

M AN U

SC

RI PT

619

We assess the robustness of our anatomical similarity measure to errors

629

in the anatomical segmentation by repeating the clustering after perturbing

630

the boundaries of the segmentation labels. We find that there is no signif-

631

icant deterioration in the performance of the anatomical similarity measure

632

when the perturbed segmentations are used instead of the original segmen-

633

tations (Fig. 11). We also show that errors in the automated anatomical

634

segmentation labels obtained by FreeSurfer are smaller than the range of the

635

perturbations performed in our experiment, and smaller than typical dMRI

636

voxel sizes (Fig. 12).

EP

TE

D

628

We have chosen to use a randomly selected set of prototype streamlines

638

to compute the similarity matrix at each iteration of the normalized cuts

639

640

641

AC C

637

algorithm. This makes computation tractable for the large data sets that result from high-resolution dMRI acquisitions such as the one used here. The number of prototype streamlines has a modest impact on the results,

642

and does not change the conclusions of our performance comparison between

643

similarity measures (Supplemental Fig. S1 and Supplemental Fig. S2). We

35

ACCEPTED MANUSCRIPT

conclude that a choice of Np = 500 prototype streamlines is sensible. Note

645

that the number of streamlines to be divided with normalized cuts decreases

646

at each iteration of the algorithm. At the final stages of the clustering (as

647

we get further away from the root of the hierarchical tree of Fig. 13, thus

648

the clusters become finer and greater accuracy becomes crucial), Np = 500

649

represents a large portion of the total number of streamlines to be divided.

SC

RI PT

644

The number of anatomical neighbors included in our proposed similarity

651

measure is another factor that affects computational complexity, although

652

histogram computations for different neighbors are fully parallelizable. We

653

find a modest improvement in performance when anatomical neighborhoods

654

of size greater than 6 are used (Supplemental Fig. S3), but the anatomical

655

similarity measure outperforms the Euclidean distance similarity measure

656

for any neighborhood size. Finally, for all the results shown here, only long-

657

range connections (streamlines longer than 55mm) are included to reduce

658

computation. Our approach can be applied to full tractography data sets

659

without a reduction in performance at the expense of longer computation

660

times for both similarity measures.

TE

D

M AN U

650

Our evaluation of the anatomical similarity measure after downsampling

662

streamlines to N = 10 vs. N = 50 equispaced points shows that increas-

663

ing this number may improve results somewhat (Supplemental Fig. S4),

665

666

AC C

664

EP

661

although this improvement is not statistically significant. Here we downsampled streamlines to a fixed N for a more balanced comparison to the Euclidean distance similarity measure, which relies on downsampling to re-

667

duce computation time. However, our proposed anatomical similarity mea-

668

sure does not require streamlines to have the same number of points, and its

36

ACCEPTED MANUSCRIPT

669

computational complexity is not impacted by this number. We have chosen to compute the anatomical similarity of streamlines with

671

respect to a set of cortical parcellation and subcortical segmentation labels,

672

instead of labels from a WM atlas, for two reasons. First, our goal was to

673

develop an unsupervised method, rather than impose the existence of certain

674

WM bundles through the use of an atlas. Our approach allows exploratory

675

analyses to be performed on tractography data from the whole brain, instead

676

of being limited to a set of pre-defined bundles. Second, our similarity mea-

677

sure relies not only on the structures that a streamline goes through, but also

678

on the structures that surround it, in a set of different directions. Therefore,

679

one might expect a segmentation that includes cortical and subcortical struc-

680

tures to perform well in that setting. Indeed, our results suggest that our

681

anatomical similarity measure works better when we compute it using labels

682

of cortical and subcortical structures, rather than labels from a WM atlas

683

(Supplemental Fig. S5). That is, although a WM atlas can be an extremely

684

valuable tool for supervised reconstruction of specific pathways, it does not

685

appear to be optimal for the purposes of whole-brain tractography clustering

686

based on anatomical similarity.

EP

TE

D

M AN U

SC

RI PT

670

The similarity measure that we have proposed involves the histograms

688

of the anatomical neighbors of tractography streamlines. These histograms

689

690

691

AC C

687

provide an “anatomical signature” that allows us to describe streamlines without relying on their spatial coordinates in a native or template space. We expect this signature to be useful for finding corresponding bundles, e.g.,

692

between subjects or between hemispheres, without the need for an exact

693

point-by-point transformation between them. This is an exciting prospect

37

ACCEPTED MANUSCRIPT

for large population studies, and it will be very important to evaluate our

695

method on larger data sets than the one used here. Early results from apply-

696

ing AnatomiCuts to a much larger set of data from the MGH-Harvard-USC

697

pilot Lifespan Human Connectome Project show promise for using the pro-

698

posed anatomical similarity measure to find corresponding clusters across

699

subjects aged 8–90 without co-registration (Siless et al., 2017). We intend to

700

investigate this further and to evaluate our method on other large data sets

701

from different disease populations in the future.

SC

RI PT

694

In previous work, we used a similar description of the anatomical neigh-

703

borhood of a path to introduce priors into a global probabilistic tractography

704

algorithm (Yendiki et al., 2011). In that work, histograms of the anatomical

705

neighbors of tractography streamlines were computed from training subjects,

706

where the bundles of interest had been labeled manually, and used as anatom-

707

ical priors when performing tractography in a novel subject. In the present

708

work, we do not rely on training data; we introduce anatomical informa-

709

tion only from the subject’s own anatomical segmentation, and only after

710

the tractography has been performed. On the one hand, this allows us to

711

perform exploratory, whole-brain tractography analyses, without being con-

712

strained to a set of bundles included in an atlas. On the other hand, because

713

the tractography is performed in an unconstrained way, it is prone to all the

715

716

D

TE

EP

AC C

714

M AN U

702

usual errors due to crossing WM bundles, such as streamlines ending prematurely or jumping from one bundle to another. Post hoc clustering methods, like the one presented here, cannot recover from such errors, as they do not

717

modify the tractography streamlines—they only assign the streamlines into

718

groups. It would be possible to use the two approaches in a complementary

38

ACCEPTED MANUSCRIPT

way. One could apply the unsupervised clustering presented here to high-

720

quality dMRI data (e.g., from the Connectom scanner) to generate clusters

721

from the whole brain, include the clusters that occur consistently across sub-

722

jects into an atlas to generate priors for the approach in Yendiki et al. (2011),

723

and then apply the latter to reconstruct the same pathways robustly in more

724

routine-quality dMRI data.

725

5. Conclusion

M AN U

SC

RI PT

719

We present AnatomiCuts, a method for unsupervised hierarchical cluster-

727

ing of dMRI tractography data based on anatomical similarity. We compare

728

this to the conventional approach of using a similarity based on Euclidean

729

distance. We find that the anatomical similarity yields results that are more

730

consistent with manual labeling relative to the Euclidean similarity. That

731

is, without introducing any training data from human raters, we are able to

732

obtain results that are in closer agreement with such a rater. We achieve this

733

simply by using a similarity measure that is better at replicating how a hu-

734

man with neuroanatomical expertise would segment WM tracts, i.e., based

735

on the anatomical structures that they either intersect or neighbor, every-

736

where along the tracts’ trajectory. This allows us to obtain anatomically

738

739

TE

EP

AC C

737

D

726

meaningful WM bundles without being limited to a set of tracts included in an atlas. Therefore we expect our approach to facilitate fully unsupervised analyses of whole-brain tractography data in large population studies.

39

ACCEPTED MANUSCRIPT

740

Acknowledgements Support for this research was provided in part by the National Institute

742

for Biomedical Imaging and Bioengineering (P41EB015896, 1R01EB023281,

743

R01EB006758, R21EB018907, R01EB019956), the National Institute for Men-

744

tal Health (U01-MH108168; Boston Adolescent Neuroimaging of Depres-

745

sion and Anxiety project), the National Institute on Aging (R01AG008122,

746

R01AG016495), the National Institute of Diabetes and Digestive and Kidney

747

Diseases (R21DK108277), the National Institute for Neurological Disorders

748

and Stroke (R01NS0525851, R21NS072652, R01NS070963, R01NS083534,

749

U01NS086625), and was made possible by the resources provided by Shared

750

Instrumentation Grants 1S10RR023401, 1S10RR019307, and 1S10RR023043.

751

Additional support was provided by the NIH Blueprint for Neuroscience Re-

752

search (U01-MH093765; part of the multi-institutional Human Connectome

753

Project, T90DA022759/R90DA023427). In addition, BF has a financial in-

754

terest in CorticoMetrics, a company whose medical pursuits focus on brain

755

imaging and measurement technologies. BF’s interests were reviewed and

756

are managed by Massachusetts General Hospital and Partners HealthCare in

757

accordance with their conflict of interest policies.

758

Basser, P.J., Pajevic, S., Pierpaoli, C., Duda, J., & Aldroubi, A. 2000. In vivo

760

761

762

SC

M AN U

D

TE

EP

AC C

759

RI PT

741

fiber tractography using DT-MRI data. Magnetic Resonance in Medicine, 44(4), 625–632.

Brun, A., Knutsson, H., Park, H.J., Shenton, M.E., & Westin, C.F. 2004. Clustering Fiber Traces Using Normalized Cuts. International Confer-

40

ACCEPTED MANUSCRIPT

ence on Medical Image Computing and Computer-Assisted Intervention,

764

3216/2004(3216), 368–375.

RI PT

763

Corouge, I., Gouttard, S., & Gerig, G. 2004 (April). Towards a shape model

766

of white matter fiber bundles using diffusion tensor MRI. Pages 344–

767

347 Vol. 1 of: 2004 2nd IEEE International Symposium on Biomedical

768

Imaging: Nano to Macro (IEEE Cat No. 04EX821).

770

Dice, L.R. 1945. Measures of the Amount of Ecologic Association Between Species. Ecology, 26(3), 297–302.

M AN U

769

SC

765

771

Ding, Z., Gore, J.C., & Anderson, A.W. 2003. Classification and quantifi-

772

cation of neuronal fiber pathways using diffusion tensor MRI. Magnetic

773

Resonance in Medicine, 49(4), 716–721.

Fan, Q., Witzel, T., Nummenmaa, A., Van Dijk, K.R.A., Van Horn, J.D.,

775

Drews, M.K., Somerville, L.H., Sheridan, M.A., Santillana, R.M., Snyder,

776

J., Hedden, T., Shaw, E.E., Hollinshead, M.O., Renvall, V., Zanzonico, R.,

777

Keil, B., Cauley, S., Polimeni, J.R., Tisdall, D., Buckner, R.L., Wedeen,

778

V.J., Wald, L.L., Toga, A.W., & Rosen, B.R. 2016. MGH-USC Human

779

Connectome Project datasets with ultra-high b-value diffusion MRI. Neu-

780

roImage, 124, 1108–1114.

782

783

TE

EP

AC C

781

D

774

Fischl, B., Salat, David H., Busa, E., Albert, M., Dieterich, M., Haselgrove, C., Van Der Kouwe, A., Killiany, R., Kennedy, D., Klaveness, S., Montillo, A., Makris, N., Rosen, B., & Dale, A.M. 2002. Whole brain segmentation:

784

Automated labeling of neuroanatomical structures in the human brain.

785

Neuron, 33(3), 341–355. 41

ACCEPTED MANUSCRIPT

Fischl, B., Van Der Kouwe, A., Destrieux, C., Halgren, E., S´egonne, F., Salat,

787

D.H., Busa, E., Seidman, L.J., Goldstein, J., Kennedy, D., Caviness, V.,

788

Makris, N., Rosen, B., & Dale, A.M. 2004. Automatically Parcellating the

789

Human Cerebral Cortex. Cerebral Cortex, 14(1), 11–22.

RI PT

786

Garyfallidis, E., Brett, M., Correia, M.M., Williams, G.B., & Nimmo-Smith,

791

I. 2012. QuickBundles, a Method for Tractography Simplification. Fron-

792

tiers in neuroscience, 6(December), 175.

SC

790

Gerig, G., Gouttard, S., & Corouge, I. 2004. Analysis of brain white matter

794

via fiber tract modeling. Pages 4421–4424 of: Engineering in Medicine and

795

Biology Society, 2004. IEMBS ’04. 26th Annual International Conference

796

of the IEEE, vol. 2.

M AN U

793

Golub, G.H., & Van Loan, C.F. 1996. Matrix Computations.

798

Greve, D.N., & Fischl, B. 2009. Accurate and robust brain image alignment using boundary-based registration. NeuroImage, 48(1), 63–72.

TE

799

D

797

Guevara, P., Poupon, C., Rivi`ere, D., Cointepas, Y., Descoteaux, M.,

801

Thirion, B., & Mangin, J.F. 2011. Robust clustering of massive trac-

802

tography datasets. NeuroImage, 54(3), 1975–1993.

806

AC C

EP

800

807

Guha, S., Rastogi, R., & Shim, K. 2001. CURE: An efficient clustering

803

804

805

808

Guevara, P., Duclap, D., Poupon, C., Marrakchi-Kacem, L., Fillard, P., Le Bihan, D., Leboyer, M., Houenou, J., & Mangin, J.F. 2012. Automatic fiber bundle segmentation in massive tractography datasets using a multisubject bundle atlas. NeuroImage, 61(4), 1083–1099.

algorithm for large databases. Information Systems, 26(1), 35–58. 42

ACCEPTED MANUSCRIPT

Hua, K., Zhang, J., Wakana, S., Jiang, H., Li, X., Reich, D.S., Calabresi,

810

P. A., Pekar, J.J., Van Zijl, P.C.M., & Mori, S. 2008. Tract probability

811

maps in stereotaxic spaces: Analyses of white matter anatomy and tract-

812

specific quantification. NeuroImage, 39(1), 336 – 347.

814

Jenkinson, M., Beckmann, C.F., Behrens, T.E.J., Woolrich, M.W., & Smith, S.M. 2012. FSL.

SC

813

RI PT

809

Jin, Y., Shi, Y., Zhan, L., Gutman, B.A, de Zubicaray, G.I., McMahon,

816

K.L., Wright, M.J., Toga, A.W., & Thompson, P.M. 2014. Automatic

817

clustering of white matter fibers in brain diffusion MRI with an application

818

to genetics. NeuroImage, 100, 75–90.

M AN U

815

Le Bihan, D., Breton, E., Lallemand, D., Grenier, P., Cabanis, E., & Laval-

820

Jeantet, M. 1986. MR imaging of intravoxel incoherent motions: appli-

821

cation to diffusion and perfusion in neurologic disorders. Radiology, 161,

822

401–407.

TE

D

819

Lehman, J.F., Greenberg, B.D., McIntyre, C.C., Rasmussen, S.A., & Haber,

824

S.N. 2011. Rules ventral prefrontal cortical axons use to reach their tar-

825

gets: implications for diffusion tensor imaging tractography and deep brain

826

stimulation for psychiatric illness. Journal of Neuroscience, 31(28), 10392–

828

829

830

AC C

827

EP

823

10402.

Li, H., Xue, Z., Guo, L., Liu, T., Hunter, J., & Wong, S.T.C. 2010. A hybrid approach to automatic clustering of white matter fibers. NeuroImage, 49(2), 1249 – 1258.

43

ACCEPTED MANUSCRIPT

Maddah, M., Z¨ollei, L., Grimson, W.E.L., Westin, C.F., & Wells, W.M. 2008.

832

A mathematical framework for incorporating anatomical knowledge in DT-

833

MRI analysis. Pages 105–108 of: 2008 5th IEEE International Symposium

834

on Biomedical Imaging: From Nano to Macro, Proceedings, ISBI.

RI PT

831

Makris, N., & Pandya, D.N. 2008. The extreme capsule in humans and

836

rethinking of the language circuitry. Brain Structure and Function, 213(3),

837

343–358.

SC

835

Moberts, B., Vilanova, A., & Van Wijk, J.J. 2005 (Oct). Evaluation of

839

fiber clustering methods for diffusion tensor imaging. Pages 65–72 of:

840

Visualization, 2005. VIS 05. IEEE.

M AN U

838

Mori, S., Crain, B.J., Chacko, V.P., & Zijl, P.V. 1999. Three-dimensional

842

tracking of axonal projections in the brain by magnetic resonance imaging.

843

Ann. Neurol., 45(2), 265–9.

TE

D

841

Mugler, J.P., & Brookeman, J.R. 1990. Three-dimensional magnetization-

845

prepared rapid gradient-echo imaging (3D MP RAGE). Magnetic Reso-

846

nance in Medicine, 15(1), 152–157.

848

849

850

O’Donnell, L., Kubicki, M., Shenton, M. E., Dreusicke, M., Grimson, W.E.L.,

AC C

847

EP

844

& Westin, C.F. 2006. A Method for Clustering White Matter Fiber Tracts. AJNR, 27(5), 1032–1036.

O’Donnell, L.J., & Westin, C.F. 2007. Automatic tractography segmenta-

851

tion using a high-dimensional white matter atlas. IEEE Transactions on

852

Medical Imaging, 26(11), 1562–1575.

44

ACCEPTED MANUSCRIPT

854

Oishi, K., Faria, A.V., Van Zijl, P.C.M., & Mori, S. 2010. MRI Atlas of Human White Matter. Academic Press.

RI PT

853

Ros, C., G¨ ullmar, D., Stenzel, M., Mentzel, H.J., & Reichenbach, J.R. 2013.

856

Atlas-guided cluster analysis of large tractography datasets. PLoS ONE,

857

8(12).

SC

855

Rosenberg, A., & Hirschberg, J. 2007. V-measure: A conditional entropy-

859

based external cluster evaluation measure. Pages 410–420 of: Proceedings

860

of the Joint Conference on Empirical Methods in Natural Language Pro-

861

cessing and Computational Natural Language (EMNLP-CoNLL’07), vol.

862

1.

864

Sammut, C., & Webb, G. I. (eds). 2017. Classification Algorithms. Boston, MA: Springer US. Pages 208–209.

D

863

M AN U

858

Setsompop, K., Kimmlingen, R., Eberlein, E., Witzel, T., Cohen-Adad, J.,

866

McNab, J.A., Keil, B., Tisdall, M.D., Hoecht, P., Dietz, P., Cauley, S.F.,

867

Tountcheva, V., Matschl, V., Lenz, V.H., Heberlein, K., Potthast, A.,

868

Thein, H., Van Horn, J., Toga, A., Schmitt, F., Lehne, D., Rosen, B.R.,

869

Wedeen, V., & Wald, L.L. 2013. Pushing the limits of in vivo diffusion

870

MRI for the Human Connectome Project. NeuroImage, 80, 220–233.

873

AC C

EP

TE

865

874

Siless, V., Medina, S., Varoquaux, G., & Thirion, B. 2013 (June). A Com-

871

872

Shi, J., & Malik, J. 2000. Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8), 888– 905.

45

ACCEPTED MANUSCRIPT

parison of Metrics and Algorithms for Fiber Clustering. Pages 190–193 of:

876

2013 International Workshop on Pattern Recognition in Neuroimaging.

877

Siless, V., Chang, K., Fischl, B., & Yendiki, A. 2016. Hierarchical Clustering

878

of Tractography Streamlines Based on Anatomical Similarity. Pages 184–

879

191 of: Medical Image Computing and Computer-Assisted Intervention.

880

Springer International Publishing.

SC

RI PT

875

Siless, V., Davidow, J.Y., Nielsen, J., Fan, Q., Hedden, T., Hollinshead,

882

M., Bustamante, C.V., Drews, M.K., Van Dijk, K.R.A., Sheridan, M.A.,

883

Buckner, R. L., Fischl, B., Somerville, L., & Yendiki, A. 2017. Registration-

884

free analysis of diffusion MRI tractography data across subjects through

885

the human lifespan. In: Proc. Intl. Soc. Mag. Reson. Med., vol. 25.

M AN U

881

Smith, S.M., Jenkinson, M., Johansen-Berg, H., Rueckert, D., Nichols, T.E.,

887

Mackay, C.E., Watkins, K.E., Ciccarelli, O., Cader, M.Z., Matthews, P.M.,

888

& Behrens, T.E.J. 2006. Tract-based spatial statistics: Voxelwise analysis

889

of multi-subject diffusion data. NeuroImage, 31(4), 1487–1505.

TE

D

886

Sørensen, T.J. 1948. A method of establishing groups of equal amplitude in

891

plant sociology based on similarity of species and its application to analyses

892

of the vegetation on Danish commons. Biol. Skr., 5, 1–34.

894

895

896

AC C

893

EP

890

Tsai, A., Westin, C.F., Hero, A.O., & Willsky, A.S. 2007. Fiber tract clustering on manifolds with dual rooted-graphs. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

46

ACCEPTED MANUSCRIPT

Tunc, B., Parker, W.A., Ingalhalikar, M., & Verma, R. 2014. Automated

898

tract extraction via atlas based adaptive clustering. NeuroImage, 102(P2),

899

596–607.

RI PT

897

Van der Kouwe, A., Benner, T., Salat, D.H., & Fischl, B. 2008. Brain

901

morphometry with multiecho MPRAGE. NeuroImage, 40(2), 559–569.

902

Visser, E., Nijhuis, E.H.J., Buitelaar, J.K., & Zwiers, M.P. 2011. Partition-

903

based mass clustering of tractography streamlines. NeuroImage, 54(1),

904

303–312.

M AN U

SC

900

Wakana, S., Caprihan, A., Panzenboeck, M.M., Fallon, J.H., Perry, M., Gol-

906

lub, R.L., Hua, K., Zhang, J., Jiang, H., Dubey, P., Blitz, A., van Zijl,

907

P., & Mori, S. 2007. Reproducibility of quantitative tractography methods

908

applied to cerebral white matter. NeuroImage, 36(3), 630–644.

D

905

Wang, Q., Yap, P.T., Wu, G., & Shen, D. 2013. Application of neuroanatom-

910

ical features to tractography clustering. Human Brain Mapping, 34(9),

911

2089–2102.

EP

TE

909

Wang, X., Grimson, W.E.L, & Westin, C.F. 2011. Tractography segmenta-

913

tion using a hierarchical Dirichlet processes mixture model. NeuroImage,

914

915

916

AC C

912

54(1), 290–302.

Wassermann, D., Bloy, L., Kanterakis, E., Verma, R., & Deriche, R. 2010. Unsupervised white matter fiber clustering and tract probability map gen-

917

eration: Applications of a Gaussian process framework for white matter

918

fibers. NeuroImage, 51(1), 228–241.

47

ACCEPTED MANUSCRIPT

Wassermann, D., Makris, N., Rathi, Y., Shenton, M., Kikinis, R., Kubicki,

920

M., & Westin, C.F. 2016. The white matter query language: a novel

921

approach for describing human white matter anatomy. Brain Structure

922

and Function, 221(9), 4705–4721.

924

Wright, S.M., & Wald, L.L. 1997. Theory and application of array coils in MR spectroscopy. NMR in Biomedicine, 10(8), 394–410.

SC

923

RI PT

919

Wu, X., Xie, M., Zhou, J., Anderson, A.W., Gore, J.C., & Ding, Z. 2012.

926

Globally optimized fiber tracking and hierarchical clustering – a unified

927

framework. Magnetic resonance imaging, 30(4), 485–495.

M AN U

925

Xia, Y., Turken, U., Whitfield-Gabrieli, S.L., & Gabrieli, J.D. 2005.

929

Knowledge-Based Classification of Neuronal Fibers in Entire Brain. Berlin,

930

Heidelberg: Springer Berlin Heidelberg. Pages 205–212.

932

Yeh, F.C., Wedeen, V.J., & Tseng, W.Y.I. 2010. Generalized q-sampling

TE

931

D

928

imaging. IEEE Transactions on Medical Imaging, 29(9), 1626–1635. Yeh, F.C., Verstynen, T.D., Wang, Y., Fern´andez-Miranda, J.C., & Tseng,

934

W.Y.I. 2013. Deterministic diffusion fiber tracking improved by quantita-

935

tive anisotropy. PLoS ONE, 8(11).

937

938

AC C

936

EP

933

Yendiki, A., Panneck, P., Srinivasan, P., Stevens, A., Z¨ollei, L., Augustinack, J., Wang, R., Salat, D., Ehrlich, S., Behrens, T., Jbabdi, S., Gollub, R., & Fischl, B. 2011. Automated Probabilistic Reconstruction of White-

939

Matter Pathways in Health and Disease Using an Atlas of the Underlying

940

Anatomy. Frontiers in Neuroinformatics, 5, 23.

48

ACCEPTED MANUSCRIPT

Zhang, T., Chen, H., Guo, L., Li, K., Li, L., Zhang, S., Shen, D., Hu, X., &

942

Liu, T. 2014. Characterization of U-shape streamline fibers: Methods and

943

applications. Medical Image Analysis, 18(5), 795–807.

RI PT

941

Ziyan, U., Sabuncu, M.R., Grimson, W.E.L., & Westin, C.F. 2009. Consis-

945

tency clustering: A robust algorithm for group-wise registration, segmen-

946

tation and automatic atlas construction in diffusion MRI. Pages 279–290

947

of: International Journal of Computer Vision, vol. 85.

AC C

EP

TE

D

M AN U

SC

944

49

AC C

EP

TE

D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

Figure 12: Mean closest-point distance between automatically segmented and manually labeled brain structures that are included in the FreeSurfer subcortical segmentation.

50

D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(i)

AC C

EP

TE

(a)

Figure 13: Top: Tree representation of the hierarchical cuts performed on a subject’s

51

whole-brain tractography data using our anatomical similarity measure. Bottom: Clusters associated with the tree nodes labeled (a)-(i).

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

(b)

TE

D

(a)

(d)

EP

(c)

Figure S1: Dice coefficient with respect to manually labeled bundles (a), homogeneity (b), completeness (c), and anatomical similarity (d) of unsupervised clustering, using different

AC C

numbers of prototype streamlines to compute the similarity matrix for normalized cuts. Results are shown for clustering with the anatomical similarity measure and 50, 100, 250, 500, 100 or 3000 prototype streamlines (A-50, A-100, A-250, A-500, A-1000, A-3000). Results for clustering with the Euclidean distance similarity measure and 500 prototype streamlines (E-500) are also shown for reference.

52

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

(b)

TE

D

(a)

(d)

EP

(c)

Figure S2: Dice coefficient with respect to manually labeled bundles (a), homogeneity

AC C

(b), completeness (c), and anatomical similarity (d) of unsupervised clustering, using randomly selected sets of 500 prototype streamlines to compute the similarity matrix for normalized cuts. Results are shown for clustering with the anatomical similarity measure and 5 different sets of 500 prototype streamlines (A-500). Results for clustering with the Euclidean distance similarity measure and a single set 500 prototype streamlines (E-500) are also shown for reference.

53

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

(b)

EP

TE

D

(a)

(a)

(b)

AC C

Figure S3: Dice coefficient with respect to manually labeled bundles (a), homogeneity (b), completeness (c), and anatomical similarity (d) of unsupervised clustering, using different sizes of anatomical neighborhoods. Results are shown for clustering with the anatomical similarity measure and a 6-, 14-, or 26-element neighborhood. Results for clustering with the Euclidean distance similarity measure are also shown for reference.

54

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

(b)

EP

TE

D

(a)

(c)

(d)

Figure S4: Dice coefficient with respect to manually labeled bundles (a), homogeneity

AC C

(b), completeness (c), and anatomical similarity (d) of unsupervised clustering, using the anatomical similarity measure on streamlines downsampled to N = 10 vs. N = 50 points. Results for clustering with the Euclidean distance similarity measure for streamlines downsampled to N = 10 points are also shown for reference.

55

TE

D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

Figure S5: Dice coefficient between the manually labeled WM bundles and the streamline

EP

clusters obtained with the anatomical similarity measure and two different sets of labels: the subjects’ individual FreeSurfer cortical parcellation and subcortical segmentation la-

AC C

bels (FS) or the labels from the JHU-ICBM WM atlas, mapped into individual space (JHU). The plot shows Dice coefficients averaged over all subjects by tract, when the total number of clusters is 200.

56

D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

TE

Figure S6: Dice coefficient between the manually labeled WM bundles and the streamline clusters obtained with our proposed anatomical similarity and a similarity measure based on shape descriptors. The plot shows Dice coefficients averaged over all subjects by tract,

AC C

EP

when the total number of clusters is 200.

57