Test-retest reliability of white matter structural brain networks: a multiband diffusion MRI study

Zhao, Tengda; Duan, Fei; Liao, Xuhong; Dai, Zhengjia; Cao, Miao; He, Yong; Shu, Ni

doi:10.3389/fnhum.2015.00059

ORIGINAL RESEARCH article

Front. Hum. Neurosci., 17 February 2015

Sec. Brain Imaging and Stimulation

Volume 9 - 2015 | https://doi.org/10.3389/fnhum.2015.00059

Test-retest reliability of white matter structural brain networks: a multiband diffusion MRI study

Tengda Zhao^1,2

Fei Duan^1,2

Xuhong Liao³

Zhengjia Dai^1,2

Miao Cao^1,2

Yong He^1,2

Ni Shu^1,2^*

¹State Key Laboratory of Cognitive Neuroscience and Learning and IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China
²Center for Collaboration and Innovation in Brain and Learning Sciences, Beijing Normal University, Beijing, China
³Center for Cognition and Brain Disorders, Hangzhou Normal University, Hangzhou, China

The multiband EPI sequence has been developed for the human connectome project to accelerate MRI data acquisition. However, no study has yet investigated the test-retest (TRT) reliability of the graph metrics of white matter (WM) structural brain networks constructed from this new sequence. Here, we employed a multiband diffusion MRI (dMRI) dataset with repeated scanning sessions and constructed both low- and high-resolution WM networks by volume- and surface-based parcellation methods. The reproducibility of network metrics and its dependence on type of construction procedures was assessed by the intra-class correlation coefficient (ICC). We observed conserved topological architecture of WM structural networks constructed from the multiband dMRI data as previous findings from conventional dMRI. For the global network properties, the first order metrics were more reliable than second order metrics. Between two parcellation methods, networks with volume-based parcellation showed better reliability than surface-based parcellation, especially for the global metrics. Between different resolutions, the high-resolution network exhibited higher TRT performance than the low-resolution in terms of the global metrics with a large effect size, whereas the low-resolution performs better in terms of local (region and connection) properties with a relatively low effect size. Moreover, we identified that the association and primary cortices showed higher reproducibility than the paralimbic/limbic regions. The important hub regions and rich-club connections are more reliable than the non-hub regions and connections. Finally, we found WM networks from the multiband dMRI showed higher reproducibility compared with those from the conventional dMRI. Together, our results demonstrated the fair to good reliability of the WM structural brain networks from the multiband EPI sequence, suggesting its potential utility for exploring individual differences and for clinical applications.

Introduction

The concept of the “human connectome” has been recently proposed and has provided a new perspective to investigate the brain's structural and functional systems (Sporns et al., 2005). As the anatomical substrate of brain function, the structural brain connectome describes brain wiring patterns and is fundamentally important for revealing the mechanisms of how the brain works. Recent studies have suggested that the human white matter (WM) structural network can be mapped in vivo using diffusion MRI (dMRI) tractography techniques and quantified by graph-theoretical analysis (Hagmann et al., 2007; Bullmore and Sporns, 2009; Gong et al., 2009a). The quantitative graph metrics of structural brain networks are suggested to be closely related to individual cognitive performances (Li et al., 2009; Wen et al., 2011) and sensitive to the processes of normal development (Hagmann et al., 2010) and aging (Gong et al., 2009b), as well as neuropsychiatric diseases (Lo et al., 2010; Shu et al., 2011; Zalesky et al., 2011; Bai et al., 2012; Cao et al., 2013), suggesting that network metrics may be potential biomarkers for clinical applications.

Recently, some promising fast-collecting imaging techniques, such as multiband EPI (mEPI), have been applied in the dMRI data acquisition (Moeller et al., 2010). This new sequence can accelerate acquisition by simultaneously imaging multiple slices in the human brain, while not significantly sacrificing spatial resolution or the SNR (Moeller et al., 2010; Xu et al., 2013). This sequence is being applied in the recently launched human connectome project aiming to acquire a large sample of healthy subjects with the goal of uncovering individual differences in brain circuitry related to behavior (van Essen et al., 2012). However, before successfully charting the human connectome using this new sequence, studies must determine whether connectivity properties conserved across the population can be reproducibly quantified in an individual over multiple scanning sessions and whether that reproducibility can be potentially influenced by methodological variations.

Previous network studies have suggested that many factors may influence the accuracy and reliability of the network metrics, such as various choices of the structural descriptions of the WM network elements and connections. Specifically, the nodes can be defined by the parcellation of the cortex into hundreds or thousands of regions using an atlas (Zalesky et al., 2010) or the landmarks of gyri and sulci (Hagmann et al., 2008). The connections can be reconstructed by dMRI deterministic or probabilistic tractography approaches (Gong et al., 2009a,b; Shu et al., 2011). Additionally, the network construction and analysis involve other procedures that may also introduce certain variances, such as node scales and weighting schemes. Until now, only a subset of studies has investigated the intra- and inter-variability and reliability of network metrics from dMRI data using a conventional EPI sequence (Vaessen et al., 2010; Zalesky et al., 2010; Bassett et al., 2011; Cheng et al., 2012; Buchanan et al., 2014; Duda et al., 2014); moderate to high reliability was indicated for the global network metrics, and different procedures have large effects on the intra- and inter-subject variability. However, for the mEPI sequence, whether multiband dMRI scans can effectively identify the conserved topological organization of the WM structural network in the brain and whether they can exhibit good test-retest (TRT) reliability remains largely unknown.

In the present study, we aim to investigate the TRT reliability of network metrics from fast collecting dMRI data with hundreds of gradient directions as acquired by a mEPI sequence. The multiband dMRI dataset consists of 11 healthy subjects who were each scanned twice with approximately 1 week apart. Based on different parcellation approaches, both low- and high-resolution WM structural networks were constructed to examine the reliability of network properties from global and local perspectives. The reproducibility of network properties and its dependence on types of procedures (cortical parcellation and nodal scales) were assessed by the intra-class correlation coefficient (ICC).

Materials and Methods

Test-Retest Datasets

The multiband test-retest pilot dataset was publicly available from INDI (http://fcon_1000.projects.nitrc.org/indi/pro/eNKI_RS_TRT/FrontPage.html). The dataset includes 24 subjects whose phenotype information is presented in Table 1. All individuals included in the sample underwent semi-structured diagnostic psychiatric interviews and completed a battery of psychiatric, cognitive and behavioral assessments. Written informed consents were obtained from all participants. The study was approved by the Nathan Kline Institute Institutional Review Board. Recently, the test-retest resting-state functional MRI (rs-fMRI) data in this dataset has been used to examine the reliability of regional functional homogeneity (Zuo et al., 2013) and the reliability of global hubs in human voxel-wise functional networks (Liao et al., 2013). To exclude the potential effects of health issues, the data of seven subjects with current/past psychiatric disorders and four subjects without diagnostic information were discarded. Moreover, one subject was excluded due to brain atrophy and one subject lacked one repeated session; therefore, data from 11 healthy subjects (3 females, mean age 32.9 ± 12.5 years) were left for further analyses (marked in Table 1).

TABLE 1

Table 1. Summary of phenotype information of subjects.

Data Acquisition

Each participant received test-retest dMRI scans (at least 1 week apart) using a Siemens Trio 3T scanner. The dMRI data were acquired using a recently developed mEPI sequence (Moeller et al., 2010; Xu et al., 2013): repetition time (TR) = 2400 ms, echo time (TE) = 85 ms, 64 slices, slice thickness of 2 mm, FOV = 212 × 180 mm², voxel size of 2 mm isotropic, b value = 1500 s/mm², 128 gradient directions with 9 b = 0 images, multiband acceleration factor = 4, averages = 1, total acquisition time = 5:58 min. A T1-weighted image was obtained with an magnetization prepared rapid gradient echo (MPRAGE) sequence [TR = 2500 ms, TE = 3.5 ms, inversion time (TI) = 1200 ms, acquisition matrix = 256 × 256, voxel size of 1 mm isotropic]. Additionally, the test-retest rs-fMRI data were also acquired, but were not used in the present study. For each dMRI scan, the data quality was checked by visual inspection to avoid the distortions caused by magnetic field inhomogeneities.

Data Preprocessing

The preprocessing of dMRI data consisted of the following steps: eddy current and motion artifact correction, estimation of the diffusion tensor, calculation of the fractional anisotropy (Smith et al.). The eddy current distortions and motion artifacts in the dMRI dataset were corrected by applying an affine alignment of each diffusion-weighted image to the b = 0 image. After that, the diffusion tensor elements were estimated by solving the Stejskal and Tanner equation; then, the reconstructed tensor matrix was diagonalized to obtain three eigenvalues (λ₁, λ₂, λ₃) and eigenvectors, and the corresponding FA of each voxel was calculated. All of the processes were performed with the FDT toolbox (Behrens et al., 2003) of FMRIB Software Library (FSL, http://www.fmrib.ox.ac.uk/fsl) (Smith et al., 2004).

Structural Segmentation and WM Tractography

First, the structural T1-weighted image was first segmented into gray matter (GM), WM and cerebrospinal fluid (CSF) in the CIVET pipeline (http://wiki.bic.mni.mcgill.ca/index.php/CIVET). Then the individual T1-weighted image was coregistered to the b = 0 image through a linear transformation which is applied to the segmented WM mask. Within each WM voxel, eight seeds were started and evenly distributed over the volume of the voxel. A streamline was started from each seed following the primary diffusion direction from voxel to voxel, thus reconstructing the WM fibers. The tractography was terminated if it turned at an angle greater than 45 degrees (Mori et al., 1999). Tens of thousands of streamlines were generated to etch out all of the major WM tracts. Diffusion tensor tractography was implemented with the Diffusion Toolkit (http://trackvis.org/) using the “fiber assignment by continuous tracking” method (Mori et al., 1999) and was visualized in the TrackVis program (http://trackvis.org/).

Network Node Definition

To investigate the effects of different parcellation schemes on the network topological architecture and reliability, we used the two most common cortical parcellation methods (surface- and volume-based parcellations) to define network nodes. Both parcellation methods were based on the volumetric Automated Anatomical Labeling (AAL) atlas (Tzourio-Mazoyer et al., 2002) in which 80 cortical areas were selected (Table 2).

1) Volume-based parcellation: the detailed procedure of the volume-based parcellation has been previously described (Gong et al., 2009a; Shu et al., 2011) and was performed using SPM software (http://www.fil.ion.ucl.ac.uk/spm/software/spm8). Briefly, the coregistered T1-weighted image was nonlinearly normalized to the nonlinear asymmetric ICBM152 T1 template (Fonov et al., 2009) in the Montreal Neurological Institute (MNI) space. The inverse transformations were used to warp the AAL atlas from the MNI space to the diffusion native space. Discrete labeling values were preserved with the nearest neighbor interpolation method.

2) Surface-based parcellation: The surface-based parcellation was performed using the CIVET pipeline (www.bic.mni.mcgill.ca/ServicesSoftware/CIVET). A detailed description of the analysis can be found in He et al. (2007). The T1-weighted image was registered into the stereotaxic space using a linear transformation (Collins et al., 1994) and was further segmented into GM, WM, CSF and background using an advanced neural net classifier (Zijdenbos et al., 2002). The internal surfaces of GM and the interface of WM and GM, each consisting of 40,962 vertices in the brain per hemisphere, were then automatically extracted using the Constrained Laplacian-based Automated Segmentation with Proximities (CLASP) algorithm (MacDonald et al., 2000; Kim et al., 2005). The labels of the cortex were assigned by a surface-based AAL atlas on the average 150 normal brains template (MacDonald et al., 2000).

TABLE 2

Table 2. Cortical region-of-interest defined in the study.

Using the above procedures, we obtained 80 cortical regions (40 for each hemisphere; Table 2) of each subject in diffusion native space through two parcellation methods, each representing a node of the network. In addition to the parcellation scheme using 80 nodes in AAL template (L-AAL), we also used a high-resolution (~1000 parcels) parcellation (H-1024) by randomly subdividing the AAL atlas into 1024 regions with equal size both in the volume and in the average cortical surface of 150 normal brains. Therefore, for surface and volume-based parcellations, both L-AAL and H-1024 WM networks with different nodal scales were constructed (Figure 1).

FIGURE 1

Figure 1. The flowchart of the construction of four WM networks under two parcellation methods and two resolutions. (1) The b = 0 image (A) and the individual T1-weighted image (B) were coregistered through a linear transformation. (2) The T1 images were then nonlinearly normalized to the ICBM152 T1 template (D) in the MNI space. (3) Each vertex on the average cortical surface of 150 normal brains was assigned with the value of the label in the volumetric AAL (F) to generate an atlas of surface parcellation (E). (4) The inverse transformations were used to warp the AAL atlas to the native diffusion space. (5) Both surface and volume atlases were subdivided into 1024 regions with equal size to define a high resolution nodal scale. (6) The reconstruction of all WM fibers in the brain was performed using deterministic tractography using the Diffusion Toolkit (C). (7) The weighted networks of each subject were created by computing the number of streamlines that connected each pair of brain regions. Both low- (L-AAL) and high-resolution (H-1024) WM networks based on different parcellation approaches (surface and volume) were constructed for each subject (H), which are represented by the abbreviations of SurL, SurH, VolL, and VolH, respectively.

Network Edge Definition

Based on whole-brain tractography and cortical parcellation, two regions were considered structurally connected if at least one fiber streamline with two end points were located in these two regions. For the weighted WM networks, we defined the fiber number (FN) of interconnecting streamlines between two regions as the weights of the network edges (Shu et al., 2011; Cheng et al., 2012; van den Heuvel et al., 2012). Therefore, both L-AAL and H-1024 FN-weighted WM networks from surface- and volume-based parcellations were constructed for each participant, respectively (Figure 1).

Network Analysis

To characterize the topological organization of WM structural networks, several graph measures were considered, as follows: network strength (S_p), global efficiency (E_glob), local efficiency (E_loc), shortest path length (L_p), clustering coefficient (Van Essen et al.) and small-world parameters (λ, γ, and σ) (Rubinov and Sporns, 2010). For regional characteristics, we considered the nodal strength and nodal efficiency (Achard and Bullmore, 2007). Moreover, we investigated the rich-club organization of WM networks (van den Heuvel and Sporns, 2011). For a recent review on the uses and interpretations of these network measures, refer to Rubinov and Sporns (2010). See Appendix for the detailed definitions and mathematical expressions of the graph metrics used in the present study. All network analyses were performed using in-house GRETNA software (http://www.nitrc.org/projects/gretna/) and visualized using BrainNet Viewer software (http://www.nitrc.org/projects/bnv/) (Xia et al., 2013).

TRT Reliability

To evaluate the TRT reliability of the network metrics between two sessions, a measurement of ICC was employed. The ICC value was calculated as (Shrout and Fleiss, 1979):

I C C = \frac{σ_{b s}^{2} - σ_{w s}^{2}}{σ_{b s}^{2} + (m - 1) σ_{w s}^{2}}

where σ_bs is the between-subject variance, σ_ws is the within subject variance, and m represents the number of repeated measurements (here, m = 2).

ICC is a normalized measure which has a maximum of 1. The ICC values were categorized into five common intervals (Landis and Koch, 1977): 0 < ICC ≤ 0.2 (slight), 0.2 < ICC ≤ 0.4 (fair), 0.4 < ICC ≤ 0.6 (moderate), 0.6 < ICC ≤ 0.8 (substantial), and 0.8 < ICC ≤ 1.0 (almost perfect). Negative ICCs, implying negative reliability (i.e., completely non-reliable), are theoretically difficult to interpret (Rousson et al., 2002) and reasons for negative ICC values are unclear (Muller and Buttner, 1994). Therefore, we set negative ICCs to zero, as suggested in other test-retest studies using the ICC (Kong et al., 2007; Braun et al., 2012).

Statistical Analysis

To test the differences of the reliability of network properties derived from different procedures of network construction and the reliability differences across regions and edges, the repeated ANOVA was performed with SPSS software (version 13.0; SPSS, Chicago, Ill). Moreover, the correlation of the network metrics between the two sessions was calculated by Pearson's correlation using an in house Matlab (The MathWorks, Inc.) program.

TRT Reliability from Conventional dMRI and Subsampled Multiband dMRI

To compare the reproducibility of network metrics between multiband dMRI and conventional dMRI, we further investigated the TRT reliability of WM networks constructed from a conventional dMRI dataset with 30 gradient directions (conv-dMRI-30grad), Moreover, to remove the possible effects of the number of gradient directions on the reliability and make results more comparable, we also investigated the TRT reliability of WM networks constructed from subsampled multiband dMRI data with 30 gradient directions (multi-dMRI-30grad).

1) Conventional dMRI dataset: Eleven right-handed subjects (3 females, mean age 28.0 ± 5.0 years) without history of neurological or psychiatric disorders were included. Each participant received test-retest dMRI scans (at least 1 week apart) using a Siemens Trio 3T scanner at the Imaging Center for Brain Research, Beijing Normal University. The dMRI images were acquired using a single-shot twice-refocused spin-echo conventional EPI sequence (TR = 8,000 ms, TE = 89 ms, FOV = 282 × 282 mm², voxel size of 2.2 mm isotropic, b value = 1000 s/mm², 30 gradient directions with one b = 0 images, average = 2, total acquisition time = 8:06 min). The T1-weighted images were acquired using a MPRAGE sequence (TR = 2530 ms, TE = 3.39 ms, TI = 1100 ms, matrix size = 256 × 256, voxel size = 1 × 1 × 1.33 mm³).

2) Subsampled multiband dMRI dataset: From the original multiband dMRI data with 128 gradient directions, we selected 30 diffusion-weighted images with uniformly distributed gradient directions and one b = 0 image to compose a subsampled multiband dMRI data for each participant.

Based on the conventional and subsampled multiband dMRI datasets, both the high- and low-resolution weighted WM networks with surface and volume based parcellations were constructed with the same procedures as performed for the original multiband dMRI dataset (multi-dMRI-128grad). Then the ICC values of the global network metrics from each dMRI dataset were calculated.

TRT Reliability of Binary WM Networks

To remove the possible effects of the weighting scheme on the inter-subject variability, both the high- and low-resolution WM networks with surface and volume based parcellations from the multiband dMRI were binarized and global metrics based on the unweighted networks were calculated. Then the ICC values of the global network metrics from two sessions were computed.

Results

First, we examined the architectural characteristics of weighted WM structural networks for the new multiband sequence. Then the TRT reliability of WM structural networks derived from the multiband dMRI data was investigated and reported in four levels: global metrics, regional metrics, structural connectivity and rich-club organization.

Conserved Topological Architecture

For the L-AAL network constructed from surface- and volume-based parcellations, the WM networks are sparse with a group mean sparsity of 17.5 and 20.1%, respectively. For the H-1024 WM network, the sparsities are about 1.4 and 1.9% for different parcellations. Low wiring cost of the structural connectivity network is observed, consistent with findings from conventional EPI sequence (Gong et al., 2009a; Bullmore and Sporns, 2012). Compared with random networks, the brain WM networks showed the similar shortest path length and higher clustering (Table 3), suggesting a prominent small-world architecture regardless of different strategies for network construction. Together, these results indicate that WM networks obtained from multiband dMRI data exhibit conserved topological architecture as those derived from conventional dMRI data (Table 3).

TABLE 3

Table 3. Global properties of WM network constructed from mEPI sequence.

TRT Reliability of Global Network Metrics

Figure 2A shows the TRT reliability of global network metrics under different procedure choices. Generally, most global network parameters exhibited moderate to high reliability (ICC > 0.52) regardless of the construction procedure. Only the lambda from L-AAL network with surface-based parcellation had a relatively low reproducibility (ICC = 0.22). Global network measures can be further classified into first and second order metrics where the first order metrics include strength, L_p, C_p, global and local efficiency, and the second order metrics include small-world parameters (λ, γ, and σ), which are normalized by the metrics of random networks (Bassett et al., 2011). Using a repeated ANOVA in which order was treated as a categorical factor and parcellation and resolution were treated as repeated measures, we found that the first order metrics, such as strength and efficiency, are more reliable than the second order metrics (p = 0.0009, Partial Eta Squared = 0.86) (Figure 2B).

FIGURE 2

Figure 2. The TRT reliability of global network properties. (A) The ICC values of global network metrics from low to high were presented with colorbars from blue to red. Multiple network metrics showed moderate to high reliability regardless of construction procedures. (B) Statistical analysis of the effects of network construction procedures on the reliability of first order and second order graph metrics. The bars and errorbars represent the mean values and standard errors, respectively, of the ICC values of first order and second order network metrics.

Given that particular choices of construction options (i.e., cortical parcellation and network resolution) can make significant differences in network topological parameters, we next evaluated which construction scheme performed the best at modeling the brain networks from the perspective of TRT reliability. A Two-Way repeated ANOVA in which parcellation and resolution were treated as repeated measures showed a significant main effect of parcellation (p = 0.002, Partial Eta Squared = 0.81), where post-hoc comparisons confirmed that the volume-based parcellation yielded more reproducible results than the surface-based parcellation (Figure 2B). Meanwhile, a significant main effect of resolution was found, which revealed an increasing reproducibility of global metrics at finer spatial resolutions (p = 0.002, Partial Eta Squared = 0.82) regardless of parcellations (Figure 2B). No significant interactions of parcellation × resolution were found (p > 0.1) (Figure 2B).

TRT Reliability of Regional Strength and Efficiency

Figure 3 shows the nodal strength (A) and efficiency (B) of all regions (averaged over subjects) from the surface- (top) and volume-based parcellations (bottom). Between two sessions, highly significant correlations of nodal properties across all nodes were observed (all r > 0.94). Moreover, highly similar distributions of hub regions (nodal strength > mean + std) were observed between the two sessions, regardless of the network construction procedures (Figure 3). For the L-AAL network, the hub regions were mainly located in the bilateral middle temporal gyri, superior and middle frontal gyri, precuneus, precentral gyrus, postcentral gyrus and supplementary motor area for both parcellations. While for the H-1024 network from surface-based parcellation, the hub regions were distributed in the bilateral temporal gyri, superior and middle frontal gyri, precuneus, anterior and median cingulate and paracingulate gyri, precental and postcentral gyrus, fusiform gyrus and insula. For the volume-based parcellation, more regions in the bilateral temporal gyri, superior and middle occipital gyrus and fewer regions in the superior and middle frontal gyri were identified as hubs compared with the network from the surface-based parcellation (Figure 3).

FIGURE 3

Figure 3. The correlation of nodal properties between sessions. (A) Similar spatial patterns of nodal strength across regions and high correlation of nodal strength between two sessions are demonstrated. On the 3D surface, the nodes with strengths from low to high are represented with colors from blue to red. In the plot, the blue dots represent nodal strength and are linearly fitted with a red line between sessions. (B) Similar spatial patterns of nodal efficiency across regions and high correlation of nodal efficiency between two sessions are demonstrated. On the 3D surface, the nodes with efficiency from low to high are represented with colors from blue to red. In the plot, the blue dots represent nodal efficiency and are linearly fitted with a red line between sessions.

Figure 4 shows the TRT reliability of nodal strength (A) and efficiency (B) under different construction procedures. Across parcellations, most of regions of the L-AAL network exhibited moderate to high reproducibility (surface: nodal strength ICC = 0.70; nodal efficiency ICC = 0.70; volume: nodal strength ICC = 0.75; nodal efficiency ICC = 0.75) except the right posterior cingulate cortex, left insula, right superior parietal gyrus and paracentral lobule. For the H-1024 network, the ICC values across most regions also ranged from moderate to high (surface: nodal strength ICC = 0.56; nodal efficiency ICC = 0.58; volume: nodal strength ICC = 0.62; nodal efficiency ICC = 0.72). When categorizing the cortical regions into three regional classes (primary, association and paralimbic) (Mesulam, 1998) (Figure 5A), a repeated ANOVA was performed in which nodal metric was treated as repeated measures while regional class, parcellation and resolution were treated as categorical factors. An interaction between regional class and network resolution (p < 0.0001, Partial Eta Squared = 0.02) and a significant main effect of regional class (p < 0.0001, Partial Eta Squared = 0.37) in the L-AAL network were observed (Figure 5B). Further post-hoc comparisons showed that the association and primary cortices exhibit a higher reliability than the paralimbic/limbic regions (p < 0.0001) for only the L-AAL network (Figure 5B). Additionally, the relationship between the nodal properties and their corresponding ICC values was investigated. The correlation results indicated that under both low- and high-resolutions, regions with higher nodal strength or efficiency tend to have larger ICC values (all p < 0.001) (Figure 4). In other words, the properties of densely connected hub regions show higher reproducibility than those of peripheral non-hub regions.

FIGURE 4

Figure 4. The TRT reliability of regional network properties. (A) 3D representations of spatial distribution of ICC values of nodal strength across regions. The plots show the correlation between nodal strength and ICC values, with blue dots representing the nodes and the red line representing the linear fit. (B) 3D representations of the spatial distribution of ICC values of nodal efficiency across regions. The plots show the correlation between nodal efficiency and ICC values, with blue dots representing the node and the red line representing the linear fit. Notably, the nodal properties across all nodes were resampled into a Gaussian distribution.

FIGURE 5

Figure 5. The TRT reliability of nodal properties across different regional classes. (A) The regions are shown in red, blue and green on a 3D surface, indicating the association, primary and paralimbic/limbic cortices. (B) Statistical analysis of the nodal reliability between regional classes in the WM network. The bars and errorbars represent the mean values and standard errors, respectively, of the ICC values of all regions in each regional class. The ICCs of nodal strength and efficiency from surface- and volume-based networks were represented by Sur_nStr, Sur_nEff, Vol_nStr, and Vol_nEff, respectively.

When focusing on the effects of cortical parcellation and network resolution on the reproducibility of nodal strength and efficiency, a repeated ANOVA was performed in which nodal metric was treated as repeated measure, parcellation and resolution were treated as categorical factors while the effect of regional class was averaged. The L-AAL network showed higher nodal ICCs than the H-1024 network (p < 0.0001, Partial Eta Squared = 0.02) (Figure 6). And the volume-based parcellation yielded higher nodal ICCs than the surface-based parcellation (p = 0.0003, Partial Eta Squared = 0.01) (Figure 6). An interaction between nodal metric and network resolution (p = 0.001, Partial Eta Squared = 0.01) was observed and nodal efficiency showed significantly higher ICCs than the nodal strength (p < 0.0001, Partial Eta Squared = 0.06) in the H-1024 network (Figure 6). Overall, the L-AAL network with volume-based parcellation exhibited the highest reproducibility in terms of nodal properties.

FIGURE 6

Figure 6. The effects of different parcellation and resolution on the reliability of nodal strength and efficiency. The bars and errorbars represent the mean values and standard errors, respectively, of the ICC values of all nodal properties from different construction procedures. The ICCs of nodal strength and efficiency from surface- and volume-based networks were represented by Sur_nStr, Sur_nEff, Vol_nStr, and Vol_nEff, respectively.

TRT Reliability of Structural Connectivity

Figure 7A shows the average matrices of WM connections across subjects for each session. Between two sessions, highly significant correlations of edge weights across all edges were observed, especially for the L-AAL network (all r > 0.9) (Figure 7B). To assess the intra-session reliability of the WM connectivity, we first detected significantly consistent connections across subjects, by performing a nonparametric one-tailed sign test. For each pair of brain regions, the sign test was performed with the null hypothesis that no connection exists [“fiber bundle number = 0” (p < 0.05)]. Nonzero connections within either session groups were detected and assigned the average edge weight (number of interconnecting streamlines between two regions) across subjects and sessions to combine as a backbone network. Figure 8A shows the reliability of edge weights of the backbone network under different construction procedures. The histogram distributions of edge ICCs are shown in Figure 8B. At least 52% of the edges of WM networks under all construction methods exhibited moderate to high ICCs. The average ICC values across all backbone connections were greater than 0.4 (SurL: mean ICC = 0.51; SurH: mean ICC = 0.42; VolL: mean ICC = 0.51; VolH: mean ICC = 0.44). A Two-Way ANOVA in which parcellation and resolution were treated as categorical factors revealed that surface- and volume-based parcellations have similar edge ICCs (p = 0.6), but the L-AAL network showed higher edge ICCs than the H-1024 network (p < 0.0001, Partial Eta Squared = 0.02). Additionally, we found that the ICC values are positively correlated with the edge weights (connection strength) (Figure 8C), suggesting that the stronger connections tend to be more reproducible than the weak ones.

FIGURE 7

Figure 7. The correlation of structural connection matrices between sessions. (A) For each session, the backbone of the WM network under different construction procedures was shown in a matrix. (B) Between the two sessions, high correlations of connection strength across all edges were shown in the plots (all p < 10⁻¹⁰). The blue dots represent the edge weights and are linearly fitted with a red line.

FIGURE 8

Figure 8. The TRT reliability of structural connections. (A) Spatial distribution of the edge ICCs of WM networks constructed from different procedures. (B) Normalized histograms of edge ICCs from 0 to 1, with an interval of 0.1. (C) The correlation between connection strength (edge weight) and ICC values is shown in the plot. The blue dots represent the edge weights and are linearly fitted with a red line. Notably, the connection strengths across all edges were resampled into a Gaussian distribution.

TRT Reliability of Rich-Club Organization

To quantify the reliability of the rich-club organization, we calculated the normalized rich-club coefficient (RC) of the backbone network according to van den Heuvel and Sporns (2011) under a range of thresholds. The normalized RC values were greater than 1 under each network construction procedure (Table 4), suggesting a characteristic rich-club organization. Furthermore, the nodes of the backbone network were classified into hubs (nodal strength > mean + std) and non-hubs. Correspondingly, edges were classified onto rich-club connections, which link hub nodes to hub nodes; feeder connections, which link hub nodes to non-hub nodes; and local connections, which link between non-hub nodes (Figure 9A). The reliability of the different hub categories of regions and edges were investigated using a Three-Way ANOVA in which parcellaion, resolution and hub category were treated as categorical factors. ANOVA analyses indicated that the reliability of hub regions was higher than that of non-hub regions (p < 0.0001, Partial Eta Squared = 0.01) regardless of the construction procedure (Figure 9B), consistent with the above finding that regions with higher nodal strength tend to have greater ICC values. For the connections, a significant effect of the edge category was observed (p < 0.0001, Partial Eta Squared = 0.01), and post-hoc comparisons confirmed that the reliability of rich-club connections is significantly higher than that of feeder (p = 0.0001) and local connections (p < 0.0001), and the reliability of feeder connections is significantly higher than that of local connections (p < 0.0001) (Figure 9C).

TABLE 4

Table 4. RC and normalized RC of the WM backbone networks under a range of thresholds.

FIGURE 9

Figure 9. The TRT reliability of rich-club organization. (A) The classification of hub/non-hub nodes and rich-club/feeder/local connections of WM networks constructed from different procedures. (B) The reproducibility of nodal strength of hub regions is significantly higher than the nodal strength of non-hub regions regardless of construction procedures. The bars and errorbars represent the mean values and standard errors, respectively, of the ICC values of the nodal strength of the hub and non-hub regions. (C) Statistical analysis of the reliability difference of edge weight among rich-club, feeder and local connections of WM networks constructed from different procedures. The bars and errorbars represent the mean values and standard errors, respectively, of the ICC values of the connection strengths of rich-club, feeder and local connections.

TRT Reliability from Conventional dMRI and Subsampled Multiband dMRI

Figure 10 shows the TRT reliability of global network metrics from the conventional dMRI and subsampled multiband dMRI datasets. A significantly progressive increase of ICC values in the global network metrics from the conventional dMRI, the subsampled multiband dMRI to the original multiband dMRI was identified by a repeated ANOVA (p < 0.0001, Partial Eta Squared = 0.54). The conventional dMRI dataset showed an overall decrease of reproducibility in all network metrics regardless of the construction procedures, except for the low-resolution network with volume-based parcellation. The subsampled multiband dMRI data also exhibited significantly decreased reliability than the original multiband dMRI, especially in the small-world parameters from the volume-based low-resolution networks.

FIGURE 10

Figure 10. The TRT reliability from conventional dMRI and subsampled multiband dMRI dataset. The ICC values of global network metrics from low to high were presented with colorbars from blue to red. The results showed a progressive increase of ICC values in the global network metrics from the conventional dMRI (conv-dMRI-30grad), the subsampled multiband dMRI (multi-dMRI-30grad) to the original multiband dMRI (multi-dMRI-128grad).

TRT Reliability of Graph Metrics of Binary WM Networks

Figure 11 shows the TRT reliability of global network metrics for both binarized and weighted WM networks from the multiband dMRI dataset. Lower ICC values of the global network metrics were found for the binary networks compared with the weighted WM networks by a paired two-sample t-test (p = 0.002, Partial Eta Squared = 0.27).

FIGURE 11

Figure 11. The TRT reliability of global network metrics of binary and weighted structural networks. The ICC values of global network metrics from low to high were presented with colorbars from blue to red. Most of global metrics of binary networks showed lower ICC values compared with those of the weighted networks.

Discussion

In the present study, we investigated the reliability of weighted WM structural networks constructed from multiband dMRI data with two repeated scanning sessions. Our primary results can be summarized as follows: First, conserved topological architecture of WM structural networks constructed from the mEPI sequence was observed, such as low wring cost, small-worldness and highly connected hub regions. Second, most of the weighted WM network metrics exhibited a high TRT reliability, especially the first order metrics are more reliable than the second order metrics (a partial eta squared value around 0.8), suggesting the potential utility in clinical applications of the new sequence. Third, different procedures of network construction have an effect on the network reliability. For example, networks with volume-based parcellation and high spatial resolution are more reliable than those with surface-based parcellation and low resolution, respectively. Moreover, WM networks from the multiband dMRI showed higher reproducibility compared with those from the conventional dMRI. Additionally, the network reliability varies across regions and edges, although with relatively low effect sizes (partial eta squared values less than 0.1). These findings provide reference and guidance for the future network studies using this new sequence.

Generally, the ICC values obtained in our study are comparable with the findings of previous WM network studies (Vaessen et al., 2010; Bassett et al., 2011; Cheng et al., 2012; Buchanan et al., 2014). Compared with the conventional dMRI, the multiband dMRI data showed higher reliability of global metrics of WM networks, and with a large effect size (Partial Eta Squared = 0.54). For the mEPI sequence, the high reproducibility of network metrics may be attributed to the relatively short scan time that can minimize the effects of head motion and can increase the reliability of fiber orientation estimation from the dMRI data with hundreds of gradient directions. However, the differences in the subjects and acquisition parameters (e.g., different slice thickness of T1 images) between the conventional and multiband dMRI datasets may have an effect on the comparison of the TRT reliability. Future comparisons with the same cohort and same acquisition parameters should be warranted.

The comparisons of parcellation methods and network resolutions offer certain insights into network reliability. First, in all cases, networks with volume-based parcellation showed better TRT reliability than the surface-based parcellation, in terms of both the global (Partial Eta Squared = 0.81) and local ICCs (Partial Eta Squared = 0.01). These results may be due to more WM seed voxels in volume-based parcellation. More WM seed points produce more robust tractography results, which can be supported by the findings of improved TRT reliability of structural networks seeding from WM rather than GM (Buchanan et al., 2014). However, investigation of other parcellation approaches merits further investigation; notably, approaches based on the individual landmarks of gyri and sulci without a template (Hagmann et al., 2008) may reduce the bias caused by registration errors. Second, the high resolution network exhibited an overall higher TRT performance than the low resolution network in terms of global network metrics with a large effect size (Partial Eta Squared = 0.82), whereas the low resolution performs better in terms of local (region and edge) properties with a relatively low effect size (Partial Eta Squared = 0.02). Consistent with our findings, Bassett et al. (2011) also found an increasing reproducibility of global metrics in all atlases at finer spatial resolutions. For the local properties, ROIs in low-resolution networks with bigger size are more possible to be connected by larger fiber tracts, avoiding the contamination from different structures, whereas smaller ROIs in high-resolution network are more easily impaired by the false positive streamlines with a lower SNR ratio but a more homogeneous fiber distribution (Parker et al., 2003). Therefore, specific methodological choice will affect the applicability of network topology-related approaches.

Moreover, the weighting scheme also has an effect on the network reliability. We found the binary WM networks showed poorer reliability than the weighted networks. The increased reliability of weighted networks may be partly due to the increased inter-subject variability introduced by the weighting scheme, which contains both real connectome differences and other biases, such as the effects of brain size on the fiber tractography. Binary network can partly overcome such problem by avoiding the variability in fiber numbers, which also has its own drawbacks, such as how to threshold the network (Buchanan et al., 2014; Duda et al., 2014). Detailed investigation of the effects of different weighting schemes on the reproducibility of graph metrics for the multiband sequence is needed in the future.

On a more methodological note, we found significant differences in reliability between graph metrics. For global metrics, the first order graph metrics (such as shortest path length and efficiency) were more reliable than second order metrics (such as small-world parameters), with a large effect size (Partial Eta Squared = 0.86). This result is consistent with the findings from MEG data (Deuker et al., 2009), but in contrast with results obtained from rs-fMRI data (Braun et al., 2012). The worse reliability of second order metrics may be caused by the normalization of the metrics of random networks, which may also indicate an increased sensitivity to measurements such as short term changes in the WM structure (Tang et al., 2010). For nodal metrics, the nodal efficiency is more reliable than the nodal strength, especially for the high-resolution WM networks, with a relatively low effect size (Partial Eta Squared = 0.06). However, a previous rs-fMRI study (Wang et al., 2011) showed that the nodal degree showed higher reliability than other nodal metrics in the binary functional networks. These results suggest that the reliability of the same graph metrics can be influenced by the imaging modalities, strategy of nodal or edge definitions and network construction procedures. In future studies, selecting specific metrics with high reliability for specific modality and methodological choice should have high priority.

The reproducibility varied across regions and exhibited spatially heterogeneous distribution. We found that most of the regions (>75%) showed moderate to high reproducibility under all construction methods, except several regions located in the left olfactory cortex, left insula, left middle temporal gyrus, right gyrus rectus, right orbital frontal gyrus, right posterior cingulate cortex, right superior parietal gyrus and paracentral lobule. Some of those regions were also identified as showing poor estimated ICC values in a recent test-retest study of the dMRI network obtained from conventional EPI sequence (Buchanan et al., 2014). Bassett et al. (2011) also found certain less reproducible regions in the inferior temporal and occipital cortices. These similar results revealed that certain regions with inherent instability are driven by anatomy or technique limitations, such as magnetic susceptibility (Vargas et al., 2009). Moreover, we found that the more densely connected regions tend to have higher reliability, due to less influence by the bias from noise or limitations of tractography algorithms. In future studies with mEPI, results regarding these regions especially which showed low reliability in our study should be interpreted with caution.

According to the functional roles in information processing (Mesulam, 1998), the brain regions can be categorized into three classes, including association, primary and paralimbic/limbic regions. For the low-resolution WM network, the ICCs of association and primary regions were significantly higher than the paralimbic/limbic regions (Partial Eta Squared = 0.37) and 72% of regions that show low ICC values were located in the paralimbic/limbic cortices. This result may be induced by the smaller ROI size in the paralimbic/limbic regions in the AAL template (surface: association = 2.9 × 10³ mm³, primary = 2.7 × 10³ mm³, paralimbic/limbic = 1.4 × 10³ mm³; volume: association = 1.9 × 10⁴ mm³, primary = 1.8 × 10⁴ mm³, paralimbic/limbic = 8.9 × 10³ mm³). As mentioned above, the smaller ROI size can be easily biased by the image noise, partial volume effects and registration errors. Another possible reason is the high anatomical variability of paralimbic/limbic tracts, such as the uncinate fasciculus and cingulum bundles (Burgel et al., 2006).

For the structural connectivity, the reliability varies across edges. There are several sources that contribute to the variation of the edge weights (number of streamlines). Image noise, spatial resolution, dMRI gradient encoding, and partial volume effects may affect the quality of fiber quantification. The tractography algorithm (Bastiani et al., 2012), including the number of random seeds in fiber tracking, can also have a slight effect on the variance of the network. Specifically, fewer random seeds will lead to a larger variance in the number of fibers from fiber tracking, although the effect in this study was diminished by choosing eight seeds per voxel in fiber tracking. In addition, the reliability of network construction also relies on the accuracy of parcellation and the mapping during image registration. The parcellation can have errors due to SNR limitations of the T1-weighted image or the algorithm itself. The registration between the T1-weighted image and the dMRI image can also have errors due to image distortion and partial volume effects. All of these factors affect the TRT reliability of the structural connectivity and networks.

Importantly, we investigated the reliability of the rich-club organization of WM networks. First, we found hubs regions and rich-club connections were more reliable than non-hub ones with a low effect size (Partial Eta Squared = 0.01). This is consistent with the findings of positive correlations between ICCs with nodal strength and edge weights. As the hub regions are more densely interconnected than the other brain regions and have a large influence on overall network organization, hubs are essential in supporting the performance of high cognitive functions of the human brain by integrating specialized brain regions into coordinated networks (van den Heuvel and Sporns, 2013). Buckner et al. (2009) demonstrated that the topography of human brain cortical hubs is highly similar across populations and robust against task states, therefore reflecting a stable property of brain functional architecture. Previous studies consistently revealed similar and stable hub distributions of WM networks across subjects from different samples (Hagmann et al., 2008; Gong et al., 2009a; Zalesky et al., 2010; Bassett et al., 2011; van den Heuvel and Sporns, 2013). This result is also in parallel with the findings from functional MRI data (Wang et al., 2011; Liao et al., 2013), which indicate that the reliable regions qualitatively tend to serve as hubs in intrinsic functional brain networks. The high reliability of hub regions and rich-club connections indicated that rich-club organization is a stable metric with commendable potential utility in clinical applications.

There are some methodological issues need to be addressed. First, we included only 11 subjects in the present study, large samples with more subjects in practical studies is necessary to obtain sufficient statistical power. Second, investigation of the effects of different acquisition parameters, gradient sampling schemes and advanced diffusion modeling approaches, such as application of higher order models to disentangle crossing fiber structures (Tournier et al., 2008), on the reproducibility of network metrics for this new sequence would be interesting, but was unfortunately outside the scope of this paper. Finally, when considering the influence of potential variations in WM structure, it is important to consider the tradeoff between the reliability and sensitivity of network metrics. In future studies, several measures (e.g., the coefficient of variation) can be further developed to comprehensively characterize the sensitivity of network metrics over scanning sessions (Lachin, 2004; Bassett et al., 2011).

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We thank Dr. Michael Milham for kindly providing the dataset. This study is supported by the 973 program (2013CB837300, NS), the Natural Science Foundation of China (Grant Nos. 81471732, NS, 11205041, XHL, 31221003 and 81030028, YH), the Beijing New Medical Discipline Based Group (Grant No. 100270569, NS), the Fundamental Research Funds for the Central Universities (Grant No. 2013YB28, NS), and the National Science Fund for Distinguished Young Scholars of China (Grant No. 81225012, YH).

References

Achard, S., and Bullmore, E. (2007). Efficiency and cost of economical brain functional networks. PLoS Comput. Biol. 3:e17. doi: 10.1371/journal.pcbi.0030017

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bai, F., Shu, N., Yuan, Y., Shi, Y., Yu, H., Wu, D., et al. (2012). Topologically convergent and divergent structural connectivity patterns between patients with remitted geriatric depression and amnestic mild cognitive impairment. J. Neurosci. 32, 4307–4318. doi: 10.1523/JNEUROSCI.5061-11.2012

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bassett, D. S., Brown, J. A., Deshpande, V., Carlson, J. M., and Grafton, S. T. (2011). Conserved and variable architecture of human white matter connectivity. Neuroimage 54, 1262–1279. doi: 10.1016/j.neuroimage.2010.09.006

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bastiani, M., Shah, N. J., Goebel, R., and Roebroeck, A. (2012). Human cortical connectome reconstruction from diffusion weighted MRI: the effect of tractography algorithm. Neuroimage 62, 1732–1749. doi: 10.1016/j.neuroimage.2012.06.002

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Behrens, T. E., Woolrich, M. W., Jenkinson, M., Johansen-Berg, H., Nunes, R. G., Clare, S., et al. (2003). Characterization and propagation of uncertainty in diffusion-weighted MR imaging. Magn. Reson. Med. 50, 1077–1088. doi: 10.1002/mrm.10609

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Braun, U., Plichta, M. M., Esslinger, C., Sauer, C., Haddad, L., Grimm, O., et al. (2012). Test-retest reliability of resting-state connectivity network characteristics using fMRI and graph theoretical measures. Neuroimage 59, 1404–1412. doi: 10.1016/j.neuroimage.2011.08.044

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Buchanan, C. R., Pernet, C. R., Gorgolewski, K. J., Storkey, A. J., and Bastin, M. E. (2014). Test-retest reliability of structural brain networks from diffusion MRI. Neuroimage 86, 231–243. doi: 10.1016/j.neuroimage.2013.09.054

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Buckner, R. L., Sepulcre, J., Talukdar, T., Krienen, F. M., Liu, H., Hedden, T., et al. (2009). Cortical hubs revealed by intrinsic functional connectivity: mapping, assessment of stability, and relation to Alzheimer's disease. J. Neurosci. 29, 1860–1873. doi: 10.1523/JNEUROSCI.5062-08.2009

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bullmore, E., and Sporns, O. (2009). Complex brain networks: graph theoretical analysis of structural and functional systems. Nat. Rev. Neurosci. 10, 186–198. doi: 10.1038/nrn2575

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bullmore, E., and Sporns, O. (2012). The economy of brain network organization. Nat. Rev. Neurosci. 13, 336–349. doi: 10.1038/nrn3214

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Burgel, U., Amunts, K., Hoemke, L., Mohlberg, H., Gilsbach, J. M., and Zilles, K. (2006). White matter fiber tracts of the human brain: three-dimensional mapping at microscopic resolution, topography and intersubject variability. Neuroimage 29, 1092–1105. doi: 10.1016/j.neuroimage.2005.08.040

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Cao, Q., Shu, N., An, L., Wang, P., Sun, L., Xia, M. R., et al. (2013). Probabilistic diffusion tractography and graph theory analysis reveal abnormal white matter structural connectivity networks in drug-naive boys with attention deficit/hyperactivity disorder. J. Neurosci. 33, 10676–10687. doi: 10.1523/JNEUROSCI.4793-12.2013

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Cheng, H., Wang, Y., Sheng, J., Kronenberger, W. G., Mathews, V. P., Hummer, T. A., et al. (2012). Characteristics and variability of structural networks derived from diffusion tensor imaging. Neuroimage 61, 1153–1164. doi: 10.1016/j.neuroimage.2012.03.036

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Colizza, V., Flammini, A., Serrano, M., and Vespignani, A. (2006). Detecting rich-club ordering in complex networks. Nat. Phys. 2, 110–115. doi: 10.1038/nphys209

CrossRef Full Text | Google Scholar

Collin, G., Sporns, O., Mandl, R. C., and van den Heuvel, M. P. (2014). Structural and functional aspects relating to cost and benefit of rich club organization in the human cerebral cortex. Cereb. Cortex 24, 2258–2267. doi: 10.1093/cercor/bht064

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Collins, D. L., Neelin, P., Peters, T. M., and Evans, A. C. (1994). Automatic 3D intersubject registration of MR volumetric data in standardized Talairach space. J. Comput. Assist. Tomogr. 18, 192–205. doi: 10.1097/00004728-199403000-00005

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Deuker, L., Bullmore, E. T., Smith, M., Christensen, S., Nathan, P. J., Rockstroh, B., et al. (2009). Reproducibility of graph metrics of human brain functional networks. Neuroimage 47, 1460–1468. doi: 10.1016/j.neuroimage.2009.05.035

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Duda, J. T., Cook, P. A., and Gee, J. C. (2014). Reproducibility of graph metrics of human brain structural networks. Front. Neuroinform. 8:46. doi: 10.3389/fninf.2014.00046

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Fonov, V., Evans, A., McKinstry, R., Almli, C., and Collins, D. (2009). Unbiased nonlinear average age-appropriate brain templates from birth to adulthood. Neuroimage 47, S102. doi: 10.1016/S1053-8119(09)70884-5

CrossRef Full Text | Google Scholar

Gong, G., He, Y., Concha, L., Lebel, C., Gross, D. W., Evans, A. C., et al. (2009a). Mapping anatomical connectivity patterns of human cerebral cortex using in vivo diffusion tensor imaging tractography. Cereb. Cortex 19, 524–536. doi: 10.1093/cercor/bhn102

CrossRef Full Text | Google Scholar

Gong, G., Rosa-Neto, P., Carbonell, F., Chen, Z. J., He, Y., and Evans, A. C. (2009b). Age- and gender-related differences in the cortical anatomical network. J. Neurosci. 29, 15684–15693. doi: 10.1523/JNEUROSCI.2308-09.2009

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Hagmann, P., Cammoun, L., Gigandet, X., Meuli, R., Honey, C. J., Wedeen, V. J., et al. (2008). Mapping the structural core of human cerebral cortex. PLoS Biol. 6:e159. doi: 10.1371/journal.pbio.0060159

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Hagmann, P., Kurant, M., Gigandet, X., Thiran, P., Wedeen, V. J., Meuli, R., et al. (2007). Mapping human whole-brain structural networks with diffusion MRI. PLoS ONE 2:e597. doi: 10.1371/journal.pone.0000597

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Hagmann, P., Sporns, O., Madan, N., Cammoun, L., Pienaar, R., Wedeen, V. J., et al. (2010). White matter maturation reshapes structural connectivity in the late developing human brain. Proc. Natl. Acad. Sci. U.S.A. 107, 19067–19072. doi: 10.1073/pnas.1009073107

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

He, Y., Chen, Z. J., and Evans, A. C. (2007). Small-world anatomical networks in the human brain revealed by cortical thickness from MRI. Cereb. Cortex 17, 2407–2419. doi: 10.1093/cercor/bhl149

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Humphries, M. D., and Gurney, K. (2008). Network ‘small-world-ness’: a quantitative method for determining canonical network equivalence. PLoS ONE 3:e0002051. doi: 10.1371/journal.pone.0002051

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Kim, J. S., Singh, V., Lee, J. K., Lerch, J., Ad-Dab'bagh, Y., MacDonald, D., et al. (2005). Automated 3-D extraction and evaluation of the inner and outer cortical surfaces using a Laplacian map and partial volume effect classification. Neuroimage 27, 210–221. doi: 10.1016/j.neuroimage.2005.03.036

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Kong, J., Gollub, R. L., Webb, J. M., Kong, J. T., Vangel, M. G., and Kwong, K. (2007). Test-retest study of fMRI signal change evoked by electroacupuncture stimulation. Neuroimage 34, 1171–1181. doi: 10.1016/j.neuroimage.2006.10.019

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Lachin, J. M. (2004). The role of measurement reliability in clinical trials. Clin. Trials 1, 553–566. doi: 10.1191/1740774504cn057oa

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Landis, J. R., and Koch, G. G. (1977). The measurement of observer agreement for categorical data. Biometrics 33, 159–174. doi: 10.2307/2529310

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Latora, V., and Marchiori, M. (2001). Efficient behavior of small-world networks. Phys. Rev. Lett. 87:198701. doi: 10.1103/PhysRevLett.87.198701

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Li, Y., Liu, Y., Li, J., Qin, W., Li, K., Yu, C., et al. (2009). Brain anatomical network and intelligence. PLoS Comput. Biol. 5:e1000395. doi: 10.1371/journal.pcbi.1000395

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Liao, X. H., Xia, M. R., Xu, T., Dai, Z. J., Cao, X. Y., Niu, H. J., et al. (2013). Functional brain hubs and their test-retest reliability: a multiband resting-state functional MRI study. Neuroimage 83, 969–982. doi: 10.1016/j.neuroimage.2013.07.058

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Lo, C. Y., Wang, P. N., Chou, K. H., Wang, J., He, Y., and Lin, C. P. (2010). Diffusion tensor tractography reveals abnormal topological organization in structural cortical networks in Alzheimer's disease. J. Neurosci. 30, 16876–16885. doi: 10.1523/JNEUROSCI.4136-10.2010

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

MacDonald, D., Kabani, N., Avis, D., and Evans, A. C. (2000). Automated 3-D extraction of inner and outer surfaces of cerebral cortex from MRI. Neuroimage 12, 340–356. doi: 10.1006/nimg.1999.0534

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Maslov, S., and Sneppen, K. (2002). Specificity and stability in topology of protein networks. Science 296, 910–913. doi: 10.1126/science.1065103

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

McAuley, J., Da Fontoura Costa, L., and Caetano, T. (2007). Rich-club phenomena across complex network hierachies. Appl. Phys. Lett. 91:084103. doi: 10.1063/1.2773951

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Mesulam, M. M. (1998). From sensation to cognition. Brain 121(Pt 6), 1013–1052.

Pubmed Abstract | Pubmed Full Text | Google Scholar

Moeller, S., Yacoub, E., Olman, C. A., Auerbach, E., Strupp, J., Harel, N., et al. (2010). Multiband multislice GE-EPI at 7 tesla, with 16-fold acceleration using partial parallel imaging with application to high spatial and temporal whole-brain fMRI. Magn. Reson. Med. 63, 1144–1153. doi: 10.1002/mrm.22361

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Mori, S., Crain, B. J., Chacko, V. P., and van Zijl, P. C. (1999). Three-dimensional tracking of axonal projections in the brain by magnetic resonance imaging. Ann. Neurol. 45, 265–269.

Pubmed Abstract | Pubmed Full Text | Google Scholar

Muller, R., and Buttner, P. (1994). A critical discussion of intraclass correlation coefficients. Stat. Med. 13, 2465–2476. doi: 10.1002/sim.4780132310

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Opsahl, T., Colizza, V., Panzarasa, P., and Ramasco, J. J. (2008). Prominence and control: the weighted rich-club effect. Phys. Rev. Lett. 101:168702. doi: 10.1103/PhysRevLett.101.168702

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Parker, G. J., Haroon, H. A., and Wheeler-Kingshott, C. A. (2003). A framework for a streamline-based probabilistic index of connectivity (PICo) using a structural interpretation of MRI diffusion measurements. J. Magn. Reson. Imaging 18, 242–254. doi: 10.1002/jmri.10350

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Rousson, V., Gasser, T., and Seifert, B. (2002). Assessing intrarater, interrater and test-retest reliability of continuous measurements. Stat. Med. 21, 3431–3446. doi: 10.1002/sim.1253

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Rubinov, M., and Sporns, O. (2010). Complex network measures of brain connectivity: uses and interpretations. Neuroimage 52, 1059–1069. doi: 10.1016/j.neuroimage.2009.10.003

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Shrout, P. E., and Fleiss, J. L. (1979). Intraclass correlations: uses in assessing rater reliability. Psychol. Bull. 86, 420–428. doi: 10.1037/0033-2909.86.2.420

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Shu, N., Liu, Y., Li, K., Duan, Y., Wang, J., Yu, C., et al. (2011). Diffusion tensor tractography reveals disrupted topological efficiency in white matter structural networks in multiple sclerosis. Cereb. Cortex 21, 2565–2577. doi: 10.1093/cercor/bhr039

CrossRef Full Text | Google Scholar

Smith, S. M., Jenkinson, M., Woolrich, M. W., Beckmann, C. F., Behrens, T. E., Johansen-Berg, H., et al. (2004). Advances in functional and structural MR image analysis and implementation as FSL. Neuroimage 23(Suppl. 1), S208–S219. doi: 10.1016/j.neuroimage.2004.07.051

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Sporns, O., Tononi, G., and Kotter, R. (2005). The human connectome: a structural description of the human brain. PLoS Comput. Biol. 1:e42. doi: 10.1371/journal.pcbi.0010042

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Tang, Y. Y., Lu, Q., Geng, X., Stein, E. A., Yang, Y., and Posner, M. I. (2010). Short-term meditation induces white matter changes in the anterior cingulate. Proc. Natl. Acad. Sci. U.S.A. 107, 15649–15652. doi: 10.1073/pnas.1011043107

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Tournier, J. D., Yeh, C. H., Calamante, F., Cho, K. H., Connelly, A., and Lin, C. P. (2008). Resolving crossing fibres using constrained spherical deconvolution: validation using diffusion-weighted imaging phantom data. Neuroimage 42, 617–625. doi: 10.1016/j.neuroimage.2008.05.002

CrossRef Full Text | Google Scholar

Tzourio-Mazoyer, N., Landeau, B., Papathanassiou, D., Crivello, F., Etard, O., Delcroix, N., et al. (2002). Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage 15, 273–289. doi: 10.1006/nimg.2001.0978

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Vaessen, M. J., Hofman, P. A., Tijssen, H. N., Aldenkamp, A. P., Jansen, J. F., and Backes, W. H. (2010). The effect and reproducibility of different clinical DTI gradient sets on small world brain connectivity measures. Neuroimage 51, 1106–1116. doi: 10.1016/j.neuroimage.2010.03.011

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

van den Heuvel, M. P., Kahn, R. S., Goni, J., and Sporns, O. (2012). High-cost, high-capacity backbone for global brain communication. Proc. Natl. Acad. Sci. U.S.A. 109, 11372–11377. doi: 10.1073/pnas.1203593109

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

van den Heuvel, M. P., and Sporns, O. (2011). Rich-club organization of the human connectome. J. Neurosci. 31, 15775–15786. doi: 10.1523/JNEUROSCI.3539-11.2011

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

van den Heuvel, M. P., and Sporns, O. (2013). Network hubs in the human brain. Trends Cogn. Sci. 17, 683–696. doi: 10.1016/j.tics.2013.09.012

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

van Essen, D. C., Ugurbil, K., Auerbach, E., Barch, D., Behrens, T. E., Bucholz, R., et al. (2012). The Human Connectome Project: a data acquisition perspective. Neuroimage 62, 2222–2231. doi: 10.1016/j.neuroimage.2012.02.018

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Vargas, M. I., Delavelle, J., Kohler, R., Becker, C. D., and Lovblad, K. (2009). Brain and spine MRI artifacts at 3Tesla. J. Neuroradiol. 36, 74–81. doi: 10.1016/j.neurad.2008.08.001

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Wang, J. H., Zuo, X. N., Gohel, S., Milham, M. P., Biswal, B. B., and He, Y. (2011). Graph theoretical analysis of functional brain networks: test-retest evaluation on short- and long-term resting-state functional MRI data. PLoS ONE 6:e21976. doi: 10.1371/journal.pone.0021976

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Watts, D. J., and Strogatz, S. H. (1998). Collective dynamics of 'small-world' networks. Nature 393, 440–442. doi: 10.1038/30918

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Wen, W., Zhu, W., He, Y., Kochan, N. A., Reppermund, S., Slavin, M. J., et al. (2011). Discrete neuroanatomical networks are associated with specific cognitive abilities in old age. J. Neurosci. 31, 1204–1212. doi: 10.1523/JNEUROSCI.4085-10.2011

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Xia, M., Wang, J., and He, Y. (2013). BrainNet Viewer: a network visualization tool for human brain connectomics. PLoS ONE 8:e68910. doi: 10.1371/journal.pone.0068910

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Xu, J., Moeller, S., Auerbach, E. J., Strupp, J., Smith, S. M., Feinberg, D. A., et al. (2013). Evaluation of slice accelerations using multiband echo planar imaging at 3 T. Neuroimage 83, 991–1001. doi: 10.1016/j.neuroimage.2013.07.055

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Zalesky, A., Fornito, A., Harding, I. H., Cocchi, L., Yucel, M., Pantelis, C., et al. (2010). Whole-brain anatomical networks: does the choice of nodes matter? Neuroimage 50, 970–983. doi: 10.1016/j.neuroimage.2009.12.027

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Zalesky, A., Fornito, A., Seal, M. L., Cocchi, L., Westin, C. F., Bullmore, E. T., et al. (2011). Disrupted axonal fiber connectivity in schizophrenia. Biol. Psychiatry 69, 80–89. doi: 10.1016/j.biopsych.2010.08.022

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Zijdenbos, A. P., Forghani, R., and Evans, A. C. (2002). Automatic “pipeline” analysis of 3-D MRI data for clinical trials: application to multiple sclerosis. IEEE Trans. Med. Imaging 21, 1280–1291. doi: 10.1109/TMI.2002.806283

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Zuo, X. N., Xu, T., Jiang, L., Yang, Z., Cao, X. Y., He, Y., et al. (2013). Toward reliable characterization of functional homogeneity in the human brain: preprocessing, scan duration, imaging resolution and computational space. Neuroimage 65, 374–386. doi: 10.1016/j.neuroimage.2012.10.017

CrossRef Full Text | Google Scholar

Appendix

Network Strength

For a network (graph) G with N nodes and K edges, we calculated the strength of G as follows:

S_{p} (G) = \frac{1}{N} \sum_{i \in G} S (i)

where S(i) is the strength of a node, which is the sum of the edge weights w_ij (fiber number) linking to node i. The strength of a network is the average of the strength across all of the nodes in the network.

Small-World Properties

Small-world network parameters (clustering coefficient, C_p, and shortest path length, L_p) were originally proposed by Watts and Strogatz (1998). In this study, we investigated the small-world properties of the weighted brain networks. The clustering coefficient of a node i, C(i), which was defined as the likelihood of whether the neighborhoods were connected with each other, was computed as follows:

C (i) = \frac{2}{k_{i} (k_{i} - 1)} \sum_{j, k} {({\bar{w}}_{i j} {\bar{w}}_{j k} {\bar{w}}_{k i})}^{1 / 3}

where k_i is the degree of node i and w is the weight, which is scaled by the mean of all weights to control each participant's cost at the same level. The clustering coefficient is zero (C(i) = 0) if the nodes are isolated or have just one connection, i.e., k_i = 0 or k_i = 1. The clustering coefficient, C_p, of a network is the average of the clustering coefficient over all nodes and indicates the extent of the local interconnectivity or cliquishness in a network (Watts and Strogatz, 1998).

The path length between any pair of nodes (e.g., node i and node j) is defined as the sum of the edge lengths along this path. For weighted networks, the length of each edge was assigned by computing the reciprocal of the edge weight, 1/w_ij. The shortest path length, L_ij, is defined as the shortest length among the lengths of all possible paths between node i and node j. The shortest path length of a network was computed as follows:

L_{p} (G) = \frac{1}{N (N - 1)} \sum_{i \neq j \in G} L_{i j}

where N is the number of nodes in the network. The L_p of a network quantifies the ability for information to propagate in parallel.

To examine the small-world properties, the clustering coefficient, C_p, and the shortest path length, L_p, of the brain networks were compared with those of random networks. In this study, we generated 100 matched random networks that had the same number of nodes and edges and the same degree distribution as real networks (Maslov and Sneppen, 2002). Notably, we retained the weight of each edge during the randomization procedure such that the weight distribution of the network was preserved. Furthermore, we computed the normalized L_p, λ = L^real_p/L^rand_p_p, and the normalized C_p, γ = C^real_p/C^rand_p, where L^rand_p and C^rand_p are the mean C_p and the mean L_p of 100 matched random networks, respectively. Importantly, the two parameters correct the differences in the edge number and degree distribution of the networks across individuals. A real network would be considered small-world if γ > 1 and λ ≈ 1 (Watts and Strogatz, 1998). Thus, a small-world network not only has a higher local interconnectivity but also has a shortest path length approximately equivalent to random networks. These two measurements can be summarized into a simple quantitative metric, small-worldness, σ = γ/λ, which is typically greater than 1 for small-world networks (Humphries and Gurney, 2008).

Network Efficiency

The global efficiency of G measures the global efficiency of the parallel information transfer in the network (Latora and Marchiori, 2001), which can be computed as follows:

E_{g l o b} (G) = \frac{1}{N (N - 1)} \sum_{i \neq j \in G} \frac{1}{L_{i j}}

where L_ij is the shortest path length between node i and node j in G.

The local efficiency of G reveals how much the network is fault tolerant and shows how efficient the communication is among the first neighbors of the node i when it is removed. The local efficiency of a graph is defined as follows:

E_{l o c} (G) = \frac{1}{N} \sum_{i \in G} E_{g l o b} (G_{i})

where G_i denotes the subgraph composed of the nearest neighbors of node i.

Regional Characteristics

To determine the nodal (regional) characteristics of the WM networks, we computed the nodal strength and efficiency. The nodal strength S(i) is defined as the sum of all of the edge weights between this node and all of the other nodes in the network. The nodal efficiency, E_nodal(i) is defined as (Achard and Bullmore, 2007):

E_{n o d a l} (i) = \frac{1}{N - 1} \sum_{i \neq j \in G} \frac{1}{L_{i j}}

where L_ij is the shortest path length between node i and node j in G. E_nodal(i) measures the average shortest path length between a given node i and all of the other nodes in the network.

Rich-Club Organization

A “rich-club” in networks is defined as the phenomenon that the high-degree nodes of a network tend to be more densely connected among themselves than is expected by chance (Colizza et al., 2006; McAuley et al., 2007). The brain's rich-club has been described previously (van den Heuvel and Sporns, 2011; van den Heuvel et al., 2012; Collin et al., 2014). For the weighted networks, the rich-club coefficient (RC) ϕ^w(k) (Opsahl et al., 2008) is given by the following equation:

ϕ^{w} (k) = \frac{W_{> k}}{\sum_{l = 1}^{E_{> k}} w_{l}^{r a n k e d}}

where E_>k denotes the subset of the edges between the hub nodes with a strength > k, W_>k denotes the total sum weights of this subset, and W_ranked denotes the ranked collection of weights in the network, with weights W representing the number of fiber streamlines of the edges. ϕ(k) was normalized relative to the ϕ_random(k) of a set of comparable random networks (n = 1000) of equal size and degree sequence, providing a normalized RC (Colizza et al., 2006; McAuley et al., 2007):

ϕ_{n o r m} (k) = ϕ (k) / ϕ_{r a n d o m} (k)

Here, the threshold k is defined as the mean plus one standard deviation (mean + std) of nodal strength across regions.

Keywords: brain connectome, diffusion tensor imaging, graph theory, multiband EPI, reproducibility, tractography, white matter

Citation: Zhao T, Duan F, Liao X, Dai Z, Cao M, He Y and Shu N (2015) Test-retest reliability of white matter structural brain networks: a multiband diffusion MRI study. Front. Hum. Neurosci. 9:59. doi: 10.3389/fnhum.2015.00059

Received: 02 October 2014; Accepted: 21 January 2015;
Published online: 17 February 2015.

Edited by:

John J. Foxe, Albert Einstein College of Medicine, USA

Reviewed by:

Christian Stephan-Otto, Fundació Sant Joan de Déu, Spain
Philip A. Cook, University of Pennsylvania, USA

Copyright © 2015 Zhao, Duan, Liao, Dai, Cao, He and Shu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ni Shu, State Key Laboratory of Cognitive Neuroscience and Learning and IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing 100875, China e-mail:bnNodUBibnUuZWR1LmNu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.