A Baseline for the Multivariate Comparison of Resting-State Networks

As the size of functional and structural MRI datasets expands, it becomes increasingly important to establish a baseline from which diagnostic relevance may be determined, a processing strategy that efficiently prepares data for analysis, and a statistical approach that identifies important effects in a manner that is both robust and reproducible. In this paper, we introduce a multivariate analytic approach that optimizes sensitivity and reduces unnecessary testing. We demonstrate the utility of this mega-analytic approach by identifying the effects of age and gender on the resting-state networks (RSNs) of 603 healthy adolescents and adults (mean age: 23.4 years, range: 12–71 years). Data were collected on the same scanner, preprocessed using an automated analysis pipeline based in SPM, and studied using group independent component analysis. RSNs were identified and evaluated in terms of three primary outcome measures: time course spectral power, spatial map intensity, and functional network connectivity. Results revealed robust effects of age on all three outcome measures, largely indicating decreases in network coherence and connectivity with increasing age. Gender effects were of smaller magnitude but suggested stronger intra-network connectivity in females and more inter-network connectivity in males, particularly with regard to sensorimotor networks. These findings, along with the analysis approach and statistical framework described here, provide a useful baseline for future investigations of brain networks in health and disease.

moderate to high test-retest reliability (Franco et al., 2009;Shehzad et al., 2009;Zuo et al., 2010), suggesting a robust examination of the intrinsic functional architecture, or "connectome," of the human brain .
Because functional connectivity between regions is believed to characterize large-scale system integrity (Van Dijk et al., 2010), there is great interest in understanding the variability of these networks in normal development and clinical contexts. Studies of the default-mode network (DMN), a set of brain regions preferentially active when subjects are not focused on the external environment (Raichle et al., 2001;Buckner et al., 2008), have established that this network not only shows a high degree of heritability (Glahn et al., 2010), but also shows alterations in a number of different neurological disorders (see Greicius, 2008 andBroyd et al., 2009 for recent reviews). For example, in autism, functional connectivity between DMN regions is substantially reduced, though coactivation within the dorsal attention network, a set of brain regions implicated in directing attention during cognitively demanding tasks, appears relatively unaffected (Kennedy and Courchesne, 2008). The parallel between altered connectivity (specific to regions associated with internal, self-referential processes) and symptoms that characterize autism suggests that straightforward investigations into functional connectivity can elucidate the etiology of complex disorders. Similar success has been found with regard to schizophrenia, where impaired modulation of the DMN has been observed in schizophrenia patients as well as their first-degree relatives, identifying an endophenotype based on large-scale connectivity (Whitfield-Gabrieli et al., 2009;Abbott et al., 2010). Furthermore, increased connectivity between particular DMN regions is associated with the severity of positive symptoms, suggesting a correspondence between specific "hyperconnectivity" and psychosis (Garrity et al., 2007;Whitfield-Gabrieli et al., 2009). The spectral properties of network activation in schizophrenia have also been explored, revealing a signature of reduced low frequency power and increased high frequency power in the DMN as well as many other RSNs (Garrity et al., 2007;Calhoun et al., 2008bCalhoun et al., , 2009. While multiple aspects of intrinsic functional connectivity show potential for clinical applications, the utility of network evaluation as a reliable diagnostic tool depends on the ability to interpret aberrant findings in the presence of an appropriate baseline. Fundamental factors, such as age and gender, are expected to exert large influences on functional connectivity based on their strong associations with underlying anatomy. For instance, most cortical regions show rapid gray matter loss as the brain matures through adolescence, followed by more gradual reductions in adulthood and advanced aging (Good et al., 2001;Sowell et al., 2003;Tamnes et al., 2010), though this trend is heterogeneous across structures and particularly variable in subcortical regions (Østby et al., 2009). White matter shows a different developmental trajectory, with volume and tract integrity peaking in adulthood (approximately 25-35 years of age) then declining slowly with age (Sowell et al., 2003;Sullivan and Pfefferbaum, 2006;Tamnes et al., 2010). Structural differences are also observed between genders; effects are smaller and some findings lack consistency, however studies concur that females show modest increases in gray matter volume localized to frontal, temporal, and parietal cortices and basal ganglia (BG) structures (Good et al., 2001;Luders et al., 2005Luders et al., , 2009Sowell et al., 2007).
As anticipated, recent investigations have identified effects of age and gender on functional connectivity. With regard to age, reports suggest network maturation in childhood (Szaflarski et al., 2006;Karunanayaka et al., 2007;Fair et al., 2008), progressive decreases in network mutual influences throughout adolescences into adulthood (Stevens et al., 2009), followed by decreases in functional connectivity and coherence in middle and late adulthood (Andrews-Hanna et al., 2007;Damoiseaux et al., 2008;Esposito et al., 2008;Koch et al., 2009;Biswal et al., 2010). Gender-related differences have received less attention but there appears to be some consensus of slightly greater connectivity in females localized to the precuneus and posterior cingulate cortex (Bluhm et al., 2008;Biswal et al., 2010). While these studies establish the influence of age and gender on functional connectivity, most unfortunately limit their investigations to the DMN, creating a dearth of reported effects with regard to other regions. In part, this bias reflects the unique function of the DMN related to internal mental processes and the desire to explicitly explore the "cognitive baseline" (Raichle et al., 2001). However, the relatively narrow scope of prior studies may also reflect the difficulty and somewhat overwhelming nature of investigations of full brain connectivity (Bullmore and Sporns, 2009). As the dimensions of data increase, so do the challenges associated with each analysis step, extending from data collection and processing to interpretation and visualization Costafreda, 2010).
Given the need for a more comprehensive understanding of functional connectivity and the methodological challenges associated with such a pursuit, the current study has two primary goals. First, we aim to present a statistical framework optimized for the analysis of large datasets that can be easily applied to investigations in other areas. We advocate a hierarchical approach where multivariate models are used first to identify important covariates, reducing the number of subsequent univariate tests and decreasing the risk of spurious findings. For multivariate analyses, we exploit the autoregressive structure common to many types of data and recommend appropriate dimension reduction of response variables to enhance the sensitivity and specificity of model estimation.
Our second goal is to apply this statistical framework in a detailed and careful investigation of the effects of age and gender on largescale resting-state functional connectivity throughout the brain. To this end, we focus our analysis on data from a large number of healthy subjects (M = 603) collected on a single instrument, and employ group independent components analysis (GICA) to identify a set of robust and reliable RSNs (Calhoun et al., 2001). We examine the effects of age and gender on three ICA-derived outcome variables describing distinct but complementary facets of functional connectivity. These include (1) the power spectra of RSN time course (TCs), related to level of coherent activity within a network; (2) the intensities of RSN spatial map (SMs), related to the connectivity and degree of coactivation within a network; and (3) the functional network connectivity (FNC; Jafri et al., 2008), related to the connectivity between networks. Furthermore, we consider these outcome measures as a function of local gray matter concentration (GMC) to determine the extent to which functional changes reflect those observed in the structural domain (Damoiseaux et al., 2008;Glahn et al., 2010). 13 and 30 years old). We therefore use a normalizing transformation, log(age), to reduce the leverage of older subjects in regression analyses (Figure 2A).

data acquIsItIon
All images were collected on a 3-Tesla Siemens Trio scanner with a 12-channel radio frequency coil. High resolution T1-weighted structural images were acquired with a five-echo MPRAGE sequence with TE = 1. 64, 3.5, 5.36, 7.22, 9.08 ms, TR = 2.53 s, TI = 1.2 s, flip angle = 7°, number of excitations = 1, slice thickness = 1 mm, field of view = 256 mm, resolution = 256 × 256. T2 * -weighted functional images were acquired using a gradient-echo EPI sequence with TE = 29 ms, TR = 2 s, flip angle = 75°, slice thickness = 3.5 mm, slice gap = 1.05 mm, field of view 240 mm, matrix size = 64 × 64, voxel size = 3.75 mm × 3.75 mm × 4.55 mm. Resting-state scans were a minimum of 5 min, 4 s in duration (152 volumes). Any additional volumes were discarded to match data quantity across participants. Subjects were instructed to keep their eyes open during the scan and stare passively at a foveally presented fixation cross, as this is suggested to facilitate network delineation compared to eyes-closed conditions (Van Dijk et al., 2010).

data PreProcessIng
Functional and structural MRI data were preprocessed using an automated preprocessing pipeline and neuroinformatics system (Figure 1, step 1) developed at MRN (Bockholt et al., 2009) and based around SPM5 1 . Following the completion of a scan, data are automatically archived and copied to an analysis directory where preprocessing is performed. In the functional data pipeline, the first four volumes are discarded to remove T1 equilibration effects, images are realigned using INRIalign, and slice-timing correction is applied using the middle slice as the reference frame. Data are then spatially normalized into the standard Montreal Neurological Institute (MNI) space (Friston et al., 1995), resliced to 3 mm × 3 mm × 3 mm voxels, and smoothed using a Gaussian kernel with a full-width at half-maximum (FWHM) of 10 mm. To ensure quality and consistency of spatial normalization across subjects we calculate the spatial correlation between each subjects normalized data and the EPI template, as well as the degree of intersection between the EPI mask (determined by retaining voxels greater than the mean of the distribution) and the subject mask (determined by the same criteria). For the analysis presented here, these two metrics flagged datasets from 35 subjects, three of which were uncorrectable due to incomplete brain coverage and one that was unusable due to large signal dropout. The remaining 31 scans were corrected by manually reorienting the original images (shift in the yaw direction), then were re-preprocessed through the pipeline. Subsequent to automated preprocessing, the data were intensitynormalized to improve the accuracy and test-retest reliability of independent components analysis (ICA) output . Intensity normalization divides the time series of each voxel by its average intensity, converting data to percent signal change units.
For the structural data pipeline, tissue classification, bias correction, image registration, and spatial normalization were automatically performed using voxel-based morphometry (VBM) in Using the described statistical approach, we identify numerous effects of age and gender on different aspects of functional connectivity throughout cortical and subcortical structures. Our results corroborate previous observations and provide novel findings that motivate future in-depth investigations.

PartIcIPants
This analysis combines existing data from 603 subjects scanned on the same scanner and spread across 34 studies and 18 principal investigators at the Mind Research Network (MRN). Informed consent was obtained from all subjects according to institutional guidelines at the University of New Mexico (UNM) and all data were anonymized prior to group analysis. None of the participants were taking psychoactive medications at the time of the scan or had a history of neurological or psychiatric disorders. Subjects were excluded from analysis if their functional scans showed extreme motion (maximum translation >6 mm, roughly two voxels) or showed poor spatial normalization to the EPI template (see below). Subjects were also excluded if they maintained high levels of substance use (smoking an average of 11 or more cigarettes per day; drinking 2.5 or more drinks per day). Table 1 provides characteristics of the participants under investigation. The sample is nearly balanced on gender (305 females), and the age distributions for genders are very similar. Because the sample is overwhelmingly right-handed (46 ambidextrous or lefthanded individuals) and preliminary tests showed no handedness effects, we do not consider handedness from this point forward. Similarly, because participants in the white racial category are overrepresented, and some studies did not collect racial information, we do not consider race from this point forward. Age is right skewed with only seven people older than 50 and the majority of individuals in adolescence or young adulthood (80% of subjects between 1 http://www.fil.ion.ucl.ac.uk/spm/software/spm5 yielded similar findings to those found with GMC, and the choice of GMC or GMV would have little impact on our general results or interpretation.

grouP IndePendent coMPonent analysIs
Data were decomposed into functional networks using spatial ICA. Spatial ICA applied to fMRI data identifies temporally coherent networks by estimating maximally independent spatial sources, referred to as SMs, from their linearly mixed fMRI signals, referred to as TCs (Figure 1, step 2). We use ICA, rather than seed-based approaches, to identify networks as this multivariate data-driven method eliminates the somewhat arbitrary choice of seed regions and simultaneously takes into account the relationships between all voxels (as opposed to simple pairwise correlations). Furthermore, compared to seed-based methods, ICA may provide increased sensitivity to detect subtle differences between subjects (Koch et al., 2009). A detailed description of the GICA implemented in this study is provided in Appendix A. Here, we describe the choices we made to prioritize detailed and reliable ICA outcome measures, as well as a processing strategy to accommodate our large dataset and resource-intensive analysis.
Group independent components analysis was performed using the GIFT toolbox 2 . We chose relatively high model order ICA (number of components, C = 75) as previous studies have demonstrated that such models yield refined components that correspond to known anatomical and functional segmentations (Kiviniemi et al., 2009;Smith et al., 2009;Abou-Elseoud et al., 2010;Ystad et al., 2010). Subject-specific data reduction principal components analysis (PCA) retained T 1 = 100 principal components (PCs) using a standard economy-size decomposition. The relatively large number of subject-specific PCs has been shown to stabilize subsequent back-reconstruction . Group data reduction retained C = 75 PCs using the expectation-maximization (EM) algorithm, included in GIFT, to avoid otherwise prohibitive memory requirements (Roweis, 1998). The Infomax ICA algorithm (Bell and Sejnowski, 1995) was repeated 20 times in Icasso 3 and resulting components were clustered to estimate the reliability of the decomposition (Himberg et al., 2004). The quality of component clusters was quantified using the index I q , which ranges from 0 to 1 and reflects the difference between intra-cluster and extra-cluster similarity (Himberg et al., 2004). Aggregate SMs were estimated as the centrotypes of component clusters to reduce sensitivity to initial algorithm parameters. Subject-specific SMs and TCs were estimated using the recently developed GICA3 back-reconstruction method based on PCA compression and projection (Calhoun et al., 2001(Calhoun et al., , 2002bErhardt et al., 2010). There are desirable properties in GICA3 not available in other methods, including that the aggregate SM is the sum of the subject-specific SMs, analogous to a random effects model where the subject-specific effects are zero-mean distributed deviations from the group mean effect. Compared with dual regression, a least-squares alternative to back-reconstruction , evidence suggests that noise-free PCA with noise-free ICA using GICA3 provides more robust results with a more intuitive and natural interpretation . SPM5, wherein the above steps are integrated into a unified model (Ashburner and Friston, 2005). Unmodulated gray matter images, estimating local GMC were then smoothed using a Gaussian kernel with a FWHM of 10 mm and resliced to 3 mm × 3 mm × 3 mm to match the functional image dimensions. Our choice of unmodulated rather than modulated images is based on a previous study showing that modulated images confer greater inter-subject variability and thus reduced sensitivity to detect differences between groups (see Figure 2 of Meda et al., 2008). For the analyses presented here, modulated data, i.e., gray matter volume (GMV), 2 http://icatb.sourceforge.net/groupica.htm 3 http://www.cis.hut.fi/projects/ica/icasso FIGuRe 1 | Schematic of the analysis pipeline. Boxes on the left indicate general steps potentially applicable to a variety of data and analysis types; boxes on the right indicate particular choices made for the data and analysis presented here. See Section 2 for details and abbreviations.

Frontiers in Systems Neuroscience
www.frontiersin.org on the response matrices as a whole (Figure 1, step 4). Analogous to a standard ANOVA F-test with subsequent pairwise univariate contrasts, our strategy performs backward selection by testing whether each predictor in our model explains variability in the multivariate response using a multivariate analysis of covariance (MANCOVA), and for the reduced model of significant predictors proceeds to perform univariate tests corrected for multiple comparisons.

Design matrix
The design matrix, D, includes gender (coded as one for female and zero for male) and age as covariates of interest, as well as a gender by age interaction. In addition, we include three nuisance predictors related to motion and spatial normalization. Although spatial ICA successfully identifies motion-related sources which are removed from analysis (McKeown et al., 2003;Kochiyama et al., 2005), residual motion-related variance may also be present in RSNs. Thus we incorporate motion covariates, defined as the average scanto-scan rotation and translation from INRIalign motion estimates, to improve our evaluation of age and gender effects. Similarly, we include a measure of spatial normalization accuracy, defined as the Spearman correlation between the warped T2 * -weighted image and the EPI template. We apply normalizing transformations to all continuous variables to improve symmetry and reduce disproportionate influence (leverage) of outlying values on the model fit. Age, rotation, and translation are log-transformed, while spatial correlation is Fisher-transformed. The distributions of these variables and effects of transformation are displayed in Figure 2.
In general, covariates of interest were not strongly associated with nuisance predictors. No significant correlations were observed between (transformed) age and translation (r = −0.02; p > 0.69), rotation (r = −0.04; p > 0.27), or spatial normalization (r = 0.01; p > 0.80). Genders did not show a difference in translation (t 601 = 1.25; p > 0.20) or rotation (t 601 = 1.22; p > 0.20), but showed a slight difference in spatial normalization (t 601 = 2.53; p < 0.01), with females having higher correlations to the EPI template than males. Accurate estimation of model coefficients is unlikely to be affected by the relatively low correlation between these columns in the design matrix (r = 0.10).
In follow-up analyses, we also considered GMC (averaged over each component region) as a predictor variable. However, because GMC and age were highly negatively correlated over all regions (r ≈ −0.7, Figure A2A), we used a modified approach to disambiguate the predictive power of these covariates (see Appendix C) and did not include GMC in our standard design matrix.

Response variables
For each of i = 1, …, M subjects, we have c = 1, …, C 1 power spectra (P ic ), c = 1, …, C 1 SMs (S ic ), and a single vector of FNC correlations (K i ). Each of these response variables is modeled separately. Prior to modeling, response variables are transformed and dimensionreduced, as simulations suggest that these steps optimize model selection (see Appendix D and Figure A3). Spectra are elementwise log-transformed, which is useful because it normalizes the highly skewed power distribution and facilitates dimension estimation with minimum description length (MDL) in the next step.

RSN selection
We identified a subset of C 1 components considered to be RSNs (as opposed to physiological artifacts) by inspecting the aggregate SMs and average power spectra (see below; Figure 1, step 3). Four viewers rated the components from 0 (definite artifact) to 1 (certain RSN) based on expectations that RSNs should exhibit peak activations in gray matter, low spatial overlap with known vascular, ventricular, motion, and susceptibility artifacts, and TCs dominated by low frequency fluctuations (Cordes et al., 2000). To facilitate evaluation, spectra were characterized with two metrics used previously to classify components (Robinson et al., 2009): dynamic range, the difference between the peak power and minimum power at frequencies to the right of the peak, and low frequency to high frequency power ratio, the ratio of the integral of spectral power below 0.10 Hz to the integral of power between 0.15 and 0.25 Hz (Figure 3). Tallied votes from the four raters were used to separate components into three broad classes: artifact (score equal to zero), mixed (score between zero and three), and RSN (score of three or greater and no votes equal to zero). This classification scheme constitutes somewhat conservative selection criteria for RSNs, though we feel it is appropriate given our larger goal of discovering associations with gender and age.

Outcome measures
For the set of selected RSNs, we considered three outcome variables: (1) component power spectra, (2) component SMs, and (3) between component connectivity (FNC). Spectra were estimated on the detrended subject-specific TCs (involving removal of the mean, slope, and period p and 2p sines and cosines over each TC) using the multi-taper approach as implemented in Chronux 4 , with the time-bandwidth product set to 3 and the number of tapers set to 5 (Mitra and Bokil, 2008).
Component SMs were thresholded to focus our analysis on the subset of voxels most representative of each network. Thresholding was based on the distribution of voxelwise t-statistics to identify voxels with strong and consistent activation across subjects, as explained in Appendix B ( Figure A1). From this point forward, descriptions of component SMs refer to the thresholded maps, which include regions most associated with component TCs.
Functional network connectivity was estimated as the Pearson's correlation coefficient between pairs of TCs (Jafri et al., 2008). Subject-specific TCs were detrended and despiked using 3dDespike 5 , then filtered using a fifth-order Butterworth low-pass filter with a high frequency cutoff of 0.15 Hz. Pairwise correlations were computed between RSN TCs, resulting in a symmetric C 1 × C 1 correlation matrix for each subject. For all FNC analyses, correlations were transformed to z-scores using Fisher's transformation, z = atanh(k), where k is the correlation between two component TCs.

Overview
Our primary aim is to develop a statistical approach optimized for the large dimensions of the three ICA-derived outcome measures. We propose a multivariate model selection strategy to reduce the total number of statistical tests performed and facilitate testing predictors 4 http://chronux.org 5 http://afni.nimh.nih.gov/afni

FIGuRe 2 | Distributions of continuous covariates of interest (A) and nuisance predictors (B).
Distributions of covariates are skewed (light gray, left panels) so are transformed to have more symmetric distributions (dark gray, right panels). This reduces disproportionate influence of more extreme observations on the MANCOVA and univariate model fits.
Because of the large number of columns of each response matrix, and because of autoregressive correlation structure between columns, we perform a PCA dimension reduction on each matrix. For dimension reduction, we use MDL to estimate the "true" number of dimensions, u. Let P c * be the dimension-reduced matrix of power spectra, keeping the first u PCs, with S c * and K * defined similarly. Over the C 1 components, spectra with 129 frequency bins were reduced to a range of 25-27 dimensions. SMs with 2000-6000 voxels were reduced to 10-25 dimensions, and the FNC matrix with 378 pairwise correlations was reduced to 31 dimensions.

Backward selection
In this section, we describe model selection in terms of the spectra; the procedure is identical for SM and FNC response matrices. For each of the C 1 components, the MANCOVA model predicting spectral power is P DB E, c * = + where P c * is the M × u response matrix, D is the design matrix, B is the matrix of regression coefficients, and E is the matrix of errors. Backward selection is implemented in mStepwise from the MANCOVAN toolbox 6 . Each step performs an F-test using the Wilks' Lambda likelihood ratio test statistic (Christensen, 2001) comparing the current full model with each reduced model, defined by removing one column of the current D and the associated interactions, if present. The F-test evaluates whether the associated row(s) of B are simultaneously equal to zero. When all reduced models have been assessed, the term associated with the least significant reduced model (largest p-value) is removed from the design matrix, and this reduced model becomes our full model in the next iteration. The final reduced model, D r , has all terms not associated with higher-order interactions significant at a = 0.01. Note that reduced models are independently selected for each response matrix, i.e., P c * and S c * , for c = 1, …, C 1 , and K * . There are criticisms of stepwise selection methods, including that redundant (or correlated) predictors can adversely affect model selection and that resulting models have an inflated risk of capturing chance features of the data (Mantel, 1970;Henderson and Velleman, 1981;Judd et al., 1989;Derksen and Keselman, 1992). We minimize the potential negative impact a stepwise procedure can have on a resulting model by carefully considering a small full model (for example, not blindly including all pairwise interactions or higher-order terms), by performing only backward selection for the fewest model comparisons, and by not including highly correlated predictors of interest (see Appendix C and Figure A2). Simulations evaluating the performance of backward selection on spectra-like data show that it typically identifies the correct model (see Appendix D and Figure A3) suggesting that backward selection of a well-considered full model is an effective heuristic for model selection.

Univariate tests
At last, we wish to discover which spectral bins, SM voxels, or FNC correlations are associated with gender and age. Given the reduced model terms, univariate models are fit to the original (not dimension-reduced) response data,     P S K c c , , , , , c C = 1 1 and , to test the association with gender and age (Figure 1, step 5). Associations are visualized by plotting the log of the p-value with the sign of the associated t-statistic, −sign(t) log 10 (p), which provides information on both the directionally and statistical strength of the result. Univariate tests were corrected for multiple comparisons at an a = 0.01 significance level using false discovery rate (FDR; Genovese et al., 2002). We calculate partial correlation coefficients to measure the strength of the linear relationship between two variables [e.g., log(power) and log(age)] after adjusting for other predictors (the remaining regressors in the model). This is calculated as the correlation between the residualized variables, obtained by fitting models between the remaining predictors and each of the variables as response vectors (Christensen, 1996). For visualization, scatter plots display the response adjusted for nuisance predictors, where the fitted nuisance terms of the model are subtracted from the response vector. 6 http://www.mathworks.com/matlabcentral/fileexchange/27014-mancovan Frontiers in Systems Neuroscience www.frontiersin.org component focused at the temporo-parietal junction (IC 71) and overlapping the alerting system (Fan et al., 2005), and an anterior cingulate and insular network (IC 55) observed to activate during demanding tasks and conflict processing (Ridderinkhof et al., 2004;Klein et al., 2007;Eichele et al., 2008). Finally, we observe a number of frontal networks (FRONT; ICs 42, 20, 47, and 49) known to mediate executive as well as memory and language functions (Koechlin et al., 2003;Koechlin and Summerfield, 2007). Consistent with previous findings (Krienen and Buckner, 2009), we observe coactivation between medial prefrontal cortex and the contralateral cerebellum (IC 49). We note that a similar frontocerebellar network was observed as the bilateral homologue of IC 49, however this component (IC 18) was not classified as an RSN due to contamination with motion. Functional connectivity between RSNs is presented in Figure 4B. Correlations between component TCs reveal patterns of connectivity highly consistent with known functional relationships. For example, clusters of positive correlations correspond to greater connectivity within functional domains, and negative correlations between DMN components and sensoryrelated networks reflect the natural opposition of these systems as attention drifts between external and internal mental states .

results
We performed a 75-component GICA using resting-state fMRI data from 603 healthy participants. Demographic information for these subjects is provided in Table 1. Each of the 75-components had a cluster quality index greater than 0.8, indicating a highly stable ICA decomposition. Based on visual inspection of SMs and power spectra, we identified 35 components as ventricular, vascular, susceptibility or motion-related artifacts, 28 components as RSNs, and 12 components as mixed, representing a mixture of RSNs and artifactual sources (see Figure 3 and Section 2 for details). The set of mixed components is composed of networks with activation in subcortical regions (e.g., thalamus and caudate) that extends into neighboring ventricles, cerebellar components showing spatial overlap with sinuses, and components at frontal and occipital poles contaminated by motion. To reduce the likelihood of spurious results, only components classified as RSNs are considered further.

restIng-state networks
Spatial maps of the 28 selected RSNs are shown in Figure 4A.
Coordinates of their peak activations are provided in Table 2. The observed networks are similar to those identified previously with low model order ICA (roughly 20 components; Beckmann et al., 2005;Damoiseaux et al., 2006;Calhoun et al., 2008a;Smith et al., 2009) and nearly identical to those identified using high model order (roughly 70 components; Kiviniemi et al., 2009;Smith et al., 2009;Abou-Elseoud et al., 2010). For this reason, we describe the RSNs only briefly here, and provide citations to more comprehensive references.
Resting-state networks are grouped by their anatomical and functional properties. The BG are represented by a single component (IC 21) with activation focused in the putamen and pallidum (Robinson et al., 2009;Ystad et al., 2010). IC 17 forms a rather prototypical representation of the large parts of the auditory system (AUD), including bilateral activation of the superior temporal gyrus, superior temporal sulcus, and middle temporal gyrus (Seifritz et al., 2002;Specht and Reul, 2003). Motor and somatosensory functions (MOT) are captured by six components (ICs 7,23,24,38,56,and 29) situated in the vicinity of the central sulcus. Similar to previous studies (Krienen and Buckner, 2009;Abou-Elseoud et al., 2010), we find corresponding cerebellar coactivation in bilateral and lateralized networks. The visual system (VIS) is also represented by six components (ICs 46,64,67,48,39,and 59) in good agreement with the anatomical and functional delineations of occipital cortex (Grill-Spector and Malach, 2004). The DMN is captured in four components, separating the full map reported initially by Raichle et al. (2001) along the anterior-posterior and inferior-superior axes Harrison et al., 2008). We classify several RSNs known to be involved in directing and monitoring behavior as attentional networks (ATTN). These include lateralized fronto-parietal networks (IC 34 and 60) similar to the ventral attention network (Corbetta and Shulman, 2002;Vincent et al., 2008), a parietal and frontal-eye field network (IC 52) reminiscent of the dorsal attention network (Corbetta and Shulman, 2002), a component centered in the central and anterior precuneus (IC 72) which is implicated in directing attention (Cavanna and Trimble, 2006;Margulies et al., 2009), a bilateral

MultIvarIate results
We applied a multivariate model selection strategy to determine the effects of age and gender on RSN power spectra, SMs, and FNC while accounting for variance associated with nuisance regressors. Results from the multivariate analysis are displayed in Figure 5, which provides the significance of model terms gender, log(age), their interaction, and several nuisance parameters related to motion and spatial normalization in predicting response vari-

unIvarIate results
The strength of a multivariate selection process is its ability to identify which covariates are important when each separate response vector (power spectrum, SM, or FNC matrix) is considered as a whole. The multivariate results of dimension-reduced responses, however, are difficult to interpret. Thus, we performed univariate tests on each of the covariates of interest retained in the final model to understand the nature and extent of the relationship between these variables and RSN properties.

Power spectra
Univariate results summarizing the effects of age on RSN power spectra are shown in Figure 6A. We observe a decrease in low frequency power with age across all RSNs ( Figure 6A1). Significant effects are found primarily between 0 and 0.15 Hz, consistent with the frequency range over which synchronous fluctuations between brain regions are detected (Cordes et al., 2000;Sun et al., 2004). Agerelated reductions in spectral power are slightly more significant in attentional and DMN components, however the rate of decrease is fairly consistent over networks (Figure 6A2), suggesting that a single process underlies these somewhat global trends. Given the profound structural changes that occur with age (Good et al., 2001;Sowell et al., 2003;Østby et al., 2009;Tamnes et al., 2010), we performed additional analyses to address the extent to which decreases in low frequency power could be explained by decreases in GMC (see Appendix C). These analyses demonstrated that age has more explanatory power over and above that of GMC than GMC has over and above that of age (Figure A2B), meaning that age is a better predictor of spectral power than GMC. While this could be due to the error associated with measuring and computing local GMC, it may also imply that age includes information related not only to gray matter but also other factors, such as vascular compliance or degree of neural activation, that are related to differences in RSN spectral power.
Information regarding the effects of gender on spectral power is displayed in Figure 6B. Gender differences are found in only a few sensorimotor and attention-related RSNs and indicate slightly greater spectral power in males than females at very low frequencies (<0.05 Hz). Effects of gender are considerably smaller than those observed for age, both in terms of the test statistics at individual frequency bins and the fraction of the spectrum showing significant effects (Figures 6B1,B2).
Typical examples of the relationships between spectral power and the covariates of interest are shown in the rightmost panels of Figure 6. IC 53, representing the "core" of the posterior DMN, shows no difference with regard to gender ( Figure 6A3, as expected from the multivariate results in Figure 5), and a fairly linear relationship between log(age) and log(power; Figure 6A4). The partial correlation coefficient, r p = −0.38, is related to the variance accounted for by log(age) that is not accounted for by other regressors and indicates modest explanatory power. IC 72, composed largely of the central and anterior precuneus, shows a similar relationship with age and an additional gender effect (Figures 6B3,B4). Though the gender difference is statistically significant, distributions of power for males and females are highly overlapping and the gender term accounts for a relatively small portion of the response variance ( Figure 6B4,   ables for all RSNs. Darker colors correspond to more significant effects, while white squares indicate terms that were not retained in the backward selection process (a = 0.01). Gender was found to be mildly significant for a few of the spectra and slightly more significant for some of the SMs. Age was found to be highly significant in all models. Interestingly, the interaction term between age and gender was never retained in the backward selection process, suggesting that the effects of age are somewhat equivalent between genders for the age range and response variables investigated here. It should also be noted that either translation or rotation were retained in most models, meaning that these terms consistently accounted for significant variability in the outcome measures. This indicates considerable motion contamination of RSN SMs and, particularly, TCs, and strongly supports the incorporation of motion-related regressors to improve the estimates related to the covariates of interest.
are shown in Figures 7B3,B4, indicating slightly greater SM intensity in females than in males centered within the bilateral globus pallidus (IC 21) and left IFG (IC 20). As before, the distributions of intensity largely overlap between genders and the partial correlation coefficients (r p = 0.22 and r p = 0.31 for IC 21 and IC 20, respectively) suggest relatively low explanatory power. Figure 8 shows differences in FNC related to age and gender. The majority of between-network correlations decrease with age, and significant reductions appear to largely involve motor and attention networks ( Figure 8A1). In particular, the supplementary motor area (IC 56) and precuneus (IC 72), two of the brain's most interconnected regions (Hagmann et al., 2008;Gong et al., 2009), exhibit decreased correlations with a large number of other RSNs as a function of age. Gender-related differences in FNC are less significant than those observed for age, but show a consistent pattern of slightly greater correlations in males than females for connections involving motor networks ( Figure 8B1). This is true for correlations between components within the motor system, for example IC 24 (right sensorimotor cortex) and IC 7 (bilateral precentral gyri), and between motor RSN and other sensory-related networks, for example IC 24 and IC 17 (auditory cortex). Examples of age and gender effects on FNC correlations are displayed in Figures 8A2,B2. Correlation magnitude between IC 38 (bilateral postcentral gyri) and IC 56 (supplementary motor area) decreases very slightly with age (r p = −0.20), and shows no difference with regard to gender ( Figure 8A2). Correlations between IC 24 (right sensorimotor cortex) and IC 72 (precuneus) reveal a similar trend with age (r p = −0.19) and a very modest gender difference with males showing slightly stronger correlations than females (r p = −0.15) ( Figure 8B2). These effects are noticeably weaker than those found in spectra and SMs (see Figures 6 and 7), suggesting that functional connectivity between regions may be less affected by age and gender than connectivity within regions.

dIscussIon
In the present study, we used a high model order ICA decomposition of resting-state functional imaging data to delineate a set of RSNs on a very large sample (n = 603). We outlined a framework inspection of the scatter plots between log(age) and log(power) reveals that decreases in power do not begin until roughly 15 or 16 years of age, with power staying approximately constant or possibly showing a slight increase between 12 and 15 years. This trend, which was observed in additional RSNs, suggests a more complex relationship between age and spectral power during development (see Section 4).

Spatial maps
The effects of age on SM intensities are summarized in Figure 7A. Significant age-related decreases were observed in all RSNs and extend across large areas of cortex. Decreases were particularly pronounced (with regard to significance and rate of change) in several motor networks, frontal regions, and the precuneus (Figures 7A1,A2). Increases in SM intensity with age were found in a few sensory-related components, however only the BG network (IC 21) showed this effect in large clusters comprising a substantial portion of the component. Additional analyses investigating the contribution of GMC to the changes in SM intensities again suggested that age has more explanatory power than GMC ( Figure A2C). In Figure 7A, right panels, we show examples of regions with age-related decreases (IC 25, centered in anterior cingulate cortex) and increases (IC 21,putamen) in SM intensity. Note that both these regions also showed slight gender effects, seen in the different intercepts of the regression lines for males and females. For IC 25, the relationship between log(age) and intensity ( Figure 7A3) is approximately linear and the partial correlation coefficient (r p = −0.45) indicates moderate explanatory ability. For IC 21, the scatter plot ( Figure 7A4) suggests a positive linear relationship between log(age) and intensity at younger ages (roughly 12-27 years) that flattens or even decreases after 30 years of age. This pattern again implies the presence of more complex relationships between age and functional connectivity that should be investigated in future studies. Figure 7B describes differences in SMs with regard to gender. Similar to the results for power spectra, gender effects are weaker and more sparse than those observed for age. We find evidence of gender effects in both directions (i.e., males > females and males < females), however the majority of regions passing significance and those with the largest spatial extents show greater intensity in females (Figures 7B1,B2). Examples of the most pronounced gender effects FIGuRe 5 | Results from the reduced MANCoVA models, depicting the significance of covariates of interest (top) and nuisance predictors (bottom) for power spectra (left), SMs (middle), and the FNC matrix (right) in log 10 (p) units. White cells indicate terms that were removed from the full model during backward selection process. Note that the term labels refer to continuous covariates following normalizing transformations (e.g., log(age); see Figure 2). Also note that the range of log 10 (p) is limited by computational precision. In our analysis, epsilon is 2 −52 , which corresponds to a maximal −log 10 (p) value of 15.65.  (B,IC 72,precuneus). Line plots of the power spectra (A3,B3) show the mean log(power) ± 1 SE for males (blue) and females (red). Horizontal bars on the frequency axis denote bands with significant effects for age (white bar, solid line) and gender (gray bar, dotted line), and correspond to the range over which log(power) was averaged in the scatter plots. Scatter plots (A4, B4) show the covariate of interest versus log(power) after adjusting for nuisance regressors and age (for gender effects). The model fit is shown by colored lines and squares for age and gender, respectively. We indicate the number of frequency bins contributing to the data displayed (b  ) and the partial correlation coefficient (r p ) between the covariate of interest and log(power).

restIng-state networks
Similar to recent reports, we found that high model order ICA yielded extremely stable sources with sufficient spatial segregation of functional regions (Kiviniemi et al., 2009;Smith et al., 2009; to systematically analyze such high-dimensional datasets and estimated the effects of age and gender on the connectivity properties of RSNs. We discuss the findings of each of these steps and their relevance to the current literature in turn.

Frontiers in Systems Neuroscience www.frontiersin.org
For the set of RSNs, we further characterized large-scale functional connectivity by computing temporal correlations between networks. Our examination of a large sample of healthy subjects may provide the best estimate of FNC to date, and the structure of the correlation matrix identifies a number of interesting features (see Figure 4). As would be expected, there is strong clustering within functional domains (e.g., VIS, MOT, and DMN), and components within the DMN are anti-correlated with other systems. Note that negative correlations are observed without the application of global mean regression in preprocessing Fox et al., 2009), supporting the interpretation that temporally anti-correlated networks subserve opposing functions (Fox et al., 2005;Fox and Raichle, 2007). We also observe that the precuneus (IC 72) exhibits heterogeneous functional connectivity with numerous components in motor, visual, and attentional systems. This is consistent with recent reports implicating the precuneus as a central core in the cortical anatomical network (Hagmann et al., 2008;Bullmore and Sporns, 2009;Gong et al., 2009;Margulies et al., 2009). Another notable feature is the distinction in connectivity between the left and right fronto-parietal networks (IC 34 and 60, respectively). The left network is positively correlated with the DMNs and anti-correlated with visual and motor networks. In contrast, the right fronto-parietal network shows weak positive correlations with visual and motor components. Disparate behavior of these lateralized networks has been noted previously (Calhoun et al., 2008a;Smith et al., 2009) and Abou-Elseoud et al., 2010;Ystad et al., 2010). From 75 components, we identified 28 RSNs whose peak activation clusters were confined to cortical gray matter that describe direct cortico-cortical axonal pathways, indirect polysynaptic connections, and common feed-forward projections among cortical, subcortical, and cerebellar structures. These components are highly reminiscent of those described in previous studies regardless of the method of determination (e.g., spatial ICA, tensor ICA, or seed-based correlation), suggesting that they are fundamental components of the human connectome (Damoiseaux et al., 2006;Kiviniemi et al., 2009;Smith et al., 2009;Abou-Elseoud et al., 2010;Van Dijk et al., 2010). An important limitation of the set of RSNs is their breadth. Due to contamination of putative RSNs and artifacts (see Figure 3), we lack "full coverage" of all brain regions. This is particularly problematic for areas neighboring known susceptibility artifacts (orbitofrontal cortex), motion edges (frontal/occipital poles), and ventricles (subcortical nuclei). Future analyses could employ preprocessing steps optimized for subcortical regions (e.g., masking areas not of interest, using a smaller Gaussian smoothing kernel) or potentially use higher model order to better separate artifactual sources. An alternative method to increase coverage is to be less conservative with regard to component selection. Though this is a reasonable approach for a primary analysis identifying and characterizing networks, our ultimate goal is a secondary association analysis thus we chose to err on the side of caution using highly selective criteria. Allen et al. A baseline for network comparisons corresponds well with task-based findings, which demonstrate that the ventral attentional system involved in stimulus-driven orienting is right-lateralized (Corbetta and Shulman, 2002;He et al., 2007;Vincent et al., 2008), whereas the left fronto-parietal network is more implicated in explicit memory retrieval (Iidaka et al., 2006). Overall, these findings suggest highly distinct functional roles for these networks in a variety of mental states.

age and gender eFFects
For our sample of healthy controls largely composed of adolescents and young adults, age accounted for 10-20% of the variance in all investigated properties of resting-state functional connectivity.
With the exception of the BG (see below), age-related decreases were observed across all RSNs. These findings are consistent with previous studies focusing on DMN regions, though our examination of a large complement of RSNs suggests that age-related changes are most prominent in motor and attentional networks (Andrews-Hanna et al., 2007;Damoiseaux et al., 2008;Esposito et al., 2008;Koch et al., 2009;Biswal et al., 2010). A near-ubiquitous decrease in functional connectivity across cortex may appear at odds with Biswal et al. (2010), who in a large multi-site investigation report both age-related increases and decreases in connectivity. However, it should be noted that the cortical areas showing increases with age also exhibited negative SM loadings, i.e., were anti-correlated with seed regions or the component TC (e.g., see their Figure 2). Therefore, age-related increases in Biswal et al. (2010) correspond to reductions in the magnitude of anti-correlation, consistent with the notion that network activity decreases with age. In the current study, we confined analysis only to voxels with positive SM loadings, thus the directionality of age-related effects have an unambiguous interpretation. In our analysis only the bilateral putamen (IC 21) showed pronounced age-related increases in functional connectivity (see Figure 7A). The etiology of this distinction is unclear and certainly warrants future in-depth investigation. It is interesting to note that gray matter volumes in subcortical regions show very different agerelated trajectories from those found in cortex (Østby et al., 2009), suggesting that were they evaluated, additional BG and thalamic components would show changes with age that are distinct from the cortical trends. As the methodology to delineate these structures improves, so can investigations into the development of subcortical networks and cortical-subcortical interactions (Robinson et al., 2009;Fair et al., 2010;Ystad et al., 2010).
The specific relationship between age and measures of functional connectivity warrants special attention. As mentioned in the Section 3, inspection of the scatter plots for spectra and SMs suggest that reductions with age in cortical RSNs do not typically start until mid to late adolescence. Furthermore, the relatively linear relationships between log(age) and connectivity measures following adolescence indicate more rapid changes in young adulthood than in late adulthood. These trends are somewhat reminiscent of developmental changes in cortical gray matter (Good et al., 2001;Sowell et al., 2003;Østby et al., 2009;Tamnes et al., 2010), consistent with the notion that structural substrates underlie function (Hagmann et al., 2008). However, as indicated in previous studies and our own analysis (see Figure A2), alterations in functional connectivity cannot be explained by differences in gray matter alone (Damoiseaux et al., 2008;Glahn et al., 2010), and may be better predicted when including changes in white matter tracts and possible interactions. Future studies should examine changes in functional connectivity while considering the underlying structure and be designed specifically to probe developmental and transitional periods. Regression models might also incorporate higher-order age terms to capture curvature and non-monotonic trends which have been found previously (Szaflarski et al., 2006;Kelly et al., 2009;Stevens et al., 2009). It is also important to note that cross-sectional analyses, such as those used here, can reduce sensitivity to estimate trajectories and may increase bias; future studies should consider a longitudinal approach to alleviate these issues (Kraemer et al., 2000).
An important consideration for interpreting age-related effects in functional connectivity measures, particularly power spectra, is the influence of non-neural factors. Changes in the BOLD signal are a function of complex metabolic and vascular reactions and are only indirectly related to changes in resting-state neural activity (Schölvinck et al., 2010). The magnitude of spontaneous fluctuations and stimulus-evoked responses is affected by a multitude of physiological variables (e.g., baseline levels of blood flow, vascular compliance, cardiac and respiratory rhythms) that are known to change with age and therefore preclude a full interpretation of age-related findings (D' Esposito et al., 2003;Birn et al., 2006Birn et al., , 2008Ances et al., 2009). In our analysis, the influence of non-neural factors may be mitigated by the use of an ICA framework, which appears to partially separate physiological noise sources at higher model orders (Birn et al., 2008;Beall and Lowe, 2010;Starck et al., 2010), however there is almost certainly some remaining contribution of non-neural variables to the age-related trends in functional connectivity measures. In an effort to assess the magnitude of this contribution, we performed additional statistical analyses on the power spectra and SMs of vascular, ventricular and white matter regions, as detailed in Appendix E. As expected, age was found to be a significant predictor for nearly all vascular compartments and tissue types, but based on the size of the observed effects these results suggest at least a partial neural origin for age-related findings in power spectra and a likely neural origin in SMs. Note, however, that without direct measurements or the concurrent use of additional non-invasive techniques such as EEG (Whitford et al., 2007), NIRS (Huppert et al., 2006), or calibrated-fMRI (Ances et al., 2009), it is generally not feasible to fully dissociate neural from confounding physiological factors. We further note that regardless of their etiology, our study demonstrates robust age-related effects in functional connectivity measures that must be considered in future analyses, even when the age range across subjects is relatively narrow (e.g., 20-30 years).
Compared to age, gender differences in functional connectivity were smaller (accounting for between 5 and 15 percent of response variance) and confined to fewer areas. Despite relatively small effects, we note that the inclusion of gender and appropriate interaction terms in regression models may be critical for discovering and interpreting the effects of other covariates, particularly when investigating complex phenotypes such as general intelligence (Haier et al., 2005;Holland, 2006, 2007). Similar to previous studies, we found evidence for greater connectivity in females versus males within DMN regions (Bluhm et al., 2008;Biswal et al., 2010), though these effects were not nearly as pronounced as those in the left IFG or bilateral BG. Gender differences in the left IFG (Broca's area) are consistent with earlier Frontiers in Systems Neuroscience www.frontiersin.org reports showing increased GMC in the same region for females (Good et al., 2001;Luders et al., 2005). Given the importance of the left IFG in various aspects of speech generation (for review see Bookheimer, 2002), the difference likely reflects gender distinctions in the use or organization of language networks at rest. It is possible that the observed increases in GMC and functional connectivity relate to a slight enhancement of language skills in adult females (Weiss et al., 2003), though the existence and extent of this gender difference is debated (Wallentin, 2009). Regardless, our results motivate further investigation into the relationship between verbal fluency and resting-state activity in language networks (Hampson et al., 2002(Hampson et al., , 2006. Gender differences in the BG centered at the pallidum appear to be a novel finding and it is not clear why this network might be more coherent in females. Sexual dimorphisms in the structural aspects of the BG are found throughout development (Good et al., 2001;Luders et al., 2009), and further examination of gender-specific function in these regions is warranted. Gender differences were also observed in FNC and spectral power within the motor and sensory systems, where males showed increases relative to females. A greater level of basal activity in motor networks, as well as greater coactivation between different networks, may be related to increased use and coordination between these systems, as suggested by recent studies of motor learning and functional connectivity (Albert et al., 2009;Xiong et al., 2009). Alternatively, this distinction may reflect an inherent gender difference in sensorimotor connectivity, possibly related to the enhancement of motor and visuospatial skills in males (Weiss et al., 2003;Hamilton, 2008).

statIstIcal aPProach
Investigations of resting-state functional connectivity provide an exciting and potentially rich look into intrinsic architecture of the human brain, however this approach brings with it some methodological challenges that must be acknowledged. Unlike analysis of task-related data, which typically involves a limited number of predefined contrasts and is often constrained to regions of interest, analysis of intrinsic connectivity presents an overwhelming number of possible comparisons to be made within and between regions. To avoid mass univariate testing with inherent multiple comparison correction problems, we propose a hierarchical framework to focus analysis and effectively reduce the number of tests performed. At the initial level, we adopt a multivariate approach with backward selection on a carefully selected set of predictors which refines the model and limits the number of statistical tests performed. Using power spectra as an example, the full spectrum is PCA compressed over frequency bins and used as a single multivariate response for model selection. Only when relevant regressors are retained (age and/or gender) are univariate regressions performed, saving many tests in the case of gender (see Figures 5  and 6B1). Note that the use of ICA has also greatly refined the testing procedure by initially delineating relevant features: power spectra are evaluated for a relatively small number of select components (here, C 1 = 28), rather than the comparatively large number of voxels (here, V ≈ 67000) that might be tested in a seed-based coherence approach. Simulations indicate that the MANCOVA results are robust to false positives and that sensitivity is optimized by PCA reduction of response variables to the MDL estimate (see Figure A3). There are, of course, limitations to this method, chiefly that the effects of interest must be captured in the retained principal components. Thus it is possible that age or gender-related differences in functional connectivity were not identified in the current analysis if the relevant features contributed a small fraction of variance to the response. To avoid this limitation, one could use alternative multivariate approaches such as discriminatory PCA (Caprihan et al., 2008) or coefficient-constrained ICA (Sui et al., 2009) that "optimize" dimension reduction based on the desired contrast. Though such approaches can improve feature identification and classification, we opt to use PCA on the response vectors independently of predictors to retain statistical tests and interpretations that are unbiased by an optimization procedure. Regarding model selection, it is interesting to note that in most cases at least one of the motion-related nuisance regressors was retained as a predictor (see Figure 5). This indicates mild contamination of motion-related variance in nearly all ICA outcome measures and suggests that covariates summarizing motion should generally be included in design matrices to improve estimates for predictors of interest. Ideally, one would include additional nuisance regressors, such as average heart rate or variability in respiration depth, as these variables also contribute to variance in ICA, TCs, and SMs (Birn et al., 2008;Beall and Lowe, 2010;Starck et al., 2010). Though cardiac and respiratory measures were unfortunately not acquired for the group of subjects studied here, their inclusion would almost certainly improve estimation and inference of gender and age effects.
As a final note on the statistical approach described here, we comment on its broad applicability. The proposed method can be used to investigate the effects of numerous covariates on functional connectivity, and we anticipate that it will enhance the ability to detect differences related to neuropsychiatric disorders while controlling for demographic and other factors. Perhaps more importantly, this method is easily applied to a variety of datasets where response variables have high dimensionality but show considerable collinearity. These include outcome measures estimating functional and effective connectivity with various techniques (Sun et al., 2004;Karunanayaka et al., 2007;Deshpande et al., 2009), multidimensional parameters summarizing full brain network topology (Gong et al., 2009), as well as large-scale structural datasets such as those describing cortical thickness or white matter integrity. Though this multivariate framework provides a relatively minor departure from the mass univariate approach typically used, we feel it represents an important step toward the development of more sophisticated statistical and inferential methods that are necessary to comprehend increasingly large and complex data (Costafreda, 2010).
where N(·) and G(·) indicate normal and gamma distributions. The six distribution parameters (m c , s c , a c1 , b c1 , a c2 , b c2 ) and the two proportions (p c1 and p c2 ) were estimated by minimizing the residual sum of squares using the lsqnonlin function in Matlab with the trust-region-reflective Newton algorithm. Conditional on the fit of the Gaussian parameters, component-related voxels were determined as those more extreme than the threshold m c ± 4s c . Because the distributions of t-statistics for RSNs are typically skewed with more extreme positive values, we further limit our statistical analyses to voxels exceeding the threshold on the right tail: t > m + 4s. This strategy provided results consistent with what had previously been selected by eye and provides an excellent fit to the center of the voxel distribution ( Figure A1).

C. GrAy mAtter ConCentrAtion And AGe
Age is a highly significant predictor of RSN connectivity (see e.g., Figure 5), however given the profound structural changes that accompany age (Good et al., 2001;Sowell et al., 2003;Tamnes et al., 2010;Ostby et al., 2009), we ask whether gray matter concentration (GMC) might be a better predictor. Because age and GMC were highly correlated (Figure A2A), the inclusion of both terms in a full model for variable selection would lead to reduced models that sometimes include one or the other as a strong effect, or both predictors together weakly. It is difficult to interpret and make inferences from such reduced models, thus we instead assess the additional contributions of each variable over the other to determine which predictor has more explanatory power for the power spectra and SMs.
To determine their explanatory power, we first orthogonalize age and GMC by fitting two linear regressions, GMC onto age to obtain residualized GMC (GMC r , GMC information not explained by log(age)) and residualized age (age r , log(age) information not Figure A1 | A typical example of the normal-gamma-gamma (Nggs) model, fit to the distribution of t-statistics for iC 38. The distribution (gray) is relatively well described by a mixture of a normal (green), positive gamma (red), and negative gamma (blue). The full model fit is shown in black, and cutoffs (m ± 4s) are determined from the estimated mean (m) and SD (s) of the normal. Thresholded SMs include only voxels with positive t-statistics: t > m + 4s.

Frontiers in Systems Neuroscience
www.frontiersin.org

D. BackwarD selection anD collinearity simulation
To investigate the effects of backward selection and highly collinear data on model estimation, we created a simulation approximately matching the size and properties of our data. We begin with 600 subjects and 3000 response "bins," but only 13 true Gaussian sources uniformly spaced and with moderate overlap over the bins. Our design matrix includes two grouping predictors (g 1 , g 2 ) each with two levels, four continuous predictors (c 1 , c 2 , c 3 , c 4 ) normally distributed, and all group-covariate interactions as predictors, yielding 15 model terms including the constant. Gaussian explained by GMC). We then consider two models: model M 1 which includes age as a predictor and M 2 which includes GMC in place of age. Similar to our main approach, backward selection is performed on each model to select a reduced model. We next add GMC r to the reduced model M 1 and add age r to reduced model M 2 and test for significance of the added variables in each model, The F-statistics and associated p-values indicate the variability in the response explained by the residualized predictor, for example in M 1 , by GMC not explained by age.
This process is performed for the spectra and SMs of each RSN. Figures A2B,C show the −log 10 (p) values for the addition of GMC r in model M 1 and age r in model M 2 over all components. With the exception of a few component SMs, age r almost always contributes over GMC alone while GMC r rarely contributes over age alone. These model comparisons demonstrate that while age can often account for GMC-related changes component spectra and SMs, GMC is not sufficient to account for age-related effects. This result also justifies the inclusion of age and omission of GMC in the primary design matrix used for model selection.

Figure A2 | evaluation of age and gray matter concentration as predictors. (A)
Typical examples of the relationship between log(age) and GMC (averaged over voxels in the thresholded SM). (B,C) Significance of the residualized GMC (GMC r , gray) and residualized log(age; age r , black) terms in models predicting spectral power (B) and SM intensity (C) for each RSN. Wilcoxon signed-rank statistics (W), based on the difference between −log 10 (p) values, are displayed on each plot.

Figure A3 | Simulations showing benefits of dimension reduction using the MDL estimate. (A)
Average p-values from the MANCOVA F-test for each model term over different number of components used, ranging from 1 to 100. Dashed black line shows the true number of dimensions estimated correctly as 13 by MDL for all 100 simulations. (B) Hit rate (fraction of times each model term appeared in the reduced model, following backward selection) as a function of components used. (C) True positives (average hit rate for true effects) and false positives (average hit rate for false effects) as a function of components used. Though difficult to see given the scale, the false positive rate was lowest at 11 and 13 components (0.0067 and 0.0078, respectively), and never rose above 0.024 (21 components). sources are related to the covariates in the following way: source 1 is q 1i = 0.6c 3i g 2i + e 1i , source 6 is q 6i = 0.7g 1i + e 2i , source 7 is q 7i = 0.25c 3i + 0.8g 2i + e 7i , source 13 is q 13i = 0.21c 2i + e 13i , and the remaining sources are Gaussian noise. Thus, subject response vectors are associated with covariates g 1 , g 2 , c 2 , c 3 , and a g 2 -by-c 3 interaction. The 13 sources are linearly mixed by addition, Gaussian noise is added, and subject data are concatenated for Q. A PCA is performed resulting in Q*, and the number of PCs retained is varied from 1 to 100. MANCOVA backward selection is performed to determine a reduced model and the p-value for each term in that reduced model is retained. Simulations were repeated 100 times, randomizing the predictors and noise. MDL correctly estimates the dimension as 13 for all 100 simulations. Results from the simulations are displayed in Figure A3. Figure A3A illustrates that predictors are most significant (as determined from the average p-value) when the correct number of dimensions are used, but significance is not greatly sensitive to this variable, even with collinear data. With respect to the observed level of significance, the simulation is similar to what we observe for component spectra. Figure A3B shows that over the 100 simulations, the correct reduced model was often selected for a range of dimensions from 13 to 25 PCs, and the four strongest predictor effects were well estimated over a wide range of dimensions. Figure A3C emphasizes the same point by showing that the reduced model correctly included the important predictors most often when the correct dimension was estimated, with a slow decline in true positive model reduction with an overestimate of dimension. Incorrectly including non-important predictors, false positives, was rare for the full range of PCs retained.

e. effects of age on non-rsn components
To examine the effects of age on non-neural signals, we identified components with peak activations overlapping large arterial vessels ( Figure A4A, left panel) and the ventricular system ( Figure A4A, middle panel), where physiological noise sources should be most prominent (e.g., see Figure 5 of Beall and Lowe, 2010). The selected components approximately capture the basilar artery (IC 3), vertebral arteries (IC 6), fourth ventricle and neighboring cisterns (IC 16), and lateral ventricles (IC 44). Effects of age on the component spectra and SMs were then determined using the MANCOVA model with backward selection, identical to the approach used for RSNs (see Section 2.6.4).
In addition, we examined the spectra of time series from white matter (WM) and cerebrospinal fluid (CSF) regions, as defined by anatomical segmentations of the T1-weighted images. To compute the regional time series, we began with the spatially normalized, unsmoothed group averages of WM and CSF segmentations from the VBM analysis (see Section 2.3), resliced to the 3-mm isotropic EPI grid. We conservatively thresholded the WM and CSF images at probabilities of 0.99 and 0.87, respectively, and spatially restricted the CSF mask to the lateral ventricles to maximize temporal coherence across voxels. Figure A4A (right panel) displays the WM and CSF masks comprising 3350 and 329 voxels, respectively. We averaged the time series of masked voxels from each subject/s EPI data to obtain a representative WM and CSF time course. Power spectra were then estimated from the time courses and multivariate model selection was performed using the procedures detailed in Sections 2.6.3 and 2.6.4.
Multivariate results for the power spectra and SMs are displayed in Figures A4B,C. The −log 10 (p) values in each plot indicate the significance of age in predicting the power spectra ( Figure A4B) and SMs ( Figure A4C) of RSNs (gray circles; same data as shown in Figure 5) and non-RSNs (orange squares). As discussed in the main text, age accounts for a significant portion of response variance in all RSN spectra and SMs. Age is also a significant covariate for non-RSN spectra and SMs, though, notably, age is not retained in the models predicting the spectra of IC 16 or the SM of IC 3. Furthermore, the significance of the age terms for non-RSN spectra are largely equivalent to or less than the significance level observed for RSN power spectra. Because contributions of vascular, cardiac and respiratory signals should be maximal in non-RSN power spectra, we can infer that the larger age-related effects in RSN spectra result from additional (putatively neural) sources. For SMs, three of the four examined components show little or no effect of age; no voxel clusters from ICs 6 or 16 passed FDR correction. In contrast, age is a highly significant predictor for IC 44 (lateral ventricles). This result is expected given the known increase in ventricular volume over the lifespan (Barron et al., 1976), as demonstrated in Figure A4D. With age, the component distribution appears to expand more posteriorly, increasing SM intensity in the trigone of the lateral ventricles and decreasing intensity in the frontal horns. These results suggest that age-related effects in SMs reflect changes in the spatial distributions of activity, whether due to putative alterations in neural connectivity or large-scale morphological changes.