Diagnosis of Autism Spectrum Disorder Using Central-Moment Features From Low- and High-Order Dynamic Resting-State Functional Connectivity Networks

Zhao, Feng; Chen, Zhiyuan; Rekik, Islem; Lee, Seong-Whan; Shen, Dinggang

doi:10.3389/fnins.2020.00258

ORIGINAL RESEARCH article

Front. Neurosci., 28 April 2020

Sec. Brain Imaging Methods

Volume 14 - 2020 | https://doi.org/10.3389/fnins.2020.00258

This article is part of the Research TopicTemporal Features in Resting State fMRI DataView all 11 articles

Diagnosis of Autism Spectrum Disorder Using Central-Moment Features From Low- and High-Order Dynamic Resting-State Functional Connectivity Networks

Feng Zhao^1,2

Zhiyuan Chen^1,2

Islem Rekik³

Seong-Whan Lee⁴

Dinggang Shen^5,4*

¹School of Computer Science and Technology, Shandong Technology and Business University, Yantai, China
²Shandong Co-Innovation Center of Future Intelligent Computing, Yantai, China
³BASIRA Lab, CVIP Group, Computing, School of Science and Engineering, University of Dundee, Dundee, United Kingdom
⁴Department of Brain and Cognitive Engineering, Korea University, Seoul, South Korea
⁵Department of Radiology and Biomedical Research Imaging Central, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States

The sliding-window-based dynamic functional connectivity networks (D-FCNs) derived from resting-state functional magnetic resonance imaging (rs-fMRI) are effective methods for diagnosing various neurological diseases, including autism spectrum disorder (ASD). However, traditional D-FCNs are low-order networks based on pairwise correlation between brain regions, thus overlooking high-level interactions across multiple regions of interest (ROIs). Moreover, D-FCNs suffer from the temporal mismatching issue, i.e., subnetworks in the same temporal window do not have temporal correspondence across different subjects. To address the above problems, we first construct a novel high-order D-FCNs based on the principle of “correlation’s correlation” to further explore the higher level and more complex interaction relationships among multiple ROIs. Furthermore, we propose to use a central-moment method to extract temporal-invariance properties contained in either low- or high-order D-FCNs. Finally, we design and train an ensemble classifier by fusing the features extracted from conventional FCN, low-order D-FCNs, and high-order D-FCNs for the diagnosis of ASD and normal control subjects. Our method achieved the best ASD classification accuracy (83%), and our results revealed the features extracted from different networks fingerprinting the autistic brain at different connectional levels.

Introduction

Autism spectrum disorder (ASD) is a serious childhood neurodevelopmental disease, characterized by the impairment in social interaction, communication, and many other behavioral and cognitive functions in varying degrees (Geschwind and Levitt, 2007). According to the 2018 community report from the Centers for Disease Control and Prevention (CDCP)¹, about 1 in 59 American children has been identified with some form of ASD, with about four times more common among boys than among girls. Thus, accurate early diagnosis and timely intervention of ASD, especially for the infants under 12 months old, may have pivotal importance in preventing the progression of detrimental symptoms (Jin et al., 2015). However, ASD is a very complex and highly heterogeneous neurological disorder, which affects many higher-level brain functions and sometimes whole-brain structures, making it challenging for accurate diagnosis. To address this, extensive research efforts (Geschwind and Levitt, 2007; Anagnostou and Taylor, 2011; Jin et al., 2015; Wang et al., 2018) have been dedicated to analyzing the neuroimaging data with different modalities, including structural magnetic resonance imaging (s-MRI) (Wee et al., 2013), functional MRI (fMRI) (Zhao et al., 2018), diffusion tensor imaging (DTI) (Deshpande et al., 2013), and positron emission tomography (PET) (Zürcher et al., 2015), to investigate ASD-related biological or neurological mechanisms. In this way, the respective biomarkers could be identified for characterizing ASD.

Recently, resting-state fMRI (rs-fMRI) uses blood-oxygenation-level-dependent (BOLD) signals to probe brain activity, which has shown great potential in exploring the in vivo neuronal underpinnings of ASD (Fornito et al., 2015; Liu et al., 2016; Huang et al., 2018; Zhao et al., 2018). Since BOLD signals are sensitive to the spontaneous and intrinsic neural activities within the brain, re-fMRI can be used as an efficient and noninvasive way for investigating neuropathological substrates of many neurological and psychiatric disorders at a whole-brain system level (Admon et al., 2012; Ganella et al., 2017; Li et al., 2017). Temporal correlation of the BOLD signals between different pairs of brain regions of interest (ROIs) is often used to define brain functional connectivity (FC), which can be used to explore how brain ROIs interact with each other. In practice, FC is often modeled as a FC network (FCN), with each specific brain ROI as a node in the network, and the strength of FC between a pair of brain ROIs as an edge (or link). In terms of both topological structures and connection strength, the differences between normal and disrupted FCN caused by certain pathological attacks reveal potential biomarkers to understand pathological underpinnings of ASD. Therefore, FCN has charted out a promising research direction to investigate the brain’s functional differences between control and disease groups (Zhang et al., 2015, 2016; Qiao et al., 2018).

To date, researchers have developed many FCN models to capture rich information exchange across ROIs so that functional neurological biomarkers can be reliably identified for ASD diagnosis (Jie et al., 2014; Ha et al., 2015a; Plitt et al., 2015). The most commonly adopted FCN, namely, conventional FCN (C-FCN), is usually rooted in the assumption that the strength of FC is temporally stationary in the entire rs-fMRI scan duration (Achard, 2006; Zhao et al., 2018). Under such an assumption, FC is quantified with the correlation (e.g., Pearson’s correlation) between a pair of rs-fMRI time series from two ROIs. As a result, C-FCN captures the functional connectivity between two ROIs in a static manner, which unfortunately overlooks the dynamic interaction between brain ROIs during the scan period.

In fact, recent studies have demonstrated that the dynamic changes of FC throughout the entire scan time may be an intrinsic property of brain function (Damaraju et al., 2014a; Kudela et al., 2017). Given the increasing evidence that dynamic FC during the entire scan time is very important for understanding the fundamental properties of brain network and the underpinnings of disordered brain connectivity changes, different studies have resorted to dynamic FC networks (D-FCNs) to characterize dynamic changes of FC, as well as the association of these dynamic changes with brain diseases (Damaraju et al., 2014b; Wee et al., 2015; Guo et al., 2017).

The most commonly used strategy of constructing D-FCNs is the sliding-window approach (Hutchison et al., 2013). The detailed contracture process of D-FCNs [i.e., low-order dynamic functional connectivity networks (Lo-D-FCNs), which will be discussed in the following section) is shown in Figure 1. Specifically, the entire rs-fMRI time series from a subject were segmented into multiple overlapping subseries by a sliding window with prefixed window length and step size between two successive windows (Figure 1A1). For each subseries, a FC subnetwork is constructed by calculating the short-term correlation between different ROIs, which is similar to the construction of C-FCN. As an example, the construction process of the second subnetwork is shown in Figures 1A2,B2, where x_i and x_j, respectively, denote the average rs-fMRI time series across all voxels within the ith and the jth ROIs, and their correlation ρ_ij(2) is computed as the FC strength between the ith and the jth ROIs. In such a way, we can obtain a FC subnetwork (Figure 1B2), which reflects a short-term FC relationship between two ROIs. Repeating the above process, we can obtain a temporal FC subnetwork series, which is called dynamic FC networks (D-FCNs, i.e., Lo-D-FCNs) (Figure 1B1). Obviously, the correlation series (e.g., [ρ_ij(1),ρ_ij(2),⋯,ρ_ij(K)] in Figure 1B1) along the scanning time between a pair of ROIs can represent the temporal change of FC between the two ROIs, which indicates that D-FCNs can capture the dynamic properties of FC throughout the scan time and can provide rich discriminative information for ASD diagnosis.

FIGURE 1

Figure 1. Flow chart of constructing low- (Lo-D-FCNs) and high-order dynamic functional connectivity networks (Ho-D-FCNs), where (A1) denotes the resting-state functional MRI (rs-fMRI) time series associated with each region of interest (ROI), (A2) denotes the second rs-fMRI subseries based on a sliding window, (B1) is the Lo-D-FCNs, (B2) is the second subnetwork of Lo-D-FCNs, (C1) is the Ho-D-FCNs, and (C2) denotes the second subnetwork from Ho-D-FCNs.

While D-FCNs opens a new avenue for us to comprehensively understand brain activities, it still has the following two issues need to be addressed.

First, D-FCNs cannot reveal the potentially much complex and high-level relationship among multiple ROIs. Similar to C-FCN, D-FCNs is also based on computing pairwise correlation between neural signals, such as Pearson’s correlation and partial correlation, between a pair of rs-fMRIs from two ROIs to estimate the FC strength (Figure 1A2). Although such simple FC network representation has been widely utilized for examining brain functional activity, it dramatically ignores much complex and high-level interactions across multiple ROIs. In such a sense, C-FCN and D-FCNs are referred to as the low-order FCN, and thus, D-FCNs also will be named as Lo-D-FCNs in this paper. Recently, emerging connectomic studies have demonstrated that examining more complex interactions involving multiple ROIs can provide more valuable insights into brain disease fingerprinting and diagnosis (Chen et al., 2016; Zhang et al., 2016, 2017a,b, 2017c; Guo et al., 2017; Morris and Rekik, 2017; Soussia and Rekik, 2018; Zhao et al., 2018). Correspondingly, those FCNs, reflecting complex interactions across multiple ROIs, are referred as the high-order FCN (Ho-FCN).

By far, much attention has been dedicated to construct Ho-FCN models for exploring the interactions among multiple ROIs. For instance, Chen et al. (2016) constructed a Ho-FCN model based on the correlations between each pair of dynamic FC time series from sliding-window-based Lo-D-FCNs. Guo et al. (2017) modeled a Ho-FCN using a minimum spanning tree for Alzheimer’s disease (AD) classification. Based on a more simple and intuitive way, i.e., correlation’s correlation strategy, a new Ho-FCN was developed by Zhang et al. (2016) for more sensitive early AD detection. Different from Lo-FCN or Lo-D-FCNs, Ho-FCN presented by Zhang et al. defines another correlation between two brain regions based on their FC profiles, rather than BOLD signals. Here, the FC profile of a brain region means the traditional low-order FC of this region. In such a way, the correlation’s correlation is able to reveal some interesting information; for example, some brain regions may exhibit stronger correlation with each other in a feature space (defined by FC profile) than the raw neural signal space. Consequently, Ho-FCN is able to provide another source of information for diagnosis (Zhang et al., 2016).

Inspired from the principle of the correlation’s correlation, we construct a novel high-order dynamic FCNs (Ho-D-FCNs) for exploring the high-order dynamic FC relationships among multiple ROIs. Figures 1C1,C2 display the flowchart of constructing Ho-D-FCNs. For each subnetwork from the Lo-D-FCNs, such as the second one shown in Figure 1B2, we regard the correlations series between a ROI and all other ROIs as its short-time FC profile, which reflects the FC relationship between this ROI and all other ROIs in a short scanning time. For example, ρ_i(2) is the short-time FC profile of the ith ROI and ρ_j(2) is that of the jth ROI (Figure 1B2). Then, the high-order correlation is computed for each pair of ROI based on the associated short-time FC profiles, such as hp_ij(2) shown in Figure 1C2. Intuitively, such correlation reflects the relatively shorter time resemblance between a pair of FC profiles from two ROIs (i.e., correlation’s correlation) and thus involves multiple ROIs. By doing so, we can obtain a corresponding high-order subnetwork (e.g., Figure 1C2) from each low-order subnetwork (e.g., Figure 1B2), which reflects how the low-order temporal correlations between different brain ROIs interact with each other during a short scan time. Accordingly, the high-order subnetwork series (Figure 1C1) is referred as Ho-D-FCNs and utilized to reveal some new characteristics for biomarker detection. In fact, the experimental result in The Most Discriminative Features for ASD Diagnosis shows that Ho-D-FCNs can provide complementary information to C-FCN and Lo-D-FCNs.

Second, Lo-D-FCNs is sensitive to the chronological order of its subnetworks, which limits its use in comparative studies. Specifically, due to the unconstrained mental activity during the brain resting state, we cannot establish the temporal correspondence among these FC subnetworks from the same temporal window across different subjects. Therefore, the subnetwork series concatenated along scanning time (i.e., Lo-D-FCNs) might be dynamically mismatched across different subjects, which somewhat hinders the investigation and comparison of dynamic FC at a population level. It is noteworthy that Ho-D-FCNs presented in previous section also faces the same problem. By far, no method is proved to be effective in addressing this issue (Zhang et al., 2017a).

Statistical moment methods, including central, Hu, Zernike moments, and so on, have been broadly used in many areas for detecting and deriving various invariant properties of random signals (Hu, 1962; Hung et al., 2006). For the processing of a one-dimensional random sequence generated from a random variable, central-moment method owns the following merits: (Geschwind and Levitt, 2007) although central moment of different order partly characterizes some dynamic properties of a random sequence from its distinct view, their integration can provide a comprehensive characterization of the fluctuation properties of this sequence. (Jin et al., 2015) Most of central-moment features have the clear mathematical interpretability, e.g., for a sequence, its first-order central moment (i.e., mean) can reflect the fluctuation central; second-order central moment (i.e., variance) can reflect the fluctuation level; third-order central moment can reflect the skewness; and the fourth-order central moment can reflect the kurtosis. In theory, the change characteristics of a random sequence can be better represented by central-moment features. Usually, these central-moment features with the range from first- to seventh order are enough for us to analyze and describe the wave profile distribution of a random variable implicated in the sequence (Anagnostou and Taylor, 2011). More importantly, central-moment features are invariant to the temporal order of a sequence. In other words, as one expressional form of a random variable’s probability distribution, central-moment features of a random sequence are immune to the order of its elements (in a mathematical sense).

To clarify the characteristic of central moment, we show the calculated central-moment values of four sequences Y1–Y4 in Figure 2, where the values in the parentheses following each sequence (Y1–Y4) sequentially denote the mean, variance, and third- and forth-order central moment. In Figure 2A, Y1 and Y2 denote two sequences with reversed order. We can see that Y1 and Y2 have the same values of central moment, demonstrating the invariance of central-moment features with respect to the sequence order. In Figure 2B, Y3 and Y4 are two symmetric sequences with identical symmetry axis but rather different fluctuating range. From the calculated central moments for Y3 and Y4, we can see that, except for the mean, the other central moments have noticeable difference, which means that central-moment features are able to reflect the dynamic change of a sequence. Based on the analysis of Figure 2, we can see that the central-moment features is invariant to sequence order and is able to capture the dynamic variation of a sequence.

FIGURE 2

Figure 2. Illustration of the calculated mean, variance, and third- and forth-order central moment (sequentially denoted in the corresponding parentheses) for four sequences Y1–Y4. (A) Two sequences (Y1 and Y2) with reversed order. (B) Two symmetric sequences (Y3 and Y4) with identical symmetry axis but different fluctuating range.

Inspired by the advantages of central-moment method, we put forward a new approach that employs central-moment technique to excavate the temporal-invariance discriminative features of Lo-D-FCNs. Specifically, we treat each FC correlation time series of a pair of ROIs in a Lo-D-FCNs (such as [ρ_ij(1),ρ_ij(2),⋯,ρ_ij(K)] in Figure 1B1), which reflects the temporal changes of FC between two ROIs, as a one-dimensional random sequence that is generated from a random variable, and then, we extract the central-moment features of the sequence for further classification. Similarly, for Ho-D-FCNs, we regard the connection strength (i.e., the connection weight of an edge) series along the scanning time (such as[hρ_ij(1),hρ_ij(2),⋯,hρ_ij(K)] in Figure 1C1) as a one-dimensional sequence and extract corresponding central-moment features.

Using the central-moment features, we can summarize the dynamic variation of either low- or high-order FC among multiple ROIs along the scanning time and give a general physiological interpretation to some extent. For example, if the value of the first-order central moment (i.e., mean value) from the FC correlation time series between a pair of ROIs in Lo-D-FCNs or among multiple ROIs in Ho-D-FCNs is relatively large, these ROIs may have strong functional correlation with each other. Similarly, if the value of the second-order central moment (i.e., variance value) is relatively large, it means that the correlations among the corresponding ROIs is very unstable during the whole scanning time; in other words, the periods of high correlation among all the corresponding ROIs may alternate with the periods of low correlation. Contrarily, such an interpretation is very hard to be obtained by directly analyzing Lo-D-FCNs or Ho-D-FCNs due to the large-scale and dynamic network structure.

In summary, there are three parts of contribution in this paper: (Geschwind and Levitt, 2007) proposing new Ho-D-FCNs (never used in previous ASD diagnosis) to reflect high-level connectivity information across multiple ROIs; (Jin et al., 2015) utilizing a central-moment method to capture FC properties derived from Lo-D-FCNs or Ho-D-FCNs without performing chronological time matching; (Anagnostou and Taylor, 2011) employing three multilevel FCN models (i.e., C-FCN, Lo-D-FCNs, and Ho-D-FCNs) to comprehensively investigate complex and multilevel functional associations among brain ROIs.

Materials and Preprocessing

Subjects

The rs-fMRI dataset used in this paper was downloaded from a publicly available Autism Brain Imaging Data Exchange (ABIDE) database (Di Martino et al., 2013). To alleviate data heterogeneity, we only consider the rs-fMRI data acquired from 45 ASD patients and 47 normal controls (NCs) with ages ranging from 7- to 15 years old, scanned at New York University Langone Medical Center. All these considered subjects had no excessive head motion with a displacement of <1.5 mm or an angular rotation of <1.5° in any of three directions. The detailed demographic information of these subjects is summarized in Table 1. As shown in Table 1, there were no significant differences (p > 0.05) in gender, age, and FIQ between two groups. ASD subjects were diagnosed based on the autism criteria in Diagnostic and Statistical Manual of Mental Disorders, 4th Edition, Text Revision (DSM-IV-TR) (American Psychiatric Association, 2000). More details on the data collection, exclusion criteria, and scan parameters can be obtained from the ABIDE website².

TABLE 1

Table 1. Demographic information of the subjects.

Data Acquisition and Preprocessing

All included subjects were scanned using a 3-T Siemens Allegra scanner at the NYU Langone Medical Center. During the 6 min rs-fMRI scan procedure, most subjects were instructed to relax with their eyes and stare at a white fixation cross at the center of the black screen. Their eye statuses were monitored by an eye tracker. The mean framewise displacement (FD) was computed to describe head motion for each individual. The individuals were excluded if their mean FD is >1 mm (Lin et al., 2015; Ray et al., 2015). On the other hand, head motion effect was further corrected with the Friston 24-parameter model in the following process. The main scanning parameters used in this dataset include the flip angle = 90, 33 slices, TR/TE = 2,000/15 ms, 180 volumes, and voxel thickness = 4 mm.

For rs-fMRI data preprocessing, we used the Statistical Parametric Mapping (SPM8) software³. Specifically, the first 10 rs-fMRI volumes were removed to ensure magnetization stabilization. Then, all rs-fMRI volumes were normalized to the Montreal Neurological Institute (MNI) space with the resolution of 3 × 3 × 3 mm³. Subsequently, ventricle, global signals were regressed out as nuisance signals, while head motion was corrected with the Friston 24-parameter model (i.e., 6 head motion parameters, 6 head motion parameters from the previous time point, and the 12 corresponding squared items) for decreasing head motion effects (Satterthwaite et al., 2013; Yan et al., 2013). Furthermore, the band-pass filtering (0.01–0.08 Hz) and signal detrending were also performed to avoid physiological noise (Cordes et al., 2001), measurement error (Achard et al., 2008), and magnetic field drifts of the scanner (Tomasi and Volkow, 2010). Finally, the brain was parcellated into 116 brain ROIs using the Automated Anatomical Labeling (AAL) atlas (Tzourio-Mazoyer et al., 2002). Next, the average rs-fMRI time series was calculated for each brain ROI and then represented in a data matrix X ∈ R^170× 116, where 170 denotes the total number of temporal image volumes and 116 denotes the total number of all brain ROIs.

Method

In this section, we mainly detail how to construct our Ho-D-FCNs based on the “correlation’s correlation” principle. As mathematical notations, we use uppercase bold letters (e.g., G, C) to denote FC networks or matrices, lowercase bold letters (e.g., x) to denote vectors, and lower case letters (e.g., i, j) to denote scalars.

Figure 3 displays the flowchart of our proposed classification framework, including the following four steps: ① constructing various FC networks, including C-FCN, Lo-D-FCNs, and Ho-D-FCNs; ② extracting the central-moment features, ranging from the first- to the seventh-order, from Lo-D-FCNs and Ho-D-FCNs (central-moment extracted from Lo-D-FCNs and Ho-D-FCNs can be regarded as the network feature since each of its elements is derived from a correlation time series of a pair of ROIs); ③ selecting the most discriminative features in a two-stage feature selection process for reducing feature dimensionality and eliminating irrelevant features to the target classification task; and ④ classification fusion. We construct an ensemble classifier with three linear support vector machines (SVM) classifiers (Cortes and Vapnik, 1995), each being trained with a specific type of FC features. The classification scores by all SVM classifiers are finally fused, by weighted averaging, to predict the target class label (ASD or NC) for a given testing subject.

FIGURE 3

Figure 3. Overview of our proposed classification framework, including four main steps: ① constructing multiple functional connectivity networks (FCNs), ② extracting central-moment features, ③ feature selection, and ④ classification fusion. Lo-CM denotes the central-moment features from low-order dynamic functional connectivity networks (Lo-D-FCNs), and Ho-CM is from high-order dynamic functional connectivity networks (Ho-D-FCNs). The means of other symbols are the same with those presented in Introduction.

Multilevel FC Networks Construction

A network structure can be modeled as a graph comprising a set of vertexes and edges linking them. Let G denote a FC network where each vertex represents a specific ROI, and each edge is weighted by the strength of FC between its end vertices (i.e., ROIs). Let C denote the connectivity matrix of G, where each column (resp. row) denotes a specific ROI, and each element of C denotes the strength of FC between two ROIs. The structure of G is encoded in C. Next, we will detail how the corresponding connectivity matrices of C-FCN, Lo-D-FCNs, and Ho-D-FCNs are constructed.

C-FCN Construction

For each subject, let x_i = (x_i1,x_i2,⋯,x_iM)(i = 1,2,⋯,N) denote the average rs-fMRI time series across all voxels within the ith ROI, where M denotes the total number of temporal image volumes, and N denotes the total number of all ROIs. We can generate the conventional correlation-based FC network (C-FCN) G_C by a symmetric matrixC_C, defined as:

C_{C} = {(ρ_{i j})}_{1 \leq i, j \leq N}, (1)

where ρ_ij denotes the Pearson’s correlation between the average rs-fMRI time series from the ith and the jth ROIs, defined as:

ρ_{i j} = corr (x_{i}, x_{j}), (2)

It can be seen from Equation (1) that each row or column of C_C denotes the Pearson correlation series between a specific ROI and all other ROIs. Notably, G_C encodes the static interactions between any pair of ROIs during the entire scanning duration, which fails to capture the dynamic nature of neural activity.

Lo-D-FCNs Construction

To encode the nonstationary interactions between different ROIs, we adopt the sliding-window strategy to generate Lo-D-FCNs. Specifically, suppose that the length of the sliding window is T and the step size between two successive windows is S, thus the entire rs-fMRI time series x_i = (x_i1,x_i2,⋯,x_iM)(i = 1,2,⋯,N) corresponding to the ith ROI are partitioned into K overlapping segments with a predefined sliding window, where K = [(M−T)/S] + 1.

Letting x_i(k) = [x_i1(k),x_i2(k),⋯,x_iT(k)](k = 1,2,⋯,[cpsbreak]K) denote the kth time subseries ofx_i, we can calculate the kth submatric C_Lo−D(k) as Equation (1).

C_{Lo - D} (k) = {[ρ_{i j} (k)]}_{1 \leq i, j \leq N} (k = 1, 2, \dots, K) (3)

where ρ_ij(k) is computed as:

ρ_{i j} (k) ρ_{i j} (k) = corr [x_{i} (k), x_{j} (k)] (4)

Obviously, C_Lo−D(k) reflects the interaction between two ROIs during a relatively shorter time period. The submatrix series ${C_{Lo - D} (k)}_{k = 1}^{K}$ along the scanning time describes the temporal change of the connectivity strength for all ROI pairs. The corresponding FCN of ${C_{Lo - D} (k)}_{k = 1}^{K}$ is called Lo-D-FCNs and denoted asC_Lo−D(k) (see Figure 3).

Ho-D-FCNs Construction

To fully capture high-order functional interactions across brain ROIs, we adopt the “correlation’s correlation” principle (Zhang et al., 2016; Morris and Rekik, 2017; Soussia and Rekik, 2018; Zhao et al., 2018) to generate Ho-D-FCNs. Specifically, for the ith ROI of a subject, we can get a correlation series ρ_i(k) = [ρ_i1(k),ρ_i2(k),…,ρ_iN(k)] from the kth submatrixC_Lo−D(k) (see Equation 3). Mathematically, ρ_i(k) denotes the ith row or column of the symmetric matrixC_Lo−D(k). We regard ρ_i(k) as the short-time FC profile of the ith ROI on the kth time subseries, reflecting the correlations between the ith ROI and all other ROIs during the kth time section. Then, the correlation is computed between the short-time FC profile ρ_i(k) of the ith ROI and the short-time FC profile ρ_j(k) of the jth ROI as follows:

h ρ_{i j} (k) = corr [ρ_{i} (k), ρ_{j} (k)], (5)

Obviously, hρ_ij(k) denotes the “correlation’s correlation” between the ith ROI and the jth ROI in the kth time section, quantifying how the correlation series ρ_i(k) [i.e., the FC profiles ρ_i(k) between the ith ROI and all other ROIs resemble the correlation series ρ_i(k)[i.e., the FC profiles ρ_j(k)] between the jth ROI and all other ROIs. As a result, hρ_ij(k) can reveal more complex relationship between the FC profiles ρ_i(k) andρ_j(k), not just the original rs-fMRI time series x_i(k) andx_j(k). Thus, the correlation coefficient hρ_ij(k) can characterize more complex and abstract interactions among multiple ROIs, which occur in a relatively shorter time period. We further define a submatrix C_Ho−D(k) in the kth time section as follows:

C_{Ho - D} (k) = {[ρ h_{i j} (k)]}_{1 \leq i, j \leq N}, (6)

Based on Equation (6), we can construct a Ho-D-FCNs, denoted asC_Ho−D(k), where the submatrices series ${C_{Ho - D} (k)}_{k = 1}^{K}$ is regarded as the associated dynamic FC of C_Ho−D(k) along the scanning time. Obviously, C_Ho−D(k) can capture high-level interactions across multiple ROIs while preserving the dynamic aspect of brain functional activity. Similar toG_Lo−D, Figure 3 displays the main steps for constructingG_Ho−D(k).

Feature Extraction and Selection

With the above-mentioned methods in Multilevel FC Networks Construction, three different types of FCN, i.e., G_C,G_Lo−D and G_Lo−D, are obtained to form multilevel representations of functional interactions across multiple ROIs. In this section, we mainly introduce how to extract and select features from these FCNs.

Central-Moment Feature Extraction

We note that both FC networks G_Lo−D and G_Ho−D are out of temporal synchrony across different subjects. In other words, the kth time subseries, $ρ_{i j}^{l} (k) (k = 1, 2, \dots, K)$ [or $h ρ_{i j}^{l} (k)$ ] from the lth subject may be inconsistent with $ρ_{i j}^{r} (k)$ [or $h ρ_{i j}^{r} (k)$ ] from the rth subject due to the unconstrained mental activities during resting state. To extract consistent dynamic connectomic features across subjects, we propose to extract the central-moment features of G_Lo−D and carry out the same procedure forG_Ho−D. Specifically, we first construct a FC time series ρ_ij between the ith ROI and the jth ROI by concatenating the elements ρ_ij(k) (see Equation 3) as follows:

ρ_{i j} = [ρ_{i j} (1), ρ_{i j} (2), \dots ρ_{i j} (k), \dots, ρ_{i j}

(K)] (1 \leq i, j \leq N, 1 \leq k \leq K), (7)

where ρ_ij reflects the FC dynamic changes along the scanning time between the ith ROI and the jth ROI. We calculate its dth order central-moment m_ij(d) of ρ_ijas follows:

m_{i j} (d) = \sqrt[d]{\frac{\sum_{k = 1}^{K} {[ρ_{i j} (k) - {\bar{ρ}}_{i j}]}^{d}}{K}} (d = 1, 2, \dots D), (8)

where D denotes the highest order. We further get a central-moment matrix series ${M_{Lo - D} (d)}_{d = 1}^{D}$ from G_Lo−D [i.e., ${C_{Lo - D} (k)}_{k = 1}^{K}$ ] by the following definition:

M_{Lo - D} (d) = {[m_{i j} (d)]}_{1 \leq i, j \leq N} (d = 1, 2, \dots D), (9)

It can be seen from Equation (8) that m_ij(d) is invariant to the element order of ρ_ij = [ρ_ij(1),ρ_ij(2),⋯ρ_ij(k),⋯,ρ_ij(K)]. Thus, ${M_{Lo - D} (d)}_{d = 1}^{D}$ is insensitive to temporal asynchrony across subject.

We use the same strategy to derive central-moment matrix series ${M_{H o - D} (d)}_{d = 1}^{D}$ of G_Ho−D [i.e., ${C_{Ho - D} (k)}_{k = 1}^{K}$ ] using the following formula:

M_{Ho - D} (d) = {[h m_{i j} (d)]}_{1 \leq i, j \leq N} (d = 1, 2, \dots D), (10)

where hm_ij(d) is computed as follows:

h m_{i j} (d) = \sqrt[d]{\frac{\sum_{k = 1}^{K} {[h ρ_{i j} (k) - {\bar{h ρ}}_{i j}]}^{d}}{K}} (d = 1, 2, \dots D), (11)

hρ_ij(k) denotes the “correlation’s correlation” between the ith ROI and the jth ROI in the kth time section (see Equation 5). We also give a brief illustration of M_Lo−D(d) and M_Ho−D(d) construction in Figure 3.

Feature Selection Using a Two-Stage Approach

For the lth subject, we obtain three types of raw features, i.e., the features $C_{C}^{(l)}$ of C-FCN, the central-moment features $M_{Lo - D}^{(l)} (d)$ of Lo-D-FCNs, and the central-moment features $M_{Ho - D}^{(l)} (d)$ of Ho-D-FCNs, each of which is a N×N symmetric matrix. Here, N denotes the number of ROIs, and N = 116 is set in our case. Since each matrix is symmetric, we only vectorize their lower off-diagonal triangular part to define the feature vector set ${y_{0}^{(l)}, y_{1}^{(l)}, y_{2}^{(l)}}$ , for representing the lth subject, where $y_{0}^{(l)}$ , $y_{1}^{(l)}$ , and $y_{2}^{(l)}$ denote the vectorization of $C_{C}^{(l)}$ , $M_{Lo - D}^{(l)} (d)$ , and $M_{Ho - D}^{(l)} (d)$ , respectively. The dimensionality of $y_{c}^{(l)} (0 \leq c \leq 2)$ is $\frac{N (N - 1)}{2}$ , and it is 6,670 in our case, where c denotes the type of feature vector. Obviously, the feature dimensionality is much larger than the total number of subjects. More importantly, many features may be irrelevant to ASD diagnosis.

To remove the redundant features while preserving a small subset of discriminative features that are most likely relevant to ASD pathology, we design a two-stage feature selection strategy. Specifically, in the first stage, for each feature from $y_{c}^{(l)} (0 \leq i \leq 2)$ , we perform a two-sample t-test between NC and ASD subjects, due to its simplicity and efficiency. Then, we select the features only with their p-values smaller than a certain threshold. In such a way, we can get a preliminary set of features that are highly correlated with the class label, while the rest features not correlated with classification well be eliminated. However, some feature may be still correlated to each other, thus causing feature redundancy. Therefore, to further remove features from these correlated features, we adopt the L₁-norm regularized least squares regression, known as LASSO (Tibshirani, 1996), to further optimize the feature subset in the second stage. Note that the t-test is performed on each feature individually, while LASSO regression considers all features jointly such that the correlation between features can be taken into account. Specifically, let ${\bar{y}}_{c}^{(l)} (0 \leq c \leq 2)$ denote the features selected by the t-test. I^(l) is the class labels of ${\bar{y}}_{c}^{(l)}$ , where I^(l) = 1 if the lth subject is ASD and I^(l) = −1 if the lth subject is NC. Let w_c represent the weight vector for the feature selection task. Mathematically, the LASSO model can be formalized as energy functional to optimize (Tibshirani, 1996):

min \frac{1}{2} \sum_{l = 1}^{L} | | I^{(l)} - < y_{c}^{(l)}, W_{c} > | |^{2} + λ | | W_{c} | |_{1} (12)

where ⟨∙,∙⟩ denotes the inner operator, L denotes the number of subjects, and λ is a parameter, controlling the model’s sparsity based on the L₁-norm regularization. The larger the value of λ, the sparser the model is. In this way, we can jointly achieve sparse feature selection. In other words, those features with nonzero elements of w_i were eventually retained. Let ${\bar{\bar{y}}}_{c}^{(l)} (0 \leq c \leq 2)$ denote the final selected set of feature from the original pool of feature vectors $y_{c}^{(l)} (0 \leq c \leq 2)$ .

Classifier Learning and Fusing

After selecting the most important features by the two-stage approach, we use SVM with linear kernel for ASD classification. Considering these features ${\bar{\bar{y}}}_{c}^{(l)} (0 \leq c \leq 2)$ are generated from three FCNs with different level, we train an SVM classifier for each type of features ${\bar{\bar{y}}}_{c}^{(l)} (0 \leq c \leq 2)$ . SVM seeks a maximum margin hyper-plane to separate the samples from two different classes. The empirical risk on the training data and the complexity of the model can be balanced by the hyperparameter γ, thus ensuring good generalization ability on the unseen data. Finally, we can fuse these three SVM classifiers together for making the final result. Specifically, each type of features ${\bar{\bar{y}}}_{c}^{(l)}$ are used to train a specific classifier. Then, for a test subject, each SVM will output an associated decision score, indicating the probability of that subject belonging to a class. Finally, to obtain classification result, we calculate the weighted average of the three decision scores from these SVM models with weight α tuned for each SVM, which reflects the reliability of corresponding decision score. In Figure 3, we provide a brief illustration of the classifier learning and fusing.

Experimental Analysis

For evaluating the performance of our proposed method, we adopted a sixfold cross-validation (CV) strategy to perform experiments. For example, all training subjects were randomly partitioned into six subsets (each subset with a roughly equal number of samples), and each time the samples within one subset are selected as the testing dataset, while the remaining samples within the other five subsets are combined together as the training dataset for feature selection and classifier training. For evaluation, we reported the average accuracy of classification results across all six CV cases. Furthermore, to avoid any possible bias in fold selection, the entire sixfold CV process was repeated 10 times, with a different random partitioning of samples each time. Finally, the average statistics of the 10 repetitions was reported. To carry out our proposed method and other competing algorithms, some parameters need to set, such as p-values in the two-sample t-test model, λ in the LASSO model (Feature Selection Using a Two-Stage Approach), and γ and α in the linear SVM model (Classifier Learning and Fusing). For fair comparison, we use nested CV to tune the parameters in each method. In particular, for each fold in the above sixfold CV, we perform another fivefold CV on the five subsets, which is used for training for the selection of parameters. The optimal values can be determined by this inner fivefold CV when the average classification accuracy reaches its optimum. Then, the selected parameters are used to learn a model based on the entire training dataset, which is further utilized for classification on the testing dataset. For our approach, we determine the optimal values for the parameters in the following range: p–values ∈ [0.01:0.01:0.1],λ ∈ [0.1:0.1:0.7], γ ∈ [2⁻⁵,2⁻⁴,⋯,2⁵], and α ∈ [0.1:0.1:0.9].

As usual, we adopt six evaluation measures, i.e., classification accuracy (ACC), sensitivity or true positive rate (TPR), specificity or true negative rate (TNR), positive predictive value (PPV), negative predictive value (NPV), and F1 score, to comprehensively evaluate classification performance. Their definitions are given as follows:

ACC = \frac{T P + T N}{T P + F P + T N + F N}, (13)

TPR = \frac{T P}{T P + F N}, (14)

TNR = \frac{T N}{F P + T N}, (15)

PPV = \frac{T P}{T P + F N}, (16)

NPV = \frac{T N}{F N + T N}, (17)

F 1 = \frac{2 \times T P}{2 \times T P + F N + F P}, (18)

where TP, TN, FP, and FN indicate the true positive, true negative, false positive, and false negative, respectively. Note that we treat ASD patients as positive samples and NC as negative samples in this paper.

The Influence of Parameters on D-FCNs

In the construction of D-FCNs (including Lo-D-FCNs and Ho-D-FCNs) and feature extraction, there are three parameters to tune: (1) sliding window length T, (2) the step size between two successive windows S, and (3) the order of central moment d, which jointly affects the diagnosis accuracy of Lo-D-FCNs and Ho-D-FCNs. To evaluate the impact of these parameters on classification performance and select a suitable combination of parameters for the subsequent multiclassifier fusion, we vary the values of these parameters in specific range (i.e., T = [40:10:90],S = [2:2:12],d = [1:1:7]) and repeat the classification experiments based on different combinations of these parameters. It is worth noting that when d = 1, we use the mean value instead of the first-order moment so that the method can better reflect the sample characteristics.

Here, we use the average classification accuracy (ACC) to evaluate the applicability of parameter combination to ASD diagnosis. Figure 4 displays the ACC achieved by Lo-D-FCNs and Ho-D-FCNs using different combinations of T, S, and d values. The higher the accuracy is, the longer the length and the warmer the color are.

FIGURE 4

Figure 4. The average classification (ACC) using different combinations of T, S, and d values. (A) The histogram of different ACC in Lo-D-FCNs. (B) The histogram of different ACC in Ho-D-FCNs.

As shown in Figure 4A, the optimal parameter combination for Lo-D-FCNs is T = 60,S = 2,andd = 4, its ACC is 79.4, while the minimum value of ACC is 54.0 when T = 60,S = 10,andd = 3. Likewise, from Figure 4B, we can see that the optimal parameter combination for Ho-D-FCNs is T = 40,S = 12,andd = 2, its ACC is 77.6, while the minimum is 56.1 when T = 70,S = 8,andd = d. Therefore, based on Figure 4, we can observe that the classification preformation is rather sensitive to these parameters. For boosting the final classification accuracy, we set these optimal parameters (i.e., T = 60,S = 2,andd = 4 for Lo-D-FCNs and T = 40, S = 12, and d = 2 for Ho-D-FCNs) as the default parameter for the following experiments.

Fusion Results of the C-FCN, Lo-D-FCNs, and Ho-D-FCNs

We select the combination of parameters that can lead to the highest ACC from the SVMs of C-FCN, Lo-D-FCNs, and Ho-D-FCNs, respectively, and obtain the final classification result by linear fusion of the SVM ensemble decision scores. In addition to our model, we also added another recently developed high-order FC network approach (Zhou et al., 2018) for comparison. Similar to our approach, this method also used sliding window approach to capture the dynamic variation of FC, and a series of traditional FC networks are constructed. Then, both low-order (termed as LoM) and high-order FC (termed as HiO) networks are constructed by maximum likelihood estimation with the assumption that these D-FCNs follow the matrix variate normal distribution.

Table 2 shows the average classification performance of nine models. Among them, C_C denotes the feature derived from the conventional correlation-based FC network (C-FCN), and C_C + C_Lo–D denotes the fusion of C-FCN and Lo-D-FCNs. The number following C_Lo–D denotes the order of central moment used to extract features. For example, C_Lo–D(1) means the low-order dynamic FC network with mean as central moment. Notice that the constructed LoM network in Zhou et al. (2018) is equivalent to our proposed Lo-D-FCNs when the order of central moment equals to 1, i.e., C_Lo–_D(1). We also report the standard deviation of the classification accuracy. The best results are highlighted in bold.

TABLE 2

Table 2. Autism spectrum disorder (ASD) classification using different feature types and evaluation measures.

Based on Table 2, we can draw the conclusions below. (1) In terms of ACC and other evaluation measures, the performance of feature types derived from D-FCNs (i.e., Lo-D-FCNs and Ho-D-FCNs) are superior to that of C-FCN, in which ACC is increased by 4 and 5%, respectively, and other performance are also improved accordingly. This result indicates that the sliding-window-based D-FCNs can provide better features for ASD classification. (2) The classification result of ensemble classifier consistently outperforms that of single feature type, which supports the assumption of integrating multiorder connectional features for boosting classification results. (3) The fusion of C-FCN, Lo-D-FCNs, and Ho-D-FCNs achieved the best classification performance, indicating that different-level FCNs can provide complementary relevant information for ASD diagnosis and classification, and the fusion of this information can further improve the classification performance. This result will also be reflected in the following experiments. (4) By comparing our model with the approach proposed in Zhou et al. (2018), we also find that our central-moment-based approach performs better in terms of accuracy. Actually, the performance of HiO is inferior to the corresponding low-order FC network [i.e.,C_Lo–D(1)], which is consistent with the results given in Zhou et al. (2018). This comparison also verifies the effectiveness of our central-moment features.

The Most Discriminative Features for ASD Diagnosis

We used t-test, followed by LASSO regression, to identify the most discriminative features in C-FCN, Lo-D-FCNs, and Ho-D-FCNs, respectively. In this study, we used the frequency, at which features are selected in all cross-validation cases, to quantify feature relevance to the target classification. The higher the feature frequency, the more reliable and discriminative it is regarded.

Figures 5A–C visualizes the top 10 most discriminative features of C-FCN, Lo-D-FCNs, and Ho-D-FCNs in the form of circular graphs, where each link corresponds to a connectional feature and represents the correlation between two brain regions (Krzywinski et al., 2009). Figure 5D also shows the mutual comparison among three sets of connections. We use link thickness to encode the degree of their correlation. The thicker the link is, the stronger the correlation is; also, the higher the frequency of the connection selected in cross-validation is, the greater the contribution to the target classification tasks is. For the abbreviations of brain regions in Figure 5, please refer to Table 3. In addition, we mark L (or R) following a brain region (or ROI) name to denote that it lies in the left hemisphere (or the right hemisphere), such as ANG^R means the right angular gyrus.

FIGURE 5

Figure 5. The circular graphs and the involved brain regions of interest (ROIs) of the top 10 discriminative connections selected by our proposed method. (A) The correlation-based functional connectivity (FC) network (C-FCN), (B) the low-order dynamic FC network (Lo-D-FCNs), (C) the high-order dynamic FC network (Ho-D-FCNs), and (D) the mutual comparison among three sets of connections. The selection frequency is encoded by the thickness of each connecting curve, i.e., thicker curves indicate higher selection frequency. For brain region abbreviations, please refer to Table 3.

TABLE 3

Table 3. Abbreviations of ROIs selected from conventional functional connectivity network (C-FCN), low-order dynamic FCNs (Lo-D-FCNs), and high-order D-FCNs (Ho-D-FCNs).

From Figure 5 and Table 3, we can derive the following. (1) The discriminative connections is not limited to connect the same hemisphere or brain lobe but also includes transhemisphere and all brain lobe, which indicates that the brain function of ASD patients has an abnormal distribution pattern over the whole brain. (2) Most selected brain regions are associated with emotional expression, language understanding, and motion coordination, such as precentral gyrus, middle frontal gyrus, middle cingulate gyrus, posterior cingulate gyrus, amygdala, angular gyrus, and others. These observations are consistent with previous studies (Qiu et al., 2010; Ecker et al., 2015; Ha et al., 2015b; Huang et al., 2018). For example, we found that SFGmed^L (Andrews-Hanna et al., 2014), ANG^R (Andrews-Hanna et al., 2014), PCUN^L (Urbain et al., 2015), CAL^L (Perkins et al., 2015), FFG^R (Urbain et al., 2016), INS^L (Leung et al., 2015; Urbain et al., 2016) contributed more to ASD identification, which is in line with the recent finding reported in the existing literatures. (3) Features selected from C-FCN, Lo-D-FCNs, and Ho-D-FCNs have significant differences, which can be seen from three aspects: first, the selected connected features by each FCN (i.e., the connectional lines in Figures 5A–C are almost entirely different from each other, except for the connected features (IX-Cb^L-PCUN^R) selected by both Lo-D-FCNs and Ho-D-FCNs although with different strength; second, according to the affiliation relation of the selected ROIs with respect to corresponding FCNs (Figure 5D), we find that most of the selected ROIs merely belong to one FCN, except one ROI (PUCN^R) that is jointly selected by all the three FCNs, four ROIs by C-FNC and Lo-D-FCNs (or Ho-D-FCNs), and five ROIs by Lo-D-FCNs and Ho-D-FCNs; and third, the regional distribution of the selected features has huge difference among the three FCNs. For example, the connectional features selected by C-FCN mainly distribute in TEM^L, PAR^L, OCC^L, SBC^L–R, LIM^L–R, INS^L–R, and FOR^L–R (Figure 5A). The features selected by Lo-D-FCNs mainly locate in INS^R, LIM^R, SBC^R, OCC^R, PAR^R, TEM^R–L, CRE^L–R, and VER^L–R (Figure 5B) and that of Ho-D-FCNs is in INS^L, LIM^L, SBC^L, TEM^R–L, and CER^L–R (Figure 5C). In summary, the above analysis of difference among three FCNs show that their network infrastructures exist significantly different, which indicate that FCNs of different level can provide complementary information for diagnosis. We think that the main reason causing the huge difference among the three FCNs is that each FCN actually reflects the correlation between brain regions from rather different viewpoints. C-FCN generally captures the static connectional feature since its FC is measured using the whole scanning time rs-fMRI series from any pair of ROIs, while Lo-D-FCNs reveals the dynamically connectional relationship between a pair of ROIs because its FC metric is similar to C-FCN, just using a short-time rs-fMRI series. Compared with C-FCN and Lo-D-FCNs, Ho-D-FCNs uses a vastly different metric to measure the connectional relationship between a pair of ROIs, i.e., using the synchronization of the short-time FC profile between two ROIs to represent their temporary correlation. Therefore, Ho-D-FCNs can reveal some new FC interaction among ROIs, thus providing supplementary information to C-FCN and Lo-D-FCNs.

Conclusion

In this paper, we proposed new Ho-D-FCNs and used the central-moment method to eliminate the phase mismatch problem of dynamic networks. Through the analysis of feature selection, we believed that the presented Ho-D-FCNs could provide complementary information to our previous research (C-FCN, Lo-D-FCNs). Therefore, we fused these three methods and got the optimal classification results. The experimental results have shown that: (1) Ho-D-FCNs was indeed helpful for mining the relevant information for ASD diagnosis; (2) different level FCNs could provide complementary information and improve the disease recognition rate through fusion; and (3) the central-moment method could help to solve the phase mismatch problem in dynamic networks, including Lo-D-FCNs and Ho-D-FCNs, which were covered in the paper. In addition, in the analysis of feature selection, we also found that most brain regions contributing to classification are related to emotional expression, language understanding, and motion coordination. These findings agree with the behavioral phenotype of ASD (Geschwind and Levitt, 2007; American Psychiatric Association, 2013).

Finally, it should be indicated that the fusion of the three methods based on the decision value of SVM might not adequately integrate the complementary information and thus have an impact on the classification accuracy. Therefore, feature fusion is a direction for future improvement, which will be our future work.

Data Availability Statement

The datasets generated for this study can be found in the Autism Brain Imaging Data Exchange (ABIDE) database (http://fcon_1000.projects.nitrc.org/indi/abide/abide_I.html).

Author Contributions

All authors listed have made a substantial, direct and intellectual contribution to the work, and approved it for publication.

Funding

FZ was supported in part by the National Natural Science Foundation of China (61773244, 61976125, 61272319, 61873117), Yantai Key Research and Development Program of China (2017ZH065, 2019XDHZ081), and Shandong Provincial Key Research and Development Program of China (2019GGX101069).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Footnotes

References

Achard, S. (2006). A resilient, low-frequency, small-world human brain functional network with highly connected association cortical hubs. J. Neurosci. 26, 63–72.

PubMed Abstract | Google Scholar

Achard, S., Bassett, D. S., Meyer-Lindenberg, A., and Bullmore, E. (2008). Fractal connectivity of long-memory networks. Phys. Rev. 77(3 Pt 2):036104.

PubMed Abstract | Google Scholar

Admon, R., Bleich-Cohen, M., Weizmant, R., Poyurovsky, M., Faragian, S., and Hendler, T. (2012). Functional and structural neural indices of risk aversion in obsessive–compulsive disorder (OCD). Psychiatry Res. Neuroimaging 203, 207–213. doi: 10.1016/j.pscychresns.2012.02.002

PubMed Abstract | CrossRef Full Text | Google Scholar

American Psychiatric Association (2013). Diagnostic, and Statistical Manual of Mental Disorders, 5th Edn. (DSM-5). Text Revision. Washington, DC: American Psychiatric Association.

Google Scholar

Anagnostou, E., and Taylor, M. J. (2011). Review of neuroimaging in autism spectrum disorders: what have we learned and where we go from here. Mol. Autism 2:4. doi: 10.1186/2040-2392-2-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Andrews-Hanna, J. R., Smallwood, J., and Spreng, R. N. (2014). The default network and self-generated thought: component processes, dynamic control, and clinical relevance. Ann. N. Y. Acad. Sci. 1316, 29–52. doi: 10.1111/nyas.12360

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, X., Zhang, H., Gao, Y., Wee, C.-Y., Li, G., and Shen, D. (2016). High-order resting-state functional connectivity network for MCI classification. Hum. Brain Mapp. 37, 3282–3296. doi: 10.1002/hbm.23240

PubMed Abstract | CrossRef Full Text | Google Scholar

Cordes, D., Haughton, V. M., Arfanakis, K., Carew, J. D., Turski, P. A., Moritz, C. H., et al. (2001). Frequencies contributing to functional connectivity in the cerebral cortex in “resting-state” data. Am. J. Neuroradiol. 22, 1326–1333.

PubMed Abstract | Google Scholar

Cortes, C., and Vapnik, V. (1995). Support-vector networks. Machine Learn. 20, 273–297.

Google Scholar

Damaraju, E., Allen, E. A., Belger, A., Ford, J. M., McEwen, S., Mathalon, D. H., et al. (2014). Dynamic functional connectivity analysis reveals transient states of dysconnectivity in schizophrenia. NeuroImage Clin. 5, 298–308. doi: 10.1016/j.nicl.2014.07.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Deshpande, G., Libero, L. E., Sreenivasan, K. R., Deshpande, H. D., and Kana, R. K. (2013). Identification of neural connectivity signatures of autism using machine learning. Front. Hum. Neurosci. 7:670. doi: 10.3389/fnhum.2013.00670

PubMed Abstract | CrossRef Full Text | Google Scholar

Di Martino, A., Yan, C.-G., Li, Q., Denio, E., Castellanos, F. X., Alaerts, K., et al. (2013). The autism brain imaging data exchange: towards a large-scale evaluation of the intrinsic brain architecture in autism. Mol. Psychiatry 19, 659–667. doi: 10.1038/mp.2013.78

PubMed Abstract | CrossRef Full Text | Google Scholar

Ecker, C., Bookheimer, S. Y., and Murphy, D. G. M. (2015). Neuroimaging in autism spectrum disorder: brain structure and function across the lifespan. Lancet Neurol. 14, 1121–1134. doi: 10.1016/S1474-4422(15)00050-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Fornito, A., Zalesky, A., and Breakspear, M. (2015). The connectomics of brain disorders. Nat. Rev. Neurosci. 16, 159–172. doi: 10.1038/nrn3901

PubMed Abstract | CrossRef Full Text | Google Scholar

Ganella, E. P., Bartholomeusz, C. F., Seguin, C., Whittle, S., Bousman, C., Phassouliotis, C., et al. (2017). Functional brain networks in treatment-resistant schizophrenia. Schizophrenia Res. 184, 73–81. doi: 10.1016/j.schres.2016.12.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Geschwind, D. H., and Levitt, P. (2007). Autism spectrum disorders: developmental disconnection syndromes. Curr. Opin. Neurobiol. 17:103111.

Google Scholar

Guo, H., Liu, L., Chen, J., Xu, Y., and Jie, X. (2017). Alzheimer classification using a minimum spanning tree of high-order functional network on fMRI dataset. Front. Neurosci. 11:639. doi: 10.3389/fnins.2017.00639

PubMed Abstract | CrossRef Full Text | Google Scholar

Ha, S., Sohn, I.-J., Kim, N., Sim, H. J., and Cheon, K.-A. (2015). Characteristics of brains in autism spectrum disorder: structure, function and connectivity across the lifespan. Exp. Neurobiol. 24:273. doi: 10.5607/en.2015.24.4.273

PubMed Abstract | CrossRef Full Text | Google Scholar

Hu, M. K. (1962). Visual pattern recognition by moment invariants. IRE Trans. Inf. Theory 8, 179–187.

Google Scholar

Huang, H., Liu, X., Jin, Y., Lee, S.-W., Wee, C.-Y., and Shen, D. (2018). Enhancing the representation of functional connectivity networks by fusing multi-view information for autism spectrum disorder diagnosis. Hum. Brain Mapp. 40, 833–854. doi: 10.1002/hbm.24415

PubMed Abstract | CrossRef Full Text | Google Scholar

Hung, V. V., Lee, J., Masuda-Jindo, K., and Kim, L. (2006). First principles study of tantalum thermodynamics by the statistical moment method. Comp. Mater. Sci. 37, 565–571.

Google Scholar

Hutchison, R. M., Womelsdorf, T., Allen, E. A., Bandettini, P. A., Calhoun, V. D., Corbetta, M., et al. (2013). Dynamic functional connectivity: promise, issues, and interpretations. Neuroimage 80, 360–378. doi: 10.1016/j.neuroimage.2013.05.079

PubMed Abstract | CrossRef Full Text | Google Scholar

Jie, B., Zhang, D., Gao, W., Wang, Q., Wee, C. Y., and Shen, D. (2014). Integration of network topological and connectivity properties for neuroimaging classification. IEEE Trans. Biomed. Eng. 61, 576–589.

Google Scholar

Jin, Y., Wee, C.-Y., Shi, F., Thung, K.-H., Ni, D., Yap, P.-T., et al. (2015). Identification of infants at high-risk for autism spectrum disorder using multiparameter multiscale white matter connectivity networks. Hum. Brain Mapp. 36, 4880–4896. doi: 10.1002/hbm.22957

PubMed Abstract | CrossRef Full Text | Google Scholar

Krzywinski, M., Schein, J., Birol, I., Connors, J., Gascoyne, R., Horsman, D., et al. (2009). Circos: an information aesthetic for comparative genomics. Genome Res. 19, 1639–1645. doi: 10.1101/gr.092759.109

PubMed Abstract | CrossRef Full Text | Google Scholar

Kudela, M., Harezlak, J., and Lindquist, M. A. (2017). Assessing uncertainty in dynamic functional connectivity. Neuroimage 149, 165–177. doi: 10.1016/j.neuroimage.2017.01.056

PubMed Abstract | CrossRef Full Text | Google Scholar

Leung, R. C., Pang, E. W., Cassel, D., Brian, J. A., Smith, M. L., and Taylor, M. J. (2015). Early neural activation during facial affect processing in adolescents with autism spectrum disorder. Neuroimage Clin. 7, 203–212.

Google Scholar

Li, W., Wang, Z., Zhang, L., Qiao, L., and Shen, D. (2017). Remodeling Pearson’s correlation for functional brain network estimation and autism spectrum disorder identification. Front. Neuroinform. 11:55. doi: 10.3389/fninf.2017.00055

CrossRef Full Text | Google Scholar

Lin, H. Y., Tseng, W. Y. I., Lai, M. C., Matsuo, K., and Gau, S. S. F. (2015). Altered resting-state frontoparietal control network in children with attention-deficit/hyperactivity disorder. J. Int. Neuropsychol. Soc. 21, 271–284. doi: 10.1017/S135561771500020X

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, F., Wang, Y., Li, M., Wang, W., Li, R., Zhang, Z., et al. (2016). Dynamic functional network connectivity in idiopathic generalized epilepsy with generalized tonic-clonic seizure. Hum. Brain Mapp. 38, 957–973. doi: 10.1002/hbm.23430

PubMed Abstract | CrossRef Full Text | Google Scholar

Morris, C., and Rekik, I. (2017). “Autism spectrum disorder diagnosis using sparse graph embedding of morphological brain networks,” in Proceedings of the Graphs in Biomedical Image Analysis, Computational Anatomy and Imaging Genetics: First International Workshop, GRAIL 2017, 6th International Workshop, MFCA 2017, and Third International Workshop, MICGen 2017, Held in Conjunction with MICCAI 2017, eds M. J. Cardoso and T. Arbel (Québec City, QC: Springer).

Google Scholar

Perkins, T. J., Bittar, R. G., McGillivray, J. A., Cox, I. I., and Stokes, M. A. (2015). Increased premotor cortex activation in high functioning autism during action observation. J. Clin. Neurosci. 22, 664–669. doi: 10.1016/j.jocn.2014.10.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Plitt, M., Barnes, K. A., and Martin, A. (2015). Functional connectivity classification of autism identifies highly predictive brain features but falls short of biomarker standards. Neuroimage Clin. 7, 359–366. doi: 10.1016/j.nicl.2014.12.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Qiao, L., Zhang, L., Chen, S., and Shen, D. (2018). Data-driven graph construction and graph learning: a review. Neurocomputing 312, 336–351.

Google Scholar

Qiu, A., Adler, M., Crocetti, D., Miller, M. I, and Mostofsky, S. H. (2010). Basal ganglia shapes predict social, communication, and motor dysfunctions in boys with autism spectrum disorder. J. Am. Acad. Child Adolesc. Psychiatry 49, 539–551.e4. doi: 10.1016/j.jaac.2010.02.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Ray, S., Gohel, S., and Biswal, B. B. (2015). Altered functional connectivity strength in abstinent chronic cocaine smokers compared to healthy controls. Brain Connect. 5, 476–486. doi: 10.1089/brain.2014.0240

PubMed Abstract | CrossRef Full Text | Google Scholar

Satterthwaite, T. D., Elliott, M. A., Gerraty, R. T., Ruparel, K., Loughead, J., Calkins, M. E., et al. (2013). An improved framework for confound regression and filtering for control of motion artifact in the preprocessing of resting-state functional connectivity data. Neuroimage 64, 240–256. doi: 10.1016/j.neuroimage.2012.08.052

PubMed Abstract | CrossRef Full Text | Google Scholar

Soussia, M., and Rekik, I. (2018). Unsupervised manifold learning using high-order morphological brain networks derived From T1-w MRI for autism diagnosis. Front. Neuroinform. 12:70. doi: 10.3389/fninf.2018.00070

PubMed Abstract | CrossRef Full Text | Google Scholar

Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Series B (Methodological) 58, 267–288.

Google Scholar

Tomasi, D., and Volkow, N. D. (2010). Functional connectivity density mapping. Proc. Natl. Acad. Sci. U.S.A. 107, 9885–9890. doi: 10.1073/pnas.1001414107

PubMed Abstract | CrossRef Full Text | Google Scholar

Tzourio-Mazoyer, N., Landeau, B., Papathanassiou, D., Crivello, F., Etard, O., Delcroix, N., et al. (2002). Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage 15, 273–289.

PubMed Abstract | Google Scholar

Urbain, C., Vogan, V. M., Ye, A. X., Pang, E. W., Doesburg, S. M., and Taylor, M. J. (2016). Desynchronization of fronto-temporal networks during working memory processing in autism. Hum. Brain Mapp. 37, 153–164. doi: 10.1002/hbm.23021

PubMed Abstract | CrossRef Full Text | Google Scholar

Urbain, C. M., Pang, E. W., and Taylor, M. J. (2015). Atypical spatiotemporal signatures of working memory brain processes in autism. Transl. Psychiatry 5:e617. doi: 10.1038/tp.2015.107

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, J., Wang, Q., Zhang, H., Chen, J., Wang, S., and Shen, D. (2018). Sparse multiview task-centralized ensemble learning for ASD diagnosis based on age- and sex-related functional connectivity patterns. IEEE Trans. Cybernetics 49, 3141–3154. doi: 10.1109/TCYB.2018.2839693

PubMed Abstract | CrossRef Full Text | Google Scholar

Wee, C.-Y., Wang, L., Shi, F., Yap, P.-T., and Shen, D. (2013). Diagnosis of autism spectrum disorders using regional and interregional morphological features. Hum. Brain Mapp. 35, 3414–3430.

PubMed Abstract | Google Scholar

Wee, C.-Y., Yang, S., Yap, P.-T., and Shen, D. (2015). Sparse temporally dynamic resting-state functional connectivity networks for early MCI identification. Brain Imaging Behav. 10, 342–356. doi: 10.1007/s11682-015-9408-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, C.-G., Cheung, B., Kelly, C., Colcombe, S., Craddock, R. C., Di Martino, A., et al. (2013). A comprehensive assessment of regional variation in the impact of head micromovements on functional connectomics. Neuroimage 76, 183–201. doi: 10.1016/j.neuroimage.2013.03.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, H., Chen, X., Shi, F., Li, G., Kim, M., Giannakopoulos, P., et al. (2016). Topographical information-based high-order functional connectivity and its application in abnormality detection for mild cognitive impairment. J. Alzheimer’s Dis. 54, 1095–1112.

PubMed Abstract | Google Scholar

Zhang, H., Chen, X., Zhang, Y., and Shen, D. (2017c). Test-retest reliability of “High-Order” functional connectivity in young healthy adults. Front. Neurosci. 11:439. doi: 10.3389/fnins.2017.00439

CrossRef Full Text | Google Scholar

Zhang, X., Hu, B., Ma, X., and Xu, L. (2015). Resting-state whole-brain functional connectivity networks for MCI classification using L2-regularized logistic regression. IEEE Trans. NanoBiosci. 14, 237–247. doi: 10.1109/TNB.2015.2403274

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Y., Zhang, H., Chen, X., Lee, S.-W., and Shen, D. (2017b). Hybrid high-order functional connectivity networks using resting-state functional MRI for mild cognitive impairment diagnosis. Sci. Rep. 7:6530. doi: 10.1038/s41598-017-06509-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Y., Zhang, H., Chen, X., and Shen, D. (2017a). Constructing multi-frequency high-order functional connectivity network for diagnosis of mild cognitive impairment. Lecture Notes Comp. Sci. 10511, 9–16. doi: 10.1007/978-3-319-67159-8_2

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhao, F., Zhang, H., Rekik, I., An, Z., and Shen, D. (2018). Diagnosis of autism spectrum disorders using multi-level high-order functional networks derived from resting-state functional MRI. Front. Hum. Neurosci. 12:184. doi: 10.3389/fnhum.2018.00184

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, Y., Qiao, L., Li, W., Zhang, L., and Shen, D. (2018). Simultaneous estimation of low-and high-order functional connectivity for identifying mild cognitive impairment. Front. Neuroinform. 12:3. doi: 10.3389/fninf.2018.00003

PubMed Abstract | CrossRef Full Text | Google Scholar

Zürcher, N. R., Bhanot, A., McDougle, C. J., and Hooker, J. M. (2015). A systematic review of molecular imaging (PET and SPECT) in autism spectrum disorder: current state and future research opportunities. Neurosci. Biobehav. Rev. 52, 56–73. doi: 10.1016/j.neubiorev.2015.02.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: autism spectrum disorder, dynamic functional connectivity networks, resting-state functional MRI, central-moment features, conventional FC network

Citation: Zhao F, Chen Z, Rekik I, Lee S-W and Shen D (2020) Diagnosis of Autism Spectrum Disorder Using Central-Moment Features From Low- and High-Order Dynamic Resting-State Functional Connectivity Networks. Front. Neurosci. 14:258. doi: 10.3389/fnins.2020.00258

Received: 07 December 2019; Accepted: 09 March 2020;
Published: 28 April 2020.

Edited by:

Xiaoping Philip Hu, University of California, Riverside, United States

Reviewed by:

Liang Wang, Institute of Psychology (CAS), China
Delin Sun, Duke University, United States
Jun Shi, Shanghai University, China
Mingli Zhang, Mcgill University, Canada

Copyright © 2020 Zhao, Chen, Rekik, Lee and Shen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dinggang Shen, ZGdzaGVuQG1lZC51bmMuZWR1

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.