Support Vector Machine for Analyzing Contributions of Brain Regions During Task-State fMRI

Wang, Mengyue; Li, Chunlin; Zhang, Wenjing; Wang, Yonghao; Feng, Yuan; Liang, Ying; Wei, Jing; Zhang, Xu; Li, Xia; Chen, Renji

doi:10.3389/fninf.2019.00010

ORIGINAL RESEARCH article

Front. Neuroinform., 06 March 2019

Volume 13 - 2019 | https://doi.org/10.3389/fninf.2019.00010

This article is part of the Research TopicBrain-inspired Machine Learning and Computation for Brain-Behavior AnalysisView all 22 articles

Support Vector Machine for Analyzing Contributions of Brain Regions During Task-State fMRI

Mengyue Wang^1†

Chunlin Li^1†

Wenjing Zhang²

Xia Li^1*

Renji Chen^2*

¹Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, School of Biomedical Engineering, Capital Medical University, Beijing, China
²Beijing Stomatological Hospital, Capital Medical University, Beijing, China
³Beijing Institute of Technology, Beijing, China

The mainstream method used for the analysis of task functional Magnetic Resonance Imaging (fMRI) data, is to obtain task-related active brain regions based on generalized linear models. Machine learning as a data-driven technical method is increasingly used in fMRI data analysis. The language task data, including math task and story task, of the Human Connectome Project (HCP) was used in this work. We chose a linear support vector machine as a classifier to classify math and story tasks and compared them with the activated brain regions of a SPM statistical analysis. As a result, 13 of the 25 regions used for classification in SVM were activated regions, and 12 were non-activated regions. In particular, the right Paracentral Lobule and right Rolandic Operculum which belong to non-activated regions, contributed most to the classification. Therefore, the differences found in machine learning can provide a new understanding of the physiological mechanisms of brain regions under different tasks.

Introduction

In functional magnetic resonance data analysis, GLM (generalized linear models) are one of the most common model-based methods that correlate measured hemodynamic signals with controlled experimental variables (Friston et al., 1994; Holmes and Friston, 1998). Specifically, each voxel of the functional Magnetic Resonance Imaging (fMRI) image and the experimental paradigm are analyzed by a generalized linear model, and each voxel corresponds to a coefficient Bata of a regression equation, and all coefficients are combined to form a statistical parameter map (Yan et al., 2011; Wu et al., 2012). In a group analysis, a one sample t-test is performed on the statistical parameter maps of all subjects to determine the activation region of the group (Beckmann et al., 2003). Although the GLM is currently the dominant approach to brain activation detection, there is growing interest in multivariate approaches (Zhang et al., 2009). For example, machine learning as a data-driven technology is not only sensitive to subtle spatial differentiation patterns, but also capable of exploring the inherent multivariate nature of high-dimensional image data (Norman et al., 2006). Since machine learning can find features that contribute most to classification (Meier et al., 2012; Lv et al., 2015), differences found can provide a new understanding of the physiological mechanisms of brain regions under different tasks.

Applying machine learning methods to neuroimaging data began with the work of Haxby et al. (2001), who recognized the distribution characteristics of visual cortex activation patterns from functional MRI. At present, machine learning has been widely used in fMRI data classification (Yan et al., 2017a,b) to explore the cognitive state of the brain (Yan et al., 2018). Under different visual stimulation conditions, the stimulus may be different visual pictures (objects or people, shoes or bottles), raster stimulation at different angles, etc., and the type of task received by the subject is determined by classifying the collected fMRI data (Haxby et al., 2001; Kamitani and Tong, 2005; Norman et al., 2006). Machine learning is used in psychiatry to distinguish patients from controls. Patients with severe depression (Fu et al., 2008) were classified with an accuracy rate of 70 to 80%. Individuals and controls with autism spectrum disorder were distinguished based on two fMRI experiments (Chanel et al., 2016). Machine learning is therefore a promising method used to detect brain state (Ecker and Murphy, 2014). Machine learning mostly uses support vector machines as classifiers in functional magnetic resonance data classification (De et al., 2008; Pereira et al., 2009; Ecker et al., 2010; Xin et al., 2013).

When the number of features far exceeds the number of subjects, it will cause problem which commonly occurs in machine learning known as the curse of dimensionality (Bellman, 1961). If the dimension reduction of features cannot be performed, it is easy to cause over-fitting (Guyon, 2003). Over fitting means that the model has poor generalization ability, that is, the ability to accurately predict new samples is poor (Mayer et al., 2009). Therefore, feature selection is required before training the model (De et al., 2007; Pereira et al., 2009; Mwangi et al., 2014).

In this study, we sought to explore the effects of activated brain regions and inactivated brain regions on the classification results of functional magnetic resonance data for different tasks. We extracted the average t value of the generalized linear model as the eigenvector and chose the Lasso regression algorithm (Tibshirani, 1996) for feature dimension reduction. Using a linear support vector machine, the classification weight was used as an index to evaluate the importance of each brain region in the classification and compared this with the group analysis results. Results revealed two brain regions that did not appear in the activated brain region but contributed significantly to the classification, namely the right Paracentral Lobule and the right Rolandic Operculum.

Materials and Methods

Participants

Experimental data for 1046 healthy subjects was obtained from the open source database, WU-Minn Human Connectome Project (HCP) Data - 1200 Subjects (HCP_1200), published by the Public Connectome Data¹. Most participants were between the ages of 22 and 35. All participants had no previously documented history of psychiatric, neurological or medical disorders that affected their brain function. Of the 1046 participants, 560 were female and 486 were male, 223 were between the ages of 22–25, 455 were between the ages of 26–30, 357 were between the ages of 31–35 and 11 were over the age of 36. We used the 3T MR Language Task fMRI Preprocessed sessions.

Experimental Paradigms

The language task contained an auditory story presentation with comprehension questions and math problems. It consisted of two runs that each had eight blocks (four story blocks and four math blocks) randomly combined. The length of each block varied, but the average length was about 30 s. In order to complete a 3.8 min run, the math task blocks needed to match the length of the story task blocks, and additional math tasks were added when the total length was less than 3.8 min. The story blocks presented participants with a brief auditory story (around 5–9 sentences) adapted from a collection of Aesop’s fables. After each story, the participant was asked about the topic of the story, in the form of a 2-alternative forced-choice question. For example, after a story about an eagle that saves a man who had done him a favor, participants were asked, “Was that about revenge or reciprocity?” Participants pressed a button under the right index finger to select the first choice or a button under the right middle finger to select the second choice. Math tasks were also presented in a phonetic manner, requiring participants to complete simple addition and subtraction problems. Each series of arithmetic operations ended with the word “equals” followed by two alternative choices, e.g., “Four plus twelve, minus two plus nine, equals twenty-two or twenty-three?” The participants pushed a button to select either the first or the second answer (Binder et al., 2011; Barch et al., 2013).

fMRI Data Acquisition

Whole-brain EPI acquisitions were acquired with a 32 channel head coil on a modified 3T Siemens Skyra with TR = 720 ms, TE = 33.10 ms, flip angle = 52°, BW = 2290 Hz/Px, in-plane FOV = 208 × 180 mm, 72 slices, 2.0 mm isotropic voxels, with a multi-band acceleration factor of 8 (Feinberg et al., 2010; Moeller et al., 2010). For further information please refer to Ugurbil et al. (2013) for an overview of the acquisition details of the task fMRI. Two runs of each task were acquired, one with a right-to-left phase encoding and the other with a left-to-right phase encoding.

fMRI Data Processing

Preprocessing

We used the 3T MR Language Task fMRI Preprocessed data. This data was processed using FSL and FreeSurfer. The steps included gradient unwarping, motion correction, fieldmap-based EPI distortion correction, brain-boundary-based registration of EPI to a structural T1-weighted scan, non-linear (FNIRT) registration into MNI152 space, and grand-mean intensity normalization. In addition, spatial smoothing was done with an 8 mm full-width at half-maximum Gaussian core (Figure 1) for GLM analysis.

FIGURE 1

Figure 1. Data processing flowchart for SPM and machine learning analysis.

SPM Statistical Analysis

In order to identify the differences between the two tasks and to evaluate the significance of functional activation, we used a GLM analysis. In the first level (within-subject) analysis, the data was skillfully modeled in GLM. Four kinds of contrast images were created for each participant, including math task, story task, math vs. story task and story vs. math task. In the second-level analysis, the contrast (con files) images were used from the first-level analyses of all 1046 subjects. The four conditions were analyzed by one-sample t-test analysis. The SPM (T) map of math and story tasks were obtained and the threshold was p < 0.05(FWE) at voxel level. To eliminate artifacts, we used math contrasts and story contrasts as a mask and the mask threshold was p < 0.001 at voxel level for math vs. story and story vs. math tasks, respectively. The SPM (T) map of math vs. story and story vs. math tasks were then obtained and the threshold was p < 0.05(FWE) at voxel level. These results were used to analyze the activation of brain functions and were compared with the results of machine learning.

Classification Using Machine Learning

After the SPM² processed individual data, the spmT file was generated for each of the two experimental conditions. Under GRETNA (Wang et al., 2015), the AAL90³ (Anatomical Automatic Labeling) template was used to segment the brain region of the spmT file, and the average statistical T value of each brain region was extracted to generate a 90 × 1 feature matrix. For a total of 1046 participants, the feature vector was: math task 1046 × 90, story task 1046 × 90. The characteristics of 800 subjects were selected as a training set. The math task tag was 1, the story task tag was -1 and the training set was sent to the classifier for classification. The remaining 246 subjects were used as the prediction test set. Before classification, a z-score was used to normalize the preprocessed training set. And the Lasso regression algorithm was used for feature selection. Then the linear support vector machine was used as the kernel function and the 10-fold cross-validation was used to calculate the correct rate of training. Brain region contribution results could also be obtained while establishing a classification model. Finally, the test set was sent to the classifier to obtain the classification label and the accuracy of the prediction result was calculated. In order to obtain the optimal classification result, it was necessary to debug the classification parameters to predict the correctness of the results as the debugging standard. It included two parameters, one was the regularization parameter α of the Lasso algorithm, and it directly determined the number of features. The larger the alpha, the sparser the model, therefore, more regression coefficients β were set to 0, thus deleting some features to achieve feature selection. The other was the penalty coefficient C of the linear support vector machine, and it directly determined the accuracy of training. The value of C was generally between 0.01 and 0.1. The contribution of the brain region was proposed under two preconditions: firstly, the feature was extracted based on the region partitioned by the brain template, so that the feature was associated with the three-dimensional brain structure, therefore, each feature corresponded to a brain region; secondly, the linear support vector machine was selected as the classifier, because the weight of the linear support vector machine was in one-to-one correspondence with the feature vector. The larger the weight value, the more important the corresponding feature was to the establishment of the classification decision surface. Through the relationship between the features and the brain regions and the relationship between features and classification weights, the corresponding relationship between brain regions and weights was established. In simple terms, the contribution of the brain region, was the weight value of the optimal decision function, of the linear support vector machine classifier.

Results

Behavioral Data

The behavioral data were collected from 1046 participants during the fMRI experiments. Only one subject’s data was lost during the experiment. We used the average reaction time and correct rate data of 1045 participants for statistical analysis. There were two tasks. The mean reaction times (RT) (Figure 2A) and the mean accuracy (Figure 2B) were 3.79 ± 0.38 s and 83.28% (SD 3.42), respectively, for the math task and 3.50 ± 0.39 s and 92.57% (SD 12.94), respectively, for the story task. Two tailed two-sample t-tests were performed to compare the mean RTs and the mean accuracy between the math task and story task. The results showed that the math task had a slower reaction time compared to the story task (t = 17.260, P < 0.001). And the accuracy of the math task was significantly lower than the story task (t = 15.834, P < 0.001).

FIGURE 2

Figure 2. Behavioral results. (A) Mean reaction time for the math stimuli and story stimuli. (B) Mean accuracy rates for the math stimuli and story stimuli.

Imaging Data

Group Analysis Results

The specific group results of the four groups of activated brain regions were shown in Table 1. The activations of math and story tasks showed that both the left and right temporal lobe were activated (Figures 3A,B). In addition to the temporal lobe, in the math task, the brain area with a greater activation intensity included: the left Precentral Gyrus, left Middle Temporal Gyrus, left Superior Temporal Gyrus, right Inferior Frontal Gyrus and the right Middle Frontal Gyrus (Wang et al., 2007). In the story task, the brain area with a greater activation intensity included: the left Inferior Frontal Gyrus, left Middle Frontal Gyrus and the right Inferior Semi-Lunar Lobule. Compared to the story results (Figure 3C), the math results included: the left Inferior Frontal Gyrus, left Inferior Parietal Lobule and the left Superior Parietal Lobule which had a higher activation intensity than the story task; while the Superior Parietal Lobule and Inferior Parietal Lobule only activated in the math task. Compared with the math results (Figure 3D), the brain area of the story task, the left Inferior Temporal Gyrus, Superior Temporal Gyrus and the Middle Temporal Gyrus, had a significantly higher activation intensity than the math task, and the Parahippocampal Gyrus Amygdala on the left and right sides only activated in the story task (Binder et al., 2011; Barch et al., 2013).

TABLE 1

Table 1. Activated regions during the two auditory stimuli and the different activated regions between them.

FIGURE 3

Figure 3. Global brain activation of the group analysis. (A) Math shows a three-dimensional brain activation map in the math task. (B) Story shows a three-dimensional brain activation map in the story task. (C) Math vs. Story shows the difference of activated brain regions between the Math task relative to the Story task. (D) Story vs. Math shows the difference of activated brain regions between the Story task relative to the Math task. WM = working memory, IPS = Intraparietal sulcus, AC = Auditory cortex, SMA = Supplementary Motor Area.

Parameter Debugging Result

As shown in Figure 4A, it was found that as the α increased, the number of features decreased exponentially. Therefore, in order to reduce the dimensional disaster and improve the classification performance of the classifier, the appropriate number of important features were selected, α were taken as: 0.001, 0.002, 0.003, 0.005, 0.007, 0.01, and the corresponding feature numbers were: 38, 25, 19, 11, 9, 8. Next, the penalty coefficient C of the linear support vector machine was debugged, and finally the accuracy of the prediction result was used as a criterion for evaluating the performance of the classifier. As shown in Figure 4B, when α = 0.002, C = 0.09, the highest classification accuracy rate was 87.60%. The current parameters and the effects of the trained models could be visually evaluated by plotting the ROC curve and the AUC indicator. As shown in Figure 4C, the area under the curve was 0.96, which was close to 1, indicating that the classifier had a good classification effect.

FIGURE 4

Figure 4. (A) The relationship between the regularization parameter alpha of the Lasso regression algorithm and the number of feature selections (B) The relationship between the penalty coefficient C of the linear support vector machine and the correct rate of the prediction result under different alpha values (C) ROC curve of optimal classification results.

Machine Learning Results

As shown in Figure 5, a three-dimensional brain region contribution distribution map in six directions was shown. Some regions tended to exhibit higher classification weights than others. In particular, if the weight of some areas was at least greater than the average weight of all areas, plus a standard deviation of one time, we considered these areas to have significant weights (Tian et al., 2011). The mean value plus the standard deviation of the contribution was equal to 0.0614. The brain region with a contribution greater than 0.0614 was considered significant, including: the right Paracentral Lobule, right Rolandic Operculum and the right Inferior Parietal Lobule, excluding the supramarginal and angular gyri.

FIGURE 5

Figure 5. Three-dimensional contribution of brain regions for classification. Each node represented a brain region divided by AAL90 (Anatomical Automatic Labeling template). The node colors represent different regions and the node size was scaled according to the weight value of the brain regions. The greater the contribution of the brain region, the larger the radius of the node.

Comparing the classified brain region contribution results and the group analysis activation region results, as shown in Table 2, it was found that 13 of the 25 characteristic brain regions overlapped with the group analysis activated brain regions. Among the 13 brain regions, there were 11 brain regions that overlapped with a different activation map between the math task and the story task. The 11 brain regions were: the left and right Inferior Parietal Lobe (not include supramarginal and angular gyri), left and right Middle Frontal Gyrus, left Supramarginal Gyrus, right Superior Parietal Gyrus, right Superior Frontal Gyrus, dorsolateral, right Inferior Frontal Gyrus, opercular part, right Angular Gyrus, left Amygdala, left Heschl Gyrus. Moreover, these coincident regions had strong activation in the group analysis results (t values were greater than 18). The remaining 12 brain regions did not overlap with the group analysis activation region results, including the two brain regions with significant contributions: the right Paracentral Lobule and right Rolandic Operculum.

TABLE 2

Table 2. Comparison with degrees between the brain region contribution and group analysis: Label and regions represent the brain region label and brain region name of the classification result under the AAL90 template.

Discussion

One of the experimental paradigms designed by Wang et al. was the auditory computing task in Mandarin Chinese and English. The calculation included addition and multiplication. It is similar to the math task. Study participants included 19 adult native Mandarin Chinese speakers, with no history of speech or hearing impairments. The active brain regions of the calculation task in English after the group analysis include: the left Precentral Gyrus, left Middle Temporal Gyrus, right Inferior Frontal Gyrus, and the right Middle Frontal Gyrus (Wang et al., 2007). Barch et al. (2013) chose 77 participants (58 women and 19 men) and all participants were aged between 22 and 35, with no previously documented history of psychiatric, neurological or medical disorders that are known to influence brain function. Binder et al. (2011) chose 34 healthy, right-handed adults as participants. (17 women and 17 men), aged between 18 and 50 years (mean 29 years). They all used the same experimental paradigm of this article, and similar results were obtained: the story vs. math results showed that the largest activation cluster involved the temporal lobe and strong medial temporal activation involved the uncus, amygdala, and the anterior hippocampus, extending posteriorly into the parahippocampal and posterior fusiform gyrus.

Comparative Analysis of Brain Region Contribution and Group Results

The contribution of brain regions is to combine the different partitions of the three-dimensional physiological structure in the brain space, with the weights of the classifiers. Therefore, the brain region contribution degree reflects the importance of different brain regions to the classification results. The higher the contribution value is, the more important the brain area is for classification results. Classification is to compare the differences between the two categories. Therefore, the results of the classification mostly coincided with the differential activation of the brain region. These overlapping brain regions were: the Middle Frontal Gyrus, which is involved in expressive language processes including semantics (Brown et al., 2010), grammar and syntax. Broca’s area played a role in syntactic processing during Chinese reading comprehension, verbal fluency (Abrahams et al., 2003), and verbal working memory (Leung et al., 2002). Inferior Parietal Lobule has been involved in the perception of emotions, facial stimuli and interpretation of sensory information. The left Supramarginal Gyrus was most likely involved with language perception and processing (Gazzaniga et al., 2013). The left Heschl Gyrus, which is found in the area of the primary auditory cortex buried within the lateral sulcus of the human brain, was the first cortical structure to process incoming auditory information. The Heschl Gyrus was active during auditory processing under fMRI for tone and semantic tasks (Warrier et al., 2009). The right Superior Frontal Gyrus, dorsolateral, is involved in self-awareness, in coordination with the action of the sensory system (Goldberg and Harel, 2006; Wang et al., 2017). The Amygdala plays a major role in memory, decision making, and emotional response (including fear, anxiety, and aggression), which is thought to be part of the limbic system (Amunts et al., 2005). The left Amygdala, plays a major role in memory, decision making, and emotional response (including fear, anxiety, and aggression), which is thought to be part of the limbic system (Amunts et al., 2005). Moreover, the intensity of activation of these overlapping brain regions in the results of the group analysis reflected the correctness of the classification features and could identify brain regions with large activation differences between the two tasks.

There were 12 brain regions in the feature brain region that did not coincide with the group activation results, including two brain regions with significant contributions: the right Paracentral Lobule, which is concerned with Motor and sensory innervations of the contralateral lower extremity (Spasojević et al., 2013) and it is also responsible for control of defecation and urination, and the right Rolandic Operculum. Some studies have proven that articulatory disorders correspond with lesions of the Rolandic Operculum (Tonkonogy and Goodglass, 1981). The reason for the significant difference between the classification result and the group analysis result can be explained by using the Paracentral Lobule brain area as an example. On the one hand, when comparing the brain regions of the two task differences in the group analysis, a mask (Gajdoš et al., 2016) was added to eliminate the pseudo activation. The mask was defined by the activation of the brain area of the math or story task. As shown in Figure 6, the T value of the brain region (label number 70) was negative for both tasks. Therefore, the differential activation of the brain area must be included in the scope of the single task activation brain area. The main function of the Paracentral Lobule brain area is to control the movement of the contralateral lower limbs and sensory innervation. The functionality of the Paracentral Lobule was independent of the activation of the task and was not activated in the separate analysis of math and story tasks. Therefore, the differential brain regions of the two tasks were unlikely to show activation in the Paracentral Lobule brain region. On the other hand, from the classification principle (Cherkassky, 1997), machine learning did not need to consider the problem of pseudo activation. The selection of features was not limited to the activation range, but the whole brain range. The linear support vector machine mapped the feature vector from the Euclid space to the Hilbert space, making the data set linearly separable in the high-dimensional space. In Hilbert space, finding such a decision surface, not only separated the two types of features, but also made the distance between the two types of features, to this decision surface, as large as possible (Schölkopf, 2000; Huang et al., 2012). The greater the distance between the two types of features, the greater the weight of the classifier, and the greater the contribution value of the brain region corresponding to the feature. Therefore, the contribution essentially reflected the difference between the two types of features corresponding to the brain region in the Hilbert space. The Paracentral Lobule brain region had the highest contribution, indicating that the distance between the corresponding features of the brain region was very far in the high-dimensional space. We speculated that the difference in this brain region was not obvious in low-dimensional space, and statistical analysis did not show any significance.

FIGURE 6

Figure 6. The averaged T value in inactivated brain regions under two tasks. The numbers on the 12-column chart represented the brain area number of the AAL90 template, the gray box represented the math task, and the orange represented the story task. The number of asterisks represented the degree of p value. ^∗p < 0.05, ^∗∗p < 0.01, ^∗∗∗p < 0.001.

Machine learning used the difference between the two tasks for classification. Among the negatively activated brain regions, the difference was more obvious, so the contribution in classification was higher than that in the activated brain region. However, the mechanism of these negatively activated brain regions in task execution remains unclear. This is because, in the two tasks used within the brain regions involved, the mechanism was quite different from the mechanism for negatively activating the brain region, therefore, there was no need to use negative activation brain regions for task execution. Depending on the supply of cerebral blood flow, the higher the degree of correlation of the regional function, the greater the degree of cerebral blood flow supply.

We compared the T values of 12 inactive brain regions for two tasks, as shown in Figure 6. The T values of brain regions in both tasks were mostly negative, and the paired sample t-test mostly had a p value of less than 0.05. This showed that there was a significant difference between the two tasks in the negative activation of brain regions. The negative activation of brain regions varied greatly among different tasks, suggesting that in addition to activating brain regions, negative activation of brain regions played an important role in brain research.

In order to study the contribution of the brain region to the classification, the linear support vector machine was selected as the classifier, because the weight value of the classifier reflected the importance of the feature to the classifier. In addition, Lasso regression was selected as the feature selection method, which was related to the training of the final machine learning algorithm model. The training model was trained based on the input training data. After the training was completed, the features were sorted based on the model representation and the importance of the features. It was only a screening process. If a feature has a strong influence on the classification performance, it will be retained, and will be zero if it has no effect on the classifier. This method did not change the correspondence between brain regions and features.

Conclusion

In this paper, the average T value of the one-sample generalized linear model was extracted as the eigenvector. The Lasso regression algorithm and the linear support vector machine were used for classification, and the result was compared with the SPM group analysis activation result. It was found that there were coincident brain regions and non-coincident brain regions: the coincident brain regions were mostly the difference between tasks to activate the brain regions, and the activation intensity was strong. Non-coincident brain regions included brain regions with significant classification contributions, right Paracentral Lobule and right Rolandic Operculum. The difference between the two results was mainly due to the difference in the algorithm. In the statistical analysis, in order to eliminate pseudo-activation, the differential activation was limited to a single task activation range; while machine learning did not need to consider pseudo-activation, which can be from the scope of the whole brain, it found feature brain regions that were not related to task activation but contributed significantly to classification. In summary, the contribution of the brain region was from another perspective, analyzing the difference between the two states of brain activity, and finding important brain regions with no statistical difference. This suggested an important role for negative activation of brain regions in brain research.

Data Availability

Publicly available datasets were analyzed in this study. This data can be found here: https://db.humanconnectome.org/.

Author Contributions

MW, CL, JW, YL, XZ, and XL analyzed the data using SPM. MW, WZ, RC, YW, and YF analyzed the data using machine learning. MW and CL prepared the figures, and drafted the manuscript. WZ and RC contributed substantial to wrote and revised the manuscript. All authors contributed to manuscript development, and read and approved the final manuscript.

Funding

This study was financially supported by the grants from the National Natural Science Foundation of China (Grant Nos. 61727807, 81771909, 31600933, 61701323, 81671776, and 61633018), the Beijing Municipal Science and Technology Commission (Grant Nos. Z161100002616020, Z131100006813022, and PXM2017_026283_000002), the Yang Fan Plan of Beijing Municipal Administration of Hospitals (Clinical Innovation Project, Grant No. XMLX201714), the Capital Medical University Fundamental and Clinical Foundations of China (Grant Nos. 16JL-L08 and 17JL68), and the Excellent Talents Programme of Beijing (Grant No. 2016000020124G098).

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Footnotes

References

Abrahams, S., Goldstein, L. H., Simmons, A., Brammer, M. J., Williams, S. C., Giampietro, V. P., et al. (2003). Functional magnetic resonance imaging of verbal fluency and confrontation naming using compressed image acquisition to permit overt responses. Hum. Brain Mapp. 20, 29–40. doi: 10.1002/hbm.10126

PubMed Abstract | CrossRef Full Text | Google Scholar

Amunts, K., Kedo, O., Kindler, M., Pieperhoff, P., Mohlberg, H., Shah, N. J., et al. (2005). Cytoarchitectonic mapping of the human amygdala, hippocampal region and entorhinal cortex: intersubject variability and probability maps. Anat. Embryol. 210, 343–352. doi: 10.1007/s00429-005-0025-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Barch, D. M., Burgess, G. C., Harms, M. P., Petersen, S. E., Schlaggar, B. L., Corbetta, M., et al. (2013). Function in the human connectome: task-fMRI and individual differences in behavior. Neuroimage 80, 169–189. doi: 10.1016/j.neuroimage.2013.05.033

PubMed Abstract | CrossRef Full Text | Google Scholar

Beckmann, C. F., Jenkinson, M., and Smith, S. M. (2003). General multilevel linear modeling for group analysis in FMRI. Neuroimage 20, 1052–1063. doi: 10.1016/S1053-8119(03)00435-X

PubMed Abstract | CrossRef Full Text | Google Scholar

Bellman, R. (1961). Adaptive Control Processes: A Guided Tour. Princeton, NJ: Princeton University Press. doi: 10.1515/9781400874668

CrossRef Full Text | Google Scholar

Binder, J. R., Gross, W. L., Allendorfer, J. B., Bonilha, L., Chapin, J., Edwards, J. C., et al. (2011). Mapping anterior temporal lobe language areas with fMRI: a multicenter normative study. Neuroimage 54, 1465–1475. doi: 10.1016/j.neuroimage.2010.09.048

PubMed Abstract | CrossRef Full Text | Google Scholar

Brown, S., Martinez, M. J., and Parsons, L. M. (2010). Music and language side by side in the brain: a PET study of the generation of melodies and sentences. Eur. J. Neurosci. 23, 2791–2803. doi: 10.1111/j.1460-9568.2006.04785.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Chanel, G., Pichon, S., Conty, L., Berthoz, S., Chevallier, C., and Grezes, J. (2016). Classification of autistic individuals and controls using cross-task characterization of fMRI activity. Neuroimage Clin. 10, 78–88. doi: 10.1016/j.nicl.2015.11.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Cherkassky, V. (1997). The nature of statistical learning theory∼. Technometrics 38, 409–409.

Google Scholar

De, M. F., Gentile, F., Esposito, F., Balsi, M., Di, S. F., Goebel, R., et al. (2007). Classification of fMRI independent components using IC- fingerprints and support vector machine classifiers. Neuroimage 34, 177–194. doi: 10.1016/j.neuroimage.2006.08.041

PubMed Abstract | CrossRef Full Text | Google Scholar

De, M. F., Valente, G., Staeren, N., Ashburner, J., Goebel, R., and Formisano, E. (2008). Combining multivariate voxel selection and support vector machines for mapping and classification of fMRI spatial patterns. Neuroimage 43, 44–58. doi: 10.1016/j.neuroimage.2008.06.037

PubMed Abstract | CrossRef Full Text | Google Scholar

Ecker, C., and Murphy, D. (2014). Neuroimaging in autism–from basic science to translational research. Nat. Rev. Neurol. 10, 82–91. doi: 10.1038/nrneurol.2013.276

PubMed Abstract | CrossRef Full Text | Google Scholar

Ecker, C., Rocharego, V., Johnston, P., Mouraomiranda, J., Marquand, A., Daly, E. M., et al. (2010). Investigating the predictive value of whole-brain structural MR scans in autism: a pattern classification approach. Neuroimage 49:44. doi: 10.1016/j.neuroimage.2009.08.024

PubMed Abstract | CrossRef Full Text | Google Scholar

Feinberg, D. A., Moeller, S., Smith, S. M., Auerbach, E., Ramanna, S., Glasser, M. F., et al. (2010). Multiplexed echo planar imaging for sub-second whole brain FMRI and fast diffusion imaging. PLoS One 5:e15710. doi: 10.1371/journal.pone.0015710

PubMed Abstract | CrossRef Full Text | Google Scholar

Friston, K. J., Holmes, A. P., Worsley, K. J., Poline, J. P., Frith, C. D., and Frackowiak, R. S. J. (1994). Statistical parametric maps in functional imaging: a general linear approach. Hum. Brain Mapp. 2, 189–210. doi: 10.1002/hbm.460020402

CrossRef Full Text | Google Scholar

Fu, C. H., Mourao-Miranda, J., Costafreda, S. G., Khanna, A., Marquand, A. F., Williams, S. C., et al. (2008). Pattern classification of sad facial processing: toward the development of neurobiological markers in depression. Biol. Psychiatry 63, 656–662. doi: 10.1016/j.biopsych.2007.08.020

PubMed Abstract | CrossRef Full Text | Google Scholar

Gajdoš, M., Mikl, M., and Mareček, R. (2016). Mask_explorer: a tool for exploring brain masks in fMRI group analysis. Comput. Methods Progr. Biomed. 134, 155–163. doi: 10.1016/j.cmpb.2016.07.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Gazzaniga, M. S., Ivry, R. B., Mangun, G. R., and Steven, M. S. (2013). Cognitive Neuroscience : The Biology of the Mind. New York, NY: W. W. Norton & Company, Inc.

Google Scholar

Goldberg, I. I., and Harel, M. R. (2006). When the brain loses its self: prefrontal inactivation during sensorimotor processing. Neuron 50, 329–339. doi: 10.1016/j.neuron.2006.03.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Guyon, I. (2003). An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182.

Google Scholar

Haxby, J. V., Gobbini, M. I., Furey, M. L., Ishai, A., Schouten, J. L., and Pietrini, P. (2001). Distributed and overlapping representations of faces and objects in ventral temporal cortex. Science 293, 2425–2430. doi: 10.1126/science.1063736

PubMed Abstract | CrossRef Full Text | Google Scholar

Holmes, A. P., and Friston, K. J. (1998). Generalisability, random effects & population inference. Neuroimage 7:S754. doi: 10.1016/S1053-8119(18)31587-8

CrossRef Full Text | Google Scholar

Huang, G. B., Zhou, H., Ding, X., and Zhang, R. (2012). Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst. Man Cybern. B 42, 513–529. doi: 10.1109/TSMCB.2011.2168604

PubMed Abstract | CrossRef Full Text | Google Scholar

Kamitani, Y., and Tong, F. (2005). Decoding the visual and subjective contents of the human brain. Nat. Neurosci. 8, 679–685. doi: 10.1038/nn1444

PubMed Abstract | CrossRef Full Text | Google Scholar

Leung, H. C., Gore, J. C., and Goldmanrakic, P. S. (2002). Sustained mnemonic response in the human middle frontal gyrus during on-line strage of spatial memoranda. J. Cogn. Neurosci. 14, 659–671. doi: 10.1162/08989290260045882

PubMed Abstract | CrossRef Full Text | Google Scholar

Lv, J., Yan, T., Tao, L., and Zhao, L. (2015). The role of configural processing in face classification by race: an ERP study. Front. Hum. Neurosci. 9:679. doi: 10.3389/fnhum.2015.00679

PubMed Abstract | CrossRef Full Text | Google Scholar

Mayer, G., Lohberger, A., Butzen, S., Pofahl, M., Blind, M., and Heckel, A. (2009). From selection to caged aptamers: identification of light-dependent ssDNA aptamers targeting cytohesin. Bioorg. Med. Chem. Lett. 19, 6561–6564. doi: 10.1016/j.bmcl.2009.10.032

PubMed Abstract | CrossRef Full Text | Google Scholar

Meier, T. B., Desphande, A. S., Vergun, S., Nair, V. A., Song, J., Biswal, B. B., et al. (2012). Support vector machine classification and characterization of age-related reorganization of functional brain networks. Neuroimage 60, 601–613. doi: 10.1016/j.neuroimage.2011.12.052

PubMed Abstract | CrossRef Full Text | Google Scholar

Moeller, S., Yacoub, E., Olman, C. A., Auerbach, E., Strupp, J., Harel, N., et al. (2010). Multiband multislice GE-EPI at 7 tesla, with 16-fold acceleration using partial parallel imaging with application to high spatial and temporal whole-brain FMRI. Magn. Reson. Med. 63, 1144–1153. doi: 10.1002/mrm.22361

PubMed Abstract | CrossRef Full Text | Google Scholar

Mwangi, B., Tian, T. S., and Soares, J. C. (2014). A review of feature reduction techniques in neuroimaging. Neuroinformatics 12, 229–244. doi: 10.1007/s12021-013-9204-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Norman, K. A., Polyn, S. M., Detre, G. J., and Haxby, J. V. (2006). Beyond mind-reading: multi-voxel pattern analysis of fMRI data. Trends Cogn. Sci. 10, 424–430. doi: 10.1016/j.tics.2006.07.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Pereira, F., Mitchell, T., and Botvinick, M. (2009). Machine learning classifiers and fMRI: a tutorial overview. Neuroimage 45(1 Suppl.), S199–S209. doi: 10.1016/j.neuroimage.2008.11.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Schölkopf, B. (2000). “The kernel trick for distances,” in Proceedings of the 13th International Conference on Neural Information Processing Systems, (Cambridge, MA: MIT Press), 283–289.

Google Scholar

Spasojević, G., Malobabic, S., Pilipović-Spasojević, O., Djukić-Macut, N., and Maliković, A. (2013). Morphology and digitally aided morphometry of the human paracentral lobule. Folia Morphol. 72, 10–16. doi: 10.5603/FM.2013.0002

PubMed Abstract | CrossRef Full Text | Google Scholar

Tian, L., Wang, J., Yan, C., and He, Y. (2011). Hemisphere- and gender-related differences in small-world brain networks: a resting-state functional MRI study. Neuroimage 54, 191–202. doi: 10.1016/j.neuroimage.2010.07.066

PubMed Abstract | CrossRef Full Text | Google Scholar

Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. J. R. Stat. Soc. 58, 267–288. doi: 10.1111/j.2517-6161.1996.tb02080.x

CrossRef Full Text | Google Scholar

Tonkonogy, J., and Goodglass, H. (1981). Language function, foot of the third frontal gyrus, and rolandic operculum. Arch. Neurol. 38, 486–490. doi: 10.1001/archneur.1981.00510080048005

PubMed Abstract | CrossRef Full Text | Google Scholar

Ugurbil, K., Xu, J. Q., Auerbach, E. J., Moeller, S., Vu, A. T., Duarte-Carvajalino, J. M., et al. (2013). Pushing spatial and temporal resolution for functional and diffusion MRI in the human connectome project. Neuroimage 80, 80–104. doi: 10.1016/j.neuroimage.2013.05.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, J., Wang, X., Xia, M., Liao, X., Evans, A., and He, Y. (2015). GRETNA: a graph theoretical network analysis toolbox for imaging connectomics. Front. Hum. Neurosci. 9:386. doi: 10.3389/fnhum.2015.00386

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, L., Wang, W., Yan, T., Song, J., Yang, W., Wang, B., et al. (2017). Beta-band functional connectivity influences audiovisual integration in older age: an EEG study. Front. Aging Neurosci. 9:239. doi: 10.3389/fnagi.2017.00239

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Y., Lin, L., Kuhl, P., and Hirsch, J. (2007). Mathematical and linguistic processing differs between native and second languages: an fMRI study. Brain Imaging Behav. 1, 68–82. doi: 10.1007/s11682-007-9007-y

CrossRef Full Text | Google Scholar

Warrier, C., Wong, P., Penhune, V., Zatorre, R., Parrish, T., Abrams, D., et al. (2009). Relating structure to function: Heschl’s gyrus and acoustic processing. J. Neurosci. 29, 61–69. doi: 10.1523/JNEUROSCI.3489-08.2009

CrossRef Full Text | Google Scholar

Wu, J., Yan, T., Zhen, Z., Jin, F., and Guo, Q. (2012). Retinotopic mapping of the peripheral visual field to human visual cortex by functional magnetic resonance imaging. Hum. Brain Mapp. 33, 1727–1740. doi: 10.1002/hbm.21324

PubMed Abstract | CrossRef Full Text | Google Scholar

Xin, L., Duygu, T., Weiner, M. W., and Norbert, S. (2013). Locally linear embedding (LLE) for MRI based Alzheimer’s disease classification. Neuroimage 83,148–157. doi: 10.1016/j.neuroimage.2013.06.033

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, T., Dong, X., Mu, N., Liu, T., Chen, D., Deng, L., et al. (2017a). Positive classification advantage: tracing the time course based on brain oscillation. Front. Hum. Neurosci. 11:659. doi: 10.3389/fnhum.2017.00659

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, T., Feng, Y., Liu, T., Wang, L., Mu, N., Dong, X., et al. (2017b). Theta oscillations related to orientation recognition in unattended condition: a vMMN study. Front. Behav. Neurosci. 11:166. doi: 10.3389/fnbeh.2017.00166

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, T., Jin, F., He, J., and Wu, J. (2011). Development of a wide-view visual presentation system for visual retinotopic mapping during functional MRI. J. Magn. Reson. Imaging 33, 441–447. doi: 10.1002/jmri.22404

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, T., Wang, W., Yang, L., Chen, K., Chen, R., and Han, Y. (2018). Rich club disturbances of the human connectome from subjective cognitive decline to Alzheimer’s disease. Theranostics 8, 3237–3255. doi: 10.7150/thno.23772

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, J., Anderson, J. R., Liang, L., Pulapura, S. K., Gatewood, L., Rottenberg, D. A., et al. (2009). Evaluation and optimization of fMRI single-subject processing pipelines with NPAIRS and second-level CVA. Magn. Reson. Imaging 27, 264–278. doi: 10.1016/j.mri.2008.05.021

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: generalized linear models, support vector machine, contribution of brain region, task fMRI, lasso regression

Citation: Wang M, Li C, Zhang W, Wang Y, Feng Y, Liang Y, Wei J, Zhang X, Li X and Chen R (2019) Support Vector Machine for Analyzing Contributions of Brain Regions During Task-State fMRI. Front. Neuroinform. 13:10. doi: 10.3389/fninf.2019.00010

Received: 15 November 2018; Accepted: 12 February 2019;
Published: 06 March 2019.

Edited by:

Tianyi Yan, Beijing Institute of Technology, China

Reviewed by:

Bin Wang, Taiyuan University of Technology, China
Takahashi Satoshi, Okayama University, Japan

Copyright © 2019 Wang, Li, Zhang, Wang, Feng, Liang, Wei, Zhang, Li and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xia Li, eGlhbGlAY2NtdS5lZHUuY24= Renji Chen, Y2hlbnJlbmppQDEyNi5jb20=

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.