Machine learning classification of resting state functional connectivity predicts smoking status

Pariyadath, Vani; Stein, Elliot A.; Ross, Thomas J.

doi:10.3389/fnhum.2014.00425

ORIGINAL RESEARCH article

Front. Hum. Neurosci., 16 June 2014

Sec. Brain Imaging and Stimulation

Volume 8 - 2014 | https://doi.org/10.3389/fnhum.2014.00425

Machine learning classification of resting state functional connectivity predicts smoking status

Vani Pariyadath^*

Elliot A. Stein

Thomas J. Ross

Neuroimaging Research Branch, Intramural Research Program, National Institute on Drug Abuse, National Institutes of Health, Baltimore, MD, USA

Machine learning-based approaches are now able to examine functional magnetic resonance imaging data in a multivariate manner and extract features predictive of group membership. We applied support vector machine (SVM)-based classification to resting state functional connectivity (rsFC) data from nicotine-dependent smokers and healthy controls to identify brain-based features predictive of nicotine dependence. By employing a network-centered approach, we observed that within-network functional connectivity measures offered maximal information for predicting smoking status, as opposed to between-network connectivity, or the representativeness of each individual node with respect to its parent network. Further, our analysis suggests that connectivity measures within the executive control and frontoparietal networks are particularly informative in predicting smoking status. Our findings suggest that machine learning-based approaches to classifying rsFC data offer a valuable alternative technique to understanding large-scale differences in addiction-related neurobiology.

Introduction

Conventional univariate methods of fMRI analysis have been used to identify differences in neural processing between various diseased populations and healthy controls over a plethora of tasks. However, not all such group differences are guaranteed to be predictive; there may be significant overlap between the two group distributions of the pertinent metric. Further, traditional univariate approaches to fMRI analysis by definition overlook multivariate patterns in the data. Machine learning offers a variety of tools to address the above limitations. Support vector machine (SVM)-based algorithms (Vapnik, 2000), for example, have been used successfully to identify neural patterns of activation (Haxby, 2012) as well as for group-level differences (Craddock et al., 2009).

Attempts to apply machine learning-based approaches to classify individuals based on various disease states has gained significant traction for screening and diagnosis (Vemuri et al., 2008; Stonnington et al., 2010), and monitoring disease trajectory (Hobbs et al., 2010). Applications of machine learning to this end include classification of schizophrenia from task activation maps (Demirci et al., 2008), using structural images to classify individuals as addicted or not (Zhang et al., 2005), and using task activation maps to classify schizophrenia, Alzheimer's, and mild traumatic brain injury (Ford et al., 2003). Depending on the specific method involved, and the neurobiological disease in question, such attempts have been met with moderate to high success, i.e., ranging from 60 to 100% classification accuracy (Orrù et al., 2012).

Addiction in particular stands to benefit from the application of machine learning-based approaches. Multiple gene × environment interactions go into determining susceptibility to addiction at various stages—initiation of drug use, transition to repeated drug use and then on to compulsive use (Kreek et al., 2005). Aside from this, the drug of abuse itself interacts with neural systems to modulate drug-related circuitry and resulting cognition (Volkow et al., 2009). Not surprisingly, multiple brain networks have been implicated in this complex disease, and, to date, very few neural biomarkers have been identified for predicting vulnerability to addiction and treatment outcome (Pariyadath et al., 2013). Importantly, with rare exceptions (Zhang et al., 2005), the search for such biomarkers through neuroimaging has thus far been restricted to univariate approaches. Through a multivariate approach, we can begin to identify complex interactions within and between brain networks, and some day explore parallel contributions from genetic and environmental sources to neural function.

When comparing groups, differences in task performance can sometimes confound the interpretation of differences in neural activation patterns. Of late, resting state functional connectivity (rsFC) analysis has proven immensely valuable for extracting differences in neural function in the absence of an explicit task. Functional connectivity analyses indicate that there are significant differences in neural architecture and functioning in substance dependent individuals (Sutherland et al., 2012). This analysis approach has been combined with machine learning tools to extract a neural metric for maturity (Dosenbach et al., 2010) within a healthy cohort. Of relevance here, SVM-based approaches have been shown to be successful in classifying major depressive disorder (Craddock et al., 2009) and schizophrenia (Shen et al., 2010) from rsFC data. To date, however, rsFC data has not been explored using machine learning in addicted individuals.

In this study, we sought to identify neural features that may explain addiction in a multivariate fashion—those that speak to addiction when examined in tandem as opposed to in isolation—and that are predictive of the addicted state. To this end, we applied a linear SVM-based method to rsFC data from nicotine-dependent individuals and controls. Linear SVM algorithms have been shown to be an effective approach for large-dimensional problems, especially those where the number of features exceeds the number of samples (Hsu et al., 2003). We employed a network-centered approach, capitalizing on a previous attempt at reducing neural activity to resting state networks (Smith et al., 2009). Recent research suggests that cognition, and psychopathologies thereof, may be better understood as involving distributed brain areas that function as part of large-scale networks, as opposed to a single, focal brain region (Bressler and Menon, 2010). Other complex diseases, such as Alzheimer's, major depression, schizophrenia, and autism have benefited from parcellating the brain in terms of large-scale functional networks (Bressler and Menon, 2010). Further, this approach permitted us to explore differences in functional connectivity without constraint only to regions that have previously been identified through univariate approaches to be relevant to addiction. We compare three different network-centered measures to assess the—(1) the extent to which each node within a network is representative of the parent network, (2) functional connectivity between different nodes within a network, and (3) functional connectivity between different networks.

Materials and Methods

Participants

Participants included 21 smokers (9 female) and 21 non-smoking controls (11 female), whose details are shown in Table 1. All smokers scored at least 6 on the Fagerstrom Test for Nicotine Dependence (FTND). Individuals with a history of pre-morbid neurological disease, major medical, or axis I psychiatric diagnosis other than substance use disorder, assessed by the computer-administered Structured Clinical Interview (SCID) for the Diagnostic and Statistical Manual of Mental Disorders IV (DSM-IV) screening version and clinician interview, or who had current substance dependence other than nicotine or cannabis, based on DSM IV criteria, were excluded from the study. Smokers were allowed to smoke ad libitum prior to the scan session. Participants gave written informed consent to this study approved by the Institutional Review Board at the National Institute on Drug Abuse-Intramural Research Program.

TABLE 1

Table 1. Demographic characteristics of study population.

fMRI Acquisition and Pre-Processing

During resting state scans, participants were instructed to rest and keep their eyes open but not to think about anything in particular. Functional MRI data were collected on a 3-T Siemens Allegra MR scanner (Siemens, Erlangen, Germany) equipped with a quadrature volume head coil. Thirty-nine slices were acquired positioned at 30° to the AC-PC line and were prescribed to cover the whole brain. The data were acquired using a single-shot gradient-echo echo-planar imaging (EPI) sequence with repetition time (TR) of 2 s, echo time (TE) of 27 ms, flip angle (FA) of 78°, field of view (FOV) of 220 × 220 mm, and an in-plane resolution of 3.44 × 3.44 mm with thickness 3.5 mm. For registration purpose, high-resolution anatomical images were acquired using a 3D magnetization prepared rapid gradient-echo (MPRAGE) T1-weighted sequence with TR of 2.5 s, TE of 4.38 ms, FA of 7°, and a voxel size of 1 × 1 × 1 mm.

Data processing and analyses were conducted in AFNI (Cox, 1996). Preprocessing included slice-timing and motion correction. Data were inspected for motion using censor.py (http://brainimaging.waisman.wisc.edu/~perlman/code/censor.py), employing a censoring threshold of 0.3 mm for translation and 0.3° for rotation between consecutive TRs. Data were then spatially normalized to the standard Talairach space. Spatial smoothing to a 6 mm FWHM Gaussian kernel was performed to increase spatial signal to noise ratio. Global fluctuations, originating presumably from such systemic effects as respiration and cardiac-induced pulsations, were accounted for individually by orthogonalizing the time-courses with respect to the first three principal components from the white matter voxel time course ensemble and the first three principal components from the time course ensemble of the cerebrospinal fluid (CSF) voxels (Behzadi et al., 2007). In addition to these physiological regressors, participants' time courses were also orthogonalized with respect to the six motion parameters. Time courses were band-pass filtered (0.01–0.15 Hz) to retain only the low frequency components in the signal. Although it is common to use a more narrow frequency band (e.g., cutoff frequency = 0.08 Hz), many studies do employ a higher cutoff frequency, such as 0.15 Hz, and in some rsFC analysis methods, a broader frequency band might even be preferable (Wu et al., 2008; Braun et al., 2012). We therefore chose to employ a broad frequency range for our network-based analysis. To address any concerns that findings here may be driven by physiological noise (stemming from the use of this frequency range), we tested the classifier after band-pass filtering the signal with a lower cutoff frequency (0.1 Hz). We did not observe any significant difference in classifier performance.

Recently, there has been some concern regarding motion-related artifacts in rsFC computation, specifically manifesting as decreases in estimated long-distance connectivity as a result of increased head motion (Power et al., 2012). To ensure that our results were not artifacts induced by head motion, we were careful to remove volumes with head motion above a stringent threshold during the pre-processing stage (0.3 mm for translation and 0.3° for rotation between consecutive TRs), and time courses were orthogonalized with respect to the participant's motion parameters. Additionally, we computed the root mean squared (RMS) head position change or for smokers and controls, and also the final number of volumes that were included in the computation of correlation coefficients.

RSN Node Selection

Sixteen resting state networks (RSNs) were selected from a 20-component ICA decomposition of task fMRI data from the BrainMap database and resting data from 36 participants carried out in a previous study (Smith et al., 2009). Four RSNs were discarded from the original 20 as they had previously been identified as artifactual. Of the 16 spatial maps, four were not categorized in the original study. Three of them were speculated to overlap with multiple other RSNs—specifically sensorimotor, frontoparietal, and executive control networks (ECNs); we refer to these three here as Higher Order Networks or HONs (Figure 1). The fourth one comprises the cuneus and surrounding occipital regions, and is categorized here as Visual-4 (Figure 1). The 16 spatial maps were reduced to 56 node regions by thresholding at Z = 6 with a minimum cluster size restriction of 50 (1 × 1 × 1 mm³) voxels (Figure 1; Table 2) using the AFNI program 3dROIMaker (Taylor and Saad, 2013). This level of thresholding was chosen so as to qualitatively capture the networks observed in Smith et al. (2009), as these networks consistently appear in the literature and are temporally stable (Damoiseaux et al., 2006; Chen et al., 2008).

FIGURE 1

Figure 1. The 16 resting state networks and their corresponding node regions. Resting state networks were selected and thresholded from a 20-component ICA decomposition of task fMRI data from the BrainMap database and resting data from 36 participants carried out in a previous study (Smith et al., 2009) (DMN, Default Mode Network; ECN, Executive Control Network; HON, Higher Order Network).

TABLE 2

Table 2. The 16 RSNs and their corresponding node regions.

Three separate classifiers were built that each focused on a separate functional connectivity-based feature.

Representativeness of RSN (REP)

The dual regression method (Zuo et al., 2010) was used, with the thresholded RSN map as a template, to extract participant-level component maps. The participant-level component maps were then standardized into Z-score maps. As a measure of RSN representativeness, the average Z-score was calculated for each node region in the 16 RSNs, for each participant. This resulted in 56 REP features.

Between-RSN Connectivity (B-RSN)

To obtain a measure of functional connectivity between networks, each group-ICA map was regressed against each participant's 4D dataset to extract the time-course corresponding to that component. Functional connectivity was computed as the temporal correlation between each pair of RSN time-courses. We employed this procedure, as opposed to calculating the correlation between every pair of nodes within any two RSNs and using the average correlation, to extract the time-course corresponding to the network as a whole. In this way, we are able to avoid extracting correlations that may arise from components in a node's time-course that do not correspond to its parent network. This procedure resulted in 120 B-RSN features.

Within-RSN Connectivity (W-RSN)

To compute functional connectivity within an RSN, reference time courses from each of the node regions within a network were generated by averaging the time courses of all voxels within the region. Subsequently, correlation coefficients were computed between each pair of node time-courses within each RSN. As we wanted to analyze node pair connectivity merely in the context of a given RSN, only pairs of nodes within the same network were analyzed. Two RSNs contained only a single node each (Visual-1 and Visual-3), and were therefore excluded from this classifier. This resulted in 119 correlation W-RSN features.

Support Vector Machine (SVM) Classifier

SVM training and testing were carried out using the Scikit-learn package in Python (Pedregosa et al., 2011), which is an implementation of the LIBSVM package (Chang and Lin, 2011). A linear SVM was employed in all models (with soft margin parameter C = 1). Classification performance was tested using leave-one-out cross-validation (LOOCV). On each run, training data was first scaled, and the corresponding scaling transformation was repeated on the test dataset. Feature selection was carried out prior to classifier-training through recursive feature elimination (Guyon et al., 2002) with either 0, 50, or 90% feature elimination; this provided a comparison of performance with no, medium, and high degree of feature elimination. Features deemed critical by this method were carried forward to the classifier-training stage (Figure 2). Without feature elimination, even with superior classification performance, it would be impossible to make any meaningful inferences about the underlying neurobiology owing to the large number of features involved. Narrowing the set of features to 10% of the original set permits a more detailed understanding of the key circuits involved.

FIGURE 2

Figure 2. Classification algorithm for predicting smoking status using SVM-Adaboost.

Adaboost

The linear SVM classifier was supplemented by a boosting algorithm—AdaBoost (Freund and Schapire, 1995). This algorithm involves an iterative process of training the SVM classifier on a weighted set of samples, where the weights are determined by the accuracy of the classifier for those samples on the previous iteration. The final classification is obtained through a linear combination of individual classifiers, where the classification of each SVM classifier is weighted by its performance accuracy. In this manner, AdaBoost builds a non-linear classifier ensemble from a weighted combination of multiple linear SVM classifiers.

Classification Performance

To ascertain the performance of a classifier, we calculated accuracy and precision, defined as below:

\begin{array}{l} Accuracy = (n u m b e r o f t r u e p o s i t i v e s \\ + n u m b e r o f t r u e n e g a t i v e s) / (n u m b e r o f a l l s a m p l e s) \\ Precision = (n u m b e r o f t r u e p o s i t i v e s) / \\ (n u m b e r o f t r u e p o s i t i v e + n u m b e r o f t r u e n e g a t i v e s) \end{array}

To test whether classification performance was significantly above chance, we randomly classified each participant as a smoker or non-smoker, and trained and tested each classifier on this dataset. This process was executed 1000 times to obtain random distributions of accuracy and precision. Z-tests were then performed between the actual accuracy/precision values and the generated random distributions to determine statistical significance. Additionally, we tested whether actual accuracy and precision scores were 2 standard deviations above the mean of the generated random distribution.

To identify features maximally contributing to improved classification performance with the within-RSN classifier, we extracted features that were utilized in the classifier following 90% feature elimination on 15 or more runs of LOOCV. Each feature had a chance of 1/119 of appearing in the critical 12 on each run. Features that showed up in 15 or more runs of LOOCV were therefore appearing far more frequently than would be predicted by chance (p < 0.000007).

Results

Smokers and controls did not differ statistically in age, gender, or ethnicity (Table 1), reducing the probability that the classifier's performance was biased by demographic features irrelevant to nicotine addiction. After applying the SVM-AdaBoost algorithm to 21 smokers and 21 controls, accuracy and precision were calculated for the REP, between-RSN, and within-RSN classifiers with and without feature elimination.

Based on the above metrics, we concluded that the within-RSN and REP classifiers can reliably be used to classify smokers from non-smokers (Table 3). On the other hand, the between-RSN classifier's performance was not consistently above chance. This suggests that there is limited predictive information for nicotine addiction in the functional connectivity between RSNs, at least based on the current method of defining network nodes.

TABLE 3

Table 3. Performance (accuracy and precision) of the three SVM-AdaBoost Classifiers.

As can be observed from Table 3, classification performance significantly improved with feature elimination for the within-RSN classifier. To identify the features maximally contributing to this improved classification performance, we extracted the features that were utilized in the classifier following 90% feature elimination on 15 or more runs of LOOCV (Figure 3). This process revealed that connectivity within HON-3 (6 circuits), HON-2 (4 circuits), executive control (2 circuits), and frontoparietal (1 circuit) networks specifically were predictive of smoking status. These circuits involved parts of the middle and superior frontal gyri, posterior cingulate cortex, precuneus, middle temporal gyri, and inferior parietal gyri.

FIGURE 3

Figure 3. Features maximally contributing to SVM classification performance. Features that were utilized in the within-RSN classifier following 90% feature elimination on 15 or more runs of LOOCV were identified, and these consisted of circuits within the (A) ECN, (B) FP, (C) HON-2, and (D) HON-3. Red and blue lines indicate circuits in which connectivity was greater and lower, respectively, in smokers relative to controls. Thick lines indicate circuits that were individually statistically different between smokers and controls, as inferred from t-tests. Inset brains indicate the orientation of the larger configuration (ECN, Executive Control Network; FP, Frontoparietal Network; HON, Higher Order Network).

To confirm that our results were not artifacts induced by head motion, we computed the RMS displacement change for smokers and controls and verified that the two groups did not differ statistically on this measure for translational [t₍₄₀₎ = −0.58; p = 0.564] or rotational [t₍₄₀₎ = 0.86; p = 0.394] head motion. Further, the two groups did not differ statistically in the final number of volumes that were included in the computation of correlation co-efficients [t₍₄₀₎ = −1.97; p = 0.056. Mean_smokers = 173.67 ± 9.67; Mean_non-smokers = 178.19 ± 3.43].

Discussion

We employed a machine learning-based approach to identify functional connectivity measures that are predictive of nicotine dependence. A comparison of three network-centered functional connectivity measures revealed that the functional connectivity between nodes within resting state networks is most informative in predicting nicotine dependence. Classification based on functional connectivity between these networks, on the other hand, resulted in performance accuracy not consistently above chance levels. It should be noted, however, that classification performance of the within-network classifier improved when many features were eliminated; this suggests that specific within-network circuits, or a combination thereof, warrant further investigation in the context of nicotine addiction. We are not implying that within-network connectivity in general would be more informative about the disease.

The REP assessments were comparably successful in predicting smoking status vis-à-vis the within-RSN classifier. However, a similar examination of the critical nodes is difficult as classification performance did not improve with feature elimination, suggesting that all 56 nodes (Figure 1; Table 2) need to be considered when predicting smoking status. As the REP measure indicates the extent to which a specific node behaves like the network in general, strong classification performance here might reflect differences in within-network connectivity in smokers.

On further examination of the within-network connectivity classifier, we found that HON RSNs, including the frontoparietal and ECNs, were critical to the classification process. Within these networks, functional connectivity of the middle/inferior frontal gyrus, posterior cingulate cortex, and precuneus were especially informative regarding nicotine dependence, i.e., were utilized in the classification processes following 90% feature elimination on 15 or more runs of LOOCV. That predictive features were observed in frontal regions is not surprising. Multiple lines of research suggest impairments in this area as being critical to addiction (Goldstein and Volkow, 2002; Volkow et al., 2002; Hester and Garavan, 2004; Kober et al., 2010). The differences observed here may reflect impairments in frontal disinhibition, a deficit thought to be critical to the disease (Goldstein and Volkow, 2011).

Network-centered approaches have identified several large scale networks whose functional connectivity patterns at rest strongly correspond with network activity during various cognitive processes (Smith et al., 2009). Of these, two sets of networks have been implicated in task performance—task-positive networks which are engaged during task performance, such as the ECN, and the task-negative “default mode network” (DMN). The DMN decreases in activity during task performance relative to its baseline at rest (Greicius et al., 2003). These task-positive and task-negative networks exhibit an anti-correlated relationship (Fox et al., 2005). Importantly, the degree to which activity between these networks is anti-correlated predicts variability in performance on cognitive tasks (Kelly et al., 2008), suggesting that the integrity of this between-network coordination is critical to efficient cognitive function. Our data suggests a breakdown of normal connectivity patterns within the ECN and DMN RSNs. This difference in functional connectivity could have two important implications: (1) it could influence the manner in which each network responds to a cognitive task or pharmacological manipulation (e.g., Cole et al., 2010), and consequently the observed interaction between the two networks; (2) similarly the within-network connectivity differences could affect ECN-DMN homeostasis under high-craving states. In support of such a possibility, DMN-ECN connectivity is weakened during smoking abstinence (Lerman et al., 2014). In other words, although the differences observed here are within a network, under various manipulations, they could manifest as between-network effects. (It is tempting to infer that the PCC-related circuits that showed up frequently as critical features speak to a DMN involvement along exactly these lines. However, in our hands, the DMN RSN itself does not appear to be a key player in the classification process, and the PCC-related circuits observed here were located in other higher order RSNs.) Here, the data were from smokers who were allowed to smoke ad libitum and were thus unlikely to be experiencing withdrawal. To address whether the connectivity disruptions reflect frontal disinhibition problems or DMN-ECN antagonism differences, classification performance needs to be compared between abstinent and sated smokers. Frontal disinhibition problems are seen to predate addiction (De Wit, 2009; Ersche et al., 2010, 2012), and are exacerbated by chronic drug use (Volkow et al., 2009), but are unlikely to be affected by acute effects of nicotine (Bekker et al., 2005). Thus, middle/inferior frontal circuits should still provide important information for classification in both sated and abstinent states. On the other hand, DMN-ECN dynamics should be markedly different in the two states, and thus functional connectivity within the ECN in one state should not be particularly informative to the other.

It is important to note that some of the features critical to the classification performance may be changes induced by chronic nicotine consumption, while others are likely pre-existent. Parts of the middle frontal gyrus, for example, have been shown to decrease in gray matter volume as a function of lifetime exposure to cigarette smoke (Brody et al., 2004; and is suggested in Gallinat et al., 2006). Importantly, classification performance in this study was guided by a combination of different neural features, and not driven by any one feature in particular. In support of this assertion, within-network classification dropped in performance when the number of features was limited to 1 (accuracy = 57.14%, precision = 56.0%). Nicotine addiction is likely an interaction of pre-existing vulnerabilities and nicotine-induced impairments; while machine learning allows us to uncover such interactions, future work will need to disentangle individual contributions from both these sources.

Classification performance here was not as high as has sometimes been reported from rsFC data (see Dosenbach et al., 2010, for example, in which the authors achieved classification accuracy over 90% when classifying individuals as either children or adults); there may be a couple of reasons for this. Firstly, great care was taken here to eliminate head motion-induced artifacts that have previously been shown to influence functional connectivity estimates (Power et al., 2012). Efforts included removal of time points where head motion was above a stringent threshold, inclusion of head motion parameters as nuisance regressors, and a comparison of head motion data between the two groups. By diminishing head motion-related confounds, we have likely reduced any artificial enhancement of classification performance from irrelevant motion artifacts. Secondly, we followed an agnostic approach of including networks/nodes that are involved in a wide range of cognitive processing, and not merely those shown to be distinguishing features within the same sample set. In this way, we have avoided any inadvertent enhancement of classification performance through “double dipping” (Kriegeskorte et al., 2009). Finally, the sample sizes used here, although standard for machine-learning approaches to fMRI data (Zhang et al., 2005; Yang et al., 2010), may have been less sufficient for probing neural differences more subtle than those seen in schizophrenia or Parkinson's disease.

To obtain a more complete picture of disruptions in functional connectivity as a consequence of nicotine addiction, it would be illuminating to examine such data on a continuum of addiction severity. Although we analyzed the data here as binary classes of “high severity of nicotine addiction” and “no nicotine addiction,” additional insight could be gleaned by using support vector regression, for example, to extract features predictive of the FTND (Fagerström et al., 2012). Such a regression-based approach would allow for a more nuanced understanding of individual differences in the severity of addiction, especially when partnered with different treatment strategies. However, as this is a relatively crude measure of disease severity, it is likely that a much larger sample size and variance in FTND would be necessary for such a regression approach. Similarly, support vector regression with a focus on lifetime smoking exposure could provide valuable insights into the consequences of chronic nicotine consumption.

One potential concern regarding our findings is that, currently, there is limited data supporting the value of within-RSN centered analysis in nicotine addiction research. As already mentioned, a previous publication from our lab (Sutherland et al., 2012) approached the consequences of nicotine-related changes in within- (and between-) DMN and ECN connectivity. Prior to this, Cole et al. (2010) examined the effects of nicotine replacement therapy within- (and between-) DMN and ECN connectivity. However, although studies like these that focus on RSNs as defined in Smith et al. (2009) are not common, it is not unusual for addiction studies to focus on within-network connectivity in pre-specified networks. For example, many rsFC studies in addiction have limited their analyses to frontal and mesocorticolimbic circuits (see Sutherland et al., 2012, for a review).

Machine learning based approaches offer both basic and clinical applications for addiction research: the capability to identify neural features critical to predicting addiction, and the potential for using such a classifier in clinical settings to predict treatment outcome or future substance dependence. For the latter, classification performance need approach 95–100% accuracy (Orrù et al., 2012). Although, for screening future substance dependence, high sensitivity (the true positive rate) is likely more important, even at the expense of specificity (true negative rate), than overall accuracy. In any case, a potential limitation to this study is that rsFC by itself may not be powerful enough to predict smoking status with close to perfect performance. Perhaps by including task-based data, or features involving other modalities—e.g., genetics—we may obtain superior predictive capabilities. It has been shown that by combining fMRI and genetics information, classification of schizophrenics from controls can be significantly enhanced (Yang et al., 2010). Similarly, by including genetic information that has previously been shown to differentiate smokers from non-smokers (Kreek et al., 2005; Hong et al., 2010), perhaps the predictive power of such classifiers can be augmented to the extent required for clinical applications. Nevertheless, our data suggests that there is tremendous potential in combining rsFC data with machine learning-based techniques for advancing our understanding of network-level predictive differences critical to addiction.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We thank Steve Smith of the FMRIB Analysis Group at the University of Oxford for the use of the resting state network maps originally published in Smith et al. (2009). This work was supported by the National Institute on Drug Abuse, Intramural Research Program, NIH/DHHS.

References

Behzadi, Y., Restom, K., Liau, J., and Liu, T. T. (2007). A component based noise correction method (CompCor) for BOLD and perfusion based fMRI. Neuroimage 37, 90–101. doi: 10.1016/j.neuroimage.2007.04.042

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Bekker, E., Böcker, K., Van Hunsel, F., Van Den Berg, M., and Kenemans, J. (2005). Acute effects of nicotine on attention and response inhibition. Pharmacol. Biochem. Behav. 82, 539–548. doi: 10.1016/j.pbb.2005.10.009

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Braun, U., Plichta, M. M., Esslinger, C., Sauer, C., Haddad, L., Grimm, O., et al. (2012). Test–retest reliability of resting-state connectivity network characteristics using fMRI and graph theoretical measures. Neuroimage 59, 1404–1412. doi: 10.1016/j.neuroimage.2011.08.044

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Bressler, S. L., and Menon, V. (2010). Large-scale brain networks in cognition: emerging methods and principles. Trends Cogn. Sci. 14, 277–290. doi: 10.1016/j.tics.2010.04.004

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Brody, A. L., Mandelkern, M. A., Jarvik, M. E., Lee, G. S., Smith, E. C., Huang, J. C., et al. (2004). Differences between smokers and nonsmokers in regional gray matter volumes and densities. Biol. Psychiatry 55, 77–84. doi: 10.1016/S0006-3223(03)00610-3

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Chang, C.-C., and Lin, C.-J. (2011). LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2:27. doi: 10.1145/1961189.1961199

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Chen, S., Ross, T. J., Zhan, W., Myers, C. S., Chuang, K.-S., Heishman, S. J., et al. (2008). Group independent component analysis reveals consistent resting-state networks across multiple sessions. Brain Res. 1239, 141–151. doi: 10.1016/j.brainres.2008.08.028

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Cole, D. M., Beckmann, C. F., Long, C. J., Matthews, P. M., Durcan, M. J., and Beaver, J. D. (2010). Nicotine replacement in abstinent smokers improves cognitive withdrawal symptoms with modulation of resting brain network dynamics. Neuroimage 52, 590–599. doi: 10.1016/j.neuroimage.2010.04.251

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Cox, R. W. (1996). AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. Comput. Biomed. Res. 29, 162–173. doi: 10.1006/cbmr.1996.0014

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Craddock, R. C., Holtzheimer, P. E., Hu, X. P., and Mayberg, H. S. (2009). Disease state prediction from resting state functional connectivity. Magn. Reson. Med. 62, 1619–1628. doi: 10.1002/mrm.22159

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Damoiseaux, J., Rombouts, S., Barkhof, F., Scheltens, P., Stam, C., Smith, S. M., et al. (2006). Consistent resting-state networks across healthy subjects. Proc. Natl. Acad. Sci. U.S.A. 103, 13848–13853. doi: 10.1073/pnas.0601417103

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Demirci, O., Clark, V. P., and Calhoun, V. D. (2008). A projection pursuit algorithm to classify individuals using fMRI data: application to schizophrenia. Neuroimage 39, 1774–1782. doi: 10.1016/j.neuroimage.2007.10.012

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

De Wit, H. (2009). Impulsivity as a determinant and consequence of drug use: a review of underlying processes. Addict. Biol. 14, 22–31. doi: 10.1111/j.1369-1600.2008.00129.x

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Dosenbach, N. U., Nardos, B., Cohen, A. L., Fair, D. A., Power, J. D., Church, J. A., et al. (2010). Prediction of individual brain maturity using fMRI. Science 329, 1358–1361. doi: 10.1126/science.1194144

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Ersche, K. D., Jones, P. S., Williams, G. B., Turton, A. J., Robbins, T. W., and Bullmore, E. T. (2012). Abnormal brain structure implicated in stimulant drug addiction. Science 335, 601–604. doi: 10.1126/science.1214463

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Ersche, K. D., Turton, A. J., Pradhan, S., Bullmore, E. T., and Robbins, T. W. (2010). Drug addiction endophenotypes: impulsive versus sensation-seeking personality traits. Biol. Psychiatry 68, 770–773. doi: 10.1016/j.biopsych.2010.06.015

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Fagerström, K., Russ, C., Yu, C. R., Yunis, C., and Foulds, J. (2012). The fagerström test for nicotine dependence as a predictor of smoking abstinence: a pooled analysis of varenicline clinical trial data. Nicotine Tob. Res. 14, 1467–1473. doi: 10.1093/ntr/nts018

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Ford, J., Farid, H., Makedon, F., Flashman, L. A., McAllister, T. W., Megalooikonomou, V., et al. (2003). “Patient classification of fMRI activation maps,” in Medical Image Computing and Computer-Assisted Intervention-MICCAI 2003 (Berlin; Heidelberg: Springer), 58–65. doi: 10.1007/978-3-540-39903-2_8

CrossRef Full Text

Fox, M. D., Snyder, A. Z., Vincent, J. L., Corbetta, M., Van Essen, D. C., and Raichle, M. E. (2005). The human brain is intrinsically organized into dynamic, anticorrelated functional networks. Proc. Natl. Acad. Sci. U.S.A. 102, 9673–9678. doi: 10.1073/pnas.0504136102

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Freund, Y., and Schapire, R. E. (1995). “A decision-theoretic generalization of on-line learning and an application to boosting,” in Computational Learning Theory (Berlin; Heidelberg: Springer), 23–37. doi: 10.1007/3-540-59119-2_166

CrossRef Full Text

Gallinat, J., Meisenzahl, E., Jacobsen, L. K., Kalus, P., Bierbrauer, J., Kienast, T., et al. (2006). Smoking and structural brain deficits: a volumetric MR investigation. Eur. J. Neurosci. 24, 1744–1750. doi: 10.1111/j.1460-9568.2006.05050.x

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Goldstein, R. Z., and Volkow, N. D. (2002). Drug addiction and its underlying neurobiological basis: neuroimaging evidence for the involvement of the frontal cortex. Am. J. Psychiatry 159:1642. doi: 10.1176/appi.ajp.159.10.1642

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Goldstein, R. Z., and Volkow, N. D. (2011). Dysfunction of the prefrontal cortex in addiction: neuroimaging findings and clinical implications. Nat. Rev. Neurosci. 12, 652–669. doi: 10.1038/nrn3119

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Greicius, M. D., Krasnow, B., Reiss, A. L., and Menon, V. (2003). Functional connectivity in the resting brain: a network analysis of the default mode hypothesis. Proc. Natl. Acad. Sci. U.S.A. 100, 253–258. doi: 10.1073/pnas.0135058100

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Guyon, I., Weston, J., Barnhill, S., and Vapnik, V. (2002). Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 389–422. doi: 10.1023/A:1012487302797

CrossRef Full Text

Haxby, J. V. (2012). Multivariate pattern analysis of fMRI: the early beginnings. Neuroimage 62, 852–855. doi: 10.1016/j.neuroimage.2012.03.016

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Hester, R., and Garavan, H. (2004). Executive dysfunction in cocaine addiction: evidence for discordant frontal, cingulate, and cerebellar activity. J. Neurosci. 24, 11017–11022. doi: 10.1523/JNEUROSCI.3321-04.2004

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Hobbs, N. Z., Henley, S. M., Ridgway, G. R., Wild, E. J., Barker, R. A., Scahill, R. I., et al. (2010). The progression of regional atrophy in premanifest and early Huntington's disease: a longitudinal voxel-based morphometry study. J. Neurol. Neurosurg. Psychiatry 81, 756–763. doi: 10.1136/jnnp.2009.190702

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Hong, L. E., Hodgkinson, C. A., Yang, Y., Sampath, H., Ross, T. J., Buchholz, B., et al. (2010). A genetically modulated, intrinsic cingulate circuit supports human nicotine addiction. Proc. Natl. Acad. Sci. U.S.A. 107, 13509–13514. doi: 10.1073/pnas.1004745107

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Hsu, C. W., Chang, C. C., and Lin, C. J. (2003). A practical guide to support vector classification. Technical Report. Available online at: http://www.csie.ntu.edu.tw/~cjlin/papers/guide/guide.pdf.

Kelly, A., Uddin, L. Q., Biswal, B. B., Castellanos, F. X., and Milham, M. P. (2008). Competition between functional brain networks mediates behavioral variability. Neuroimage 39, 527–537. doi: 10.1016/j.neuroimage.2007.08.008

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Kober, H., Mende-Siedlecki, P., Kross, E. F., Weber, J., Mischel, W., Hart, C. L., et al. (2010). Prefrontal–striatal pathway underlies cognitive regulation of craving. Proc. Natl. Acad. Sci. U.S.A. 107, 14811–14816. doi: 10.1073/pnas.1007779107

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Kreek, M., Nielsen, D., Butelman, E., and Laforge, K. (2005). Genetic influences on impulsivity, risk taking, stress responsivity and vulnerability to drug abuse and addiction. Nat. Neurosci. 8, 1450–1457. doi: 10.1038/nn1583

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Kriegeskorte, N., Simmons, W. K., Bellgowan, P. S., and Baker, C. I. (2009). Circular analysis in systems neuroscience: the dangers of double dipping. Nat. Neurosci. 12, 535–540. doi: 10.1038/nn.2303

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Lerman, C., Gu, H., Loughead, J., Ruparel, K., Yang, Y., and Stein, E. A. (2014). Large-scale brain network coupling predicts acute nicotine abstinence effects on craving and cognitive Function. JAMA Psychiatry 71, 523–530. doi: 10.1001/jamapsychiatry.2013.4091

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Orrù, G., Pettersson-Yeo, W., Marquand, A. F., Sartori, G., and Mechelli, A. (2012). Using support vector machine to identify imaging biomarkers of neurological and psychiatric disease: a critical review. Neurosci. Biobehav. Rev. 36, 1140–1152. doi: 10.1016/j.neubiorev.2012.01.004

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Pariyadath, V., Paulus, M. P., and Stein, E. A. (2013). “Brain, reward, and drug addiction,” in Neurobiology of Mental Illness, 4th Edn., eds D. S. Charney, E. J. Nestler, P. Sklar, and J. D. Buxbaum (New York, NY: Oxford University Press), 732–741.

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. (2011). Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830. Available online at: http://jmlr.org/papers/v12/pedregosa11a.html

Power, J. D., Barnes, K. A., Snyder, A. Z., Schlaggar, B. L., and Petersen, S. E. (2012). Spurious but systematic correlations in functional connectivity MRI networks arise from subject motion. Neuroimage 59, 2142–2154. doi: 10.1016/j.neuroimage.2011.10.018

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Shen, H., Wang, L., Liu, Y., and Hu, D. (2010). Discriminative analysis of resting-state functional connectivity patterns of schizophrenia using low dimensional embedding of fMRI. Neuroimage 49, 3110–3121. doi: 10.1016/j.neuroimage.2009.11.011

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Smith, S. M., Fox, P. T., Miller, K. L., Glahn, D. C., Fox, P. M., Mackay, C. E., et al. (2009). Correspondence of the brain's functional architecture during activation and rest. Proc. Natl. Acad. Sci. U.S.A. 106, 13040–13045. doi: 10.1073/pnas.0905267106

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Stonnington, C. M., Chu, C., Klöppel, S., Jack Jr, C. R., Ashburner, J., and Frackowiak, R. S. (2010). Predicting clinical scores from magnetic resonance scans in Alzheimer's disease. Neuroimage 51, 1405–1413. doi: 10.1016/j.neuroimage.2010.03.051

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Sutherland, M. T., McHugh, M. J., Pariyadath, V., and Stein, E. A. (2012). Resting state functional connectivity in addiction: Lessons learned and a road ahead. Neuroimage 62, 2281–2295. doi: 10.1016/j.neuroimage.2012.01.117

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Taylor, P. A., and Saad, Z. S. (2013). FATCAT: (an efficient) functional and tractographic connectivity analysis toolbox. Brain Connect. 3, 523–535. doi: 10.1089/brain.2013.0154

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Vapnik, V. N. (2000). The Nature of Statistical Learning Theory. New York, NY: Springer-Verlag.

Vemuri, P., Gunter, J. L., Senjem, M. L., Whitwell, J. L., Kantarci, K., Knopman, D. S., et al. (2008). Alzheimer's disease diagnosis in individual subjects using structural MR images: validation studies. Neuroimage 39, 1186–1197. doi: 10.1016/j.neuroimage.2007.09.073

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Volkow, N., Fowler, J., Wang, G., Baler, R., and Telang, F. (2009). Imaging dopamine's role in drug abuse and addiction. Neuropharmacology 56, 3–8. doi: 10.1016/j.neuropharm.2008.05.022

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Volkow, N. D., Fowler, J. S., Wang, G.-J., and Goldstein, R. Z. (2002). Role of dopamine, the frontal cortex and memory circuits in drug addiction: insight from imaging studies. Neurobiol. Learn. Mem. 78, 610–624. doi: 10.1006/nlme.2002.4099

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Wu, C. W., Gu, H., Lu, H., Stein, E. A., Chen, J.-H., and Yang, Y. (2008). Frequency specificity of functional connectivity in brain networks. Neuroimage 42, 1047–1055. doi: 10.1016/j.neuroimage.2008.05.035

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Yang, H., Liu, J., Sui, J., Pearlson, G., and Calhoun, V. D. (2010). A hybrid machine learning method for fusing fMRI and genetic data: combining both improves classification of schizophrenia. Front. Hum. Neurosci. 4:192. doi: 10.3389/fnhum.2010.00192

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Zhang, L., Samaras, D., Tomasi, D., Volkow, N., and Goldstein, R. (2005). “Machine learning for clinical diagnosis from functional magnetic resonance imaging,” in IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2005 (San Diego, CA). 1211–1217.

Zuo, X.-N., Kelly, C., Adelstein, J. S., Klein, D. F., Castellanos, F. X., and Milham, M. P. (2010). Reliable intrinsic connectivity networks: test–retest evaluation using ICA and dual regression approach. Neuroimage 49, 2163–2177. doi: 10.1016/j.neuroimage.2009.10.080

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Keywords: biomarkers, fMRI, machine learning, nicotine addiction, support vector machines

Citation: Pariyadath V, Stein EA and Ross TJ (2014) Machine learning classification of resting state functional connectivity predicts smoking status. Front. Hum. Neurosci. 8:425. doi: 10.3389/fnhum.2014.00425

Received: 13 February 2014; Accepted: 28 May 2014;
Published online: 16 June 2014.

Edited by:

Shuhei Yamaguchi, Shimane University, Japan

Reviewed by:

Vince D. Calhoun, University of New Mexico, USA
Keiichi Onoda, Shimane University, Japan

Copyright © 2014 Pariyadath, Stein and Ross. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Vani Pariyadath, Neuroimaging Research Branch, Intramural Research Program, National Institute on Drug Abuse, National Institutes of Health, 251 Bayview Blvd., Suite 200, Rm. 07A505.08, Baltimore, MD 21224, USA e-mail:dmFuaS5wYXJpeWFkYXRoQG5paC5nb3Y=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.