ORIGINAL RESEARCH article
Sec. Brain Imaging and Stimulation
Volume 11 - 2017 | https://doi.org/10.3389/fnhum.2017.00555
Predicting Long-Term Cognitive Outcome Following Breast Cancer with Pre-Treatment Resting State fMRI and Random Forest Machine Learning
- 1Department of Neuro-Oncology, University of Texas MD Anderson Cancer Center, Houston, TX, United States
- 2Department of Bioinformatics and Computational Biology, University of Texas MD Anderson Cancer Center, Houston, TX, United States
- 3Division of Medical Oncology, School of Medicine, Stanford University, Palo Alto, CA, United States
- 4Cancer Prevention Institute of California, Fremont, CA, United States
- 5Department of Breast Medical Oncology, University of Texas MD Anderson Cancer Center, Houston, TX, United States
- 6Department of Psychiatry and Behavioral Sciences, School of Medicine, Stanford University, Palo Alto, CA, United States
We aimed to determine if resting state functional magnetic resonance imaging (fMRI) acquired at pre-treatment baseline could accurately predict breast cancer-related cognitive impairment at long-term follow-up. We evaluated 31 patients with breast cancer (age 34–65) prior to any treatment, post-chemotherapy and 1 year later. Cognitive testing scores were normalized based on data obtained from 43 healthy female controls and then used to categorize patients as impaired or not based on longitudinal changes. We measured clustering coefficient, a measure of local connectivity, by applying graph theory to baseline resting state fMRI and entered these metrics along with relevant patient-related and medical variables into random forest classification. Incidence of cognitive impairment at 1 year follow-up was 55% and was predicted by classification algorithms with up to 100% accuracy (p < 0.0001). The neuroimaging-based model was significantly more accurate than a model involving patient-related and medical variables (p = 0.005). Hub regions belonging to several distinct functional networks were the most important predictors of cognitive outcome. Characteristics of these hubs indicated potential spread of brain injury from default mode to other networks over time. These findings suggest that resting state fMRI is a promising tool for predicting future cognitive impairment associated with breast cancer. This information could inform treatment decision making by identifying patients at highest risk for long-term cognitive impairment.
Breast cancer and its treatments can have neurotoxic effects in some patients, resulting in acute, persistent and/or late onset cognitive impairments such as difficulties with attention, memory, processing speed, executive function and verbal fluency (Wefel et al., 2015). Defining cognitive impairment has historically been a challenge for cancer-related neurotoxicity as well as other neurologic syndromes. The Diagnostic Statistical Manual-5 (DSM-5) includes a category, “mild neurocognitive disorder” (Saykin et al., 2013; Sachs-Ericsson and Blazer, 2015), which is often used in clinical practice for patients with cancer-related cognitive impairment. Currently there is no diagnostic criteria specific to patients with cancer, however, the International Cognition and Cancer Task Force (ICCTF) recommends an approach for defining cancer-related cognitive impairment (Wefel et al., 2011), which we employed in this study. Clinically, the ICCTF definition corresponds to “mild to moderate impairment”, which is consistent with the DSM-5 criteria for mild neurocognitive disorder and with the level of deficit commonly observed in patients with cancer (Wefel et al., 2015). A previous study observed that the ICCTF definition is almost twice as sensitive to impairment as other methods (Vardy et al., 2017) and we have shown that it corresponds with measures of brain network connectivity (Kesler et al., 2015, 2016).
There are many potential etiologies for cancer-related cognitive impairment, but the final common biologic pathway is altered brain structure and function, which can be evaluated using neuroimaging. In addition to providing valuable insights regarding these neural mechanisms, baseline neuroimaging biomarkers can serve as predictors of future outcome. Several examples of this application exist in other conditions including predicting Alzheimer’s disease conversion, development of dyslexia, and response to interventions such as cognitive rehabilitation (Strangman et al., 2010; Hoeft et al., 2011; Moradi et al., 2015; Thompson et al., 2015). Cancer-related cognitive impairment is one of the only syndromes where a potential brain injury is known in advance and is therefore ideal for prediction of outcome from early or pre-treatment, baseline data. This information could be practice-changing by assisting oncologists with treatment decision-making based on a patient’s individual risk for negative cognitive effects.
Resting state functional magnetic resonance imaging (fMRI), which is one of the most sensitive neuroimaging techniques currently available, is non-invasive and simple to acquire (Kesler, 2014). Resting state fMRI data characterize spontaneous, spatially and temporally coherent functional activity in the brain and are typically used to measure intrinsic functional network connectivity (Raichle, 2011). Intrinsic functional networks reflect various cognitive states, represent the majority of energy usage in the brain and are associated with the expression of genes that regulate synaptic function (Fox and Greicius, 2010; Shirer et al., 2012; Buckner et al., 2013; Richiardi et al., 2015). We previously demonstrated that resting state fMRI data in combination with machine learning can be used to automatically distinguish chemotherapy-treated breast cancer survivors from chemotherapy naïve survivors and healthy female controls (Kesler et al., 2013). Others have shown that resting state fMRI is sensitive to brain networks that recover following chemotherapy vs. those that do not (Dumas et al., 2013).
We recently demonstrated subtle disruption of intrinsic functional network organization in patients newly diagnosed with breast cancer who were evaluated prior to any treatment, including surgery (Kesler et al., 2017a). These findings indicate that resting state fMRI can detect brain changes that are likely associated with aspects of tumor pathology and/or pre-existing patient characteristics that are important for cognitive trajectory. These early neural deficits may make the brain more vulnerable to the effects of chemotherapy, other adjuvant treatments and/or aging, resulting in long-term cognitive impairment.
The application of machine learning algorithms (Jordan and Mitchell, 2015) frequently provides increased prediction accuracy since these models tend to be nonparametric and able to learn complex interactions among predictors. These characteristics are especially important when studying cognition since brain function arises from complex interactions among various neuronal communities. Additionally, machine learning approaches tend to be more suitable than traditional statistical methods for problems such as cancer-related cognitive impairment that involve a large number of potential predictors (Strobl et al., 2007). There are many different machine learning approaches. Random forest modeling uses random subsets of features to grow an ensemble of decision trees that predict the outcome of interest for classification and regression problems (Breiman et al., 1984; Breiman, 2001). We have previously demonstrated that random forest models are highly useful for evaluating cancer-related cognitive impairment (Kesler et al., 2016, 2017b). For example, we showed that random forest models are superior to traditional linear models for examining the relationships between neuroimaging metrics and cancer-related cognitive impairment (Kesler et al., 2016).
In this study, we aimed to predict chronic cognitive impairment (observed at 1 year post-chemotherapy) from baseline intrinsic functional network characteristics obtained prior to treatment initiation. There are many different characteristics of intrinsic functional connectivity that can be measured from resting state fMRI. Based on our prior studies of breast and other cancers (Kesler et al., 2015, 2016, 2017b), we focused on clustered connectivity (i.e., clustering coefficient), a property of brain networks derived from graph theoretical analysis (Rubinov and Sporns, 2010). We examined the accuracy of regional clustering coefficients for predicting categorical cognitive impairment (impaired, unimpaired) by entering them into random forest classification models. We hypothesized that regional clustering coefficients alone or in combination with patient/medical factors would more accurately predict future cognitive outcome than patient/medical factors alone.
Materials and Methods
As part of our ongoing, prospective longitudinal study of breast cancer and cognition, we enrolled 31 newly diagnosed patients with primary breast cancer age 34–65 years and 43 frequency matched healthy control females (Table 1). Patients were assessed prior to initiation of any treatment (including surgery with general anesthesia), 1 month after completing chemotherapy and again 1 year later. Controls were assessed at yoked intervals. Participants were included in the present study if they had completed both the baseline and 1 year follow-up assessments (see Kesler et al., 2017a or Supplementary Methods). Chemotherapy regimens included doxorubicin, cyclophosphamide and paclitaxel (N = 16), cyclophosphamide, doxorubicin and fluorouracil (N = 2), cyclophosphamide and paclitaxel (N = 9), doxorubicin, carboplatin and paclitaxel (N = 2), fluorouracil, epirubicin and cyclophosphamide (N = 2). The Stanford University Institutional Review Board approved this study and all procedures performed were in accordance with the ethical standards of the Declaration of Helsinki. Written informed consent was obtained from all participants included in the study.
Table 1. Participant data at pre-treatment baseline shown as mean (standard deviation) unless otherwise noted.
Cognitive Impairment Assessment
Cognitive function was measured using the following standardized tests: Rey Auditory Verbal Learning Test (RAVLT) for verbal learning and verbal memory retention (Schmidt, 2012), Comprehensive Trail Making Test (CTMT) for attention, processing speed and executive function (Moses, 2004), and Controlled Oral Word Association (COWA) for verbal fluency (Ruff et al., 1996). This is consistent with the testing battery recommended by the ICCTF for harmonizing studies of cancer and cognition (Wefel et al., 2011). This is also the battery we have shown previously to be sensitive to cognitive deficits in patients with breast cancer (Kesler and Blayney, 2016; Kesler et al., 2017a). Psychological distress (depression, anxiety, cognitive fatigue) was assessed using the Total Score from the Clinical Assessment of Depression (CAD; Aghakhani and Chan, 2007). We also examined self-ratings from our Mobile Cognitive Assessment Battery Adjustment Index, a questionnaire regarding functional capacity (i.e., occupational, home, leisure and social function (Kesler and Blayney, 2014)). Additional self-report questionnaires, as well as several non-standardized, experimental computerized tests, were administered but are not reported here (total testing time = 1.5 h).
Test scores were converted to z-scores based on the control group’s mean and standard deviation. Cognitive impairment was defined as having any two z-scores of −1.5 or lower or any one z-score of −2.0 or lower, based on the ICCTF recommendations (Wefel et al., 2011) and our prior studies (Kesler et al., 2015, 2016, 2017b). A patient was categorized as impaired if her performance was impaired at both baseline and 1 year follow-up (persistent impairment) or if she demonstrated impaired performance at 1 year follow-up that was not present at baseline (late onset impairment). As noted above, we have previously demonstrated that this impairment definition is associated with measures of brain network organization. Impairment was also moderately associated with elevated symptoms on the Adjustment Index (r = 0.29, p = 0.059), suggesting further ecological validity.
Neuroimaging Acquisition and Preprocessing
Neuroimaging data were acquired using a GE Discovery MR750 3.0 Tesla whole body scanner (GE Medical Systems) on the same day as the cognitive testing session (see Supplementary Methods for further details). Functional connectivity preprocessing was performed with Statistical Parametric Mapping 8 (SPM8) and CONN Toolboxes as previously described (Kesler et al., 2013, 2014, 2017a; Kesler and Blayney, 2016). The resulting connectivity matrices were binarized to minimum connection density and submitted to graph theoretical analysis using our Brain Networks Toolbox1. As in our previous studies, 90 regions of interest (ROIs) were defined using the Automated Anatomical Labeling Atlas (Tzourio-Mazoyer et al., 2002) and we measured the clustering coefficient of each brain ROI. Clustering coefficient is the ratio of connections to all possible connections among a region’s neighbors (Rubinov and Sporns, 2010). We have previously demonstrated significant clustering deficits in patients with breast and other cancers (Bruno et al., 2012; Hosseini et al., 2012; Kesler et al., 2015, 2016).
Incidence of cognitive impairment was compared between groups using a two-sample test for equality of proportions (Chi squared, two-tailed). Change in CAD and Adjustment Index scores were evaluated using paired t-test.
For random forest classification, the square root of the number of features were split at each node and an ensemble of 500 trees was grown by bootstrapping the features with replacement. Feature selection/reduction was conducted on a training set (A + B) consisting of a 75% random sample of the breast cancer group obtained after stratified class sampling. Recursive feature elimination was used to remove minimally contributing features and optimize the models. Recursive feature elimination was conducted on this set with A = training data and B = testing data with leave-one-out cross-validation across 100 random partitions of A and B. Features that provided the best accuracy across these partitions were used to re-train a model on A + B with out-of-bag error estimation (Liaw and Wiener, 2002). The resulting model was then applied to the held-out 25% of the breast cancer group to test prediction accuracy.
We tested three different models (Figure 1). Model 1 included only the following baseline patient and medical features: age, education, cancer stage at diagnosis, minority status, menopausal status and CAD score. Model 2 combined the above patient/medical features with clustering coefficients for three brain regions; right middle orbitofrontal gyrus, right inferior parietal lobule (RIPL) and right mesial superior frontal gyrus. We previously showed these regions to have subtly altered clustering prior to treatment in patients with breast cancer (Kesler et al., 2017a). For Model 3, we tested an expanded feature set that included clustering coefficients for all major cortical and subcortical regions (N = 90) in addition to patient/medical features.
Figure 1. Random forest models. We tested and compared three different random forest models for predicting 1 year post-chemotherapy cognitive outcome from pre-treatment data. ROIs, connectome regions of interest.
The significance of model accuracy was evaluated using a two-sided exact binomial test in addition to the area under the curve (AUC) of the receiver operating characteristic (ROC). Feature importance was determined using mean decrease in Gini index (Wright et al., 2016; Kesler et al., 2017b). To determine the most accurate model, we compared model AUCs using the bootstrapping method described by Hanley and McNeil (1983).
Brain regions identified as important predictors were evaluated for network hub status based on degree, betweenness centrality and/or clustering coefficient greater than 1 standard deviation above network mean (Sporns et al., 2007). We also evaluated modularity to provide insight regarding hub relationships. Modularity involves decomposing the brain into non-overlapping groups of regions (modules) that have maximal within-group connections and minimal between-group connections (Sporns and Betzel, 2016). Hubs were further classified as provincial or connector type based on module participation coefficient per previously established criteria (Guimerà and Amaral, 2005; Sporns et al., 2007).
Because there is no standard definition of cognitive impairment, we supplemented classification analysis with random forest regression to determine if features identified by classification could accurately predict individual cognitive test z-scores at 1 year follow-up. Regression model accuracy was determined using the adjusted R squared statistic. Feature importance for regression models was determined using percent increase in mean squared error. All statistical analyses were performed in the R Statistical Package (R Foundation) including the “randomForest”, “caret” and “pROC” libraries.
Patients with breast cancer demonstrated 55% (N = 17/31) incidence of cognitive impairment and healthy controls demonstrated 26% (N = 11/43). The difference in incidence was significant (X2 = 6.56, p = 0.010). Of those impaired in the breast cancer group, 59% (N = 10/17) had persistent impairment while 41% (N = 7/17) had late onset impairment. Depression, anxiety and fatigue decreased over time based on patients’ self-ratings, but not significantly (p > 0.725) and was not clinically elevated at any time point. Self-rating of functional capacity decreased over time but not significantly (p > 0.193). Cognitive testing and self-report data are presented in Table 2 and Supplementary Table S1.
Predicting Future Cognitive Impairment
Model 1: the final model retained all six features and performed with 71% accuracy (p = 0.453, ROC = 0.67, sensitivity = 0.75, specificity = 0.67, Figure 2). Model 2: the final model retained the three brain regions as well as age, minority status, menopausal status and CAD and demonstrated 88% accuracy (p = 0.070, ROC = 0.75, sensitivity = 1.0, specificity = 0.75, Figure 2). Model 3: the final model retained five brain regions (left lingual gyrus, left calcarine, right insula, right middle temporal gyrus and right olfactory area) but no patient/medical features and performed with 100% accuracy (p = 0.008, ROC = 1.0, sensitivity = 1.0, specificity = 1.0, Figure 2).
Figure 2. Random forest classification of cognitive impairment at 1 year post-chemotherapy follow-up from pre-treatment predictors. Model 1: patient and medical variables; Model 2: clustering coefficients from a priori brain regions and patient/medical variables; Model 3: clustering coefficients from 90 brain regions and patient/medical variables. Top row shows receiver operating characteristic (ROC) curve. Bottom row shows relative feature importance. Features displayed are those retained after recursive feature elimination. Higher mean decrease in Gini index indicates greater importance of that feature in the model. CAD, Clinical Assessment of Depression; RMIOFC, right middle orbitofrontal cortex; RIPL, right inferior parietal; RMESF, right mesial superior frontal; RMTG, right middle temporal; LCAL, left calcarine; RINS, right insula; LLING, left lingual; ROLF, right olfactory.
Comparing Classification Models
The AUC of Model 3 was significantly greater than the AUCs of Models 1 (z = 2.83, p = 0.005) and 2 (z = 2.42, p = 0.015). Models 1 and 2 were not significantly different (z = 0.514, p = 0.607).
Predicting Future Cognitive Test Scores
Using the five brain regions from Model 3 above, individual RAVLT verbal learning scores were accurately predicted with an adjusted R2 = 0.79 (p < 0.0001). RAVLT verbal retention scores were predicted with an adjusted R2 = 0.78 (p < 0.0001). The model for CTMT Trail 1 scores had an adjusted R2 = 0.70 (p < 0.0001). CTMT Trail 5 scores were predicted at adjusted R2 = 0.75 (p < 0.0001). The model for COWA scores had an adjusted R2 = 0.64 (p < 0.0001). The relative contributions of the five clustered connectivity features to these regression models are provided in Supplementary Table S2.
Characteristics of Predictive Brain Regions
As shown in Table 3, all but two of the eight brain regions included in Models 2 and 3 were categorized as hubs (e.g., globally connected regions) and all were connector type hubs. Modularity analysis indicated that all regions from Model 2 were in the default mode network while regions from Model 3 were members of other networks including sensory/motor, executive/attention and salience networks (Figure 3).
Figure 3. Brain regions whose clustering coefficients at pre-treatment predicted cognitive impairment at 1 year post-chemotherapy follow-up. Regions are shown as spheres with color indicating module membership. Labeled regions are those that were included in random forest classification models and found to be predictive of cognitive impairment. RMIOF, right middle orbitofrontal; RIPL, right inferior parietal; RMESF, right mesial superior frontal; RMTG, right middle temporal; LCAL, left calcarine; RINS, right insula; LLING, left lingual; ROLF, right olfactory.
The aim of this study was to determine if baseline, pre-treatment resting state fMRI could be used to accurately predict long-term cognitive outcome in chemotherapy-treated patients with breast cancer. Based on our prior work, we measured clustering coefficient, a characteristic of brain network connectivity obtained from resting state fMRI data. We observed that most patients (55%) demonstrated cognitive impairment at 1 year post-chemotherapy follow-up. This incidence is consistent with previous studies (Wefel et al., 2010, 2015). We examined three different classification models for predicting this impairment: Model 1 included only patient/medical variables, Model 2 combined patient/medical variables with clustering coefficients from selected, a priori regions, and Model 3 included the entire brain with patient/medical variables.
Model 1 results indicated that patient and medical factors, particularly age and CAD score, were independently useful at predicting impairment with 71% accuracy, although specificity was suboptimal and the overall model was not significant. The addition of clustered connectivity data improved accuracy to 85% with increased specificity, though this improvement was not significant. Model 3 performed with perfect sensitivity and specificity and included only clustered connectivity of brain regions selected in a data driven manner. Model 3 was significantly more accurate than Models 1 and 2. Further, regression models using Model 3 features were associated with significant adjusted R2 values for predicting individual test scores suggesting that the model may be relatively robust to impairment definition. Perfect (100%) accuracy in machine learning applications is rare but not unprecedented when neuroimaging features are included (Gothelf et al., 2011; Marzelli et al., 2011; Weygandt et al., 2012; Zhang et al., 2013). However, our model was built using a small sample and therefore requires subsequent validation.
Many of the regions selected in Models 2 and 3 have been noted to be altered in prior studies of breast cancer as well as other conditions that affect cognitive function (Kaiser et al., 2014; Kesler, 2014; Lepage et al., 2014; Stouten-Kemperman et al., 2015; Wang et al., 2016). These regions are known to be associated with the cognitive domains we measured. For example, orbitofrontal regions are involved in executive control and other cognitive processes (Nestor et al., 2015; Ohtani et al., 2017). The insula is a key region of the salience network, important for various functions including attention, language, interoception and social-emotional behaviors (Menon and Uddin, 2010; Seeley, 2010). Middle temporal gyrus supports memory, language, semantic and visual processing, among others, and is part of the ventral attention network (Deslauriers et al., 2017). One novel finding was the importance of the right olfactory area. Olfaction has a well-known and important role in memory through conditional and emotional learning systems (Mouly and Sullivan, 2010) and as a site of ongoing adult neurogenesis (Lledo and Valley, 2016). Cancer treatments, including chemotherapy and radiation interfere with neurogenesis (Monje and Dietrich, 2012) and have been associated with changes in olfactory function in patients with breast and other cancers (Steinbach et al., 2010).
Modularity analysis indicated that all Model 2 regions were part of the default mode network, consistent with prior studies (Fox et al., 2005; Seeley et al., 2007; Grayson and Fair, 2017). Default mode network subserves a wide variety of cognitive processes and is therefore characterized by high connectivity and functional activity (Hagmann et al., 2008; Cole et al., 2010; Lord et al., 2013). The “nodal stress” theory of neurodegeneration suggests that high traffic regions, like hubs of the default mode network, are more vulnerable to aging, disease and injury (Zhou et al., 2012). Our findings suggest that certain default mode network hubs are injured by breast cancer and this injury does not adequately recover over time and/or is exacerbated by adjuvant therapies such that it is associated with long-term cognitive impairment.
The regions in Model 3 most accurately predicted outcome but had no overlap with those in Model 2; we did not previously observe them to be different between patients with breast cancer and healthy controls at pre-treatment baseline (Kesler et al., 2017a). Whereas Model 2 regions were members of default mode network, Model 3 regions were included in salience, executive/attention and sensory/motor networks. These regions may be very subtly vulnerable pre-treatment such that differences are difficult to detect and/or alternative methods are required to detect them. Otherwise, default mode network injury existing at pre-treatment baseline might extend to other brain networks via “trans-neuronal spread” (Zhou et al., 2012). The brain regions included in Model 2 were all identified as hubs; regions with high connectivity that are vital for network resilience and regulation of information flow (Vertes and Bullmore, 2015). Further, Model 2 regions were all connector type hubs, which, unlike provincial type hubs, form bridges between different networks (van den Heuvel and Sporns, 2013). Taken together, these findings suggest that the initial site of injury involves default mode network hubs that potentially spread the injury to other networks via their connector status. Most Model 3 regions were also identified as hubs and as such would be the most vulnerable areas of these “secondary” networks. These mechanisms were not the primary focus of this study and therefore further investigation of longitudinal changes in connectome organization is required.
The main limitation of this study is the small sample size which can result in model over-fitting. We conducted random forest modeling using a conservative approach that included cross-validation and careful separation of training and testing samples. However, further evaluation of our models’ validity requires a new, unseen and larger sample of patients to which we can apply our algorithms. We are currently acquiring such a sample and have also made our algorithms available for others to apply to their own data as appropriate2. Other considerations include our choice of brain parcellation scheme and connectome property. The 90 AAL parcellation is one of the most commonly used and is the one we have employed previously in our studies of chemotherapy-related cognitive impairment (Kesler et al., 2015, 2016, 2017a,b; Amidi et al., 2017). As noted above, we focused on clustering coefficient because we have shown this connectome property to be the most consistently altered in patients with breast cancer. Future studies with larger samples could include evaluation of alternative connectome properties to determine if they improve predictive models. These might include connectome properties derived from other neuroimaging modalities such as diffusion tensor imaging (DTI), for example. We used a cognitive testing battery and impairment definition recommended by the ICCTF to increase consistency across studies of cancer and cognition including data pooling initiatives. Further investigation is required to examine the effects of alternate tests and impairment categories. Finally, other machine learning approaches may yield different results. For example, support vector machine (SVM) is a common method used in neuroimaging studies and we have previously demonstrated its usefulness for distinguishing chemotherapy-treated from chemotherapy naïve patients (Kesler et al., 2013). However, SVMs are much more difficult to interpret than random forest models, particularly in terms of feature importance. Feature importance contributed to evaluation of our hypothesis regarding relative importance of patient/medical vs. neuroimaging features and was essential for understanding specific brain network patterns involved in cognitive impairment. Future studies could include comparison of different machine learning approaches, which was beyond the scope of this preliminary study.
Patients undergoing chemotherapy tend to be monitored for various toxicities including cardiac, hepatic and hematologic problems, among others. Given the high incidence of cognitive impairments, it seems reasonable that neurologic monitoring be included as well. We have demonstrated that this impairment can potentially be predicted from baseline, pre-treatment data. Resting state fMRI may be a particularly promising tool for this purpose, improving our ability to identify patients at risk for long-term cancer-related brain injury. If inclusion of resting state fMRI data continues to result in the most accurate predictions of future cognitive outcome, we have already demonstrated that it is feasible to obtain these data from patients pre-treatment. Connectome metrics derived from resting state fMRI show good to excellent test-retest reliability (Braun et al., 2012; Cao et al., 2014; Termenon et al., 2016). Our resting state fMRI acquisition required only 7 min making this scan a practical possibility. Prediction of cognitive outcome could inform treatment decision-making and prioritize patients for early intervention. With further validation, our findings could support the use of one of our algorithms as standard of care for patients with breast cancer to determine risk for cognitive neurotoxicity.
SRK designed the study, wrote the Matlab code for graph theoretical analysis and the R code for random forest analysis, conducted the statistical analyses and prepared the manuscript. AR assisted with R code and statistical analyses and edited the manuscript. IAO-G and MK edited the manuscript. DWB provided oncology consultation, assisted with participant recruitment and edited the manuscript. OP supervised participant recruitment, study coordination and data acquisition and edited the manuscript.
This research was funded by a grant from the National Cancer Institute (National Institutes of Health, 1R01CA172145: SRK).
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The authors wish to thank the faculty and staff at the Stanford University Richard M. Lucas Center for their assistance with neuroimaging acquisitions, Marjorie Adams, Melissa Packer and Bingjie Tong for assisting with study coordination and data management and Vikram Rao for assistance with neuroimaging preprocessing.
- ^ https://github.com/srkesler/bnets.git
- ^ https://www.dropbox.com/sh/df3akk7wr4wl2uv/AACYvBdZnnThXLqT-3OQgJxla?dl=0
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnhum.2017.00555/full#supplementary-material
Aghakhani, A., and Chan, E. K. (2007). Test Reviews: Bracken, B. A., and Howell, K. (2004). Clinical assessment of depression. Odessa, FL: Psychological Assessment Resources. J. Psychoeduc. Assess. 25, 416–422. doi: 10.1177/0734282907300383
Amidi, A., Hadi Hosseini, S. M., Leemans, A., Kesler, S. R., Agerbæk, M., Wu, L. M., et al. (2017). Changes in brain structural networks and cognitive functions in testicular cancer patients receiving cisplatin-based chemotherapy. J. Natl. Cancer Inst. 109:djx085. doi: 10.1093/jnci/djx085
Braun, U., Plichta, M. M., Esslinger, C., Sauer, C., Haddad, L., Grimm, O., et al. (2012). Test-retest reliability of resting-state connectivity network characteristics using fMRI and graph theoretical measures. Neuroimage 59, 1404–1412. doi: 10.1016/j.neuroimage.2011.08.044
Breiman, L. (2001). Random forests. Mach. Learn. 45, 5–32. doi: 10.1023/A:1010933404324
Breiman, L., Friedman, J., Olshen, R., and Stone, C. (1984). Classification and Regression Trees. New York, NY: Chapman and Hall.
Bruno, J., Hosseini, S. M., and Kesler, S. (2012). Altered resting state functional brain network topology in chemotherapy-treated breast cancer survivors. Neurobiol. Dis. 48, 329–338. doi: 10.1016/j.nbd.2012.07.009
Buckner, R. L., Krienen, F. M., and Yeo, B. T. (2013). Opportunities and limitations of intrinsic functional connectivity MRI. Nat. Neurosci. 16, 832–837. doi: 10.1038/nn.3423
Cao, H., Plichta, M. M., Schäfer, A., Haddad, L., Grimm, O., Schneider, M., et al. (2014). Test-retest reliability of fMRI-based graph theoretical properties during working memory, emotion processing, and resting state. Neuroimage 84, 888–900. doi: 10.1016/j.neuroimage.2013.09.013
Cole, M. W., Pathak, S., and Schneider, W. (2010). Identifying the brain’s most globally connected regions. Neuroimage 49, 3132–3148. doi: 10.1016/j.neuroimage.2009.11.001
Deslauriers, J., Ansado, J., Marrelec, G., Provost, J. S., and Joanette, Y. (2017). Increase of posterior connectivity in aging within the Ventral Attention Network: a functional connectivity analysis using independent component analysis. Brain Res. 1657, 288–296. doi: 10.1016/j.brainres.2016.12.017
Dumas, J. A., Makarewicz, J., Schaubhut, G. J., Devins, R., Albert, K., Dittus, K., et al. (2013). Chemotherapy altered brain functional connectivity in women with breast cancer: a pilot study. Brain Imaging Behav. 7, 524–532. doi: 10.1007/s11682-013-9244-1
Fox, M. D., and Greicius, M. (2010). Clinical applications of resting state functional connectivity. Front. Syst. Neurosci. 4:19. doi: 10.3389/fnsys.2010.00019
Fox, M. D., Snyder, A. Z., Vincent, J. L., Corbetta, M., Van Essen, D. C., and Raichle, M. E. (2005). The human brain is intrinsically organized into dynamic, anticorrelated functional networks. Proc. Natl. Acad. Sci. U S A 102, 9673–9678. doi: 10.1073/pnas.0504136102
Gothelf, D., Hoeft, F., Ueno, T., Sugiura, L., Lee, A. D., Thompson, P., et al. (2011). Developmental changes in multivariate neuroanatomical patterns that predict risk for psychosis in 22Q10.2 deletion syndrome. J. Psychiatr. Res. 45, 322–331. doi: 10.1016/j.jpsychires.2010.07.008
Grayson, D. S., and Fair, D. A. (2017). Development of large-scale functional networks from birth to adulthood: a guide to neuroimaging literature. Neuroimage doi: 10.1016/j.neuroimage.2017.01.079 [Epub ahead of print].
Guimerà, R., and Amaral, L. A. (2005). Cartography of complex networks: modules and universal roles. J. Stat. Mech. 2005:nihpa35573. doi: 10.1088/1742-5468/2005/02/p02001
Hagmann, P., Cammoun, L., Gigandet, X., Meuli, R., Honey, C. J., Wedeen, V. J., et al. (2008). Mapping the structural core of human cerebral cortex. PLoS Biol. 6:e159. doi: 10.1371/journal.pbio.0060159
Hanley, J. A., and McNeil, B. J. (1983). A method of comparing the areas under receiver operating characteristic curves derived from the same cases. Radiology 148, 839–843. doi: 10.1148/radiology.148.3.6878708
Hoeft, F., McCandliss, B. D., Black, J. M., Gantman, A., Zakerani, N., Hulme, C., et al. (2011). Neural systems predicting long-term outcome in dyslexia. Proc. Natl. Acad. Sci. U S A 108, 361–366. doi: 10.1073/pnas.1008950108
Hosseini, S. M., Koovakkattu, D., and Kesler, S. R. (2012). Altered small-world properties of gray matter networks in breast cancer. BMC Neurol. 12:28. doi: 10.1186/1471-2377-12-28
Jordan, M. I., and Mitchell, T. M. (2015). Machine learning: trends, perspectives, and prospects. Science 349, 255–260. doi: 10.1126/science.aaa8415
Kaiser, J., Bledowski, C., and Dietrich, J. (2014). Neural correlates of chemotherapy-related cognitive impairment. Cortex 54, 33–50. doi: 10.1016/j.cortex.2014.01.010
Kesler, S. R. (2014). Default mode network as a potential biomarker of chemotherapy-related brain injury. Neurobiol. Aging 35, S11–S19. doi: 10.1016/j.neurobiolaging.2014.03.036
Kesler, S. R., Adams, M., Packer, M., Rao, V., Henneghan, A. M., Blayney, D. W., et al. (2017a). Disrupted brain network functional dynamics and hyper-correlation of structural and functional connectome topology in patients with breast cancer prior to treatment. Brain Behav. 7:e00643. doi: 10.1002/brb3.643
Kesler, S. R., Noll, K., Cahill, D. P., Rao, G., and Wefel, J. S. (2017b). The effect of IDH1 mutation on the structural connectome in malignant astrocytoma. J. Neurooncol. 131, 565–574. doi: 10.1007/s11060-016-2328-1
Kesler, S. R., and Blayney, D. W. (2014). Mobile cognitive assessment battery (MCAB) for assessment of cancer-related cognitive changes. J. Clin. Oncol. 32:9571. doi: 10.1007/978-3-319-50604-3_5
Kesler, S. R., and Blayney, D. W. (2016). Neurotoxic effects of anthracycline- vs. nonanthracycline-based chemotherapy on cognition in breast cancer survivors. JAMA Oncol. 2, 185–192. doi: 10.1001/jamaoncol.2015.4333
Kesler, S. R., Gugel, M., Huston-Warren, E., and Watson, C. (2016). Atypical structural connectome organization and cognitive impairment in young survivors of acute lymphoblastic leukemia. Brain Connect. 6, 273–282. doi: 10.1089/brain.2015.0409
Kesler, S. R., Gugel, M., Pritchard-Berman, M., Lee, C., Kutner, E., Hosseini, S. M., et al. (2014). Altered resting state functional connectivity in young survivors of acute lymphoblastic leukemia. Pediatr. Blood Cancer 61, 1295–1299. doi: 10.1002/pbc.25022
Kesler, S. R., Watson, C. L., and Blayney, D. W. (2015). Brain network alterations and vulnerability to simulated neurodegeneration in breast cancer. Neurobiol. Aging 36, 2429–2442. doi: 10.1016/j.neurobiolaging.2015.04.015
Kesler, S. R., Wefel, J. S., Hosseini, S. M., Cheung, M., Watson, C. L., and Hoeft, F. (2013). Default mode network connectivity distinguishes chemotherapy-treated breast cancer survivors from controls. Proc. Natl. Acad. Sci. U S A 110, 11600–11605. doi: 10.1073/pnas.1214551110
Lepage, C., Smith, A. M., Moreau, J., Barlow-Krelina, E., Wallis, N., Collins, B., et al. (2014). A prospective study of grey matter and cognitive function alterations in chemotherapy-treated breast cancer patients. Springerplus 3:444. doi: 10.1186/2193-1801-3-444
Liaw, A., and Wiener, M. (2002). Classification and regression by random forest. R News 2, 18–22.
Lledo, P. M., and Valley, M. (2016). Adult olfactory bulb neurogenesis. Brain Behav. Immun. 8:a018945. doi: 10.1101/cshperspect.a018945
Lord, L. D., Expert, P., Huckins, J. F., and Turkheimer, F. E. (2013). Cerebral energy metabolism and the brain’s functional network architecture: an integrative review. J. Cereb. Blood Flow Metab. 33, 1347–1354. doi: 10.1038/jcbfm.2013.94
Marzelli, M. J., Hoeft, F., Hong, D. S., and Reiss, A. L. (2011). Neuroanatomical spatial patterns in Turner syndrome. Neuroimage 55, 439–447. doi: 10.1016/j.neuroimage.2010.12.054
Menon, V., and Uddin, L. Q. (2010). Saliency, switching, attention and control: a network model of insula function. Brain Struct. Funct. 214, 655–667. doi: 10.1007/s00429-010-0262-0
Monje, M., and Dietrich, J. (2012). Cognitive side effects of cancer therapy demonstrate a functional role for adult neurogenesis. Behav. Brain Res. 227, 376–379. doi: 10.1016/j.bbr.2011.05.012
Moradi, E., Pepe, A., Gaser, C., Huttunen, H., Tohka, J., and Alzheimer’s Disease Neuroimaging Initiative (2015). Machine learning framework for early MRI-based Alzheimer’s conversion prediction in MCI subjects. Neuroimage 104, 398–412. doi: 10.1016/j.neuroimage.2014.10.002
Moses, J. (2004). Comprehensive Trail Making Test (CTMT)By Cecil R. Reynolds. Austin, Texas: PRO-ED, Inc., 2002. Arch. Clin. Neuropsychol. 19, 703–708. doi: 10.1016/j.acn.2004.02.004
Mouly, A. M., and Sullivan, R. (2010). “Memory and plasticity in the olfactory system: from infancy to adulthood,” in The Neurobiology of Olfaction, ed. A. Menini (Boca Raton, FL: CRC Press), 367–394.
Nestor, P. G., Nakamura, M., Niznikiewicz, M., Levitt, J. J., Newell, D. T., Shenton, M. E., et al. (2015). Attentional control and intelligence: MRI orbital frontal gray matter and neuropsychological correlates. Behav. Neurol. 2015:354186. doi: 10.1155/2015/354186
Ohtani, T., Nestor, P. G., Bouix, S., Newell, D., Melonakos, E. D., McCarley, R. W., et al. (2017). Exploring the neural substrates of attentional control and human intelligence: diffusion tensor imaging of prefrontal white matter tractography in healthy cognition. Neuroscience 341, 52–60. doi: 10.1016/j.neuroscience.2016.11.002
Raichle, M. E. (2011). The restless brain. Brain Connect. 1, 3–12. doi: 10.1089/brain.2011.0019
Richiardi, J., Altmann, A., Milazzo, A.-C., Chang, C., Chakravarty, M. M., Banaschewski, T., et al. (2015). Correlated gene expression supports synchronous activity in brain networks. Science 348, 1241–1244. doi: 10.1126/science.1255905
Rubinov, M., and Sporns, O. (2010). Complex network measures of brain connectivity: uses and interpretations. Neuroimage 52, 1059–1069. doi: 10.1016/j.neuroimage.2009.10.003
Ruff, R. M., Light, R. H., Parker, S. B., and Levin, H. S. (1996). Benton controlled oral word association test: reliability and updated norms. Arch. Clin. Neuropsychol. 11, 329–338. doi: 10.1016/0887-6177(95)00033-x
Sachs-Ericsson, N., and Blazer, D. G. (2015). The new DSM-5 diagnosis of mild neurocognitive disorder and its relation to research in mild cognitive impairment. Aging Ment. Health 19, 2–12. doi: 10.1080/13607863.2014.920303
Saykin, A. J., de Ruiter, M. B., McDonald, B. C., Deprez, S., and Silverman, D. H. (2013). Neuroimaging biomarkers and cognitive function in non-CNS cancer and its treatment: current status and recommendations for future research. Brain Imaging Behav. 7, 363–373. doi: 10.1007/s11682-013-9283-7
Schmidt, M. (2012). Rey Auditory Verbal Learning Test (RAVLT): A Handbook. Lutz, FL: Psychological Assessment Resources.
Seeley, W. W. (2010). Anterior insula degeneration in frontotemporal dementia. Brain Struct. Funct. 214, 465–475. doi: 10.1007/s00429-010-0263-z
Seeley, W. W., Menon, V., Schatzberg, A. F., Keller, J., Glover, G. H., Kenna, H., et al. (2007). Dissociable intrinsic connectivity networks for salience processing and executive control. J. Neurosci. 27, 2349–2356. doi: 10.1523/JNEUROSCI.5587-06.2007
Shirer, W. R., Ryali, S., Rykhlevskaia, E., Menon, V., and Greicius, M. D. (2012). Decoding subject-driven cognitive states with whole-brain connectivity patterns. Cereb. Cortex 22, 158–165. doi: 10.1093/cercor/bhr099
Sporns, O., and Betzel, R. F. (2016). Modular brain networks. Annu. Rev. Psychol. 67, 613–640. doi: 10.1146/annurev-psych-122414-033634
Sporns, O., Honey, C. J., and Kötter, R. (2007). Identification and classification of hubs in brain networks. PLoS One 2:e1049. doi: 10.1371/journal.pone.0001049
Steinbach, S., Hundt, W., Zahnert, T., Berktold, S., Böhner, C., Gottschalk, N., et al. (2010). Gustatory and olfactory function in breast cancer patients. Support. Care Cancer 18, 707–713. doi: 10.1007/s00520-009-0672-9
Stouten-Kemperman, M. M., de Ruiter, M. B., Koppelmans, V., Boogerd, W., Reneman, L., and Schagen, S. B. (2015). Neurotoxicity in breast cancer survivors ≥10 years post-treatment is dependent on treatment type. Brain Imaging Behav. 9, 275–284. doi: 10.1007/s11682-014-9305-0
Strangman, G. E., O’Neil-Pirozzi, T. M., Supelana, C., Goldstein, R., Katz, D. I., and Glenn, M. B. (2010). Regional brain morphometry predicts memory rehabilitation outcome after traumatic brain injury. Front. Hum. Neurosci. 4:182. doi: 10.3389/fnhum.2010.00182
Strobl, C., Boulesteix, A. L., Zeileis, A., and Hothorn, T. (2007). Bias in random forest variable importance measures: illustrations, sources and a solution. BMC Bioinformatics 8:25. doi: 10.1186/1471-2105-8-25
Termenon, M., Jaillard, A., Delon-Martin, C., and Achard, S. (2016). Reliability of graph analysis of resting state fMRI using test-retest dataset from the Human Connectome Project. Neuroimage 142, 172–187. doi: 10.1016/j.neuroimage.2016.05.062
Thompson, D. G., Kesler, S. R., Sudheimer, K., Mehta, K. M., Thompson, L. W., Marquett, R. M., et al. (2015). FMRI activation during executive function predicts response to cognitive behavioral therapy in older, depressed adults. Am. J. Geriatr. Psychiatry 23, 13–22. doi: 10.1016/j.jagp.2014.02.001
Tzourio-Mazoyer, N., Landeau, B., Papathanassiou, D., Crivello, F., Etard, O., Delcroix, N., et al. (2002). Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage 15, 273–289. doi: 10.1006/nimg.2001.0978
van den Heuvel, M. P., and Sporns, O. (2013). Network hubs in the human brain. Trends Cogn. Sci. 17, 683–696. doi: 10.1016/j.tics.2013.09.012
Vardy, J. L., Stouten-Kemperman, M. M., Pond, G., Booth, C. M., Rourke, S. B., Dhillon, H. M., et al. (2017). A mechanistic cohort study evaluating cognitive impairment in women treated for breast cancer. Brain Imaging Behav. doi: 10.1007/s11682-017-9728-5 [Epub ahead of print].
Vertes, P. E., and Bullmore, E. T. (2015). Annual research review: growth connectomics—the organization and reorganization of brain networks during normal and abnormal development. J. Child Psychol. Psychiatry 56, 299–320. doi: 10.1111/jcpp.12365
Wang, L., Yan, Y., Wang, X., Tao, L., Chen, Q., Bian, Y., et al. (2016). Executive function alternations of breast cancer patients after chemotherapy: evidence from resting-state functional MRI. Acad. Radiol. 23, 1264–1270. doi: 10.1016/j.acra.2016.05.014
Wefel, J. S., Kesler, S. R., Noll, K. R., and Schagen, S. B. (2015). Clinical characteristics, pathophysiology, and management of noncentral nervous system cancer-related cognitive impairment in adults. CA Cancer J. Clin. 65, 123–138. doi: 10.3322/caac.21258
Wefel, J. S., Saleeba, A. K., Buzdar, A. U., and Meyers, C. A. (2010). Acute and late onset cognitive dysfunction associated with chemotherapy in women with breast cancer. Cancer 116, 3348–3356. doi: 10.1002/cncr.25708
Wefel, J. S., Vardy, J., Ahles, T., and Schagen, S. B. (2011). International cognition and cancer task force recommendations to harmonise studies of cognitive function in patients with cancer. Lancet Oncol. 12, 703–708. doi: 10.1016/s1470-2045(10)70294-1
Weygandt, M., Blecker, C. R., Schäfer, A., Hackmack, K., Haynes, J. D., Vaitl, D., et al. (2012). fMRI pattern recognition in obsessive-compulsive disorder. Neuroimage 60, 1186–1193. doi: 10.1016/j.neuroimage.2012.01.064
Wright, M. N., Ziegler, A., and König, I. R. (2016). Do little interactions get lost in dark random forests? BMC Bioinformatics 17:145. doi: 10.1186/s12859-016-0995-8
Zhang, D., Liu, B., Chen, J., Peng, X., Liu, X., Fan, Y., et al. (2013). Determination of vascular dementia brain in distinct frequency bands with whole brain functional connectivity patterns. PLoS One 8:e54512. doi: 10.1371/journal.pone.0054512
Zhou, J., Gennatas, E. D., Kramer, J. H., Miller, B. L., and Seeley, W. W. (2012). Predicting regional neurodegeneration from the healthy brain functional connectome. Neuron 73, 1216–1227. doi: 10.1016/j.neuron.2012.03.004
Keywords: breast cancer, chemotherapy, connectome, resting state fMRI, random forest, cognition
Citation: Kesler SR, Rao A, Blayney DW, Oakley-Girvan IA, Karuturi M and Palesh O (2017) Predicting Long-Term Cognitive Outcome Following Breast Cancer with Pre-Treatment Resting State fMRI and Random Forest Machine Learning. Front. Hum. Neurosci. 11:555. doi: 10.3389/fnhum.2017.00555
Received: 05 June 2017; Accepted: 01 November 2017;
Published: 15 November 2017.
Edited by:Irina Strigo, University of California, San Francisco, United States
Reviewed by:Huaidong Cheng, Second Hospital of Anhui Medical University, China
Xuntao Yin, Third Military Medical University, China
Copyright © 2017 Kesler, Rao, Blayney, Oakley-Girvan, Karuturi and Palesh. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Shelli R. Kesler, firstname.lastname@example.org