Prediction of Clinical Outcomes in Acute Ischaemic Stroke Patients: A Comparative Study

Rajashekar, Deepthi; Hill, Michael D.; Demchuk, Andrew M.; Goyal, Mayank; Fiehler, Jens; Forkert, Nils D.

doi:10.3389/fneur.2021.663899

BRIEF RESEARCH REPORT article

Front. Neurol., 06 May 2021

Sec. Stroke

Volume 12 - 2021 | https://doi.org/10.3389/fneur.2021.663899

This article is part of the Research TopicMachine Learning in Action: Stroke Diagnosis and Outcome PredictionView all 12 articles

Prediction of Clinical Outcomes in Acute Ischaemic Stroke Patients: A Comparative Study

Deepthi Rajashekar^1,2,3^*

Michael D. Hill^2,3,4,5,6

Andrew M. Demchuk^2,5

Mayank Goyal^2,5

Jens Fiehler⁷

Nils D. Forkert^2,3,4,8

¹Biomedical Engineering Graduate Program, University of Calgary, Calgary, AB, Canada
²Depertment of Radiology, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
³Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
⁴Department of Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
⁵Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
⁶Department of Community Health Sciences, University of Calgary, Calgary, AB, Canada
⁷Department of Diagnostic and Interventional Neuroradiology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
⁸Alberta Children's Hospital Research Institute, University of Calgary, Calgary, AB, Canada

Background: Clinical stroke rehabilitation decision making relies on multi-modal data, including imaging and other clinical assessments. However, most previously described methods for predicting long-term stroke outcomes do not make use of the full multi-modal data available. The aim of this work was to develop and evaluate the benefit of nested regression models that utilise clinical assessments as well as image-based biomarkers to model 30-day NIHSS.

Method: 221 subjects were pooled from two prospective trials with follow-up MRI or CT scans, and NIHSS assessed at baseline, as well as 48-hours and 30 days after symptom onset. Three prediction models for 30-day NIHSS were developed using a support vector regression model: one clinical model based on modifiable and non-modifiable risk factors (M_CLINICAL) and two nested regression models that aggregate clinical and image-based features that differed with respect to the method used for selection of important brain regions for the modelling task. The first model used the widely accepted RreliefF (M_RELIEF) machine learning method for this purpose, while the second model employed a lesion-symptom mapping technique (M_LSM) often used in neuroscience to investigate structure-function relationships and identify eloquent regions in the brain.

Results: The two nested models achieved a similar performance while considerably outperforming the clinical model. However, M_RELIEF required fewer brain regions and achieved a lower mean absolute error than M_LSM while being less computationally expensive.

Conclusion: Aggregating clinical and imaging information leads to considerably better outcome prediction models. While lesion-symptom mapping is a useful tool to investigate structure-function relationships of the brain, it does not lead to better outcome predictions compared to a simple data-driven feature selection approach, which is less computationally expensive and easier to implement.

Introduction

The prognosis of clinical and functional outcome in acute ischemic stroke patients is typically made based on multi-modal information such as demographic, clinical, laboratory, and radiological data. Theoretically, machine learning models can identify patterns in high-dimensional data that can be used to make data-driven and reproducible stroke outcome predictions in new patients and support patient management. However, despite the ability to integrate multimodal information, recent machine learning models have mostly utilized clinical data or image-based biomarkers alone (1) to predict stroke outcome. So far, the benefit of using true multi-modal data for stroke outcome prediction has not been investigated comprehensively. One of the few multi-modal predictive models of stroke outcome is described by Brugnara et al. (2). However, clinical assessments at various timepoints are used as input features without addressing the issue of feature collinearity. Furthermore, previous studies often predict the stroke outcome in a binary classification scheme (good vs. bad), which ignores the incremental, yet relevant non-linear differences in stroke severity scores.

Integration of image-based biomarkers for stroke outcome prediction is more complex than using other clinical assessments (in most cases), but has the potential to add considerable predictive power. A key aspect to consider within this context is the selection of regions-of-interest (ROIs) in the brain that are critically associated with the clinical deficit of interest since non-informative and redundant feature can downgrade the prediction accuracy considerably (3). Lesion-symptom mapping (LSM) (4) is able to identify brain regions that are important for a clinical outcome score of interest but has been used rarely for selection of brain regions for stroke outcome prediction (5). The more common ROI selection approach is to use classical feature selection methods during the training process. However, these two general approaches have never been compared to date with respect to stroke outcome prediction.

The aim of this work is to compare different setups of nested machine learning models using clinical information only and a combination of clinical and radiological features selected using lesion-symptom mapping and classical feature selection methods to predict the 30-days NIH stroke scale (NIHSS).

Methods

Data

The datasets used in this study were pooled from the ESCAPE (6) and iKNOW (7) trials. Patients with remote hemorrhages, bilateral lesions, and severe white matter hyperintensities were excluded from this secondary analysis, and only patients with a follow-up MRI or CT scan (18-hours to one week from baseline) with complete clinical information (obtained after stroke and upto 6-hours post randomization) were included, leading to a final sample of 221 patients. The clinical outcome of interest in this study is the NIHSS assessed at 30 days after stroke symptom onset. The patient characteristics are summarized in Table 1. The measurable clinical and laboratory features used in the nested regression model include age, sex, modifiable and non-modifiable risk factors suggested in the evidence-based review of stroke rehabilitation (8). These include blood pressure, glucose, hematocrit, hypertension, diabetes, smoking status, hyperlipidemia, and atrial fibrillation (see Supplementary Table 1). Additionally, the baseline NIHSS score (pre-treatment) was also included as part of the clinical data to model stroke outcome (5, 9).

TABLE 1

Table 1. Characteristics of patients pooled (N = 221) from the ESCAPE⁶ and iKNOW⁷ datasets.

All lesions were manually delineated by an expert observer using the ITKSNAP tool. Each image sequence was skull stripped and non-linearly registered to the common FLAIR-NCCT (10) atlas of the elderly using the ANTs toolkit. The grey matter (GM) and white matter (WM) parcellations from the probabilistic BNA atlas (11) and the JHU atlas (12), respectively, were fused and transformed to the FLAIR-NCCT atlas. All image-based features were computed in the FLAIR-NCCT atlas space.

Model Design

Nested regression models were developed to predict the 30-days NIHSS outcome based on clinical data and image-based biomarkers. Here, the first model predicts the 48-hours NIHSS using imaging features alone whereas the result of this model is then used together with clinical features to predict the 30-days NIHSS.

\begin{array}{l} N I H S S_{30 - d a y s} ~ (F e a t u r e s_{C l i n i c a l} \\ + (N I H S S_{48 - h o u r s} ~ F e a t u r e s_{I m a g i n g})) \end{array}

For both models, epsilon-regression was used implemented using in a radial kernel support vector regression (SVR) framework. Using follow-up imaging acquired between 18-hours and 5-days from symptom onset to identify regions-of-interest (ROIs) that maximally correlate with a long-term assessment might introduce confounding effects and bias the results. Therefore, the ROIs included in the predictive models were identified with respect to the 48-hours NIHSS to ensure that the identified structure-function relationships are related to the primary stroke-induced deficits alone. This also ensures that the identified ROIs are not selected because of post-secondary comorbidities (not directly related to the primary stroke) developed either in-hospital or post-discharge. The two approaches for ROI selection are: (i) the LSM method using Brunner-Munzel test (13) and (ii) a widely accepted machine learning-based feature selection method that accounts for collinearity known as RreliefF (14). The LSM method was implemented using the LESYMAP package (15) using the default parameters employing a p-value threshold at 0.05, discarding voxels not injured in at least 10% of the sample data, and using false discovery rate (FDR, the rate of Type 1 errors) to correct for multiple comparisons. For ease of comparison, brain regions that were not affected in at least 10% of the sample data were also removed prior to the RreliefF feature selection. The RreliefF feature selector was also employed using default parameters from the Fselector package (16) with the sample size of 10 and a neighbor count set to five. The result of the LSM is a statistical map of clusters of significant voxels that survive the FDR correction with non-zero voxel weights. Regions in the BNA-JHU parcellation that were assigned non-zero voxel weights by the LSM analysis were included as ROIs in the proposed regression analyses.

For each brain region identified by LSM as being important in the training set, the relative lesion overlap was computed and used as image-based features. Moreover, in case of WM tracts, the cross-sectional width of the tract spared after the lesion was also calculated and used as additional features (17). Therefore, the final set of image-derived input features used in this study include GM overlap, WM overlap, and WM tract integrity for all the selected ROIs.

For RreliefF feature selection, the lesion overlap (GM and WM) and tract integrity (only WM) was calculated for each atlas region and used for feature selection based on the training set.

Model Evaluation

For the sake of being able to compare brain regions selected for stroke outcome prediction qualitatively between the two models, the data was randomly split into completely independent training and test sets. This resulted in only one set of features selected for each method, which greatly enhances the interpretability and comparison of the models. Therefore, the entire dataset was partitioned into two mutually exclusive subsets for model training (80%) and testing (20%) using a stratified split that preserves the representation of stroke severity across both groups. Three models were evaluated in this framework: (i) un-nested SVR model with clinical features alone (M_CLINICAL) selected using RreliefF; (ii) nested model using clinical and imaging data with RreliefF as feature selector (M_RELIEF); and (iii) nested model using clinical and image data with LSM as feature selector (M_LSM). The resulting models were compared for predictive performance with respect to the model's mean absolute error (MAE) and coefficient of determination (R²).

Results

The overlap of all individual patient lesions in the atlas space shows that maximum incidence of stroke in this dataset occurs in the brain regions supplied by the middle cerebral artery (see Supplementary Figure 1). The median recovery profile of patients in this database is shown in Supplementary Figure 2.

The model using clinical features only resulted in a rather poor predictive performance (R² = 0.13). The optimal prediction results were achieved using age, baseline NIHSS, blood glucose and hematocrit levels, sex, presence of atrial fibrillation, hypertension, and hyperlipidemia, treatment decision (endovascular thrombectomy or tissue plasminogen activator), symptom onset to admission time, and blood pressure as features. However, the iterative feature selection procedure using RreliefF did not select presence of diabetes and smoking status, which are usually considered important predictors. Only the clinical features selected in this model were included in the two nested models to enable a direct comparison.

Compared to the simple predictive model using clinical features only, the two nested (M_RELIEF and M_LSM) models performed better and resulted in comparable R² and MAEs (see Table 2). No statistically significant MAE differences (p > 0.05) were found comparing the two nested models. However, M_RELIEF used only 44 ROIs in comparison to the 106 ROIs selected in M_LSM (see Figure 1). The plots of the predicted and ground truth scores for both models are shown in Supplementary Figure 3.

TABLE 2

Table 2. Model performances for each setup.

FIGURE 1

Figure 1. Selected regions of interest (ROIs) for the RreliefF-based (red) and LSM-based (blue) feature selection. The LSM-based ROIs are hemispherically asymmetrical and include regions outside of the subcortical nuclei.

Discussion

This study demonstrates that conventional machine learning feature selection methods (M_RELIEF) can identify important brain regions for stroke outcome prediction as well as the conventional lesion-symptom mapping methods (M_LSM).

The advantage of the M_RELIEF model over the M_LSM model are two-fold. First, the M_RELIEF model is simpler since it uses <50% of features compared to the M_LSM model and results in similar predictive performance. Second, the M_RELIEF setup does not require extensive LSM computations to derive structure-function relationships and identify eloquent brain regions. Specifically, despite using a fewer number of regions, the ROIs chosen by the M_RELIEF model are largely in the left hemisphere and include regions that correspond to the dominance of left-hemispheric functions assessed by NIHSS.

Importantly, using LSM for ROI selection has additional limitations that the RreliefF feature selection overcomes. First, LSM analyses suffer from low statistical power due to the corrections for multiple comparisons and do not account for violating assumptions of normality in the outcome score. Second, the LSM analysis results in individual voxel weights, which are not really needed to compute region-level inferences of critical brain regions that are associated with a deficit. While LSM is a powerful tool to investigate the neural correlates of stroke induced clinical deficit, its usefulness to select ROIs for stroke outcome prediction tasks seems rather limited. For these reasons, and by applying the Occam's razor principle in model selection, traditional feature selection methods seem to be better suited for future research in stroke outcome prediction.

The proposed framework has a design advantage in comparison to the existing prognostic models of stroke outcome. A recent review on predictive models of stroke outcome (18) reports that: (i) the target outcome of the predictive model is usually a categorized version of functional outcome¹; (ii) the variables used to model this score include prognostic parameters², stroke risk factors, and baseline stroke severity measured by the NIHSS scale. An obvious limitation is that classification models predicting binarized functional outcome likely ignore the gradation of stroke severity, which is relevant information for stroke prognostication. Furthermore, the functional outcomes, prognostic parameters, and the baseline severity measures may be strongly correlated resulting in inflated classification accuracies. In the proposed work, both of these limitations (loss of relevant information and collinearity) are addressed by employing the nested regression model. For instance, since the 48-hours NIHSS is highly correlated with the 30-day NIHSS, it might bias the regression model. Therefore, having a nested model that utilizes the short-term outcome to derive image-based ROIs that in turn predict the long-term outcomes seems to be a promising way to reduce the affects of collinearity. Furthermore, it is important to note that the results of different studies describing predictive models are not comparable because of different sample sizes, different evaluation methods, different assessment time points, and different imaging time points. For this reason, the predictive model using clinical data only was included in this study as a means of baseline comparison.

One of the limitations of the proposed work is that the findings are population-specific and are likely to change with the stroke cohort used (type of stroke and sample size), choice of parcellation atlas, LSM technique, and/or training scheme employed. This study is also exploratory in the sense that, subject to availability, the clinical descriptors included are a subset of all potential stroke risk factors reported in the literature. The power calculations for using LSM in predictive analysis has not been explored in this study. Additionally, the burden of preprocessing each patient scan for registration, lesion segmentation, and feature computation is extensive. State-of-the-art deep learning methods have the potential to use 3D MRI or CT scans (without lesion definitions) and do not demand handcrafted image-based features and might not even need manual lesion segmentations. Furthermore, the results of this study can be considered a relevant first step toward building a computer-aided prognosis support tool using explainable machine learning methods. However, the predictive accuracy of the models generated in this study need to be further improved using additional datasets and should be evaluated prospectively using a completely independent dataset.

An important recommendation for future work is to model stroke outcomes using ordinal regression models, which can account for the relative ordering between two values in the NIHSS scale. However, ordinal regression models are more complex and typically require the definition of interval thresholds, which can either be derived from the training data or based on domain knowledge. That said, the results described in this paper will generally hold true for ordinal regression models as well. Confirmatory research in this direction may also benefit from investigating the utility of convolutional neural networks without requiring lesion segmentation to predict long-term stroke outcome as an ordinal regression problem.

Conclusions

In summary, this study shows that combining clinical and imaging data leads to better stroke outcome predictions compared to using clinical data alone. While lesion-symptom mapping is a powerful neuroscience tool to investigate structure-function relationships in stroke patients, these methods do not appear to have an additional benefit for selecting brain regions important for stroke outcome prediction compared to rather simple and data-driven feature selection methods.

Data Availability Statement

The acquisition of the datasets for the two trials was approved by the respective local ethics board at each site contributing to the two trials. All datasets used in this secondary study were made available after complete anonymization. Requests to access these datasets should be directed to Nils Forkert, bmlscy5mb3JrZXJ0QHVjYWxnYXJ5LmNh.

Author Contributions

DR conceptualized, conducted experiments, and drafted the paper. MH contributed data, validated the design of experiments, and critically reviewed the paper. JF and MG contributed data, and reviewed the paper. AD contributed data. NF validated the design of experiments, and critically reviewed the paper. All authors contributed to the article and approved the submitted version.

Funding

This work was funded by the Heart and Stroke Foundation of Canada Grant in aid (G-17-0018368), the Canada Research Chairs program, the River Fund at Calgary Foundation, and the Hotchkiss Brain Institute.

Conflict of Interest

JF (all unrelated): Research support: EU, BMBF, BMWi, DFG, Acandis, Medtronic, Microvention, Stryker, Consultancy: Acandis, Cerenovus, Medtronic, Microvention, Penumbra, Phenox, Stryker, Stock: Tegus, Executive functions: University Medical Center Hamburg-Eppendorf, Eppdata GmbH. MH reports grants from Covidien (Medronic LLC), during the conduct of the ESCAPE study; personal fees from Merck, non-financial support from Hoffmann-La Roche Canada Ltd, grants from Covidien (Medtronic), grants from Boehringer-Ingleheim, grants from Stryker Inc., grants from Medtronic LLC, grants from NoNO Inc., outside the submitted work; In addition, MH has a patent Systems and Methods for Assisting in Decision-Making and Triaging for Acute Stroke Patients pending to US Patent office Number: 62/086,077 and owns stock in Calgary Scientific Incorporated, a company that focuses on medical imaging software, is a director of the Canadian Federation of Neurological Sciences, a not-for-profit group and has received grant support from Alberta Innovates Health Solutions, CIHR, Heart & Stroke Foundation of Canada, National Institutes of Neurological Disorders and Stroke.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2021.663899/full#supplementary-material

Footnotes

1. ^Examples include: the modified Ranking Scale (mRS), NIHSS, Barthel Index, etc.

2. ^Examples include: Preadmission Comorbidities, Level of Consciousness, Age, and Neurological Deficit (PLAN); Stroke Prognostication Using Age and National Institutes of Health Stroke Scale (SPAN); Totaled Health Risks in Vascular Events (THRIVE), etc.

References

1. Price CJ, Hope TM, Seghier ML. Ten problems and solutions when predicting individual outcome from lesion site after stroke. Neuroimage. (2017) 145:200–8. doi: 10.1016/j.neuroimage.2016.08.006

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Gianluca B, Neuberger U, Mahmutoglu MA, Foltyn M, Herweh C, Nage S, et al. Multimodal predictive modeling of endovascular treatment outcome for acute ischemic stroke using machine-learning. Stroke. (2020) 51:3541–51. doi: 10.1161/STROKEAHA.120.030287

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Vercio LL, Amador K, Bannister JJ, Crites S, Gutierrez A, MacDonald ME, Moore J, et al. Supervised machine learning tools: a tutorial for clinicians. J. Neural Eng. (2020) 17:062001. doi: 10.1088/1741-2552/abbff2

PubMed Abstract | CrossRef Full Text | Google Scholar

4. de Haan B, Karnath H-O. A hitchhiker's guide to lesion-behaviour mapping. Neuropsychologia. (2018) 115:5–16. doi: 10.1016/j.neuropsychologia.2017.10.021

CrossRef Full Text | Google Scholar

5. Forkert ND, Verleger T, Cheng B, Thomalla G, Hilgetag CC, Fiehler J. Multiclass support vector machine-based lesion mapping predicts functional outcome in ischemic stroke patients. PLOS ONE. (2015)10:e0129569. doi: 10.1371/journal.pone.0129569

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Demchuk AM, Goyal M, Menon BK, Eesa M, Ryckborst KJ, Kamal N, et al. Endovascular treatment for small core and anterior circulation proximal occlusion with emphasis on minimizing CT to recanalization times (ESCAPE) trial: methodology. Int J Stroke. (2015) 10:429–38. doi: 10.1111/ijs.12424

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Cheng B, Forkert ND, Zavaglia M, Hilgetag CC, Golsari A, Siemonsen S, et al. Influence of stroke infarct location on functional outcome measured by the modified rankin scale. Stroke. (2014) 45:1695–702. doi: 10.1161/STROKEAHA.114.005152

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Cotoi A, Batey C, Hussein N, Janzen S, Teasell R. Rehabilitation of younger patients post stroke. In: Evidence Review Handbook. ESRBR (2018). p. 53. Available online at: http://www.ebrsr.com/evidence-review/21-rehabilitation-younger-patients-post-stroke

Google Scholar

9. Choi JC, Kim BJ, Han M-K, Lee SJ, Kang K, Park J-M, et al. Utility of items of baseline national institutes of health stroke scale as predictors of functional outcomes at three months after mild ischemic stroke. J Stroke Cerebrovasc Dis. (2017) 26:1306–13. doi: 10.1016/j.jstrokecerebrovasdis.2017.01.027

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Rajashekar D, Wilms M, MacDonald ME, Ehrhardt J, Mouches P, Frayne R, et al. High-resolution T2-FLAIR and non-contrast CT brain atlas of the elderly. Sci Data. (2020) 7:1–7. doi: 10.1038/s41597-020-0379-9

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Jiang T. Brainnetome: a new -ome to understand the brain and its disorders. Neuroimage. (2013) 80:263–72. doi: 10.1016/j.neuroimage.2013.04.002

CrossRef Full Text | Google Scholar

12. Oishi K, Faria AV, van Zijl PCM, Mori S. MRI Atlas of Human White Matter. Elsevier Science & Technology (2010).

Google Scholar

13. Rorden C, Karnath H-O, Bonilha L. Improving lesion-symptom mapping. J Cogn Neurosci. (2007) 19:1081–8. doi: 10.1162/jocn.2007.19.7.1081

CrossRef Full Text | Google Scholar

14. Robnik-Šikonja M, Kononenko I. An adaptation of Relief for attribute estimation in regression. in Machine Learning: Proceedings of the Fourteenth International Conference (ICML'97). San Francisco, CA (1997) 5:296–304.

Google Scholar

15. LESYMAP. LESYMAP. Available online at: https://dorianps.github.io/LESYMAP/

16. FSelector: Selecting Attributes version 0.33 from CRAN. Available online at: https://rdrr.io/cran/FSelector/

17. Rajashekar D, Mouchès P, Fiehler J, Menon BK, Goyal M, Demchuk AM, et al. Structural integrity of white matter tracts as a predictor of acute ischemic stroke outcome. Int J Stroke. (2020) 15:965–72. doi: 10.1177/1747493020915251

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Gao MMY, Wang J, Saposnik G. The art and science of stroke outcome prognostication. Stroke. (2020) 51:1358–60. doi: 10.1161/STROKEAHA.120.028980

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: support vector machine, lesion symptom mapping, NIHSS (National Institue of Health Stroke Scale), nested regression, ischemic stroke

Citation: Rajashekar D, Hill MD, Demchuk AM, Goyal M, Fiehler J and Forkert ND (2021) Prediction of Clinical Outcomes in Acute Ischaemic Stroke Patients: A Comparative Study. Front. Neurol. 12:663899. doi: 10.3389/fneur.2021.663899

Received: 03 February 2021; Accepted: 09 April 2021;
Published: 06 May 2021.

Edited by:

Vida Abedi, Geisinger Health System, United States

Reviewed by:

Mohammad Adibuzzaman, Purdue University, United States
Ghasem Farahmand, Tehran University of Medical Sciences, Iran
Durgesh Prasad Chaudhary, Geisinger Health System, United States

Copyright © 2021 Rajashekar, Hill, Demchuk, Goyal, Fiehler and Forkert. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Deepthi Rajashekar, ZGVlcHRoaS5yYWphc2hla2ExQHVjYWxnYXJ5LmNh

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.