Prediction of conversion to dementia using interpretable machine learning in patients with amnestic mild cognitive impairment

Chun, Min Young; Park, Chae Jung; Kim, Jonghyuk; Jeong, Jee Hyang; Jang, Hyemin; Kim, Kyunga; Seo, Sang Won

doi:10.3389/fnagi.2022.898940

ORIGINAL RESEARCH article

Front. Aging Neurosci., 05 August 2022

Sec. Alzheimer's Disease and Related Dementias

Volume 14 - 2022 | https://doi.org/10.3389/fnagi.2022.898940

Prediction of conversion to dementia using interpretable machine learning in patients with amnestic mild cognitive impairment

MY
Min Young Chun ¹^{§ †}
CJ
Chae Jung Park ^2,3^{§ †}
JK
Jonghyuk Kim ²^{§ †}
JH
Jee Hyang Jeong ⁴^§
HJ
Hyemin Jang ^1,3^§
KK
Kyunga Kim ^2,5^{§ ‡ *}
SW
Sang Won Seo ^1,2,3,6,7^{§ ‡ *}

1. Department of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
2. Department of Digital Health, Samsung Advanced Institute for Health Sciences and Technology, Sungkyunkwan University, Seoul, South Korea
3. Alzheimer’s Disease Convergence Research Center, Samsung Medical Center, Seoul, South Korea
4. Department of Neurology, Ewha Womans University Seoul Hospital, Ewha Womans University College of Medicine, Seoul, South Korea
5. Biomedical Statistics Center, Data Science Research Institute, Research Institute for Future Medicine, Samsung Medical Center, Seoul, South Korea
6. Department of Health Sciences and Technology, Samsung Advanced Institute for Health Sciences and Technology, Sungkyunkwan University, Seoul, South Korea
7. Department of Intelligent Precision Healthcare Convergence, Sungkyunkwan University, Suwon, South Korea

Article metrics

View details

Citations

4,7k

Views

1,7k

Downloads

Abstract

Purpose:

Amnestic mild cognitive impairment (aMCI) is a transitional state between normal aging and Alzheimer’s disease (AD). However, not all aMCI patients are observed to convert to AD dementia. Therefore, developing a predictive algorithm for the conversion of aMCI to AD dementia is important. Parametric methods, such as logistic regression, have been developed; however, it is difficult to reflect complex patterns, such as non-linear relationships and interactions between variables. Therefore, this study aimed to improve the predictive power of aMCI patients’ conversion to dementia by using an interpretable machine learning (IML) algorithm and to identify the factors that increase the risk of individual conversion to dementia in each patient.

Methods:

We prospectively recruited 705 patients with aMCI who had been followed-up for at least 3 years after undergoing baseline neuropsychological tests at the Samsung Medical Center between 2007 and 2019. We used neuropsychological tests and apolipoprotein E (APOE) genotype data to develop a predictive algorithm. The model-building and validation datasets were composed of data of 565 and 140 patients, respectively. For global interpretation, four algorithms (logistic regression, random forest, support vector machine, and extreme gradient boosting) were compared. For local interpretation, individual conditional expectations (ICE) and SHapley Additive exPlanations (SHAP) were used to analyze individual patients.

Results:

Among the four algorithms, the extreme gradient boost model showed the best performance, with an area under the receiver operating characteristic curve of 0.852 and an accuracy of 0.807. Variables, such as age, education, the scores of visuospatial and memory domains, the sum of boxes of the Clinical Dementia Rating scale, Mini-Mental State Examination, and APOE genotype were important features for creating the algorithm. Through ICE and SHAP analyses, it was also possible to interpret which variables acted as strong factors for each patient.

Conclusion:

We were able to propose a predictive algorithm for each aMCI individual’s conversion to dementia using the IML technique. This algorithm is expected to be useful in clinical practice and the research field, as it can suggest conversion with high accuracy and identify the degree of influence of risk factors for each patient.

Introduction

Amnestic mild cognitive impairment (aMCI) refers to a transitional state between normal aging and dementia (Flicker et al., 1991; Petersen et al., 2001; Sarazin et al., 2007). Previous studies showed that within 3 years, approximately 50% of aMCI patients converted to dementia (Fischer et al., 2007; Espinosa et al., 2013), with an annual conversion rate of 5–25% (Larrieu et al., 2002; Mitchell and Shiri-Feshki, 2009; Alegret et al., 2014). However, some aMCI patients maintain a stable state of cognitive function or reverted to normal cognition (Busse et al., 2006; Mitchell and Shiri-Feshki, 2009). Several factors, including age, sex, neuropsychological test results, and apolipoprotein E (APOE) genotype were found to affect the rate of conversion to dementia (Petersen et al., 1995; Daly et al., 2000; DeCarli et al., 2004; Yaffe et al., 2006). Thus, as the clinical outcomes of aMCI patients are heterogeneous, it is important to consider the risk factors of each patient individually while predicting their conversion to dementia.

Several studies have been conducted to create algorithms that predict the conversion of aMCI to dementia (Ravaglia et al., 2006; Tabert et al., 2006; De Simone et al., 2019). Specifically, Jang et al. developed a dementia risk prediction algorithm by using traditional statistical methods, such as multivariate logistic regression (LR) and the nomogram (Jang et al., 2017). However, when the LR is applied to complex multivariate non-linear relationships, it may have low robustness because of the multicollinearity between the variables (Tu, 1996).

Machine learning (ML) techniques, a form of artificial intelligence that is increasingly used in the medical research field, have also been considered in developing prediction algorithms for conversion to dementia (Chen and Herskovits, 2010; Mattila et al., 2012; Hall et al., 2015; So et al., 2017; Zhu et al., 2020; Lian et al., 2021; Qiao et al., 2021). These prediction algorithms are based on computer algorithms that help ML to learn complex relationships with empirical data and to make more accurate decisions (Bishop, 2006; Waljee et al., 2014). Compared to the traditional statistical methods, ML has a lower possibility of overlooking unexpected predictors and potential interactions between variables (Waljee et al., 2014). However, unlike nomograms, ML techniques are not able to show which factors play a major role in the conversion. Thus, interpretable ML (IML) was developed to provide understandable explanations for learning complex outputs with predictive accuracy, descriptive accuracy, and relevancy (Murdoch et al., 2019).

Therefore, in the present study, we aimed to develop an IML algorithm with a higher predictive power than that of LR, which predicts conversion to dementia in aMCI participants in an accurate manner. We used clinical demographics, APOE genotype, and neuropsychological results as features that are easily accessible in clinical practice. We also attempted to develop a graphic-based interpretable method to show which risk factors influence conversion to dementia, and to what extent, in individual aMCI participants.

Materials and methods

Participants

We conducted a cohort study among participants with aMCI who visited the Samsung Medical Center (SMC) in South Korea from June 2007 to December 2019 and were followed-up for at least 3 years after baseline neuropsychological tests. In total, 705 participants with aMCI were enrolled in this study. All aMCI subjects met the following criteria for aMCI (Albert et al., 2011): (1) subjective memory complaints by participants or caregivers; (2) objective memory decline below –1.0, standard deviation (SD) on either verbal or visual memory tests; (3) normal activities of daily living (ADL), as judged clinically; and (4) not demented.

All the subjects underwent neurological examination, laboratory tests, including APOE genotype, and neuropsychological tests. We excluded participants with secondary causes of cognitive impairment through laboratory tests, such as vitamin B₁₂/folate determination, syphilis serology, and thyroid function tests. In addition, participants with structural lesions, such as territorial infarction, intracranial hemorrhage, brain tumor, traumatic brain injury, hydrocephalus, or severe white matter hyperintensities on brain magnetic resonance imaging (MRI), were excluded.

The study was approved by the Institutional Review Board of SMC, and informed consent was obtained from all participants and caregivers.

Neuropsychological assessments

All the participants underwent the Seoul Neuropsychological Screening Battery (SNSB), a standardized neuropsychological battery widely used in South Korea (Kang and Na, 2003; Kang et al., 2016). Four major cognitive domains were evaluated: memory, language, visuospatial, and frontal/executive function. If the z-score of SNSB was below −1.0 SD of age and education, it was considered impaired.

The scorable tests are comprised of the Korean version of the Boston Naming Test (Kim and Na, 1999), Rey-Osterrieth Complex Figure Test (RCFT) (Kang and Na, 2003), which involves copying, immediate and 20-min delayed recall, and recognition, the Seoul Verbal Learning Test (SVLT) (Kang and Na, 2003), which includes three learning-free recall trials of 12 words, a 20-min delayed recall trial of these 12 items, and a recognition test, the contrasting program (instructing the patient to raise the second and third fingers when the examiner raises the second finger, and to raise the second finger when the examiner raises the second and third fingers), go/no-go test (changing the initial rule as follows: instructing the patient to make a fist in respond to examiner’s raising the second and third fingers) (Dubois et al., 2000), and phonemic and semantic Controlled Oral Word Association Tests (COWAT) (Kang et al., 2000). In addition, the ideomotor praxis and the total calculation score were evaluated. The Korean version of the Mini-Mental State Examination (K-MMSE) and clinical dementia rating-sum of boxes (CDR-SOB) of all the participants were investigated (Kang et al., 2016).

Follow-up

All the participants underwent two or more SNSB during a follow-up period of at least 3 years. Dementia was diagnosed on the basis of the criteria of the fourth edition of the Diagnostic and Statistical Manual of Mental Disorders and required evidence of cognitive deficits (confirmed by neuropsychological testing) and social and/or occupational dysfunction (confirmed by ADL impairment). The criteria of the National Institute of Neurological and Communicative Disorders and Stroke and the Alzheimer’s Disease and Related Disorders Association were used for the diagnosis of probable AD (McKhann et al., 2011). A consensus panel and an experienced neurologist reviewed the interview records and neuropsychological results of each aMCI patient and confirmed the conversion to dementia in the SMC cohort.

The primary outcome was defined as conversion to dementia within 3 years of the baseline neuropsychological test. The predictive algorithm used variables, such as age, gender, years of education, neuropsychological features, APOE ε2, and APOE ε4 status as the potential predictors.

Feature selection

Three major steps were performed to select variables: First, domain knowledge was used to remove the unnecessary variables from the results of neuropsychological tests; second, the remaining variables were used to confirm the significance of the variables through LR analysis for a single variable and remove the insignificant variables; and third, one of the variables suspected of multicollinearity was removed or integrated through the correlation coefficient. We specified the primary outcome as 3-year dementia conversion and included features, such as demographics, APOE genotypes, and neuropsychological features (including K-MMSE and CDR-SOB) selected using the above process. The selected features were used as inputs for predictive model building, and as potential predictors for model interpretation.

Algorithm constructions

Eighty percent of the total data was randomly selected by the matching class imbalance and used it to develop the predictive algorithm, and the remaining 20% was used for the algorithm test. Stratified 5-fold cross-validation was repeated five times by random dataset splitting, and Bayesian optimization was used for hyperparameter tuning. Five types of ML models were developed: multivariable LR, random forest (RF), support vector machine (SVM), artificial neural network (ANN) and extreme gradient boost (XGB).

Statistical analyses

The performance of the model was compared by using areas under the receiver operating characteristic curve (AUCs) with DeLong test (P-value < 0.05 indicated statistical significance) (DeLong et al., 1988). Statistical analyses were performed using the Daim (v1.1.0) package in R 4.1.2 (R Core Team, 2021).

Interpretation methods

The interpretation of the developed ML models was based on both global and local perspectives. IML analysis was carried out using R 4.1.2 (R Core Team, 2021), the caret (v6.0-90), the iml (v0.10.1), the vip (v0.3.2), the pdp (v0.7.0), the breakDown (v0.2.1), SHAPforxgboost (v0.1.1), the caret (v6.0-90), the DALEX (v2.3.0), and the modelStudio (v3.0.0) packages.

Global interpretation

The global analysis method was used to evaluate the overall performance of the developed model, which we evaluated through the model performance, feature importance (Breiman, 2001; Fisher et al., 2019), and partial dependence (Friedman, 2001). The ML model performance of the four groups divided by gender and age was measured by accuracy and AUC. The feature importance is to observe a lowered performance change by randomly mixing a specific feature. The partial dependence plot (PDP) is a global interpretation method in the ML model that shows the marginal effect of one or two features on the prediction result (Friedman, 2001).

Local interpretation

The local analysis method interpreted the prediction results for individual participants. In this study, we implemented Individual Conditional Expectations (ICE) (Goldstein et al., 2015), Break-down (Robnik-Šikonja and Kononenko, 2008), and SHapley Additive exPlanations (SHAP) (Lundberg and Lee, 2017). First, ICE (or Ceteris-paribus) plots display one line per individual that shows how the individual’s prediction changes when a feature changes (Goldstein et al., 2015). Other feature values are fixed with the individual’s data. Second, Break-down plots show feature attributions; that is, the prediction is decomposed into contributions that can be attributed to different interpretive features (Robnik-Šikonja and Kononenko, 2008). A plot can be drawn by adding or subtracting each feature contribution one by one on the basis of the average predicted value for all datasets. Finally, SHAP explains individual predictions by computing the contribution of each feature to the prediction. This is based on the game theoretically optimal Shapley values (Lundberg and Lee, 2017). Unlike break-down plots, the order of adding features is calculated by numerous trials; therefore, the mean and SD is estimated.

We plotted three local interpretations above with the XGB model using six exemplary patients. Supplementary Table 1 shows demographic and dementia conversion information. Also, we collected all IML results and developed dashboards with a graphical view of each patient’s analysis results.

Results

Demographics and clinical characteristics

Table 1 shows the patient demographics and clinical characteristics. The model-building and validation datasets were composed of 565 and 140 participants, respectively. Among the aMCI participants of the development set, 36.1% (204/565) of the participants were observed to convert to dementia within 3 years. In the validation set, 50 out of 140 participants (35.7%) converted to dementia, which is similar to the conversion rate in the development set. Among participants who converted to dementia, 90.2% (n = 229) progressed to clinical AD–type dementia by meeting the core clinical criteria for probable AD (McKhann et al., 2011), and 9.8% to other types of dementia including subcortical vascular dementia (n = 12, 4.7%), frontotemporal dementia (n = 2, 0.8%), dementia with Lewy bodies (n = 2, 0.8%), and others (n = 9, 3.5%).

TABLE 1

Feature	Training set (N = 565)		Validation set (N = 140)
	Mean	SD (%)	Mean	SD (%)
Conversion to dementia	204	(36.1%)	50	(35.7%)
Age (years)	71.6	7.8	72.2	7.6
Sex – Women	348	(61.6%)	84	(60.0%)
Education (years)	11.1	5.2	11.1	4.8
APOE ε4 carrier	214	(37.9%)	45	(32.1%)
APOE ε2 carrier	46	(8.1%)	9	(6.4%)
K-BNT	39.9	10.1	39.6	10.3
Ideomotor praxis	4.2	1.2	4.2	1.2
Calculation total score	10.9	2.0	10.6	2.1
RCFT copy score	29.7	6.3	29.7	5.7
RCFT copy time (seconds)	258.5	124.3	273.5	139.4
SVLT delayed recall	2.6	2.5	2.5	2.4
SVLT recognition score	18.3	2.8	18.4	2.4
RCFT delayed recall	6.9	5.4	6.8	4.8
RCFT recognition score	18.2	2.3	18.3	2.3
Contrasting program	19.1	2.8	19.0	2.9
Go/no-go	16.9	5.0	16.8	4.9
COWAT animal	12.5	4.2	12.6	4.3
K-MMSE	25.9	3.2	25.6	3.2
CDR-SOB	1.5	0.9	1.5	0.9

Demographics of the study.

The numbers are mean and standard deviation (or percentage in parenthesis) of the training and validation sets.

APOE, apolipoprotein E; K-BNT, Korean version of the Boston Naming Test; RCFT, Rey–Osterrieth Complex Figure Test; SVLT, Seoul Verbal Learning Test; COWAT, Controlled Oral Word Association; K-MMSE, Korean version of the Mini-Mental State Examination; SD, standard deviation; CDR-SOB, clinical dementia rating-sum of boxes.

The following 19 features were used for model building: age, gender, education, APOE ε2, APOE ε4, K-BNT, ideomotor apraxia, calculation total score, RCFT copy score, RCFT copy time, SVLT delayed recall, SVLT recognition score, RCFT delayed recall, RCFT recognition score, contrasting program, go/no-go test, COWAT animal, K-MMSE, and CDR-SOB.

Global interpretation

The global interpretation results on the three methods are as follows:

Algorithm performance

The performance of the developed classifiers on validation set and the optimized hyperparameters is shown in Table 2. The XGB model showed the highest performance (accuracy 0.807, AUC 0.852) compared to the other models. Figure 1A shows the receiver operating characteristic curve of the developed classifiers. Statistical tests showed that the AUCs of the XGB and the LR models were significantly different (P-value < 0.05). The hyperparameters of best performed XGB model was as follows: booster = gbtree, eta = 0.1, max_depth = 6, min_child_weight = 17, subsample = 0.81, colsample_bytree = 0.66. The hyperparameters of other models were as follows: mtry = 4 for RF, sigma = 0.020 and C = 0.849 for SVM, and size = 4 and decay = 0.32 for ANN. We determined the XGB to be the best-performing classifer and proceeded with the model interpretation. Also, we divided test set into 4 groups by gender and age: (1) age < 70 and male (n = 20), (2) age < 70 and female (n = 29), (3) age ≥ 70 and male (n = 36), (4) age ≥ 70 and female (n = 55). The prediction result from XGB model of each group was (1) 0.902, (2) 0.838, (3) 0.865, and (4) 0.828, respectively (Figure 1B).

TABLE 2

Classifier	Accuracy	AUC
Logistic regression	0.743	0.813
Random forest	0.771	0.834
Support vector machine	0.800	0.830
Artificial neural network	0.757	0.841
Extreme gradient boost	0.807	0.852

Performance of classifiers on validation set.

Each classifier’s accuracy, area under the receiver operating characteristic curve, and optimized hyperparameters as presented.

AUC, area under the receiver operating characteristic curve.

FIGURE 1

Feature importance

Figure 2 shows feature importance of XGB, where the bars indicate feature importance, and the interval bands indicate difference due to random permutations. According to the result, clinical neuropsychological features of RCFT, CDR-SOB, as well as age were important factors to the global performance.

FIGURE 2

Partial independence

In Figure 3, the PDP of six features is shown with the XGB and LR models. It can be explained that under the condition that other features are fixed, the possibility of dementia conversion increases with age, while it decreases when the RCFT delayed recall score increases. The slope patterns of the XGB and LR were similar.

FIGURE 3

Local interpretation

The local interpretation results on three methods are as follows.

Individual conditional expectations

Figure 4 shows the ICE plot, which presents eight features for six individuals. To explain the result on patient number 3 (green line), the probability of dementia conversion increases between the ages of 70 and 75 years. The age of this patient is 75 years as seen in a blue dot on the green line, the interpretation plot shows the prediction value (y-axis), that is, the conversion probability, indicating approximately 0.5 within 3 years. Likewise, regarding RCFT delayed recall, this subject scored 5; therefore, the conversion possibility was approximately 0.5. If the patient had performed the test better and obtained a higher score, the conversion probability would be reduced.

FIGURE 4

Break-down plots

Figure 5 shows the break-down plots in six individuals, with the XGB model. In patient number 1, the most upper left plot, the subject had a sum of box value of 3, which attributes as much as 0.127 to the baseline mean prediction value of 0.36. In the same way, the RCFT delayed recall value of 0 contributes as much as 0.127 to the prediction.

FIGURE 5

SHapley Additive exPlanations

Figure 6 shows Shapley values plot of six individuals. In patient number 1 (the most upper left plot), the feature that contributed the most to predicting dementia conversion is the CDR-SOB. In patient number 5 (lower middle plot), RCFT delayed recall contributed most to the conversion.

FIGURE 6

Graphic-based overall interpretation on individuals

Figure 7 shows the dashboard displaying the global and the local interpretation of patient 1. We collected all the IML results above and developed a dashboard that provides a graphical view of each patient’s analysis results by displaying them on a screen (Figure 7). It not only provides the probability of aMCI to dementia conversion, but also presents quantitative information on the risk factors attributed to the conversion.

FIGURE 7

Discussion

In the present study, using the clinical and neuropsychological features of carefully phenotyped aMCI patients, we developed an algorithm to predict conversion to dementia by applying the IML technique. Our major findings are as follows. First, among the ML techniques, the XGB model showed the best accuracy, which was superior to that of LR. Second, variables, such as visual memory delayed recall, CDR-SOB, age, K-MMSE score, frontal executive function, education, verbal memory delayed recall, visuospatial function, and APOE genotype were important features for creating the algorithm. Finally, ICE and SHAP analyses allowed for the interpretation of variables acted as important factors in the conversion to dementia of each aMCI patient. Taken together, our findings suggest that an algorithm using the IML technique enables us to individually predict the conversion of patients with aMCI to dementia within 3 years in clinical practice and the research field. Using our newly developed IML algorithm, we predict that, with the aid of visualized graphs, patients will be able to more easily understand the neuropsychological factors that are at risk, which would become a further step toward precision medicine.

In the present study, when compared with other algorithms including LR, the XGB model showed the best performance with an AUC of 0.852 and an accuracy of 0.807. Thus, these findings suggest that our newly developed algorithm with the XGB model overcomes this limitation and results in better AUC and accuracy than LR. If the predictive algorithm is applied to the electronic medical record system, the conversion rate would be readily calculated in clinical practice with more accuracy.

The second major finding was that RCFT delayed recall, CDR-SOB, age, K-MMSE, COWAT-animal, education, SVLT delayed recall, RCFT copy time, and APOE genotype were the important factors in the IML algorithm, which is consistent with previous studies. Consistent with our findings, MMSE (Hou et al., 2019), CDR-SOB (Daly et al., 2000; Dickerson et al., 2007; Montano et al., 2013; Woolf et al., 2016), and frontal/executive dysfunction, which can be examined by the COWAT-animal test (Lezak et al., 2004), were found to be the predictors of conversion to dementia in other studies (Tabert et al., 2006; Jung et al., 2020). The APOE ε4 genotype was also found to play an important role in conversion to dementia, which was again consistent with previous studies (Petersen et al., 1995; Mosconi et al., 2004; Elias-Sonnenschein et al., 2011).

In our previous studies (Ye et al., 2015; Jang et al., 2017), the odds ratio of conversion to dementia was higher in Verbal-aMCI patients than in Visual-aMCI patients. However, our global interpretation results showed that the RCFT delayed recall score (visual memory) had higher feature importance than the SVLT delayed recall score (verbal memory), which is thought to be due to differences in the classification of participants. The previous studies defined Visual-aMCI as only visual memory impairment, Verbal-aMCI as only verbal memory impairment, and Both-aMCI as visual and verbal memory impairment, and then analyzed the odds ratio compared to Visual-aMCI. On the other hand, we analyzed the variables of the RCFT delayed recall score and SVLT delayed recall score together with other neuropsychological test scores of all participants without classification.

There are also some debates on the educational effects in participants with aMCI among studies. Specifically, a previous study (Cooper et al., 2015) did not show that high educational levels predict conversion to dementia in participants with aMCI. However, another study from our group showed that highly educated aMCI participants were at a higher risk of conversion to AD dementia than less educated aMCI participants (Ye et al., 2013). Furthermore, early stage aMCI participants with higher levels of education showed a slower cognitive decline while late-stage aMCI participants with higher levels of education showed a more rapid cognitive decline. Thus, our present findings that aMCI patients with higher education levels were more likely to convert to dementia should be replicated in the future studies with larger MCI participants.

Some studies have proposed an algorithm for differentiating cognitive decline using ML methods, including the Disease State Index, naïve Bayes, Bayesian network classifier with inverse tree structure, decision tree, SVM, multiple-layer perceptrons, Begging, RF, and rule-based classifier (Chen and Herskovits, 2010; Hall et al., 2015; So et al., 2017; Bansal et al., 2018; Bhagyashree et al., 2018; Zhu et al., 2020). Beheshti et al. also developed a predictive algorithm with feature ranking and a genetic algorithm, which can predict the conversion rate to dementia after 3 years (Beheshti et al., 2017). However, compared to previous studies, the present study is meaningful in that we predicted the conversion of aMCI to dementia with IML, especially by presenting the attribution of each feature to the prediction. Thus, the IML predictive algorithm used in our study might be more useful in clinical practice because it is composed of clinical data that are widely and commonly used for evaluating cognition status.

Our final major finding was that our IML, which consisted of the ICE and SHAP analyses, allowed for the interpretation of variables that acted as important factors in the conversion to dementia in each patient. Therefore, we suggest that our IML is an improved predictive algorithm that has both the high accuracy of ML and the advantage of the nomogram. Identifying the specific factors that influence conversion to dementia for each aMCI patient will be helpful for the development of personalized intervention strategies in the future.

To our knowledge, our study is the first to develop an IML algorithm to predict conversion to dementia within a large sample size of well-phenotyped aMCI patients. Another strength of this study is that the IML algorithm was based on variables that are most commonly used in clinical practice, specifically neuropsychological test results and APOE genotype. However, this study has some limitations. First, MRI volumetry and cortical thickness, which are highly correlated with neurodegenerative dementia, were not used in this algorithm. Future studies incorporating structural brain MRI information are required to achieve higher predictive power. Second, since we did not perform amyloid and tau positron emission tomography in all participants, we could not determine the biomarker guided diagnosis in our participants. Third, the number of samples to train the model might not be large enough because of the limited number of subjects of 3-year followed-up. Finally, since this study was conducted only at SMC, there is a limitation regarding the generalizability of the outcomes. External validation in an independent cohort should be conducted in the future. Nevertheless, our study is noteworthy in demonstrating that the IML algorithm is able to estimate the individual risk of conversion to dementia in each aMCI patient.

Conclusion

This study was able to develop an IML algorithm to predict conversion to dementia in aMCI patients. This IML algorithm is expected to be useful in clinical practice and the research field as it can identify the degree to which individual risk factors influence each patient.

Statements

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving human participants were reviewed and approved by the Institutional Review Board of Samsung Medical Center. The patients/participants provided their written informed consent to participate in this study.

Author contributions

MC, CP, and JK: conceptualization and formal analysis and investigation. CP and JK: methodology. MC and CP: writing – original draft preparation. JJ, HJ, KK, and SS: writing – review and editing. SS: funding acquisition. KK and SS: supervision. All authors contributed to manuscript revision, read, and approved the submitted version.

Funding

This research was supported by a grant of the Korean Health Technology R&D Project, Ministry of Health and Welfare, Republic of Korea (HI19C1132); a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health and Welfare and Ministry of science and ICT, Republic of Korea (grant numbers: HU20C0111 and HU22C0170); the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (NRF-2019R1A5A2027340); Institute of Information and Communications Technology Planning and Evaluation (IITP) grant funded by the Korea government (MSIT) (No. 2021-0-02068, Artificial Intelligence Innovation Hub); Future Medicine 20*30 Project of the Samsung Medical Center (#SMX1220021); and the “National Institute of Health” research project (2021-ER1006-01).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnagi.2022.898940/full#supplementary-material

References

1
AlbertM. S.DeKoskyS. T.DicksonD.DuboisB.FeldmanH. H.FoxN. C.et al (2011). The diagnosis of mild cognitive impairment due to Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease.Alzheimers Dement.7270–279. 10.1016/j.jalz.2011.03.008
2
AlegretM.Cuberas-BorrósG.EspinosaA.ValeroS.HernándezI.RuízA.et al (2014). Cognitive, genetic, and brain perfusion factors associated with four year incidence of Alzheimer’s disease from mild cognitive impairment.J. Alzheimer’s Dis.41739–748. 10.3233/JAD-132516
3
BansalD.ChhikaraR.KhannaK.GuptaP. (2018). Comparative analysis of various machine learning algorithms for detecting dementia.Procedia Comput. Sci.1321497–1502. 10.1016/j.procs.2018.05.102
- CrossRef
- Google Scholar
4
BeheshtiI.DemirelH.MatsudaH. (2017). Classification of Alzheimer’s disease and prediction of mild cognitive impairment-to-Alzheimer’s conversion from structural magnetic resource imaging using feature ranking and a genetic algorithm.Comput. Biol. Med.83109–119. 10.1016/j.compbiomed.2017.02.011
5
BhagyashreeS. I. R.NagarajK.PrinceM.FallC. H. D.KrishnaM. (2018). Diagnosis of dementia by machine learning methods in epidemiological studies: a pilot exploratory study from south India.Soc. Psychiatry Psychiatr. Epidemiol.5377–86. 10.1007/s00127-017-1410-0
6
BishopC. M. (2006). Pattern Recognition and Machine Learning.New York, NY: Springer-Verlag.
- Google Scholar
7
BreimanL. (2001). Statistical modeling: the two cultures (with comments and a rejoinder by the author).Stat. Sci.16199–231.
- Google Scholar
8
BusseA.HenselA.GühneU.AngermeyerM. C.Riedel-HellerS. G. (2006). Mild cognitive impairment: long-term course of four clinical subtypes.Neurology672176–2185. 10.1212/01.wnl.0000249117.23318.e1
9
ChenR.HerskovitsE. H. (2010). Machine-learning techniques for building a diagnostic model for very mild dementia.Neuroimage52234–244. 10.1016/j.neuroimage.2010.03.084
10
CooperC.SommerladA.LyketsosC. G.LivingstonG. (2015). Modifiable predictors of dementia in mild cognitive impairment: a systematic review and meta-analysis.Am. J. Psychiatry172323–334.
- Google Scholar
11
DalyE.ZaitchikD.CopelandM.SchmahmannJ.GuntherJ.AlbertM. (2000). Predicting conversion to Alzheimer disease using standardized clinical information.Arch. Neurol.57675–680. 10.1001/archneur.57.5.675
12
De SimoneM. S.PerriR.FaddaL.CaltagironeC.CarlesimoG. A. (2019). Predicting progression to Alzheimer’s disease in subjects with amnestic mild cognitive impairment using performance on recall and recognition tests.J. Neurol.266102–111. 10.1007/s00415-018-9108-0
13
DeCarliC.MungasD.HarveyD.ReedB.WeinerM.ChuiH.et al (2004). Memory impairment, but not cerebrovascular disease, predicts progression of MCI to dementia.Neurology63220–227. 10.1212/01.wnl.0000130531.90205.ef
14
DeLongE. R.DeLongD. M.Clarke-PearsonD. L. (1988). Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach.Biometrics44837–845.
- Google Scholar
15
DickersonB. C.SperlingR. A.HymanB. T.AlbertM. S.BlackerD. (2007). Clinical prediction of Alzheimer disease dementia across the spectrum of mild cognitive impairment.Arch. Gen. Psychiatry641443–1450. 10.1001/archpsyc.64.12.1443
16
DuboisB.SlachevskyA.LitvanI.PillonB. (2000). The FAB: a frontal assessment battery at bedside.Neurology551621–1626.
- Google Scholar
17
Elias-SonnenscheinL. S.ViechtbauerW.RamakersI. H.VerheyF. R.VisserP. J. (2011). Predictive value of APOE-epsilon4 allele for progression from MCI to AD-type dementia: a meta-analysis.J. Neurol. Neurosurg. Psychiatry821149–1156. 10.1136/jnnp.2010.231555
18
EspinosaA.AlegretM.ValeroS.Vinyes-JunquéG.HernándezI.MauleónA.et al (2013). A longitudinal follow-up of 550 mild cognitive impairment patients: evidence for large conversion to dementia rates and detection of major risk factors involved.J. Alzheimer’s Dis.34769–780.
- Google Scholar
19
FischerP.JungwirthS.ZehetmayerS.WeissgramS.HoenigschnablS.GelpiE.et al (2007). Conversion from subtypes of mild cognitive impairment to Alzheimer dementia.Neurology68288–291. 10.1212/01.wnl.0000252358.03285.9d
20
FisherA.RudinC.DominiciF. (2019). All Models are wrong, but many are useful: learning a variable’s importance by studying an entire class of prediction models simultaneously.J. Mach. Learn. Res.20177.
- Google Scholar
21
FlickerC.FerrisS. H.ReisbergB. (1991). Mild cognitive impairment in the elderly: predictors of dementia.Neurology411006–1009. 10.1212/wnl.41.7.1006
22
FriedmanJ. H. (2001). Greedy function approximation: a gradient boosting machine.Ann. Statist.291189–1232.
- Google Scholar
23
GoldsteinA.KapelnerA.BleichJ.PitkinE. (2015). Peeking inside the black box: visualizing statistical learning with plots of individual conditional expectation.J. Comput. Graph. Stat.2444–65.
- Google Scholar
24
HallA.Muñoz-RuizM.MattilaJ.KoikkalainenJ.TsolakiM.MecocciP.et al (2015). Generalizability of the disease state index prediction model for identifying patients progressing from mild cognitive impairment to Alzheimer’s disease.J. Alzheimers Dis.4479–92. 10.3233/jad-140942
25
HouX. H.FengL.ZhangC.CaoX. P.TanL.YuJ. T. (2019). Models for predicting risk of dementia: a systematic review.J. Neurol. Neurosurg. Psychiatry90373–379. 10.1136/jnnp-2018-318212
26
JangH.YeB. S.WooS.KimS. W.ChinJ.ChoiS. H.et al (2017). Prediction model of conversion to dementia risk in subjects with amnestic mild cognitive impairment: a longitudinal.Multi-Center Clinic-Based Study.J. Alzheimers Dis.601579–1587. 10.3233/JAD-170507
27
JungY. H.ParkS.JangH.ChoS. H.KimS. J.KimJ. P.et al (2020). Frontal-executive dysfunction affects dementia conversion in patients with amnestic mild cognitive impairment.Sci. Rep.10:772. 10.1038/s41598-020-57525-6
28
KangI. W.BeomI. G.ChoJ. Y.SonH. R. (2016). Accuracy of Korean-mini-mental status examination based on seoul neuro-psychological screening battery ii results.Korean J. Fam. Med.37177–181. 10.4082/kjfm.2016.37.3.177
29
KangY.ChinJ.NaD. L.LeeJ.PArkJ. (2000). Brief Report: a normative study of the korean version of controlled oral word association test (COWAT) in the elderly.Korean J. Clin. Psychol.19385–392.
- Google Scholar
30
KangY.NaD. L. (2003). Seoul Neuropsychological Screening Battery.Seoul: Human Brain Research & Consulting Co.
- Google Scholar
31
KimH.NaD. L. (1999). BRIEF REPORT normative data on the Korean version of the Boston naming test.J. Clin Exp. Neuropsychol.21127–133. 10.1076/jcen.21.1.127.942
32
LarrieuS.LetenneurL.OrgogozoJ. M.FabrigouleC.AmievaH.Le CarretN.et al (2002). Incidence and outcome of mild cognitive impairment in a population-based prospective cohort.Neurology591594–1599. 10.1212/01.wnl.0000034176.07159.f8
33
LezakM. D.HowiesonD. B.LoringD. W. (2004). Neuropsychological Assessment.New York, NY: Oxford University Press.
- Google Scholar
34
LianC.LiuM.WangL.ShenD. (2021). Multi-Task weakly-supervised attention network for dementia status estimation with structural MRI.IEEE Trans Neural Netw Learn Syst *, 10.1109/tnnls.2021.3055772
35
LundbergS. M.LeeS. I. (2017). A unified approach to interpreting model predictions.Adv. Neural Inf. Proc. Syst304768–4777.
- Google Scholar
36
MattilaJ.SoininenH.KoikkalainenJ.RueckertD.WolzR.WaldemarG.et al (2012). Optimizing the diagnosis of early Alzheimer’s disease in mild cognitive impairment subjects.J. Alzheimers Dis.32969–979. 10.3233/JAD-2012-120934
37
McKhannG. M.KnopmanD. S.ChertkowH.HymanB. T.JackC. R.Jr.KawasC. H.et al (2011). The diagnosis of dementia due to Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease.Alzheimers Dement.7263–269. 10.1016/j.jalz.2011.03.005
38
MitchellA. J.Shiri-FeshkiM. (2009). Rate of progression of mild cognitive impairment to dementia–meta-analysis of 41 robust inception cohort studies.Acta. Psychiatr. Scand.119252–265. 10.1111/j.1600-0447.2008.01326.x
39
MontanoM. B.AndreoniS.RamosL. R. (2013). Clinical dementia rating independently predicted conversion to dementia in a cohort of urban elderly in Brazil.Int. Psychogeriatr.25245–251. 10.1017/S1041610212001615
40
MosconiL.PeraniD.SorbiS.HerholzK.NacmiasB.HolthoffV.et al (2004). MCI conversion to dementia and the APOE genotype: a prediction study with FDG-PET.Neurology632332–2340. 10.1212/01.wnl.0000147469.18313.3b
41
MurdochW. J.SinghC.KumbierK.Abbasi-AslR.YuB. (2019). Definitions, methods, and applications in interpretable machine learning.Proc. Natl. Acad. Sci. U.S.A.11622071–22080. 10.1073/pnas.1900654116
42
PetersenR. C.DoodyR.KurzA.MohsR. C.MorrisJ. C.RabinsP. V.et al (2001). Current concepts in mild cognitive impairment.Arch. Neurol.581985–1992. 10.1001/archneur.58.12.1985
43
PetersenR. C.SmithG. E.IvnikR. J.TangalosE. G.SchaidD. J.ThibodeauS. N.et al (1995). Apolipoprotein E status as a predictor of the development of Alzheimer’s disease in memory-impaired individuals.JAMA2731274–1278.
- Google Scholar
44
QiaoH.ChenL.YeZ.ZhuF. (2021). Early Alzheimer’s disease diagnosis with the contrastive loss using paired structural MRIs.Comput Methods Prog. Biomed.208:106282. 10.1016/j.cmpb.2021.106282
45
RavagliaG.FortiP.MaioliF.MartelliM.ServadeiL.BrunettiN.et al (2006). Conversion of mild cognitive impairment to dementia: predictive role of mild cognitive impairment subtypes and vascular risk factors.Dement. Geriatr. Cogn. Disord.2151–58. 10.1159/000089515
46
R Core Team (2021). R: A Language and Environment for Statistical Computing.Vienna: R Foundation for Statistical Computing. Available online at: https://www.R-project.org/
- Google Scholar
47
Robnik-ŠikonjaM.KononenkoI. (2008). Explaining classifications for individual instances.IEEE Trans. Knowl. Data Eng.20589–600. 10.1109/TKDE.2007.190734
- CrossRef
- Google Scholar
48
SarazinM.BerrC.De RotrouJ.FabrigouleC.PasquierF.LegrainS.et al (2007). Amnestic syndrome of the medial temporal type identifies prodromal AD: a longitudinal study.Neurology691859–1867. 10.1212/01.wnl.0000279336.36610.f7
49
SoA.HooshyarD.ParkK. W.LimH. S. (2017). Early Diagnosis of Dementia from Clinical Data by Machine Learning Techniques.Appl. Sci.7:651. 10.3390/app7070651
- CrossRef
- Google Scholar
50
TabertM. H.ManlyJ. J.LiuX.PeltonG. H.RosenblumS.JacobsM.et al (2006). Neuropsychological prediction of conversion to Alzheimer disease in patients with mild cognitive impairment.Arch. Gen. Psychiatry63916–924. 10.1001/archpsyc.63.8.916
51
TuJ. V. (1996). Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes.J. Clin. Epidemiol.491225–1231. 10.1016/s0895-4356(96)00002-9
- CrossRef
- Google Scholar
52
WaljeeA. K.HigginsP. D.SingalA. G. (2014). A primer on predictive models.Clin. Transl. Gastroenterol.5:e44. 10.1038/ctg.2013.19
53
WoolfC.SlavinM. J.DraperB.ThomassenF.KochanN. A.ReppermundS.et al (2016). Can the clinical dementia rating scale identify mild cognitive impairment and predict cognitive and functional decline?Dement Geriatr. Cogn. Disord.41292–302. 10.1159/000447057
54
YaffeK.PetersenR. C.LindquistK.KramerJ.MillerB. (2006). Subtype of mild cognitive impairment and progression to dementia and death.Dement Geriatr. Cogn. Disord.22312–319. 10.1159/000095427
55
YeB. S.ChinJ.KimS. Y.LeeJ. S.KimE. J.LeeY.et al (2015). The heterogeneity and natural history of mild cognitive impairment of visual memory predominant type.J. Alzheimers Dis.43143–152. 10.3233/JAD-140318
56
YeB. S.SeoS. W.ChoH.KimS. Y.LeeJ.-S.KimE.-J.et al (2013). Effects of education on the progression of early-versus late-stage mild cognitive impairment.Int. Psychogeriatr.25597–606. 10.1017/S1041610212002001
57
ZhuF.LiX.TangH.HeZ.ZhangC.HungG.-U.et al (2020). Machine learning for the preliminary diagnosis of dementia.Sci. Prog.2020:5629090. 10.1155/2020/5629090
- CrossRef
- Google Scholar

Summary

Keywords

Alzheimer’s disease, amnestic mild cognitive impairment, prediction algorithm, interpretable machine learning, artificial intelligence, clinical decision-support system, SHapley Additive exPlanations (SHAP)

Citation

Chun MY, Park CJ, Kim J, Jeong JH, Jang H, Kim K and Seo SW (2022) Prediction of conversion to dementia using interpretable machine learning in patients with amnestic mild cognitive impairment. Front. Aging Neurosci. 14:898940. doi: 10.3389/fnagi.2022.898940

Received

18 March 2022

Accepted

18 July 2022

Published

05 August 2022

Volume

14 - 2022

Edited by

Xiuqin Jia, Capital Medical University, China

Reviewed by

Lin Chen, Chongqing Institute of Green and Intelligent Technology (CAS), China; Jin San Lee, Kyung Hee University, South Korea

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Kyunga Kim, kyunga.j.kim@samsung.comSang Won Seo, sangwonseo@empas.com

†These authors have contributed equally to this work and share first authorship

‡These authors have contributed equally to this work

^§ORCID: Min Young Chun, orcid/0000-0003-3731-6132; Chae Jung Park, orcid/0000-0002-1261-307X; Jonghyuk Kim, orcid/0000-0001-5496-0152; Jee Hyang Jeong, orcid/0000-0001-7945-6956; Hyemin Jang, orcid/0000-0003-3152-1274; Kyunga Kim, orcid/0000-0002-0865-2236; Sang Won Seo, orcid/0000-0002-8747-0122

This article was submitted to Alzheimer’s Disease and Related Dementias, a section of the journal Frontiers in Aging Neuroscience

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

ORIGINAL RESEARCH article

Prediction of conversion to dementia using interpretable machine learning in patients with amnestic mild cognitive impairment

Abstract

Introduction

Materials and methods

Participants

Neuropsychological assessments

Follow-up

Feature selection

Algorithm constructions

Statistical analyses

Interpretation methods

Global interpretation

Local interpretation

Results

Demographics and clinical characteristics

Global interpretation

Algorithm performance

Feature importance

Partial independence

Local interpretation

Individual conditional expectations

Break-down plots

SHapley Additive exPlanations

Graphic-based overall interpretation on individuals

Discussion

Conclusion

Statements

Data availability statement

Ethics statement

Author contributions

Funding

Conflict of interest

Publisher’s note

Supplementary material

References

Summary

Outline

Figures

Cite article

Share article

Article metrics