Predicting mild cognitive impairment among Chinese older adults: a longitudinal study based on long short-term memory networks and machine learning

Background Mild cognitive impairment (MCI) is a transitory yet reversible stage of dementia. Systematic, scientific and population-wide early screening system for MCI is lacking. This study aimed to construct prediction models using longitudinal data to identify potential MCI patients and explore its critical features among Chinese older adults. Methods A total of 2,128 participants were selected from wave 5–8 of Chinese Longitudinal Healthy Longevity Study. Cognitive function was measured using the Chinese version of Mini-Mental State Examination. Long- short-term memory (LSTM) and three machine learning techniques, including 8 sociodemographic features and 12 health behavior and health status features, were used to predict individual risk of MCI in the next year. Performances of prediction models were evaluated through receiver operating curve and decision curve analysis. The importance of predictors in prediction models were explored using Shapley Additive explanation (SHAP) model. Results The area under the curve values of three models were around 0.90 and decision curve analysis indicated that the net benefit of XGboost and Random Forest were approximate when threshold is lower than 0.8. SHAP models showed that age, education, respiratory disease, gastrointestinal ulcer and self-rated health are the five most important predictors of MCI. Conclusion This screening method of MCI, combining LSTM and machine learning, successfully predicted the risk of MCI using longitudinal datasets, and enables health care providers to implement early intervention to delay the process from MCI to dementia, reducing the incidence and treatment cost of dementia ultimately.


Introduction
With an increasing older adult population worldwide, geriatric health concerns cannot be ignored.Aging results in declining physical and cognitive functions, leading to a high risk of disability and death (Klimova et al., 2017).Distinguishing between pathological and normal cognitive decline, generally referred to as dementia or cognitive impairment, remains challenging.As an inevitable human phenomenon, aging is a significant factor in deteriorating cognitive function.With a global increase in life expectancy, older adults have an increased likelihood of developing dementia and cognitive impairment.The World Health Organization (WHO) stated that >55 million older adults had a diagnosis of dementia in 2021, with >139 million older adults estimated to be diagnosed with dementia in 2050 worldwide.In 2019, the annual cost of dementia-related treatment exceeded US $1.3 trillion (World Health Organization, 2021).China has the greatest population of people with dementia, comprising 25% of the global population.Aggregate expenditure on dementia in China reached US $195 billion in 2019 (Jia et al., 2020b;Mattap et al., 2022).
With no reversal therapies available, prevention of dementia remains a priority.Mild cognitive impairment (MCI), a risk factor for dementia, is considered a transitional stage between normal cognitive function and dementia, where there is objective cognitive decline but with a capacity to live independently.However, approximately 10-20% of older adults aged ≥65 years with MCI are diagnosed with dementia after 1 year (Langa and Levine, 2014).Delaying the progression of MCI to dementia is currently the most effective approach, as diverse treatments for MCI have proven to be effective and less costly (Langa and Levine, 2014;Anderson, 2019;Huang et al., 2022), with early identification and intervention in high-risk groups shown to prevent dementia onset in 40% of such cases.
Currently, screening techniques and questionnaires for MCI are limited.On account of the fact that neurodegenerative disease starts to develop many years before the symptoms are observed, while applying MCI screening to the population with normal cognitive function, imaging examinations, and fluid biomarkers can detect the neurodegenerative and pathological changes most accurately.Imaging techniques, such as magnetic resonance imaging (MRI), positron emission computed tomography (PET), and single photon emission computed tomography (SPECT), are capable of showing the tiny changes in brain structure, blood flow, metabolism, and neurotransmitters in patients with MCI.Nevertheless, due to the rarity and inaccessibility of these techniques for the general public, they cannot be used as a common screening tool for MCI (Dunne et al., 2021), with limited coverage in terms of MCI questionnaires [Mini-Mental State Examination (MMSE) and the Montreal Cognitive Assessment (MoCA)] that generally require a significant investment in manpower and their training.Therefore, an effective, systematic, and convenient MCI screening method to identify high-risk older adults in the general population is urgently needed.Effective screening could be conducive to targeted interventions for those at high risk of MCI.One study reported significant changes through implementing appropriate early intervention for potential patients in England, namely, an 8.5% decrease in the incidence of dementia and a reduction in dementia-related expenditure of approximately $180 million (Mukadam et al., 2020).Owing to the irreversible nature of dementia, treatment for patients with dementia places considerable financial and psychological pressure on families and caregivers (Chiao et al., 2015).Given the significant negative effects of dementia, it is critical to identify high-risk individuals at an early stage.
Some studies have adopted multiple perspectives to identify risk factors in people with MCI.A national cross-sectional study in China that comprised 46,011 older adults showed that MCI was associated with sociodemographic characteristics, including age, sex, parental history, education level, residence, and marital status (Jia et al., 2020a).Several cohort studies have shown a causal relationship between health status and behaviors that contribute to MCI.Chronic diseases, such as hypertension, stroke, and diabetes as well as harmful lifestyle behaviors, such as smoking and alcohol consumption, significantly increase the risk of MCI, while regular physical exercise, tea/coffee consumption, and playing Mahjong can prevent cognitive impairment (Kivipelto et al., 2018;Kakutani et al., 2019;Zhang et al., 2020Zhang et al., , 2022)).Owing to limitations in conventional regression methods in terms of collinearity potentially affecting predictors, some studies have applied machine learning based on imaging data or biomarkers to further determine whether an individual has MCI and to explore key features of MCI (Mirzaei et al., 2016;Wang et al., 2022;Alamro et al., 2023).However, most machine learning studies have only used single-wave panel data, and neurodegenerative disorders have a natural history of progression, thus ignoring the dynamic and longitudinal nature of these diseases, such that early identification and intervention could be sufficient.
Consequently, to address those deficiencies in previous studies, we used long short-term memory networks (LSTMs) in this study to capture the interdependence of predictors in longitudinal data.In combination with machine learning, it is possible to generate a model that can forecast the likelihood of conversion to MCI after several years.This model facilitates convenient and efficient screening for MCI and identification of risk groups for targeted intervention procedures.LSTMs are a form of recurrent neural network that address long-term dependencies and gaps between significant events in sequential data.Compared to traditional times series analysis like the Autoregressive Integrated Moving Average model (ARIMA), LSTMs models generally generate better outcomes in nonlinear and volatile time series data (Lou et al., 2022;Liu X. D. et al., 2023) despite the complexity of model interpretations and the long duration of model training.LSTMs were originally introduced into medically relevant applications to forecast the incidence and prevalence of diseases with considerable success during the COVID-19 pandemic (Borges and Nascimento, 2022;Gautam, 2022;Liu X. D. et al., 2023).Simultaneously, several studies have shown the feasibility of using LSTMs prediction in relation to individual characteristics in machine learning techniques to predict depression in older adults through applying longitudinal sequence data (Su et al., 2020;Lin et al., 2022).
No previous studies have used multiple sequence data waves to predict potential MCI in older Chinese adults.On the basis of the traits that LSTMs could effectively capture the temporal dependencies and trends of individual characters in longitudinal data from multiple data waves, and the capability that machine learning could extract important variables with significant trends related to MCI, therefore, this study assumes that the combination of LSTMs and machine learning could successfully identify the older adults at high risk for MCI and indicate instructions of implementing early interventions to prevent dementia.The data used in this study were Waves 5-8 (2008Waves 5-8 ( , 2011Waves 5-8 ( , 2014Waves 5-8 ( , 2018) ) Data, 1998Data, -2018)).Respondents in the CLHLS among the selected waves were randomly sampled from approximately half of the counties and city districts of China's 23 mainland provinces.The CLHLS questionnaire includes a wide range of instruments, such as the Mini-Mental State Examination (MMSE), the Center for Epidemiologic Studies Depression Scale, and the Self-Rating Anxiety Scale.Previous studies have confirmed that the design of questionnaire and quality of datasets are excellent (Gu, 2008;Zeng, 2012).
The Wave 5 questionnaire of the CLHLS was used to obtain baseline characteristics of the older adults, including 2,334 homebased interviewees who continuously responded until Wave 8.After excluding respondents lacking answers or records for cognition measurement, that is, the MMSE questionnaire in this study, and respondents who had been diagnosed with dementia in Waves 5-7 based on the their MMSE scores, 2,128 eligible participants were included in the ultimate data preprocessing and statistical analysis.

Assessment of MCI and outcome variables
The MMSE has been widely applied to screen for cognitive dysfunction among older adults.In the CLHLS questionnaires, the MMSE was modified into a Chinese version, including 24 items within six dimensions: five items for orientation (five points in total), one for naming (seven points in total, one point for naming each kind of food), three for registration (three points in total), five for attention and calculation (five points in total), three for recall (three points in total), and seven for language (seven points in total).The final cognitive function score was the sum of the scores of the six dimensions, with a possible total of 30 points.

Predictors
We considered three levels of individual characteristics to fit the LSTMs and machine learning models from Waves 5-8, namely (Supplementary Table 1), (i) sociodemographic characteristics, such as age, sex, geographical area, education level, marital status, residence, income level, and living status; (ii) health behavior factors, including active smoking, alcohol consumption, exercise, self-rated health [SRH], and sleep quality; and (iii) health status factors, such as a history of hypertension, diabetes, cardiopathy, stroke, chronic respiratory disease, cancer, or gastrointestinal ulcer.

Processing of missing values
In order to reduce the probability of bias during the imputation procedure, variables with >20% information were abandoned to guarantee good performance (Jakobsen et al., 2017).The ultimate predictors included from CLHLS Waves 5-7 were imputed utilizing a MICE package in R studio 4.2.3 software, applying multivariate iterative random forest ("RF" method) imputation algorithms with five iterations to produce datasets with the least variance compared with datasets being imputed before.

Statistical analysis
Statistical analyses were performed using Keras package (version 2.6.0)software for deep learning and Scikit-Learn package (version 1.1.2) for machine learning in Python (version 3.9) software.We randomly partitioned the data into three disjoint sets: training, testing, and validation, with proportions of 60, 20, and 20%, respectively.Details about hyperparameters of LSTMs and parameters of three machine learning models were listed in Supplementary Tables 2, 3.

The multivariate LSTMs models
Machine learning techniques are generally applied to panel data from a cross-sectional perspective, but are not able to capture features with time sensitivity.To forecast the development of predictors and explore potential outcomes, recurrent neural networks (RNNs) are used to capture the inputs of predictors from specific time periods and transfer information to subsequent time periods through combining the interdependence among predictors.However, traditional RNNs cannot cope with gradient vanishing and gradient exploding in longterm dependency issues owing to their simple neuron structure, whereas LSTMs can successfully handle these disadvantages in RNNs through the use of "forget gate" and the sigmoid function in each LSTMs unit.The LSTMs model has been validated as a powerful and precise model for forecasting time-series data in longitudinal studies.As shown in Figure 1, time-sensitivity predictors in CLHLS Waves 5-6 were randomly split such that 70% of the samples were used to train the LSTMs model to forecast the values of the predictors in Wave 7, and the remaining 30% of the samples were used to test our LSTMs model.The model was then fitted to CLHLS Waves 6-7 to forecast predictors in Wave 8, combining invariable features such as age, sex, education level, and geographical area that did not need to be predicted over time to constitute a new dataset.

Synthetic minority oversampling technique
Imbalanced data were a challenge for machine learning as the proportion of older adults with MCI was only 16.92% in this study.A common issue is that models tend to be biased toward the majority class, resulting in suboptimal performance.To address this problem, we applied the synthetic minority oversampling technique (SMOTE).SMOTE creates synthetic samples from the existing minority class through interpolation from its nearest neighbors, thereby increasing the number of minority samples in the datasets.

Gradient boosting decision tree (GBDT)
The GBDT is an ensemble machine learning approach for classification and regression based on the CART algorithm.The GBDT improves prediction accuracy through gradually improving estimation using a boosting method.In addition, the GBDT utilizes a nonlinear regression procedure to improve tree accuracy.A series of decision trees was created, which produced a set of weak prediction models and generated loss functions.The final classification model was the weighted sum of all weak prediction models through each round of training.

Extreme gradient boosting
XGBoost is a scalable and efficient implementation of gradient boosting, a popular machine learning technique that combines weak learners (typically decision trees) into a strong ensemble model.XGBoost offers several advantages over other gradient boosting frameworks, such as parallelization, regularization, and missing value handling.In addition, XGBoost can handle encoded categorical variables.

Random Forest algorithm
Random Forest (RF) is a machine learning technique that builds an ensemble of decision trees and aggregates their predictions.RF can handle both classification and regression problems, as well as categorical and numerical features.It also provides measures of feature importance and variable selection.RF introduces randomness in two ways: by bootstrapping the training data for each tree, and by selecting a random subset of features for each split.To analyze the ultimate result, each decision tree was accessed in the final decision to obtain a reliable result.Based on majority selection for all decision trees, each sample was classified into two classes.

Model assessment
To assess the outcomes of each machine learning model, we calculated the area under the receiver operating characteristic curve (ROC; AUC) and sensitivity (equation 1), specificity (equation 2), accuracy (equation 3), and balanced accuracy (equation 4).True positives and true negatives indicate older adults who were correctly identified as patients with MCI or the normal cognitive function group, respectively; false positives and false negatives indicate older adults who were inaccurately identified as patients with MCI or the normal cognitive function group, respectively.Each machine learning model could predict the probability of cognitive impairment in older adults.If the probability of an individual was greater than the threshold, then older adults were regarded as patients with MCI, and vice versa.To further evaluate and understand the prediction models, we calculated the net benefit of the machine learning models using decision curve analysis (DCA).This method indicated the proportion of patients who received a correct diagnosis minus the percentage of patients who were misdiagnosed under different threshold values.The predictors of the LSTMs model for older adults with MCI from CLHLS wave 5 to wave 8.

SHapley Addictive explanation models
For ensemble machine learning models applied in this study, the processes of their predictions are generally opaque.Unlike the traditional statistical models, it is difficult for people to understand their working mechanisms and certain positive or negative contributions of predictors to the outcomes.To address this problem, post-hoc interpretations of the model output should be proposed for machine learning studies.Based on the individual and joint contributions among players, Shapley values are a way of fairly allocating the payoff of a game in cooperative game theory, which was introduced into machine learning techniques to explain the attribution of each input feature toward the outcome.SHapley Addictive explanation models (SHAP) is able to be used to provide various types of visualized explanations for machine learning models, including global feature importance, feature interaction, and feature dependence.SHAP was performed in Python using shap package (Version 0.42.1) in this study and was used to visualize the importance of each predictor and the association between predictors and MCI quantitatively (Ekanayake et al., 2022).

Results
As presented in Table 1, 2,146 older adults in the baseline CLHLS wave of 2008 participated in this study (older adults with MCI, 17.29%).The median age of patients with MCI was 92 years (range, 86-97 years), which was 10 years older than that of older adults with normal cognitive function (82 years, range, 78-88 years).The proportions of older adult males (46.62%) and females (53.38%) were relatively equal, with approximately two-thirds of the participants with MCI being female.Of older adults with MCI, 71.67% were illiterate, and 75.28% were single older adults.Older adults with low or very low-income levels comprised the majority of participants with MCI.The percentage of individuals living alone was higher among those with normal cognitive function than among those with MCI.Only 13.61% of older adults regularly exercised among those with MCI.People with normal cognitive function generally rated their health and sleep quality as better than those with MCI.A higher percentage of older adults in the normal group had a diagnosis of hypertension.A total of 14.17% of older adults with a history of stroke had poorer MMSE scores.
For further descriptive analysis, odds ratios (ORs) for each predictor were evaluated using univariate and multivariate logistic regression analyses.Among sociodemographic variables, the analysis showed that age was a risk factor for MCI (adjusted OR [aOR] 1.123, 95% CI 1.103-1.143).Compared with literate older adults, illiterate older adults had a higher risk of developing MCI (aOR 1.641, 95% CI 1.199-2.247).Older adults with very low income levels had a higher risk of MCI than their wealthier counterparts (aOR 5.673, 95% CI 1.067-30.180).Among health behavior/health status variables, older adults who did not regularly exercise had a high risk of MCI (aOR 2.277, 95% CI 1.596-3.248).Older adults with poor or very poor selfrated health had a higher risk of MCI compared with those who had very good self-rated health (aOR 2.069, 95% CI 1.145-3.740and aOR 3.874, 95% CI 1.527-9.826,respectively).Moreover, older adults with no history of stroke had a reduced risk of MCI (aOR 0.515, 95% CI 0.347-0.776).
LSTMs model performance is illustrated in Figure 2. The mean squared errors of both the training and validation sets were generally equal (approximately 0.08) after 30 rounds of training, and the inflection points of both sets were close, indicating that the LSTMs model could be utilized to forecast characteristics of older adults three years later.Table 2 and Figure 3A shows the ROC curves and AUC values of the three machine learning models in the testing set (GBDT 0.902, 95% CI 0.879-0.925;XGBoost 0.928, 95% CI 0.908-0.948;and RF 0.938, 95% CI 0.919-0.956).Table 3 and Figure 3B shows the performance of the three models in the validation set.The AUC values of all three machine learning models in the test sets were >0.9.The three machine learning models produced equal results in the validation sets, indicating that they were robust models for classifying patients with MCI and healthy people.XGBoost had the highest and most balanced accuracy and the second-highest sensitivity using 0.3 as a threshold (Table 2), and RF produced the highest sensitivity under this condition.The DCA results (Figure 4) showed that the XGboost and RF models were close, within the range of 0-0.8, and the net benefit values were higher than 0.4 using 0.3 as a threshold.
Figure 5 illustrates the ranking of feature importance in MCI prediction.Age, education, and chronic respiratory disease were the first, second, and third-most important characteristics of older adults when predicting MCI in all three models, respectively.Younger literate older adults with no history of chronic respiratory disease had a lower probability of developing MCI.Self-rated health was also an important feature that presented a direct trend in MCI output.All three SHAP models indicated that having a gastrointestinal ulcer was one of the most important features for predicting potential MCI in patients; however, it did not show a clear tendency in relation to MCI progression.

Discussion
To our knowledge, this study is the first to forecast cognitive impairment in older Chinese adults using an LSTMs model and machine learning based on CLHLS Waves 5-8, with predictions that included sociodemographic health behaviors and health status characteristics.In total, 2,128 older adults were included in this study.Our LSTMs model produced robust results in the validation set; thus, it was capable of forecasting the feature values of older adults in the next wave using the SMOTE algorithm and three machine learning approaches that performed well in predicting MCI. Figure 6 depicts the conceptual framework discussed, summarizes the accuracy of the prediction models, presents the results, and presents multiple perspective values.
Regarding model precision, this prediction method combining LSTMs and machine learning can be successfully applied to longitudinal data to capture temporal information, thus improving the accuracy of MCI predictions in older adults (Chae et al., 2018;Wang 10.3389/fnagi.2023.1283243Frontiers in Aging Neuroscience 06 frontiersin.org, 2019;Su et al., 2020).To date, most studies have used LSTMs to forecast the prevalence and incidence rates or temporal trends in medical-related applications (Borges and Nascimento, 2022;Gautam, 2022;Liu X. D. et al., 2023).In addition, LSTMs have shown excellent performance when predicting high-dimensional data such as air and water pollution (Kim et al., 2022;Middya and Roy, 2022).Thus, building on previous LSTMs applications, some studies have used LSTMs to detect early health deterioration in individual clinical data (da Silva et al., 2021).Furthermore, the utilization of LSTMs to forecast individual features, followed by machine learning to construct predictive models, has been shown to be useful in disease prediction; for example, in the prediction of depression (Su et al., 2020;Lin et al., 2022) and in glaucoma assessment (Dixit et al., 2021).To date, no studies have utilized LSTMs and machine learning to establish a prediction model for MCI and explore its risk factors.Compared to the previous two prediction models using CLHLS, this study revealed relatively high accuracy and robustness with the AUCs of 0.902 to 0.938 for the test set and high sensitivity and specificity, and from 0.890 to 0.914 for the validation test.One longitudinal study proposed to use The Growth Mixed Model (GMM) and machine learning combination to forecast the MMSE trajectory of older adults.Due to the time effect bias for the application of constant baseline individual character in forecasting models, the AUCs of their models ranged from 0.51 to 0.66 in eight machine learning techniques (Wu et al., 2022).The other study utilized sociodemographic and life behavioral features of Chinese older adults to construct prediction models, achieving an accuracy of 0.7540 and the AUC of 0.8269 at maximum (Wang et al., 2022).To conclude, the outcomes of LSTMs and machine learning framework demonstrates the feasibility and effectiveness of the study hypothesis.Three decision tree-based models (GBDT, XGBoost, and RF) were used with SHAP to interpret individual predictions.Age, education level, chronic respiratory disease, gastrointestinal ulcers, and self-rated health were identified as the five most important predictors in this study.Age and education level have been reported in previous studies to be important predictors of MCI (Chun et al., 2022;Liu H. et al., 2023).Physiological decline in cognitive function is inevitable as people age (Langa and Levine, 2014) and age is a major predictor of MCI.Lower educational levels have been shown to be significantly associated with cognitive decline, and education in later life may also contribute to improved cognitive function (Peeters et al., 2020).According to our results, older adults with a formal education performed well in terms of MMSE scores.The other three features were not found to be strong predictors in other studies; however, they have all been shown to be closely associated with MCI.Older adults with no history of chronic respiratory disease are less likely to develop MCI.Common chronic respiratory diseases, such as chronic obstructive pulmonary disease and obstructive sleep apnea-hypopnea syndrome, lead to perennial hypoxia and hypercarbia (Olaithe et al., 2018), causing damage to brain functions, including language, execution, and attention.
Ultimately, cognitive function continues to decline under these pathological conditions.Gastrointestinal ulcers did not show a clear trend in Figure 5, whereas changes in metabolic substances in the gastrointestinal tract under pathological conditions are reported to impair brain function via the gut-brain axis (Zeng et al., 2022).Moreover, a healthy gastrointestinal tract can guard against cognitive decline and mitigate neuroinflammation (Xiang et al., 2022); hence, this result needs to be verified in another study.The SHAP analysis illustrated a positive correlation between self-rated health and MCI; that is, good self-rated health may represent good cognitive function and vice versa, which is consistent with previous cohort studies (Bond et al., 2006).MCI prediction models could provide references for clinical practice and bring broad benefits to society; however, they still need adjustment and practice to meet the standards for realworld application.When applied for MCI screening, the most appropriate prediction model requires striking a balance between sensitivity and specificity to achieve high precision and costeffectiveness.Consequently, it is critical to determine the threshold for identifying patients with MCI and conducting further interventions.As shown in Figure 4, the XGBoost prediction model had the greatest net benefit and balanced accuracy when the threshold probability was <0.6.When the threshold probability was 0.3, RF had the highest sensitivity and identifies most patients with MCI with relatively low costeffectiveness owing to the proportion of misdiagnoses.Determining the ultimate thresholds require constant evaluation and collaboration between governments and healthcare providers to obtain optimal clinical, economic, and social outcomes.
Ongoing application of this approach and cooperation can be viewed from three perspectives: the nation (macro), healthcare providers (medium), and individuals (micro).As a macroregulator, the government should enhance the utilization of big data and incorporate prediction models into various healthcare provider and public Internet platforms.This screening method could promote population health and reduce the disease burden.Various healthcare providers can select different thresholds in terms of specific medical conditions and testing technologies and change their criteria according to local prevalence and incidence.As psychiatric hospitals are generally equipped with adequate medical resources, the threshold for machine learning models could be relatively low to achieve suitable resource allocation.Once MCI predictive models become more sophisticated with continuous training and with more individual information available, such as risk genes or biomarkers, the threshold can be adjusted to pursue relatively high cost-effectiveness.In terms of the micro perspective, the general public could benefit through becoming more aware of their own and their families' risk of MCI through the application of this prediction model, avoiding additional examinations and ameliorating individual MCI risk.
This study contributes to the prevention of MCI and dementia.First, the combination of an LSTMs model and machine learning could precisely identify patients with MCI and their critical features several years earlier.Age, literacy level, chronic respiratory disease, gastrointestinal ulcers, and self-rated health were good predictors of MCI.Second, MCI prediction models have substantial clinical,   Finally, this study contributes to the prevention of dementia and MCI and promotes healthy aging.

Study limitations
This study had some limitations.First, we examined the robustness of both LSTMs and machine learning models and included four waves of data; however, our findings need to be validated in another cohort.Lacking external validation may affect the performance and adaptability of prediction models in different scenarios, as well as the confidence in the predictive ability of the models.Therefore, future researchers need to use other sources or types of data to validate this method framework and explore possibilities for improvement.Second, most predictors in this study were self-reported, which could have led to information bias.Third, the MMSE has a ceiling effect, meaning that it may not detect subtle changes in cognition that occur during MCI.Furthermore, MMSE scores could be affected by certain individual sociodemographic background factors (Arevalo-Rodriguez et al., 2021;Wu et al., 2022); therefore, MCI evaluations should be more comprehensive and include using Montreal Cognitive Assessment and the Clinical Dementia Rating evaluations, in addition to detecting biomarkers and undertaking imaging examinations for a more accurate clinical diagnosis in future studies.While this study proposes a convenient screening method using accessible individual features for the general public, outcomes obtained using this method are for reference only and cannot replace acknowledged MCI diagnosis standards.

Conclusion
This study showed that individual features could be predicted through combining LSTMs and machine learning models.The risk of MCI could be accurately predicted through exploring critical risk factors, such as age, education level, chronic respiratory disease, gastrointestinal ulcer, and self-rated health, in patients with MCI using three SHAP models among older Chinese adults based on four waves of CLHLS datasets.The combination of LSTMs and machine learning models captured Decision curve analysis.The x-axis indicates the threshold probability of MCI.The y-axis indicates the net benefit.Huang et al. 10.3389/fnagi.2023.1283243Frontiers in Aging Neuroscience 11 frontiersin.orgthe interdependence of predictors and generated an effective decision support system for healthcare providers to identify patients at high risk of MCI.With macro-direction undertaken at a governmental level, this screening method can continue to be optimized to obtain better thresholds for MCI screening.Our study findings may offer healthcare providers MCI screening support to implement early interventions to delay the progression from MCI to dementia, increase test availability among the population, and reduce incidence rates and treatment costs, ultimately contributing to healthy aging.Conceptual framework of discussion in this study.

FIGURE 2
FIGURE 2The training and validation curve of LSTMs from CLHLS wave5 to wave 7 (MSE, Mean squared error).

FIGURE 3
FIGURE 3Performance of machine learning models in test set (A) and validation set (B).

FIGURE 5
FIGURE 5Importance of predictors analysis by SHAP model.SHAP (SHapley Additive exPlanation) values are ranked by value of a feature to the predictions made by the GBDT/XGboost/RF.

TABLE 1
Predicted characteristics in 2018 and odds ratio of older adults with MCI.

TABLE 2
Performance of machine learning models in test set of predicting MCI among Chinese older adults.
GBDT, Gradient boosting decision tree; XGboost, Extreme gradient boosting; RF, Random Forest; AUC, Area Under the Curve; TP, True Positives; TN, True Negatives; FP, False Positives; FN, False Negatives.economic, and social value through optimizing prediction under governmental direction and adjusting thresholds for MCI probability according to the specific needs of different healthcare providers.

TABLE 3
Performance of machine learning models in validation set of predicting MCI among Chinese older adults.