Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Nutr., 14 January 2026

Sec. Nutrition, Psychology and Brain Health

Volume 12 - 2025 | https://doi.org/10.3389/fnut.2025.1680290

This article is part of the Research TopicImpact of nutrition on brain healthView all 26 articles

Interpretable machine learning models for predicting cognitive impairment using NHANES neuropsychological tests: nutritional and sociodemographic associations

Li SongLi Song1Chenlu LiChenlu Li2Xiaojiao Xiang
&#x;Xiaojiao Xiang3*Peijia Lin
&#x;Peijia Lin4*
  • 1Department of Neurology, The Second Affiliated Hospital of Chongqing Medical University, Chongqing, China
  • 2Department of Neurology, The First Affiliated Hospital Of Chongqing Medical University, Chongqing Key Laboratory of Major Neurological and Mental Disorders, Chongqing Key Laboratory of Neurology, Chongqing, China
  • 3Department of Nuclear Medicine, the Second Affiliated Hospital of Chongqing Medical University, Chongqing, China
  • 4Department of Otolaryngology Head and Neck Surgery, Children’s Hospital of Chongqing Medical University, National Clinical Research Center for Children and Adolescents’ Health and Diseases, Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing, China

Background: Early identification of individuals at risk for cognitive impairment is essential for timely intervention and public health planning. While sociodemographic and clinical predictors are well recognized, the role of nutrition and its interactions in cognitive health remains less explored.

Methods: Using data from the 2011–2014 National Health and Nutrition Examination Survey (NHANES, n = 2,208), we developed ensemble machine learning models (LightGBM, XGBoost, Random Forest) to predict cognitive impairment across three neuropsychological assessments (CERAD-WL, DSST, AFT). SHapley Additive exPlanations (SHAP) were applied to quantify and interpret the contribution of demographic, clinical, and nutritional predictors, as well as their interactions. To validate the nutrient interactions identified by our models, we conducted exploratory in vitro experiments assessing oxidative stress and neuroprotective pathways in SH-SY5Y neuronal cells.

Results: Ensemble models demonstrated excellent predictive performance, consistently outperforming traditional classifiers. Key predictors included education, age, socioeconomic status, and chronic disease conditions. Among nutritional factors, vitamin B2 emerged as consistently associated with lower predicted cognitive impairment risk across all three models, with notable interactions observed with copper and vitamin E. Exploratory in vitro experiments supported these associations, showing reduced oxidative stress and increased expression of neuroprotective genes (SIRT1, BDNF) under vitamin B2 treatment, particularly when combined with copper or vitamin E.

Conclusion: Interpretable machine learning models integrating cognitive tests with demographic, clinical, and nutritional variables can accurately predict cognitive impairment. Nutritional predictors, particularly vitamin B2 and its interactions, may contribute to model performance and biological plausibility, suggesting potential avenues for stratified monitoring strategies.

1 Introduction

With an aging population and increasing life expectancy, dementia has emerged as a major global health challenge, with its incidence and prevalence continuously rising (1, 2). It is estimated that by 2050, there will be more than 150 million people affected by dementia worldwide (3). Dementia is a progressive neurodegenerative disorder characterized by sequential decline in cognitive function, memory impairment, and behavioral changes, eventually leading to the loss of independent living ability (4). Therefore, early identification of dementia is crucial for timely intervention and precision treatment, which may help slow disease progression and improve patients’ quality of life (5). Current methods for early diagnosis include cognitive assessments, neuroimaging techniques, and biomarker detection (68). However, these approaches have notable limitations, such as low sensitivity and specificity of biomarkers, nonspecific early symptoms, and the high cost of PET scans (911). There is thus an urgent need for affordable, convenient, and highly specific early screening tools.

Neuropsychological testing plays a vital role in assessing cognitive function (12). These assessments are advantageous due to their low cost, ease of administration, and ability to evaluate multiple domains of cognitive decline (13). Although single tests have their limitations, combining multiple assessments with advanced statistical analyses can significantly enhance early detection and prognosis prediction of cognitive impairment (14). The CERAD Word List (CERAD-WL) test evaluates word learning and delayed recall and is highly sensitive to early dementia, but its performance may be influenced by educational and cultural background. The Digit Symbol Substitution Test (DSST) primarily measures processing speed and executive function, making it effective for identifying vascular cognitive impairment, though its specificity for dementia pathology is relatively low. The Animal Fluency Test (AFT) assesses semantic fluency, but is limited by language ability (1517). Using these three assessments in combination enables a comprehensive evaluation of cognitive changes across multiple domains. Developing an efficient and practical large-scale screening tool by integrating multiple cognitive tests represents a current and promising strategy for early prevention of cognitive impairment.

Accumulating evidence suggests that specific nutrients may influence neurocognitive resilience in later life. Among these, vitamin B2, copper, and vitamin E have garnered particular attention due to their close involvement in neuronal survival, antioxidative capacity, and synaptic function (1820). Taken individually, vitamin B2 (as the FAD/FMN cofactor) supports mitochondrial metabolism and the glutathione antioxidant cycle and has been linked to better cognitive performance (18); vitamin E, a membrane-phase chain-breaking antioxidant, is associated with lower dementia risk or slower functional decline (21, 22); and copper supports antioxidant enzymes such superoxide dismutase (23); while sufficient intake is necessary, excess exposure has been linked to faster cognitive decline (24, 25). Research indicates that these micronutrients often do not act in isolation but form a collaborative antioxidant network: copper-dependent enzymes initiate the clearance of superoxide anions, riboflavin-dependent flavoproteins sustain intracellular redox cycling capacity, and vitamin E halts the chain reaction of lipid peroxidation at the membrane level (23, 26, 27). These processes collectively contribute to mitochondrial stability, redox homeostasis, and the maintenance of neurotrophic signaling—such as SIRT1 and BDNF—pathways that are critical for memory, processing speed, attention, executive function, and semantic fluency (26, 2830). This implies that cognitive health may be influenced not only by individual nutrients but also by the interactive patterns among them.

However, most population-based studies continue to examine nutrients in isolation, seldom directly testing nutrient-nutrient synergies (31). This approach is suboptimal in practice, as nutrients often coexist within dietary patterns and are closely linked to socioeconomic status and comorbidity profiles (32). Traditional regression models are further limited by high collinearity among nutritional variables (33). Thus, although vitamin B2, copper, and vitamin E have each been linked to mitochondrial support, antioxidative stress, neuroprotection, and neuroinflammatory regulation, it remains unclear which nutrient combinations carry the most informative value for cognitive screening in the community.

In this study, we aim to address this gap. We integrated multidomain cognitive performance (CERAD-WL, DSST, AFT) with sociodemographic, clinical, and dietary nutrient data from the 2011–2014 National Health and Nutrition Examination Survey (NHANES), a nationally representative U.S. health and nutrition surveillance program that systematically captures adults’ health status and dietary intake (3436). We trained multiple machine learning models and applied SHapley Additive exPlanations (SHAP) to: (i) quantify the relative contribution of established risk factors (age, educational attainment, poverty-income ratio, hypertension, diabetes); (ii) identify nutrition-related predictive features, particularly vitamin B2; and (iii) reveal potential interaction patterns between vitamin B2 and copper and vitamin E. Furthermore, motivated by these model-derived interactions, we conducted exploratory in vitro experiments in human SH-SY5Y neuroblastoma cells to examine whether vitamin B2 alone, or in combination with copper or vitamin E, modulates oxidative stress levels and neuroprotective pathways. Taken together, this approach is intended to support a practical early screening strategy for cognitive risk: one that can be used at the community level, is grounded in biology, and also identifies nutritional factors that people could realistically change.

2 Materials and methods

2.1 Data source and study population

This study used publicly available data from the NHANES, covering two continuous cycles (2011–2014, total sample N = 19,931). NHANES employs a multistage, stratified probability sampling design and is nationally representative of the non-institutionalized U.S. population’s health and nutritional status.

Layered inclusion/exclusion strategy:

1. Cognitive tests: Required complete results for CERAD-WL, DSST, and AFT.

2. Dietary data: Required a 24-h dietary recall. Participants missing any cognitive test or dietary intake were excluded, yielding the initial analytic sample N = 2,461.

3. Predictor set for community applicability: To simplify the model and enhance generalizability, predictors were limited to interview-obtainable covariates. Exclusions at this step were triggered by missing data in the covariates shown in Figure 1 (education, PIR, BMI, hypertension, smoking, diabetes, alcohol use), removing n = 253 and resulting in the final sample N = 2,208.

Figure 1
Flowchart depicting participant selection from NHANES 2011-2014. Initial participants numbered 19,931. Excluded were those without CERAD (16,808), DSST (149), AFT (40), and with missing dietary data (221), resulting in 2,461. Further exclusions for education (2), BMI (208), poverty (204), hypertension (4), smoking (2), diabetes (1), and alcohol use (257) led to a final inclusion of 2,208 participants.

Figure 1. Study participant selection flowchart.

The sample size at each step and participant flow are shown in Figure 1. Reporting follows STROBE/TRIPOD recommendations (37, 38).

As NHANES data are publicly available and fully de-identified, and its study protocol was approved by the NCHS Research Ethics Review Board, this secondary data analysis did not require additional ethical approval (39).

2.2 Outcome definition: cognitive impairment

In NHANES 2011–2014, cognition was assessed with three brief screening tests—CERAD, DSST, and AFT—which sample distinct domains (episodic memory, processing speed/executive attention, and semantic fluency, respectively) (1517). Consistent with NHANES analytic/reporting practice and prior NHANES studies, we operationalized low cognitive performance per test as a score at or below the 25th percentile of that test’s distribution in our analytic sample (40). We chose this threshold a priori based on the existing literature, where the lowest-quartile rule is commonly used to flag low performance in population-based screening contexts and to maintain comparability across sociodemographic strata in a multiethnic survey (41). These instruments are intended for epidemiologic screening rather than clinical diagnosis, and fixed raw-score cutoffs vary by age and education without a single standard applicable to the multiethnic NHANES population; therefore, we did not apply diagnostic cutoffs. Analyses were conducted separately for each test-defined outcome.

2.3 Dietary intake assessment

Dietary intake was assessed using two 24-h dietary recall interviews recalls administered with the USDA Automated Multiple-Pass Method. Each recall covered the day before the interview (00:00 – 24:00; midnight-to-midnight). The first recall was conducted in person at the Mobile Examination Center, and the second was completed via telephone interview 3–10 days later. To approximate usual intake, we used the mean of the two recalls; when only one recall was available, the Day 1 value was used instead (42). In our analytic sample, 92.35% of participants completed one recall and 7.65% completed two recalls.

A total of 48 dietary nutrients were included in the analysis, covering energy intake, macronutrients, micronutrients, and other dietary components. Specifically, the categories were as follows: (1) Energy intake: total energy; (2) Macronutrients: protein, carbohydrates, total sugars, dietary fiber, total fat, saturated fatty acids, monounsaturated fatty acids, polyunsaturated fatty acids, and cholesterol; (3) Vitamins: vitamin A and its subtypes (retinol, α-carotene, β-carotene, β-cryptoxanthin), vitamin D (D2 and D3), vitamin E (as α-tocopherol), vitamin K, vitamin C, and B-complex vitamins (vitamin B1, B2, B6, B12, niacin, folate and its forms, and total choline); (4) Minerals: calcium, phosphorus, magnesium, iron, zinc, copper, sodium, potassium, and selenium; (5) Other dietary components: caffeine, theobromine, alcohol, and moisture intake (including plain water, other beverages, and the water content of foods—assessed by a 24-h dietary recall). All nutrient intake levels were expressed as mean ± standard deviation (Mean ± SD) to describe the distribution characteristics of dietary intake in the population.

2.4 Model development and evaluation

We developed predictive models using nationally representative cross-sectional data from the 2011–2014 NHANES cycles (initial sample size N = 19,931). We first restricted the sample to participants who had completed all three cognitive assessments (CERAD-WL, DSST, and AFT) and who had at least one valid 24-h dietary recall interview, yielding 2,461 eligible participants. We then further required complete information on key interview-based sociodemographic and clinical variables (including age, sex, race/ethnicity, education, PIR, BMI, smoking status, drinking status, and self-reported diabetes and hypertension), resulting in a final analytic sample of N = 2,208.

To enhance feasibility in community screening and routine clinical encounters, we intentionally limited candidate predictors to variables that can be obtained through brief interview or standard self-report (sociodemographic characteristics, lifestyle behaviors, and common chronic conditions), supplemented by nutrient intake derived from 24-h dietary recall. All continuous predictors were standardized using z-score transformation (subtracting the training-fold mean and dividing by the training-fold standard deviation), and the same scaling parameters were applied to the corresponding validation/test folds during cross-validation. This approach was used to improve numerical stability across variables with different scales and to prevent information leakage. To reduce redundancy and multicollinearity among predictors, we computed variance inflation factors (VIFs) and iteratively removed predictors with VIF > 3, refitting after each step, in line with established guidance on handling multicollinearity (43).

Because cognitive impairment represented the minority class, we addressed class imbalance within each training fold using a combination of synthetic minority oversampling (SMOTE) and random undersampling of the majority class (44). This strategy was intended to improve the model’s ability to identify cognitively impaired individuals.

We compared seven machine learning models with diverse inductive biases and algorithmic characteristics: (1) Random Forest, (2) LightGBM, (3) K-Nearest Neighbors (K-NN), (4) Naive Bayes, (5) Support Vector Machine (SVM) and (6) XGBoost. All models followed a unified training procedure: (i) stratified 5-fold cross-validation was applied to preserve the proportion of cognitively impaired vs. non-impaired cases in each fold, thereby improving the stability and representativeness of performance estimates; and (ii) hyperparameters were optimized using grid search.

To comprehensively assess predictive performance, we adopted a multidimensional evaluation framework: (1) Discriminative ability was quantified using overall accuracy and the area under the receiver operating characteristic curve (AUC-ROC); (2) Clinical utility was evaluated using sensitivity and specificity, reflecting the model’s ability to correctly identify true positives (cognitive impairment) and exclude true negatives (no impairment), respectively; and (3) Predictive reliability was evaluated using the Fβ score and the area under the precision–recall curve (AUC-PR), both of which are particularly informative in class-imbalanced settings. The Fβ score balances precision and recall (with β = 1 corresponding to the F1 score), while the AUC-PR reflects the stability of positive-class detection in the minority (impaired) group (45).

2.5 Model explainability

In terms of model interpretation, we conducted an in-depth analysis of the best-performing model using SHAP (46). By calculating SHAP values for each sample, we generated three core outputs: (1) a global feature importance ranking based on the mean absolute SHAP values; (2) individualized explanation plots (force plots) illustrating the key drivers of individual predictions; and (3) an interaction effect heatmap revealing relationships between features. All analyses were performed in a Python 3.8 environment. Machine learning models were implemented using scikit-learn version 1.0.2, SHAP analyses were conducted using version 0.41.0, and visualizations were generated with Matplotlib 3.5 and Seaborn 0.11. The overall analytical pipeline—including characterization of missingness using Little’s MCAR test, applying an individual-level complete-case strategy within each analysis (i.e., excluding only participants missing a required variable rather than discarding the variable itself), controlling multicollinearity via VIF and confirming feature contribution using Boruta, performing z-score standardization of continuous variables based on parameters estimated from the training set, and interpreting the final models using SHAP—was predefined prior to model training. This design was intended to obtain stable and clinically meaningful predictions without relying on the assumption that individual predictors follow a normal distribution. The study followed the TRIPOD guidelines to ensure completeness and transparency in reporting the prediction model (38).

2.6 In vitro validation of key nutrient effects

Human SH-SY5Y neuroblastoma cells cell lines were purchased from the Type Culture Collection of the Chinese Academy of Sciences (Shanghai, China) (47). Cells were cultured in DMEM/F12 (1:1, Gibco, Cat#11220033) + 10% heat-inactivated FBS (FBS, H clone, Cat# SV30087.03) + 1% penicillin at 37 °C, 5% CO₂, and used at passages 5–20. To validate model-motivated neuroprotective hypotheses, cells were pre-incubated for 6 h with vitamin B2 (5 μM), vitamin E (20 μM), and/or CuSO₄ (5 μM) (4850). Unless otherwise specified, pretreatments were maintained during the oxidative challenge to mimic continuous exposure.

Cells were pre-loaded with DCFH-DA (10 μM, 30 min, 37 °C, serum-free, protected from light), washed twice with PBS, and transferred to phenol-red- and pyruvate-free medium. Oxidative stress was then elicited by freshly diluting H2O2 to 200 μM and incubating for 2 h (no wash after addition) (51). Fluorescence (Ex/Em 485/535 nm) was recorded on a plate reader either kinetically (every 2–5 min for 30–60 min) or at a fixed endpoint (15–30 min). Background (dye-free wells) was subtracted, and signals were normalized to cell content by SRB staining or to total protein by BCA in matched wells. t-BHP (50–100 μM) served as a positive control in selected runs. Non-cytotoxicity at working doses was verified on parallel plates using CCK-8. Where indicated, NAC (2–5 mM) or catalase (200–500 U/mL) were included as antioxidant controls to confirm ROS-dependent signals.

Total RNA was extracted with TRIzol and treated with RNase-free DNase I. RNA quantity and purity were assessed (A260/280 ≈ 2.0; A260/230 > 1.8). One microgram of RNA was reverse-transcribed with random hexamers; minus-RT controls (no reverse transcriptase) were prepared in parallel. RT-qPCR was performed using SYBR Green chemistry in 10–20 μL reactions (primer 0.2–0.5 μM; 10–50 ng cDNA per well) on a 96-well instrument with the following program: 95 °C 2 min; 40 cycles of 95 °C 15 s and 60 °C 30 s; followed by a melt curve (65–95 °C, 0.5 °C increments). No-template controls (NTC) were included for each primer pair. Primer specificity and efficiency were verified by single-peak melt curves, a single band of the expected size (agarose gel), and 5-point standard curves (efficiency 90–110%, R2 ≥ 0.99). GAPDH served as the reference gene (stability verified under the present treatments); relative expression was calculated by 2^-ΔΔCt. Human primer sequences were: SIRT1 F: TAGACACGCTGGAACAGGTTGC, R: CTCCTCGTACAGCTTCACAGTC; BDNF (exon IX, total) F: ACTCTGGAGAGCGTGAATGG, R: CGTAGAAGTATTGCTTCAGTTGG; GAPDH F: GTCTCCTCTGACTTCAACAGCG, R: ACCACCCTGTTGCTGTAGCCAA (52, 53). Experiments used ≥3 biologically independent replicates, each measured in technical triplicate (technical replicates averaged per sample).

2.7 Statistical analysis

We performed all statistical analyses to compare baseline characteristics between participants with and without cognitive impairment. Continuous variables were summarized as mean ± standard deviation (Mean ± SD), and categorical variables were summarized as counts and percentages [n (%)]. Between-group comparisons were conducted using: (i) chi-square tests for categorical variables; (ii) t-tests or analysis of variance (ANOVA) for continuous variables that were approximately normally distributed; and (iii) Kruskal–Wallis H tests for continuous variables that were not normally distributed.

To compare predictive performance across machine learning models, we analyzed model evaluation metrics (accuracy, Fβ score, sensitivity, specificity, AUC-PR, and AUC-ROC) across models. For metrics that were approximately normally distributed, we applied one-way ANOVA; for metrics that were non-normally distributed, we used the Kruskal–Wallis H test. When an overall difference across models was detected (p < 0.05), we performed post hoc pairwise comparisons with appropriate correction for multiple testing (Tukey’s honestly significant difference test following ANOVA, or Dunn-type procedures following Kruskal–Wallis).

For in vitro experiments, data from at least three biologically independent replicates are presented as mean ± standard deviation (Mean ± SD). Group differences were assessed using one-way ANOVA followed by Tukey’s post hoc test.

3 Results

3.1 Baseline characteristics of participants

A total of 2,208 participants were included in this study, among whom 1,084 (49.09%) were female, with a mean age of 69.23 ± 6.71 years. The baseline characteristics of the participants, grouped by cognitive status based on the AFT, are detailed in Table 1. DSST and CERAD-WL by group are provided in Supplementary Tables 1, 2.

Table 1
www.frontiersin.org

Table 1. Baseline characteristics of participants by AFT.

3.2 AFT

3.2.1 Model discriminative performance

This study systematically evaluated the predictive performance of seven machine learning models for cognitive impairment using the AFT (Table 2; Figures 2AF). The results showed that the Random Forest model demonstrated optimal overall performance, achieving an accuracy of 93.5%, an AUC-ROC of 0.981, while maintaining high sensitivity (0.983) and specificity (0.867). Both LightGBM and XGBoost models also exhibited excellent performance, with accuracy reaching 92.2% and AUC exceeding 0.965, though with slightly lower specificity. While the K-NN model achieved the best specificity (0.996), its overall predictive efficacy was compromised by relatively low sensitivity (0.729). The Naive Bayes model performed poorly, with accuracy of only 55.7% and sensitivity as low as 0.366. SVM demonstrated moderate but stable predictive performance (85.7% accuracy). All inter-model performance differences were statistically significant (p < 0.001). Comprehensive analysis indicates that ensemble learning methods (particularly Random Forest) achieved the best balance between sensitivity and specificity, providing a reliable algorithmic choice for AFT-based cognitive impairment screening.

Table 2
www.frontiersin.org

Table 2. Performance comparison of seven machine learning models in predicting cognitive impairment based on the AFT.

Figure 2
Panel A displays two plots: a ROC curve with sensitivity versus 1-specificity for different models, and a box plot of AUC values with model comparison using Kruskal-Wallis test. Panel B shows a box plot for accuracy across models, analyzed by ANOVA. Panel C includes another ROC curve and box plot for AUC with ANOVA analysis. Panel D presents an F-Beta score box plot. Panel E shows specificity values across models. Panel F displays sensitivity values, both analyzed by ANOVA. Each panel compares six models: Random Forest, Light GBM, K-NN, Naive Bayes, SVM, and XGBoost.

Figure 2. Machine learning model performance on AFT-based cognitive impairment prediction. (A) Receiver operating characteristic curve with area under the curve values. (B) Classification accuracy across models. (C) Area under the precision-recall curve. (D) Fβ score (β = 1) comparison. (E) Model specificity in identifying healthy controls. (F) Model sensitivity in detecting cognitive impairment.

3.2.2 Variable importance and interpretation

3.2.2.1 Bee swarm plot

In the XGBoost classification model for the AFT, SHAP analysis identified the top 15 features most strongly associated with predicted cognitive impairment outcomes (Figure 3A), ranked by descending mean SHAP values: Race/Ethnicity 0.0766; Education 0.0760; Hypertension 0.0650; Sex 0.0499; Drinking status 0.0425; Diabetes 0.0297; Age 0.0200; PIR 0.0173; Moisture 0.0148; Caffeine 0.0144; Zinc 0.0132; Sodium 0.0121; Total sugar 0.0115; BMI 0.0101; Vitamin B2 0.0099.

Figure 3
Panel A features two charts: a scatter plot and a bar chart, both displaying SHAP values for various factors impacting a model (e.g., race, education, hypertension). Panel B includes scatter plots correlating SHAP values with factors like moisture, caffeine, zinc, and others, color-coded by specific attributes. Panel C depicts decision plots for predicting cognitive impairment, comparing no cognitive impairment and cognitive impairment scenarios, showing feature contributions.

Figure 3. SHAP analysis for cognitive impairment prediction using the AFT. (A) Top 15 features ranked by mean absolute SHAP values. (B) Dependence plots illustrating nonlinear relationships and interactions between key nutritional predictors and SHAP values. (C) Individual prediction explanation using force plots.

The results indicated that race/ethnicity and education level had the strongest associations with model predictions, with contributions greater than those of other factors. Hypertension, sex, and drinking status formed the second group of variables with notable associations. Among metabolic-related factors, diabetes showed stronger associations than BMI, while nutritional factors overall contributed smaller associations, with moisture and caffeine being most prominent. For sodium and total sugar, higher values predominantly show negative SHAP contributions, indicating lower predicted impairment risk after adjustment for covariates; this pattern is consistent with Table 1, where participants with cognitive decline exhibit lower mean sodium and total sugar. Overall, AFT scores were not only associated with nutritional intake patterns but also with socioeconomic status and chronic disease conditions, emphasizing the influence of multiple factors.

3.2.2.2 Dependency plot

The SHAP analysis based on the XGBoost model (Figure 3B) indicated differential associations of nutrition-related variables with predicted cognitive impairment. Moisture, caffeine, zinc, sodium, total sugar, and vitamin B2 showed negative correlation trends with predicted risk. The association between higher moisture intake and lower predicted risk appeared stronger in participants with lower education levels. Zinc demonstrated an interaction with iron: lower zinc was associated with higher predicted risk when iron was also low, whereas higher zinc was associated with lower predicted risk when iron was high. In hypertensive participants, the association between caffeine intake and lower predicted risk was more evident. Among diabetic participants, lower sodium intake was associated with higher predicted risk, while the association was weaker in non-diabetic individuals. In alcohol-consuming participants, higher sugar intake was paradoxically associated with lower predicted risk. The association between vitamin B2 and lower predicted risk appeared stronger under higher copper levels.

3.2.2.3 Force plot

Analysis based on SHAP force plots showed that under AFT assessment (Figure 3C), the model predicted a 95.4% probability of no cognitive impairment for this individual, which was substantially higher than the baseline probability of 38.8%. Among the important features associated with this prediction, education level (+0.105), race (+0.0904), and gender (+0.052) were the main contributors toward predictions of normal cognition. The predicted probability of cognitive impairment was only 4.62%, below the baseline probability of 61.2%. Notably, the contribution of this individual’s nutritional characteristics (including vitamin B12 and zinc levels) accounted for less than 5% of the model output, suggesting that in individuals with favorable sociodemographic profiles, the influence of nutritional factors may be comparatively smaller relative to higher-level social determinants.

3.3 CERAD-WL

3.3.1 Model discrimination performance

Baseline characteristics of participants grouped by CERAD-WL cognitive status are provided in Supplementary Table 1. Multiple machine learning models were constructed based on the CERAD-WL scale to predict the risk of cognitive impairment. The performance metrics of different models are as follows (Table 3; Figures 4AF).

Table 3
www.frontiersin.org

Table 3. Model performance comparison on CERAD-WL cognitive impairment screening.

Figure 4
Group of figures comparing different machine learning models using various metrics.A: ROC curves with corresponding AUC values. B: Box/strip plot of classification accuracy across models. C: Box/strip plot of PR-AUC. D: Box/strip plot comparing Fβ scores (β=1; F1-score). E: Box/strip plot of model specificity for identifying healthy controls. F: Box/strip plot of model sensitivity for detecting cognitive impairment.

Figure 4. Performance metrics of machine learning models for CERAD-WL-based cognitive impairment detection. (A) Receiver operating characteristic curve with area under the curve values. (B) Classification accuracy across models. (C) Area under the precision-recall curve. (D) Fβ score (β = 1) comparison. (E) Model specificity in identifying healthy controls. (F) Model sensitivity in detecting cognitive impairment.

Among all models, LightGBM showed the best internal discrimination (AUC-ROC 0.996) and recall of impaired individuals (sensitivity 0.999), while maintaining acceptable specificity (0.923). This indicates that, within NHANES, the model can almost always flag individuals classified as impaired by CERAD-WL while still excluding most unimpaired individuals. The XGBoost model also demonstrated high performance, with all metrics approaching or exceeding 0.98. The Random Forest model achieved an accuracy of 0.976, an AUC-ROC of 0.999, and a sensitivity of 1.000, but its specificity was relatively lower (0.856). The SVM and K-KNN models also performed well, with accuracies of 0.966 and 0.939 and AUC-ROCs of 0.990 and 0.991, respectively. In contrast, the Naive Bayes model performed significantly worse, with an accuracy of only 0.490, a sensitivity of 0.431, and a specificity of 0.785.

The differences in major metrics between models were all statistically significant (p < 0.001). Accuracy, F-beta score, sensitivity, specificity, and AUC-PR were tested using ANOVA, while AUC-ROC was tested using the Kruskal–Wallis test. Overall, ensemble algorithms such as LightGBM and XGBoost demonstrated extremely high accuracy and stability in predicting cognitive impairment based on the CERAD-WL test.

3.3.2 Variable importance and interpretation

3.3.2.1 Bee swarm plot

The SHAP bee swarm plot revealed the relative importance of each variable in the predictive model (Figure 5A). Among all variables, age had the highest mean absolute SHAP value (0.0295), showing the strongest association with model predictions. Hypertension (0.0263) and education (0.0238) also showed notable associations. BMI (0.0192), diabetes (0.0188), and sex (0.0170) demonstrated moderate associations.

Figure 5
Panel A shows a SHAP summary plot with age, hypertension, and education as top predictors. Panel B includes scatter plots highlighting Vitamin C, moisture, theobromine, copper, Vitamin D, and Vitamin E related to SHAP values across various factors. Panel C presents SHAP waterfall charts for cognitive impairment predictions, indicating feature impacts on prediction scores.

Figure 5. Interpretable machine learning analysis for CERAD-WL-based cognitive impairment prediction. (A) Feature importance ranking, (B) nonlinear predictor effects and interactions, (C) case-specific prediction interpretation.

Nutrition-related factors, such as vitamin C (0.0122), moisture (0.0117), theobromine (0.0093), copper (0.0091), vitamin D (0.0087), drinking status (0.0081), vitamin E (0.0066), vitamin B12 (0.0061), and vitamin B2 (0.0053), had relatively lower mean SHAP values, indicating smaller but still observable associations with model predictions. Overall, sociodemographic variables-particularly age and hypertension-were the strongest predictors in the model, while various micronutrients also showed contributory associations with the model’s discriminative ability.

3.3.2.2 Dependency plot

Analysis of SHAP dependence plots uncovered complex nonlinear associations between key predictors and predicted cognitive impairment risk (Figure 5B). The relationship between vitamin E and vitamin B2 showed an interaction, where higher vitamin B2 levels were associated with stronger contributions of vitamin E in the model, suggesting a potential synergistic pattern. In hypertensive populations, higher vitamin D levels were more strongly associated with predictions of no cognitive impairment. Other features, such as vitamin C, moisture, theobromine, and copper, showed weaker associations, but their effects varied depending on age, smoking status, or diabetes. Smoking appeared to modify the association of water intake with model predictions, while the SHAP contribution of theobromine was influenced by diabetes status. In older adults, higher copper levels were associated with higher predicted risk, whereas the association of vitamin C with lower predicted risk appeared weaker.

3.3.2.3 Force plot

Using the predictive model based on the CERAD-WL test, we employed SHAP force plots to interpret the predictions for “no cognitive impairment” and “cognitive impairment” in the same subject(Figure 5C).

For the “no cognitive impairment” prediction, the subject’s baseline predicted value (E[f(x)]) was 0.743, and the final predicted value was 0.992, suggesting performance consistent with no cognitive impairment on the CERAD-WL test. The main features associated with this prediction and their corresponding SHAP values were: age, education, BMI, vitamin C, and moisture.

For the “cognitive impairment” prediction, the subject’s baseline predicted value was 0.257, and the final predicted value decreased to 0.00842, indicating that the model predicted a very low probability of cognitive impairment. In this subject, the model yielded highly consistent results across the CERAD-WL test predictions, suggesting normal cognitive function.

3.4 DSST

3.4.1 Model discrimination performance

Participants’ baseline characteristics grouped by DSST cognitive status are provided in Supplementary Table 2. In the prediction of cognitive impairment based on the DSST, various machine learning models demonstrated distinct performance characteristics (Table 4; Figures 6AF). Ensemble learning methods exhibited superior performance. The Random Forest, LightGBM, and XGBoost models achieved accuracies of 0.877, 0.875, and 0.873, respectively, with AUC-ROC approaching or exceeding 0.94. These models maintained high sensitivity (>0.90; 0.923, 0.920, and 0.906, respectively) while preserving specificity between 0.811–0.826. Furthermore, all three models demonstrated excellent predictive stability, with AUC-PR above 0.95.

Table 4
www.frontiersin.org

Table 4. DSST-based cognitive impairment prediction: model performance comparison.

Figure 6
Six-panel image comparing machine learning models. Panel A shows ROC curves for sensitivity vs. specificity for Random Forest, LightGBM, K-NN, Naive Bayes, SVM, and XGBoost. Panel B compares models based on accuracy using ANOVA. Panel C displays PR-AUC with ROC curves, while Panel D evaluates the F-beta score. Panel E assesses specificity, and Panel F focuses on sensitivity, both analyzed with ANOVA. Each panel uses color-coded dots for model comparison.

Figure 6. Clinical performance assessment of machine learning models using DSST. (A) Diagnostic discrimination (ROC curves), (B) overall accuracy, (C) positive predictive value (PR curves), (D) balanced metric (F1-score), (E) healthy control identification, (F) cognitive impairment detection.

In contrast, the SVM model showed moderate performance (accuracy = 0.829, AUC-ROC = 0.903, sensitivity = 0.862), albeit with slightly lower specificity (0.782). The K-KNN algorithm achieved comparable accuracy (0.827) but displayed lower sensitivity (0.757) offset by higher specificity (0.927). The Naive Bayes classifier performed substantially worse than other models across all metrics, with particularly low accuracy (0.666), AUC-ROC (0.783), and sensitivity (0.550).

Statistical comparisons revealed significant between-model differences in all performance metrics (p < 0.001). Post-hoc analysis for the DSST models revealed that the ensemble methods (Random Forest, LightGBM, XGBoost) did not differ significantly from one another in accuracy or AUC-ROC (p > 0.05). However, all three ensemble models significantly outperformed the K-KNN, Naive Bayes, and SVM models (p < 0.01 for all pairwise comparisons). The K-KNN and SVM models also performed significantly better than the Naive Bayes model across all primary metrics (p < 0.001). These findings collectively demonstrate that ensemble algorithms (Random Forest, LightGBM, XGBoost) provide more accurate and robust prediction of cognitive impairment from DSST performance compared to conventional machine learning approaches.

3.4.2 Variable importance and interpretation

3.4.2.1 Bee swarm plot

SHAP analysis revealed significant differences in the contributions of various variables to the cognitive impairment prediction model (Figure 7A). Among all features, education had the highest mean absolute SHAP value (0.108), showing the strongest association with model predictions. The PIR (0.053) and age (0.040) also showed strong associations, underscoring the role of socioeconomic status and age in the model’s outputs. In addition, sex (0.038), hypertension (0.036), diabetes (0.036), and race/ethnicity (0.033) were also associated with notable contributions. Moisture (0.027) showed a moderate association.

Figure 7
Panel A shows SHAP values indicating the impact of various features like education and poverty-to-income ratio on predictions. Panel B consists of several scatter plots with SHAP values plotted against features such as moisture, vitamin B2, caffeine, sodium, vitamin E, and vitamin D. Color intensity indicates related factors like hypertension and diabetes. Panel C shows force plots for predicting cognitive impairment, illustrating the contribution of features like education and poverty-to-income ratio. Each feature's impact varies for scenarios with and without cognitive impairment.

Figure 7. Clinically interpretable features of DSST-based cognitive screening models. (A) Key predictor ranking, (B) metabolic and sociodemographic interactions, (C) individual case risk profiling.

Nutritional and lifestyle-related variables such as vitamin B2 (0.012), vitamin E (0.012), caffeine (0.011), drinking status (0.011), sodium (0.011), vitamin D (0.007), and zinc (0.007) had relatively lower mean SHAP values, indicating smaller but still observable associations with model predictions. However, these variables may still play a role in shaping predictions for specific individuals.

Overall, the results suggest that socioeconomic and clinical factors were more strongly associated with the model’s predictive outputs, whereas nutritional and lifestyle factors contributed supplementary associations under certain conditions.

3.4.2.2 Dependence plots

The dependence visualization revealed several critical insights into cognitive impairment risk prediction (Figure 7B). SHAP dependence analysis showed that higher levels of moisture were associated with lower predicted cognitive impairment risk. Moreover, in individuals with hypertension, an interaction was observed between moisture and hypertension, with stronger associations in model predictions. Vitamin E also showed an interaction with diabetes, contributing jointly to prediction patterns among diabetic individuals. Vitamin B2 showed an overall negative association with predicted cognitive impairment risk, with stronger associations in individuals with lower education levels, while higher education appeared to attenuate the independent contribution of vitamin B2. Higher caffeine intake was associated with lower predicted risk, and this association was more evident in lower-income groups, suggesting an interaction between caffeine intake and family income. Additionally, higher sodium levels were associated with lower predicted risk, with this association being more prominent in diabetic participants. The association of vitamin D with lower predicted risk was strongest in individuals with moderate BMI, while higher BMI appeared to weaken this association, suggesting possible interactions between nutritional factors and metabolic status.

Overall, these results indicate that multiple nutritional and lifestyle factors were associated with predicted cognitive impairment risk not only independently but also through interactions with diverse clinical and socioeconomic characteristics, collectively contributing to the model’s predictive performance.

3.4.2.3 Force plot

The SHAP force plot analysis revealed distinct predictive patterns for normal cognition versus cognitive impairment (Figure 7C). For normal cognition prediction, the model assigned a 96.3% probability (substantially higher than the 43.9% baseline), with key associations coming from socioeconomic factors (education +0.121 SHAP value, PIR + 0.104) and the nutritional factor vitamin B2 (+0.0158). Moisture (+0.0645) also showed a notable association with lower predicted risk. Conversely, the cognitive impairment prediction probability was only 3.71% (below the 56.1% baseline). Notably, vitamin B2 appeared as the strongest individual factor associated with lower predicted risk, while socioeconomic variables collectively showed nearly equivalent contributions. These results illustrate how multiple protective factors were associated with lower predicted cognitive impairment risk, with vitamin B2 and socioeconomic status particularly influential in this example.

3.5 Cross-scale performance comparison

Optimal models: Ensemble learning methods (particularly LightGBM, XGBoost, and Random Forest) consistently showed the best performance across all assessment scales, with higher metrics compared to traditional models (e.g., SVM, K-NN) and Naive Bayes. For CERAD-WL: LightGBM achieved the highest overall performance (accuracy 0.987, AUC-ROC 0.996). For DSST: Random Forest performed slightly better (accuracy 0.877, AUC-ROC 0.94), though the differences among the three ensemble models were minimal. For AFT: Random Forest showed the strongest performance (accuracy 93.5%, AUC-ROC 0.981). The ensemble models consistently yielded AUC-ROC and AUC-PR scores above 0.95, reflecting strong discriminative ability and robustness. Naive Bayes showed the lowest performance (e.g., only 0.49 accuracy for CERAD-WL).

3.6 Summary of core predictors and key findings across models

To synthesize findings from our machine learning models, this section summarizes predictors associated with cognitive impairment and their interaction patterns across the CERAD-WL, DSST, and AFT assessments. SHAP analysis revealed a hierarchy of predictor importance and identified interactive patterns among variables.

3.6.1 Core predictors demonstrating cross-model consistency

Sociodemographic and clinical factors showed the strongest predictive capacity. Advanced age maintained a stable association with increased risk of cognitive impairment across all models, while higher educational attainment was consistently correla ted with reduced risk. Lower family PIR was persistently associated with elevated risk scores. Among clinical indicators, hypertension and diabetes status showed stable associations with increased risk scores across all models.

After controlling for these strong predictors and applying multicollinearity filtering (VIF ≤ 3), Vitamin B2 (Riboflavin) was the only nutritional marker that demonstrated protective associations across all three cognitive domains, with higher intake levels correlating with lower model-predicted risk scores.

3.6.2 Complex patterns: variable interactions and association variations

Beyond core predictors, the analysis revealed model-specific association patterns.

3.6.2.1 Inter-variable interactions

SHAP analysis indicated interactive patterns among variables. A positive association was observed between vitamin B2 and vitamin E; the protective association of vitamin E was more pronounced in individuals with higher vitamin B2 levels. Similarly, the protective association of vitamin B2 was stronger under conditions of higher copper levels. In the AFT model, an interaction between zinc and iron was observed, manifesting as lower zinc being associated with higher risk when iron levels were low, while higher zinc correlated with lower risk when iron levels were high.

3.6.2.2 Subgroup association differences

The strength of nutritional associations varied across population subgroups. The association between higher moisture intake and risk scores was more evident among individuals with lower education levels and those with hypertension. The protective association of vitamin B2 was more prominent in individuals with lower education. The association between caffeine intake and risk scores was more marked in low-income groups. Among different BMI groups, the protective association of vitamin D was most prominent in individuals with moderate BMI, while the association between sodium intake and risk scores was primarily observed in diabetic participants.

3.6.3 Interpretation guidelines

These findings require cautious interpretation, particularly regarding nutritional factors. A primary consideration is the multicollinearity among dietary nutrients, as they typically coexist in foods. This makes it difficult to determine the independent role of any single nutrient from observational data. Therefore, while vitamin B2 demonstrates a stable cross-model signal, for other individual nutrients, the results suggest focusing on the implications of overall dietary patterns.

3.7 In vitro validation of neuroprotective nutrients identified by SHAP analysis

To functionally validate the nutrient associations identified through SHAP analysis, we examined whether vitamin B2 and its predicted interactions with vitamin E and copper could modulate oxidative stress and neuroprotective pathways in vitro. As shown in Figure 8A, treatment with H2O2 was associated with increased intracellular ROS levels, indicating successful establishment of the oxidative stress model. Pretreatment with vitamin B2 was associated with lower ROS levels. Furthermore, combined treatment with vitamin B2 and vitamin E or vitamin B2 and copper was associated with further decreases in ROS generation, consistent with a potential synergistic antioxidant pattern.

Figure 8
Three bar graphs (A. ROS, B. SIRT1, C. BDNF) compare relative levels across five treatments: Control, Hâ‚‚Oâ‚‚, and combinations with vitamins B2, E, and CuSOâ‚„. The graphs indicate significant differences, marked by asterisks, in levels across treatments. Graph A shows the highest ROS level with Hâ‚‚Oâ‚‚ while B. SIRT1 and C. BDNF indicate increased levels with vitamin treatments.

Figure 8. Effects of different treatments on ROS, SIRT1, and BDNF levels. (A) Relative level of ROS under different treatment conditions, (B) relative level of SIRT1 under different treatment conditions, (C) relative level of BDNF under different treatment conditions (*p < 0.05, **p < 0.01).

To further examine molecular responses, we measured the mRNA expression levels of SIRT1 and BDNF using qPCR. As shown in Figures 8B,C, H2O2 treatment was associated with reduced expression of SIRT1 and BDNF, whereas vitamin B2 was associated with higher expression levels compared to H2O2 alone. Combined treatment with vitamin E or copper was associated with further increases in SIRT1 and BDNF expression, suggesting enhanced upregulation under co-treatment conditions.

These findings provide experimental evidence consistent with the nutrient interactions suggested by SHAP analysis and support the biological plausibility of vitamin B2 as a factor associated with oxidative stress modulation and neurotrophic pathway activation.

4 Discussion

This study demonstrates the value of integrating interpretable machine learning with population-scale data to advance cognitive impairment screening. By systematically combining three neuropsychological tests with extensive NHANES data, we developed predictive models that not only achieved strong performance but also provided meaningful insights into the complex interplay between sociodemographic, clinical, and nutritional factors in cognitive health.

Established sociodemographic factors—particularly age, education level, and socioeconomic status—consistently dominated across all cognitive assessments, reinforcing their fundamental role in cognitive risk stratification (54). More importantly, our approach precisely quantified how these non-modifiable factors establish a baseline risk profile, thereby highlighting the urgent need to identify modifiable protective factors that can benefit individuals across different risk categories.

Among nutritional factors, vitamin B2 emerged as the most robust and consistent predictor, maintaining significant associations across all three cognitive domains even after rigorous adjustment for confounders and multicollinearity. This primary finding was further refined through the identification of nutrient synergy patterns, particularly between vitamin B2 and copper/vitamin E. While the individual biological pathways involved in vitamin-mediated neuroprotection are well-documented (26, 5557), our contribution lies in demonstrating how interpretable machine learning can identify specific nutrient combinations worthy of experimental investigation. Subsequent validation experiments confirmed that these computationally predicted interactions produce measurable synergistic effects on oxidative stress reduction and neurotrophic pathway activation (1214), thereby strengthening the biological plausibility of our population-level findings.

It is crucial to emphasize that multicollinearity among nutritional factors, comorbidities, and BMI represents an important limitation in interpreting our results. This complex covariance structure primarily stems from: (1) the inherent coexistence of dietary nutrients in natural foods; (2) socioeconomic status as a common upstream determinant influencing both dietary patterns and disease risk; and (3) the bidirectional network of associations between nutritional status, metabolic diseases (such as diabetes and hypertension), and BMI. Although we applied stringent criteria (VIF ≤ 3) to reduce statistical multicollinearity, accurately isolating the independent physiological effects of specific nutrients from these complex observational associations remains challenging.

Based on this methodological limitation, the interpretation of SHAP values for most individual nutritional factors should be approached with caution. Except for vitamin B2, which remained robust after multicollinearity control, the association results for other nutrients are better understood as emphasizing the importance of overall dietary patterns rather than providing conclusive evidence for the independent effects of any single nutrient.

The context-dependent nature of nutritional associations represents another key insight. The presence of modified effects of nutrients such as vitamin B2, caffeine, and vitamin D across different socioeconomic and metabolic subgroups further demonstrates that the impact of nutrition on cognition must be understood within the complex network of individual health status and social determinants (5862). This understanding aligns with the concept of precision nutrition and suggests that effective dietary recommendations should consider these interacting factors.

Methodologically, our work confirms several important advantages of the integrated approach we employed. The complementary nature of the three neuropsychological tests enabled comprehensive cognitive assessment across multiple domains, while ensemble machine learning methods effectively handled the high-dimensional, nonlinear relationships inherent in these complex data (63, 64). Most significantly, SHAP analysis transformed these advanced models into interpretable tools that not only provided accurate predictions but also generated biologically plausible hypotheses about nutrient interactions—hypotheses that were subsequently validated through targeted experimentation.

We classified low cognitive performance using a ≤ 25th percentile per-test rule rather than age−/education-normed clinical cutoff (40, 41). Although this aligns with NHANES practice and facilitates population comparisons, alternative cut points (e.g., norm-referenced thresholds or z-score composites) may shift case assignments. Therefore, our findings should be interpreted as screening-oriented associations in survey data rather than clinical diagnoses; future work should compare definitions across thresholds and validate in longitudinal cohorts.

Another limitation relates to the cognitive assessments used in NHANES. The CERAD-WL, DSST, and AFT are brief screening tools designed for large-scale epidemiologic surveys rather than comprehensive neuropsychological evaluation. Although they capture key domains relevant to aging—episodic memory, processing speed/executive attention, and semantic fluency—they do not include more in-depth measures of short-term memory, long-term memory consolidation, or phonemic fluency (1517, 65, 66). As a result, our models may not fully characterize how vitamin B2, vitamin E, or copper relate to more specific cognitive subcomponents. Future studies incorporating extended neuropsychological batteries or domain-specific tests could provide finer-grained insight into nutrient–cognition relationships and help determine whether distinct nutrient interactions differentially influence particular cognitive processes.

In addition, the cross-sectional design also limits the establishment of causal inference. Future longitudinal studies and expanded mechanistic investigations—particularly exploring pathways beyond oxidative stress, such as neuroinflammation and mitochondrial function—will be valuable next steps.

Although the ensemble models achieved very high discrimination across tasks, with AUC-ROC and AUC-PR values frequently exceeding 0.95 (and up to 0.99 in the CERAD-WL task), these values should be interpreted with caution and generalized cautiously. All reported metrics reflect internal stratified 5-fold cross-validation within NHANES under our case definition (cognitive impairment defined as ≤25th percentile on each cognitive test). This setup yields relatively clear separation between “impaired” and “non-impaired” groups and may inflate apparent model performance compared with prospective clinical screening. Even though we minimized information leakage by fitting all preprocessing steps (including standardization and class balancing) within each training fold, the present results may still overestimate real-world generalizability. External validation in independent cohorts, ideally with longitudinal cognitive outcomes, will be required to determine clinical utility.

Despite these limitations, our findings have important implications for cognitive health strategies. The strong predictive performance of models based on easily obtainable interview data supports their potential utility in community screening settings. More importantly, the identification of vitamin B2 as a robust nutritional factor that withstood multicollinearity testing, along with its synergistic partners, provides precise targets for developing stratified nutritional approaches to cognitive preservation. By successfully translating computational predictions into experimentally validated biological effects, this work demonstrates how to integrate data-driven discovery and mechanistic research to generate actionable insights for public health.

5 Conclusion

This study demonstrates that multidimensional predictive models based on interpretable machine learning can effectively identify populations at high risk for cognitive impairment. Our model analysis reveals the prominent value of vitamin B2 as a robust predictor across cognitive domains and identifies its synergistic patterns with copper and vitamin E. Targeting the key nutrient interactions identified by the model, in vitro experiments provided functional validation at the mechanistic level: combined intervention with vitamin B2 and copper/vitamin E indeed exerts synergistic effects through modulating oxidative stress levels and activating neurotrophic pathways. This “computational discovery-experimental validation” research paradigm not only strengthens the application value of vitamin B2 in the field of cognitive health but, more importantly, establishes a complete evidence chain from population data discovery to biological mechanism verification, providing new insights for developing stratified prevention and control strategies for cognitive impairment based on nutritional interventions.

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found at: https://www.cdc.gov/nchs/nhanes/index.htm.

Ethics statement

The studies involving humans were approved by National Center for Health Statistics. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

LS: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Writing – original draft. CL: Investigation, Methodology, Project administration, Resources, Writing – original draft. XX: Funding acquisition, Project administration, Supervision, Validation, Visualization, Writing – review & editing. PL: Investigation, Methodology, Project administration, Resources, Supervision, Validation, Writing – original draft, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This work was supported by the National Natural Science Foundation of China (NSFC) (grant no. 82201578) to XX, and NSFC (grant no. 82401691) to LS.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that Generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnut.2025.1680290/full#supplementary-material

References

1. Marvi, F, Chen, YH, and Sawan, M. Alzheimer's disease diagnosis in the preclinical stage: Normal aging or dementia. IEEE Rev Biomed Eng. (2025) 18:74–92. doi: 10.1109/RBME.2024.3376835,

PubMed Abstract | Crossref Full Text | Google Scholar

2. Song, L, Pan, QL, Zhou, GF, Liu, SW, Zhu, BL, Lin, PJ, et al. SHMT2 mediates small-molecule-induced alleviation of Alzheimer pathology via the 5′UTR-dependent ADAM10 translation initiation. Adv Sci. (2024) 11:5260. doi: 10.1002/advs.202305260,

PubMed Abstract | Crossref Full Text | Google Scholar

3. Collaborators GBDDF. Estimation of the global prevalence of dementia in 2019 and forecasted prevalence in 2050: an analysis for the global burden of disease study 2019. Lancet Public Health. (2022) 7:e105–25. doi: 10.1016/S2468-2667(21)00249-8,

PubMed Abstract | Crossref Full Text | Google Scholar

4. Tenchov, R, Sasso, JM, and Zhou, QA. Alzheimer's disease: exploring the landscape of cognitive decline. ACS Chem Neurosci. (2024) 15:3800–27. doi: 10.1021/acschemneuro.4c00339,

PubMed Abstract | Crossref Full Text | Google Scholar

5. Reuben, DB, Kremen, S, and Maust, DT. Dementia prevention and treatment: a narrative review. JAMA Intern Med. (2024) 184:563–72. doi: 10.1001/jamainternmed.2023.8522,

PubMed Abstract | Crossref Full Text | Google Scholar

6. Janelidze, S, Barthelemy, NR, Salvado, G, Schindler, SE, Palmqvist, S, Mattsson-Carlgren, N, et al. Plasma phosphorylated tau 217 and Abeta42/40 to predict early brain Abeta accumulation in people without cognitive impairment. JAMA Neurol 2024, 81:947–957, doi:DOI: doi: 10.1001/jamaneurol.2024.2619

Crossref Full Text | Google Scholar

7. Ding, J, Shen, C, Wang, Z, Yang, Y, El Fakhri, G, Lu, J, et al. Tau-PET abnormality as a biomarker for Alzheimer's disease staging and early detection: a topological perspective. Cereb Cortex. (2023) 33:10649–59. doi: 10.1093/cercor/bhad312,

PubMed Abstract | Crossref Full Text | Google Scholar

8. Busch, RM, Hogue, O, Postle, AF, and Floden, DP. Automated detection of cognitive impairment in clinical practice. J Neurol. (2024) 271:5187–96. doi: 10.1007/s00415-024-12444-8,

PubMed Abstract | Crossref Full Text | Google Scholar

9. Palmqvist, S, Tideman, P, Mattsson-Carlgren, N, Schindler, SE, Smith, R, Ossenkoppele, R, et al. Blood biomarkers to detect Alzheimer disease in primary care and secondary care. JAMA. (2024) 332:1245–57. doi: 10.1001/jama.2024.13855,

PubMed Abstract | Crossref Full Text | Google Scholar

10. Li, B, Shi, K, Ren, C, Kong, M, and Ba, M. Detection of tau-PET positivity in clinically diagnosed mild cognitive impairment with multidimensional features. J Alzheimer's Dis. (2022) 94:627–40. doi: 10.3233/JAD-230180,

PubMed Abstract | Crossref Full Text | Google Scholar

11. Laske, C, Sohrabi, HR, Frost, SM, Lopez-de-Ipina, K, Garrard, P, Buscema, M, et al. Innovative diagnostic tools for early detection of Alzheimer's disease. Alzheimers Dement. (2015) 11:561–78. doi: 10.1016/j.jalz.2014.06.004,

PubMed Abstract | Crossref Full Text | Google Scholar

12. Yang, YM, Zhao, ZM, Wang, W, Dong, FM, Wang, PP, Jia, YJ, et al. Trends in cognitive function assessed by a battery of neuropsychological tests after mild acute ischemic stroke. J Stroke Cerebrovasc Dis. (2020) 29:104887. doi: 10.1016/j.jstrokecerebrovasdis.2020.104887,

PubMed Abstract | Crossref Full Text | Google Scholar

13. Gallagher, D, Mhaolain, AN, Coen, R, Walsh, C, Kilroy, D, Belinski, K, et al. Detecting prodromal Alzheimer's disease in mild cognitive impairment: utility of the CAMCOG and other neuropsychological predictors. Int J Geriatr Psychiatry. (2010) 25:1280–7. doi: 10.1002/gps.2480,

PubMed Abstract | Crossref Full Text | Google Scholar

14. Creavin, ST, Wisniewski, S, Noel-Storr, AH, Trevelyan, CM, Hampton, T, Rayment, D, et al. Mini-mental state examination (MMSE) for the detection of dementia in clinically unevaluated people aged 65 and over in community and primary care populations. Cochrane Database Syst Rev. (2016) 2016:CD011145. doi: 10.1002/14651858.CD011145.pub2

Crossref Full Text | Google Scholar

15. Moms, JC, and Heyman, AM. The consortium to establish a registry for Alzheimer's disease (CERAD). Part I. Clinical and neuropsychological assessment of Alzheimer's disease. Neurology. (1989) 39:59–65.

Google Scholar

16. Wechsler, D: WAIS-R: Manual: Wechsler adult intelligence scale-revised. New York: Psychological Corporation. (1981).

Google Scholar

17. Tombaugh, TN, Kozak, J, and Rees, L. Normative data stratified by age and education for two measures of verbal fluency: FAS and animal naming. Arch Clin Neuropsychol. (1999) 14:167–77. doi: 10.1093/arclin/14.2.167,

PubMed Abstract | Crossref Full Text | Google Scholar

18. Suwannasom, N, Kao, I, Pruß, A, Georgieva, R, and Riboflavin, HB. Riboflavin: The health benefits of a forgotten natural vitamin. Int J Mol Sci. (2020) 21:950. doi: 10.3390/ijms21030950

Crossref Full Text | Google Scholar

19. Teleanu, RI, Chircov, C, Grumezescu, AM, Volceanov, A, and Teleanu, DM. Antioxidant therapies for neuroprotection—a review. J Clin Med. (2019) 8:1659. doi: 10.3390/jcm8101659,

PubMed Abstract | Crossref Full Text | Google Scholar

20. Opazo, CM, Greenough, MA, and Bush, AI. Copper: from neurotransmission to neuroproteostasis. Front Aging Neurosci. (2014) 6:143. doi: 10.3389/fnagi.2014.00143,

PubMed Abstract | Crossref Full Text | Google Scholar

21. Traber, MG, and Atkinson, J. Vitamin E, antioxidant and nothing more. Free Radic Biol Med. (2007) 43:4–15. doi: 10.1016/j.freeradbiomed.2007.03.024,

PubMed Abstract | Crossref Full Text | Google Scholar

22. Dysken, MW, Sano, M, Asthana, S, Vertrees, JE, Pallaki, M, Llorente, M, et al. Effect of vitamin E and memantine on functional decline in Alzheimer disease: the TEAM-AD VA cooperative randomized trial. JAMA. (2014) 311:33–44. doi: 10.1001/jama.2013.282834,

PubMed Abstract | Crossref Full Text | Google Scholar

23. Harris, ED. Copper as a cofactor and regulator of copper, zinc superoxide dismutase. J Nutr. (1992) 122:636–40. doi: 10.1093/jn/122.suppl_3.636,

PubMed Abstract | Crossref Full Text | Google Scholar

24. Gil Martínez, V, Avedillo Salas, A, and Santander Ballestín, S. Vitamin supplementation and dementia: a systematic review. Nutrients. (2022) 14:1033. doi: 10.3390/nu14051033,

PubMed Abstract | Crossref Full Text | Google Scholar

25. Lei, P, Ayton, S, and Bush, AI. The essential elements of Alzheimer’s disease. J Biol Chem. (2021) 296:100105. doi: 10.1074/jbc.REV120.008207,

PubMed Abstract | Crossref Full Text | Google Scholar

26. Ashoori, M, and Saedisomeolia, A. Riboflavin (vitamin B2) and oxidative stress: a review. Br J Nutr. (2014) 111:1985–91. doi: 10.1017/S0007114514000178,

PubMed Abstract | Crossref Full Text | Google Scholar

27. Buettner, GR. The pecking order of free radicals and antioxidants: lipid peroxidation, α-tocopherol, and ascorbate. Arch Biochem Biophys. (1993) 300:535–43. doi: 10.1006/abbi.1993.1074,

PubMed Abstract | Crossref Full Text | Google Scholar

28. Michan, S, Li, Y, Chou, M, Parrella, E, Ge, H, Long, J, et al. SIRT1 is essential for normal cognitive function and synaptic plasticity. J Neurosci. (2010) 30:9695–707. doi: 10.1523/JNEUROSCI.0027-10.2010,

PubMed Abstract | Crossref Full Text | Google Scholar

29. Lane, AR, Roberts, BR, Fahrni, CJ, and Faundez, V. A primer on copper biology in the brain. Neurobiol Dis. (2025) 212:106974. doi: 10.1016/j.nbd.2025.106974

Crossref Full Text | Google Scholar

30. Villalón-García, I, Álvarez-Córdoba, M, Povea-Cabello, S, Talaverón-Rey, M, Villanueva-Paz, M, Luzón-Hidalgo, R, et al. Vitamin E prevents lipid peroxidation and iron accumulation in PLA2G6-associated neurodegeneration. Neurobiol Dis. (2022) 165:105649. doi: 10.1016/j.nbd.2022.105649,

PubMed Abstract | Crossref Full Text | Google Scholar

31. Hu, FB. Dietary pattern analysis: a new direction in nutritional epidemiology. Curr Opin Lipidol. (2002) 13:3–9. doi: 10.1097/00041433-200202000-00002,

PubMed Abstract | Crossref Full Text | Google Scholar

32. Darmon, N, and Drewnowski, A. Does social class predict diet quality? Am J Clin Nutr. (2008) 87:1107–17. doi: 10.1093/ajcn/87.5.1107,

PubMed Abstract | Crossref Full Text | Google Scholar

33. Freedman, LS, Kipnis, V, Schatzkin, A, Tasevska, N, and Potischman, N. Can we use biomarkers in combination with self-reports to strengthen the analysis of nutritional epidemiologic studies? Epidemiol Perspect Innovat. (2010) 7:2. doi: 10.1186/1742-5573-7-2,

PubMed Abstract | Crossref Full Text | Google Scholar

34. Yuan, M, He, J, Hu, X, Yao, L, Chen, P, Wang, Z, et al. Hypertension and NAFLD risk: insights from the NHANES 2017-2018 and Mendelian randomization analyses. Chin Med J. (2024) 137:457–64. doi: 10.1097/CM9.0000000000002753,

PubMed Abstract | Crossref Full Text | Google Scholar

35. Ma, R, Cui, L, Cai, J, Yang, N, Wang, Y, Chen, Q, et al. Association between systemic immune inflammation index, systemic inflammation response index and adult psoriasis: evidence from NHANES. Front Immunol. (2024) 15:1323174. doi: 10.3389/fimmu.2024.1323174,

PubMed Abstract | Crossref Full Text | Google Scholar

36. Zhang, X, Yang, L, Wang, Z, Wang, H, Nie, S, Zhao, C, et al. Associations between nutritional status and cognitive impairment in older adults: results from the NHANES 2011-2014 cycles. Front Nutr. (2025) 12:1571990. doi: 10.3389/fnut.2025.1571990,

PubMed Abstract | Crossref Full Text | Google Scholar

37. Von Elm, E, Altman, DG, Egger, M, Pocock, SJ, Gøtzsche, PC, Vandenbroucke, JP, et al. The strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies. Int J Surg. (2014) 12:1495–9. doi: 10.1016/j.ijsu.2014.07.013,

PubMed Abstract | Crossref Full Text | Google Scholar

38. Collins, GS, Reitsma, JB, Altman, DG, and Moons, KG. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. J Br Surg. (2015) 102:148–58. doi: 10.1002/bjs.9736,

PubMed Abstract | Crossref Full Text | Google Scholar

39. Smolen, JS, Landewe, RBM, Bergstra, SA, Kerschbaumer, A, Sepriano, A, Aletaha, D, et al. EULAR recommendations for the management of rheumatoid arthritis with synthetic and biological disease-modifying antirheumatic drugs: 2022 update. Ann Rheum Dis. (2023) 82:3–18. doi: 10.1136/ard-2022-223356,

PubMed Abstract | Crossref Full Text | Google Scholar

40. Brody, DJ, Kramarow, EA, Taylor, CA, and McGuire, LC. Cognitive performance in adults aged 60 and over: National Health and Nutrition Examination Survey, 2011–2014. Natl Health Stat Report. (2019) 126:1–23.

Google Scholar

41. Huang, G, and Ren, G. Interaction between ω-6 fatty acids intake and blood cadmium on the risk of low cognitive performance in older adults from National Health and nutrition examination survey (NHANES) 2011–2014. BMC Geriatr. (2022) 22:292. doi: 10.1186/s12877-022-02988-7,

PubMed Abstract | Crossref Full Text | Google Scholar

42. Zhu, G, Li, Z, Tang, L, Shen, M, Zhou, Z, Wei, Y, et al. Associations of dietary intakes with gynecological cancers: findings from a cross-sectional study. Nutrients. (2022) 14:5026. doi: 10.3390/nu14235026,

PubMed Abstract | Crossref Full Text | Google Scholar

43. Chong, I-G, and Jun, C-H: Performance of some variable selection methods when multicollinearity is present. Chemom Intell Lab Syst 2005, 78:103–112, 112, doi: 10.1016/j.chemolab.2004.12.011,.

PubMed Abstract | Crossref Full Text | Google Scholar

44. Ni, Z, Zhu, Y, Qian, Y, Li, X, Xing, Z, Zhou, Y, et al. Synthetic minority over-sampling technique-enhanced machine learning models for predicting recurrence of postoperative chronic subdural hematoma. Front Neurol. (2024) 15:1305543. doi: 10.3389/fneur.2024.1305543,

PubMed Abstract | Crossref Full Text | Google Scholar

45. Pedregosa, F, Varoquaux, G, Gramfort, A, Michel, V, Thirion, B, Grisel, O, et al. Scikit-learn: machine learning in Python. J Mach Learn Res. (2011) 12:2825–30.

Google Scholar

46. Sokolova, M, and Lapalme, G. A systematic analysis of performance measures for classification tasks. Inf Process Manag. (2009) 45:427–37. doi: 10.1016/j.ipm.2009.03.002

Crossref Full Text | Google Scholar

47. Li, C-L, Zhou, G-F, Xie, X-Y, Wang, L, Chen, X, Pan, Q-L, et al. STAU1 exhibits a dual function by promoting amyloidogenesis and tau phosphorylation in cultured cells. Exp Neurol. (2024) 377:114805. doi: 10.1016/j.expneurol.2024.114805,

PubMed Abstract | Crossref Full Text | Google Scholar

48. Tripathi, AK, Dwivedi, A, Pal, MK, Rastogi, N, Gupta, P, Ali, S, et al. Attenuated neuroprotective effect of riboflavin under UV-B irradiation via miR-203/c-Jun signaling pathway in vivo and in vitro. J Biomed Sci. (2014) 21:39. doi: 10.1186/1423-0127-21-39,

PubMed Abstract | Crossref Full Text | Google Scholar

49. Trela-Makowej, A, Leśkiewicz, M, Kruk, J, Żądło, A, Basta-Kaim, A, and Szymańska, R. Antioxidant and neuroprotective activity of vitamin E homologues: in vitro study. Meta. (2022) 12:608. doi: 10.3390/metabo12070608,

PubMed Abstract | Crossref Full Text | Google Scholar

50. Cai, Y, Xiao, R, Zhang, Y, Xu, D, Wang, N, Han, M, et al. DHPA protects SH-SY5Y cells from oxidative stress-induced apoptosis via mitochondria apoptosis and the Keap1/Nrf2/HO-1 signaling pathway. Antioxidants. (2022) 11:1794. doi: 10.3390/antiox11091794,

PubMed Abstract | Crossref Full Text | Google Scholar

51. Piras, F, Sogos, V, Pollastro, F, and Rosa, A. Protective effect of arzanol against H2O2-induced oxidative stress damage in differentiated and undifferentiated SH-SY5Y cells. Int J Mol Sci. (2024) 25:7386. doi: 10.3390/ijms25137386,

PubMed Abstract | Crossref Full Text | Google Scholar

52. Zhang, X, Chen, H, Pang, T, Liang, K, Mei, J, Zhu, Y, et al. A preliminary study of sirtuin-1 on angiotensin II-induced senescence and inflammation in abdominal aortic aneurysms. Cytojournal. (2024) 21:32. doi: 10.25259/Cytojournal_80_2024,

PubMed Abstract | Crossref Full Text | Google Scholar

53. Donnici, L, Tiraboschi, E, Tardito, D, Musazzi, L, Racagni, G, and Popoli, M. Time-dependent biphasic modulation of human BDNF by antidepressants in neuroblastoma cells. BMC Neurosci. (2008) 9:61. doi: 10.1186/1471-2202-9-61,

PubMed Abstract | Crossref Full Text | Google Scholar

54. Yang, Z-G, Sun, X, Han, X, Wang, X, and Wang, L. Relationship between social determinants of health and cognitive performance in an older American population: a cross-sectional NHANES study. BMC Geriatr. (2025) 25:25. doi: 10.1186/s12877-024-05672-0,

PubMed Abstract | Crossref Full Text | Google Scholar

55. Zhang, M, Chen, H, Zhang, W, Liu, Y, Ding, L, Gong, J, et al. Biomimetic remodeling of microglial riboflavin metabolism ameliorates cognitive impairment by modulating neuroinflammation. Adv Sci. (2023) 10:e2300180. doi: 10.1002/advs.202300180,

PubMed Abstract | Crossref Full Text | Google Scholar

56. Packer, L. Protective role of vitamin E in biological systems. Am J Clin Nutr. (1991) 53:1050S–5S. doi: 10.1093/ajcn/53.4.1050S,

PubMed Abstract | Crossref Full Text | Google Scholar

57. Olfat, N, Ashoori, M, and Saedisomeolia, A. Riboflavin is an antioxidant: a review update. Br J Nutr. (2022) 128:1887–95. doi: 10.1017/S0007114521005031,

PubMed Abstract | Crossref Full Text | Google Scholar

58. Stern, Y. Cognitive reserve in ageing and Alzheimer's disease. Lancet Neurol. (2012) 11:1006–12. doi: 10.1016/S1474-4422(12)70191-6,

PubMed Abstract | Crossref Full Text | Google Scholar

59. French, SA, Tangney, CC, Crane, MM, Wang, Y, and Appelhans, BM. Nutrition quality of food purchases varies by household income: the SHoPPER study. BMC Public Health. (2019) 19:231. doi: 10.1186/s12889-019-6546-2,

PubMed Abstract | Crossref Full Text | Google Scholar

60. Zhang, R, Zhang, L, Du, W, Tang, J, Yang, L, Geng, D, et al. Caffeine alleviate lipopolysaccharide-induced neuroinflammation and depression through regulating p-AKT and NF-κB. Neurosci Lett. (2024) 837:137923. doi: 10.1016/j.neulet.2024.137923,

PubMed Abstract | Crossref Full Text | Google Scholar

61. Imerbsin, N, Shantavasinkul, PC, Witoonpanich, P, Sirivarasai, J, Taonam, N, Phanachet, P, et al. Vitamin D and cognitive impairment. Nutrients. (2025) 17:8. doi: 10.3390/nu17081301,

PubMed Abstract | Crossref Full Text | Google Scholar

62. Sano, R, Shinozaki, Y, and Ohta, T. Sodium–glucose cotransporters: functional properties and pharmaceutical potential. J Diabetes Invest. (2020) 11:770–82. doi: 10.1111/jdi.13255,

PubMed Abstract | Crossref Full Text | Google Scholar

63. Yang, Y, Hu, Y, Chen, Y, Gu, W, and Nie, S. Identifying leukoaraiosis with mild cognitive impairment by fusing multiple MRI morphological metrics and ensemble machine learning. J Imaging Informat Med. (2024) 37:666–78. doi: 10.1007/s10278-023-00958-y,

PubMed Abstract | Crossref Full Text | Google Scholar

64. Dolcet-Negre, MM, Imaz Aguayo, L, García-de-Eulate, R, Martí-Andrés, G, Fernández-Matarrubia, M, Domínguez, P, et al. Predicting conversion from subjective cognitive decline to mild cognitive impairment and Alzheimer’s disease dementia using ensemble machine learning. J Alzheimer's Dis. (2023) 93:125–40. doi: 10.3233/JAD-221002,

PubMed Abstract | Crossref Full Text | Google Scholar

65. Tombaugh, TN. Trail making test a and B: normative data stratified by age and education. Arch Clin Neuropsychol. (2004) 19:203–14. doi: 10.1016/S0887-6177(03)00039-8,

PubMed Abstract | Crossref Full Text | Google Scholar

66. Quinn, CM, Kasibante, J, Namudde, A, Bangdiwala, AS, Kabahubya, M, Nakasujja, N, et al. Neurocognitive outcomes of tuberculous meningitis in a primarily HIV-positive Ugandan cohort. Wellcome Open Res. (2022) 6:208. doi: 10.12688/wellcomeopenres.16967.2,

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: vitamin B2, cognitive impairment, nutrient synergy, NHANES, SHAP, oxidative stress, SIRT1, BDNF

Citation: Song L, Li C, Xiang X and Lin P (2026) Interpretable machine learning models for predicting cognitive impairment using NHANES neuropsychological tests: nutritional and sociodemographic associations. Front. Nutr. 12:1680290. doi: 10.3389/fnut.2025.1680290

Received: 05 August 2025; Revised: 14 November 2025; Accepted: 22 December 2025;
Published: 14 January 2026.

Edited by:

Enrico Collantoni, University of Padua, Italy

Reviewed by:

Laura Beth McIntire, Weill Cornell Medical Center, NewYork-Presbyterian, United States
Anna Maria Berardi, Université de Lorraine, France
Asmaa Shams Eldeen, Cairo University, Egypt

Copyright © 2026 Song, Li, Xiang and Lin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Peijia Lin, bGlucGVpamlhMTIzQDE2My5jb20=; Xiaojiao Xiang, MzA1ODc3QGhvc3BpdGFsLmNxbXUuZWR1LmNu

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.