Predicting Age From Behavioral Test Performance for Screening Early Onset of Cognitive Decline

Background: Neuronal reactions and cognitive processes slow down during aging. The onset, rate, and extent of changes vary considerably from individual to individual. Assessing the changes throughout the lifespan is a challenging task. No existing test covers all domains, and batteries of tests are administered. The best strategy is to study each functional domain separately by applying different behavioral tasks whereby the tests reflect the conceptual structure of cognition. Such an approach has limitations that are described in the article. Objective: Our aim was to improve the diagnosis of early cognitive decline. We estimated the onset of cognitive decline in a healthy population, using behavioral tests, and predicted the age group of an individual. The comparison between the predicted (“cognitive”) and chronological age will contribute to the early diagnosis of accelerated aging. Materials and Methods: We used publicly available datasets (POBA, SSCT) and Pearson correlation coefficients to assess the relationship between age and tests results, Kruskal-Wallis test to compare distribution, clustering methods to find an onset of cognitive decline, feature selection to enhance performance of the clustering algorithms, and classification methods to predict an age group from cognitive tests results. Results: The major results of the psychophysiological tests followed a U-shape function across the lifespan, which reflected the known inverted function of white matter volume changes. Optimal values were observed in those aged over 35 years, with a period of stability and accelerated decline after 55–60 years of age. The shape of the age-related variance of the performance of major cognitive tests was linear, which followed the trend of lifespan gray matter volume changes starting from adolescence. There was no significant sex difference in lifelong dynamics of major tests estimates. The performance of the classification model for identifying subject age groups was high. Conclusions: ML models can be designed and utilized as computer-aided detectors of neurocognitive decline. Our study demonstrated great promise for the utility of classification models to predict age-related changes. These findings encourage further explorations combining several tests from the cognitive and psychophysiological test battery to derive the most reliable set of tests toward the development of a highly-accurate ML model.


INTRODUCTION
The slowing of neuronal reactions and cognitive processes is a typical functional outcome of aging. However, the onset, rate, and extent of changes vary considerably from individual to individual. Furthermore, the breadth of cognitive function has led physiologists to describe cognitive performance in terms of domains of functioning; there is no single test that covers all domains, and batteries of tests are usually administered. Therefore, assessing the changes in the cognitive function throughout the lifespan of an individual is a challenging task. The best strategy is to study each functional domain by applying different conditions and behavioral tasks whereby the tests reflect the conceptual structure of cognition. This makes them suitable for both scientific research and practical studies. However, such an approach has limitations, which will be described in the article. memory updating, and information speed processing, which are EF domains, or alternatively, cognitive subdomains.
Researchers consider dependent variables of executive functioning tests (EFTs) to be more sensitive to age-related changes than estimates of other types of cognitive functioning (Salthouse et al., 2003). Classical psychophysiological tasks are used to test EF target-specific functions. Assessments typically reflect subdomains of each ability, and careful combinations of tasks reveal patterns of performance that are consistent with a variety of neurological and neuropsychiatric conditions (Harvey, 2019). Typical limitations of the tasks are as follows: • Despite the perfect usability of tests, many agree that practice effects influence follow-up performance on EFTs, which leads to potential overestimation of cognitive abilities in young people and underestimation of cognitive decline in older adults and patients (Overman et al., 2017). • Because cognitive subdomains (both basic and higher-order) are closely interconnected, detecting changes that account for mutual compensation (e.g., speed-accuracy trade-off) can be difficult. However, such phenomena are common in physiology and may benefit permanent adjustments to variant conditions. Changing performance tactics may serve the surviving strategy. A solution was proposed by Beghali, where the Stroop switching task was modified by adding additional switching conditions to allow the assessment of overall EF using a single test (Belghali and Decker, 2019;Belghali et al., 2020).
Although age-related effects are more pronounced in EF than in other cognitive functions, the assumption that EF represents a distinct construct has received criticism (Salthouse et al., 2003;Salthouse, 2005). In a study of 261 cases, authors found "only weak evidence for the existence of distinct constructs corresponding to EF or to aspects of executive control concerned with inhibition, updating, or time sharing, " suggesting that researchers should not merely assume that variables reflect a particular hypothesized concept without relevant empirical evidence. To overcome such implications, we validated the Stroop switching card test (SSCT) in a recent study by comparing Stroop variables with the digital symbol substitution test, the digit span forward and backward test (DSFBT), the trail making test (TMT), and the classical Stroop test (Belghali et al., 2020). Age-related cognitive changes are the key points of interest in interdisciplinary studies within the medical and behavioral sciences. Neurophysiologists, neurologists, and psychiatrists categorize cognitive processes into functional domains that have a hierarchical structure. The higher-order cognitive domains are cognitive control and EF, which account for the acquisition and processing of 'information.
Accurate assessment of cognitive status is important in neuroscience. To estimate cognition, EFTs are commonly used; however, there is no strong consensus that EFTs are reliable. In fact, some researchers have criticized the assumption that EF represents a distinct construct (Salthouse et al., 2003;Salthouse, 2005).

Psychophysiological Status and Tests, Functional Systems, Neural Hypernets
Psychophysiological tests (PTs) are alternative tools for assessing cognition and are also aimed at quantifying cognitive functioning domains, such as EF, information-processing speed, attentional control, and working memory. Commonly, a battery of PTs is composed of tests that cover all the constituents of cognition. However, they do not provide a summary assessment of whether the test results are associated with aging or disease. Instead, PTs provide an insight into an individual's psychophysiological status (PS). PS offers information on overall test performance, neuropathological changes, type of temperament, and trait features.
The idea of PS is closely linked to the theory of functional systems, which is a framework that describes the structure of an individual's behavior at the physiological and informational levels. Furthermore, it clarifies the cognitive architecture of an individual (Red'ko et al., 2004;Vityaev and Demin, 2018). According to the theory, goal-motivated activity comprises afferent synthesis, making a decision, and accepting the final result of an action (response selection). Thus, to estimate the PS of a person, clinicians should use a test battery that assesses all three components: the sensory component of a simple action, decision-making time, and response selection. A common EF test comprises these three elements.
All behavioral tests consist of consequent elements: afferent synthesis as a constituent of cognition, decision-making (an estimate of information-processing speed), and response selection (the core of attentional control). Additionally, PTs estimate the stability of regulatory system functions, which is also known as the level of neuropsychological stability.
Using a battery of PTs, neurophysiologists do not aim to target separate cognitive functions. Instead, they target physiological characteristics of the processes that underlie higher-order cognitive functions (e.g., EF). Sensorimotor response assessment in PTs is used to study the mechanisms of memory, information perception, and information processing, and by placing time limits or changing task complexity, it is possible to evaluate performance under various conditions. This allows psychophysiological compliance to be determined with some professional requirements. PTs have been validated as a cost effective and reliable tool to screen for professional maladjustment in sports and extreme professions (Li et al., 2019;Boichuk et al., 2020;Myroshnychenho et al., 2020). Unfortunately, clinical psychology does not meet the unconditioned cutoff criteria for major tests (Statsenko and Charykova, 2010).
Modern neurophysiological and neuropsychological studies have shown that specialized operations and systematic interactions of brain structures underlie cognition and behavior. The brain is structured and organized systematically and it includes projective, associative, integrative, and limbic-reticular function-specific systems. The systems closely interact with structures that are excited either simultaneously or alternatively. The functional elements are dispersed throughout the brain and separated, but not isolated, from each other. They maintain close cooperation, so that activation of one element can activate other elements. The basic unit of a functional system is a neuron, and a network of interconnected neurons is called a cooperative or cognitive group (cog). These networks contain an individual's innate and acquired knowledge and experience. The complete set of cogs forms a cognitome. The theory of functional systems has been further developed into the theory of neural hypernets, which describes the mind as a network in which the vertices are networks of functionally connected neurons. The representation of the mind as an organic and mathematical structure has fostered research applying experimental and theoretical physics, graph theory, and statistical mechanics approaches (Sudakov, 1997(Sudakov, , 2015.

Onset of Cognitive Decline
Cognitive abilities (e.g., memory, thinking, and attention) begin declining from the age of 30. However, the rate of decline varies among individuals depending on genetics, lifestyle, regular mental activity, and somatic diseases. Compared with young and middle-aged adults, the elderly are more prone to lower mental performance, emotional lability, higher threshold of unconditioned reflexes, difficulties in developing conditioned reflexes, and fading of reflexes (Nelson and Luciana, 2001;Park and Gutchess, 2002). Because cognition reflects the integrated activity of the whole brain, cognitive impairment develops with focal and diffuse deterioration across various brain regions. The incidence of cognitive disorders increases with age, where 3-20% of people aged over 65 years have severe cognitive impairment (dementia) (Damulin, 2008). The incidence of mild cognitive impairment in the elderly ranges from 40 to 80% across different age groups (Larrabee and Crook, 1994). Usually, a diagnosis is made when an individual presents with evident cognitive deterioration and irreversible brain changes (e.g., dementia). Therefore, there is a need for improvements in diagnosis that allow the tracking of minor changes to detect early neurodegeneration. This will help to provide early prophylactic interventions and preventive measures to the elderly for sustaining a high level of intelligence.

OBJECTIVES
The overall aim of this study was to improve the diagnosis of early cognitive decline by applying a machine learning (ML) approach to psychophysiological and cognitive tests. We estimated the approximate age of onset of cognitive decline in a healthy population based on behavioral test performance and predicted individuals' age groups to compare with the their chronological age. Our objectives were:

Methodology of the Study
To address the first objective, we assessed the relationship between age and test performance. To do so, we calculated Pearson's correlation coefficients. For each age group, the relationships between the continuous features were assessed using the Kruskal-Wallis test.
For the second objective, we studied the distribution of test performance values by age. Trendlines that approximate the distribution functions were determined with the least squares method to estimate second-order polynomial coefficients. The parabolic trendline functions were displayed using 95% confidence intervals, which were calculated using the bootstrap method. We developed a descriptive model of cognitive decline by comparing the polynomial regression function fits for the different tests. To find a possible onset of cognitive decline we assessed mean values and variance of tests results in age groups. For this we used descriptive statistics methods.
To address the third objective, we analyzed the patterns of the sex-specific features of lifelong performance dynamics of the psychophysiological and cognitive tests. We built ordinary least squares regression trendlines and expressed results as IQR, mean ± std or number of cases, and their percentage out of the observed group. With Kruskal-Wallis test we assessed whether sex affected the impact of age on test performance (i.e., whether there was an interaction effect). To examine differences between the slopes and intercepts we used a t-test.
The fourth objective was multifold. We hypothesized that in normal aging there is a cutoff age from when cognitive decline begins. Some clustering techniques allow solutions to be built based on the number of clusters which can be predefined by the user. This allows one to test several possible divisions to determine the optimal model with clear separation of the identified groups.
To achieve the first part of the fourth objective of determining the age at which cognitive decline can be identified from test performance, we utilized a ML approach. We used an exploratory analysis by assessing the separability of datasets using unsupervised ML algorithms. We used clustering methods, such as Simple K-means (Arthur andVassilvitskii, 2006), canopy (McCallum et al., 2000), expectation-maximization (Dempster et al., 1977), and GenClus++ (Islam et al., 2018). Testing different numbers of clusters based on performance allowed us to determine the possible onset of cognitive decline. Then we built pairwise distributions of each attribute by age. The battery of PT that we used resulted in a large number of dependent variables (e.g., time estimates and accuracy metrics). For the analysis, we employed the major tests results explained in section 3.2.1.
For the second part of objective four, we studied the informative value of the tests for detecting cognitive changes in the elderly. To enhance the performance of the clustering algorithms, we used feature-selection methods, which are designed to minimize overfitting and reduce the time needed for training, while increasing model performance metrics by eliminating less informative features from the dataset. We employed the genetic algorithm (Hall, 1998) and information gain attribute evaluation (Kononenko and Hong, 1997). The genetic algorithm retrieves the most relevant features, whereas information gain attribute evaluation-based ranker lists the attributes in descending order based on their informative value for the final prediction. These values are considered as a useful measure of feature importance in the final model decision.
In the third, final part of the fourth objective, we built an ML algorithm to predict the age group from an individual's cognitive test performance. This fulfills the final aim of detecting misclassified cases that are susceptible to accelerated brain aging based on cognitive status assessment. To build the desired solution, we used several binary classification algorithms, such as support vector machines (Platt, 1999) with linear and nonlinear (radial basis function) kernels, Gaussian Naive Bayes (John and Langley, 2013), Bagging meta-estimator (Louppe and Geurts, 2012), an extra-trees classifier (Geurts et al., 2006), a random forest classifier (Breiman, 2001), and multilayer perceptron (Glorot and Bengio, 2010). Because of the relatively small size of the datasets, we used a stratified five-fold crossvalidation technique to have confidence that the predictions will generalize to unseen data. To evaluate the performance of the predictive models, we generated a receiver operating characteristic (ROC) curve averaged over five folds. We also calculated mean sensitivity, specificity, balanced accuracy (BAC), and area under the curve (AUC) values with respect to class. These performance measures were suitable as the datasets were balanced across the age attribute. Finally, we determined the cases that were misclassified by the best predictive model. We used the confusion matrix and calculated false-positive (FP) and false-negative values (FN).

POBA Dataset
We used the dataset called Psychophysiological outcomes of brain atrophy (POBA; see section Acknowledgments). The methodology of the neurophysiological tests used for the dataset is well-defined and relevant to research on age-related functional changes. The accurate computerized assessment of PS was strongly aligned with the purpose of the study. The POBA dataset does not contain any complicated tests and comprises simple tasks that are suitable for those with different intelligence levels. The dataset consisted of 231 cases which included MRI examinations and psychophysiological testing results of people aged 4-84 years. Written patient or parental consent for minors for participation was obtained from each case. All participants were either patients who suffered from periodic headaches or were anxious about having organic brain pathology, or healthy participants who were examined at the beginning of their professional sports career. The exclusion criteria were as follows: organic brain pathology, mental disorder, or head injury. The dataset is available on demand (see section 7). A thorough description of the dataset has been previously published .
We have highlighted only the features used in this study to determine PT dynamics across the lifespan and for ML analysis. We used the following PTs: 1. Simple visual-motor reaction (SVMR): Reaction time (RT) is recorded for a single type of stimuli requiring an identical response. The result of the test is mean RT (SVMR_mean), which reflects the participants current functional state and indicates overall working capacity, type of temperament, and level of excitability of the central nervous system. 2. A type ofgo/no-go test with similar visual and motor components as the SVMR but with two types of stimuli that require different responses. For this reason, it is also called the complex visual-motor reaction (CVMR). The mean RT (CVMR_mean) correlates negatively with psychometric measurements of intelligence (Colman, 2015). 3. Decision-making time (DMT) is defined as the time taken for response selection. It is measured as CVMR_mean subtracted by SVMR_mean. 4. Attention study technique: To test attention, identical triggering stimuli are presented subsequently in different locations on a computer screen. The mean response time (AST_mean) reflects the level of attention to visual objects, stability, concentration of attention, speed of information processing, and work efficiency. 5. Interference resilience technique: In contrast to the previous task, this technique includes additional interfering objects (e.g., circles of different color and size) that overlapping each other and the targeted stimuli, which requires additional time for the participant to notice the triggering signal, and respond. The system calculates the average response time (IRT_mean). 6. The time delay in responding to the targeted stimulus due to visual interfering objects (TRVI) is the subtraction of AST_mean from IRT_mean (see Formula 2). 7. Reaction to a moving object (RMO) technique: A circle appears on the screen with one red and one green colored mark arranged radially. It becomes quickly filled with a yellow color in a clockwise direction from a starting point to the finishing line. The participant responds when the yellow sector passes through the red finishing mark. The result is measured as a mean value (RMO_mean) of the positive (time delays) and negative values (premature responses). A negative RMO_mean indicates a predominance of excitation of the central nervous system, whereas a positive RMO_mean indicates a predominance of inhibition of the central nervous system. Although RMO test results include a time parameter, the variable is an additional indicator of reaction accuracy (e.g., a delayed or proactive reaction). 8. RT variability: The dependent variables mentioned above measure the mean RTs calculated over 30 subsequent episodes of testing with varied time intervals. The standard deviation of RT conveys unique information beyond that offered by mean performance (Graveson et al., 2016). We analyzed the SD for each task as a separate dependent variable (SVMR_variance, CVMR_variance, AST_variance, IRT_variance, RMO_variance). 9. We used wrist dynamometry to measure the maximum muscular strength of the right (WDR_MMS) and left hand (WDL_MMS). 10. Asymmetry coefficient (AC) is calculated as the ratio of the maximum muscular strength of the wrists (see Formula 3). A study showed an association between the depth of the central sulcus (anatomic brain asymmetry) and the predominant use of the right or left hand for skilled and unskilled activities (Amunts et al., 2000). As anatomic brain asymmetry accounts for the functional asymmetry of the extremities, AC may reflect the difference in power between hands.

Stroop Switching Card Test Dataset
We used the SSCT dataset available on demand (see Data Availability section). A sample of 103 participants aged 15-75 years volunteered for the experiment. The battery consisted of standardized neuropsychological tests evaluating cognitive flexibility (TMT), inhibition (Stroop color and word test [SCWT]; Golden and Freshwater, 1978), the SSCT (Belghali and Decker, 2019), updating (forward and backward digit span test; Wechsler, 1955), and information speed processing (digit symbol substitution test [DSST]; Wechsler et al., 1997). A single testing session lasted for approximately 1 h, and each participant was tested individually. The dataset and the methodology of the study is described in Belghali et al. (2020). Below is a brief description of the dataset.

Cognitive Flexibility
Cognitive flexibility is the mental ability to switch between thinking about multiple concepts simultaneously. It is based on executive functions that involve conscious changes in attention (cognitive shifting) and unconscious shifts of attention between tasks (task switching). The TMT was used to assess flexibility. It is a neuropsychological test of visual attention and task switching. The subject connects 25 consecutive targets in a sequential order. TMT consists of two parts (A and B). In the first part, the targets are presented as numbers and the participant is required to connect them. In the second part, the participant is required to alternate between numbers and letters (i.e., 1-A-2-B-3-C, etc.). The time of completion for each part is recorded. The SSCT dataset contains the final outcome of the TMT test, which is measured as the Switch Score (SS) or TMT_BA_Time, which is the time delay between switching attention between numbers and letters (see Formula 5). Other studies have also used the ratio of performance (see Formula 6) based on evidence that the ratio of performance provides an index of EF; although the parts of the TMT differ in motor control and perceptual complexity (Arbuthnott and Frank, 2000). The TMT reflects cognitive abilities (visual-conceptual, visuospatial, and visual-motor tracking) as well as sustained attention and task alternation. The results predict physical impairment and mortality in older adults because poor cognitive function is associated with shorter life expectancy (Vazzana et al., 2010).

Inhibition
Inhibition was assessed using the classical Stroop test and its modified forms.
The first test was the classical SCWT starting with two basic tasks: color naming (part A) and word reading (part B). The third task (part C) contains an interference condition whereby individuals are asked to name the ink color, which does not correspond to the written word (e.g., "yellow" written in green ink). The incongruity between the ink color and the meaning of the word causes a time delay when performing part C compared with A and B. The examiner records the completion times of each task (i.e., STROOP A, STROOP B, and STROOP C) and the total number of errors in part C. The interference score (IS) is the dependent variable of interest (see Formula 4).
The SSCT developed by Belghali is a modified version of the SCWT. In addition to the classic interference condition, it includes a switching condition, where subjects are instructed to act in different ways depending on where the words are printed. The instructions are to either read the conflicting words (e.g., if "blue" is written in another color, the individual is instructed to read "blue") or name the incongruently colored ink (e.g., if "yellow" is written in green ink, the participant is instructed to say "green"). The main reasoning behind this task is that inhibition and switching share brain networks, notably the prefrontal network. Moreover, inhibition and switching have been considered two sides of the same coin (Mostofsky and Simmonds, 2008). Age-related decreases in response inhibition accounts for rising up of interference on Stroop tasks (Troyer et al., 2006). Older adults whose executive performance reduces within 1 year have shown larger switch discrepancy scores (i.e., the difference in performance between the SSCT performance and the classical Stroop task) compared with those whose executive performance remains stable (Fine et al., 2008).
The following outcomes of the SSCT were used: 1. RT (SSCT_TIME): the global RT to complete the SSCT. 2. The total number of response errors (SSCT_ERROR): reflects accuracy. 3. The inverse efficiency score (SSCT_IES) by Bruyer and Brysbaert (Bruyer and Brysbaert, 2011): reflects the RT of the correct responses and combines the proportion of errors and RT into one variable (see Formula 7).
Responses are faster and more accurate when incongruent trials occur immediately after incongruent trials (conflict resolution) than when they occur after the congruent ones (conflict adaptation). Some studies have measured conflict resolution by the difference in response errors (i.e., accuracy) between incongruent and congruent trials and gauge conflict adaptation based on the response error difference between congruent trials following incongruent trials and incongruent trials following incongruent trials (Puccioni and Vallesi, 2012a,b). However, we used the following approach: 4. Conflict resolution (SSCT_Conflict_Resolution) was measured as the total number of response errors in incongruent trials that followed incongruent trials, which is inhibition without a change in congruence and refers to the ability to select relevant information while suppressing distracting information that is irrelevant to the current goal of the task. The subsequent tasks were congruent regarding the required response. 5. Conflict adaptation (SSCT_Conflict_Adaptation) was measured as the total number of response errors in congruent trials that followed incongruent trials, which is inhibition with a change in congruence and refers to the ability to adjust responses in accordance with the congruence of both current and previous trials. 6. Inhibition and switching (SSCT_I_S) is another metric of the conflict resolution process. In the SSCT, conflict resolution is applied in two ways. The first involves a cognitive sequence that involves inhibition exclusively without a change in congruence (e.g., naming an incongruent ink color preceded by an incongruent ink color). The second involves both inhibition and switching without a change in congruence (i.e., switching between calling the incongruent ink of colors and reading the words). 7. Working memory updating (SSCT_Updating) was measured by the total number of errors while classifying cards after each trial. It assessed inhibition only and inhibition with switching.

Updating
Updating was assessed using the DSFBT, which is a widelyused neuropsychological test for short-term verbal memory and is a component of the Wechsler memory scale (Woods et al., 2011). DSFBT includes two sequences: forward and backward. For the forward one, the participant repeats a series of numbers presented by the examiner in the same order. In the backward sequence, the participant recalls the numbers in the reverse order. The length of the sequence increases in subsequent trials. Two trials are presented for each list length. Each trial starts with two digits until the limit in list length is reached (nine forward and eight backward). The examiner stops when the subject fails both trials of the same list length successively or when the maximal list length is reached. The dependent variable of interest (DIGIT_SPAN_FWBW) is the total number of lists reported correctly for both sequences.

Information Speed Processing
Information speed processing assessed with the DSST, which is sensitive to many domains of cognitive dysfunction. It is also sensitive to changes in cognitive functioning across a wide range of clinical populations. Symbol-coding paradigms that are similar to the DSST are included as subtests in the Brief Assessment of Cognition in Schizophrenia and Repeatable Battery for the Assessment of Neuropsychological Status. However, DSST has low specificity for determining which cognitive domain is affected (Jaeger, 2018). Performance on the DSST can be affected by associative learning, motor speed, attention, visuoperceptual functions (e.g., scanning and ability to write or draw), executive functions of planning and strategizing, and working memory. The DSST consists of nine digit symbol pairs (e.g., 1/-, 2/∼ 7/{, 8/X, 9/=), followed by a list of digits. Under each digit, the subject is required to write down the corresponding symbol as fast as possible. The number of correctly processed symbols within the allocated time (Processing_speed) is measured.

Preprocessing of Data
The POBA dataset consists of a list of deidentified subject records, with one patient per row, which are stored in a commaseparated value format file. To convert data into a format suitable for ML applications, several preprocessing steps are performed. We cleaned the data by removing missing, unknown, or inappropriate values. In 26% of cases, values for wrist dynamometry attributes (WDL_MMS, WDR_MMS, and AC) were missing. We generated the values of missing attributes by using a linear regression model, which was trained on the available data as predictors and missing attributes as outcome variables. Then, the value of the AC feature was calculated using Formula 3. Then the numerical variables were normalized by subtracting the mean value and scaling to the attribute variance.

To Form Clusters and Groups of Participants
To form clusters and groups of participants, we initially used four age groups. The range of years corresponding to each group was as follows: Adolescents were aged [0, 20) years, Young adults were aged [20, 40) years, Midlife adults were aged [40, 60) years, and Older adults were aged ≥ 60 years. As shown in Figure 1, the distribution of subjects by age group was similar. Subsequently we enlarged the groups into two major clusters. The clusters of the young (<40 years) and older (40 years and above) adults were almost balanced: 48.5/51.5%. While solving the last task, we excluded the demographic features from the dataset because they risked biasing the prediction.

Performance Evaluation Metrics
We used several objective measures to evaluate the performance of the clustering and classification methods. Confusion and error matrices were built for each predictive model to show how they distinguished between the younger and older classes. The ROC curve and AUC were used to evaluate the performance of the classifiers and summarize the trade-off between the truepositive (TPR) and false-positive rates (FPR), using different probability thresholds. The medical decision-making community has extensively published on the use of ROC graphs for the diagnostic testing (Fawcett, 2004) of balanced data (Saito and Rehmsmeier, 2015). Thus, we found that this metric was appropriate for our needs. We used: Here we use: The overall accuracy of the model was defined as follows: where TP, TN, FP, and FN are true-positive, true-negative, falsepositive, and false-negative values, respectively, representing the confusion matrix of the classification model. All metrics were calculated for each fold separately, and averaged values were used as the final measure.

Hardware and Software
All experiments were conducted using a Linux Ubuntu 18.04 workstation with 24 CPU cores and two NVIDIA GeForce GTX 1080 Ti GPU with 11 GB GDDR5X memory each, using the Python programming language and its libraries for data processing, ML, and data visualization, such as scikit-learn, NumPy, Pandas, Matplotlib, Seaborn, and Plotly. For the POBA dataset collection, we used NS-Psychotest by Neurosoft. Figure 2A describes the association between age and performance of participants for the PTs (i.e., the POBA dataset). Figure 2B shows the relationship between age and cognitive test performance (i.e., the SSCT dataset). The color intensity and size of the ellipses are proportional to the correlation coefficients.

Association Between Test Performance and Age
The analysis of the PTs showed a positive correlation between age and all features except AC, which was negatively associated with age. Age was significantly  associated with all psychophysiological parameters (p < 0.05) except for TRVI, RMO_mean, and wrist power.
For the analysis of cognitive tests, associations between test performance and age were significant and stronger compared with those between PT performance and age. Test output values increased with age because they reflected either the time taken to complete the task or the number of errors (inaccuracy). The exceptions were information speed processing from the DSST and accuracy in updating, reflected by the dependent variable of the DSFB test. Poorer test performance resulted in lower speed and accuracy estimates. Performance in the DSST and DSFB were negatively associated with age. All these changes demonstrate the inevitable decline in mental processes with age.
Apart from the correlations between age and basic neurophysiologic and cognitive functions, the diagrams showed strong associations of age with various attributes of behavioral test performance. Cognitive domains undergo agerelated changes in parallel; therefore, such associations are not  The significant associations between features are marked in bold.

Lookup for the Onset of Psychophysiological and Cognitive Decline
Most dependent variables of the test batteries are represented by low values for high performance and vice versa. However, several dependent variables have lower values for poor performance, which include the muscle strength parameters and outputs of the DSST and DFBW tests. To maintain consistency in the diagrams, we reversed the values for subsequent analyses (i.e., 1/WDL_MMS, 1/Processing_speed, and 1/DIGIT_SPAN_FWBW). Table 3 shows the lifelong dynamics of PT performance. The minimal values of the variables in young adults indicated better performance than other groups across all PTs (Figure 3). The U-shaped curve of the minimal values in those aged 30-45 years was the common pattern for all age-related changes, except for those of AC and RMO_mean. AC values showed a slight descending trend toward 55 years and a similar ascending trend after 55 years. RMO_mean remained almost unchanged throughout life.
The lifelong dynamics of cognitive test performance showed a different pattern. The performance metrics of the cognitive tests shared a similar overall trend, as seen in Table 4. Most values showed a rise from adolescence and an increase throughout life. However, several test estimates showed a small improvement in young adults, followed by steady worsening with age (e.g., SSCT_TIME, SSCT_IES, SSCT_Conflict adaptation, TMT_BA_TIME, and 1/DIGIT_SPAN_FWBW).
Figures 4, 5 illustrate the data in table. Only the SSCT_Conflict adaptation and SSCT_I_S curves presented an optimal value in those aged over 25 years with the following worsening of the parameters. All other dependent variables of the cognitive tasks progressed steadily throughout life. Table 3 shows the variance of PT performance by age and sex. From the averaged group data, men outperformed women in all PTs except for IRT_variance, which was similar across both sexes, with a slightly lower value in women (Figure 6). Table 4 and Figures 4, 7 show data of the cognitive test performance. No significant variance was related to sex. Table 5 summarizes sex-specific lifelong changes of the variables. No significant differences were found among slopes or intercepts except for choice RT (CVMR_variance). Figure 6 shows that during adolescence, CVMR_variance remains unchanged throughout life in men. In contrast, in women, CVMR_variance increases with age. Figure 7 illustrates the different trends of changes with age for SSCT_Updating. In men, it remains relatively stable, whereas in women it increases. The significant difference in slope indicates different rates of deterioration between the sexes for this cognitive feature (see Table 5). There were no sex-related differences in the dynamics of age-related changes of psychophysiological or cognitive tests.

Prediction of the Age Group Using Machine Learning
To estimate the onset of cognitive decline, we used cluster analysis. After assessing the outcome metrics of clustering into several groups, we obtained the best performance using two clusters when the cutoff value was set to 40 years of age (see Table 6). We achieved the best performance using the GenClus++ method (a combination of K-Means and the genetic algorithm). The misclassification of young participants was less frequent than that of older adults. This may account for the cumulative effect of individual lifestyle on cognitive status. Neurodevelopment in youth appears to be a more standardized process than brain aging of diverse origin, pace, and extent.
Initially, the clustering generated low prediction accuracy (68.4%). To improve the performance of clustering of the POBA dataset, we resorted to using the feature-selection method. The genetic algorithm returned the following list of features that maximized prediction accuracy: AST_mean, IRT_mean, SVMR_mean, and CVMR_mean (see Table 7). When we ran the information gain-based ranker for the SSCT dataset, we retrieved the following informative features: SSCT_TIME, SSCT_IES, Processing_speed, TMT_BA_TIME, and SSCT_ERROR. When we fed the unsupervised ML clustering models with the aforementioned features, the separability of the subjects by age group improved considerably.
To estimate the utility of a novel battery of tests for diagnosing age-related cognitive changes, we built an ML classification model, which identified the age group of participants as either below or above 40 years of age. If the prediction is reliable, it may reflect a subtle biomarker for accelerated aging (neurodegeneration) in those misclassified by the algorithm. A cognitive disorder may be diagnosed by estimating the gap between the chronological and predicted (biological) age. To make such predictions, a larger dataset is required in future studies using ML.
In Table 8 methods known for their high performance in classifying numerical data are compared. Figure 8 shows the ROC curves and AUC values that represent the performance of classifiers in both datasets. The accuracy of age group prediction from cognitive test performance was higher than that of PT performance (maximal AUC for the SSCT dataset was 0.9962 vs. 0.9382 for the POBA dataset).  The significant differences between cohorts are marked in bold.

Dynamics of Psychophysiological Attributes Throughout Life
We used a battery of PTs comprising of cognitive domains and subdomains. SVMR reflected information processing, DMT represented task switching and inhibitory control, and IRT and AST measured attention. RT variability (RTV) across a set of trials reflected functional stability. To include the largest possible number of cognitive functions and components, we applied a comprehensive neurophysiological battery to study aging. However, we did not intend to explore specific cognitive subdomains, using specific tests. The idea was to study the basic neurophysiology that underlies complex behavior. The acquisition of visual-motor RT and its variance is straightforward, and the estimates provide an accurate physiologic assessment of individual neurodynamic properties.

RT
In our study, RTs for visual-motor response, attention, switching, and inhibition were positively associated with age, which demonstrates age-related neurocognitive slowing.
Measuring RT provides an insight into information processing that typically includes signal acquisition, decision-making, and response. There is a fundamental processing speed, i.e., the rate at which cognitive operations are executed. The definition suggests independence of higher-level cognitive operations from motor operations (Salthouse, 1996). Because the basic psychophysiological tasks that we used included motor responses, RTs may reflect EF rather than fundamental processing speed (Nilsson et al., 2014). The RTs that we measured were the aggregate output of a series of complex information-processing transactions that were initiated by the presentation of a stimulus and terminated by an overt response (Bashore et al., 1997).

Choice RT
Choice RT (CRT) conveys information regarding concentration and processing speed. By subtracting simple RT from CRT, central processing time can be assessed, which accounts for 80% of age-related CRT slowing (Woods et al., 2015;Chintapalli and Romero-Ortuno, 2021). However, our data showed no significant difference between the rate of CVMR_mean and SVMR_mean slowing, which indicates impaired sensorimotor functions, rather than cognitive performance, and which at baseline explains the overall CRT slowing.

RT Variability
Apart from RT, we also noticed age-related acceleration of its inconsistency (RTV) (Graveson et al., 2016). A psychophysiological explanation for this variability relates to fluctuations in executive control mechanisms. It may also reflect attentional sustainability (Bunce et al., 1993(Bunce et al., , 2004West et al., 2002). Typical age-related neurocognitive slowing results in an increase in RTV across the lifespan (Hultsch et al., 2002), where RTV increases with response slowing (Haynes et al., 2017). This increase can be reduced by physical exercise Haynes et al., 2017). The effect of aging on RTV may increase with the complexity of a cognitive task (West et al., 2002;Bunce et al., 2004;Dixon et al., 2007), which we observed in the comparison between AST_variance and IRT_variance slopes and SVMR_variance and CVMR_variance slopes (see Figure 6 and Table 5).
There are varied opinions regarding the plausible metrics of RTV. In this study, we used standard deviation, which is in line with a systematic review that showed similar results independently, with different ways of assessing RTV (Haynes et al., 2017). However, another systematic review supported measures controlled for mean RT (e.g., coefficient of variation) (Graveson et al., 2016). The idea of adjusting RTV to RT is based on the high correlations between these variables. Therefore, agerelated changes in RTV may reflect a general slowing of responses (Myerson et al., 2007). However, associations between RTV and   clinical outcomes (e.g., dementia, falls, and death) suggest that neurocognitive variance is not simply related to general slowing (Graveson et al., 2016;Haynes et al., 2017).

Attention
Rather than using choice and simple reactions exclusively, we employed attention study techniques (e.g., AST and IRT), which stemmed from the importance of evaluating attention. By driving goal-oriented behavior, attention determines the performance of any activity. Aging results in a reduced ability to concentrate on an object. Typical symptoms of advanced age neurodynamic disorders are talking around, inability to sustain attention, being easily distracted, and difficulty recalling information against a noisy background (Tanila et al., 1997). Age-related decline in attention may reduce the performance for simultaneously carrying out tasks. For example, RT increases in the elderly when they are asked to concurrently achieve postural stability. Furthermore, slower responses during a choice reaction test are a potential predictor of faster decline in mobility (Chintapalli and Romero-Ortuno, 2021). Although it is not wellunderstood, mobility and cognitive impairment accompany each other throughout life.

Visual-Motor Task Performance
Although we used PTs with visual paradigms, it was still challenging to analyze performance of tasks that relied on visual sensory functioning because they comprise several components: sensory acquisition, cognitive appraisal, and processing. Each component may undergo age-related changes that result in the wrong behavior of the entire system. Despite being derived from sensory components, poor performance may be misinterpreted as a sign of cognitive decline. Increased age results in a decline in visual search performance; however, the reasons for this association remain unclear (Monge et al., 2017).

Brain Functional Asymmetry
A possible reason for a negative association between AC and age is that the force of each wrist differs because of the asymmetrical atrophy of the brain during aging. This reduces the dominant position of the motor cortex of one side. Another reason is a reduction in white matter (WM) connectivity during life. Fiber loss in the corpus callosum disconnects the two hemispheres and reduces the suppression of the non-dominant hemisphere by the contralateral hemisphere (Teipel et al., 2009). If motor cortex activity becomes equal across hemispheres, AC will reduce.

Lifelong Trend of Cognitive Performance
Tasks vary across POBA and SSCT studies depending on cognitive complexity. They range from those involving low cognitive demands (e.g., SVMR in the POBA dataset) to those requiring more complex cognition (e.g., cognitive tasks in the SSCT dataset). On average, the cognitive tasks comprised in the SSCT dataset are more demanding compared with the battery of PTs. This may also explain why decline starts early in life according to cognitive tests, whereas psychophysiological findings begin to worsen only from middle age.

Task Switching and Conflict Resolution
Conflict resolution is an important cognitive ability that enables the suppression of automatic responses that may have been suitable previously but are inappropriate in a new context (Ho et al., 2019). Our findings on the lifelong dynamics of SSCT_Conflict_Resolution, SSCT_I_S, and 1/DIGIT_SPAN_FWBW are consistent with previous studies that showed poor dual-tasking abilities in older adults. The ability to switch between concurrent tasks is supported by executive control, which becomes weaker with age (Graveson et al., 2016). Executive cognitive control requires operations, such as conflict monitoring and response inhibition. Conflict monitoring is the evaluative component of cognitive control, as it detects the occurrence and level of conflict. The disproportionate deficits in inhibitory processing have shown to discriminate individuals with normal aging and MCI. Conflict resolution follows conflict monitoring and inhibits task-irrelevant responses and screens for task-relevant information (Cullen et al., 2007). The SSCT_I_S reliably reflects the ability to inhibit irrelevant responses and switch to a correct behavior (Belghali et al., 2020).

Interference Score
Interference score is a dependent variable of SCWT, the diagnostic value of which is based on the interference effect.
The effect leads to slower cognitive speed during incongruent trials compared with congruent trials. The effect is larger in cognitively-impaired people compared with cognitivelypreserved individuals (Ho et al., 2019). In our study, IS increased steadily across the lifespan (see Figure 5).

Processing Speed
In the battery of cognitive tests we used, DSST enabled us to assess information-processing speed, which showed significant age-related changes. Processing speed is particularly sensitive to age and mediates the decline in higher-order cognitive domains (Nilsson et al., 2014). An alternative point of view is that under appropriate control, processing speed accounts for most agerelated differences in executive deficits (Verhaeghen and Cerella, 2002). For example, a study of three groups of participants, with mean ages of 22, 70, and 85 years, respectively, showed common perceptual and orienting attention patterns, and differences were observed for processing speed only (Muiños et al., 2016).

Attention
Attention as a cognitive domain was also measured by the cognitive tests. This is because major complex activities require attentional resources. Multitasking suffers with advanced age because of the reduced ability to flexibly switch attention (known as intellectual rigidity). TMT reflects cognitive flexibility and involves attention. Thus, it is not surprising that its dependent variable, TMT_BA_TIME, decreased across the lifespan. Numerous studies have shown that aging results in disturbances of concentration and attention.
Similarly to attention, working memory is also involved in many cognitive tasks (e.g., the SSCT and DSST). Changes in the dependent variables of the tests (SSCT_Updating and Processing_speed) signify working memory decline with age.

Onset of Decline in Psychophysiological and Cognitive Performance
Cognitive decline may start at different ages. There is no common age of onset in any population. Previous studies have shown that the decline is already evident at middle age. The most affected functions are EF (Singh-Manoux et al., 2012) and processing speed (Salthouse, 2009;Zimprich and Mascherek, 2010). A recent study revealed that most age-related cognitive changes occur at the age of 50-65 years, with only a few age-related differences being evident before the age of 50 years (Ferreira et al., 2015). However, in healthy educated adults, some aspects of cognitive impairment have shown to start during their 20s and 30s (Salthouse, 2009). Some studies have suggested that crystallized intelligence continues to increase during adulthood, whereas decline in physiological cognitive functions (e.g., fluid intelligence, memory, and especially processing speed) starts earlier (Zimprich and Mascherek, 2010).
There is a considerable body of evidence that has implicated brain structural changes in age-related EF deficits. Below is the discussion of our findings of previous brain morphology studies, which give insight into the pathomorphological mechanisms of neurocognitive slowing.

Reaction Time and Processing Speed
RT and processing speed. RT estimates and RTV of the PT that we used followed a U-shaped function across the lifespan, which is consistent with the inverted U-shaped function of WM volume changes. WM volume increases until early middle age (35 years of age), which is followed by a period of stability, and finally an accelerated decline after late middle age (55-60 years of age). Furthermore, there is evidence that indices of WM integrity from diffusion tensor imaging (DTI) strongly correlate with processing speed. WM integrity changes start early in adulthood and show greater decline after the age of 60 (Ferreira et al., 2014;Nilsson et al., 2014).
A recent study found that age-related differences in two components of processing speed do not occur simultaneously. The cognitive component of processing speed integrates all the results. The slowing of the component occurs before the age of 50 years, whereas the motor component slows during the age of 50-65 years (Ferreira et al., 2015).

RT Variance
The trial-to-trial volatility of RT across a task (RTV) is closely related to brain structural features and provides an insight into brain changes across the lifespan. RTV occurs because of the consequences of WM decline, such as less distinct cortical representations and increased neural noise. Independent of RT, RTV has been shown to be associated with the prevalence of WM lesions (hyperintensities on FLAIR). RTV is a measure of WM integrity alone (in DTI studies) and general neurological integrity at a biological level (Deary et al., 2006;Nilsson et al., 2014). Findings regarding the age at which RTV begins to increase are inconsistent. However, in line with our findings, age-related increases in variability are thought to begin in middle age or earlier (Haynes et al., 2017).

Gray Matter Atrophy and Cognitive Decline
Age-related changes in gray matter (GM) also mediate cognitive performance across the lifespan (Ferreira et al., 2014). The shape of age-related variance of major cognitive test performance is close to a straight line, which is similar to the linear trend of decreases in GM volume across the lifespan. The neural centers that comprise GM are responsible for information synthesis (e.g., decision making) and establishing links (e.g., associative thinking and working memory).

Inhibitory Process
A study using the Stroop test showed that age-related differences in cognitive inhibition occur at age 50-65 years alongside the onset of verbal fluency and premotor function decline (Ferreira et al., 2015). However, in our study, impairment started early in life (from adolescence), with a slow progression throughout life (see IS changes in Figure 5).

Attention
Attention is an internal cognitive process for directing focus toward objects or locations while managing distractions. According to its sources, attention can be broken down into networks that carry out alerting and executive control functions. Several studies have suggested that age-related cognitive decline, especially EF, begins early in life, childhood (Finch, 2009;Salthouse, 2009), or middle age (Zhou et al., 2011). Age-related deterioration of the prefrontal lobe and dopaminergic system accounts for the impairment of executive attention after the age of 40 (Zhou et al., 2011). In a recent study, decline in the attentional domain was found to occur during the transition from middle age (50 years) to old age (65 years) (Ferreira et al., 2015).
Our data showed that performance in PTs that utilize attention begins to deteriorate at middle age. In contrast, performance in the cognitive tests used in the study shows an early onset of decline. Estimating the onset of decline in attention is difficult because of the lack of tasks that purely measure attention.

Working Memory
EF decline in working memory occurs before the age of 50, which is reflected by changes in the manipulation of visual and verbal information. Changes in the latter are less prominent, as the visual modality is more demanding than the verbal modality, and more complex components are more vulnerable to change across the lifespan. Difficulties in verbal learning that start during middle age are likely to be more related to frontal lobe impairment than middle lobe impairment (Ferreira et al., 2015). The procedural memory component undergoes age-related changes that manifest as an increase in errors and time of execution, which are more related to inhibitory control (performed by the frontal lobe) and processing speed (Lezak et al., 2004;Ferreira et al., 2015).
It is difficult to estimate the onset of working memory decline using the tests that we included because the battery of tests used do not specifically measure working memory status.

Socio-Demographic Correlates in Age-Related Cognitive Impairment
Sociodemographic correlates, such as sex and education, can influence cognitive test performance. Literacy and higher educational level correlates with superior test performance on FIGURE 5 | The distribution of the Stroop switching card test, trial making test, Stroop color and word test, digit span forward and backward test, and digit symbol substitution test by age.  several cognitive domains (Ho et al., 2019). For our analysis of both datasets, we took into consideration the educational level of participants. For instance, we used literacy as an inclusion criterion. Moreover, we included adults who indicated that they completed a professional course after finishing general education. Another way to control for the level of intelligence is by considering the years of formal education. However, this has several limitations. Firstly, the intensity of training is not considered. Secondly, it does not reflect the level of intellectual activity after completion of formal education (e.g., postgraduate study). Furthermore, there is evidence that the effects of intelligence and formal education on the development of dementia differ. Schmand showed that a low reading test score predicted incident dementia better than a low level of education. Furthermore, the study found that a high occupational level had a protective effect (Schmand et al., 1997).

Sex Differences in Age-Related Cognitive Impairment
Despite some variation in psychophysiological and cognitive findings across the lifespan, we found significant test performance differences related to sex. The linear trends of  the age-related changes for choice RT and updating in the SSCT test (see Table 5) had significantly different slopes. However, we did not observe significant sex differences in the lifelong dynamics of major test estimates. There is currently limited agreement in the literature on sex differences. Longitudinal studies comparing changes in cognitive function and probability of Alzheimer's dementia in men and women have revealed confronting results (Barnes et al., 2003).  The largest AUC value for each dataset is marked in bold.

Identification of Accelerated Decline
The idea of studying EF in normal aging was motivated by the increasing evidence that deficits in certain EFs may arise at early stages of neurodegenerative disease (Ho et al., 2019). The most commonly-documented cognitive changes associated with old age are decline in memory, attention, and speed of processing of incoming information. All components of processing, from stimulus acquisition to response execution, decline with age. There is no strong consensus regarding whether the rate of decline is a process specific or common across all components (Bashore et al., 1997). Previous studies have shown that a set of influences on information-processing speed occur with advancing age. The best way to characterize these influences is by combining a variety of processing speed measures, which may be acquired from different tasks or acquisition methods (e.g., latency of evoked potentials and RT) (Bashore et al., 1997).
In our study, the linear trendlines for the variance of test estimates across the lifespan represent tendencies toward decline of functioning, specifically for information processing, attention, and switching. They all followed a common trend with a similar rate of progression. The scatterplots indicate that age-related neurocognitive slowing is an unavoidable process that occurs at a permanent rate.
The graphs show that age-related decline in functioning does not separate the population into obvious cohorts that can be easily observed in scatterplots. Visually, we could not estimate a threshold value that would indicate the onset of cognitive decline. Nevertheless, the results obtained are promising. For now, the classification algorithms can be used for screening purposes (see section 5.4.2). The short acquisition time allows testing of each patient for signs of enforced brain aging. Future studies of patients with dementia using the same PTs FIGURE 8 | Classification performance of each method in terms of the mean receiver operating characteristic curve using a stratified five-fold cross-validation technique (area under the curve values close to 1 indicate a high level of diagnostic rate, whereas a value close to 0.5 shows poor performance).
will provide further support for our findings. Using a larger sample size will improve the performance metrics so that the battery of PTs will be a reliable predictor of the age group. Although, cognitive monitoring is not a replacement for a thorough neuropsychological assessment, its use as a supplement may provide indices of key cognitive domains during a brief consultation (Ho et al., 2019).

The Separability of Data and Onset of Decline
To improve prediction accuracy, we used feature selection. We expected that the highly-prevalent decline in some cognitive domains would be largely responsible for age-related functional fading. The time estimates for the attention study technique and motor-visual reaction tests were ranked the highest. All selected features reflected information-processing speed. Additionally, other cognitive domains and subdomains (e.g., attention and task switching) were involved. This is relevant to the recentlyformulated assumption that slowing of functioning is a major outcome of aging. The dependent variables that were derived from studies of attention (TRVI), motor reaction (DMT), and RMO_mean had a value output of zero. From the perspective of the ranking method, these features can be considered as redundant because they do not provide additional information for the final model decision.

Reflection of Age-Related Cognitive Changes With the Psychophysiological Tests
Multilayer perceptron or traditional fully-connected three-layer NN models show significantly higher AUC values (89.6%) compared with other methods using the POBA dataset. The NN is a model that mimics the behavior of data and finds hidden patterns within it. The sensitivity and specificity of the young class were retrieved at 89 and 81%, respectively. The model was more sensitive to the young class than to the older class, which is in line with the trend observed in our assessment of the separability of the data.
Our results demonstrate that cognitive tests and PTs may serve a diagnostic purpose as a screening tool. They are rapid, standardized, fully automated, easy to administer, highly reproducible, and have sufficient sensitivity and specificity (see section 5.4.2).

STRENGTHS AND LIMITATIONS OF THE STUDY
A limitation of the study is its cross-sectional design. This implies that participants of different ages were born and raised at a different time. Thus, the Flynn or antiFlynn effect may have an influence, which is the change in intelligence test scores across generations, and it may influence mean RTs . Such effects have been reported during previous decades in various countries (Woodley, 2012). However, there is increasing evidence that the effect is due to changes in test-taking behavior over time rather than significant variability in intelligence (Must and Must, 2013).
Another limitation of the study is that the datasets we used were not completely comparable, as they were acquired within the last 5 years in different countries. However, the last decade of research has shown that cognitive differences between countries are becoming smaller (Meisenberg and Woodley, 2013), which reduces the potential differences between societies and partly overcomes the limitation of this study.
In contrast to studies that used no specific criterion for normative performance, we proposed an approach that may have clinical utility. The ML classification algorithm may serve as a reliable tool for detecting individuals with accelerated cognitive impairment. If the algorithm misclassifies a participant into an incorrect age group, the individual may be considered at risk of cognitive deterioration. This potential application of our approach for clinical purposes is a strength of the study. To implement the classifier in practice, the study will need to be extended in a larger sample of healthy participants. Further research is required to investigate the dynamics of the identified measures in normal and pathological-aging populations.
Comprehensive psychophysiological assessment and detailed analysis of cognitive functions using ML-based modeling allowed us to detect early cognitive decline. Our study is one of few that have explored a broad variety of cognitive measures in a cohort of young and middle-aged adults. Knowledge of the early stages of normal aging will facilitate early advanced diagnostics and prevention of pathological aging.

CONCLUSION
• The study introduced the concept of a predicted "cognitive" age that can be forecast from a set of tests and compared with the chronological age. In cases where there is a significant difference between the predicted and actual age, the participant may be considered susceptible to accelerated brain aging. This will allow the individual to undergo advanced diagnostic procedures and follow-up examinations. • In our study all RT and variance estimates followed a Ushaped function across the lifespan, which reflected the known inverted U-shaped function of WM volume changes, with optimal values observed in early middle age (35 years), followed by a period of stability, and accelerated decline after late middle age (55-60 years). The shape of the age-related variance of the major cognitive test performance was close to a straight line, which was similar to the linear trend of the decrease in GM volume across the lifespan. The neural centers comprising GM are responsible for information synthesis (e.g., decision making) and establishing links (e.g., associative thinking and working memory). • Overall, the battery of cognitive tasks we used was more demanding compared with the PTs, which may explain why the analysis of the cognitive tests showed a decline starting early in life. In contrast, the psychophysiological findings (simple and complex RT and its variance across trials) suggested that the onset of functional decline occurred at middle age. • Our study suggested that cognitive aging results from the convergence of several processes described in recent findings. These processes were a decline in EF, overall cognitive slowing, and impairment in visual processing. The tests we used may serve a diagnostic purpose as a screening tool for early neurocognitive slowing. The batteries may be used as subtle biomarkers of neurodegeneration for individuals who are misclassified by the algorithm. • The study did not show considerable sex differences in the lifelong dynamics of major test estimates, except for choice RT and updating in the SSCT test that showed a significantlyfaster decline in women than in men. • The performance of the classification model to identify the subjects' age group was promising. The sensitivity and specificity of the identification of the young class were 97 and 86%, using the PTs. The metrics of the cognitive tests were 95 and 98%, respectively. We observed better performance of the ML algorithms with the cognitive tests than the PTs as predictors (balanced accuracy was 96.5 vs. 94%, respectively), which is because of the linear change in cognitive estimates compared with the U-shaped change of the lifelong neurophysiological dynamics. • ML models can be designed and utilized as a computer-aided detector of neurocognitive decline. Our study showed great promise for the use of classification models as predictors of age-related changes. Our results encourage us to explore a combination of tests from the battery to derive a more reliable set of tests based on performance metrics. Moreover, further investigations of other cognitive and PTs are warranted. • Future research is required to improve the performance characteristics of the ML model by using a larger sample size and an enriched test dataset that includes patients with dementia.

DATA AVAILABILITY STATEMENT
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: The datasets generated for this study are available on request at the site of Big Data Analytics Center (BIDAC) at https://bi-dac.com.

ETHICS STATEMENT
The studies involving human participants were reviewed and approved by United Arab Emirates University Human Research Ethics Committee (Notice Number: ERH_2019_4006 19_11) and CERSTAPS (Ethical Committee of Sport and Physical Activities Research (Notice Number: 2016-26-04-13). Written informed consent to participate in this study was provided by the participants' legal guardian/next of kin.

AUTHOR CONTRIBUTIONS
YS formulated the objectives, collected POBA dataset, and wrote the manuscript. TH did machine learning, formulated the methodology, and prepared graphs and tables. IC constructed the batteries of tests. KG, NZ, and TA contributed to literature review and data analysis. GB and ML supervised the research and formulated the conclusion. MB constructed the test battery, collected the SSCT dataset, and participated in manuscript writing.