Artificial neural networks for non-linear age correction of diffusion metrics in the brain

Kocar, Thomas D.; Behler, Anna; Leinert, Christoph; Denkinger, Michael; Ludolph, Albert C.; Müller, Hans-Peter; Kassubek, Jan

doi:10.3389/fnagi.2022.999787

ORIGINAL RESEARCH article

Front. Aging Neurosci., 20 October 2022

Sec. Neurocognitive Aging and Behavior

Volume 14 - 2022 | https://doi.org/10.3389/fnagi.2022.999787

This article is part of the Research TopicQuantifying Aging and DementiaView all 4 articles

Artificial neural networks for non-linear age correction of diffusion metrics in the brain

Thomas D. Kocar^1,2,3*

Anna Behler¹

Christoph Leinert^2,3

Michael Denkinger^2,3

Albert C. Ludolph^1,4

Hans-Peter Müller^1†

Jan Kassubek^1,4*†

¹Department of Neurology, University of Ulm, Ulm, Germany
²Geriatric Center Ulm, Agaplesion Bethesda Ulm, University of Ulm, Ulm, Germany
³Institute of Geriatric Research, Ulm University Medical Center, Ulm, Germany
⁴German Center for Neurodegenerative Diseases (DZNE), Ulm, Germany

Human aging is characterized by progressive loss of physiological functions. To assess changes in the brain that occur with increasing age, the concept of brain aging has gained momentum in neuroimaging with recent advancements in statistical regression and machine learning (ML). A common technique to assess the brain age of a person is, first, fitting a regression model to neuroimaging data from a group of healthy subjects, and then, using the resulting model for age prediction. Although multiparametric MRI-based models generally perform best, models solely based on diffusion tensor imaging have achieved similar results, with the benefits of faster data acquisition and better replicability across scanners and field strengths. In the present study, we developed an artificial neural network (ANN) for brain age prediction based upon tract-based fractional anisotropy (FA). Consequently, we investigated if this age-prediction model could also be used for non-linear age correction of white matter diffusion metrics in healthy adults. The brain age prediction accuracy of the ANN (R² = 0.47) was similar to established multimodal models. The comparison of the ANN-based age-corrected FA with the tract-wise linear age-corrected FA resulted in an R² value of 0.90 [0.82; 0.93] and a mean difference of 0.00 [−0.04; 0.05] for all tract systems combined. In conclusion, this study demonstrated the applicability of complex ANN models to non-linear age correction of tract-based diffusion metrics as a proof of concept.

Introduction

Human aging is characterized by progressive loss of physiological functions (López-Otín et al., 2013). In the brain, aging specifically affects the frontal lobe and, in contrast, relatively spares posterior areas and infratentorial pathways (Raz and Rodrigue, 2006; Cox et al., 2016; Behler et al., 2021). Recently, the concept of brain aging has gained momentum (Baecker et al., 2021), as advanced statistical regression and machine learning (ML) models have opened new possibilities in analyzing neuroimaging data. A common approach to assess the so-called brain age of a person is fitting a regression model to neuroimaging data from a group of healthy subjects with their chronological age as the target variable and then using the resulting model for individual age prediction (Cole and Franke, 2017). This prediction, rather than chronological age, is considered to reflect a person’s brain age which may, to a certain extent, reflect brain “health.” The difference between the chronological age and the estimated brain age is often referred to as the “brain age gap” (Baecker et al., 2021) or “brain age delta” (Smith et al., 2019), as an approximation for accelerated or delayed aging of the brain. Correlation of the “brain age gap” with clinical factors indicated systolic/diastolic blood pressure, smoking habits, and cardiac function as predictors for accelerated aging (Cole, 2020). In contrast, higher bone mineral density was shown to be associated with delayed brain aging (Smith et al., 2019).

A comparison of different MRI approaches showed that diffusion tensor imaging (DTI) reached the best single modality performance in brain age estimation (R² = 0.53, MAE = 3.9 years) which was similar to a multimodal approach using six MRI modalities (R² = 0.62, MAE = 3.5 years) (Cole, 2020). Such multimodal models are based on a wide range of features, resulting in the need for a very high number of imaging data for good performance. In addition, the acquisition of multiparametric MRI is time consuming and thus limiting the usability in clinical routine. In comparison, DTI is acquired fast and provides comparable results across different scanners and field strengths, as demonstrated in a large-scale multicenter study with pooled data (Müller et al., 2016). This may become important in terms of a general application of the model since many brain age algorithms are sensitive to field strength or scanner type (Cole et al., 2017; Baecker et al., 2021).

In addition to applying ML to DTI data to assess aging (Cole, 2020), ML models based on DTI data have been utilized in clinical settings especially in neurodegenerative diseases (Dyrba et al., 2013; Sarica et al., 2017) or in order to stratify patient subgroups (Behler et al., 2022; Münch et al., 2022). Therefore, DTI data sets of patients and healthy participants are usually preprocessed to retrieve relevant anatomical / morphological information. For instance, such features can be extracted in a tract-based approach (Kocar et al., 2021b; Münch et al., 2022). As an alternative to clinical considerations, feature selection can also be done using statistical techniques (Talai et al., 2021). Neural networks were shown to be very performant in this context and in multiparametric MRI approaches, as well as support vector machines (Castellazzi et al., 2020; Kocar et al., 2021b; Tsai et al., 2022).

Quantification of physiological brain aging also plays a role in the analysis of clinical DTI studies. The regional differences in age-related changes in diffusion metrics and higher-order age dependencies, i.e., non-linear changes (Hsu et al., 2008; Westlye et al., 2010), result in the need to perform an optimized tract-based age correction (Behler et al., 2021). Given the limits of multimodal MRI-based brain age models, the questions arise: (1) whether a model that captures the interactions between white matter pathways might be just as accurate in the brain age estimation based on diffusion metrics and (2) whether such a model is also applicable to perform a tract-based age correction of DTI data in an automated approach. Since artificial neural networks (ANN) are powerful ML models, capable of finding patterns and interactions within data (Bishop, 2006), they are commonly used for brain age prediction. In contrast, age correction is almost exclusively done by simple linear regression, presumably because of the simplicity in generating its inverse function. This is impossible in non-linear multivariate regression models such as ANN; therefore, an algorithmic improvement is needed.

To this end, we fitted a multilayer perceptron (MLP) regression model as a type of ANN to a training set of tract-based diffusion metrics gathered from healthy adults. We examined the predictive performance of this model, compared the results to a ridge regression model, and conducted a thorough model inspection. Finally, we investigated how the ANN could be used for a tract-based non-linear age correction of diffusion metrics.

Materials and methods

Data collection and processing

The data set consisted of 219 healthy adults (103 male/116 female, mean age 51.6 ± 15.9 years, range 19.5–81.9 years, no diagnosed diseases) who underwent brain DTI and was previously used in a study by Behler and colleagues (Behler et al., 2021). The data were collected using three different scanners (1.5 T and 3 T field strength) with four different protocols, as summarized in Table 1. All subjects gave written informed consent. The present study was in accordance with institutional guidelines and approved by the Ethics Committee of Ulm University, Germany (reference # 19/12 and 279/19).

TABLE 1

Table 1. Study population and imaging data.

For data preprocessing, the data were assessed for completeness and - according to an established analysis quality control (Müller et al., 2014)—corrupted gradient directions (GD) as well as motion artifacts were excluded from further analysis prior to correction of eddy current-induced geometric distortions. Following a standardized iterative stereotaxic normalization process, using scanning protocol-specific DTI template sets, data were transformed to the Montreal Neurological Institute (MNI) stereotaxic standard space. Maps of FA were calculated from MNI-normalized DTI data, and a Gaussian smoothing filter of 8 mm full width at half maximum (FWHM) was applied to the normalized individual FA maps. In order to calculate differences between scanning protocols at the group level, FA maps were harmonized according to an established harmonization procedure (Rosskopf et al., 2015; Müller et al., 2016). The FA maps of age-matched subsets of participants were arithmetically averaged separately for each scanning protocol. The resulting averaged FA maps were then used to calculate voxel-wise difference maps between protocol D and any other protocol since most participants underwent protocol D. The averaged three-dimensional (3-D) difference matrices, i.e., linear correction matrices, were then applied accordingly to the FA maps of all participants who underwent scanning protocols A, B, or C. This procedure resulted in the recalibration of all FA maps acquired with different protocols and the harmonization of subject groups.

For the analysis of fiber tract (FT), specific tracts were identified by using a seed-based approach based on an averaged DTI dataset. The modified deterministic streamline tracking approach (Mori et al., 2002; Müller et al., 2009) used an eigenvector scalar product threshold of 0.9 and considered only voxels with an FA value above a threshold of 0.2 [cortical gray matter shows FA values up to 0.2 (Kunimatsu et al., 2004)]. Regions of interest (ROIs) with a radius of between 6 and 10 mm were defined for the seed regions. All FTs originating in the seed ROI or multiple ROIs for extended seed regions (e.g., callosal areas), respectively, define the corresponding tracts of interest (TOIs). The following 21 TOI were identified using this seed-based fiber tracking approach: Corticospinal tract (CST), frontooccipital tract, fasciculus uncinatus, optic radiation, superior longitudinal fasciculus (SLF), inferior longitudinal fasciculus (ILF), cingulum, superior cerebellar peduncle (SCP), middle cerebellar peduncle (MCP), corticostriatal tract, corticopontine tract, corticorubral tract, perforant path, the tract from temporal lobe to hypothalamus, anterior limb of the inner capsule, posterior limb of the inner capsule, and the tracts associated with the corpus callosum areas I to V. FA values were calculated by arithmetic averaging of the bihemispheric data.

Machine learning model construction

The orientational dependence of the voxel-wise diffusion tensor can be performed in several combinations of the Eigenvectors, resulting in the scalar parameters fractional anisotropy (FA), mean diffusivity, axial diffusivity, and radial diffusivity, which each have advantages in specific research contexts. In order to avoid redundancy, we selected FA for the analyses performed in the current study, which already showed to be a robust representation of (age-related) diffusion properties (Salat et al., 2005; Behler et al., 2021; Kocar et al., 2021a; Münch et al., 2022). The selection of the TOI was performed in such a way that various functional areas and diffusion directions were covered. Clinical significance of these tracts has been shown, both for aging (Behler et al., 2021) and neurodegenerative diseases (Kocar et al., 2021b). Beyond this preprocessing, no further restrictions preceded the construction of our models. Although incomplete samples were not a problem in the original data set regression analysis, here, they represent one in the application of more complex ML models. Therefore, two samples had to be discarded from the initial data set due to a missing FA value in one tract. From the remaining 217 samples, a training and a test data set were defined by a random 80:20 split. Within the training data set, leave-one-out cross validation (LOOCV) was used for hyperparameter tuning (Hastie et al., 2017). All FA values were z-transformed based on the training data set for calibration. Rescaling of the target variable (age) was performed to reduce the computational load during ML model calculation. Conversely, for data presentation, the transformation was reversed.

Using the scikit-learn 0.24.2 library for python (Pedregosa et al., 2011), two ML models were implemented, a ridge regression and by an introduction of a hidden layer (as an extension to the ridge regression) an MLP regression. Ridge regression is a robust model for statistical regression even in the presence of collinearity (Leeuwenberg et al., 2022). MLP as a type of ANN is highly efficient in finding complex interactions within data and are overall powerful algorithms, given enough data and a sound construction process (Bishop, 2006). Unless stated otherwise, the default hyperparameters of the Ridge classes were used: fit_intercept = True, max_iter = None, tol = 0.001, solver = “auto,” positive = False, random_state = None. The same applies to the MLP Regressor: solver = “adam,” batch_size = “auto,” learning_rate_init = 0.001, shuffle = True, random_state = None, tol = 0.0001, warm_start = False, early_stopping = False, beta_1 = 0.9, beta_2 = 0.999, epsilon = 1e-08, n_iter_no_change = 10. In the ridge regression model, the L2-regularization parameter was determined by an exhaustive grid-search, ranging from 100 to 0 with a decrement of 0.1. Each value in the grid was tested by LOOCV and the best value in terms of the least squared error as a performance metric was chosen for the final model, which was trained on the entire training data set.

The MLP contained only one hidden layer with a single weight matrix, given the amount of data and overall model complexity. For every TOI in the input layer, we chose to have one neuron in the hidden layer, resulting in 21 neurons and a square weight matrix. A graphical abstraction of the model building process is presented in Figure 1A. As an activation function, a rectified linear unit (ReLU) (Glorot et al., 2011) was used, as it resembles the complex relationship between age and tract-based FA values. During adulthood, an age range without changes in the FA values is followed by a linear decrease (Westlye et al., 2010; Behler et al., 2021). In addition to L2-regularization, non-random initialization and pruning were used to prevent overfitting (Kukaèka et al., 2017). For initializing the weight matrix, an identity matrix was used (Le et al., 2015). This not only aimed to prevent overfitting but also increased the explicability of the model, as the neurons in the hidden layer may retain some resemblance to their respective input neuron after training. All biases between the input layer and the hidden layer were set to 0. Between the hidden layer and the output layer, the coefficients from the trained ridge regression model for weight initialization and the intercept for bias initialization were used as a pre-trained last layer. Training was conducted by first forward propagating the training sample’s data through the initialized model and then backpropagating the loss function (= squared error ∧ L2-regularization) through the hidden layers by calculating the partial derivatives with respect to the model parameters. Then, the weights and biases were adjusted using the gradient descent algorithm. This procedure was repeated for up to a maximum of 100,000 iterations to ensure convergence. After training the entire model, the weights of the neural network were pruned by setting the lowest absolute value to 0, until there was at least one sample for each parameter (Han et al., 2015; Blalock et al., 2020). Similar to the ridge regression model, the L2-regularization parameter was determined by an exhaustive grid search, ranging from 20 to 0 with a decrement of 0.1 and choosing the best value by LOOCV.

FIGURE 1

Figure 1. Development of the ANN and the ANN age-correction algorithm. Circles represent neurons, the bias unit is marked with “1.” For every tract system (input layer, blue), there is exactly one neuron in the hidden layer (light blue). Weights are displayed as lines in grayscale, with 0 as white and 1 as black. (A) The weights between the input and the hidden layer were initialized as the identity matrix. The weights between hidden layer and the output layer (dark blue) were initialized as the coefficients from the ridge regression (light blue box). After training, the hidden layer processes multiple interactions from the input layer, which are represented by gray lines. (B) ANN age correction algorithm: First, the input data x were forward propagated through the ANN with the weight matrix W, resulting in the brain age prediction y. Then, the difference between y and the age correction target y_target (= error E) was backpropagated to calculate the modified weight matrix W’. Forward and backpropagation are identical to a regular gradient descent algorithm which are commonly used for ANN training. Last, the input data x were updated by matrix multiplication (xW’W^{– 1}). These steps were repeated until the error E reached an absolute value of 1 month or less and the algorithm terminated. dE/dθ_ij = partial derivatives of the error with respect to the weights in the weight matrix W.

Machine learning model inspection

The coefficients of the ridge regression model were analyzed as a proxy for feature importance. For the ANN, the permutation importance for each feature was calculated using the test data set and 1,000 iterations (Breiman, 2001). For further insights, the weights of the ANN were inspected. Weights converging on a single hidden layer neuron were taken as an indication of possible interactions between the respective input TOIs.

Brain age prediction

Brain age prediction was conducted on the validation and test data set using both ridge regression and the ANN. For evaluating the performance of the models, the R² value and the mean absolute error (MAE) were calculated. The 95%-confidence intervals (CI) were determined by bootstrapping with 1,000 iterations (Efron and Tibshirani, 1993). In addition, the distributions of the prediction error, i.e., the difference between chronological age and predicted brain age, were tested for normality using the Shapiro-Wilk test (Shapiro and Wilk, 1965).

Age correction

Age correction of the tract-based FA values was conducted for samples in the test data set with the test samples’ mean chronological age as the global target age. The basic structure of the age correction approach outlined here is adopted from an image generation algorithm that combines the artistic style from one image and the content from another image (Gatys et al., 2015, 2017). In summary, the gradient descent algorithm was applied not to update the weights and biases of the ANN, but to modify the input data (Figure 1B). First, the tract-based FA values (= input data x) from each sample were forward propagated through the ANN, putting out the predicted brain age y. Then, the age difference between the sample’s chronological age and the global target age was added to the predicted brain age y to define the sample’s age correction target y_target. Note, that y_target cannot be set to the global target age, as this would lead not only to the desired age correction, but also the correction of the “brain age gap.” To modify the input data x, the difference between y and y_target (= error E) was backpropagated through the ANN. The hyperparameters of the ANN were set to warm_start = True, max_iter = 1, solver = “sgd” and alpha = 0, and exactly one step of gradient descent was conducted to calculate the modified weight matrix W’. Finally, instead of updating the weights and biases of the model like in a regular gradient descent algorithm, the input data were updated. The updated input data were calculated as the dot-product of the input data x, the modified weight matrix W’, and the inverse of the original weight matrix W. Adjustment of the input data was restricted by the following rules:

1. If the global target age is higher than the chronological age, FA values of the individual sample must decrease.

2. If the global target age is lower than the chronological age, FA values of the individual sample must increase.

3. The FA values derived from the superior and MCP (SCP and MCP) should remain unchanged, as these tracts generally do not exhibit age-associated alterations (Cox et al., 2016; Behler et al., 2021).

For any value that did not meet these requirements, the initial value was retained. Forward propagation, backpropagation and updating the input data were repeated until the error E reached an absolute value of less than 1 month. Then, the algorithm was terminated and the modified input data were considered age-corrected. When processing these new age-corrected input data with the ANN, the output age differs from the original predicted age by the difference between the sample’s chronological and the global target age. The algorithm corrects all tract-based FA values to the same target age and cannot introduce tract-specific age targets, as in a tract-wise age correction.

The ANN-based age-corrected FA values were compared with those that were age-corrected using a tract-specific linear regression approach proposed by Behler and colleagues (Behler et al., 2021). As a performance metric, the R² values and 95% confidence intervals were calculated by bootstrapping with 1,000 iterations, both for the individual tracts and the combined data set. In addition, Bland-Altman analysis was performed for the individual tracts and the combined data set (Bland and Altman, 1986).

Results

Model construction and inspection

The exhaustive grid searches determined the L2-regularization term as alpha = 62.4 for the ridge regression and alpha = 10.5 for the ANN. Pruning of the ANN resulted in a sparse weight matrix with 67% zeroes. The coefficients of the ridge regression are reported in Table 2, the intercept was 0. Parameter inspection of the ANN showed that the main diagonal of the weight matrix contained the highest value for each hidden layer neuron (Figure 1 and Supplementary Table 1), retaining the initial relationship to their input TOI. In the hidden layer neurons 4, 8, 14, and 21, respectively, 10 or more non-zero weights from the input layer converged. With respect to their largest weight, these hidden layer neurons corresponded to the optic radiation, SCP, temporal lobe to hypothalamus, and corpus callosum area V-associated tracts. Permutation importance indicated that features with large coefficients in the ridge regression were also important for brain age prediction in the ANN, most notably the corticorubral tract, the SCP, and the corticostriatal tract (Figure 2).

TABLE 2

Table 2. Ridge regression coefficients.

FIGURE 2

Figure 2. Important features in the ridge regression and the artificial neural network (ANN). Feature importance is displayed as the absolute value of the ridge regression coefficients (Left) and the mean permutation importance (PI) values from the ANN (Right). For better visualization, PI values were rescaled and negative PI values were set to 0. CST, corticospinal tract; SLF, superior longitudinal fasciculus; ILF, inferior longitudinal fasciculus; SCP, superior cerebellar peduncle; MCP, middle cerebellar peduncle; AIC, anterior limb of internal capsule; PIC, posterior limb of internal capsule; CC, corpus callosum.

Age prediction

The following results are reported as the mean, with the 95% CI in square brackets. Ridge regression achieved an R² value of 0.38 [0.27; 0.48] in the LOOCV and 0.38 [0.10; 0.59] in the test data set. The MAE was 10.59 [9.53; 11.76] years in the LOOCV and 8.36 [6.71; 10.29] years in the test data set. The ANN achieved an R² value of 0.47 [0.36; 0.56] in the LOOCV and 0.47 [0.23; 0.66] in the test data set. The MAE was 9.93 [8.95; 10.98] years in the LOOCV and 7.61 [6.01; 9.53] years in the test data set. Prediction errors in the ANN and in the ridge regression were normally distributed with p > 0.05 (also see Supplementary Figure 1).

Age correction

The comparison of the ANN-based age-corrected FA with the tract-wise linear age-corrected FA resulted in an R² value of 0.90 [0.82; 0.93] and a mean difference of 0.00 [-0.04; 0.05] for all tract systems combined. The tract-wise comparison of both approaches showed an inconsistent pattern, with the frontal tracts showing more concordance than the posterior ones. Low or negative R² values were noted in the frontooccipital tract, the optic radiation, the ILF, the cingulum, and the tracts associated with segment IV of the corpus callosum. The Bland-Altman analysis indicated that FA values were systematically lower after ANN vs. tract-wise linear age correction in the corticostriatal tract. For a complete overview, see Table 3 and Figure 3.

TABLE 3

Table 3. Performance metrics of the ANN-based age correction of fractional anisotropy (FA) compared to a tract-wise linear age-correction.

FIGURE 3

Figure 3. Tract-wise comparison of two different approaches for age correction of fractional anisotropy (FA) values. The tract-wise linear age-corrected FA values is plotted against the artificial neural network (ANN) age-corrected FA values of each subject in the test data set (N = 43). The bisector is displayed as a “guide for the eye.” CST, corticospinal tract; SLF, superior longitudinal fasciculus; ILF, inferior longitudinal fasciculus; AIC, anterior limb of internal capsule; PIC, posterior limb of internal capsule; CC, corpus callosum.

Discussion

Artificial neural network for age correction based on diffusion metrics

In the present study, we provided the proof of concept for non-linear age correction of tract-based diffusion metrics data using an ANN. First, the ANN was trained on tract-based DTI metrics gathered from healthy adults, then, the ANN was applied to a separate test data set for age correction, using a modified gradient descent algorithm. The comparison with a state-of-the-art linear tract-wise age-correction (Behler et al., 2021) gave an R² value of 0.90. As an overall pattern, ANN age correction was more concordant with tract-wise age correction in frontal areas, less in occipital or infratentorial areas.

Artificial neural network explained about 50% of the subjects’ age

In the current approach, the ANN explained about 50% of the subjects’ chronological age by white matter diffusion metrics alone, which is in line with previously reported performance metrics (Cole, 2020; Niu et al., 2020).

In brain age prediction models, the difference between the predicted brain age and the chronological age is often referred to as the “brain age gap” and is considered an indicator for accelerated or delayed brain aging. The age prediction errors of the ANN were normally distributed. Biological markers and their measuring errors generally follow a normal distribution which is often explained by central field theory (Lyon, 2014). We regard the normal distribution of the age prediction errors of the ANN as consistent with the idea that both brain age and the “brain age gap” can be regarded as biomarkers (Franke and Gaser, 2019).

Some lifestyle and biographical factors such as alcohol consumption (McEvoy et al., 2018), body composition (Beck et al., 2022), or previous childbirths (Voldsbekk et al., 2021) are associated with white matter alterations. If a detailed description of these factors is available in a given study population, this information could be included in the model and corrections for factors other than age could be possible. Given the difficulties in obtaining these data in retrospective clinical studies, we suggest applying the proposed ANN age-correction at the group level only, where the influence of different biographies and lifestyles should be averaged out. In addition to physiological aging, diseases such as alcohol dependence can have a long-term impact on DTI metrics (Pfefferbaum et al., 2014). As the ANN was trained on data of healthy participants with no known pathological conditions or cognitive deficits, it cannot take disease-specific alterations into account. In patient data, disease-specific tract data would not be modified beyond what can be explained by physiological aging alone. In turn, disease-age interactions cannot be accounted for.

Tract-specific predictive contributions

Model inspection revealed high importance of the corticorubral tract, the SCP, and the corticostriatal tract in brain age prediction and indicated interaction effects in optic radiation, SCP, the tract from temporal lobe to hypothalamus, and corpus callosum area V-associated tracts. The importance of the SCP was surprising, as previous studies have shown no age-related changes in diffusion metrics in cerebellar tracts (Behler et al., 2021). This result could be explained by the SCP serving as a reference against which the frontal areas were compared. Accordingly, non-specific changes in whole-brain FA (e.g., due to lifestyle) would not automatically change the age prediction, since the SCP would partially level out the changes in the frontal lobe. If this interpretation is correct, the proposed ANN may even account for some of the lifestyle factors, even if they are not explicitly presented to the model.

Non-linear age correction can be performed by artificial neural network

The comparison of ANN age-corrected FA values with age-corrected FA based on an established tract-wise method showed similar results with a good overall performance (R² = 0.90). Nevertheless, the ANN struggled to accurately correct tracts in posterior brain areas, namely the frontooccipital tract, the optic radiation, the ILF, the cingulum, and the tracts associated with area IV of the corpus callosum. Inaccuracies during ANN age-correction compound with every iteration of gradient descent and there is no mechanism to limit the range of possible values. For training ML models, this limitation is generally implemented by L2-regularization. The lack of such a feature in our ANN age correction algorithm might explain the more pronounced deviations found in the cingulum and the corpus callosum area IV associated tracts (see Figure 3), where implausibly high/low FA-values were suggested as age-corrected by the ANN. The feed-forward nature of the age-correction algorithm (see Figure 1B) may provide an explanation for some of the outliers. In theory, the issue could be addressed by introducing L2-regularization to the outlined method of ANN age correction. However, the implementation might prove challenging, as high L2-parameters would prioritize FA-values close to 0 (z-transformed mean) over reducing the actual error term. A more practical approach could be to use ANN age-correction only when the amount of intended age-correction is low.

ANN-based non-linear age correction generates synthetic data for the intended target age. These synthetic tract-based diffusion data could not only be used for age correction in group studies but could also be for data augmentation and sharing. Data augmentation, which can also be considered a form of regularization (Kukaèka et al., 2017), is common in advanced ML techniques such as deep neural networks (Abadi et al., 2016). In various neurodegenerative diseases, ML models based on neuroimaging data can strengthen diagnostic accuracy (Lampe et al., 2022). However, collecting a large amount of neuroimaging data in rare brain diseases is often challenging, as it is to train complex yet accurate ML models (Castiglioni et al., 2021). The data generation used here for age correction might be used for data augmentation of limited diffusion metric data sets by using the existing data sets to create new ones.

Limitations

The present study is not without limitations. The relatively small sample size (n = 217) prompted us to prune the weight matrix to reduce model complexity. Given this sample size, we also decided against performing subgroup analyses, e.g., of gender differences. While we assumed that our subjects were healthy based on the absence of diseases, the finding of accelerated brain aging in some subjects may contradict this notion. Possible underlying factors were not investigated in the present study. There were no signs of impaired neurological and neurocognitive functioning among the study participants according to medical history and clinical impression, however, no standardized neurocognitive screening was performed.

DTI measures a physical parameter, i.e., the results should theoretically be the same for every protocol independent of the scanner. However, there are differences in DTI parameters occurring from different values for TE, B0, and voxel volumes. In order to reduce these effects, a (linear) harmonization of FA values from different protocols has been applied. The differences measured in FA maps of age-matched (healthy) controls can be used for this task (Müller et al., 2016). The use of age-matched control groups enabled us to calculate the protocol-based differences, at the expense of small differences of age-matched subject groups that are assumed to be one or two orders of magnitude lower than protocol differences.

Conclusion

In conclusion, the present study provided a proof of concept for the use of ANN in non-linear age correction of tract-based diffusion metrics. Future studies could extend the proposed method of ANN age correction to data augmentation.

Data availability statement

The data that support the findings of this study are available from the corresponding author, TK, upon reasonable request. Requests to access these datasets should be directed to dGhvbWFzLmtvY2FyQHVuaS11bG0uZGU=.

Ethics statement

The studies involving human participants were reviewed and approved by the Ethics Committee of Ulm University, Germany (reference # 19/12 and 279/19). The patients/participants provided their written informed consent to participate in this study.

Author contributions

TK: study concept and design, analyses and interpretation of data, and drafting of manuscript. AB and JK: study concept and design, interpretation of data, and critical revision of manuscript for intellectual content. CL, MD, and AL: interpretation of data and critical revision of manuscript for intellectual content. H-PM: study concept and design, analyses and interpretation of data, and critical revision of manuscript for intellectual content. All authors contributed to the article and approved the submitted version.

Acknowledgments

We thank the Ulm University Center for Translational Imaging MoMAN and the Ulm University Institute for Geriatric Research for their support.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnagi.2022.999787/full#supplementary-material

References

Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., et al. (2016). “TensorFlow: Large-scale machine learning on heterogeneous distributed systems,” in TensorFlow: A system for large-scale machine learning (Savannah, GA: USENIX Association), 265–283.

Google Scholar

Baecker, L., Garcia-Dias, R., Vieira, S., Scarpazza, C., and Mechelli, A. (2021). Machine learning for brain age prediction: introduction to methods and clinical applications. EBioMedicine 72:103600. doi: 10.1016/j.ebiom.2021.103600

PubMed Abstract | CrossRef Full Text | Google Scholar

Beck, D., de Lange, A.-M. G., Alnæs, D., Maximov, I. I., Pedersen, M. L., Leinhard, O. D., et al. (2022). Adipose tissue distribution from body MRI is associated with cross-sectional and longitudinal brain age in adults. NeuroImage: Clin. 33:102949. doi: 10.1016/j.nicl.2022.102949

PubMed Abstract | CrossRef Full Text | Google Scholar

Behler, A., Kassubek, J., and Müller, H.-P. (2021). Age-Related alterations in DTI metrics in the human brain—consequences for age correction. Front. Aging Neurosci. 13:682109. doi: 10.3389/fnagi.2021.682109

PubMed Abstract | CrossRef Full Text | Google Scholar

Behler, A., Müller, H.-P., Ludolph, A. C., Lulé, D., and Kassubek, J. (2022). A multivariate Bayesian classification algorithm for cerebral stage prediction by diffusion tensor imaging in amyotrophic lateral sclerosis. Neuroimage Clin. 35:103094. doi: 10.1016/j.nicl.2022.103094

PubMed Abstract | CrossRef Full Text | Google Scholar

Bishop, C. M. (2006). Pattern Recognition and Machine Learning. New York, NY: Springer.

Google Scholar

Blalock, D., Ortiz, J. J. G., Frankle, J., and Guttag, J. (2020). What is the State of Neural Network Pruning? arXiv [preprint]. Available online at: http://arxiv.org/abs/2003.03033 (accessed February 28, 2022).

Google Scholar

Bland, J. M., and Altman, D. G. (1986). Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1, 307–310.

Google Scholar

Breiman, L. (2001). Random forests. Mach. Learn. 45, 5–32. doi: 10.1023/A:1010933404324

CrossRef Full Text | Google Scholar

Castellazzi, G., Cuzzoni, M. G., Cotta Ramusino, M., Martinelli, D., Denaro, F., Ricciardi, A., et al. (2020). A Machine learning approach for the differential diagnosis of Alzheimer and vascular dementia fed by MRI selected features. Front. Neuroinform. 14:25. doi: 10.3389/fninf.2020.00025

PubMed Abstract | CrossRef Full Text | Google Scholar

Castiglioni, I., Rundo, L., Codari, M., Di Leo, G., Salvatore, C., Interlenghi, M., et al. (2021). AI applications to medical images: from machine learning to deep learning. Phys. Med. 83, 9–24. doi: 10.1016/j.ejmp.2021.02.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Cole, J. H. (2020). Multimodality neuroimaging brain-age in UK biobank: relationship to biomedical, lifestyle, and cognitive factors. Neurobiol. Aging 92, 34–42. doi: 10.1016/j.neurobiolaging.2020.03.014

PubMed Abstract | CrossRef Full Text | Google Scholar

Cole, J. H., and Franke, K. (2017). Predicting age using neuroimaging: innovative brain ageing biomarkers. Trends Neurosci. 40, 681–690. doi: 10.1016/j.tins.2017.10.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Cole, J. H., Poudel, R. P. K., Tsagkrasoulis, D., Caan, M. W. A., Steves, C., Spector, T. D., et al. (2017). Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker. NeuroImage 163, 115–124. doi: 10.1016/j.neuroimage.2017.07.059

PubMed Abstract | CrossRef Full Text | Google Scholar

Cox, S. R., Ritchie, S. J., Tucker-Drob, E. M., Liewald, D. C., Hagenaars, S. P., Davies, G., et al. (2016). Ageing and brain white matter structure in 3,513 UK Biobank participants. Nat. Commun. 7:13629. doi: 10.1038/ncomms13629

PubMed Abstract | CrossRef Full Text | Google Scholar

Dyrba, M., Ewers, M., Wegrzyn, M., Kilimann, I., Plant, C., Oswald, A., et al. (2013). Robust automated detection of microstructural white matter degeneration in Alzheimer’s disease using machine learning classification of multicenter DTI data. PLoS One 8:e64925. doi: 10.1371/journal.pone.0064925

PubMed Abstract | CrossRef Full Text | Google Scholar

Efron, B., and Tibshirani, R. (1993). An Introduction to the Bootstrap. New York, NY: Chapman & Hall.

Google Scholar

Franke, K., and Gaser, C. (2019). Ten years of BrainAGE as a neuroimaging biomarker of brain aging: what insights have we gained? Front. Neurol. 10:789. doi: 10.3389/fneur.2019.00789

PubMed Abstract | CrossRef Full Text | Google Scholar

Gatys, L. A., Ecker, A. S., and Bethge, M. (2015). A neural algorithm of artistic style. arXiv [preprint]. Available online at: http://arxiv.org/abs/1508.06576 (accessed April 28, 2022).

Google Scholar

Gatys, L. A., Ecker, A. S., and Bethge, M. (2017). Texture and art with deep neural networks. Curr. Opin. Neurobiol. 46, 178–186. doi: 10.1016/j.conb.2017.08.019

PubMed Abstract | CrossRef Full Text | Google Scholar

Glorot, X., Bordes, A., and Bengio, Y. (2011). “Deep sparse rectifier neural networks,” in Proceedings of the fourteenth international conference on artificial intelligence and statistics proceedings of machine learning research, eds. G. Gordon, D. Dunson, and M. Dudík (Fort Lauderdale, FL, USA: PMLR), 315–323. Available online at: https://proceedings.mlr.press/v15/glorot11a.html

Google Scholar

Han, S., Pool, J., Tran, J., and Dally, W. J. (2015). Learning both Weights and Connections for Efficient Neural Networks. arXiv [preprint]. Available online at: http://arxiv.org/abs/1506.02626 (accessed February 28, 2022).

Google Scholar

Hastie, T., Tibshirani, S., and Friedman, H. (2017). The Elements of Statistical Learning. Berlin: Springer.

Google Scholar

Hsu, J.-L., Leemans, A., Bai, C.-H., Lee, C.-H., Tsai, Y.-F., Chiu, H.-C., et al. (2008). Gender differences and age-related white matter changes of the human brain: a diffusion tensor imaging study. NeuroImage 39, 566–577. doi: 10.1016/j.neuroimage.2007.09.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Kocar, T. D., Behler, A., Ludolph, A. C., Müller, H.-P., and Kassubek, J. (2021a). Multiparametric microstructural MRI and machine learning classification yields high diagnostic accuracy in amyotrophic lateral sclerosis: proof of concept. Front. Neurol. 12:745475. doi: 10.3389/fneur.2021.745475

PubMed Abstract | CrossRef Full Text | Google Scholar

Kocar, T. D., Müller, H.-P., Ludolph, A. C., and Kassubek, J. (2021b). Feature selection from magnetic resonance imaging data in ALS: a systematic review. Ther. Adv. Chronic. Dis. 12:204062232110510. doi: 10.1177/20406223211051002

PubMed Abstract | CrossRef Full Text | Google Scholar

Kukaèka, J., Golkov, V., and Cremers, D. (2017). Regularization for deep learning: a taxonomy. arXiv [preprint]. Available online at: http://arxiv.org/abs/1710.10686 (accessed February 28, 2022).

Google Scholar

Kunimatsu, A., Aoki, S., Masutani, Y., Abe, O., Hayashi, N., Mori, H., et al. (2004). The optimal trackability threshold of fractional anisotropy for diffusion tensor tractography of the corticospinal tract. Magn. Reson. Med. Sci. 3, 11–17. doi: 10.2463/mrms.3.11

PubMed Abstract | CrossRef Full Text | Google Scholar

Lampe, L., Niehaus, S., Huppertz, H.-J., Merola, A., Reinelt, J., Mueller, K., et al. (2022). Comparative analysis of machine learning algorithms for multi-syndrome classification of neurodegenerative syndromes. Alzheimers Res. Ther. 14:62. doi: 10.1186/s13195-022-00983-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Le, Q. V., Jaitly, N., and Hinton, G. E. (2015). A simple way to initialize recurrent networks of rectified linear units. arXiv [preprint]. Available online at: http://arxiv.org/abs/1504.00941 (accessed February 28, 2022).

Google Scholar

Leeuwenberg, A. M., van Smeden, M., Langendijk, J. A., van der Schaaf, A., Mauer, M. E., Moons, K. G. M., et al. (2022). Performance of binary prediction models in high-correlation low-dimensional settings: a comparison of methods. Diagn. Progn. Res. 6:1. doi: 10.1186/s41512-021-00115-115

CrossRef Full Text | Google Scholar

López-Otín, C., Blasco, M. A., Partridge, L., Serrano, M., and Kroemer, G. (2013). The hallmarks of aging. Cell 153, 1194–1217. doi: 10.1016/j.cell.2013.05.039

PubMed Abstract | CrossRef Full Text | Google Scholar

Lyon, A. (2014). Why are normal distributions normal? Br. J. Philos. Sci. 65, 621–649. doi: 10.1093/bjps/axs046

PubMed Abstract | CrossRef Full Text | Google Scholar

McEvoy, L. K., Fennema-Notestine, C., Elman, J. A., Eyler, L. T., Franz, C. E., Hagler, D. J., et al. (2018). Alcohol intake and brain white matter in middle aged men: microscopic and macroscopic differences. NeuroImage: Clin. 18, 390–398. doi: 10.1016/j.nicl.2018.02.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Mori, S., Kaufmann, W. E., Davatzikos, C., Stieltjes, B., Amodei, L., Fredericksen, K., et al. (2002). Imaging cortical association tracts in the human brain using diffusion-tensor-based axonal tracking. Magn. Reson. Med. 47, 215–223. doi: 10.1002/mrm.10074

PubMed Abstract | CrossRef Full Text | Google Scholar

Müller, H.-P., Kassubek, J., Grön, G., Sprengelmeyer, R., Ludolph, A. C., Klöppel, S., et al. (2014). Impact of the control for corrupted diffusion tensor imaging data in comparisons at the group level: an application in Huntington disease. Biomed. Eng. Online 13:128. doi: 10.1186/1475-925X-13-128

PubMed Abstract | CrossRef Full Text | Google Scholar

Müller, H.-P., Turner, M. R., Grosskreutz, J., Abrahams, S., Bede, P., Govind, V., et al. (2016). A large-scale multicentre cerebral diffusion tensor imaging study in amyotrophic lateral sclerosis. J. Neurol. Neurosurg. Psychiatry 87, 570–579. doi: 10.1136/jnnp-2015-311952

PubMed Abstract | CrossRef Full Text | Google Scholar

Müller, H.-P., Unrath, A., Riecker, A., Pinkhardt, E. H., Ludolph, A. C., and Kassubek, J. (2009). Intersubject variability in the analysis of diffusion tensor images at the group level: fractional anisotropy mapping and fiber tracking techniques. Mage Reson. Imaging 27, 324–334. doi: 10.1016/j.mri.2008.07.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Münch, M., Müller, H.-P., Behler, A., Ludolph, A. C., and Kassubek, J. (2022). Segmental alterations of the corpus callosum in motor neuron disease: a DTI and texture analysis in 575 patients. NeuroImage: Clin. 35:103061. doi: 10.1016/j.nicl.2022.103061

PubMed Abstract | CrossRef Full Text | Google Scholar

Niu, X., Zhang, F., Kounios, J., and Liang, H. (2020). Improved prediction of brain age using multimodal neuroimaging data. Hum. Brain Mapp. 41, 1626–1643. doi: 10.1002/hbm.24899

PubMed Abstract | CrossRef Full Text | Google Scholar

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. (2011). Scikit-learn machine learning in python. Mach. Learn. Python 6, 2825–2830.

Google Scholar

Pfefferbaum, A., Rosenbloom, M. J., Chu, W., Sassoon, S. A., Rohlfing, T., Pohl, K. M., et al. (2014). White matter microstructural recovery with abstinence and decline with relapse in alcohol dependence interacts with normal ageing: a controlled longitudinal DTI study. Lancet Psychiatry 1, 202–212. doi: 10.1016/S2215-0366(14)70301-70303

CrossRef Full Text | Google Scholar

Raz, N., and Rodrigue, K. M. (2006). Differential aging of the brain: patterns, cognitive correlates and modifiers. Neurosci. Biobehav. Rev. 30, 730–748. doi: 10.1016/j.neubiorev.2006.07.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Rosskopf, J., Müller, H.-P., Dreyhaupt, J., Gorges, M., Ludolph, A. C., and Kassubek, J. (2015). Ex post facto assessment of diffusion tensor imaging metrics from different MRI protocols: preparing for multi-centre studies in ALS. Amyotroph. Lateral. Scler. Frontotemporal Degener. 16, 92–101. doi: 10.3109/21678421.2014.977297

PubMed Abstract | CrossRef Full Text | Google Scholar

Salat, D. H., Tuch, D. S., Greve, D. N., van der Kouwe, A. J. W., Hevelone, N. D., Zaleta, A. K., et al. (2005). Age-related alterations in white matter microstructure measured by diffusion tensor imaging. Neurobiol. Aging 26, 1215–1227. doi: 10.1016/j.neurobiolaging.2004.09.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Sarica, A., Cerasa, A., Valentino, P., Yeatman, J., Trotta, M., Barone, S., et al. (2017). The corticospinal tract profile in amyotrophic lateral sclerosis. Hum. Brain Mapp. 38, 727–739. doi: 10.1002/hbm.23412

PubMed Abstract | CrossRef Full Text | Google Scholar

Shapiro, S. S., and Wilk, M. B. (1965). An analysis of variance test for normality (Complete Samples). Biometrika 52:591. doi: 10.2307/2333709

CrossRef Full Text | Google Scholar

Smith, S. M., Vidaurre, D., Alfaro-Almagro, F., Nichols, T. E., and Miller, K. L. (2019). Estimation of brain age delta from brain imaging. NeuroImage 200, 528–539. doi: 10.1016/j.neuroimage.2019.06.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Talai, A. S., Sedlacik, J., Boelmans, K., and Forkert, N. D. (2021). Utility of multi-modal MRI for differentiating of parkinson’s disease and progressive supranuclear palsy using machine learning. Front. Neurol. 12:648548. doi: 10.3389/fneur.2021.648548

PubMed Abstract | CrossRef Full Text | Google Scholar

Tsai, C.-C., Chen, Y.-L., Lu, C.-S., Cheng, J.-S., Weng, Y.-H., Lin, S.-H., et al. (2022). Diffusion tensor imaging for the differential diagnosis of Parkinsonism by machine learning. Biomed. J. Online ahead of print. doi: 10.1016/j.bj.2022.05.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Voldsbekk, I., Barth, C., Maximov, I. I., Kaufmann, T., Beck, D., Richard, G., et al. (2021). A history of previous childbirths is linked to women’s white matter brain age in midlife and older age. Hum. Brain Mapp. 42, 4372–4386. doi: 10.1002/hbm.25553

PubMed Abstract | CrossRef Full Text | Google Scholar

Westlye, L. T., Walhovd, K. B., Dale, A. M., Bjornerud, A., Due-Tonnessen, P., Engvig, A., et al. (2010). Life-Span changes of the human brain white matter: diffusion tensor imaging (DTI) and volumetry. Cereb. Cortex 20, 2055–2068. doi: 10.1093/cercor/bhp280

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: diffusion tensor imaging, age dependence, magnetic resonance imaging, fractional anisotropy, diffusivity, machine learning, neural network

Citation: Kocar TD, Behler A, Leinert C, Denkinger M, Ludolph AC, Müller H-P and Kassubek J (2022) Artificial neural networks for non-linear age correction of diffusion metrics in the brain. Front. Aging Neurosci. 14:999787. doi: 10.3389/fnagi.2022.999787

Received: 21 July 2022; Accepted: 04 October 2022;
Published: 20 October 2022.

Edited by:

Vijay Venkatraman, The University of Melbourne, Australia

Reviewed by:

Eirini Messaritaki, Cardiff University, United Kingdom
Joseph M. Gullett, University of Florida, United States

Copyright © 2022 Kocar, Behler, Leinert, Denkinger, Ludolph, Müller and Kassubek. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Thomas D. Kocar, dGhvbWFzLmtvY2FyQHVuaS11bG0uZGU=; Jan Kassubek, amFuLmthc3N1YmVrQHVuaS11bG0uZGU=

^†These authors share senior authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.