Deep Feature Selection and Causal Analysis of Alzheimer’s Disease

Liu, Yuanyuan; Li, Zhouxuan; Ge, Qiyang; Lin, Nan; Xiong, Momiao

doi:10.3389/fnins.2019.01198

METHODS article

Front. Neurosci., 15 November 2019

Sec. Brain Imaging Methods

Volume 13 - 2019 | https://doi.org/10.3389/fnins.2019.01198

This article is part of the Research TopicBrain-image Based Computation for Supporting Clinical Decision in Neurological and Psychiatric DisordersView all 17 articles

Deep Feature Selection and Causal Analysis of Alzheimer’s Disease

Momiao Xiong^*

Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center, Houston, TX, United States

Deep convolutional neural networks (DCNNs) have achieved great success for image classification in medical research. Deep learning with brain imaging is the imaging method of choice for the diagnosis and prediction of Alzheimer’s disease (AD). However, it is also well known that DCNNs are “black boxes” owing to their low interpretability to humans. The lack of transparency of deep learning compromises its application to the prediction and mechanism investigation in AD. To overcome this limitation, we develop a novel general framework that integrates deep leaning, feature selection, causal inference, and genetic-imaging data analysis for predicting and understanding AD. The proposed algorithm not only improves the prediction accuracy but also identifies the brain regions underlying the development of AD and causal paths from genetic variants to AD via image mediation. The proposed algorithm is applied to the Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset with diffusion tensor imaging (DTI) in 151 subjects (51 AD and 100 non-AD) who were measured at four time points of baseline, 6 months, 12 months, and 24 months. The algorithm identified brain regions underlying AD consisting of the temporal lobes (including the hippocampus) and the ventricular system.

Introduction

Alzheimer’s disease (AD) causes progressive brain atrophy and memory loss, is a progressive, irreversible degenerative disease of the brain, and is the most common neurodegenerative disease in the world (Struyfs et al., 2015; Zhuang et al., 2017; Liu et al., 2018a,b). AD is an increasingly prevalent disease affecting an estimated 5.4 million Americans and more than 30 million people in the world. It is estimated that these numbers will be tripled by 2050. AD is the sixth leading cause of death in the United States (Alzheimer’s Association, 2016; Leandrou et al., 2018).

Diagnosis and prediction of AD via clinical and psychometric assessments are challenging (Leandrou et al., 2018). The AD patients cannot obtain early and accurate diagnosis through clinical dementia rating and cognitive tests. A final diagnosis of AD is confirmed by histological examination at postmortem biopsy. However, the histological examination of the brain for the living patients is infeasible. Individually varying brain structure, function, and pathological effects can be measured by images. Therefore, imaging plays an important role in improving diagnosis and prediction of AD. According to the recommendation by the National Institute of Neurological and Communicative Disorders and Stroke–AD and Related Disorders Association (NINCDS-ADRDA) Work Group, the clinical classification of AD should explore the image markers: magnetic resonance imaging (MRI), diffusion tensor imaging (DTI), positron emission tomography (PET), amyloid-PET, tau-PET, and abnormal neuronal cerebrospinal fluid (CSF) markers (tau and/or Aβ) (Dubois et al., 2007; Leandrou et al., 2018; Liu et al., 2018a,b).

As the size of the imaging datasets increases, manual analysis of imaging data is tedious and time-consuming. Computer-aided diagnosis (CAD) of AD that combines computational models and analytical tools for high-dimensional imaging data analysis is emerging as one of the major tools for diagnosis and prediction of AD (Dimitriadis et al., 2018; Leandrou et al., 2018). The widely used machine learning (ML) methods in CAD include discriminant analysis (DA), logistic regression (LR), random forest, neural networks, and support vector machine (SVM) (Lorenzi et al., 2017; Sarica et al., 2017; Dimitriadis et al., 2018; Leandrou et al., 2018). Deep learning, a rapidly resurging subfield of ML, outperforms many classical ML approaches and is emerging as a major analytic platform in ML (Esteva et al., 2019). Deep learning with massive amounts of computational power has produced a revolution in driverless cars, speech recognition, and imaging analysis (Waldrop, 2019) and demonstrated great potential for the diagnosis and predictive power in tuberculosis (Heo et al., 2019), cancer (Esteva et al., 2017; Haenssle et al., 2018; Ghatwary et al., 2019; Ladefoged et al., 2019), diabetic retinopathy (Gulshan et al., 2016), chronic kidney disease (Ravizza et al., 2019), AD (Payan and Montana, 2015; Hosseini-Asl et al., 2016; Sarraf and Tofighi, 2016; Ju et al., 2017; Ding et al., 2018; Wada et al., 2018; Spasov et al., 2019), and conversion from mild cognitive impairment (MCI) to AD (Choi et al., 2018; Spasov et al., 2019). There is a growing interest in the application of deep learning to health care and medicine.

Despite its great progresses in computer vision, natural language processing, control, decision making, diagnosis, and early detection of complex diseases, deep leaning is also well known as a “black box” owing to its low interpretability to humans and still has a serious opacity problem (Waldrop, 2019). Overcoming the limitation of the lack of transparency and interpretation remains a great challenge for deep learning (Dubois et al., 2007). In this paper, we develop a novel general framework that integrates deep leaning and causal inference for image classification. The new framework for image analysis consists of two stages: (1) develop convolutional neural networks (CNN) to classify AD status on the basis of DTI and use of occlusion map to find image regions that are most distinctive for disease status and (2) the use of state-of-the-art causal inference tools to determine if the selected image regions are causal for AD.

Brain anatomy, structural connectivity, and physical connection between brain regions that are characterized through water molecular diffusing within white matter tracts can be measured by DTI. The imaging signals provide intermediate endophenotypes. Genetic variants will influence brain microstructure, function, and disease development. Understanding the role that genetics has in imaging and disease variation is key to understanding the causal chain of complex diseases (Jahanshad et al., 2013; Bycroft et al., 2018; Elliott et al., 2018). Therefore, to further cover the genetic bases of brain structures and function, and mechanism of AD, a joint analysis of the genetic brain images and AD will be carried out. We will assess both association and causal relationships among genetic variants, brain regions, and AD.

Materials and Methods

Materials

The DTI images used in this study are downloaded from the Alzheimer’s Disease Neuroimaging Initiative (ADNI); the size of each image was 91 × 109 × 91. ADNI is a longitudinal multicenter study designed to develop clinical, imaging, genetic, and biomedical biomarkers for the early detection and tracking of AD¹ (Alzheimer’s Disease Neuroimaging Initiative, 2019). DTI images were recorded for every participant from different time points in which they joined the research study. The diagnostic results were normal control (NC), MCI, and AD. In this study, DTI images of 151 individuals from NCs (100 images) and AD (51 images) groups were chosen from four different diagnostic time points: baseline, 6 months, 12 months, and 24 months.

Image Preprocessing

To make sure that all the images for this analysis are comparable, we register all the DTI image data for every subject at every time point to the common template, which can be downloaded from the McConnell Brain Imaging Centre². We utilized a strategy of combination of linear and non-linear registration algorithm to map each individual DTI data to the common template. During the linear image registration procedure, we first map the image data to the common template to make sure all the images are within the standard brain region by using FLIRT (FMRIB’s Linear Image Registration Tool) from FSL (FMRIB software library) image analysis suite³. Then we further applied non-linear registration algorithm, which is implemented in RNiftyReg to map the image details within the standard brain. The linear image registration process helps us restrain each individual DTI image to a standard template, and the non-linear image registration helps us to make sure that the registered image maintains the structures details as the original data.

Genetic Data Preprocessing

We performed quality control (QC) in both individual level and single-nucleotide polymorphism (SNP) level QC in the plink binary format. For the individual level QC, the following steps were applied to the data:

1. Individuals with discordant gender information were removed from the data.

2. Individuals with missing rate >10% were removed from the data.

3. Individuals with heterozygosity rate of more than three standard deviations from the mean were excluded from the data.

4. Individuals with identity by descent (IBD) > 0.185 were excluded from the data.

After the individual level QC was conducted, the following steps for SNP level QC were further applied to the data:

1. SNPs with missing genotype rate >10% were excluded from the data being analyzed.

2. SNPs with P-value for Hardy–Weinberg equilibrium (HWE) test <1E-6 were excluded from the data.

3. SNPs without polymorphism were removed from the data.

Then pre-imputation QC tool from McCarthy Groups was further applied to check the data against 1000G reference data. The imputation of the genetic data was conducted under the SHAPEIT + IMPUTE2 framework in the internal computational clusters. The 1000G reference data were used as the reference panel for imputation. After the imputation, the SNP level QC steps were applied again to the data to produce the final genetic data for analysis. Finally, a total of 1,589,061 common SNPs in 36,480 genes genotyped in 151 individuals were included in analysis.

Architecture of Convolutional Neural Network

The CNN model Visual Geometry Group (VGG) that won the first and second places in the localization and classification tracks, respectively, in the ImageNet Challenge 2014 was chosen for image classification and prediction (Simonyan and Zisserman, 2014). To improve the classification accuracy, the VGG utilized smaller receptive window size and increased the depth of the network. Furthermore, to prevent overfitting and improve the image region recognition ability of the networks, global average pooling (GAP) layer was used as a structure regularizer and localizer in the model to identify the complete extent of the object and exactly which regions of an image are being used for classification (Zhou et al., 2016).

As is shown in Figure 1, the network contained five max pooling layers, followed with a GAP layer before a fully connected softmax layer with two nodes.

FIGURE 1

Figure 1. VGG-GAP model architecture. The CNNs in the model included five max pooling layers and one GAP layer before fully connected layer. VGG, Visual Geometry Group; GAP, global average pooling; CNN, convolutional neural network.

Three-dimensional (3D) whole brain images with 109 × 91 × 91 size were input into CNN. DTI measures microscopic random motion of water molecules, which uncovers the orientation of surrounding tissues, and provides tract information on brain structure. Convolution of an image with different filters can perform operations that capture various types of features and directional information of DTI images and can preserve tract of DTI and the relationship between pixels. 3D CNNs (3D-CNN) with five convolutional layers and three fully connected layers were used for AD prediction. A 3D filter was applied to the dataset, and the filter moves in three directions (X, Y, Z) to calculate the low-level feature representations. Specifically, 3D filters were arranged as in Table 1.

TABLE 1

Table 1. 3D filters in five convolutional layers.

To overcome the small sample size limitation of medical images, image augmentation techniques were used (Aderghal et al., 2017). The first technique we applied was Gaussian filters to blur the image to mimic the possible variations in the original images. A filter size of 3 × 3, 5× 5, and 7× 7 were used with spread parameters of 0.7, 0.7, and 0.6, respectively. The second augmentation technique we used was translation, where we shifted the images by ±1 pixel in each dimension. This imitates the possible variations in registration process where the images were aligned with the template. Finally yet importantly, the images were flipped horizontally because some regions of the brain (e.g., the hippocampus) are symmetrical to enlarge our sample size. To balance the data, we randomly duplicated some images from the under-sampled category. Data augmentation and class balancing produced over 20 times more data than the original dataset.

The model was trained in the Texas Advanced Computing Center (TACC) Maverick2 with NVIDIA GTX 1080 Ti GPUs.

Deep Feature Selection for Diffusion Tensor Imaging Images

Prediction difference analysis for visualizing the response of CNN to a specific input was used to select features for DTI image classification (Zintgraf et al., 2017). Specifically, prediction difference analysis estimates the importance of input pixels by calculating the effect of removing information from the imaging on the class prediction precision (Zeiler et al., 2014).

A sliding window (patch) of 3 × 3 × 3 was applied to each image. The imaging signals contained in the sliding window were taken as a feature. Each one 3 × 3 × 3 patch was replaced by randomly sampled values from multivariate normal distributions. The resulting new image where the imaging feature (information) was removed was input into a previously trained CNN model to obtain probability p₁ for predicting AD. Let p₀ be the probability of predicting AD using the original images [without removing the feature (information)]. The relative importance of the feature was evaluated by Zintgraf et al. (2017).

d = \log (\frac{\frac{p_{0}}{1 - p_{0}}}{\frac{p_{1}}{1 - p_{1}}}) (1)

The sliding window moved across the entire image and a relevance matrix, W of the same size as the whole image was generated, which reflected the relevance importance of all image pixels. A positive value indicated that the pixel contributed evidence for the classification of AD, whereas a negative value showed that the pixel contributed against the classification of AD. For details, please see Zintgraf et al. (2017).

Conditional Generative Adversarial Network and Classifier Two-Sample Tests for Causal Discovery

Three-dimensional functional principal component (FPC) scores were used to summarize the imaging signal information of the brain region (Xiong, 2018). Similarly, 1D FPCs can be used to summarize genetic information in the gene. Conditional generative adversarial networks (CGANs) will be used to discover causal relationships between the brain neuroimaging region and AD and causal relationships between the brain neuroimaging region and gene as well (Goodfellow et al., 2014; Lopez-Paz and Oquab, 2017) (Figure 2). Specifically, consider two variables X and Y, which can be binary disease status or continuous FPCs summarizing imaging signals in the brain region or genetic variation in the gene. If X causes Y, denoted by X → Y, then we have

FIGURE 2

Figure 2. Workflow of causal inference using CGAN and a classifier two-sample test. CGAN, conditional generative adversarial network. (A) A visual explanation of CGAN and (B) the complete workflow of causal discovery.

Y = f_Y(X, N_Y),

where f_Y is a non-linear function and realized by CGAN where a neural network is used to approximate the non-linear function f_Y(X, N_Y), and N_Y is a noise random variable and is independent of cause X. Similarly, if Y causes X(Y→X), then we have

X = f_X(X, N_X),

where f_X is a non-linear function and N_X is a noise random variable and is independent of cause Y. Assume that n subjects are sampled.

We define dataset D_w = {u_i, v_i, i = 1, …, n}. We assign label 0 to dataset D_u = {u_i, i = 1, …, n} and 1 to dataset D_v = {v_i, i = 1, …, n}. Let P be the distribution of u_i, i = 1, …, n and Q be the distribution of v_i, i = 1, …, n. We use the K nearest neighbor (KNN) as a binary classifier to classify two datasets and define the test statistic t as the classification accuracy to test the null hypothesis of equal distributions of two datasets P = Q. Let z be a random variable.

The procedures for bivariate causal discovery using CGAN are summarized as follows (Lopez-Paz and Oquab, 2017):

1. Use a CGAN from X→Y to generate the dataset D_{X → Y} = { (x_i, ${\hat{y}}_{i}$ = f_y (x_i,z_i)), i = 1,…,n}.

2. Use a CGAN from Y→X to generate the dataset D_{Y→ X} = {( ${\hat{x}}_{i}$ = f_X (y_i,z_i),y_i), i = 1,…,n}.

3. Divide the total samples into training samples and test samples.

4. Classify two datasets : D_u = D_y = {y_i, i = 1, …, n} versus D_v = D_{X → Y} = { ${\hat{y}}_{i}$ , i= 1,…,n} and calculate the two-sample statistic ${\hat{t}}_{X \to Y}$ .

5. Classify two datasets: D_u = D_x = {x_i, i = 1, …, n} versus D_v = D_{Y → X} = { ${\hat{x}}_{i}$ , i= 1,…,n} and calculate the two-sample statistic ${\hat{t}}_{Y \to X}$ .

6. Calculate the test statistic T = ${\hat{t}}_{X \to Y}$ $- {\hat{t}}_{Y \to X}$ . Under the null hypothesis of no causal relationship or test inconclusive, the statistic T is asymptotically distributed as

N(0,σ²), where $σ^{2} = \frac{0.5}{n_{test}} - 2 c o v ({\hat{t}}_{X \to Y}, {\hat{t}}_{Y \to X})$ and n_test is the number of subjects in the test set.

Association is defined as measuring the dependence or correlation between two variables and to use these dependencies for prediction that is not dealing with causal problems. Almost all currently used statistical methods in imaging genetics [such as sparse canonical correlation analysis (SCCA), sparse reduced rank regression (SRRR), and parallel independent component analysis (ICA)] are association analysis methods. These methods can detect association between genetic variation and imaging signals. It is well known that correlation or association analysis does not imply causation. The signals identified by association analysis may not have specific pathological relevance to diseases. Association signals provide limited information on the causal mechanism of diseases. Most genetic and imaging analysis questions to uncover the mechanism of the disease are causal in nature. Causation analysis is essential to the genetic analysis of complex phenotypes yet ignored for a long time.

Distinguishing causation from association is an age-old problem. Intuitively, causation implies that changes in one variable will directly make changes in the other. The essential distinction between association and causation relies on what the response will be if we intervene in the system (Lattimore and Ongv, 2018).

There are two types of causal inference: interventional causal inference and observational causal inference. Interventional causal inference learns the effect of taking an action directly via experiments, for example, randomized controlled trials. Interventional experiments are a gold standard for causal inference. However, because in human genetics we cannot change the genetic materials of human subjects, experimental interventions are unethical and infeasible. Therefore, it is essential to develop statistical methods and algorithms to predict the outcomes of an intervention from passive observation.

The additive noise models (ANMs) assume one causal direction X → Y but no reversible causal direction Y → X. Causation is asymmetric. However, the association of X and Y can be (1) X → Y, (2) Y → X, and (3) X → Y, Y→ X. Association is symmetric.

Additive noise models are based on the independence of cause and mechanism (ICM) principle. ICM assumes that causes and mechanisms are chosen independently by nature, which is a recently proposed principle for causal reasoning and causal learning (Peters et al., 2017). ICM assumes that the mechanism that generates effect from its cause contains no information about the cause, which implies that X and N_Y in the ANMs are independent. However, X and N_Y in the non-linear regression model Y = f_Y(X, N_Y) may be dependent.

In summary, association is studied by observed conditional distribution, and causation is investigated by interventional distribution where causal effect is determined by the effect of hypothetic manipulation of an input on an output. In other words, association is investigated by seeing, and causation is investigated by doing.

Results

Alzheimer’s Disease Classification and Prediction

The VGG network with 3D filters was used for classification and prediction of AD using 3D whole brain DTI images at four different time points: baseline, 6 months, 12 months, and 24 months. We consider two classes: AD and NC. AD prediction accuracy using VGG is listed in Table 2, and its sensitivity and specificity are shown in Table 3, where the first and second values in the brackets represent sensitivity and specificity, respectively. Tables 2, 3 demonstrate that the prediction accuracy, sensitivity, and specificity of VGG using the training dataset at baseline to predict AD in the test datasets at baseline, 6 months, 12 months, and 24 months were 0.8675 (0.6873, 0.9600), 0.8452 (0.6364, 0.9600), 0.8335 (0.7295, 0.8995), and 0.7463 (0.6294, 0.8853), respectively. In other cases, we can observe similar results. The area under the curve (AUC) using the training data at baseline, 6 months, 12 months, and 24 months for prediction of AD in the test datasets at the same time points was 0.8571, 0.8291, 0.8583, and 0.7756, respectively. The low sensitivity of prediction of AD may be due to small and imbalanced sample size (51 AD and 100 controls). A much higher proportion of non-AD controls have decreased sensitivity but increased specificity. Deep VGG that has a large number of parameters to be estimated requires large sample sizes. Although we used data augmentation methods to increase sample sizes, augmentation methods still did not provide large and reliable sample sizes. Large sample sizes are an important issue for increasing the prediction of accuracy.

TABLE 2

Table 2. AD prediction accuracy on fivefold cross validation.

TABLE 3

Table 3. Average sensitivity and specificity over fivefold cross validation.

Region Selection and Interpretation

Relative importance of value d was sorted. Image areas whose relative importance value was in the top 10th percentile were considered as features that contributed substantially to the prediction of AD. We identified 23 important brain regions that contributed substantially to AD prediction. The results are shown in Figure 3 where each subfigure has 91 × 109 pixel sizes, where the darker the red color is, the more important the brain region is to the prediction accuracy. The brain regions with red color included the temporal lobe (the left temporal, medial, and right temporal lobes), ventricles and enlarged ventricle, occipital lobe, and prefrontal area. To further interpret the image analysis results and increase their transparency, we tested the causal relationships between DTI image ROIs and AD disease at baseline, 6 months, 12 months, and 24 months using CGAN-based statistics. After Bonferroni correction, P-value < 0.0022 was the threshold to declare significance. The number of identified brain regions that showed significant causation to AD at baseline, 6 months, 12 months, and 24 months was 1, 1, 2, and 4, respectively. Table 4 lists ROIs where P-values for testing causation between the ROI and AD were <0.05. Three remarkable features emerged from these results. First, as time passed, AD progressed from mild (early stage), via moderate (middle stage), to severe (late stage), which resulted in atrophy of more and more brain regions. Therefore, we observed the increased number of significant causal brain regions with AD as the study time of AD increased from the baseline to 24 months. Second, in general, as AD progressed, the significance of causation between the brain region and AD increased (P-values for testing causation decreased). Third, the brain region in ROI 18 (the ventricles and enlarged ventricle) (Figure 4) showed significant causation to AD at all four time points (baseline, 6 months, 12 months, and 24 months). The brain regions in ROI 14 (the left temporal lobe) (Figure 4) showed significant causation at 12 and 24 months after Bonferroni correction. The literature reports that these regions are related to AD. The left temporal lobe is involved in language and AD (Cretin et al., 2015; Flick et al., 2018; Trimmel et al., 2018), and the right temporal lobe atrophy is involved in severe impairment in emotion recognition (Everhart et al., 2015) and causes frontotemporal dementia (Gliebus, 2014), with the brain ventricles often affected AD (Ferrarini et al., 2006). Ventricle enlargement is a useful structural biomarker for the diagnosis of AD (Anandh et al., 2014).

FIGURE 3

Figure 3. Visualization of the brain regions with relative importance values at the baseline, 6 months, 12 months, and 24 months. The deeper the red color of the brain region, the more important for AD prediction. AD, Alzheimer’s disease.

TABLE 4

Table 4. Causations between DTI image ROIs and AD disease status.

FIGURE 4

Figure 4. Three brain regions showed causation to AD. AD, Alzheimer’s disease.

Genetic Studies of Two Brain Regions

To uncover genetic architecture of brain regions, in addition to genetic-imaging association analysis, we conducted genetic-imaging causal analysis using the CGAN where imaging signals within the brain region and SNPs within the gene were summarized by 2D functional principle scores and classical functional principle scores, respectively (Lopez-Paz and Oquab, 2016). The total number of candidate genes being tested was 61. After Bonferroni correction, the P-value for declaring significance of both causation and association was 0.00082. We presented the results of P-values < 0.05 in causal analysis and association analysis of genetic variation in 61 candidate genes with two brain regions, the left temporal lobe and frontal and temporal left lobe, and the right temporal lobe as seen in Supplementary Tables S1, S2, respectively, where 61 genes were obtained from genome-wide causation studies of AD in the manuscript (Lin et al., unpublished). In Supplementary Tables S1, S2, the P-values in bold green denote significant causation or association after Bonferroni corrections. The majority of genes that had causal or association relationships with brain neuroimaging phenotypes were identified at all time points (baseline, 6 months, 12 months, and 24 months). We also observed that these identified genes had causal or association relationships with both the left temporal lobe and right temporal lobe regions. The identified genes CD33, COBL, and APP that had causal relationships with brain neuroimaging regions were confirmed multiple times in the literature (Bradshaw et al., 2013; Mez et al., 2017; Kovacs et al., 2018; Van Giau et al., 2018; Huang C.C. et al., 2019; Huang C.Y. et al., 2019). It was also reported that gene FGF4 was involved in neurodevelopmental disorders (Grillo et al., 2014), FRMD6 was implicated in AD (Hong et al., 2012), Dock9 played an important role in regulation of morphological changes in hippocampal neurons (Kuramoto et al., 2009), H3F3B was associated with a broad schizophrenia phenotype (Manley et al., 2018), SCYL1 was involved in cerebellar atrophy (Lenz et al., 2018), AKAP5 played a significant role in the regulation of sympathetic nerve activities (Han et al., 2016), and PIGC was involved in epilepsy and intellectual disability (Edvardson et al., 2017).

Discussion

In this paper, we presented a general artificial intelligence (AI) platform for prediction of AD using DTI images. Non-transparency could be a major challenge of deep learning for medical image analysis. To meet this challenge, we introduced three approaches to medical image interpretation: feature selection and visualization, causal analysis of neuroimaging region, and genetic-imaging analysis. Feature selection and visualization methods selected and visualized brain regions as a potential pathology of AD. Further CGAN evaluation and two-sample tests discovered potential causal relationships between the brain neuroimaging regions and AD. We observed the increased number of significant causal brain regions with AD when AD progressed. In general, as AD progressed, the significance of causation between the brain region and AD increased (P-values decreased). We observed that the ventricles and enlarged ventricle and the left and right temporal lobes had strong causal relationships with AD. Temporal lobes including the hippocampus are crucial in AD development at the early stages, whereas the ventricles and enlarged ventricle are a useful structural biomarker for the diagnosis of AD. Joint causal analysis of genetic and images of the left and right temporal regions using CGAN evaluation and two-sample tests mapped CD33, COBL, FRMD6, APP, and other genes to the left and right temporal brain regions.

Many findings in the paper can be confirmed in the literature. For example, both prediction analysis using deep learning and causal analysis using CGAN and a two-sample test identified the brain temporal lobe region that was involved in AD. The temporal lobe includes the hippocampus and its surrounding regions. It is well known that the temporal lobe consists of structures that are vital for long-term memory. There are numerous reports that the temporal lobe including the left, medial, and right temporal lobes are involved in AD pathology (Kakeda and Korogi, 2010; Li and Chen, 2015; Menéndez-González et al., 2015; Aggleton et al., 2016; Delgado-González et al., 2017; Pettigrew et al., 2017; Wolk et al., 2017; Jung et al., 2018; Kitchigina, 2018; Persson et al., 2018; Grajski and Bressler, 2019; Kenkhuis et al., 2019; Lam et al., 2019; Pasquini et al., 2019; Xie et al., 2019). DTI discovered the functional and structural connectivity between the medial temporal lobe (MTL) and posteromedial cortex (PMC) (Buckner et al., 2008; Pasquini et al., 2019). The MTL includes the hippocampal formation and other cortices. These regions underlie memory processing through interplay with neocortical areas from the PMC. AD-related pathological changes such as tau accumulation and amyloidβ deposition often affect the PMC and MTL regions. The functional and structural disconnections between the MTL and PMC cause the development and progression of AD.

The literature confirmed the identified pathological paths from genetic variants to AD via brain regions: CD33 → medial temporal and hippocampus (Wang et al., 2019) → AD (Pasquini et al., 2019) and CD33 → AD (Miles et al., 2019); APP → medial and lateral temporal lobe (Huang C.C. et al., 2019) → AD (Buckner et al., 2008) and APP → AD (Zhou et al., 2011); SCYL1 → cerebellar atrophy (Schmidt et al., 2015) → AD (Gallo et al., 2017); and SCYL1 → neurodegenerative disease (Schmidt et al., 2007). These provided indirect evidences of identified biomarkers for unraveling mechanism of AD.

The results in this paper are preliminary. Sample sizes need to be increased and additional datasets analyzed to replicate the results. The purpose of this paper is to stimulate further discussions regarding the great challenges we are facing in developing robust deep learning platforms that combine multiple modes of imaging tools and have high accuracy across multiple datasets and uncovering causal pathways from genetic variants to disease via brain imaging regions.

Data Availability Statement

Publicly available datasets were analyzed in this study. This data can be found here: http://adni.loni.usc.edu/.

Author Contributions

YL developed the software and conducted the data analysis. ZL contributed to the data preprocessing and partial writing. NL contributed to data preprocessing. QG conducted the partial data analysis. MX designed the study and wrote the manuscript.

Funding

YL was supported by the UTHealth Innovation for Cancer Prevention Research Training Program Pre-doctoral Fellowship (Cancer Prevention and Research Institute of Texas Grant No. RP160015).

Disclaimer

The content is solely the responsibility of the authors and does not necessarily represent the official views of the Cancer Prevention and Research Institute of Texas.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

In this study, data used in the preparation of this article were obtained from the ADNI database (http://adni.loni.usc.edu). The ADNI was launched in 2003 as a public–private partnership, led by principal investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial MRI, PET, other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of MCI and early AD. For up-to-date information, see www.adni-info.org. The authors thank TACC for providing computational resources.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnins.2019.01198/full#supplementary-material

Footnotes

References

Aderghal, K., Boissenin, M., Benois-Pineau, J., Catheline, G., and Afdel, K. (2017). “Classification of sMRI for AD diagnosis with convolutional neuronal networks: a pilot 2-D + epsilon Study on ADNI,” in International Conference on Multimedia Modeling, (New York, NY: Springer) 690–701.

Google Scholar

Aggleton, J. P., Pralus, A., Nelson, A. J., and Hornberger, M. (2016). Thalamic pathology and memory loss in early Alzheimer’s disease: moving the focus from the medial temporal lobe to Papez circuit. Brain 139, 1877–1890. doi: 10.1093/brain/aww083

PubMed Abstract | CrossRef Full Text | Google Scholar

Alzheimer’s Association. (2016). 2016 Alzheimer’s disease facts and figures. Alzheimers Dement. 12, 459–509. doi: 10.1016/j.jalz.2016.03.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Alzheimer’s Disease Neuroimaging Initiative. (2019). About ADNI. Available at: http://adni.loni.usc.edu/about/#core-container (accessed January 12, 2019).

Google Scholar

Anandh, K. R., Sujatha, C. M., and Ramakrishnan, S. (2014). Segmentation of ventricles in Alzheimer mr images using anisotropic diffusion filtering and level set method. Biomed. Sci. Instrum. 50, 307–313.

PubMed Abstract | Google Scholar

Bradshaw, E. M., Chibnik, L. B., Keenan, B. T., Ottoboni, L., Raj, T., Tang, A., et al. (2013). CD33 Alzheimer’s disease locus: altered monocyte function and amyloid biology. Nat. Neurosci. 16, 848–850. doi: 10.1038/nn.3435

PubMed Abstract | CrossRef Full Text | Google Scholar

Buckner, R. L., Andrews-Hanna, J. R., and Schacter, D. L. (2008). The brain’s default network: anatomy, function, and relevance to disease. Ann. N. Y. Acad. Sci. 1124, 1–38. doi: 10.1196/annals.1440.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Bycroft, C., Freeman, C., Petkova, D., Band, G., Elliott, L. T., Sharp, K., et al. (2018). The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209. doi: 10.1038/s41586-018-0579-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Choi, H., Jin, K. H., and Alzheimer’s Disease Neuroimaging Initiative. (2018). Predicting cognitive decline with deep learning of brain metabolism and amyloid imaging. Behav. Brain Res. 344, 103–109. doi: 10.1016/j.bbr.2018.02.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Cretin, B., Laure, D. L., Blanc, F., and Magnin, E. (2015). Left temporal lobe epilepsy revealing left posterior cortical atrophy due to Alzheimer’s disease. J. Alzheimers Dis. 45, 521–526. doi: 10.3233/JAD-141953

PubMed Abstract | CrossRef Full Text | Google Scholar

Delgado-González, J. C., Florensa-Vila, J., Mansilla-Legorburo, F., Insausti, R., and Artacho-Pérula, E. (2017). Magnetic resonance imaging and anatomical correlation of human temporal lobe landmarks, in 3d euclidean space: a study of control and Alzheimer’s disease subjects. J. Alzheimers Dis. 57, 461–473. doi: 10.3233/JAD-160944

PubMed Abstract | CrossRef Full Text | Google Scholar

Dimitriadis, S. I., Liparas, D., and Alzheimer’s Disease Neuroimaging Initiative. (2018). How random is the random forest? Random forest algorithm on the service of structural imaging biomarkers for Alzheimer’s disease: from Alzheimer’s disease neuroimaging initiative (ADNI) database. Neural Regen. Res. 13, 962–970. doi: 10.4103/1673-5374.233433

PubMed Abstract | CrossRef Full Text | Google Scholar

Ding, Y., Sohn, J. H., Kawczynski, M. G., Trivedi, H., Harnish, R., Jenkins, N. W., et al. (2018). A deep learning model to predict a diagnosis of Alzheimer Disease by using 18F-FDG PET of the brain. Radiology 290, 456–464. doi: 10.1148/radiol.2018180958

PubMed Abstract | CrossRef Full Text | Google Scholar

Dubois, B., Feldman, H. H., Jacova, C., DeKosky, S. T., Barberger-Gateau, P., Cummings, J., et al. (2007). Research criteria for the diagnosis of Alzheimer’s disease: revising the NINCDS-ADRDA criteria. Lancet Neurol. 6, 734–746. doi: 10.1016/S1474-4422(07)70178-3

CrossRef Full Text | Google Scholar

Edvardson, S., Murakami, Y., Nguyen, T. T. M., Shahrour, M., St-Denis, A., Shaag, A., et al. (2017). Mutations in the phosphatidylinositol glycan C (PIGC) gene are associated with epilepsy and intellectual disability. J. Med. Genet. 54, 196–201. doi: 10.1136/jmedgenet-2016-104202

PubMed Abstract | CrossRef Full Text | Google Scholar

Elliott, L. T., Sharp, K., Alfaro-Almagro, F., Shi, S., Miller, K. L., Douaud, G., et al. (2018). Genome-wide association studies of brain imaging phenotypes in UK Biobank. Nature 562, 210–216. doi: 10.1038/s41586-018-0571-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Esteva, A., Kuprel, B., Novoa, R. A., Ko, J., Swetter, S. M., Blau, H. M., et al. (2017). Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115–118. doi: 10.1038/nature21056

PubMed Abstract | CrossRef Full Text | Google Scholar

Esteva, A., Robicquet, A., Ramsundar, B., Kuleshov, V., DePristo, M., Chou, K., et al. (2019). A guide to deep learning in healthcare. Nat. Med. 25, 24–29. doi: 10.1038/s41591-018-0316-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Everhart, D. E., Watson, E. M., Bickel, K. L., and Stephenson, A. J. (2015). Right temporal lobe atrophy: a case that initially presented as excessive piety. Clin. Neuropsychol. 29, 1053–1067. doi: 10.1080/13854046.2015.1104387

PubMed Abstract | CrossRef Full Text | Google Scholar

Ferrarini, L., Palm, W. M., Olofsen, H., van Buchem, M. A., Reiber, J. H., and Admiraal-Behloul, F. (2006). Shape differences of the brain ventricles in Alzheimer’s disease. Neuroimage 32, 1060–1069. doi: 10.1016/j.neuroimage.2006.05.048

PubMed Abstract | CrossRef Full Text | Google Scholar

Flick, G., Oseki, Y., Kaczmarek, A. R., Al Kaabi, M., Marantz, A., and Pylkkänen, L. (2018). Building words and phrases in the left temporal lobe. Cortex 106, 213–236. doi: 10.1016/j.cortex.2018.06.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Gallo, M., Frangipane, F., Cupidi, C., De Bartolo, M., Turone, S., Ferrari, C., et al. (2017). The novel PSEN1 M84V mutation associated to frontal dysexecutive syndrome, spastic paraparesis, and cerebellar atrophy in a dominant Alzheimer’s disease family. Neurobiol. Aging 56:213.e7-213.e12. doi: 10.1016/j.neurobiolaging.2017.04.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Ghatwary, N., Zolgharni, M., and Ye, X. (2019). Early esophageal adenocarcinoma detection using deep learning methods. Int. J. Comput. Assist. Radiol. Surg. doi: 10.1007/s11548-019-01914-4 [Epub ahead of print].

CrossRef Full Text | PubMed Abstract | Google Scholar

Gliebus, G. (2014). A case report of anxiety disorder preceding frontotemporal dementia with asymmetric right temporal lobe atrophy. SAGE Open Med. Case Rep. 2:2050313X13519977. doi: 10.1177/2050313X13519977

PubMed Abstract | CrossRef Full Text | Google Scholar

Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., et al. (2014). Generative adversarial nets. Adv. Neural Inform. Process. Syst. 2, 2672–2680.

Google Scholar

Grajski, K. A., and Bressler, S. L. (2019). Alzheimer’s disease neuroimaging initiative. Differential medial temporal lobe and default-mode network functional connectivity and morphometric changes in Alzheimer’s disease. Neuroimage Clin. 23:101860. doi: 10.1016/j.nicl.2019.101860

PubMed Abstract | CrossRef Full Text | Google Scholar

Grillo, L., Greco, D., Pettinato, R., Avola, E., Potenza, N., Castiglia, L., et al. (2014). Increased FGF3 and FGF4 gene dosage is a risk factor for craniosynostosis. Gene 534, 435–439. doi: 10.1016/j.gene.2013.09.120

PubMed Abstract | CrossRef Full Text | Google Scholar

Gulshan, V., Peng, L., Coram, M., Stumpe, M. C., Wu, D., Narayanaswamy, A., et al. (2016). Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316, 2402–2410. doi: 10.1001/jama.2016.17216

PubMed Abstract | CrossRef Full Text | Google Scholar

Haenssle, H. A., Fink, C., Schneiderbauer, R., Toberer, F., Buhl, T., Blum, A., et al. (2018). Man against machine: diagnostic performance of a deep learning convolutional neural network for dermoscopic melanoma recognition in comparison to 58 dermatologists. Ann. Oncol. 29, 1836–1842. doi: 10.1093/annonc/mdy166

PubMed Abstract | CrossRef Full Text | Google Scholar

Han, C., Tomita, H., Ohba, T., Nishizaki, K., Ogata, Y., Matsuzaki, Y., et al. (2016). Modified sympathetic nerve regulation in AKAP5-null mice. Biochem. Biophys. Res. Commun. 469, 897–902. doi: 10.1016/j.bbrc.2015.12.057

PubMed Abstract | CrossRef Full Text | Google Scholar

Heo, S. J., Kim, Y., Yun, S., Lim, S. S., Kim, J., Nam, C. M., et al. (2019). Deep learning algorithms with demographic information help to detect tuberculosis in chest radiographs in annual workers’ health examination data. Int. J. Environ. Res. Public Health 16:E250. doi: 10.3390/ijerph16020250

PubMed Abstract | CrossRef Full Text | Google Scholar

Hong, M. G., Reynolds, C. A., Feldman, A. L., Kallin, M., Lambert, J. C., Amouyel, P., et al. (2012). Genome-wide and gene-based association implicates FRMD6 in Alzheimer disease. Hum. Mutat. 33, 521–529. doi: 10.1002/humu.22009

PubMed Abstract | CrossRef Full Text | Google Scholar

Hosseini-Asl, E., Keynto, R., and El-Baz, A. (2016). “Alzheimer’s disease diagnostics by adaptation of 3D convolutional network,” in Proceedings of the 2016 IEEE International Conference on Image Processing (ICIP), (Piscataway, NJ: IEEE), 126–130.

Google Scholar

Huang, C. C., Hsiao, I. T., Huang, C. Y., Weng, Y. C., Huang, K. L., Liu, C. H., et al. (2019). Tau PET With 18F-THK-5351 Taiwan Patients With Familial Alzheimer’s Disease With the APP p.D678H Mutation. Front. Neurol. 10:503. doi: 10.3389/fneur.2019.00503

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, C. Y., Hsiao, I. T., Lin, K. J., Huang, K. L., Fung, H. C., Liu, C. H., et al. (2019). Amyloid PET pattern with dementia and amyloid angiopathy in Taiwan familial AD with D678H APP mutation. J. Neurol. Sci. 398, 107–116. doi: 10.1016/j.jns.2018.12.039

PubMed Abstract | CrossRef Full Text | Google Scholar

Jahanshad, N., Kochunov, P. V., Sprooten, E., Mandl, R. C., Nichols, T. E., Almasy, L., et al. (2013). Multi-site genetic analysis of diffusion images and voxelwise heritability analysis: a pilot project of the ENIGMA-DTI working group. Neuroimage 90, 470–471. doi: 10.1016/j.neuroimage.2013.04.061

PubMed Abstract | CrossRef Full Text | Google Scholar

Ju, R., Hu, C., Zhou, P., and Li, Q. (2017). Early diagnosis of Alzheimer’s disease based on resting-state brain networks and deep learning. IEEE/ACM Trans. Comput. Biol. Bioinform. 16:99. doi: 10.1109/TCBB.2017.2776910

PubMed Abstract | CrossRef Full Text | Google Scholar

Jung, N. Y., Lee, J. H., Lee, Y. M., Shin, J. H., Shin, M. J., Lee, M. J., et al. (2018). Early stage memory impairment, visual hallucinations, and myoclonus combined with temporal lobe atrophy predict Alzheimer’s disease pathology in corticobasal syndrome. Neurocase 24, 145–150. doi: 10.1080/13554794.2018.1494290

PubMed Abstract | CrossRef Full Text | Google Scholar

Kakeda, S., and Korogi, Y. (2010). The efficacy of a voxel-based morphometry on the analysis of imaging in schizophrenia, temporal lobe epilepsy, and Alzheimer’s disease/mild cognitive impairment: a review. Neuroradiology 52, 711–721. doi: 10.1007/s00234-010-0717-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Kenkhuis, B., Jonkman, L. E., Bulk, M., Buijs, M., Boon, B. D. C., Bouwman, F. H., et al. (2019). 7T MRI allows detection of disturbed cortical lamination of the medial temporal lobe in patients with Alzheimer’s disease. Neuroimage Clin. 21:101665. doi: 10.1016/j.nicl.2019.101665

PubMed Abstract | CrossRef Full Text | Google Scholar

Kitchigina, V. F. (2018). Alterations of Coherent Theta and Gamma Network Oscillations as an Early Biomarker of Temporal Lobe Epilepsy and Alzheimer’s Disease. Front. Integr. Neurosci. 12:36. doi: 10.3389/fnint.2018.00036

PubMed Abstract | CrossRef Full Text | Google Scholar

Kovacs, M. D., Burchett, P. F., and Sheafor, D. H. (2018). App review: management guide for incidental findings on CT and MRI. J. Digit. Imaging. 31, 154–158. doi: 10.1007/s10278-017-0035-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Kuramoto, K., Negishi, M., and Katoh, H. (2009). Regulation of dendrite growth by the Cdc42 activator Zizimin1/Dock9 in hippocampal neurons. J. Neurosci. Res. 87, 1794–1805. doi: 10.1002/jnr.21997

PubMed Abstract | CrossRef Full Text | Google Scholar

Ladefoged, C. N., Marner, L., Hindsholm, A., Law, I., Højgaard, L., and Andersen, F. L. (2019). Deep learning based attenuation correction of PET/MRI in pediatric brain tumor patients: evaluation in a clinical setting. Front. Neurosci. 12:1005. doi: 10.3389/fnins.2018.01005

PubMed Abstract | CrossRef Full Text | Google Scholar

Lam, A. D., Cole, A. J., and Cash, S. S. (2019). New approaches to studying silent mesial temporal lobe seizures in alzheimer’s disease. Front. Neurol. 10:959. doi: 10.3389/fneur.2019.00959

PubMed Abstract | CrossRef Full Text | Google Scholar

Lattimore, F., and Ongv, C. S. (2018). A Primer on Causal Analysis. arXiv [Preprint].,

Google Scholar

Leandrou, S., Petroudi, S., Kyriacou, P. A., Reyes-Aldasoro, C. C., and Pattichis, C. S. (2018). Quantitative MRI brain studies in mild cognitive impairment and Alzheimer’s disease: a methodological review. IEEE Rev. Biomed. Eng. 11, 97–111. doi: 10.1109/RBME.2018.2796598

PubMed Abstract | CrossRef Full Text | Google Scholar

Lenz, D., McClean, P., Kansu, A., Bonnen, P. E., Ranucci, G., Thiel, C., et al. (2018). SCYL1 variants cause a syndrome with low γ-glutamyl-transferase cholestasis, acute liver failure, and neurodegeneration (CALFAN). Genet. Med. 20, 1255–1265. doi: 10.1038/gim.2017.260

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, B. Y., and Chen, S. D. (2015). Potential similarities in temporal lobe epilepsy and Alzheimer’s Disease: from clinic to pathology. Am. J. Alzheimers Dis. Other Demen. 30, 723–728. doi: 10.1177/1533317514537547

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, X., Chen, K., Wu, T., Weidman, D., Lure, F., and Li, J. (2018a). Use of multimodality imaging and artificial intelligence for diagnosis and prognosis of early stages of Alzheimer’s disease. Transl. Res. 194, 56–67. doi: 10.1016/j.trsl.2018.01.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, X., Hou, D., Lin, F., Luo, J., Xie, J., Wang, Y., et al. (2018b). The role of neurovascular unit damage in the occurrence and development of Alzheimer’s disease. Rev. Neurosci. 30, 477–484. doi: 10.1515/revneuro-2018-0056

PubMed Abstract | CrossRef Full Text | Google Scholar

Lopez-Paz, D., and Oquab, M. (2016). Revisiting classifier two-sample tests. arXiv [Preprint].

Google Scholar

Lopez-Paz, D., and Oquab, M. (2017). “Revisiting Classifier Two-Sample Tests for GAN Evaluation and Causal Discovery,” in Proceedings of the International Conference on Learning Representations (ICLR), Toulon.

Google Scholar

Lorenzi, M., Filippone, M., Frisoni, G. B., Alexander, D. C., Ourselin, S., and Alzheimer’s Disease Neuroimaging Initiative. (2017). Probabilistic disease progression modeling to characterize diagnostic uncertainty: application to staging and prediction in Alzheimer’s disease. NeuroImage 190, 56–68. doi: 10.1016/j.neuroimage.2017.08.059

PubMed Abstract | CrossRef Full Text | Google Scholar

Manley, W., Moreau, M. P., Azaro, M., Siecinski, S. K., Davis, G., Buyske, S., et al. (2018). Validation of a microRNA target site polymorphism in H3F3B that is potentially associated with a broad schizophrenia phenotype. PLoS One 13:e0194233. doi: 10.1371/journal.pone.0194233

PubMed Abstract | CrossRef Full Text | Google Scholar

Menéndez-González, M., de Celis Alonso, B., Salas-Pacheco, J., and Arias-Carrión, O. (2015). Structural neuroimaging of the medial temporal lobe in Alzheimer’s Disease clinical trials. J Alzheimers Dis. 48, 581–589. doi: 10.3233/JAD-150226

PubMed Abstract | CrossRef Full Text | Google Scholar

Mez, J., Chung, J., Jun, G., Kriegel, J., Bourlas, A. P., Sherva, R., et al. (2017). Two novel loci, COBL and SLC10A2, for Alzheimer’s disease in African Americans. Alzheimers Dement. 13, 119–129. doi: 10.1016/j.jalz.2016.09.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Miles, L. A., Hermans, S. J., Crespi, G. A. N., Gooi, J. H., Doughty, L., Nero, T. L., et al. (2019). Small molecule binding to Alzheimer risk factor CD33 promotes Aβ phagocytosis. Science 19, 110–118. doi: 10.1016/j.isci.2019.07.023

PubMed Abstract | CrossRef Full Text | Google Scholar

Pasquini, L., Rahmani, F., Maleki-Balajoo, S., La Joie, R., Zarei, M., Sorg, C., et al. (2019). Medial Temporal Lobe Disconnection and Hyperexcitability Across Alzheimer’s Disease Stages. J. Alzheimers Dis. Rep. 3, 103–112. doi: 10.3233/ADR-190121

PubMed Abstract | CrossRef Full Text | Google Scholar

Payan, A., and Montana, G. (2015). Predicting Alzheimer’s disease: a neuroimaging study with 3D convolutional neural networks. arXiv:1502.02506 [Preprint].

Google Scholar

Persson, K., Barca, M. L., Cavallin, L., Braekhus, A., Knapskog, A. B., Selbaek, G., et al. (2018). Comparison of automated volumetry of the hippocampus using NeuroQuant and visual assessment of the medial temporal lobe in Alzheimer’s disease. Acta Radiol. 8, 997–1001. doi: 10.1177/0284185117743778

PubMed Abstract | CrossRef Full Text | Google Scholar

Peters, J., Janzing, D., and Schölkopf, B. (2017). Elements of Causal Inference: Foundations and Learning Algorithms. Boston: The MIT Press.

Google Scholar

Pettigrew, C., Soldan, A., Sloane, K., Cai, Q., Wang, J., Wang, M. C., et al. (2017). Progressive medial temporal lobe atrophy during preclinical Alzheimer’s disease. Neuroimage Clin. 16, 439–446. doi: 10.1016/j.nicl.2017.08.022

PubMed Abstract | CrossRef Full Text | Google Scholar

Ravizza, S., Huschto, T., Adamov, A., Böhm, L., Büsser, A., Flöther, F. F., et al. (2019). Predicting the early risk of chronic kidney disease in patients with diabetes using real-world data. Nat. Med. 25, 57–59. doi: 10.1038/s41591-018-0239-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Sarica, A., Cerasa, A., and Quattrone, A. (2017). Random forest algorithm for the classification of neuroimaging data in Alzheimer’s disease: a systematic review. Front. Aging Neurosci. 9:329. doi: 10.3389/fnagi.2017.00329

PubMed Abstract | CrossRef Full Text | Google Scholar

Sarraf, S., and Tofighi, G. (2016). DeepAD: Alzheimer’s disease classification via deep convolutional neural networks using MRI and fMRI. BioRxiv [Preprint]. doi: 10.1101/070441,

CrossRef Full Text | Google Scholar

Schmidt, W. M., Kraus, C., Höger, H., Hochmeister, S., Oberndorfer, F., Branka, M., et al. (2007). Mutation in the Scyl1 gene encoding amino-terminal kinase-like protein causes a recessive form of spinocerebellar neurodegeneration. EMBO Rep. 8, 691–697. doi: 10.1038/sj.embor.7401001

PubMed Abstract | CrossRef Full Text | Google Scholar

Schmidt, W. M., Rutledge, S. L., Schüle, R., Mayerhofer, B., Züchner, S., Boltshauser, E., et al. (2015). Disruptive SCYL1 mutations underlie a syndrome characterized by recurrent episodes of liver failure, peripheral neuropathy, cerebellar atrophy, and ataxia. Am. J. Hum. Genet. 97, 855–861. doi: 10.1016/j.ajhg.2015.10.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv [Preprint].

Google Scholar

Spasov, S., Passamonti, L., Duggento, A., Liò, P., and Toschi, N. (2019). A parameter-efficient deep learning approach to predict conversion from mild cognitive impairment to Alzheimer’s disease. Neuroimage 189, 276–287. doi: 10.1016/j.neuroimage.2019.01.031

PubMed Abstract | CrossRef Full Text | Google Scholar

Struyfs, H., Van Hecke, W., Veraart, J., Sijbers, J., Slaets, S., De Belder, M., et al. (2015). Diffusion kurtosis imaging: a possible MRI biomarker for ad diagnosis? J. Alzheimers Dis. 48, 937–948. doi: 10.3233/JAD-150253

PubMed Abstract | CrossRef Full Text | Google Scholar

Trimmel, K., van Graan, A. L., Caciagli, L., Haag, A., Koepp, M. J., Thompson, P. J., et al. (2018). Left temporal lobe language network connectivity in temporal lobe epilepsy. Brain 141, 2406–2418. doi: 10.1093/brain/awy164

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Giau, V., Senanarong, V., Bagyinszky, E., Limwongse, C., An, S. S. A., and Kim, S. (2018). Identification of a novel mutation in APP gene in a Thai subject with early-onset Alzheimer’s disease. Neuropsychiatr. Dis. Treat. 14, 3015–3023. doi: 10.2147/NDT.S180174

PubMed Abstract | CrossRef Full Text | Google Scholar

Wada, A., Tsuruta, K., Irie, R., Kamagata, K., Maekawa, T., Fujita, S., et al. (2018). Differentiating Alzheimer’s disease from dementia with Lewy bodies using a deep learning technique based on structural brain connectivity. Magn. Reson. Med. Sci. doi: 10.2463/mrms.mp.2018-0091 [Epub ahead of print].

CrossRef Full Text | PubMed Abstract | Google Scholar

Waldrop, M. M. (2019). News feature: what are the limits of deep learning? Proc. Natl. Acad. Sci. U.S.A. 116, 1074–1077. doi: 10.1073/pnas.1821594116

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Y. J., Wan, Y., Wang, H. F., Tan, C. C., Li, J. Q., Yu, J. T., et al. (2019). Effects of CD33 variants on neuroimaging biomarkers in non-demented elders. J. Alzheimers Dis. 68, 757–766. doi: 10.3233/JAD-181062

PubMed Abstract | CrossRef Full Text | Google Scholar

Wolk, D. A., Das, S. R., Mueller, S. G., Weiner, M. W., and Yushkevich, P. A., Alzheimer’s Disease Neuroimaging Initiative (2017). Medial temporal lobe subregional morphometry using high resolution MRI in Alzheimer’s disease. Neurobiol. Aging 49, 204–213. doi: 10.1016/j.neurobiolaging.2016.09.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Xie, L., Wisse, L. E., Pluta, J., de Flores, R., Piskin, V., Manjón, J. V., et al. (2019). Alzheimer’s Disease neuroimaging initiative. Automated segmentation of medial temporal lobe subregions on in vivo T1-weighted MRI in early stages of Alzheimer’s disease. Human brain mapping. Hum. Brain Mapp. 40, 3431–3451. doi: 10.1002/hbm.24607

PubMed Abstract | CrossRef Full Text | Google Scholar

Xiong, M. M. (2018). Big Data in Omics and Imaging: (2) Integrated Analysis and Causal Inference. New York: CRC Press.

Google Scholar

Zeiler, M. D., and Fergus, R. (2014). “Visualizing and understanding convolutional networks,” in Proceedings of the European Conference on Computer Vision–ECCV, (Berlin: Springer), 818–833. doi: 10.1007/978-3-319-10590-1_53

CrossRef Full Text | Google Scholar

Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., and Torralba, A. (2016). “Learning deep features for discriminative localization,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (Berlin: Springer), 2921–2929.

Google Scholar

Zhou, Z. D., Chan, C. H., Ma, Q. H., Xu, X. H., Xiao, Z. C., and Tan, E. K. (2011). The roles of amyloid precursor protein (APP) in neurogenesis: implications to pathogenesis and therapy of Alzheimer disease. Cell Adh. Migr. 5, 280–292. doi: 10.4161/cam.5.4.16986

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhuang, Q. S., Zheng, H., Gu, X. D., Shen, L., and Ji, H. F. (2017). Detecting the genetic link between Alzheimer’s disease and obesity using bioinformatics analysis of GWAS data. Oncotarget 8, 55915–55919. doi: 10.18632/oncotarget.19115

PubMed Abstract | CrossRef Full Text | Google Scholar

Zintgraf, L. M., Cohen, T. S., Adel, T., and Welling, M. (2017). Visualizing deep neural network decisions: prediction difference analysis. arXiv [Preprint].

Google Scholar

Keywords: Alzheimer’s disease, diffusion tensor imaging images, deep learning, causal inference, feature selection, genetic-imaging data analysis

Citation: Liu Y, Li Z, Ge Q, Lin N and Xiong M (2019) Deep Feature Selection and Causal Analysis of Alzheimer’s Disease. Front. Neurosci. 13:1198. doi: 10.3389/fnins.2019.01198

Received: 18 March 2019; Accepted: 22 October 2019;
Published: 15 November 2019.

Edited by:

Lin Shi, The Chinese University of Hong Kong, China

Reviewed by:

Jingyun Chen, New York University, United States
Liang Zhan, University of Pittsburgh, United States

Copyright © 2019 Liu, Li, Ge, Lin and Xiong. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Momiao Xiong, TW9taWFvLlhpb25nQHV0aC50bWMuZWR1

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.