Spherical convolutional neural networks can improve brain microstructure estimation from diffusion MRI data

Kerkelä, Leevi; Seunarine, Kiran; Szczepankiewicz, Filip; Clark, Chris A.

doi:10.3389/fnimg.2024.1349415

ORIGINAL RESEARCH article

Front. Neuroimaging, 14 March 2024

Sec. Brain Imaging Methods

Volume 3 - 2024 | https://doi.org/10.3389/fnimg.2024.1349415

This article is part of the Research TopicDiffusion-Weighted MR Imaging (DW-MRI) and Diffusion-Weighted MR Spectroscopy (DW-MRS)View all 10 articles

Spherical convolutional neural networks can improve brain microstructure estimation from diffusion MRI data

Leevi Kerkelä¹^*

Kiran Seunarine^1,2

Filip Szczepankiewicz³

Chris A. Clark¹

¹UCL Great Ormond Street Institute of Child Health, University College London, London, United Kingdom
²Department of Neurosurgery, Great Ormond Street Hospital, London, United Kingdom
³Medical Radiation Physics, Clinical Sciences Lund, Lund University, Lund, Sweden

Diffusion magnetic resonance imaging is sensitive to the microstructural properties of brain tissue. However, estimating clinically and scientifically relevant microstructural properties from the measured signals remains a highly challenging inverse problem that machine learning may help solve. This study investigated if recently developed rotationally invariant spherical convolutional neural networks can improve microstructural parameter estimation. We trained a spherical convolutional neural network to predict the ground-truth parameter values from efficiently simulated noisy data and applied the trained network to imaging data acquired in a clinical setting to generate microstructural parameter maps. Our network performed better than the spherical mean technique and multi-layer perceptron, achieving higher prediction accuracy than the spherical mean technique with less rotational variance than the multi-layer perceptron. Although we focused on a constrained two-compartment model of neuronal tissue, the network and training pipeline are generalizable and can be used to estimate the parameters of any Gaussian compartment model. To highlight this, we also trained the network to predict the parameters of a three-compartment model that enables the estimation of apparent neural soma density using tensor-valued diffusion encoding.

1 Introduction

Neuroimaging enables non-invasively measuring functional and structural properties of the brain, and it is essential in modern neuroscience. Diffusion magnetic resonance imaging (dMRI), the most commonly used imaging modality for quantifying microstructural properties of the brain, measures displacements of water molecules at the microscopic level and is thus sensitive to tissue microstructure. dMRI has been used to localize microstructural alterations associated with, for example, learning (Sagi et al., 2012), healthy development (Lebel et al., 2019), aging (Sullivan and Pfefferbaum, 2006), neurodevelopmental disorders (Gibbard et al., 2018), and neurodegenerative diseases (Zhang et al., 2009). However, accurately inferring clinically and scientifically relevant properties of tissue microstructure (e.g., cell morphology or distribution of cell types) from the measured signals remains a highly challenging inverse problem (Kiselev, 2017).

Most dMRI data analysis methods are based on signal models that express the measured signal as a function of parameters of interest and can be fit to data by numerically minimizing an objective function (Novikov et al., 2019). An essential requirement for microstructural neuroimaging methods is low rotational variance (i.e., estimated parameters should not depend on how the subject's head is oriented in the scanner). Furthermore, it is often desirable for the parameter estimates to be independent of the orientation distribution of the microscopic structures (e.g., an estimate of axon density should not depend on whether the axons are aligned or crossing). These two requirements are often achieved by acquiring high-angular resolution diffusion imaging (HARDI) data and averaging over the diffusion encoding directions, which is referred to as “powder-averaging”, a term borrowed from the field of solid-state nuclear magnetic resonance (NMR). The number of acquisition directions required for a nearly rotationally invariant powder-averaged signal depends on the properties of tissue microstructure and diffusion encoding (Szczepankiewicz et al., 2019a). Fitting models to powder-averaged signals is often referred to as the “spherical mean technique” (SMT), a term introduced by Kaden et al. (2016b). While powder-averaging enables the estimation of various microstructural parameters (Jespersen et al., 2013; Lasič et al., 2014; Kaden et al., 2016a,b; Szczepankiewicz et al., 2016; Henriques et al., 2020; Palombo et al., 2020; Gyori et al., 2021), a significant amount of information is lost during averaging. Therefore, it may be beneficial to estimate the parameters directly from full data without powder-averaging.

In recent years, microstructural parameter estimation using machine learning (ML) has received significant attention as a potential solution to issues with conventional fitting, such as slow convergence, poor noise robustness, and terminating at local minima (Golkov et al., 2016; Barbieri et al., 2020; Palombo et al., 2020; de Almeida Martins et al., 2021; Elaldi et al., 2021; Gyori et al., 2021, 2022; Karimi et al., 2021; Sedlar et al., 2021a,b; Kerkelä et al., 2022). ML models can be trained to predict microstructural parameter values from data using supervised or self-supervised learning. In the context of dMRI, a particularly promising development has been the invention of spherical convolutional neural networks (sCNNs) (Cohen et al., 2018; Esteves et al., 2018). sCNNs are SO(3)-equivariant (i.e., rotating the input changes the output according to the same rotation) artificial neural networks that perform spherical convolutions with learnable filters. They theoretically enable rotationally invariant classification and regression, making them potentially well-suited for predicting microstructural parameters from dMRI data.

This study aimed to investigate if sCNNs can improve microstructural parameter estimation. We focused on estimating the parameters of a constrained two-compartment model by Kaden et al. (2016a) regularly used in neuroscience to study human white matter in vivo (Collins et al., 2019; Toescu et al., 2021; Voldsbekk et al., 2021; Battocchio et al., 2022; Rahmanzadeh et al., 2022). An sCNN implemented according to Esteves et al. (2018) was trained to predict the neurite orientation distribution function (ODF) and scalar parameters (neurite diffusivity and density) from dMRI data. Training and testing were done using simulated data. The sCNN was compared to conventional fitting and a multi-layer perceptron (MLP) in terms of accuracy and orientational variance. The trained model was then applied to MRI data acquired in a clinical setting to generate microstructural maps. Furthermore, to highlight the fact that the sCNN and training pipeline are applicable to any Gaussian compartment model, the network was trained to estimate the parameters of a constrained three-compartment model by Gyori et al. (2021) that enables the estimation of apparent neural soma density using tensor-valued diffusion encoding (Topgaard, 2017).

2 Materials and methods

2.1 Spherical harmonics

Any square-integrable function on the sphere f:S² → ℂ can be expanded in the spherical harmonic basis:

\begin{array}{l} f (x) = \sum_{l = 0}^{b} \sum_{m = - l}^{l} {\hat{f}}_{l}^{m} Y_{l}^{m} (x), & (1) \end{array}

where x is a point on the unit sphere, b is the bandwidth of f, l is the degree, m is the order, ${\hat{f}}_{l}^{m}$ is an expansion coefficient, and $Y_{l}^{m}$ is a spherical harmonic defined as

\begin{array}{l} Y_{l}^{m} (θ, ϕ) = \sqrt{\frac{2 l + 1}{4 π} \frac{(l - m)!}{(l + m)!}} P_{l}^{m} (cos θ) e^{i m ϕ}, & (2) \end{array}

where θ ∈ [0, π] is the polar coordinate, ϕ ∈ [0, 2π) is the azimuthal coordinate, and $P_{l}^{m}$ is the associated Legendre function.

The expansion coefficients are given by the spherical Fourier transform (SFT):

\begin{array}{l} {\hat{f}}_{l}^{m} = \int_{S^{2}} d x f (x) Ȳ_{l}^{m} (x) . & (3) \end{array}

SFT of a band-limited function can be computed exactly as a finite sum using a sampling theorem (Driscoll and Healy, 1994). Equation 1 is the inverse spherical Fourier transform (ISFT).

Since reconstructed dMRI signals are real-valued and antipodally symmetric, we use the following basis:

\begin{array}{l} S_{l}^{m} = {\begin{array}{l} 0 & if l is odd \\ \sqrt{2} ℑ (Y_{l}^{- m}) & if m < 0 \\ Y_{l}^{0} & if m = 0 \\ \sqrt{2} ℜ (Y_{l}^{m}) & if m > 0 \end{array} . & (4) \end{array}

Considering that diffusion encoding directions do not usually follow a sampling theorem like the one by Driscoll and Healy (1994) that enables SFT to be exactly computed as a finite sum, we use least squares to compute the expansion coefficients: Indexing $j = \frac{1}{2} l (l + 1) + m$ assigns a unique index j to every pair l, m. Given f sampled at points x₁, x₂, ..., x_{n_points} stored in a column vector X, the values of the spherical harmonics sampled at the same points are organized in a n_points×n_coefficients matrix B where $B_{i j} = S_{l}^{m} (x_{i})$ . (B^TB)−1B^TX gives a vector containing the expansion coefficients minimizing the Frobenius norm (Brechbühler et al., 1995).

2.2 Spherical convolution

Convolution of a spherical signal f by a spherical filter h is defined as

\begin{array}{l} (f * h) (x) = \int_{S O (3)} d R f (R {\hat{e}}_{3}) h (R^{- 1} x), & (5) \end{array}

where ${\hat{e}}_{3}$ is a unit vector aligned with the z-axis. If f and h are band-limited, the above equation can be evaluated efficiently as a pointwise product in the frequency domain (Driscoll and Healy, 1994). The spherical harmonic coefficients of the convoluted signal y are

\begin{array}{l} ŷ_{l}^{m} = 2 π \sqrt{\frac{4 π}{2 l + 1}} {\hat{f}}_{l}^{m} ĥ_{l}^{0} . & (6) \end{array}

Spherical convolution is equivariant to rotations (i.e., R(f*h) = (Rf)*h for all R ∈ SO(3)) and the filter is marginalized around the z-axis (i.e, for every h, there exists a filter h_z that is symmetric with respect to the z-axis so that f * h = f * h_z).

2.3 Compartment models

Compartment models represent the dMRI signal as a sum of signals coming from different microstructural environments (e.g., intra- and extra-axonal water). For details, see, for example, the review by Jelescu and Budde (2017). Here, we focus on models with non-exchanging Gaussian compartments following an ODF. The signal measured along $\hat{n}$ is expressed as a spherical convolution of the ODF by a microstructural kernel response function K:

\begin{array}{l} S (\hat{n}) = \int_{SO (3)} d R ODF (R {\hat{e}}_{3}) K (R^{- 1} \hat{n}), & (7) \end{array}

where K is the microstructural kernel response function:

\begin{array}{l} K (\hat{n}) = S_{0} [\sum_{i = 1}^{N} f_{i} exp (- b : D_{i})], & (8) \end{array}

where S₀ is the signal without diffusion-weighting, N is the number of compartments, f_i is a signal fraction, b is the b-tensor corresponding to $\hat{n}$ and a b-value equal to Tr(b), : denotes the generalized scalar product ( $b : D = \sum_{i = 1}^{3} \sum_{j = 1}^{3} b_{i j} D_{i j}$ ) (Westin et al., 2016), and D_i is an axially symmetric diffusion tensor aligned with the z-axis representing Gaussian diffusion in the compartment. The training pipeline presented in this paper is applicable to any compartment model that can be expressed using Equations 7 and 8. Given a different data generation method, the sCNN can be trained to predict the parameters of non-Gaussian models as well.

2.3.1 Two-compartment model

The so-called “standard model” of diffusion in white matter consists of a one-dimensional compartment representing diffusion inside neurites and a coaxial axially symmetric extra-cellular compartment (Novikov et al., 2019). We focus on a constrained version of the model by Kaden et al. (2016a) that enables model parameters to be estimated from powder-averaged data using the SMT. The model contains two parameters: intra-neurite diffusivity d and intra-neurite signal fraction f. Axial and radial diffusivities of the extra-cellular compartment are d and (1−f)d, respectively. Inserting this into Equation 8 gives

\begin{array}{l} K (\hat{n}) = S_{0} [f \exp (- b : [\begin{matrix} 0 & 0 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & d \end{matrix}]) \\ + (1 - f) \exp (- b : [\begin{matrix} (1 - f) d & 0 & 0 \\ 0 & (1 - f) d & 0 \\ 0 & 0 & d \end{matrix}])] . & (9) \end{array}

2.3.2 Spherical mean technique

Kaden et al. (2016b) observed that for a fixed b-value, the spherical mean of the dMRI signal over the gradient directions does not depend on the ODF. By exploiting this invariance, the constrained two-compartment model can be fit to powder-averaged data, denoted by S_PA here, using the following signal equation (Kaden et al., 2016a):

\begin{array}{l} S_{PA} = S_{0} [f \frac{\sqrt{π} erf (\sqrt{b d})}{2 \sqrt{b d}} + (1 - f) e^{- b (1 - f) d} \frac{\sqrt{π} erf (\sqrt{b f d})}{2 \sqrt{b f d}}] . & (10) \end{array}

2.3.3 Three-compartment model

Palombo et al. (2020) added a spherical compartment representing neural soma to the standard model to make it more suitable for gray matter. We use a constrained three-compartment model by Gyori et al. (2021) that uses tensor-valued diffusion encoding to make apparent neural soma imaging more feasible without high-performance gradient hardware. The model contains four parameters: intra-neurite diffusivity d_i, intra-neurite signal fraction f_i, spherical compartment diffusivity d_sph, and spherical compartment signal fraction f_sph. Axial and radial diffusivities of the extra-cellular compartment are $d_{i} {(1 - f_{i} - f_{sph})}^{\frac{1}{2} f_{sph} / (f_{sph} + f_{i})}$ and $d_{i} {(1 - f_{i} - f_{sph})}^{(\frac{1}{2} f_{sph} + f_{i}) / (f_{sph} + f_{i})}$ , respectively. We omit explicitly writing out the kernel signal equation to save space, but it is trivial to construct from Equation 8.

2.4 Simulations

Simulated training data was generated by evaluating Equation 7 in the frequency domain according to Equation 6. The response function values were evaluated along 3072 directions uniformly distributed over the surface of the sphere according to the hierarchical equal area isolatitude pixelisation (HEALPix) (Gorski et al., 2005; Zonca et al., 2019) and expanded in the spherical harmonics basis. Rician noise was added to the simulated signals:

\begin{array}{l} S_{noisy} = \sqrt{{(S + X)}^{2} + Y^{2}}, & (11) \end{array}

where S is the simulated signal without noise and X and Y are sampled from a normal distribution with zero mean and standard deviation of 1/SNR, where SNR is the signal-to-noise ratio. SNR was matched to the mean SNR in the imaging experiments.

2.5 Network architecture

Our sCNN, visualized in Figure 1, consists of six spherical convolution layers implemented according to Esteves et al. (2018) without enforcing localized filters. The network takes the expansion coefficients in the frequency domain as input and outputs the estimated ODF and scalar model parameters. The number of input channels is equal to the number of shells in data. Each spherical convolution layer is followed by a leaky (slope is 0.1 for negative values) rectified linear unit (ReLU) applied in the spatial domain. The conversion between frequency and spatial domains is done using the 3072 HEALPix directions. Spherical harmonics up to degree 16 are used in the network because the non-linearity can increase signal bandwidth. Spectral pooling discards coefficients of the highest degrees. After the initial three convolutions, global mean pooling is applied in the spatial domain, and the resulting arrays are concatenated and passed to the fully connected network (FCN) that outputs the predicted scalar parameter. The FCN consists of three hidden layers with 128 units each. The first two layers of the FCN are followed by batch normalization (Ioffe and Szegedy, 2015) and a ReLU. The sCNN for estimating the two-compartment model parameters has 78,258 trainable parameters.

Figure 1

Figure 1. Network for two-compartment model parameter prediction. The input is normalized two-shell data expanded using spherical harmonics up to degree eight. The signals undergo spherical convolutions, non-linearities, and spectral pooling to produce the predicted orientation distribution function. After the initial three convolutions, global mean pooling is applied in the signal domain, and the resulting arrays are concatenated to create a nearly rotationally invariant feature vector passed on to the FCN that outputs the predicted scalar parameter.

2.6 Training

Training was done over 10⁵ batches of simulated data generated during training. Each batch contained signals from 500 microstructural configurations produced by random sampling (d ~ U(0, 3 μm²/ms) and f ~ U(0, 1)). ODFs were sampled from five volunteer scans. Validation and test datasets were constructed similarly, except that they contained 10⁴ and 10⁶ microstructural configurations, respectively, and the ODFs were sampled from different volunteer scans. Training was performed twice: with and without randomly rotating the ODFs. The ODFs in the validation and test datasets were randomly rotated. ADAM (Kingma and Ba, 2014) was the optimizer with an initial learning rate of 10⁻³, which was reduced by 90% after 50% and 75% into the training. Mean squared error (MSE) was the loss function. ODF MSE was calculated in the spatial domain.

2.7 Baseline methods

The sCNN was compared to the SMT and an MLP that takes the normalized dMRI signals as inputs and outputs the spherical harmonic coefficients of the ODF and the model parameters. The SMT parameter estimation and the subsequent ODF estimation using the estimated microstructural kernel and constrained spherical deconvolution (CSD) was done using Dmipy (Fick et al., 2019). The MLP consisted of three hidden layers with 512 nodes each. The hidden layers were followed by batch normalization and a ReLU. The MLP had 614,447 trainable parameters. It was trained like the sCNN, except ten times more batches were used to account for the higher number of parameters and ensure convergence.

2.8 Imaging data

The brains of eight healthy adult volunteers were scanned on a Siemens Magnetom Prisma 3T (Siemens Healthcare, Erlangen, Germany) at Great Ormond Street Hospital, London, United Kingdom. Data was denoised (Veraart et al., 2016) using MRtrix3 (Tournier et al., 2019) and distortion- and motion-corrected using FSL (Jenkinson et al., 2012; Andersson and Sotiropoulos, 2016). SNR was estimated in each voxel as the inverse of the standard deviation of the normalized signals without diffusion-weighting.

2.8.1 High-angular resolution diffusion imaging

Seven volunteers were scanned using a standard clinical two-shell HARDI protocol with two non-zero b-values of 1 and 2.2 ms/μm² with 60 directions over half a sphere each. Other relevant scan parameters were the following: diffusion time (Δ) = 28.7 ms; diffusion encoding time (δ) = 16.7 ms; echo time (TE) = 60 ms; repetition time (TR) = 3,050 ms; field of view (FOV) = 220 × 220 ms; voxel size = 2 × 2 × 2 mm³; slice gap = 0.2 mm; 66 slices; phase partial Fourier = 6/8; multiband acceleration factor = 2. Fourteen images were acquired without diffusion-weighting, one of which had the phase encoding direction reversed to be used to correct for susceptibility-induced distortions. The total scan time was 7 minutes. Mean SNR in the brain was 50. Neurite ODFs were estimated using multi-tissue CSD (Jeurissen et al., 2014) with l_max = 8.

2.8.2 Tensor-valued diffusion imaging

One volunteer was scanned using a prototype spin echo sequence that enables tensor-valued diffusion encoding (Szczepankiewicz et al., 2019a). Data was acquired using numerically optimized (Sjölund et al., 2015) and Maxwell-compensated (Szczepankiewicz et al., 2019b) gradient waveforms encoding linear and planar b-tensors. The acquisitions with linear b-tensors were performed with b-values of 0.5, 1, 2, 3.5, and 5 ms/μm² with 12, 12, 20, 20, and 30 directions over half a sphere, respectively. The acquisitions with planar b-tensors were performed with b-values of 0.5, 1, and 2 ms/μm² with 12, 12, and 20 directions over half a sphere, respectively. Other relevant scan parameters were the following: TE = 82 ms; TR = 4.2 s; FOV = 220 × 220 ms; voxel size = 2 × 2 × 2 mm³; slice gap = 0.2 mm; 66 slices; phase partial Fourier = 6/8; multiband acceleration factor = 2. Fourteen images were acquired without diffusion-weighting, one of which had the phase encoding direction reversed. The total scan time was 12 minutes. Mean SNR in the brain was 29.

3 Results

3.1 Two-compartment model

3.1.1 Prediction accuracy

MSE on the test dataset is reported in Table 1. The sCNN and MLP outperformed the SMT in estimating the ODF and scalar parameters. The sCNN predicted d and f the best while the MLP was predicted the ODF marginally better than the sCNN. Both the sCNN and MLP benefited slightly from randomly rotating the training data. Figure 2 shows how prediction accuracy depends on the values of d and f. The sCNN and MLP outperformed the SMT in all parts of the parameter space. Although the largest errors with the SMT occurred for values of d and f not typically observed in the brain, ML-based approaches were more accurate for values observed in the brain (i.e., d roughly between 1 and 2 μm²/ms).

Table 1

Table 1. Mean squared error of the estimated two-compartment model parameters on the test dataset.

Figure 2

Figure 2. Mean squared error of the estimated two-compartment model parameters on the test dataset for different values of intra-neurite diffusivity (d) and intra-neurite signal fraction (f). The first row (A–E) shows the results for d and the second row (F–J) shows the results for f. Deep learning-based methods outperformed the spherical mean technique in all parts of the parameter space. The asterisk (^*) refers to models trained with randomly rotated training data.

3.1.2 Rotational variance

The rotational variance of the different methods was assessed by generating signals from 10³ random microstructural configurations rotated over 729 rotations given by the SO(3) sampling theorem by Kostelec and Rockmore (2008). No noise was added to the signals to exclude the effects of noise. The average standard deviation of the estimated parameters from the rotated data are shown in Table 2. The sCNN and SMT were much less sensitive to rotations than the MLP. The SMT had the lowest rotational variance for d, and the sCNN had the lowest rotational variance for f. However, the SMT's non-zero rotational variance was driven by low values of d or f for which the fit is unstable. For values typically observed in white matter, the SMT's estimates' standard deviation was three orders of magnitude smaller than the average. Data augmentation by rotating the input signals improved prediction accuracy for both the sCNN and MLP. However, the sCNN was much less rotationally variant even without data augmentation than the MLP was with data augmentation.

Table 2

Table 2. Average standard deviation of the estimated two-compartment model parameters over rotations of the input signals.

3.1.3 Application on real imaging data

Figure 3 shows parameter maps generated using the three methods. The maps produced by the ML-based methods appear less noisy. Overall, the sCNN estimated d to be greater than the MLP (mean difference = 2.4·10⁻² μm²/ms; std of difference = 8.1·10⁻² μm²/ms) and SMT (mean difference = 0.9·10⁻² μm²/ms; std of difference = 12.7·10⁻² μm²/ms). However, in the CSF the sCNN tended to estimate d to be less than the MLP or SMT. Overall, the sCNN estimated f to be greater than the MLP (mean difference = 0.5·10⁻²; std of difference = 3.6·10⁻²) and SMT (mean difference = 0.1·10⁻²; std of difference = 4.5·10⁻²) while exhibiting a similar yet lesser tissue-dependent pattern as d. Figure 4 shows example ODFs generated by the trained sCNN.

Figure 3

Figure 3. Axial slices of the intra-neurite diffusivity (A–C) and intra-neurite signal fraction (G–I) maps generated using the spherical convolutional neural network, multi-layer perceptron, and spherical mean technique. The second row (D–F) shows the differences between the intra-neurite diffusivity maps and the fourth row (J–L) shows the differences between the intra-neurite signal fraction maps.

Figure 4

Figure 4. Neurite orientation distribution functions overlaid on a map of intra-neurite signal fraction generated by the spherical convolutional neural network trained with randomly rotating the training data. The color represents the principal direction, and the size is scaled according to neurite density. This coronal slice shows the intersection of the corticospinal tract and the corpus callosum.

3.2 Three-compartment model

To highlight the fact that the network and training pipeline are applicable to any Gaussian compartment models, the sCNN was trained to predict the three-compartment model parameters the same way as with the two-compartment model. Informed by the two-compartment model results, the network was trained with randomly rotated training data. d_i ~ U(0, 3 μm²/ms), f_i ~ U(0, 1), d_sph ~ U(0, max(d_i, 0.5 μm²/ms)), and f_sph ~ U(0, 1 − f_i). The upper limit of d_sph was chosen to correspond to a sphere with a diameter of 25 μm using the Monte Carlo simulator Disimpy (Kerkelä et al., 2020). Figure 5 shows maps that the sCNN generated from preprocessed dMRI data.

Figure 5

Figure 5. Axial slices of the intra-neurite diffusivity (A), spherical compartment diffusivity (B), intra-neurite signal fraction (C), and spherical compartment signal fraction (D) maps generated by the spherical convolutional neural network trained with randomly rotating the training data.

4 Discussion

The primary purpose of this study was to investigate whether sCNNs can improve microstructural parameter estimation from noisy dMRI data, focusing on a constrained two-compartment model widely used in neuroscience research to study human white matter in vivo. The sCNN demonstrated superior accuracy with similar rotational variance compared to the SMT, and exhibited similar accuracy but considerably lower rotational variance than the MLP that had significantly more trainable parameters. Our results show that sCNNs can offer substantial benefits over simpler artificial neural network architectures for ML-based microstructural parameter estimation from dMRI data.

We focused on comparing neural network architectures with a fixed training strategy, using the SMT as a baseline. Previous research by Gyori et al. (2022) has highlighted the significant impact of training data distribution on neural network predictions, which affects the performance of our sCNN when applied to real imaging data. We are aware of this limitation, and in future work, we aim to optimize the training data distribution. Another relevant key takeaway from the work by Gyori et al. (2022) is that at low SNR, ML-based parameter estimation can suffer from high bias, which manifests as maps that appear exceedingly smooth. Moreover, it is important to note the general limitation of microstructural models that deviations from model assumptions can lead to inaccuracies (Lampinen et al., 2017; Henriques et al., 2019; Kerkelä et al., 2021).

When it comes to training the sCNN, while it is crucial to sample the space of possible ODFs as exhaustively as possible during training, the MLP training requirements are even more demanding since its rotational variance can only be reduced through learning. Changes in b-values or the angular resolution of shells will necessitate retraining our network. Technically, the same network could be used as long as the b-values remain consistent, but the spherical harmonics expansion would vary with different angular resolutions (i.e., the number of b-vectors).

To the best of our knowledge, sCNNs have been used to analyze dMRI data only a few times prior to this. Sedlar et al. (2021a) trained an sCNN to predict 'neurite orientation dispersion and density imaging' (NODDI) (Zhang et al., 2012) parameters from subsampled data, and Goodwin-Allcock et al. (2022) showed that sCNNs can improve the robustness of diffusion tensor estimation from data with just a few directions. sCNNs have also been used to estimate ODFs (Elaldi et al., 2021; Sedlar et al., 2021b). However, this study differs from the aforementioned studies in two important ways. First, our network and simulations were developed to estimate both the ODF and scalar parameters of any Gaussian compartment model. Second, we carefully compared the sCNN to the SMT, a commonly used and nearly rotationally invariant conventional fitting method, thus warranting a comparison with sCNN. Although we implemented spherical convolution layers as described by Esteves et al. (2018), other architectures also exist and warrant investigation in the context of microstructural parameter estimation. For example, the sCNNs by Cohen et al. (2018) use cross-correlation and can learn non-zonal (i.e., not symmetric with respect to the z-axis) filters, Kondor et al. (2018) developed efficient quadratic nonlinearities in the spherical harmonics domain, and the graph-based sCNN by Perraudin et al. (2019) is suitable for spherical data with very high angular resolution. Besides optimizing network architecture, future studies should also focus on optimizing hyperparameters and especially on carefully assessing the effects of and optimizing the training data distribution.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by UCL Research Ethics Committee (https://www.ucl.ac.uk/research-ethics/committees-and-governance/about-ucl-research-ethics-committee). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

LK: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Software, Validation, Visualization, Writing—original draft, Writing—review & editing. KS: Data curation, Writing—review & editing. FS: Resources, Software, Writing—review & editing. CC: Funding acquisition, Writing—review & editing.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Conflict of interest

FS is an inventor on a patent related to the study.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Andersson, J. L., and Sotiropoulos, S. N. (2016). An integrated approach to correction for off-resonance effects and subject movement in diffusion mr imaging. Neuroimage 125, 1063–1078. doi: 10.1016/j.neuroimage.2015.10.019

PubMed Abstract | Crossref Full Text | Google Scholar

Barbieri, S., Gurney-Champion, O. J., Klaassen, R., and Thoeny, H. C. (2020). Deep learning how to fit an intravoxel incoherent motion model to diffusion-weighted mri. Magn. Reson. Med 83:312–321. doi: 10.1002/mrm.27910

PubMed Abstract | Crossref Full Text | Google Scholar

Battocchio, M., Schiavi, S., Descoteaux, M., and Daducci, A. (2022). Bundle-o-graphy: improving structural connectivity estimation with adaptive microstructure-informed tractography. Neuroimage 263, 119600. doi: 10.1016/j.neuroimage.2022.119600

PubMed Abstract | Crossref Full Text | Google Scholar

Brechbühler, C., Gerig, G., and Kübler, O. (1995). Parametrization of closed surfaces for 3-d shape description. Comput. Vis. Image Underst. 61, 154–170. doi: 10.1006/cviu.1995.1013

Crossref Full Text | Google Scholar

Cohen, T. S., Geiger, M., Köhler, J., and Welling, M. (2018). Spherical CNNs. arXiv [Preprint]. arXiv:1801.10130. doi: 10.48550/arXiv.1801.10130

Crossref Full Text | Google Scholar

Collins, S. E., Spencer-Smith, M., Mürner-Lavanchy, I., Kelly, C. E., Pyman, P., Pascoe, L., et al. (2019). White matter microstructure correlates with mathematics but not word reading performance in 13-year-old children born very preterm and full-term. NeuroImage. 24, 101944. doi: 10.1016/j.nicl.2019.101944

PubMed Abstract | Crossref Full Text | Google Scholar

de Almeida Martins, J. P., Nilsson, M., Lampinen, B., Palombo, M., While, P. T., Westin, C.-F., et al. (2021). Neural networks for parameter estimation in microstructural mri: application to a diffusion-relaxation model of white matter. Neuroimage 244, 118601. doi: 10.1016/j.neuroimage.2021.118601

PubMed Abstract | Crossref Full Text | Google Scholar

Driscoll, J. R., and Healy, D. M. (1994). Computing fourier transforms and convolutions on the 2-sphere. Adv. Appl. Math. 15, 202–250. doi: 10.1006/aama.1994.1008

Crossref Full Text | Google Scholar

Elaldi, A., Dey, N., Kim, H., and Gerig, G. (2021). “Equivariant spherical deconvolution: Learning sparse orientation distribution functions from spherical data,” in International Conference on Information Processing in Medical Imaging (Cham: Springer), 267–278.

PubMed Abstract | Google Scholar

Esteves, C., Allen-Blanchette, C., Makadia, A., and Daniilidis, K. (2018). “Learning so (3) equivariant representations with spherical CNNs,” in Proceedings of the European Conference on Computer Vision (ECCV). Available online at: https://openaccess.thecvf.com/content_ECCV_2018/html/Carlos_Esteves_Learning_SO3_Equivariant_ECCV_2018_paper.html

Google Scholar

Fick, R. H., Wassermann, D., and Deriche, R. (2019). The dmipy toolbox: diffusion mri multi-compartment modeling and microstructure recovery made easy. Front. Neuroinform. 13, 64. doi: 10.3389/fninf.2019.00064

PubMed Abstract | Crossref Full Text | Google Scholar

Gibbard, C. R., Ren, J., Skuse, D. H., Clayden, J. D., and Clark, C. A. (2018). Structural connectivity of the amygdala in young adults with autism spectrum disorder. Hum. Brain Mapp. 39, 1270–1282. doi: 10.1002/hbm.23915

PubMed Abstract | Crossref Full Text | Google Scholar

Golkov, V., Dosovitskiy, A., Sperl, J. I., Menzel, M. I., Czisch, M., Sämann, P., et al. (2016). Q-space deep learning: twelve-fold shorter and model-free diffusion mri scans. IEEE Trans. Med. Imaging 35, 1344–1351. doi: 10.1109/TMI.2016.2551324

PubMed Abstract | Crossref Full Text | Google Scholar

Goodwin-Allcock, T., McEwen, J., Gray, R., Nachev, P., and Zhang, H. (2022). “How can spherical CNNs benefit ML-based diffusion MRI parameter estimation?” in International Workshop on Computational Diffusion MRI (Cham: Springer Nature Switzerland). doi: 10.1007/978-3-031-21206-2_9

Crossref Full Text | Google Scholar

Gorski, K. M., Hivon, E., Banday, A. J., Wandelt, B. D., Hansen, F. K., Reinecke, M., et al. (2005). Healpix: a framework for high-resolution discretization and fast analysis of data distributed on the sphere. Astrophys. J. 622, 759. doi: 10.1086/427976

Crossref Full Text | Google Scholar

Gyori, N. G., Clark, C. A., Alexander, D. C., and Kaden, E. (2021). On the potential for mapping apparent neural soma density via a clinically viable diffusion mri protocol. Neuroimage 239, 118303. doi: 10.1016/j.neuroimage.2021.118303

PubMed Abstract | Crossref Full Text | Google Scholar

Gyori, N. G., Palombo, M., Clark, C. A., Zhang, H., and Alexander, D. C. (2022). Training data distribution significantly impacts the estimation of tissue microstructure with machine learning. Magn. Reson. Med 87, 932–947. doi: 10.1002/mrm.29014

PubMed Abstract | Crossref Full Text | Google Scholar

Henriques, R. N., Jespersen, S. N., and Shemesh, N. (2019). Microscopic anisotropy misestimation in spherical-mean single diffusion encoding mri. Magn. Reson. Med 81, 3245–3261. doi: 10.1002/mrm.27606

PubMed Abstract | Crossref Full Text | Google Scholar

Henriques, R. N., Jespersen, S. N., and Shemesh, N. (2020). Correlation tensor magnetic resonance imaging. Neuroimage 211, 116605. doi: 10.1016/j.neuroimage.2020.116605

PubMed Abstract | Crossref Full Text | Google Scholar

Ioffe, S., and Szegedy, C. (2015). “Batch normalization: accelerating deep network training by reducing internal covariate shift,” in International Conference on Machine Learning (PMLR). Available online at: https://proceedings.mlr.press/v37/ioffe15.html

Google Scholar

Jelescu, I. O., and Budde, M. D. (2017). Design and validation of diffusion mri models of white matter. Front. Phys. 5, 61. doi: 10.3389/fphy.2017.00061

PubMed Abstract | Crossref Full Text | Google Scholar

Jenkinson, M., Beckmann, C. F., Behrens, T. E., Woolrich, M. W., and Smith, S. M. (2012). Fsl. Neuroimage 62, 782–790. doi: 10.1016/j.neuroimage.2011.09.015

Crossref Full Text | Google Scholar

Jespersen, S. N., Lundell, H., Sønderby, C. K., and Dyrby, T. B. (2013). Orientationally invariant metrics of apparent compartment eccentricity from double pulsed field gradient diffusion experiments. NMR Biomed. 26, 1647–1662. doi: 10.1002/nbm.2999

PubMed Abstract | Crossref Full Text | Google Scholar

Jeurissen, B., Tournier, J.-D., Dhollander, T., Connelly, A., and Sijbers, J. (2014). Multi-tissue constrained spherical deconvolution for improved analysis of multi-shell diffusion mri data. Neuroimage 103, 411–426. doi: 10.1016/j.neuroimage.2014.07.061

PubMed Abstract | Crossref Full Text | Google Scholar

Kaden, E., Kelm, N. D., Carson, R. P., Does, M. D., and Alexander, D. C. (2016a). Multi-compartment microscopic diffusion imaging. Neuroimage 139, 346–359. doi: 10.1016/j.neuroimage.2016.06.002

PubMed Abstract | Crossref Full Text | Google Scholar

Kaden, E., Kruggel, F., and Alexander, D. C. (2016b). Quantitative mapping of the per-axon diffusion coefficients in brain white matter. Magn. Reson. Med 75, 1752–1763. doi: 10.1002/mrm.25734

PubMed Abstract | Crossref Full Text | Google Scholar

Karimi, D., Jaimes, C., Machado-Rivas, F., Vasung, L., Khan, S., Warfield, S. K., et al. (2021). Deep learning-based parameter estimation in fetal diffusion-weighted mri. Neuroimage 243, 118482. doi: 10.1016/j.neuroimage.2021.118482

PubMed Abstract | Crossref Full Text | Google Scholar

Kerkelä, L., Nery, F., Callaghan, R., Zhou, F., Gyori, N. G., Szczepankiewicz, F., et al. (2021). Comparative analysis of signal models for microscopic fractional anisotropy estimation using q-space trajectory encoding. Neuroimage 242, 118445. doi: 10.1016/j.neuroimage.2021.118445

PubMed Abstract | Crossref Full Text | Google Scholar

Kerkelä, L., Nery, F., Hall, M. G., and Clark, C. A. (2020). Disimpy: a massively parallel monte carlo simulator for generating diffusion-weighted mri data in python. J. Open Source Softw. 5, 2527. doi: 10.21105/joss.02527

Crossref Full Text | Google Scholar

Kerkelä, L., Seunarine, K., Henriques, R. N., Clayden, J. D., and Clark, C. A. (2022). Improved reproducibility of diffusion kurtosis imaging using regularized non-linear optimization informed by artificial neural networks. arXiv [Preprint]. arXiv:2203.07327. doi: 10.48550/arXiv.2203.07327

Crossref Full Text | Google Scholar

Kingma, D. P., and Ba, J. (2014). “Adam: a method for stochastic optimization,” in arXiv.

Google Scholar

Kiselev, V. G. (2017). Fundamentals of diffusion mri physics. NMR Biomed. 30, e3602. doi: 10.1002/nbm.3602

PubMed Abstract | Crossref Full Text | Google Scholar

Kondor, R., Lin, Z., and Trivedi, S. (2018). Clebsch-gordan nets: a fully fourier space spherical convolutional neural network. Adv. Neural Inf. Process. Syst.

Google Scholar

Kostelec, P. J., and Rockmore, D. N. (2008). Ffts on the rotation group. J. Fourier Analy. Appl. 14, 145–179. doi: 10.1007/s00041-008-9013-5

Crossref Full Text | Google Scholar

Lampinen, B., Szczepankiewicz, F., Mårtensson, J., van Westen, D., Sundgren, P. C., and Nilsson, M. (2017). Neurite density imaging versus imaging of microscopic anisotropy in diffusion mri: a model comparison using spherical tensor encoding. Neuroimage 147, 517–531. doi: 10.1016/j.neuroimage.2016.11.053

PubMed Abstract | Crossref Full Text | Google Scholar

Lasič, S., Szczepankiewicz, F., Eriksson, S., Nilsson, M., and Topgaard, D. (2014). Microanisotropy imaging: quantification of microscopic diffusion anisotropy and orientational order parameter by diffusion mri with magic-angle spinning of the q-vector. Front. Phys. 2, 11. doi: 10.3389/fphy.2014.00011

Crossref Full Text | Google Scholar

Lebel, C., Treit, S., and Beaulieu, C. (2019). A review of diffusion mri of typical white matter development from early childhood to young adulthood. NMR Biomed. 32, e3778. doi: 10.1002/nbm.3778

PubMed Abstract | Crossref Full Text | Google Scholar

Novikov, D. S., Fieremans, E., Jespersen, S. N., and Kiselev, V. G. (2019). Quantifying brain microstructure with diffusion mri: theory and parameter estimation. NMR Biomed. 32, e3998. doi: 10.1002/nbm.3998

PubMed Abstract | Crossref Full Text | Google Scholar

Palombo, M., Ianus, A., Guerreri, M., Nunes, D., Alexander, D. C., Shemesh, N., et al. (2020). Sandi: a compartment-based model for non-invasive apparent soma and neurite imaging by diffusion mri. Neuroimage 215, 116835. doi: 10.1016/j.neuroimage.2020.116835

PubMed Abstract | Crossref Full Text | Google Scholar

Perraudin, N., Defferrard, M., Kacprzak, T., and Sgier, R. (2019). Deepsphere: efficient spherical convolutional neural network with healpix sampling for cosmological applications. Astron. Comp. 27, 130–146. doi: 10.1016/j.ascom.2019.03.004

Crossref Full Text | Google Scholar

Rahmanzadeh, R., Galbusera, R., Lu, P.-J., Bahn, E., Weigel, M., Barakovic, M., et al. (2022). A new advanced mri biomarker for remyelinated lesions in multiple sclerosis. Ann. Neurol. 92, 486–502. doi: 10.1002/ana.26441

PubMed Abstract | Crossref Full Text | Google Scholar

Sagi, Y., Tavor, I., Hofstetter, S., Tzur-Moryosef, S., Blumenfeld-Katzir, T., and Assaf, Y. (2012). Learning in the fast lane: new insights into neuroplasticity. Neuron 73, 1195–1203. doi: 10.1016/j.neuron.2012.01.025

PubMed Abstract | Crossref Full Text | Google Scholar

Sedlar, S., Alimi, A., Papadopoulo, T., Deriche, R., and Deslauriers-Gauthier, S. (2021a). “A spherical convolutional neural network for white matter structure imaging via dMRI,” in International Conference on Medical Image Computing and Computer-Assisted Intervention (Cham: Springer), 529–539.

Google Scholar

Sedlar, S., Papadopoulo, T., Deriche, R., and Deslauriers-Gauthier, S. (2021b). “Diffusion mri fiber orientation distribution function estimation using voxel-wise spherical u-net,” in Computational Diffusion MRI (Cham: Springer), 95–106.

Google Scholar

Sjölund, J., Szczepankiewicz, F., Nilsson, M., Topgaard, D., Westin, C.-F., and Knutsson, H. (2015). Constrained optimization of gradient waveforms for generalized diffusion encoding. J. Magne. Res. 261, 157–168. doi: 10.1016/j.jmr.2015.10.012

PubMed Abstract | Crossref Full Text | Google Scholar

Sullivan, E. V., and Pfefferbaum, A. (2006). Diffusion tensor imaging and aging. Neurosci. Biobehav. Rev. 30, 749–761. doi: 10.1016/j.neubiorev.2006.06.002

Crossref Full Text | Google Scholar

Szczepankiewicz, F., Sjölund, J., Ståhlberg, F., Lätt, J., and Nilsson, M. (2019a). Tensor-valued diffusion encoding for diffusional variance decomposition (divide): technical feasibility in clinical mri systems. PLoS ONE 14, e0214238. doi: 10.1371/journal.pone.0214238

PubMed Abstract | Crossref Full Text | Google Scholar

Szczepankiewicz, F., van Westen, D., Englund, E., Westin, C.-F., Ståhlberg, F., Lätt, J., et al. (2016). The link between diffusion mri and tumor heterogeneity: mapping cell eccentricity and density by diffusional variance decomposition (divide). Neuroimage 142, 522–532. doi: 10.1016/j.neuroimage.2016.07.038

PubMed Abstract | Crossref Full Text | Google Scholar

Szczepankiewicz, F., Westin, C.-F., and Nilsson, M. (2019b). Maxwell-compensated design of asymmetric gradient waveforms for tensor-valued diffusion encoding. Magn. Reson. Med 82, 1424–1437. doi: 10.1002/mrm.27828

PubMed Abstract | Crossref Full Text | Google Scholar

Toescu, S. M., Hales, P. W., Kaden, E., Lacerda, L. M., Aquilina, K., and Clark, C. A. (2021). Tractographic and microstructural analysis of the dentato-rubro-thalamo-cortical tracts in children using diffusion MRI. Cerebral Cortex 31, 2595–2609. doi: 10.1093/cercor/bhaa377

PubMed Abstract | Crossref Full Text | Google Scholar

Topgaard, D. (2017). Multidimensional diffusion MRI. J. Magne. Res. 275:98–113. doi: 10.1016/j.jmr.2016.12.007

Crossref Full Text | Google Scholar

Tournier, J.-D., Smith, R., Raffelt, D., Tabbara, R., Dhollander, T., Pietsch, M., et al. (2019). Mrtrix3: A fast, flexible and open software framework for medical image processing and visualisation. Neuroimage 202, 116137. doi: 10.1016/j.neuroimage.2019.116137

PubMed Abstract | Crossref Full Text | Google Scholar

Veraart, J., Novikov, D. S., Christiaens, D., Ades-Aron, B., Sijbers, J., and Fieremans, E. (2016). Denoising of diffusion mri using random matrix theory. Neuroimage 142, 394–406. doi: 10.1016/j.neuroimage.2016.08.016

PubMed Abstract | Crossref Full Text | Google Scholar

Voldsbekk, I., Groote, I., Zak, N., Roelfs, D., Geier, O., Due-Tønnessen, P., et al. (2021). Sleep and sleep deprivation differentially alter white matter microstructure: a mixed model design utilising advanced diffusion modelling. Neuroimage 226, 117540. doi: 10.1016/j.neuroimage.2020.117540

PubMed Abstract | Crossref Full Text | Google Scholar

Westin, C.-F., Knutsson, H., Pasternak, O., Szczepankiewicz, F., Özarslan, E., van Westen, D., et al. (2016). Q-space trajectory imaging for multidimensional diffusion mri of the human brain. Neuroimage 135, 345–362. doi: 10.1016/j.neuroimage.2016.02.039

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, H., Schneider, T., Wheeler-Kingshott, C. A., and Alexander, D. C. (2012). Noddi: practical in vivo neurite orientation dispersion and density imaging of the human brain. Neuroimage 61, 1000–1016. doi: 10.1016/j.neuroimage.2012.03.072

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, Y., Schuff, N., Du, A.-T., Rosen, H. J., Kramer, J. H., Gorno-Tempini, M. L., et al. (2009). White matter damage in frontotemporal dementia and alzheimer's disease measured by diffusion mri. Brain 132, 2579–2592. doi: 10.1093/brain/awp071

PubMed Abstract | Crossref Full Text | Google Scholar

Zonca, A., Singer, L. P., Lenz, D., Reinecke, M., Rosset, C., Hivon, E., et al. (2019). healpy: equal area pixelization and spherical harmonics transforms for data on the sphere in python. J. Open Source Softw. 4, 1298. doi: 10.21105/joss.01298

Crossref Full Text | Google Scholar

Keywords: diffusion magnetic resonance imaging, geometric deep learning, microstructure, spherical convolutional neural network, MRI

Citation: Kerkelä L, Seunarine K, Szczepankiewicz F and Clark CA (2024) Spherical convolutional neural networks can improve brain microstructure estimation from diffusion MRI data. Front. Neuroimaging 3:1349415. doi: 10.3389/fnimg.2024.1349415

Received: 04 December 2023; Accepted: 26 February 2024;
Published: 14 March 2024.

Edited by:

Clemence Ligneul, University of Oxford, United Kingdom

Reviewed by:

Jaeseok Park, Sungkyunkwan University, Republic of Korea
Steven Baete, New York University, United States

Copyright © 2024 Kerkelä, Seunarine, Szczepankiewicz and Clark. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Leevi Kerkelä, bGVldmkua2Vya2VsYS4xN0B1Y2wuYWMudWs=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.