- 1UNIACT, NeuroSpin, CEA Paris-Saclay, Frederic Joliot Institute, Gif-sur-Yvette, France
- 2NeuroSpin, CEA Paris-Saclay, Frederic Joliot Institute, Gif-sur-Yvette, France
- 3InDEV, NeuroDiderot, Université Paris Cité, Inserm, Paris, France
- 4BioMaps, Service Hospitalier Frédéric Joliot, CNRS, Inserm, Université Paris-Saclay, Orsay, France
- 5CETAPS EA 3832, Université de Rouen, Rouen, France
- 6CRIOBE, UAR 3278, CNRS-EPHE-UPVD, Mooréa, France
The hippocampal subfields, pivotal to episodic memory, are distinct both in terms of cyto- and myeloarchitectony. Studying the structure of hippocampal subfields in vivo is crucial to understand volumetric trajectories across the lifespan, from the emergence of episodic memory during early childhood to memory impairments found in older adults. However, segmenting hippocampal subfields on conventional MRI sequences is challenging because of their small size. Furthermore, there is to date no unified segmentation protocol for the hippocampal subfields, which limits comparisons between studies. Therefore, we introduced a novel segmentation tool called HSF short for hippocampal segmentation factory, which leverages an end-to-end deep learning pipeline. First, we validated HSF against currently used tools (ASHS, HIPS, and HippUnfold). Then, we used HSF on 3,750 subjects from the HCP development, young adults, and aging datasets to study the effect of age and sex on hippocampal subfields volumes. Firstly, we showed HSF to be closer to manual segmentation than other currently used tools (p < 0.001), regarding the Dice Coefficient, Hausdorff Distance, and Volumetric Similarity. Then, we showed differential maturation and aging across subfields, with the dentate gyrus being the most affected by age. We also found faster growth and decay in men than in women for most hippocampal subfields. Thus, while we introduced a new, fast and robust end-to-end segmentation tool, our neuroanatomical results concerning the lifespan trajectories of the hippocampal subfields reconcile previous conflicting results.
1. Introduction
Episodic memory, the memory of specific episodes with spatiotemporal details, is critically underpinned by the hippocampal subfields, namely the dentate gyrus (DG), cornu ammonis from 1 to 3 (CA1/2/3), and the subiculum. Each subfield presents a distinct myelo- and cyto-architectony, and plays a critical role in episodic memory functions. For example, the DG and CA3 are involved in pattern separation, which allows the storage and retrieval of similar but distinct events (Yassa and Stark, 2011). CA1 and subiculum are necessary for pattern completion, i.e., the reconstruction of a full memory from partial elements. Since episodic memory performance correlates with variations in hippocampal subfields volume (Palombo et al., 2018), we hypothesize that hippocampal subfields volumetric trajectories are associated with the evolution of episodic memory performance across the lifespan.
Analyzing hippocampal subfields' dynamics implies delineating their boundaries, boundaries often defined at a microscopic scale. Unfortunately, Magnetic Resonance Imaging (MRI) cannot study the unique myelo- and cyto-architectures of subfields, because structures such as CA1 and the Subiculum have the same contrast (Yushkevich et al., 2015a). Numerous efforts have been made to use geometrical heuristics to map histological features to MRI, thereby providing manual segmentation guidelines (Berron et al., 2017; Dalton et al., 2017). Manual segmentation with these protocols is now considered the gold standard for studying the hippocampal subfields in vivo. However, it is a complex, time-consuming, and subjective task which makes it error-prone and limits reproducibility. MRI segmentation of hippocampal subfields faces multiple difficulties, mainly caused by a lack of resolution, tissue ambiguity (notably in the head and the tail of the hippocampus), and noise. This problem is amplified by the lack of standardized segmentation protocols. For example, some protocols merge CA1, 2, and 3, sometimes delineating a separate CA4 or even excluding the hippocampal head or tail. This leads to multiple divergent protocols, inducing a lot of variabilities, notably in the boundary between DG and CA3, 4, and the boundary between CA1 and the subiculum with inter-protocol differences of almost 2 mm (Yushkevich et al., 2015a).
Recent efforts have been made to uniformize and automatize the hippocampal subfields segmentation task (Yushkevich et al., 2015a; Wisse et al., 2017). New hippocampal subfields' segmentation tools have recently been developed, such as ASHS (Yushkevich et al., 2015b), HIPS (Romero et al., 2017), or even more recently HippUnfold (DeKraker et al., 2021). They provide better segmentations, closer to manual segmentation, but neither of them implements state-of-the-art end-to-end deep learning which has been proven to be more fault-tolerant and adaptable to new observations, especially on complex and non-linear tasks (O'Mahony et al., 2020). Recent studies highlighted the possible gains of end-to-end deep learning for hippocampal segmentation (Qiu et al., 2019; Zhu et al., 2019; Yang et al., 2020), promising fast inference time (less than a minute per subject against several hours for FreeSurfer), higher accuracy, and higher robustness to anatomical variations. Unfortunately, most deep learning solutions are currently provided as a proof-of-concept, with either no public implementation, no pre-trained models, or are trained on small and specific datasets limiting generalizability. The current literature lacks an end-to-end deep learning segmentation protocol trained on a heterogeneous database to ensure segmentation quality across (i) contrast, (ii) magnetic field intensity, (iii) age range, or (iv) health condition.
Even though segmentation protocols still need to be uniformized, there is a disparity of available segmentation tools for the hippocampal subfields. The current understanding of the effect of age and sex on volumetric changes in hippocampal subfields across the lifespan is based on manual or (semi-)automatic segmentation studies. Uematsu et al. (2012) found that the total hippocampal volume is increasing until early adulthood. Another study showed a differential maturation between the posterior and anterior hippocampal portions (Gogtay et al., 2006). Regarding sex difference, Suzuki (2004) showed that the myelination process, which is thought to contribute to the increase in volume during adolescence, takes place earlier in women (i.e., before the age of 18) than in men (i.e., after the age of 20), with a potentially more pronounced developmental dynamic in men than in women. Ziegler et al. (2012) noted an increase of gray matter volume during adulthood in the hippocampus up to 41 years old, with a maximum at 62 years old for the DG and CA, followed by fast atrophy. This is in accordance with Yang et al. (2013), who identified a quadratic relationship between the overall volume of the hippocampus and age, with an inflection point at 63 years old, followed by a strong negative correlation between volume and age.
Non-human primate studies (e.g., 20) have shown that subfields such as the DG, CA2/3, and the subiculum (but not the pre- nor para-subiculum) are growing asynchronously until adulthood. However, this question has only been recently addressed in human children and adolescents with inconsistent results. According to Ellis et al. (2021), the DG exhibits a very rapid growth in infants, doubling in size, associated with an increase in CA1 and CA3 volumes during development (8–14 years old). This contrasts with a stable or a slight linear decrease in subicular volumes (Ziegler et al., 2012; Lee et al., 2014). Concerning normal aging, data suggest a volumetric decrease of all subfields which predominates in the DG (de Flores et al., 2015; Foster et al., 2019). While the literature suggests a differential maturation and aging of hippocampal subfields, there is currently a lack of accurate automated segmentation tools which hinders the use of large datasets to study trajectories across the lifespan.
Here, we offer the first end-to-end deep learning pipeline to segment the hippocampal subfields. Hippocampal segmentation factory (HSF) is an open-source tool that leverages new computer vision segmentation methods. It was trained on a heterogeneous database comprising all public datasets with manually segmented hippocampal subfields and new manually segmented observations to ensure generalization. We hypothesized (i) that HSF provides a better overlap (dice coefficient), fewer outliers (Hausdorff distance), and a better volumetric similarity than currently available tools abiding Barron's protocol (Berron et al., 2017); (ii) that subfields such as DG and CA1 exhibit differential lifespan dynamics which can be divided into three periods (a. growth, b. stability, and c. decay); (iii) a fast decay for all subfields starting from 60 to 65 years old; (iv) finally that there are sex differences in the volumetric trajectories, with volumetric variations being more intense in men than in women.
2. Method
This section aims at describing (i) the technical details of HSF development in terms of computational architecture, training regime, and inference peculiarities, (ii) how it differs from other state-of-the-art tools addressing the same segmentation problem, and (iii) how we leveraged the potential of HSF to study hippocampal subfields volumetric trajectories in large healthy individuals datasets which covers the lifespan (5–100+ years old). Please note that we conducted a speed test comparing the tools included in our benchmark. While FreeSurfer, one of the most used neuroimaging tools, possesses modules for hippocampal subfields segmentation, we chose not to compare it. Although FreeSurfer (Iglesias et al., 2015) is still considered a classic neuroimaging tool, it has recently incorporated deep learning-based approaches. Because it has useful automated features outside the scope of this study, it produces many outputs leading to a long computing time, which makes it inconvenient for scientific studies interested in a single substructure of the human brain: its inference time of approximately 10 h per subject is slower than manual segmentation of the hippocampal subfields. While previous studies found the segmentation quality of FreeSurfer to be good enough to study the hippocampal subfields (Schmidt et al., 2018), others have demonstrated that FreeSurfer has poorer segmentation quality in comparison to the tools included in our benchmark (de Flores et al., 2015; DeKraker et al., 2022), with segmentations that are in a mismatch with known anatomical boundaries leading to a significantly different volumetry, especially in the head and the tail of the hippocampus (Wisse et al., 2014). Thus, as we are only interested in fast tools only tackling hippocampal subfields segmentation, we only used FreeSurfer as a benchmark for speed comparison.
2.1. HSF: description of the hippocampal segmentation factory
HSF is designed to be a fully customizable end-to-end pipeline, handling tasks from the preprocessing of raw anatomical images, to the segmentation of the hippocampal subfields through specialized and highly efficient deep learning models comprised in a “Model Hub” on any hardware acceleration platform such as CUDA, TensorRT, or OpenVINO. HSF also supports the DeepSparse compute engine to benefit from the AVX512 (VNNI) vector instruction set. HSF is distributed under the MIT license at https://github.com/clementpoiret/HSF.
2.1.1. Datasets description
The key strength of HSF lies in its training database, which consists of 12 datasets of manually segmented hippocampi by individual expert raters (Table 1), totaling 411 subjects.
2.1.2. Internal information processing
The HSF pipeline consists of three main steps: 1/a preprocessing step handled by ROILoc (a standalone by-product of HSF available at https://github.com/clementpoiret/ROILoc) to extract the hippocampi from a given MRI (Figure 1), 2/an augmentation pipeline, and 3/a segmentation by multiple expert models in order to produce both the segmentation and an uncertainty map (Figure 2).
 
  Figure 1. Technical description of ROILoc. ROILoc aims at locating and extracting any region of interest on a given MRI.
 
  Figure 2. Complete overview of HSF. A (T1w or T1w) MRI passes through ROILoc to extract the left and right hippocampi. Each subvolume is then randomly augmented to obtain 21 different versions of the same hippocampus. Each segmentation goes through five independent deep learning models, and the final segmentation is a voxel-wise plurality vote across all segmentations. A voxel-wise aleatoric uncertainty map is computed for further post hoc analysis.
In order to limit the computational impact of HSF, we used a preprocessing step to extract the hippocampi from the MRI. To do so, ROILoc registers the MNI152 09c Sym template (Fonov et al., 2009) to the T1w or T2w input MRI. Utilizing the CerebrA atlas (Manera et al., 2020), the registration process facilitated the inference of approximate coordinates of the hippocampus in native space. ROILoc then crops the MRI into two volumes corresponding to the right and left hippocampi from head to tail, with an arbitrary safety margin. To finish the preprocessing, the resulting crops are Z-normalized and padded to obtain shapes that are multiple of 8 to satisfy hardware acceleration constraints.
HSF provides a “Model Hub” offering multiple pre-trained models that can handle preprocessed hippocampi. Our built-in models are 3D Residual UNets of depth 4, with ResNet building blocks (Zhang et al., 2018) and transposed convolutions as the upsampling method. We have replaced the additive skip connections with a self-attention mechanism inspired by the one introduced for 2D images by Oktay et al. (2018), with BatchNorm layers replaced by SwitchNorm layers (Luo et al., 2019). Each segmentation model has its efficient counterpart that can benefit from the AVX512-VNNI instruction set due to pruning (at 70%) and int8-Quantization through NeuralMagic's SparseML.
2.1.3. Training methodology
To augment the quality of the segmentation, we employed the widely used technique called bagging. We trained five “weak-learner” models, each of which was generated by random sampling, with replacement, N samples from the original training set, which contained 822 hippocampi. The bagging technique then amalgamated each weak learner into a strong learner, which displayed a superior accuracy of prediction compared to each weak learner on its own. Bagging outperforms the conventional random split because it introduces more variability (i.e., some subjects can be observed multiple times during a single epoch), thereby enhancing the prediction of the strong learner (Opitz and Maclin, 1999). Each model is trained with an AdamW optimizer, a one-cycle learning rate scheduler, and stochastic weight averaging for 512 epochs with a batch size of 1 to handle heterogeneous input volumes. int8-Quantized models are trained with quantization-aware training.
Given an input x, our segmentation loss L is defined with TP and TN the true positives and negatives, FP and FN the false positives and negatives, and α = 0.3, β = 0.7, such as:
While the base loss function is a focal Tversky, the loss function was modulated for each observation to handle different segmentation protocols. As HSF predicts CA1, CA2, and CA3, we merged classes (e.g., CA2 and CA3) at training time to learn from observations that do not distinguish them. For segmentation protocols having a separate head or tail class, all predictions are merged to form a single ‘hippocampus' class so that predicting any subfield outside the ‘head' or ‘tail' class is penalized but not inside of them.
2.1.4. Inference
To further enhance the segmentation pipeline, test-time augmentation is natively implemented, augmenting each hippocampus with random horizontal flips, and with affine and elastic deformations. The final segmentation is computed as a voxel-wise plurality vote, assigning to a given voxel the most frequent class. For the sake of further post hoc analysis of the segmentation quality, a voxel-wise aleatoric uncertainty H(Yi∨X) is also computed (Wang et al., 2019). Given a set Y of i predictions, in HSF:
where is the frequency of the mth unique value in Yi.
2.2. Benchmarking HSF against ASHS, HIPS, and HippUnfold
HSF has been assessed against the most recent and widespread tools for hippocampal segmentation: ASHS (Yushkevich et al., 2015b), HIPS (Romero et al., 2017), and HippUnfold (DeKraker et al., 2021). To compare it with manual segmentations, CP, AB, SP, and MF randomly segmented 25 subjects who were excluded from our training set from 5 different datasets: HiPlay7, MemoDev (Bouyeure et al., 2021) (Table 1), as well as HCP-Development (HCP-D), HCP-Young Adults (HCP-YA), and HCP-Aging (HCP-A). This segmentation process took approximately 5 h per hippocampus. In relation to an earlier study on MemoDev, an assessment was conducted by Bouyeure et al. (2021) to determine the reliability of the manual segmentations. This evaluation involved the computation of an inter-rater reliability index, specifically the dice coefficient, between two individual tracers, who followed the same segmentation protocol. Furthermore, it is worth noting that both raters had no prior knowledge of the participants' age, sex, or memory performance. The obtained inter-rater reliability indices were notably high at 0.77 and 0.79 for the right and left hippocampi, respectively. Segmentations are compared on three metrics:
• the dice coefficient (DC), an overlap metric ranging from 0 (no overlap) to 1 (full overlap) defined as ,
• the Hausdorff distance (HD), a metric of surface distance ranging from 0 to +inf. With the directed Hausdorff distance between two point sets X and Y such as , the HD is defined as HD(ym, yp) = max(hd(ym, yp), hd(yp, ym) ),
• and the volumetric similarity (VS), a comparison between volumes of two segmentations ranging from 0 (complete dissimilarity between volumes) to 1 (exact match between volumes). With the volume of a region S, it is defined as .
As both T1w and T2w images can be segmented by HSF, we conducted an additional analysis to evaluate any discrepancies in quality across these contrasts using the same metrics. Given the strong correlation between contrast and resolution (e.g., an isometric millimetric MPRAGE 3D T1w and anisotropic 2D Coro-T2w), we limited our study to only 15 subjects from our test set sourced from the HCP databases, where T1w and T2w MRIs are in the same space and at the same resolution. Owing to the presence of either heteroscedasticity or non-normal distributions of scores, we compared segmentations utilizing non-parametric Kruskal–Wallis or pairwise Wilcoxon–Mann–Whitney tests, with p-values corrected using the Benjamini–Hochberg false discovery rate.
2.3. HSF: analyzing the Human Connectome Project
The following sections are specifically dedicated to explaining how we used HSF (process and inference) to study hippocampal subfields trajectories across the lifespan in the HCP datasets (HCP-D, HCP-YA, and HCP-A).
2.3.1. Datasets descriptions
All databases are acquired on a 3T Siemens Prisma (Skyra for HCP-YA) scanner:
- HCP-D: HCP-D contains 1350 healthy children, adolescents, and young adults aged from 5 to 21 years. T1w and T2w MRIs are acquired at an isotropic resolution of 0.8 mm across four sites (Somerville et al., 2018),
- HCP-YA: HCP-YA includes 1,200 subjects with ages ranging from 22 to 35 years. T1w and T2w MRIs have been acquired on a single site at an isotropic resolution of 0.7 mm,
- HCP-A: HCP-A comprises 1,200 subjects from 36 to 100+ years old. T1w and T2w MRIs are acquired at an isotropic resolution of 0.8 mm across four different sites (Bookheimer et al., 2019).
The HCP datasets were provided in part by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research and by the McDonnell Center for Systems Neuroscience at Washington University.
2.3.2. MRI segmentation
Prior to HCP's datasets' segmentation and after the HSF validation, we retrained HSF's models with the manually segmented observations coming from the previous section (see Section 2.2.) including observations from the HCP's datasets. We thereby improved the reliability of segmentations by including new and HCP-specific observations to ensure there was no mismatch between our training set's distribution and HCP's distribution of observations. All segmentations are performed on T2w images, with ROILoc's location algorithm using the ‘Affine‘ registration and a margin of 16 voxels in all directions to ensure that whole hippocampi are included in their boxes.
2.3.3. Lifespan modeling
The whole hippocampus and each subfield were modeled for each sex as a natural cubic spline (NCS) regression between age and volume, a flexible, simple, and efficient model to describe trends (Greenland, 1995; Elhakeem et al., 2022). Cubic models have been validated to study developmental trajectories of the amygdala and the whole hippocampus (Uematsu et al., 2012; Bussy et al., 2021). NCS allowed us to model the growth and decay of hippocampal subfields by fitting a set of piecewise polynomial regressions smoothly joining at points called knots, with a linearity constraint at the extremity of the curve. Significance and goodness of fit for the NCS are computed similarly to linear regressions because NCS are fitted using an ordinary least-squares algorithm. We chose the number of degrees of freedom by minimizing an Akaike Information Criterion. Then, inflection points in the volumetric trajectories of the ROIs were detected as suggested by Satopaa et al. (2011). Finally, we computed an anteroposterior evolution of the subfield's volume on a per-slice basis averaged across every subject.
2.3.4. Statistical analysis
Although lifespan dynamics of the hippocampus and its subfields are thought to be non-linear (e.g., 15,18,34), we assume that within a single period, defined as the uninterrupted period between two distinct inflection points (e.g., young adults), the relationship between age and volume is linear. Therefore, for each lifespan period, we tested (i) the relationship between age and volume, (ii) the relationship between sexes, and (iii) the interaction between these two independent variables using an ordinary least-squares regression. P-values are corrected using a Benjamini–Hochberg false discovery rate.
3. Results
3.1. Benchmarking HSF against ASHS, HIPS, and HippUnfold
First, we validated HSF against three state-of-the-art hippocampal subfields segmentation tools: ASHS, HIPS, and HippUnfold (Figure 3). While manual segmentation may require up to 5 h per subject, FreeSurfer 7 may take even longer, exceeding 10 h due to its all-inclusive pipeline, encompassing whole-brain segmentation and cortical morphometry. As we were interested solely in hippocampal subfields segmentation, we have compared only the specialized tools, which were, therefore, much faster: HIPS, ASHS, and HippUnfold can segment a new subject in under an hour. HSF is even faster, taking only minutes to segment a new subject from the HCP. While HIPS requires the use of the volBrain service and can take up to a day to complete due to queueing, HSF is much quicker. In its most accurate mode, HSF takes only 5 mi on a CPU and 90 s on an NVIDIA A100 GPU (Table 2). In its fast mode, HSF can segment a new subject in only 15 s on both CPU and GPU, with the main speed bottleneck being the registration tool ANTs, which is used to localize the hippocampus (ROILoc).
 
  Figure 3. Segmentation example from a random subject. The dentate gyrus is in red, CA1/2/3 are in green, yellow, and purple, and the subiculum is in blue.
We used dice coefficient, Hausdorff distance, and volumetric similarity (Figure 4) with manual segmentations as benchmarking metrics. We found HSF to exhibit a significantly better DC than ASHS (p = 4e−6; hedge's g = 1.636), HIPS (p = 7e−9; hedge's g = 4.934), and HippUnfold (p = 7e−9; hedge's g = 5.440), with no differences between HippUnfold and HIPS.
 
  Figure 4. Cumming estimation plots comparing HSF (T2w) against ASHS (T1w and T2w), HIPS (T1w and T2w), and HippUnfold (HU) (T1w and T2w). The first row illustrates three performance metrics—the dice coefficient (higher is better), the Hausdorff distance (lower is better), and the volumetric similarity (higher is better). The vertical bars in this row represent the mean ±std for each metric group. The dashed line in this row represents the inter-rater reliability for manual segmentation obtained in the earlier study of Bouyeure et al. (2021). As this earlier study only computed the inter-rater comparison as the dice coefficient, it is not available for the other two metrics. The second row depicts the mean effect size (Cohen's d) with a black dot to facilitate statistical comparison between the groups. The black bars in this row represent 95% CIs for variability estimations. The 95% CIs are obtained through non-parametric bootstrap resampling to generate distributions of all possible effect sizes.
Regarding HD, which is sensitive to outlier voxels in the segmentation, we found HSF performing on par with HIPS, but being better than HippUnfold (p = 7e−8; hedge's g = −1.184). Importantly, ASHS mainly penalized by poor segmentation results in a few observations although estimation statistics may suggest a difference between the two tools (Figure 4). Our statistical tests failed to reject the null hypothesis.
With respect to the VS, all three methods had similar volumes, but HSF was the closest to manual segmentations (VS = 0.862), better than ASHS (p = 2e−4; hedge's g = 1.210), HIPS (p = 9e−9; hedge's g = 3.391), and HippUnfold (p = 8e−9, hedge's g = 3.550). We found no differences between HIPS and HippUnfold.
After an extensive evaluation, we analyzed the disparities in segmentation quality compared to the T1w and T2w images on a subset of our test set where both contrasts were acquired using the same resolution, as outlined in Table 3. While the effect sizes were negligible, we found that T2w images tend to exhibit a slight inclination, with HSF producing segmentations closer to the manual ones, especially on the smallest regions, CA1, 2, and 3 (DC increased by 0.045, HD decreased by 2.386, and VS increased by 0.035).
3.2. Human Connectome Project
3.2.1. Lifespan development dynamics
After the HSF's retraining including new HCP subjects to ensure segmentation quality, we established lifespan trajectories (Figure 5) consisting of Natural Cubic Splines, from which we inferred inflection points reflecting lifespan critical periods. DG was the subfield whose developmental trajectory was the most correlated with age (p = 0.005). Total hippocampal volume was negatively correlated with age for both sexes starting from 70 years old (p = 0.03), which is also reflected in the subiculum (p = 2e−8). In addition to significant differences in volumes between sexes mostly during the “stable adulthood” period, except for CA2/3 (p = 0.120), we found differences between men and women during the “development” period in the DG (p = 0.01), and CA2/3 (p = 0.01), and during the aging period for the DG (p = 0.015) and CA1 (p = 0.04). Interestingly, we found differences in trajectories between men and women (i.e., interaction between age and sex), for the development period of CA2/3 (p = 0.017), for the aging period of the DG (p = 0.04), and before 60/70 years for the subiculum (p = 0.016).
 
  Figure 5. Lifespan dynamics of hippocampal subfields. Trend lines (surrounded by standard errors) are defined as natural cubic splines with a number of degrees of freedom minimizing an Akaike Information Criterion. Vertical dashed lines indicate inflection points.
3.2.2. From head to tail: subfields' distribution
Delineating the subfields in the head and the tail of the hippocampus is a complex task, with some protocols not even delineating subfields in the tail. Due to the peculiar training methods, we trained HSF to segment the head and the tail even when there was no ground truth subfield segmentation in these regions. Using HSF, we created an overall normalized anteroposterior distribution of subfields across all three HCP datasets (Figure 6). We found no anatomical differences between lifespan periods and sexes.
 
  Figure 6. Normalized anteroposterior composition of subfields, going from 0% of the hippocampus (head) to 100% (tail). Vertical black lines are approximate delimiters of the head, body, and tail of the hippocampus.
According to HSF, the hippocampal head starts mostly with CA1, quickly followed by the subiculum and then the DG before the hippocampal body. After the body, CA2 and CA3 start to disappear and then followed by the DG. The tail comprises mostly subiculum, CA1, and a small portion of DG which disappears near the middle of the tail.
4. Discussion
This study had two main goals: 1/to introduce a new segmentation tool for the hippocampal subfield based on machine learning named hippocampal segmentation factor (HSF), which leverages the latest advances in computer vision, and 2/to study lifespan volumetric trajectories of hippocampal subfields in healthy individuals using the proposed tool. We developed and validated HSF, and demonstrated that it is faster than all previous tools while offering a better segmentation quality closer to manual segmentation. Then, applying our tool to data from 3,750 individuals (HCP-development, HCP-young adults, and HCP-aging), we show that hippocampal subfields have different volumetric trajectories across the lifespan. These trajectories are non-linear, and inflection points differ between males and females in accordance with prior literature (16).
First of all, we validated HSF in comparison to ASHS, HIPS, and HippUnfold. When looking at the DC, it has to be noted that, even in the absence of histological ground truth, HSF matches the inter-rater agreement (Figure 4). Moreover, its scalability benefits out-of-the-box from the latest advances in computing due to the open neural network exchange (ONNX) ecosystem and NeuralMagic's DeepSparse inference engine. HSF shows an unprecedented segmentation speed which makes it particularly suited to the processing of big datasets such as the HCP. The bootstrap aggregation strategy, coupled with the test-time augmentation, makes HSF more robust than ASHS and HippUnfold as suggested by our results, with a lower variance with respect to the DC, HD, and VS (Figure 4). One feature of interest is the ability of HSF to segment both T1w and T2w images. Our investigation yielded superior quality segmentations through the utilization of T2w images—a result that aligns with the existing literature. It is important to note, however, that our dataset contained a larger quantity of T2w images compared to T1w images. Therefore, we are unable to definitively conclude whether the observed disparities in quality are a direct result of superior T2w contrast or a potential bias within our dataset. However, because each tool was trained using data segmented with different protocols, it is difficult to compare their accuracy, especially regarding the boundary between CA1 and the Subiculum (Yushkevich et al., 2015a). As HSF learned from multiple datasets, we interpret its segmentation as following a consensus between multiple segmentation guidelines, even if our results show it is very close to Barron's protocol (Berron et al., 2017). All tools segment the head and the body of the hippocampus in a similar manner, except HIPS which after manual verification, did not seem to respect the hippocampal subfields' boundaries visible to the naked eye. HippUnfold underperforms compared to HSF and ASHS because it overrepresents CA2 and CA3 in the tail. The way HSF learned to segment the hippocampal tail (Figure 6) is very similar to the histology-based tail segmentation proposed by Dalton et al. (2017), Flores et al. (2020), which both differ from Barron's protocol. There is no histological ground truth to support the superiority of HSF over HippUnfold regarding tail segmentation. If HSF was to be proved wrong regarding this particular point, future investigators could easily add new deep learning models to HSF's Model Hub in a plug-and-play fashion. Ever since the most recent launch of FreeSurfer 7, the original authors (Iglesias et al., 2015) have been endeavoring to enhance their segmentation pipeline of the hippocampal subfields. Due to the fact that this updated version is still untested and limited, it has not been integrated into our benchmark because of the current limitation to low-resolution T1 images. Thus, we highly suggest that future studies thoroughly examine this novel update as soon as it exits the beta stage.
After validating HSF, we segmented and analyzed hippocampal ROIs obtained from the HCP-development, HCP-young adults, and HCP-aging datasets. This allowed us to study the developmental trajectories of hippocampal subfields during the lifespan with a bigger age range than previous studies [e.g., (Yang et al., 2013; Bookheimer et al., 2019)]. Our model selection of NCS based on AIC found three main patterns. The first pattern, as expected, divided the hippocampus developmental trajectory into three main periods: growth, stabilization, and decay (GSD). This is the overall developmental pattern of the hippocampus, showing a maximal volume at approximately 20 to 25 years old, which is lower than some previous studies [e.g., (Yang et al., 2013)] but this may be due to the finer resolution of our model, thus allowing the observation of three distinct trends. After the stable period, we found a significantly negative correlation between hippocampal volume and age from 70 years old onwards, which is approximately 8 years later than previously found (Ziegler et al., 2012; Yang et al., 2013; de Flores et al., 2015). As previously, this may be caused by modeling artifacts, survivor bias, or inclusion bias in the used datasets (inclusion of “super-healthy” individuals with better aging than the general population). This GSD trajectory was observed in DG and CA1, which is consistent with previous studies showing growth during infancy and childhood (Lavenex and Banta Lavenex, 2013; Lee et al., 2014; Ellis et al., 2021), [up to a 2-fold increase in size for DG (Bachevalier, 2013)]. Moreover, the inflection points of DG and CA1 were very similar to those of the total hippocampus (Figure 5). However, we observed different trajectories for CA2/3 and the subiculum. Although the literature suggested a volumetric increase of CA2/3 (Lavenex and Banta Lavenex, 2013; Lee et al., 2014), we found this structure to be the most stable across the lifespan with no clear trend. This may be due to an insufficient resolution, forcing us to merge CA2 and CA3, thus averaging their dynamics. Another possible factor might be a too-noisy segmentation because of partial volumes resulting in a lack of sensibility to detect fine changes in these small and complex regions. Finally, our results for the subiculum are consistent with the literature: mostly flat (i.e., absence of correlation of volume with age) or a slight quasi-linear negative correlation between age and volume (Ziegler et al., 2012; Lee et al., 2014; de Flores et al., 2015; Foster et al., 2019). Our bigger age range and finer model allow us to refine those characteristics: by examining our results, we found a plateau, no correlation between age and volume, until the age of 60~70 years after which a fast decay happens similar to other subfields. Overall, this suggests that the DG, followed by CA1, is the most affected by development and aging. Most of the development of the subiculum appears to happen before the age of 5, which would relate to mnesic developments (Bouyeure and Noulhiane, 2021). While the subicular volume is positively correlated with the learnings of the when, where, and what components of episodic memory (Chi et al., 2022), prior studies found correlations between episodic memory and subiculum only up to 5 years old, which might be caused by the earlier maturation of the monosynaptic pathway (Canada, 2020). If the subiculum appears to mature earlier, it also decays earlier than others, which suggests that it might be a relevant biomarker for the early identification of age-related cognitive impairments. Furthermore, given that our findings are largely consistent with prior research, this serves to strengthen the validity of HSF, our novel segmentation tool.
Finally, besides sexual dimorphism with men having, over the stable part of their life, bigger hippocampal subfields than women, we found differences in developmental trajectories of hippocampal subfields between men and women. These are debated in the literature since some studies did not find interactions between volume, sex, and age (Sullivan et al., 2005; Mueller et al., 2007), while others did (). The present study suggests a complex relationship since we did not find such an interaction for all subfields. We found significant differences only for the growing period of the DG and CA2/3 with a faster growth in men than in women. This may be due to gonadal hormones modulating neoneurogenesis and increasing adult-born cells' survival in the DG (Galea et al., 2006; Spritzer and Galea, 2007; Hamson et al., 2013). However, this literature suggests that this interaction also exists in CA1 (Leranth, 2004; Islam et al., 2020), which was not the case in our study. Interestingly, we also observed a stronger negative correlation between age and volume for the DG and CA1 in men than in women. Overall, our results add to the literature and reconcile previous results on the lifespan volumetric trajectories of hippocampal subfields.
Our study suffers from several limitations. First, the lack of a standardized protocol to segment the hippocampal subfields negatively affects the way algorithms will learn to segment. This is partly solved by learning from a consensus between guidelines, but we lack a better in vivo ground truth than the one provided by manual segmentations. Then, volume might not reflect all the age-related changes in hippocampal structures. Although we found no anteroposterior differences between subjects, we believe it is critical to go beyond volumetric analysis and assess additional information, such as shape as suggested by Yang et al. (2013), Voineskos et al. (2015), and Lynch et al. (2019) or other complementary measures gathered through diffusion imaging, or even quantitative T1 relaxation maps, a proxy for intracortical myelin (Vos de Wael et al., 2018).
Therefore, while the hippocampal subfields are critical in the physiology of episodic memory, the lack of efficient segmentation tools hinders the use of large datasets to study their role in health and disease. Here, we introduced a new segmentation tool, HSF, robust to changes in populations, and acquisition parameters such as contrast, resolution, or magnetic field intensity. After its validation against other existing tools (ASHS, HIPS, and HippUnfold), we used it to segment large datasets (HCP-development, HCP-young adults, and HCP-aging) in order to model volumetric trajectories of the hippocampal subfields from 5 to 100 years old. Our volumetric analysis has shown that most subfields except the subiculum are positively correlated with age until the early 20s, and that the most correlated subfield is the dentate gyrus. This study also found a major inflection point at approximately 70 years old (even earlier in the subiculum) where a fast and significant volumetric decrease occurs. Our study has yet to be correlated with evaluations of mnesic performances, which could help to validate subicular volumes as a relevant biomarker for the early diagnosis of age-related cognitive decline.
Data availability statement
The datasets with the exception of HCP datasets presented in this article are not readily available because participants provided consent only for using their data under the supervision of the principal investigator. Requests to access the datasets should be directed to MN.
Ethics statement
The studies involving human participants were reviewed and approved by CPP 2011-A00058-33. Written informed consent to participate in this study was provided by the participants' legal guardian/next of kin.
Author contributions
CP: conceptualization, methodology, software, formal analysis, data curation, writing—original draft, and visualization. AB: investigation, resources, data curation, writing—review and editing, and funding acquisition. SP and MF: data curation and writing—review and editing. AG: methodology. ED: methodology and writing—review and editing. MB: resources, data curation, and writing—review and editing. FL: writing—review and editing. MN: conceptualization, investigation, resources, writing—original draft, supervision, project administration, and funding acquisition. All authors contributed to the article and approved the submitted version.
Funding
This study was funded by Fondation de France, Grant/Award Number: 00070721.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Bachevalier, J. (2013). The development of memory from a neurocognitive and comparative perspective. In: Bauer PJ, Fivush R, editors. The Wiley Handbook on the Development of Children's Memory. Chichester: John Wiley and Sons Ltd (2013). p. 109–25.
Berron, D., Vieweg, P., Hochkeppler, A., Pluta, J. B., Ding, S. L., Maass, A., et al. (2017). A protocol for manual segmentation of medial temporal lobe subregions in 7 Tesla MRI. NeuroImage Clin. 15, 466–482. doi: 10.1016/j.nicl.2017.05.022
Bookheimer, S. Y., Salat, D. H., Terpstra, M., Ances, B. M., Barch, D. M., Buckner, R. L., et al. (2019). The lifespan human connectome project in aging: an overview. Neuroimage 185, 335–348. doi: 10.1016/j.neuroimage.2018.10.009
Bouyeure, A., and Noulhiane, M. (2021). Episodic memory development in normal and adverse environments. In: Factors Affecting Neurodevelopment. Elsevier. p. 517–27. Available online at: https://linkinghub.elsevier.com/retrieve/pii/B9780128179864000444 (accessed October 24, 2022).
Bouyeure, A., Patil, S., Mauconduit, F., Poiret, C., Isai, D., Noulhiane, M., et al. (2021). Hippocampal subfield volumes and memory discrimination in the developing brain. Hippocampus 31, 1202–1214. doi: 10.1002/hipo.23385
Bussy, A., Plitman, E., Patel, R., Tullo, S., Salaciak, A., Bedford, S. A., et al. (2021). Hippocampal subfield volumes across the healthy lifespan and the effects of MR sequence on estimates. Neuroimage. 233, 117931. doi: 10.1016/j.neuroimage.2021.117931
Canada, K. L. (2020). Examining the Co-Development of Episodic Memory and Hippocampal Subfields – A Longitudinal Study. Digital Repository at the University of Maryland. Available online at: http://drum.lib.umd.edu/handle/1903/26350 (accessed August 31, 2022).
Chi, C. H., Yang, F. C., and Chang, Y. L. (2022). Age-related volumetric alterations in hippocampal subiculum region are associated with reduced retention of the “when” memory component. Brain Cogn. 160, 105877. doi: 10.1016/j.bandc.2022.105877
Dalton, M. A., Zeidman, P., Barry, D. N., Williams, E., and Maguire, E. A. (2017). Segmenting subregions of the human hippocampus on structural magnetic resonance image scans: an illustrated tutorial. Brain Neurosci. Adv. 1:239821281770144. doi: 10.1177/2398212817701448
de Flores, R., La Joie, R., and Chételat, G. (2015). Structural imaging of hippocampal subfields in healthy aging and Alzheimer's disease. Neuroscience 309, 29–50. doi: 10.1016/j.neuroscience.2015.08.033
DeKraker, J., Haast, R. A., Yousif, M. D., Karat, B., Köhler, S., Khan, A. R., et al (2021). HippUnfold: Automated Hippocampal Unfolding, Morphometry, and Subfield Segmentation. Neuroscience. Available online at: http://biorxiv.org/lookup/doi/10.1101/12,03.471134 (accessed July 19, 2022).
DeKraker, J., Haast, R. A., Yousif, M. D., Karat, B., Lau, J. C., Köhler, S., et al. (2022). Automated hippocampal unfolding for morphometry and subfield segmentation with HippUnfold. Elife. 11, e77945. doi: 10.7554/eLife.77945
Elhakeem, A., Hughes, R. A., Tilling, K., Cousminer, D. L., Jackowski, S. A., Cole, T. J., et al. (2022). Using linear and natural cubic splines, SITAR, and latent trajectory models to characterise nonlinear longitudinal growth trajectories in cohort studies. BMC Med. Res. Methodol. 22, 68. doi: 10.1186/s12874-022-01542-8
Ellis, C. T., Skalaban, L. J., Yates, T. S., Bejjanki, V. R., Córdova, N. I., Turk-Browne, N. B., et al. (2021). Evidence of hippocampal learning in human infants. Curr. Biol. 31, 3358–3364.e4. doi: 10.1016/j.cub.2021.04.072
Flores, R., Berron, D., Ding, S., Ittyerah, R., Pluta, J. B., Xie, L., et al. (2020). Characterization of hippocampal subfields using ex vivo MRI and histology data: Lessons for in vivo segmentation. Hippocampus. 30, 545–564. doi: 10.1002/hipo.23172
Fonov, V., Evans, A., McKinstry, R., Almli, C., and Collins, D. (2009). Unbiased nonlinear average age-appropriate brain templates from birth to adulthood. Neuroimage 47, S102. doi: 10.1016/S1053-8119(09)70884-5
Foster, C. M., Kennedy, K. M., Hoagey, D. A., and Rodrigue, K. M. (2019). The role of hippocampal subfield volume and fornix microstructure in episodic memory across the lifespan. Hippocampus. 29, 1206–1223. doi: 10.1002/hipo.23133
Galea, L. A. M., Spritzer, M. D., Barker, J. M., and Pawluski, J. L. (2006). Gonadal hormone modulation of hippocampal neurogenesis in the adult. Hippocampus 16, 225–232. doi: 10.1002/hipo.20154
Gogtay, N., Nugent, T. F., Herman, D. H., Ordonez, A., Greenstein, D., Hayashi, K. M., et al. (2006). Dynamic mapping of normal human hippocampal development. Hippocampus 16, 664–672. doi: 10.1002/hipo.20193
Greenland, S. (1995). Avoiding power loss associated with categorization and ordinal scores in dose-response and trend analysis: Epidemiology. 6, 450–454. doi: 10.1097/00001648-199507000-00025
Haeger, A., Mangin, J. F., Vignaud, A., Poupon, C., Grigis, A., Boumezbeur, F., et al. (2020). Imaging the aging brain: study design and baseline findings of the SENIOR cohort. Alz Res Therapy. 12, 77. doi: 10.1186/s13195-020-00642-1
Hamson, D. K., Wainwright, S. R., Taylor, J. R., Jones, B. A., Watson, N. V., Galea, L. A. M., et al. (2013). Androgens increase survival of adult-born neurons in the dentate gyrus by an androgen receptor-dependent mechanism in male rats. Endocrinology 154, 3294–3304. doi: 10.1210/en.2013-1129
Hindy, N. C., Ng, F. Y., and Turk-Browne, N. B. (2016). Linking pattern completion in the hippocampus to predictive coding in visual cortex. Nat. Neurosci. 19, 665–667. doi: 10.1038/nn.4284
Iglesias, J. E., Augustinack, J. C., Nguyen, K., Player, C. M., Player, A., Wright, M., et al. (2015). A computational atlas of the hippocampal formation using ex vivo, ultra-high resolution MRI: Application to adaptive segmentation of in vivo MRI. Neuroimage. 115, 117–137. doi: 10.1016/j.neuroimage.2015.04.042
Islam, M. N., Sakimoto, Y., Jahan, M. R., Ishida, M., Tarif, A. M. M., Nozaki, K., et al. (2020). Androgen affects the dynamics of intrinsic plasticity of pyramidal neurons in the CA1 hippocampal subfield in adolescent male rats. Neuroscience. 440, 15–29. doi: 10.1016/j.neuroscience.2020.05.025
Kulaga-Yoskovitz, J., Bernhardt, B. C., Hong, S. J., Mansi, T., Liang, K. E., van der Kouwe, A. J. W., et al. (2015). Multi-contrast submillimetric 3 Tesla hippocampal subfield segmentation protocol and dataset. Sci Data. 2, 150059. doi: 10.1038/sdata.2015.59
Lagarde, J., Olivieri, P., Tonietto, M., Gervais, P., Comtat, C., Caill,é, F., et al. (2021). Distinct amyloid and tau PET signatures are associated with diverging clinical and imaging trajectories in patients with amnestic syndrome of the hippocampal type. Transl Psychiatry. 11, 498. doi: 10.1038/s41398-021-01628-9
Lavenex, P., and Banta Lavenex, P. (2013). Building hippocampal circuits to learn and remember: Insights into the development of human memory. Behav Brain Res. 254, 8–21. doi: 10.1016/j.bbr.2013.02.007
Lee, J. K., Ekstrom, A. D., and Ghetti, S. (2014). Volume of hippocampal subfields and episodic memory in childhood and adolescence. Neuroimage 94, 162–171. doi: 10.1016/j.neuroimage.2014.03.019
Leranth, C. (2004). Androgens increase spine synapse density in the CA1 hippocampal subfield of ovariectomized female rats. J. Neurosci. 24, 495–499. doi: 10.1523/JNEUROSCI.4516-03.2004
Luo, P., Ren, J., Peng, Z., Zhang, R., and Li, J. (2019). Differentiable Learning-to-Normalize via Switchable Normalization. arXiv: 180610779. Available online at: http://arxiv.org/abs/1806.10779 (accessed May 31, 2023).
Lynch, K. M., Shi, Y., Toga, A. W., and Clark, K. A. (2019). Hippocampal shape maturation in childhood and adolescence. Cereb. Cortex. 29, 3651–3665. doi: 10.1093/cercor/bhy244
Manera, A. L., Dadar, M., Fonov, V., and Collins, D. L. (2020). CerebrA, registration and manual label correction of Mindboggle-101 atlas for MNI-ICBM152 template. Sci Data. 7, 237. doi: 10.1038/s41597-020-0557-9
Mueller, S. G., Stables, L., Du, A. T., Schuff, N., Truran, D., Cashdollar, N., et al. (2007). Measurement of hippocampal subfields and age-related changes with high resolution MRI at 4T. Neurobiol Aging 28, 719–726. doi: 10.1016/j.neurobiolaging.2006.03.007
Oktay, O., Schlemper, J., Folgoc, L. L., Lee, M., Heinrich, M., Misawa, K., et al (2018). Attention U-Net: Learning Where to Look for the Pancreas. arXiv:180403999. Available online at: http://arxiv.org/abs/1804,03999 (accessed May 31, 2023).
O'Mahony, N., Campbell, S., Carvalho, A., Harapanahalli, S., Hernandez, G. V., Krpalkova, L., et al. (2020). Deep learning vs. traditional computer vision. In: Arai, K., Kapoor, S., editors. Advances in Computer Vision. Cham: Springer International, Publishing. p. 128–44.
Opitz, D., and Maclin, R. (1999). Popular ensemble methods: an empirical study. jair. 11, 169–198. doi: 10.1613/jair.614
Palombo, D. J., Bacopulos, A., Amaral, R. S. C., Olsen, R. K., Todd, R. M., Anderson, A. K., et al. (2018). Episodic autobiographical memory is associated with variation in the size of hippocampal subregions. Hippocampus. 28, 69–75. doi: 10.1002/hipo.22818
Qiu, Q., Gong, G., Wang, L., Duan, J., and Yin, Y. (2019). Feasibility of Automatic Segmentation of Hippocampus Based on Deep Learning in Hippocampus-Sparing Radiotherapy. Int. J. Radiat. Oncol. Biol. Phy. 105, E137–E138. doi: 10.1016/j.ijrobp.2019.06.2177
Romero, J. E., Coup,é, P., and Manjón, J. V. (2017). HIPS. A new hippocampus subfield segmentation method. Neuroimage. 163, 286–295. doi: 10.1016/j.neuroimage.2017.09.049
Satopaa, V., Albrecht, J., Irwin, D., and Raghavan, B. (2011). Finding a “Kneedle” in a haystack: detecting knee points in system behavior. In: 2011 31st International Conference on Distributed Computing Systems Workshops. Minneapolis, MN: IEEE (2011). p. 166–71.
Schmidt, M. F., Storrs, J. M., Freeman, K. B., Jack, C. R., Turner, S. T., Griswold, M. E., et al. (2018). A comparison of manual tracing and FreeSurfer for estimating hippocampal volume over the adult lifespan. Hum Brain Mapp. 39, 2500–2513. doi: 10.1002/hbm.24017
Shaw, T. B., York, A., Barth, M., and Bollmann, S. (2020). Towards optimising MRI characterisation of tissue (TOMCAT) dataset including all longitudinal automatic segmentation of hippocampal subfields (LASHiS) data. Data Brief 32, 106043. doi: 10.1016/j.dib.2020.106043
Somerville, L. H., Bookheimer, S. Y., Buckner, R. L., Burgess, G. C., Curtiss, S. W., Dapretto, M., et al. (2018). The lifespan human connectome project in development: a large-scale study of brain connectivity development in 5–21 year olds. Neuroimage. 183, 456–468. doi: 10.1016/j.neuroimage.2018.08.050
Spritzer, M. D., and Galea, L. A. M. (2007). Testosterone and dihydrotestosterone, but not estradiol, enhance survival of new hippocampal neurons in adult male rats. Devel Neurobio. 67.1321–1333. doi: 10.1002/dneu.20457
Sullivan, E. V., Marsh, L., and Pfefferbaum, A. (2005). Preservation of hippocampal volume throughout adulthood in healthy men and women. Neurobiol Aging. 26, 1093–1098. doi: 10.1016/j.neurobiolaging.2004.09.015
Suzuki, M. (2004). Male-specific volume expansion of the human hippocampus during adolescence. Cereb Cortex. 15, 187–193. doi: 10.1093/cercor/bhh121
Uematsu, A., Matsui, M., Tanaka, C., Takahashi, T., Noguchi, K., Suzuki, M., et al. (2012). Devel-opmental trajectories of amygdala and hippocampus from infancy to early adulthood in healthy individuals. Krueger F, editor. PLoS ONE 7, e46970. doi: 10.1371/journal.pone.0046970
Voineskos, A. N., Winterburn, J. L., Felsky, D., Pipitone, J., Rajji, T. K., Mulsant, B. H., et al. (2015). Hippocampal (subfield) volume and shape in relation to cognitive performance across the adult lifespan: hippocampal volume, shape, and age-related cognitive performance. Hum. Brain Mapp. 36, 3020–3037. doi: 10.1002/hbm.22825
Vos de Wael, R., Larivière, S., Caldairou, B., Hong, S. J., Margulies, D. S., Jefferies, E., et al. (2018). Anatomical and microstructural determinants of hippocampal subfield functional connectome embedding. Proc. Natl. Acad. Sci. USA. 115, 10154–10159. doi: 10.1073/pnas.1803667115
Wang, G., Li, W., Aertsen, M., Deprest, J., Ourselin, S., Vercauteren, T., et al. (2019). Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks. Neurocomputing. 338, 34–45. doi: 10.1016/j.neucom.2019.01.103
Winterburn, J. L., Pruessner, J. C., Chavez, S., Schira, M. M., Lobaugh, N. J., Voineskos, A. N., et al. (2013). A novel in vivo atlas of human hippocampal subfields using high-resolution 3T magnetic resonance imaging. Neuroimage. 74, 254–265. doi: 10.1016/j.neuroimage.2013.02.003
Wisse, L. E. M., Biessels, G. J., and Geerlings, M. I. A. (2014). Critical appraisal of the hippocampal subfield segmentation package in freesurfer. Front Aging Neurosci. 6, 261. doi: 10.3389/fnagi.2014.00261
Wisse, L. E. M., Daugherty, A. M., Olsen, R. K., Berron, D., Carr, V. A., Stark, C. E. L., et al. (2017). A harmonized segmentation protocol for hippocampal and parahippocampal subregions: Why do we need one and what are the key goals?: A harmonized hippocampal subfield protocol: key goals and impact. Hippocampus. 27, 3–11. doi: 10.1002/hipo.22671
Wisse, L. E. M., Kuijf, H. J., Honingh, A. M., Wang, H., Pluta, J. B., Das, S. R., et al. (2016). Automated Hippocampal Subfield Segmentation at 7T MRI. AJNR. Am. J. Neuroradiol. 37, 1050–1057. doi: 10.3174/ajnr.A4659
Yang, X., Goh, A., Chen, S. H. A., and Qiu, A. (2013). Evolution of hippocampal shapes across the human lifespan: hippocampal shapes in aging. Hum. Brain Mapp. 34, 3075–3085. doi: 10.1002/hbm.22125
Yang, Z., Zhuang, X., Mishra, V., Sreenivasan, K., and Cordes, D. (2020). CAST. A multi-scale convolutional neural network based automated hippocampal subfield segmentation toolbox. Neuroimage 218, 116947. doi: 10.1016/j.neuroimage.2020.116947
Yassa, M. A., and Stark, C. E. L. (2011). Pattern separation in the hippocampus. Trends Neurosci. 34, 515–525. doi: 10.1016/j.tins.2011.06.006
Yushkevich, P. A., Amaral, R. S. C., Augustinack, J. C., Bender, A. R., Bernstein, J. D., Boccardi, M., et al. (2015a). Quantitative comparison of 21 protocols for labeling hippocampal subfields and parahippocampal subregions in in vivo MRI: towards a harmonized segmentation protocol. Neuroimage. 111, 526–541. doi: 10.1016/j.neuroimage.2015.01.004
Yushkevich, P. A., Pluta, J. B., Wang, H., Xie, L., Ding, S. L., Gertje, E. C., et al. (2015b). Automated volumetry and regional thickness analysis of hippocampal subfields and medial temporal cortical structures in mild cognitive impairment: automatic morphometry of MTL subfields in MCI. Hum. Brain Mapp. 36, 258–287. doi: 10.1002/hbm.22627
Yushkevich, P. A., Wang, H., Pluta, J., Das, S. R., Craige, C., Avants, B. B., et al. (2010). Nearly automatic segmentation of hippocampal subfields in in vivo focal T2-weighted MRI. Neuroimage. 53, 1208–1224. doi: 10.1016/j.neuroimage.2010.06.040
Zhang, Z., Liu, Q., and Wang, Y. (2018). Road extraction by deep residual U-net. IEEE Geosci Remote Sensing Lett. 15, 749–753. doi: 10.1109/LGRS.2018.2802944
Zhu, H., Shi, F., Wang, L., Hung, S. C., Chen, M. H., Wang, S., et al. (2019). Dilated dense U-Net for infant hippocampus subfield segmentation. Front Neuroinform. 13, 30. doi: 10.3389/fninf.2019.00030
Keywords: deep learning, semantic segmentation, MRI, development, aging
Citation: Poiret C, Bouyeure A, Patil S, Grigis A, Duchesnay E, Faillot M, Bottlaender M, Lemaitre F and Noulhiane M (2023) A fast and robust hippocampal subfields segmentation: HSF revealing lifespan volumetric dynamics. Front. Neuroinform. 17:1130845. doi: 10.3389/fninf.2023.1130845
Received: 23 December 2022; Accepted: 22 May 2023;
 Published: 15 June 2023.
Edited by:
Xiaohao Cai, University of Southampton, United KingdomReviewed by:
Christian Rummel, University of Bern, SwitzerlandChao Wang, Southern University of Science and Technology, China
Copyright © 2023 Poiret, Bouyeure, Patil, Grigis, Duchesnay, Faillot, Bottlaender, Lemaitre and Noulhiane. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Marion Noulhiane, bWFyaW9uLm5vdWxoaWFuZUBjZWEuZnI=
 Michel Bottlaender2,4
Michel Bottlaender2,4 
   
  