BRIEF RESEARCH REPORT article

Front. Neurosci., 09 February 2026

Sec. Brain Imaging Methods

Volume 20 - 2026 | https://doi.org/10.3389/fnins.2026.1729842

Multimodal connectivity-based cortical segmentation with graph neural networks

  • 1. Computer-Aided Medical Procedures and Augmented Reality, Technical University of Munich, Munich, Germany

  • 2. Athinoula A. Martinos Center for Biomedical Imaging, Radiology Department, Massachusetts General Hospital, Harvard Medical School, Boston, MA, United States

Article metrics

View details

490

Views

25

Downloads

Abstract

Due to the significant amount of time and expertise needed for manual segmentation of the brain cortex from magnetic resonance imaging (MRI) data, there is a substantial need for efficient and accurate algorithms to replace the need for human involvement. In this work, we explore the capabilities of Graph Neural Networks (GNNs) to segment the brain surface based on structural brain connectivity. We train three different GNN architectures, the Graph Convolutional Network (GCN), the Graph Attention Network (GAT), and the Graph U-Net, and evaluate their performances when trained on silver-standard cortical region labels created by FreeSurfer. We take a multimodal approach to brain segmentation by examining the influence of the structural connectivity values inferred from diffusion MRI (dMRI) in addition to using values from structural MRI (sMRI). Our results demonstrate the utility of GNN models, particularly the GAT architecture, which achieved Dice scores competitive to those reported in the literature with non-graph methods. Additionally, structural connectivity derived from dMRI revealed significant value in improving automatic segmentation, as models trained on combined attributes from dMRI and sMRI outperformed those trained only on sMRI. Finally, we compared the GNN-based and the FreeSurfer segmentations in their ability to predict demographic/clinical data, where neither of the two approaches was statistically significantly superior to the other.

1 Introduction

Development of in vivo imaging techniques such as magnetic resonance imaging (MRI) has given neuroscientists an unprecedentedly detailed and non-invasive view of the human brain. However, as the brain is a highly complex structure consisting of billions of neurons, interpreting MRI images remains a challenge. Brain segmentation provides a valuable way of reducing dimensionality by dividing the brain into a limited number of regions of interest (Gallardo Diez, 2018; Wang et al., 2018; Desikan et al., 2006). Further, segmentation greatly aids medicine and neuroscience by helping to advance our understanding of neuroanatomy and providing deeper insight into the connection between brain structure and human clinical and behavioral data (Gallardo Diez, 2018).

Manual brain labeling is a tedious and time-consuming process (Roy et al., 2018). As a result, there has been substantial research to fully automate the process. The vast majority of published methods use only structural MRI (sMRI) images as input, which provide important information about brain anatomy and allow measurements such as cortical thickness and curvature (Roy et al., 2018; Henschel et al., 2020, 2022; Fischl et al., 2004; Zhang et al., 2024). However, recent studies indicate that multimodal segmentation can provide novel insights and improve the utility of surface segmentation (Wang et al., 2018; Zhang et al., 2024). Specifically, the relationship between structural connectivity and cortical morphology has been exploited to leverage structural connectivity to segment (Zhang et al., 2024; Eschenburg et al., 2021; Wang et al., 2018) and align (Petrović et al., 2009; Isaev et al., 2017; Cole et al., 2025; Zhou et al., 2026) the cortex. Functional connectivity obtained from functional MRI has also been used for such purposes (Wang et al., 2018; Conroy et al., 2013). We explore here the impact of including structural connectivity, which is a measure of the strength of physical white-matter connections between brain regions, to segment the cortex.

We train Graph Neural Networks (GNNs), such as the Graph Convolutional Network (GCN), the Graph Attention Network (GAT), and the Graph U-Net architectures, based on information from sMRI alone as well as combined sMRI and dMRI connectivity features, using the segmentation results obtained from FreeSurfer (Fischl, 2012) as the silver standard. Doing so enables us to compare the segmentation quality of a model trained on data from a single modality, in our case sMRI, with a model trained with the addition of connectivity data derived from dMRI. Furthermore, we explore the accuracy of different GNN architectures, as well as the influence of different sMRI-derived parameters on the final segmentation results.

We also validate the best-performing model by examining its effectiveness in a downstream task, thereby evaluating the applicability of the multimodal GNN and its possible improvement upon the silver-standard training set. Given that FreeSurfer segmentation labels are created automatically, as opposed to manually, they are not considered gold-standard ground truth (Fischl et al., 2004), and, as such, they might contain errors. Overlap measures between the labels resulting from GNN and those of FreeSurfer, therefore, cannot gauge any potential improvement over the training labels, as a perfect Dice score would only indicate that the model has learned to perfectly reproduce the FreeSurfer results. Conversely, disparities between the two results could indicate either an error on the part of the trained model or possibly an improvement upon the FreeSurfer segmentation, something that cannot be estimated with silver-standard labels alone. Evaluating the usefulness of the final segmentation when applied to a downstream task, such as age prediction, however, provides us with a surrogate measure to compare the two segmentation methods.

We describe our methods in Section 2, provide our experimental results in Section 3, and discuss them and conclude the paper in Section 4.

2 Method

The basis of our cortical graph structure is the pial surface automatically created with FreeSurfer (Fischl, 2012) from the T1-weighted 3D MRI image. The final result is in the form of a triangular mesh approximating the pial surface, which is shown in Figure 1 for one of the test subjects. The resulting graph is unweighted. A feature vector is assigned to each vertex (node) in the graph and can contain a combination of the characteristics derived from sMRI and dMRI.

Figure 1

Visualization of a complex blue mesh structure resembling a human brain. Three rectangular insets on the right zoom into specific sections, showing detailed mesh patterns. Axes with numerical labels are visible.

Visualization of the pial surface of an example subject from the training dataset. The zoomed-in regions show the triangular tessellation of the surface in greater detail. Note that while the structure of the graph is that of the mesh, the positioning of the nodes in the 3D space (used here for visualization) is not part of the graph. The (x, y, z) coordinates are in fact used as part of the feature vector of each vertex.

Data collected from sMRI that we incorporated in the feature vector include (x, y, z) coordinates of the node, cortical thickness, and curvature values. Thickness is defined as the distance between the white and pial surfaces, and curvature is defined as the inverse of the radius of the circle tangent to the surface at the position of the corresponding vertex (mean curvature was used, which is the average of the curvatures corresponding to the smallest and the largest tangent circles). Thickness and curvature are scalar values, and the coordinates are a vector of length 3. The curvature is negative in a gyrus and positive in a sulcus. The features were min-max normalized across the entire dataset.

To compute structural connectivity of each vertex to the other vertices from dMRI, we used a conductance-based method (Frau-Pascual et al., 2019). This approach models white-matter pathways within the brain based on a combination of differential Maxwell's equations and Kirchoff's circuit laws. A high conductance (i.e., low resistance) value computed between two regions represents a high degree of connectivity between them. This method accounts for both direct and indirect brain connections and has been shown to predict age and dementia (Frau-Pascual et al., 2021). We applied this method to sMRI and dMRI of 97 subjects of the “100 Unrelated Subjects” group of the Human Connectome Project (HCP) Young Adult study, which consists of healthy individuals aged 22–35 (Van Essen et al., 2013). Three subjects were omitted due to failed connectivity computation. We were granted access to restricted HCP data, which contains, among other sensitive information, the subjects' exact age, Mini-Mental State Exam (MMSE) scores, and education level. During preprocessing, we transformed the data from the FreeSurfer uniform space, fsaverage3, where the brain connectivity data had been computed, to the native space of every subject via the mris_apply_reg command. The final connectivity map for each vertex was a feature vector of length 642 (the number of vertices in the hemisphere).

We used the following three different GNN architectures for segmentation:

  • Graph Convolutional Network (GCN) is based on the Convolutional Neural Network concept, but designed to work on graph structures. With each layer, the information from the neighboring nodes is incorporated in the next layer's feature vector (Kipf and Welling, 2017).

  • Graph Attention Network (GAT) allows to assign a degree of usefulness to each neighboring node, making it a more complex architecture than the GCN (Veličković et al., 2018).

  • Graph U-Net is based on the U-Net model, a deep-learning architecture well suited for the segmentation of biomedical images. It contains Graph Pooling layers, which down-sample the graph, followed by Graph Unpooling layers, which expand it back to the original size (Gao and Ji, 2019).

3 Results

We first report the outcomes of the GNN model training on the task of cortical segmentation with the FreeSurfer segmentation as ground truth. Next, we detail the results of predicting the demographic/clinical characteristics of the subjects based on the segmentations, which compares the usefulness of the model's and the FreeSurfer's outputs for a real-life task.

3.1 Graph-based cortical segmentation

All three models were evaluated on 97 subjects of the “100 Unrelated Subjects” group of the HCP Young Adult dataset. We held out eight subjects as the final validation set, and the remaining 89 subjects were included in a fivefold cross-validation. We monitored the training process in each split by examining the Dice scores of the model applied to both the training and the validation sets of the fold. In each fold, once training concluded, we applied the trained model to the final validation set (eight subjects). The mean Dice score on the final validation set over the five splits is reported and used as the primary metric to compare different models in the following.

For GNN model training, the Adam optimization algorithm and the ReduceLRonPlateau scheduler were used. The starting learning rate was lr = 0.01, and all models were trained for 100 epochs. Due to GPU constraints, a batch size of 1 was chosen. Gaussian noise was added to the input features during training for data augmentation.

Table 1 shows the final average Dice scores for the three model architectures and six different feature vectors, for the left and right hemispheres. For ease of reading, the model trained on coordinates, thickness, and curvature will be referred to as the “fully structural” model, and the model trained on all possible attributes, that is, coordinates, thickness, curvature, and structural connectivity, will be called the “multimodal” model.

Table 1

Feature vector GCN GAT Graph U-Net
Left hemisphere
(x, y, z) coordinates 0.712 ± 0.009 0.707 ± 0.017 –
(x, y, z) coordinates + thickness 0.743 ± 0.003 0.744 ± 0.005 –
(x, y, z) coordinates + curvature 0.742 ± 0.003 0.745 ± 0.007 –
(x, y, z) coordinates + thickness + curvature 0.761 Âą 0.008 0.768 Âą 0.005 0.744 Âą 0.003
(x, y, z) coordinates + thickness + curvature + conductance 0.9104 Âą 0.0008 0.927 Âą 0.003 0.882 Âą 0.003
(x, y, z) coordinates + conductance 0.8930 ± 0.0006 0.914 ± 0.003 –
Right hemisphere
(x, y, z) coordinates 0.708 ± 0.005 0.711 ± 0.004 –
(x, y, z) coordinates + thickness 0.735 ± 0.004 0.728 ± 0.008 –
(x, y, z) coordinates + curvature 0.725 ± 0.006 0.722 ± 0.009 –
(x, y, z) coordinates + thickness + curvature 0.753 Âą 0.007 0.760 Âą 0.003 0.734 Âą 0.004
(x, y, z) coordinates + thickness + curvature + conductance 0.900 Âą 0.001 0.918 Âą 0.002 0.876 Âą 0.009
(x, y, z) coordinates + conductance 0.886 ± 0.002 0.904 ± 0.002 –

Average Dice scores on the final validation set.

Dice scores (mean Âą StD) on the final validation dataset for GCN, GAT, and Graph U-Net architectures for both left and right hemispheres. The highest value in each row is highlighted in green. The row with the highest values is in bold. Graph U-Net was trained only for two types of input feature vectors due to the significantly longer training time required and poor initial results.

3.1.1 Influence of the feature vector

As can be seen in Table 1 for both hemispheres, using only (x, y, z) coordinates of the vertex produced the lowest Dice scores. Adding either the thickness or curvature scalar value slightly improved the results, with thickness often slightly more helpful than curvature. Adding them both further increased the final score, suggesting non-overlapping information between them. The most notable finding is the results by the multimodal model (when the connectivity values are added), where the Dice score is considerably increased, for example, up to a cross-hemisphere mean of 0.923 for the GAT model. Worth pointing out is also that the model trained on the combination of (x, y, z) + conductance still outperformed all models trained on data solely from sMRI.

3.1.2 Comparison of the model architectures

In terms of model performance, GAT outperformed GCN in 9 out of the 12 feature combinations, including all that include connectivity values. Among models only trained on structural data, GCN either outperformed the GAT model or scored slightly below it. Graph U-Net achieved the lowest final scores in all examined cases.

Two-sided paired t-tests across the results on the final validation set revealed the subject-mean Dice scores from the multimodal model to be significantly higher for GAT as compared to GCN (p = 2 × 10−5 and p = 10−6 for the left and right hemispheres, respectively) and Graph U-Net (p = 7 × 10−9 and p = 3 × 10−6 for the left and right hemispheres, respectively).

3.1.3 Influence of the number of parameters on the model performance

As detailed in Section 3.1.1, the addition of connectivity values to the feature vector considerably improved the segmentation results. For a fair comparison, the depth and width of the models were kept constant regardless of the size of the feature vector. However, as shown in Table 2, the connectivity data increase the size of the feature vector by 642, in turn increasing the number of trainable weights in our multimodal models, whereas thickness and curvature each only add a single scalar value per vector. In other words, when the sizes of the rest of the layers are kept constant, the number of trainable parameters will increase with the length of the input feature vector. For example, for the GCN architecture, the number of weights increased for a model trained solely on coordinates from 144k to 145k for a model trained on a combination of coordinates, thickness, and curvature, and finally up to 474k for a model trained on all available attributes.

Table 2

(x, y, z) coordinates + thickness + curvature
Number of parameters Dice score
145001 0.7410 Âą 0.0013
285801 0.7633 Âą 0.0031
499201 0.7630 Âą 0.0017
(x, y, z) coordinates + thickness + curvature + conductance
473705 0.9104 Âą 0.0008

Influence of the number of model parameters on the final Dice score in the GCN model.

It is worth pointing out that due to the possibility of overfitting, increasing the number of trainable parameters does not always lead to enhanced performance. Having more parameters is therefore not necessarily an advantage for a model trained on the connectivity values. However, for the sake of completeness, we trained a GCN model with an increased width of the second layer on a feature vector solely including attributes from sMRI, to see how much it would affect its performance. As can be seen in Table 2, the final Dice score of the fully structural model was still significantly lower than the multimodal model, even with more trainable weights in the former than in the latter. When the number of features in the fully structural model was doubled relative to the original size, the score increased by 0.0223. However, upon increasing the size again, the result plateaued.

3.2 Downstream task

We further evaluated the performance of the GNN model by employing it in a downstream task, which tests the usefulness of the final segmentation results. As the multimodal GAT was the best-performing model in both hemispheres in Section 3.1.2, we chose this model for the downstream task.

3.2.1 Statistical correlation

To examine the applicability of the segmentation outputs of the trained model and FreeSurfer, we correlated region-wise imaging attributes, namely average cortical thickness and average conductance, with demographic/clinical characteristics such as age, MMSE score, and education level. Specifically, we calculated the average thickness/conductance value for every segmented cortical region for both the FreeSurfer and the GAT model segmentations, and then determined how the resulting values correlated with the subjects' age/MMSE/education.

When correlating the average of all within-region mean thickness values with age, for FreeSurfer segmentation, the left hemisphere showed statistical significance with r = −0.35 and . The negative linear correlation is consistent with what is reported in the literature (Aycheh et al., 2018). For the right hemisphere, the values are slightly less significant, with r = −0.23 and . Model-based segmentation performed similarly to FreeSurfer-based segmentation; for the left hemisphere, r = −0.31 and , and for the right hemisphere r = −0.23 and .

In region-wise correlation, four FreeSurfer regions passed the Bonferroni-corrected threshold (ÎąB = 0.05/35 = 0.0014) in the left hemisphere and two in the right hemisphere. For model-based segmentation, five regions fulfilled the threshold for the left hemisphere, and two for the right hemisphere. Table 3 shows the p-values for the specific regions in which at least one of the segmentation sources passed the significance threshold ÎąB. While this shows certain regions' strong correlation with cortical thickness, the model and FreeSurfer performed similarly, as a paired Wilcoxon signed rank test on the region-wise p-values revealed no statistically significant difference (pW = 0.16 and pW = 0.88 for the left and right hemispheres, respectively).

Table 3

Region FreeSurfer GAT model
Left hemisphere
Superior frontal gyrus p = 0.002, r = −0.332 p = 0.0008, r = −0.334
Inferior frontal gyrus, Pars triangularis p = 1.86e−07,
r = −0.500
p = 9.3e−07,
r = −0.474
Inferior parietal cortex p = 0.002,
r = −0.315
p = 0.001,
r = −0.321
Superior temporal gyrus p = 0.001,
r = −0.321
p = 0.0009,
r = −0.332
Banks of the superior temporal sulcus p = 7.0e−05,
r = −0.392
p = 7.7e−05,
r = −0.390
Right hemisphere
Middle frontal gyrus, Rostral division p = 0.0011,
r = −0.325
p = 0.0009,
r = −0.332
Middle frontal gyrus, Caudal division p = 0.0001,
r = −0.383
p = 0.0002,
r = −0.366

Results of the correlation between the mean thickness and the subject's age.

Brain regions with mean cortical thickness statistically significantly correlated with age (by at least one method), for FreeSurfer and GNN-based (GAT) segmentation. The more significant result in each row is highlighted in green.

No correlation between other brain attributes in a brain region and/or other demographic/clinical features passed the ÎąB significance threshold.

3.2.2 Age prediction

For the prediction task, we combined regions from both hemispheres, resulting in an input vector of length 70. We used seven available machine learning models from the MATLAB Regression Learner Toolbox. We also trained different-sized Multi-Layer Perceptrons (MLPs). The performance was determined with the root mean square error (RMSE), which should be at the very least below the standard deviation, σage = 3.70 years, in order to show any form of utility.

FreeSurfer segmentation outperformed the GAT segmentation, achieving a lower RMSE for four out of seven models when using thickness as a brain characteristic and for all models when using conductance. However, the results for both methods were not satisfactory enough for a clear conclusion—the lowest reported RMSE was 3.44, i.e. only 0.26 below σage, for the Regression Trees when using the FreeSurfer segmentation and the average conductance.

MLP showed poor training results, either indicating over- or underfitting, and a large variation in the RMSE between different validation splits. This significant training instability suggests that the model was not capable of properly generalizing on the dataset, making the comparison between the two segmentation methods difficult to interpret. As such, no strong conclusions could be drawn from the final results.

4 Discussion

We demonstrated that GNNs are a suitable network architecture for the task of cortical surface segmentation, comparable to the non-deep-learning based algorithms such as the one employed by FreeSurfer. While the direct comparison with other methods in the literature is limited due to the difference in datasets and cortical atlases used, the final reported Dice scores of 0.768 for the model solely trained on sMRI-derived data and, especially, 0.927 for the model trained by including connectomic data are comparable to the currently reported state-of-the-art results (Henschel et al., 2020, 2022; Roy et al., 2018; Wu et al., 2019; Cucurull et al., 2018; Eschenburg et al., 2021).

The considerable increase in the final Dice scores for the model trained in a multimodal fashion indicates the significant utility of structural connectivity in the task of brain segmentation. Notably, the model trained on the combination of (x, y, z) and conductance outperformed models trained on all available features from sMRI, which suggests that connectivity values provide more useful information than the thickness or curvature, even though those are the parameters that are utilized by the FreeSurfer segmentation algorithm (Fischl et al., 2004).

When considering the network architecture, GAT was the strongest one when connectivity data were included. However, GCN performed comparatively well when only sMRI values were considered. This is possibly because the more complicated architecture of the GAT model performs better when it needs to generalize on a more complex training set, and the GCN is not powerful enough to process the significantly longer feature vector once the connectivity data are introduced, even when it has more training weights. When the training data include only attributes from sMRI, a less powerful model such as a GCN might be enough.

It is worth noting that the Graph U-Net layers needed to be kept narrower than those in the other two architectures due to memory constraints, resulting in a much smaller number of training weights, 170403, for a Graph U-Net, in comparison to 473705 for a GCN and 242863 for a GAT model (for a feature vector of length 647). However, the fact that the GAT model, while having significantly fewer training weights than the GCN model, still managed to outperform it, indicates that a suitable model architecture should still be able to perform well with fewer trainable parameters. While GPU time and memory were limiting during model training, they are often much less of a concern during inference (e.g., while applying a trained model in a clinical setting), where even CPUs can be used to segment an image in a reasonable time.

The influence of trainable weights on the network performance was further explored in Section 3.1.3. The results from Table 2 indicate that, as expected, simply increasing the training weights is not a suitable alternative to introducing new attributes in the feature vector and adding new information to the training set.

In the second part of this work, we aimed to examine whether the GNN model has the capability to generalize upon FreeSurfer labels well enough to show higher utility in a downstream task posed. However, the results failed to unequivocally prove an improvement in the GNN-based segmentation. The RMSE from the MATLAB toolbox models did not show clear superiority of one method over the other, and the MLP showed typical behavior of an overfitted model, suggesting that the model most likely simply learned to memorize the input vector and their age labels, and did not have enough data to generalize on. It is therefore not possible to state whether the model was able to improve upon the silver-standard labels it was trained on.

One significant limitation of the study is the lack of manually labeled data, which could be used as the gold standard during training, serving as more reliable labels than the silver-standard segmentation created with FreeSurfer, and also as a benchmark for the evaluation of the final results. Training on more subjects could also improve the performance of the model in the downstream task. Nonetheless, as graph segmentation is intrinsically a node classification task, 97 subjects were enough to produce high training and validation scores. For the model to improve upon FreeSurfer, however, it may require a larger amount of data to be able to generalize upon the variation in the input. It is possible that with the amount of data provided, the model was capable of matching FreeSurfer's segmentation, but not necessarily improving upon it.

Furthermore, it would be beneficial to see if the models are equally capable of generalizing on a more diverse group of subjects. As stated before, the brain surfaces used for training were all from a young and healthy population. However, for neuroscientific research as well as in clinical settings, patients and elderly individuals are often of more interest. Even more insightful would be to compare the influence of brain connectivity on the segmentation of the brains of patients with neurological disorders that are known to influence both the structure and connectivity of the brain, such as Alzheimer's disease.

Future work includes validating our approach on gold-standard manually annotated cortical labels, attention-weight and node-importance analysis of the GAT architecture to explore which regions or connectivity features drive improvements, and training an end-to-end segmentation model to optimize downstream tasks (such as age prediction). Finally, more recent architectures, such as GCNs with an added attention mechanism (Tan et al., 2025) or diffusion-based approaches (Zhu et al., 2024), have shown great potential in the task of cortical surface segmentation. It would be worth exploring whether such state-of-the-art architectures could also benefit from the inclusion of structural connectivity.

Statements

Data availability statement

Magnetic resonance images were provided by the Human Connectome Project (HCP) (Van Essen et al., 2013), WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil): https://www.humanconnectome.org/study/hcp-young-adult. The conductance model of structural brain connectivity (Frau-Pascual et al., 2019) was computed using our public toolbox at: https://github.com/ainafp/FVT4DWI. FreeSurfer (Fischl, 2012) is publicly available at: https://surfer.nmr.mgh.harvard.edu. The models employed here were built using the PyTorch Geometric library (Fey and Lenssen, 2019).

Author contributions

AŁ: Conceptualization, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing, Formal analysis. AK: Conceptualization, Supervision, Writing – review & editing. CP: Supervision, Writing – review & editing, Conceptualization, Resources. AF-P: Resources, Writing – review & editing, Data curation. IA: Data curation, Writing – review & editing, Conceptualization, Funding acquisition, Project administration, Resources, Supervision.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This work was supported by the National Institutes of Health (NIH), specifically the National Institute on Aging (RF1AG068261, R01AG068261). Computational resources were provided by the Chair for Computer-Aided Medical Procedures and Augmented Reality at the Technical University of Munich.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author IA declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

  • 1

    Aycheh H. M. Seong J.-K. Shin J.-H. Na D. L. Kang B. Seo S. W. et al . (2018). Biological brain age prediction using cortical thickness data: a large scale cohort study. Front. Aging Neurosci. 10:252. doi: 10.3389/fnagi.2018.00252

  • 2

    Conroy B. R. Singer B. D. Guntupalli J. S. Ramadge P. J. Haxby J. V. (2013). Inter-subject alignment of human cortical anatomy using functional connectivity. NeuroImage81, 400–411. doi: 10.1016/j.neuroimage.2013.05.009

  • 3

    Cucurull G. Wagstyl K. Casanova A. Veličković, P. Jakobsen E. Drozdzal M. Romero A. et al . (2018). “Convolutional neural networks for mesh-based parcellation of the cerebral cortex,” in Medical Imaging with Deep Learning. Amsterdam.

  • 4

    [Dataset] Cole M. Xiang Y. Consagra W. Srivastava A. Qiu X. Zhang Z. et al . (2025). Alignment of continuous brain connectivity.

  • 5

    Desikan R. S. Ségonne F. Fischl B. Quinn B. T. Dickerson B. C. Blacker D. et al . (2006). An automated labeling system for subdividing the human cerebral cortex on mri scans into gyral based regions of interest. NeuroImage31, 968–980. doi: 10.1016/j.neuroimage.2006.01.021

  • 6

    Eschenburg K. M. Grabowski T. J. Haynor D. R. (2021). Learning cortical parcellations using graph neural networks. Front. Neurosci. 15:797500. doi: 10.3389/fnins.2021.797500

  • 7

    Fey M. Lenssen J. E. (2019). Fast graph representation learning with PyTorch geometric.

  • 8

    Fischl B. (2012). FreeSurfer. Neuroimage62, 774–781. doi: 10.1016/j.neuroimage.2012.01.021

  • 9

    Fischl B. van der Kouwe A. Destrieux C. Halgren E. Ségonne F. Salat D. H. et al . (2004). Automatically parcellating the human cerebral cortex. Cerebral Cortex14, 11–22. doi: 10.1093/cercor/bhg087

  • 10

    Frau-Pascual A. Augustinack J. Varadarajan D. Yendiki A. Salat D. H. Fischl B. et al . (2021). Conductance-based structural brain connectivity in aging and dementia. Brain Connect. 11, 566–583. doi: 10.1089/brain.2020.0903

  • 11

    Frau-Pascual A. Fogarty M. Fischl B. Yendiki A. Aganj I. (2019). Quantification of structural brain connectivity via a conductance model. Neuroimage189, 485–496. doi: 10.1016/j.neuroimage.2019.01.033

  • 12

    Gallardo Diez G. A. (2018). Inferring and comparing structural parcellations of the human brain using diffusion MRI. Theses, UniversitĂŠ CĂ´te d'Azur.

  • 13

    Gao H. Ji S. (2019). Graph U-nets. arXiv [preprint]. arXiv:1905.05178. doi: 10.48550/arXiv.1905.05178

  • 14

    Henschel L. Conjeti S. Estrada S. Diers K. Fischl B. Reuter M. (2020). FastSurfer - a fast and accurate deep learning based neuroimaging pipeline. NeuroImage219:117012. doi: 10.1016/j.neuroimage.2020.117012

  • 15

    Henschel L. Kügler D. Reuter M. (2022). FastSurferVINN: building resolution-independence into deep learning segmentation methods–a solution for highres brain MRI. NeuroImage251:118933. doi: 10.1016/j.neuroimage.2022.118933

  • 16

    Isaev D. Gutman B. Moyer D. Faskowitz J. Thompson P. (2017). “Cortical connectome registration using spherical demons,” in 12th International Symposium on Medical Information Processing and Analysis, 2016, Vol. 101600M. Tandil, Argentina. doi: 10.1117/12.2256975

  • 17

    Kipf T. N. Welling M. (2017). Semi-supervised classification with graph convolutional networks.

  • 18

    Petrović A. Smith S. M. Menke R. A. Jenkinson M. (2009). Methods for tractography-driven surface registration of brain structures. Med. Image Comput. Comput, Assist. Interv. 12(Pt 1), 705–712. doi: 10.1007/978-3-642-04268-3_87

  • 19

    Roy A. G. Conjeti S. Navab N. Wachinger C. (2018). QuickNAT: A fully convolutional network for quick and accurate segmentation of neuroanatomy.

  • 20

    Tan J. Ren X. Chen Y. Yuan X. Chang F. Yang R. et al . (2025). Application of improved graph convolutional network for cortical surface parcellation. Sci. Rep. 15:16409. doi: 10.1038/s41598-025-00116-0

  • 21

    Van Essen D. C. Smith S. M. Barch D. M. Behrens T. E. Yacoub E. Ugurbil K. (2013). The WU-Minn human connectome project: an overview. NeuroImage80, 62–79. doi: 10.1016/j.neuroimage.2013.05.041

  • 22

    Veličković P. Cucurull G. Casanova A. Romero A. Liò P. Bengio Y. (2018). Graph Attention Networks.

  • 23

    Wang C. Ng B. Garbi R. (2018). Multimodal brain parcellation based on functional and anatomical connectivity. Brain Connect. 8, 604–617. doi: 10.1089/brain.2017.0576

  • 24

    Wu Z. Zhao F. Xia J. Wang L. Lin W. Gilmore J. H. et al . (2019). Intrinsic Patch-Based cortical anatomical parcellation using graph convolutional neural network on surface manifold. Med. Image Comput. Comput. Assist. Interv. 11766, 492–500. doi: 10.1007/978-3-030-32248-9_55

  • 25

    Zhang F. Cho K. I. K. Seitz-Holland J. Ning L. Legarreta J. H. Rathi Y. et al . (2024). DDParcel: deep learning anatomical brain parcellation from diffusion MRI. IEEE Trans. Med. Imaging43, 1191–1202. doi: 10.1109/TMI.2023.3331691

  • 26

    Zhou Z. Li J. Williams J. Fischl B. Aganj I. (2026). “Deep-learning cortical registration guided by structural and diffusion MRI and connectivity,” in Proceedings of the IEEE International Symposium on Biomedical Imaging 2026 (London).

  • 27

    Zhu Y. Lian C. Li X. Wang F. Ma J. (2024). “Efficient cortical surface parcellation via full-band diffusion learning at individual space,” in Proceedings of Medical Image Computing and Computer Assisted Intervention-MICCAI 2024, volume LNCS 15002. Cham: Springer Nature Switzerland. doi: 10.1007/978-3-031-72069-7_16

Summary

Keywords

connectome-based prediction, cortical segmentation, diffusion MRI (dMRI), FreeSurfer, graph neural networks, structural brain connectivity

Citation

Łabiak A, Kazi A, Pellegrini C, Frau-Pascual A and Aganj I (2026) Multimodal connectivity-based cortical segmentation with graph neural networks. Front. Neurosci. 20:1729842. doi: 10.3389/fnins.2026.1729842

Received

21 October 2025

Revised

19 December 2025

Accepted

14 January 2026

Published

09 February 2026

Volume

20 - 2026

Edited by

Ahmed Shalaby, University of Texas Southwestern Medical Center, United States

Reviewed by

Waldemar Karwowski, University of Central Florida, United States

Hamed Mohammadi, University of Central Florida, United States

Updates

Copyright

*Correspondence: Agata Łabiak, ; Iman Aganj,

† Present address: Aina Frau-Pascual, Better Care S.L., Sabadell, Spain; Universitat Oberta de Catalunya, Barcelona, Spain

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics