Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med., 20 August 2025

Sec. Precision Medicine

Volume 12 - 2025 | https://doi.org/10.3389/fmed.2025.1659653

This article is part of the Research TopicAI Innovations in Neuroimaging: Transforming Brain AnalysisView all 9 articles

A novel MRI-based deep learning–radiomics framework for evaluating cerebrospinal fluid signal in central nervous system infection

Ferhat Cüce&#x;Ferhat Cüce1Gkalp Tulum
&#x;Gökalp Tulum2*Muhammed Ikbal Isik&#x;Muhammed Ikbal Isik1Marziye Jalili&#x;Marziye Jalili3Güven Girgin&#x;Güven Girgin4
mer Karada&#x;Ömer Karadaş5Niray BaNiray Baş5Berza 
zcanBerza Özcan6Ümit Savaci&#x;Ümit Savaşci6Sena akir&#x;Sena Şakir6Akay 
vün Karada&#x;Akçay Övünç Karadaş7Eda TeometeEda Teomete8Onur OsmanOnur Osman2Jawad Rasheed,,,
Jawad Rasheed9,10,11,12*
  • 1Department of Radiology, Health Science University, Gulhane Training, and Research Hospital, Ankara, Türkiye
  • 2Department of Electrical and Electronics Engineering, Topkapi University, Istanbul, Türkiye
  • 3Department of Artificial Intelligence, Üsküdar University, Istanbul, Türkiye
  • 4Department of Neurology, Muğla Training, and Research Hospital, Muğla, Türkiye
  • 5Department of Neurology, Health Science University, Gulhane Training, and Research Hospital, Ankara, Türkiye
  • 6Department of Infection Disease, Health Science University, Gulhane Training, and Research Hospital, Ankara, Türkiye
  • 7Private Clinic, Ankara, Türkiye
  • 8Department of Classics, University of Michigan, Ann Arbor, MI, United States
  • 9Department of Computer Engineering, Istanbul Sabahattin Zaim University, Istanbul, Türkiye
  • 10Department of Software Engineering, Istanbul Nisantasi University, Istanbul, Türkiye
  • 11Applied Science Research Center, Applied Science Private University, Amman, Jordan
  • 12Research Institute, Istanbul Medipol University, Istanbul, Türkiye

Introduction: Accurate and timely diagnosis of central nervous system infections (CNSIs) is critical, yet current gold-standard techniques like lumbar puncture (LP) remain invasive and prone to delay. This study proposes a novel noninvasive framework integrating handcrafted radiomic features and deep learning (DL) to identify cerebrospinal fluid (CSF) alterations on magnetic resonance imaging (MRI) in patients with acute CNSI.

Methods: Fifty-two patients diagnosed with acute CNSI who underwent LP and brain MRI within 48 h of hospital admission were retrospectively analyzed alongside 52 control subjects with normal neurological findings. CSF-related signals were segmented from the ventricular system and sub-lentiform nucleus parenchyma, including perivascular spaces (PVSs), using semi-automated methods on axial T2-weighted images. Two hybrid models (DenseASPP-RadFusion and MobileASPP-RadFusion), fusing radiomics and DL features, were developed and benchmarked against base DL architectures (DenseNet-201 and MobileNet-V3Large) via 5-fold nested cross-validation. Radiomics features were extracted from both original and Laplacian of Gaussian–filtered MRI data.

Results: In the sub-lentiform nucleus parenchyma, the hybrid DenseASPP-RadFusion model achieved superior classification performance (accuracy: 78.57 ± 4.76%, precision: 84.09 ± 3.31%, F1-score: 76.12 ± 6.86%), outperforming its corresponding base models. Performance was notably lower in ventricular system analyses across all models. Radiomics features derived from fine-scale filtered images exhibited the highest discriminatory power. A strict, clinically motivated patient-wise classification strategy confirmed the sub-lentiform nucleus region as the most reliable anatomical target for distinguishing infected from non-infected CSF.

Discussion: This study introduces a robust and interpretable MRI-based deep learning–radiomics pipeline for CNSI classification, with promising diagnostic potential. The proposed framework may offer a noninvasive alternative to LP in selected cases, particularly by leveraging CSF signal alterations in PVS-adjacent parenchymal regions. These findings establish a foundation for future multicenter validation and integration into clinical workflows.

1 Introduction

Central nervous system infections (CNSIs) are neurological emergencies that demand prompt and accurate diagnosis to reduce morbidity and mortality. The gold standard for confirming CNSI involves isolating the microbial agent or detecting its antigen in cerebrospinal fluid (CSF), typically via culture or polymerase chain reaction (PCR) analysis following lumbar puncture (LP) (1, 2). However, in clinical practice, the turnaround time for these methods is often inadequate for urgent decision-making. As such, CSF pleocytosis observed on microscopy is frequently used as a proxy to initiate empirical therapy with antibiotics, antivirals, or antifungals (3). Yet, reactive or false-positive pleocytosis may occur—particularly following initial LPs or in immunocompromised patients—raising concerns about overtreatment and diagnostic uncertainty (1).

Furthermore, LP is an invasive procedure with contraindications, including the presence of intracranial mass lesions, bleeding diathesis, spinal malformations, or local infections at the puncture site (2). These factors highlight the need for reliable, noninvasive, and rapid diagnostic tools to support or replace traditional CSF sampling in specific clinical contexts.

MRI plays a vital complementary role in the evaluation of CNSI. Certain imaging patterns—such as asymmetric involvement of the temporal lobe, insula, and cingulum in herpes encephalitis; leptomeningeal enhancement in meningitis; or abscess formation and tuberculous granulomas—may suggest an infectious etiology (4). Nonetheless, normal MRI findings do not exclude infection, and the sensitivity of MRI for viral and bacterial meningitis ranges between 67.4 and 83.3% (57). Therefore, neuroimaging alone is insufficient, and there is an urgent demand for advanced image analysis tools that can extract diagnostic information beyond the visual capabilities of radiologists.

Radiomics addresses this gap by converting conventional medical images into high-dimensional quantitative data, capturing subtle image patterns such as intensity, texture, shape, and spatial relationships (810). These handcrafted features have shown promise in multiple domains, but their performance can be enhanced when fused with deep learning (DL)–derived features. DL models can automatically learn abstract, hierarchical representations from imaging data, offering complementary insights into disease phenotypes.

Recent studies have demonstrated the efficacy of DL–radiomics fusion models specifically within neurology, such as multimodal neuroimaging feature learning for Alzheimer’s disease diagnosis (11), deep radiomic analysis of MRI data for Alzheimer’s disease classification (12), and fusion of MRI and cognitive assessments for mild cognitive impairment diagnostics (13). Similarly, these approaches have shown promise in distinguishing multiple sclerosis lesions (14) and differentiating Parkinson’s disease patients from healthy individuals using radiomic features from MRI (15) and PET imaging (16). Additionally, deep learning radiomic frameworks have been effectively used for predicting hemorrhage progression in intracerebral hemorrhage (17), forecasting outcomes after acute ischemic stroke (18), and diagnosing temporal lobe epilepsy through FDG-PET imaging (19).

Despite the growing interest in end-to-end deep learning pipelines, current evidence suggests that combining DL with handcrafted radiomics yields more interpretable and robust results especially in datasets with limited sample sizes (2022). Consequently, standardization initiatives now recommend best practices for preprocessing, feature selection, and model validation to improve reproducibility across institutions (23).

In this study, we propose a hybrid DL–radiomics framework for classifying infected versus non-infected CSF regions in patients with suspected CNSI. We focus on two anatomical targets: the ventricular system and the sub-lentiform nucleus parenchyma, including the perivascular spaces (PVSs), which are implicated in glymphatic CSF circulation. We hypothesize that the fusion of radiomic descriptors and DL-based spatial features can enable noninvasive discrimination of CSF infection patterns, thereby supporting earlier diagnosis and potentially reducing the reliance on lumbar puncture.

2 Methods and materials

2.1 Patient

The local ethics committee approved this retrospective study, and written consent was waived.

This retrospective study included patients diagnosed with CNSI who underwent brain MRI as part of their routine clinical work-up between 2017 and 2024. Fifty-two patients in the infection group were diagnosed with acute bacterial, viral and aseptic meningitis based on a combination of clinical presentation (e.g., fever, headache, neck stiffness), CSF analysis, and microbiological testing. Importantly, none of the included patients met the diagnostic criteria for encephalitis or meningoencephalitis, and there were no findings suggestive of parenchymal involvement (such as diffusion restriction, edema, or signal abnormalities and contrast enhancement in the brain parenchyma) on MRI. Mild to moderate leptomeningeal enhancement was observed in the majority of cases on post-contrast T1-weighted images, which was consistent with active meningeal inflammation. No significant ventriculitis, abscess formation, or hydrocephalus was detected. Clinically, patients presented primarily with headache and fever, and none exhibited focal neurological deficits, altered mental status, or seizures at the time of imaging. This strict inclusion criterion ensured a clinically and radiologically homogeneous infection cohort, thereby allowing a focused evaluation of CSF-related signal features in isolated meningitis and minimizing potential confounding from parenchymal disease.

All patients diagnosed with CNSI underwent an LP on the day of admission and had brain MRIs performed within the first 48 h after being admitted to the hospital. We excluded patients who did not undergo LP, had no brain MRI, had MRIs taken more than 48 h after treatment commenced.

The control group consisted of 52 patients with chronic headaches with normal neurological examinations and normal brain MRI reports. A total of 104 patients, including both the patient and control groups, were included in the analysis.

2.2 Imaging parameters

All brain MRIs were performed on a Philips 3 T imaging system with a dedicated head coil. All studies included axial plane fat-saturated fast spin eco T2-weighted sequence with time repetition (TR): 2,600–5,600 millisecond (ms), time echo (TE): 70–90 ms, echo train length (ETL):10–12. The slice thickness was 5 millimeters (mm). To accurately evaluate subtle cerebrospinal fluid (CSF)-specific signal alterations and to minimize inadvertent segmentation errors arising from CSF flow artifacts, pre-contrast T2-weighted images were exclusively utilized in this study. T2-weighted imaging was selected for its inherent sensitivity and superior contrast resolution regarding fluid characteristics, enabling precise and artifact-aware segmentation of CSF regions. On the other hand, sequences such as T1-weighted, post-contrast T1-weighted, FLAIR, and diffusion-weighted images (DWI) were deliberately excluded. T1-weighted and post-contrast sequences primarily emphasize anatomical structures and contrast-enhanced parenchymal or meningeal lesions, providing limited utility in isolated CSF analysis without parenchymal involvement. Likewise, FLAIR imaging suppresses CSF signals, inherently limiting its applicability for dedicated CSF signal assessment. DWI is particularly sensitive to acute parenchymal lesions, but since our study specifically excluded patients with parenchymal abnormalities, its inclusion was not considered beneficial. Since no 3D modeling was employed in our study, the slice thickness of 5 mm did not constitute a significant limitation for our analysis. This selective approach ensured methodological consistency and enhanced reliability in analyzing isolated CSF-related radiomic and deep learning features.

2.3 Semi-automated segmentation procedure

Upon consensus, two independent radiologists determined the slices in the axial planes of T2-weighted images. Subsequently, MRI images were stored in the DICOM file format and imported to the ManSeg (v.2.7d) software (24). Initially, the radiologists focused on segmenting the CSF signal in both the upper and posterior sections of the lateral ventricles’ lumen, avoiding areas with visible flow artifacts. Next, to reduce the risk of missing any subtle, instantaneous changes in the normal CSF flow signal, they separately segmented the parenchyma of the sub-lentiform nucleus, which includes the perivascular spaces (PVSs) supplied by the lenticulostriate arteries. Sub-lentiform nucleus parenchyma with the PVSs would effectively represent the features of the CSF, including its contents. For each patient, the lateral ventricles’ lumen and the parenchyma of the sub-lentiform nucleus were segmented bilaterally. For the segmentation of suspicious regions, the radiologists roughly delineated the boundaries of the regions of interest independently, and the segmentation process was then performed automatically using the active contour algorithm (25). Final consensus segmentation masks were obtained after resolving discrepancies through joint review. Inter-observer agreement was assessed retrospectively on a randomly selected subset of 10 patients. Mean Dice similarity coefficients were 0.92 ± 0.03 for ventricular regions and 0.91 ± 0.04 for sub-lentiform parenchyma. Figure 1 depicts samples of infected CSF and normal CSF on T2-weighted images, respectively.

Figure 1
MRI images comparing normal and infected cerebrospinal fluid in the ventricular system and sub-lentiform nucleus. The ventricular system images display red-highlighted regions in the right and left hemispheres under normal and infected conditions. The sub-lentiform nucleus images show red-highlighted areas under the same conditions, with close-ups for detailed comparison. Panels are labeled by condition and hemisphere.

Figure 1. Segmented anatomical regions from the ventricular system and the sub-lentiform nucleus in both right and left hemispheres. The top row illustrates two representative cases from the control group with normal CSF, while the bottom row presents two cases from the CNSI group with infected CSF. Red-highlighted regions indicate the manually segmented areas used for radiomics feature extraction. The bounding boxes were generated as standardized input patches for deep learning models. All images are derived from T2-weighted MRI sequences. CNSI, Central Nervous System Infection; CSF, Cerebrospinal fluid.

2.4 Feature extraction

Radiomics features were extracted from the segmented regions on both the native T2-weighted MRI images and three Laplacian-of-Gaussian (LoG)–filtered counterparts generated with kernel sizes of 3 × 3 × 1 (fine), 5 × 5 × 2 (medium), and 7 × 7 × 3 (coarse). While 2D morphological features were derived solely from the original T2 images, both first-order and second-order statistical features, including those from gray level co-occurrence matrix (GLCM), gray level size zone matrix (GLSZM), gray level run length matrix (GLRLM), neighboring gray-tone difference matrix (NGTDM), and gray level dependence matrix (GLDM) were extracted from all image sources. A comprehensive list of the extracted features is presented in Table 1, comprising a total of 378 features.

Table 1
www.frontiersin.org

Table 1. The description and the total number of radiomics features.

2.5 Classification methodology

First, the region of interest (ROI) images and their corresponding radiomics features were imported. For the deep-learning analysis, each segmented region was centrally cropped into a 32 × 32 pixel patch, which was then resized to 224 × 224 pixels using bicubic interpolation. A patient-based 5-fold cross-validation (CV) approach was employed, ensuring that each patient’s ROI images and associated radiomics data remained grouped during the splitting process. One-fold was allocated as the test set, while the remaining folds were used for training and validation. Feature selection was conducted solely on the radiomics features derived from the training and validation sets. From a total of 378 radiomics features, the top 50 most discriminative features were selected using a filter-based approach. Subsequently, the training and validation sets were split into an internal 3-fold cross-validation (CV) to divide them into training and validation subsets further. Data augmentation techniques, including rotation, zooming, translation, and flipping, were applied to enhance the diversity of the training data.

In our preliminary analyses, we evaluated several advanced architectures, including Swin Transformer, Vision Transformer (ViT), and attention-based networks. However, these approaches yielded poor performance and instability due to the relatively limited size of our dataset. Therefore, DenseNet-201 and MobileNet-V3Large were selected as robust baseline architectures, given their known ability to generalize well on smaller datasets and their compatibility with our hybrid feature fusion strategy.

Model training was conducted in two phases. For the first outer fold, both the customized models DenseASPP-RadFusion and MobileASPP-RadFusion and the base models DenseNet-201 (26) and MobileNet-V3Large (27) were initialized from scratch. For the remaining folds, the weights from the previous fold were loaded to continue training. During the initial training phase, the learning rate was set to 1e-4 with a reduction factor of 0.5 and a minimum learning rate of 1e-7. Training proceeded for up to 200 epochs, with early stopping implemented after 10 epochs. During the fine-tuning phase, the learning rate was reduced to 1e-5, and the first 70% of the layers were frozen. Training was conducted for 20 epochs, with early stopping triggered after five epochs. These hyperparameters were empirically determined based on iterative experimentation within the internal training-validation splits to minimize overfitting. No hyperparameter tuning was performed on the external test sets. Throughout the process, training and validation loss, as well as accuracy metrics, were monitored. At the end of each fold, model weights and performance metrics were saved. During the testing phase, the feature selection obtained from the outer fold was applied to the test set, and model performance was evaluated using standard metrics, including accuracy, precision, recall, and F1-score. Finally, the results from all five folds were reported as mean ± standard deviation for each performance metric. Figure 2 depicts the flowchart of the classification process.

Figure 2
Flowchart depicting a machine learning workflow for radiomics features. It starts with importing samples, followed by patient-wise and train-validation processes. Key steps include data augmentation, model building, initial training with learning rate schedules, and fine-tuning. Evaluation and storage of performance metrics and models occur iteratively through cross-validation folds until completion. The process concludes with storing results and models.

Figure 2. Overview of the proposed classification framework.

In the baseline architecture, models such as DenseNet-201 and MobileNet-V3Large were employed as feature extractors. These base models processed the input MRI images to generate feature maps, which were subsequently passed through a global average pooling layer, followed by a fully connected layer with 256 neurons and a dropout layer (rate = 0.3), leading directly to the classification output. In contrast, the proposed fusion models were designed to integrate both deep image features and handcrafted radiomics features. In the image branch of the proposed models, the backbone feature map was processed through five parallel paths. Four of these paths constituted the Atrous Spatial Pyramid Pooling (ASPP) module, employing 3 × 3 convolutions with dilation rates of 1, 6, 12, and 18, each followed by batch normalization and ReLU activation, producing four parallel 7 × 7 × 512 feature maps. The fifth path was designed to inject global contextual information by applying global average pooling to the backbone feature map (resulting in 1 × 1 × 1920), followed by a 1 × 1 convolution with 512 filters, and then bilinear upsampling to reach a size of 7 × 7 × 512. All five outputs were concatenated to form a unified representation of size 7 × 7 × 2,560 and then compressed via a 1 × 1 convolution with 512 filters.

In parallel to the image pathway, radiomics features were processed through a separate branch. A total of 378 radiomics features were extracted and reduced to 50 using filter-based feature selection. These selected features passed through two fully connected layers [Dense (128) and Dense (512)] with dropout, reshaped into a 1 × 1 × 512 tensor and then upsampled to 7 × 7 × 512 to match the spatial resolution of the image features. Finally, the outputs from both the image and radiomics branches were concatenated along the channel axis, forming a 7 × 7 × 1,024 fused representation. This combined feature map was subjected to global average pooling, followed by a Dense (256) layer with dropout, and terminated with a softmax classification layer. This architecture effectively captured both spatial and contextual information from MRI data, enriched by complementary radiomics descriptors. As illustrated in Figure 3, the proposed model architecture integrates both ASPP-enhanced image features and spatially fused radiomics features. The implementation code for the proposed MRI-based deep learning–radiomics framework is publicly available at: https://github.com/DrGokalpTulum/MRI-Based-Deep-Learning-Radiomics-Framework-for-Evaluating-Cerebrospinal-Fluid-Signal-.git.

Figure 3
Comparison diagram showing the structure of base and proposed models. The base model uses DenseNet-201 and MobileNet-V3Large with global average pooling and a dense layer leading to class output. The proposed model includes multiple 3x3 convolution layers with different dilation rates, concatenated and passed through a 1x1 convolution. Radiomics features undergo feature selection before combining with the model output for class prediction.

Figure 3. Schematic representation of the baseline and proposed model architectures.

3 Results

In the CNSI group, 55.7% (n = 29) of the patients were male, 44.3% (n = 23) were female, and the mean age was 43.5 ± 22.5 years. In the control group, 33.9% (n = 18) of the patients were male, 66.1% (n = 34) were female, and the mean age was 46.7 ± 11 years.

The CSF analysis was performed on the patient’s admission to the health institution. The macroscopic appearance of the CSF, the amount of CSF glucose and protein, pleocytosis in microscopy, and the presence of microorganisms in the Gram stain were evaluated. High CSF protein, low glucose, leukocyte count of 100 or more cells/mm3, and neutrophil predominance are evaluated as bacterial meningitis; normal CSF glucose, borderline high protein levels, and lymphocytes being the predominant cell in the cell count were evaluated as viral meningitis; normal CSF findings were accepted as aseptic meningitis.

According to early biochemical and microscopy results, bacterial meningitis was observed in 37 patients, viral meningitis in 14 patients, and CSF findings of 1 patient were evaluated as aseptic meningitis. While no culture medium growth was detected in the CSF of 24 patients, Streptococcus was detected in 5 patients, E. coli in 3 patients, Brucella in 2 patients, Acinetobacter in 1 patient, Neisseria in 1 patient, and Proteus in 1 patient, according to CSF culture results. Varicella Zoster Virus PCR positivity was detected in the CSF of two patients. Based on clinical and laboratory results in the patient group, antimicrobial treatment for CNSI was empirically started. After the diagnosis of the agent was confirmed by culture, PCR, and serology, treatment revision was performed with de-escalation in three patients.

During the 5-fold outer cross-validation, a total of 378 radiomics features were subjected to feature selection, and the top 50 features were retained in each fold. Across all folds, a total of 92 unique features were selected. Among these, 20 features were consistently selected in all five folds, indicating strong discriminative capacity. These high-frequency features primarily originated from the Laplacian of Gaussian (LoG) filtered MRI with fine kernels (2 mm). In particular, features such as Energy, Maximum, Range, Long Run Emphasis, and High Gray Level Zone Emphasis repeatedly appeared across all folds.

Additionally, 16 features appeared in four folds and five features in three folds, most of which stemmed from LoG-filtered MRI with medium kernels (4 mm) or original T2-weighted images. These consistently selected features highlight the critical role of multiscale texture descriptors in capturing the heterogeneity of cerebrospinal fluid regions. Detailed feature selection results, including Feature Name, Image Source, Feature Class, and Frequency, are provided in the Supplementary file.

Upon investigating the classification results, the proposed fusion models (DenseASPP-RadFusion and MobileASPP-RadFusion) demonstrate improvements over their corresponding base architectures (DenseNet-201 and MobileNet-V3Large) in the sub-lentiform nucleus parenchyma region. DenseASPP-RadFusion achieved the highest mean accuracy (78.57 ± 4.76%) and precision (84.09 ± 3.31%), with relatively low standard deviations, indicating both high performance and consistency across folds. Although MobileASPP-RadFusion yielded the highest mean recall (77.05 ± 14.82%), the associated standard deviation was relatively large, suggesting instability in sensitivity across different validation folds.

In contrast, none of the models showed strong classification capability in the ventricular system. Accuracy values remained between 57.26 and 60.52%, while F1-scores were notably lower, particularly for DenseASPP-RadFusion (53.46 ± 11.04%) and DenseNet-201 (49.37 ± 15.32%). Moreover, the standard deviations in recall for all models were high (ranging from 12.94 to 20.66%), indicating a lack of reliability in detecting true positives in ventricular-level CSF signals.

The results show that the sub-lentiform nucleus parenchyma with PVSs provides more stable and discriminative information for classification tasks compared to the ventricular system. The performance of the models was statistically significantly different (p < 0.05). Detailed performance metrics for all models and anatomical regions are presented in Table 2, while the corresponding ROC curves are illustrated in Figure 4.

Table 2
www.frontiersin.org

Table 2. Performance metrics (mean ± std) of all models.

Figure 4
Two graphs showing average ROC curves with standard deviation for models in different brain areas. The top graph is for the sub-lentiform nucleus parenchyma, while the bottom is for the ventricular system. Both graphs feature spatial_densenet, spatial_mobilenet, std_densenet, and std_mobilenet models, each with respective AUC scores and standard deviations shown in the legend. The true positive rate is plotted against the false positive rate.

Figure 4. Mean ROC curves with standard deviation (shaded regions) for each model across the 5-fold cross-validation. The upper plot illustrates results for the sub-lentiform nucleus parenchyma with PVSs. The lower plot presents results for the ventricular system. Legend entries include average AUC ± standard deviation for each model.

To complement the fold-level evaluation, patient-wise classification performance was also assessed under clinically motivated assumptions. To assess patient-level diagnostic performance under clinical assumptions, strict patient-wise accuracy was calculated separately for each class (infection and control) across all outer folds. Since each patient had two separate ROIs from the right and left sub-lentiform nucleus levels, the following decision rules were applied: for infection cases (class 1), a prediction was considered correct if at least one of the two ROIs was classified as infected, reflecting a clinically cautious approach to minimize false negatives. Conversely, for control cases (class 0), a prediction was deemed correct only if both ROIs were classified as non-infected, ensuring stricter criteria for healthy labeling. This binary patient-wise accuracy was computed per case and averaged within each fold for all models.

Figure 5 presents box plots illustrating the distribution of strict patient-wise accuracy values for each model, separately for the sub-lentiform nucleus parenchyma and ventricular system. In the sub-lentiform nucleus parenchyma with PVSs, the proposed model DenseASPP-RadFusion yielded the most stable and accurate performance, with infection class accuracies tightly clustered within the 80 to 90% interquartile range and control accuracies between 70 and 80%, both showing low interfold variability. Similarly, DenseNet-201 achieved high median values, though with a slightly wider spread in the control group. Notably, both models exhibited limited presence of outliers, suggesting consistency in predictions across patient subsets.

Figure 5
Two box plot charts display strict clinical patient-wise accuracy by class, comparing infection and control groups across four models: DenseASPP-RadFusion, MobileASPP-RadFusion, DenseNet-201, and MobileNet-V3Large. The top chart is labeled

Figure 5. Box plots illustrating strict clinical patient-wise accuracy for CNSI and control classes across all models, evaluated separately for the sub-lentiform nucleus parenchyma (top) and the ventricular system (bottom). CNS, Central nervous system infection.

On the other hand, MobileNet-V3Large exhibited high variability and lower median accuracy, particularly for control patients. Its control group performance distribution dropped to a lower interquartile range (below 60%) and revealed several outliers, reflecting instability across folds. MobileASPP-RadFusion demonstrated acceptable median accuracy but higher dispersion, particularly in control cases, indicating less consistent generalization across folds.

In the ventricular system, all models demonstrated lower and more dispersed accuracy distributions, indicating reduced reliability in this anatomical region. For instance, although MobileNet-V3Large achieved reasonable infection accuracy, its control classification remained weak and inconsistent. MobileASPP-RadFusion and DenseNet-201 exhibited moderate accuracy with noticeably higher standard deviations, particularly in control predictions, highlighting the challenge of robust CSF signal interpretation in ventricular regions. The broader interquartile ranges and frequent outliers in the ventricular plots underscore the inconsistency of model behavior in this region. These findings further reinforce that the sub-lentiform nucleus parenchyma with PVSs provides a more clinically reliable classification, both at the level of the fold and the patient.

Under this realistic criterion, the proposed fusion models demonstrated high stability and accuracy, particularly in the sub-lentiform nucleus. By contrast, all models showed reduced and inconsistent performance in ventricular CSF classification, further underscoring the diagnostic limitations of relying solely on ventricular analysis. Discordant predictions between left and right sub-lentiform nucleus evaluations occurred in 22.6 ± 3.1% for DenseASPP-RadFusion, 30.4 ± 6.0% for MobileASPP-RadFusion, 29.9 ± 4.7% for DenseNet-201, and 39.5 ± 5.9% for MobileNet-V3Large, indicating varying levels of stability in bilateral predictions.

4 Discussion

In this study, we developed and evaluated a novel MRI-based deep learning–radiomics framework to classify CSF signals in patients with acute CNSIs. Our findings demonstrate that the fusion of handcrafted radiomic descriptors with DL features enables more accurate and reliable classification of infected versus non-infected CSF, particularly when analyzing the sub-lentiform nucleus parenchyma region. These results offer promising evidence for the utility of noninvasive imaging-based diagnostics as a potential complement or alternative to LP in selected clinical contexts.

Despite their central role in CNSI diagnosis, CSF analyses via LP remain invasive and carry procedural risks, including herniation, hemorrhage, or infection—especially in patients with intracranial mass lesions or bleeding disorders (13). Moreover, pleocytosis, often used as a surrogate marker of infection, may occasionally yield false-positive results, especially after repeated LPs or in immunocompromised individuals (1). These limitations necessitate the development of alternative diagnostic strategies that are rapid, noninvasive, and reproducible.

While MRI has proven valuable in detecting certain CNSI patterns—such as temporal lobe involvement in herpes encephalitis or leptomeningeal enhancement in meningitis—it lacks sufficient sensitivity to reliably detect all cases, particularly in early or ambiguous presentations (47). In our study, conventional visual inspection of ventricular CSF signals on MRI did not provide sufficient discriminatory power to distinguish infected from non-infected fluid. This is likely due to the inherent signal homogeneity and dynamic flow of CSF in the ventricles, which limits the effectiveness of static image-based analysis.

Indeed, previous AI-based studies evaluating body fluid segmentation—such as pleural or synovial effusions—have reported promising results (28, 29). However, these studies primarily focused on relatively static fluids that exhibit well-defined boundaries and textural consistency. CSF, on the other hand, is in constant motion, and its flow-dependent signal properties pose substantial challenges for conventional image segmentation and classification.

To address these limitations, our study focused on the sub-lentiform nucleus parenchyma, specifically targeting regions that include perivascular spaces (PVSs)—components of the glymphatic system that mediate convective CSF flow from penetrating arteries into the interstitial space. Unlike the ventricular system, these parenchymal regions are less affected by flow artifacts and may reflect more stable and informative imaging features. Additionally, inflammation in adjacent brain parenchyma during CNSI—though often invisible on routine MRI—may alter tissue texture and contribute to detectable radiomic changes.

Our results strongly support this hypothesis. The hybrid DenseASPP-RadFusion model, which integrates multiscale radiomics with spatially resolved DL features, achieved a mean classification accuracy of 78.6% in the sub-lentiform nucleus region—substantially outperforming both its base architecture (DenseNet-201) and all models applied to the ventricular system. Features derived from Laplacian of Gaussian (LoG)–filtered images, particularly with fine kernels (2 mm), contributed most significantly to model performance, suggesting that subtle intensity variations in the CSF-parenchyma interface are key discriminative elements.

Furthermore, we applied a clinically grounded, strict patient-wise classification strategy, wherein a diagnosis of infection was accepted if either hemisphere exhibited an infected CSF pattern, while a control classification required bilateral confirmation of non-infection. Under this realistic criterion, the proposed fusion models demonstrated high stability and accuracy, particularly in the sub-lentiform nucleus. By contrast, all models showed reduced and inconsistent performance in ventricular CSF classification, further underscoring the diagnostic limitations of relying solely on ventricular analysis.

The broader implication of our findings lies in the potential of hybrid DL–radiomics frameworks to improve CNSI diagnosis in settings where LP is delayed, contraindicated, or inconclusive. To our knowledge, this is the first study to apply a deep learning–radiomics fusion approach to analyze CSF signal patterns in brain MRI for the classification of CNSIs. Prior applications of AI to fluid-based diagnostics have largely centered around cancer-related effusions or synovial fluid segmentation in rheumatology (28, 29), whereas our study opens new directions for infectious disease imaging.

DenseNet-based models consistently outperformed MobileNet-based models across most performance metrics, likely due to their deeper and densely connected architectures enabling effective feature reuse and robust representation learning. Conversely, MobileNet’s design prioritizes computational efficiency and fewer parameters, potentially limiting its capability to capture subtle radiomic patterns. Thus, DenseNet architectures may be preferable for tasks demanding detailed representation of subtle imaging features, whereas MobileNet remains beneficial under computational constraints.

Nevertheless, our study has limitations. The relatively modest sample size (n = 104) and single-center design may limit generalizability. However, all MRIs were acquired using a uniform 3 T scanner and standardized imaging protocol, enhancing internal consistency. Future research should validate these findings using multicenter datasets with larger, more diverse populations and include longitudinal evaluation across various CNSI subtypes (e.g., bacterial, viral, fungal). Additionally, the integration of clinical metadata (e.g., laboratory markers, symptoms) with imaging features may further improve classification performance. Moreover, we used a slice thickness of 5 mm, which is relatively thicker than the thin-cut images (≤3 mm) typically preferred in current brain MRI research. Although this could potentially limit the segmentation accuracy and reliability in studies utilizing 3D modeling approaches, our analyses and segmentations were strictly performed on 2D images, reducing its impact within our study context. Future studies using thinner slice imaging might offer further improvements in segmentation detail and predictive performance. Future studies could further enhance the clinical impact and interpretability of the proposed fusion models by incorporating explainable AI (XAI) methodologies to identify and visualize the most influential radiomic and deep-learning-derived features. Integrating these techniques would significantly strengthen model transparency, improve clinical confidence, and facilitate a smoother translation into clinical practice.

In conclusion, our study introduces a novel, interpretable, and clinically relevant framework for noninvasive CNSI assessment using advanced radiomics and deep learning methods. The sub-lentiform nucleus parenchyma, inclusive of PVSs, emerges as a promising anatomical region for CSF evaluation. This approach has the potential to complement traditional LP-based diagnostics and support faster, safer, and more accurate CNSI management in clinical practice.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Clinical Research Ethics Committee of the University of Health Sciences, Gülhane Training and Research Hospital (Decision No.: 2023/18–amendment; Approval Date: 15 March 2023). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

FC: Conceptualization, Data curation, Resources, Writing – original draft. GT: Conceptualization, Methodology, Software, Visualization, Writing – original draft. MI: Data curation, Resources, Writing – original draft. MJ: Software, Writing – review & editing. GG: Conceptualization, Writing – review & editing. ÖK: Conceptualization, Project administration, Validation, Writing – review & editing. NB: Data curation, Resources, Validation, Writing – review & editing. BÖ: Data curation, Resources, Writing – review & editing. ÜS: Data curation, Validation, Writing – original draft. SŞ: Data curation, Resources, Validation, Writing – review & editing. AK: Conceptualization, Writing – review & editing. ET: Writing – review & editing. OO: Methodology, Software, Writing – review & editing. JR: Methodology, Software, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2025.1659653/full#supplementary-material

References

1. Troendle, M, and Pettigrew, A. A systematic review of cases of meningitis in the absence of cerebrospinal fluid pleocytosis on lumbar puncture. BMC Infect Dis. (2019) 19:692. doi: 10.1186/s12879-019-4204-z

PubMed Abstract | Crossref Full Text | Google Scholar

2. Wright, BLC, Lai, JTF, and Sinclair, AJ. Cerebrospinal fluid and lumbar puncture: a practical review. J Neurol. (2012) 259:1530–45. doi: 10.1007/s00415-012-6413-x

PubMed Abstract | Crossref Full Text | Google Scholar

3. Bedetti, L, Marrozzini, L, Baraldi, A, Spezia, E, Iughetti, L, Lucaccioni, L, et al. Pitfalls in the diagnosis of meningitis in neonates and young infants: the role of lumbar puncture. J Matern Fetal Neonatal Med. (2019) 32:4029–35. doi: 10.1080/14767058.2018.1481031

PubMed Abstract | Crossref Full Text | Google Scholar

4. Nguyen, I, Urbanczyk, K, Mtui, E, and Li, S. Intracranial CNS infections: a literature review and radiology case studies. Semin Ultrasound CT MR. (2020) 41:106–20. doi: 10.1053/j.sult.2019.09.003

PubMed Abstract | Crossref Full Text | Google Scholar

5. Raza, MA, Tufail, M, Altuf, L, Chaudhary, K, Ghazanfar, S, Bukhari, SKA, et al. Diagnostic accuracy of magnetic resonance imaging for central nervous system associated infectious diseases. J Health Rehabil Res. (2024) 4:1–4. doi: 10.61919/jhrr.v4i3.1096

Crossref Full Text | Google Scholar

6. Kralik, SF, Vallejo, JG, Kukreja, MK, Salman, R, Orman, G, Huisman, TAGM, et al. Diagnostic accuracy of MRI for detection of meningitis in infants. AJNR Am J Neuroradiol. (2022) 43:1350–5. doi: 10.3174/ajnr.A7610

PubMed Abstract | Crossref Full Text | Google Scholar

7. Vaswani, AK, Nizamani, WM, Ali, M, Aneel, G, Shahani, BK, and Hussain, S. Diagnostic accuracy of contrast-enhanced FLAIR magnetic resonance imaging in diagnosis of meningitis correlated with CSF analysis. Indus J Biosci Res. (2025) 3. doi: 10.70749/ijbr.v3i5.1344

Crossref Full Text | Google Scholar

8. Van Timmeren, JE, Cester, D, Tanadini-Lang, S, Alkadhi, H, and Baessler, B. Radiomics in medical imaging—“how-to” guide and critical reflection. Insights Imag. (2020) 11:91. doi: 10.1186/s13244-020-00887-2

PubMed Abstract | Crossref Full Text | Google Scholar

9. Jing, R, Wang, J, Li, J, Wang, X, Li, B, Xue, F, et al. A wavelet features derived radiomics nomogram for prediction of malignant and benign early-stage lung nodules. Sci Rep. (2021) 11:22330. doi: 10.1038/s41598-021-01470-5

PubMed Abstract | Crossref Full Text | Google Scholar

10. Xia, C, Zuo, M, Lin, Z, Deng, L, Rao, Y, Chen, W, et al. Multimodal deep learning fusing clinical and Radiomics scores for prediction of early-stage lung adenocarcinoma lymph node metastasis. Acad Radiol. (2025) 32:2977–89. doi: 10.1016/j.acra.2024.12.018

PubMed Abstract | Crossref Full Text | Google Scholar

11. Liu, S, Liu, S, Cai, W, Che, H, Pujol, S, Kikinis, R, et al. Multimodal neuroimaging feature learning for multiclass diagnosis of Alzheimer’s disease. IEEE Trans Biomed Eng. (2015) 62:1132–40. doi: 10.1109/TBME.2014.2372011

PubMed Abstract | Crossref Full Text | Google Scholar

12. Chaddad, A, Desrosiers, C, and Niazi, T. Deep radiomic analysis of MRI related to Alzheimer’s disease. IEEE Access. (2018) 6:58213–21. doi: 10.1109/ACCESS.2018.2871977

Crossref Full Text | Google Scholar

13. Qiu, S, Chang, GH, Panagia, M, Gopal, DM, Au, R, and Kolachalama, VB. Fusion of deep learning models of MRI scans, Mini-mental state examination, and logical memory test enhances diagnosis of mild cognitive impairment. Alzheimer’s Dementia: Diagnosis, Assess Dis Monitor. (2018) 10:737–49. doi: 10.1016/j.dadm.2018.08.013

PubMed Abstract | Crossref Full Text | Google Scholar

14. Ye, Z, George, A, Wu, AT, Niu, X, Lin, J, Adusumilli, G, et al. Deep learning with diffusion basis spectrum imaging for classification of multiple sclerosis lesions. Annals Clin Translational Neurol. (2020) 7:695–706. doi: 10.1002/acn3.51037

PubMed Abstract | Crossref Full Text | Google Scholar

15. Wu, Y, Jiang, JH, Chen, L, Lu, JY, Ge, JJ, Liu, FT, et al. Use of radiomic features and support vector machine to distinguish Parkinson’s disease cases from normal controls. Annals Translational Med. (2019) 7:773. doi: 10.21037/atm.2019.11.26

PubMed Abstract | Crossref Full Text | Google Scholar

16. Sun, X, Ge, J, Li, L, Zhang, Q, Lin, W, Chen, Y, et al. Use of deep learning-based radiomics to differentiate Parkinson’s disease patients from normal controls: a study based on [18F]FDG PET imaging. Eur Radiol. (2022) 32:8008–18. doi: 10.1007/s00330-022-08799-z

PubMed Abstract | Crossref Full Text | Google Scholar

17. Song, L, Zhou, H, Guo, T, Qiu, X, Tang, D, Zou, L, et al. Predicting hemorrhage progression in deep intracerebral hemorrhage: a multicenter retrospective cohort study. World Neurosurg. (2023) 170:e387–401. doi: 10.1016/j.wneu.2022.11.022

PubMed Abstract | Crossref Full Text | Google Scholar

18. Hilbert, A, Ramos, LA, van Os, HJA, Olabarriaga, SD, Tolhuisen, ML, Wermer, MJH, et al. Data-efficient deep learning of radiological image data for outcome prediction after endovascular treatment of patients with acute ischemic stroke. Comput Biol Med. (2019) 115:103516. doi: 10.1016/j.compbiomed.2019.103516

PubMed Abstract | Crossref Full Text | Google Scholar

19. Zhang, Q, Liao, Y, Wang, X, Zhang, T, Feng, J, Deng, J, et al. A deep learning framework for 18F-FDG PET imaging diagnosis in pediatric patients with temporal lobe epilepsy. Eur J Nucl Med Mol Imaging. (2021) 48:2476–85. doi: 10.1007/s00259-020-05108-y

PubMed Abstract | Crossref Full Text | Google Scholar

20. Xie, C, Yu, X, Tan, N, Zhang, J, Su, W, Ni, W, et al. Combined deep learning and radiomics in pretreatment radiation esophagitis prediction for patients with esophageal cancer underwent volumetric modulated arc therapy. Radiother Oncol. (2024) 199:110438. doi: 10.1016/j.radonc.2024.110438

PubMed Abstract | Crossref Full Text | Google Scholar

21. Cheng, C, Wang, Y, Zhao, J, Wu, D, Li, H, and Zhao, H. Deep learning and Radiomics in triple-negative breast Cancer: predicting long-term prognosis and clinical outcomes. J Multidiscip Healthc. (2025) 18:319–27. doi: 10.2147/JMDH.S509004

PubMed Abstract | Crossref Full Text | Google Scholar

22. Cè, M, Chiriac, MD, Cozzi, A, Macrì, L, Rabaiotti, FL, Irmici, G, et al. Decoding Radiomics: a step-by-step guide to machine learning workflow in handcrafted and deep learning Radiomics studies. Diagnostics (Basel). (2024) 14:2473. doi: 10.3390/diagnostics14222473

PubMed Abstract | Crossref Full Text | Google Scholar

23. Cuce, F, Tulum, G, Yılmaz, KB, Osman, O, and Aralasmak, A. Radiomics method in the differential diagnosis of diabetic foot osteomyelitis and charcot neuroarthropathy. BJR. (2023) 96:20220758. doi: 10.1259/bjr.20220758

PubMed Abstract | Crossref Full Text | Google Scholar

24. Chan, TF, and Vese, LA. Active contours without edges. IEEE Trans on Image Process. (2001) 10:266–77. doi: 10.1109/83.902291

PubMed Abstract | Crossref Full Text | Google Scholar

25. Huang, G, Liu, Z, van der Maaten, L, and Weinberger, KQ. Densely connected convolutional networks. Proceed IEEE Conference Computer Vision Pattern Recog (CVPR). (2017):4700–8. doi: 10.1109/CVPR.2017.243

Crossref Full Text | Google Scholar

26. Howard, A, Sandler, M, Chu, G, Chen, L-C, Chen, B, Tan, M, et al. Searching for mobile NetV3. ar Xiv. (2019), 1314–1324. doi: 10.1109/ICCV.2019.00140

Crossref Full Text | Google Scholar

27. Ozcelik, N, Ozcelik, AE, Guner Zirih, NM, Selimoglu, I, and Gumus, A. Deep learning for diagnosis of malign pleural effusion on computed tomography images. Clinics (Sao Paulo). (2023) 78:100210. doi: 10.1016/j.clinsp.2023.100210

PubMed Abstract | Crossref Full Text | Google Scholar

28. Iqbal, I, Shahzad, G, Rafiq, N, Mustafa, G, and Ma, J. Deep learning-based automated detection of human knee joint’s synovial fluid from magnetic resonance images with transfer learning. IET Image Process. (2020) 14:1990–8. doi: 10.1049/iet-ipr.2019.1646

Crossref Full Text | Google Scholar

29. Das, P, Roy, SD, and Sangma, K. SFRSeg-net: synovial fluid region segmentation from rheumatoid arthritis affected small joints using USG for early detection In: A Antonacopoulos, S Chaudhuri, and R Chellappa, editors. Pattern recognition. Cham: Springer Nature Switzerland (2025). 127–46.

Google Scholar

Keywords: central nervous system infection, cerebrospinal fluid, brain MRI, Radiomics, deep learning, lumbar puncture, perivascular spaces

Citation: Cüce F, Tulum G, Isik MI, Jalili M, Girgin G, Karadaş Ö, Baş N, Özcan B, Savaşci Ü, Şakir S, Karadaş AÖ, Teomete E, Osman O and Rasheed J (2025) A novel MRI-based deep learning–radiomics framework for evaluating cerebrospinal fluid signal in central nervous system infection. Front. Med. 12:1659653. doi: 10.3389/fmed.2025.1659653

Received: 04 July 2025; Accepted: 08 August 2025;
Published: 20 August 2025.

Edited by:

Deepti Deshwal, Maharaja Surajmal Institute of Technology, India

Reviewed by:

Neeraj Kumar Pandey, Graphic Era University, India
Weiwei Jiang, Beijing University of Posts and Telecommunications (BUPT), China
Bo Kyu Choi, Yonsei University College of Medicine, Republic of Korea

Copyright © 2025 Cüce, Tulum, Isik, Jalili, Girgin, Karadaş, Baş, Özcan, Savaşci, Şakir, Karadaş, Teomete, Osman and Rasheed. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Gökalp Tulum, Z29rYWxwdHVsdW1AdG9wa2FwaS5lZHUudHI=; Jawad Rasheed, amF3YWQucmFzaGVlZEBpenUuZWR1LnRy

ORCID: Ferhat Cüce, orcid.org/0000-0003-1831-3868
Gökalp Tulum, orcid.org/0000-0003-1906-0401
Muhammed İkbal Işik, orcid.org/0000-0002-2132-8457
Marziye Jalili, orcid.org/0009-0002-4235-4416
Güven Girgin, orcid.org/0000-0002-6077-9824
Ömer Karadaş, orcid.org/0000-0003-0457-3722
Ümit Savaşci, orcid.org/0000-0003-3663-2053
Sena Şakir, orcid.org/0009-0003-5260-3145
Akçay Övünç Karadaş, orcid.org/0000-0003-3427-1207

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.