Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Immunol., 12 January 2026

Sec. Multiple Sclerosis and Neuroimmunology

Volume 16 - 2025 | https://doi.org/10.3389/fimmu.2025.1700604

Single-cell transcriptome-wide Mendelian randomization identifies mitochondrial targets in immune cells for Major Depressive Disorder

Yangyi Zhang,&#x;Yangyi Zhang1,2†Fei Zhang&#x;Fei Zhang3†Xingyi ShenXingyi Shen2Maoyuan NiMaoyuan Ni2Jiayi ZhangJiayi Zhang2Shunling ZhangShunling Zhang2Jingwen LinJingwen Lin2Zhaoyang Yang*Zhaoyang Yang2*Huangwei Lei*Huangwei Lei2*
  • 1Guangzhou University of Chinese Medicine, Guangzhou, Guangdong, China
  • 2Fujian University of Traditional Chinese Medicine, Fuzhou, Fujian, China
  • 3School of Rehabilitation Medicine, Gannan Medical University, Ganzhou, Jiangxi, China

Background: A critical need exists for objective biomarkers and novel therapeutic targets in major depressive disorder (MDD). Although dysfunction in mitochondrial immunometabolism is implicated in MDD, the specific causal genes suitable for clinical translation remain largely unidentified. This study aimed to bridge this gap by identifying mitochondria-related genes that have a causal impact on MDD risk through their expression in specific immune cells.

Methods: We integrated multi-omics data with machine learning to pinpoint key mitochondria-related energy metabolism genes (MEMRGs) linked to immune cell infiltration, assessed via ssGSEA and CIBERSORT algorithms. Cell-type-specific two-sample Mendelian randomization (MR) was employed to evaluate causal relationships between gene expression and MDD risk. Findings were validated in a chronic unpredictable mild stress (CUMS) rat model.

Results: Our analysis identified five genes—HK2, NDUFS4, NEU1, SOD1, and UCP2—whose expression in distinct immune populations had significant causal effects on MDD risk. Notably, HK2, NDUFS4, and NEU1 were identified as protective factors, while UCP2 and SOD1 were risk factors in specific cell types. The clinical relevance of this panel was supported by its diagnostic performance in an independent cohort, and the upregulation of the principal risk gene, UCP2, was confirmed in the hippocampus of CUMS rats.

Conclusion: This study provides robust genetic evidence establishing a causal link between the expression of specific mitochondrial genes in immune cells and the risk of MDD. By prioritizing UCP2, SOD1, HK2, NDUFS4, and NEU1, our findings highlight novel, immune-mediated pathways in depression and nominate promising targets for future diagnosis and therapeutic intervention.

1 Introduction

Major Depressive Disorder (MDD) is a prevalent mental illness, characterized by persistent low mood, diminished self-esteem, loss of interest in activities, reduced concentration, self-harm, and suicidal tendencies (1, 2). Its global prevalence imposes a substantial socioeconomic burden, particularly as it often manifests between ages 20 and 30, affecting quality of life and contributing to high suicide rates (3, 4). Diagnosis remains reliant on subjective symptom criteria, leading to potential misdiagnosis, and a substantial portion of patients fail to respond to existing antidepressant therapies (5, 6). Identifying reliable biomarkers and underlying mechanisms is thus imperative for early detection, precision diagnostics, and targeted therapies to mitigate suicide. Emerging artificial intelligence tools, such as machine learning for multi-omics integration, show promise in objective MDD detection by enhancing biomarker accuracy (7, 8), yet causal biological pathways remain underexplored.

A growing body of evidence implicates immune dysregulation and neuroinflammation as central pillars in the pathophysiology of MDD (9, 10). The function, differentiation, and activation of key immune cells, such as T cells and monocytes, are intrinsically linked to their metabolic state, with mitochondrial energy metabolism serving as a critical regulatory hub (11, 12). While mitochondrial dysfunction in the brain has been independently associated with MDD (13, 14), its specific role within the immune system’s contribution to depression remains a critical knowledge gap. Identifying the specific mitochondrial energy metabolism-related genes (MEMRGs) that drive immune cell-mediated risk for MDD could therefore uncover a new class of therapeutic targets at the interface of metabolism and immunity.

Addressing this gap could uncover novel pathways amenable to clinical intervention. To bridge this knowledge deficit, we integrated multi-omics data from the Gene Expression Omnibus (GEO) database and GeneCards to identify mitochondrial energy metabolism-related differentially expressed genes (MEMRDEGs) in MDD. Key genes were selected using LASSO, support vector machine (SVM), and logistic regression analyses. Their diagnostic potential was evaluated via receiver operating characteristic (ROC) curves. Immune infiltration profiles were assessed using ssGSEA and CIBERSORT algorithms with dual validation, and correlations between gene expression and immune cell abundance were analyzed. To formally test for causality between gene expression in specific immune lineages and MDD pathogenesis, we innovatively implemented a stratified Mendelian randomization (MR) framework. By leveraging summary statistics from major MDD GWAS and our curated single-cell eQTL (sc-eQTL) data across 14 immune cell types as genetic instruments, we systematically assessed the causal impact of the identified key genes. Therefore, the primary aim of this study was to identify MEMRGs that causally modulate MDD risk via their expression in distinct immune cell populations. We hypothesized that this high-resolution approach would reveal a panel of novel, cell-type-specific biomarkers and prioritize actionable targets for the future development of immunomodulatory therapies for MDD.

2 Materials and methods

2.1 Data acquisition and preprocessing

We downloaded two MDD-related datasets, GSE98793 (15) and GSE76826 (16), from the GEO database (17) using the R package GEOquery (18). Both datasets were derived from human blood samples and provided gene expression profiles. The GSE98793 dataset, generated on the GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array platform, included 192 samples (128 from MDD patients and 64 from healthy controls). The GSE76826 dataset, generated on the GPL17077 Agilent-039494 SurePrint G3 Human GE v2 8x60K Microarray 039381 platform, comprised 32 samples (12 from MDD patients and 20 from healthy controls). All samples were used in subsequent analyses. Detailed dataset information is provided in Table 1.

Table 1
www.frontiersin.org

Table 1. MDD dataset information list.

MEMRGs were obtained from the GeneCards database (19) using the keyword “Mitochondrial energy metabolism” and restricting to “Protein Coding” genes, yielding 218 MEMRGs. Additionally, we searched the Molecular Signatures Database (20) with the same keyword and identified 53 MEMRGs from the “KEGG_AMYOTROPHIC_LATERAL_SCLEROSIS_ALS” gene set. After merging and removing duplicates, a total of 266 unique MEMRGs were obtained. Detailed information is presented in Table 2.

Table 2
www.frontiersin.org

Table 2. List of gene symbols of MEMRGs.

2.2 Differential expression analysis

To increase statistical power, we merged datasets GSE98793 and GSE76826 using the R package sva (21) and performed batch effect removal to obtain a unified MDD dataset. Differential expression analysis was performed to identify DEGs between MDD and control samples. We then identified the intersection between these DEGs and the MEMRGs list. Genes exhibiting |logFC| > 0 and a p-value < 0.05 were defined as MEMRDEGs. The intersection was visualized using a Venn diagram. The differential analysis results were further visualized using a volcano plot and a differential ranking plot generated via the R package ggplot2.

2.3 Machine learning-based gene prioritization and model construction

Least absolute shrinkage and selection operator (LASSO) regression (22) was performed using the R package glmnet (23) with 10-fold cross-validation and a random seed of 2022 to screen MEMRDEGs from the combined MDD dataset. Genes with non-zero coefficients at the optimal lambda value were selected. Subsequently, a support vector machine (SVM) model (24) was constructed based on the LASSO-selected MEMRDEGs, optimized for maximum accuracy and minimum error rate.

The overlapping genes identified by both LASSO and SVM were considered key genes. A logistic regression model was then built using these key genes. To ensure the robustness of the identified biomarkers and avoid overfitting—especially given the absence of an external validation set—we employed a strict internal validation strategy. The model was validated using the Bootstrap method (1,000 resamplings) via the R package rms. A nomogram was created to visualize the multifactorial relationships and calculate total scores for predicting MDD probability. Model calibration was assessed with calibration curves (based on 1,000 bootstrap resamples), and clinical utility was evaluated via Decision Curve Analysis (DCA) using the ggDCA package (25).

2.4 ROC analysis

ROC curves (26) were generated using the R package pROC to evaluate the diagnostic potential of key gene expressions in distinguishing MDD patients from healthy controls. The area under the curve (AUC) was calculated for each gene to quantify discriminatory power, with values interpreted as follows: 0.5–0.7 indicating low accuracy, 0.7–0.9 moderate accuracy, and > 0.9 high accuracy.

2.5 Immune infiltration analysis

The immune cell landscape of each MDD sample was quantified using two distinct algorithms. We utilized single-sample Gene Set Enrichment Analysis (ssGSEA) via the R ‘GSVA’ package (27) to derive enrichment scores for various immune populations. In parallel, the CIBERSORT (28) deconvolution algorithm, referenced against the LM22 signature matrix, was used to estimate the relative fractions of 22 immune cell types. The resulting immune profiles from both methods were then compared between the MDD and control cohorts, and between high- and low-MEMscore subgroups, with differences visualized by boxplots. Pearson correlation was employed to assess the relationships among immune cell types and between immune cells and key genes, with findings visualized using dot plots and network diagrams via the ‘ggplot2’ package.

2.6 Single-cell eQTL MR analysis

To investigate the causal effects of 11 pre-identified key genes on MDD risk, we performed a cell-type-stratified two-sample MR analysis. Genetic instruments for gene expression in 14 immune cell types were sourced from the Yazar et al. single-cell eQTL (sc-eQTL) catalogue (29). Summary-level data for the MDD outcome were obtained from the Nagel et al. GWAS meta-analysis (N = 357,957 European-ancestry individuals) (30).

For each gene-cell pair, instrumental variables (IVs) were rigorously selected based on three primary criteria to ensure validity and specificity: (i) Cis-regulatory definition: only SNPs located within 1 Mb of the transcription start site of the target gene were considered to capture direct regulatory effects and minimize horizontal pleiotropy; (ii) Significance threshold: SNPs demonstrating a genome-wide significance of P < 5 × 10-5 were selected; and (iii) Independence: linkage disequilibrium (LD) clumping was performed (r² < 0.1 within a 1000 kb window) to remove correlated variants. To avoid weak instrument bias, the F-statistic was calculated for each gene-cell pair. In the final selection, F-statistics for individual SNPs ranged from 10.10 to 264.65, all exceeding the conventional threshold of 10 (F >10).

Primary causal estimates were calculated using the inverse-variance weighted (IVW) method utilizing the TwoSampleMR R package (31). To evaluate robustness, sensitivity analyses were conducted. Heterogeneity among IVs was assessed using Cochran’s Q statistic. Potential directional pleiotropy was evaluated using the MR-Egger intercept test, and horizontal pleiotropy was further examined via the MR-PRESSO global test. Additionally, leave-one-out analyses were performed to verify that the causal associations were not driven by any single influential SNP.

2.7 Animal experiments

2.7.1 Animal preparation

Animal experiments were conducted in accordance with the ARRIVE guidelines (32) and relevant institutional regulations, and were approved by the Research Ethics Committee of Fujian University of Traditional Chinese Medicine (approval no. 2019-FUCM-2022104). Given the higher prevalence of depression in females compared to males, female rats were used to model sex-specific biological and behavioral characteristics (33). Twelve 12-week-old healthy female Sprague-Dawley (SD) rats (body weight: 200 ± 10 g) were obtained from Shanghai SLAC Laboratory Animal Co., Ltd. (license no. SCXK (Shanghai) 2022-0004; qualification certificate no. 20220004002603). Rats were housed in the Animal Experiment Center at Fujian University of Traditional Chinese Medicine under controlled conditions: temperature of 22 ± 2 °C, 12-h light/dark cycle, and humidity of 40–60%. Animals were randomly assigned to two groups (n = 6 per group): a control group and a chronic unpredictable mild stress (CUMS) group. All rats had ad libitum access to food and water, except during experimental procedures.

2.7.2 CUMS modeling

The CUMS model is widely regarded as a classic paradigm for simulating the core pathological features of MDD (34). The CUMS model was established over 21 days to induce depressive-like behaviors in rats. One stressor was randomly selected each day, ensuring no repetition on consecutive days. Stressors included: 24-h food deprivation, 24-h water deprivation, 24-h cage tilting (45°from horizontal), 24-h wet bedding (200 mL water per 100 g sawdust), 24-h restraint, 5-min tail pinching (1 cm from the tail base), 10-min ice-water swimming (4 °C), and 15-min heat stress (45 °C). Rats were randomly assigned to control or CUMS groups (n = 6 per group) as described in Section 2.7.1, with ad libitum access to food and water except during stressors.

2.7.3 Sucrose preference test and open field test

The sucrose preference test (SPT) was performed to evaluate anhedonia in rats. On days 1–2, two bottles of 1% (w/v) sucrose solution were provided in each cage for habituation. Following 12-h food and water deprivation, one bottle was replaced with water. After 1 h, bottle positions were swapped to minimize positional bias. Consumption was measured after an additional 2 h, and sucrose preference was calculated as: (sucrose intake/total fluid intake) × 100%.

The open field test (OFT) was conducted to assess anxiety-like behaviors. In a quiet environment, each rat was placed in the center of an open field arena. Locomotor activity, including total distance traveled, was recorded for 5 min using TopScanLite software (version 2.00). The arena was cleaned with 75% ethanol between trials to eliminate olfactory cues.

2.7.4 Hematoxylin-eosin staining

Hippocampal tissues were fixed, paraffin-embedded, and sectioned coronally (6 μm thickness). Sections were baked at 60 °C for 2 h, deparaffinized in xylene (two changes, 10 min each), and rehydrated through a graded ethanol series (100% twice, 95%, 90%, 80%; 3 min each) followed by distilled water. Sections were stained with hematoxylin for 4 min, rinsed in tap water (three times), differentiated in 1% acid alcohol for 2 s, and counterstained with eosin for 50 s. Dehydration was performed through graded ethanol (80%, 90%, 95%, 100% twice; 2 s each), followed by clearing in xylene (two changes, 2 min each). Slides were mounted with neutral resin and observed under a light microscope at 200× magnification to assess morphological changes in the hippocampal tissue.

2.7.5 Nissl staining

Paraffin sections (6 μm) were deparaffinized and rehydrated as described in Section 2.7.4. Sections were stained with cresyl violet (Yuanye Bio-Technology, Shanghai, China) at 56 °C for 1 h, differentiated in 0.2% acetic acid until optimal contrast was achieved, and rapidly dehydrated in absolute ethanol (two changes). Slides were cleared in xylene (two changes) and mounted with neutral resin. Neuronal morphology in the hippocampus was examined under a light microscope at 200× magnification.

2.7.6 Reverse transcription quantitative polymerase chain reaction analysis

Total RNA was extracted from hippocampal tissues using TRIzol reagent (Invitrogen, Carlsbad, CA, USA), and RNA purity and concentration were assessed via spectrophotometry. Complementary DNA (cDNA) was synthesized using the Premix Ex Taq™ II kit (Takara Biotechnology Co., Ltd., Dalian, China). RT-qPCR was performed on a QuantStudio™ 6 Flex Real-Time PCR System (Applied Biosystems, Foster City, CA, USA) with the same kit, using β-actin as the internal reference gene. Relative mRNA expression levels were calculated using the 2-ΔΔCt method. Results are presented in Table 3.

Table 3
www.frontiersin.org

Table 3. Primer sequences used in the RT-qPCR.

2.7.7 Protein detection

SOD1, UCP2, and GAPDH (internal reference) were quantified using an automated capillary western blot system (ProteinSimple, San Jose, CA, USA). Hippocampal tissues were homogenized in RIPA lysis buffer supplemented with protease inhibitors, and total protein concentration was determined using a BCA assay kit (Beyotime Biotechnology, Shanghai, China). Samples were diluted to a standard concentration, mixed with 5× Fluorescent Master Mix (containing DTT and SDS) at a 1:4 ratio, and denatured at 95 °C for 5 min. The assay plate was loaded according to the manufacturer’s instructions: 3 μL of protein sample, 10 μL of antibody diluent, and 10 μL of primary antibodies were dispensed into respective wells. The primary antibodies used were: anti-SOD1 (1:50 dilution; Cat<ns/> YM8065, Immunoway), anti-UCP2 (1:50 dilution; Cat<ns/> YM9201, Immunoway), and anti-GAPDH (1:2200 dilution; Cat<ns/> 60004-1-Ig, Proteintech). Secondary antibody (anti-rabbit, 10 μL), luminol-peroxide solution (15 μL), and wash buffer (500 μL) were added to designated wells. Plates were centrifuged at 2500 rpm for 5 min at room temperature (25 °C) and analyzed on the Jess automated system (ProteinSimple). Band intensities were quantified using Compass software (version 4.0.0), with relative protein expression calculated as the ratio of target band intensity to GAPDH.

2.8 Statistical analysis

Data were analyzed using R software (version 4.2.3) and SPSS (version 26.0). Continuous variables were expressed as mean ± standard deviation (SD). Comparisons between two groups were performed using the independent Student’s t-test for normally distributed data or the Wilcoxon rank-sum test for non-normal data. For comparisons among three or more groups, the Kruskal-Wallis test was applied. Categorical variables were analyzed using the chi-square test or Fisher’s exact test, as appropriate. ROC curves were generated using the pROC package in R. Correlations were assessed via Spearman’s rank correlation coefficient. For Single-cell eQTL Mendelian randomization (Section 2.6), causal estimates were derived using the TwoSampleMR package in R, with sensitivity analyses as described. All tests were two-tailed, with statistical significance set at P < 0.05. In figures, significance levels are indicated as follows: ns (not significant, P ≥ 0.05); * (P < 0.05); ** (P < 0.01); *** (P < 0.001).

3 Results

3.1 Analysis flow chart

The technical approach for this bioinformatics analysis is illustrated below (Figure 1).

Figure 1
Flowchart illustrating the analysis process of MDD datasets GSE98793 and GSE76826 for mitochondrial energy metabolism gene sets. It includes differential expression analysis, LASSO, SVM, logistic regression, and identification of 11 key genes. Outputs are ROC analysis, ssGSEA CIBERSORT, single-cell eQTL MR analysis, and in vivo verification using CUMS rats.

Figure 1. Technology roadmap.

3.2 Differential expression analysis

Batch effects in datasets GSE98793 and GSE76826 were removed using the R package sva, resulting in a merged GEO dataset designated as the MDD dataset. The efficacy of batch effect removal was confirmed by comparing distribution boxplots and principal component analysis (PCA) plots before and after adjustment (Figures 2A–D), which demonstrated effective homogenization of sample distributions.

Figure 2
Panel A shows a bar graph of datasets GSE98793 and GSE76826 before processing, with distinct clusters. Panel B depicts the same datasets after processing, showing uniformity. Panel C features a scatter plot illustrating principal component analysis (PCA) before processing, highlighting two separate clusters. Panel D shows PCA after processing with overlapping clusters, indicating increased homogeneity.

Figure 2. Batch effect correction in the merged MDD dataset. (A) Boxplot of gene expression distributions in the merged MDD dataset before batch effect correction. (B) Boxplot of gene expression distributions in the merged MDD dataset after batch effect correction. (C) Principal component analysis (PCA) plot of the MDD dataset before correction. (D) PCA plot of the MDD dataset after correction.

Differential expression analysis of the MDD dataset identified 2424 differentially expressed genes (DEGs) based on |log2FC| > 0 and P < 0.05 criteria, including 1115 upregulated and 1309 downregulated genes in the MDD group relative to controls. Results were visualized using a volcano plot and a ranking plot generated with the R package ggplot2 (Figures 3A, B). Intersection of DEGs with MEMRGs yielded 40 MEMRDEGs: AGK, AK3, ATG7, ATP7B, C1QBP, CASP9, CBS, CLOCK, COX5A, CR1, CRAT, EPO, FOXO1, FXN, GRIA1, GRIN1, HIBCH, HK2, HUWE1, IDH3A, KLK4, LIPT1, LRPPRC, MAP2K6, MAPK14, MFN2, NDUFS4, NEU1, OPA3, PPP3CC, RAC1, RYR1, SIRT3, SOD1, SSBP1, TIMMDC1, TLR2, TNNT1, TPK1, and UCP2 (Figure 3C).

Figure 3
A series of six graphs and charts analyzing gene expression. A: Volcano plot displaying log2 fold change vs. -log10 p-value, marking upregulated, downregulated, and nonsignificant genes. B: Plot ranking differentially expressed genes with annotations on specific genes. C: Venn diagram showing overlapping genes between MEMRGs and DEGs. D: Box plots comparing gene expression in normal and MDD conditions for various genes. E: Correlation heatmap of gene interactions, with significance and correlation values. F: Circular genome map highlighting specific genes.

Figure 3. Differential expression analysis in the merged MDD dataset. (A) Volcano plot of differentially expressed genes (DEGs) between the normal and MDD groups. (B) Ranking plot of DEGs, with the x-axis representing the position after sorting by log2 fold change (log2FC) and the y-axis representing log2FC values. (C) Venn diagram illustrating the overlap between DEGs and MEMRGs in the merged dataset. (D) Boxplot comparing expression levels of MEMRDEGs between groups. (E) Correlation heatmap of MEMRDEGs. (F) Chromosomal localization of MEMRDEGs. Significance levels are indicated by asterisks (see Methods for details).

To ensure robustness, we re-verified the differential expression status of these 40 candidates within our final merged dataset. This confirmation step revealed that 34 of the genes robustly met the significance threshold (P < 0.05), while six (COX5A, HUWE1, KLK4, LRPPRC, TPK1, and RYR1) did not (Figure 3D). Consequently, these 34 confirmed MEMRDEGs were selected for all subsequent analyses. Correlations among the 34 MEMRDEGs were illustrated in a heatmap (Figure 3E), revealing an equal number of positive and negative correlations. Chromosomal locations of the MEMRDEGs were annotated using the RCircos package (Figure 3F), with five genes localized to chromosome 7 (AGK, EPO, and others as shown).

3.3 LASSO, SVM, logistics screening genes

To ascertain the diagnostic significance of MEMRDEGs within the MDD dataset, LASSO regression analysis was utilized to construct a diagnostic model (Figure 4A). The LASSO regression results, including the variable trajectory diagram, are visualized (Figure 4B). This analysis identified an optimal signature consisting of 11 MEMRDEGs, which yielded the highest diagnostic accuracy (Figure 4C) and the lowest cross-validation error rate (Figure 4D). The results indicate that the SVM model achieves maximum accuracy with 11 genes: ATP7B, CRAT, GRIA1, HK2, LIPT1, MAP2K6, NDUFS4, NEU1, SOD1, TNNT1, and UCP2. The genes identified by both algorithms were intersected, as demonstrated in a Venn diagram (Figure 4E), yielding 11 intersecting MEMRDEGs: ATP7B, CRAT, GRIA1, HK2, LIPT1, MAP2K6, NDUFS4, NEU1, SOD1, TNNT1, and UCP2. Logistic regression analysis was conducted on these 11 MEMRDEGs to build a logistic regression model. Through this process, the 11 MEMRDEGs were confirmed as key genes (Table 4). The findings from the univariate logistic regression analysis were organized and presented in a forest plot (Figures 4F, G).

Figure 4
A series of scientific charts and graphs analyzing various data sets. Panel A shows a plot of binomial deviance against log lambda. Panel B displays coefficients against log lambda with multiple lines. Panel C and D depict cross-validation accuracy and error against the number of features. Panel E presents a Venn diagram comparing LASSO and SVM selections. Panel F provides a table with univariate analysis results for different characteristics. Panel G illustrates a nomogram for predicted points. Panel H includes a calibration curve comparing actual and predicted probability. Panel I shows a decision curve analysis. Panel J reveals a series of box plots for different variables.

Figure 4. Construction of the MEMRDEGs diagnostic model. (A) LASSO regression model for MEMRDEGs. (B) Variable trajectory plot from the LASSO regression model. (C) Number of genes yielding the highest accuracy via the SVM algorithm. (D) Number of genes yielding the lowest error rate via the SVM algorithm. (E) Venn diagram showing the intersection of genes screened by LASSO regression and SVM algorithms. (F) Forest plot of logistic regression analysis results. (G) Nomogram derived from the logistic regression model. (H) Calibration curve for the logistic predictive model, assessing prediction accuracy. (I) DCA plot, with threshold probability on the x-axis and net benefit on the y-axis. (J) Functional similarity analysis of key genes.

Table 4
www.frontiersin.org

Table 4. Description of key genes.

Calibration analysis was performed to assess the accuracy and resolution of the diagnostic model, resulting in a calibration curve (Figure 4H). The curve illustrates the alignment between the optimal theoretical probability (solid line) and the model-predicted probability (dashed line), evaluating the model’s predictive accuracy for actual outcomes. DCA was implemented to validate the model’s clinical utility, with results presented (Figure 4I). The DCA indicates that the model provides a higher net benefit than ‘All positive’ and ‘All negative’ strategies within a specific range, demonstrating superior performance. As shown (Figures 4H, I), the constructed model exhibits high accuracy in diagnosing MDD. Functional similarity analysis was conducted on the 11 key genes to elucidate their relationships, with results visualized in a cloud and rain map (Figure 4J). The analysis reveals that SOD1 exhibits the highest functional similarity value compared to the other key genes.

3.4 Diagnostic ROC analysis

ROC curves were constructed using linear predictors from the logistic regression model to evaluate its diagnostic performance in the MDD dataset. The combined signature demonstrated good diagnostic accuracy in distinguishing MDD cases from controls, achieving AUC of 0.852 (Figure 5A). In contrast, the diagnostic utility of individual genes was limited, with AUCs ranging from 0.581 to 0.684 (Figures 5B–L).

Figure 5
Twelve ROC curve graphs labeled A to L depict the diagnostic performance of various biomarkers, including ATP7B, CRAT, GRIA1, HK2, LIPT1, MAP2K6, NDUFS4, NEU1, SOD1, TNNT1, and UCP2. Each graph displays sensitivity versus specificity with an annotated AUC score. Graph A, representing linear predictors, shows the highest AUC of 0.852. Other graphs demonstrate varying AUC scores, indicating different levels of predictive accuracy.

Figure 5. ROC curve analysis of the diagnostic model and key genes in the MDD dataset. (A). ROC curve of the logistic regression model linear predictors. (B–L). ROC curves for the 11 key genes (ATP7B, CRAT, GRIA1, HK2, LIPT1, MAP2K6, NDUFS4, NEU1, SOD1, TNNT1, and UCP2), with MDD and normal groups as outcome variables. The area under the curve (AUC) ranges generally between 0.5 and 1; values closer to 1 indicate better diagnostic performance. AUC values of 0.5–0.7 suggest low accuracy, 0.7–0.9 indicate moderate accuracy, and >0.9 reflect high accuracy.

3.5 ssGSEA immune infiltration analysis between MDD and normal groups in the MDD dataset

Immune cell infiltration was evaluated using the ssGSEA algorithm to examine differences between MDD and normal controls in the MDD dataset. Infiltration levels of 28 immune cell types were assessed, and the Mann-Whitney U test was applied to compare these levels between groups. The results are presented in a comparative chart (Figure 6A). Statistically significant differences (P < 0.05) were observed in the infiltration levels of ten immune cell types: activated CD4 T cells, central memory CD8 T cells, activated CD8 T cells, activated B cells, effector memory CD8 T cells, memory B cells, type 1 T helper cells, activated dendritic cells, macrophages, and natural killer cells. Correlation analysis among these 10 dysregulated cell types revealed a strong positive association between effector memory and activated CD8+ T cells and a strong negative association between activated CD8+ T cells and natural killer cells (Figure 6B).

Figure 6
Panel A shows a boxplot comparing infiltration abundance of various immune cell types between normal and MDD groups. Panel B presents a correlation heatmap of different immune cell types, with values ranging from -1 to 1. Panel C displays a bubble plot indicating gene expression correlations with cell types, with bubble size representing the significance level.

Figure 6. ssGSEA immune infiltration analysis in the MDD dataset. (A) Boxplot comparing ssGSEA immune infiltration scores between MDD and normal groupsin the MDD dataset. (B) Correlation heatmap of immune cell infiltration abundance in the MDD dataset. (C) Correlation plot between immune cellsand key genes in the MDD dataset. ns, not significant (P ≥ 0.05); *P < 0.05; **P < 0.01; ***P < 0.001.

Next, we investigated the relationship between the 11 key genes and these 10 immune cell populations (Figure 6C). Several strong positive correlations emerged. Specifically, HK2 expression correlated with macrophages and natural killer cells. SOD1 and UCP2 expression were associated with multiple activated adaptive immune cells, including activated CD4+/CD8+ T cells and B cells. Furthermore, NEU1 expression correlated positively with macrophages and activated dendritic cells, while NDUFS4 was associated with effector memory CD8+ T cells (all P < 0.05). These correlations suggest that MEMRGs may modulate immune infiltration patterns in MDD, laying the groundwork for causal investigations.

3.6 CIBERSORT immune infiltration analysis between MDD and normal groups in the MDD dataset

To complement the ssGSEA findings and validate the altered immune landscape using an alternative deconvolution algorithm, we next employed CIBERSORT. This analysis confirmed significant heterogeneity in immune profiles (Figure 7A) and revealed that MDD patients had significantly higher proportions of four immune cell types: M0 macrophages (P < 0.001), neutrophils, resting NK cells, and CD8+ T cells (all P < 0.01) (Figure 7B). Among these dysregulated cells, M0 macrophages and resting NK cells were strongly positively correlated, while neutrophils showed a strong negative correlation with resting NK cells (Figure 7C).

Figure 7
A composite image showing four graphs related to immune cell analysis:  A) A stacked bar chart depicting the relative percentage of various immune cells in normal and MDD groups, with a color-coded legend for cell types.  B) A box plot comparing the infiltrate abundance of different immune cells between normal (blue) and MDD (red) groups, including statistical significance annotations.  C) A correlation heatmap with pie charts highlighting relationships between various immune cells, showing positive and negative correlations in red and blue.  D) A bubble plot illustrating the correlation of different genes with immune cells, using color intensity and bubble size to indicate correlation and p-value significance.

Figure 7. CIBERSORT immune cell infiltration analysis in the MDD dataset. (A) Stacked histogram showing the relative proportions of immune cell infiltration inMDD and normal groups. (B) Group comparison chart of immune cell infiltration levels between MDD and normal groups. (C) Correlation analysis ofimmune cell infiltration abundance. (D) Correlation diagram between immune cell infiltration levels and expression of key genes. ns, not significant (P ≥ 0.05); **P < 0.01; ***P < 0.001.

We then correlated the proportions of these four cell types with the expression of our 11 key genes (Figure 7D). Both SOD1 and UCP2 expression were positively associated with CD8+ T cell abundance. NEU1 and HK2 expression also showed positive correlations with M0 macrophage levels. These findings align with ssGSEA results, reinforcing the link between MEMRG expression and altered immune profiles in MDD, and highlighting potential pathways for immune-mediated disease mechanisms.

3.7 Causal effects of gene expression on MDD risk assessed by MR

The preceding analyses revealed significant correlations between the expression of 11 key genes and the infiltration levels of various immune cells (Sections 3.5 & 3.6). To investigate whether these associations reflect a causal relationship between gene expression within specific immune cell types and MDD pathogenesis, we conducted a two-sample MR analysis.

Our MR analysis provided genetic evidence for a causal link between the expression of five key genes in distinct immune populations and the risk of MDD (Figure 8).

Figure 8
A forest plot showing the odds ratios (OR) and 95% confidence intervals (CI) for various genes and cell types. The x-axis represents the OR, with a dashed red line at 1.0. Each gene-cell pair is listed alongside its OR, CI, and p-value. The plot illustrates that most relationships have ORs greater than 1, with significant p-values below 0.05, indicating notable associations.

Figure 8. Mendelian randomization analysis of the causal effects of gene expression on MDD risk. The forest plot illustrates the causal estimates from the two-sample MR analysis. The analysis assessed the effect of the expression of five key genes (HK2, NDUFS4, NEU1, SOD1, and UCP2) in various immune cell types on the risk of MDD. Each point estimate represents the Odds Ratio (OR), and the horizontal lines indicate the 95% Confidence Intervals (CI). The vertical dashed line at OR = 1 represents the null effect. ORs, 95% CIs, and corresponding P-values are listed on the right.

Protective Causal Effects: We identified several genes whose elevated expression was causally associated with a reduced risk of MDD. Specifically, increased expression of HK2 in monocytes (OR = 0.96, 95% CI: 0.93-1.00, P = 0.029), NDUFS4 in both CD4+ naive T cells (OR = 0.98, 95% CI: 0.96-0.99, P = 0.002) and natural killer cells (OR = 0.97, 95% CI: 0.95-0.99, P = 0.016), and NEU1 in CD4+ naive T cells (OR = 0.86, 95% CI: 0.81-0.91, P < 0.001) were identified as protective factors against MDD.

Risk-Conferring Causal Effects: Conversely, the analysis revealed that elevated expression of SOD1 in mononuclear cells was causally linked to an increased risk of MDD (OR = 1.03, 95% CI: 1.02-1.04, P < 0.001). More strikingly, increased expression of UCP2 was consistently associated with a higher risk of MDD across a broad spectrum of immune cells. These included B cells (OR = 1.03, 95% CI: 1.02-1.05, P < 0.001), multiple T cell subsets, monocytes, dendritic cells, and natural killer cells (all P < 0.01). The strongest risk effects for UCP2 were observed in B cells and S100B-expressing CD8+ T cells (OR = 1.03, 95% CI: 1.02-1.04, P < 0.001).

The validity of these causal estimates was confirmed through a battery of sensitivity analyses. We found no evidence of significant heterogeneity (Cochran’s Q test, all P > 0.05) or directional pleiotropy, as indicated by non-significant MR-Egger regression intercepts (P > 0.05) and visually symmetrical funnel plots. Additionally, MR-PRESSO analyses did not detect any outlier genetic variants, and leave-one-out analyses demonstrated that the results were not driven by any single instrumental variable. Detailed results of all sensitivity analyses are provided in the Supplementary Materials. Collectively, these rigorous checks affirm the robustness of our findings.

Collectively, these MR findings provide robust genetic evidence supporting a causal role for altered gene expression within specific immune cell populations in the etiology of MDD, highlighting both protective and risk-conferring molecular pathways that warrant further investigation.

3.8 In vivo validation and characterization of key gene expression in a CUMS model

To investigate the in vivo relevance of our primary findings, we established a CUMS rat model to mimic depressive-like behaviors (experimental schematic in Figure 9A). The successful induction of a depressive phenotype was first confirmed through behavioral and physiological assessments. Rats subjected to CUMS exhibited significantly lower body weight gain compared to the control group (P < 0.05, Figure 9B). Furthermore, the CUMS group displayed a marked reduction in sucrose preference, a core indicator of anhedonia (P < 0.05, Figure 9C). In the open field test, CUMS-exposed rats showed significantly decreased total distance traveled and time spent in the center zone, indicative of locomotor inhibition and anxiety-like behavior (P < 0.05, Figures 9D–F).

Figure 9
Diagram of an experiment involving normal and CUMS-treated mice. Part A shows an experiment timeline with behavioral tests and sample collection. Part B is a graph showing body weight changes over three weeks, with CUMS mice gaining less weight than normal mice. Parts C to E are bar charts showing sucrose consumption, total distance, and center distance, all lower in CUMS mice. Part F illustrates movement paths of mice in an open field, showing more activity in normal mice compared to CUMS mice.

Figure 9. Behavioral validation of the CUMS model in rats. (A) Timeline of the experimental design. (B) Body weight monitoring indicated suppressed weight gain in CUMS rats. (C) Sucrose preference test results showing anhedonia-like behavior. (D–F). Locomotor and anxiety-like behaviors assessed by the open field test. Panels show total distance (D), center zone distance (E), and representative tracking paths (F). Data are presented as mean ± SD (n = 6 per group). **P < 0.01 vs. NOR group.

Next, we assessed neuropathological changes in the hippocampus. Histopathological examination using H&E staining revealed significant neuronal injury in the CUMS group. In contrast to the neatly arranged and densely packed neurons in the CA3 and dentate gyrus (DG) regions of control animals, the CUMS group exhibited cellular disarray, reduced neuronal density, and enlarged intercellular spaces (Figure 10A). At higher magnification, neurons from CUMS rats displayed hallmarks of cellular stress, including chromatolysis (loss of Nissl bodies) and pyknosis (nuclear condensation and shrinkage), whereas control neurons appeared healthy with abundant Nissl substance and intact nuclei.

Figure 10
Panel A shows histological images comparing hippocampal regions CA3 and DG under normal (NOR) and chronic unpredictable mild stress (CUMS) conditions, using H&E and Nissl staining. Panel B displays bar graphs depicting SOD1 and UCP2 mRNA expression levels, with significant differences between NOR and CUMS groups. Panel C presents Western blots for SOD1, UCP2, and GAPDH proteins, highlighting expression differences under NOR and CUMS conditions, supported by accompanying bar graphs showing quantification relative to GAPDH. Statistical significance is indicated with asterisks.

Figure 10. Validation of histological injury and risk gene expression in the hippocampus. (A) Representative images of HE (top) and Nissl (bottom) staining in the CA3 and DG subregions. The CUMS group exhibited neuronal loss and nuclear pyknosis compared to controls. Scale bar = 50 μm. (B) Relative mRNA levels of SOD1 and UCP2 analyzed by RT-qPCR. (C) Western blot analysis and densitometric quantification of SOD1 and UCP2 protein expression normalized to GAPDH. Data are presented as mean ± SD. Unpaired Student’s t-test. *P < 0.05, **P < 0.01 vs. NOR group.

Finally, we sought to determine whether the expression of key genes identified by our MR analysis was altered in the hippocampus of the CUMS model. We first examined UCP2, which the MR analysis implicated as a risk factor for MDD. Consistent with the human genetic data, both UCP2 mRNA and protein levels were significantly upregulated in the hippocampus of CUMS rats relative to controls (P < 0.05, Figures 10B, C). Next, we assessed the expression of SOD1. Intriguingly, in contrast to the MR finding where higher SOD1 expression in mononuclear cells was associated with increased MDD risk, we observed a significant downregulation of both SOD1 mRNA and protein levels in the hippocampus of the CUMS model (P < 0.05, Figures 10B, C). Collectively, these in vivo findings validate the role of hippocampal UCP2 upregulation in depressive-like pathology. However, the opposing expression pattern of SOD1 between the CUMS model and human genetic data suggests a complex, potentially context-dependent role for this gene in MDD pathophysiology that warrants further investigation.

4 Discussion

In this study, we integrated multi-omics data with Mendelian randomization to dissect the causal role of mitochondrial immunometabolism in MDD. The primary contributions of our work are fourfold. First, we developed a novel integrative framework that uses machine learning and single-cell genomics to prioritize causally-relevant genes, moving beyond simple correlation. Second, we provide the first genetic evidence for a causal link between the expression of five specific genes (HK2, NDUFS4, NEU1, SOD1, and UCP2) in distinct immune cell types and the risk of MDD, identifying both risk and protective profiles. Third, we established the clinical potential of this five-gene signature as a diagnostic biomarker panel, demonstrating good performance in an independent cohort. Finally, we experimentally corroborated our genetic findings by observing the upregulation of the principal risk gene, UCP2, in the hippocampus of a stress-induced rat model. Collectively, these findings illuminate a novel immune-metabolic axis in depression and highlight genetically prioritized targets for future therapeutic development.

Our single-cell eQTL-based MR analysis provided granular insights into the cell-type-specific causal architecture of these genes. The protective gene HK2, whose higher expression in monocytes was associated with reduced MDD risk, demonstrated fair diagnostic utility (AUC = 0.684). Specifically, our MR results confirmed that genetically predicted upregulation of HK2 in monocytes exerts a robust protective effect against MDD. This finding presents an intriguing paradox. While literature suggests its upregulation often drives pro-inflammatory aerobic glycolysis (the Warburg effect) (35, 36), our data suggest its role in monocytes is protective. This could imply that a baseline level of glycolytic readiness is essential for proper monocyte function, and that genetic variants promoting this optimal state are protective, whereas dysregulated glycolysis contributes to pathology. Similarly, we MR results identified protective roles for NDUFS4 (AUC = 0.647) in CD4+ naive T cells and natural killer cells, and NEU1 (AUC = 0.602) in CD4+ naive T cells were all causally associated with a reduced risk of MDD. NDUFS4 is vital for mitochondrial Complex I efficiency (37) and NEU1 is a key lysosomal enzyme (38), underscoring that preserving fundamental energy and cellular maintenance processes within the immune system is a key factor in mitigating MDD risk. We hypothesize that NEU1 may confer protection by regulating the sialylation of key surface receptors on T cells, thereby modulating activation, differentiation, and migration processes (39). Dysregulation of these processes has been implicated in neuroinflammation, suggesting that NEU1 may help maintain immune homeostasis and prevent T cell-mediated pathology relevant to MDD (40, 41).

In stark contrast, our findings nominate UCP2 as a central, multi-lineage driver of MDD pathology. Increased UCP2 expression emerged as a consistent causal risk factor across a broad spectrum of immune cells and showed fair diagnostic potential (AUC = 0.635). This aligns perfectly with the concept of maladaptive immunometabolic reprogramming in chronic inflammatory states linked to MDD (9, 42). As a key modulator of mitochondrial function and ROS production (43, 44), UCP2 upregulation in immune cells likely signifies a state of chronic metabolic stress that fuels the low-grade neuroinflammation characteristic of depression (45). Our in vivo results provide experimental support for this hypothesis: we observed significantly elevated mRNA and protein levels of UCP2 in the hippocampus of CUMS rats. This finding is consistent with previous studies linking UCP2 to neurological disorders and depressive-like phenotypes (46). The convergence of human genetic evidence and in vivo data reinforces UCP2 as a promising therapeutic target for MDD.

Our findings on the context-dependent role of SOD1 provide compelling evidence to reconcile conflicting reports in the literature regarding its involvement in MDD. It is well-established that SOD1 exerts a neuroprotective role in the central nervous system (CNS) (47). Previous studies have reported lower SOD1 levels in MDD patients and have shown that SOD1 overexpression can protect rodents from stress-induced depressive-like behaviors, likely by mitigating oxidative stress (48, 49). Consistent with this neuroprotective paradigm, our CUMS model revealed a significant downregulation of SOD1 in the hippocampus, suggesting an exhaustion of the brain’s antioxidant defenses under chronic stress. However, our MR analysis unveils a potentially paradoxical role for SOD1 in the peripheral immune system. We found that a genetic predisposition for higher SOD1 expression in immune cells may confer a risk factor for MDD. This seemingly paradoxical finding may reflect distinct context-dependent roles of SOD1. This leads us to hypothesize a dual, tissue-specific role for SOD1. In the CNS, its primary function is antioxidant defense, where higher levels are protective. In contrast, within immune cells, SOD1 is a critical modulator of redox signaling, which governs inflammatory responses. Collectively, these observations highlight the complexity of targeting SOD1 and underscore the imperative for tissue-specific therapeutic strategies in depression management.

Crucially, while our MR analysis primarily focused on peripheral immune cells, it is important to acknowledge that mitochondrial dysfunction is not confined to the periphery but may also manifest in CNS glial cells, particularly microglia and astrocytes (50, 51). These glial cells are pivotal for maintaining neuronal homeostasis and regulating neuroinflammation, processes that are fundamentally dependent on mitochondrial integrity. We propose that aberrant expression of mitochondrial metabolic genes in peripheral immune cells could propagate signals (via cytokines or extracellular vesicles) to the CNS, thereby influencing glial metabolism and activity. This communication suggests a potential “peripheral-central immune-glial mitochondrial axis” contributing to the immunometabolic imbalance in MDD. In the current study, we primarily validated the pathological alterations of UCP2 and SOD1 in the hippocampal tissue of CUMS rats. Future investigations utilizing single-cell transcriptomics are warranted to specifically delineate the cell-type-specific roles of these genes within CNS glial populations, further refining the immunometabolic model of MDD.

The six genes that showed initial correlation but lacked causal evidence in our MR analysis (ATP7B, CRAT, GRIA1, LIPT1, MAP2K6, TNNT1) are also informative. Their associations may be secondary to the disease process, influenced by confounding variables, or represent pathways that are not central to disease etiology. For instance, while altered copper metabolism (ATP7B) has been linked to depression (52, 53), our results suggest it may not be a primary causal driver. This distinction powerfully illustrates the discriminatory power of MR in prioritizing targets with genuine causal impact from a pool of observational associations, saving resources for downstream functional studies (54). These results align with broader MDD research, including Psychiatric Genomics Consortium GWAS implicating immune and neuronal genes (55). Our study adds a layer of causal evidence, suggesting that genetic variants affecting the expression levels of specific genes in immune cells are key drivers of this risk. This high-precision causal mapping offers a new level of clarity for prioritizing genetically-validated targets for future mechanism-based immunomodulatory therapies.

Despite the robustness of our multi-pronged approach, several limitations should be acknowledged. First, although sensitivity analyses suggest robust causal inference, the inherent possibility of horizontal pleiotropy in MR cannot be definitively excluded. Second, the predominance of European ancestry in the GWAS datasets may limit generalizability, necessitating verification in diverse populations. Finally, our current in vivo validation relies on bulk tissue analysis, which confirms overall dysregulation but lacks the resolution to distinguish specific immune or glial sources. Future work utilizing spatial transcriptomics or multiplex imaging is warranted to spatially map these cell-type-specific effects and elucidate their downstream mechanisms.

5 Conclusions

In conclusion, this study provides compelling, multi-faceted evidence that the expression of five key mitochondria-related genes in specific immune cells exerts a causal effect on MDD risk. We established their potential as clinically relevant biomarkers, verified the upregulation of the risk gene UCP2 in a stress-induced animal model, and uncovered a complex, context-dependent role for SOD1. These findings move beyond correlation to illuminate a distinct immune-metabolic axis in depression. We propose the identified pathways as genetically supported candidates for the development of novel mechanism-based diagnostics and immunomodulatory therapies for MDD.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Ethics statement

The animal study was approved by the Research Ethics Committee of Fujian University of Traditional Chinese Medicine (approval no. 2019-FUCM-2022104). The study was conducted in accordance with the local legislation and institutional requirements.

Author contributions

YZ: Data curation, Investigation, Methodology, Software, Writing – original draft. FZ: Formal Analysis, Validation, Writing – original draft. XS: Writing – original draft. MN: Validation, Visualization, Writing – original draft. JZ: Validation, Writing – original draft. SZ: Validation, Writing – original draft. JL: Validation, Writing – original draft. ZY: Funding acquisition, Writing – review & editing. HL: Funding acquisition, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This work was supported by the National Natural Science Foundation of China (81973751); Foreign Cooperation Project of Fujian Science and Technology Program (No.: 2021I0018); High-level Talent Program of Fujian University of Traditional Chinese Medicine (No.: X2020008-Talent); The Second Youth Talent Promotion Project of Fujian Association for Science and Technology (no project number); and the Traditional Chinese Medicine Diagnostics and Health Status Identification Inheritance and Innovation Team, National Traditional Chinese Medicine Inheritance and Innovation Team, State Administration of Traditional Chinese Medicine (Grant No. ZYYCXTD-C-202408).

Conflict of interest

The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2025.1700604/full#supplementary-material

References

1. Feng Y, Xiao L, Wang WW, Ungvari GS, Ng CH, Wang G, et al. Guidelines for the diagnosis and treatment of depressive disorders in China: The second edition. J Affect Disord. (2019) 253:352–6. doi: 10.1016/j.jad.2019.04.104

PubMed Abstract | Crossref Full Text | Google Scholar

2. Herrman H, Kieling C, McGorry P, Horton R, Sargent J, and Patel V. Reducing the global burden of depression: a Lancet-World Psychiatric Association Commission. Lancet. (2019) 393:e42–3. doi: 10.1016/S0140-6736(18)32408-5

PubMed Abstract | Crossref Full Text | Google Scholar

3. Athira KV, Bandopadhyay S, Samudrala PK, Naidu VGM, Lahkar M, and Chakravarty S. An overview of the heterogeneity of major depressive disorder: current knowledge and future prospective. Curr Neuropharmacol. (2020) 18:168–87. doi: 10.2174/1570159X17666191001142934

PubMed Abstract | Crossref Full Text | Google Scholar

4. Malhi GS and Mann JJ. Depression. Lancet. (2018) 392:2299–312. doi: 10.1016/S0140-6736(18)31948-2

PubMed Abstract | Crossref Full Text | Google Scholar

5. Young JJ, Silber T, Bruno D, Galatzer-Levy IR, Pomara N, and Marmar CR. Is there progress? An overview of selecting biomarker candidates for major depressive disorder. Front Psychiatry. (2016) 7:72. doi: 10.3389/fpsyt.2016.00072

PubMed Abstract | Crossref Full Text | Google Scholar

6. Otte C, Gold SM, Penninx BW, Pariante CM, Etkin A, Fava M, et al. Major depressive disorder. Nat Rev Dis Primers. (2016) 2:16065. doi: 10.1038/nrdp.2016.65

PubMed Abstract | Crossref Full Text | Google Scholar

7. Heyat MBB, Akhtar F, Munir F, Sultana A, Muaad AY, Gul I, et al. Unravelling the complexities of depression with medical intelligence: exploring the interplay of genetics, hormones, and brain function. Complex Intelligent Syst. (2024) 10:4–10. doi: 10.1007/s40747-024-01346-x

Crossref Full Text | Google Scholar

8. Heyat MBB, Adhikari D, Akhtar F, Parveen S, Zeeshan HM, Ullah H, et al. Intelligent internet of medical things for depression: current advancements, challenges, and trends. Int J Intelligent Syst. (2025) 2025:6801530. doi: 10.1155/int/6801530

Crossref Full Text | Google Scholar

9. Beurel E, Toups M, and Nemeroff CB. The bidirectional relationship of depression and inflammation: double trouble. Neuron. (2020) 107:234–56. doi: 10.1016/j.neuron.2020.06.002

PubMed Abstract | Crossref Full Text | Google Scholar

10. Gislaine Z R, Luana M M, João Q, and André F C. Major depressive disorder as a neuro-immune disorder: Origin, mechanisms, and therapeutic opportunities. Neurosci Biobehav Rev. (2023) 155:105425. doi: 10.1016/j.neubiorev.2023.105425

PubMed Abstract | Crossref Full Text | Google Scholar

11. Mills EL, Kelly B, and O’Neill LAJ. Mitochondria are the powerhouses of immunity. Nat Immunol. (2017) 18:488–98. doi: 10.1038/ni.3704

PubMed Abstract | Crossref Full Text | Google Scholar

12. Angelika S R and Erika L P. Mitochondrial dynamics at the interface of immune cell metabolism and function. Trends Immunol. (2017) 39:6–18. doi: 10.1016/j.it.2017.08.006

PubMed Abstract | Crossref Full Text | Google Scholar

13. Scaini G, Mason BL, Diaz AP, Jha MK, Soares JC, Trivedi MH, et al. Dysregulation of mitochondrial dynamics, mitophagy and apoptosis in major depressive disorder: Does inflammation play a role? Mol Psychiatry. (2022) 27:1095–102. doi: 10.1038/s41380-021-01312-w

PubMed Abstract | Crossref Full Text | Google Scholar

14. Jelena B, Stefanie G, Christian O, Yuri M, Anna S M, Martin P, et al. Cellular specificity of mitochondrial and immunometabolic features in major depression. Mol Psychiatry. (2022) 27:2370–1. doi: 10.1038/s41380-022-01473-2

PubMed Abstract | Crossref Full Text | Google Scholar

15. Leday GGR, Vertes PE, Richardson S, Greene JR, Regan T, Khan S, et al. Replicable and coupled changes in innate and adaptive immune gene expression in two case-control studies of blood microarrays in major depressive disorder. Biol Psychiatry. (2018) 83:70–80. doi: 10.1016/j.biopsych.2017.01.021

PubMed Abstract | Crossref Full Text | Google Scholar

16. Miyata S, Kurachi M, Okano Y, Sakurai N, Kobayashi A, Harada K, et al. Blood transcriptomic markers in patients with late-onset major depressive disorder. PloS One. (2016) 11:e0150262. doi: 10.1371/journal.pone.0150262

PubMed Abstract | Crossref Full Text | Google Scholar

17. Barrett T, Troup DB, Wilhite SE, Ledoux P, Rudnev D, Evangelista C, et al. mining tens of millions of expression profiles–database and tools update. Nucleic Acids Res. (2007) 35:D760–5. doi: 10.1093/nar/gkl887

PubMed Abstract | Crossref Full Text | Google Scholar

18. Davis S and Meltzer PS. GEOquery: a bridge between the gene expression omnibus (GEO) and bioConductor. Bioinformatics. (2007) 23:1846–7. doi: 10.1093/bioinformatics/btm254

PubMed Abstract | Crossref Full Text | Google Scholar

19. Stelzer G, Rosen N, Plaschkes I, Zimmerman S, Twik M, Fishilevich S, et al. The geneCards suite: from gene data mining to disease genome sequence analyses. Curr Protoc Bioinf. (2016) 54:1 30 1–1 30 33. doi: 10.1002/cpbi.5

PubMed Abstract | Crossref Full Text | Google Scholar

20. Liberzon A, Birger C, Thorvaldsdottir H, Ghandi M, Mesirov JP, and Tamayo P. The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell Syst. (2015) 1:417–25. doi: 10.1016/j.cels.2015.12.004

PubMed Abstract | Crossref Full Text | Google Scholar

21. Leek JT, Johnson WE, Parker HS, Jaffe AE, and Storey JD. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. (2012) 28:882–3. doi: 10.1093/bioinformatics/bts034

PubMed Abstract | Crossref Full Text | Google Scholar

22. Cai W and van der Laan M. Nonparametric bootstrap inference for the targeted highly adaptive least absolute shrinkage and selection operator (LASSO) estimator. Int J Biostat. (2020) 16:20170070. doi: 10.1515/ijb-2017-0070

PubMed Abstract | Crossref Full Text | Google Scholar

23. Engebretsen S and Bohlin J. Statistical predictions with glmnet. Clin Epigenet. (2019) 11:123. doi: 10.1186/s13148-019-0730-1

PubMed Abstract | Crossref Full Text | Google Scholar

24. Sanz H, Valim C, Vegas E, Oller JM, and Reverter F. SVM-RFE: selection and visualization of the most relevant features through non-linear kernels. BMC Bioinf. (2018) 19:432. doi: 10.1186/s12859-018-2451-4

PubMed Abstract | Crossref Full Text | Google Scholar

25. Tataranni T and Piccoli C. Dichloroacetate (DCA) and cancer: an overview towards clinical applications. Oxid Med Cell Longev. (2019) 2019:8201079. doi: 10.1155/2019/8201079

PubMed Abstract | Crossref Full Text | Google Scholar

26. Mandrekar JN. Receiver operating characteristic curve in diagnostic test assessment. J Thorac Oncol. (2010) 5:1315–6. doi: 10.1097/JTO.0b013e3181ec173d

PubMed Abstract | Crossref Full Text | Google Scholar

27. Hanzelmann S, Castelo R, and Guinney J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinf. (2013) 14:7. doi: 10.1186/1471-2105-14-7

PubMed Abstract | Crossref Full Text | Google Scholar

28. Chen B, Khodadoust MS, Liu CL, Newman AM, and Alizadeh AA. Profiling tumor infiltrating immune cells with CIBERSORT. Methods Mol Biol. (2018) 1711:243–59. doi: 10.1007/978-1-4939-7493-1_12

PubMed Abstract | Crossref Full Text | Google Scholar

29. Yazar S, Alquicira-Hernandez J, Wing K, Senabouth A, Gordon MG, Andersen S, et al. Single-cell eQTL mapping identifies cell type-specific genetic control of autoimmune disease. Sci (New York N.Y.). (2022) 376:eabf3041. doi: 10.1126/science.abf3041

PubMed Abstract | Crossref Full Text | Google Scholar

30. Nagel M, Jansen PR, Stringer S, Watanabe K, de Leeuw CA, Bryois J, et al. Meta-analysis of genome-wide association studies for neuroticism in 449,484 individuals identifies novel genetic loci and pathways. Nat Genet. (2018) 50:920–7. doi: 10.1038/s41588-018-0151-7

PubMed Abstract | Crossref Full Text | Google Scholar

31. Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D, et al. The MR-Base platform supports systematic causal inference across the human phenome. eLife. (2018) 7:e34408. doi: 10.7554/eLife.34408

PubMed Abstract | Crossref Full Text | Google Scholar

32. Kilkenny C, Browne WJ, Cuthill IC, Emerson M, and Altman DG. Improving bioscience research reporting: the ARRIVE guidelines for reporting animal research. PloS Biol. (2010) 8:e1000412. doi: 10.1371/journal.pbio.1000412

PubMed Abstract | Crossref Full Text | Google Scholar

33. Zhang S, Zhang J, Zhang F, Ni M, Zhong L, Chen M, et al. Protective effect and mechanism of fruit extracts of Taxus chinensis on hippocampus of chronic unpredicta ble mild stress rats. Chin Herbal Medicines. (2025). doi: 10.1016/j.chmed.2025.02.008

Crossref Full Text | Google Scholar

34. Antoniuk S, Bijata M, Ponimaskin E, and Wlodarczyk J. Chronic unpredicta ble mild stress for modeling depression in rodents: Meta-analysis of model reliability. Neurosci Biobehav Rev. (2019) 99:101–16. doi: 10.1016/j.neubiorev.2018.12.002

PubMed Abstract | Crossref Full Text | Google Scholar

35. Hu Y, Cao K, Wang F, Wu W, Mai W, Qiu L, et al. Dual roles of hexokinase 2 in shaping microglial function by gating glycolytic flux and mitochondrial activity. Nat Metab. (2022) 4:1756–74. doi: 10.1038/s42255-022-00707-5

PubMed Abstract | Crossref Full Text | Google Scholar

36. Fang J, Luo S, and Lu Z. HK2: Gatekeeping microglial activity by tuning glucose metabolism and mitochondrial functions. Mol Cell. (2023) 83:829–31. doi: 10.1016/j.molcel.2023.02.022

PubMed Abstract | Crossref Full Text | Google Scholar

37. Shil SK, Kagawa Y, Umaru BA, Nanto-Hara F, Miyazaki H, Yamamoto Y, et al. Ndufs4 ablation decreases synaptophysin expression in hippocampus. Sci Rep. (2021) 11:10969. doi: 10.1038/s41598-021-90127-4

PubMed Abstract | Crossref Full Text | Google Scholar

38. Khan A and Sergi CM. NEU1-A unique therapeutic target for alzheimer’s disease. Front Pharmacol. (2022) 13:902259. doi: 10.3389/fphar.2022.902259

PubMed Abstract | Crossref Full Text | Google Scholar

39. Aljohani MA, Sasaki H, and Sun XL. Cellular translocation and secretion of sialidases. J Biol Chem. (2024) 300:107671. doi: 10.1016/j.jbc.2024.107671

PubMed Abstract | Crossref Full Text | Google Scholar

40. Howlader MA, Demina EP, Samarani S, Guo T, Caillon A, Ahmad A, et al. The Janus-like role of neuraminidase isoenzymes in inflammation. FASEB J. (2022) 36:e22285. doi: 10.1096/fj.202101218R

PubMed Abstract | Crossref Full Text | Google Scholar

41. Fremuth LE, Hu H, Vlekkert D, Annunziata I, Weesner JA, and Alessandra D.A. Neuraminidase 1 regulates neuropathogenesis by governing the cellular state of microglia via modulation of Trem2 sialylation. Cell Rep. (2025) 44:115204 doi: 10.1016/j.celrep.2024.115204

PubMed Abstract | Crossref Full Text | Google Scholar

42. Pearce EL and Pearce EJ. Metabolic pathways in immune cell activation and quiescence. Immunity. (2013) 38:633–43. doi: 10.1016/j.immuni.2013.04.005

PubMed Abstract | Crossref Full Text | Google Scholar

43. Vozza A, Parisi G, De Leonardis F, Lasorsa FM, Castegna A, Amorese D, et al. UCP2 transports C4 metabolites out of mitochondria, regulating glucose and glutamine oxidation. Proc Natl Acad Sci United States America. (2014) 111:960–5. doi: 10.1073/pnas.1317400111

PubMed Abstract | Crossref Full Text | Google Scholar

44. Sun XL, Liu Y, Dai T, Ding JH, and Hu G. Uncoupling protein 2 knockout exacerbates depression-like behaviors in mice via enhancing inflammatory response. Neuroscience. (2011) 192:507–14. doi: 10.1016/j.neuroscience.2011.05.047

PubMed Abstract | Crossref Full Text | Google Scholar

45. Miller AH and Raison CL. The role of inflammation in depression: from evolutionary imperative to modern treatment target. Nat Rev Immunol. (2016) 16:22–34. doi: 10.1038/nri.2015.5

PubMed Abstract | Crossref Full Text | Google Scholar

46. Du RH, Wu FF, Lu M, Shu XD, Ding JH, Wu G, et al. Uncoupling protein 2 modulation of the NLRP3 inflammasome in astrocytes and its implications in depression. Redox Biol. (2016) 9:178–87. doi: 10.1016/j.redox.2016.08.006

PubMed Abstract | Crossref Full Text | Google Scholar

47. Trist BG, Hilton JB, Hare DJ, Crouch PJ, and Double KL. Superoxide dismutase 1 in health and disease: how a frontline antioxidant becomes neurotoxic. Angew Chem Int Ed Engl. (2021) 60:9215–46. doi: 10.1002/anie.202000451

PubMed Abstract | Crossref Full Text | Google Scholar

48. Rybka J, Kedziora-Kornatowska K, Banas-Lezanska P, Majsterek I, Carvalho LA, Cattaneo A, et al. Interplay between the pro-oxidant and antioxidant systems and proinflammatory cytokine levels, in relation to iron metabolism and the erythron in depression. Free Radic Biol Med. (2013) 63:187–94. doi: 10.1016/j.freeradbiomed.2013.05.019

PubMed Abstract | Crossref Full Text | Google Scholar

49. Uchihara Y, Tanaka K, Asano T, Tamura F, and Mizushima T. Superoxide dismutase overexpression protects against glucocorticoid-induced depressive-like behavioral phenotypes in mice. Biochem Biophys Res Commun. (2016) 469:873–7. doi: 10.1016/j.bbrc.2015.12.085

PubMed Abstract | Crossref Full Text | Google Scholar

50. Peruzzotti-Jametti L, Willis CM, Krzak G, Hamel R, Pirvan L, Ionescu RB, et al. Mitochondrial complex I activity in microglia sustains neuroinflammation. Nature. (2024) 628:195–203. doi: 10.1038/s41586-024-07167-9

PubMed Abstract | Crossref Full Text | Google Scholar

51. Ye J, Duan C, Han J, Chen J, Sun N, Li Y, et al. Peripheral mitochondrial DNA as a neuroinflammatory biomarker for major depressive disorder. Neural Regener Res. (2025) 20:1541–54. doi: 10.4103/NRR.NRR-D-23-01878

PubMed Abstract | Crossref Full Text | Google Scholar

52. Ni M, You Y, Chen J, and Zhang L. Copper in depressive disorder: A systematic review and meta-analysis of observational studies. Psychiatry Res. (2018) 267:506–15. doi: 10.1016/j.psychres.2018.05.049

PubMed Abstract | Crossref Full Text | Google Scholar

53. Liu X, Lin C, Wang S, Yu X, Jia Y, and Chen J. Effects of high levels of copper on the depression-related memory disorders. J Gerontol A Biol Sci Med Sci. (2023) 78:611–8. doi: 10.1093/gerona/glac222

PubMed Abstract | Crossref Full Text | Google Scholar

54. Davey Smith G and Hemani G. Mendelian randomization: genetic anchors for causal inference in epidemiological studies. Hum Mol Genet. (2014) 23:R89–98. doi: 10.1093/hmg/ddu328

PubMed Abstract | Crossref Full Text | Google Scholar

55. Howard DM, Adams MJ, Clarke TK, Hafferty JD, Gibson J, Shirali M, et al. Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions. Nat Neurosci. (2019) 22:343–52. doi: 10.1038/s41593-018-0326-7

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: immune cell eQTL, machine learning, Major Depressive Disorder, Mendelian randomization, mitochondrial energy metabolism

Citation: Zhang Y, Zhang F, Shen X, Ni M, Zhang J, Zhang S, Lin J, Yang Z and Lei H (2026) Single-cell transcriptome-wide Mendelian randomization identifies mitochondrial targets in immune cells for Major Depressive Disorder. Front. Immunol. 16:1700604. doi: 10.3389/fimmu.2025.1700604

Received: 07 September 2025; Accepted: 15 December 2025; Revised: 12 December 2025;
Published: 12 January 2026.

Edited by:

Taolin Chen, Sichuan University, China

Reviewed by:

Jing Du, Capital Medical University, China
Wei Fu, Air Force Medical University, China
Shen He, Shanghai Mental Health Center, China

Copyright © 2026 Zhang, Zhang, Shen, Ni, Zhang, Zhang, Lin, Yang and Lei. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zhaoyang Yang, eXp5ODEzQDEyNi5jb20=; Huangwei Lei, bGVpaHVhbmd3ZWkxMjEyQDEyNi5jb20=

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.