- 1Department of Dermatology, The First Affiliated Hospital (Southwest Hospital), Army Medical University, Chongqing, China
- 2Chongqing Key Laboratory for the Mechanism and Intervention of Cancer Metastasis, Chongqing University Cancer Hospital, Chongqing University, Chongqing, China
- 3School of Engineering, Huaqiao University, Quanzhou, China
- 4Department of Dermatology of Jiangbei Campus, The First Affiliated Hospital of Army Medical University, Chongqing, China
Alopecia areata (AA) and androgenetic alopecia (AGA) both present with hair loss but require different therapies, and reliable biomarkers to guide treatment remain lacking. We integrated bulk and single-cell RNA-seq to compare JAK–STAT signaling in AA versus AGA. In AA, 257 immune-enriched differentially expressed genes (DEGs) were identified; WGCNA and consensus machine learning (LASSO, SVM-RFE, random forest) yielded six candidate hub genes, and external validation narrowed these to four key genes–granzyme A (GZMA), interleukin-2 receptor
1 Introduction
AA and AGA are the two most common hair loss disorders and have markedly different clinical features and epidemiological impacts (Zhou et al., 2021). AA presents as sudden-onset, non-scarring hair loss caused by breakdown of the hair follicle’s immune privilege and a cytotoxic T-cell-mediated autoimmune attack (Dai et al., 2021); its lifetime prevalence is 0.1%–2%, with considerable psychosocial burden (Strazzulla et al., 2018). By contrast, AGA is driven by androgen-dependent miniaturization of scalp hair follicles (especially in frontal and vertex regions) via androgen receptor signaling in dermal papilla cells, with only limited inflammatory involvement (Anzai et al., 2019). AA can progress to alopecia totalis or universalis, whereas AGA usually arises in adulthood and is epidemiologically linked to cardiometabolic comorbidities (Gonul et al., 2018). These distinct clinical courses and systemic associations underscore the need for disease-specific biomarkers and targeted therapies.
The pathophysiology of AA is driven by dysregulated activation of the JAK/STAT (Janus kinase/Signal Transducer and Activator of Transcription) signaling pathway (Dai et al., 2021; Haughton et al., 2023). A key downstream element is the IFN-
To date, AA research has focused mainly on STAT-mediated cytokine signaling and related interventions, whereas AGA studies have largely been limited to androgen pathways or a few isolated molecular markers (Haughton et al., 2023; Wada-Irimada et al., 2025; Sardana et al., 2018). Significant knowledge gaps remain. First, there have been no comprehensive, side-by-side comparisons of the JAK-STAT signaling networks in AA versus AGA, leaving the immunological similarities and differences between the two diseases incompletely explained—AA’s pathology features excessive T cell-driven JAK-STAT activation, whereas AGA studies have historically under emphasized inflammatory pathways. Second, cross-disease multi-omics studies are scarce, impeding the identification of common versus disease-specific hub genes and hindering the development of precise diagnostics and targeted therapies (Ocampo-Garza et al., 2019; Sardana et al., 2018). Finally, in clinical translation, reliance on linear models (e.g., logistic regression nomograms) persists, yet such models perform poorly in small-cohort, high-dimensional transcriptomic settings–precisely where robust, mechanism-anchored representations and sample-efficient learning are most needed (Clough and Barrett, 2016; Ritchie et al., 2015). These limitations underscore the need for advanced machine learning frameworks that can operate on limited data while maintaining transparency. In other words, an explainable AI approach is required to meet these challenges in a modern Healthcare 4.0 context.
Clarifying these mechanistic differences is critical for developing precise interventions: for AA, immune-targeted therapies (e.g., JAK inhibition) should be prioritized, whereas for AGA, treatments should focus on androgen-dependent signaling. Clearly distinguishing the molecular features of the two disorders will not only accelerate the discovery of non-invasive biomarkers but also lay the groundwork for innovative, mechanism-based treatments. Accordingly, in this study we performed a multi-omics integrative analysis to systematically compare the JAK-STAT networks in AA and AGA, identify disease-specific hub genes, and evaluate their potential utility in early diagnosis and personalized treatment.
To meet these challenges, we integrated bulk and single-cell RNA-seq data to map JAK-STAT biology in AA and AGA, while retaining a biologically grounded feature-selection pipeline (DEGs
Specifically, the primary contributions of this work are:
1. Prior-driven, interpretable representations for small cohorts. Bulk gene expression profiles are projected onto pathway- and cell type–aligned latent variables using a frozen MultiPLIER prior, and then these latent dimensions are re-weighted according to the expression of the consensus hub genes. This approach produces a mechanism-anchored embedding that reduces data requirements while preserving biological interpretability and context relevance.
2. Relation-style set-to-set comparison as a nomogram alternative. Instead of using a linear nomogram, our method performs full support–query matching via a Hadamard similarity map with permutation-invariant aggregation, followed by a shallow classifier (i.e., a Relation Network–style few-shot comparator tailored to transcriptomic data). This design is both parameter-efficient and sample-efficient for small cohorts, and its case-based matching mechanism offers an intuitive interpretation of predictions.
3. End-to-end, biology-grounded pipeline with multi-omics and experimental validation. We maintain the robust feature-selection backbone and couple it to the few-shot classifier, then validate the pipeline across bulk and single-cell data and further confirm AA-specific activation of the IL2RB/IL2RG–EOMES–GZMA axis at both mRNA and protein levels in independent scalp samples. This approach localizes the
2 Methods
2.1 Data sources
Transcriptomic sequencing data for AA and AGA were obtained from the Gene Expression Omnibus (GEO) database (Langfelder and Horvath, 2008). Specifically, the training set data for AA came from the GSE68801 dataset (60 AA samples and 36 normal samples), while the validation set data came from the GSE45512 dataset (five AA samples and five normal samples). Additionally, transcriptome data for AGA were obtained from the GSE36169 dataset (five AGA samples and five normal samples), and the single-cell RNA sequencing (scRNA-seq) data of AA was from GSE212447, which included five AA samples for subsequent analysis.
2.2 Data preprocessing
For GSE68801 (AA, GPL570), GSE45512 (AA validation, GPL570), and GSE36169 (AGA, GPL96), we obtained either raw CEL files or processed series matrix files from GEO. When CEL files were available, microarrays were processed using the RMA pipeline (background correction, quantile normalization, and probe summarization). For series matrices, we verified that expression values were already
2.3 Key gene selection workflow
2.3.1 Differential expression analysis
Differential Expression Analysis: We used the limma package (v3.54.2) in R to identify differentially expressed genes (DEGs) between AA lesional and control samples in GSE68801 (Li et al., 2024). Genes with
2.3.2 Weighted gene co-expression network analysis (WGCNA)
Weighted Gene Co-expression Network Analysis (WGCNA): We applied WGCNA (v1.72.1) Szklarczyk et al. (2021); Shannon et al. (2003) on the GSE68801 expression matrix (all samples) to identify gene co-expression modules, using disease status (AA vs. control) as the trait. To ensure reliable results, we first performed sample clustering to check data correlations and remove any outlier samples. We then chose a soft-threshold power based on the scale-free topology criterion to optimize the network’s fit, and used it to construct a gene co-expression network with hierarchical clustering, yielding distinct gene modules. Next, Pearson correlation analysis was conducted between each module’s eigenexpression and the disease trait, and the modules with the strongest positive and negative correlations with AA were identified. The genes from these highly associated modules were selected for further investigation.
2.3.3 Functional enrichment analysis
Functional Enrichment Analysis: We performed functional enrichment analysis on the intersecting gene set using the R package clusterProfiler (v4.6.2) (Bodinier et al., 2023; Narala et al., 2021). This included Gene Ontology (GO) enrichment—covering the three GO categories of cellular component (CC), biological process (BP), and molecular function (MF)—as well as Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. The enrichment results for GO and KEGG were visualized with appropriate plots.
2.3.4 Constructing PPI network based on the correlation analysis of the JAK family genes
First, correlation analysis was performed between specific gene expression profiles and JAK family genes, with Pearson correlation coefficients and p values used to assess linear relationships. Genes meeting the screening criteria (
2.3.5 Consensus machine-learning feature selection
To enhance the accuracy and reproducibility of high-dimensional transcriptomic feature selection in this study, three classical machine learning algorithms least absolute shrinkage and selection operator(LASSO) regression, support vector machine recursive feature elimination (SVM-RFE), and random forest (RF) were simultaneously applied on the basis of 20 candidate genes to precisely identify the core biomarkers. LASSO was implemented using glmnet (V 4.1-8) (Bodinier et al., 2023), which automatically optimized the penalty coefficient through 10-fold cross-validation. L1 regularization was employed to compress the coefficients of non-critical features to zero, facilitating gene screening (Narala et al., 2021); SVM-RFE was constructed using e1071 (V 1.7-13) (Lin et al., 2012), eliminating the least contributing features after each round of training, and retaining the gene subset with the strongest predictive ability.RF generates 500 decision trees using randomForest (V 4.7-1.2) (Guyon et al., 2002), quantifies feature importance through out-of-bag error evaluation and Gini gain ranking, and effectively suppresses overfitting through an ensemble voting mechanism (Hu and Szymczak, 2023). Finally, we performed a Venn intersection of the feature gene lists, selecting the overlapping genes as core candidates. Details of the three machine learning algorithms are as follows:
2.3.5.1 LASSO (L1-regularized logistic regression)
In a binary (AA vs. control) logit model, incorporating L1 regularization simultaneously achieves coefficient shrinkage and variable selection (shrinking the regression coefficients of unimportant features to 0) (Tibshirani, 1996).
Model and Objective Function formula is given in Equation 1 (Binary Classification,
where
2.3.5.2 SVM-RFE (support vector machine recursive feature elimination)
In linearly separable or approximately linearly separable settings, the weight vector
Linear SVM (Soft Margin) Objective formula is given in Equation 2:
where
In each round, several features with the smallest
2.3.5.3 Random forest (RF)
RF generates an ensemble of trees through bootstrapping (bagging) and random subset feature splitting, which is suitable for nonlinear and high-order interaction scenarios; two types of importance metrics can be used to screen features (Breiman, 2001). Let the training set be
Node splitting (Gini criterion). Let the empirical class proportions at node
The impurity decrease for the candidate split
Select the split with the largest
Out-of-Bag (OOB) Error and Evaluation. If sample
For all nodes
MDI can quickly provide global rankings, but may produce bias when dealing with “different value ranges/number of categories” or highly correlated features.
First, obtain the original score
the larger
2.3.6 Identification of gene expression levels between normal and patient samples
We next evaluated how well the candidate feature genes distinguish AA from normal samples. We examined each feature gene’s expression in both the training and validation sets and plotted receiver operating characteristic (ROC) curves using the pROC package (v1.18.4) (Zhang et al., 2025). Genes that were differentially expressed in AA vs. controls with consistent direction in both sets and had an area-under-curve (AUC)
2.3.7 Expression of key genes in AGA
In order to systematically investigate their expression patterns, a Comprehensive differential expression analysis of key genes was conducted in the AGA dataset. Particular emphasis was placed on the JAK family genes (JAK1, JAK2, JAK3, and TYK2), then the differential expression of these genes in AGA was analyzed to clarify their potential mechanisms in this disease.
2.4 Prior-informed few-shot diagnostic model
We replaced the conventional nomogram with a Relation Network–style few-shot classifier leveraging MultiPLIER priors (Figure 1). In our model, the bulk expression matrix
Step 1: Projection onto MultiPLIER Latent Space. We project the gene expression matrix
Where,
Step 2: Key-gene-guided channel re-weighting.
Let
where
Step 3: Two-Branch Encoder & Hadamard Similarity Mapping. We use a lightweight neural encoder
where
Step 4: Permutation-Invariant Set Aggregation. We aggregate the query–support similarity vectors across the entire support set in a permutation-invariant way (insensitive to the order of support samples). In practice, we use a Deep Sets–style relation module (The formula is shown below) that transforms and sums the similarity vectors for each class:
where
Step 5: Training and Inference Procedure. We train the model in an episodic 2-way
where
We used a MultiPLIER loading matrix with L = 987 latent variables. The encoder
Instead of fitting a traditional logistic-regression nomogram, we trained our prior-informed few-shot model and took its raw output logit
2.5 Immune infiltration analysis
Single-sample gene set enrichment analysis (ssGSEA) was employed to quantitatively assess the abundance of 28 immune cell types between the disease group and the control group in the training set. Then the infiltration levels of the two groups were compared using the two-sided Wilcoxon rank sum test. Cell types with p
2.6 Gene set variation analysis and GSEA analyses
To delineate differences in immune response pathway activities among distinct biological groups, we first obtained 153 immune-related gene sets with complete functional annotations from the ImmPort database. Using these gene sets, we performed gene set variation analysis (GSVA) on the training set data via the R package “GSVA” (v1.46.0) to evaluate the enrichment levels of immune response pathways across samples (Hänzelmann et al., 2013). GSVA is a nonparametric algorithm that transforms a gene expression matrix into a gene set activity score matrix, enabling quantitative assessment of pathway enrichment at the single-sample level. In this way, GSVA scores reflect the enrichment status of immune-related pathways for each sample and allow comparison of pathway activities between AA and control groups.
For mechanistic insights into the roles of key hub genes in disease pathogenesis, we additionally carried out gene set enrichment analysis (GSEA) based on Hallmark gene sets. The R package “msigdbr” (v10.0.1) was employed to retrieve Hallmark reference gene sets (Sennett et al., 2024). For each hub gene, samples were stratified into high- and low-expression groups according to the median expression of that gene, and classical GSEA was applied to identify differentially regulated pathways between the two groups and to dissect the associated signaling programs (Zhang et al., 2025). GSEA evaluates whether predefined gene sets exhibit statistically significant and concordant differences between two biological states and, compared with conventional single-gene analyses, is able to detect both subtle coordinated changes across many genes and pronounced alterations within smaller gene subsets, thereby enhancing biological interpretation.
2.7 Identification of drug candidates
GeneCards (Stelzer et al., 2016), a comprehensive gene database integrating genomic, transcriptomic, proteomic, genetic, and functional data from approximately 150 sources, served as a key resource for exploring gene-disease associations and drug interactions. Critical gene-drug relationships based on hub genes were extracted from GeneCards, with approved drugs screened and visualized using the R package “ggalluvial” (V 0.12.5) (Brunson, 2020).
2.8 Single-cell sequencing analysis
Single-cell data were preprocessed and normalized using the R package “Seurat” (V 5.1.0) (Butler et al., 2018) with parameters set to min.cells = 3 and min.features = 200, retaining only genes expressed in
Quality control excluded cells with
2.9 Multi-database functional annotation of hub genes
Cross-referencing the four hub genes and their upstream cytokines across several independent functional–annotation and signaling databases provided external validation of our computational findings. Public resources consistently annotate granzyme A (GZMA) as a cytotoxic serine protease expressed in activated
Together, this multi-database functional annotation converges on a coherent picture: the IL2RB/IL2RG–EOMES–GZMA axis represents a well-established immune effector module, and our multi-omics pipeline has rediscovered this module as the core signature distinguishing AA from AGA. This convergence increases confidence in the biological plausibility and translational relevance of the identified targets, rather than indicating a model driven by spurious associations.
2.10 Biological validation in independent scalp samples
2.10.1 Human scalp samples and ethics
This study enrolled three groups of participants: (i) alopecia areata (AA) lesional scalp
2.10.2 Western blot (WB) analysis of hub proteins
Total protein was extracted from scalp tissues homogenized on ice using RIPA lysis buffer supplemented with PMSF (Beyotime, China). Protein concentrations were determined with a BCA assay kit (Beyotime, China). Equal amounts of protein were resolved by SDS–PAGE and transferred to PVDF membranes at 200 mA for 2 h. Membranes were blocked in 5% non-fat milk at room temperature for 2 h and incubated overnight at
2.10.3 Real-time quantitative polymerase chain reaction (RT-qPCR) analysis of hub genes
Total RNA was isolated using the RNASimple Total RNA Kit (TIANGEN, China). First-strand cDNA was synthesized with the MightyScript First-Strand cDNA Synthesis Mix (Sangon, China). qPCR was performed with 2
3 Result
During differential expression analysis of the AA training set, a total of 257 differentially expressed genes (DEGs) were identified. Among these, 91 DEGs exhibited upregulated expression, while 166 DEGs showed downregulated expression. Based on these results, a volcano plot was generated to visualize the changes in gene expression, as shown in Figure 2a. Furthermore, the top 10 most DEGs from both the upregulated and downregulated groups were extracted, and a heatmap was plotted to examine their distribution patterns across samples, as shown in Figure 2b. Additionally, WGCNA was performed using the training set data, with disease status encoded as the phenotypic trait.
Figure 2. Screening of differentially expressed genes (DEGs) and identification of module genes in alopecia areata (AA). (a) Volcano plot and (b) Heatmap of DEGs across different groups (Control and AA).
During this process, six outlier samples (GSM1681988, GSM1681991, GSM1682055, GSM1682056, GSM1682057, and GSM1682101) were identified and excluded to ensure the accuracy and reliability of the analysis, as shown in Figure 3a. By setting a scale-free network assessment coefficient (
Figure 3. Screening of differentially expressed genes (DEGs) and identification of module genes in alopecia areata (AA). (a) Sample clustering tree diagram and correlation heatmap and (b) Scale independence and average connectivity of different soft threshold powers used in Weighted Gene Co-expression Network Analysis (WGCNA).
Figure 4. Screening of differentially expressed genes (DEGs) and identification of module genes in alopecia areata (AA). (a) Cluster tree diagram generated by WGCNA and (b) Module-trait relationship heatmap showing the correlation coefficients and significance values between modules and traits of interest.
By intersecting the DEGs with genes from the significant WGCNA modules, we obtained 228 overlapping genes (Figure 5a). We performed GO enrichment for these genes and visualized the top five categories in each GO domain (Figure 5b).
Figure 5. Functional enrichment and gene expression analysis. (a) Venn diagram showing the overlap between genes identified by DEGs and WGCNA and (b) Dot plot of Gene Ontology (GO) terms for cellular component (CC), biological process (BP), and molecular function (MF) enriched among DEGs.
For Biological Process (BP), the most enriched terms were intermediate filament organization, intermediate filament-based process, skin development, and epidermal development. For Cellular Component (CC), top terms included molting cycle, intermediate filaments (and intermediate filament cytoskeleton), keratin filaments, and external side of plasma membrane. For Molecular Function (MF), top terms included keratinization, structural constituent of skin epidermis, chemoattractant activity, chemoattractant receptor binding, cytokine activity, and CCR chemokine receptor binding. Additionally, KEGG pathway analysis identified nine significantly enriched pathways (Figure 6), including Staphylococcus aureus infection, viral protein–cytokine/chemokine interactions, cytokine–cytokine receptor interaction, estrogen signaling, chemokine signaling, cell adhesion molecules (CAMs), hematopoietic cell lineage, Th1 and Th2 cell differentiation, and primary immunodeficiency. These enrichment results provide a context-sensitive interpretation of the overlapping genes, suggesting they cluster in pathways relevant to AA pathology.
First, Pearson correlation analysis was used to pair all DEGs with the expression levels of JAK1, JAK2, JAK3, and TYK2 one by one and calculate the correlation coefficients. Subsequently, genes with statistically significant correlations with JAK family genes were screened for further study. The results showed 55 genes were positively or negatively correlated with at least one member of the JAK axis at a moderate-intensity level. Thus, they were included in the candidate list for subsequent functional verification. The correlation results are presented as a heatmap: colors range from light to dark, corresponding to correlation coefficients from weak to strong, and symbols in the squares indicate statistical significance, as shown in Figure 7.
Figure 7. Functional enrichment and gene expression analysis. (a) Box plots comparing the expression levels of Janus kinase (JAK) family genes (JAK1, JAK2, JAK3, TYK2) between control and AA groups and (b) Heatmap of the correlation matrix of key genes, with Pearson’s correlation coefficients shown. Nodes and edges highlight the relationships among JAK family genes and others.
Genes found to be strongly correlated with the JAK family underwent PPI network analysis utilizing STRING, with visualization achieved through Cytoscape. This network-based approach facilitated the identification of 10 core hub genes: cluster of differentiation 3D (CD3D), granzyme A (GZMA), interleukin 2 receptor subunit beta (IL2RB), interleukin 2 receptor subunit gamma (IL2RG), inducible T-cell costimulator (ICOS), STAT4, interleukin 10 receptor subunit alpha (IL10RA), eomesodermin (EOMES), interleukin-2-inducible T-cell kinase (ITK), and integrin subunit alpha L (ITGAL), as shown in Figures 8, 9.
Figure 8. Identification hub genes in AA. (a,b) PPI network showing the complex interactions between key genes.
Figure 9. Upset plot of 10 intersecting genes obtained from five topology analysis algorithms (Maximum Clique Centrality (MCC), Maximum Neighborhood Component (MNC), Degree, Density of Maximum Neighborhood Component (DMNC), Edge Percolated Component (EPC)) in the cytoHubba plugin.
To rigorously refine the selection of key molecular drivers, three complementary machine learning algorithms were implemented on the training dataset. The LASSO regression model first highlighted six pivotal genes, namely, GZMA, IL2RB, IL2RG, STAT4, IL10RA, and EOMES, as shown in Figure 11a. Following this, SVM-RFE analysis delineated nine influential genes, including IL2RG, IL10RA, GZMA, ITGAL, CD3D, STAT4, EOMES, IL2RB, and ITK, as shown in Figure 10a. dditionally, the RF classifier, applying a MeanDecreaseGini threshold exceeding 3, pinpointed seven key genes: IL2RG, ITGAL, STAT4, IL2RB, EOMES, GZMA, and IL10RA, as shown in Figure 10b. By intersecting the outputs derived from these three distinct analytical frameworks, a robust consensus set of six feature genes: GZMA, IL2RB, IL2RG, STAT4, IL10RA, and EOMES was ultimately determined, as shown in Figure 11b. This integrative approach reinforced the credibility of the selected genes as pivotal components within the JAK-related molecular landscape.
Figure 10. Identification hub genes in AA. (a) The Support Vector Machine (SVM) line chart shows that when the model reaches its highest accuracy, nine genes are selected. (b) Bar chart of variable importance based on MeanDecreaseGini from Random Forest analysis, illustrating the most critical genes.
Figure 11. Identification hub genes in AA. (a) Least absolute shrinkage and selection operator (LASSO) regression plot displaying binomial deviance versus log lambda values to determine optimal gene selection. (b) Venn diagram representing the overlap of identified genes from LASSO, Random Forest (RF), and SVM-Recursive Feature Elimination (RFE).
Based on these six feature genes, an external validation cohort was further introduced for expression validation. Based on these six feature genes, an external validation cohort was further introduced for expression validation. Of the six candidate biomarkers identified by our machine-learning selection, four genes—GZMA, IL2RB, IL2RG, and EOMES—showed significant differential expression in an independent AA cohort, with consistent up/downregulation trends (Figure 12). Each of these four genes achieved an AUC
Figure 12. Validation and predictive performance of feature genes in AA. (a) Differential expression of six core genes in the AA and control groups of the training set. (b) Differential expression of six core genes in the AA and control groups of the test set.
Figure 13. Validation and predictive performance of feature genes in AA. (a) Receiver operating characteristic (ROC) curve of six core genes in the AA and control groups of the training set. (b) ROC curve of six core genes in the AA and control groups of the test set.
Figure 14. (a) Decision curve analysis illustrating the net benefit of the predictive model across different risk thresholds. (b) ROC curve for the model, with the area under the curve (AUC) indicating the model’s overall predictive power.
We analyzed immune cell infiltration in AA lesions versus controls. Out of 28 cell types, 22 showed significantly different abundance between AA and healthy scalps (
Figure 15. Immune cell infiltration analysis. Box plots comparing the infiltration levels of 28 immune cell types between the control and AA groups.
Figure 16. Immune cell infiltration analysis. Correlation heatmap showing Pearson’s correlation coefficients between the infiltration levels of different immune cell types and hub genes. Orange lines denote significant positive correlations, while blue lines indicate significant negative correlations.
Gene set variation analysis (GSVA) of the training set revealed widespread immune pathway perturbations in AA. We identified 115 immune-related pathways with significant activity differences relative to controls
Figure 17. Bar plot showing the differential analysis results of gene set variation analysis (GSVA).
Together, these concordant annotations converge on a coherent picture: the IL2RB/IL2RG–EOMES–GZMA axis represents a well-established immune effector module rather than a spurious statistical pattern, and our multi-omics pipeline has rediscovered this module as the core signature distinguishing AA from AGA. This convergence increases confidence in the biological plausibility and translational relevance of the identified targets and supports their prioritization for pathway-guided immunomodulatory therapies.
Using single-cell RNA-seq from AA lesions (5 samples, 25,875 cells), we identified 12 distinct cell clusters (Figure 18). The hub genes were predominantly expressed in immune cell populations. In AA, GZMA and EOMES were enriched in
Figure 18. Single-cell analysis. (a) Uniform Manifold Approximation and Projection (UMAP) displaying the clustering of various cell types. (b) Dot plot illustrating the expression levels of distinct marker genes across identified cell types.
Figure 20. Violin plots depicting the distribution and expression levels of hub genes in each cell type.
These findings outline a localized immune circuit in AA lesions: the hub genes are active in cytotoxic T cells and proliferating lymphocytes, suggesting a central cytotoxic T-cell loop driving AA pathology. By contrast, in AGA scalps the expression of these hub genes was minimal or unchanged (Figure 21). Neither EOMES nor GZMA was detected in the AGA single-cell data, and JAK–STAT pathway genes (JAK1/2/3, TYK2) showed no significant up- or downregulation. Thus, unlike AA, AGA did not exhibit a notable JAK–STAT immune activation signature at the single-cell level.
Figure 21. The expression of key genes and JAK family in AGA. (a) Box plots showing the expression levels of key genes between the Control and AGA groups (EOMES and GZMA were not exist in the AGA dataset). (b) Box plots depicting the expression levels of JAK1, JAK2, and TYK2 genes in control versus AGA groups (JAK3 transcripts were not detected in the AGA dataset).
To determine whether this cytotoxic/JAK axis is recapitulated in independent patient tissues, we next examined its expression in AA, AGA, and healthy scalp specimens. Compared with healthy controls, mRNA levels of EOMES, GZMA, IL2RB, and IL2RG were significantly upregulated in alopecia areata (AA) lesional scalp (all
Figure 22. RT-qPCR validation of the IL2RB/IL2RG–EOMES–GZMA cytotoxic/JAK axis in alopecia areata (AA). Relative mRNA levels of EOMES, GZMA, IL2RB, and IL2RG are significantly elevated in AA lesional scalp compared with healthy controls, whereas androgenetic alopecia (AGA) samples are comparable to controls, indicating that this cytotoxic/JAK module is transcriptionally active in AA but largely quiescent in AGA. Data are shown as mean
4 Discussion
The infiltration levels of 28 immune cell types in the training set were analyzed using the ssGSEA algorithm to compare differences between the disease group and the control group. It is important to note that the AGA dataset (GSE36169) comprised only five AGA samples and five controls, which substantially limits statistical power. This small sample size may lead to an underestimation of subtle AGA-specific transcriptomic changes, including immune or inflammatory signals that fall below our detection threshold. Accordingly, our conclusion that AGA lacks a comparable inflammatory JAK–STAT signature should be interpreted with caution and validated in larger, independent cohorts.
A significance threshold of normalized enrichment score (NES)
Four key genes identified as related to drugs was selected from the GeneCards database, their connections to therapeutic drugs were visualized using a Sankey diagram, as shown in Figure 24. The EOMES gene was found to have a significant association with glatiramer, while GZMA was closely linked to cyclosporine. In addition, IL2RB and IL2RG were found to correlate with four clinically significant drugs, namely, Basiliximab, Denileukin diftitox, Aldesleukin, and Nogapendekin alfa (Anktiva). In summary, these drugs play an important role in clinical practice, highlighting their therapeutic significance.
Beyond gene-level associations and drug–gene networks, we next sought to determine whether the key components of this cytotoxic/JAK axis are consistently annotated as immune effector molecules in independent functional-annotation resources. Cross-referencing the four hub genes and their upstream cytokines across several databases provided external validation of our computational findings. Public resources consistently annotate granzyme A (GZMA) as a cytotoxic serine protease expressed in activated
Protein expression changes paralleled the qPCR results: AA samples displayed markedly higher levels of EOMES, GZMA, IL2RB, and IL2RG proteins than healthy controls
Figure 25. Western blot validation of the IL2RB/IL2RG–EOMES–GZMA cytotoxic/JAK axis at the protein level. Protein expression of EOMES, GZMA, IL2RB, and IL2RG is markedly upregulated in AA lesional scalp compared with healthy controls, concordant with the RT-qPCR results, whereas AGA-affected scalp shows no significant difference relative to controls. Representative immunoblots and densitometric quantification (normalized to
Despite both causing hair loss, AA and AGA are driven by fundamentally different molecular mechanisms. Our multi-omics analysis, single-cell analysis, and wet-lab validation together delineated a clear split: AA lesions are transcriptionally co-dominated by hyperactive JAK–STAT signaling and a granzyme-rich cytotoxic
Unlike the conventional logistic nomogram, our diagnostic model uses a prior-informed few-shot learning approach. It first embeds each sample’s expression into pathway/cell-type–aligned latent variables via fixed MultiPLIER loadings (ensuring portability and interpretability in small-sample settings). It then performs a full support–query comparison using Hadamard-product similarity and permutation-invariant aggregation to learn a class relation score (“learning to compare”), and finally outputs the AA vs. control prediction through a shallow classifier. Our framework embodies three key advantages: (1) By leveraging a pathway-aligned latent space (MultiPLIER priors), the model remains robust in small-sample settings and its features have direct biological meaning, enhancing interpretability. (2) The few-shot, relation-based approach (“learning to compare”) is well-suited to situations with limited cohorts yet a need to generalize, aligning with Healthcare 4.0’s emphasis on data-efficient AI. (3) The model’s risk output is easily translated into a familiar nomogram-style score (after calibration), with each gene’s contribution explained via additive SHAP values. This transparency ensures the predictions are clinically interpretable, facilitating trust and communication between AI, clinicians, and patients. Collectively, these features illustrate a paradigm of XAI 2.0: our framework balances high accuracy with transparent, context-sensitive reasoning, thereby fostering greater trust in AI-driven clinical decisions. Additionally, we evaluated our model with ROC curves and decision curve analysis to facilitate fair comparison with traditional prediction models, and it demonstrated superior performance and net benefit under these metrics.
Multi-perspective explanations for different stakeholders: Our framework’s interpretability can be tailored to various end-users in the clinical setting. For clinicians, the nomogram-style scorecard provides a transparent decision aid, linking gene expression to risk on an intuitive 0–100 scale. For patients, results can be translated into clear terms (e.g., “high immune activity gene levels indicate active disease, suggesting JAK inhibitor therapy may be effective”), enhancing understanding and shared decision-making. For research scientists, the model’s latent features and SHAP attributions supply mechanistic insights by mapping gene contributions to pathways and cell types. Notably, the MultiPLIER latent variables with the highest re-weighting coefficients correspond to the key pathways identified in AA: for example, one top latent factor is annotated with a cytotoxic T-cell signature (enriched in genes such as GZMA and EOMES), and another is associated with cytokine/JAK–STAT signaling (enriched in IL2RB, IL2RG, etc.). These highly weighted latent dimensions align with the JAK–STAT hyperactivation and cytotoxic T-cell activity driving AA, demonstrating that our model’s predictions are grounded in biologically interpretable features. This multi-role adaptability ensures the AI’s explanations are context-sensitive—providing the right depth of information to the right audience—thereby building trust and accountability in a Healthcare 4.0 environment.
At the mechanistic level, our findings identify JAK-STAT activity and cytotoxic T cell signaling as the two pathways most enriched in AA, positioning them within the same EOMES+ CD8+ cell cluster. This supports the functional axis “JAK-STAT
5 Conclusion
In summary, our multi-layered analyses identified a four-gene cytotoxic loop (GZMA, IL2RB, IL2RG, EOMES) driving pathological JAK–STAT hyperactivation in AA, whereas AGA remains governed by androgenic signaling with minimal JAK–STAT involvement. Multi-database functional annotation and independent RT-qPCR and Western blot assays convergently confirmed that this IL2RB/IL2RG–EOMES–GZMA axis is selectively activated in AA at both the transcript and protein levels, providing orthogonal support for its use as a mechanistic biomarker and therapeutic target. This mechanistic split sharpens the molecular boundary between AA and AGA and maps naturally onto therapeutic strategy: AA benefits from pathway-directed immunomodulation, now clinically validated with JAK inhibitors such as baricitinib and ritlecitinib, and suggests upstream (IL-2 receptor) or downstream (cytotoxic effector) nodes as complementary targets; AGA, by contrast, remains best addressed with androgen suppression (finasteride) and minoxidil, with Wnt/
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.
Ethics statement
The studies involving humans were approved by Ethics Committee of the First Affiliated Hospital of Army Medical University of the Chinese People’s Liberation Army. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
NY: Writing – review and editing, Writing – original draft, Funding acquisition, Conceptualization, Validation, Methodology. LR: Investigation, Formal Analysis, Data curation, Writing – review and editing. XG: Software, Writing – review and editing, Formal Analysis, Methodology. JT: Investigation, Data curation, Visualization, Writing – review and editing. SL: Project administration, Supervision, Writing – review and editing, Conceptualization. ZS: Supervision, Conceptualization, Writing – review and editing, Project administration.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This work was supported by the National Nature Science Foundation of China (Grant no. 82103761).
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmolb.2025.1753206/full#supplementary-material
References
Anzai, A., Wang, E. H. C., Lee, E. Y., Aoki, V., and Christiano, A. M. (2019). Pathomechanisms of immune-mediated alopecia. Int. Immunol. 31, 439–447. doi:10.1093/intimm/dxz039
Aran, D., Looney, A. P., Liu, L., Wu, E., Fong, V., Hsu, A., et al. (2019). Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage. Nat. Immunol. 20, 163–172. doi:10.1038/s41590-018-0276-y
Bodinier, B., Filippi, S., Nøst, T. H., Chiquet, J., and Chadeau-Hyam, M. (2023). Automated calibration for stability selection in penalised regression and graphical models. J. R. Stat. Soc. Ser. C Appl. Statistics 72, 1375–1393. doi:10.1093/jrsssc/qlad058
Brunson, J. C. (2020). Ggalluvial: layered grammar for alluvial plots. J. Open Source Softw. 5, 2017. doi:10.21105/joss.02017
Butler, A., Hoffman, P., Smibert, P., Papalexi, E., and Satija, R. (2018). Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat. Biotechnol. 36, 411–420. doi:10.1038/nbt.4096
Clough, E., and Barrett, T. (2016). “The gene expression omnibus database,” in Statistical genomics: methods and protocols (Springer), 93–110.
Dai, Z., Chen, J., Chang, Y., and Christiano, A. M. (2021). Selective inhibition of jak3 signaling is sufficient to reverse alopecia areata. JCI Insight 6, e142205. doi:10.1172/jci.insight.142205
Gonul, M., Cemil, B. C., Ayvaz, H. H., Cankurtaran, E., Ergin, C., and Gurel, M. S. (2018). Comparison of quality of life in patients with androgenetic alopecia and alopecia areata. An. Bras. Dermatol. 93, 651–658. doi:10.1590/abd1806-4841.20186131
Guo, Y., Luo, L., Zhu, J., and Li, C. (2023). Multi-omics research strategies for psoriasis and atopic dermatitis. Int. J. Mol. Sci. 24, 8018. doi:10.3390/ijms24098018
Guyon, I., Weston, J., Barnhill, S., and Vapnik, V. (2002). Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 389–422. doi:10.1023/a:1012487302797
Hänzelmann, S., Castelo, R., and Guinney, J. (2013). Gsva: gene set variation analysis for microarray and rna-seq data. BMC Bioinform. 14 (7), 7. doi:10.1186/1471-2105-14-7
Hassan, A. S., Al-Dhoun, M. Q., Shaker, O. G., and AlOrbani, A. M. (2022). Expression of signal transducer and activator of transcription-3 in androgenetic alopecia: a case-control study. Skin Pharmacol. Physiol. 35, 278–281. doi:10.1159/000525532
Haughton, R. D., Herbert, S. M., Ji-Xu, A., Downing, L., Raychaudhuri, S. P., and Maverakis, E. (2023). Janus kinase inhibitors for alopecia areata: a narrative review. Ind. J. Dermatol. Venereol. Leprol. 89, 799–806. doi:10.25259/IJDVL_1093_2022
Hu, J., and Szymczak, S. (2023). A review on longitudinal data analysis with random forest. Briefings Bioinform. 24, bbad002. doi:10.1093/bib/bbad002
Langfelder, P., and Horvath, S. (2008). Wgcna: an r package for weighted correlation network analysis. BMC Bioinform. 9, 559. doi:10.1186/1471-2105-9-559
Li, L., Lu, J., Liu, J., Wu, J., Zhang, X., Meng, Y., et al. (2024). Immune cells in the epithelial immune microenvironment of psoriasis: emerging therapeutic targets. Front. Immunol. 14, 1340677. doi:10.3389/fimmu.2023.1340677
Lin, X., Yang, F., Zhou, L., Yin, P., Kong, H., Xing, W., et al. (2012). A support vector machine-recursive feature elimination feature selection method based on artificial contrast variables and mutual information. J. Chromatography B 910, 149–155. doi:10.1016/j.jchromb.2012.05.020
Narala, S., Li, S. Q., Klimas, N. K., and Patel, A. B. (2021). Application of least absolute shrinkage and selection operator logistic regression for the histopathological comparison of chondrodermatitis nodularis helicis and hyperplastic actinic keratosis. J. Cutaneous Pathol. 48, 739–744. doi:10.1111/cup.13931
Ocampo-Garza, J., Griggs, J., and Tosti, A. (2019). New drugs under investigation for the treatment of alopecias. Expert Opin. Investig. Drugs 28, 275–284. doi:10.1080/13543784.2019.1568989
Ritchie, M. E., Phipson, B., Wu, D., Hu, Y., Law, C. W., Shi, W., et al. (2015). limma powers differential expression analyses for rna-sequencing and microarray studies. Nucleic Acids Res. 43, e47. doi:10.1093/nar/gkv007
Sardana, K., Gupta, A., and Gautam, R. K. (2018). Recalcitrant alopecia areata responsive to leflunomide and anthralin—potentially undiscovered jak/stat inhibitors? Pediatr. Dermatol. 35, 856–858. doi:10.1111/pde.13688
Sennett, M. L., Agak, G. W., Thiboutot, D. M., and Nelson, A. M. (2024). Transcriptomic analyses predict enhanced metabolic activity and therapeutic potential of mtor inhibitors in acne-prone skin. JID Innov. 4, 100306. doi:10.1016/j.xjidi.2024.100306
Shannon, P., Markiel, A., Ozier, O., Baliga, N. S., Wang, J. T., Ramage, D., et al. (2003). Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504. doi:10.1101/gr.1239303
Stelzer, G., Rosen, N., Plaschkes, I., Zimmerman, S., Twik, M., Fishilevich, S., et al. (2016). The genecards suite: from gene data mining to disease genome sequence analyses. Curr. Protocols Bioinform. 54, 1–30. doi:10.1002/cpbi.5
Strazzulla, L. C., Wang, E. H. C., Avila, L., Sicco, K. L., Brinster, N., Christiano, A. M., et al. (2018). Alopecia areata: disease characteristics, clinical evaluation, and new perspectives on pathogenesis. J. Am. Acad. Dermatol. 78, 1–12. doi:10.1016/j.jaad.2017.04.1141
Szklarczyk, D., Gable, A. L., Nastou, K. C., Lyon, D., Kirsch, R., Pyysalo, S., et al. (2021). The string database in 2021: customizable protein–protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res. 49, D605–D612. doi:10.1093/nar/gkaa1074
Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B Stat. Methodol. 58, 267–288. doi:10.1111/j.2517-6161.1996.tb02080.x
Wada-Irimada, M., Takahashi, T., Sekine, M., Okazaki, T., Takahashi, T., Chiba, T., et al. (2025). Predictive factors for treatment responses to baricitinib in severe alopecia areata: a retrospective, multivariate analysis of 70 cases from a single center. J. Dermatol. 52, 701–711. doi:10.1111/1346-8138.17641
Zhang, X., Lan, Y., Xu, J., Quan, F., Zhao, E., Deng, C., et al. (2019). Cellmarker: a manually curated resource of cell markers in human and mouse. Nucleic Acids Res. 47, D721–D728. doi:10.1093/nar/gky900
Zhang, R., Jin, W., and Wang, K. (2025). Glycolysis-driven prognostic model for acute myeloid leukemia: insights into the immune landscape and drug sensitivity. Biomedicines 13, 834. doi:10.3390/biomedicines13040834
Keywords: alopecia areata, few-shot learning, gene expression, MultiPLIER, permutation-invariant aggregation, relation network, transfer learning
Citation: Yu N, Ran L, Gong X, Teng J, Liu S and Song Z (2026) JAK-centric explainable few-shot gene-expression diagnosis framework for alopecia via MultiPLIER priors and relation-style set-to-set comparison. Front. Mol. Biosci. 12:1753206. doi: 10.3389/fmolb.2025.1753206
Received: 24 November 2025; Accepted: 16 December 2025;
Published: 12 January 2026.
Edited by:
Feng Gao, The Sixth Affiliated Hospital of Sun Yat-sen University, ChinaReviewed by:
Lantian Yao, Xiamen University, ChinaXiangzheng Fu, The Hong Kong Hainan Commercial Association, Hong Kong, SAR China
Copyright © 2026 Yu, Ran, Gong, Teng, Liu and Song. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Shulei Liu, bGl1c2h1bGVpQHRtbXUuZWR1LmNu; Zhiqiang Song, ZHJzb25nenFAdG1tdS5lZHUuY24=
Junfei Teng4