Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Mol. Biosci., 12 January 2026

Sec. Molecular Diagnostics and Therapeutics

Volume 12 - 2025 | https://doi.org/10.3389/fmolb.2025.1753206

This article is part of the Research TopicMedical Knowledge-Assisted Machine Learning Technologies in Individualized Medicine Volume IIIView all 5 articles

JAK-centric explainable few-shot gene-expression diagnosis framework for alopecia via MultiPLIER priors and relation-style set-to-set comparison

Nanlan YuNanlan Yu1Ling RanLing Ran2Xinrong GongXinrong Gong3Junfei TengJunfei Teng4Shulei Liu
Shulei Liu4*Zhiqiang Song
Zhiqiang Song1*
  • 1Department of Dermatology, The First Affiliated Hospital (Southwest Hospital), Army Medical University, Chongqing, China
  • 2Chongqing Key Laboratory for the Mechanism and Intervention of Cancer Metastasis, Chongqing University Cancer Hospital, Chongqing University, Chongqing, China
  • 3School of Engineering, Huaqiao University, Quanzhou, China
  • 4Department of Dermatology of Jiangbei Campus, The First Affiliated Hospital of Army Medical University, Chongqing, China

Alopecia areata (AA) and androgenetic alopecia (AGA) both present with hair loss but require different therapies, and reliable biomarkers to guide treatment remain lacking. We integrated bulk and single-cell RNA-seq to compare JAK–STAT signaling in AA versus AGA. In AA, 257 immune-enriched differentially expressed genes (DEGs) were identified; WGCNA and consensus machine learning (LASSO, SVM-RFE, random forest) yielded six candidate hub genes, and external validation narrowed these to four key genes–granzyme A (GZMA), interleukin-2 receptor β (IL2RB) and γ (IL2RG) chains, and eomesodermin (EOMES). Building on these biology-anchored features, we introduced an interpretable few-shot deep learning classifier as an explainable AI alternative to a nomogram: bulk expression profiles are projected onto pathway/cell-type–aligned MultiPLIER latent variables (a frozen prior), the latent channels are re-weighted via element-wise multiplication with the expression levels of the key hub genes, and a Relation-style set-to-set comparator then aggregates support–query similarities (Hadamard mapping + permutation-invariant aggregation) before a shallow head predicts AA versus control. This prior-informed approach enables robust discrimination under limited sample conditions while retaining mechanistic interpretability, thereby exemplifying a next-generation XAI solution for small-cohort genomic diagnosis. Cross-database functional annotation and wet-lab validation (RT-qPCR and Western blot) in independent AA/AGA/healthy scalp samples confirmed that the IL2RB/IL2RG–EOMES–GZMA axis is selectively activated at both mRNA and protein levels in AA. Single-cell analysis localized GZMA to cytotoxic T cells and IL2RG to proliferating lymphocytes, outlining an EOMES+ CD8+ T-cell GZMA–IL2RB/IL2RG cytotoxic loop driving JAK–STAT hyperactivation in AA. Drug–gene network analysis linked these targets to JAK inhibitors and cyclosporine. AGA showed no comparable JAK–STAT perturbation, consistent with its androgen-centric biology. In summary, this four-gene loop provides a non-invasive AA biomarker and a tractable target for precision JAK blockade, while the proposed few-shot framework offers a general, prior-driven alternative to nomograms for transcriptomic diagnosis in small cohorts, illustrating an XAI-driven diagnostic approach for precision medicine.

1 Introduction

AA and AGA are the two most common hair loss disorders and have markedly different clinical features and epidemiological impacts (Zhou et al., 2021). AA presents as sudden-onset, non-scarring hair loss caused by breakdown of the hair follicle’s immune privilege and a cytotoxic T-cell-mediated autoimmune attack (Dai et al., 2021); its lifetime prevalence is 0.1%–2%, with considerable psychosocial burden (Strazzulla et al., 2018). By contrast, AGA is driven by androgen-dependent miniaturization of scalp hair follicles (especially in frontal and vertex regions) via androgen receptor signaling in dermal papilla cells, with only limited inflammatory involvement (Anzai et al., 2019). AA can progress to alopecia totalis or universalis, whereas AGA usually arises in adulthood and is epidemiologically linked to cardiometabolic comorbidities (Gonul et al., 2018). These distinct clinical courses and systemic associations underscore the need for disease-specific biomarkers and targeted therapies.

The pathophysiology of AA is driven by dysregulated activation of the JAK/STAT (Janus kinase/Signal Transducer and Activator of Transcription) signaling pathway (Dai et al., 2021; Haughton et al., 2023). A key downstream element is the IFN-γ/IL-15 (interferon-γ/interleukin-15) axis, which persistently recruits and sustains CD8+NKG2D+ cytotoxic T cells around anagen hair follicles, breaking immune privilege and inducing keratinocyte apoptosis, ultimately causing hair loss (Dai et al., 2021; Haughton et al., 2023). JAK inhibitors (e.g., baricitinib, tofacitinib) have shown significant hair regrowth in AA clinical trials by blocking STAT1/STAT3 phosphorylation and inhibiting effector T cell function (Wada-Irimada et al., 2025). In AGA, by contrast, pathogenesis is primarily driven by the dihydrotestosterone (DHT)–androgen receptor (AR) axis, which increases NADPH oxidase (NOX)-mediated oxidative stress and pro-apoptotic molecules (e.g., BAX), leading to follicular miniaturization and premature catagen entry, with far less inflammatory infiltration than AA. Although studies have noted elevated STAT3 in AGA scalps, it shows no strong correlation with clinical severity, and immune-targeted therapies have limited efficacy (Hassan et al., 2022). Thus, first-line AGA treatments remain anti-androgenic drugs (finasteride, minoxidil), underscoring fundamental differences in molecular pathophysiology and treatment strategy between AA and AGA (Ocampo-Garza et al., 2019).

To date, AA research has focused mainly on STAT-mediated cytokine signaling and related interventions, whereas AGA studies have largely been limited to androgen pathways or a few isolated molecular markers (Haughton et al., 2023; Wada-Irimada et al., 2025; Sardana et al., 2018). Significant knowledge gaps remain. First, there have been no comprehensive, side-by-side comparisons of the JAK-STAT signaling networks in AA versus AGA, leaving the immunological similarities and differences between the two diseases incompletely explained—AA’s pathology features excessive T cell-driven JAK-STAT activation, whereas AGA studies have historically under emphasized inflammatory pathways. Second, cross-disease multi-omics studies are scarce, impeding the identification of common versus disease-specific hub genes and hindering the development of precise diagnostics and targeted therapies (Ocampo-Garza et al., 2019; Sardana et al., 2018). Finally, in clinical translation, reliance on linear models (e.g., logistic regression nomograms) persists, yet such models perform poorly in small-cohort, high-dimensional transcriptomic settings–precisely where robust, mechanism-anchored representations and sample-efficient learning are most needed (Clough and Barrett, 2016; Ritchie et al., 2015). These limitations underscore the need for advanced machine learning frameworks that can operate on limited data while maintaining transparency. In other words, an explainable AI approach is required to meet these challenges in a modern Healthcare 4.0 context.

Clarifying these mechanistic differences is critical for developing precise interventions: for AA, immune-targeted therapies (e.g., JAK inhibition) should be prioritized, whereas for AGA, treatments should focus on androgen-dependent signaling. Clearly distinguishing the molecular features of the two disorders will not only accelerate the discovery of non-invasive biomarkers but also lay the groundwork for innovative, mechanism-based treatments. Accordingly, in this study we performed a multi-omics integrative analysis to systematically compare the JAK-STAT networks in AA and AGA, identify disease-specific hub genes, and evaluate their potential utility in early diagnosis and personalized treatment.

To meet these challenges, we integrated bulk and single-cell RNA-seq data to map JAK-STAT biology in AA and AGA, while retaining a biologically grounded feature-selection pipeline (DEGs WGCNA PPI/network prioritization consensus machine learning). Building on this foundation, we replaced the traditional nomogram with an interpretable few-shot deep learning framework, embodying an explainable AI approach, that (i) projects each bulk expression profile onto pathway/cell-type–aligned latent variables using a frozen MultiPLIER prior (Guo et al., 2023), (ii) amplifies AA-relevant signals via weighting latent channels by the hub gene set, and (iii) conducts a Relation Network–style set-to-set comparison between support and query samples (Hadamard similarity mapping with permutation-invariant aggregation) followed by a shallow classifier. This prior-driven, small-sample–oriented design preserves mechanistic interpretability while improving classification performance under limited sample conditions, effectively addressing the performance–interpretability trade-off emphasized in XAI 2.0 for Healthcare 4.0. Finally, we situate the AA-specific hub genes within an immune-cytotoxic JAK-STAT loop and contrast this mechanism against AGA’s androgen-centric, low-inflammation landscape, highlighting distinct pathogenic circuits in each disease.

Specifically, the primary contributions of this work are:

1. Prior-driven, interpretable representations for small cohorts. Bulk gene expression profiles are projected onto pathway- and cell type–aligned latent variables using a frozen MultiPLIER prior, and then these latent dimensions are re-weighted according to the expression of the consensus hub genes. This approach produces a mechanism-anchored embedding that reduces data requirements while preserving biological interpretability and context relevance.

2. Relation-style set-to-set comparison as a nomogram alternative. Instead of using a linear nomogram, our method performs full support–query matching via a Hadamard similarity map with permutation-invariant aggregation, followed by a shallow classifier (i.e., a Relation Network–style few-shot comparator tailored to transcriptomic data). This design is both parameter-efficient and sample-efficient for small cohorts, and its case-based matching mechanism offers an intuitive interpretation of predictions.

3. End-to-end, biology-grounded pipeline with multi-omics and experimental validation. We maintain the robust feature-selection backbone and couple it to the few-shot classifier, then validate the pipeline across bulk and single-cell data and further confirm AA-specific activation of the IL2RB/IL2RG–EOMES–GZMA axis at both mRNA and protein levels in independent scalp samples. This approach localizes the EOMES+ CD8+ cytotoxic loop driving JAK–STAT hyperactivation in AA, in contrast to AGA’s androgen-centric, low-inflammation landscape. In doing so, it “closes the loop” from mechanistic insight to diagnosis and provides a prior-informed explainable alternative to linear models for limited-sample settings.

2 Methods

2.1 Data sources

Transcriptomic sequencing data for AA and AGA were obtained from the Gene Expression Omnibus (GEO) database (Langfelder and Horvath, 2008). Specifically, the training set data for AA came from the GSE68801 dataset (60 AA samples and 36 normal samples), while the validation set data came from the GSE45512 dataset (five AA samples and five normal samples). Additionally, transcriptome data for AGA were obtained from the GSE36169 dataset (five AGA samples and five normal samples), and the single-cell RNA sequencing (scRNA-seq) data of AA was from GSE212447, which included five AA samples for subsequent analysis.

2.2 Data preprocessing

For GSE68801 (AA, GPL570), GSE45512 (AA validation, GPL570), and GSE36169 (AGA, GPL96), we obtained either raw CEL files or processed series matrix files from GEO. When CEL files were available, microarrays were processed using the RMA pipeline (background correction, quantile normalization, and probe summarization). For series matrices, we verified that expression values were already log2-transformed and quantile-normalized according to the original submitter’s methods. Probe IDs were mapped to HGNC gene symbols using the platform annotation, and if multiple probes mapped to the same gene, their values were collapsed by taking the median. To avoid label leakage, only lesional AA samples and healthy controls from GSE68801 were used for training (non-lesional AA samples were excluded).

2.3 Key gene selection workflow

2.3.1 Differential expression analysis

Differential Expression Analysis: We used the limma package (v3.54.2) in R to identify differentially expressed genes (DEGs) between AA lesional and control samples in GSE68801 (Li et al., 2024). Genes with p<0.05 and |log2 fold change|>0.5 were considered significantly differentially expressed. A volcano plot and heatmap were then generated to visualize the DEGs’ distribution patterns.

2.3.2 Weighted gene co-expression network analysis (WGCNA)

Weighted Gene Co-expression Network Analysis (WGCNA): We applied WGCNA (v1.72.1) Szklarczyk et al. (2021); Shannon et al. (2003) on the GSE68801 expression matrix (all samples) to identify gene co-expression modules, using disease status (AA vs. control) as the trait. To ensure reliable results, we first performed sample clustering to check data correlations and remove any outlier samples. We then chose a soft-threshold power based on the scale-free topology criterion to optimize the network’s fit, and used it to construct a gene co-expression network with hierarchical clustering, yielding distinct gene modules. Next, Pearson correlation analysis was conducted between each module’s eigenexpression and the disease trait, and the modules with the strongest positive and negative correlations with AA were identified. The genes from these highly associated modules were selected for further investigation.

2.3.3 Functional enrichment analysis

Functional Enrichment Analysis: We performed functional enrichment analysis on the intersecting gene set using the R package clusterProfiler (v4.6.2) (Bodinier et al., 2023; Narala et al., 2021). This included Gene Ontology (GO) enrichment—covering the three GO categories of cellular component (CC), biological process (BP), and molecular function (MF)—as well as Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. The enrichment results for GO and KEGG were visualized with appropriate plots.

2.3.4 Constructing PPI network based on the correlation analysis of the JAK family genes

First, correlation analysis was performed between specific gene expression profiles and JAK family genes, with Pearson correlation coefficients and p values used to assess linear relationships. Genes meeting the screening criteria (|correlation coefficient|> 0.6 and p < 0.05) were retained. A protein-protein interaction (PPI) network was then constructed using the STRING database and visualized with Cytoscape (V 3.9.1) (Szklarczyk et al., 2021; Shannon et al., 2003). The top 20 hub genes were predicted using five topological analysis algorithms from the cytoHubba plugin, and the intersection analysis results were presented using an UpSet plot.

2.3.5 Consensus machine-learning feature selection

To enhance the accuracy and reproducibility of high-dimensional transcriptomic feature selection in this study, three classical machine learning algorithms least absolute shrinkage and selection operator(LASSO) regression, support vector machine recursive feature elimination (SVM-RFE), and random forest (RF) were simultaneously applied on the basis of 20 candidate genes to precisely identify the core biomarkers. LASSO was implemented using glmnet (V 4.1-8) (Bodinier et al., 2023), which automatically optimized the penalty coefficient through 10-fold cross-validation. L1 regularization was employed to compress the coefficients of non-critical features to zero, facilitating gene screening (Narala et al., 2021); SVM-RFE was constructed using e1071 (V 1.7-13) (Lin et al., 2012), eliminating the least contributing features after each round of training, and retaining the gene subset with the strongest predictive ability.RF generates 500 decision trees using randomForest (V 4.7-1.2) (Guyon et al., 2002), quantifies feature importance through out-of-bag error evaluation and Gini gain ranking, and effectively suppresses overfitting through an ensemble voting mechanism (Hu and Szymczak, 2023). Finally, we performed a Venn intersection of the feature gene lists, selecting the overlapping genes as core candidates. Details of the three machine learning algorithms are as follows:

2.3.5.1 LASSO (L1-regularized logistic regression)

In a binary (AA vs. control) logit model, incorporating L1 regularization simultaneously achieves coefficient shrinkage and variable selection (shrinking the regression coefficients of unimportant features to 0) (Tibshirani, 1996).

Model and Objective Function formula is given in Equation 1 (Binary Classification, yi{0,1}):

β̂=argminβRpi=1nyilogσxiβ+1yilog1σxiβ+λβ1,σz=11+ez(1)

where λ0 is a sparsity control parameter; the larger λ is, the fewer features are retained. The optimization uses coordinate descent and solves along the regularization path, and λ is selected by K-fold cross-validation (such as λmin or λ1se).

2.3.5.2 SVM-RFE (support vector machine recursive feature elimination)

In linearly separable or approximately linearly separable settings, the weight vector w of linear SVM directly characterizes the contribution of each feature to the classification hyperplane; RFE iteratively eliminates the feature with the smallest weight to obtain a compact and discriminative subset (Guyon et al., 2002).

Linear SVM (Soft Margin) Objective formula is given in Equation 2:

minw,b,ξ12w22+Ci=1nξi s.t. yiwxi+b1ξi,ξi0(2)

where yi{1,+1}. After training, the weight can be written as w=iαiyixi (αi is the dual variable). The RFE ranking criterion adopts the square of the per-dimensional weight shown in Equation 3:

rankj=wj2(3)

In each round, several features with the smallest rank(j) are removed, and the SVM is retrained until the preset number of features is reached; the optimal subset size and penalty parameter C are selected through nested cross-validation.

2.3.5.3 Random forest (RF)

RF generates an ensemble of trees through bootstrapping (bagging) and random subset feature splitting, which is suitable for nonlinear and high-order interaction scenarios; two types of importance metrics can be used to screen features (Breiman, 2001). Let the training set be {(xi,yi)}i=1n, where xiRp is a gene expression vector (with genes as features), and yi{0,1} represents AA(1) vs. healthy(0). RF consists of T randomized CART trees: each tree is trained on a bootstrapped sample S(t), and each node only selects the optimal split within a random subset of features (with size mtry); classification predictions are aggregated by voting/probability averaging presentes in Equation 4:

p̂1x=1Tt=1Tp̂t,1x,ŷx=1p̂1x0.5(4)

Node splitting (Gini criterion). Let the empirical class proportions at node t be pk,t(k{0,1}), and the Gini impurity is given in Equation 5:

iGt=1pk,t2(5)

The impurity decrease for the candidate split s:t{tL,tR} is shown as follows Equation 6

Δis,t=iGtntLntiGtLntRntiGtR(6)

Select the split with the largest Δi (entropy can also be used as an alternative).

Out-of-Bag (OOB) Error and Evaluation. If sample i is not drawn into the t-th tree, denote tTioob. Its OOB prediction and OOB error rate are given in Equation 7

ŷioob=argmaxk1TioobtTioob1ŷtxi=k, OOB-Err =1ni=1n1ŷioobyi(7)

For all nodes u split by gene j, accumulate the impurity decrease and weight it by the sample weight p(u)=nu/n of samples reaching that node; then take the average over the forest is given in Equation 8:

MDIj=1Tt=1Tut split on jpuΔiu(8)

MDI can quickly provide global rankings, but may produce bias when dealing with “different value ranges/number of categories” or highly correlated features.

First, obtain the original score eorig (such as accuracy/error rate) on the OOB samples. Then, randomly permute the values of the j-th gene to get eperm,j, and define as follows Equation 9:

FIj=eperm ,jeorig FIj=eperm ,jeorig 1(9)

the larger FIj is, the more crucial gene j is; conditional permutation can also be used to alleviate correlation bias.

2.3.6 Identification of gene expression levels between normal and patient samples

We next evaluated how well the candidate feature genes distinguish AA from normal samples. We examined each feature gene’s expression in both the training and validation sets and plotted receiver operating characteristic (ROC) curves using the pROC package (v1.18.4) (Zhang et al., 2025). Genes that were differentially expressed in AA vs. controls with consistent direction in both sets and had an area-under-curve (AUC) >0.7 were deemed key diagnostic genes for subsequent analysis.

2.3.7 Expression of key genes in AGA

In order to systematically investigate their expression patterns, a Comprehensive differential expression analysis of key genes was conducted in the AGA dataset. Particular emphasis was placed on the JAK family genes (JAK1, JAK2, JAK3, and TYK2), then the differential expression of these genes in AGA was analyzed to clarify their potential mechanisms in this disease.

2.4 Prior-informed few-shot diagnostic model

We replaced the conventional nomogram with a Relation Network–style few-shot classifier leveraging MultiPLIER priors (Figure 1). In our model, the bulk expression matrix X (genes × samples) is first projected onto latent variables (LVs) aligned with known pathways and cell types using a fixed MultiPLIER loading matrix. The LVs corresponding to AA-relevant signals are then re-weighted based on the expression of the hub genes identified by our consensus pipeline. Next, a two-branch (support and query) encoder computes pairwise Hadamard product similarity vectors between each query and all support samples; these similarity vectors are fed into a permutation-invariant relation module and aggregated, and finally a shallow linear classifier outputs the prediction (AA or control). This design embodies the “learning to compare” concept of Relation Networks while enforcing set-level (support set) invariance. Crucially, by using a fixed biologically-informed latent space, our architecture keeps the model’s decisions transparent and mechanism-grounded–aligning with emerging XAI 2.0 principles in healthcare that demand context-rich explanations without sacrificing performance.

Figure 1
A flowchart of a machine learning model for gene prediction includes support and query sets, embedded into prior knowledge. A similarity function directs to an attention matrix featuring encoding of a gene load matrix and key gene. This process influences weighted predictions through a network culminating in a softmax layer for final prediction output.

Figure 1. Prior-informed relation-style few-shot diagnostic model for AA.

Step 1: Projection onto MultiPLIER Latent Space. We project the gene expression matrix X into a lower-dimensional latent space defined by a public MultiPLIER gene-by-LV loading matrix ZRG×L (pre-trained on >70,000 human RNA-seq samples). Specifically in formula Equation 10, we solve a ridge regression to estimate the latent variable activity matrix BRL×N:

B*=argminBXZBF2+λBF2=ZZ+λI1ZX.(10)

Where, Z is kept fixed (no fine-tuning) to preserve its biological alignment, which improves interpretability and prevents overfitting in small-sample settings.

Step 2: Key-gene-guided channel re-weighting.

Let G* be the final hub-gene set from the consensus feature selection. We compute an LV-wise weight vector αRL from the MultiPLIER loadings equation is given in Equation 11:

α=1|G*|gG*|Zg|,B̃=diagα̂B(11)

where α̂=α/max(α) (or α/α1) normalizes magnitudes. This channel gating prioritizes LVs heavily loaded by hub genes while retaining the full prior space; B̃ is forwarded to the encoder. (The use of a fixed prior Z keeps interpretability at the pathway/cell-type level).

Step 3: Two-Branch Encoder & Hadamard Similarity Mapping. We use a lightweight neural encoder fθ:RLRd to embed each latent variable vector into a d-dimensional feature space. This encoder is shared between the support set S (AA and control reference samples) and the query sample(s). Denote the encoded embeddings as hs=fθ(b̃s) for each support sS, and hq=fθ(b̃q) for a query qQ. For a given query q, we then compute its similarity to each support sample s via the Hadamard product (element-wise multiplication) of their embeddings shown in Equation 12:

mq,s=hqhsRd,(12)

where denotes element-wise multiplication. This simple yet expressive operation produces a d-dimensional similarity vector mq,s for each query–support pair, capturing pairwise feature interactions and serving as a learnable matching mechanism in few-shot frameworks.

Step 4: Permutation-Invariant Set Aggregation. We aggregate the query–support similarity vectors across the entire support set in a permutation-invariant way (insensitive to the order of support samples). In practice, we use a Deep Sets–style relation module (The formula is shown below) that transforms and sums the similarity vectors for each class:

rq,c=ρψsScϕϕmq,s,c0,1,

where ϕϕ and ρψ are small neural networks (e.g., multi-layer perceptrons or 1× 1 convolutions). This defines a symmetric set function over the support samples and serves as a simple Relation Network–style comparator. (An attention-based Set Transformer could be used in place of the sum aggregator.) The resulting class scores rq,c are passed into a linear output layer with softmax to produce the final AA vs. control prediction.

Step 5: Training and Inference Procedure. We train the model in an episodic 2-way K-shot manner (two classes, K support samples per class, and a set of query samples per episode). During training, the model’s parameters are learned by minimizing the cross-entropy loss on the query predictions. For each episode, with true query label indicators yq,c, the loss formula is given in Equation 13:

L=qQc0,1yq,clogexprq,c/τcexprq,c/τ,(13)

where τ is an optional temperature hyperparameter. We apply standard L2 weight decay and dropout regularization in the encoder fθ and relation module (ϕϕ,ρψ). The model is optimized using the Adam optimizer, with early stopping based on validation AUC. Inference: To make a prediction on a new sample, we form a support reference set from the training data and compute rq,c for the query; the query is then assigned to the class with the higher score (argmaxcrq,c).

We used a MultiPLIER loading matrix with L = 987 latent variables. The encoder fθ was a 2-layer fully connected network with 128 and 64 hidden units respectively, ReLU activation, and a dropout rate of 0.3. During training, we employed a 2-way 5-shot episodic design: each episode included 5 support samples per class (AA vs. control) and 5 query samples. The model was trained for 1,000 episodes with early stopping based on validation AUC stagnation for 10 episodes. For calibration, we compared Platt scaling and isotonic regression and adopted isotonic regression due to a lower cross-validated Brier score.

Instead of fitting a traditional logistic-regression nomogram, we trained our prior-informed few-shot model and took its raw output logit s(x) (pre-softmax score) as a risk score for AA. We then calibrated these scores to probabilistic risk estimates by learning a monotonic mapping on the training data—comparing Platt scaling (sigmoid calibration) and isotonic regression—and choosing the method with lower cross-validated Brier score. Using the selected calibration, we constructed a nomogram-style score chart: we computed the contribution of each latent feature to s(x) using SHAP (SHapley Additive exPlanation) values with respect to the post-weighting LV inputs, aggregated these contributions for each of the four hub genes via the MultiPLIER loadings, and rescaled the gene contribution totals to a 0–100 point scale. The sum of points (“Total Points”) is converted to an AA risk probability by the calibrated mapping p̂(x)=Calibs(x). (Notably, SHAP values are additive on the log-odds scale and sum up to s(x).) We evaluated the model’s discrimination using ROC curves (AUC) in pROC, to allow direct comparison with prior analyses, and assessed clinical utility via decision curve analysis (DCA) using the rmda package (comparing the net benefit of our model against “treat-all” and “treat-none” strategies).

2.5 Immune infiltration analysis

Single-sample gene set enrichment analysis (ssGSEA) was employed to quantitatively assess the abundance of 28 immune cell types between the disease group and the control group in the training set. Then the infiltration levels of the two groups were compared using the two-sided Wilcoxon rank sum test. Cell types with p < 0.05 after FDR correction were considered significantly different. To investigate potential associations between immune cell populations and the candidate key genes, he Pearson correlation coefficient between the ssGSEA score and the expression level of core genes was computed within the training set.

2.6 Gene set variation analysis and GSEA analyses

To delineate differences in immune response pathway activities among distinct biological groups, we first obtained 153 immune-related gene sets with complete functional annotations from the ImmPort database. Using these gene sets, we performed gene set variation analysis (GSVA) on the training set data via the R package “GSVA” (v1.46.0) to evaluate the enrichment levels of immune response pathways across samples (Hänzelmann et al., 2013). GSVA is a nonparametric algorithm that transforms a gene expression matrix into a gene set activity score matrix, enabling quantitative assessment of pathway enrichment at the single-sample level. In this way, GSVA scores reflect the enrichment status of immune-related pathways for each sample and allow comparison of pathway activities between AA and control groups.

For mechanistic insights into the roles of key hub genes in disease pathogenesis, we additionally carried out gene set enrichment analysis (GSEA) based on Hallmark gene sets. The R package “msigdbr” (v10.0.1) was employed to retrieve Hallmark reference gene sets (Sennett et al., 2024). For each hub gene, samples were stratified into high- and low-expression groups according to the median expression of that gene, and classical GSEA was applied to identify differentially regulated pathways between the two groups and to dissect the associated signaling programs (Zhang et al., 2025). GSEA evaluates whether predefined gene sets exhibit statistically significant and concordant differences between two biological states and, compared with conventional single-gene analyses, is able to detect both subtle coordinated changes across many genes and pronounced alterations within smaller gene subsets, thereby enhancing biological interpretation.

2.7 Identification of drug candidates

GeneCards (Stelzer et al., 2016), a comprehensive gene database integrating genomic, transcriptomic, proteomic, genetic, and functional data from approximately 150 sources, served as a key resource for exploring gene-disease associations and drug interactions. Critical gene-drug relationships based on hub genes were extracted from GeneCards, with approved drugs screened and visualized using the R package “ggalluvial” (V 0.12.5) (Brunson, 2020).

2.8 Single-cell sequencing analysis

Single-cell data were preprocessed and normalized using the R package “Seurat” (V 5.1.0) (Butler et al., 2018) with parameters set to min.cells = 3 and min.features = 200, retaining only genes expressed in 3 cells and cells with 200 detected genes to construct the Seurat object for subsequent analysis. Quality control filtering ensured that each cell contained 200-7,500 detected genes (nFeature), < 25,000 total RNA molecules (nCount), and < 15% mitochondrial gene content (percent.mt). Dimensionality reduction was performed via the RunPCA function, followed by cell clustering using FindNeighbors and FindClusters. Cell cluster annotation integrated multiple approaches: (1) Cluster-specific marker genes identified by FindAllMarkers were cross-referenced with the CellMarker database (Zhang et al., 2019); (2) Predictive validation was conducted using the automated single-cell annotation tool SingleR (V 2.0.0) (Aran et al., 2019); (3) Final annotations were visualized via UMAP. Expression patterns of model genes across distinct cell populations were further analyzed.

Quality control excluded cells with <200 or >7500 detected genes, total RNA count >25,000, or mitochondrial content >15%. Doublet detection was performed using DoubletFinder, and flagged cells were removed. Batch correction across 5 AA patient samples was conducted using Seurat v5 integration with SCTransform normalization. PCA was run with 50 components, and the top 20 PCs were used for clustering (resolution =0.8) and UMAP embedding (n_neighbors=30, min_dist=0.3). Marker gene identification used the Wilcoxon rank-sum test with log2FC>0.25 and adjusted p<0.05.

2.9 Multi-database functional annotation of hub genes

Cross-referencing the four hub genes and their upstream cytokines across several independent functional–annotation and signaling databases provided external validation of our computational findings. Public resources consistently annotate granzyme A (GZMA) as a cytotoxic serine protease expressed in activated CD8+ T cells and natural killer cells, mediating granule-dependent target cell death and shaping inflammatory responses. Likewise, interleukin-2 (IL-2) and its receptor components are repeatedly described as central regulators of T-cell proliferation, survival, and differentiation, with established roles in driving effector and memory programs. In these databases, IL2RB and IL2RG encode the β and common γ chains shared by the IL-2/IL-15 receptor complexes, respectively, and are linked to JAK1/3 activation and downstream STAT signaling. EOMES is annotated as a T-box transcription factor that programs a cytotoxic transcriptional state in CD8+ T cells, characterized by high interferon-γ and granzyme expression.

Together, this multi-database functional annotation converges on a coherent picture: the IL2RB/IL2RG–EOMES–GZMA axis represents a well-established immune effector module, and our multi-omics pipeline has rediscovered this module as the core signature distinguishing AA from AGA. This convergence increases confidence in the biological plausibility and translational relevance of the identified targets, rather than indicating a model driven by spurious associations.

2.10 Biological validation in independent scalp samples

2.10.1 Human scalp samples and ethics

This study enrolled three groups of participants: (i) alopecia areata (AA) lesional scalp (n=10); (ii) androgenetic alopecia (AGA) affected scalp (n=10); and (iii) healthy control scalp (n=10). The protocol was approved by the Ethics Committee of the First Affiliated Hospital of Army Medical University (approval No. ky201977). All procedures were conducted in accordance with relevant guidelines and regulations. Written informed consent was obtained from all patients/participants prior to enrollment. Scalp tissues were obtained as residual specimens from the Department of Dermatology after therapeutic excision of perilesional scalp trauma, melanocytic nevi, or lipomas. Each specimen measured approximately 0.5×0.5cm to 0.5×1.0cm. Immediately after collection, samples were snap-frozen in liquid nitrogen and stored at 80°C until analysis.

2.10.2 Western blot (WB) analysis of hub proteins

Total protein was extracted from scalp tissues homogenized on ice using RIPA lysis buffer supplemented with PMSF (Beyotime, China). Protein concentrations were determined with a BCA assay kit (Beyotime, China). Equal amounts of protein were resolved by SDS–PAGE and transferred to PVDF membranes at 200 mA for 2 h. Membranes were blocked in 5% non-fat milk at room temperature for 2 h and incubated overnight at 4°C with primary antibodies against β-actin (1:4000; 20536-1-AP, Proteintech, China), EOMES (1:5000; 83945-5-RR, Proteintech, China), GZMA (1:500; 11288-1-AP, Proteintech, China), IL2RB (1:5000; 13602-1-AP, Proteintech, China), and IL2RG (1:500; 11409-1-AP, Proteintech, China). After TBST washes, membranes were incubated with HRP-conjugated secondary antibodies (Beyotime, China) for 2 h at room temperature and washed again with TBST. Signals were visualized by enhanced chemiluminescence.

2.10.3 Real-time quantitative polymerase chain reaction (RT-qPCR) analysis of hub genes

Total RNA was isolated using the RNASimple Total RNA Kit (TIANGEN, China). First-strand cDNA was synthesized with the MightyScript First-Strand cDNA Synthesis Mix (Sangon, China). qPCR was performed with 2× SG Fast qPCR Master Mix (Sangon, China) on a Bio-Rad CFX96 real-time PCR system (Bio-Rad, United States) under the following conditions: initial denaturation at 95°C for 3 min; 40 cycles of 95°C for 3 s and 60°C for 20 s for annealing/extension/data acquisition. Primer sequences are listed in Table 1.

Table 1
www.frontiersin.org

Table 1. The sequences of primers used for qRT-PCR detection.

3 Result

During differential expression analysis of the AA training set, a total of 257 differentially expressed genes (DEGs) were identified. Among these, 91 DEGs exhibited upregulated expression, while 166 DEGs showed downregulated expression. Based on these results, a volcano plot was generated to visualize the changes in gene expression, as shown in Figure 2a. Furthermore, the top 10 most DEGs from both the upregulated and downregulated groups were extracted, and a heatmap was plotted to examine their distribution patterns across samples, as shown in Figure 2b. Additionally, WGCNA was performed using the training set data, with disease status encoded as the phenotypic trait.

Figure 2
A composite image with two panels: (a) A volcano plot displaying gene expression data with labeled genes, indicating downregulated (green) and upregulated (red) genes against their p-values. (b) A heatmap depicting differential expression of genes, color-coded from red to green, with a density plot above showing distribution by group (AA and control). A color key indicates expression levels.

Figure 2. Screening of differentially expressed genes (DEGs) and identification of module genes in alopecia areata (AA). (a) Volcano plot and (b) Heatmap of DEGs across different groups (Control and AA).

During this process, six outlier samples (GSM1681988, GSM1681991, GSM1682055, GSM1682056, GSM1682057, and GSM1682101) were identified and excluded to ensure the accuracy and reliability of the analysis, as shown in Figure 3a. By setting a scale-free network assessment coefficient (R2 = 0.85) and an optimal soft threshold (β = 8), 12 distinct gene co-expression modules were successfully identified, as shown in Figures 3b, 4. Subsequently, based on a Pearson correlation threshold (p < 0.05; |cor|> 0.6), the modules exhibiting the most significant positive (red module) and negative (pink module) correlations with phenotypic traits were selected, from which 1,667 associated genes were determined, as shown in Figure 4b. These genes provide profound basic data for subsequent research.

Figure 3
Two diagrams depict sample clustering. The left diagram shows a dendrogram labeled

Figure 3. Screening of differentially expressed genes (DEGs) and identification of module genes in alopecia areata (AA). (a) Sample clustering tree diagram and correlation heatmap and (b) Scale independence and average connectivity of different soft threshold powers used in Weighted Gene Co-expression Network Analysis (WGCNA).

Figure 4
Cluster Tree Diagram and Module-trait Relationship Heatmap. The left graph shows scale independence and mean connectivity plotted against the soft threshold power. The right heatmap displays relationships between modules and traits, with color intensity indicating strength and direction of correlation, ranging from red to blue.

Figure 4. Screening of differentially expressed genes (DEGs) and identification of module genes in alopecia areata (AA). (a) Cluster tree diagram generated by WGCNA and (b) Module-trait relationship heatmap showing the correlation coefficients and significance values between modules and traits of interest.

By intersecting the DEGs with genes from the significant WGCNA modules, we obtained 228 overlapping genes (Figure 5a). We performed GO enrichment for these genes and visualized the top five categories in each GO domain (Figure 5b).

Figure 5
Two-part image: (a) A Venn diagram with two overlapping circles labeled

Figure 5. Functional enrichment and gene expression analysis. (a) Venn diagram showing the overlap between genes identified by DEGs and WGCNA and (b) Dot plot of Gene Ontology (GO) terms for cellular component (CC), biological process (BP), and molecular function (MF) enriched among DEGs.

For Biological Process (BP), the most enriched terms were intermediate filament organization, intermediate filament-based process, skin development, and epidermal development. For Cellular Component (CC), top terms included molting cycle, intermediate filaments (and intermediate filament cytoskeleton), keratin filaments, and external side of plasma membrane. For Molecular Function (MF), top terms included keratinization, structural constituent of skin epidermis, chemoattractant activity, chemoattractant receptor binding, cytokine activity, and CCR chemokine receptor binding. Additionally, KEGG pathway analysis identified nine significantly enriched pathways (Figure 6), including Staphylococcus aureus infection, viral protein–cytokine/chemokine interactions, cytokine–cytokine receptor interaction, estrogen signaling, chemokine signaling, cell adhesion molecules (CAMs), hematopoietic cell lineage, Th1 and Th2 cell differentiation, and primary immunodeficiency. These enrichment results provide a context-sensitive interpretation of the overlapping genes, suggesting they cluster in pathways relevant to AA pathology.

Figure 6
Scatter plot showing ten pathways on the y-axis against GeneRatio on the x-axis. Dot size represents count, ranging from four to sixteen. Color intensity, from blue to red, indicates the level of p.adjust, with blue being higher. The cytokine-cytokine receptor interaction pathway has the highest GeneRatio and count.

Figure 6. Dot plot of KEGG enrichment analysis pathway.

First, Pearson correlation analysis was used to pair all DEGs with the expression levels of JAK1, JAK2, JAK3, and TYK2 one by one and calculate the correlation coefficients. Subsequently, genes with statistically significant correlations with JAK family genes were screened for further study. The results showed 55 genes were positively or negatively correlated with at least one member of the JAK axis at a moderate-intensity level. Thus, they were included in the candidate list for subsequent functional verification. The correlation results are presented as a heatmap: colors range from light to dark, corresponding to correlation coefficients from weak to strong, and symbols in the squares indicate statistical significance, as shown in Figure 7.

Figure 7
Box plots on the left compare the expression levels of JAK1, JAK2, JAK3, and TYK2 between Control (blue) and AA (red) groups, with significance marked (ns, **, ****, *). A heatmap on the right shows the correlation of key genes, with color intensity representing Pearson's correlation coefficients.

Figure 7. Functional enrichment and gene expression analysis. (a) Box plots comparing the expression levels of Janus kinase (JAK) family genes (JAK1, JAK2, JAK3, TYK2) between control and AA groups and (b) Heatmap of the correlation matrix of key genes, with Pearson’s correlation coefficients shown. Nodes and edges highlight the relationships among JAK family genes and others.

Genes found to be strongly correlated with the JAK family underwent PPI network analysis utilizing STRING, with visualization achieved through Cytoscape. This network-based approach facilitated the identification of 10 core hub genes: cluster of differentiation 3D (CD3D), granzyme A (GZMA), interleukin 2 receptor subunit beta (IL2RB), interleukin 2 receptor subunit gamma (IL2RG), inducible T-cell costimulator (ICOS), STAT4, interleukin 10 receptor subunit alpha (IL10RA), eomesodermin (EOMES), interleukin-2-inducible T-cell kinase (ITK), and integrin subunit alpha L (ITGAL), as shown in Figures 8, 9.

Figure 8
Two protein-protein interaction (PPI) networks are shown. Network #1 displays a dense web of multicolored nodes and connecting lines, indicating numerous interactions. Network #2 features a structured arrangement of yellow and orange nodes with interconnecting lines, suggesting interaction variations.

Figure 8. Identification hub genes in AA. (a,b) PPI network showing the complex interactions between key genes.

Figure 9
Bar chart depicting intersection sizes of five sets labeled MCC, MNC, Degree, DMNC, and EPC. Bars show intersection sizes: 10, 8, 6, and three bars of size 1. Colored sections represent set sizes.

Figure 9. Upset plot of 10 intersecting genes obtained from five topology analysis algorithms (Maximum Clique Centrality (MCC), Maximum Neighborhood Component (MNC), Degree, Density of Maximum Neighborhood Component (DMNC), Edge Percolated Component (EPC)) in the cytoHubba plugin.

To rigorously refine the selection of key molecular drivers, three complementary machine learning algorithms were implemented on the training dataset. The LASSO regression model first highlighted six pivotal genes, namely, GZMA, IL2RB, IL2RG, STAT4, IL10RA, and EOMES, as shown in Figure 11a. Following this, SVM-RFE analysis delineated nine influential genes, including IL2RG, IL10RA, GZMA, ITGAL, CD3D, STAT4, EOMES, IL2RB, and ITK, as shown in Figure 10a. dditionally, the RF classifier, applying a MeanDecreaseGini threshold exceeding 3, pinpointed seven key genes: IL2RG, ITGAL, STAT4, IL2RB, EOMES, GZMA, and IL10RA, as shown in Figure 10b. By intersecting the outputs derived from these three distinct analytical frameworks, a robust consensus set of six feature genes: GZMA, IL2RB, IL2RG, STAT4, IL10RA, and EOMES was ultimately determined, as shown in Figure 11b. This integrative approach reinforced the credibility of the selected genes as pivotal components within the JAK-related molecular landscape.

Figure 10
Line chart (a) displays SVM cross-validation accuracy for different variables, showing a rising trend. Bar chart (b) illustrates variable importance with IL2RG ranked highest in Mean Decrease Gini.

Figure 10. Identification hub genes in AA. (a) The Support Vector Machine (SVM) line chart shows that when the model reaches its highest accuracy, nine genes are selected. (b) Bar chart of variable importance based on MeanDecreaseGini from Random Forest analysis, illustrating the most critical genes.

Figure 11
LASSO regression plot with two charts: the top displays binomial deviance versus log lambda, featuring a U-shaped curve with error bars. The lower chart shows coefficients versus log lambda, with lines in various colors. Adjacent is a Venn diagram with three overlapping circles labeled LASSO (red), RF (blue), and SVM-RFE (orange). The intersection shows shared elements, with numbers denoting quantities and percentages.

Figure 11. Identification hub genes in AA. (a) Least absolute shrinkage and selection operator (LASSO) regression plot displaying binomial deviance versus log lambda values to determine optimal gene selection. (b) Venn diagram representing the overlap of identified genes from LASSO, Random Forest (RF), and SVM-Recursive Feature Elimination (RFE).

Based on these six feature genes, an external validation cohort was further introduced for expression validation. Based on these six feature genes, an external validation cohort was further introduced for expression validation. Of the six candidate biomarkers identified by our machine-learning selection, four genes—GZMA, IL2RB, IL2RG, and EOMES—showed significant differential expression in an independent AA cohort, with consistent up/downregulation trends (Figure 12). Each of these four genes achieved an AUC >0.70 for distinguishing AA from controls (Figure 13).We therefore designated these four as the core biomarkers for our diagnostic model. Using the training set, we trained the few-shot model on these biomarkers and evaluated its performance. The model achieved high discriminatory power (AUC =0.945) as shown by ROC analysis, and decision curve analysis (Figure 14) indicated improved clinical net benefit of the model across a range of risk thresholds (outperforming “treat-all” and “treat-none” strategies).

Figure 12
Two box plot panels compare the differential expression of genes between control and AA groups. Panel (a) shows EOMES, GZMA, IL10RA, IL2RB, IL2RG, and STAT4 for the train set, with significant differences marked by asterisks. Panel (b) shows the same genes for the test set, with varying significance levels. Both plots use blue for control and red for AA groups.

Figure 12. Validation and predictive performance of feature genes in AA. (a) Differential expression of six core genes in the AA and control groups of the training set. (b) Differential expression of six core genes in the AA and control groups of the test set.

Figure 13
Two panels showing ROC curves for train and test sets. Panel (a) depicts the ROC curve for the train set with AUC values for EOMES (0.853), GZMA (0.889), IL2RB (0.85), and IL2RG (0.913). Panel (b) shows the ROC curve for the test set with AUC values for EOMES (0.92), GZMA (0.96), IL2RB (1.0), and IL2RG (0.96). Both graphs plot sensitivity against specificity with a diagonal reference line.

Figure 13. Validation and predictive performance of feature genes in AA. (a) Receiver operating characteristic (ROC) curve of six core genes in the AA and control groups of the training set. (b) ROC curve of six core genes in the AA and control groups of the test set.

Figure 14
Decision and ROC curves for model evaluation. The decision curve (left) shows net benefit versus high-risk threshold for models: model, EOMES, GZMA, IL2RB, IL2RG, All, and None. The ROC curve (right) illustrates sensitivity versus one minus specificity, with the model's AUC at 0.945, indicating high accuracy.

Figure 14. (a) Decision curve analysis illustrating the net benefit of the predictive model across different risk thresholds. (b) ROC curve for the model, with the area under the curve (AUC) indicating the model’s overall predictive power.

We analyzed immune cell infiltration in AA lesions versus controls. Out of 28 cell types, 22 showed significantly different abundance between AA and healthy scalps (p<0.05, Figure 15). In particular, AA lesions had elevated levels of multiple T cell subsets (e.g., activated CD4+ T, activated CD8+ T, effector memory T, regulatory T cells) and other immune cells (e.g., activated B cells, dendritic cells, macrophages, NK cells, MDSCs), compared to controls. Furthermore, the four hub genes were each strongly positively correlated (Pearson r, p<0.05) with a subset of these infiltrating immune cells. For instance, higher GZMA expression correlated with increased activated CD8+ T cells and effector memory CD8+ T cells; IL2RB and IL2RG levels correlated with T cell and NK cell subsets; and EOMES correlated with cytotoxic T cell levels (Figure 16). These findings suggest that AA is characterized by a broad immune cell infiltration, tightly linked to the identified hub genes.

Figure 15
Box plot comparing immune cell values between control (blue) and AA (red) groups. Each category of immune cells is represented on the x-axis, with values on the y-axis, showing statistical significance with asterisks for specific comparisons.

Figure 15. Immune cell infiltration analysis. Box plots comparing the infiltration levels of 28 immune cell types between the control and AA groups.

Figure 16
Correlation matrix illustrating cell type associations with gene expressions EOMES, GZMA, IL2RB, and IL2RG. Square color intensity represents the Pearson correlation coefficient, with orange lines indicating significant correlations and blue lines for negative correlations. A color scale is provided for Pearson's r, ranging between negative one and one.

Figure 16. Immune cell infiltration analysis. Correlation heatmap showing Pearson’s correlation coefficients between the infiltration levels of different immune cell types and hub genes. Orange lines denote significant positive correlations, while blue lines indicate significant negative correlations.

Gene set variation analysis (GSVA) of the training set revealed widespread immune pathway perturbations in AA. We identified 115 immune-related pathways with significant activity differences relative to controls (p<0.05): 109 pathways showed higher activity in AA, while 6 were lower. Figure 17 visualizes the five most extreme pathways in each category (upregulated, downregulated, and unchanged). Notably, AA lesions showed elevated activity in pathways related to innate and adaptive immunity—e.g., antibacterial innate immune response, leukocyte chemotaxis, T cell–mediated cytotoxicity, immune interactions between lymphoid and non-lymphoid cells, and adaptive immune responses involving antigen receptor recombination. In contrast, a few pathways were suppressed in AA, such as primary adaptive immune response, negative regulation of germinal center formation, MHC class II antigen presentation, advanced glycation end-product receptor signaling, and antiviral interferon-stimulated gene pathways. GSEA was performed using the HALLMARK gene set as the reference.

Figure 17
Bar graph illustrating immune system pathways with t-values of GSVA scores. Red bars indicate pathways with upregulation, green bars indicate downregulation, and gray bars show no significant change. A legend on the right specifies the color codes for up, down, and no significant change.

Figure 17. Bar plot showing the differential analysis results of gene set variation analysis (GSVA).

Together, these concordant annotations converge on a coherent picture: the IL2RB/IL2RG–EOMES–GZMA axis represents a well-established immune effector module rather than a spurious statistical pattern, and our multi-omics pipeline has rediscovered this module as the core signature distinguishing AA from AGA. This convergence increases confidence in the biological plausibility and translational relevance of the identified targets and supports their prioritization for pathway-guided immunomodulatory therapies.

Using single-cell RNA-seq from AA lesions (5 samples, 25,875 cells), we identified 12 distinct cell clusters (Figure 18). The hub genes were predominantly expressed in immune cell populations. In AA, GZMA and EOMES were enriched in CD8+ T cells, and IL2RB and IL2RG were notably expressed in proliferating T cells and other lymphocytes (Figures 19, 20).

Figure 18
Two-panel UMAP plots showing cell type distribution and expression levels. (a) Clustering of cell types with distinct colors and labels from 0 to 13. (b) Expression levels with labeled cell types like B cells, T cells, and macrophages. Both plots have axes labeled

Figure 18. Single-cell analysis. (a) Uniform Manifold Approximation and Projection (UMAP) displaying the clustering of various cell types. (b) Dot plot illustrating the expression levels of distinct marker genes across identified cell types.

Figure 19
Four UMAP scatter plots visualize gene expression levels for EOMES, GZMA, IL2RB, and IL2RG. Each plot shows data points in purple hues on a gray background, with intensity indicating gene expression levels, as per the color scales ranging from zero to four for GZMA and from zero to three for the others.

Figure 19. UMAP feature plots showing the expression of hub genes across different cell clusters.

Figure 20
Four violin plots depict gene expression levels for EOMES, GZMA, IL2RB, and IL2RG across various cell types including B cells, T cells, and macrophages. The expression level is shown on the y-axis and cell identity on the x-axis. Each plot highlights different patterns of gene expression, with notable peaks in certain cell types for IL2RB and IL2RG.

Figure 20. Violin plots depicting the distribution and expression levels of hub genes in each cell type.

These findings outline a localized immune circuit in AA lesions: the hub genes are active in cytotoxic T cells and proliferating lymphocytes, suggesting a central cytotoxic T-cell loop driving AA pathology. By contrast, in AGA scalps the expression of these hub genes was minimal or unchanged (Figure 21). Neither EOMES nor GZMA was detected in the AGA single-cell data, and JAK–STAT pathway genes (JAK1/2/3, TYK2) showed no significant up- or downregulation. Thus, unlike AA, AGA did not exhibit a notable JAK–STAT immune activation signature at the single-cell level.

Figure 21
Box plots comparing gene expression between Control and AGA groups. Panel (a) shows key genes IL2RB and IL2RG, with IL2RG having higher expression in AGA. Panel (b) shows other genes JAK1, JAK2, and TYK2, with no significant differences.

Figure 21. The expression of key genes and JAK family in AGA. (a) Box plots showing the expression levels of key genes between the Control and AGA groups (EOMES and GZMA were not exist in the AGA dataset). (b) Box plots depicting the expression levels of JAK1, JAK2, and TYK2 genes in control versus AGA groups (JAK3 transcripts were not detected in the AGA dataset).

To determine whether this cytotoxic/JAK axis is recapitulated in independent patient tissues, we next examined its expression in AA, AGA, and healthy scalp specimens. Compared with healthy controls, mRNA levels of EOMES, GZMA, IL2RB, and IL2RG were significantly upregulated in alopecia areata (AA) lesional scalp (all p<0.01), whereas androgenetic alopecia (AGA) scalp showed no significant increase relative to controls (all p>0.05). In AA, the transcription factor EOMES and the cytotoxic effector GZMA exhibited the most pronounced elevations; IL2RB/IL2RG were also consistently increased, indicating activation of the IL-2/IL-15–JAK1/3 axis (Figure 22). These AA-specific mRNA elevations are fully consistent with our microarray-based differential expression, external validation cohorts, and immune-infiltration analyses, and they further underscore that the IL2RB/IL2RG–EOMES–GZMA axis is transcriptionally active in AA but quiescent in AGA.

Figure 22
Bar graphs showing relative mRNA expression levels for EOMES, GZMA, IL2RB, and IL2RG, comparing control, AA, and AGA groups. AA shows significantly higher expression in EOMES and GZMA, while IL2RB and IL2RG display less differentiation. Statistical significance is indicated by asterisks, with

Figure 22. RT-qPCR validation of the IL2RB/IL2RG–EOMES–GZMA cytotoxic/JAK axis in alopecia areata (AA). Relative mRNA levels of EOMES, GZMA, IL2RB, and IL2RG are significantly elevated in AA lesional scalp compared with healthy controls, whereas androgenetic alopecia (AGA) samples are comparable to controls, indicating that this cytotoxic/JAK module is transcriptionally active in AA but largely quiescent in AGA. Data are shown as mean ± SD. Statistical annotations: ** p<0.01 vs. control; n.s., not significant (AGA vs. control).

4 Discussion

The infiltration levels of 28 immune cell types in the training set were analyzed using the ssGSEA algorithm to compare differences between the disease group and the control group. It is important to note that the AGA dataset (GSE36169) comprised only five AGA samples and five controls, which substantially limits statistical power. This small sample size may lead to an underestimation of subtle AGA-specific transcriptomic changes, including immune or inflammatory signals that fall below our detection threshold. Accordingly, our conclusion that AGA lacks a comparable inflammatory JAK–STAT signature should be interpreted with caution and validated in larger, independent cohorts.

A significance threshold of normalized enrichment score (NES) > 1 and p < 0.05 was applied, and the top five significantly enriched pathways were visualized. We also performed single-gene gene set enrichment analysis (GSEA) for each hub gene using Hallmark gene sets. The top enriched pathways linked to the hub genes included graft rejection, IL6–JAK–STAT3 signaling, inflammatory response, interferon alpha response, and interferon gamma response (Figure 23). These enrichment patterns further underscore the central role of JAK–STAT signaling and broad immune activation in AA’s gene expression profile.

Figure 23
Four line graphs labeled (a) to (d) show enrichment scores versus rank in an ordered dataset. Each graph includes lines for five hallmark responses: allograft rejection, IL-2 STAT5 signaling, inflammatory response, interferon alpha, and gamma responses. Each line displays a distinct color with a consistent pattern across the graphs. The plots show a peak followed by a gradual decline, and below each graph are colorful markers representing residuals.

Figure 23. GSEA plots for key genes. (a) EOMES. (b) GZMA. (c) IL2RB. (d) IL2RG.

Four key genes identified as related to drugs was selected from the GeneCards database, their connections to therapeutic drugs were visualized using a Sankey diagram, as shown in Figure 24. The EOMES gene was found to have a significant association with glatiramer, while GZMA was closely linked to cyclosporine. In addition, IL2RB and IL2RG were found to correlate with four clinically significant drugs, namely, Basiliximab, Denileukin diftitox, Aldesleukin, and Nogapendekin alfa (Anktiva). In summary, these drugs play an important role in clinical practice, highlighting their therapeutic significance.

Figure 24
Sankey diagram illustrating connections between genes and drugs. On the left, genes EOMES, GZMA, IL2RB, and IL2RG are listed; on the right, drugs Aldesleukin, Basiliximab, Cyclosporine, Denileukin diftitox, Glatiramer, and Nogapendekin alfa are shown. Colored flows represent associations between them.

Figure 24. Sankey diagram of hub genes and corresponding drugs.

Beyond gene-level associations and drug–gene networks, we next sought to determine whether the key components of this cytotoxic/JAK axis are consistently annotated as immune effector molecules in independent functional-annotation resources. Cross-referencing the four hub genes and their upstream cytokines across several databases provided external validation of our computational findings. Public resources consistently annotate granzyme A (GZMA) as a cytotoxic serine protease expressed in activated CD8+ T cells and natural killer cells, mediating granule-dependent target cell death and shaping inflammatory responses. Likewise, interleukin-2 (IL-2) and its receptor components are repeatedly described as central regulators of T-cell proliferation, survival, and differentiation, with established roles in driving effector and memory programs. In these databases, IL2RB and IL2RG encode the β and common γ chains shared by the IL-2/IL-15 receptor complexes, respectively, and are linked to JAK1/3 activation and downstream STAT signaling. EOMES is annotated as a T-box transcription factor that programs a cytotoxic transcriptional state in CD8+ T cells, characterized by high interferon-γ and granzyme expression.

Protein expression changes paralleled the qPCR results: AA samples displayed markedly higher levels of EOMES, GZMA, IL2RB, and IL2RG proteins than healthy controls (p<0.01), while AGA did not differ significantly from controls. In AGA, none of the targets exceeded control levels; in some cohorts, EOMES/GZMA signals approached or fell below the detection threshold (Figure 25), arguing against an AA-like cytotoxic–JAK loop in AGA. Taken together, the concordant mRNA and protein-level changes in AA, contrasted with the flat expression profiles in AGA, provide orthogonal wet-lab validation that the cytotoxic/JAK axis identified in silico is a genuine, disease-specific effector module rather than a computational artifact.

Figure 25
Western blot and bar graphs show protein expression levels of EOMES, GZMA, IL2RB, and IL2RG across different sample groups: Control, AA, and AGA. β-actin is used as a loading control. The bar charts indicate protein fold changes, with significant differences marked by asterisks, and

Figure 25. Western blot validation of the IL2RB/IL2RG–EOMES–GZMA cytotoxic/JAK axis at the protein level. Protein expression of EOMES, GZMA, IL2RB, and IL2RG is markedly upregulated in AA lesional scalp compared with healthy controls, concordant with the RT-qPCR results, whereas AGA-affected scalp shows no significant difference relative to controls. Representative immunoblots and densitometric quantification (normalized to β-actin) are shown. Data are presented as mean ± SD. Statistical annotations: ** p<0.01 vs. control; n.s., not significant (AGA vs. control).

Despite both causing hair loss, AA and AGA are driven by fundamentally different molecular mechanisms. Our multi-omics analysis, single-cell analysis, and wet-lab validation together delineated a clear split: AA lesions are transcriptionally co-dominated by hyperactive JAK–STAT signaling and a granzyme-rich cytotoxic CD8+ T-cell infiltrate, whereas AGA lesions retain an androgen-centric, low-inflammation profile with minimal JAK–STAT engagement. This dichotomy aligns with clinical observations—AA is a cytotoxic T-cell–driven disease reversible by JAK inhibitors, while AGA is not—and it provides a unified framework to tailor mechanism-based therapies. Building on these insights, we pinpointed four hub genes in AA (GZMA, IL2RB, IL2RG, EOMES) using network analysis and machine learning, and confirmed their importance in independent cohorts. Notably, EOMES, a transcription factor guiding CD8+ T-cell effector/memory differentiation, links the presence of perifollicular EOMES+ CD8+ T cells in AA to the observed cytotoxic immune phenotype. RT-qPCR and Western blot analyses in independent AA/AGA/healthy scalp samples confirmed that EOMES, GZMA, IL2RB and IL2RG are robustly upregulated at both mRNA and protein levels in AA but not in AGA, mirroring the transcriptomic and single-cell patterns.

Unlike the conventional logistic nomogram, our diagnostic model uses a prior-informed few-shot learning approach. It first embeds each sample’s expression into pathway/cell-type–aligned latent variables via fixed MultiPLIER loadings (ensuring portability and interpretability in small-sample settings). It then performs a full support–query comparison using Hadamard-product similarity and permutation-invariant aggregation to learn a class relation score (“learning to compare”), and finally outputs the AA vs. control prediction through a shallow classifier. Our framework embodies three key advantages: (1) By leveraging a pathway-aligned latent space (MultiPLIER priors), the model remains robust in small-sample settings and its features have direct biological meaning, enhancing interpretability. (2) The few-shot, relation-based approach (“learning to compare”) is well-suited to situations with limited cohorts yet a need to generalize, aligning with Healthcare 4.0’s emphasis on data-efficient AI. (3) The model’s risk output is easily translated into a familiar nomogram-style score (after calibration), with each gene’s contribution explained via additive SHAP values. This transparency ensures the predictions are clinically interpretable, facilitating trust and communication between AI, clinicians, and patients. Collectively, these features illustrate a paradigm of XAI 2.0: our framework balances high accuracy with transparent, context-sensitive reasoning, thereby fostering greater trust in AI-driven clinical decisions. Additionally, we evaluated our model with ROC curves and decision curve analysis to facilitate fair comparison with traditional prediction models, and it demonstrated superior performance and net benefit under these metrics.

Multi-perspective explanations for different stakeholders: Our framework’s interpretability can be tailored to various end-users in the clinical setting. For clinicians, the nomogram-style scorecard provides a transparent decision aid, linking gene expression to risk on an intuitive 0–100 scale. For patients, results can be translated into clear terms (e.g., “high immune activity gene levels indicate active disease, suggesting JAK inhibitor therapy may be effective”), enhancing understanding and shared decision-making. For research scientists, the model’s latent features and SHAP attributions supply mechanistic insights by mapping gene contributions to pathways and cell types. Notably, the MultiPLIER latent variables with the highest re-weighting coefficients correspond to the key pathways identified in AA: for example, one top latent factor is annotated with a cytotoxic T-cell signature (enriched in genes such as GZMA and EOMES), and another is associated with cytokine/JAK–STAT signaling (enriched in IL2RB, IL2RG, etc.). These highly weighted latent dimensions align with the JAK–STAT hyperactivation and cytotoxic T-cell activity driving AA, demonstrating that our model’s predictions are grounded in biologically interpretable features. This multi-role adaptability ensures the AI’s explanations are context-sensitive—providing the right depth of information to the right audience—thereby building trust and accountability in a Healthcare 4.0 environment.

At the mechanistic level, our findings identify JAK-STAT activity and cytotoxic T cell signaling as the two pathways most enriched in AA, positioning them within the same EOMES+ CD8+ cell cluster. This supports the functional axis “JAK-STAT cytotoxic T cells” as the pathological core of AA. This unified model explains the reversal effect of broad-spectrum JAK inhibitors in AA, clarifies the limited efficacy of single-cytokine therapies, and provides rationale for next-generation combination interventions combining “JAK blockade + direct regulation of cytotoxic T cell function.” Notably, baricitinib (2022) was approved in 2022 for severe adult AA, followed by oral JAK inhibitors like ritlecitinib (2023), further clinically validating this pathway’s pathogenic role. The four pivotal genes exhibit distinct roles within this axis: IL2RB/IL2RG promotes clonal expansion and maintains effector programs via JAK1/3 coupling; EOMES locks cells into a cytotoxic transcriptional state characterized by high IFN-γ/high granzyme expression; GZMA executes effector killing. Consistent with multi-database annotations cataloguing IL-2 and granzyme A as canonical immunomodulatory and cytotoxic mediators, this IL2RB/IL2RG–EOMES–GZMA circuit represents not only a transcriptional signature but a biologically grounded effector axis with established roles in T-cell activation and target-cell damage. Pathway enrichment and protein interaction networks situate them within broader immune programs (cytokine receptor interactions, Th1 differentiation, granzyme-mediated cytotoxicity, etc.), suggesting synergistic effects at both upstream amplification and downstream effector levels beyond follicular attack. Integrating public drug perturbation resources, this “IL2RB/IL2RG-EOMES-GZMA” circuit shows the strongest “inverse” correlation with JAK inhibitors at the transcriptional level, providing a basis for prioritizing evaluation of strategies such as IL-2 pathway modulation and granzyme targeting. In essence, by leveraging domain-specific knowledge, our model not only predicts outcomes but also yields mechanistic explanations that clinicians can readily interpret—exemplifying the value of domain-grounded explainability in XAI 2.0 for healthcare.

5 Conclusion

In summary, our multi-layered analyses identified a four-gene cytotoxic loop (GZMA, IL2RB, IL2RG, EOMES) driving pathological JAK–STAT hyperactivation in AA, whereas AGA remains governed by androgenic signaling with minimal JAK–STAT involvement. Multi-database functional annotation and independent RT-qPCR and Western blot assays convergently confirmed that this IL2RB/IL2RG–EOMES–GZMA axis is selectively activated in AA at both the transcript and protein levels, providing orthogonal support for its use as a mechanistic biomarker and therapeutic target. This mechanistic split sharpens the molecular boundary between AA and AGA and maps naturally onto therapeutic strategy: AA benefits from pathway-directed immunomodulation, now clinically validated with JAK inhibitors such as baricitinib and ritlecitinib, and suggests upstream (IL-2 receptor) or downstream (cytotoxic effector) nodes as complementary targets; AGA, by contrast, remains best addressed with androgen suppression (finasteride) and minoxidil, with Wnt/β-catenin modulation representing an active investigational avenue. Beyond these biological insights, our prior-informed few-shot classifier offers a practical, explainable diagnostic tool that leverages pathway- and cell-type–aware features, making it well-suited for small-cohort precision medicine. This approach can facilitate biomarker-guided, personalized management of hair loss disorders by providing robust and transparent predictions even when sample sizes are limited. In summary, integrating biological priors with few-shot learning may serve as a generalizable blueprint for other transcriptomic diagnostic applications facing small sample constraints. This work exemplifies how coupling domain knowledge with few-shot learning advanced XAI 2.0 in Healthcare 4.0, paving the way for trustworthy, context-sensitive AI diagnostics in future precision medicine.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Ethics statement

The studies involving humans were approved by Ethics Committee of the First Affiliated Hospital of Army Medical University of the Chinese People’s Liberation Army. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

NY: Writing – review and editing, Writing – original draft, Funding acquisition, Conceptualization, Validation, Methodology. LR: Investigation, Formal Analysis, Data curation, Writing – review and editing. XG: Software, Writing – review and editing, Formal Analysis, Methodology. JT: Investigation, Data curation, Visualization, Writing – review and editing. SL: Project administration, Supervision, Writing – review and editing, Conceptualization. ZS: Supervision, Conceptualization, Writing – review and editing, Project administration.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This work was supported by the National Nature Science Foundation of China (Grant no. 82103761).

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmolb.2025.1753206/full#supplementary-material

References

Anzai, A., Wang, E. H. C., Lee, E. Y., Aoki, V., and Christiano, A. M. (2019). Pathomechanisms of immune-mediated alopecia. Int. Immunol. 31, 439–447. doi:10.1093/intimm/dxz039

PubMed Abstract | CrossRef Full Text | Google Scholar

Aran, D., Looney, A. P., Liu, L., Wu, E., Fong, V., Hsu, A., et al. (2019). Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage. Nat. Immunol. 20, 163–172. doi:10.1038/s41590-018-0276-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Bodinier, B., Filippi, S., Nøst, T. H., Chiquet, J., and Chadeau-Hyam, M. (2023). Automated calibration for stability selection in penalised regression and graphical models. J. R. Stat. Soc. Ser. C Appl. Statistics 72, 1375–1393. doi:10.1093/jrsssc/qlad058

PubMed Abstract | CrossRef Full Text | Google Scholar

Breiman, L. (2001). Random forests. Mach. Learn. 45, 5–32. doi:10.1023/a:1010933404324

CrossRef Full Text | Google Scholar

Brunson, J. C. (2020). Ggalluvial: layered grammar for alluvial plots. J. Open Source Softw. 5, 2017. doi:10.21105/joss.02017

PubMed Abstract | CrossRef Full Text | Google Scholar

Butler, A., Hoffman, P., Smibert, P., Papalexi, E., and Satija, R. (2018). Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat. Biotechnol. 36, 411–420. doi:10.1038/nbt.4096

PubMed Abstract | CrossRef Full Text | Google Scholar

Clough, E., and Barrett, T. (2016). “The gene expression omnibus database,” in Statistical genomics: methods and protocols (Springer), 93–110.

Google Scholar

Dai, Z., Chen, J., Chang, Y., and Christiano, A. M. (2021). Selective inhibition of jak3 signaling is sufficient to reverse alopecia areata. JCI Insight 6, e142205. doi:10.1172/jci.insight.142205

PubMed Abstract | CrossRef Full Text | Google Scholar

Gonul, M., Cemil, B. C., Ayvaz, H. H., Cankurtaran, E., Ergin, C., and Gurel, M. S. (2018). Comparison of quality of life in patients with androgenetic alopecia and alopecia areata. An. Bras. Dermatol. 93, 651–658. doi:10.1590/abd1806-4841.20186131

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, Y., Luo, L., Zhu, J., and Li, C. (2023). Multi-omics research strategies for psoriasis and atopic dermatitis. Int. J. Mol. Sci. 24, 8018. doi:10.3390/ijms24098018

PubMed Abstract | CrossRef Full Text | Google Scholar

Guyon, I., Weston, J., Barnhill, S., and Vapnik, V. (2002). Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 389–422. doi:10.1023/a:1012487302797

CrossRef Full Text | Google Scholar

Hänzelmann, S., Castelo, R., and Guinney, J. (2013). Gsva: gene set variation analysis for microarray and rna-seq data. BMC Bioinform. 14 (7), 7. doi:10.1186/1471-2105-14-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Hassan, A. S., Al-Dhoun, M. Q., Shaker, O. G., and AlOrbani, A. M. (2022). Expression of signal transducer and activator of transcription-3 in androgenetic alopecia: a case-control study. Skin Pharmacol. Physiol. 35, 278–281. doi:10.1159/000525532

PubMed Abstract | CrossRef Full Text | Google Scholar

Haughton, R. D., Herbert, S. M., Ji-Xu, A., Downing, L., Raychaudhuri, S. P., and Maverakis, E. (2023). Janus kinase inhibitors for alopecia areata: a narrative review. Ind. J. Dermatol. Venereol. Leprol. 89, 799–806. doi:10.25259/IJDVL_1093_2022

PubMed Abstract | CrossRef Full Text | Google Scholar

Hu, J., and Szymczak, S. (2023). A review on longitudinal data analysis with random forest. Briefings Bioinform. 24, bbad002. doi:10.1093/bib/bbad002

PubMed Abstract | CrossRef Full Text | Google Scholar

Langfelder, P., and Horvath, S. (2008). Wgcna: an r package for weighted correlation network analysis. BMC Bioinform. 9, 559. doi:10.1186/1471-2105-9-559

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, L., Lu, J., Liu, J., Wu, J., Zhang, X., Meng, Y., et al. (2024). Immune cells in the epithelial immune microenvironment of psoriasis: emerging therapeutic targets. Front. Immunol. 14, 1340677. doi:10.3389/fimmu.2023.1340677

PubMed Abstract | CrossRef Full Text | Google Scholar

Lin, X., Yang, F., Zhou, L., Yin, P., Kong, H., Xing, W., et al. (2012). A support vector machine-recursive feature elimination feature selection method based on artificial contrast variables and mutual information. J. Chromatography B 910, 149–155. doi:10.1016/j.jchromb.2012.05.020

PubMed Abstract | CrossRef Full Text | Google Scholar

Narala, S., Li, S. Q., Klimas, N. K., and Patel, A. B. (2021). Application of least absolute shrinkage and selection operator logistic regression for the histopathological comparison of chondrodermatitis nodularis helicis and hyperplastic actinic keratosis. J. Cutaneous Pathol. 48, 739–744. doi:10.1111/cup.13931

PubMed Abstract | CrossRef Full Text | Google Scholar

Ocampo-Garza, J., Griggs, J., and Tosti, A. (2019). New drugs under investigation for the treatment of alopecias. Expert Opin. Investig. Drugs 28, 275–284. doi:10.1080/13543784.2019.1568989

PubMed Abstract | CrossRef Full Text | Google Scholar

Ritchie, M. E., Phipson, B., Wu, D., Hu, Y., Law, C. W., Shi, W., et al. (2015). limma powers differential expression analyses for rna-sequencing and microarray studies. Nucleic Acids Res. 43, e47. doi:10.1093/nar/gkv007

PubMed Abstract | CrossRef Full Text | Google Scholar

Sardana, K., Gupta, A., and Gautam, R. K. (2018). Recalcitrant alopecia areata responsive to leflunomide and anthralin—potentially undiscovered jak/stat inhibitors? Pediatr. Dermatol. 35, 856–858. doi:10.1111/pde.13688

PubMed Abstract | CrossRef Full Text | Google Scholar

Sennett, M. L., Agak, G. W., Thiboutot, D. M., and Nelson, A. M. (2024). Transcriptomic analyses predict enhanced metabolic activity and therapeutic potential of mtor inhibitors in acne-prone skin. JID Innov. 4, 100306. doi:10.1016/j.xjidi.2024.100306

PubMed Abstract | CrossRef Full Text | Google Scholar

Shannon, P., Markiel, A., Ozier, O., Baliga, N. S., Wang, J. T., Ramage, D., et al. (2003). Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504. doi:10.1101/gr.1239303

PubMed Abstract | CrossRef Full Text | Google Scholar

Stelzer, G., Rosen, N., Plaschkes, I., Zimmerman, S., Twik, M., Fishilevich, S., et al. (2016). The genecards suite: from gene data mining to disease genome sequence analyses. Curr. Protocols Bioinform. 54, 1–30. doi:10.1002/cpbi.5

PubMed Abstract | CrossRef Full Text | Google Scholar

Strazzulla, L. C., Wang, E. H. C., Avila, L., Sicco, K. L., Brinster, N., Christiano, A. M., et al. (2018). Alopecia areata: disease characteristics, clinical evaluation, and new perspectives on pathogenesis. J. Am. Acad. Dermatol. 78, 1–12. doi:10.1016/j.jaad.2017.04.1141

PubMed Abstract | CrossRef Full Text | Google Scholar

Szklarczyk, D., Gable, A. L., Nastou, K. C., Lyon, D., Kirsch, R., Pyysalo, S., et al. (2021). The string database in 2021: customizable protein–protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res. 49, D605–D612. doi:10.1093/nar/gkaa1074

PubMed Abstract | CrossRef Full Text | Google Scholar

Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B Stat. Methodol. 58, 267–288. doi:10.1111/j.2517-6161.1996.tb02080.x

CrossRef Full Text | Google Scholar

Wada-Irimada, M., Takahashi, T., Sekine, M., Okazaki, T., Takahashi, T., Chiba, T., et al. (2025). Predictive factors for treatment responses to baricitinib in severe alopecia areata: a retrospective, multivariate analysis of 70 cases from a single center. J. Dermatol. 52, 701–711. doi:10.1111/1346-8138.17641

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, X., Lan, Y., Xu, J., Quan, F., Zhao, E., Deng, C., et al. (2019). Cellmarker: a manually curated resource of cell markers in human and mouse. Nucleic Acids Res. 47, D721–D728. doi:10.1093/nar/gky900

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, R., Jin, W., and Wang, K. (2025). Glycolysis-driven prognostic model for acute myeloid leukemia: insights into the immune landscape and drug sensitivity. Biomedicines 13, 834. doi:10.3390/biomedicines13040834

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, C., Li, X., Wang, C., and Zhang, J. (2021). Alopecia areata: an update on etiopathogenesis, diagnosis, and management. Clin. Rev. Allergy Immunol. 61, 403–423. doi:10.1007/s12016-021-08883-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: alopecia areata, few-shot learning, gene expression, MultiPLIER, permutation-invariant aggregation, relation network, transfer learning

Citation: Yu N, Ran L, Gong X, Teng J, Liu S and Song Z (2026) JAK-centric explainable few-shot gene-expression diagnosis framework for alopecia via MultiPLIER priors and relation-style set-to-set comparison. Front. Mol. Biosci. 12:1753206. doi: 10.3389/fmolb.2025.1753206

Received: 24 November 2025; Accepted: 16 December 2025;
Published: 12 January 2026.

Edited by:

Feng Gao, The Sixth Affiliated Hospital of Sun Yat-sen University, China

Reviewed by:

Lantian Yao, Xiamen University, China
Xiangzheng Fu, The Hong Kong Hainan Commercial Association, Hong Kong, SAR China

Copyright © 2026 Yu, Ran, Gong, Teng, Liu and Song. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Shulei Liu, bGl1c2h1bGVpQHRtbXUuZWR1LmNu; Zhiqiang Song, ZHJzb25nenFAdG1tdS5lZHUuY24=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.