Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Genet., 15 January 2026

Sec. RNA

Volume 16 - 2025 | https://doi.org/10.3389/fgene.2025.1753636

This article is part of the Research TopicFunctions of RNA and RNA-Binding Proteins in DiseaseView all articles

DPCDI: an artificial intelligent-derived indicator interpreting the diagnostic, stratification, and therapeutic implications of druggability programmed cell death in heart failure

Lili Zhang&#x;Lili Zhang1Yihao Zhu&#x;Yihao Zhu2Yuan FangYuan Fang1Yanping YangYanping Yang1Yin YuYin Yu1Hanshi WangHanshi Wang1Xiyue JiangXiyue Jiang1Xue ZhangXue Zhang1Dong Huang
Dong Huang1*
  • 1Department of Cardiology, Shanghai Sixth People’s Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
  • 2Center for Translational Medicine, Naval Medical University, Shanghai, China

Programmed cell death (PCD) pathways with druggable potential represent a promising but still underexplored frontier in heart failure (HF) research for diagnosis, prognosis, and therapy. To address this gap, we developed a Druggable Programmed Cell Death Index (DPCDI) through an integrative machine learning framework. An optimal combination of Lasso and Random Forest algorithms identified 15 pivotal genes (CALCOCO2, VPS13D, CLU, STAT3, OPTN, UBB, CXCL12, PPP1R15A, ATF4, IVNS1ABP, HMGB2, JAK2, EXOC7, ENO1, TPCN1) for DPCDI construction. Non-negative matrix factorization (NMF) analysis stratified HF patients into two distinct subtypes, with Subtype 2 exhibiting elevated apoptosis and mitophagy activity. Single-cell RNA sequencing revealed dynamic JAK2 and IVNS1ABP expression during cardiomyocyte state transitions, while CXCL12 showed stage-specific regulation in endothelial cells. Mendelian randomization analysis indicated that genetic predisposition to elevated JAK2 and STAT3 expression was associated with reduced HF risk, whereas CXCL12 overexpression increased susceptibility. Experimental validation in HF mouse models confirmed increased Cxcl12 and Jak2 expression and decreased Stat3 levels. Furthermore, knockout of Cxcl12, Jak2, and Stat3 induced HF phenotypes. Molecular docking identified pifithrin-α as a potent ligand for CXCL12 and strophanthidin for STAT3. Collectively, DPCDI provides a comprehensive framework for HF diagnosis, risk stratification, and targeted therapeutic development.

1 Introduction

Despite decades of laboratory research and clinical advances in heart failure (HF) treatment, global HF-related mortality and morbidity continue to rise (Heidenreich et al., 2022). According to the Global Burden of Disease Study 2017, an estimated 64.3 million people worldwide suffer from HF (GBD 2017 Disease and Injury Incidence and Prevalence Collaborators, 2018), imposing persistent medical and financial burdens. HF is a complex multifactorial clinical syndrome characterized by structural or functional impairment of ventricular filling and/or blood ejection (Bozkurt et al., 2021), leading to heart dysfunction. Its etiology involves diverse factors, notably ischemic heart disease, hypertension, and dilated cardiomyopathy (Savarese et al., 2023), making it impossible to attribute HF to a single cause. Given this high prevalence and etiological heterogeneity, efforts to improve HF management remain imperative. Consequently, novel biomarkers and therapeutic targets have garnered increasing interest for their potential to enhance preventive and therapeutic strategies (Shu et al., 2022).

Programmed cell death (PCD), an evolutionarily conserved process, plays a critical role in the pathogenesis of cardiovascular diseases including HF (Del Re et al., 2019; Zhou et al., 2021). Accumulating evidence indicates that dynamic alterations in druggable PCD pathways—such as apoptosis, necrosis, ferroptosis, and autophagy—drive HF progression (Zhou et al., 2021; Narula et al., 2006), which indicates an attractive therapeutic manipulation by targeting PCD. For example, administration of empagliflozin and sacubitril has been found to reduce apoptosis in rat models with HF symptoms, which demonstrates protective effects on cardiac functions (Jankowski et al., 2024). Additionally, different types of PCD signaling are interconnected at multiple levels in HF progression (Zhou et al., 2021), underscoring the need to focus on the interactions between various PCD patterns rather than an individual PCD. Over the past years, numerous efforts have been made to establish HF-predictive signatures using an individual PCD pattern (Jiang et al., 2022; Gu et al., 2023), and moderate diagnostic performances were observed in small-scale or different specimen-source cohorts. As such, leveraging multiple PCD forms to construct a signature based on single-source specimen cohorts may better represent the characteristic of HF than using a single PCD information. Also of note is that the biological mechanisms underlying most of these published signatures in HF were poorly understood, which greatly limits their future applications. A more profound comprehension of the mechanisms governing druggable PCD patterns in the context of HF, particularly their complex interactions, is crucial for advancing our current knowledge. However, to date, this in-depth understanding remains elusive.

To address these shortcomings, we aimed to comprehensively investigate the diagnosis, stratification, and therapeutic significance of four druggable PCD patterns (apoptosis, necrosis, ferroptosis, and autophagy), as well as elucidate the latent biological mechanisms in HF. Herein, we first designed an integrative machine learning (ML) framework to discover a Druggable Programmed Cell Death Indicator (DPCDI) on the basis of large expression profiling cohorts, which was used to identify and stratify patients with HF. Then, we implemented the single-cell RNA sequencing to expound the biological mechanisms underlying DPCDI in the context of the HF cell atlas. Using Mendelian randomization (MR) analysis, in-silicon drug prediction, and molecular docking, we also investigated the therapeutic applications of DPCDI in the treatment of HF. Altogether, we proposed a DPCDI with a potential implication in diagnosing, stratifying, and medicating patients with HF.

2 Methods

2.1 Data collection and processing of HF cohorts

Gene expression profiling of left ventricle (LV) from control and HF cases was downloaded from the GEO database (http://www.ncbi.nlm.nih.gov/geo). Herein, we collected a total of ten independent cohorts, including GSE141910 (Tan et al., 2020; Flam et al., 2022), GSE116250 (Yamaguchi et al., 2020; Sweet et al., 2018), GSE135055 (Hua et al., 2020), GSE16499 (Kong et al., 2010), GSE5406 (Hannenhalli et al., 2006), GSE57338 (Liu et al., 2015), GSE79962 (Matkovich et al., 2017), GSE42955, GSE52601, and GSE21610. Regarding the high throughput sequencing cohorts (GSE141910, GSE116250, GSE135055), the raw count value was converted to log2 (TPM+1) value. For the array cohorts (GSE16499, GSE5406, GSE57338, GSE79962, GSE42955, GSE52601, and GSE21610), the raw signal value was processed through quantile normalization and log2 transformation. We also enrolled GSE121893, a single-cell RNA sequencing data on LV and left atrium (LA) samples collected from four HF patients (Wang et al., 2020). Information on HF cohorts collected in this study can be found in Table 1. For detailed processing of GSE121893, see Methods-Expression patterns and dynamics of DPCDI at single-cell resolution.

Table 1
www.frontiersin.org

Table 1. Information on HF cohorts collected in this study.

2.2 Development and external validation of DPCDI model

GSE141910, which contains the largest case numbers in our enrolled cohorts, was used as a training cohort for discovering a DPCDI model. And the other five cohorts (GSE116250, GSE135055, GSE16499, GSE5406, GSE57338, and GSE79962) were selected as testing cohorts for DPCDI validation. Before establishing DPCDI, we collected 1,066 druggability PCD-related genes (Supplementary Table S1), which encompass apoptosis, ferroptosis, autophagy, and necroptosis-related genes. Subsequently, we applied Self-Organizing Maps (SOM) to cluster the identified PCD-related genes based on their expression patterns in the GSE141910 dataset. The SOM grid dimensions were set to 10 × 10. Training was performed for 1,000 iterations, and the optimal number of clusters (k = 15) was determined by minimizing the within-cluster sum of squares (WCSS). This analysis yielded a distinct cluster of genes exhibiting high expression in HF. Here we designed an integrative ML framework, which was composed of 113 combinations derived from 12 ML algorithms (Zhu et al., 2024; Liu et al., 2022), to generate a DPCDI model for HF recognition. In this framework, least absolute shrinkage and selection operator (Lasso), random forest (RF), stepwise generalized linear model (Stepglm), and generalized linear model by likelihood-based boosting (glmBoost) (Zhu et al., 2024), were initially used to select features from SOM-identified PCD genes. Next, the other eight algorithms, including Ridge, elastic network (Enet), support vector machine (SVM), linear discriminant analysis (LDA), partial least squares regression for generalized linear models (plsRglm), gradient boosting machine (GBM), eXtreme Gradient Boosting (XGBoost), and NaiveBayes (Zhu et al., 2024), were implemented to establish prediction models on the selected features. AUC (Area Under the Receiver Operating Characteristic Curve) is an important indicator for evaluating the performance of binary classification models. It comprehensively reflects the model’s ability to distinguish between positive and negative samples by quantifying the area under the ROC curve. In the case of imbalanced sample classes, AUC can objectively reflect the model’s performance. In total, 113 combinations were generated after parameter tuning and ten-fold cross-validation, of which the best combination with the highest average AUC was regarded optimal. This optimal combination was termed DPCDI. We also assessed the AUC performance of DPCDI in four external validation cohorts (GSE79962, GSE42955, GSE52601, and GSE21610). To ensure productivity of the integrative ML framework and our DPCDI, the source code, scripts, processed datasets, and instructions have been archived in the Zenodo repository (https://doi.org/10.5281/zenodo.17918314).

2.3 Comparison between DPCDI and other published signatures

To compare the predictive power of DPCDI with other signatures, we gathered 51 mRNA signatures for HF prediction that were published in the last 5 years, which was summarized in Supplementary Table S2. These mRNA signatures were established by diverse algorithms, such as RF, Lasso, Boruta, and SVM (Guo and Xu, 2023; Bian et al., 2022). Additionally, these signatures were derived from different biological processes, such as cellular senescence (Guo and Xu, 2023), immune microenvironment (Wang et al., 2024), and N7-methylguanosine modification (Ma et al., 2023). Model 50 (NPPA and NPPB) and model 51 (TNNT2 and TNNI3) integrate the genes corresponding to the two categories of traditional cardiac markers, respectively. For equitable comparability, we filtered out the signatures with more than 30% of genes not matched in cohorts for DPCDI training and testing. We calculated the AUC performance for each signature in all cohorts.

2.4 Construction of DPCDI-derived subtypes

We initially applied Rank-In (Tang et al., 2021), a well-established approach to combine the high throughput sequencing and array data, to integrate all cohorts into a meta-cohort for HF subtype discovery. We extracted 657 HF cases from the meta-cohort for subtype analysis. According to the expression profiling of DPCDI genes from these HF cases, the Non-negative Matrix Factorization (NMF) technique (Gaujoux and Seoighe, 2010) was performed to partition the HF patients into different subtypes. To ensure a stable identification, we used the brunet approach, with a setting of 100 iterations, to execute the NMF process. The cophenetic coefficient was used to determine the optimal rank for clustering (Gaujoux and Seoighe, 2010), and the optimal rank (=2) was selected to construct two subtypes. Furthermore, the principal components analysis (PCA) plot was used to assess the dispersion of the identified two subtypes. We also investigated the expressions of DPCDI genes and HF markers between the two subtypes.

2.5 Biological mechanisms and enrichment analysis of DPCDI

To understand the detailed biological peculiarities of DPCDI, we performed Gene Ontology (GO)-Biological Process (BP) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses using the clusterProfiler package (Yu et al., 2012). The GO-BP and KEGG enrichment terms with P value <0.05 and PCD correlation were retained. Subsequently, we used the clusterProfiler package to implement Gene Set Enrichment Analysis (GSEA), to evaluate the enrichment distribution of the DPCDI-related terms across the two HF subtypes. The single-sample Gene Set Enrichment Analysis (ssGSEA) was also conducted using the GSVA package (Hänzelmann et al., 2013), to calculate the enrichment scores of these DPCDI-related terms. To delve into the biological peculiarities underlying DPCDI in HF progression and two HF subtypes, the KEGG database (https://www.genome.jp/kegg/) was acquired to depict the cross-talk between DPCDI genes and their enrichment terms.

2.6 Expression patterns and dynamics of DPCDI at single-cell resolution

We used GSE121893, which contains single-cell RNA sequencing information on LA and LV samples from four HF patients (Wang et al., 2020). Consistent with our previous analysis of LV cohorts, we retained LV samples for subsequent exploration. Herein, the single-cell RNA sequencing analysis on GSE121893 was accomplished by the Seurat package (Hao et al., 2021). Initially, we eliminated the poor-quality cells with gene count (less than 200 or larger than 5, 000) and mitochondrial proportion larger than 20%, and a total of 1,668 cells were retained. Next, we carried out a log transformation to normalize gene expression and identified the top 2,000 highly variable genes for downstream reduction. We then conducted a linear transformation to scale the gene expression, which assigns each gene the same weight. According to the scaling data, PCA was used based on the top 2,000 highly variable genes. Batch correction was performed using the harmony package with the following parameter settings (reduction = “pca”, dims = 1:15, max.iter = 20). The optimal number of PC (=15) was determined and subjected to Uniform Manifold Approximation and Projection (UMAP) clustering. Afterward, we obtained five different clusters based on the parameters setting (dimension = 1:15, resolution = 0.5). The identities of clusters were annotated using the well-known gene markers of heart cells (Wang et al., 2020; Hu et al., 2023). Thus, 5 cell populations were annotated: (1) Cardiomyocyte (CM), as reflected with higher expressions and proportions of MYL2, MYH7, FHL2, TTN, TNNT1, and TNNT2; (2) Endothelial cell (EC), as reflected with higher expressions and proportions of VWF, PECAM1, CDH5, and IFI27; (3) Fibroblast (FB), as reflected with higher expressions and proportions of ACTA2, CALD1, and MYH11; (4) Smooth muscle cell (SMC), as reflected with higher expressions and proportions of DCN, GSN, C7, LUM, FBLN1, and COL1A2; (5) Macrophage (MP), as reflected with higher expressions and proportions of PTPRC, CD163, CCL4, CXCL8, and LAPTM5. We next explored the expression patterns of DPCDI genes in these 5 cell populations. Additionally, we used the CellChat package (Jin et al., 2021) to quantitatively infer and analyze cell-cell communication, thereby understanding the intricate cell atlas in the context of HF.

Regarding the DPCDI genes-enriched cell population (CM and EC), we subsequently annotated their sub-populations. Next, we investigated the expression patterns of DPCDI genes in these defined sub-populations of CM and EC. Using the monocle3 package (Cao et al., 2019), we proposed a pseudo-time analysis to infer the transition trajectory across the CM or EC sub-populations, so as to expound the dynamics of DPCDI genes. The inferred trajectory was then projected to a UMAP plot for visualization. To gain insight into a comprehensive molecular program underlying the transition, we analyzed the Differential Expressed Genes (DEGs) that significantly changed among the pseudo-time under the threshold of Moran’s I > 0.05 and Q-value <0.05. The ClusterGVis package (Zhang, 2022) was used to divide these DEGs into distinct patterns using K-means clustering. GO and KEGG enrichment analyses were implemented for each pattern using the clusterProfiler package (Yu et al., 2012), and the enriched terms with P < 0.05 were regarded as significant.

2.7 Mendelian randomization of the causal-effect between DPCDI expression and HF risk

Two-sample Mendelian randomization (2SMR) analysis (Hemani et al., 2018) was used to assess the causal association between the genetic predisposition of three DPCDI genes (CXCL12, JAK2, STAT3) and HF. This data was retrieved from the IEU Open GWAS (https://gwas.mrcieu.ac.uk/) and deCODE (https://www.decode.com/summarydata/) (Ferkingstad et al., 2021; Elsworth et al., 2020). The population source of the database is Europe, with samples derived from peripheral blood. The exposure IDs were summarized as follows: JAK2-eQTL (eqtl-a-ENSG00000096968 in IEU Open GWAS); STAT3-eQTL (eqtl-a-ENSG00000168610 in IEU Open GWAS); CXCL12-pQTL (3516_60_CXCL12_SDF in deCODE); JAK2-pQTL (11816_84_JAK2_JAK2 in deCODE); STAT3-pQTL (10346_5_STAT3_STAT3 in deCODE); HF (ebi-a-GCST009541 in IEU Open GWAS). Following the 2SMR assumption, the Single Nucleotide Polymorphisms (SNPs) closely related to DPCID genes (P-value <5 × 10–8) but not with HF (P-value >0.05) were obtained. To avoid linkage disequilibrium (LD), we excluded the SNPs with LD-R2 greater than 0.01 within a cropping range of 5,000 Kb. Eventually, the assumption-compliant SNPs, with an F-statistic greater than 100, were retained as strong Instrumental Variables (IVs) for MR analysis. The inverse variance weighted (IVW) method, with the highest statistical power in 2SMR, was used to expound the causality between DPCDI genes and HF risk. Additionally, the Bayesian weighted Mendelian randomization (BWMR) (Zhao et al., 2020) was used to validate the results of 2SMR. To further explore the biological significance of these three DPCDI genes, the Mouse Genome Informatics (MGI) database (https://www.informatics.jax.org/) was accessed to show the cardiovascular phenotypes of CXCL12-, JAK2-, and STAT3-knockout mouses.

2.8 Heart failure mouse model and echocardiographic assessment

The animal experiments were approved by the Institutional Animal Ethics Committee of Shanghai Jiao Tong University (Approval number: 2025-0821), and complied with NIH guidelines. Male and female C57BL/6 mice (8 weeks old) were housed under standardized conditions (24 °C ± 2 °C, 40% ± 5% humidity, 12-h light/dark cycle). HF was induced by Transverse Aortic Constriction (TAC). Sham-operated mice underwent identical procedures without aortic ligation. Ejection fraction (EF) and fractional shortening (FS) were assessed by echocardiography 24 h after surgery. Heart tissues were harvested at 1 month post-surgery for analysis. Biometric data (EF, FS, body weight, and heart weight) are presented in Supplementary Table S3.

2.9 Hematoxylin and eosin (H&E) and masson staining

Heart tissues were fixed, paraffin-embedded, and cut into sections. Tissue morphology and fibrosis were assessed by hematoxylin and eosin (H&E) and Masson’s trichrome staining according to standard protocols. The infarct size and fibrosis area were quantified using ImageJ software.

2.10 RNA isolation and quantitative real-time PCR (qPCR) analysis

Total RNA from mouse heart tissue was isolated via an EZ-press RNA Purification Kit (EZBioscience). Reverse transcription to cDNA was performed with an RT Kit (EZBioscience). Quantitative PCR was performed via a Roche480 LightCycler® 96 real-time PCR system with 2×SYBR Green qPCR Master Mix (EZBioscience). The primer sequences are listed in Supplementary Table S4. An unpaired t-test was applied to assess the differences in gene expression between the control and HF groups. Pearson’s correlation analysis was used to examine the relationship between gene expression levels and echocardiographic indicators (EF and FS).

2.11 Drug prediction and molecular docking of DPCDI

The L1000 fireworks display (L1000FWD) database (https://maayanlab.cloud/l1000fwd/) (Wang et al., 2018) was queried to identify the small-molecule drugs targeting CXCL12 and STAT3, which displayed an opposite relationship with the expressions of CXCL12 and STAT3 in HF. According to the information on toxicity and availability of predicted drugs, we selected pifithrin and strophanthidin as the candidate drugs with potential utility in treating HF. Next, we conducted molecular docking to validate the reliability of the drug-target interaction based on the binding affinity and pattern (Yu et al., 2024a). We first downloaded the three-dimensional structures of pifithrin (PubChem CID: 4817) and strophanthidin (PubChem CID: 6185) from the PubChem website (https://pubchem.ncbi.nlm.nih.gov/), which was subjected to energy minizine using the ChemBio3 software. Next, we gained the three-dimensional structures of CXCL12 (ID: 3HP3) and STAT3 (ID: 6NJS) from the Protein Data Bank (https://www.rcsb.org/). The source of these protein structures was all X-ray crystal detection and derived from Homo sapiens. PyMOL software was used to preprocess the protein structures, including the removal of solvents, ligands, and hydrogens. To determine the active pockets of CXCL12 and STAT3 for docking drugs, we employed the AutoDock Vina software to perform polar hydrogenation and optimize the docking site. AutoDock Vina software was then utilized to dock the target proteins and drugs, including the interactions of CXCL12-pifithrin, CXCL12-strophanthidin, STAT3-pifithrin, and STAT3-strophanthidin. Subsequently, the affinity was computed to appraise the binding efficiency for each drug-target interaction, with an affinity lower than 5.0 kcal/mol indicating a superior bind efficiency. We visualized the binding pattern of each drug-target interaction using the PyMOL software.

3 Results

3.1 DPCDI derived from an integrative ML framework in multi-cohorts enables an accurate HF diagnosis

A flowchart outlining our study is delineated in Figure 1. SOM was initially performed in the training cohort, revealing a total of 251 DPCD-related genes that were actively expressed in HF (Figure 2A). DPCD-related genes assigned in each SOM-identified cluster were summarized in Supplementary Table S5. Subsequently, the expression profile of 251 DPCD-related genes from the training cohort was subjected to an integrative ML framework to develop a DPCDI, which was depicted in Figure 2B. In this computational framework, we established 113 kinds of algorithmic combinations under the ten-fold cross-validation and calculated the AUC score of each combination in all cohorts to assess their predictive performances. As shown in these 113 combinations, we noted that the best-performing combination, consisting of Lasso and RF algorithms, achieved the highest mean AUC (0.9815) across the training and five independent testing cohorts (Figure 2C). The AUC performances of all combinations can be found in Supplementary Data 2. Lasso identified the most valuable 36 DPCD-related genes with non-zero coefficients via the optimal lambda (Figure 2D), with a regression binominal deviation reaching minimum (Figure 2E). These 36 DPCD genes were then subjected to an RF algorithm using 1,000 trees (Figure 2F), and a final set of 15 genes to build DPCDI was identified, including CALCOCO2, VPS13D, CLU, STAT3, OPTN, UBB, CXCL12, PPP1R15A, ATF4, IVNS1ABP, HMGB2, JAK2, EXOC7, ENO1, and TPCN1 (Figure 2G). Moreover, we displayed the case distribution and confusion matrix of DPCDI in each cohort (Figure 2H). Four indicators for evaluating DPCDI prediction (accuracy, sensitivity, specificity, and F1 score) were then calculated in each cohort. Remarkably, we observed a relatively higher sensitivity of DPCDI, above 0.9 in all cohorts, suggesting a potential capability of DPCDI to accurately recognize HF. We also verified the robust predictive power of DPCDI in four external cohorts (GSE79962, GSE21610, GSE42955, and GSE52601), as demonstrated in Supplementary Figure S1. Overall, we provided a 15 gene signature, termed DPCDI fitted by Lasso and RF, as an attractive panel to predict HF occurrence.

Figure 1
Infographic with four sections labeled A, B, C, D, surrounding a central circular diagram with

Figure 1. Overview of this study. (A) DPCDI modeling. (B) Biological mechanisms underlying DPCDI. (C) Pathogenic potential of DPCDI. (D) Therapeutic applications of DPCDI.

Figure 2
Diagram illustrating machine learning analysis for gene expression and heart failure prediction. Panel A shows a training cohort using a self-organizing map (SOM) for clustering gene expression. Panel B outlines model construction with 113 combinations and 10-fold cross-validation, leading to an optimal model. Panel C presents a heatmap of AUC scores across different models and datasets. Panel D and E depict lambda optimization for regression coefficients and deviance. Panel F shows error rates for different models over trees. Panel G highlights variable importance for outcomes. Panel H includes pie charts and evaluation metrics for training and testing datasets, showing accuracy, sensitivity, specificity, and F1 scores.

Figure 2. Establishment of an HF-predictive DPCDI using an integrative ML framework. (A) SOM identification of three sets of HF-susceptible genes in the training cohort. (B) Design of the integrative ML framework. (C) The AUC performance of 113 ML combinations for distinguishing HF from control across the training and five independent testing cohorts was sorted, and top 30 combinations was visualized as a heatmap. The best combination (Lasso and RF) with the highest average AUC was marked with a red star. (D) Lasso coefficients of 176 DPCD-related genes under optimal λ construction. A total of 36 genes with non-zero coefficients were retained (labeling on the right panel). (E) The minimum deviance was reached under the 36 Lasso-selected genes. (F) Changes in error rates using different tree numbers in RF. (G) Feature importances of 15 DPCD-related genes screened from 36 Las-so-selected genes via RF. (H) Confusion matrix and diagnostic assessment of DPCDI within the training and five independent testing cohorts. Each cohort was displayed respectively.

3.2 Comparison of diagnostic performance of DPCDI and published HF-predictive signatures

We analyzed the expression landscape of altered DPCDI genes in each cohort, as displayed in Figure 3A. Expression changes in these genes were consistent in all cohorts, in which ATF4, ENO1, CALCOCO2, OPTN, VPS13D, and STAT3 were downregulated in HF. The log2fold change (log2FC), p-value and False Discovery Rate (FDR) of these genes between control and HF cases across cohorts were summarized in Supplementary Table S6. Whereas, CXCL12, HMGB2, IVNS1ABP, PPP1R15A, UBB, EXOC7, TPCN1, CLU, and JAK2 showed up-regulations in HF. These DPCDI genes were largely related to apoptosis or autophagy. Interestingly, these genes are expressed differently in HF, suggesting their intricate regulation in apoptosis and autophagy. To enable an equitable comparison of our DPCDI and other HF-predictive signatures, we systemically incorporated 51 gene signatures published within the past 5 years (see Methods). We filtered out signatures with larger than 30% missing genes in our enrolled cohorts, and a final set of 35 signatures was retained for comparison. These signatures were different functional gene panels, such as cellular senescence, immune microenvironment, and N7-methylguanosine. Notably, our DPCDI demonstrated better AUC performance than almost most signatures in the training (GSE141910), three testing (GSE116250, GSE135055, GSE57388), and meta-cohort (Figure 3B). Regarding the other two testing cohorts (GSE16499 and GSE5406), the AUC performance of DPCDI remained relatively robust (larger than 0.85) even though it was not the top-ranked model. Also worthy of note is that some signatures possessed better performance in their discovery cohort but were weak in other cohorts, which may arise from an over-fitting troublesome. For instance, the AUC of model three ranked second in the GSE16499 cohort, but relatively weakened within GSE116250 (ranked 27/35) and GSE5406 (ranked 21/35). In sum, our DPCDI showed a superior predictive performance than almost most published signatures, suggesting its extrapolation possibility with better generalization.

Figure 3
Heatmap and dot plot showing gene expression data across six cohorts. Panel A depicts a heatmap with genes like ATF4, ENO1, and CXCL12, distinguishing between heart failure (HF) and control groups. Panel B is a dot plot comparing models across cohorts GSE141910 to GSE57338, highlighting DPCD as significant. Different colors mark varying experimental sets, such as autophagy and apoptosis.

Figure 3. Superior predictive capability of DPCDI. (A) Heatmap displaying the expressions of DPCDI genes between control and HF cases within the training and five independent testing cohorts. (B) Comparison of diagnostic performance of DPCDI and other 50 published signatures. The AUC performance of DPCDI in each cohort was marked in a red bracket respectively. *, P < 0.05; **, P < 0.01; ***, P < 0.001; ****, P < 0.001.

3.3 Partitioning of DPCDI-derived molecular subtypes for HF

To identify the HF molecular subtypes with different degrees, we performed NMF clustering to explore the molecular features of these DPCDI genes in HF. Based on the NMF metrics of cophenetic, residuals, RSS, and silhouette (Figure 4A), the suitable cluster number (rank = 2) was determined to subtyping. Accordingly, the HF cases merged in meta-cohort (N = 657) were successfully classified into two distinct clusters (Figure 4B), and this apparent dispersion was also demonstrated in the PCA plot (Figure 4C). As shown in Figure 4D, we also measured the expression landscape of DPCDI genes and HF markers among the C1 (N = 297) and C2 clusters (N = 360). Among the DPCDI genes, the significant downregulation of HMGB2, IVNS1ABP, CALCOCO2, and STAT3 (P-value <0.001) were observed in C2, while PPP1R15A and JAK2 showed the elevated pattern (P-value <0.001). We also found that TNINT2 and EGFR were significantly upregulated in C2.

Figure 4
A complex scientific diagram presenting various analyses related to heart failure (HF) markers and DPCDI gene interactions. Panels include graphs, heat maps, and pathway illustrations, displaying data on gene expression, enrichment terms, and clustering. Colors indicate regulation patterns: blue for DPCDI, green for regulators, red for upregulation, blue shades for downregulation. Pathways involve mitochondrial apoptosis and mitophagy, with gene names like HMGB2, RELA, and ATF4 prominently featured. Visual elements depict flow processes and cellular components involved in intrinsic apoptosis, emphasizing the roles of specific genes and regulatory mechanisms.

Figure 4. Construction of a DPCDI-derived HF subtype and biological peculiarities of DPCDI. (A) The distribution of cophenetic, residuals, RSS, and silhouette through ranks from 2 to 10. (B) Consensus map of NMF clustering when rank equals 2. (C) The three-dimensional PCA plot shows the distinct distribution of two clusters. (D) The expressions of DPCDI genes and HF biomarkers between control and HF. (E) Enriched GO and KEGG terms of DPCDI genes. (F) GSEA enrichment plot of DPCDI-related terms. (G) ssGSEA enrichment plot of DPCDI-related terms. (H) Schematic chart illustrating the differences in enrichment of DPCDI-related terms between HF and control cases. (I) Overview depicting the cross-talk between DPCDI genes and regulators.

To delve into the different molecular characteristics of DPCDI between these two clusters, we initially implemented the GO and KEGG enrichment analyses on the DPCDI genes. Detailed information on these analyses can be found in Supplementary Table S7. We showed that the intrinsic apoptotic signaling pathway, regulation of apoptotic signaling pathway, and regulation of autophagy represent the GO functional terms (Figure 4E). Additionally, we found that mitophagy, autophagy, and chemokine signaling pathway were the significant KEGG pathway terms, of which mitophagy integrates ATF4, UBB, CALCOCO2, and OPTN, and chemokine signaling pathway links CXCL12, JAK2, and STAT3. GSEA was then performed to measure the enrichment distribution of these functional and pathway terms between the C1 and C2 clusters. As shown in Figure 4F, we noticed that the intrinsic apoptotic signaling pathway, regulation of apoptotic signaling pathway, mitophagy, autophagy, and chemokine signaling pathway were highly enriched in C2 (NES >1; Supplementary Table S8), but regulation of autophagy was lowly enrichment (NES <1; Supplementary Table S8). Using the ssGSEA approach, we also observed the same tendencies in these DPCDI-enrichment terms between C1 and C2 (Figure 4G). Consequently, we observed that patients in C2 exhibited enhanced activity in apoptosis and autophagy-related pathways, specifically intrinsic apoptosis and mitophagy, while demonstrating reduced regulation of autophagy initiation (Figure 4H). We next investigated the interplays between the DPCDI genes in these biological processes, as depicted in Supplementary Figure S2. A pathway overview of crosstalk between the DPCDI genes and their regulators was summarized in Figure 4I. Interestingly, we noted that CALCOCO2 and TPCN1, two important regulators of autophagy cargo recognition, were decreased in C2, which partly explained the inhibited autophagy regulation in C2. More intriguingly, we found that the downstream STAT3 in the chemokine signaling pathway, which is supposed to be activated by CXCL12 and JAK2, was significantly downregulated in C2 patients. This data suggests that the C2 subtype exhibits more active apoptotic and autophagic processes, but the regulation of autophagy is impeded. Notably, the C1 and C2 subtypes were identified using transcriptomic data, in the absence of correlated clinical parameters such as baseline cardiac function or comorbidity profiles. Consequently, this observed differences between subtypes must be interpreted cautiously and require validation through future studies.

3.4 Expression patterns of DPCDI in HF cell atlas at single-cell resolution

We next questioned whether the expression patterns of DPCDI genes differ by the cellular diversity of LV. We applied scRNA-seq analysis of individual cells isolated from LV tissues of four HF patients. The workflow was shown in Figure 5A. A total of 1,668 high-quality cells were gained after data trimming and filtering (Supplementary Figure S3). Then, we performed unsupervised clustering to partition 1,668 cells that were identified as five distinct clusters using UMAP. Five main cell populations, including cardiomyocyte (CM), endothelial cell (EC), fibroblast (FB), smooth muscle cell (SMC), and macrophage (MP), were subsequently annotated based on their respective markers (Figures 5B,C). Next, we visualized the global expression of DPCDI genes among these 5 cell populations (Figure 5D). Worthy of noting is that the IVNS1ABP (Figure 5E), JAK2 (Figure 5F), and OPTN (Figure 5G) were highly expressed in the CM population. Nonetheless, STAT3 was almost absent in CM compared to other populations, suggesting its dysfunction despite JAK2 being actively expressed. Moreover, we noticed that CXCL12 expression was enriched in EC and SMC populations (Figure 5H). Interestingly, the CM population was only enriched for JAK2 activation but lacked CXCL12 and STAT3, yet the close relationship of the CXCL12-JAK2-STAT3 axis in chemokine signaling was reported. Thus, it is believed that CM and NCM populations interact dynamically in the context of HF. Given this specific DPCDI gene expression in CM and EC, we selected these cell types for further investigation.

Figure 5
Illustration showcasing various analytical techniques applied to scRNA-seq data. Panel A outlines the experimental workflow involving cell data analysis methods like Cell Chat and Monocle. Panel B presents a UMAP plot categorizing cell types with an accompanying bar graph showing cell counts. Panel C features a heat map visualizing gene expression across conditions. Panels D to H display density plots and violin plots for gene expression, highlighting JAK2, CXCL12, and other genes. Panels I and K show network diagrams of cell interactions and strengths, while Panels M and N depict CXCL and VISFATIN signaling pathway networks with interaction roles indicated.

Figure 5. Exploration of DPCDI expression in HF patients at single-cell resolution. (A) Schematic illustration of this flow. (B) UMAP visualization and numbers of five annotated cell populations isolated from LV of HF patients. (C) Heatmap displaying marker genes for annotating each cell population. (D) Expression of DPCDI genes among the cell populations. Expression distribution of IVNS1ABP (E), JAK2 (F), OPTN (G), and CXCL12 (H) among the cell populations. (I) The number of interactions of the five populations. (J) The number of interactions of each cell population. (K) The interaction weights/strength of the 5 cell populations. (L) The interaction weights/strength of each cell population. The line color and width represent cell type and interaction pairs, respectively. (M) Identification of CXCL signaling pathway network between EC, SMC, and MP. Left and right portions show the interactions and signaling roles in CXCL signaling pathway, respectively. (N) Identification of VISFATIN signaling pathway network between MP and CM. The left and right portions show the interactions and signaling roles in the VISFATIN signaling pathway, respectively.

CellChat, based on the information on ligands, receptors, and cofactors from scRNA-seq data, was implemented to quantitatively infer and analyze intercellular communication. The cell-to-cell interaction among CM, EC, FB, SMC, and MP was respectively visualized as circular plots by interaction number (Figure 5I) and interaction strength (Figure 5J). The cell-to-cell interaction for each population was respectively exhibited by interaction number (Figure 5K) and interaction strength (Figure 5L). Using cell centrality analysis, we next extrapolated the roles of cell populations in signaling pathways. Figure 5M shows the inferred intercellular communication network for CXCL12 signaling. Of note, EC and SMC are dominant senders, and MP is the receiver of CXCL12 signaling. Specifically in the most important contribution for ligand-receptor, EC and SMC highly expressed CXCL12 ligand, which acts as a sender to active CXCR4 receptor in MP (Supplementary Figure S4A). We also showcased an inferred intercellular communication network for VISFATIN signaling (Figure 5N), consisting of MP as a sender and CM as a receiver. Interestingly, both MPs and CMs are predicted to be influencers, suggesting their roles as interactive controls in VISFATIN signaling. Given that MPs are activated via CXCL signaling, we speculated that these pro-inflammatory MPs show the activity of the NAMPT ligand and subsequently activate the INSR ligand of CMs (Supplementary Figure S4B), which may drive the regulation of JAK2 in CMs.

3.5 Dynamics of DPCDI was observed during the transition of sub-populations of CM and EC

To elucidate the dynamics of DPCDI genes, we defined the sub-populations of CM and EC (Supplementary Figure S5; Supplementary Figure S6), representing two major DPCDI-enriched groups in the LV landscape of HF. The Two sub-populations of CM were identified (Figure 6A), including CM1 and CM2. The expressions of three CM-enriched DPCDI genes (IVNS1ABP, JAK2, and OPTN) were depicted in Figure 6B, showing that CM2 expressed higher levels of IVNS1ABP and JAK2. Notably, CM1 expressed increased levels of mitochondrial oxidative stress-related genes (Figure 6C), such as NDUFA4 and C14ORF2; and CM2 demonstrated greater expressions of HF cardiac genes (Figure 6C), such as NPPA and NPPB. Therefore, CM2 may represent an end-stage CM population, suggesting a vibrant activity of apoptosis. We further performed pseudo-time analysis and the trajectory of two CM sub-populations was constructed (Figure 6D). Interestingly, we found that CM1 was expressed at a relatively earlier pseudo-time, with two branches significantly differentiating from nodes across the pseudo-time. Early branch was largely dominated by CM1, whereas the late branch was mainly composed of CM2. Next, we examined the dynamics of cardiac genes and DPCDI genes along the pseudo-time. Considering these cardiac genes, we showed that MYL4, MYL9, NPPA NPPB, and TNNT1 progressively increased (Figure 6E), representing a shift towards CM1 to CM2. As shown in Figure 6F, IVNS1ABP and JAK2 were gradually decreased along the pseudo-time, suggesting their potential in mediating CM differentiation. Additionally, STAT3 expressed relatively lower expression in CM2 and almost absent in CM1. Next, we partitioned genes that were significantly expressed along the pseudo-time trajectory into four different clusters (Figure 6G). During the CM differentiation, genes in cluster 4 (C4), such as IVNS1ABP and JAK2, were progressively decreased from an intermate state to an end state, and significantly enriched in terms related to the cytoskeleton. Additionally, cluster 1 (C1) including HF cardiac genes, were progressively elevated, and mostly enriched for terms related to muscle contraction and cytoskeleton. We also identified two sub-populations of EC (Figure 6H), including EC1 and EC2. As shown in Figures 6I,J, CXCL12 (an EC-enriched DPCID gene) was mostly expressed in the EC2 sub-population. Notably, EC1 exhibited an increased expression of Ca2+ ATPase-related genes (Figure 6K), such as CSRP3, PLN, and ATP2A2. EC1 also highly expressed ANKRD1, an EC activation factor contributing to repressing cardiac gene expression, which promotes cardiac modeling. As depicted in Figure 6K, EC2 was characterized by cell migration-related genes (CX3CL1 and CXCL12) and vessel morphogenesis-related genes (COL4A1 and NR2F2). Pseudo-time analysis was then performed to construct the trajectory of two EC sub-populations (Figure 6L). Notably, the early branch with lower pseudo-time was largely dominated by EC2, whereas the late branch was mainly composed of EC1. We next examined the dynamics of EC1 and EC2 marker genes along the pseudo-time, as depicted in Figures 6M,N. We showed that EC2 marker genes, especially CXCL12, progressively decreased during the state transformation. Next, the differential expressed genes across the pseudo-time were partitioned into four distinct clusters, as shown in Figure 6O. EC2 maker genes were allocated to C2, demonstrating a stronger involvement in ribosome-related functions. EC1 marker genes were clustered to C4 that were progressively elevated, which displayed an enrichment in muscle contraction-related terms.

Figure 6
Scatter plots, heatmaps, and violin plots illustrate gene expression analyses between two cell types: CM and EC. Different panels show density plots, UMAP visualizations, pseudotime trajectories, and expression levels for genes such as JAK2, IVNS1ABP, CXCL12, and others. Clusters are indicated as CM1, CM2, EC1, and EC2, highlighting differences in gene expression patterns and dynamics. Statistical data is visualized alongside graphical representations to emphasize gene significance across clusters.

Figure 6. In-depth exploration of DPCDI dynamics in sub-populations of CM and EC during HF progression. (A) UMAP visualization and numbers of CM population. (B) Global expression of JAK2, OPTN, and IVNS1ABP among the CM population. (C) Heatmap displaying marker genes for CM1 and CM2 sub-populations. (D) Inference of pseudo-time trajectories of CM population. The left and right portions represent the branch trajectories with cells colored by pseudo-time and branch trajectories with cells colored by sub-population, respectively. (E) Expression dynamic of marker genes (MYL4, MYL9, NPPA, NPPB, TNNI1, TNNT1) across the pseudo-time. (F) Expression dynamic of IVNS1ABP, JAK2, OPTN, and STAT3 across the pseudo-time. (G) Gene dynamic analysis of CM population transition. The left part was a heatmap displaying the expression dynamics of DEGs with distinct patterns along the pseudo-time. The right part was GO and KEGG enrichment terms of each cluster. (H) UMAP visualization and numbers of EC population. (I) Global expression of CXCL12 among the EC population. (J) Expression distribution of CXCL12 between EC1 and EC2 sub-populations. (K) Heatmap displaying marker genes for EC1 and EC2 sub-populations. (L) Inference of pseudo-time trajectories of EC population. The left and right portions represent the branch trajectories with cells colored by pseudo-time and branch trajectories with cells colored by sub-population, respectively. (M) Expression dynamic of EC1-marker genes (ANKRD1, ATP2A2, CSRP3, PLN) across the pseudo-time. (N) Expression dynamic of EC2-marker genes (COL14A1, CX3CL1, CXCL12, NR2F2) across the pseudo-time. (O) Gene dynamic analysis of EC population transition. The left part was a heatmap displaying the expression dynamics of DEGs with distinct patterns along the pseudo-time. The right part was GO and KEGG enrichment terms of each cluster.

3.6 Genetic predisposition towards CXCL12, JAK2, and STAT3 were found to casually associated with HF

To investigate the causal relationship between genetically proxied DPCDI expression and HF risk, we systematically applied 2SMR and BWMR. Among the DPCDI genes, the results of CXCL12 (pQTL), JAK2 (eQTL and pQTL), and STAT3 (eQTL and pQTL) were demonstrated to be statistically significant, suggesting high-support evidence for a genetic association linked to HF. Using 2SMR, the effect sizes of the SNP characteristics of eQTL expressions (JAK2, STAT3) and HF risk were negatively related, which were depicted in Figures 7A,D. The forest plot, leave-one-out plot, and funnel plot of SNP characteristics can be found in Supplementary Figure S7 and Supplementary Figure S8. The negative relationship was also demonstrated using the BWMR method (Figures 7B,E). Using the IVW approach, with the highest statistical power, we found that genetic predisposition toward mRNA expressions of JAK2 and STAT3 significantly decreased the risk of HF (Figures 7C,F; OR <1, P < 0.05). Additionally, the direction of effect estimate through IVW was consistent with the other five approaches. Horizontal heterogeneity and pleiotropy were not observed (Supplementary Table S9; Supplementary Table S10), which guarantees the analytic reliability.

Figure 7
Mendelian randomization analysis graphs of eQTLs and pQTLs for JAK2, STAT3, and CXCL12. Panels A to N depict scatter plots and forest plots for statistical analysis, comparing SNP effects on gene expression with odds ratios for various methods. Each panel includes plots and tables detailing odds ratios, confidence intervals, and p-values, with significant results highlighted. Panel P illustrates a genetic pathway diagram showing SNP influence on exposure and outcomes, with connections and confounders marked. Panel Q lists gene-related biological processes and phenotypes, highlighting abnormalities and associated diseases.

Figure 7. Mendelian randomization assessing the causal relationship between DPCDI genes and HF. (A–C) MR analysis of JAK2 (eQTL), in which A and B represent the correlation between the SNP effect of JAK2-eQTL (x-axis) and HF (y-axis) based on 2SMR and BWMR respectively, C was a forest plot summarizing the OR effects and P-values of 2SMR and BWMR on the causal relation-ship between JAK2-eQTL and HF. (D–F) MR analysis of STAT3 (eQTL), in which D and E represent the correlation between the SNP effect of STAT3-eQTL (x-axis) and HF (y-axis) based on 2SMR and BWMR respectively, F was a forest plot summarizing the OR effects and P-values of 2SMR and BWMR on the causal relationship between STAT3-eQTL and HF. (G–I) MR analysis of CXCL12 (pQTL), in which G and H represent the correlation between the SNP effect of CXCL12-pQTL (x-axis) and HF (y-axis) based on 2SMR and BWMR respectively, I was a forest plot summarizing the OR effects and P-values of 2SMR and BWMR on the causal relationship between CXCL12-pQTL and HF. (J–L) MR analysis of JAK2 (pQTL), in which J and K represent the correlation between the SNP effect of JAK2-pQTL (x-axis) and HF (y-axis) based on 2SMR and BWMR respectively, L was a forest plot summarizing the OR effects and P-values of 2SMR and BWMR on the causal relationship between JAK2-pQTL and HF. (M–O) MR analysis of STAT3 (pQTL), in which M and N represent the correlation between the SNP effect of STAT3-pQTL (x-axis) and HF (y-axis) based on 2SMR and BWMR respectively, O was a forest plot summarizing the OR effects and P-values of 2SMR and BWMR on the causal relationship between STAT3-pQTL and HF. (P) Schematic illustration of this flow. (Q) Cardiovascular phenotypes of CXCL12, JAK2, STAT3 knockout from the MGI database. Red and blue stand for a risk and protective factor of HF, respectively.

With regard to the pQTL data on CXCL12, JAK2, and STAT3, we found that the effect size of the SNP characteristics of CXCL12 was positively associated with HF (Figure 7G), whereas the SNPs of JAK2 (Figure 7J) and STAT3 (Figure 7M) were negatively correlated to HF. The forest plot, leave-one-out plot, and funnel plot of these SNP characteristics can be found in Supplementary Figure S9 (pQTL of CXCL12), Supplementary Figure S10 (pQTL of JAK2), and Supplementary Figure S11 (pQTL of STAT3). Consentient with 2SMR, the relationships of CXCL12, JAK2, and STAT3 were observed in BWMR, which were respectively shown in Figures 7H,K,N. Using the IVW approach, we found that genetic predisposition toward protein expressions of CXCL12 significantly increased the HF risk (Figure 7I; OR > 1, P < 0.05). Though the estimated direction of JAK2-pQTL was consistent with the eQTL data, no statistical significance was shown (Figure 7L; OR < 1, P > 0.05). Interestingly, we showed that genetically proxied higher protein expression of STAT3 also significantly decreased the HF risk (Figure 7O; OR < 1, P < 0.05), which was corroborate with the eQTL data. Also, we did not note heterogeneity and pleiotropy in the analyses of pQTL data on CXCL12, JAK2, and STAT3 (Supplementary Table S11; Supplementary Table S12; Supplementary Table S13). The workflow was shown in Figure 7P. Furthermore, the MGI database was queried to identify the cardiovascular phenotypes relevant to the knock-out of CXCL12, JAK2, and STAT3 genes in mouse models (Figure 7Q). Abnormal angiogenesis and poor arterial differentiation were induced after the knockout of CXCL12 and JAK2. More important traits, such as dilated heart and cardiomyopathy, occurs in STAT3-knockout mice, suggesting its crucial role in HF progression. Altogether, our data indicates that CXCL12 plays a risk factor contributing to HF, but JAK2 and STAT3 act protective factors against HF.

3.7 Mouse model validation of Cxcl12, Jak2, and Stat3 expression

To investigate the expression profile of DPCDI-core genes (CXCL12, JAK2, STAT3) in HF, we established TAC-induced HF mouse models. The experimental flow is outlined in Figure 8A. Echocardiography confirmed significantly impaired cardiac function in the TAC group compared to sham-operated controls, as evidenced by reduced ejection fraction (EF) and fractional shortening (FS) (Figures 8B,C; P < 0.05). Histological analysis revealed marked cardiomyocyte hypertrophy and inflammatory cell infiltration by H&E staining (Figure 8D), while Masson’s trichrome staining demonstrated extensive interstitial fibrosis (Figure 8E). Furthermore, qPCR analysis of cardiac tissue showed significant upregulation of Jak2 and Cxcl12 mRNA levels and downregulation of Stat3 mRNA levels (Figure 8F; all P < 0.001). As depicted in Figure 8G, correlation analysis revealed negative correlations between the expression levels of Jak2 and Cxcl12 and cardiac function parameters (EF and FS), whereas a positive correlation was observed between Stat3 expression and cardiac function (EF and FS). Taken together, these findings suggest that elevated Jak2 and Cxcl12 expression may promote HF progression, whereas Stat3 expression may exert a protective effect against HF.

Figure 8
Illustration of a scientific study involving mice, showcasing different experimental methods including echocardiography, staining, and RNA isolation. Panel B displays echocardiography results for sham and heart failure (HF) models. Panel C contains box plots comparing ejection fraction (EF) and fractional shortening (FS) percentages between control and HF groups. Panels D and E depict histological images of cardiac tissues stained with hematoxylin and eosin (HE) and Masson’s trichrome for sham and HF conditions. Panel F shows gene expression box plots for Cxcl12, Jak2, and Stat3. Panel G demonstrates scatter plots correlating gene expression with EF and FS percentages, featuring correlation coefficients and p-values.

Figure 8. Animal Validation of Cxcl12, Jak2, and Stat3 expressions in the HF context. (A) Study flow. (B,C) Echocardiography of the sham and HF mouse models. (D) HE staining of the sham and HF mouse models. (E) Masso staining of the sham and HF mouse models. (F) Expressions of Cxcl12, Jak2, and Stat3 in the sham and HF mouse models. (G) Correlations between gene expressions and echocardiography indicators. *, P < 0.05; **, P < 0.01; ***, P < 0.001.

3.8 Therapeutic implications of CXCL12 and STAT3 in mitigating HF

Given the risk role of CXCL12 and the protective role of STAT3 against HF, we next sought to explore their therapeutic values for HF treatment. The L1000 FWD database was employed to predict the potential small-molecule drugs with opposite patterns to reverse the upregulation of CXCL12 and downregulation of STAT3 in HF (Figure 9A). The detailed information on the top 20 predicted drugs, including the similarity score, P-value, Q-value, Z-score, and combined score, was summarized in Supplementary Table S14. Considering the toxicity and availability of these drugs, we selected pifithrin and strophanthidin as the candidate drugs. The two-dimensional structures of pifithrin and strophanthidin can be found in Figures 9B,C, respectively. Next, we performed molecular docking to validate the predicted drug-gene interactions. The binding patterns of pifithrin-CXCL12 (Figure 9D, left panel), pifithrin-STAT3 (Figure 9E, left panel), strophanthidin-CXCL12 (Figure 9F, left panel), and strophanthidin-STAT3 (Figure 9G, left panel) were visualized through molecular docking. Importantly, the binding affinities of these dockings were all lower than −5 kcal/mol, indicating reliable interactions. Specifically, pifithrin formed two hydrogen bonds with CXCL12 (Figure 9D, right panel): with ARG-41 and ASN-46 residues at the distance of 2.7 Å and 3.3 Å. Pifithrin formed two hydrogen bonds with STAT3 (Figure 9E, right panel): with ALA-250 and GLN-326 residues at the distance of 2.0 Å and 3.3 Å. Moreover, strophanthidin only formed one hydrogen bond with CXCL12 (Figure 9F, right panel): with GLU-63 residue at a distance of 3.1 Å. Lastly, strophanthidin formed two hydrogen bonds with STAT3 (Figure 9G, right panel): with ILE-258 and GLN-326 residues at the distance of 2.4 Å and 3.0 Å. Thus, pifithrin and strophanthidin may serve as a potential therapeutic candidate for further investigation, repressing the CXCL12 elevation and STAT3 decrease in HF.

Figure 9
A scientific illustration presenting molecular interactions and structures. Panel A shows a funnel diagram with compounds ranked by combined score, highlighting pifithrin and strophanthidin in red. Panel B depicts the chemical structure of pifithrin, while Panel C shows the structure of strophanthidin. Panels D and F illustrate the molecular docking of pifithrin and strophanthidin with CXCL12, with binding energies of -5.4 and -5.5 kcal/mol, respectively. Panels E and G display their interactions with STAT3, with energies of -5.8 and -7.0 kcal/mol. Insets detail specific amino acid interactions.

Figure 9. Therapeutic value of DPCDI genes using small-molecule drug prediction and molecular docking. (A) The top 20 most significant small-molecule drugs with signature similarity of STAT3 upregulation and CXCL12 downregulation by the L1000fwd database. Pifithrin and strophanthidin were selected. (B) Chemical structure of pifithrin. (C) Chemical structure of strophanthidin. (D) Molecular docking between pifithrin and CXCL12. (E) Molecular docking between pifithrin and STAT3. (F) Molecular docking between strophanthidin and CXCL12. (G) Molecular docking between strophanthidin and STAT3.

4 Discussion

HF, a complex clinical syndrome of global epidemic proportions, poses a significant health burden due to its high prevalence and heterogeneous etiology. While management strategies have advanced considerably in recent decades (Bozkurt, 2024), our understanding of its molecular mechanisms remains limited by cardiac complexity (Zhang et al., 2024), hindering precise diagnosis and effective treatment. This underscores the urgent need for novel predictive and therapeutic targets.

Advances in sequencing and bioinformatics have enabled the identification of molecular signatures for disease prediction from omics data (Pilarczyk et al., 2022). Consequently, numerous mRNA signatures derived from transcriptomic assays have been proposed for HF prediction. However, signatures derived from disparate specimen types (such as LV tissues and myocytes isolated from LV (Fan and Hu, 2022)) may lack diagnostic specificity due to cellular heterogeneity during disease progression, limiting their reliability. To address this, we exclusively curated LV-derived gene expression profiles.

Given the established role of programmed cell death (PCD) in cardiovascular disorders (Del Re et al., 2019) and its potential significance in HF, we hypothesized that PCD-related genes could aid in HF identification, stratification, and therapy. We focused on four druggable PCD forms: apoptosis, necrosis, ferroptosis, and autophagy. Ten independent HF cohorts (429 controls, 694 HF cases) from GEO were designated as training, testing, and external validation sets to develop a DPCDI. In the existing studies, the modeling of signature mostly depended on personal preferences (Wang et al., 2022). To avoid this bias, we collected 12 prevalent ML algorithms to construct a predictive signature. Upon packing and benchmarking 12 ML approaches into 113 algorithmic combinations, we successfully designed an integrative computational framework, as our previous studies of building signatures to identifying patients with acute myocardial infarction (Zhu et al., 2024). Ultimately, the optimal combination of Lasso and RF algorithms achieved the highest average AUC score (=0.9815) across the training and testing cohorts, and also performed excellent external validation cohorts. We termed this optimal combination DPCDI, which was fitted on the expressions of 15 DPCD-related genes (CALCOCO2, VPS13D, CLU, STAT3, OPTN, UBB, CXCL12, PPP1R15A, ATF4, IVNS1ABP, HMGB2, JAK2, EXOC7, ENO1, and TPCN1). Interestingly, we found these 15 DPCDI genes were differentially expressed in HF, and largely associated with apoptosis and autophagy. This data underscores the intricate plays between these DPCDI genes in apoptosis and autophagy that contribute to HF.

Overfitting often occurs when establishing predictive signatures, reflected in the high predictive accuracy of the discovery cohort but generalized poor encountering external data (Deo, 2015). Using an integration of Lasso and RF, we minimize the abundant information and identify the most significant genes to generate DPCDI. Benchmarking against 50 published HF signatures revealed DPCDI’s superior generalizability (AUC >0.85 across all cohorts). Notably, while model four model 4 (a 14-aging gene signature by Yu et al.) (Yu et al., 2024b) and model 7 (a 17-gene signature by Portokallidou et al.) (Portokallidou et al., 2023) ranked highly, their external validation performance was suboptimal, likely due to overfitting. In contrast, our DPCID also presents a good predictive performance regarding the external validation. DPCDI’s robustness stems from Lasso’s L1 regularization (reducing dimensionality, preventing overfitting) (Li et al., 2022) and RF’s handling of unbalanced cohorts (Biau and Scornet, 2016). The combined approach outperformed either algorithm alone (Lasso with AUC = 0.9299; RF with AUC = 0.9599).

The molecular subtype of HF remains incompletely explored. We investigated whether DPCDI contributes to HF molecular subtyping. NMF partitioned 657 HF patients into two divergent subtypes (C1, N = 297; C2, N = 360). DPCDI genes were differentially expressed between subtypes, with two HF markers (NINT2 and EGFR) significantly elevated in C2 versus C1. Given DPCDI’s role in apoptosis and autophagy, we used GSEA and ssGSEA to compare molecular characteristics between DPCDI-derived subtypes. C2 showed greater enrichment in apoptosis, autophagy, mitophagy, and chemokine signaling, but less enrichment in autophagy regulation. Conversely, C1 exhibited the opposite pattern. Studies have showed that apoptosis and autophagy cooperate or antagonize, controlling the cell fate (Noguchi et al., 2020; Nikoletopoulou et al., 2013). Mitophagy, central to mitochondria quality control (Wanderoy et al., 2020), balances apoptosis and autophagy. Based on this evidence, we speculated that the dysregulated mitophagy in C2 exacerbates apoptosis.

Integrating KEGG and literature evidence, we outlined a mechanistic overview of DPCDI gene crosstalk with enriched pathways. HMGB2, IVNS1ABP, ATF4, and PPP1R15A contribute to intrinsic apoptosis. Downregulated ATF4 relieves BCL2L1 inhibition of pro-apoptotic factors BAX and BAK, stimulating mitochondrial outer membrane permeabilization (MOMP) (Yamazaki and Galluzzi, 2022). The upregulation of HMGB2 and PPP1R15A aggravates intrinsic apoptosis through stimulating RELA and BID (Yagi et al., 2003), respectively. INVS1ABP has been found to stabilize actin to inhibit intrinsic apoptosis (Hotter et al., 2020). In C2 patients, the decreasing of INVS1ABP may lose its protective function, and upregulated PPP1R15A may induce apoptosis. Also, we depicted the CXCL12-JAK2-STAT3 axis in the chemokine signaling activates intrinsic apoptosis. Intriguingly, despite activation of CXCL12 and JAK2, STAT3 was inhibited, suggesting a dysphosphorylation of STAT3. STAT3 exerts cytoprotective effects through anti-apoptosis, but its dysregulation induces dilated cardiomyopathy and adverse remodeling post-myocardial infarction (Haghikia et al., 2011; Haghikia et al., 2014). We speculated PTPN2 and SOCS3 inhibit the JAK2-STAT3 activities (Zhang et al., 2018; Carow and Rottenberg, 2014). Regarding C2 patients, we found that CXCL12 and JAK2 increase significantly compared to C1, but STAT3 is downregulated. In this regard, a lower level of STAT3 attenuates its inhibition of apoptosis and thus exacerbates apoptosis.

Additionally, we demonstrated that ENO1, CLU, and UBB, which belong to the SUPER family as apoptotic cell surface markers, were differentially expressed in HF to induce ubiquitination and apoptosis (Díaz-Ramos et al., 2012). Particularly, UBB was significantly elevated in HF, suggesting a higher ubiquitination. VPS13D, a ubiquitin-binding gene in mitochondrial clearance (Anding et al., 2018), was decreased in HF. EXO7 serves as a key component of the mitophagy scaffold (Farré and Subramani, 2011), and was found to be upregulated in HF. Except for the decrease in ENO1, there were no significant changes regarding the mitophagy-upstream regulators (CLU, UBB, VPS13D, and EXO7) in C2 compared to C1. Furthermore, we showed that two cargo receptors for mitophagosome recognition, CALCOCO2 and OPTN (Adriaenssens et al., 2024), were decreased in HF. Despite the activation of ubiquitination to mitophagosome formation, the inhibition of two cargo receptors (CALCOCO2 and OPTN) suggests that the impairment cargo of mitophagosome. TPCN1, which controls Ca2+ channels to regulate autophagy, shows an upregulation pattern in HF (Xiong and Zhu, 2016). More importantly, we noted that CALCOCO2 and TPCN1 were significantly decreased in C2. Therefore, we concluded that the recruitment and ubiquitination of mitophagosomes continue in C2 patients. However, the cargo regulation is chaotic because of the impaired expressions of CALCOCO2 and TPCN1. Also, the impeded mitophagy regulation might enhance apoptosis (Noguchi et al., 2020). This may explain the low enrichment of mitophagy but the high enrichment of mitophagy and apoptosis in C2.

We next explored DPCDI expression patterns at single-cell resolution to investigate detailed molecular mechanism. Using scRNA-seq, we plotted the LV cell atlas from four HF patients, including CM, EC, FB, SMC, and MP populations. Among DPCDI genes, JAK2, IVNS1ABP, and OPTN showed higher expression in CM, while CXCL12 was enriched in EC and SMC. JAK2 was highly expressed in CM, but its upstream CXCL12 and downstream STAT3 were not correspondingly enriched, prompting investigation into the CXCL12-JAK2-STAT3 axis across populations. JAK-STAT signaling contributes to HF progression (Pang et al., 2023), but cell-to-cell mechanisms are poorly explained. Using CellChat, we analyzed cell interactions. CXCL12-expressing ECs and SMCs bind CXCR4 receptors on MPs, leading to CXCL signaling that promotes MP pro-inflammation (Kim et al., 2014). Pro-inflammatory MPs exhibit NAMPT activity and bind INSR ligands on CMs. INSR activation promotes JAK2-STAT3 signaling (Salminen et al., 2021). We propose that CXCL12-expressing ECs/SMCs stimulate MP pro-inflammation via CXCL12-CXCR4; pro-inflammatory MPs then interact with CMs via NAMPT-INSR to activate JAK2-STAT3 signaling. Pseudo-time analysis inferred trajectories for CM and EC populations. Two CM-expressed DPCDI genes (IVNS1ABP and JAK2) decreased dynamically during CM transition from oxidative stress to failure. CXCL12, an EC-expressed DPCDI gene, declined during EC transformation from migration and vessel morphogenesis. This indicates roles for IVNS1ABP and JAK2 in CM differentiation, and CXCL12 in EC differentiation.

To assess DPCDI druggability, we performed two large population-based MR studies at mRNA and protein levels. Using eQTL datasets, we identified negative causal associations between JAK2/STAT3 mRNA expression and HF risk. pQTL datasets validated protective roles of JAK2/STAT3 proteins against HF. CXCL12 protein level positively correlated with HF prevalence. Knockout of CXCL12, JAK2, and STAT3 in mice caused cardiovascular phenotypes like abnormal angiogenesis, poor arterial differentiation, and dilated heart, underscoring their pivotal roles in cardiovascular development and validating our MR results. CXCL12 level associates with increased MACEs in CAD patients (Zhang et al., 2022), indicating high risk. HF patients show JAK2 activation but severe STAT3 reduction (Cambi et al., 2012). STAT3 downregulation relates to end-stage HF (Hilfiker-Kleiner et al., 2005; Harhous et al., 2019), highlighting its protective role. Our findings support CXCL12 as a high-risk factor and JAK2/STAT3 as protective factors, suggesting therapeutic targets.

We predicted 20 small-molecule drugs reversing CXCL12 upregulation and STAT3 downregulation in HF. Pifithrin and strophanthidin were promising candidates, showing close affinities with CXCL12 and STAT3 through molecular docking. Pifithrin, a p53 inhibitor, protects against doxorubicin-induced apoptosis and attenuates myocardial ultrastructural alterations (Liu et al., 2004). Strophanthidin, a Na+/K+-ATPase inhibitor, exerts positive inotropic effects on failing human myocardium concentration-dependently (von Lewinski et al., 2007) and enhances anti-tumor activity via STAT3 in HepG2 cells (Reddy et al., 2019). Although predicted and validated in silico to interact with CXCL12 and STAT3, their cardio-protective mechanisms through reversing CXCL12/STAT3 expression remain unclear. Detailed investigations into pifithrin and strophanthidin effects on CXCL12 inhibition and STAT3 activation for HF alleviation are needed.

Our study has limitations. First, cohorts were retrospective. Further DPCDI investigation requires larger, multi-ethnic cohorts and clinical correlation. Second, DPCDI gene crosstalk in HF is incompletely explored, and more functional experiments are needed. Third, the therapeutic significance of CXCL12 and STAT3, particularly interactions with pifithrin and strophanthidin, requires detailed pharmacological study.

5 Conclusion

In summary, an optimal combination of Lasso and RF that contribute to a 15-gene signature (CALCOCO2, VPS13D, CLU, STAT3, OPTN, UBB, CXCL12, PPP1R15A, ATF4, IVNS1ABP, HMGB2, JAK2, EXOC7, ENO1, and TPCN1) was developed and validated to accurately predict HF, termed as DPCDI. We next leveraged DPCDI to partition HF into two subtypes, in which C2 presents a higher degree of apoptosis and mitophagy, while C1 shows the opposite. At the single-cell resolution, we found the dynamic expression of JAK2 and OPTN among the CM transition to failure status; and CXCL12 in EC transition to vessel morphogenesis. Moreover, we used the MR analysis on the large-scale eQTL and pQTL to elaborate on the causality between DPCDI and HF, with JAK2 and STAT3 as protective factors and CXCL12 as a high-risk factor. Using molecular docking, two agents including pifithrin and strophanthidin were predicted to closely interact with CXCL12 and STAT3. Repressing CXCL12 and stimulating STAT3 through the medication of pifithrin or strophanthidin may represent new therapeutic strategies for HF. Collectively, DPCDI provides a translatable framework for HF diagnosis, molecular subtyping, and precision therapeutics.

Data availability statement

The GEO database (http://www.ncbi.nlm.nih.gov/geo) contains datasets used in this study, including GSE141910, GSE116250, GSE135055, GSE16499, GSE5406, GSE57338, GSE79962, GSE42955, GSE52601, GSE21610, and GSE121893. To ensure productivity of the integrative ML framework and our DPCDI, the source code, scripts, processed datasets, and instructions have been archived in the Zenodo repository (https://doi.org/10.5281/zenodo.17918314).

Ethics statement

The animal study was approved by Institutional Animal Ethics Committee of Shanghai Jiao Tong University. The study was conducted in accordance with the local legislation and institutional requirements.

Author contributions

LZ: Investigation, Conceptualization, Methodology, Funding acquisition, Writing – review and editing, Writing – original draft, Formal Analysis, Visualization, Data curation, Validation. YZ: Visualization, Methodology, Data curation, Validation, Conceptualization, Writing – original draft, Software, Investigation, Resources, Formal Analysis, Writing – review and editing. YF: Writing – review and editing, Data curation, Visualization. YaY: Writing – review and editing, Data curation. YiY: Data curation, Writing – review and editing. HW: Writing – review and editing, Data curation. XJ: Data curation, Writing – review and editing. XZ: Writing – review and editing, Data curation. DH: Funding acquisition, Writing – review and editing, Project administration, Supervision.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This work was supported by grants from the National Natural Science Foundation of China (82570384 and 82172068) to DH. The Science and Technology Development Fund of Shanghai Pudong New Area (PKJ2022-Y07) and Hospital Basic Scientific Research Fund of the Sixth People’s Hospital Affiliated to Shanghai Jiao Tong University School of Medicine (ynqn202316) to LZ.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2025.1753636/full#supplementary-material

References

Adriaenssens, E., Nguyen, T. N., Sawa-Makarska, J., Khuu, G., Schuschnig, M., Shoebridge, S., et al. (2024). Control of mitophagy initiation and progression by the TBK1 adaptors NAP1 and SINTBAD. Nat. Struct. Mol. Biol. 31, 1717–1731. doi:10.1038/s41594-024-01338-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Anding, A. L., Wang, C., Chang, T. K., Sliter, D. A., Powers, C. M., Hofmann, K., et al. (2018). Vps13D encodes a Ubiq-uitin-Binding protein that is required for the regulation of mitochondrial size and clearance. Curr. Biol. 28 (2), 287–295.e286. doi:10.1016/j.cub.2017.11.064

PubMed Abstract | CrossRef Full Text | Google Scholar

Bian, R., Xu, X., and Li, W. (2022). Uncovering the molecular mechanisms between heart failure and end-stage renal disease via a bio-informatics study. Front. Genet. 13, 1037520. doi:10.3389/fgene.2022.1037520

PubMed Abstract | CrossRef Full Text | Google Scholar

Biau, G., and Scornet, E. (2016). A random forest guided tour. Test 25, 197–227. doi:10.1007/s11749-016-0481-7

CrossRef Full Text | Google Scholar

Bozkurt, B. (2024). Contemporary pharmacological treatment and management of heart failure. Nat. Rev. Cardiol. 21 (8), 545–555. doi:10.1038/s41569-024-00997-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Bozkurt, B., Coats, A. J. S., Tsutsui, H., Abdelhamid, C. M., Adamopoulos, S., Albert, N., et al. (2021). Universal definition and classification of heart failure: a report of the heart failure society of America, heart failure association of the european society of cardiology, Japanese heart failure society and writing committee of the uni-versal definition of heart failure: endorsed by the Canadian heart failure society, heart failure association of India, cardiac society of Australia and New Zealand, and Chinese heart failure association. Eur. J. Heart Fail 23 (3), 352–380. doi:10.1002/ejhf.2115

PubMed Abstract | CrossRef Full Text | Google Scholar

Cambi, G. E., Lucchese, G., Djeokeng, M. M., Modesti, A., Fiaschi, T., Faggian, G., et al. (2012). Impaired JAK2-induced activation of STAT3 in failing human myocytes. Mol. Biosyst. 8 (9), 2351–2359. doi:10.1039/c2mb25120e

PubMed Abstract | CrossRef Full Text | Google Scholar

Cao, J., Spielmann, M., Qiu, X., Huang, X., Ibrahim, D. M., Hill, A. J., et al. (2019). The single-cell transcriptional landscape of mammalian organogenesis. Nature 566 (7745), 496–502. doi:10.1038/s41586-019-0969-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Carow, B., and Rottenberg, M. E. (2014). SOCS3, a major regulator of infection and inflammation. Front. Immunol. 5, 58. doi:10.3389/fimmu.2014.00058

PubMed Abstract | CrossRef Full Text | Google Scholar

Del Re, D. P., Amgalan, D., Linkermann, A., Liu, Q., and Kitsis, R. N. (2019). Fundamental mechanisms of regulated cell death and impli-cations for heart disease. Physiol. Rev. 99 (4), 1765–1817. doi:10.1152/physrev.00022.2018

PubMed Abstract | CrossRef Full Text | Google Scholar

Deo, R. C. (2015). Machine learning in medicine. Circulation 132 (20), 1920–1930. doi:10.1161/CIRCULATIONAHA.115.001593

PubMed Abstract | CrossRef Full Text | Google Scholar

Díaz-Ramos, A., Roig-Borrellas, A., García-Melero, A., and López-Alemany, R. (2012). α-Enolase, a multifunctional protein: its role on pathophysiological situations. J. Biomed. Biotechnol. 2012, 156795. doi:10.1155/2012/156795

PubMed Abstract | CrossRef Full Text | Google Scholar

Elsworth, B., Lyon, M., Alexander, T., Liu, Y., Matthews, P., Hallett, J., et al. (2020). The MRC IEU OpenGWAS data infrastructure. BioRxiv 2020. doi:10.1101/2020.08.10.244293

CrossRef Full Text | Google Scholar

Fan, S., and Hu, Y. (2022). Integrative analyses of biomarkers and pathways for heart failure. BMC Med. Genomics 15 (1), 72. doi:10.1186/s12920-022-01221-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Farré, J. C., and Subramani, S. (2011). Rallying the exocyst as an autophagy scaffold. Cell 144 (2), 172–174. doi:10.1016/j.cell.2011.01.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Ferkingstad, E., Sulem, P., Atlason, B. A., Sveinbjornsson, G., Magnusson, M. I., Styrmisdottir, E. L., et al. (2021). Large-scale integration of the plasma proteome with genetics and disease. Nat. Genet. 53 (12), 1712–1721. doi:10.1038/s41588-021-00978-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Flam, E., Jang, C., Murashige, D., Yang, Y., Morley, M. P., Jung, S., et al. (2022). Inte-grated landscape of cardiac metabolism in end-stage human nonischemic dilated cardiomyopathy. Nat. Cardiovasc Res. 1 (9), 817–829. doi:10.1038/s44161-022-00117-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Gaujoux, R., and Seoighe, C. (2010). A flexible R package for nonnegative matrix factorization. BMC Bioinforma. 11, 367. doi:10.1186/1471-2105-11-367

PubMed Abstract | CrossRef Full Text | Google Scholar

GBD 2017 Disease and Injury Incidence and Prevalence Collaborators (2018). Global, regional, and national incidence, prevalence, and years lived with disability for 354 diseases and injuries for 195 countries and territories, 1990-2017: a systematic analysis for the global Burden of Disease Study 2017. Lancet 392 (10159), 1789–1858. doi:10.1016/S0140-6736(18)32279-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Gu, J. J., Du, T. J., Zhang, L. N., Zhou, J., Gu, X., and Zhu, Y. (2023). Identification of ferroptosis-related genes in heart failure induced by transverse aortic constriction. J. Inflamm. Res. 16, 4899–4912. doi:10.2147/JIR.S433387

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, L., and Xu, C. E. (2023). Integrated bioinformatics and machine learning algorithms reveal the critical cellular senes-cence-associated genes and immune infiltration in heart failure due to ischemic cardiomyopathy. Front. Immunol. 14, 1150304. doi:10.3389/fimmu.2023.1150304

PubMed Abstract | CrossRef Full Text | Google Scholar

Haghikia, A., Stapel, B., Hoch, M., and Hilfiker-Kleiner, D. (2011). STAT3 and cardiac remodeling. Heart Fail Rev. 16 (1), 35–47. doi:10.1007/s10741-010-9170-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Haghikia, A., Ricke-Hoch, M., Stapel, B., Gorst, I., and Hilfiker-Kleiner, D. (2014). STAT3, a key regulator of cell-to-cell communication in the heart. Cardiovasc Res. 102 (2), 281–289. doi:10.1093/cvr/cvu034

PubMed Abstract | CrossRef Full Text | Google Scholar

Hannenhalli, S., Putt, M. E., Gilmore, J. M., Wang, J., Parmacek, M. S., Epstein, J. A., et al. (2006). Tran-scriptional genomics associates FOX transcription factors with human heart failure. Circulation 114 (12), 1269–1276. doi:10.1161/CIRCULATIONAHA.106.632430

PubMed Abstract | CrossRef Full Text | Google Scholar

Hänzelmann, S., Castelo, R., and Guinney, J. (2013). GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioin-formatics 14, 7. doi:10.1186/1471-2105-14-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Hao, Y., Hao, S., Andersen-Nissen, E., Mauck, W. M., 3rd, Zheng, S., Butler, A., et al. (2021). Inte-grated analysis of multimodal single-cell data. Cell 184 (13), 3573–3587.e3529. doi:10.1016/j.cell.2021.04.048

PubMed Abstract | CrossRef Full Text | Google Scholar

Harhous, Z., Booz, G. W., Ovize, M., Bidaux, G., and Kurdi, M. (2019). An update on the multifaceted roles of STAT3 in the heart. Front. Cardiovasc Med. 6, 150. doi:10.3389/fcvm.2019.00150

PubMed Abstract | CrossRef Full Text | Google Scholar

Heidenreich, P. A., Bozkurt, B., Aguilar, D., Allen, L. A., Byun, J. J., Colvin, M. M., et al. (2022). AHA/ACC/HFSA Guideline for the management of heart failure: a report of the American college of Cardi-ology/American heart Association Joint Committee on clinical practice Guidelines. Circulation 145 (18), e895–e1032. doi:10.1161/CIR.0000000000001063

PubMed Abstract | CrossRef Full Text | Google Scholar

Hemani, G., Zheng, J., Elsworth, B., Wade, K. H., Haberland, V., Baird, D., et al. (2018). The MR-Base platform supports systematic causal inference across the human phenome. Elife 7, e34408. doi:10.7554/eLife.34408

PubMed Abstract | CrossRef Full Text | Google Scholar

Hilfiker-Kleiner, D., Hilfiker, A., and Drexler, H. (2005). Many good reasons to have STAT3 in the heart. Pharmacol. Ther. 107 (1), 131–137. doi:10.1016/j.pharmthera.2005.02.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Hotter, G., Mastora, C., Jung, M., Brüne, B., Carbonell, T., Josa, C., et al. (2020). The influenza virus NS1A binding protein gene modulates macrophages response to cytokines and phagocytic potential in inflammation. Sci. Rep. 10 (1), 15302. doi:10.1038/s41598-020-72342-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Hu, C., Li, T., Xu, Y., Zhang, X., Li, F., Bai, J., et al. (2023). CellMarker 2.0: an updated database of man-ually curated cell markers in human/mouse and web tools based on scRNA-seq data. Nucleic Acids Res. 51 (D1), D870–d876. doi:10.1093/nar/gkac947

PubMed Abstract | CrossRef Full Text | Google Scholar

Hua, X., Wang, Y. Y., Jia, P., Xiong, Q., Hu, Y., Chang, Y., et al. (2020). Multi-level transcriptome sequencing identifies COL1A1 as a candidate marker in human heart failure progression. BMC Med. 18 (1), 2. doi:10.1186/s12916-019-1469-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Jankowski, J., Kozub, K. O., Kleibert, M., Camlet, K., Kleibert, K., and Cudnoch-Jędrzejewska, A. (2024). The role of programmed types of cell death in pathogenesis of heart failure with preserved ejection fraction. Int. J. Mol. Sci. 25 (18), 9921. doi:10.3390/ijms25189921

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, Y., Chen, L., Chao, Z., Chen, T., and Zhou, Y. (2022). Ferroptosis related genes in ischemic and idiopathic cardiomyopathy: screening for potential pharmacological targets. Front. Cell Dev. Biol. 10, 817819. doi:10.3389/fcell.2022.817819

PubMed Abstract | CrossRef Full Text | Google Scholar

Jin, S., Guerrero-Juarez, C. F., Zhang, L., Chang, I., Ramos, R., Kuan, C. H., et al. (2021). Inference and analysis of cell-cell communication using CellChat. Nat. Commun. 12 (1), 1088. doi:10.1038/s41467-021-21246-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Kim, D., Kim, J., Yoon, J. H., Ghim, J., Yea, K., Song, P., et al. (2014). CXCL12 secreted from adipose tissue recruits macrophages and induces insulin resistance in mice. Diabetologia 57 (7), 1456–1465. doi:10.1007/s00125-014-3237-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Kong, S. W., Hu, Y. W., Ho, J. W., Ikeda, S., Polster, S., John, R., et al. (2010). Heart fail-ure-associated changes in RNA splicing of sarcomere genes. Circ. Cardiovasc Genet. 3 (2), 138–146. doi:10.1161/CIRCGENETICS.109.904698

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Y., Lu, F., and Yin, Y. (2022). Applying logistic LASSO regression for the diagnosis of atypical Crohn's disease. Sci. Rep. 12 (1), 11340. doi:10.1038/s41598-022-15609-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, X., Chua, C. C., Gao, J., Chen, Z., Landy, C. L., Hamdy, R., et al. (2004). Pifithrin-alpha protects against doxorubicin-induced apoptosis and acute cardiotoxicity in mice. Am. J. Physiol. Heart Circ. Physiol. 286 (3), H933–H939. doi:10.1152/ajpheart.00759.2003

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, Y., Morley, M., Brandimarto, J., Hannenhalli, S., Hu, Y., Ashley, E. A., et al. (2015). RNA-Seq identifies novel myocardial gene expression signatures of heart failure. Genomics 105 (2), 83–89. doi:10.1016/j.ygeno.2014.12.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, Z., Liu, L., Weng, S., Guo, C., Dang, Q., Xu, H., et al. (2022). Machine learning-based integration develops an immune-derived lncRNA signature for improving outcomes in colorectal cancer. Nat. Commun. 13 (1), 816. doi:10.1038/s41467-022-28421-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Ma, C., Tu, D., Xu, Q., Wu, Y., Song, X., Guo, Z., et al. (2023). Identification of m(7)G regulator-mediated RNA methylation modifi-cation patterns and related immune microenvironment regulation characteristics in heart failure. Clin. Epigenetics 15 (1), 22. doi:10.1186/s13148-023-01439-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Matkovich, S. J., Al, K. B., Efimov, I. R., Evans, S., Vader, J., Jain, A., et al. (2017). Widespread down-regulation of cardiac mitochondrial and sarcomeric genes in patients with sepsis. Crit. Care Med. 45 (3), 407–414. doi:10.1097/CCM.0000000000002207

PubMed Abstract | CrossRef Full Text | Google Scholar

Nikoletopoulou, V., Markaki, M., Palikaras, K., and Tavernarakis, N. (2013). Crosstalk between apoptosis, necrosis and autophagy. Bio-chim Biophys. Acta 1833 (12), 3448–3459. doi:10.1016/j.bbamcr.2013.06.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Narula, J., Haider, N., Arbustini, E., and Chandrashekhar, Y. (2006). Mechanisms of disease: apoptosis in heart failure--seeing hope in death. Nat. Clin. Pract. Cardiovasc Med. 3 (12), 681–688. doi:10.1038/ncpcardio0710

PubMed Abstract | CrossRef Full Text | Google Scholar

Noguchi, M., Hirata, N., Tanaka, T., Suizu, F., Nakajima, H., and Chiorini, J. A. (2020). Autophagy as a modulator of cell death machinery. Cell Death Dis. 11 (7), 517. doi:10.1038/s41419-020-2724-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Pang, Q., You, L., Meng, X., Li, Y., Deng, T., Li, D., et al. (2023). Regulation of the JAK/STAT signaling pathway: the promising targets for cardiovascular disease. Biochem. Pharmacol. 213, 115587. doi:10.1016/j.bcp.2023.115587

PubMed Abstract | CrossRef Full Text | Google Scholar

Pilarczyk, M., Fazel-Najafabadi, M., Kouril, M., Shamsaei, B., Vasiliauskas, J., Niu, W., et al. (2022). Connecting omics signatures and revealing biological mechanisms with iLINCS. Nat. Commun. 13 (1), 4678. doi:10.1038/s41467-022-32205-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Portokallidou, K., Dovrolis, N., Ragia, G., Atzemian, N., Kolios, G., and Manolopoulos, V. G. (2023). Multi-omics integration to identify the genetic expression and protein signature of dilated and ischemic cardiomyopathy. Front. Cardiovasc Med. 10, 1115623. doi:10.3389/fcvm.2023.1115623

PubMed Abstract | CrossRef Full Text | Google Scholar

Reddy, D., Ghosh, P., and Kumavath, R. (2019). Strophanthidin Attenuates MAPK, PI3K/AKT/mTOR, and Wnt/β-Catenin signaling pathways in human cancers. Front. Oncol. 9, 1469. doi:10.3389/fonc.2019.01469

PubMed Abstract | CrossRef Full Text | Google Scholar

Salminen, A., Kaarniranta, K., and Kauppinen, A. (2021). Insulin/IGF-1 signaling promotes immunosuppression via the STAT3 pathway: impact on the aging process and age-related diseases. Inflamm. Res. 70 (10-12), 1043–1061. doi:10.1007/s00011-021-01498-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Savarese, G., Becher, P. M., Lund, L. H., Seferovic, P., Rosano, G. M. C., and Coats, A. J. S. (2023). Global burden of heart failure: a comprehensive and updated review of epidemiology. Cardiovasc Res. 118 (17), 3272–3287. doi:10.1093/cvr/cvac013

PubMed Abstract | CrossRef Full Text | Google Scholar

Shu, S., Fu, M., Chen, X., Zhang, N., Zhao, R., Chang, Y., et al. (2022). Cellular landscapes of nondiseased human cardiac valves from end-stage heart failure-explanted heart. Arterioscler. Thromb. Vasc. Biol. 42 (12), 1429–1446. doi:10.1161/ATVBAHA.122.318314

PubMed Abstract | CrossRef Full Text | Google Scholar

Sweet, M. E., Cocciolo, A., Slavov, D., Jones, K. L., Sweet, J. R., Graw, S. L., et al. (2018). Transcriptome analysis of human heart failure reveals dysregulated cell adhesion in dilated cardiomyopathy and acti-vated immune pathways in ischemic heart failure. BMC Genomics 19 (1), 812. doi:10.1186/s12864-018-5213-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Tan, W. L. W., Anene-Nzelu, C. G., Wong, E., Lee, C. J. M., Tan, H. S., Tang, S. J., et al. (2020). Epige-nomes of human hearts reveal new genetic variants relevant for cardiac disease and phenotype. Circ. Res. 127 (6), 761–777. doi:10.1161/CIRCRESAHA.120.317254

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, K., Ji, X., Zhou, M., Deng, Z., Huang, Y., Zheng, G., et al. (2021). Rank-in: enabling integrative analysis across microarray and RNA-seq for cancer. Nucleic Acids Res. 49 (17), e99. doi:10.1093/nar/gkab554

PubMed Abstract | CrossRef Full Text | Google Scholar

von Lewinski, D., Bisping, E., Elgner, A., Kockskämper, J., and Pieske, B. (2007). Mechanistic insight into the functional and toxic effects of Strophanthidin in the failing human myocardium. Eur. J. Heart Fail 9 (11), 1086–1094. doi:10.1016/j.ejheart.2007.08.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Wanderoy, S., Hees, J. T., Klesse, R., Edlich, F., and Harbauer, A. B. (2020). Kill one or kill the many: interplay between mitophagy and apoptosis. Biol. Chem. 402 (1), 73–88. doi:10.1515/hsz-2020-0231

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Z., Lachmann, A., and Keenan, A. B. (2018). Ma'ayan A: L1000FWD: fireworks visualization of drug-induced transcriptomic sig-natures. Bioinformatics 34 (12), 2150–2152. doi:10.1093/bioinformatics/bty060

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, L., Yu, P., Zhou, B., Song, J., Li, Z., Zhang, M., et al. (2020). Single-cell reconstruction of the adult human heart during heart failure and recovery reveals the cellular landscape underlying cardiac function. Nat. Cell Biol. 22 (1), 108–119. doi:10.1038/s41556-019-0446-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, L., Liu, Z., Liang, R., Wang, W., Zhu, R., Li, J., et al. (2022). Comprehensive machine-learning survival framework develops a consensus model in large-scale multicenter cohorts for pancreatic cancer. Elife 11, e80150. doi:10.7554/eLife.80150

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, X., Rao, J., Chen, X., Wang, Z., and Zhang, Y. (2024). Identification of shared signature genes and immune microenvironment subtypes for heart failure and chronic kidney disease based on machine learning. J. Inflamm. Res. 17, 1873–1895. doi:10.2147/JIR.S450736

PubMed Abstract | CrossRef Full Text | Google Scholar

Xiong, J., and Zhu, M. X. (2016). Regulation of lysosomal ion homeostasis by channels and transporters. Sci. China Life Sci. 59 (8), 777–791. doi:10.1007/s11427-016-5090-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Yagi, A., Hasegawa, Y., Xiao, H., Haneda, M., Kojima, E., Nishikimi, A., et al. (2003). GADD34 induces p53 phosphorylation and p21/WAF1 transcription. J. Cell Biochem. 90 (6), 1242–1249. doi:10.1002/jcb.10711

PubMed Abstract | CrossRef Full Text | Google Scholar

Yamaguchi, T., Sumida, T. S., Nomura, S., Satoh, M., Higo, T., Ito, M., et al. (2020). Cardiac dopamine D1 receptor triggers ventricular arrhythmia in chronic heart failure. Nat. Commun. 11 (1), 4364. doi:10.1038/s41467-020-18128-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Yamazaki, T., and Galluzzi, L. (2022). BAX and BAK dynamics control mitochondrial DNA release during apoptosis. Cell Death Differ. 29 (6), 1296–1298. doi:10.1038/s41418-022-00985-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, G., Wang, L. G., Han, Y., and He, Q. Y. (2012). clusterProfiler: an R package for comparing biological themes among gene clusters. Omics 16 (5), 284–287. doi:10.1089/omi.2011.0118

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, Y., Li, L., Luo, B., Chen, D., Yin, C., Jian, C., et al. (2024a). Predicting potential therapeutic targets and small molecule drugs for early-stage lung adenocarcinoma. Biomed. Pharmacother. 174, 116528. doi:10.1016/j.biopha.2024.116528

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, Y., Wang, L., Hou, W., Xue, Y., Liu, X., and Li, Y. (2024b). Identification and validation of aging-related genes in heart failure based on multiple machine learning algorithms. Front. Immunol. 15, 1367235. doi:10.3389/fimmu.2024.1367235

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, J. (2022). ClusterGVis: one-step to cluster and visualize gene expression matrix.

Google Scholar

Zhang, Y., Ding, H., Wang, X., and Ye, S. D. (2018). Modulation of STAT3 phosphorylation by PTPN2 inhibits naïve pluripotency of em-bryonic stem cells. FEBS Lett. 592 (13), 2227–2237. doi:10.1002/1873-3468.13112

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, S., Ding, Y., Feng, F., and Gao, Y. (2022). The role of blood CXCL12 level in prognosis of coronary artery disease: a meta-analysis. Front. Cardiovasc Med. 9, 938540. doi:10.3389/fcvm.2022.938540

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, D., Wen, Q., Zhang, R., Kou, K., Lin, M., Zhang, S., et al. (2024). From cell to gene: deciphering the mechanism of heart failure with single-cell sequencing. Adv. Sci. (Weinh) 11 (39), e2308900. doi:10.1002/advs.202308900

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhao, J., Ming, J., Hu, X., Chen, G., Liu, J., and Yang, C. (2020). Bayesian weighted Mendelian randomization for causal inference based on summary statistics. Bioinformatics 36 (5), 1501–1508. doi:10.1093/bioinformatics/btz749

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, L., Sun, J., Gu, L., Wang, S., Yang, T., Wei, T., et al. (2021). Programmed cell death: complex regulatory networks in cardiovascular disease. Front. Cell Dev. Biol. 9, 794879. doi:10.3389/fcell.2021.794879

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhu, Y., Chen, Y., and Zu, Y. (2024). Leveraging a neutrophil-derived PCD signature to predict and stratify patients with acute myo-cardial infarction: from AI prediction to biological interpretation. J. Transl. Med. 22 (1), 612. doi:10.1186/s12967-024-05415-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: bioinformatics, cell death, heart failure, machine learning, multi-omics

Citation: Zhang L, Zhu Y, Fang Y, Yang Y, Yu Y, Wang H, Jiang X, Zhang X and Huang D (2026) DPCDI: an artificial intelligent-derived indicator interpreting the diagnostic, stratification, and therapeutic implications of druggability programmed cell death in heart failure. Front. Genet. 16:1753636. doi: 10.3389/fgene.2025.1753636

Received: 25 November 2025; Accepted: 19 December 2025;
Published: 15 January 2026.

Edited by:

Enkai Li, Washington University in St. Louis, United States

Reviewed by:

Chunhua Liu, Shanghai Ocean University, China
Baihe Sun, Tongji University, China

Copyright © 2026 Zhang, Zhu, Fang, Yang, Yu, Wang, Jiang, Zhang and Huang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dong Huang, aHVhbmdkb25nMTAwNEBzanR1LmVkdS5jbg==

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.