BRIEF RESEARCH REPORT article

Front. Pediatr., 16 May 2025

Sec. General Pediatrics and Pediatric Emergency Care

Volume 13 - 2025 | https://doi.org/10.3389/fped.2025.1518908

Identification of the key genes in children with sepsis by WGCNA in multiple GEO datasets

  • YS

    Yue-chuan Shen 1,2

  • DY

    Dao-jun Yu 3

  • ZY

    Ze Yu 4*

  • XZ

    Xue Zhao 5*

  • 1. Zhejiang Chinese Medical University, Hangzhou, Zhejiang, China

  • 2. Department of Emergency, Zhoushan Hospital, Wenzhou Medical University, Zhoushan, China

  • 3. Department of Laboratory Medicine, Hangzhou First People's Hospital, the Fourth School of Clinical Medicine, Zhejiang Chinese Medical University, Hangzhou, Zhejiang, China

  • 4. The Laboratory of Cytobiology and Molecular Biology, Zhoushan Hospital, Wenzhou Medical University, Zhoushan, China

  • 5. Department of Emergency, Affiliated Hangzhou First People's Hospital, School of Medicine, Westlake University, Hangzhou, Zhejiang, China

Abstract

Pediatric sepsis is a serious condition causing organ failure owing to immune dysregulation, linked to high morbidity and mortality, highlighting the need for quick detection and treatment. This study aims to identify key genes involved in pediatric sepsis using three gene expression datasets from the Gene Expression Omnibus. We first identified differentially expressed genes (DEGs) with R, then conducted a gene set enrichment analysis, and integrated DEGs with important module genes from weighted gene coexpression network analysis. We also screened adult sepsis datasets to find genes specific to pediatric cases, ultimately validating XCL1 as a key gene. This study suggests that XCL1 is crucial in understanding pediatric sepsis etiology and its molecular mechanisms.

1 Introduction

Sepsis is a life-threatening condition affecting all ages, marked by abnormal immune responses and organ dysfunction, posing a significant public health challenge (). Globally, pediatric sepsis occurs in 22 cases per 100,000 person-years, and neonatal sepsis at 2,202 cases per 100,000 live births, totaling around 1.2 million cases annually (, ). The pediatric sepsis mortality rate is 25%, mainly due to refractory shock or organ dysfunction, with many deaths occurring within the first 48–72 h (). Timely detection, appropriate resuscitation, and meticulous care are essential for optimizing the prognosis of children with sepsis ().

Historically, sepsis was believed to primarily result from sustained inflammatory response to infection. However, clinical research aimed at treating sepsis by targeting key inflammatory molecules, either selectively or non-selectively, has not achieved significant progress (). Most research revealed that sepsis development involves not only a prolonged and intense inflammatory response but also immunosuppression (). This process involves a complex molecular network formed by interactions among cytokines, chemokines, and neuroendocrine factors.

In recent years, advanced analytical methods that leverage biological networks have emerged to extract key information from a broad range of histological data and to uncover the interactions present within this information (). The main types are gene regulation, protein interactions, and coexpression networks. Weighted gene coexpression network analysis (WGCNA) helps reveal connections between gene clusters with similar expression in transcriptomic data and disease phenotypes, aiding in identifying molecular markers or therapeutic targets in complex diseases ().

This study established a WGCNA network utilizing data from the Gene Expression Omnibus (GEO), encompassing peripheral whole blood samples from children with sepsis and healthy controls. Through the application of coexpression networks and diverse bioinformatics methodologies, this research elucidated modules and hub genes correlated with the prognosis of pediatric sepsis, with the objective of identifying potential biomarkers closely associated with clinical outcomes.

2 Materials and methods

2.1 Data sources and gene expression profiles

We searched the GEO database for high-throughput functional genomics studies on pediatric sepsis, finding relevant microarray datasets from children with sepsis and healthy controls, including GSE26378, GSE26440, GSE13904, and GSE131761 (). The limma package was used for statistical analysis, error detection, data cleaning, and organization, improving data management. The robust multi-array average (RMA) method normalized data, and limma identified DEGs with p < 0.05 and log2 fold-change ≥1.

GSE26378, GSE26440, and GSE13904 are pediatric sepsis datasets, while GSE131761 is for adults. An analysis of four gene expression datasets was conducted, with clinical details given in Supplementary Table S1. The analysis included GSE26378, GSE26440, and GSE13904, excluding GSE131761, to identify sepsis-related genes in children ().

2.2 WGCNA analysis and module identification

The WGCNA method improves gene set expression analysis using the WGCNA R package for constructing gene networks. Cluster analysis identifies outliers, while an automated system creates coexpression networks. Modules undergo functional assessment via hierarchical clustering and dynamic tree cutting, with Module Membership (MM) and Gene Significance (GS) evaluated for clinical associations. Central modules have high MM correlation and a p-value of 0.05, with MM over 0.80 and GS above 0.1 indicating strong connectivity and significance. Gene information for these modules supports further research.

Genes within the clinically significant gene module network with a GS value exceeding 0.2 and an MM value greater than 0.80 are classified as hub genes. Genes identified as overlapping are chosen as candidates for pivotal roles. The Venn R package is employed to produce significant gene diagrams.

2.3 Supplementary method

Supplementary data provide the details on the methods for identifying DEGs, conducting functional enrichment analysis, and performing GeneMANIA analysis ().

3 Results

3.1 Identification of DEGs in these datasets

Figure 1A identified differentially expressed genes (DEGs) in the GSE26378, GSE26440, and GSE13904 datasets using p < 0.05 and |log2 (fold-change)| > 1, revealing their roles in immune responses, especially neutrophil activation and cytokine production (Supplementary Figures S1–S3). By analyzing the overlapping regions of DEGs through a Venn diagram, we identified 357 common gene regions (Figure 1B). Subsequently, GO and KEGG analyses were conducted on these overlapping genes. The findings demonstrated that these genes were primarily enriched in the biological processes associated with T-cell differentiation and PD-L1 expression (Figure 1C, Supplementary Figure S4).

Figure 1

3.2 Identification of coexpression gene modules in pediatrics sepsis

We used weighted WGCNA to find coexpression gene modules in the pediatric sepsis dataset GSE26378, selecting a soft-threshold power of β = 16 for a scale-free network with a scale independence value of above 0.85 (Figures 2A,B). Samples from the GSE26378 dataset was classified into the pediatric sepsis group and the control group, with no outliers detected (Figure 2C). Hierarchical clustering and dynamic branch cutting techniques were employed on the gene dendrogram, leading to the identification of 17 modules (Figures 2D,E). The heatmap displays the topological overlap matrix (TOM) of the analyzed genes. The analysis demonstrated a high degree of independence among the modules associated with gene expression (Figure 2E). The brown module (indicative of positive correlation) and coral2 module (indicative of negative correlation) were significantly correlated with pediatric sepsis and selected for further examination (Figure 2F). In terms of module membership, these two modules encompass 35 genes significantly linked to pediatric sepsis.

Figure 2

At the same time, we used the same method to analyze the hub modules and genes of two other GSE datasets (GSE26440 and GSE13904). As shown in Supplementary Figures S5, S6, in the GSE26440 dataset, hub modules lightpink4 and darkorange2 contained a total of 345 genes, while in the GSE13904 dataset, hub modules antiquewhite4 and palevioletred3 contained a total of 287 genes. Further, based on the 357 DEGs obtained previously, the hub genes came from the three datasets, and a Venn analysis revealed 16 intersected genes, including CD160, XCL1, and CLIC3 (Figure 2G).

3.3 Identification of pediatric sepsis–related hub genes

Furthermore, to examine the specificity of these 16 genes in pediatric sepsis, we also used another adult sepsis dataset (GSE131761), whose patient clinical information was also presented in Supplementary Table S1. In the GSE131761 dataset, 559 genes were upregulated and 628 were downregulated in adult sepsis (Supplementary Figure S7).

After an intersection analysis of the 1,187 DEGs and 16 hub genes of pediatric sepsis, ultimately, 5 specific hub genes of pediatric sepsis were found, namely XCL1, CD160, KLRC3, TGFBR3, and PYHIN1 (Figure 2H). In addition, another adult sepsis dataset (GSE46955) was also used to analyze the hub genes. The difference analysis of the five core genes showed that only XCL1 and CD160 had no significant difference in adult sepsis and healthy people, so XCL1 and CD160 were candidates for the hub genes in pediatric sepsis (Supplementary Figure S8). Finally, a GeneMANIA analysis revealed that XCL1 had the most interaction among the five genes involved in a variety of cell signal transduction, including cytokine receptor binding, leukocyte migration, cytokine activity, and cellular chemotaxis (Figure 2I).

4 Discussion

Sepsis is a dysregulated response to infections and is common in children with various illnesses. Pediatric sepsis has a better prognosis than in adults, but risks remain (). Children undergo rapid growth and immune changes from birth to adolescence, affecting their responses to respiratory infections (). Treatment guidelines for sepsis underscore the necessity for the prompt administration of antibiotics in children exhibiting a high suspicion of sepsis, to enhance prognosis (). Identifying diagnostic markers and immune cell patterns in pediatric sepsis is crucial for optimizing prognosis and understanding its immune impact. This endeavor will deepen our comprehension of the impact of pediatric sepsis on the immune system.

WGCNA identifies genes with similar expression profiles and organizes them into modules, suggesting interconnected functions and shared signaling pathways (). WGCNA improves coexpression analysis by removing strict thresholds, preserving key biological information ().

We used blood gene expression data linked to pediatric sepsis to create a coexpression network, identifying modules associated with sepsis and highlighting the gene XCL1, which was significantly elevated in affected children. This discovery not only offers new insights into the pathogenesis of pediatric sepsis but also establishes a foundation for future investigations into related diagnostic and therapeutic strategies.

XCL1 is produced by T, NK, and NKT cells during infections and inflammation, playing a key role in these processes and linked to diseases such as infections, autoimmune disorders, and tumors (). Some research studies show XCL1 expression increases in various infections, notably in activated CD8+ T cells during chronic tuberculosis in mice, indicating a link to pathogenesis (, ). In a model of experimental pneumococcal meningitis, XCL1 and other cytokines have been detected during the acute phase of infection (). Furthermore, XCL1 expression is similarly elevated in mice with chronic infections such as cytomegalovirus and herpes simplex virus (, ). In autoimmune diseases, XCL1 expression is also heightened. It can be identified in the synovial tissue of patients with rheumatoid arthritis, with elevated levels observed in tissue samples from sarcoidosis and Crohn's disease (). The increased expression of XCL1 is crucial to the development and pathogenesis of inflammatory neurological diseases, including multiple sclerosis and HTLV-1-associated myelopathy (HAM) (). These findings underscore the significant role of XCL1 in a range of infections and autoimmune diseases.

The methodology of the study involved an independent analysis of three data sets, highlighting the need to address batch effects when integrating multiple datasets to avoid biased results. Integration can be achieved using methods like ComBat or Harmony, with changes in batch effects visualized through principal component analysis (PCA) or t-distributed stochastic neighbor embedding (t-SNE). Conclusions are drawn from data integration without batch correction, indicating potential technical variations. Future research should validate key findings through independent cohorts or sensitivity analysis.

Our research findings suggested that XCL1 was the hub gene in pediatric sepsis. This study is not without limitations, as the conclusions remain invalidated through animal models or clinical samples. In addition, the conclusions of this study are based on a retrospective public data set analysis, which has not been validated in an independent cohort, especially in prospective children with sepsis, and may be limited by the heterogeneity of the original data (e.g., treatment regimens, ethnic differences) and technical bias (e.g., interplatform batch effects). We hope to obtain a multicenter pediatric SEPSIS cohort through international cooperation (such as the European Sepsis database and American PHIS database). Subsequent validation experiments should cover different age stages (neonates/children), pathogen types (bacteria/viruses), and sepsis phenotypes (shock/non-shock) to assess the broad applicability of markers.

In the future, we will undertake systematic investigations, including the assessment of XCL1 expression levels in children with sepsis and the correlation with clinical characteristics (e.g., disease severity), thereby exploring the function of XCL1 in pediatric sepsis. By employing in vitro cell models and small animal models, we will comprehensively explore the biological roles of XCL1 in pediatric sepsis, analyzing how XCL1 modulates the immune response and pathological progression of sepsis through the activation and chemotaxis of immune cells, including T cells, NK cells, and macrophages.

5 Conclusions

The pathogenesis of sepsis is intricate and not yet completely elucidated; however, it is marked by a sustained, excessive inflammatory response and a disturbance in intraorganismal homeostasis that is difficult to restore. To identify molecular targets reflective of pediatric sepsis pathology, it is crucial to reevaluate our research methodologies and promote interdisciplinary collaboration, particularly between medicine and fields such as computer science, to advance innovation in the diagnosis and prevention of pediatric sepsis.

Statements

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.

Author contributions

YS: Conceptualization, Data curation, Methodology, Project administration, Writing – original draft. DY: Resources, Supervision, Validation, Visualization, Writing – original draft. ZY: Conceptualization, Data curation, Formal analysis, Resources, Writing – review & editing. XZ: Investigation, Methodology, Project administration, Software, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This project is supported by a special fund of the Emergency Department of Zhoushan Hospital.

Acknowledgments

The authors acknowledge the doctors in the Emergency Department of Zhoushan Hospital for their help during the study.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fped.2025.1518908/full#supplementary-material

References

  • 1.

    VincentJL. Current sepsis therapeutics. EBioMedicine. (2022) 86:104318. 10.1016/j.ebiom.2022.104318

  • 2.

    SchlapbachLJWatsonRSSorceLRArgentACMenonKHallMWet alInternational consensus criteria for pediatric sepsis and septic shock. JAMA. (2024) 331:66574. 10.1001/jama.2024.0179

  • 3.

    Sanchez-PintoLNBennettTDDeWittPERussellSRebullMNMartinBet alDevelopment and validation of the Phoenix criteria for pediatric sepsis and septic shock. JAMA. (2024) 331:67586. 10.1001/jama.2024.0196

  • 4.

    ThadaniSGoldsteinSConroyAL. Phoenix criteria for pediatric sepsis and septic shock. JAMA. (2024) 331:204950. 10.1001/jama.2024.8199

  • 5.

    XiangLRenHWangYZhangJQianJLiBet alClinical value of pediatric sepsis-induced coagulopathy score in diagnosis of sepsis-induced coagulopathy and prognosis in children. J Thromb Haemost. (2021) 19:29307. 10.1111/jth.15500

  • 6.

    GuYLiZLiHYiXLiuXZhangYet alExploring the efficacious constituents and underlying mechanisms of sini decoction for sepsis treatment through network pharmacology and multi-omics. Phytomedicine. (2024) 123:155212. 10.1016/j.phymed.2023.155212

  • 7.

    LeiseBSFuglerLA. Laminitis updates: sepsis/systemic inflammatory response syndrome-associated laminitis. Vet Clin North Am Equine Pract. (2021) 37:63956. 10.1016/j.cveq.2021.08.003

  • 8.

    GuptaOPDeshmukhRKumarASinghSKSharmaPRamSet alFrom gene to biomolecular networks: a review of evidences for understanding complex biological function in plants. Curr Opin Biotechnol. (2022) 74:6674. 10.1016/j.copbio.2021.10.023

  • 9.

    LinWWangYChenYWangQGuZZhuY. Role of calcium signaling pathway-related gene regulatory networks in ischemic stroke based on multiple WGCNA and single-cell analysis. Oxid Med Cell Longev. (2021) 2021:8060477. 10.1155/2021/8060477

  • 10.

    WongHRCvijanovichNZAllenGLThomasNJFreishtatRJAnasNet alCorticosteroids are associated with repression of adaptive immunity gene programs in pediatric septic shock. Am J Respir Crit Care Med. (2014) 189:9406. 10.1164/rccm.201401-0171OC

  • 11.

    WongHRCvijanovichNLinRAllenGLThomasNJWillsonDFet alIdentification of pediatric septic shock subclasses based on genome-wide expression profiling. BMC Med. (2009) 7:34. 10.1186/1741-7015-7-34

  • 12.

    WongHRCvijanovichNAllenGLLinRAnasNMeyerKet alGenomic expression profiling across the pediatric systemic inflammatory response syndrome, sepsis, and septic shock spectrum. Crit Care Med. (2009) 37:155866. 10.1097/CCM.0b013e31819fcc08

  • 13.

    Martínez-PazPAragón-CaminoMGómez-SánchezELorenzo-LópezMGómez-PesqueraEFadrique-FuentesAet alDistinguishing septic shock from non-septic shock in postsurgical patients using gene expression. J Infect. (2021) 83:14755. 10.1016/j.jinf.2021.05.039

  • 14.

    FranzMRodriguezHLopesCZuberiKMontojoJBaderGDet alGeneMANIA update 2018. Nucleic Acids Res. (2018) 46:W604. 10.1093/nar/gky311

  • 15.

    SouzaDCJaramillo-BustamanteJCCespedes-LesczinskyMQuinteroEMCJimenezHJJaborniskyRet alChallenges and health-care priorities for reducing the burden of paediatric sepsis in Latin America: a call to action. Lancet Child Adolesc Health. (2022) 6:12936. 10.1016/S2352-4642(21)00341-2

  • 16.

    MolloyEJBearerCF. Paediatric and neonatal sepsis and inflammation. Pediatr Res. (2022) 91:2679. 10.1038/s41390-021-01918-4

  • 17.

    WongHR. Pediatric sepsis biomarkers for prognostic and predictive enrichment. Pediatr Res. (2022) 91:2838. 10.1038/s41390-021-01620-5

  • 18.

    Al-JumayliMBrownSLChettyIJExtermannMMovsasB. The biological process of aging and the impact of ionizing radiation. Semin Radiat Oncol. (2022) 32:1728. 10.1016/j.semradonc.2021.11.011

  • 19.

    WanQTangJHanYWangD. Co-expression modules construction by WGCNA and identify potential prognostic markers of uveal melanoma. Exp Eye Res. (2018) 166:1320. 10.1016/j.exer.2017.10.007

  • 20.

    LeiYTakahamaY. XCL1 and XCR1 in the immune system. Microbes Infect. (2012) 14:2627. 10.1016/j.micinf.2011.10.003

  • 21.

    OrdwayDHigginsDMSanchez-CampilloJSpencerJSHenao-TamayoMHartonMet alXCL1 (lymphotactin) chemokine produced by activated CD8T cells during the chronic stage of infection with Mycobacterium tuberculosis negatively affects production of IFN-gamma by CD4T cells and participates in granuloma stability. J Leukoc Biol. (2007) 82:12219. 10.1189/jlb.0607426

  • 22.

    Rosas-TaracoAGHigginsDMSánchez-CampilloJLeeEJOrmeIMGonzález-JuarreroM. Intrapulmonary delivery of XCL1-targeting small interfering RNA in mice chronically infected with Mycobacterium tuberculosis. Am J Respir Cell Mol Biol. (2009) 41:13645. 10.1165/rcmb.2008-0363OC

  • 23.

    KleinMPaulRAngeleBPoppBPfisterHWKoedelU. Protein expression pattern in experimental pneumococcal meningitis. Microbes Infect. (2006) 8:97483. 10.1016/j.micinf.2005.10.013

  • 24.

    DornerBGSmithHRFrenchARKimSPoursine-LaurentJBeckmanDLet alCoordinate expression of cytokines and chemokines by NK cells during murine cytomegalovirus infection. J Immunol Baltim. (2004) 172:311931. 10.4049/jimmunol.172.5.3119

  • 25.

    Araki-SasakiKTanakaTEbisunoYKandaHUmemotoEHayashiKet alDynamic expression of chemokines and the infiltration of inflammatory cells in the HSV-infected cornea and its associated tissues. Ocul Immunol Inflamm. (2006) 14:25766. 10.1080/09273940600943581

  • 26.

    MiddelPThelenPBlaschkeSPolzienFReichKBlaschkeVet alExpression of the T-cell chemoattractant chemokine lymphotactin in Crohn’s disease. Am J Pathol. (2001) 159:175161. 10.1016/S0002-9440(10)63022-2

  • 27.

    WangC-RLiuM-FHuangY-HChenH-C. Up-regulation of XCR1 expression in rheumatoid joints. Rheumatol Oxf Engl. (2004) 43:56973. 10.1093/rheumatology/keh147

  • 28.

    PetrekMKolekVSzotkowskáJdu BoisRM. CC and C chemokine expression in pulmonary sarcoidosis. Eur Respir J. (2002) 20:120612. 10.1183/09031936.02.00289902

Summary

Keywords

pediatric sepsis, WGCNA, GEO datasets, KEGG analysis, molecular mechanisms

Citation

Shen Y, Yu D, Yu Z and Zhao X (2025) Identification of the key genes in children with sepsis by WGCNA in multiple GEO datasets. Front. Pediatr. 13:1518908. doi: 10.3389/fped.2025.1518908

Received

29 October 2024

Accepted

28 April 2025

Published

16 May 2025

Volume

13 - 2025

Edited by

Francesca Conti, University of Bologna, Italy

Reviewed by

Angelo Mazza, Papa Giovanni XXIII Hospital, Italy

Rama Shankar, Michigan State University, United States

Rana Hossam Elden, Helwan University, Egypt

Updates

Copyright

*Correspondence: Xue Zhao Ze Yu

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics