Abstract
Innate lymphoid cells (ILCs), as the innate counterpart of CD4+ T helper (Th) cells, play crucial roles in maintaining tissue homeostasis. While the ILC subsets and their corresponding Th subsets demonstrate significant similarities in core programming related to effector function and regulatory mechanisms, their principal distinctions, given their innate and adaptive lymphocyte nature, remain largely unknown. In this study, we have employed an integrative analysis of 294 bulk RNA-sequencing results across all ILC and Th subsets, using scRNA-seq algorithms. Consequently, we identify two genesets that predominantly differentiate ILCs from Th cells, as well as three genesets that distinguish various immune responses. Furthermore, through chromatin accessibility analysis, we find that the ILC geneset tends to rely on specific transcriptional regulation at promoter regions compared with the Th geneset. Additionally, we observe that ILCs and Th cells are under differential transcriptional regulation. For example, ILCs are under stronger regulation by multiple transcription factors, including RORα, GATA3, and NF-κB. Otherwise, Th cells are under stronger regulation by AP-1. Thus, our findings suggest that, despite the acknowledged similarities in effector functions between ILC subsets and corresponding Th subsets, the underlying regulatory machineries still exhibit substantial distinctions. These insights provide a comprehensive understanding of the unique roles played by each cell type during immune responses.
Introduction
In recent years, innate lymphoid cells (ILCs) have garnered increasing attention owing to their functional parallels with CD4+ T helper (Th) cells of the adaptive immune system (1–3). Both ILCs and Th cells play pivotal roles by secreting effector cytokines, sharing similarities in their effector functions and regulatory mechanisms. These shared characteristics lead to the categorization of both cell types into distinct subsets based on the expression of signature effector cytokines and master transcription factors (4–7). For instance, type 1 ILCs (ILC1s) and Th1 cells mainly produce IFN-γ and TNF-α, regulated by the master transcription factor T-bet (8, 9). Likewise, ILC2s and Th2 cells predominantly express IL-5 and IL-13, controlled by the transcription factor GATA3 (10–12). Furthermore, ILC3s, encompassing natural killer receptor expressing (NCR+) ILC3s, double-negative (DN) ILC3s, and lymphoid tissue-inducer (LTi) cells, mirror Th17 cells as they primarily secrete IL-22 and IL-17, under the regulation of the transcription factor RORγt (13, 14).
Although the ILC subsets and their corresponding Th subsets share functional and regulatory similarities, it remains imperative to acknowledge their fundamental differences as innate and adaptive lymphocytes, respectively. One notable distinction lies in the role of T-cell receptor (TCR) signaling, which is crucial for the activation, differentiation, and effector functions of Th cells (15). TCR signaling triggers downstream transcription factors like AP-1, NF-AT, and NF-κB (16). In contrast, ILCs, lacking TCR expression, rely on cytokines, neuropeptides, eicosanoids, and other environmental signal for activation (17–21). Additionally, ILCs are characterized by their tissue residency, meaning that their activation and functions are regulated by local environmental cues and they exert crucial regulatory roles in maintaining tissue homeostasis (22, 23). Th cells, however, are primarily circulatory, undergoing initially activated and differentiated in secondary lymphoid organs before migrating to exert effector functions in peripheral tissues (24). This distinction underscores that despite parallels, there exist key functional and regulatory differences between ILCs and Th cells intrinsic to their identities as innate versus adaptive lymphocytes.
The transcriptomic differences between ILC subsets and their corresponding Th cells should, to varying degrees, reflect their functional and regulatory differences. Previous studies have attempted transcriptional comparison between these cell types (25–27). For instance, in a study examining Nippostrongylus brasiliensis infection, lung ILC2s and Th2 cells isolated on day 14 were subjected to bulk RNA-sequencing (RNA-seq) analysis. The resulting differentially expressed genes between the two cell types were indeed substantial (25). However, the core programing of lung ILC2 and Th2 cells, including cell-surface receptors, cytokines, and transcription factors, exhibited significant shared properties (26, 28). Furthermore, other studies have demonstrated that differentiated Th2 cells and memory Th2 cells can produce effector cytokines independently of TCR signaling, similar to innate lymphocytes (29, 30). These findings may partially explain the resemblance between lung Th2 cells and ILC2s after N. brasiliensis infection. Nevertheless, it remains unclear whether all ILC subsets and their corresponding Th cells exhibit such comparable features under different conditions. Moreover, ILCs primarily reside in local tissues, where they participate in maintaining tissue homeostasis. Conversely, Th cells typically undergo activation and differentiation in secondary lymphoid organs, specializing in cytokine production to facilitate immune responses. Therefore, further investigation into the fundamental difference between ILCs and Th cells is still imperative.
Results
Integrated transcriptome analysis uncovers distinct gene expression signatures between ILC and Th subsets
ILCs and Th cells exhibited parallel effector functions in innate and adaptive immunity, respectively (31). However, the transcriptomic differences between these cell types had remained largely unclear. To comprehensively compare their transcriptomes and minimize variations from a single data source, we performed an integrative analysis leveraging publicly available bulk RNA-sequencing (RNA-seq) datasets. In total, we collected bulk RNA-seq datasets from 52 published studies, comprising 294 samples, including 21 ILC1, 64 ILC2, 41 ILC3, 61 Th1 cell, 49 Th2 cell, and 58 Th17 cell samples (Figure 1A, Supplementary Table 1). Subsequently, we applied a series of data processing steps to the bulk RNA-seq datasets, including normalization of data, filtration of genes with low expression, and removal of batch effect (Figure 1B, Supplementary Figure 1A). Through these steps, we identified 8,393 genes with significantly high expression levels (transcripts per million or TPM >10) in at least one ILC or Th subset (expressed in more than 80% of the samples for each respective subset) (Figure 1C, Supplementary Figure 1B). Following the removal of batch effects, the RNA-seq samples underwent UMAP classification, a commonly used algorithm frequently used for single-cell RNA-seq (scRNA-seq) analysis, based on the gene expression profiles of the samples (Figure 1D, Supplementary Figure 1C). The UMAP analysis accurately assigned the samples to their respective cell type categories, indicating that the gene expression profiles effectively captured the cellular heterogeneity within the dataset. To further validate the accuracy of the RNA-seq dataset processing, we examined the expression of expected signature genes in ILC and Th subsets. Specifically, all three ILC subsets exhibited enriched expression of the ILC marker Kit, whereas the three Th subsets showed specific expression of the T-cell marker Cd3e (Figure 1E). Moreover, the master transcriptional factors Tbx21, Gata3, and Rorc, associated with different immune cell types, were accurately and highly expressed by their respective ILC or Th subsets (Figure 1E).
Figure 1
The successful classification allowed us to further investigate the distinct transcriptional features present in each ILC and Th subset by employing algorithms specifically designed for scRNA-seq analysis. Despite sharing similarities in effector functions, we anticipated significant transcriptional differences between ILC subsets and their corresponding Th subsets, reflecting their inherent differentiation as innate and adaptive immune cells, respectively. To elucidate such transcriptional differences, we conducted paired transcriptome comparisons between ILC subsets and their parallel Th subsets (ILC1 versus Th1, ILC2 versus Th2, and ILC3 versus Th17). This analysis revealed 258 ILC-specific genes and 434 Th-specific genes, referred to as the ILC geneset and Th geneset, respectively (Figure 1F, Supplementary Table 2). To further verify the reliability of the geneset and eliminate the interference of the tissue environment, we conducted a comparison of the expression levels of the ILC geneset and the Th geneset between ILC and Th subsets originating from the same source (25, 32, 33). As we anticipated, most genes in the ILC geneset exhibit a higher expression level in ILC subsets, whereas most genes in the Th geneset exhibit a higher expression level in Th subsets from the same source (Supplementary Figure 2). Overall, the ILC geneset and Th geneset represent fundamental distinctions between ILC and Th cells and are minimally affected by environmental factors.
The different subsets of ILCs, along with their corresponding Th subsets, play distinct roles in immune responses. Therefore, we next conducted a screening process to identify genes that underlay specific immune responses, exhibiting conserved, subset-specific expression patterns in individual ILC subsets and their corresponding Th counterparts, aiming to identify genes underlying specific immune responses. As a result, we defined 167 genes as the type 1 geneset, which included Ifng and Tbx21, specifically expressed in both ILC1s and Th1 cells. We also defined 98 genes as the type 2 geneset, such as Il4, Il5, Il13, and Gata3, which were preferentially upregulated in both ILC2s and Th2 cells. Additionally, we defined 148 genes as the type 3 geneset, like Il17a, Il17f, Il22, and Rorc, exhibiting significant increases in both ILC3s and Th17 cells (Figure 1G, Supplementary Table 3).
Collectively, by conducting an integrated analysis of 294 bulk RNA-seq datasets across ILC and Th subsets using scRNA-seq algorithms, we have successfully identified genesets that highlight the key transcriptional differences between ILCs and Th cells, as well as the variations in immune response programs.
GO enrichment analysis reveals fundamental functional and regulatory disparities across ILC and Th subsets
These genesets identified in our analysis provided valuable insights into the fundamental transcriptional characteristics underlying the functional properties of the ILC and Th subsets. To further elucidate the functional differences between ILCs and Th cells, as well as between distinct immune response programs, we performed gene ontology (GO) enrichment analysis on these genesets. The analysis revealed the top 100 enriched GO terms in the ILC and Th genesets, highlighting the significant overrepresentation of pathways related to “lymphocyte activation and chemotaxis” in both ILCs and Th cells. This finding underscored the critical role of these processes in shaping the functional properties of ILCs and Th cells in the immune system (Figure 2A, Supplementary Tables 4, 5). To validate this observation, we examined the expression profile of representative genes within these GO terms across different ILC and Th subsets (Supplementary Figure 3A). Additionally, GO terms associated with pathways of “cell migration”, “epithelium development”, “mesenchyme development”, “blood vessel development”, and “nervous system process” were specifically enriched in the ILC geneset (Figure 2A, Supplementary Table 4), consistent with the known role of ILCs as tissue-resident immune cells involved in maintaining tissue homeostasis (22, 23). Specifically, GO terms related to “cytokine production”, “MAPK-ERK pathway” and “SMAD pathway” were also enriched in the ILC geneset, suggesting distinct regulation machineries underlying the immune effects of ILCs compared with Th cells (Figure 2A). In contrast, GO terms associated with “cell cycle” were highly enriched in the Th geneset, indicating fundamental disparities between Th cells and ILCs in terms of proliferation (Figure 2A, Supplementary Table 5). This observation aligned with the knowledge that T cells underwent clonal expansion during activation and differentiation, resulting in a significant increase in cell number (34). Furthermore, the expression profiles of representative genes within the specifically enriched GO terms in the ILC and Th genesets confirmed the functional distinctions between ILCs and Th cells (Supplementary Figures 3B–D).
Figure 2
In addition, we examined the enriched GO terms in the three immune response-related genesets. Although there were discernible differences between these immune response programs, it was noted that GO terms associated with cytokine production, defense response, and inflammatory response were consistently enriched across all three genesets (Figure 2B, Supplementary Figures 3E–G, Supplementary Tables 6–8). This suggested a similarity in their helper-like functionalities and their contribution to immune responses.
Together, the combined GO enrichment analysis of the predefined ILC and Th genesets, as well as the immune response-related genesets, provides a comprehensive understanding of the functional and regulatory similarities and differences between the different ILC and Th subsets.
Expression-concordant opening chromatin regions associated with the ILC geneset tend to distribute in close proximity to the transcription start sites
Gene expression levels often exhibited correlation with the accessibility of corresponding chromatin loci (35, 36). To further understand the transcriptional disparities between ILCs and Th cells, we compared the chromatin accessibility across the ILC and Th genesets using publicly available sequencing of DNase I hypersensitive sites (DNase-seq) data (GSE172358) (37) (Figure 3A). Through this analysis, we identified 3,292 opening chromatin regions (OCRs) across the ILC genesets and 4,293 OCRs across the Th geneset (within 50 kb to the transcriptional start site or TSS). The accessibility of these OCRs was then compared between ILCs and Th cells. OCRs that exhibited accessibility changes consistent with the expression changes of their associated genes when comparing ILCs and Th cells were referred to as expression-concordant OCRs (Figures 3A, B). Conversely, OCRs showing discord accessibility and expression changes between ILC and Th subsets were categorized as expression-non-concordant OCRs. As a result, we identified 1,022 concordant OCRs across the ILC genesets, which we termed as ILC concordant OCRs (Figure 3C). These OCRs corresponded to majority (79.5%) of the genes in the ILC geneset (Figure 3D). Additionally, we identified 2,270 ILC non-concordant OCRs, encompassing 94.2% of the genes in the ILC geneset (Supplementary Figures 4A, B). Similarly, we discovered 1,559 concordant OCRs across the Th genesets, named as Th concordant OCRs (Figure 3E). These OCRs accounted for 72.4% in the genes in the Th geneset (Figure 3F). We also identified 2,734 Th non-concordant OCRs, corresponding to 92.6% of the genes in the Th geneset (Supplementary Figure 4C, D). Similar to RNA-seq, Th subsets for DNase-seq are differentiated in vitro whereas ILC subsets are isolated in vivo. For eliminating the interference of the tissue environment and differentiation method, we validated chromatin accessibility of ILC concordant OCRs and Th concordant OCRs in ATAC-seq data of ILC and Th subsets in vivo (25). As we anticipated, most of ILC concordant OCRs are specifically opened in ILC subsets, whereas most of Th concordant OCRs are specifically opened in Th subsets in vivo (Supplementary Figure 5). This confirmed that the specific regulatory regions of ILC geneset and Th geneset are not affected by environmental factors.
Figure 3
Furthermore, we performed a detailed characterization of the ILC and Th concordant and non-concordant OCRs associated with the ILC and Th genesets. Notably, we observed that the peak widths of the ILC concordant OCRs in all ILC subsets were broader compared with the peak widths of the Th concordant OCRs in the Th subsets (Figures 3C, E). To statistically confirm this difference, we quantified the peak widths of the ILC and Th concordant and non-concordant OCRs (Figure 3G). The ILC concordant OCRs exhibited a significant increase in width compared with the Th concordant OCRs (Figure 3G). Conversely, the width of the ILC non-concordant OCRs showed some variation and even reduction when compared with the width of the Th non-concordant OCRs (Supplementary Figure 4E). Previous studies had suggested that wider OCRs might result from the merging of multiple accessible regions and were more likely to be located in the promoter and super-enhancer regions of the associated genes (38). Therefore, we also investigated the genomic distribution of these ILC and Th concordant and non-concordant OCRs. Consistent with the previous studies (39–41), ILC concordant OCRs demonstrated a preferential distribution within 1 kilobase pair (kb) around TSSs compared with the Th concordant OCRs (Figure 3H). However, the distribution of ILC non-concordant OCRs did not show such a tendency (Supplementary Figure 4F). Overall, these findings suggest that the transcriptional regulation of the ILC geneset tends to heavily rely on promoter regions compared with the transcriptional regulation of the Th geneset.
Different transcription factors are involved in distinguishing the functionalities of ILCs from Th cells
The observed differences in chromatin landscapes between the ILC and Th genesets suggested that they might be regulated by distinct transcriptional control mechanisms mediated by unique regulators. To explore this further, we conducted an analysis to identify enriched transcription factor binding motifs within the ILC and Th concordant OCRs. Indeed, we found that the ILC concordant OCRs displayed enriched binding motifs for transcription factors including RORα, GATA3, GABPA, and c-Rel (Figure 4A), whereas differentially enriched binding motifs were observed for transcription factors such as BATF, AP-1, and ELK4 in the Th concordant OCRs (Figure 4A). Subsequently, we considered the average expression levels of these predicted transcription factors across ILC or Th subsets, providing further evidence of their potential roles in driving the divergence of gene expression programs between ILCs and Th cells (Figure 4B). Importantly, the motifs for BATF and AP-1, both belonging to the bZIP family, were notably enriched in the Th concordant OCRs (Figures 4A, B). This observation aligned with the notion that these transcription factors could form the AP-1-BATF transcriptional complex, exerting crucial regulatory functions downstream of TCR signaling to promote the activation and function of Th cells (16, 42). In contrast, the transcriptional regulation of the ILC geneset appeared to rely on a distinct set of transcription factors, primarily belonging to different transcription factor families rather than the bZIP family (Figure 4B). On the other hand, a diverse range of transcription factors were enriched in the expression-non-concordant OCRs in both ILCs and Th cells, indicating their unlikely involvement in regulating the gene expression in the ILC or Th geneset (Supplementary Figures 6A, B).
Figure 4
In our analysis of the ILC concordant OCRs, we identified 10 potential transcription factors with highly enriched binding motifs and significant expression levels, namely, RORα, GATA3, RXRβ, c-REL, REL-B, REL-A, ERRα, ZBTB33, NR4A2, and RUNX3. Notably, RORα, a member of the nuclear receptor family, displayed the highest enrichment in the ILC concordant OCRs. This finding aligned with previous studies that highlighted the crucial roles of RORα in the development, maintenance, and immune activation of ILC2s and ILC3s (43–45). Similarly, we observed significant enrichment of GATA3 binding motifs, supporting the notion that GATA3 played a regulatory role in the development, expansion, and activation of both ILC2s and ILC3s in our previous studies (12, 46, 47). Thus, these results validated the accuracy of our analysis. Furthermore, we observed substantial enrichment of binding motifs for REL-A, REL-B, and c-Rel in the ILC concordant OCRs. Their transcription factors, belonging to the RHD family, formed complexes with the NF-κB subunits (p50 for REL-A and c-Rel, and p52 for REL-B) (48). This suggested a potential involvement of NF-κB in the transcriptional regulation of the ILC geneset. In contrast, in the Th concordant OCRs, all of the top 10 enriched potential transcription factors with significant expression levels belonged to the bZIP family. This highlighted the critical role of AP-1 in regulating T-cell function under TCR signaling (49). Additionally, we discovered substantial enrichment of potential transcription factors from the ETS family in both the ILC and Th concordant OCRs. For example, ETS1 was identified as occupying a significant number of expression-concordant OCRs in both cell types. Based on literature, ETS1 had been reported to play crucial regulatory roles in both the expansion of ILC2s and the activation of Th cells (50, 51). Overall, our analysis identified the top 10 highly enriched potential transcription factors in ILCs or Th cells based on their expression-concordant OCRs, as well as five potential transcription factors that were commonly enriched in both cell types.
Subsequently, we investigated the frequencies of binding motifs associated with these potential transcription factors in the expression-concordant OCRs of ILCs and Th cells. The top 10 potential transcription factors specific to ILCs displayed enhanced regulation of the ILC geneset in ILCs compared with the Th geneset in Th cells, indicating their importance in distinguishing the functionalities of ILC and Th cells (Figure 4C). Moreover, the top 10 Th-specific transcription factors from the bZIP family exhibited significant regulation of the Th geneset specifically in Th cells, whereas their impact on the ILC geneset in ILCs was relatively limited (Figure 4D). This underscored the essential role of AP-1 in distinguishing the functionalities of Th cells from ILCs. Additionally, the five potential transcription factors commonly enriched in the expression-concordant OCRs of ILCs and Th cells demonstrated comparable regulation of the ILC geneset in ILCs and the Th geneset in Th cells (Figure 4E). However, these transcription factors regulated a relatively high frequency of genes within both the ILC and Th genesets, suggesting their critical roles. In contrast, the distribution patterns of these potential transcription factors in the expression-non-concordant OCRs of ILCs and Th cells did not align with the gene expression patterns (Supplementary Figures 6C–E). Collectively, these findings indicate that distinct functionalities of ILCs and Th cells are regulated by different transcription factors.
Similar effector roles of ILCs and Th cells are operated by distinct regulatory machineries
Given the presence of distinct regulatory machineries for the ILC and Th genesets, we further wondered whether the immune response-related genesets, which exhibited similar expression in both ILC subsets and their corresponding Th subsets, were differentially regulated. Remarkably, we found that chromatin regions of signature effector genes associated with different immune responses, such as the Ifng locus in ILC1s and Th1 cells, the Il4, Il5, and Il13 loci in ILC2s and Th2 cells, and the Il17a and Il17f loci in ILC3s and Th17 cells, all displayed variant accessibility between ILC subsets and their corresponding Th subsets (Figure 5A). Therefore, we defined the ILC and Th subset-specific OCRs based on their differential chromatin accessibility between ILC subsets and their corresponding Th subsets (Figure 5B, Supplementary Figure 7A). ILC3- and Th17-specific OCRs displayed exclusive chromatin accessibility in ILC3 and Th17 subsets in vivo, respectively (Supplementary Figure 7B). In addition, ILC2- and Th2-specific OCRs displayed higher accessibility in ILC2 and Th2 subsets in vivo, respectively (Supplementary Figure 7C). Notably, the overall width of ILC subset-specific OCRs exceeded that of the corresponding Th subset-specific OCRs, with the former preferentially located in close proximity to TSSs, suggesting distinct regulation of these genesets between the two cell types (Figures 5C, D, Supplementary Figures 7D, E). Consequently, we conducted an analysis of potential transcription factors inferred from these subset-specific OCRs in ILCs and Th cells. As expected, different sets of potential transcription factors with significant expression were enriched by the subset-specific OCRs in ILC subsets and the corresponding Th subsets, respectively (Figure 5E). Interestingly, these potential transcription factors exhibited considerable consistency with those we inferred from the expression-concordant OCRs to the ILC or Th genesets, indicating the utilization of the same regulatory machinery to govern effector functions in each subset. Notably, the master transcription factors Tbx21, Gata3, and Rorc displayed the highest enrichment by the ILC subset-specific OCRs, underscoring their indispensable regulatory roles in the effector functions of ILC subsets. Additionally, upon assessing the distribution patterns of binding motifs linked to these potential transcription factors in the ILC or Th subset-specific OCRs, we confirmed the preferential binding of potential transcription factors specific to ILC subsets to the ILC subset-specific OCRs, whereas those specific to Th subsets tended to interact with the Th subset-specific OCRs (Figure 5F). Therefore, despite the striking similarity in effector roles between ILC subsets and their corresponding Th subsets, the underlying regulatory mechanisms remain distinctive.
Figure 5
Discussion
ILCs are often regarded as the innate counterparts to Th cells in the adaptive immune system due to their shared functionalities and regulatory mechanisms (3, 31). However, it is important to recognize the fundamental differences between these two cell types as innate and adaptive lymphocytes, respectively. In this study, we have demonstrated the presence of significant distinctions in functionalities and underlying regulatory mechanisms between ILCs and Th cells by conducting integrative transcriptome and chromatin landscape analyses.
Bulk RNA-seq has been widely used as a mature approach for transcriptome analysis for many years. More recently, single-cell transcriptome analysis, or scRNA-seq, has emerged as a highly valuable tool in the field (52, 53). While scRNA-seq offers several advantages over bulk RNA-seq, it also comes with certain limitations (54). One noticeable drawback is the higher cost associated with scRNA-seq, resulting in smaller sample sizes for each cell type. In contrast, bulk RNA-seq is comparatively more cost-effective, and thus researchers can easily access numerous publicly available datasets, particularly for the ILC and Th transcriptomes that are relevant to our study (55). Furthermore, due to technological differences, scRNA-seq typically captures fewer genes compared with bulk RNA-seq (52). In our study, we sought to harness the advantages of both technologies. Therefore, we compiled and processed a collection of 294 publicly available bulk RNA-seq datasets for ILCs and Th cells, which were subsequently analyzed using algorithms designed for scRNA-seq. Through this integrated approach, we successfully identified two genesets specific to all ILCs or Th cells, as well as three genesets specific to different types of immune responses. Although for bulk RNA-seq data, most of Th cell datasets are derived from in vitro differentiated lineages, and most ILCs are isolated from mucosal tissue in vivo, we observed that our defined ILC geneset and Th geneset still represent difference between ILC and Th subsets in vivo, suggesting that these genesets represent fundamental distinctions between ILC and Th cells.
The accuracy and reliability of the genesets we defined are ensured by their generation from a large number of bulk RNA-seq datasets derived from various experiments conducted in different laboratories. This comprehensive approach guarantees that the genesets should accurately capture the fundamental differences between ILCs and Th cells, as well as between different types of immune responses. Consequently, we performed GO enrichment analysis to gain further insights into the functional disparities underlying the transcriptional differences. While there are some shared features between the ILC and Th genesets, they are predominantly enriched in distinct pathways. Consistent with our understanding of T-cell expansion following activation, we observed a preferential enrichment of cell cycle-related pathways in the Th geneset. On the other hand, the ILC geneset exhibited specific enrichment in pathways associated with their role in maintaining tissue homeostasis. Moreover, despite ILC and Th subsets being involved in diverse immune responses, we identified a similarity in pathway enrichment for immune response-related genesets, indicating the existence of certain fundamental similarities in their T helper cell functions through cytokine secretion.
The differential enrichment of pathways in the ILC and Th genesets suggests that the functionalities of these two cell types may be regulated by distinct mechanisms. Notably, the expression-concordant OCRs associated with the ILC geneset are generally broader in width compared with those associated with the Th genesets. ILC concordant regions preferentially localize around TSSs, which correspond to the promoter regions of genes. Interestingly, the OCRs specific to the immune response-related genesets in ILCs also exhibit a similar distribution pattern as compared with distribution of the Th-specific OCRs in Th cells. This characteristic may confer an advantage for multiple transcription factors to bind to the promoter regions in ILCs, thereby facilitating rapid transcription initiation upon cell activation.
We have conducted further analysis on the expression-associated OCRs for the ILC and Th genesets, as well as on the ILC- and Th-specific OCRs for the immune response-related genesets, to infer the potential involvement of transcription factors. It is noteworthy that Th cells exhibit a heightened tendency to specifically utilize AP-1-mediated regulation, which occurs downstream of TCR signaling. In contrast, ILCs do not favor AP-1 and exhibit enhanced utilization of master transcription factors specific to their respective subsets. Additionally, our findings indicate a preferential reliance on NF-κB for regulatory processes in ILCs. Furthermore, despite the similar effector functions of ILC subsets and their corresponding Th subsets, the underlying regulatory mechanisms are also largely distinct, aligning with their innate and adaptive lymphocyte properties, respectively (56). Consequently, our study provides valuable insights into the functional and regulatory differences between ILCs and Th cells, contributing to a comprehensive understanding of their unique roles during immune responses.
Materials and methods
Data acquisition
Raw data for gene expression profiles (RNA-seq) of ILC and Th subsets were retrieved from Sequence Read Archive (SRA); accession numbers are given in Figure 1A and Supplementary Table 1. Raw data for RNA-seq of ILC and Th subsets in vivo were retrieved from SRA under accession numbers SRP060453, SRP069783, and SRP337230. Chromatin accessibility profiles (DNase-seq) of ILC and Th subsets were retrieved from SRP315389, and ATAC-seq of ILC and Th subsets in vivo were retrieved from SRP069783.
RNA-seq data processing
The RNA-seq reads were aligned to the GRCm38/mm10 assembly of mouse genome using HISAT2 (v 2.2.1), and quantified by featureCounts. The gene expression level was counted by featureCounts (v 2.0.3) against mouse GRCm38 genome assembly (v 94). Transcripts per million mapped reads (TPM) were calculated using R package scuttle (v1.4.0). Genes with significantly high expression levels were filtered by TPM >10 in at least one ILC or Th subset. Batch effect removal was performed by the “removeBatchEffect” function in the limma package (v3.50.3) (57). Dimension reduction of the gene expression matrix before or after batch effect removal by the Uniform Manifold Approximation and Projection (UMAP) algorithm was performed by R package Seurat (v4.3.0) (58). “ScaleData”, “RunPCA”, and “RunUMAP” were sequentially executed, and top20 principle components (PCs) were used for UMAP analysis. GSEA was performed by GSEA software (v3.0).
Differential expression analysis and gene ontology enrichment
For differential expression analysis between ILCs and Th cells and among three types of immune responses, the FindConservedMarkers function in the Seurat package was performed (minimum of log2 fold change >0.25 and maximum of P. value <0.01). The FindMarkers function was used in differential expression analysis in each ILC and Th subsets (log2 fold-change >0.25 and P value < 0.01). GO analysis was performed by overrepresentation test with R package clusterProfiler (v4.2.2). Function “compareCluster” was performed for GO enrichment of ILC - and Th-specific genesets (q value < 0.05), and the top 100 GO terms in q value are shown in enrichment map by the “emapplot” function in the “enrichplot” package. Function enrichGO was performed for GO enrichment of immune response-specific genesets.
DNase-seq data processing
DNase-seq reads were mapped to the mm10 genome with Bowtie2 (v2.4.4). Non-redundant reads with MAPQ ≥10 were remained. The remaining reads were sorted using Samtools (v 1.13). DNase-seq peaks (OCRs) were called by MACS2 (v2.2.7.1) with settings of –nomodel –extsize 75, based on a q-value threshold of 0.01. DNase-seq reads in each OCR were quantified using bedtools (v2.27.1). Bins per million mapped reads (BPM) values of OCRs were calculated with R-package scuttle as TPM in RNA-seq data.
Peak annotation and differential OCR analysis
Annotation of OCRs to their neighboring genes were performed by the annotatePeak function in R package ChIPseeker (v1.30.3). OCRs within 50 kb to TSS of neighboring genes were defined as OCRs related to these genes. For OCRs related to genes in the ILC and Th genesets, concordant OCRs were defined as OCRs exhibiting concordant accessibility changes with the expression changes of their related genes; the residual OCRs were defined as non-concordant OCRs. For example, in OCRs related to the ILC geneset, the ILC concordant OCRs were defined as OCRs with a fold change of ILC/Th >1.5, minimum BPM in both repeats of each ILC subset > maximum BPM in both repeats of each Th subset, and BPM of both repeats in all ILC subsets >0.5. The residual OCRs related to the ILC geneset are defined as ILC non-concordant OCRs. Similar criteria are used to define Th concordant and non-concordant OCRs.
For OCRs related to immune response-related genesets, ILC-specific OCRs are defined as OCRs with fold change between ILC subsets and the corresponding Th subsets >1.5, minimum BPM in both repeats of the ILC subset > maximum BPM in both repeats of the corresponding Th subset, and BPM of both repeats in the ILC subset >0.5. A similar criterion is used to define Th concordant OCRs.
Motif enrichment
Transcription factor motif enrichment was performed by the findMotifsGenome function in Homer software (v4.10), using the HOCOMOCO database and the database included in HOMER. For each OCR, transcription factor binding sites were annotated by FIMO software in MEME Suite (v5.0.5), using a p-value threshold of 0.0001.
Data visualization and statistics
Data were analyzed by R version 4.1.2. Bar charts, pie charts, box plots, scatter plots, and histograms were operated by ggplot2 (v3.4.2). Heatmaps in schematic illustration were performed by pheatmap (v1.0.12). Heatmaps of OCRs in DNase-seq were visualized by deepTools (v3.5.1). DNase-Seq tracks were visualized using UCSC Genome Browser. The statistical significance of GO enrichment and motif enrichment were calculated by a two-sided hypergeometric test. P values above 0.05 were considered not significant, *P < 0.05 **P < 0.01, ***P < 0.001.
Statements
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.
Author contributions
YZ: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Software, Visualization, Writing – original draft. LH: Data curation, Formal Analysis, Methodology, Software, Visualization, Writing – original draft. GR: Data curation, Methodology, Writing – original draft. YYZ: Data curation, Methodology, Writing – original draft. XZ: Data curation, Methodology, Writing – original draft. CZ: Conceptualization, Funding acquisition, Methodology, Project administration, Supervision, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. The work was supported by the National Key R&D program of China (2022YFA1103602 and 2022YFA0806400 to CZ), the National Natural Science Foundation of China (32170896, 31770957 and 91842102 to CZ), the Shenzhen Innovation Committee of Science and Technology (JCYJ20220818100401003 to CZ), and the Natural Science Foundation of Beijing (18G10645 to CZ).
Acknowledgments
We thank all the members of the Zhong laboratory for their valuable assistance.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2023.1271879/full#supplementary-material
References
1
SpitsHArtisDColonnaMDiefenbachADi SantoJPEberlGet al. Innate lymphoid cells–a proposal for uniform nomenclature. Nat Rev Immunol (2013) 13:145–9. doi: 10.1038/nri3365
2
ArtisDSpitsH. The biology of innate lymphoid cells. Nature (2015) 517:293–301. doi: 10.1038/nature14189
3
VivierEArtisDColonnaMDiefenbachADi SantoJPEberlGet al. Innate lymphoid cells: 10 years on. Cell (2018) 174:1054–66. doi: 10.1016/j.cell.2018.07.017
4
GuoLJunttilaISPaulWE. Cytokine-induced cytokine production by conventional and innate lymphoid cells. Trends Immunol (2012) 33:598–606. doi: 10.1016/j.it.2012.07.006
5
FangDZhuJ. Dynamic balance between master transcription factors determines the fates and functions of CD4 T cell and innate lymphoid cell subsets. J Exp Med (2017) 214:1861–76. doi: 10.1084/jem.20170494
6
ZhongCZhuJ. Transcriptional regulatory network for the development of innate lymphoid cells. Mediators Inflammation (2015) 2015:264502. doi: 10.1155/2015/264502
7
SpinnerCALazarevicV. Transcriptional regulation of adaptive and innate lymphoid lineage specification. Immunol Rev (2020) 300:65–81. doi: 10.1111/imr.12935
8
FuchsAVermiWLeeJSLonardiSGilfillanSNewberryRDet al. Intraepithelial type 1 innate lymphoid cells are a unique subset of IL-12- and IL-15-responsive IFN-gamma-producing cells. Immunity (2013) 38:769–81. doi: 10.1016/j.immuni.2013.02.010
9
BerninkJHPetersCPMunnekeMte VeldeAAMeijerSLWeijerKet al. Human type 1 innate lymphoid cells accumulate in inflamed mucosal tissues. Nat Immunol (2013) 14:221–9. doi: 10.1038/ni.2534
10
HoylerTKloseCSSouabniATurqueti-NevesAPfeiferDRawlinsELet al. The transcription factor GATA-3 controls cell fate and maintenance of type 2 innate lymphoid cells. Immunity (2012) 37:634–48. doi: 10.1016/j.immuni.2012.06.020
11
LiangHEReinhardtRLBandoJKSullivanBMHoICLocksleyRM. Divergent expression patterns of IL-4 and IL-13 define unique functions in allergic immunity. Nat Immunol (2011) 13:58–66. doi: 10.1038/ni.2182
12
YagiRZhongCNorthrupDLYuFBouladouxNSpencerSet al. The transcription factor GATA3 is critical for the development of all IL-7Rα-expressing innate lymphoid cells. Immunity (2014) 40:378–88. doi: 10.1016/j.immuni.2014.01.012
13
CicciaFGugginoGRizzoASaievaLPeraltaSGiardinaAet al. Type 3 innate lymphoid cells producing IL-17 and IL-22 are expanded in the gut, in the peripheral blood, synovial fluid and bone marrow of patients with ankylosing spondylitis. Ann Rheum Dis (2015) 74:1739–47. doi: 10.1136/annrheumdis-2014-206323
14
SawaSCherrierMLochnerMSatoh-TakayamaNFehlingHJLangaFet al. Lineage relationship analysis of RORgammat+ innate lymphoid cells. Science (2010) 330:665–9. doi: 10.1126/science.1194597
15
LiMORudenskyAY. T cell receptor signalling in the control of regulatory T cell differentiation and function. Nat Rev Immunol (2016) 16:220–33. doi: 10.1038/nri.2016.26
16
GaudGLesourneRLovePE. Regulatory mechanisms in T cell receptor signalling. Nat Rev Immunol (2018) 18:485–97. doi: 10.1038/s41577-018-0020-8
17
McGintyJWvon MoltkeJ. A three course menu for ILC and bystander T cell activation. Curr Opin Immunol (2020) 62:15–21. doi: 10.1016/j.coi.2019.11.005
18
KloseCSNMahlakõivTMoellerJBRankinLCFlamarALKabataHet al. The neuropeptide neuromedin U stimulates innate lymphoid cells and type 2 inflammation. Nature (2017) 549:282–6. doi: 10.1038/nature23676
19
XueLSalimiMPanseIMjösbergJMMcKenzieANSpitsHet al. Prostaglandin D2 activates group 2 innate lymphoid cells through chemoattractant receptor-homologous molecule expressed on TH2 cells. J Allergy Clin Immunol (2014) 133:1184–94. doi: 10.1016/j.jaci.2013.10.056
20
WuDHuLHanMDengYZhangYRenGet al. PD-1 signaling facilitates activation of lymphoid tissue inducer cells by restraining fatty acid oxidation. Nat Metab (2022) 4:867–82. doi: 10.1038/s42255-022-00595-9
21
WuDLiZZhangYZhangYRenGZengYet al. Proline uptake promotes activation of lymphoid tissue inducer cells to maintain gut homeostasis. Nat Metab (2023) 5:1953–68. doi: 10.1038/s42255-023-00908-6
22
GasteigerGFanXDikiySLeeSYRudenskyAY. Tissue residency of innate lymphoid cells in lymphoid and nonlymphoid organs. Science (2015) 350:981–5. doi: 10.1126/science.aac9593
23
MeiningerICarrascoARaoASoiniTKokkinouEMjosbergJ. Tissue-specific features of innate lymphoid cells. Trends Immunol (2020) 41:902–17. doi: 10.1016/j.it.2020.08.009
24
SchenkelJMPaukenKE. Localization, tissue biology and T cell state - implications for cancer immunotherapy. Nat Rev Immunol (2023) 23:807–23. doi: 10.1038/s41577-023-00884-8
25
ShihHYSciumeGMikamiYGuoLSunHWBrooksSRet al. Developmental acquisition of regulomes underlies innate lymphoid cell functionality. Cell (2016) 165:1120–33. doi: 10.1016/j.cell.2016.04.029
26
NagashimaHMahlakoivTShihHYDavisFPMeylanFHuangYet al. Neuropeptide CGRP limits group 2 innate lymphoid cell responses and constrains type 2 inflammation. Immunity (2019) 51:682–695 e6. doi: 10.1016/j.immuni.2019.06.009
27
KouesOICollinsPLCellaMRobinetteMLPorterSIPyfromSCet al. Distinct gene regulatory pathways for human innate versus adaptive lymphoid cells. Cell (2016) 165:1134–46. doi: 10.1016/j.cell.2016.04.014
28
Van DykenSJNussbaumJCLeeJMolofskyABLiangHEPollackJLet al. A tissue checkpoint regulates type 2 immunity. Nat Immunol (2016) 17:1381–7. doi: 10.1038/ni.3582
29
RahimiRANepalKCetinbasMSadreyevRILusterAD. Distinct functions of tissue-resident and circulating memory Th2 cells in allergic airway disease. J Exp Med (2020) 217(9):e20190865. doi: 10.1164/ajrccm-conference.2020.201.1_MeetingAbstracts.A1303
30
SteinfelderSRauschSMichaelDKuhlAAHartmannS. Intestinal helminth infection induces highly functional resident memory CD4(+) T cells in mice. Eur J Immunol (2017) 47:353–63. doi: 10.1002/eji.201646575
31
BandoJKColonnaM. Innate lymphoid cell function in the context of adaptive immunity. Nat Immunol (2016) 17:783–9. doi: 10.1038/ni.3484
32
SanoTHuangWHallJAYangYChenAGavzySJet al. An IL-23R/IL-22 circuit regulates epithelial serum amyloid A to promote local effector th17 responses. Cell (2015) 163:381–93. doi: 10.1016/j.cell.2015.08.061
33
LyuMSuzukiHKangLGaspalFZhouWGocJet al. ILC3s select microbiota-specific regulatory T cells to establish tolerance in the gut. Nature (2022) 610:744–51. doi: 10.1038/s41586-022-05141-x
34
MayerAZhangYPerelsonASWingreenNS. Regulation of T cell expansion by antigen presentation dynamics. Proc Natl Acad Sci U.S.A. (2019) 116:5914–9. doi: 10.1073/pnas.1812800116.
35
CaoJCusanovichDARamaniVAghamirzaieDPlinerHAHillAJet al. Joint profiling of chromatin accessibility and gene expression in thousands of single cells. Science (2018) 361:1380–5. doi: 10.1126/science.aau0730
36
KlemmSLShiponyZGreenleafWJ. Chromatin accessibility and the regulatory epigenome. Nat Rev Genet (2019) 20:207–20. doi: 10.1038/s41576-018-0089-8
37
FangDCuiKCaoYZhengMKawabeTHuGet al. Differential regulation of transcription factor T-bet induction during NK cell development and T helper-1 cell differentiation. Immunity (2022) 55:639–655 e7. doi: 10.1016/j.immuni.2022.03.005
38
PottSLiebJD. What are super-enhancers? Nat Genet (2015) 47:8–12. doi: 10.1038/ng.3167
39
BoyleAPDavisSShulhaHPMeltzerPMarguliesEHWengZet al. High-resolution mapping and characterization of open chromatin across the genome. Cell (2008) 132:311–22. doi: 10.1016/j.cell.2007.12.014
40
KellisMWoldBSnyderMPBernsteinBEKundajeAMarinovGKet al. Defining functional DNA elements in the human genome. Proc Natl Acad Sci U.S.A. (2014) 111:6131–8. doi: 10.1073/pnas.1318948111
41
HuLZhaoXLiPZengYZhangYShenYet al. Proximal and distal regions of pathogenic th17 related chromatin loci are sequentially accessible during pathogenicity of th17. Front Immunol (2022) 13:864314. doi: 10.3389/fimmu.2022.864314
42
LiPSpolskiRLiaoWWangLMurphyTLMurphyKMet al. BATF-JUN is critical for IRF4-mediated transcription in T cells. Nature (2012) 490:543–6. doi: 10.1038/nature11530
43
HalimTYMacLarenARomanishMTGoldMJMcNagnyKMTakeiF. Retinoic-acid-receptor-related orphan nuclear receptor alpha is required for natural helper cell development and allergic inflammation. Immunity (2012) 37:463–74. doi: 10.1016/j.immuni.2012.06.012
44
FiancetteRFinlayCMWillisCBevingtonSLSoleyJNgSTHet al. Reciprocal transcription factor networks govern tissue-resident ILC3 subset function and identity. Nat Immunol (2021) 22:1245–55. doi: 10.1038/s41590-021-01024-x
45
LoBCCanals HernaezDScottRWHughesMRShinSBUnderhillTMet al. The transcription factor RORα Preserves ILC3 lineage identity and function during chronic intestinal infection. J Immunol (2019) 203:3209–15. doi: 10.4049/jimmunol.1900781
46
ZhongCCuiKWilhelmCHuGMaoKBelkaidYet al. Group 3 innate lymphoid cells continuously require the transcription factor GATA-3 after commitment. Nat Immunol (2016) 17:169–78. doi: 10.1038/ni.3318
47
ZhongCZhengMCuiKMartinsAJHuGLiDet al. Differential expression of the transcription factor GATA3 specifies lineage and functions of innate lymphoid cells. Immunity (2020) 52:83–95 e4. doi: 10.1016/j.immuni.2019.12.001
48
DolcetXLlobetDPallaresJMatias-GuiuX. NF-kB in development and progression of human cancer. Virchows Arch (2005) 446:475–82. doi: 10.1007/s00428-005-1264-9
49
YukawaMJagannathanSVallabhSKartashovAVChenXWeirauchMTet al. AP-1 activity induced by co-stimulation is required for chromatin opening during T cell activation. J Exp Med (2020) 217(1):e20182009. doi: 10.1084/jem.20182009
50
ZookECRamirezKGuoXvan der VoortGSigvardssonMSvenssonECet al. The ETS1 transcription factor is required for the development and cytokine-induced expansion of ILC2. J Exp Med (2016) 213:687–96. doi: 10.1084/jem.20150851
51
ZhongYWalkerSKPritykinYLeslieCSRudenskyAYvan der VeekenJ. Hierarchical regulation of the resting and activated T cell epigenome by major transcription factor families. Nat Immunol (2022) 23:122–34. doi: 10.1038/s41590-021-01086-x
52
ChenGNingBShiT. Single-cell RNA-seq technologies and related computational data analysis. Front Genet (2019) 10:317. doi: 10.3389/fgene.2019.00317
53
SalibaAEWestermannAJGorskiSAVogelJ. Single-cell RNA-seq: advances and future challenges. Nucleic Acids Res (2014) 42:8845–60. doi: 10.1093/nar/gku555
54
SvenssonVVento-TormoRTeichmannSA. Exponential scaling of single-cell RNA-seq in the past decade. Nat Protoc (2018) 13:599–604. doi: 10.1038/nprot.2017.149
55
ThindASMongaIThakurPKKumariPDindhoriaKKrzakMet al. Demystifying emerging bulk RNA-Seq applications: the application and utility of bioinformatic methodology. Brief Bioinform (2021) 22(6):bbab259. doi: 10.1093/bib/bbab259
56
FangDHealyAZhuJ. Differential regulation of lineage-determining transcription factor expression in innate lymphoid cell and adaptive T helper cell subsets. Front Immunol (2022) 13:1081153. doi: 10.3389/fimmu.2022.1081153
57
RitchieMEPhipsonBWuDHuYLawCWShiWet al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res (2015) 43:e47. doi: 10.1093/nar/gkv007
58
HaoYHaoSAndersen-NissenEMauckWM3rdZhengSButlerAet al. Integrated analysis of multimodal single-cell data. Cell (2021) 184:3573–3587 e29. doi: 10.1016/j.cell.2021.04.048
Summary
Keywords
innate lymphoid cells, T helper cells, transcriptome, chromatin accessibility, transcription factor
Citation
Zhang Y, Hu L, Ren G, Zeng Y, Zhao X and Zhong C (2023) Distinct regulatory machineries underlying divergent chromatin landscapes distinguish innate lymphoid cells from T helper cells. Front. Immunol. 14:1271879. doi: 10.3389/fimmu.2023.1271879
Received
03 August 2023
Accepted
13 November 2023
Published
01 December 2023
Volume
14 - 2023
Edited by
Carolina Jancic, National Scientific and Technical Research Council (CONICET), Argentina
Reviewed by
Han-Yu Shih, National Eye Institute (NIH), United States; Jan-Hendrik Schroeder, King’s College London, United Kingdom
Updates
Copyright
© 2023 Zhang, Hu, Ren, Zeng, Zhao and Zhong.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Chao Zhong, zhongc@pku.edu.cn
†These authors have contributed equally to this work
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.