Virus infection pattern imprinted and diversified the differentiation of T-cell memory in transcription and function

Introduction Memory T (Tm) cells are a subpopulation of immune cells with great heterogeneity. Part of this diversity came from T cells that were primed with different viruses. Understanding the differences among different viral-specific Tms will help develop new therapeutic strategies for viral infections. Methods In this study, we compared the transcriptome of Tm cells that primed with CMV, EBV and SARS-CoV-2 with single-cell sequencing and studied the similarities and differences in terms of subpopulation composition, activation, metabolism and transcriptional regulation. Results We found that CMV is marked by plentiful cytotoxic Temra cells, while EBV is more abundant in functional Tem cells. More importantly, we found that CD28 and CTLA4 can be used as continuous indicators to interrogate the antiviral ability of T cells. Furthermore, we proposed that REL is a main regulatory factor for CMV-specific T cells producing cytokines and plays an antiviral role. Discussion Our data gives deep insight into molecular characteristics of Tm subsets from different viral infection, which is important to understand T cell immunization. Furthermore, our results provide basic background knowledges for T cell based vaccine development in future.


Introduction
A remarkable feature of adaptive immunity is "immunological memory" (1), which is established during primary antigen encounters and can be persistent without residual antigens.The "memory" confers on the host a rapid, vigorous immune response to reinfection of given pathogens and provides long-term protection (2).Memory T cells are an important component of immunological memory that is highly specific and diverse.It is well known that memory T cells consist of many subpopulations that are highly heterogeneous (3,4), but how these heterogeneous subpopulations of memory T cells are defined and how they differentiate into different phenotypes and diverse functions are not well known.
Very few phenotype markers have been discovered to define the differentiation and heterogenicity of memory T cells.Lanzavecchia first defined "central memory" T cells and "effector memory" T cells by chemokine receptor CCR7 expression (5), although people found that it is not always appropriate in all cases.On the other hand, the location of residential memory T cells from different tissues will influence the phenotype and function (6)(7)(8)(9).For example, memory T cells located in bone marrow have a higher proliferation rate than those in lung tissue (10)(11)(12).Furthermore, the antigen dose, antigen exposure time, and duration of acute or persistent inflammation will also influence the differentiation, phenotype, and function of memory T cells.This heterogeneity of the memory T-cell repertoire is more pronounced when it is primed with different viruses.The extent and frequency of primed antigen determine the phenotype and function of virus-specific memory T cells (13,14).For instance, CMV and EBV are viruses that both belong to Herpesviridae and are classified as chronic persistent infection viruses.However, CMV-specific Tm is remarkable for its particular CD8+ T-cell inflation (15), whereas EBV-specific T cells typically contract with the decrease in viral load.Meanwhile, SARS-CoV-2 can cause acute infection and may induce another T-cell memory differentiation pattern.Previous definitions of memory T cells are more derived from acute influenza infection and chronic viral infection, such as LCMV.However, it is not yet clear what the heterogeneity between the same cell subpopulations induced by different viruses is.What markers can be used to clearly describe the differentiation landscape of memory T cells primed in different viral infections is also not clear.
In this study, we used CMV-, EBV-, and SARS-CoV-2-specific Tm cells as examples to investigate the differences among Tm cells primed with different viral infections.A comprehensive comparison was made on these Tms, from subpopulation composition, cell activation, and metabolism to specific gene expression and transcriptional regulation.We found that CMV and EBV are different in subpopulation composition and that the same subpopulation of Tm cells has a viral-specific expression pattern.All these basic differences contribute to the functional diversity of different viral-specific Tms.

Sample declaration and treatment
Blood samples used for EBV single-cell data analysis were obtained from one healthy adult donor with plentiful IFNg + T cells after EBV peptide pool stimulation.It was approved by the Guangzhou Blood Center, China.The use of the blood samples was approved by the Health Commission of Guangdong Province and Guangzhou Institute of Respiratory Disease.Blood samples used in CMV and SARS-CoV-2 single-cell data analysis belong to a sample set of our previous study (16), which contained eight convalescents infected with CMV and SARS-CoV-2.Another five convalescents samples were also collected for further validation experiments.The convalescents in this study were defined as people that recovered (diagnosed with clinical nucleic acid testing and CT results) for more than 2 months.All information of samples used in this study is summarized in Supplementary Table 1.The study is approved by the Ethics Commission of the First Affiliated Hospital of Guangzhou Medical University (No. 2020-51).The signed consent forms from all the participants were obtained.

Peptide pool construction
The peptide pool construction was consistent with our previous published methods (15,16).SARS-CoV-2-specific peptides were designed and synthesized against the following proteins: the entire antigen region of spike (S), nucleocapsid (N), membrane (M), and envelope (E).A total of 487 15-mer SARS-CoV-2 epitopes (overlapping by 11 amino acids) were generated with an online peptide generator (Peptide 2.0) and were synthesized by GL Biochem Corporation (Shanghai) with a purity of over 80%.
There were 383 15-mer (overlapping by 11 amino acids) CMVspecific peptides designed and synthesized to span the entire proteins of the HCMV pp65, IE-1, and IE-2.The EBV peptide pool contains 1,686 peptides that cover the EBV protein of BZLF1, BRLF1, BMRF1, BMLF1, EBNA1, EBNA2, EBNA3A, EBNA3B, EBNA3C, LMP1, LMP2A, and LMP2B.The length of each EBVspecific peptide was also 15-mer with 11 overlapping residues between neighboring peptides.The design and generation of these peptides were all with the online peptide generator (Peptide 2.0) and synthesized by GL Biochem Corporation (Shanghai) with a purity of >80%.
Each peptide was dissolved in DMSO with a concentration of 20 mM to form a stock.Full information of peptides used for EBV-, CMV-, and SARS-CoV-2-specific T-cell stimulation can be found in Supplementary Table 2.

scRNA-seq library construction and sequencing
For EBV samples, single cells were encapsulated in droplets using 10× Genomics GemCode Technology and processed according to the manufacturer's instructions.In brief, every single cell and every transcript were barcoded with a sample index and unique molecular identifier.Libraries were generated and sequenced from cDNAs using the Chromium Next GEM Single Cell 5′ Reagent Kits v1.1.The Single Cell 5′ Protocol was used to produce Illumina-ready sequencing libraries.
For CMV and SARS-CoV-2 samples, sample tag labeling was performed using a BD Human Single-Cell Multiplexing Kit (Cat.No. 633781) and processed on a BD Rhapsody ™ Cartridge Reagent Kit (Cat.No. 633731) following the user's manual.Single-cell mRNA, AbSeq barcodes, and SampleTag barcodes were all captured by BD Rhapsody beads coated with poly(T) oligonucleotide, with a unique cell barcode and molecular barcode on each bead.Single-cell cDNA synthesis and library amplification were performed following the manufacturer's protocol by using a BD Rhapsody ™ cDNA Kit (Cat.No. 633773) and BD Rhapsody ™ WTA Amplification Kit (Cat.No. 633801).The cDNA library was constructed through two rounds of PCR for whole-transcriptome analysis and single-cell multiplexing analysis.Finally, eight cycles of PCR were performed for all elements following the manufacturer's instructions.All PCR libraries were quantified using an Agilent Bioanalyzer 2100 and pooled.Pooled libraries were sequenced on NovaSeq with PE150.

scRNA-seq data alignment and analysis
The scRNA-seq of EBV-stimulated T cells was aligned to reference GRCH38 and quantified with Cell Ranger software.The scRNA-seq of CMV-and SARS-CoV-2-stimulated T cells was also aligned to reference GRCH38 but with the BD Rhapsody WTA pipeline in version 1.9.1.Both data from 10x and BD were further analyzed with the Seurat R package.Cells with more than 25% mitochondrial RNA expression were filtered out.Additionally, cells with fewer than 500 RNA molecules or fewer than 250 expressed features were discarded.Because the ranges of the RNA molecule count and feature count were different between 10x and BD sequencing, we used 1.5-fold IQR as the threshold to remove cells with outlier values.

Batch effect correction
To integrate the data sequenced with different platforms, in addition to SCT normalization, Harmony was also performed to correct the batch effect.As default, integration was performed with the top 3,000 highly variable genes.Furthermore, because the total read count is quite different between 10x and BD, the gene expression matrix was normalized with the iSMNN method (17), and the corrected matrix was used for further analysis.

Dimensionality reduction and cluster annotation
Dimensionality reduction was performed with Harmony after SCT transformation, and cell clustering was carried out with 50 PCAs and with a resolution of 0.5.Then, UMAP was conducted on the Harmony result.To annotated the cell clusters, cell type-related markers were collected from different literatures (18-21) and R&D system websites (https://www.rndsystems.com/cn/resources/cellmarkers/immune-cells/helper-t-cells).Th0 was defined as CD4 + T cells with CCR7 and SELL expression to keep consistency with the naming of other defined T helper cells, such as Th1 and Th17.

Validation datasets
In case the results from a single donor of EBV is individualspecific, a validation dataset was collected from Zendo6952657.To keep the same status with our own sample, donor4 with a CMVand EBV-negative status was selected as convalescent.In this dataset, viral-specific memory T cells were captured with the peptide-MHC method.The same data processing pipeline and cell-type gene markers were used as our own data in this validation dataset.

Differential gene expression analysis and functional annotation
The differentially expressed genes were identified with the Seurat function "FindAllMarkers" or "FindMarkers".The gene functional annotation for given gene sets was performed with the "clusterProfiler" R package.

Transcription factor regulatory network inference
To identify the viral-specific active TF, "SCENIC" was carried out on our single-cell data with the default parameters.The motif information was obtained from "https://resources.aertslab.org/cistarget/databases/homo_sapiens/hg38/refseq_r80/mc9nr/ gene_based/".The correlation method was specified to "spearman."Further network visualization of the inferred TF regulatory network was performed by "Cytoscape".

Correlation analysis
To identify the genes related to the switch between CD28 and CTLA4 expression in T cells, cells with both CD28 and CTLA4 expression were selected.The ratio of CD28 to CTLA4 in each cell was taken as an index to indicate cell activity.Then, Pearson correlation was performed between each gene in our expression matrix (except CD28 and CTLA4).All the positively correlated genes with FDR less than 0.05, which were calculated with the Benjamini and Hochberg method, were used to perform further gene functional annotation analysis.

Pathway score
To quantify the activity of each cell in the NF-kB pathway, first, the genes in this pathway were acquired by the R package "KEGGREST" with KEGGID "hsa04064."Then, the pathway score was calculated with the "AddModuleScore" function in Seurat.The plot was created by RidgePlot according to the added score.

The same subsets of memory CD8+ T cells in acute/chronic viral infection differ greatly at the transcriptome level
To characterize the T-cell repertoires derived from different viral infections, CMV and EBV were used as chronic infection viral infection and SARS-CoV-2 was used as an acute infection.PBMCs were collected from convalescents that had ever been infected with EBV or CMV and SARS-CoV-2.Viral-specific CD4 + and CD8 + T cells were obtained by viral peptide stimulation followed by cell sorting (see Methods for details) (Figure 1A).Then, single-cell sequencing was performed with the 10x platform for EBV and BD Rhapsody for CMV and SARS-CoV-2."Harmony" was used to integrate the data from two different platforms, and the performance was very satisfied (Supplementary Figures 1A, B).Virus-specific CD4 + and CD8 + T cells were separated based on the protein expression levels of CD4 and CD8.
As expected, different subsets of memory CD8 + T cells from three viral infections differed greatly in proportion (Figures 1B, C) and three effector T-cell subsets (cluster0,2,6, Tem), four terminally differentiated effector T-cell subsets (cluster1,5,7,8, Temra), one central memory T-cell subset (cluster3, Tcm), and one stem-like central memory T-cell subset (cluster4, Tscm) were defined according to expression of marker genes that were defined by previous studies (18-21) (Figure 1D).Temra, as a hallmark of CMV-specific T cells (22), accounted for the largest proportion (55.8%) of the CMV T-cell repertoire (Figures 1C, E).There is also a certain proportion of Temra in the EBV-specific T-cell repertoire (20%), although it is not the main group.This result is consistent with previous studies showing that the Temra phenotype is not the unique characteristic of CMV, and other chronic viruses can also elicit Temra with similar differentiation progression (22).The high frequency of Temra in CMV may be due to the "smoldering" character of CMV infection (23), which is reflected in the tiny CMV-specific CD8 + T-cell reactivation and antigen being rapidly extinguished.It made us to speculate that a high level of Temra in CMV may be caused by persistent stimulation, which can also lead to high expression of coinhibitory molecules that are associated with T-cell exhaustion (Supplementary Figure 1C).The difference in proportion of Temra and Tem between CMV and EBV was confirmed in an independent validation dataset.The same cell type criterion determination was performed as our own dataset (Supplementary Figure 1E) and the proportion of Temra in CMV was still dominated, whereas Tem is more in EBV (Supplementary Figure 1D).Additional experiment validation was also conducted in CMV and EBV, as well as SARS-CoV-2 specific T cells in another five convalescents donors.For all samples, the proportion of Temra in CMV was higher than that in EBV (Supplementary Figure 1F).These results indicate that although EBV and CMV are both laten viruses from Herpesviridae, the composition of T-cell repertoires are quite different.
Interestingly, in addition to the difference in the proportion of CD8 subsets, the same subset was also characterized by its own expression pattern in each virus.By taking all the Temra cells together, we found that CMV Temra cells highly expressed MATR3, ZFP36L1, MT-ND5, SLAMF7, and KLF6 than EBV (Figure 1F), within which SLAMF7 is documented to be upregulated in CMVspecific CD8 + T cells and downregulated in EBV-specific cells (24).EBV Temra is higher in FABP5, MIF, and MALAT1.These The landscape of CMV-, EBV-, and SARS-CoV-2-specific CD8+ T cells.differentially expressed genes suggest that although Temra cells can be found in all three viral-specific T-cell repertoires, different viruses have their own characteristics on gene expression pattern.Unlike CMV and EBV, SARS-CoV-2 has few Temra cells (Figures 1C, E).The main group of T cells was Tem in SARS-CoV-2 (54.8%) and EBV (59.2%).However, only 21.6% of CMVspecific CD8 + T cells were Tem cells.As can be noted in Figure 1C, the main Tem subgroups are different among CMV, EBV, and SARS-CoV-2.The major Tem subgroup is cluster 0 for EBV, and it is cluster 2 for CMV and SARS-CoV-2.By taking all the Tems (C0_Tem and C2_Tem and C6_Tem) together and comparing among them, we found that the Tems of CMV specifically overexpressed the chemokine CCL4L2, whereas those of EBV expressed CCL20 (Figure 1G).IRF4 is more active in both CMV and EBV than in SARS-CoV-2, which is consistent with a previous report that IRF4 can promote CD8 + T-cell exhaustion in chronic infection (25).HERC5 is demonstrated to be functional in SARS-CoV-2 infection and interferes with viral protein activity by modulating ISG15 signaling (26).Here, we found that HERC5 is highly expressed in SARS-CoV-2 Tem cells (Figure 1G).EBV Tem cells were peculiarly higher in MIF expression (Figure 1G), which has also been suggested by other studies for its function in EBV infection (27,28).
We found, for CMV, that the main cell type that expressed cytokine-related genes was Temra, which was enriched with CCL3/4, CCL4L2, TNF, IL6ST, IL21R, FASLG, and TNFRSF9 expression (Figure 2A).In contrast to CMV, Temra in EBV exclusively highly expressed XCL2 and TGFB1 whereas SARS-CoV-2 Temra highly expressed CXCL8, IL1RN, IL1B, and CCR5.Our results also showed that SARS-CoV-2 displayed extreme IL-26 and CCR1 gene expression activity in Tscm cells (Figure 2A).CCR1 was reported to be highly expressed in DCs in SARS-CoV-2 patients and is considered a potential therapeutic target clinically (29).Here, we provided further evidence implying that CCR1 is also highly expressed in SARS-CoV-2 Tscm cells.The distinction of functions for different T cells can, to a certain extent, be reflected on the expression of cytokine secretionrelated genes.Taken together, our results showed that CD8 + T cells derived from different viruses can form T-cell subgroups with specific transcription characteristics.However, whether different transcriptome patterns in the Tm of CMV, EBV, and SARS-CoV-2 indicate that they use different metabolism modes or have different functions is less defined.

Metabolism and function are divergent for T cells from different viruses
It is generally considered that T cells undergo glycolysis during the "effector" phase, whereas naïve and memory T cells are inclined to undergo oxidative phosphorylation (30).Therefore, to further study the function and metabolism model of T cells from different viral infection patterns, function-related genes were collected from previous studies (31).In our results, EBV-specific CD8 + T cells were more prone to highly express glycolysis genes.In contrast, oxidative phosphorylation activation gene expression is much stronger in SARS-CoV-2.Unlike the other two viruses, CMV has gene expression on both metabolism pathways but with mild intensity compared with EBV and SARS-CoV-2 (Figure 2B).The cell metabolic state dynamically changes with T-cell function.SARS-CoV-2 and EBV are all enriched with the Tem subgroup; however, the high expression level of oxidative phosphorylation process in SARS-CoV-2 Tem led us to query whether they are still in the effector phase.Indeed, we found that cytotoxicity and general cytokine genes, such as IFNG and TNF, were all expressed at low levels in SARS-CoV-2 (Figure 2B).However, the expression of IFN response genes was highest in SARS-CoV-2 and lowest in EBV (Figure 2B).In addition, the genes in the NF-kB signaling pathway and Jun/Fos pathway are inactive in SARS-CoV-2, which further implies attenuation of effector function (Figure 2B).Additionally, the number of expressed cytokines was lowest in SARS-CoV-2 among the three viruses (Figure 2C).The reduction in cytokines can trigger the cell apoptosis process, and the downregulation of BCL2 and upregulation of BAX can also be seen in SARS-CoV-2 when compared with the other two viruses (Figure 2D).Therefore, all these results taken together, we proposed that the Tem in the SARS-CoV-2 stimulation model is likely undergoing a functional fading and memory phase in convalescents.

CD4 + T cells are more concordant than CD8 + cells among different viral infections
CD4 + T cells are another crucial cell type in antiviral processes.However, unlike that in CD8 + T cells, the population of CD4 + T cells did not present much of a difference among the three viruses (Figures 3A, B, D).CD4 + T cells were classified as Th0, Th1, Th2, Th17, Tfh, or Treg cells based on marker genes (Figure 3C), and we only found some differences in Th17 cells, which showed a larger proportion than the other two viruses (18.6% in EBV, 9% and 8.6% in CMV and SARS-CoV-2) (Figure 3D).Cytotoxic and cytokine genes were hardly expressed in all CD4 + T cells (Figure 3E), which could be because the main cells responding to ex vivo peptide stimulation are CD8 + in CMV and EBV.Although SARS-CoV-2 CD4 + cells have a certain response to peptide stimulation, the production of cytokines was low in most studies.In line with CD8 + T cells, EBV CD4 + T cells show high expression in glycolysis genes, whereas CMV and SARS-CoV-2 are high in oxidative phosphorylation (Figure 3E).Most of the top differentially expressed genes between different viruses were related to metabolism, such as mitochondrial genes and ribosome genes (Supplementary Figure 2).In addition, we found that MIF and FABP5 were exclusively expressed in EBV.FABP5 is reported to be positively related to Th17 differentiation (32); therefore, we suggested that the high proportion of Th17 in EBV may be due to the high expression of FABP5.

CD28 to CTLA4 transition regulates T-cell antiviral function
CTLA4 transcription was immediately initiated when the T cell was activated.Intracellular CTLA4 accumulates on the surface of T cells by externalization and competes with CD28 binding to ligands and negatively regulates T-cell function (33).We studied the expression patterns of CD28 and CTLA4 at single-cell resolution to characterize the dynamic changes in T cells from activation to functional inhibition, which is also called "T-cell exhaustion."In a total of 14,596 CD8 + T cells, only 29% expressed either CD28 or CTLA4 and 2.7% expressed both (Supplementary Figure 3A).CD28 was found to be hardly expressed in CMV when taking all the virus-specific CD8 + T cells into account, but a more detailed distinction shows that some CMVspecific Tcm cells still expressed CD28 (Figure 4A, upper panel).We found that regardless of CMV, EBV, or SARS-CoV-2 infection, cells with further differentiation potential presented relatively high CD28 expression, such as Tcm and Tscm.Temra shows the lowest expression of CD28 in CMV and EBV.In contrast, CTLA4 is extremely highly expressed in CMV Temra cells, which further indicates the mutually exclusive correlation between CD28 and CTLA4 due to the competitive binding of ligands (Figure 4A, lower panel).
We considered cells with only CD28 expression as T cells in the "activation phase," cells with only CTLA4 expression as T cells in the "activity inhibition phase," and cells with both gene expression as the "transformation phase".Interestingly, the cytotoxic genes were highly expressed in the "inhibition phase," which is inconsistent with our expected "function inhibition" (Figure 4B).However, it seems that the expression of cytotoxic genes, such as GZBM and IFNG, is commonly observed in exhausted T cells reported by other studies (34,35).Most of the cells belonging to the "activation phase" in CMV and SARS-CoV-2 are Tcm (Supplementary Figure 3B), which is not the major cell type responsible for cytokine secretion (Figure 2A).Therefore, in CMV and SARS-CoV-2, cytokine and cytotoxic genes cannot be expressed as widely as in EBV (Figure 4B).
In the transformation phase, we hypothesize that the cells undergo a process in which the ratio of CD28/CTLA4 expression is gradually attenuated.To decipher the details of the alteration in molecular function during this process, a correlation analysis was carried out within cells in the transformation phase (see Methods for details).The genes that were positively correlated with the ratio of CD28/CTLA4 were found to be related to the "responding to virus" and "T cell receptor signaling pathway" (Figure 4C), which indicates that a higher CD28/CTLA4 ratio elicits a stronger T-cell response.For further validation, two public scRNA-seq datasets for PBMC samples from hospitalized COVID-19 patients were collected.The same pipeline and criteria were applied, and they all indicated the ratio of CD28/CTLA4 expression related to T-cell function (Supplementary Figure 3C).

TF regulates viral-specific cytokine secretion and cell function
It is generally accepted that transcription factors play a vital role in T-cell activation, reaction, and differentiation (36)(37)(38).In the first part of our results, we denoted that the expression of cytokines is viral specific.To examine the rationale behind this expression heterogeneity, a regulatory network inference analysis was performed with the SCENIC pipeline (39) on our single-cell data (see Methods for details).The analysis takes both gene coexpression and binding motifs into account to recognize the TF targets and takes "regulon" as the unit to estimate the activity of TF in each cell.According to the deduced regulatory results, the 23 viral-specific cytokines of interest (in Figure 2A) and their 76 TFs built up a network with a maximum cytokine degree of 28 for IL6ST and a maximum TF degree of 12 for GABPB1 (Figure 5A; Supplementary Figures 4A-C).In the CMV subnetwork (Supplementary Figure 4A), we found that eight out of nine CMV-specific cytokines were targeted by REL, which indicates its key function in CMV CD8 + Tm cells.REL is a subunit of the NF-kB family, and many studies suggest that the activation of CMV is related to the activity of REL (40, 41).In total, there were 3,100 CD8 + Tm cells denoted as "REL regulon" active in our data according to the AUCell score (with 0.14 as cutoff) (Figure 5B), and most of them were CMV-specific Temra cells (Supplementary Figure 4D), which is in line with cytokine expression in our previous results (Figure 5B).However, the expression of REL is not CMV specific.Tem in EBV also displayed high REL expression but little REL regulon activity (Figure 5C).This result further indicates that it is the 'regulatory event' rather than TF expression that is responsible for cytokine secretion.Furthermore, we also found that the activity of the NF-kB signaling pathway was higher in CMV than in EBV and SARS-CoV-2 (Figure 5D), which further suggests the potential roles of REL in both signal transduction and cytokine secretion regulation in CMV-specific Tm cells.SCENIC identified 182 TF regulons with condition-specific activity (Supplementary Figure 5), such as the Tcm-specific TF "LEF1," EBV-specific TF "IRF8," "RELB," and "HMGN3" and CMV/EBV Temra-specific TF "TGIF1" and "HIVEP3" (Figure 5E).All these TFs resulted in distinguished downstream gene expression and further distinctions in cell function for different viral infections.

Discussion
It is generally agreed that the Tm repertoire primed by different viruses is divergent.However, it is still unclear how and to what extent they are different.Our results point out that in CMV it is mainly the Temra subpopulation that is dominant, whereas EBV and SARS-CoV-2 are the Tem subpopulations that take advantage.Our results also indicate that they are different in cytokine secretion, cell metabolism, and cell activation regulation.In our study, viral-specific Tm cells were activated by the viral peptide pool, which can capture all T cells primed by a given virus under current circumstances and are not limited by MHC restriction.However, the difference caused by different circumstances from different samples is inevitable.Further studies working in the same environmental system are still necessary.
The enrichment of Temra T cells is one of the characters of CMV (42).However, as another persistent virus, EBV comprises Tem rather than Temra cells.Our results suggest that the main functional cells in CMV are those with the Temra phenotype instead of the Tem phenotype.As our results indicated, SARS-CoV-2-specific Tm cells show high gene expression on oxidative processes.This result is reasonable because SARS-CoV-2 is an acute infectious virus, and most effector T cells enter the "memory phase" in convalescence.EBV is a persistent infectious virus, and most of the time, reactivation is asymptomatic, but EBV-specific Tm cells are active in the glycolysis process, which implies the "effector phase" is going through.CMV is quite different from EBV and SARS-CoV-2, not only in the phenotype of Tm but also in the metabolism process, which may be due to the accumulation of viralspecific T cells caused by inflation.
Costimulation is a secondary signal that is required for T-cell activation (43).In contrast, coinhibitors calm down the effective T cells and maintain the equilibrium of the immune system.Chronic and persistent infection results in a high level of coinhibitor expression; however, many studies have found that exhausted T cells expressed a high level of cytokines (34).We speculate that these results indicate that there is a "transition phase," in which T cells have inadequate function and express both costimulators and coinhibitors.Indeed, our results demonstrate that T-cell activation decreases as the ratio of CD28/CTLA-4 expression decreases.This study provides a feasible prospect for antiviral therapy by retrieving costimulator expression in virus-specific T-cell populations in the future.
How cells transmit external stimuli into the nucleus and respond appropriately is another attractive field.By studying the transcription factor regulatory network, we found that the TF REL is more active in CMV than in EBV and SARS-CoV-2.In addition, as a key subunit of NF-kB, high activity of REL can be related with the increased signaling of the NF-kB pathway in CMV.Therefore, we suggest that CMV Tm cells transmit external stimuli through the NF-kB pathway and initiate the transcription of cytokines in the nucleus, such as CCL3, CCL4, and TNF.We also listed several possible TFs for EBV and SARS-CoV-2 Tms, such as IKZF2 in SARS-CoV-2 and IRF8 in EBV.In brief, our study provides comprehensive evidence that Tm primed by different viruses are quite different in phenotype, function, metabolism, transcription profile, and TF-mediated signal transduction.
(A) The pipeline for sample collection, treatment, and single-cell RNAsequencing data preparation.(B, C) The UMAP figure of all CD8 cells from three different viruses; merged together in b and shown separately in c.The total CD8+ cell number for each of viruses was shown above each panel.The color denotes different cell clusters as the legend shows.(D) The mean expression of each gene in each cell cluster is presented in different colors, as shown in the legend color bar.The values were row scaled.The marker genes for different cell types were listed on the left.(E) The proportion of different cell clusters in the T-cell repertoire primed with different viruses is displayed in a stacked bar plot.The different colors represent different cell clusters the same as in figure (B, C).The pattern of the bar indicates the cell types listed in the legend.(F) The top 20 DEGs between CMV and EBV Temra found by the "findMarker' program are shown with the heatmap.Each row represents one gene, and each column represents one cell.The color is the expression value of a given gene in the given cell.(G) The top DEGs among CMV, EBV, and SARS-CoV-2 Tems.The expression for each gene in each cell is present with the violin plot.The red one is cells from CMV-primed, the green one from SARS-CoV-2 primed, and the blue one from EBV-primed.

2
FIGURE 2 Distinct functions of different viral-specific T cells.(A) The cytokines with differential gene expression among different cell clusters.For each dot, the size represents the percentage of cells with non-zero expression of given genes (row) in given cell clusters (column).The color is the average expression as denoted in the legend.The virus-specific cytokines are listed on the left.(B) The expression for the marker genes of different functions is shown in the dot plot.The meaning for size and color is the same as that in figure (A).(C) The number of cytokines with non-zero expression in each cell is plotted with the violin plot for each virus-specific T-cell repertoire.The quantile of the numbers is labeled with the boxplot.(D) The expression of gene BCL2 and BAX in each cell is plotted with the violin plot.The red violin indicates CMV-primed T cells, the blue one for EBVprimed cells, and the orange one for SARS-CoV-2-primed ones.

3
FIGURE 3 The landscape of CMV-, EBV-, and SARS-CoV-2-specific CD4+ T cells.(A, B) The UMAP figure of all CD4 cells from three different viruses, taken together in (B) and shown separately in (C).The color denotes different cell clusters as the legend shows.(C) The mean expression of each gene in each cell cluster is presented in different colors as shown in the legend color bar.The values were row scaled.The marker genes for different cell types were separated with gap rows.(D) The proportion of different cell clusters in the T-cell repertoire primed with different viruses is displayed in a stacked barplot.The different color represents different cell clusters the same as in a-b.The pattern of the bar indicates the cell types listed in the legend.(E) The expression for the marker genes of different functions is shown in the dot plot.The size represents the percentage of cells with nonzero expression in given genes (row) and given cell clusters (column).The color is the average expression, as denoted in the legend.

4
FIGURE 4The ratio of CD28 to CTLA4 related with T-cell activity.(A) The average expression of CD28 and CTLA4 for all cells in given the virus-specific T-cell repertoire is shown in the dot plot.The size is the percentage of cells with non-zero value, and the color is the expression value as denoted in the legend color bars.The heatmap separates the T-cell repertoire into different cell types and with the color represents the average expression in given cell types for each virus.The value is row scaled.(B) Each dot indicates the average expression of a given gene (row) in a given cell cluster (column).The meaning of the dot size is similar with that in (A).(C) The genes positively correlated with the CD28/CTLA-4 ratio was collected for functional enrichment analysis.The top most significantly enriched functions are listed in the barplot.The length of the bar represents the number of genes belonging to given functions.The color is an adjusted p-value.

5 REL
FIGURE 5 REL regulates cytokine expression in CMV-specific T cells.(A) The network constructed with cytokines in Figure 2A and their SCENIC imputed transcription factors.The nodes with different colors indicate CMV-(red), EBV-(green), and SARS-CoV-2-specific (gold) cytokines.Blue nodes are deduced TFs, and the lines represent the regulation relationship between cytokine and TF.(B) The top-left panel shows the REL AUC distribution, which is calculated with SCENIC.The x-axis is the AUC value, and the y-axis is the frequency of cells with a given AUC value.The dashed line labels the cutoff used to defined active regulon.The remaining three panels show the AUC value of each virus-specific T cell with UMAP plot.(C) Similar with that in figure (B), with REL gene expression instead of AUC value.(D) The Ridge plot for the pathway score of the NF-kB signaling pathway.The a-axis is the pathway score, and the y-axis is the density of cells with a given score.(E) The violin plot shows the RSS score (regulon specificity score from SCENIC) (y-axis) for each TF in each virus-specific T-cell cluster (column).