- 1Commissariat à l’Energie Atomique et aux Energies Alternatives (CEA), Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE), Département Médicaments et Technologies pour la Santé, Service d’Ingénierie Moléculaire pour la Santé (SIMoS), Université de Paris-Saclay, Gif-sur-Yvette, France
- 2CEA, Institut de Biologie François Jacob Centre National de Recherche en Gé nomique Humaine, Evry, France
Introduction: This study provides an in-depth analysis of the diversity of the CD4 TCR CDR3β repertoire specific to influenza A (HA) and Epstein-Barr virus (EBNA) epitopes.
Methods: Epitope-specific CD4 T cells from 13 healthy donors were enriched using a short-term culture step, isolated based on activation markers, and sequenced for their TCR CDR3β region using high-throughput sequencing. The frequency of each clonotype was then identified within the complete circulating CD4 T-cell CDR3β repertoire.
Results: For both epitopes, the clonotype distribution was markedly skewed, with a small number of highly expanded clones comprising approximately 60% of the repertoire, alongside numerous low-frequency clonotypes. VJ gene usage and motif preferences differed between the two peptides, highlighting epitope-specific TCR selectivity. The response was predominantly composed of private T-cell clonotypes. The proportion of public clonotypes can increase among donors sharing HLA class II molecules and reveals in HLA-unrelated donors the level of TCR promiscuity.
Discussion: Overall, our data demonstrate that CD4 T-cell responses to these viral epitopes are polyclonal and highly personalized. The modest overlap of clonotypes between donors, coupled with a long tail of low-frequency clones, suggests that the full diversity of the epitope-specific T-cell repertoire is likely broader than previously estimated.
Introduction
B and T lymphocytes play a crucial role in combating various diseases, including bacterial and viral infections and cancer (1, 2). They exert their power by selectively recognizing the antigens, thereby preserving the self-components from deleterious effects. This selectivity is made possible by combining different steps of differentiation and selection of the adaptive immune cells. Antigen recognition occurs through a re-arranged receptor, which is encoded by V D J gene segments that might undergo somatic recombination (3). This unique process ensures that the immune system can respond to an almost limitless variety of pathogens. After a phase of selection by self-antigens in the bone marrow or the thymus, naïve immune cells circulate in the blood and the secondary lymphoid organs. Upon encountering their specific antigen, these cells proliferate and differentiate into effector cells that directly combat pathogens, or into memory cells that provide long-lasting immunity (4). This sophisticated system of diversity generation and selective cell selection enables the adaptive immune system to provide targeted and effective protection against a wide range of diseases.
Owing to the crucial role that the size of the T-cell repertoire plays in generating efficient T cells, T-cell diversity has been assessed using various approaches. Distribution of CDR3 length variants was initially used to evaluate the extent of diversity within Vβ subsets (5). Later, TCR gene amplification and sequencing revealed that there are about 10 million different β chains circulating in the blood (6). More recent studies using high-throughput sequencing indicated similar (7, 8) or greater diversity (9). However, a recent study highlighted the limitation of current methods of estimation and proposed that the size of the T-cell repertoire is likely underestimated (10). The composition and evolution of the TCR repertoire were also analyzed to provide insights into the diversity and dynamics of T cells involved in cancer (11, 12) and infectious diseases (13, 14). In addition to studies on the full T-cell repertoire, deep sequencing has also been used to explore the diversity of antigen-specific T cells (15–18). Since these antigen-specific cells are a small minority among circulating cells, distinguishing them from nonspecific cells remains a significant challenge, but is essential for characterizing their T-cell receptors (TCRs). HLA multimer technology allows tracking and sorting of antigen-specific T cells and can be advantageously combined with sequencing (17, 19). However, this approach has been more successful for the investigation of CD8 cells, rather than CD4 cells (20, 21). The peptide binding register for HLA class II molecules, along with the TCR avidity of CD4 T cells and their frequency, is often suboptimal, resulting in the capture of only a subset of the specific CD4 T cells. (20, 21). Alternatively, antigen-specific CD4 T cells can be characterized after in vitro expansion and cell sorting or cloning (15, 16, 18, 22). These steps advantageously increase the T-cell detectability but alter the original T-cell repertoire as the expansion process might not equally favor all T-cell clones, leading to an overrepresentation of certain clones. Although no appropriate control can fully correct for this bias, it needs to be overcome by appropriate approaches to evaluate the clonal frequency within the circulating T-cell repertoire.
In this study, we employed a three-step approach to examine in depth the CD4 T-cell repertoire of viral epitopes. Our analysis focused on the CDR3β, as the TCR β locus contains more V, D, and J segments than the α locus, which lacks D segments, leading to a larger combinatorial and junctional diversity. First, CDR3β sequences from circulating CD4 T cells were sequenced. Next, peripheral blood mononuclear cells (PBMCs) were cultured with the viral epitope to enrich for epitope-specific T cells, which were subsequently sorted and sequenced. Finally, the sequences of these specific T cells were compared to the entire T-cell repertoire to determine their original frequency. By applying this methodology to study CD4 T-cell responses to influenza A (HA) and Epstein-Barr virus (EBNA) epitopes, we aimed to shed light on the diversity and specificity of the CD4 T-cell response. Our results indicate that each individual of the 13 different donors exhibited unique signatures, with highly diverse repertoires composed of dominant clonotypes alongside a large number of unexpanded clonotypes.
Materials and methods
Peptides
The influenza hemagglutinin (HA) peptide HA306-318 (PRYVKQNTLKLAT) and the Epstein-Barr virus nuclear antigen (EBNA) peptide EBNA515-528 (TSLYNLRRGTALAI) were from ProteoGenix (France).
Peripheral blood mononuclear cell samples and HLA genotyping
Blood samples were obtained from healthy donors at the Etablissement Francçais du Sang (EFS), Rungis, France, after informed written consent was obtained from the donors and in accordance with EFS guidelines approved by the Comiteí d’eíthique et de deíontologie de l’EFS. PBMCs were isolated from total blood using Ficoll/Lymphoprep separation (GE Healthcare) and cryopreserved in liquid nitrogen. A portion of the PBMCs was used to purify CD4 T cells using magnetic separation with CD4 microbeads and LS columns (Miltenyi Biotec). Ten million CD4 T cells were frozen (circulating CD4 T cells). For each donor, a minimum of five million PBMCs were used to extract DNA with the Machery Nagel NucleoSpin Blood QuickPure kit for HLA typing by NGS (DKMS, Berlin, Germany) (Supplementary Table S2).
T-cell expansion and activation-induced cell marker assay
Thawed PBMCs were seeded at 2x106 cells/mL in a 6-well plate in the presence of the peptide HA or EBNA (10 μg/mL peptide) in RPMI 1640 medium supplemented with 5% human AB serum (Sigma-Aldrich), 60 U/mL IL-2 (R&D), 100 U/mL IL-4 (R&D) and 1 μg/mL of anti-CD28 (clone 15E8, Miltenyi Biotec). The medium was changed every 2–3 days with the same medium supplemented with IL-2 and IL-4 and, at day 8, the medium was changed for the same medium without cytokines. At day 10, cells were washed and incubated with and without 10 μg/mL peptide and 24 hours later cells were stained with the following antibodies: anti-CD4-VioBlue (clone REA623, Miltenyi Biotec), anti-CD134-PE/cyanine 7 (clone ACT35, Biolegend) and anti-CD137-APC (clone 4B4-1, Biolegend). CD4+ CD134+ CD137+ cells were sorted on a FACS ARIA III cytometer (Becton Dickinson, San Jose, CA, USA) and frozen as a dry pellet (expanded CD4 T-cells). No Fc-receptor blocking step was applied before staining, as cultures contained negligible Fc-receptor-expressing cells; non-specific binding was monitored using isotype controls.
T-cell receptor repertoire library preparation and sequencing
The sequencing library method was adapted from a published efficient bulk multiplex method (23). Briefly, RNA was extracted from pellets of expanded CD4 cells and circulating CD4 cells using the RNeasy PLUS microkit (Qiagen) or minikit (Qiagen) and quantified with a Qubit 4 Fluorometer (Thermo Fisher Scientific) and RNA HS assay kit (Thermo Fisher Scientific). RNA was reverse transcribed using a TCRB chain constant region reverse primer with the Superscript IV First-Strand Synthesis System (Invitrogen). cDNA was column purified with the Oligo Clean Concentrator kit (Zymo Research). 100 ng (expanded CD4 T cells) or 1000 ng (circulating CD4 T-cells) of TCRB chain cDNA was used as a template for PCR with KAPA2G Fast Multiplex Mix (Roche). Forward TCRB primers were used with a single nested TCRB constant region reverse primer at 0.2 µM per primer. Samples underwent 20 cycles for NGS library preparation for PCR1. For PCR2, 2 µL of PCR1 was added to 18 µL of PCR2 master mix (0.25 µM each i5 and i7 sample barcoding primers) and underwent 20 cycles with the use of Herculase II fusion DNA polymerase (Agilent). PCR2 product was then run on agarose gel (2%) and the desired bands were purified using a GeneJet PCR Purification kit (Thermo Fisher Scientific). Purified libraries were quantified with a Qubit 4 Fluorometer (Thermo Fisher Scientific) and Qubit dsDNA HS assay kit (Thermo Fisher Scientific). Paired-end sequencing was performed on an Illumina Novaseq 6000 for the circulating CD4 T cells and Illumina iSeq 100 for the expanded CD4 T cells with 15% of PhiX control v3 (Illumina).
Bioinformatic analysis
After sequencing quality filtering, the clean data was aligned against the international ImMunoGeneTics library (https://github.com/repseqio/library-imgt/) using the MiXCR v4.2.0 tool (https://github.com/milaboratory/mixcr). The high-quality reads were further assembled into clonotypes, correcting for PCR and sequencing errors. The downstream analysis was performed with MiXCR, the “immunarch” R package and in-house R script (https://github.com/Gaut1993).
Statistical analysis
Data and statistical analyses were done in FlowJo 10 and GraphPad Prism 8.3.0. The corresponding figure legends provide the statistical details of the experiments. Spearman correlation analyses were used for comparisons, and Mann-Whitney or Wilcoxon tests were used for comparisons of unpaired or paired data, respectively.
Graphical illustrations
Certain graphical illustrations were made with BioRender(biorender.com).
Results
Setting up a sequencing approach to have in-depth access to T-cell repertoires specific for two viral epitopes
In order to describe in depth the CD4 T-cell repertoire specific to two viral epitopes, we set up an approach combining in vitro expansion of antigen-specific CD4 T cells with sequencing of the CDR3β region of their T-cell receptor and the entire T-cell repertoire. This approach comprises three different steps (Figure 1). First, CDR3β TCR sequences from circulating CD4 T cells were sequenced. Second, PBMCs of the same donors were cultured for 10 days with each viral epitope to enrich the cell population with epitope-specific T cells. Specific CD4 T cells were then sorted by flow cytometry using the CD134 and CD137 activation markers and their CDR3β TCR region was sequenced. Third, sequences of the specific T-cell repertoire were retrieved from the circulating CD4 T-cell repertoire to obtain their original frequency. We applied this approach to two viral CD4 T-cell epitopes from influenza A (HA306-318 (HA)) and Epstein-Barr (EBNA515-528 (EBNA)) viruses selected from the literature to leverage natural immunity against these common viruses. The HA peptide binds to multiple HLA-DR molecules (24) and participates in the influenza-specific CD4 T-cell response in multiple donors (25) to its HLA-DR promiscuity (24). EBNA515–528 was found to contribute to the T-cell response to Epstein-Barr virus in multiple donors (26). We selected 13 different donors with various HLA typing to gain a broad view of the diversity of the T-cell response to these frequent epitopes. They were retained for the study because they responded to both peptides in IFN-γ ELISPOT, in contrast to other screened donors, who showed no response to at least one of them (not shown).
Figure 1. Experimental set-up to capture the antigenic TCR repertoire. Circulating CD4 T cells were purified from PBMCs of each donor. PBMCs were also cultured with peptide HA 306–318 or EBNA 515-528. After 10 days, cultured cells were stimulated with the appropriate peptide and activated T cells were sorted after 24 hours on CD4+/CD134+/CD137+ expression. CDR3β regions of activated T cells (in green) and of circulating CD4 T cells were sequenced. The frequency of the antigen-specific T-cell repertoire was retrieved from the circulating repertoire.
The circulating CDR3β TCR repertoire is highly diverse and mainly composed of private clonotypes
We analyzed the CDR3β T-cell repertoire of the 13 healthy donors by deep sequencing from 10 million circulating CD4 T cells per donor. Overall 4,643,632 circulating CDR3β clonotypes were retrieved after data filtration, varying between 134,960 and 667,858 clonotypes (459,527 ± 160,469) per donor. The top 100 most frequent clonotypes represented between 2.3% and 24.8% of the repertoire for each donor (Figure 2A). The top 100,000 clonotypes captured 50% of the total repertoire for 12 of the 13 donors. Almost 50% of the repertoire was composed of very low-frequency clones. A higher number of frequent clonotypes dominated the repertoire of donor #1198 compared to the others. We then explored the VJ gene usage association within the circulating repertoire (Figure 2B). We identified a total of 31 V-gene usages and 13 J-gene usages with no statistically significant association. The CDR3β length of the circulating repertoire was also highly diverse, with clonotypes ranging from 6 to 32 amino acids in length, centered around 14–15 amino acids (Figure 2C). More than 75% of the circulating clonotypes were between 13 and 16 CDR3β amino acids in length. We also analyzed the frequency of public TCRβ by overlapping every circulating T-cell repertoire between each pair of the 13 donors (Figure 2C). Public clonotypes were found in every circulating repertoires, but with minimal overlap of approximately 3 to 6% (Figure 2D). It also appears that donors sharing HLA-DRB1 molecules (see Supplementary Table S1) do not present a stronger repertoire overlap than other donors.
Figure 2. Composition of the circulating CD4 TCRβ repertoire in peripheral blood samples of healthy donors. CDR3β regions of circulating CD4 T cells were sequenced. (A) Distribution of CDR3β clonotypes as a function of their relative abundance for each donor. (B) Co-occurrence map of TRBV-TRBJ gene usage. (C) Amino acid length distribution of the TCR CDR3β. (D) Overlap of the 13 healthy donors circulating repertoire (Morisita’s overlap index).
Sequencing of peptide-activated T cells reveals a large population of specific T cells, dominated by highly enriched clonotypes
To enhance the sensitivity of peptide-specific T-cell detection, PBMCs from the 13 donors were cultured with the HA and EBNA peptides for 10 days under culture conditions that were optimized in our previous study (18). This expansion step appeared necessary as CD134 and CD137 expression was inconsistent with freshly isolated PBMCs from two donors showing IFN-γ ELISPOT responses to the HA peptide (not shown). After several washings, cultured cells were stimulated with or without the corresponding peptide and, after 24 hours, expression of activation-induced marker (AIM) (CD4+CD134+CD137+) was assessed by flow cytometry (Figure 3A). All donors exhibited AIM T cells upon peptide stimulation (Figure 3B). The peptide-stimulated cell population was significantly larger than the unstimulated one for both peptides (p<0.0001), the activation magnitude of the HA-specific CD4 T cells being stronger than that of the EBNA peptide.
Figure 3. Antigen-specific CD4 T-cell responses of healthy donors. After 10 days of culture of the PBMCs with peptide HA 306–318 or EBNA 515-528, cells were stimulated with the appropriate peptide or non-stimulated and activated T cells were sorted after 24 hours on CD4+/CD134+/CD137+ expression. (A) FACS plot examples. (B) percentages of activation-induced markers (CD134+ CD137+) CD4+ T cells after stimulation of PBMCs with peptide HA306–318 or EBNA515-528. Statistical comparisons between unstimulated and stimulated groups were performed with the Wilcoxon test (pairwise comparisons). ****p<0.0001.
Afterwards we sequenced the CDR3β TCR repertoire of the sorted CD4+CD134+CD137+ T cells of the 13 healthy donors and evaluated the sequence distribution (Figure 4). A total of 18,144 clonotypes were retrieved for the HA repertoire (between 744 and 2911 clonotypes per donor) and 18,291 for the EBNA repertoire (between 1046 and 2971 clonotypes per donor), indicating a similar response size for both peptides. Clonotype proportion showed that the repertoire was dominated by highly expanded clonotypes (Figure 4A). A single clonotype dominated each repertoire with a high frequency ranging between 3.9% and 21.2% for HA (mean: 11.2% ± 5.0%) and between 1.9% and 45.8% for EBNA (mean: 17.8% ± 15.4%). The top 50 clonotypes accounted for an average of 63% and 60% of the HA- and EBNA-specific T-cell repertoires, respectively, indicating that the remaining clonotypes, while more numerous, are significantly underrepresented. The sequences were then recovered from the entire T-cell repertoire previously characterized for each donor (see above). The cumulative frequency of specific clonotypes within the total repertoire averaged 0.4% for the HA peptide and 0.6% for the EBNA peptide (Figure 4B). This proportion exceeded 1% for one donor with the HA peptide and for two donors with the EBNA peptide. Therefore, the size of the peptide-specific repertoire within the total repertoire appeared significant and comparable for both peptides. A comparative analysis of CDR3β length revealed similarities between the two peptide-specific T-cell repertoires. The length distribution of TCR CDR3β in the peptide-specific repertoire mainly ranged between 13 and 16 amino acids (>10%), 14 amino acids being the most prevalent length (Figure 4C). In contrast to CDR3β length, VJ gene usage of the peptide-specific T-cell repertoires showed different preferences (Figure 4D). Among all functional human Vβ genes, only the following were detected in the HA-specific repertoire: TRBV4-1, TRBV5-1, TRBV6-1, TRBV6-5, TRBV6-6, TRBV7-2, TRBV7-3, TRBV9, TRBV11-3, TRBV12-3, TRBV19, TRBV20-1, TRBV24-1, TRBV25-1, TRBV28, and TRBV30. Of these, TRBV28 was the most prevalent. Regarding the 13 TRBJ genes, TRBJ2–7 and TRBJ1–1 were the most highly expressed, with the strongest association observed between TRBV28 and TRBJ2–7 in the HA-specific repertoire. In the EBNA-specific repertoire, the following TRBV genes were identified: TRBV2, TRBV3-1, TRBV4-1, TRBV7-2, TRBV7-4, TRBV9, TRBV10-1, TRBV10-2, TRBV12-3, TRBV12-4, TRBV18, TRBV20-1, TRBV27, and TRBV29-1. Among these, TRBV2 and TRBV4–1 were the most prominent. Similar to the HA-specific repertoire, all TRBJ genes were present, with TRBJ1–2 and TRBJ2–7 being the most represented. The major VJ association differed from that of HA and consisted of TRBV4–1 and TRBJ2-1. Both peptide-specific repertoires showed strong VJ preferences, in contrast to the whole T-cell repertoire. Finally, we characterized the clonotype motifs of the two CDR3β T-cell repertoires for the 14-amino-acid CDR3, as it was the most represented in both repertoires (Figure 4E). Both motifs exhibited low amino acid diversity per position and shared little similarity, consistent with the strong sequence differences between the two repertoires. The HA motif displayed a higher proportion of hydrophobic amino acids at positions 8 and 9 than the rest of the CDR3β, which contains mainly polar residues. The EBNA motif was characterized by hydrophobic amino acids at position 5 and polar amino acids at position 9. Analysis of CDR3β motifs revealed distinct patterns for each peptide, reflecting strong VJ gene preferences and specific sequence characteristics in each repertoire.
Figure 4. Composition of the epitope-specific CD4 TCRβ repertoires. Analysis of the CDR3β T-cell repertoires was presented for the peptides HA 306-318 (green) and EBNA 515-528 (blue). (A) Distribution of CDR3β clonotypes as a function of their relative abundance for each donor. (B) Proportion of the epitope-specific repertoire among the total repertoire for each peptide. (C) Amino acid length distribution of the antigen-specific TCR CDR3β. (D) Co-occurrence maps of TRBV-TRBJ gene usage in both specific antigen repertoires (HA in green and EBNA in blue). (E) Sequence logo analysis of the 14-amino-acid CDR3β sequences for the HA and EBNA antigen-specific repertoires.
Peptide-specific repertoires show overlaps between donors but are essentially private
We also compared the HA- and EBNA-specific CDR3β T-cell repertoires across 13 donors by pairwise overlap analysis (Figure 5A). Almost all pairwise comparisons of T-cell repertoires yielded less than 5% overlap and, in many cases, less than 1%, demonstrating that the private repertoire largely dominates for both peptides. Public repertoires were rare. However, donors #1237 and #1239 shared 12% of their HA-specific T-cell repertoire and both possessed the HLA-DRB1*0101 allotype (Supplementary Table S1). Donors #1198 and #1206, who also had the HLA-DRB1*0101 allotype, shared 7% of their HA-specific repertoire, but did not share a substantial common HA-specific repertoire with donors #1237 and #1239. None of these donors exhibited significant overlaps across the EBNA-specific repertoires. Notably donors #1205 and #1206, despite having completely different HLA typings, shared 8.7% and 20% of their T-cell repertoires specific for HA and EBNA, respectively. Clonotype sharing is not exclusive to donors with a common HLA-DR molecule, suggesting that some TCRs are promiscuous (27, 28). It was statistically more frequent among donors with at least one shared HLA-DR molecule for the HA repertoire, but not for the EBNA repertoire (Figure 5B).
Figure 5. Analysis of the public antigen-specific TCRβ repertoire across the donors. (A) Overlap of the antigen-specific repertoire (Morisita’s overlap index) in green for HA and blue for EBNA. (B) Number of shared public clonotypes in the antigen-specific TCRβ repertoire with at least one shared or no shared HLA DRB1 allotype.
Discussion
This study provides a comprehensive analysis of the CD4 T-cell response to viral epitopes, employing an innovative approach to explore the diversity of the TCR repertoire of antigen-specific CD4 T cells. This sensitive approach reveals a notably unbalanced, polyclonal and personalized epitope-specific T-cell repertoire.
The methodological set-up allows for a detailed examination of the diversity of the CD4 TCR CDR3β repertoire raised against multiple kinds of antigens. By culturing PBMCs with specific viral epitopes and sorting the cells based on expression of AIMs, this approach selectively enriches for the peptide-specific CD4 T cells, allowing the detection of rare antigen-specific T cells and appears to be highly sensitive. Indeed, we identified between 744 and 2911 HA-specific clonotypes per donor and between 1046 and 2971 EBNA-specific clonotypes per donor, an estimate which is greater than those of previous studies for the HA epitope (29–31). Although PCR amplification and sequencing can lead to multiple sequence errors, the specificity of the sequences is ensured by double sequencing, owing to the low probability that the same error appears in two sequencing runs. This method was applied to a large number of sorted cells and hence provides a detailed view of the diversity compared to single-cell sequencing approaches, which can be used with a limited number of cells (32, 33). Single-cell sequencing of TCR is especially useful to link the epitope specificity to cell phenotype (32, 33) and to preserve the pairing of TCR α and β chains, but it is not appropriate for in-depth investigation of T-cell repertoires. We therefore focused only on the CDR3β sequences, which present a wider combinatorial and junctional diversity than CDR3α.
We first sequenced the CDR3β T-cell repertoire of 10 million circulating CD4 T cells for each donor by dedicating at least 50 million sequencing reads to each sample. This analysis was generally performed with a lower number of cells (7–9), leading necessarily to a lower number of identified sequences than the 134,960 to 667,858 clonotypes we identified per donor. The whole T-cell repertoire appears vast and highly diverse. However, 10 million cells constitute a small sampling to account for the whole theoretical diversity of the 1012 circulating immune cells (10). Although the frequencies vary across the different studies, a limited number of clones are much more frequent than the majority of clones, which are unexpanded (7–9, 34). In our study, the top 100 clonotypes represent between 2.3% and 24.8% of the repertoire for each donor, showing considerable variation across the donors. No particular association between V, D, and J segments emerges, while the CDR3β length appears canonical. As already shown (8, 35), the number of common clonotypes across the 13 donors remains low, although we sequenced CDR3β from a large set of 10 million CD4 T cells. This observation suggests that the real diversity might exceed previous estimates (6, 9), as suggested recently (10). In fact, if the total number of clonotypes is limited, by increasing the size of the sampling, the proportion of public clonotypes should increase, which is not the case. Furthermore, we also observed a long tail of individual sequences at very low frequency (clonotype indices from 100,000 to 5,000,000), which together covered up to 50% of the total repertoire. At a scale of an entire human body, they might be present at a substantial number and contribute to immune responses. We therefore suggest that the number of sequenced T cells per individual should be increased to measure with greater accuracy the size of the whole T-cell repertoire.
The main objective of this study was the characterization of the diversity of the epitope-specific CDR3β T-cell repertoire. In contrast to the whole T-cell repertoire, which contains every stage of differentiated T cells, the epitope-specific T-cell repertoire likely corresponded to memory or effector CD4 T cells. It is therefore expected to be more narrowly focused and to show a higher degree of clonotype expansion than the whole repertoire. Indeed, for both the HA and EBNA peptides, the response was characterized by the presence of highly expanded clonotypes. Each repertoire was dominated by a single clonotype, while the top 50 clonotypes covered a mean of approximately 60% of the peptide-specific T-cell repertoires. The reason for this unbalanced distribution is unknown, but it might reflect the predominant expansion of clonotypes with high TCR avidity for the epitope. Nevertheless, a multitude of very low-frequency clonotypes contribute to approximately 40% of the epitope-specific repertoire. Previous studies dedicated to clonality of the specific T-cell repertoire revealed similar curves of distribution, but with a lower number of identified clonotypes, qualifying the T-cell response as oligoclonal (29, 31). Based on our observations, we would prefer to describe it as polyclonal.
We amplified in vitro the peptide-specific T cells to characterize them and then calculated clonotype frequencies among the entire T-cell repertoire. We found that the cumulative frequency accounted for approximately 0.5% of the total repertoire for both peptides. This appeared slightly higher than previous estimates (36, 37). Comparison with cell frequency evaluated by tetramer staining (36, 37) should be considered with caution as mRNA content might differ from one cell to another, especially if the cells are differently activated. In the context of non-activated PBMCs, we found that the cumulative frequency accounted for approximately 0.5% of the total repertoire for both peptides. This appeared slightly higher than previous estimates (36, 37). Analysis of VJ gene usage and motif preferences among the T cells responding to both peptides revealed specific patterns consistent with the selectivity of the TCR and the lack of cross-reactivity between the two peptides.
The predominance of private T-cell responses across the donors illustrates the highly personalized nature of immune responses. This has already been documented in multiple studies (38, 39) including in congenic mice (40). Each donor uses a very small part of the large universe of possible recombined TCRs, so it is relatively unlikely that they share the same set of TCRs with another donor. The T-cell privacy therefore confirms the polyclonality of the antigen-specific T-cell response and the difficulty of capturing a large part of the real TCR repertoire with too small blood samples. We acknowledge that our analysis was limited to 13 donors, owing to the experimental design that required deep sequencing of both circulating and antigen-expanded repertoires. Despite this limited cohort size, the observed patterns were consistent across donors, supporting the robustness of our conclusions. Public CDR3β sequences account for a small percentage of the HA- and EBNA-specific T cells according to several studies reporting shared CDR3 sequences (30, 31, 41). Common HLA-DRB1 allotypes were found in our study to increase significantly the overlaps of the TCR repertoire for HA but not for EBNA. Owing to the role of HLA molecules in shaping the T-cell repertoire in the thymus and in driving antigen-specific T-cell activation in the periphery, it would have been expected to be more important. Meanwhile, the substantial occurrence of shared clonotypes among donors without identical HLA-DR molecules hints at the complexity of TCR recognition and its capacity for cross-reactivity (27, 28). It is already known that some TCRs are promiscuous and recognize the same epitope presented by different HLA molecules (27, 28). TCR recognition of peptides presented by HLA molecules has been described as highly selective. TCR promiscuity highlights that HLA sequence variations do not necessarily prevent peptide recognition by T cells, thus revealing different modalities of interaction between HLA complexes and TCR (27). Further, cross-reactivity across completely different peptides (42) or allotypic molecules (43) has also been reported.
Exploration of the HA- and EBNA-specific T-cell repertoire inside the whole T-cell repertoire sheds light on the polyclonal nature of epitope-specific responses, with a small number of highly expanded clonotypes dominating the response and a multitude of low-frequency clonotypes. This wide diversity is also illustrated by the minimal overlap of the T-cell repertoire across the donors. This study also highlights the power of high-throughput sequencing, which allows for in-depth investigation of T-cell repertoire diversity due to its ability to process millions of sequences simultaneously.
Data availability statement
The data presented in this study are available in the GEO repository under accession number GSE309566.
Ethics statement
The studies involving humans were approved by Comiteí d’eíthique et de deíontologie de l’Etablissement Francçais du Sang. The studies were conducted in accordance with the local legislation and institutional requirements. Blood samples were obtained from healthy donors at Etablissement Francçais du Sang (EFS) after written informed consent for participation was provided.
Author contributions
GL: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Writing – original draft, Writing – review & editing. RG: Formal analysis, Investigation, Software, Writing – review & editing. VP: Formal analysis, Investigation, Writing – review & editing. EC: Investigation, Resources, Writing – review & editing. RO: Investigation, Writing – review & editing. SH: Data curation, Formal analysis, Writing – review & editing. BM: Conceptualization, Funding acquisition, Methodology, Project administration, Supervision, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research and/or publication of this article. This study received funding from the exploratory CEA program. RG received a PhD scholarship from Fondation de la Recherché Médicale (FRM) and VP from the European Union’s Horizon 2020 research and innovation program under the Marie Slodowska-Curie grant agreement No 859974.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that Generative AI was used in the creation of this manuscript. To improve the English language of the manuscript. The text was subsequently reviewed and revised by a native English speaker with a scientific training (PhD in Biophysics from King’s College London).
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2025.1713225/full#supplementary-material
References
1. Doherty PC, Allan W, Eichelberger M, and Carding SR. Roles of alpha beta and gamma delta T cell subsets in viral immunity. Annu Rev Immunol. (1992) 10:123–51. doi: 10.1146/annurev.iy.10.040192.001011
2. Restifo NP, Dudley ME, and Rosenberg SA. Adoptive immunotherapy for cancer: harnessing the T cell response. Nat Rev Immunol. (2012) 12:269–81. doi: 10.1038/nri3191
3. Davis MM. T cell receptor gene diversity and selection. Annu Rev Biochem. (1990) 59:475–96. doi: 10.1146/annurev.bi.59.070190.002355
4. von Boehmer H, Karjalainen K, Pelkonen J, Borgulya P, and Rammensee HG. The T-cell receptor for antigen in T-cell development and repertoire selection. Immunol Rev. (1988) 101:21–37. doi: 10.1111/j.1600-065x.1988.tb00731.x
5. Pannetier C, Delassus S, Darche S, Saucier C, and Kourilsky P. Quantitative titration of nucleic acids by enzymatic amplification reactions run to saturation. Nucleic Acids Res. (1993) 21:577–83. doi: 10.1093/nar/21.3.577
6. Arstila TP, Casrouge A, Baron V, Even J, Kanellopoulos J, and Kourilsky P. A direct estimate of the human alphabeta T cell receptor diversity. Science. (1999) 286:958–61. doi: 10.1126/science.286.5441.958
7. Robins HS, Campregher PV, Srivastava SK, Wacher A, Turtle CJ, Kahsai O, et al. Comprehensive assessment of T-cell receptor beta-chain diversity in alphabeta T cells. Blood. (2009) 114:4099–107. doi: 10.1182/blood-2009-04-217604
8. Warren RL, Freeman JD, Zeng T, Choe G, Munro S, Moore R, et al. Exhaustive T-cell repertoire sequencing of human peripheral blood samples reveals signatures of antigen selection and a directly measured repertoire size of at least 1 million clonotypes. Genome Res. (2011) 21:790–7. doi: 10.1101/gr.115428.110
9. Qi Q, Liu Y, Cheng Y, Glanville J, Zhang D, Lee JY, et al. Diversity and clonal selection in the human T-cell repertoire. Proc Natl Acad Sci U S A. (2014) 111:13139–44. doi: 10.1073/pnas.1409155111
10. Mora TW AM. How many different clonotypes do immune repertoires contain? Curr Opin Syst Biol. (2019) 18:104–10.
11. Ma J, Sun G, Zhu P, Liu S, Ou M, Chen Z, et al. Determination of the complexity and diversity of the TCR beta-chain CDR3 repertoire in bladder cancer using high-throughput sequencing. Oncol letters. (2019) 17:3808–16.
12. Clemente MJ, Przychodzen B, Jerez A, Dienes BE, Afable MG, Husseinzadeh H, et al. Deep sequencing of the T-cell receptor repertoire in CD8+ T-large granular lymphocyte leukemia identifies signature landscapes. Blood. (2013) 122:4077–85. doi: 10.1182/blood-2013-05-506386
13. Emerson RO, DeWitt WS, Vignali M, Gravley J, Hu JK, Osborne EJ, et al. Immunosequencing identifies signatures of cytomegalovirus exposure history and HLA-mediated effects on the T cell repertoire. Nat Genet. (2017) 49:659–65. doi: 10.1038/ng.3822
14. Minervina AA, Komech EA, Titov A, Bensouda Koraichi M, Rosati E, Mamedov IZ, et al. Longitudinal high-throughput TCR repertoire profiling reveals the dynamics of T-cell memory formation after mild COVID-19 infection. eLife. (2021) 10. doi: 10.7554/eLife.63502
15. Eugster A, Lindner A, Heninger AK, Wilhelm C, Dietz S, Catani M, et al. Measuring T cell receptor and T cell gene expression diversity in antigen-responsive human CD4+ T cells. J Immunol Methods. (2013) 400-401:13–22. doi: 10.1016/j.jim.2013.11.003
16. Eugster A, Lindner A, Catani M, Heninger AK, Dahl A, Klemroth S, et al. High diversity in the TCR repertoire of GAD65 autoantigen-specific human CD4+ T cells. J Immunol. (2015) 194:2531–8. doi: 10.4049/jimmunol.1403031
17. Risnes LF, Christophersen A, Dahal-Koirala S, Neumann RS, Sandve GK, Sarna VK, et al. Disease-driving CD4+ T cell clonotypes persist for decades in celiac disease. J Clin Invest. (2018) 128:2642–50. doi: 10.1172/JCI98819
18. Pollastro S, de Bourayne M, Balzaretti G, Jongejan A, van Schaik BDC, Niewold ITG, et al. Characterization and monitoring of antigen-responsive T cell clones using T cell receptor gene expression analysis. Front Immunol. (2020) 11:609624. doi: 10.3389/fimmu.2020.609624
19. Rowntree LC, Nguyen THO, Kedzierski L, Neeland MR, Petersen J, Crawford JC, et al. SARS-CoV-2-specific T cell memory with common TCRalphabeta motifs is established in unvaccinated children who seroconvert after infection. Immunity. (2022) 55:1299–315.e4. doi: 10.1016/j.immuni.2022.06.003
20. Cecconi V, Moro M, Del Mare S, Sidney J, Bachi A, Longhi R, et al. The CD4+ T-cell epitope-binding register is a critical parameter when generating functional HLA-DR tetramers with promiscuous peptides. Eur J Immunol. (2010) 40:1603–16. doi: 10.1002/eji.200940123
21. Novak EJ, Masewicz SA, Liu AW, Lernmark A, Kwok WW, and Nepom GT. Activated human epitope-specific T cells identified by class II tetramers reside within a CD4high, proliferating subset. Int Immunol. (2001) 13:799–806. doi: 10.1093/intimm/13.6.799
22. Ausubel LJ, O’Connor KC, Baecher-Allen C, Trollmo C, Kessler B, Hekking B, et al. Characterization of in vivo expanded OspA-specific human T-cell clones. Clin Immunol. (2005) 115:313–22. doi: 10.1016/j.clim.2005.02.015
23. Montagne JM, Zheng XA, Pinal-Fernandez I, Milisenda JC, Christopher-Stine L, Lloyd TE, et al. Ultra-efficient sequencing of T Cell receptor repertoires reveals shared responses in muscle from patients with Myositis. EBioMedicine. (2020) 59:102972. doi: 10.1016/j.ebiom.2020.102972
24. Marshall KW, Liu AF, Canales J, Perahia B, Jorgensen B, Gantzos RD, et al. Role of the polymorphic residues in HLA-DR molecules in allele-specific binding of peptide ligands. J Immunol. (1994) 152:4946–57. doi: 10.4049/jimmunol.152.10.4946
25. Gelder CM, Welsh KI, Faith A, Lamb JR, and Askonas BA. Human CD4+ T-cell repertoire of responses to influenza A virus hemagglutinin after recent natural infection. J Virol. (1995) 69:7497–506. doi: 10.1128/jvi.69.12.7497-7506.1995
26. Ye M, Kasey S, Khurana S, Nguyen NT, Schubert S, Nugent CT, et al. MHC class II tetramers containing influenza hemagglutinin and EBV EBNA1 epitopes detect reliably specific CD4(+) T cells in healthy volunteers. Hum Immunol. (2004) 65:507–13. doi: 10.1016/j.humimm.2004.02.019
27. Hennecke J and Wiley DC. Structure of a complex of the human alpha/beta T cell receptor (TCR) HA1.7, influenza hemagglutinin peptide, and major histocompatibility complex class II molecule, HLA-DR4 (DRA*0101 and DRB1*0401): insight into TCR cross-restriction and alloreactivity. J Exp Med. (2002) 195:571–81. doi: 10.1084/jem.20011194
28. Mandic M, Castelli F, Janjic B, Almunia C, Andrade P, Gillet D, et al. One NY-ESO-1-derived epitope that promiscuously binds to multiple HLA-DR and HLA-DP4 molecules and stimulates autologous CD4+ T cells from patients with NY-ESO-1-expressing melanoma. J Immunol. (2005) 174:1751–9. doi: 10.4049/jimmunol.174.3.1751
29. Prevost-Blondel A, Chassin D, Zeliszewski D, Dorval I, Sterkers G, Pannetier C, et al. Preferential usage of the T-cell receptor by influenza virus hemagglutinin-specific human CD4+ T lymphocytes: in vitro life span of clonotypic T cells. J Virol. (1995) 69:8046–50. doi: 10.1128/jvi.69.12.8046-8050.1995
30. Herati RS, Muselman A, Vella L, Bengsch B, Parkhouse K, Del Alcazar D, et al. Successive annual influenza vaccination induces a recurrent oligoclonotypic memory response in circulating T follicular helper cells. Sci Immunol. (2017) 2. doi: 10.1126/sciimmunol.aag2152
31. Glanville J, Huang H, Nau A, Hatton O, Wagar LE, Rubelt F, et al. Identifying specificity groups in the T cell receptor repertoire. Nature. (2017) 547:94–8. doi: 10.1038/nature22976
32. Khatun A, Kasmani MY, Zander R, Schauder DM, Snook JP, Shen J, et al. Single-cell lineage mapping of a diverse virus-specific naive CD4 T cell repertoire. J Exp Med. (2021) 218. doi: 10.1084/jem.20200650
33. Lyu M, Wang S, Gao K, Wang L, Zhu X, Liu Y, et al. Dissecting the landscape of activated CMV-stimulated CD4+ T cells in humans by linking single-cell RNA-seq with T-cell receptor sequencing. Front Immunol. (2021) 12:779961. doi: 10.3389/fimmu.2021.779961
34. Oakes T, Heather JM, Best K, Byng-Maddick R, Husovsky C, Ismail M, et al. Quantitative characterization of the T cell receptor repertoire of naive and memory subsets using an integrated experimental and computational pipeline which is robust, economical, and versatile. Front Immunol. (2017) 8:1267. doi: 10.3389/fimmu.2017.01267
35. Zhuo Y, Yang X, Shuai P, Yang L, Wen X, Zhong X, et al. Evaluation and comparison of adaptive immunity through analyzing the diversities and clonalities of T-cell receptor repertoires in the peripheral blood. Front Immunol. (2022) 13:916430. doi: 10.3389/fimmu.2022.916430
36. Novak EJ, Liu AW, Nepom GT, and Kwok WW. MHC class II tetramers identify peptide-specific human CD4(+) T cells proliferating in response to influenza A antigen. J Clin Invest. (1999) 104:R63–7. doi: 10.1172/JCI8476
37. Danke NA and Kwok WW. HLA class II-restricted CD4+ T cell responses directed against influenza viral antigens postinfluenza vaccination. J Immunol. (2003) 171:3163–9. doi: 10.4049/jimmunol.171.6.3163
38. Porcheddu V, Lhomme G, Giraudet R, Correia E, and Maillere B. The self-reactive FVIII T cell repertoire in healthy individuals relies on a short set of epitopes and public clonotypes. Front Immunol. (2024) 15:1345195. doi: 10.3389/fimmu.2024.1345195
39. Ciacchi L, van de Garde MDB, Ladell K, Farenc C, Poelen MCM, Miners KL, et al. CD4(+) T cell-mediated recognition of a conserved cholesterol-dependent cytolysin epitope generates broad antibacterial immunity. Immunity. (2023) 56:1082–97.e6. doi: 10.1016/j.immuni.2023.03.020
40. Cibotti R, Cabaniols JP, Pannetier C, Delarbre C, Vergnon I, Kanellopoulos JM, et al. Public and private V beta T cell receptor repertoires against hen egg white lysozyme (HEL) in nontransgenic versus HEL transgenic mice. J Exp Med. (1994) 180:861–72. doi: 10.1084/jem.180.3.861
41. Greenshields-Watson A, Attaf M, MacLachlan BJ, Whalley T, Rius C, Wall A, et al. CD4(+) T cells recognize conserved influenza A epitopes through shared patterns of V-gene usage and complementary biochemical features. Cell Rep. (2020) 32:107885. doi: 10.1016/j.celrep.2020.107885
42. Lang HL, Jacobsen H, Ikemizu S, Andersson C, Harlos K, Madsen L, et al. A functional and structural basis for TCR cross-reactivity in multiple sclerosis. Nat Immunol. (2002) 3:940–3. doi: 10.1038/ni835
Keywords: TCR, clonotypes, repertoire, viral, epitopes, HLA class II, NGS, CD4 T cells
Citation: Lhomme G, Giraudet R, Porcheddu V, Correia E, Olaso R, Hua S and Maillere B (2025) Deep exploration of the TCR CDR3β repertoire specific for viral CD4 T-cell epitopes inside the circulating T-cell repertoire. Front. Immunol. 16:1713225. doi: 10.3389/fimmu.2025.1713225
Received: 25 September 2025; Accepted: 30 October 2025;
Published: 26 November 2025.
Edited by:
Varun Sasidharan Nair, Italian Institute of Technology (IIT), ItalyReviewed by:
Caio Santos Bonilha, University of Glasgow, United KingdomAlexandra Vujkovic, Institute of Tropical Medicine Antwerp, Belgium
Copyright © 2025 Lhomme, Giraudet, Porcheddu, Correia, Olaso, Hua and Maillere. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Bernard Maillere, YmVybmFyZC5tYWlsbGVyZUBjZWEuZnI=
Rémi Giraudet1