ORIGINAL RESEARCH article

Front. Immunol., 08 March 2018

Sec. B Cell Biology

Volume 9 - 2018 | https://doi.org/10.3389/fimmu.2018.00395

Many Routes to an Antibody Heavy-Chain CDR3: Necessary, Yet Insufficient, for Specific Binding

    SD

    Sara D’Angelo 1

    FF

    Fortunato Ferrara 1

    LN

    Leslie Naranjo 1

    MF

    M. Frank Erasmus 1

    PH

    Peter Hraber 2

    AR

    Andrew R. M. Bradbury 1*

  • 1. Specifica Inc., Santa Fe, NM, United States

  • 2. Los Alamos National Laboratory, Los Alamos, NM, United States

Article metrics

View details

95

Citations

24,2k

Views

9,1k

Downloads

Abstract

Because of its great potential for diversity, the immunoglobulin heavy-chain complementarity-determining region 3 (HCDR3) is taken as an antibody molecule’s most important component in conferring binding activity and specificity. For this reason, HCDR3s have been used as unique identifiers to investigate adaptive immune responses in vivo and to characterize in vitro selection outputs where display systems were employed. Here, we show that many different HCDR3s can be identified within a target-specific antibody population after in vitro selection. For each identified HCDR3, a number of different antibodies bearing differences elsewhere can be found. In such selected populations, all antibodies with the same HCDR3 recognize the target, albeit at different affinities. In contrast, within unselected populations, the majority of antibodies with the same HCDR3 sequence do not bind the target. In one HCDR3 examined in depth, all target-specific antibodies were derived from the same VDJ rearrangement, while non-binding antibodies with the same HCDR3 were derived from many different V and D gene rearrangements. Careful examination of previously published in vivo datasets reveals that HCDR3s shared between, and within, different individuals can also originate from rearrangements of different V and D genes, with up to 26 different rearrangements yielding the same identical HCDR3 sequence. On the basis of these observations, we conclude that the same HCDR3 can be generated by many different rearrangements, but that specific target binding is an outcome of unique rearrangements and VL pairing: the HCDR3 is necessary, albeit insufficient, for specific antibody binding.

Introduction

Antibodies bind their targets using diversified loops, termed complementarity-determining regions (CDRs), with three in each rearranged VH and VL gene. CDRs 1 and 2 are encoded by germline V genes, while CDR3s in both VH and VL are the product of gene recombination. Compared to other CDRs, the varied length and biochemical properties of heavy-chain complementarity-determining region 3 (HCDR3) contribute to enhanced sequence diversity (1). It has been estimated (2) that the theoretical HCDR3 diversity exceeds 1015 variants, generated from fixed genomic sequences by combinatorial and junctional diversification mechanisms. This underlies the vast diversity of the human antibody repertoire. The fully assembled V(D)J gene and its incorporated HCDR3 are derived from the sequential random assembly of 56 VH, 23 DH, and 6 JH genes (36). While both VH and JH contribute to the HCDR3, the DH forms the central core. Although DH genes are predominantly read in one frame, all three frames can be used (7, 8), further increasing potential diversity. It was initially thought that D genes could also be inverted and duplicated (9, 10); however, recent deep sequencing results indicate that this is unlikely (7). Diversity is further increased by P-nucleotide-mediated (11) or N-nucleotide-mediated (12, 13) addition, or exonuclease-mediated loss (11, 14), of nucleotides between the VH/DH and DH/JH segments. Recombination between VH genes after rearrangement provides further diversification although it remains unclear how much this contributes overall (15).

There is much evidence that the HCDR3 is the major determinant of antibody-binding specificity. Specific antibodies have been selected from synthetic antibody libraries where diversity is restricted to the HCDR3 (1618). It has been shown that a greater number of antibodies were selected from a synthetic library containing only HCDR3 diversity than when the same library was combined with LCDR3 diversity (18). However, due to the random diversity in both CDR3s, this lower performance may have been due to a reduced fitness profile, caused by either the higher mutational load or potential inter-CDR structural clashes. In addition to display libraries, transgenic mice with antibody diversity restricted to the HCDR3 were able to generate high-affinity responses in vivo (19). HCDR3’s themselves have also been harvested as diversity elements (2022), and low-affinity binders have been selected from fluorescent scaffold libraries in which they provide the only diversity (23). Further evidence that HCDR3s are the major determinants of antibody-binding specificity arises from the observation that peptides derived from HCDR3 structures can show biological activity similar to the antibodies from which they were derived (24, 25), in one case even demonstrating in vivo viral neutralization (25). Furthermore, peptide libraries generated from naïve IgM HCDR3s produce specific binders against targets (21, 26), often more efficiently than synthetic peptide libraries. HCDR3s have also been transplanted from antibodies to other proteins, conferring the expected binding activity upon those non-antibody scaffolds. These include the HCDR3 from different antibodies transplanted into neocarzinostatin (27), sfGFP (23), or an epidermal growth factor-like module of human tissue-type plasminogen activator (28). In each of these cases, the grafted HCDR3 recapitulated the antibody-binding activity. Although it is evident that the HCDR3 is critical in antigen binding, diversity confined to the LCDR3 can still generate specific antibodies (29), and it is known from affinity maturation experiments that the affinities of antibodies with identical HCDR3s may differ by up to 100-fold (30, 31).

Although the other antibody-binding loops have defined canonical structures (3235), the prediction of the HCDR3 conformation is not trivial and has been found to have a wide variety of different possible configurations (35). In addition to the structural variability of HCDR3s with different sequences, the same HCDR3 can adopt different conformations within the same antibody when bound to different targets (36) or in uncomplexed antibodies with different VH/VL frameworks (37). This reflects the important role that HCDR3 plays in target recognition by antibodies (10, 19, 38) and likely shows that HCDR3 conformational flexibility is an additional diversity mechanism employed by the immune system.

The diverse nature of the HCDR3 has led to its use as a fingerprint both in vivo (3943) and in vitro (4450). In this article, we have assessed the diversity of HCDR3 sequences in an in vitro selected antibody population. We found that in vitro selection elicits hundreds of different target-specific HCDR3s, but that only within the context of a target-specific antibody population, antibodies with the same HCDR3 recognize the target. In an unselected population, we were unable to identify any sequenced antibodies with the same HCDR3 that was target specific. We conclude that the HCDR3 is necessary, but insufficient, for specific antibody binding.

Results

Selection of Anti-CDK2 Antibodies from a Naïve Human Recombinant Library

We selected antibodies against CDK2, a human cyclin-dependent kinase, provided by the Structural Genomic Consortium (SGC; Toronto), from a well-validated (44, 45, 5159) large naïve phage antibody library in the single-chain Fv (scFv) format, created by site-specific recombination (59, 60). This library was previously used to develop a combined phage and yeast display approach (45, 53), which has the advantage that many more antibodies can be identified than by regular phage display, particularly when combined with next-generation sequencing (NGS) (44). After two rounds of phage selection (using biotinylated CDK2 antigen and streptavidin-coated magnetic beads) and two rounds of yeast sorting (at 100 nM antigen concentration), almost all yeast displaying antibodies recognized the target, as shown in Figure 1A.

Figure 1

Figure 1

Yeast display sorting and analysis of CDK2-specific antibodies. (A) After two rounds of phage selection, the antibody population was displayed on yeast cells and enriched for CDK2-specific binders by two rounds of flow cytometry-assisted sorting performed at 100 nM. (B) Abundance distribution of the selected anti-CDK2 clones as identified by their heavy-chain complementarity-determining region 3 (HCDR3s) by next-generation sequencing.

The HCDR3s of the final sorted population were sequenced using IonTorrent. 32,138 total HCDR3 sequences were obtained and analyzed with the Antibody Mining Toolbox (61). 535 different HCDR3s aa sequences made up 98% of all the sequences analyzed. The remaining 2% of sequences mainly comprised HCDR3 represented by one or two sequences and were ignored for the purposes of this study as the result of possible sequencing errors. The 535 HCDR3 sequences were ranked by abundance, and their distribution is presented in Figure 1B. The majority of HCDRs were represented by limited numbers of clones.

In a previous publication (44), we described the identification and affinity determination of representative clones of 8 of the 10 most abundant HCDR3 clones. All isolated clones bound CDK2, and for each HCDR3 sequence identified, the affinity of one representative clone was assessed directly on the yeast surface (62). The affinities of these eight most abundant identified clones were found to range from 2 to 75 nM, as previously described (44).

HCDR3-Based Rescue of Anti-CDK2 Antibodies and Characterization

To assess whether an identified HCDR3 corresponded to a single clone, or a sublibrary of clones, we used an inverse PCR approach anchored within the most abundant unique HCDR3 sequence (50) to isolate all clones containing this HCDR3 within the sorting-enriched population.

When 24 random clones from this sublibrary were Sanger sequenced, eight different antibodies, all containing the same HCDR3, were identified. Additional mutations were identified in the rest of the VH, and significantly more in the VL. In fact, for the VL, four different CDR1, two different CDR2 and two different CDR3 sequences, were identified (Figure 2A). However, the different clones shared 91.6–97.8% homology, and the same rearrangements were responsible for VH (5-51, D2-08, and J3) and VL (IGLV3-21 and IGLJ1). Finally, when the affinities of the eight expressing and binding clones were calculated, they spanned a 10-fold range (from 30.1 to 352.5 nM) (Figure 2B), reflecting the 100 nM target concentration used for sorting.

Figure 2

Figure 2

Sequences and affinities of single-chain Fvs with an identical heavy-chain complementarity-determining region 3 (HCDR3). Inverse PCR of the most abundant clone of the selected population was analyzed by Sanger sequencing. (A) Sanger sequence analysis of the eight unique clones with identical HCDR3 obtained by inverse PCR. (B) VDJ gene usage and affinity values of the eight different clones.

HCDR3-Based Rescue of Non-Selected Antibodies and Characterization

Given this broad variation in affinity, and the known importance of the HCDR3 in antibody binding specificity, we applied the inverse PCR technique to the original naïve, unselected library to assess the relative abundance of clones containing the HCDR3 sequence of interest. The inverse PCR reaction, using primers specific for the top-ranked clone, was performed using a plasmid preparation of the naïve library as a template. The obtained mini-library was transformed into yeast-competent cells, and, upon induction, the cells were sorted for well-displayed antibodies (Figure 3, left and middle panels). Of note, after sorting for expression and analyzing such population for binding to CDK2 (Figure 3, third panel), only 0.15% of the clones showed binding for the cognate antigen. Ninety six clones of the population sorted for expression were sequenced. We identified 55 different scFvs containing the same HCDR3 with a far greater variation in both VH and VL than seen in the clones isolated from the selected population: 37.7–73.2% homology to the selected clones. Figure 4A shows the alignment of the VH regions of the sequenced clones. The VL families, not being under any particular selective pressure, were very diverse, derived from 12 VL-kappa and 10 VL-lambda germline genes, with 4 JL-kappa genes and 3 JL-lambda genes represented. The VH genes, on the other hand, having all been selected to contain the same HCDR3 sequence, were, not surprisingly, all found to have the same JH and DH genes (with one exception). More surprising was the diversity of the VH germline genes, which comprised 5 VH families derived from 19 different germline VH genes (Figure 4B; Table 1).

Figure 3

Figure 3

Analysis of the naïve library for single-chain Fvs (scFvs) with identical heavy-chain complementarity-determining region 3 (HCDR3s). Yeast clones displaying scFvs from the HCDR3-specific mini-library obtained from the naïve library via inverse PCR were analyzed by flow cytometry. Cells expressing scFvs on the surface and detected by anti-SV5-PE antibody were gated and sorted (left panel). Once grown and analyzed, they showed an increase in the expressing population (middle panel) with a limited, but detectable, binding for biotinylated CDK2 detected by streptavidin-APC conjugation (right panel).

Figure 4

Figure 4

Analysis of antibody sequences obtained by inverse PCR. (A) The alignment of antibody sequences with the same heavy-chain complementarity-determining region 3 (HCDR3) from naïve and selected libraries shows differences in VH gene sequences. This representation, a so-called highlighter plot, indicates amino acid mutations in the sites that differ from the first sequence. The alignment was generated with the Highlighter Tool (hiv.lanl.gov/content/sequence/HIGHLIGHT/highlighter_top.html). (B) The sequences of the antibodies sharing the same HCDR3 obtained by inverse PCR from the population enriched for specific antigen binders were phylogenetically clustered to show relations among variants that share VH alleles. (C) Single-chain Fvs (scFvs) sharing the same HCDR3 but obtained from either the selected or the naïve population were displayed on yeast and their binding activity was measured by flow cytometry and reported in the histogram as percentage of binding. None of the scFvs isolated from the naïve library bound CDK2, as opposed to the ones from the selected population.

Table 1

Clone IDVH geneDH geneJH geneHCDR3VL geneJL geneLCDR3
C7IGHV1-2IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV2-28IGKJ3CMQTLQTPFTF
G12IGHV1-2IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV2-14IGLJ1CSSYTSVSTYVF
E2IGHV1-2IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV7-46IGLJ2CLLDYTDARVF
D7IGHV1-3IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-39IGKJ1CQQSYSTPWTF
A6IGHV1-46IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV2-28IGKJ4CMQSLQTPLTF
C9IGHV1-46IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV2-8IGLJ2CSSYAGSNNVVF
D2IGHV1-46IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV2-14IGLJ1CCSYGGPYVF
H11IGHV1-46IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-5IGKJ4CQQYYSTPLTF
A12IGHV1-69IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV1-40IGLJ1CQSYDSSLSGYVF
H10IGHV1-69IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV2-18IGLJ1CSSYTSSSTYVF
B2IGHV3-21IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-12IGKJ3CQQTNSFPFTF
D9IGHV3-21IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-9IGKJ5CHQTDTLPITF
A11IGHV3-23IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-5IGKJ2CQQYDTLPRTF
B5IGHV3-23IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-39IGKJ4CQQSYSTPPTF
G4IGHV3-23IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV4-1IGKJ2CQQYHSTPYTF
A2IGHV3-23IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-5IGKJ2CQQYVECSF
A5IGHV3-23IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV2-28IGKJ2CMQALQSPRTF
A8IGHV3-23IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV3-11IGKJ2CQQYNNWPPYTF
B4IGHV3-23IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-17IGKJ4CLQHNLYPRTF
D1IGHV3-23IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV3D-15IGKJ4CQQYNNWPPLTF
D4IGHV3-23IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV3D-20IGKJ2CQQFGGSPKCSF
E1IGHV3-23IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-17IGKJ4CLQHNTYPLTF
F2IGHV3-23IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV2-28IGKJ1CMQALQTPWTF
G2IGHV3-23IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV3-1IGLJ2CQVWDSNSHVVF
G5IGHV3-23IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV3D-20IGKJ5CQQRSNWPLTF
B3IGHV3-23IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV2-14IGLJ2CAAWDSSLSAVVF
A1IGHV3-30IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-33IGKJ5CQQYDKLPLTF
B1IGHV3-30IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-12IGKJ2CQQGYSFPRTF
D12IGHV3-53IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV3-21IGLJ2CQAWDTHDDPWGVF
E5IGHV3-64IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-33IGKJ2CVQHRGYPRYTF
C10IGHV3-64DIGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV3-11IGKJ3CQQRINRVTF
A3IGHV3-7IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV3D-20IGKJ3CQQYSLYPLSF
G9IGHV3-7IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV1-44IGLJ1CQAWDSRTAVF
B8IGHV3-72IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-9IGKJ4CQQLNSYPLAF
G3IGHV3-9IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV3-11IGKJ5CQQRGNWPPGATF
G1IGHV4-31IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV3-1IGLJ2CQAWDSGTVVF
B6IGHV4-31IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV2-28IGKJ2CMQALQSPRTF
H1IGHV4-31IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV3-1IGLJ2CQAWDSSTAVF
F1IGHV4-34IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-17IGKJ1CLQHNNYPRTF
G8IGHV4-34IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV2D-29IGKJ2CMQGTHWPRTF
H5IGHV4-34IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV4-1IGKJ4CQQYYSTPLTF
A10IGHV4-34IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-17IGKJ1CLQHNNYPRTF
E3IGHV4-34IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV2-28IGKJ1CMQALQAPWTL
F4IGHV4-34IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV3-21IGLJ2CQVWDSRDQHVAF
A7IGHV4-34IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-33IGKJ2CQQYDNLRYSF
E12IGHV4-34IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV3-11IGKJ2CQQRSNSPPFTF
C4IGHV4-4IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV1-51IGLJ3CGTWDSSLSAGVF
B7IGHV4-59IGHD3-10IGHJ3CAKGFRAGDAFDIWIGKV4-1IGKJ3CQQFYSTPPLFTF
C8IGHV4-61IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV2-8IGLJ2CSSYTGSNNWRVVF
A9IGHV5-51IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-5IGKJ3CQQSYSTPLFTF
C6IGHV5-51IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-5IGKJ4CLQHDEYPLTF
E6IGHV5-51IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-5IGKJ4CQQADSVPLTF
F7IGHV5-51IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV4-1IGKJ3CQQYSSIPFTF
F8IGHV5-51IGHD2-8IGHJ3CAKGFRAGDAFDIWIGLV7-43IGLJ3CLLYYGGAQLGVF
G11IGHV5-51IGHD2-8IGHJ3CAKGFRAGDAFDIWIGKV1-39IGKJ1SQQSYDSPPMTF

Analyses of the gene usage from the heavy-chain complementarity-determining region 3 (HCDR3)-specific non-selected population.

Testing of the scFvs by flow cytometry from the selected and naïve populations revealed that none of the scFvs containing this HCDR3 isolated from the naïve library were able to bind CDK2, while all those from the selected population bound CDK2 (Figure 4C).

We were surprised that the same HCDR3 could be assembled from so many different germline VH genes in the naïve unselected library. As the library we used was originally created by cloning the rearranged VH and VL genes of peripheral blood lymphocytes from 40 donors, this convergent HCDR3 assembly may be a normal consequence of the generation of antibody diversity, or it could be a result of the various PCR reactions we performed to create the library, as well as the final inverse PCR anchored within the HCDR3.

In Vivo HCDR3 Generation

In order to assess the prevalence of identical HCDR3s derived from different germline genes in vivo, we analyzed two publicly available datasets of naïve B cell sequences (63, 64), referred to respectively as the “DeKosky” and the “DeWitt” datasets. In the first, ~55,000 naïve VH sequences from three donors were obtained by paired end MiSeq reads, and 23 HCDR3 sequence pairs were found to be shared between two of the three donors in the naïve repertoires. This represents a frequency of 0.083% shared HCDR3s. No HCDR3 was found to be shared among all three donors. Interestingly, all of these 23 HCDR3 pairs were discordant for identified VH germline genes, and seven were also discordant for the identified DH gene (Table 2A). However, all pairs share the same JH gene. In the second analysis (64), based on a dataset of 8,596,145 productive MiSeq reads comprising 7,984,053 unique HCDR3s from the naïve B cells of three donors, we identified 568 identical HCDR3s (0.007% of the total unique HCDR3s) generated by different VDJ recombinations (as determined by IMGT). These were generated using from two to 26 different VDJ combinations (see Table S1 in Supplementary Material), and 176 of these rearrangements were found in all three donors. Two of the HCDR3s with the greatest number of rearrangements are illustrated in Table 2B. The first, CARDRGDYW, was generated from 14 different VH genes (from five different VH gene families), five different DH genes and one JH gene in 26 different combinations. Five of these combinations were found in two donors, and two were found in all three donors, the remaining were unique combinations found in individual donors. The second, CARDSSGWYYFDYW, was a longer HCDR3 and was generated from 20 different VH genes (from all seven different VH gene families) and only one DH and one JH gene. Six of the combinations were found in two donors and four in all three donors.

Table 2

(A)

HCDR3 aa sequenceLengthVH familyVH geneDH geneJH gene

CAKDGYW5IGHV01IGHV1-46IGHJ4
CAKDGYW5IGHV03IGHV3-23IGHJ4
CARADDAFDIW9IGHV01IGHV1-69IGHJ3
CARADDAFDIW9IGHV04IGHV4-34IGHD1-1IGHJ3
CARALYYFDYW9IGHV01IGHV1-46IGHD1-7IGHJ4
CARALYYFDYW9IGHV01IGHV1-2IGHD3-16IGHJ4
CARDKYYFDYW9IGHV01IGHV1-69IGHD1-14IGHJ4
CARDKYYFDYW9IGHV04IGHV4-59IGHJ4
CARDLDYW6IGHV03IGHV3-11IGHJ4
CARDLDYW6IGHV03IGHV3-33IGHJ4
CARDPFDYW7IGHV01IGHV1-69IGHJ4
CARDPFDYW7IGHV01IGHV1-46IGHJ4
CARDPGPW6IGHV01IGHV1-69IGHD1-14IGHJ5
CARDPGPW6IGHV03IGHV3-33IGHD1-14IGHJ5
CARDRSSSFDYW10IGHV03IGHV3-33IGHD6-13IGHJ4
CARDRSSSFDYW10IGHV03IGHV3-74IGHD6-13IGHJ4
CARDSGNDYW8IGHV07IGHV7-4IGHD6-13IGHJ4
CARDSGNDYW8IGHV01IGHV1-8IGHD4-23IGHJ4
CARDSSGYFDYW10IGHV01IGHV1-46IGHD3-22IGHJ4
CARDSSGYFDYW10IGHV01IGHV1-18IGHD3-22IGHJ4
CARDYCSGGSCYFDYW14IGHV04IGHV4-31IGHD2-15IGHJ4
CARDYCSGGSCYFDYW14IGHV03IGHV3-21IGHD2-15IGHJ4
CARGAAPDYW8IGHV01IGHV1-46IGHD5-12IGHJ4
CARGAAPDYW8IGHV03IGHV3-53IGHD2-15IGHJ4
CARGAYYFDYW9IGHV04IGHV4-59IGHD3-16IGHJ4
CARGAYYFDYW9IGHV03IGHV3-33IGHJ4
CARGGNWFDPW9IGHV04IGHV4-34IGHD3-10IGHJ5
CARGGNWFDPW9IGHV04IGHV4-30IGHD2-15IGHJ5
CARGGYGDYVDYW11IGHV01IGHV1-46IGHD4-17IGHJ4
CARGGYGDYVDYW11IGHV01IGHV1-18IGHD4-17IGHJ4
CARGIAAADYW9IGHV03IGHV3-48IGHD6-13IGHJ4
CARGIAAADYW9IGHV01IGHV1-69IGHD6-13IGHJ4
CARGRVFDYW8IGHV04IGHV4-34IGHD3-16IGHJ4
CARGRVFDYW8IGHV02IGHV2-5IGHD1-26IGHJ4
CARGSSFDYW8IGHV04IGHV4-59IGHD3-10IGHJ4
CARGSSFDYW8IGHV03IGHV3-53IGHD6-6IGHJ4
CARGVAARDYW9IGHV04IGHV4-59IGHD6-6IGHJ4
CARGVAARDYW9IGHV03IGHV3-48IGHD6-6IGHJ4
CARRFDPW6IGHV03IGHV3-21IGHJ5
CARRFDPW6IGHV04IGHV4-34IGHJ5
CARRLGNWYFDLW11IGHV03IGHV3-11IGHD3-10IGHJ2
CARRLGNWYFDLW11IGHV04IGHV4-61IGHD7-27IGHJ2
CARVGSGWYFDYW11IGHV03IGHV3-66IGHD6-19IGHJ4
CARVGSGWYFDYW11IGHV03IGHV3-7IGHD6-19IGHJ4
CASNDAFDIW8IGHV01IGHV1-46IGHJ3
CASNDAFDIW8IGHV05IGHV5-10IGHJ3

(B)

HCDR3 aa Sequence# rearrangementsVH familyVH geneDH geneJH geneDonor representation

CARDRGDYW26IGHV01IGHV01-02IGHD03-10IGHJ04-01donor1
CARDRGDYW26IGHV01IGHV01-02IGHD05-24IGHJ04-01donor3
CARDRGDYW26IGHV01IGHV01-03IGHD01-26IGHJ04-01donor3
CARDRGDYW26IGHV01IGHV01-03IGHD03-10IGHJ04-01donor3
CARDRGDYW26IGHV01IGHV01-03IGHD06-25IGHJ04-01donor1
CARDRGDYW26IGHV01IGHV01-18IGHD03-10IGHJ04-01donor3
CARDRGDYW26IGHV01IGHV01-18IGHD03-16IGHJ04-01donor1, donor3
CARDRGDYW26IGHV01IGHV01-18IGHD05-24IGHJ04-01donor2
CARDRGDYW26IGHV01IGHV01-18IGHD06-25IGHJ04-01donor3
CARDRGDYW26IGHV01IGHV01-46IGHD03-10IGHJ04-01donor1, donor2, donor3
CARDRGDYW26IGHV01IGHV01-46IGHD03-16IGHJ04-01donor2
CARDRGDYW26IGHV01IGHV01-69IGHD03-10IGHJ04-01donor1, donor2, donor3
CARDRGDYW26IGHV01IGHV01-69IGHD03-16IGHJ04-01donor2
CARDRGDYW26IGHV03IGHV03-11IGHD03-10IGHJ04-01donor1
CARDRGDYW26IGHV03IGHV03-11IGHD03-16IGHJ04-01donor1
CARDRGDYW26IGHV03IGHV03-13IGHD03-10IGHJ04-01donor1
CARDRGDYW26IGHV03IGHV03-48IGHD03-10IGHJ04-01donor3
CARDRGDYW26IGHV03IGHV03-53IGHD03-10IGHJ04-01donor1, donor3
CARDRGDYW26IGHV03IGHV03-53IGHD03-16IGHJ04-01donor3
CARDRGDYW26IGHV03IGHV03-53IGHD05-24IGHJ04-01donor1
CARDRGDYW26IGHV03IGHV03-64IGHD03-10IGHJ04-01donor1, donor3
CARDRGDYW26IGHV03IGHV03-66IGHD03-10IGHJ04-01donor2, donor3
CARDRGDYW26IGHV04IGHV04-39IGHD03-10IGHJ04-01donor1, donor3
CARDRGDYW26IGHV04IGHV04-39IGHD03-16IGHJ04-01donor3
CARDRGDYW26IGHV05IGHV05-51IGHD03-16IGHJ04-01donor2
CARDRGDYW26IGHV07IGHV07-04IGHD03-10IGHJ04-01donor3
CARDSSGWYYFDYW20IGHV01IGHV01-02IGHD06-19IGHJ04-01donor1, donor2, donor3
CARDSSGWYYFDYW20IGHV01IGHV01-03IGHD06-19IGHJ04-01donor1, donor2, donor3
CARDSSGWYYFDYW20IGHV01IGHV01-08IGHD06-19IGHJ04-01donor1
CARDSSGWYYFDYW20IGHV01IGHV01-18IGHD06-19IGHJ04-01donor1, donor2, donor3
CARDSSGWYYFDYW20IGHV01IGHV01-46IGHD06-19IGHJ04-01donor1, donor3
CARDSSGWYYFDYW20IGHV01IGHV01-69IGHD06-19IGHJ04-01donor2, donor3
CARDSSGWYYFDYW20IGHV02IGHV02-70IGHD06-19IGHJ04-01donor1, donor3
CARDSSGWYYFDYW20IGHV03IGHV03-11IGHD06-19IGHJ04-01donor1
CARDSSGWYYFDYW20IGHV03IGHV03-20IGHD06-19IGHJ04-01donor1
CARDSSGWYYFDYW20IGHV03IGHV03-23IGHD06-19IGHJ04-01donor3
CARDSSGWYYFDYW20IGHV03IGHV03-48IGHD06-19IGHJ04-01donor3
CARDSSGWYYFDYW20IGHV03IGHV03-53IGHD06-19IGHJ04-01donor1, donor2, donor3
CARDSSGWYYFDYW20IGHV03IGHV03-64IGHD06-19IGHJ04-01donor3
CARDSSGWYYFDYW20IGHV03IGHV03-66IGHD06-19IGHJ04-01donor2, donor3
CARDSSGWYYFDYW20IGHV03IGHV03-72IGHD06-19IGHJ04-01donor3
CARDSSGWYYFDYW20IGHV03IGHV03-74IGHD06-19IGHJ04-01donor1, donor2
CARDSSGWYYFDYW20IGHV04IGHV04-39IGHD06-19IGHJ04-01donor3
CARDSSGWYYFDYW20IGHV05IGHV05-51IGHD06-19IGHJ04-01donor3
CARDSSGWYYFDYW20IGHV06IGHV06-01IGHD06-19IGHJ04-01donor1, donor2
CARDSSGWYYFDYW20IGHV07IGHV07-04IGHD06-19IGHJ04-01donor3

Analyses of (A) DeKosky and (B) DeWitt data for identical HCDR3s.

Discussion

Next-generation sequencing has been widely applied to many areas of human immunology, helping, for instance, to increase understanding of antibody repertoires (6471), VH/VL pairing (39, 67), humoral responses to pathogens (7274), vaccination (41, 73, 75, 76), and the role of antibodies in autoimmune conditions (77) and cancer (78). In addition to its role in understanding natural in vivo humoral responses, NGS has also been used in the practice of in vitro antibody selection, including in the sequencing of antibodies selected by phage (4749) and yeast (44, 50) display, as well as in the analysis of naïve antibody libraries (61, 79, 80). We have previously shown (61) that the number of specific antibody HCDR3s that can be identified using NGS after a combined phage/yeast selection protocol far exceeds the number that can be isolated using standard low-throughput analysis and sequencing methods. After in vitro selections, we routinely use the HCDR3s as unique identifiers to rank antibody abundance. Identified clones can then be isolated by inverse PCR (50).

In the work presented here, we show that 535 different HCDR3s are identified by NGS of a yeast displayed population that is positive for binding to CDK2 (Figure 1A). This mirrors previous work in which we showed that hundreds of different HCDR3s were able to mediate specific binding against a number of different targets (44). When antibodies containing the most abundant HCDR3 were isolated from the selected pool using specific inverse primers, a single scFV gene was not obtained, but an “oligoclonal” population of specific binders, comprising at least eight different antibody sequences. These are all very similar to one another (91.6–97.8% homology), with most of the variation occurring in the VL but with the germline VH, DH, JH, VL, and JL genes identified as being identical. Analysis of these clones revealed a 10-fold difference in affinity (Kd), confirming the importance of additional antibody structure beyond the HCDR3 in modulating binding activity (30, 31) and indicating that the true diversity of anti-CDK2 antibodies could be significantly higher than 535, when variability in VL and HCDR1 and HCDR2 is also taken into account.

Given the identification of different antibodies with identical HCDR3s, all of which bound the target in the selected population, we turned to the naïve library to assess whether the same HCDR3 within the context of different antibodies would also be able to bind the target. By using the same inverse PCR approach, a far more diverse collection of antibodies, all of which contained the same HCDR3, was isolated. However, none of these were able to bind the target, and analysis of the aligned sequences revealed that apart from the identical HCDR3’s, these antibodies comprised very different VL genes. This was not surprising since VLs were randomly recombined and not under selective pressure. More surprising was the finding that the frameworks and CDR1 and CDR2 of the VHs were largely diverse, corresponding to 19 different germline VH genes. When this population was tested for binding to CDK2 by flow cytometry, only 0.15% of displayed antibodies with the identical HCDR3 bound the target. On the basis of these findings, we conclude that a specific HCDR3 will only define a particular binding specificity within a very narrow structurally appropriate context: i.e., HCDR3 is necessary, but is insufficient to define specific antibody-binding properties unless combined with appropriate VL and VH germline genes. This is perhaps not surprising given a recent report in which structural analysis of the same HCDR3 sequence placed within the context of different VH and VL genes shows significant conformational diversity (37). Those results, along with those presented here, suggest that the conformations of HCDR3 conformations are modified not only by their sequences but also by the structural environment in which they are found: in particular their VH and VL pairing.

It is remarkable that so many different rearranged VH genes, derived from 19 germline genes, were found to contain the same HCDR3. This begs the question as to whether the generation of identical HCDR3s from different germline genes is biological in nature, or a result of the molecular biological manipulations we had undertaken in these experiments. In a couple of published NGS analyses of in vivo naïve B cell HCDR3 repertoires (63, 64), 0.04–0.08% of HCDR3s were found to be shared between any pairs of donors. Further analysis of the sequences described in the study by DeKosky et al. (63) (Table 2A) reveals that all these so-called public HCDR3s were derived from different germline VH (and in some cases DH) genes, suggesting that the generation of identical HCDR3 sequences is stochastic and usually occurs using different germline VH and DH combinations. This conclusion was confirmed and extended by a much larger second dataset (64), which was generated by sequencing the naïve B cell repertoire of three individuals at far greater depth (>8.5 M productive reads total). Different rearrangements encoding identical HCDR3s were found both within and between donors. Altogether, 568 different HCDR3s generated with from 2 to 26 different rearrangements were identified (see Table S1 in Supplementary Material). Of these, 176 rearrangements, comprising 155 different HCDR3s, were found in all 3 donors. In a particularly notable example, the same four rearrangements (using four different VH genes and the same DH and JH genes) were found in all three donors for two of these HCDR3s (CARGYSSGWYYFDYW and CARDSSGWYYFDYW) (see Table S1 in Supplementary Material; Table 2B). These results demonstrate that the creation of identical HCDR3s from different VH or DH germline genes is a regular, albeit rare, occurrence in vivo and that the sequences of the HCDR3s, as well as the rearrangements used to create them, are shared among different individuals. The observed in vitro HCDR3 rearrangement diversity, therefore, more likely reflects the original in vivo recombination, rather than the consequence of molecular biological manipulation. This is further confirmed by in vitro selection experiments from natural naïve libraries (81), in which it was found that antibodies with the same HCDR3 sequence were derived using different VH genes.

The library used here (59) was created from the rearranged V genes of 40 donors and is estimated to comprise approximately 3.3 × 106 different HCDR3s (61). The in vivo data described above suggest that most, if not all, of the identical HCDR3s identified in the naïve library were stochastically derived from different germline VH, DH, and JH gene rearrangements in the original donors. However, it cannot be excluded that this natural diversity was supplemented by some of the in vitro molecular biological manipulations we have carried out. In particular, the inverse PCR primers we used to isolate all identical HCDR3s may be “correcting” slightly different sequences to the desired HCDR3, even if, given the primer lengths (18–23 bases), they would have to be extremely similar to be able to do this. Furthermore, inadvertent PCR errors may have increased the apparent diversity of the surrounding VH gene despite the use of a proofreading polymerase. One surprising result was the low percentage (0.15%) of CDK2-binding clones containing the identified HCDR3 in the naïve library.

The earliest naïve in vitro antibody libraries (16, 82, 83) had claimed diversities of ~5 × 107, and an average of ~4 antibodies were selected per target. A smaller subset (107) of a much larger library yielded ~1 antibody per target (84). Assuming the diversity estimates for the sizes of these (sub)libraries is correct, these results suggest that one should expect one positive antibody per ~107 different antibodies, consistent with theoretical analyses of library size (85). However, as library size has scaled upward (to claimed diversities of >1011), the number of antibodies selected against individual targets has generally remained below 100 in the absence of heroic efforts (81). The use of deep sequencing described here, and elsewhere (44, 4750), indicates that the gap between the potential diversity of selectable antibodies, and the significantly lower number usually analyzed is predominantly a sampling problem, which can be overcome with ongoing improvements in sequencing technology. This will allow the calculation, rather than the estimation, of the true diversity of antibody repertoires and antigen-specific populations selected from them.

Materials and Methods

Bacterial and Yeast Strains

  • DH5aF′: F′/endA1 hsdR17(rKmK+) supE44 thi-1 recA1 gyrA (Na1r) relA1 D(lacZYAargF) U169 (m80lacZDM15)

  • Omnimax (Life Technologies): F′ {proAB lacIq lacZM15 Tn10(TetR) (ccdAB)} mcrA (mrr hsdRMS-mcrBC) 80(lacZ)M15 (lacZYAargF)U169 endA1 recA1 supE44 thi-1 gyrA96 relA1 tonA panD

  • EBY100 (kindly provided by Prof. Dane Wittrup): MATa AGA::GAL1-AGA1::URA3 ura3-52 trp1 leu2-delta200 his3-delta200 pep4::HIS3 prb11.6R can1 GAL

scFv Antibody Selections

In vivo biotinylated His-tagged CDK2 protein (NP_001789.2), produced by the SGC (Toronto) was used for the scFv phage display selections. The naïve scFv library described in the study by Sblattero and Bradbury (59) was used for two rounds of phage display against the antigen with streptavidin magnetic beads. Two additional rounds of yeast display sorting were performed using 100 nM of antigen. The detailed protocol for antibody selections against biotinylated proteins is described in the study by Ferrara et al. (45).

For the selection of clones sharing same HCDR3 derived from the naïve library and cloned into yeast display vector (see below), cells were induced and labeled with anti-SV5-PE to assess the scFv display level.

All the flow cytometry-assisted sorting experiments were performed using the FACSAria (Becton Dickinson) sorter and analyzed using FlowJo software (FlowJo LLC).

When single clones were analyzed for their specificity, they were stained with CDK2, unrelated antigen, and conjugated streptavidin, as negative controls. All experiments with single clones were performed in a 96-well format using the LSRII (Becton Dickinson) flow cytometer.

Next-Generation Sequencing

The plasmid DNA of the anti-CDK2 second sort output was used as a template for the PCR targeting the HCDR3 region of the scFvs. A set of forward primers mapping to the framework region upstream of the HCDR3 and carrying one of the Ion Torrent sequencing adaptors were used in combination with a barcoded reverse primer mapping to the common SV5 tag region of the yeast display vector and carrying the second adaptor required for sequencing. The primer sequences and method are described in detail by D’Angelo et al. (61). Once amplified with the proofreading Phusion polymerase (NEB), gel extracted, and quantified (Q-bit, HS-DNA kit, Invitrogen), the amplicon libraries were processed using the Ion Xpress Amplicon library protocol and then prepared for sequencing on the Ion 316 Chip (Life Technologies). The sequences analysis was performed using the AbMining Toolbox as described by D’Angelo et al. (61).

Primer Design and Inverse PCR

The inverse PCR strategy is described in the study by D’Angelo et al. (50). Briefly, primers were designed on the DNA consensus sequence for the HCDR3 of the top-ranked clone as back to back primers directed outward from the middle of the HCDR3, with a 5′ phosphorylated forward primer. The inverse PCR was carried out using a high-fidelity polymerase with proofreading activity (Phusion High Fidelity Polymerase, NEB) and either 0.03 fmol of plasmid DNA obtained from the yeast sorted population (1,000–10,000 times the diversity of the sorting output) or 0.3 fmol of the original phage naïve library were used as a template.

After amplification, the PCR product was gel extracted and purified (Qiaquick Gel extraction kit, Qiagen) to avoid contamination from the original plasmid template. The purified products were ligated with T4 ligase and transformed into DH5aF′ bacterial cells.

For the clones obtained from the second yeast sort enriched for CDK2 binders, single clones were analyzed by Sanger sequencing to confirm the presence of the correct HCDR3 and obtain the sequence of the full-length scFv, before carrying out binding assays. The sequenced plasmid clones were then retransformed into the EBY100 yeast display strain (Yeast transformation kit, Sigma) for testing by flow cytometry.

When the entire unselected naïve library was used as a template to isolate clones sharing the same HCDR3 by inverse PCR, ligation and bacteria transformation were performed. A plasmid preparation was obtained and transformed into the EBY100 yeast display cells. The final product was a sublibrary of scFvs, with 106 clones sharing the same HCDR3.

Affinity Measurement

The affinity of the selected clones was determined by yeast display using the equilibrium binding titration curve to extrapolate the equilibrium dissociation constant (KD), as described by Boder and Wittrup (86). Briefly, the induced, monoclonal populations of yeast displaying scFvs were incubated with eight concentrations of biotinylated antigen, ranging from 1 to 500 nM for 30 min at room temperature to allow the binding reaction to reach equilibrium followed by 5 min on ice for 5 min to reduce the off-rate. After washing, the yeast cells were incubated with the secondary reagents (streptavidin-Alexa633, to detect antigen binding to the displayed scFvs, and anti-SV5-PE, to detect the yeast displayed scFvs) on ice for 30 min. After the final washes, the samples were analyzed by flow cytometry on the BDAriaIII (BD Biosciences). The mean fluorescence intensities of the gated binding/displaying populations were plotted against the antigen concentration, and a non-linear least-squares analysis was used to fit the curve and obtain the KD values for each scFv.

In Vivo Database Analysis

In the first database (63), the identity of VH, DH, and JH genes making up each individual HCDR3 were determined on the basis of the nucleotide sequences using IMGT (87) and NCBI IgBlast software (88), along with a CDR3 motif identification algorithm (89). The final data set comprising HCDR3 sequences found in three individuals was kindly provided by DeKosky and Georgiou. The second database (64) used a scored alignment across a definition list of all the known VH, DH, and JH genes in IMGT (90). The naïve data sets for three individual donors were downloaded from the public repository found at http://adaptivebiotech.com/pub/robins-bcell-2016.1 Data were filtered through RStudio software package [RStudio Team (2015) RStudio: Integrated Development for R. RStudio, Inc. (Boston, MA, USA)],2 whereby 7.4 × 106, 6.0 × 106, and 8.4 × 106 individual sequences were processed for the three separate donors, respectively. In some cases, the families could be identified, but not the individual germline VH, DH, or JH genes. Therefore, this initial data set was processed further to include only complete data sets (e.g., HCDR3, VH, DH, JH designations). Any sequence that contained multiple gene or family designations and/or stop codons within HCDR3 was excluded. The final curated set consisted of 2.6 × 106, 2.4 × 106, and 3.6 × 106 in the three respective donors, of which the unique HCDR3 sequences and VH, DH, and JH gene recombination were tabulated.

Statements

Author contributions

SD and FF equally contributed to this work. SD, FF, and AB contributed to research design. SD, FF, and LN conducted experiments; SD, FF, PH, and ME performed data analysis. SD, FF, ME, PH, and AB wrote the manuscript.

Funding

This work was supported by the National Institutes of Health (1-U54-DK093500-01 to AB).

Acknowledgments

We thank Dr. Brandon DeKosky and Dr. George Georgiou for kindly sharing their antibody data set and Marissa Vignali for help accessing the DeWitt database.

Conflict of interest

SD, FF, LN, FE, and AB are employees of Specifica Inc. The remaining author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at http://www.frontiersin.org/articles/10.3389/fimmu.2018.00395/full#supplementary-material.

References

  • 1

    ZemlinMKlingerMLinkJZemlinCBauerKEnglerJAet alExpressed murine and human CDR-H3 intervals of equal length exhibit distinct repertoires that differ in their amino acid composition and predicted range of structures. J Mol Biol (2003) 334(4):73349.10.1016/j.jmb.2003.10.007

  • 2

    ElhanatiYSethnaZMarcouQCallanCGJrMoraTWalczakAM. Inferring processes underlying B-cell repertoire diversity. Philos Trans R Soc Lond B Biol Sci (2015) 370(1676):20140243.10.1098/rstb.2014.0243

  • 3

    JungDGiallourakisCMostoslavskyRAltFW. Mechanism and control of V(D)J recombination at the immunoglobulin heavy chain locus. Annu Rev Immunol (2006) 24:54170.10.1146/annurev.immunol.23.021704.115830

  • 4

    BassingCHSwatWAltFW. The mechanism and regulation of chromosomal V(D)J recombination. Cell (2002) 109(Suppl):S4555.10.1016/S0092-8674(02)00675-X

  • 5

    MatsudaFShinEKHirabayashiYNagaokaHYoshidaMCZongSQet alOrganization of variable region segments of the human immunoglobulin heavy chain: duplication of the D5 cluster within the locus and interchromosomal translocation of variable region segments. EMBO J (1990) 9(8):25016.

  • 6

    LefrancMPGiudicelliVGinestouxCJabado-MichaloudJFolchGBellahceneFet alIMGT, the international ImMunoGeneTics information system. Nucleic Acids Res (2009) 37(Database issue):D100612.10.1093/nar/gkn838

  • 7

    BenichouJGlanvilleJPrakETAzranRKuoTCPonsJet alThe restricted DH gene reading frame usage in the expressed human antibody repertoire is selected based upon its amino acid content. J Immunol (2013) 190(11):556777.10.4049/jimmunol.1201929

  • 8

    CorbettSJTomlinsonIMSonnhammerELBuckDWinterG. Sequence of the human immunoglobulin diversity (D) segment locus: a systematic analysis provides no evidence for the use of DIR segments, inverted D segments, “minor” D segments or D-D recombination. J Mol Biol (1997) 270(4):58797.10.1006/jmbi.1997.1141

  • 9

    SanzI. Multiple mechanisms participate in the generation of diversity of human H chain CDR3 regions. J Immunol (1991) 147(5):17209.

  • 10

    VanDykLMeekK. Assembly of IgH CDR3: mechanism, regulation, and influence on antibody diversity. Int Rev Immunol (1992) 8(2–3):12333.10.3109/08830189209055568

  • 11

    FeeneyAJVictorKDVuKNadelBChukwuochaRU. Influence of the V(D)J recombination mechanism on the formation of the primary T and B cell repertoires. Semin Immunol (1994) 6(3):15563.10.1006/smim.1994.1021

  • 12

    DesiderioSVYancopoulosGDPaskindMThomasEBossMALandauNet alInsertion of N regions into heavy-chain genes is correlated with expression of terminal deoxytransferase in B cells. Nature (1984) 311(5988):7525.10.1038/311752a0

  • 13

    BenedictCLGilfillanSThaiTHKearneyJF. Terminal deoxynucleotidyl transferase and repertoire development. Immunol Rev (2000) 175:1507.10.1111/j.1600-065X.2000.imr017518.x

  • 14

    GaussGHLieberMR. Mechanistic constraints on diversity in human V(D)J recombination. Mol Cell Biol (1996) 16(1):25869.10.1128/MCB.16.1.258

  • 15

    WilsonPCWilsonKLiuYJBanchereauJPascualVCapraJD. Receptor revision of immunoglobulin heavy chain variable region genes in normal human B lymphocytes. J Exp Med (2000) 191(11):188194.10.1084/jem.191.11.1881

  • 16

    BarbasCFIIIBainJDHoekstraDMLernerRA. Semisynthetic combinatorial antibody libraries: a chemical solution to the diversity problem. Proc Natl Acad Sci U S A (1992) 89(10):445761.10.1073/pnas.89.10.4457

  • 17

    BraunagelMLittleM. Construction of a semisynthetic antibody library using trinucleotide oligos. Nucleic Acids Res (1997) 25(22):46901.10.1093/nar/25.22.4690

  • 18

    MahonCMLambertMAGlanvilleJWadeJMFennellBJKrebsMRet alComprehensive interrogation of a minimalist synthetic CDR-H3 library and its ability to generate antibodies with therapeutic potential. J Mol Biol (2013) 425(10):171230.10.1016/j.jmb.2013.02.015

  • 19

    XuJLDavisMM. Diversity in the CDR3 region of V(H) is sufficient for most antibody specificities. Immunity (2000) 13(1):3745.10.1016/S1074-7613(00)00006-6

  • 20

    KissCFisherHPesaventoEDaiMValeroROveckaMet alAntibody binding loop insertions as diversity elements. Nucleic Acids Res (2006) 34(19):e132.10.1093/nar/gkl681

  • 21

    DerooSFischerABeaupainNCounsonMBoutonnetNPletinckxJet alNon-immunized natural human heavy chain CDR3 repertoires allow the isolation of high affinity peptides mimicking a human influenza hemagglutinin epitope. Mol Immunol (2008) 45(5):136673.10.1016/j.molimm.2007.09.001

  • 22

    VenetSRavnUBuatoisVGueneauFCalloudSKosco-VilboisMet alTransferring the characteristics of naturally occurring and biased antibody repertoires to human antibody libraries by trapping CDRH3 sequences. PLoS One (2012) 7(8):e43471.10.1371/journal.pone.0043471

  • 23

    DaiMTemirovJPesaventoEKissCVelappanNPavlikPet alUsing T7 phage display to select GFP-based binders. Protein Eng Des Sel (2008) 21(7):41324.10.1093/protein/gzn016

  • 24

    LeviMSallbergMRudenUHerlynDMaruyamaHWigzellHet alA complementarity-determining region synthetic peptide acts as a miniantibody and neutralizes human immunodeficiency virus type 1 in vitro. Proc Natl Acad Sci U S A (1993) 90(10):43748.10.1073/pnas.90.10.4374

  • 25

    BourgeoisCBourJBAhoLSPothierP. Prophylactic administration of a complementarity-determining region derived from a neutralizing monoclonal antibody is effective against respiratory syncytial virus infection in BALB/c mice. J Virol (1998) 72(1):80710.

  • 26

    ChevigneAFischerAMathuJCounsonMBeaupainNPlesseriaJMet alSelection of a CXCR4 antagonist from a human heavy chain CDR3-derived phage library. FEBS J (2011) 278(16):286778.10.1111/j.1742-4658.2011.08208.x

  • 27

    NicaiseMValerio-LepiniecMMinardPDesmadrilM. Affinity transfer by CDR grafting on a nonimmunoglobulin scaffold. Protein Sci (2004) 13(7):188291.10.1110/ps.03540504

  • 28

    SmithJWTachiasKMadisonEL. Protein loop grafting to construct a variant of tissue-type plasminogen activator that binds platelet integrin alpha IIb beta 3. J Biol Chem (1995) 270(51):3048690.10.1074/jbc.270.51.30486

  • 29

    PerssonHYeWWernimontAAdamsJJKoideAKoideSet alCDR-H3 diversity is not required for antigen recognition by synthetic antibodies. J Mol Biol (2013) 425(4):80311.10.1016/j.jmb.2012.11.037

  • 30

    YangWPGreenKPinz-SweeneySBrionesATBurtonDRBarbasCFIII. CDR walking mutagenesis for the affinity maturation of a potent human anti-HIV-1 antibody into the picomolar range. J Mol Biol (1995) 254(3):392403.10.1006/jmbi.1995.0626

  • 31

    SchierRMcCallAAdamsGPMarshallKWMerrittHYimMet alIsolation of picomolar affinity anti-c-erbB-2 single-chain Fv by molecular evolution of the complementarity determining regions in the center of the antibody binding site. J Mol Biol (1996) 263(4):55167.10.1006/jmbi.1996.0598

  • 32

    ChothiaCLeskAM. Canonical structures for the hypervariable regions of immunoglobulins. J Mol Biol (1987) 196(4):90117.10.1016/0022-2836(87)90412-8

  • 33

    ChothiaCLeskAMGherardiETomlinsonIMWalterGMarksJDet alStructural repertoire of the human VH segments. J Mol Biol (1992) 227(3):799817.10.1016/0022-2836(92)90224-8

  • 34

    ChothiaCLeskAMTramontanoALevittMSmith-GillSJAirGet alConformations of immunoglobulin hypervariable regions. Nature (1989) 342(6252):87783.10.1038/342877a0

  • 35

    NorthBLehmannADunbrackRLJr. A new clustering of antibody CDR loop conformations. J Mol Biol (2011) 406(2):22856.10.1016/j.jmb.2010.10.030

  • 36

    JamesLCRoversiPTawfikDS. Antibody multispecificity mediated by conformational diversity. Science (2003) 299(5611):13627.10.1126/science.1079731

  • 37

    TeplyakovAObmolovaGMaliaTJLuoJMuzammilSSweetRet alStructural diversity in a human antibody germline library. MAbs (2016) 8(6):104563.10.1080/19420862.2016.1190060

  • 38

    KabatEAWuTT. Identical V region amino acid sequences and segments of sequences in antibodies of different specificities. Relative contributions of VH and VL genes, minigenes, and complementarity-determining regions to binding of antibody-combining sites. J Immunol (1991) 147(5):170919.

  • 39

    DeKoskyBJKojimaTRodinACharabWIppolitoGCEllingtonADet alIn-depth determination and analysis of the human paired heavy- and light-chain antibody repertoire. Nat Med (2014) 21(1):8691.10.1038/nm.3743

  • 40

    ParameswaranPLiuYRoskinKMJacksonKKDixitVPLeeJYet alConvergent antibody signatures in human dengue. Cell Host Microbe (2013) 13(6):691700.10.1016/j.chom.2013.05.008

  • 41

    LasersonUVigneaultFGadala-MariaDYaariGUdumanMVander HeidenJAet alHigh-resolution antibody dynamics of vaccine-induced immune responses. Proc Natl Acad Sci U S A (2014) 111(13):492833.10.1073/pnas.1323862111

  • 42

    LuDRTanYCKongpachithSCaiXSteinEALindstromTMet alIdentifying functional anti-Staphylococcus aureus antibodies by sequencing antibody repertoires of patient plasmablasts. Clin Immunol (2014) 152(1–2):7789.10.1016/j.clim.2014.02.010

  • 43

    TanYCBlumLKKongpachithSJuCHCaiXLindstromTMet alHigh-throughput sequencing of natively paired antibody chains provides evidence for original antigenic sin shaping the antibody response to influenza vaccination. Clin Immunol (2014) 151(1):5565.10.1016/j.clim.2013.12.008

  • 44

    GlanvilleJD’AngeloSKhanTAReddySTNaranjoLFerraraFet alDeep sequencing in library selection projects: what insight does it bring?Curr Opin Struct Biol (2015) 33:14660.10.1016/j.sbi.2015.09.001

  • 45

    FerraraFD’AngeloSGaiottoTNaranjoLTianHGraslundSet alRecombinant renewable polyclonal antibodies. MAbs (2015) 7(1):3241.10.4161/19420862.2015.989047

  • 46

    FerraraFNaranjoLAD’AngeloSKissCBradburyAR. Specific binder for lightning-link(R) biotinylated proteins from an antibody phage library. J Immunol Methods (2013) 395(1–2):837.10.1016/j.jim.2013.06.010

  • 47

    RavnUDidelotGVenetSNgKTGueneauFRousseauFet alDeep sequencing of phage display libraries to support antibody discovery. Methods (2013) 60(1):99110.10.1016/j.ymeth.2013.03.001

  • 48

    RavnUGueneauFBaerlocherLOsterasMDesmursMMalingePet alBy-passing in vitro screening – next generation sequencing technologies applied to antibody display and in silico candidate selection. Nucleic Acids Res (2010) 38(21):e193.10.1093/nar/gkq789

  • 49

    LovgrenJPursiheimoJPPyykkoMSalmiJLamminmakiU. Next generation sequencing of all variable loops of synthetic single framework scFv-application in anti-HDL antibody selections. N Biotechnol (2016) 33(6):7906.10.1016/j.nbt.2016.07.009

  • 50

    D’AngeloSKumarSNaranjoLFerraraFKissCBradburyAR. From deep sequencing to actual clones. Protein Eng Des Sel (2014) 27(10):3017.10.1093/protein/gzu032

  • 51

    FerraraFKimCYNaranjoLABradburyAR. Large scale production of phage antibody libraries using a bioreactor. MAbs (2015) 7(1):2631.10.4161/19420862.2015.989034

  • 52

    CloseDWFerraraFDichosaAEKumarSDaughtonARDaligaultHEet alUsing phage display selected antibodies to dissect microbiomes for complete de novo genome sequencing of low abundance microbes. BMC Microbiol (2013) 13:270.10.1186/1471-2180-13-270

  • 53

    FerraraFNaranjoLAKumarSGaiottoTMukundanHSwansonBet alUsing phage and yeast display to select hundreds of monoclonal antibodies: application to antigen 85, a tuberculosis biomarker. PLoS One (2012) 7(11):e49535.10.1371/journal.pone.0049535

  • 54

    LilloAMAyrissJEShouYGravesSWBradburyARPavlikP. Development of phage-based single chain Fv antibody reagents for detection of Yersinia pestis. PLoS One (2011) 6(12):e27756.10.1371/journal.pone.0027756

  • 55

    VelappanNMartinezJSValeroRChasteenLPonceLBondu-HawkinsVet alSelection and characterization of scFv antibodies against the Sin Nombre hantavirus nucleocapsid protein. J Immunol Methods (2007) 321(1–2):609.10.1016/j.jim.2007.01.011

  • 56

    AyrissJWoodsTBradburyAPavlikP. High-throughput screening of single-chain antibodies using multiplexed flow cytometry. J Proteome Res (2007) 6(3):107282.10.1021/pr0604108

  • 57

    KehoeJWVelappanNWalboltMRasmussenJKingDLouJet alUsing phage display to select antibodies recognizing post-translational modifications independently of sequence context. Mol Cell Proteomics (2006) 5(12):235063.10.1074/mcp.M600314-MCP200

  • 58

    LouJMarzariRVerzilloVFerreroFPakDShengMet alAntibodies in haystacks: how selection strategy influences the outcome of selection from molecular diversity libraries. J Immunol Methods (2001) 253(1–2):23342.10.1016/S0022-1759(01)00385-4

  • 59

    SblatteroDBradburyA. Exploiting recombination in single bacteria to make large phage antibody libraries. Nat Biotechnol (2000) 18(1):7580.10.1038/71958

  • 60

    SblatteroDLouJMarzariRBradburyA. In vivo recombination as a tool to generate molecular diversity in phage antibody libraries. J Biotechnol (2001) 74(4):30315.10.1016/S1389-0352(01)00022-8

  • 61

    D’AngeloSGlanvilleJFerraraFNaranjoLGleasnerCDShenXet alThe antibody mining toolbox: an open source tool for the rapid analysis of antibody repertoires. MAbs (2014) 6(1):16072.10.4161/mabs.27105

  • 62

    ColbyDWKelloggBAGraffCPYeungYASwersJSWittrupKD. Engineering antibody affinity by yeast surface display. Methods Enzymol (2004) 388:34858.10.1016/S0076-6879(04)88027-3

  • 63

    DeKoskyBJLunguOIParkDJohnsonELCharabWChrysostomouCet alLarge-scale sequence and structural comparisons of human naive and antigen-experienced antibody repertoires. Proc Natl Acad Sci U S A (2016) 113(19):E263645.10.1073/pnas.1525510113

  • 64

    DeWittWSLindauPSnyderTMSherwoodAMVignaliMCarlsonCSet alA public database of memory and naive B-cell receptor sequences. PLoS One (2016) 11(8):e0160853.10.1371/journal.pone.0160853

  • 65

    KhanTAFriedensohnSde VriesARStraszewskiJRuscheweyhHJReddyST. Accurate and predictive antibody repertoire profiling by molecular amplification fingerprinting. Sci Adv (2016) 2(3):e1501371.10.1126/sciadv.1501371

  • 66

    LavinderJJHortonAPGeorgiouGIppolitoGC. Next-generation sequencing and protein mass spectrometry for the comprehensive analysis of human cellular and serum antibody repertoires. Curr Opin Chem Biol (2015) 24:11220.10.1016/j.cbpa.2014.11.007

  • 67

    BusseCECzogielIBraunPArndtPFWardemannH. Single-cell based high-throughput sequencing of full-length immunoglobulin heavy and light chain genes. Eur J Immunol (2014) 44(2):597603.10.1002/eji.201343917

  • 68

    MathonetPUllmanCG. The application of next generation sequencing to the understanding of antibody repertoires. Front Immunol (2013) 4:265.10.3389/fimmu.2013.00265

  • 69

    KaplinskyJLiASunACoffreMKoralovSBArnaoutR. Antibody repertoire deep sequencing reveals antigen-independent selection in maturing B cells. Proc Natl Acad Sci U S A (2014) 111(25):E26229.10.1073/pnas.1403278111

  • 70

    KaplinskyJArnaoutR. Robust estimates of overall immune-repertoire diversity from high-throughput measurements on samples. Nat Commun (2016) 7:11881.10.1038/ncomms11881

  • 71

    ArnaoutRLeeWCahillPHonanTSparrowTWeiandMet alHigh-resolution description of antibody heavy-chain repertoires in humans. PLoS One (2011) 6(8):e22365.10.1371/journal.pone.0022365

  • 72

    TsiorisKGuptaNTOgunniyiAOZimniskyRMQianFYaoYet alNeutralizing antibodies against West Nile virus identified directly from human B cells by single-cell analysis and next generation sequencing. Integr Biol (Camb) (2015) 7(12):158797.10.1039/c5ib00169b

  • 73

    JacksonKJLiuYRoskinKMGlanvilleJHohRASeoKet alHuman responses to influenza vaccination show seroconversion signatures and convergent antibody rearrangements. Cell Host Microbe (2014) 16(1):10514.10.1016/j.chom.2014.05.013

  • 74

    JiangNHeJWeinsteinJAPenlandLSasakiSHeXSet alLineage structure of the human antibody repertoire in response to influenza vaccination. Sci Transl Med (2013) 5(171):171ra19.10.1126/scitranslmed.3004794

  • 75

    LavinderJJWineYGieseckeCIppolitoGCHortonAPLunguOIet alIdentification and characterization of the constituent human serum antibodies elicited by vaccination. Proc Natl Acad Sci U S A (2014) 111(6):225964.10.1073/pnas.1317793111

  • 76

    GreiffVMenzelUHaesslerUCookSCFriedensohnSKhanTAet alQuantitative assessment of the robustness of next-generation sequencing of antibody variable gene repertoires from immunized mice. BMC Immunol (2014) 15:40.10.1186/s12865-014-0040-5

  • 77

    SnirOMesinLGidoniMLundinKEYaariGSollidLM. Analysis of celiac disease autoreactive gut plasma cells and their corresponding memory compartment in peripheral blood using high-throughput sequencing. J Immunol (2015) 194(12):570312.10.4049/jimmunol.1402611

  • 78

    DeFalcoJHarbellMManning-BogABaiaGScholzAMillareBet alNon-progressing cancer patients have persistent B cell responses expressing shared antibody paratopes that target public tumor antigens. Clin Immunol (2017).10.1016/j.clim.2017.10.002

  • 79

    GlanvilleJZhaiWBerkaJTelmanDHuertaGMehtaGRet alPrecise determination of the diversity of a combinatorial antibody library gives insight into the human immunoglobulin repertoire. Proc Natl Acad Sci U S A (2009) 106(48):2021621.10.1073/pnas.0909775106

  • 80

    FantiniMPandolfiniLLisiSChirichellaMArisiITerrignoMet alAssessment of antibody library diversity through next generation sequencing and technical error compensation. PLoS One (2017) 12(5):e0177574.10.1371/journal.pone.0177574

  • 81

    EdwardsBMBarashSCMainSHChoiGHMinterRUllrichSet alThe remarkable flexibility of the human antibody repertoire; isolation of over one thousand different antibodies to a single protein, BLyS. J Mol Biol (2003) 334(1):10318.10.1016/j.jmb.2003.09.054

  • 82

    MarksJDHoogenboomHRBonnertTPMcCaffertyJGriffithsADWinterG. By-passing immunization. Human antibodies from V-gene libraries displayed on phage. J Mol Biol (1991) 222(3):58197.10.1016/0022-2836(91)90498-U

  • 83

    GramHMarconiLABarbasCFIIIColletTALernerRAKangAS. In vitro selection and affinity maturation of antibodies from a naive combinatorial immunoglobulin library. Proc Natl Acad Sci U S A (1992) 89(8):357680.10.1073/pnas.89.8.3576

  • 84

    GriffithsADWilliamsSCHartleyOTomlinsonIMWaterhousePCrosbyWLet alIsolation of high affinity human antibodies directly from large synthetic repertoires. EMBO J (1994) 13(14):324560.

  • 85

    PerelsonASOsterGF. Theoretical studies of clonal selection: minimal antibody repertoire size and reliability of self-non-self discrimination. J Theor Biol (1979) 81(4):64570.10.1016/0022-5193(79)90275-3

  • 86

    BoderETWittrupKD. Yeast surface display for directed evolution of protein expression, affinity, and stability. Methods Enzymol (2000) 328:43044.10.1016/S0076-6879(00)28410-3

  • 87

    BrochetXLefrancMPGiudicelliV. IMGT/V-QUEST: the highly customized and integrated system for IG and TR standardized V-J and V-D-J sequence analysis. Nucleic Acids Res (2008) 36(Web Server issue):W5038.10.1093/nar/gkn316

  • 88

    YeJMaNMaddenTLOstellJM. IgBLAST: an immunoglobulin variable domain sequence analysis tool. Nucleic Acids Res (2013) 41(Web Server issue):W3440.10.1093/nar/gkt382

  • 89

    IppolitoGCHoiKHReddySTCarrollSMGeXRogoschTet alAntibody repertoires in humanized NOD-scid-IL2Rgamma(null) mice and human B cells reveals human-like diversification and tolerance checkpoints in the mouse. PLoS One (2012) 7(4):e35497.10.1371/journal.pone.0035497

  • 90

    GiudicelliVDurouxPGinestouxCFolchGJabado-MichaloudJChaumeDet alIMGT/LIGM-DB, the IMGT comprehensive database of immunoglobulin and T cell receptor nucleotide sequences. Nucleic Acids Res (2006) 34(Database issue):D7814.10.1093/nar/gkj088

Summary

Keywords

heavy-chain complementarity-determining region 3, single-chain Fv display, binding specificity, rearrangement, inverse PCR

Citation

D’Angelo S, Ferrara F, Naranjo L, Erasmus MF, Hraber P and Bradbury ARM (2018) Many Routes to an Antibody Heavy-Chain CDR3: Necessary, Yet Insufficient, for Specific Binding. Front. Immunol. 9:395. doi: 10.3389/fimmu.2018.00395

Received

01 December 2017

Accepted

13 February 2018

Published

08 March 2018

Volume

9 - 2018

Edited by

Jacob Glanville, Distributed Bio, United States

Reviewed by

Mepur Hanumantha-Rao Ravindranath, Terasaki Foundation, United States; Ramya Yarlagadda, Intrexon, United States

Updates

Copyright

*Correspondence: Andrew R. M. Bradbury,

These authors have contributed equally to this work.

Specialty section: This article was submitted to B Cell Biology, a section of the journal Frontiers in Immunology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics