Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Immunol., 09 January 2026

Sec. Comparative Immunology

Volume 16 - 2025 | https://doi.org/10.3389/fimmu.2025.1706720

LILRB1 and LILRB2 genomics and transcriptomics in macaque and baboon species: polymorphism, diversification, and extensive alternative splicing

Natasja G. de Groot*Natasja G. de Groot*Nanine de GrootNanine de GrootCorrine M. C. HeijmansCorrine M. C. HeijmansAnnemiek J. M. de Vos-RouwelerAnnemiek J. M. de Vos-RouwelerMarit K. H. van der WielMarit K. H. van der WielJesse BruijnesteijnJesse Bruijnesteijn
  • Comparative Genetics and Refinement, Biomedical Primate Research Centre, Rijswijk, Netherlands

Introduction: Inhibitory receptors play a pivotal role in fine-tuning immune responses. The leukocyte receptor complex (LRC) encodes multiple receptor families, including the leukocyte immunoglobulin-like receptor (LILR) family, which next to activating receptors involves several inhibitory receptors. The LILRB1 and LILRB2 receptors are considered immune checkpoint inhibitors, which may interact with MHC class I molecules, and are expressed mainly on monocytes, B- and T-cells.

Method: In this study, we characterized LILRB1 and LILRB2 at the genomic and transcriptomic level in three Old World monkey species, namely rhesus and long-tailed macaques and Hamadryas baboon, using SMRT sequencing on PacBio platforms.

Result and discussion: We describe 71 LILRB1 and 58 LILRB2 alleles in the two macaque species, of which only one allele was previously published. In contrast, less polymorphism is observed in the Hamadryas baboon, with only six LILRB1 and seven LILRB2 alleles characterized. Phylogenetic analysis, including known human data, revealed extensive diversification of the LILRB1 and LILRB2 in macaques, with allelic variation clustering into nine and twelve distinct lineages, respectively. This contrasts with the more conserved repertoires observed in humans and Hamadryas baboons. Compared with our experience analyzing MHC and KIR transcriptome data, the LILRB1 and LILRB2 transcriptomes were dominated by alternatively spliced isoforms. Alternative 3’ splice sites near exons 10 and 15 and/or skipping of exon 15, were encountered for most LILRB1 alleles. In LILRB2, the deletion of exon 9 is the most prominent event, next to deletion of exon 10 and the use of alternative 3’ splice sites near exons 10 and 15. The exons that encode the extracellular domains remain largely intact, suggesting that alternative splicing predominantly affects the stem region and the signaling capacity of the LILRB1 and LILRB2 receptors.

Introduction

The leukocyte immunoglobulin-like receptor (LILR) gene family is located on chromosome 19 within the leucocyte receptor complex (LRC), adjacent to the killer-cell immunoglobulin-like (KIR) region (1). The LILR gene cluster is organized into a centromeric and telomeric part separated by a highly conserved central region (1, 2). In humans, the LILR family comprises eleven functional genes, which encode for five activating (LILRA) and five inhibitory receptors (LILRB), and a soluble receptor, LILRA3 (3). In addition, two pseudogenes are present in this region. LILR receptors are involved in an array of functions, such as maintaining immune tolerance, inflammation, hematopoietic differentiation, and neural processes (3).

The LILR gene system appears to be an old entity, predating the emergence and radiation of the mammalian lineage roughly 100 million years ago (4, 5). In addition to humans, this system has been extensively studied in mice, where it is referred to as the paired immunoglobulin-like receptor (PIR) (68). In non-human primates (NHP), however, only limited data are available on LILR gene characterization. One of the better studied species is the chimpanzee (Pan troglodytes), which shared a common ancestor with humans approximately 5 to 6 million years ago. Nine different genes have been identified in chimpanzees, four of which are orthologous to human LILR genes, including LILRB1 and LILRB2 (9). A study in the rhesus macaque (Macaca mulatta, Mamu), which shared a common ancestor with humans approximately 25 to 33 million years ago (10, 11), highlighted that this species expresses various LILR receptors with similar structural features to their human counterparts (12). Previously, we published a thorough comparative analysis of the genomic LILR region across humans and different NHP species using available reference genome assemblies. This study demonstrated that the region has a highly conserved organization and revealed the presence of orthologs of the human LILRB1 and LILRB2 genes in apes, as well as in Old and New World monkey species (5).

The human LILRB1 and LILRB2 gene products, also known as LIR-1/ILT2/MIR-7/CD85j and LIR-2/ILT4/MIR-10/CD85d, respectively, each comprise of four Ig-like domains (D1-D4), stabilized by conserved disulfide bonds. Both proteins also contain a transmembrane region (TM) and a cytoplasmic tail (Cyt) that carries three to four immunoreceptor tyrosine-based inhibitory motifs (ITIM) (13). These ITIMs can recruit Src homology region 2 domain-containing phosphatase-1 (SHP-1), which is essential for inhibitory downstream signaling (1416). The extracellular Ig-like domains can recognize human leucocyte antigen (HLA) class I molecules through the simultaneous interaction of domains D1 and D2 with the highly conserved α3 domain of HLA class I and the conserved β2-microglobulin (β2m) subunit, respectively (17). LILRB2 gene products can also interact with β2m-free HLA molecules (18). Similarly, in mice, PIR gene products are shown to interact with major histocompatibility complex (MHC) class I molecules (68). Beyond HLA interaction, LILRB1 may engage with human cytomegalovirus HLA class I homolog (UL18) (14, 19), dengue virus (20), the damage-associated molecular pattern protein S100A9 (21), and plasmodium falciparum-expressed RIFIN proteins (22). LILRB2, on the other hand, can bind to CD1d (19), β-amyloid (23), and RIFIN (24). For example, the interaction between LILRB2 and β-amyloid may enhance the development of Alzheimer’s disease (AD) (23) and has been explored as a potential therapeutic target for treating AD (25, 26). Collectively, the diverse ligand binding profiles of LILRB1 and LILRB2 underlie their broad immunomodulatory functions.

Macaque species, such as the rhesus and long-tailed macaque (Macaca fascicularis, Mafa), are widely applied as model in translational research to study various aspects of human infectious diseases, including AIDS, COVID-19, Malaria and tuberculosis (2730) as well as neurodegenerative diseases like AD and Parkinson’s disease (31, 32). Furthermore, naturally occurring cancers in macaques may provide valuable models for the development of human cancer immunotherapies (33). The interpretation of findings from such studies would benefit from detailed knowledge of the immune system of the model species. Given the roles of LILRB1 and LILRB2 as immune checkpoint receptors and as important ligands for HLA class I molecules, a comprehensive characterization of these genes in species applied in pre-clinical research is essential. Here, we present an in-depth characterization of LILRB1 and LILRB2 gene polymorphism at both the genomic and transcriptomic levels in rhesus (n=78) and long-tailed macaques (n=70). A smaller cohort of Hamadryas baboons (Papio hamadryas, Paha, n=29), was included for comparison. Like macaques, baboons are Old World monkeys (OWM), but they are native to Africa. They are also widely applied as models in various types of biomedical studies (3436). For the analysis, we employed Single-Molecule Real-Time (SMRT) sequencing on Pacific Bioscience’s (PacBio) platforms. Previously, this method has been successfully applied by our team to characterize KIR transcriptional profiles in humans, chimpanzees, rhesus and long-tailed macaque families, as well as to investigate MHC allelic variation in macaques (3741). In addition to uncovering extensive, previously unrecognized polymorphisms, we observed broad diversification of LILRB1 and LILRB2 alleles in rhesus and long-tailed macaques. Hamadryas baboons, however, exhibited a more conserved repertoire, similar to that observed in humans. LILRB1 and LILRB2 transcripts also display alternative splicing across all three non-human primate species, primarily affecting the exons encoding the stem and the signaling domains.

Materials and methods

Sample collection

The AAALAC-accredited facilities of the Biomedical Primate Research Centre (BPRC) house self-sustaining, pedigreed breeding colonies of rhesus and long-tailed macaques. These colonies are organized into various naturalistic social groups, typically comprising multiple matrilines, with adult females and their offspring, accompanied by a single non-natal adult male. From the population of Indian-origin rhesus macaques, 44 individuals were included in the study, comprising 37 animals from four families and seven unrelated males and females (Supplementary Table 1, Supplementary Figure 1). In addition, samples from 17 Burmese-origin rhesus macaques (three families and three unrelated males and females) and 17 Chinese-origin (three families and seven unrelated individuals) from the BPRC biobank were selected for inclusion (Supplementary Table 1, Supplementary Figure 1). The BPRCs long-tailed macaque colony represents a mixed population based on geographic origin, comprising animals from regions both north and south of the isthmus of Kra, and from the Indonesian and Malaysian islands (40, 42). From this colony, 70 animals were selected, comprising eight families and 9 unrelated males and females (Supplementary Table 1, Supplementary Figure 1).

Twenty-nine Hamadryas baboon samples, comprising 24 individuals from five families and five unrelated individuals, were selected from our biobank for inclusion in the study (Supplementary Table 1, Supplementary Figure 1). These samples originate from a cohort of Hamadryas baboons living at WILDLANDS Adventure Zoo in Emmen, The Netherlands.

Four anonymized, family-related human PBMC samples collected in the early 1980s, originating from a hematologic patient whose family had been typed, were kindly provided by the Immunohematology and Blood Transfusion department of the Leiden University Medical Centre. These samples were used to characterize LILRB1 transcriptome polymorphism.

Genomic DNA isolation, RNA extraction and cDNA synthesis

During regular health checks, heparin and/or ethylenediaminetetraacetic acid (EDTA) samples were obtained from the BPRC-housed rhesus and long-tailed macaques and served as a source for DNA and RNA. Genomic DNA (gDNA) was extracted from EDTA blood or isolated PBMCs using a standard salting-out procedure. RNA was extracted from EDTA blood using the RNeasy mini kit (Qiagen, Hilden, DE) and was used as input for the synthesis of cDNA by the RevertAid first strand cDNA synthesis kit (Invitrogen, Carlsbad (CA), USA) according to manufacturer’s instructions.

LILRB1 and LILRB2 transcriptome amplification and SMRT sequencing on PacBio platform

To obtain LILRB1 and LILRB2 amplicons, gene-specific primers were designed (Supplementary Table 2). The primer set Mamu-LILRB1-Fw1-Rv1 was used for all rhesus macaque samples, and the products amplified with this set resulted in LILRB1 transcripts of 2077 base pair (bp) in length, spanning from exon 4 to the 3’UTR. The primer set Mamu-LILRB1-Fw2-Rv2 was used for a subset of rhesus macaque samples to generate LILRB1 transcripts of 2533 bp in length, spanning from exon 3 to the 3’UTR. For the long-tailed macaques, one forward (Fw) and one reverse (Rv) primer were designed, both located within the transcript (Fw in exon 3 and Rv in exon 16), resulting in the amplification of LILRB1 from exon 4 to partially exon 16 (108 out of the 147 bp of exon 16) and a total product size of 1761 bp. This latter primer set was also used to amplify LILRB1 for the Hamadryas baboon samples. To amplify LILRB1 transcripts in the human samples, the primer set Hosa-LILRB1-Fw-Rv was used, generating a product of 2405 bp. A generic primer set was designed to amplify LILRB2 transcripts for rhesus macaque, long-tailed macaque, and Hamadryas baboon samples, spanning 1788 bp from exon 4 to the 3’UTR. The PCR reactions (50 μl) contained 5 μl of cDNA, 1x Phusion HF buffer, 0.2 mM dNTPs, 0.5 μM of the forward and reverse primer, 3% DMSO, and 0.02 U/μl Phusion hot start II DNA polymerase (Thermo Fisher Scientific, Waltham (MA), USA). Each primer was tagged at the 5’end with a unique 16 bp barcode (www.pacb.com) to allow identification of the pooled amplicons from the different samples after SMRT sequencing on a PacBio platform. For amplification of LILRB1 and LILRB2, thermal cycling conditions started with a denaturation step at 98°C for 2 min, followed by 35 cycles of 98°C for 10 s, 60°C or 62°C for 40 s (annealing temperature applied is specified in Supplementary Table 2), 72°C for 40 s, and a final extension at 72°C for 10 min. The amplification of human LILRB1 started with a denaturation step at 98°C for 2 min, followed by 35 cycles of 98°C for 10 s, 60°C for 40 s, 72°C for 180 s, and a final extension at 72°C for 10 min. Gel electrophoresis was used to size select PCR products (Supplementary Figure 2), and amplicons were purified using GeneJet gel extraction kit (Invitrogen, Carlsbad (CA), USA) according to manufacturer’s instructions. DNA concentrations of individual amplicons were quantified using the Qubit dsDNA HS assay kit and Qubit 2.0 Fluorometer (Thermo Fisher Scientific, Waltham (MA), USA). Amplicons were then pooled in equimolar amounts to a total yield of 3 μg of DNA. The pooled samples were purified twice using AMPure XP beads (Beckman Coulter, Brea (CA), USA) at a 1:1 bead to DNA volume ratio according to manufacturer’s instructions. After purification, the DNA of the pooled samples was re-measured to ensure a total yield of >1 μg total DNA. The PacBio SMRTbell libraries were generated according to Pacific Biosciences “Procedure & Checklist – Amplicon Template Preparation and Sequencing” and sequencing was performed at the Leiden Genome Technology Centre using a PacBio Sequel I (P6-C4 sequencing chemistry) or Sequel II (sequencing kit versions 2.0 and 2.1) system (39). Sequence data collection was performed with a 10-hour (Sequel I) or 20-24-hour (Sequel II) movie time window to obtain sufficient yields of high-quality circular consensus reads. In total, ten runs were performed on a Sequel I and eleven on a Sequel II system to characterize and confirm the various LILRB1 and LILRB2 alleles at the transcription level in the human, two macaque, and Hamadryas baboon samples (Supplementary Table 3). The average number of reads per sample varied from 2736 to 29543. Two Sequel I runs contained only four samples, and the average read count was 28074 and 34581 (Supplementary Table 3).

LILRB1 and LILRB2 transcriptome data analysis

Circular consensus sequences (CCS) were filtered for high read quality (value of 0.99 or higher) and demultiplexed based on unique barcodes. The “Map to reference” function (setting Geneious mapper) in the Geneious Prime version 2025.0.3 was used to align the CCS reads to either a LILRB1 or LILRB2 reference library. Reads with a 100% match were identified using the following settings: fine tuning iterate up to 5 times, minimum overlap = 400, maximum mismatches per read 0, 100% minimum overlap identity, maximum ambiguity = 1. Reference libraries were established per species. The LILRB1 reference libraries consisted of previously reported transcripts for humans and rhesus macaques (12, 43), supplemented with directly submitted as well as predicted reference LILRB1 sequences for humans, rhesus and long-tailed macaques (Supplementary Table 4). The construction of the LILRB2 reference library followed a different strategy. Through comparative analyses of genomic LILR organizations (5), we identified LOC102143922, designated Mafa-LILRA3 in the Gene database (https://www.ncbi.nlm.nih.gov/genbank/), as Mafa-LILRB2. In addition, among the reads amplified with the LILRB1 primers, a group of deviating alleles was identified, which appeared to cluster phylogenetically with LOC102143922. These alleles, along with the sequence of LOC102143922, were used as the reference library to further characterize LILRB2 alleles in macaque species. For baboons, comparative analyses showed that LOC101022334 (Paan-LILRA3 in the Gene database) and LOC116271753 (Paan-LILRB1 in the Gene database) are identified as Paan-LILRB1 and Paan-LILRB2, respectively. Following the initial analyses, additional reference libraries were constructed for human, rhesus and long-tailed macaques, and Hamadryas baboons incorporating alternatively spliced LILRB1 and LILRB2 transcripts based on data generated in this study. Unused reads were grouped into contigs based on similarity using Geneious software, and the consensus sequence of each contig was compared with the LILRB1 and LILRB2 transcript reference library and analyzed through phylogenetic analysis. The novel sequences/alleles (cutoff value of three reads per contig) were confirmed by identification in two independent PCRs and PacBio runs and/or identified in at least two individuals.

LILRB1 and LILRB2 gDNA amplification, ONT and PacBio SMRT sequencing, and data analysis

Representative rhesus and long-tailed macaque families (14 and 7 individuals, respectively, see Supplementary Table 1), which were also included in the transcriptome analysis, were selected for LILRB1 characterization at the gDNA level using long-read sequencing on an ONT platform. The PCR reactions (50 μl) to amplify genomic LILRB1 was performed with 250 ng gDNA (50 ng/μl) as input, 1x Phusion HF buffer, 0.2 mM dNTPs, 0.5 μM of each forward and reverse primer (Supplementary Table 2), 3% DMSO, and 0.02 U/μl Phusion hot start II DNA polymerase. This primer set (Fw-Rv) amplified a 5787 bp product that spans from intron 3 to part of exon 16 (107 out of the 147 bp of exon 16). The thermal cycling conditions consisted of a denaturation step at 98°C for 2 min, followed by 35 cycles of 98°C for 10 s, 62°C for 30 s, 72°C for 4 min, and a final extension at 72°C for 10 min. Gel electrophoresis and extraction were performed as described above. Individual amplicons were barcoded using the ONT Native Barcoding Expansion kit (EXP-NBD103, Oxford Nanopore Technologies Ltd, Oxford, UK), and pooled equimolar (12 samples per pool). ONT sequencing adapters were subsequently ligated to the pooled barcoded amplicons using the Ligation Sequencing kit (SQK-LSK109, Oxford Nanopore Technologies Ltd) according to the ONT Amplicons by Ligation protocol (version protocol NBA_9093_v109_revF_12Nov2019; ligation kit LQK-LSK109). Library clean-up was performed with AMPure XP beads and ONT Long Fragment Buffer to remove ligation enzymes and to enrich for >3 kb transcripts. A MinION R9.4.1 flow cell (Oxford Nanopore Technologies Ltd) was primed using the Flow Cell Priming kit (EXP-FLP002) prior to sequencing. Flow cells were run for up to 24 hours. The reads were demultiplexed and base called using Guppy V3.4.1 software and analyzed with Geneious.

In addition, gDNA sequences of the LILRB1 and LILRB2 alleles were characterized using SMRT sequencing on a PacBio platform. PCR reactions (50 μl) were performed using 250 ng of gDNA (50 ng/μl) as input, 1x Phusion HF buffer, 0.2 mM dNTPs, 0.5 μM of each forward and reverse primer (Supplementary Table 2), 3% DMSO, and 0.02 U/μl Phusion hot start II DNA polymerase. For LILRB1, two sets of primers were used, the first set amplified a 5787 bp product as described above and the second set (Fw3-Rv3) a 5739 bp product that spans intron 3 through part of exon 16 (45 out of the 147 bp of exon 16). For LILRB2, the primers amplified a 5887 bp product that spans from the start codon located at the end of exon 3 to the 3’UTR. The primer sets were barcoded as described for the cDNA amplification. The thermal cycling conditions consisted of a denaturation step at 98°C for 2 min, followed by 35 cycles of 98°C for 10 s, 60 or 62°C for 40 s (Supplementary Table 2), 72°C for 6 min, and a final extension at 72°C for 10 min. Gel electrophoresis and amplicon extraction, pooling and sequencing of the samples were performed similar as described for the cDNA samples. Twelve runs were conducted on the Sequel II system to characterize and confirm the LILRB1 and LILRB2 alleles at the gDNA level (Supplementary Table 3). Data analyses were performed as previously detailed, using reference libraries supplemented with gDNA sequences extracted from the reference genomes of rhesus and long tailed macaque and subsequently expanded with the newly characterized alleles. Novel sequences/alleles were confirmed by identification in two independent PCRs and PacBio runs and/or by their detection in at least two individuals. All genomic LILRB1 and LILRB2 sequences characterized in this study have been submitted to the ENA-EMBL database under project number PRJEB86133 and received individual accession numbers (Supplementary Table 5). In addition, seven alleles were identified only at the transcriptional level and were submitted under the same project number.

Confirmation of 21bp insert exon 9 by Sanger sequencing

PCR (50 μl) amplification to confirm the presence of the 21 bp insert in specific alleles/lineages was performed on cDNA (5 μl) with the primer set LILR-ex7-Fw and LILR-ex7-Rv (Supplementary Table 2) and using Phusion hot start II DNA polymerase. The thermal cycling conditions consisted of a denaturation step at 98°C for 2 min, followed by 32 cycles of 98°C for 10 s, 62°C for 15 s, 72°C for 2 min, and a final extension at 72°C for 10 min. Gel electrophoresis and amplicon extraction were performed as described above. Samples were directly sequenced in house on an ABI 3130xl genetic analyzer (Applied Biosystems, Foster City, CA) using the ABI Prism BigDye Terminator v3.1 Cycle sequencing mixture (Applied Biosystems). The data were analyzed using MacVector version 18.7.6 (MacVector, Inc Cambridge, UK).

Phylogenetic analysis

Phylogenetic analysis was performed in Geneious prime (version 2025.0.3) using the neighbor-joining method (44), and two models for computing the evolutionary distances: Jukes-Cantor (45) and Tamura-Nei (46). Both genetic distance models produced trees with similar topologies. Subsequently, a neighbor-joining tree was constructed with MEGA 11 application (47) and the evolutionary distances were computed using the Nei-Gojobori (Jukes-Cantor) method (48) and are in the units of the number of nonsynonymous substitutions per nonsynonymous site. Bootstrap values were calculated based on 1,000 replicates (49).

Notably, in rhesus macaques, LILRB1- and LILRB2-like molecules, designated as Mm-LILRBa and MmLILRBb, respectively, were identified based on the screening of cDNA libraries derived from different tissues (12). Phylogenetic comparisons of the sequences of Mm-LILRBa (DQ155431) and MmLILRBb (DQ155432) with the current dataset showed that both Mm-LILRBa and MmLILRBb cluster within the LILRB1 group, with the sequence of Mm-LILRBa being identical to Mamu-LILRB1*001:01, and MmLILRBb clustering with macaque LILRB1*002 lineage alleles.

LILRB1 and LILRB2 allele nomenclature in OWM species

Currently, no official allele nomenclature exists for LILRB1 and LILRB2. Therefore, we propose a LILRB1 and LILRB2 allele nomenclature system for OWM species, developed based on the framework established for MHC nomenclature in primate species (5052). To illustrate, we use the designation of the rhesus macaque allele Mamu-LILRB1*001:01:01:01 as an example. The four-letter prefix (in this case Mamu) defines the taxonomic group and is derived from the species’ scientific name: the first two letters from the genus (Macaca) and the last two letters from the species (mulatta). This is followed by a hyphen, the gene name (LILRB1 in this example), and an asterisk. After the asterisk, the first field consists of three digits and represents the lineage, defined as a group of alleles clustering together in phylogenetic analyses. The second, third, and fourth fields, each composed of two digits, denote the allelic variation, synonymous mutations within the coding region, and differences in the non-coding region, respectively. Colons are used to separate these fields. The suffix “Sp” is added when the allele described is a splice variant.

Results

Characterization of the LILRB1 and LILRB2 transcriptomes in rhesus macaques, long-tailed macaques, and Hamadryas baboons

The LILRB1 gene consists of 16 exons, and the corresponding mature mRNA is encoded by 14 of these exons. The start codon is located in exon 3, and the stop codon at the beginning of exon 16, after 147 bp (Figure 1) (53). For LILRB2, our sequencing results indicate a comparable exon-intron organization across the coding region of the gene. The four Ig-like domains (D1 to D4) of LILRB1 and LILRB2 are encoded by exons 5-8, while exons 9 and 10 encode the stem (spacer) region, exon 11 encodes the transmembrane segment, and exons 12–16 encode the cytoplasmatic tail.

Figure 1
Diagram showing gene structure and mature mRNA transcripts for LILRB1 and LILRB2. Both genes have exons labeled one through sixteen. The mature mRNA transcript includes domains labeled D1 to D4, Stem, TM, and Cyt. Arrows indicate the transcription process from DNA to mRNA.

Figure 1. Schematic organization of the LILRB1 and LILRB2 gene. The upper panels illustrate the exon-intron structure of each gene, with exons represented as numbered blocks and introns as horizontal lines. In LILRB1, the break marked by two vertical lines represents an approximately 13 kb stretch of DNA (53). In the publicly available genome assemblies, exon 1 and the adjacent 13 kb intron 1 region present in LILRB1 could not be clearly identified for LILRB2, and therefore only exon 2 through the 3’UTR region is shown. Exons representing the mature mRNA LILRB1 and LILRB2 coding sequences are indicated by gray-colored blocks. The structural features of the LILRB1 and LILRB2 molecules encoded by the various exons are labeled, with D1 to D4 representing the four extracellular domains, “TM” referring to the transmembrane region, and “Cyt.” referring to the cytoplasmic tail.

To characterize the polymorphism of LILRB1 and LILRB2 genes in macaques, samples were selected from both related and unrelated individuals within the pedigreed macaque cohorts housed at the BPRC (Supplementary Figure 1). Additional rhesus macaque samples of Burmese and Chinese origin were included. In total, 74 rhesus macaques (44 Indian, 15 Burmese, 15 Chinese), along with 62 long-tailed macaques, and 29 Hamadryas baboon samples were examined. Transcriptome analysis of the LILRB1 and LILRB2 revealed that most reads represented alternatively spliced transcripts, while only a minority corresponded to the mature mRNA. Overall, the transcriptome analyses resulted in the identification of 93 LILRB1 and LILRB2 alleles across the three species (Table 1, Supplementary Table 5). Of these, only one allele, Mamu-LILRB1*001:01, had been previously reported (12). Notably, all four baboon LILRB1 alleles, and several macaque LILRB1 alleles predominantly lacked exon 15 in their transcripts, most likely due to alternative splicing, and are therefore designated with the abbreviation Sp (Supplementary Table 5, alleles marked with a $ sign). Similarly, some LILRB2 alleles predominantly lacked exon 9 at the transcriptome level (Supplementary Table 5, alleles marked with a # sign). One baboon allele, Paha-LILRB2*013:01Sp, was identified as splice variant containing a three base pair insertion along with the deletion of exon 9. These findings highlight the challenges of fully characterizing LILRB1 and LILRB2 polymorphisms using transcriptomic data alone. To address this, selected animals from each cohort were further analyzed at the gDNA level.

Table 1
www.frontiersin.org

Table 1. Overview of the number of LILRB1 and LILRB2 alleles characterized in the rhesus macaque, long-tailed macaque, and Hamadryas baboon cohorts.

Genomic characterization of LILRB1 and LILRB2 in macaques and Hamadryas baboons: phylogeny and lineage diversification

To characterize the LILRB1 and LILRB2 allelic polymorphisms present at the genomic level in our three cohorts of OWM, we selected 30 Indian, 13 Burmese, and 16 Chinese rhesus macaques, 54 long-tailed macaques, and 14 Hamadryas baboons (Supplementary Table 1). Most of these samples were sequenced using the PacBio platform, however, a small subset was sequenced using the ONT workflow. The analyses resulted in the identification of 135 LILRB1 and LILRB2 alleles across the three species (Table 1, Supplementary Table 5). This total included 86 alleles previously identified at the transcription level, now extended with intronic sequence information, and 49 newly detected alleles.

Subsequently, we performed phylogenetic analysis to explore the evolutionary relationships among LILRB1 and LILRB2 alleles in the three species and in humans (Supplementary Figure 3). This analysis showed that human LILRB1 and LILRB2 sequences cluster separately from those of OWM. Alleles from the two macaque species intermingled across several distinct clusters, while the Hamadryas baboon sequences formed species-specific branches. Based on sequence alignments and phylogenetic clustering patterns, we defined 10 LILRB1 and 15 LILRB2 lineages to describe the sequence variation observed across the three OWM species (Figure 2A, Supplementary Figure 3). For naming the OWM LILRB1 and LILRB2 alleles (Supplementary Table 5), we applied the nomenclature proposed in the Materials & Methods section.

Figure 2
Tables displaying allele distributions for LILRB1 and LILRB2 genes across species and lineages. Panel A compares alleles and D1-D2 counts in species Mamu, Mafa, and Paha. Panel B shows allele counts for LILRB1 and LILRB2 in Mamu across Indian, Burmese, and Chinese lineages. Each table includes total counts with certain values in parentheses.

Figure 2. Overview of the LILRB1 and LILRB2 lineages in rhesus (Mamu) and long-tailed (Mafa) macaques and Hamadryas baboon (Paha). (A) For each species, the number of alleles identified per lineage is shown, with the number of alleles exhibiting intron-level variation indicated in parentheses. The number of distinct D1-D2 amino acid sequences translated from these alleles is also provided (additional supporting data is provided in Supplementary Table 7). (B) Lineage distribution across the rhesus macaque populations, categorized by geographic origin (Indian, Burmese, and Chinese), with the number of alleles per lineage indicated.

All LILRB1 lineages are supported by four or more alleles. For LILRB2, 11 lineages are supported by three to ten alleles, while four lineages currently contain a single allele (Figure 2A). Most LILRB1 lineages are shared between the two macaque species, except for *005 and *007, which were unique to long-tailed macaques. In contrast, less sharing is observed for LILRB2 lineages (only 4 out of 12), with three and five distinct lineages specific to rhesus and long-tailed macaques, respectively (Figure 2A). In Hamadryas baboons, lineage diversity was limited, with alleles clustering into one LILRB1 and three LILRB2 lineages (Figure 2A, Supplementary Figure 3).

For the rhesus macaque, we distinguished animals of Indian, Burmese, and Chinese origin. Several lineages were shared across these geographical populations (Figure 2B). Furthermore, sharing between Indian and Burmese, Indian and Chinese, and Burmese and Chinese animals were observed for LILRB1*002, LILRB1*008, and LILRB2*005, respectively (Figure 2B). The remaining lineages were restricted to a single population. A few alleles were pairwise shared between the different rhesus macaque populations, whereas only Mamu-LILRB1*001:04:01:01 was common to all three rhesus macaque populations (Supplementary Table 5). These results indicate that most rhesus macaque LILRB1 and LILRB2 alleles are population-specific, likely reflecting historical geographic barriers and limited gene flow. Furthermore, no allelic sharing of LILRB1 and LILRB2 was observed between rhesus and long-tailed macaques, suggesting further diversification following speciation.

Sequence characteristics and polymorphic sites of LILRB1 and LILRB2

Alignments of the LILRB1 and LILRB2 coding sequences from rhesus macaques, long-tailed macaques, and Hamadryas baboons revealed both nucleotide polymorphisms and structural variations (Supplementary Figure 4). For example, insertions were identified in both LILRB1 and LILRB2. In LILRB1, a 21 bp insertion mapping to exon 9 was observed in all currently described Mafa-LILRB1*005 alleles, as well as in Mafa-LILRB1*004:01:01:01, *007:03:01:01, and *007:04:01:01. This insertion results in an extension of the stem region by seven amino acids, and its presence was confirmed by Sanger sequencing. In LILRB2, a three bp insertion at positions 82–84 characterizes the macaque lineages *001, *002, *003, and *010. Another structural variation we observed involves the absence of exon 9 in certain LILRB2 alleles, resulting in a shortened stem region. This likely represents alternative spliced variants, as in case of Mafa-LILRB2*002:02 and *008:04 exon 9 could be identified at the gDNA level (Supplementary Table 5).

Next, the deduced amino acid sequences of the reported human LILRB1 alleles (Supplementary Table 4) were compared with those of the rhesus macaque, long-tailed macaque, and Hamadryas baboon LILRB1 alleles characterized in this study. Variability plots showed that non-synonymous polymorphisms, resulting in amino acid changes, are distributed throughout the coding sequence in all four species (Figure 3). In the Hamadryas baboon population, four LILRB1 alleles were identified with a high degree of sequence similarity, exhibiting only two amino acid-changing sites, one in the D3 and one in the cytoplasmic tail (Figure 3). In contrast, rhesus and long-tailed macaques exhibit more diversity, with 27 and 37 LILRB1 alleles, respectively, and an extensive number of non-synonymous changes, particularly within the exons that encode the Ig-like domains (D1-D4). Normalization to exon length confirmed elevated non-synonymous mutation density in these domains in macaques as compared to humans and Hamadryas baboon (Figure 4, Supplementary Table 6). Exons 4 and 10 of macaque LILRB1 also exhibited increased non-synonymous variation.

Figure 3
Comparison charts show unique amino acid count along sequences of LILRB1 and LILRB2 for humans, Mamu, Mafa, and Paha. Each chart highlights domains D1 to D4, Stem, and TM + Cyt. Blue lines represent LILRB1, orange lines represent LILRB2, indicating variability across species and domains.

Figure 3. Variability plots of LILRB1 and LILRB2 of human, rhesus (Mamu) and long-tailed (Mafa) macaques, and Hamadryas baboon (Paha). The x-axis sequentially numbers the amino acids positions in the sequence alignment. The y-axis indicates the total number of unique amino acids (AA) encountered at a certain position. Source information for the human sequences used for comparison is provided in Supplementary Table 4.

Figure 4
Two tables display genetic data for LILRB1 and LILRB2 across different species: Human, Mamu, Mafa, and Paha. Each table shows the number of samples (n), exons, and data for synonymous (S) and non-synonymous (NS) positions. Specific exon regions and domains, such as D1, D2, D3, D4, Stem, TM, and Cyt, are included with varying values. Shaded areas highlight certain exon regions.

Figure 4. Overview of the synonymous (S) and non-synonymous (NS) mutations in LILRB1 and LILRB2 alleles from rhesus (Mamu) and long-tailed macaques (Mafa), and Hamadryas baboons (Paha). Mutation counts are normalized to exon length, with exon lengths and raw S/NS counts provided in Supplementary Table 6. For comparison, data from human LILRB1 and LILRB2 alleles are included. These human alleles correspond to those listed in Supplementary Table 4. “NA” indicates not applicable and is used when the corresponding part of the sequence was not characterized or when the exon was not detected/characterized. Distinct grey backgrounds highlight the data corresponding to the four domain (D) regions and the stem region. “TM” refers to the transmembrane region, and “Cyt.” to cytoplasmic tail region.

For LILRB2, macaques displayed a similar pattern to that observed for LILRB1, with polymorphisms concentrated in the domain-encoding exons (Figures 3, 4). In Hamadryas baboons, greater LILRB2 diversity relative to LILRB1 was observed. Notably, only two human LILRB2 sequences are currently available (Supplementary Table 4), limiting comparisons across species.

Sequence sharing in the D1 and D2 domains of LILRB1 or LILRB2 between rhesus and long-tailed macaques

Mature mRNA LILRB1 or LILRB2 coding sequences are not shared between rhesus and long-tailed macaques. However, within lineages, macaque LILRB1 and LILRB2 alleles can exhibit high sequence similarity. To assess the potential impact of the polymorphisms encountered in LILRB1 and LILRB2 on MHC class I recognition, we compared the deduced amino acid sequences of exons 5 (D1) and 6 (D2) in the two macaque species and Hamadryas baboons.

Alleles within a specific lineage may share the same D1-D2 amino acid sequence, and this was observed for all three species (Figure 2A, Supplementary Table 7). The greatest D1-D2 sequence variation was observed in long-tailed macaques, where the characterized nucleotide sequence diversity translated into 25 distinct D1-D2 amino acid sequences for LILRB1 and 21 for LILRB2 (Figure 2A). Across species, sharing of D1, D2, and combined D1-D2 amino acid sequence was observed among specific rhesus and long-tailed macaque LILRB1 and LILRB2 lineage alleles (Figure 5). Notably, D1 sequence sharing was detected among alleles of the Mamu-LILRB2*011 lineage and the Mafa-LILRB2*004:02 and *004:04 alleles. Although these alleles cluster closely in phylogenetic analyses (Supplementary Figure 3), we classified them into distinct lineages based on sequence differences in the D3, D4, stem, and cytoplasmic tail regions. Furthermore, sequence sharing of the D1 domain and the combined D1-D2 domains was identified in certain LILRB2 lineages currently detected only in long-tailed macaques (Figure 5). Overall, these findings indicate that although an increased number of non-synonymous mutations is present in the LILRB1 and LILRB2 extracellular domain encoding regions (Figure 4), the D1 and D2 encoding regions −involved in MHC class I interaction− remain highly conserved within a lineage, and conservation for these domain encoding regions is even observed among lineages shared between rhesus and long-tailed macaque.

Figure 5
Table displaying LILRB1 and LILRB2 alleles associated with identical D1 and D2 domains. LILRB1 section lists various combinations like Mamu-LILRB1*001 and Mafa-LILRB1*001. LILRB2 section includes alleles such as Mamu-LILRB2*008 and Mafa-LILRB2*008. Each domain is highlighted with D1 and D2 labels and specific allele numbers.

Figure 5. Rhesus (Mamu) and long-tailed (Mafa) macaque LILRB1 and LILRB2 alleles that share D1, D2, or combined D1-D2 amino acid sequences. Schematic illustrations on the left side of the figure represent the D1, D2, and combined D1-D2 domains, with adjacent each colored block (blue for LILRB1, light orange for LILRB2) indicating the combination of alleles sharing the same sequence for exon 5 (D1), exon 6 (D2), or exons 5 and 6 (D1-D2). For lineages in which all currently known alleles share the same corresponding domain sequence, the total number of alleles is shown in parentheses. When only a subset of alleles within a lineage shares the indicated domain sequence, the identifying digits of those alleles are listed next to the first allele name, separated by commas.

Polymorphism of the MHC class I alpha-3 domain and its association with LILRB1 and LILRB2 D1 variation

The crystal structure of the HLA-LILRB1 complex shows strong binding between β2M and the LILRB1 D2 domain, in concert with a more flexible interaction between the HLA-α3 domain and LILRB1 D1 (17, 18, 54). LILRB2, in contrast, has been shown to bind HLA-G in a β2M-independent manner (54). In-vitro studies demonstrated that LILRB1 binds HLA-C with significantly lower affinity compared to HLA-A and -B, and that polymorphisms within the α3 domain can modulate this interaction, likely through electrostatic effects (53). To assess the extent of the α3 domain diversity among human, macaque, and baboon Mhc class I alleles and its potential impact on the interaction with the D1 domains of LILRB1 and LILRB2, we compared the deduced amino acid sequences of exon 4 from HLA-A, -B, -C and -G alleles (IPD-IMGT/HLA Database release 3.60), as well as from Mamu-A1 and -B, Mafa-A1 and -B, and Paha-A and -B alleles (IPD-MHC NHP Database release 3.14.0.0) (Supplementary Table 8A). Notably, unlike humans, macaques and baboons lack an HLA-C equivalent and possess expanded and variable Mhc-A and -B gene repertoires per haplotype (50, 5559). The comparison revealed relatively greater protein variation in the macaque and baboon MHC-A1 and -B α3 domains compared to the HLA-A, -B, -C, and -G α3 domains (Supplementary Table 8A). In HLA, six amino acid positions were associated with significant differences in binding affinity for LILRB1 D1-D2 variants (53) (Figure 6). At these positions, in macaque and baboon MHC-A and -B allotypes the predominant residues were 183D, 189V, 268E −although a higher frequency of 268K was observed in Mamu- and Mafa-A1−, 194V, and 207G. At position 253, most macaque and baboon MHC-B allotypes contained E, whereas MHC-A allotypes displayed either E or K (Figure 6). Furthermore, we identified six α3 domain residues −187T, 198E, 202R, 203C, 208F, and 210P− that are conserved across macaque and baboon MHC-A and -B allotypes (Supplementary Table 8B) and are located near the positions highlighted in Figure 6.

Figure 6
Bar chart showing allele distributions across positions 183, 189, 268, 194, 207, and 253 of MHC gene variants. Each position displays bars for different allele types, with color coding correlating to a pie chart legend on the right. The chart is divided by percentage, with distinct colors representing amino acid substitutions: R, V, A, D, E, F, G, I, K, M, N, P, Q. A numbered key for MHC variants and counts is included.

Figure 6. Stacked bar plots illustrating the distribution of amino acids at MHC alpha 3 domain positions associated with significant binding differences to LILRB1 in humans. Above each stacked bar plot the amino acid position (Pos.), the HLA type involved, and the amino acids associated with increased binding affinity in the HLA-LILRB1 study are shown (53). For human (HLA), rhesus macaque (Mamu), long-tailed macaque (Mafa), and Hamadryas baboon (Paha) the number (N) of complete exon 4 sequences analyzed is indicated. Distinct amino acids are represented using the conventional one-letter symbols. Additional supporting data are provided in Supplementary Table 8.

From the perspective of LILRs, studies in humans have shown that positively charged residues at positions 72 (R72Q) and 103 (H103D) in the LILRB1 D1 domain enhance binding to HLA class I molecules (53). In macaque and baboons, the deduced amino acid sequences of LILRB1 and LILRB2 revealed that these positions are also predominantly occupied by positively charged residues. An exception is position 72 in LILRB2 D1, which is not positively charged but instead contains the neutral amino acids serine (S) and leucine (L) (Supplementary Table 7).

Extensive alternative splicing characterizes the LILRB1 and LILRB2 transcriptomes of rhesus and long-tailed macaques, and Hamadryas baboons

Alternative splicing is a form of post-transcriptional modification that occurs in approximately 95% of the human multi-exon genes (60, 61). The introduction of next-generation sequencing techniques has enabled and enhanced large-scale characterization of alternatively spliced transcripts, as demonstrated, for example, in the KIR gene cluster (38, 41, 62).

The transcriptome analysis of LILRB1 and LILRB2 in rhesus and long-tailed macaques and Hamadryas baboons revealed that transcripts were alternatively spliced (Supplementary Tables 9A-D). In LILRB1, the majority of the splice variants arose from a complex combination of alternative splicing events, predominantly involving exon skipping and the use of alternative 3’-splice site (ss) (Supplementary Table 10A). To compare splicing patterns across species, we also characterized the LILRB1 transcriptome in four human samples (Supplementary Figure 1). In total, we identified 48 splicing events shared by at least two individuals/animals and present in two or more species (Supplementary Table 10A). Across all four species, alternative splicing affected exons 10 and 15, encoding the stem region and part of the cytoplasmic tail, respectively. These events were observed in the majority of splice variants (Figure 7) and involved different alleles (Supplementary Table 10A). One conserved event concerns an alternative 3’-ss in exon 10, resulting in a 3 bp insertion, leading to the addition of an alanine (A) residue in the stem region (Figure 8). This splicing event occurred in most, if not all, LILRB1 alleles across all four species and is observed as independent event or in combination with other splicing events (Supplementary Table 10A). For exon 15, three common events were recorded. One involved an alternative 3’-ss near exon 15, resulting in a 3 bp insertion encoding a glutamine (Q) residue in the cytoplasmic tail region, an event shared by humans and macaques (Figure 8). The other two events included partial intron 15 retentions (typically around 30 bp, producing an exon 15 of 156 bp in length) and complete exon 15 skipping, the latter resulting in the loss of one of the four ITIM motifs present in the cytoplasmic tail of LILRB1 molecules (Figure 8). These events were observed in macaques and in both macaques and baboons, respectively. Exon 15 skipping is most likely explained by a GT-to-GC splice site mutation detected in all four Hamadryas baboon and most macaque LILRB1 alleles (Supplementary Table 5). However, the presence of this splice site mutation does not invariably result in exon 15 exclusion. Also, some alleles contain an alternative splice site that may produce transcripts with an exon 15 of 126 bp in length. Additionally, small insertions of one or two base pairs at the start of exon 10, introducing premature stop codons, were detected across all four species for many different alleles (Supplementary Table 10A).

Figure 7
Bar charts comparing the total number of alternative splicing events for LILRB1 and LILRB2 across four species: human, rhesus macaque, long-tailed macaque, and Hamadryas baboon. The x-axis represents different data points labeled from 3 to 16, and the y-axis shows the total number of events ranging from 0 to 100. Each species is represented by a different color within the stacked bars.

Figure 7. Stacked bar charts showing the absolute number of alternative splicing events involving the corresponding exon for LILRB1 and LILRB2 in humans, rhesus and long-tailed macaques, and Hamadryas baboons. Only alternative splicing events shared by at least two individuals/animals and present in two or more species were considered (Supplementary Table 10). On the x-axis, exons are numbered sequentially, as shown in Figure 1. D1-D4 indicate the four extracellular domains, “TM” denotes the transmembrane section, and “Cyt.” refers to the cytoplasmic tail.

Figure 8
Diagram showing LILRB1 and LILRB2 structures and alternativelyspliced isoforms in humans, rhesus and long-tailed macaques, and hamadryas baboons. It illustrates transmembrane domains, ITIMs, extracellular domains, and stems.

Figure 8. Schematic representation of the rhesus macaque, long-tailed macaque, and Hamadryas baboon LILRB1 and LILRB2 canonical (C) and alternatively spliced isoforms (numbered 1-5), as deduced from transcript sequences. For both LILRB1 and LILRB2, the deduced isoforms shown are the mature mRNA (C) or the result of the different splicing events discussed in the text (see Supplementary Table 10; events indicated in bold). These events may occur individually or in combination with other alternative splicing events (Supplementary Table 10). For human LILRB2, no transcriptome data were available. LILRB1: 1. 3 bp insertion in exon 10 (3’ss); 2. 3 bp insertion in exon 15 (3’ss); 3. 3 bp insertion in exons 10 and 15 (3’ss); 4. deletion of exon 15 (exon skipping); 5. 3bp insertion in exon 10 (3’ss) and deletion of exon 15 (exon skipping). LILRB2: 1. deletion of exon 9 (exon skipping); 2. deletion of exons 9 and 10 (exon skipping); 3. 3bp insertion in exon 10 (3’ss); 4. deletion of exon 9 (exon skipping) and 3bp insertion in exon 10 (3’ss); 5. deletion of exon 9 (exon skipping) and 3bp insertion in exon 15 (3’ss).

Next, the alternatively spliced LILRB2 transcripts in rhesus and long-tailed macaques and Hamadryas baboons were investigated. Twenty splicing events were identified in at least two animals and across two species (Supplementary Table 10B). In many of these events (Figure 7) and in nearly all alleles (Supplementary Table 10B), exons 9, 10, and 15 were affected by alternative splicing. Deletion of exon 9, which may result in a shortened stem of LILRB2 (Figure 8), was commonly observed in all three non-human primate species, either as a single event or in combination with other splicing events (Supplementary Table 10B). One such combined event involved the deletion of exon 10, that may lead to the complete loss of the stem region in the LILRB2 structure (Figure 8). Other events involved the use of alternative 3’-ss near exons 10 and 15, resulting in the insertion of 3 bp encoding glutamine (Figure 8). These events were also detected either alone or in combination with other splicing events in numerous alleles across the three species (Supplementary Table 10B).

Discussion

The immunomodulatory roles of LILRB1 and LILRB2 receptors make them promising targets for therapies in various types of cancer (6366). In addition, infectious diseases such as tuberculosis and HIV, autoimmune diseases including multiple sclerosis and rheumatoid arthritis, and neurodegenerative conditions like Alzheimer’s disease may benefit from development of monoclonal antibodies that block the immune effector functions of these receptors (65). Although LILRB1 and LILRB2 have been studied in humans and, indirectly, in mice (which lack direct homologues of LILR), a detailed characterization in non-human primate species commonly used in preclinical research has been lacking. Here, we provide an in-depth genomic and transcriptomic analysis of LILRB1 and LILRB2 in rhesus and long-tailed macaques and Hamadryas baboons. Our results show that these genes are highly polymorphic and diversified in the macaque species but, similar to humans, are more conserved in Hamadryas baboons (Figure 2). Most polymorphisms are in the exons encoding the four extracellular domains (Figures 3, 4). A detailed investigation of the deduced D1 and D2 amino acid sequences showed sharing of D1, D2, and combined D1-D2 sequences between the two macaque species (Figure 5), suggesting conserved ligand binding potential. Furthermore, we observed alternative splicing in LILRB1 and LILRB2 transcripts, involving most of the characterized alleles, with the majority of the splicing events affecting the exons encoding the stem and intracellular parts of the molecules (Figure 7). Several alternative splicing events are conserved across species, suggesting a possible role of these isoforms in regulating immunomodulatory responses.

The rather conserved nature of LILRB1 and LILRB2 in Hamadryas baboons, but also in humans, may indicate functional specialization. In contrast, the diversification of LILRB1 and LILRB2 in macaque species may reflect co-evolution with their highly diverse and expanded MHC class I system (67). Sequence comparison of the D1 and D2 domains, which are involved in binding the MHC class I α3 domain and β2M, between rhesus and long-tailed macaques revealed several identical and highly similar structures, suggesting conserved ligand binding in these species (Figure 5). However, the D1 and D2 domains may also contain species- and lineage-specific polymorphisms that are likely important for ligand interactions and the functional diversity of LILRB1 and LILRB2 in macaques. Analysis of the deduced amino acid sequences of the MHC-I α3 domain and the D1 domains of LILRB1 and LILRB2 showed that key residues involved in human LILRB1-MHC binding are also present in the three studied OWM species, suggesting potential for comparable interactions (Figure 6). At position 253 of macaque and baboon MHC-A allotypes, however, we observed oppositely charged residues, E (negatively charged) and K (positively charged), whose potential impact on LILRB1 and LILRB2 binding remains to be determined. Furthermore, within the α3 domain of the macaque and baboon MHC-A and -B allotypes, six conserved residues were identified located near key interaction sites (Supplementary Table 8B). Their conserved nature may indicate functional importance, for instance, in supporting stable binding to LILRB1 and LILRB2 in these species. Nonetheless, additional diversification at other positions within the macaque MHC-I α3 domain (Supplementary Table 8B) and within LILRB1 and LILRB2 (Supplementary Table 7) may influence binding characteristics and signaling potential, which effects need to be addressed in future functional studies.

Rhesus and long-tailed macaques are native to the Asian continent, whereas baboons are native to sub-Saharan Africa, where six distinct species are recognized (68). Among these, the Hamadryas baboon represents one of the smallest baboon populations and inhabits the Horn of Africa and the southwestern Arabian Peninsula. The condensed LILRB1 and LILRB2 repertoire that we identified in the Hamadryas baboons may be influenced by their limited population size. Therefore, additional research on LILRB1 and LILRB2 polymorphism in other baboon species is needed to determine whether the conserved nature of these genes is a general phenomenon in baboons. Alternatively, the differences in LILRB1 and LILRB2 diversity identified between the two macaque species and Hamadryas baboon may have been driven by their distinct habitats and the type of pathogens they encounter. For instance, macaques may have evolved LILRB1 and LILRB2 alleles that are specialized in recognizing decoy MHC class I molecules that are encoded by viruses, similarly to UL18 encoded by human cytomegalovirus (14, 19).

Within the LILRB1 and LILRB2 allelic repertoire, we identified alleles/lineages with in-frame insertions and deletions, mostly 3 bp events (Supplementary Figure 4). However, in case of the Mafa-LILRB1*005 lineage alleles, along with Mafa-LILRB1*004:01:01:01, *007:03:01:01, and *007:04:01:01, a 21 bp insertion in exon 9 was observed, extending the stem region with seven amino acids (PTTGPTS). The biochemical properties of these residues may influence the folding and stability of the LILRB1 molecule. For instance, threonine (T) and serine (S) can introduce hydrogen bonds, potentially stabilizing the LILRB1 structure, while glycine (G) −the smallest amino acid− may enhance flexibility, and proline (P) might introduce rigidity (69). The 21 bp insertion occurred in ~20% of the analyzed long-tailed macaque animals, including three homozygotes (Supplementary Table 1). Whether this insertion impairs receptor folding or instead enhances receptor function is yet undetermined.

Previously, our team reported on alternative splicing in the human and macaque KIR transcriptome (38). In humans, approximately 53% of KIR2DL4 and 4% of KIR2D/3D transcripts were alternatively spliced, whereas in rhesus macaques, the corresponding proportions were approximately 13% and 24%, respectively. The LILRB1 and LILRB2 transcriptomes in rhesus and long-tailed macaques, and Hamadryas baboons exceed these levels, with the majority of reads representing alternatively spliced transcripts. For both LILRB1 and LILRB2, alternative splicing primarily affected the stem region and part of the cytoplasmic tail involved in signal transduction (Figure 7). Skipping of exon 15 was observed in transcripts of Mamu-, Mafa-, and all Paha-LILRB1 alleles. This event reduces the number of ITIM motifs from four to three and is likely associated with a GT-to-GC splice-site mutation. Although functional studies are required to assess the binding capacity and signal potential of isoforms, the abundance and conservation of specific alternative splicing events across species suggest that some may play an important role in modulating immune responses.

In humans, alternatively spliced soluble LILR isoforms have been described that are generated by the transcription of a cryptic stop codon located three nucleotides downstream of the donor splice site in the intron following the stem region (70). Human soluble LILRB1 has been shown to function as a negative regulator of the membrane bound counterpart, and a similar function is expected for other soluble LILRs. In macaque and Hamadryas baboon LILRB1 and LILRB2 alleles, a cryptic stop codon in the intron following the stem region was detected only in Mafa-LILRB1*009:02:01:01, Mafa-LILRB2*004:01:01:01, and the Mamu- and Mafa-LILRB2*008 lineage alleles. This suggests that the mechanism generating soluble isoforms through transcription of a cryptic stop codon, as seen in human LILR, may also occur in macaque LILR. However, most macaque and Hamadryas baboon LILRB1 and LILRB2 alleles lack a cryptic stop codon in this region. Instead, various 3’ splice site insertions of one to two base pairs immediately near exon 10 are found in macaque and baboon LILRB1 and LILRB2 transcripts. These insertions introduce premature stop codons and might result in the production of soluble isoforms.

In the HLA-LILRB1 complex it is demonstrated that binding between β2M and the D2 domain is crucial (17, 54). In case of the HLA-LILRB2 complex, β2M-independent binding has been observed (54), highlighting the importance of the interaction between the HLA class I α3 domain and the D1 domain of LILRB2. In none of the four species studied alternative splicing events were identified that affect the LILRB1 D2 domain (Figure 7). In case of LILRB2, the D1, D2, and D3 domains were exclusively constitutively spliced (Figure 7). These observations highlight the essential structural and functional roles of these domains in humans, both macaque species, and Hamadryas baboons.

This is the first comprehensive characterization of the LILRB1 and LILRB2 repertoires in rhesus and long-tailed macaques and Hamadryas baboons. Previously we demonstrated that the primate LILR region has a highly conserved organization (5). Here we show that regarding LILRB1 and LILRB2 gene polymorphism and diversity, species specific differences exist. Given the role of macaque and baboon species as models for human diseases (2729, 71), our findings provide a valuable foundation for advancing the interpretation and translation of immune modulatory responses. Future functional studies on LILRB1 and LILRB2 will be essential to fully address the implications of the polymorphisms and alternatively spliced variants identified in macaques and baboons.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.ebi.ac.uk/ena, OZ237016 till OZ237135, OZ237575 till OZ237580, OZ243064 till OZ243077, OZ243094, OZ243095.

Ethics statement

Human PBMC samples were provided by the Immunohematology and Blood transfusion department of the Leiden University Medical Centre. The studies were conducted in accordance with the local legislation and institutional requirements. The human samples used in this study were acquired from gifted from another research group. Written informed consent to participate in this study was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and the institutional requirements. Ethical approval was not required for the study involving animals in accordance with the local legislation and institutional requirements because DNA and RNA was extracted from EDTA blood taken during regular health checks and initially used for parentage analysis. Remaining DNA and RNA is stored in the BPRC-biobank and available for other research projects.

Author contributions

NGG: Conceptualization, Data curation, Investigation, Project administration, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. NG: Data curation, Formal Analysis, Validation, Writing – review & editing. CH: Formal Analysis, Writing – review & editing. AV-R: Formal Analysis, Writing – review & editing. MW: Methodology, Writing – review & editing. JB: Writing – review & editing.

Funding

The author(s) declared that financial support was not received for this work and/or its publication.

Acknowledgments

We are grateful to Drs. J. Stumpel of WILDLANDS Adventure Zoo for providing informed consent to include the results from the Hamadryas baboon samples in this report. We like to thank R.F. Acar, Drs. L. Storm and N. Otting, and Dr. R.E. Bontrop for their contributions to the manuscript, and F. van Hassel for designing the figures.

Conflict of interest

The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2025.1706720/full#supplementary-material

Supplementary Figure 1 | Pedigrees of rhesus macaque (A), long-tailed macaque (B), and Hamadryas baboon (C) families selected for LILRB1 and LILRB2 characterization, and a human family (D) for LILRB1 characterization. In these pedigrees, males are represented by squares and females by ovals. For the Indian rhesus macaque, one male appears multiple times in the pedigree and is indicated by a square with a light-gray background. Similarly, for the long-tailed macaque, one female appears in multiple pedigrees and is represented by an oval with a light-gray background. In two Chinese rhesus macaque families, the mother’s identity is unknown, which is denoted as “not determined” (N.D.). In cases where the sex of the offspring is unknown, only the animal’s ID is provided. On two occasions, the segregation of LILRB1 alleles could not be confirmed (Supplementary Table 1); in these instances, the specific animal IDs are underlined. J10006, J16012, J15021, J16019.

| Representative image of an agarose gel with LILRB1 amplicons size selected for downstream sequencing analysis. The FastRuler Middle Range DNA Ladder (L) of Thermo Fisher Scientific was used for size selection, with the length of the five purified DNA fragment illustrate the range of the selected products. Results on LILRB1 characterization for J10006 (lane 2), J16012 (lane 3), J15021 (lane 4), and J16019 (lane 5) are depicted in Supplementary Tables 1 and 9B (animal IDs highlighted in red). The blanco (no-template control, lane 6) demonstrates the absence of DNA contamination.

Supplementary Figure 3 | Phylogenetic analysis of LILRB1 and LILRB2 alleles from rhesus macaque (Mamu, blue), long-tailed macaque (Mafa, black), and Hamadryas baboon (Paha, green). The coding DNA sequences (CDS) of all LILRB1 and LILRB2 alleles identified at gDNA level, together with seven alleles detected only at the transcription level (Supplementary Table 5), were subjected to phylogenetic analysis. The Neighbor-joining tree was constructed using the Nei-Gojobori (Jukes-Cantor) method for evolutionary distances. Human LILRB1 and LILRB2 sequences (IDs shown in red and orange, respectively) were included in the analysis for comparison, and source information is provided in Supplementary Table 4. Relevant bootstrap values, based on 1,000 replicates, are indicated.

Supplementary Figure 4 | Selected regions of the sequence alignments for LILRB1 and LILRB2 alleles, highlighting segments containing insertions and deletions. Human LILRB alleles (labeled Hosa, HM, AF283984, and AF283985) are shown at the top, followed by alleles from rhesus (Mamu) and long-tailed (Mafa) macaques, and Hamadryas baboon (Paha). “del.” denotes a deletion, “bp” indicates base pair, and “pos.” refers to position.

Supplementary Table 1 | LILRB1 and LILRB2 allelic variation characterized in the rhesus macaque, long-tailed macaque and Hamadryas baboon animals. The data are presented by family (see also Supplementary Figure 1) and include allele characterization at both the cDNA and gDNA levels. Unrelated individuals are also included in the analyses for all species. If an allele could not be identified, it is denoted as “not determined” (N.D.). The underlined rhesus and long-tailed macaque IDs indicate the individuals for whom LILRB1 characterization at the gDNA level was performed using Nanopore sequencing. An “L” is included in the allele name when only Nanopore sequencing was conducted for the corresponding animal, as it was unable to discriminate allelic variation. #Occasions in which the segregation of a LILRB1 allele could not be confirmed (see also Supplementary Figure 1), with the alleles involved underlined. In two Chinese rhesus families, alleles are printed in red to indicate that an intron mutation occurred when inherited from parent to child. As a result, these alleles received a different designation. Where known, males are depicted with a light-blue background, and females with a pink background. Animal IDs highlighted in red identify the animals whose LILRB1 amplicons are depicted in Supplementary Figure 2.

Supplementary Table 2 | Overview of primers designed for the amplification of LILRB1 and LILRB2 in rhesus (Mamu) and long-tailed macaque (Mafa) and Hamadryas baboon (Paha) at the transcription (cDNA) and genomic DNA (gDNA) level. The abbreviations Fw stands for forward, Rv for reverse. The location, the different annealing temperatures applied for each primer set and the expected product size (base pair, bp) amplified by the primers is indicated. The primers designed for LILRB1 amplification in human (Hosa) samples are also included.

Supplementary Table 3 | Information on the PacBio runs conducted to characterize allelic variation in LILRB1 and LILRB2 at both the transcription (cDNA) and genomic DNA (gDNA) levels. For the three species analyzed, the platform used, total number (#) of reads retrieved, number of samples, and the average number of reads per sample are provided. *Samples indicated for this sequel run were pooled together with other type of samples. The sequel I run, highlighted with a gray background, contained only four samples to ensure a higher number of reads per sample.

Supplementary Table 4 | Overview of published, directly submitted and reference or predicted reference sequences included in the LILRB1 and LILRB2 reference library and phylogenetic analysis. ^The long-tailed macaque LILRB2, LOC102120967, is assigned as Mafa-LILRB1 (5). #LOC102143922, LOC101022334, and LOC116271753, are identified as Mafa-LILRB2, Paan-LILRB1, and Paan-LILRB2, respectively, based on comparative analyses with the genomic organizations published by Storm et al., 2021.

Supplementary Table 5 | Meta-data associated with the LILRB1 and LILRB2 alleles identified in rhesus and long-tailed macaques, and Hamadryas baboons. Bolded allele designations refer to alleles identified only at the transcriptional level. ENA-EMBL accession numbers for the alleles are provided. The number (#) of individuals in which each allele is detected is indicated (numbers extracted from Supplementary Table 1). If an allele is detected in a single individual, it is confirmed by an independent second PCR and an independent sequencing run (indicated as 2xPCR). For rhesus macaque alleles, the population in which the allele is detected (India, Burma, and China) is specified. A 2-field allele name is provided when alleles are detected at the transcriptional level. The suffix Sp indicates that the allele identified is an alternative splice variant. $Alleles lacking exon 15 at the transcription level. #Alleles lacking exon 9 at the transcription level. @Mamu-LILRB2*005:01:01:02 is identified in one individual with 56 reads from a single PCR, and further confirmation is required. ^Mamu-LILRB2*006:01:01:02 and *006:01:01:03, and Mafa-LILRB2*004:02:01:01, *004:04:01:01, *008:01:01:01, and *010:01:01:01 are identified in single individuals at both the cDNA and gDNA levels, with read numbers ranging from 51 to 1031 base pairs, confirming their coding sequences. However, intron sequences for these alleles require further confirmation. &Paha-LILRB2*013:02Sp and *014:02 are identified in single individuals with read numbers ranging from 120 to 535 base pairs and require further confirmation. For each allele, the two nucleotides forming the splice site between exon 15-intron 15 (156 bp) are indicated. In addition, for the LILRB1 alleles the two nucleotides forming the alternative splice site between exon 15-intron15 (126 bp) are provided. “N.D.” means not determined.

Supplementary Table 6 | Numbers of synonymous (S) and non-synonymous (NS) mutations detected in LILRB1 and LILRB2 alleles from the studied rhesus macaque, long-tailed macaque and Hamadryas baboon populations. For comparison, the numbers on S and NS mutations detected in human LILRB1 and LILRB2 alleles are included. These human alleles correspond to those listed in Supplementary Table 4. The number (n) of macaque and baboon alleles included in the analysis are indicated. The length of each exon (in base pairs, bp) is provided and used to normalize the data. NA indicates “not applicable” and is used when the corresponding part of the sequence was not characterized or when the exon in question was not detected.

Supplementary Table 7 | Polymorphic amino acid positions based on the deduced amino acid sequences of the rhesus (Mamu) and long-tailed (Mafa) macaque and Hamadryas baboon (Paha) LILRB1 (A) and LILRB2 (B) alleles for the D1 and D2 domain structures. The consensus sequences of human (Hosa) LILRB1 and LILRB2 are shown at the top and used as references. Identity with the reference is indicated by a dash (“-”), amino acid substitutions are represented by the standard one-letter code, and a deletion is indicated by an “x”. When polymorphisms are present within the Hosa-LILRB1 and Hosa-LILRB2 sequences, they are indicated using a slash between the relevant one-letter codes. The first row lists the allele names. When multiple alleles of a lineage share the same amino acid sequence, the digits identifying those alleles are listed next the first allele name, separated by a comma. The D1 domain is from position 25-121, the D2 domain from 122-221. For the rhesus macaque alleles, the geographic origin of the population in which they were detected is indicated. Position 72 and 103 in the human LILRB1 D1 domain, as discussed by Liu et al. (53), correspond to positions 73 and 104 in the alignment presented here, due to a insertion/deletion present at position 28.

Supplementary Table 8 | Overview of the total number of human (HLA), rhesus macaque (Mamu), long-tailed macaque (Mafa), and Hamadryas baboon (Paha) MHC class I sequences and complete exon 4 sequences used for alpha 3 domain analysis (A) and the polymorphisms detected at different alpha 3 domain positions in the macaque and baboon species (B). (A) The data were obtained from IPD-IMGT/HLA database release 3.60 and IPD-MHC NHP release 3.14.0.0. ^For the Hamadryas baboon, additional data from a manuscript currently in preparation are included. Sequences containing a premature stop codon were excluded from further analysis in Supplementary Table 8B and Figure 6. (B) Amino acid (Aa) positions discussed in the main text are highlighted with distinct background coloring. Identity with the consensus is indicated by a dash (“-”), amino acid substitutions are represented by the standard one-letter code, and a deletion is indicated by an asterisk (“*”). Polymorphisms within the sequences are denoted by a slash separating the relevant one-letter codes, with the number of sequences containing each polymorphism given in parentheses for positions included in Figure 6, as well as for other positions when the polymorphism is represented by only a small number of sequences.

Supplementary Table 9 | Pivot tables showing the typing results of the rhesus macaque (Mamu) and long-tailed macaque (Mafa) LILRB1 (A-B) and LILRB2 (C-D) PacBio cDNA amplicons. Column A lists the different LILRB1 or LILRB2 variants. The animal IDs corresponding to each set of results are shown at the top of each column. Within each column, the distribution of reads retrieved per amplicon is indicated and corresponds to the allele or splice-variant designations listed in Column A. Some animal IDs appear two or three times, indicating that the same animal was sequenced in duplicate or triplicate. Animal IDs highlighted in red (Supplementary Table 9B) identify the animals whose LILRB1 amplicons are depicted in Supplementary Figure 2.

Supplementary Table 10 | Overview of alternative splicing events observed in the LILRB1 (A) and LILRB2 (B) transcriptomes of human (Hosa), rhesus macaque (Mamu), long-tailed macaque (Mafa), and Hamadryas baboon (Paha). Only splicing events detected in at least two individuals/animals and across two or more species are considered. The numbers indicate the distinct LILRB1 and LILRB2 alleles in which each alternative splicing event was identified. Different types of alternative splicing events within an array are separated by a period. Within each array, “bp” denotes base pair, “del” deletion, “ins” insertion, “ex” exon, and “in” intron. The mechanism of each splicing event is indicated in brackets. Alternative splicing events shown in Figure 8 are indicated in bold.

References

1. Volz A, Wende H, Laun K, and Ziegler A. Genesis of the ILT/LIR/MIR clusters within the human leukocyte receptor complex. Immunol Rev. (2001) 181:39–51. doi: 10.1034/j.1600-065x.2001.1810103.x

Crossref Full Text | Google Scholar

2. Wende H, Colonna M, Ziegler A, and Volz A. Organization of the leukocyte receptor cluster (LRC) on human chromosome 19q13.4. Mamm Genome. (1999) 10:154–60. doi: 10.1007/s003359900961

Crossref Full Text | Google Scholar

3. Hirayasu K and Arase H. Functional and genetic diversity of leukocyte immunoglobulin-like receptor and implication for disease associations. J Hum Genet. (2015) 60:703–8. doi: 10.1038/jhg.2015.64

Crossref Full Text | Google Scholar

4. Kumar S and Hedges SB. A molecular timescale for vertebrate evolution. Nature. (1998) 392:917–20. doi: 10.1038/31927

PubMed Abstract | Crossref Full Text | Google Scholar

5. Storm L, Bruijnesteijn J, de Groot NG, and Bontrop RE. The genomic organization of the LILR region remained largely conserved throughout primate evolution: implications for health and disease. Front Immunol. (2021) 12:716289. doi: 10.3389/fimmu.2021.716289

PubMed Abstract | Crossref Full Text | Google Scholar

6. Takai T. A novel recognition system for MHC class I molecules constituted by PIR. Adv Immunol. (2005) 88:161–92. doi: 10.1016/S0065-2776(05)88005-8

PubMed Abstract | Crossref Full Text | Google Scholar

7. Takai T. Paired immunoglobulin-like receptors and their MHC class I recognition. Immunology. (2005) 115:433–40. doi: 10.1111/j.1365-2567.2005.02177.x

PubMed Abstract | Crossref Full Text | Google Scholar

8. Takai T and Ono M. Activating and inhibitory nature of the murine paired immunoglobulin-like receptor family. Immunol Rev. (2001) 181:215–22. doi: 10.1034/j.1600-065x.2001.1810118.x

PubMed Abstract | Crossref Full Text | Google Scholar

9. Canavez F, Young NT, Guethlein LA, Rajalingam R, Khakoo SI, Shum BP, et al. Comparison of chimpanzee and human leukocyte Ig-like receptor genes reveals framework and rapidly evolving genes. J Immunol. (2001) 167:5786–94. doi: 10.4049/jimmunol.167.10.5786

PubMed Abstract | Crossref Full Text | Google Scholar

10. Glazko GV and Nei M. Estimation of divergence times for major lineages of primate species. Mol Biol Evol. (2003) 20:424–34. doi: 10.1093/molbev/msg050

PubMed Abstract | Crossref Full Text | Google Scholar

11. Perelman P, Johnson WE, Roos C, Seuanez HN, Horvath JE, Moreira MA, et al. A molecular phylogeny of living primates. PloS Genet. (2011) 7:e1001342. doi: 10.1371/journal.pgen.1001342

PubMed Abstract | Crossref Full Text | Google Scholar

12. Slukvin II, Grendell RL, Rao DS, Hughes AL, and Golos TG. Cloning of rhesus monkey LILRs. Tissue Antigens. (2006) 67:331–7. doi: 10.1111/j.1399-0039.2006.00579.x

PubMed Abstract | Crossref Full Text | Google Scholar

13. Colonna M, Nakajima H, and Cella M. Inhibitory and activating receptors involved in immune surveillance by human NK and myeloid cells. J Leukoc Biol. (1999) 66:718–22. doi: 10.1002/jlb.66.5.718

PubMed Abstract | Crossref Full Text | Google Scholar

14. Cosman D, Fanger N, Borges L, Kubin M, Chin W, Peterson L, et al. A novel immunoglobulin superfamily receptor for cellular and viral MHC class I molecules. Immunity. (1997) 7:273–82. doi: 10.1016/s1074-7613(00)80529-4

PubMed Abstract | Crossref Full Text | Google Scholar

15. Fanger NA, Cosman D, Peterson L, Braddy SC, Maliszewski CR, and Borges L. The MHC class I binding proteins LIR-1 and LIR-2 inhibit Fc receptor-mediated signaling in monocytes. Eur J Immunol. (1998) 28:3423–34. doi: 10.1002/(SICI)1521-4141(199811)28:11<3423::AID-IMMU3423>3.0.CO;2-2

PubMed Abstract | Crossref Full Text | Google Scholar

16. Willcox BE, Thomas LM, Chapman TL, Heikema AP, West AP Jr., and Bjorkman PJ. Crystal structure of LIR-2 (ILT4) at 1.8 A: differences from LIR-1 (ILT2) in regions implicated in the binding of the Human Cytomegalovirus class I MHC homolog UL18. BMC Struct Biol. (2002) 2:6. doi: 10.1186/1472-6807-2-6

PubMed Abstract | Crossref Full Text | Google Scholar

17. Willcox BE, Thomas LM, and Bjorkman PJ. Crystal structure of HLA-A2 bound to LIR-1, a host and viral major histocompatibility complex receptor. Nat Immunol. (2003) 4:913–9. doi: 10.1038/ni961

PubMed Abstract | Crossref Full Text | Google Scholar

18. Shiroishi M, Kuroki K, Rasubala L, Tsumoto K, Kumagai I, Kurimoto E, et al. Structural basis for recognition of the nonclassical MHC molecule HLA-G by the leukocyte Ig-like receptor B2 (LILRB2/LIR2/ILT4/CD85d). Proc Natl Acad Sci U S A. (2006) 103:16412–7. doi: 10.1073/pnas.0605228103

PubMed Abstract | Crossref Full Text | Google Scholar

19. Burshtyn DN and Morcos C. The expanding spectrum of ligands for leukocyte ig-like receptors. J Immunol. (2016) 196:947–55. doi: 10.4049/jimmunol.1501937

PubMed Abstract | Crossref Full Text | Google Scholar

20. Chan KR, Ong EZ, Tan HC, Zhang SL, Zhang Q, Tang KF, et al. Leukocyte immunoglobulin-like receptor B1 is critical for antibody-dependent dengue. Proc Natl Acad Sci U S A. (2014) 111:2722–7. doi: 10.1073/pnas.1317454111

PubMed Abstract | Crossref Full Text | Google Scholar

21. Arnold V, Cummings JS, Moreno-Nieves UY, Didier C, Gilbert A, Barre-Sinoussi F, et al. S100A9 protein is a novel ligand for the CD85j receptor and its interaction is implicated in the control of HIV-1 replication by NK cells. Retrovirology. (2013) 10:122. doi: 10.1186/1742-4690-10-122

PubMed Abstract | Crossref Full Text | Google Scholar

22. Harrison TE, Morch AM, Felce JH, Sakoguchi A, Reid AJ, Arase H, et al. Structural basis for RIFIN-mediated activation of LILRB1 in malaria. Nature. (2020) 587:309–12. doi: 10.1038/s41586-020-2530-3

PubMed Abstract | Crossref Full Text | Google Scholar

23. Kim T, Vidal GS, Djurisic M, William CM, Birnbaum ME, Garcia KC, et al. Human LilrB2 is a beta-amyloid receptor and its murine homolog PirB regulates synaptic plasticity in an Alzheimer’s model. Science. (2013) 341:1399–404. doi: 10.1126/science.1242077

PubMed Abstract | Crossref Full Text | Google Scholar

24. Sakoguchi A, Saito F, Hirayasu K, Shida K, Matsuoka S, Itagaki S, et al. Plasmodium falciparum RIFIN is a novel ligand for inhibitory immune receptor LILRB2. Biochem Biophys Res Commun. (2021) 548:167–73. doi: 10.1016/j.bbrc.2021.02.033

PubMed Abstract | Crossref Full Text | Google Scholar

25. Cao Q, Shin WS, Chan H, Vuong CK, Dubois B, Li B, et al. Inhibiting amyloid-beta cytotoxicity through its interaction with the cell surface receptor LilrB2 by structure-based design. Nat Chem. (2018) 10:1213–21. doi: 10.1038/s41557-018-0147-z

PubMed Abstract | Crossref Full Text | Google Scholar

26. Lao K, Zhang R, Dai Y, Luan J, Guo N, Xu X, et al. Identification of novel Abeta-LilrB2 inhibitors as potential therapeutic agents for Alzheimer’s disease. Mol Cell Neurosci. (2021) 114:103630. doi: 10.1016/j.mcn.2021.103630

PubMed Abstract | Crossref Full Text | Google Scholar

27. Almond N, Berry N, Stebbings R, Preston M, Ham C, Page M, et al. Vaccination of macaques with DNA followed by adenoviral vectors encoding simian immunodeficiency virus (SIV) gag alone delays infection by repeated mucosal challenge with SIV. J Virol. (2019) 93. doi: 10.1128/JVI.00606-19

PubMed Abstract | Crossref Full Text | Google Scholar

28. Boszormenyi KP, Stammes MA, Fagrouch ZC, Kiemenyi-Kayere G, Niphuis H, Mortier D, et al. The post-acute phase of SARS-coV-2 infection in two macaque species is associated with signs of ongoing virus replication and pathology in pulmonary and extrapulmonary tissues. Viruses. (2021) 13. doi: 10.3390/v13081673

PubMed Abstract | Crossref Full Text | Google Scholar

29. Dijkman K, Vervenne RAW, Sombroek CC, Boot C, Hofman SO, van Meijgaarden KE, et al. Disparate tuberculosis disease development in macaque species is associated with innate immunity. Front Immunol. (2019) 10:2479. doi: 10.3389/fimmu.2019.02479

PubMed Abstract | Crossref Full Text | Google Scholar

30. Gardner MB and Luciw PA. Macaque models of human infectious disease. ILAR J. (2008) 49:220–55. doi: 10.1093/ilar.49.2.220

PubMed Abstract | Crossref Full Text | Google Scholar

31. Freire-Cobo C, Edler MK, Varghese M, Munger E, Laffey J, Raia S, et al. Comparative neuropathology in aging primates: A perspective. Am J Primatol. (2021) 83:e23299. doi: 10.1002/ajp.23299

PubMed Abstract | Crossref Full Text | Google Scholar

32. Stonebarger GA, Bimonte-Nelson HA, and Urbanski HF. The rhesus macaque as a translational model for neurodegeneration and alzheimer’s disease. Front Aging Neurosci. (2021) 13:734173. doi: 10.3389/fnagi.2021.734173

PubMed Abstract | Crossref Full Text | Google Scholar

33. Deycmar S, Gomes B, Charo J, Ceppi M, and Cline JM. Spontaneous, naturally occurring cancers in non-human primates as a translational model for cancer immunotherapy. J Immunother Cancer. (2023) 11. doi: 10.1136/jitc-2022-005514

PubMed Abstract | Crossref Full Text | Google Scholar

34. Havel PJ, Kievit P, Comuzzie AG, and Bremer AA. Use and importance of nonhuman primates in metabolic disease research: current state of the field. ILAR J. (2017) 58:251–68. doi: 10.1093/ilar/ilx031

PubMed Abstract | Crossref Full Text | Google Scholar

35. VandeBerg J, Williams-Blangero S, and Tardif S. The baboon in biomedical research. New York: Springer (2009).

Google Scholar

36. Li C, Jenkins S, Considine MM, Cox LA, Gerow KG, Huber HF, et al. Effect of maternal obesity on fetal and postnatal baboon (Papio species) early life phenotype. J Med Primatol. (2019) 48:90–8. doi: 10.1111/jmp.12396

PubMed Abstract | Crossref Full Text | Google Scholar

37. Bruijnesteijn J, de Groot N, van der Wiel MKH, Otting N, de Vos-Rouweler AJM, de Groot NG, et al. Unparalleled rapid evolution of KIR genes in rhesus and cynomolgus macaque populations. J Immunol. (2020) 204:1770–86. doi: 10.4049/jimmunol.1901140

PubMed Abstract | Crossref Full Text | Google Scholar

38. Bruijnesteijn J, van der Wiel MKH, de Groot N, Otting N, de Vos-Rouweler AJM, Lardy NM, et al. Extensive alternative splicing of KIR transcripts. Front Immunol. (2018) 9:2846. doi: 10.3389/fimmu.2018.02846

PubMed Abstract | Crossref Full Text | Google Scholar

39. Bruijnesteijn J, van der Wiel MKH, Swelsen WTN, Otting N, de Vos-Rouweler AJM, Elferink D, et al. Human and rhesus macaque KIR haplotypes defined by their transcriptomes. J Immunol. (2018) 200:1692–701. doi: 10.4049/jimmunol.1701480

PubMed Abstract | Crossref Full Text | Google Scholar

40. de Groot NG, de Groot N, de Vos-Rouweler AJM, Louwerse A, Bruijnesteijn J, and Bontrop RE. Dynamic evolution of Mhc haplotypes in cynomolgus macaques of different geographic origins. Immunogenetics. (2022) 74:409–29. doi: 10.1007/s00251-021-01249-y

PubMed Abstract | Crossref Full Text | Google Scholar

41. de Groot NG, Heijmans CMC, van der Wiel MKH, Bruijnesteijn J, and Bontrop RE. The KIR repertoire of a West African chimpanzee population is characterized by limited gene, allele, and haplotype variation. Front Immunol. (2023) 14:1308316. doi: 10.3389/fimmu.2023.1308316

PubMed Abstract | Crossref Full Text | Google Scholar

42. Doxiadis GG, de Groot N, de Groot NG, Rotmans G, de Vos-Rouweler AJ, and Bontrop RE. Extensive DRB region diversity in cynomolgus macaques: recombination as a driving force. Immunogenetics. (2010) 62:137–47. doi: 10.1007/s00251-010-0422-7

PubMed Abstract | Crossref Full Text | Google Scholar

43. Davidson CL, Li NL, and Burshtyn DN. LILRB1 polymorphism and surface phenotypes of natural killer cells. Hum Immunol. (2010) 71:942–9. doi: 10.1016/j.humimm.2010.06.015

PubMed Abstract | Crossref Full Text | Google Scholar

44. Saitou N and Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. (1987) 4:406–25. doi: 10.1093/oxfordjournals.molbev.a040454

PubMed Abstract | Crossref Full Text | Google Scholar

45. Jukes TH and Cantor CR. Evolution of protein molecules. In: HN IM, editor. Mammalian protein metabolism. Academic Press, New York (1969). p. 21–132.

Google Scholar

46. Tamura K and Nei M. Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol Biol Evol. (1993) 10:512–26. doi: 10.1093/oxfordjournals.molbev.a040023

PubMed Abstract | Crossref Full Text | Google Scholar

47. Tamura K, Stecher G, and Kumar S. MEGA11: molecular evolutionary genetics analysis version 11. Mol Biol Evol. (2021) 38:3022–7. doi: 10.1093/molbev/msab120

PubMed Abstract | Crossref Full Text | Google Scholar

48. Nei M and Gojobori T. Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol Biol Evol. (1986) 3:418–26. doi: 10.1093/oxfordjournals.molbev.a040410

PubMed Abstract | Crossref Full Text | Google Scholar

49. Felsenstein J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution. (1985) 39:783–91. doi: 10.1111/j.1558-5646.1985.tb00420.x

PubMed Abstract | Crossref Full Text | Google Scholar

50. de Groot NG, Otting N, Maccari G, Robinson J, Hammond JA, Blancher A, et al. Nomenclature report 2019: major histocompatibility complex genes and alleles of Great and Small Ape and Old and New World monkey species. Immunogenetics. (2020) 72:25–36. doi: 10.1007/s00251-019-01132-x

PubMed Abstract | Crossref Full Text | Google Scholar

51. Klein J, Bontrop RE, Dawkins RL, Erlich HA, Gyllensten UB, Heise ER, et al. Nomenclature for the major histocompatibility complexes of different species: a proposal. Immunogenetics. (1990) 31:217–9. doi: 10.1007/BF00204890

PubMed Abstract | Crossref Full Text | Google Scholar

52. Maccari G, Robinson J, Bontrop RE, Otting N, de Groot NG, Ho CS, et al. IPD-MHC: nomenclature requirements for the non-human major histocompatibility complex in the next-generation sequencing era. Immunogenetics. (2018) 70:619–23. doi: 10.1007/s00251-018-1072-4

PubMed Abstract | Crossref Full Text | Google Scholar

53. Liu F, Cocker ATH, Pugh JL, Djaoud Z, Parham P, and Guethlein LA. Natural LILRB1 D1-D2 variants show frequency differences in populations and bind to HLA class I with various avidities. Immunogenetics. (2022) 74:513–25. doi: 10.1007/s00251-022-01264-7

PubMed Abstract | Crossref Full Text | Google Scholar

54. Kuroki K, Matsubara H, Kanda R, Miyashita N, Shiroishi M, Fukunaga Y, et al. Structural and functional basis for LILRB immune checkpoint receptor recognition of HLA-G isoforms. J Immunol. (2019) 203:3386–94. doi: 10.4049/jimmunol.1900562

PubMed Abstract | Crossref Full Text | Google Scholar

55. Boyson JE, Shufflebotham C, Cadavid LF, Urvater JA, Knapp LA, Hughes AL, et al. The MHC class I genes of the rhesus monkey. Different evolutionary histories of MHC class I and II genes in primates. J Immunol. (1996) 156:4656–65.

PubMed Abstract | Google Scholar

56. Daza-Vamenta R, Glusman G, Rowen L, Guthrie B, and Geraghty DE. Genetic divergence of the rhesus macaque major histocompatibility complex. Genome Res. (2004) 14:1501–15. doi: 10.1101/gr.2134504

PubMed Abstract | Crossref Full Text | Google Scholar

57. Kulski JK, Anzai T, Shiina T, and Inoko H. Rhesus macaque class I duplicon structures, organization, and evolution within the alpha block of the major histocompatibility complex. Mol Biol Evol. (2004) 21:2079–91. doi: 10.1093/molbev/msh216

PubMed Abstract | Crossref Full Text | Google Scholar

58. Otting N, Heijmans CM, Noort RC, de Groot NG, Doxiadis GG, van Rood JJ, et al. Unparalleled complexity of the MHC class I region in rhesus macaques. Proc Natl Acad Sci U S A. (2005) 102:1626–31. doi: 10.1073/pnas.0409084102

PubMed Abstract | Crossref Full Text | Google Scholar

59. van der Wiel MKH, Doxiadis GGM, de Groot N, Otting N, de Groot NG, Poirier N, et al. MHC class I diversity of olive baboons (Papio anubis) unravelled by next-generation sequencing. Immunogenetics. (2018) 70:439–48. doi: 10.1007/s00251-018-1053-7

PubMed Abstract | Crossref Full Text | Google Scholar

60. Blencowe BJ. Alternative splicing: new insights from global analyses. Cell. (2006) 126:37–47. doi: 10.1016/j.cell.2006.06.023

PubMed Abstract | Crossref Full Text | Google Scholar

61. Nilsen TW and Graveley BR. Expansion of the eukaryotic proteome by alternative splicing. Nature. (2010) 463:457–63. doi: 10.1038/nature08909

PubMed Abstract | Crossref Full Text | Google Scholar

62. Bruijnesteijn J, de Groot NG, and Bontrop RE. The genetic mechanisms driving diversification of the KIR gene cluster in primates. Front Immunol. (2020) 11:582804. doi: 10.3389/fimmu.2020.582804

PubMed Abstract | Crossref Full Text | Google Scholar

63. Chen HM, van der Touw W, Wang YS, Kang K, Mai S, Zhang J, et al. Blocking immunoinhibitory receptor LILRB2 reprograms tumor-associated myeloid cells and promotes antitumor immunity. J Clin Invest. (2018) 128:5647–62. doi: 10.1172/JCI97570

PubMed Abstract | Crossref Full Text | Google Scholar

64. Jiang Z, Huang Q, Chang Y, Qiu Y, Cheng H, Yang M, et al. LILRB2 promotes immune escape in breast cancer cells via enhanced HLA-A degradation. Cell Oncol (Dordr). (2024) 47:1679–96. doi: 10.1007/s13402-024-00947-5

PubMed Abstract | Crossref Full Text | Google Scholar

65. Redondo-Garcia S, Barritt C, Papagregoriou C, Yeboah M, Frendeus B, Cragg MS, et al. Human leukocyte immunoglobulin-like receptors in health and disease. Front Immunol. (2023) 14:1282874. doi: 10.3389/fimmu.2023.1282874

PubMed Abstract | Crossref Full Text | Google Scholar

66. Zeller T, Munnich IA, Windisch R, Hilger P, Schewe DM, Humpe A, et al. Perspectives of targeting LILRB1 in innate and adaptive immune checkpoint therapy of cancer. Front Immunol. (2023) 14:1240275. doi: 10.3389/fimmu.2023.1240275

PubMed Abstract | Crossref Full Text | Google Scholar

67. de Groot NG, Blokhuis JH, Otting N, Doxiadis GG, and Bontrop RE. Co-evolution of the MHC class I and KIR gene families in rhesus macaques: ancestry and plasticity. Immunol Rev. (2015) 267:228–45. doi: 10.1111/imr.12313

PubMed Abstract | Crossref Full Text | Google Scholar

68. Rogers J, Raveendran M, Harris RA, Mailund T, Leppala K, Athanasiadis G, et al. The comparative genomics and complex population history of Papio baboons. Sci Adv. (2019) 5:eaau6947. doi: 10.1126/sciadv.aau6947

PubMed Abstract | Crossref Full Text | Google Scholar

69. Umumararungu T, Gahamanyi N, Mukiza J, Habarurema G, Katandula J, Rugamba A, et al. Proline, a unique amino acid whose polymer, polyproline II helix, and its analogues are involved in many biological processes: a review. Amino Acids. (2024) 56:50. doi: 10.1007/s00726-024-03410-9

PubMed Abstract | Crossref Full Text | Google Scholar

70. Jones DC, Roghanian A, Brown DP, Chang C, Allen RL, Trowsdale J, et al. Alternative mRNA splicing creates transcripts encoding soluble proteins from most LILR genes. Eur J Immunol. (2009) 39:3195–206. doi: 10.1002/eji.200839080

PubMed Abstract | Crossref Full Text | Google Scholar

71. Emborg ME. Nonhuman primate models of neurodegenerative disorders. ILAR J. (2017) 58:190–201. doi: 10.1093/ilar/ilx021

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: alternative splicing, evolution, human, leucocyte immunoglobulin-like receptor (LILR), leucocyte receptor complex (LRC), long-tailed macaque, non-human primates (NHP), rhesus macaque

Citation: de Groot NG, de Groot N, Heijmans CMC, de Vos-Rouweler AJM, van der Wiel MKH and Bruijnesteijn J (2026) LILRB1 and LILRB2 genomics and transcriptomics in macaque and baboon species: polymorphism, diversification, and extensive alternative splicing. Front. Immunol. 16:1706720. doi: 10.3389/fimmu.2025.1706720

Received: 16 September 2025; Accepted: 10 December 2025; Revised: 08 December 2025;
Published: 09 January 2026.

Edited by:

Johannes M. Dijkstra, Fujita Health University, Japan

Reviewed by:

Kouyuki Hirayasu, Kanazawa University, Japan
Xueling Wu, Boston Medical Center, United States

Copyright © 2026 de Groot, de Groot, Heijmans, de Vos-Rouweler, van der Wiel and Bruijnesteijn. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Natasja G. de Groot, Z3Jvb3RAYnByYy5ubA==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.