ORIGINAL RESEARCH article

Front. Immunol., 28 May 2026

Sec. Viral Immunology

Volume 17 - 2026 | https://doi.org/10.3389/fimmu.2026.1791604

Statistical association of complete PYHIN gene family loss with flight and inverted roosting in bats

  • 1. School of Biological Sciences, University of Nebraska, Lincoln, NE, United States

  • 2. Nebraska Center for Virology, University of Nebraska, Lincoln, NE, United States

Abstract

The PYHIN (pyrin and HIN domain–containing) gene family encodes central cytosolic DNA sensors, including AIM2 and IFI16, that activate inflammasome and type I interferon pathways during infection. While these pathways are critical for antiviral and antimicrobial defense, how ecological pressures shape their evolution remain unclear. The PYHIN gene family varies across mammals, and bats have completely lost all PYHIN genes. Here, we integrate phylogenomic, comparative genomic, and phylogenetically controlled analyses across more than 150 mammalian species to investigate PYHIN evolutionary dynamics. We show that PYHIN genes form a tightly linked genomic cassette within a conserved chromosomal interval flanked by SPTA1 and CADM3, a pattern we describe as an Anchored Gene Cluster Pulsation mechanism characterized by coordinated expansion, contraction, and loss. Across mammals, the genomic distance between these anchor genes correlates with PYHIN copy number. In bats, phylogenetic logistic regression identifies powered flight and inverted roosting as traits statistically associated with PYHIN loss, whereas echolocation and hibernation show no association. Olfactory receptor genes within the same region are retained, indicating targeted loss of DNA-sensing immune genes rather than generalized genome contraction. These findings support a model in which bat immunity is associated with ecological specialization favoring immune tolerance and provide a framework for understanding immune gene family evolution as integrated genomic modules.

Introduction

The PYHIN (pyrin and HIN domain-containing) gene family regulates innate immunity and inflammation by sensing cytosolic DNA and activating inflammasomes (). Five members in humans, Absent in Melanoma 2 (AIM2), Interferon Gamma Inducible Protein 16 (IFI16), Myeloid Cell Nuclear Differentiation Antigen (MNDA), Pyrin and HIN Domain Family Member 1 (PYHIN1), and Pyrin Only Protein 3(POP3), coordinate responses to infection and stress via conserved PYD and HIN domains (). Other animals, especially rodents, have additional PYHIN members, such as Interferon-inducible protein p203 (IFI203) (, ). AIM2 and IFI16 detect pathogenic or mislocalized DNA, triggering inflammasome or interferon responses, while POP3 modulates these pathways through PYD interactions (). Despite their immune centrality, PYHIN genes show enigmatic evolutionary patterns. Bats uniquely lost PYHIN genes among mammals, while birds never acquired this mammalian innovation ().

Despite their well-established immunological roles, PYHIN genes show striking and poorly understood variation across vertebrates (, ). Among mammals, the complete absence of PYHIN genes in bats is particularly notable given the critical role of these genes in inflammatory and antiviral defense in other species. This observation raises a fundamental immunological question: how can bats tolerate the loss of a major DNA-sensing pathway, and what evolutionary forces drove this extreme form of immune remodeling?

Understanding PYHIN evolution is especially relevant in the context of bat immunity. Bats can tolerate infections by certain viruses that are highly pathogenic in humans, a phenomenon that has led to their immune systems being described as exhibiting “super-immunity.” Powered flight has long been proposed as a key driver of bat immune specialization, as it imposes exceptional metabolic demands and increases the risk of inflammation-associated tissue damage. Other bat-associated ecological traits, such as inverted roosting and echolocation, have received far less attention. Importantly, the evolutionary relationships between these ecological traits and specific immune gene losses have not been systematically tested.

Here, we address these questions through comprehensive phylogenomic and comparative genomic analyses spanning more than 150 species of mammals, birds, reptiles, and fishes. We show that the PYHIN gene family emerged in therian mammals approximately 160–180 million years ago and subsequently underwent lineage-specific expansion, contraction, and complete loss. Importantly, the absence of PYHIN genes in monotremes reflects a different evolutionary scenario than that observed in bats. Because the PYHIN gene family originated after the divergence of monotremes, these species never acquired PYHIN genes. In contrast, bats inherited PYHIN genes from a therian ancestor and subsequently lost the entire gene family. This distinction highlights that absence of PYHIN genes across lineages can arise through fundamentally different evolutionary processes—failure to acquire versus secondary loss. We propose the Anchored Gene Cluster Pulsation Hypothesis, in which PYHIN genes evolve as a coordinated genomic cassette embedded within a conserved chromosomal framework, undergoing episodic gain and loss rather than gradual, gene-by-gene turnover. Importantly, phylogenetically controlled analyses identify powered flight and inverted roosting as the ecological traits most strongly associated with complete PYHIN loss in bats. Together, our findings provide an immunologically grounded framework for understanding how ecological pressures can drive large-scale remodeling of innate immune sensing pathways, resulting in immune systems that are uniquely adapted—though not universally enhanced—to specific physiological and environmental demands.

Results

Evolutionary origin and distribution of the PYHIN family

We systematically searched for PYHIN protein coding sequences, identifying members through sequence similarity and phylogenetic analysis (see Supplementary Table 1). The distribution of PYHIN genes varies across mammalian lineages, with distinct presence/absence patterns characterizing different clades. Notably, as shown in Figures 1A–F, none of the five PYHIN family members (AIM2, IFI16, MNDA, PYHIN1, and POP3) were detected in monotremes (0% presence). In addition, AIM2 exhibits the broadest distribution across placental mammals, with 100% presence in Xenarthra (2/2), Afrotheria (7/7), and Marsupialia (9/9) and a high representation in Euarchontoglires (89%; 75/84). In contrast, POP3 has the most restricted distribution, being completely absent in most mammalian lineages except for Euarchontoglires (29%; 24/84). Our analysis revealed that the PYHIN gene family is a placental mammal-specific innovation; its distribution varies significantly across major mammalian clades.

Figure 1

). The dashed lines represent not present in the described clades. Line thickness represents the percentage of species within each clade that contain the respective gene. Numbers in parentheses indicate the number of species with the gene relative to the total number of species examined in each clade, along with the exact percentage. Color coding is consistent across all panels: AIM2 (orange), IFI16 (purple), MNDA (red), PYHIN1 (yellow), and POP3 (pink).

Hominoids retain a five-gene PYHIN repertoire

Our analysis of PYHIN gene distribution across mammals revealed the differences among taxonomic groups (Figure 2A). Primates showed conservation of all five PYHIN genes: AIM2 (100%), IFI16 (94%), MNDA (91%), PYHIN1 (94%), and POP3 (71%), distinguishing them from other Euarchontoglires, which exhibited greater variability and a complete absence of POP3. In contrast, bats (Chiroptera) lost all PYHIN genes, with none detected across 24 species as reported () (Figure 2A, right panel).

Figure 2

).

The POP3 protein contains the Pyrin domain but lacks the HIN domain, also called Pyrin Domain Containing 5 (PYDC5). It is sometimes considered a variant within the broader family context. POP3 distribution followed a phylogenetic gradient (Figure 2B). It was absent in prosimians (0%, 0/5) and tarsiers (0%, 0/1) but appeared frequently in anthropoids: New World monkeys (80%, 4/5) and Old World monkeys (78.6%, 11/14). Hominoids retained POP3 universally (100%, 9/9). Furthermore, phylogenetic analysis reveals it is indeed a hominoid-specific factor (Supplementary Figure 1). In addition, all hominoids have four other genes in the PYHIN family (Supplementary Table 1).

Our syntenic analysis further revealed distinct PYHIN gene arrangements across primates (Figure 3, lanes 1–5). Humans and rhesus macaques (lanes 1–2) maintained conserved structures, while prosimians showed greater variation: ring-tailed lemurs (lane 3) retained the core but duplicated PYHIN1 and lacked POP3; slow lorises (lane 4) had multiple IFI16 duplications and two PYHIN1-like genes; and the small-eared galago (lane 5) exhibited the most divergence, lacking MNDA, PYHIN1, and POP3, but retaining duplicated IFI16-like genes and AIM2. These findings highlight the dynamic evolution of PYHIN genes in primates.

Figure 3

The SPTA1-CADM3 region as an evolutionary hotspot for PYHIN dynamics

Spectrin alpha chain, erythrocytic 1 (SPTA1) maintains the biconcave shape of red blood cells, while Cell Adhesion Molecule 3 (CADM3) facilitates cell-cell adhesion and neural development. They are located on the same chromosome and all PYHIN family genes are located in the region (Figure 3).

Gene deletion is the most striking phenomenon in the region. Bats have lost all PYHIN genes while retaining the anchor genes (lane 16), whereas cetaceans (e.g., harbor porpoise, bottlenose dolphin) retain only one or two IFI203-like genes (lanes 13–14).

Gene duplication is most pronounced in mice, with Mus musculus expanding to at least 13 PYHIN members (Figure 3, lane 6), including lineage-specific paralogs like IFI202b, IFI204, and IFI207-IFI214—far exceeding the five found in humans (, ).

Pseudogenization also happens during PYHIN evolution. MNDA is a pseudogene in the African savanna elephant (lane 11), while AIM2 is pseudogenized in the hippopotamus (lane 15), and IFI203 in the mouse (lane 6).

Lineage-specific diversification further drives PYHIN evolution. MNDA-like expansions occur in Florida manatees and elephants (lanes 10-12), while PYHIN1 diversification is prominent in the ring-tailed lemur (lane 3). Novel IFI203-like genes in cetaceans exhibit modified domain structures, diverging from ancestral forms (lanes 13-14).

Even closely related species show marked variation. House and Ryukyu mice (lanes 6-7) and brown and black rats (lanes 8-9) have distinct PYHIN repertoire despite recent divergence. Similarly, African and Indian elephants (lanes 11-12) exhibit subtle but unique PYHIN arrangements.

These patterns of gene birth, expansion, contraction, and loss underscore the SPTA1-CADM3 region as an evolutionary hub for PYHIN innovation, with genomic plasticity constrained only by its conserved anchor genes.

Establishment and conservation of the SPTA1-CADM3 genomic framework

To explore the genomic context of PYHIN gene evolution, we examined the evolutionary history of its flanking genes, SPTA1 and CADM3. SPTA1 has a homology to SPTAN1 (Spectrin Alpha, Non-Erythrocytic), while CADM3 belongs to the immunoglobulin superfamily.

Our phylogenetic analysis (Figure 4A) shows that SPTA1 emerged in monotremes (e.g., Ornithorhynchus anatinus, platypus) and has persisted across all examined mammals. In contrast, CADM3 originated at the split between bony fish and cartilaginous fish (Figure 4B). The phylogenetic tree reveals a clear separation between the cartilaginous fish CADM protein coding sequences (e.g., epaulette shark and lesser devil ray) and the CADM3 orthologs found in fish and tetrapods (mammals, reptiles, and amphibians). We could not locate CADM3-like sequences in non-vertebrate lineages (data not shown). This topology, supported by high bootstrap values (>90%), indicates that CADM3 likely represents a vertebrate’s evolutionary innovation (Figure 4B).

Figure 4

While SPTA1 and CADM3 evolved along distinct paths, our data suggest they have remained on the same chromosome throughout evolution (Figure 3; Supplementary Tables 1, 2). The presence of linked SPTA1-CADM3 genes in monotremes (which lack PYHIN genes) and the complete PYHIN deletion in bats (while retaining SPTA1-CADM3 linkage) demonstrates that this genomic framework was established prior to PYHIN emergence. A few exceptions exist (Supplementary Table 2; bottom section); however, these are based on preliminary sequence data and do not have verified chromosome assignments. Based on data in Figures 3, 4, SPTA1 and CADM3 were established in proximity on the same chromosome before the divergence of Prototheria (monotremes) and Theria (marsupials and placental mammals).

Genomic distance and PYHIN retention in the SPTA1-CADM3 region

The distance between SPTA1 and CADM3 varies systematically across mammals and apparently correlates with PYHIN gene retention and deletion (Figure 3). To address this issue further, three groups of animals (rodents, cetaceans, and non-hominoid primates) were analyzed, and their genomes showed a strong correlation between SPTA1-CADM3 distance and PYHIN gene count (r = 0.89, p < 0.00001) (Figure 5A, black line). In addition, rodents, cetaceans, and non-hominoid primates also show positive correlation among themselves (Figure 5A, colored inserts). Hominoid data were omitted from the correlation analysis because they exhibit no variation in PYHIN gene counts, which would not contribute to assessing a correlative relationship with the SPTA1-CADM3 genomic distance. Monotremes, which lack PYHIN genes, have shorter SPTA1-CADM3 distances (~110kb vs. ~495kb in marsupials, p = 0.018). However, since sequence data are available for only two monotreme species, this conclusion should be interpreted with caution. Despite having distances similar to those of hominoids, marsupials retain only AIM2 (Figure 5B). Also, bats exhibit even shorter distances, reflecting complete PYHIN loss (Figure 5B). Overall, SPTA1-CADM3 distance correlates strongly with PYHIN content across mammals.

Figure 5

SPTA1-CADM3 genomic region in mammals exhibits parallel evolution of olfactory receptor (OR) gene families, reflecting species-specific ecological adaptations. (Figure 5C). These patterns underscore the evolutionary interplay between sensory and immune functions shaped by ecological pressures.

SPTA1-CADM3 is a minimal genomic boundary for co-evolving sensory and immune genes

CADM3 is frequently found adjacent to AIM2 across various species, with examples such as the southern two-toed sloth (Choloepus didactylus) showing overlapping transcripts (Gene ID: 119526496; updated 19-Nov-2025). Similarly, SPTA1 is intimately associated with OR genes. In pigs (Sus scrofa), compelling evidence of this integration is seen through overlapping transcripts between PYHIN and OR genes (Gene ID: 100154852, updated 1-Mar-2025), and also between some OR and SPTA1 transcripts (Gene ID: 100152068, updated 1-Mar-2025). All those observations support that the SPTA1-CADM3 genomic region serves as a crucial and minimal boundary encompassing the dynamic co-evolution of both PYHIN and OR genes.

Flight and inverted roosting strongly associated with lineage-specific loss of PYHIN genes

Flight has long been proposed as a major determinant of bat immune biology; however, this idea has not been subjected to rigorous statistical testing, particularly in a phylogenetic context that accounts for shared evolutionary history. In addition, inverted roosting, a near-universal behavior among bats, has received little attention as a potential immunological selective pressure.

To formally evaluate the contributions of these traits, we implemented a phylogenetically controlled comparative framework that jointly analyzes flight, inverted roosting, and other bat-associated ecological traits across mammals (Supplementary Table 4). By explicitly accounting for phylogenetic non-independence among species, we test whether these traits are statistically associated with variation in immune gene repertoire size.

We identify a robust and highly specific association between the evolution of flight and inverted roosting behavior and complete loss of the PYHIN immune gene family (Figure 6A). Both traits show strong positive associations with PYHIN deletion (slope = 8.4, adjusted p = 0.0002). In contrast, other ecological traits commonly associated with bats, including echolocation and hibernation, show no detectable relationship with PYHIN dynamics (echolocation: slope = 0.000334, adjusted p = 0.9998; hibernation: slope = 0.000040, adjusted p = 0.9998). Parallel analyses of OR gene loss within the same genomic region likewise show no association with flight or inverted roosting (both slope = 0.951, adjusted p = 0.627; Figure 6B). Echolocation, however, exhibits a statistically significant association with OR reduction (slope = 3.454, adjusted p = 0.0460), consistent with established sensory trade-offs in echolocating mammals (Supplementary Table 5). Together, our phylogenetic logistic regression identifies a strong statistical association between the evolution of flight and inverted roosting and complete loss of the PYHIN gene family; however, because these traits are largely restricted to the Chiroptera lineage, the individual contributions of trait-specific versus lineage-specific effects cannot be fully separated (Supplementary Tables 4, 5).

Figure 6

Discussion

Flight, inverted roosting, and DNA-sensing immune gene loss

Fossil evidence indicates that both flight and inverted roosting arose early in chiropteran evolution and have been maintained as an integrated biomechanical system (). Our phylogenetically controlled analyses identify flight and inverted roosting as the strongest statistical association factors of complete PYHIN loss across mammals (Figure 6). Other traits commonly invoked in discussions of bat immunity—such as echolocation or hibernation—show no association with PYHIN dynamics. Because all extant bats fly and almost all roost inverted, and because genomic data are lacking for the few exceptional bat species, our analyses cannot statistically disentangle the individual contributions of these two traits. Nonetheless, their tight evolutionary and functional coupling is consistent with the hypothesis that the ecological transition to powered flight, together with its associated inverted roosting, may have influenced the evolution of DNA-sensing immune genes.

From an immunological standpoint, chronic or inappropriate activation of PYHIN-dependent pathways under conditions of frequent DNA damage would be maladaptive. Persistent inflammasome activation can drive tissue injury, systemic inflammation, and reduced fitness. Complete deletion of the PYHIN gene family eliminates not only functional signaling but also the risk of leaky or dysregulated activation that can arise from partially functional or pseudogenized genes. Thus, PYHIN loss in bats may represent a genomic pattern consistent with reduced inflammatory liability under flight and inverted roosting-associated physiological constraints.

Loss of DNA-sensing immune genes in bats suggesting potential immune tolerance

Bats are divided into two major suborders, Yinpterochiroptera and Yangochiroptera, which diverged early in chiropteran evolution. The absence of PYHIN genes in both lineages indicates that this loss occurred in their common ancestor prior to this split, supporting the view that bat immune system evolution reflects a shared ecological adaptation rather than enhanced antiviral specialization. The loss of PYHIN genes provides a mechanistic explanation for several hallmark features of bat immunity. Bats are well known for their ability to tolerate viral infections that are highly pathogenic in other mammals, including humans (). Reduced DNA-triggered inflammasome signaling may contribute to the observed viral tolerance by dampening excessive inflammation and immunopathology. Importantly, this does not imply superior or broadly enhanced immunity. Rather, bats appear to have evolved a genomic immune profile consistent with minimizing inflammatory self-damage in the context of extreme metabolic and biomechanical demands.

This specialization entails trade-offs. While reduced inflammasome activity may favor viral tolerance, it may also compromise defenses against certain pathogens. The high susceptibility of bats to fungal pathogens such as Pseudogymnoascus destructans, the causative agent of white-nose syndrome, is consistent with this view (, ). Thus, bat immunity is neither “super” nor deficient; it is uniquely tuned to the ecological realities of flight and inverted roosting.

Anchored pulsatile evolution as a framework for immune gene remodeling

Having established the immunological and ecological context, our data support a broader evolutionary framework to explain how such radical immune remodeling can occur. The PYHIN genes are embedded within a conserved genomic interval flanked by SPTA1 and CADM3, which remains intact across mammals regardless of PYHIN content. Within this stable framework, the PYHIN family undergoes coordinated expansion, contraction, and deletion—a pattern we describe as the Anchored Gene Cluster Pulsation Hypothesis (Figure 7).

Figure 7

).

Multiple independent lines of evidence support this hypothesis. First, unlike classical birth–death models, which assume continuous, gene-by-gene duplication and loss across the genome, PYHIN evolution is constrained within a stable SPTA1–CADM3 genomic framework that persists across mammals. This interval shows stepwise assembly—CADM3 arising early in vertebrates (420–450 MYA), SPTA1 emerging later in mammals (166–190 MYA), and the full PYHIN cassette appearing in therians (160–180 MYA)—followed by coordinated expansion and contraction (Figure 7). Second, across multiple mammalian lineages, including rodents, cetaceans, and non-hominoid primates, the physical distance between SPTA1 and CADM3 strongly correlates with PYHIN gene copy number, consistent with cassette-level gain and loss rather than scattered gene turnover (Figure 5). Third, lineage comparisons further support discrete pulses of evolution. Monotremes exhibit a compact interval lacking PYHIN genes; however, this absence reflects their divergence prior to the origin of the PYHIN gene family and therefore represents a failure to acquire, rather than gene loss. In contrast, bats inherited PYHIN genes from a therian ancestor and subsequently lost the entire gene family, representing a true case of complete secondary loss. This distinction is critical for interpreting evolutionary patterns across mammals, as similar absence states can arise through fundamentally different processes—ancestral absence versus lineage-specific deletion (Figure 7). Fourth and finally, modularity within the region—exemplified by independent retention or loss of PYHIN and olfactory receptor genes—demonstrates that this anchored architecture permits selective, pulse-like remodeling of functionally related gene families without destabilizing neighboring loci (Figures 3, 5).

Although PYHIN genes show dynamic gain and loss across mammals, the observed patterns are not fully explained by a simple birth–death model. The strict confinement of PYHIN genes within the SPTA1–CADM3 interval, the strong correlation between interval size and gene number, and the coordinated, complete deletion in bats collectively indicate cassette-level remodeling. We therefore propose that the Anchored Gene Cluster Pulsation model provides the most parsimonious explanation for these patterns, while incorporating local stochastic processes within a constrained genomic framework.

Importantly, this framework explains how bats could completely delete a major immune-sensing pathway while retaining neighboring gene families, including olfactory receptors. The specificity of PYHIN loss argues against generalized genome contraction and instead highlights targeted immune remodeling driven by selection against inflammatory cost rather than loss of immune competence.

Implications for comparative and evolutionary immunology

Together, our findings place bat immunity within a broader immunological principle: immune systems evolve under competing pressures of protection and self-damage. In bats, the balance has shifted decisively toward tolerance, facilitated by the complete removal of DNA-sensing inflammasome pathways. The Anchored Gene Cluster Pulsation Hypothesis provides a mechanistic explanation for how such shifts can occur rapidly and coordinately at the genomic level, in which genes undergo episodic expansion and contraction within fixed genomic boundaries. This organization allows immune gene content to change rapidly in response to ecological pressures while preserving overall genomic structure (Figures 3, 7). Our research underscores the importance of ecological context in interpreting immune gene repertoires. Apparent immune “losses” may represent adaptive refinements rather than deficiencies. More broadly, our study highlights how genome architecture can enable large-scale immune innovation, allowing mammals to explore diverse ecological niches while maintaining overall genomic stability.

Materials and methods

Searching of PYHIN, CADM3, and SPTA1 Protein-Coding Sequences

The coding sequences of the human PYHINs, CADM3, and SPTA1 genes were used as the queries (see Supplementary Table 1 for the accession numbers). BLASTP protein similarity search es were performed against the nonredundant protein database at the National Center for Biotechnology Information (NCBI) with default options. When several isoforms were available for a gene, one isoform whose protein-coding sequence was most like the human ortholog was selected. Protein sequences that were identical or partial (too short) were excluded. Subfamily classification of proteins is done based on the phylogenetic placement regardless of the annotation given in the NCBI database. In case of ambiguity, a reciprocal BLASTP similarity search was performed further.

Phylogenetic analysis and classification of protein sequences

Multiple sequence alignment of the protein sequences was performed using MAFFT (ver. 7.526) with the E‐INS‐i iterative refinement method. The maximum likelihood (ML) phylogenetic analysis was performed using IQ‐TREE (ver. 1.6.12) with the options for automatic model selection and ultrafast bootstrap analysis and SH‐aLRT branch test for branch support analysis (both with 1000 replicates) (, ). The visualization of the phylogenies was performed using the Interactive Tree of Life website (ver. 6.9.1) ().

Comparative genomic analysis of the SPTA1-CADM3 genomic region

Genomic sequences containing SPTA1 and CADM3 genes were collected from various vertebrate species. Complete genome assemblies were accessed from NCBI public databases. SPTA1 and CADM3 genes were identified as conserved anchor points across all examined species. All genes located between these two anchors were systematically identified based on NCBI annotation.

PYHIN family genes (AIM2, IFI16, MNDA, PYHIN1, POP3) and their homologs were characterized through sequence similarity searches and phylogenetic analysis. Pseudogenes were identified and labeled (marked with “ps” in the figure). Genes were classified by family relationships (indicated by color coding in Figure 3). Orthologous relationships between genes were established using sequence similarity. Paralogs specific to certain lineages (particularly in rodents) were identified through comparative genomic analysis.

The orientation of each gene was determined using NCBI transcription information (shown by arrows in Figure 3). The relative positions of genes were mapped within the SPTA1-CADM3 region for each species. The genomic distance between SPTA1 and CADM3 was calculated for each species (shown in Mb in the right column of Figure 3). The OR genes are analyzed with the same procedure.

Ecological classification

Mammalian lineages were classified into major ecological niches (aquatic, semi-aquatic, terrestrial/arboreal, aerial) based on habitat use, locomotor adaptations, and behavioral data compiled from published natural history references and primary literature sources. Aquatic species (e.g., cetaceans) are fully water-bound; semi-aquatic species (e.g., pinnipeds and monotremes) divide their time between aquatic and terrestrial environments; terrestrial/arboreal species inhabit land or trees; and aerial species (e.g., bats) are adapted for powered flight.

Ecological and behavioral trait data used in downstream analyses—including flight, echolocation, hibernation, and inverted roosting—were systematically compiled from the literature and are fully documented with source references in Supplementary Table 4. For each species, trait assignments were based on consensus descriptions from authoritative sources, including classic mammalian compendia () and additional primary studies where necessary.

Gene-specific data, including PYHIN gene presence/absence and annotation, were obtained from publicly available genome assemblies and curated databases (NCBI RefSeq and GenBank). All data sources for both ecological traits and genomic annotations are explicitly listed in Supplementary Table 4 to ensure transparency and reproducibility.

Mammalian lineages were classified into major ecological niches (aquatic, semi-aquatic, terrestrial/arboreal, aerial) based on habitat use, locomotor adaptations, and behavioral data from published sources (). Aquatic species (e.g., cetaceans) are fully water-bound; semi-aquatic species (e.g., some pinnipeds, monotremes) split time between land and water; terrestrial/arboreal species inhabit land or trees; and aerial species (e.g., bats) are adapted for flight. Social complexity was categorized using published data on group size, social structure, and interaction rates. Lineages with diverse ecological traits (e.g., rodents) were labeled as occupying “varied niches.”

Statistical calculation

Pearson correlation coefficients were calculated with Microsoft Excel, and the statistical significance (p-values) of these coefficients was determined using online calculators for Pearson’s correlation, such as the one available at https://www.socscistatistics.com/pvalues/pearsondistribution.aspx. For analyzing differences between two groups, the Mann-Whitney U Test Calculator (https://www.socscistatistics.com/tests/mannwhitney/default2.aspx) was used for two tailed hypothesis. Additionally, permutation tests were conducted using the perm.test function in R. The standard threshold of p < 0.05 was considered statistically significant for all analyses. We used the Phylogenetic Generalized Least Squares (PGLS) framework in ape and nlme packages in R to include phylogenetic dependence in the correlation tests and Mann-Whitney U tests via the species tree obtained through the Open Tree of Life ().

A reconciled gene and species tree was created using GeneRax (version 2.1.3, default parameters). Seventy-one species were selected for the gene tree and pruned from the mammalian species tree of Upham et al. () accessed through the Open Tree of Life.

Binary response variables of PYHIN gene loss or OR gene loss were tested against behavioral traits of flight, echolocation, hibernation, and head-down roosting. Trait and gene-loss datasets were analyzed using Firth’s phylogenetic logistic regression, using the phyloglm() function in the phylolm R package in R (version 2.6.5) under default parameters with 1000 bootstrap replicates. Each trait was tested in a separate model for each response variable. Adjusted p-values were computed using the Benjamini-Hochberg (BH) false discovery rate (FDR) method in the p.adjust() function. All sequence-based analyses in this study were conducted at the amino acid level unless otherwise specified.

Statements

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Author contributions

CZi: Investigation, Visualization, Validation, Formal analysis, Data curation, Writing – review & editing. MS: Data curation, Visualization, Formal analysis, Validation, Writing – review & editing, Investigation. AS: Writing – review & editing, Investigation, Formal analysis, Data curation, Validation. CZh: Formal analysis, Validation, Writing – review & editing. LZ: Resources, Validation, Funding acquisition, Formal analysis, Supervision, Project administration, Data curation, Visualization, Investigation, Conceptualization, Writing – review & editing, Writing – original draft, Methodology.

Funding

The author(s) declared that financial support was not received for this work and/or its publication.

Acknowledgments

CZ was partially supported by the Mariska Family Foundation. MS and AS were supported by USERVE. We would like to thank UNL Bioinformatics Core Facility for phylogenomic calculations, and Greta Heit, Daniella Zastrow, and Amina Teri for their assistance in preparing and verifying the data.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2026.1791604/full#supplementary-material

Supplementary Figure 1

Phylogenetic relationships among POP3/PYDC5 protein families. POP3 nomenclature is used for primate PYDC5 proteins due to its high relation to the Human POP3 protein. The maximum likelihood phylogeny was reconstructed using protein sequences from representative vertebrate species. Internal nodes supported by 80% or higher in both ultrafast bootstrap and SH-aLRT branch tests are marked by gray dots, with dot size corresponding to ultrafast bootstrap values (80%−100%). The tree scale represents 0.1 substitutions per site. The sequences used to construct the tree are listed in Supplemental Supplementary Table 2.

References

  • 1

    SchattgenSAFitzgeraldKA. The PYHIN protein family as mediators of host defenses. Immunol Rev. (2011) 243:109–18. doi: 10.1111/j.1600-065x.2011.01053.x. PMID:

  • 2

    FanXJiaoLJinT. Activation and immune regulation mechanisms of PYHIN family during microbial infection. Front Microbiol. (2021) 12:809412. doi: 10.3389/fmicb.2021.809412. PMID:

  • 3

    KhareSRatsimandresyRAde AlmeidaLCudaCMRellickSLMisharinAVet al. The PYRIN domain–only protein POP3 inhibits ALR inflammasomes and regulates responses to infection with DNA viruses. Nat Immunol. (2014) 15:343–53. doi: 10.1038/ni.2829. PMID:

  • 4

    ChoubeyDDuanXDickersonEPonomarevaLPanchanathanRShenHet al. Interferon-inducible p200-family proteins as novel sensors of cytoplasmic DNA: Role in inflammation and autoimmunity. J Interferon Cytokine Res. (2010) 30:371–80. doi: 10.1089/jir.2009.0096. PMID:

  • 5

    ZhangYTianQDuYCaoHLengyelPKongW. Multiple splicing results in at least two p203 proteins that are expressed in the liver and down-regulated during liver regeneration. FBL. (2008) 13:2444–51. doi: 10.2741/2857. PMID:

  • 6

    AhnMCuiJIrvingATWangL-F. Unique loss of the PYHIN gene family in bats amongst mammals: Implications for inflammasome sensing. Sci Rep. (2016) 6:21722. doi: 10.1038/srep21722. PMID:

  • 7

    CridlandJACurleyEZWykesMNSchroderKSweetMJRobertsTLet al. The mammalian PYHIN gene family: Phylogeny, evolution and expression. BMC Evol Biol. (2012) 12:140. doi: 10.1186/1471-2148-12-140. PMID:

  • 8

    CarterAMMessAM. The evolution of fetal membranes and placentation in carnivores and ungulates (Ferungulata). Anim Reprod. (2017) 14:124–35. doi: 10.21451/1984-3143-ar903

  • 9

    DouadyCJChatelierPIMadsenOde JongWWCatzeflisFSpringerMSet al. Molecular phylogenetic evidence confirming the Eulipotyphla concept and in support of hedgehogs as the sister group to shrews. Mol Phylogenet. Evol. (2002) 25:200–9. doi: 10.1016/s1055-7903(02)00232-4. PMID:

  • 10

    FoleyNMSpringerMSTeelingEC. Mammal madness: Is the mammal tree of life not yet resolved? Philos Trans R Soc B. Biol Sci. (2016) 371:20150140. doi: 10.1098/rstb.2015.0140. PMID:

  • 11

    MessACarterAM. Evolutionary transformations of fetal membrane characters in Eutheria with special reference to Afrotheria. J Exp Zool. Part B. (2006) 306B:140–63. doi: 10.1002/jez.b.21079. PMID:

  • 12

    XiaBZhangWZhaoGZhangXBaiJBroshRet al. On the genetic basis of tail-loss evolution in humans and apes. Nature. (2024) 626:1042–8. doi: 10.1038/s41586-024-07095-8. PMID:

  • 13

    Erdemci-EvinSBossoMKrchlikovaVBayerWRegensburgerKMayerMet al. A variety of mouse PYHIN proteins restrict murine and human retroviruses. Viruses. (2024) 16:493. doi: 10.3390/v16040493. PMID:

  • 14

    BoermaDBSwartzSM. Roosting ecology drives the evolution of diverse bat landing maneuvers. iScience. (2024) 27:110381. doi: 10.1016/j.isci.2024.110381. PMID:

  • 15

    OrkneyABoermaDBHedrickBP. Evolutionary integration of forelimb and hindlimb proportions within the bat wing membrane inhibits ecological adaptation. Nat Ecol Evol. (2025) 9:111–23. doi: 10.1038/s41559-024-02572-9. PMID:

  • 16

    SimmonsNGunnellG. Fossil evidence and the origin of bats. J Mamm. Evol. (2005) 12:209–46. doi: 10.1007/s10914-005-6945-2. PMID:

  • 17

    SimmonsNBSeymourKLHabersetzerJGunnellGF. Primitive early Eocene bat from Wyoming and the evolution of flight and echolocation. Nature. (2008) 451:818–21. doi: 10.1038/nature06549. PMID:

  • 18

    ChandraAKesavardhanaS. PANoptosis regulation in reservoir hosts of zoonotic viruses. Viruses. (2024) 16. doi: 10.3390/v16111733. PMID:

  • 19

    DasSJainDChaudharyPQuintela-TizonRMBanerjeeAKesavardhanaS. Bat adaptations in inflammation and cell death regulation contribute to viral tolerance. mBio. (2025) 16:e0320423. doi: 10.1128/mbio.03204-23. PMID:

  • 20

    DemianWLCormierOMossmanK. Immunological features of bats: Resistance and tolerance to emerging viruses. Trends Immunol. (2024) 45:198210. doi: 10.1016/j.it.2024.01.008. PMID:

  • 21

    IrvingATAhnMGohGAndersonDEWangLF. Lessons from the host defences of bats, a unique viral reservoir. Nature. (2021) 589:363–70. doi: 10.1038/s41586-020-03128-0. PMID:

  • 22

    CampanaMGKurataNPFosterJTHelgenLEReederDMFleischerRCet al. White-nose syndrome fungus in a 1918 bat specimen from France. Emerg Infect Dis. (2017) 23:1611–2. doi: 10.3201/eid2309.170875. PMID:

  • 23

    ZhelyazkovaVLFischerNMPuechmailleSJ. Bat white-nose disease fungus diversity in time and space. Biodivers. Data J. (2024) 12:e109848. doi: 10.3897/bdj.12.e109848. PMID:

  • 24

    JonesJNowakRParadisoJ. Walker's mammals of the world. J Mammal. (1984) 65:171.

  • 25

    SolariSBakerRJ. Mammal species of the world: A taxonomic and geographic reference by D. E. Wilson; D. M. Reeder. J Mammal. (2007) 88:824–30. doi: 10.1644/06-mamm-r-422.1. PMID:

  • 26

    SullivanJ. Mammalogy. 6th ed. J Mammal. (2015) 96:246–8. doi: 10.1093/jmammal/gyu018

  • 27

    KalyaanamoorthySMinhBQWongTKFvon HaeselerAJermiinLS. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat Methods. (2017) 14:587–9. doi: 10.1038/nmeth.4285. PMID:

  • 28

    NguyenL-TSchmidtHAvon HaeselerAMinhBQ. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. (2014) 32:268–74. doi: 10.1093/molbev/msu300. PMID:

  • 29

    LetunicIBorkP. Interactive Tree of Life (iTOL) v6: Recent updates to the phylogenetic tree display and annotation tool. Nucleic Acids Res. (2024) 52:W78–82. doi: 10.1093/nar/gkae268. PMID:

  • 30

    UphamNSEsselstynJAJetzW. Inferring the mammal tree: Species-level sets of phylogenies for questions in ecology, evolution, and conservation. PloS Biol. (2019) 17:e3000494. doi: 10.1371/journal.pbio.3000494. PMID:

Summary

Keywords

AIM2, bats, flight, IFI16, innate immunity, inverted roosting, PYHIN

Citation

Ziemann C, Streicher M, Sicks A, Zhang C and Zhang L (2026) Statistical association of complete PYHIN gene family loss with flight and inverted roosting in bats. Front. Immunol. 17:1791604. doi: 10.3389/fimmu.2026.1791604

Received

19 January 2026

Revised

11 May 2026

Accepted

11 May 2026

Published

28 May 2026

Volume

17 - 2026

Edited by

Anna J Jasinska, University of Pittsburgh, United States

Reviewed by

Daniel Becker, University of Oklahoma, United States

Gunnaporn Suriyaphol, Chulalongkorn University, Thailand

Updates

Copyright

*Correspondence: Luwen Zhang, ;,

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics