Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Bioinform., 07 August 2025

Sec. Evolutionary Bioinformatics

Volume 5 - 2025 | https://doi.org/10.3389/fbinf.2025.1563786

This article is part of the Research TopicCompleting the Timetree of LifeView all 6 articles

Internal fossil constraints have more effect on the age estimates of crown Palaeognathae than different phylogenomic data type

  • 1Department of Zoology, Biology Institute, Rio de Janeiro State University - UERJ, Rio de Janeiro, Brazil
  • 2Laboratory of Life Sciences, Faculty of Medicine, Kagawa University, Kagawa, Japan

Palaeognathae is an ancient bird lineage that includes the volant tinamous and six flightless lineages: ostrich, rhea, cassowary, emu, kiwi (extant) and moa, elephant bird (extinct). Over the past decade, a consensus has emerged on the relationships within the group. In this consensus, the ostrich branch splits first, followed by rheas, a clade containing tinamou and moa and a clade with the emu and cassowary sister to the kiwi and elephant bird. However, the timing of the origin of these major clades remains uncertain. In phylogenomic studies, the origin of the crown Palaeognathae is typically dated to the K–Pg boundary (∼66 Ma), though one study suggested a younger Early Eocene age (∼51 Ma). This discrepancy might result from the number and position of fossil priors (calibration strategies) or by differences in genomic regions sampled (data types). We investigated the impact of calibration strategies and data types on the timing of the Palaeognathae root using genomic sequences from nuclear (noncoding [CNEE and UCE] and coding [first and second codon positions]) and mitogenomic datasets. The nuclear dataset included 14 Palaeognathae species (13 extant and the extinct moa), while the mitogenomic included 31 species, covering all extant and extinct lineages. The datasets were analyzed with and without internal calibrations. The age estimates were more influenced by calibration strategy than data type, although some nuclear data (CNEE) produced substantially younger ages except for the Casuariiformes node, whilst another dataset (PRM) from a previous study estimated younger ages for Casuariiformes compared to the other datasets. Nevertheless, our results consistently placed the origin of crown Palaeognathae around the K–Pg boundary (62–68 Ma), even when using the original dataset that produced the Eocene age. These findings demonstrate that multiple internal calibrations yield consistent results across different sequence types and taxon schemes, providing robust estimates of the crown Palaeognathae age. This improved timing enhances our understanding of the early evolutionary history of this clade, particularly regarding the placement of enigmatic Paleocene fossils, such as Lithornithidae and Diogenornis, which in this timeframe can be assigned to internal branches within the crown Palaeognathae.

1 Introduction

Geological time forms the backdrop of biological diversification in Evolutionary Theory. Until 60 years ago, the temporal study of biodiversity was limited to fossils, but with the introduction of the molecular clock hypothesis, the concept of an evolutionary timescale was expanded to encompass both hereditary material and the peptide products it encodes (Kumar, 2005). This enabled the reconstruction of the tree of life with unprecedented detail, as living branches became sources of chronological data. As one of the most studied animal groups, birds significantly contribute to our understanding of biodiversity, including genomics, ecology, and biogeography (Jarvis et al., 2014; Zhang et al., 2014b; 2014a; Claramunt and Cracraft, 2015; Stiller et al., 2024). With over ten thousand species, living birds (Neornithes) combine a rich genetic diversity with a significant fossil record that extends to the Late Cretaceous (Mayr, 2016a), making them a celebrated branch in the efforts to assemble the time tree of life.

Morphological and genomic data split the Neornithes between the Neognathae and the Palaeognathae clades (Livezey and Zusi, 2007; Prum et al., 2015). Although palaeognaths comprise less than 1% of neornithine species, they include the largest and heaviest birds to ever live (Crouch and Clarke, 2019). Featuring gigantism, palaeognaths encompass several flightless, continentally endemic lineages from the Southern Hemisphere. The unique phylogenetic, morphological, and biogeographic traits make palaeognaths a key group for understanding early neornithine evolution in space, form, and time (Harshman et al., 2008; Widrig and Field, 2022). Most nuclear and mitogenomic data support an early divergence of ostriches (Struthioniformes) followed by rheas (Rheiformes). The tinamous (Tinamiformes) plus the recently extinct moas (Dinornithiformes) split next and the remaining palaeognaths form a clade comprising emus and cassowaries (Casuariiformes) sister to kiwis (Apterygiformes) and the recently extinct elephant birds (Aepyornithiformes) (Figure 1a; Mitchell et al., 2014; Yonezawa et al., 2017; Takezaki, 2023).

Figure 1
Phylogenetic diagrams illustrating bird diversification. Panel (a) shows a cladogram of Palaeognathae with species silhouettes and a timeline from the Cretaceous to Neogene periods. Panel (b) presents a timeline with studies on bird evolution, highlighting calibration data from Cretaceous to Neogene, with references to various studies and calibration types indicated. Geological time scales are color-coded, showing the transitions between the Cretaceous, Paleogene, and Neogene periods.

Figure 1. Divergence times estimated for the crown Palaeognathae root using phylogenomic data. (a) Blue line indicates the mean ages distributed along a unified confidence interval for the Palaeognathae root (see also Table 1). The gray color on the vertical line at the Neornithine root highlights the age uncertainty for that node. Branch lengths within Palaeognathae display relationships among the major clades and do not reflect divergence times. Abbreviations indicate data type: mtg, mitogenomic; nu, our main nuclear dataset; prm, nuclear dataset that estimated the Eocene age for crown Palaeognaths. Symbols are as follows: *, this study; †, PRM dataset without ingroup Palaeognathae fossil calibrations. Geologic Time units in Million years ago (Ma). (b) Confidence intervals of each individual study for the estimated age of the crown Palaeognathae root for details, see Table 1.

Yet, a clear-cut picture on palaeognaths early evolution remains tentative as phylogenomic studies disagree on the age of the most recent common ancestor of extant (crown group) Palaeognathae. The most common method to estimate the divergence times between lineages using molecular data is the Bayesian relaxed clock (dos Reis et al., 2015). This method adequately accommodates the uncertainty in both evolutionary rates and fossil dates, which are incorporated as prior probability distributions [calibration priors; see Glossary]. The information in the observed data and the priors are integrated to convert the sequence differences into absolute geological times (Yang, 2006; Ho and Phillips, 2009). Hence, the selection of fossils as calibration priors must follow rigorous criteria (Parham et al., 2012) so the resulting time tree is robust and consistent across datasets. Virtually all phylogenomic studies with different gene and taxon coverage agree that the crown Palaeognathae split occurred between the Late Cretaceous and the Earliest Paleogene (K–Pg age) (Figure 1b; Jarvis et al., 2014; Mitchell et al., 2014; Claramunt and Cracraft, 2015; Grealy et al., 2017; Yonezawa et al., 2017; Stiller et al., 2024; Wu et al., 2024).

However, one study (Prum et al., 2015) deviates markedly by suggesting an Early Eocene age for crown palaeognaths around 51 Ma (Figure 1). Although that study followed rigorous criteria for fossil prior selection, a striking pattern is observed in the number and position of fossil priors [calibration strategies] (Table 1). In the calibration strategy that resulted in the Eocene age, all fossil-based priors were restricted to the Neognathae clade (Prum et al., 2015). In contrast, the K–Pg age is consistently recovered in studies that included at least a fossil-based calibration at the neornithine root, and most studies included at least one calibration within Palaeognathae, whereas none was used to generate the Eocene age (Table 1). The impact of calibration strategies in Bayesian molecular dating has been extensively evaluated. Simulated and empirical data show that calibration priors at or near the root produce low error and high precision (Duchêne et al., 2014; Mello and Schrago, 2014; Carruthers et al., 2020). Those studies also demonstrated that the lack of fossil priors on deep nodes produces undesirable effects such as inconsistent and unrealistic age estimates. The Eocene age was obtained with no fossil priors to any of the three deepest avian nodes, namely, Neornithes, Galloanseres and Neoaves. Thus, as the common ancestor of Palaeognathae represents a deep node, the exclusion of time priors for this clade and for the neornithine root may have biased divergence time estimates, potentially leading to an underestimation of its divergence time. Alternatively, as taxonomic and marker sampling varied extensively across previous studies (Table 1), the resulting discrepancies in divergence time estimates may reflect differences in evolutionary rates across lineages, genes, or genomic regions.

Table 1
www.frontiersin.org

Table 1. Phylogenomic studies that estimated the divergence times of Palaognathae lineages. Number of loci are given for mitochondrial individual genes (mt), complete or nearly complete mitogenomes (mtg) and nuclear (nu).

We investigated if the conflict between the K–Pg and the Eocene age of crown Palaeognathae was caused by differences in fossil calibration strategies or phylogenomic data type. We used complete mitogenomes that included living and extinct species, resulting in the most comprehensive mitogenomic time tree estimated for Palaeognaths to date (Figure 1; Table 1). The nuclear dataset comprises over 10 million base pairs (bp) of non-recombinant loci from various genomic regions across all extant lineages and the extinct moa (Takezaki, 2023), and is used here for the first time to estimate divergence times. We tested whether a dataset tailored to resolve palaeognath relationships could also produce robust divergence time estimates under varying loci and fossil calibration strategies. We also reanalyzed the dataset of Prum et al. (2015) using different fossil calibration strategies and discussed how these affect our main results.

2 Methodological details

2.1 Molecular data assembly

We assembled a wide diversity of molecular markers from previously published studies. All species names from each study were updated according to the International Ornithological Committee v. 14.2 (Gill et al., 2024). The markers represent four distinct classes of genomic markers, namely, conserved non-exonic elements, coding sequences, ultraconserved sequences, and mitochondrial genomes. Each dataset assembly is detailed next and summarized in Table 1.

First, we created a mitogenomic dataset [MTG] that consisted of complete or nearly complete mitochondrial genomes available at NCBI (https://www.ncbi.nlm.nih.gov/genbank/). Priority was given for RefSeq sequences and the longest sequence with the least number of undetermined bases. We extracted and concatenated all 13 protein coding genes and the ribosomal 12S and 16S genes, resulting in 31 Palaeognathae species and 14,307 bp. The species and accession numbers of all mitogenomes used in this study were Aepyornis hildebrandti [KJ749824], Aepyornis maximus [OP413809], Anomalopteryx didiformis [MK778441], Apteryx australis [MN356385], Apteryx haastii [NC_002782], Apteryx mantelli [AY016010], Apteryx owenii [GU071052], Apteryx rowi [MN998652], Casuarius bennetti [AY016011], Casuarius casuarius [MN356153], Crypturellus cinnamomeus [NC_052825], Crypturellus soui [MN356154], Crypturellus tataupa [AY016012], Crypturellus undulatus [NC_052774], Dinornis giganteus [AY016013], Dromaius baudinianus [NC_045365], Dromaius novaehollandiae [MN356172], Emeus crassus [AY016015], Eudromia elegans [AF338710], Mullerornis agilis [KJ749825], Mullerornis modestus [OP413795], Nothocercus julius [MN356379], Nothocercus nigrocapillus [MN356380], Nothoprocta ornata [MN356381], Nothoprocta pentlandii [MN356382], Nothoprocta perdicaria [MN356428], Rhea americana [NC_000846], Rhea pennata [NC_002783], Struthio camelus [AF338715], Tinamus guttatus [MN356150], Tinamus major [NC_002781] (Härlid et al., 1998; Cooper et al., 2001; Haddrath and Baker, 2001; Phillips et al., 2010; Mitchell et al., 2014; Cibois et al., 2020; Feng et al., 2020; Lan and Xu, 2020; Grealy et al., 2023; Edwards et al., 2024). The RefSeq mitogenome of the chicken Gallus gallus [NC_040902] (Miao et al., 2013) was used as outgroup.

We used a nuclear dataset from Takezaki (2023) [TKZ]. This dataset includes all extant (ostrich, rhea, cassowary, emu, kiwi and tinamou) and one extinct (moa) lineages and comprises the noncoding data originally from Cloutier et al. (2019) and the coding data from Sackton et al. (2019). However, extensive gene tree heterogeneity was observed in the original datasets (Cloutier et al., 2019; Sackton et al., 2019), suggesting potential bias from estimation errors or other misrepresentative evolutionary signals. To mitigate this, loci likely to introduce such biases were identified and filtered following the criteria outlined below (Takezaki, 2023). First, loci were excluded if the branch lengths within palaeognaths in their gene trees were more than five times longer than those in the concatenated sequence tree. Loci showing significant evidence of positive selection or recombination between palaeognath lineages and Gallus were also removed. Importantly, during the filtering process, no species were excluded, and no tip pruning was applied, ensuring all species were retained as in the original datasets. By retaining all species and focusing on molecular data types, we ensured that the filtered loci had more homogeneous composition and substitution rate variation, thus low probability of saturation, while preserving all palaeognath nodes necessary for reliable divergence time estimation. The filtered datasets yielded a robust topology with significantly reduced sequence heterogeneity and were used as the primary nuclear genomic source for estimating divergence times in palaeognaths. Noncoding data (conserved non-exonic elements: CNEEs, 12,561 loci and ultraconserved elements: UCEs 1,363 loci) contained 14 Palaeognathae species and coding data (the first and second codon positions: C12, 5,374 loci) for 13 Palaeognathae species were used. The third codon positions and intron data were excluded from the analyses as they have the highest GC content and long branch lengths compared to other data (Takezaki, 2023). All 19,298 loci were concatenated into a supermatrix of 15,610,067 bp and cleaned of poorly aligned sites and highly divergent regions using Gblocks (Castresana, 2000), resulting in our final TKZ matrix with 11,187,881 bp. Finally, we reanalyzed the original data from Prum et al. (2015) [PRM], which consisted of nine Palaeognathae species and 259 coding and non-coding regions (around 400 Kbp) obtained from anchored hybrid enrichment (Lemmon et al., 2012). The alignments generated in this study (MTG and TKZ) including all species sampled for each dataset are available as Supplementary Material S1.

2.2 Fossil specimens

We selected five fossil specimens that rigorously fit the criteria to be used as calibration priors. We opted for the vouchered oldest specimens with unambiguous locality and stratigraphy, and an apomorphy-based diagnosis that is consistent between morphological and molecular datasets in phylogenetic context (Parham et al., 2012). Data for all five fossil specimens, including stratigraphic age, phylogenetic placement justification, and references, are detailed in Table 2 and summarized below.

Table 2
www.frontiersin.org

Table 2. Fossil specimens with phylogenetic justifications used for estimation of divergence times. Age and age uncertainty probabilities (minimum and maximum soft bounds) and prior configuration as used in MCMCTree.

The stem-galloanserine Asteriornis maastrichtensis from the Latest Cretaceous Europe (Field et al., 2020) calibrated the root of our tree. As the skull and legs of A. maastrichtensis share exclusive derived characters with Galliformes and other Neognathae (Crane et al., 2024), it provides a minimum age for the split Neognathae and Palaeognathae split around 66 Ma. The soft upper bound was set at 86.5 Ma, which represents rich avian fossil deposits that completely lack neornithine representatives (Prum et al., 2015).

The remaining four fossil specimens calibrated nodes within Palaeognathae. Palaeophasianus meleagroides from Eocene North America (Wetmore, 1933) is among the oldest representative of Geranoididae, a family of long-legged birds sharing derived characters with the younger European Palaeotididae and the Asian Eogruidae and Ergilornithidae (Mayr, 2016b; 2019; 2022). As those three families are consistently placed on the ostrich stem, Palaeophasianus meleagroides provides a minimum age for the Palaeognathae root. The only known older stem palaeognath is Diogenornis fragilis, a specimen from Paleocene South America with uncertain affinities (Noriega et al., 2017; Mayr, 2022).

The second Palaeognathae internal calibration was provided by Opisthodactylus horacioperezi from Miocene South America (Agnolin and Chafrat, 2015). This oldest crown Rheidae genus shares with the extant Pterocnemia a derived hindlimb morphology (Agnolin and Chafrat, 2015; Noriega et al., 2017). This fossil prior was set between the two extant Rheidae genera Rhea and Pterocnemia.

The third internal palaeognath calibration was the fragmentary coracoid MACN-SC-3610 from Miocene South America representing the oldest unambiguous crown Tinamidae. Despite its fragmentary condition, cladistic analyses confidently place the specimen within Tinamidae as sister to the genus Crypturellus. This placement is supported by unique derived characters (Bertelli, 2017; Almeida et al., 2021). Other specimens found in the same age and locality have an unstable position (Bertelli et al., 2014). We used MACN-SC-3610 to calibrate the split between Crypturellus and its sister genus Tinamus.

Finally, the last internal palaeognath calibration was Emuarius gidju from Miocene Australia (Boles, 1992; 2001). Cladistic analyses using 25 ingroup taxa and 179 osteological characters firmly placed E. gidju within crown Casuariiformes supported by 13 unambiguous apomorphies distributed in several skeletal elements including the skull, sternum, synsacrum and femur (Worthy et al., 2014).

2.3 Divergence time estimates and fossil calibration strategies

The divergence times were estimated separately for the MTG, TKZ and PRM datasets using the PAML 4.10 package (Yang, 2007). The mean substitution rate estimated with BASEML was set as the Dirichlet-gamma prior. The absolute divergence times were estimated with the Bayesian program MCMCTree using a time unit of 1 million years and the uncorrelated relaxed clock under the GTR model with alpha = 0.5 and five Gamma categories. The parameters for birth, death, and sampling were 0.1. The σ2 prior was set to G (1, 10) for the MTG dataset, indicating moderate violation of the clock, whereas it was G (1, 0.1) for all the nuclear datasets indicating a higher violation of the clock as expected with thousands of independently evolving markers. Branch lengths and model parameters were estimated with the approximate likelihood method, designed to handle large phylogenomic data (Reis and Yang, 2011).

Substitution saturation in the MTG alignment was assessed using DAMBE7 (Xia, 2018). The proportion of invariant sites under a Poisson + invariant distribution model was estimated and added as a parameter of the substitution saturation test for all fully resolved sites (Xia et al., 2003; Xia and Lemey, 2009). Under default settings, saturation is likely if the observed index of substitution saturation (Iss) exceeds the critical threshold (Iss.c). For a symmetrical tree topology, such as the consensus phylogeny of Palaeognathae recovered from multiple genomic studies (see Introduction and Figure 1), the observed Iss was 0,4539, well below the Iss.c of 0,8155, suggesting minimal saturation of the MTG dataset. The observed Iss was significantly smaller even considering an for an extremely asymmetrical (and generally very unlikely) tree (Iss.c = 0.5709). To further validate this result, we plotted the uncorrected pairwise genetic distances (p-distances) against the model-corrected distances (JC69) for each mitochondrial partition using R (R Core Team, 2024) and the ape (Paradis and Schliep, 2019) and phangorn (Schliep, 2011) packages. The plots showed strong linearity with coefficients of determination (R2) greater than 0.967 which, taken together with the substitution saturation test, provide robust evidence for minimal saturation in our MTG dataset.

The MTG topology was produced using a constrained backbone (order-level), and terminal relationships, branch lengths, partition scheme and substitution models were estimated with IQTREE 2.1.4 (Minh et al., 2020). The topology and the best partition scheme with ten partitions were used for molecular dating. The tree topologies for the nuclear datasets were constrained as well following the consensus backbone (Figure 1). The TKZ dataset was partitioned by molecular type (CNEE, C12 and UCE), and in addition each datatype was analyzed separately. Lastly, the PRM dataset was analyzed with 75 partitions as in the original study (Table 1).

The MCMC chain length, sampling frequency, and burn-in were optimized for each dataset to meet the statistical requirements of the data size. For the MTG dataset, the chain length was set to 20,000 iterations, sampling every 100, and a burn-in 10,000. For the TKZ dataset, the chain length was set to 100,000 iterations, sampling every 5,000, and a burn-in of 5,000,000. For the PRM dataset, the chain length was set to 20,000 generations, sampling every 100, and a burn-in 10,000. For the separate nuclear C12 we ran 50,000 iterations, sampling every 500, and set the burn-in to 10,000; for CNEE we ran 80,000 iterations, sampled every 2,000, and set the burn-in to 800,000 and for the UCE data we ran 50,000 iterations, sampled every 1,000, and burn-in 20,000. For the PRM dataset we ran 2,000 iterations, sampling every 500, and set the burn-in to 1,000. Two independent MCMC runs were performed for each dataset, with convergence verified in both. The ESS parameters were verified in Tracer 1.7.2 (Rambaut et al., 2018) and all values reached >200.

To assess whether age estimates of crown Palaeognathae are more influenced by internal fossil constraints or phylogenomic data composition, we developed two calibration strategies. The first included all the fossil priors that were available for all datasets (Table 2). All fossil calibrations were applied to all datasets, except for the prior for crown Rheidae in the PRM dataset as this node was represented by only a single species. The probability distribution for the fossil age uncertainty followed a skew-normal curve calculated with the SN package (Azzalini, 2023) for R (R Core Team, 2024). To accommodate the uncertainty of the Neornithes root age, the maximum age of all internal fossil priors was extended to 70 Ma. In the second strategy, all Palaeognathae fossils were removed, leaving only the Asteriornis maastrichtensis prior at the root (Table 3). All the time trees generated in this study, namely, MTG, TKZ, C12, CNEE, UCE, and the PRM with all fossil priors and root prior only are available as Supplementary Material S2. Furthermore, the differences in divergence time estimates and taxon sampling between the MTG and TKZ topologies are displayed in pdf format as Supplementary Material S3.

Table 3
www.frontiersin.org

Table 3. Comparison of divergence times calculated with different calibration strategies for each dataset. Full Set strategy includes all fossils whereas Root Only excludes all ingroup (Palaeognathae) fossils. Specimen, age and phylogenetic placement of each prior as in Table 2. Time units are million years ago (Ma) and Confidence Interval is the 95% Highest Posterior Distribution (HPD) are rounded up to the next decimal place.

3 Results

We investigated the impact of internal calibration priors on divergence time estimates for the Palaeognathae radiation. Specifically, we assembled a diverse phylogenomic dataset to test whether the Eocene age (Prum et al., 2015) was caused by data type or a calibration strategy that lacked fossil priors outside Neognathae. Our resulting matrices (Table 1) comprised mitogenomes (MTG, 31 species, 14,307 bp, 10 partitions) and distinct nuclear genomic regions from different studies (TKZ, 14 species, 11,187,881 bp, 3 partitions; PRM, 9 species, 394,684 bp, 75 partitions). Each partition from the TKZ dataset was also analyzed separately, resulting in CNEE with 14 species, 4,504,498 bp, 1 partition; C12, 13 species, 4,797,876 bp, 1 partition; UCE, 14 species, 1,885,507 bp, 1 partition. Our posterior probability estimates with MCMCTree indicated that the different cellular compartments and genomic regions sampled in our datasets have markedly different substitution rates. Specifically, the mean substitution rate averaged for the ten partitions of MTG was the highest (µ = 0.01107/million years [Myr]), followed by the rate averaged for the 75 partitions of PRM (µ = 0.00091/Myr) and finally for the rate averaged for the three partitions of TKZ (µ = 0.00053/million years). The divergence time estimates for each dataset using our full calibration set are provided in Table 4, which also summarizes estimates from major previous studies focusing on either Palaeognathae or, more broadly, Neornithes. The confidence intervals from Tables 3 and 4 were used to generate Figures 2, 3, respectively.

Table 4
www.frontiersin.org

Table 4. Mean ages and confidence interval (95% HPD) of the nodes estimated in this study and in other recent phylogenomic studies.

Figure 2
Phylogenetic tree diagram illustrating bird orders with chronological bars representing various research studies. Orders are depicted with silhouettes, including Dinornithiformes and Neognathae. The timeline includes Cretaceous to Quaternary periods with studies labeled, positioned according to time, from Mitchell et al. 2014 to Selvatti and Takezaki 2025.

Figure 2. Differences in the confidence intervals of the ages estimated for each major Palaeognath clade and the Neornithine root using the full calibration set in this study and other major works (Table 4). The colors in the node circles match the colors on the horizontal bars, namely, red for Neornithes; green for Palaeognathae; blue for Notopalaeognathae; orange for non-rheid notopalaeognaths; pink for Casuariiformes + Apterygidae and Aepyornithidae (sampled only in the MTG dataset); coral for Casuariidae; and grey for Tinamiformes + Dinornithiformes. Each bar corresponds to one dataset or previous study.

Figure 3
Phylogenetic timeline of various bird orders, including silhouettes for Dinornithiformes, Tinamiformes, Casuariiformes, Apterygiformes, Aepyornithiformes, Rheiformes, and Struthioniformes. Colored bars represent different evolutionary timelines, labeled with

Figure 3. Results from varying the calibration schemes found in this study. The colors refer to the same nodes as in Figure 2. Light colors reflect the first calibration strategy (all fossils, as in Figure 2), whereas dark hues reflect the second calibration strategy (root only). Aepyornithidae was sampled only in the MTG dataset. The confidence intervals were extracted from Table 3.

The divergence times estimated separately for each partition of the TKZ dataset remained very similar to the estimates for the concatenated matrix, indicating the consistency of our results (Figures 2, 3; Table 1). However, the separate analyses of the CNEE dataset produced short internodes in the palaeognath backbone, leading to extremely similar age estimates for two nodes. The first was between Struthioniformes and the remaining palaeognaths [Notopalaeognathae], and the second for the non-Rheiformes Notopalaeognathae [Tinamiformes, Apterygiformes and Casuariiformes] (Figure 2). The CNEE estimates were also in conflict with the other datasets within palaeognathae. For example, the estimates for the non-Rheiformes Notopalaeognathae, the split between Apterygiformes and Casuariiformes and between Dinornithiformes and Tinamiformes were younger (39–30 Ma) compared with the estimates from the remaining datasets (61–45 Ma), with little to no overlap between the confidence intervals (Figure 2; Table 4). The single exception was the emu-cassowary node (Casuariiformes), which estimates with the CNEE dataset remained similar with greatly overlapping confidence intervals with the other datasets (Figure 2; Table 4).

The phylogenomic datasets were used to estimate the divergence times with five fossil priors (one outgroup and four ingroup; Table 2) under two distinct calibration strategies. The first included all fossil priors, while the second used only the root fossil prior, testing the effect of excluding ingroup fossil calibrations across different data types. The results are summarized in Figure 3 and Table 3 and detailed as follows. In the first calibration strategy, the mean divergence time estimates remained consistent, with less than 10 Myr difference across datasets, though estimates for MTG tend to be slightly larger than those for nuclear TKZ and PRM and the difference between estimates for Casuariiformes were larger (Figure 3; Table 3, light colors). The oldest mean estimates for the MRCA of crown Palaeognathae was obtained by the MTG dataset at 68.1 Ma (95% Highest Posterior Distribution [HPD] = 62–74.2 Ma), followed by the very similar ages of the TKZ at 62.4 (56.4–68.3 Ma) and PRM at 62.4 (56.7–67.2 Ma) datasets. Furthermore, the confidence intervals overlapped greatly with an average variation of 12 Myr between minimum and maximum for each estimate. In the second calibration strategy, however, the results differed markedly (Figure 3; Table 3, dark colors). Mean ages were older in the TKZ (12 Myr) and MTG (6 Myr) datasets compared to the first strategy. In contrast, the PRM dataset yielded a much younger mean age (−20 Myr), strikingly deviating from the estimates of the first strategy. Furthermore, the mean ages varied extensively among datasets. The oldest mean age was produced by the TKZ dataset (80 Ma), followed by the MTG (74 Ma) and PRM datasets (42 Ma). In the second calibration strategy the confidence intervals were large and varied considerably across datasets, averaging 18 Myr and with notable gaps between estimates denoting conflicting results (Figure 3). For example, although the intervals between the MTG and TKZ estimates showed substantial overlap indicating a higher degree of agreement, they did not overlap with the confidence interval of the significantly younger PRM estimate (Figure 3).

The divergence times of the internal palaeognath clades remained consistent across different datasets in our results, with the already mentioned younger estimates within Notopalaeognathae in the CNEE dataset alone and in the PRM dataset (Figure 2; Table 4). The mean age of the MRCA of the Notopalaeognathae was dated between 56 and 63 Ma, and the clade that contains the non-Rheidae Notopalaeognathae was dated around 55–61 Ma (39 Ma in CNEE). The estimates for the MRCA of Casuariiformes and Apterygiformes (54–56 Ma; 39 Ma in CNEE) were similar to the estimates for the Tinamiformes and Dinortnithiformes MRCA (47–57 Ma; 34 Ma in CNEE). However, the estimates using the PRM dataset were consistently younger for the Casuariiformes. Although this node was dated between 26 and 33 Ma with the MTG, TKZ, C12, CNEE, and UCE datasets, the estimate with the PRM dataset was 13 Ma, which is similar to the age estimated in the original study (9 Ma; Prum et al., 2015). However, in that study the confidence interval was substantial (1–22 Ma), whereas it was significantly smaller in our results (11–15 Ma; Figure 2). In the second calibration strategy, the results showed significant disparities, especially for the deep nodes in the nuclear dataset. Under this strategy, for instance, the estimates from TKZ produced virtually no overlap with the confidence intervals of the same dataset when all calibration were included (Figure 3; Table 3). That same disparity between the calibration strategies were present but less pronounced in the MTG estimates. Nevertheless, regardless of calibration strategy, the confidence intervals of estimates with TKG and MTG overlap consistently. However, a remarkable trend was observed regarding the PRM dataset. In the second calibration strategy, the estimates for all the nodes except Casuariiformes produced older age estimates in the MTG, TKZ, C12, CNEE, and UCE dataset, however, the PRM dataset always resulted in younger estimates compared with the first strategy (Figure 3; Table 3).

4 Discussion

Palaeognaths are one of the most peculiar avian lineages, with an enduring enigmatic evolutionary history and biogeography (Mayr, 2011; Widrig and Field, 2022). Recently, thorough scrutiny of phylogenomic data advanced our knowledge on the branching order in palaeognaths, reducing some of the conflicts regarding tree topology (Yonezawa et al., 2017; Takezaki, 2023). Nevertheless, crown Palaeognathae age estimates have varied significantly over the last decade (Figure 1; Table 1), from the Middle Cretaceous (97 Ma; Haddrath and Baker, 2012) to the Eocene (51 Ma; Prum et al., 2015). As more loci, lineages and fossils were analyzed, most phylogenomic data converged on a crown Palaeognathae age around the K–Pg boundary (66 Ma), but no Eocene age was recovered a second time by independent data. That disparity might result from different genomic sampling and fossil calibration strategies, including number, position, and age prior probability, which hinder direct comparison among those studies. Consequently, pinpointing the cause for such conflict remained inconclusive, leaving the age of the crown group Palaeognathae one of the most elusive aspects of the Palaeognathae complex evolution.

We analyzed a large diversity of phylogenomic data, including mitogenomes and nuclear coding and non-coding loci, and the original data that recovered the Eocene age for crown palaeognaths. Importantly, the taxon sampling varied considerably between datasets, with the MTG scoring the highest number of tips (31 species), and represents the largest mitogenomic dataset to date for which divergence times were estimated. This dataset nearly doubles the sample size of Grealy et al. (2017) and exceeds that of Grealy et al. (2023) by eight species. Divergence times were also estimated for nuclear genomic data from 14 species, thoroughly curated to minimize saturation (Takezaki, 2023). Thus, our sampling encompassed taxonomic diversity and millions of base pairs across distinct genomic regions and cellular compartments. In addition to controlling for taxonomic and genetic diversity, we tested whether conflicts in the Palaeognathae root age might result from incomplete placement of fossil calibrations, particularly within the ingroup. We applied two calibration strategies, the first using all available fossils (Table 2) and the second excluding ingroup calibrations.

Our results show remarkable consistency across different datasets despite varying taxon sampling and molecular data type, especially for the root age of crown Palaeognathae, which was the focus of this study. The estimates of most datasets also remained consistent for the estimates for the internal nodes except for the CNEE data analyzed separately. The estimates from this dataset produced very short internal branches in the Palaeognathae backbone, and the estimates for the non-Rheidae Notopalaeognathae nodes were significantly younger than our estimates and compared to other studies (Table 4). Those results might be explained by the notably slow evolution of CNEEs, which is below the neutral rate (Edwards et al., 2017). Previous research shows that even after removing loci with evidence for phylogenetic noise (e.g., recombination and long branches), the CNEE tree produced very short internodes in the palaeognath stem (Takezaki, 2023), which coincide with the collapsed branches in our CNEE time tree. These results suggest that this particular CNEE dataset is challenging to estimate the details of the divergence times for nodes between the root and family-level Palaeognathae clades. In contrast, however, the estimates for the Casuariiformes node remained consistent across datasets including the CNEE except for the PRM which produced a much younger estimate (Table 4). This discrepancy of the PRM may be due to the differences in taxon sampling or that it consists mostly of coding sequence including third codon positions. Interestingly, this node received an explicit fossil-based calibration, which might consist of an exceptionally informative prior for this particular node. Future studies focusing on the CNEE power to estimate the divergence times with confidence for Palaeognathae and other avian lienages will be of great interest to clarify our findings.

Our main result is that divergence time estimates in Palaeognathae are profoundly influenced by calibration strategy (Tables 1 and 3). Empirical studies have demonstrated that genomic regions evolve at different rates. For example, genes that protein coding, sex-related, or mediate defense response have very high substitution rates (Moutinho et al., 2020). Thus, as different genomic regions and cellular compartments evolve at distinct rates (Mendes and Hahn, 2016), we expected that the observed conflicting estimates were caused by studies sampling different genomic regions. Due to the heterogeneous nature of the sampled genomic regions and their differing evolutionary patterns, we anticipated variation in substitution rates, which was confirmed by our findings (see Results). However, despite substitution rate heterogeneity, all phylogenomic datasets converged to similar ages and confidence intervals when the full set of fossil priors were used (Table 2). In contrast, when no ingroup calibration was used, time estimates varied immensely, often with irreconcilable confidence intervals (Figure 3; Table 3). This is somewhat surprising as empirical and simulation studies showing that calibrations set near or at the root yield precise and accurate divergence time estimates (Duchêne et al., 2014; Mello and Schrago, 2014).

Our results indicate that root calibration alone may be insufficient for groups with complex evolutionary history such as Palaeognathae. Empirical data and simulations in silico show that multiple ingroup calibrations help reduce disparities caused by differences in genetic distance between nodes with and without calibrations (Rutschmann et al., 2007; Marshall, 2008; Reis et al., 2018). Our results showed that conflicting ages and confidence intervals emerged from the same dataset with different calibration strategies, while adding more internal calibrations improved the precision of the estimates. Furthermore, previous studies have shown that multiple calibrations improve age estimation accuracy (Linder et al., 2005; Villaverde et al., 2021), a trend we also observed, where the use of multiple fossils resulted in more consistent divergence time estimates.

Our findings further reinforce prior evidence that the use of a single fossil calibration at the root of a tree is insufficient for reliable divergence time estimation within the ingroup, in cases when a long, uncalibrated stem branch separates the root from the first ingroup split. In the case of palaeognaths, the deep divergence between Neornithes and the crown palaeognath ancestor spans tens of millions of years, making it difficult for a relaxed clock model to accommodate rate heterogeneity across such a wide temporal range. In this context, a single calibration might fail to adequately constrain the rate estimates within the ingroup, leading to inflated or biased node age estimates. In a broad analysis of Bovidae, multiple fossil calibrations distributed across the tree consistently improve temporal accuracy and reduce uncertainty (Bibi, 2013). Furthermore, recent simulation-based assessments demonstrate that when evolutionary rates vary substantially across lineages, commonly used clock models including uncorrelated and autocorrelated relaxed clocks may perform poorly, particularly in the absence of internal calibrations (Hagemann et al., 2023). Although local clock models may be better suited to handle such heterogeneity, they are not implemented in MCMCTree and remain difficult to apply to genomic-scale datasets due to model complexity and computational limitations. Nonetheless, our results highlight the practical importance of using multiple, well-placed fossil calibrations to overcome limitations associated with model rigidity and clock non-uniformity.

Previous studies that used multiple calibration priors overlap partially with our set of fossils. Specimens of the genus Emuarius and the specimen Opisthodactylus horacioperezi have been used as minimum dates for the Casuariiformes and Rheidae clades, respectively (Table 1). As those fossils have clear apomorphies shared with their respective clades and their phylogenetic position have been corroborated by formal cladistic analyses, they indeed satisfy the recommended best practices for molecular dating prior design based fossil specimens (Parham et al., 2012). However, as the following three fossil specimens satisfy only some of the best practices, they were not to used in the present study. The Brazilian fossil Diogenornis fragilis was originally assigned to a rheiform assembly from the Miocene (Alvarenga, 1983; Mayr, 2016a). However, those similarities might reflect plesiomorphies and its phylogenetic position remains poorly resolved, and Diogenornis is considered a stem rheiform at best (Mayr, 2016a; Widrig and Field, 2022). Because stem fossils share only some characteristics with the extant (crown) clade, they provide little information for the split that originated the crown clade and thus should be avoided as calibration priors. This justification is extended to the kiwi-like fossil Proapteryx (Worthy et al., 2013). As it shares only some of the derived characters that define the extant Apterygiformes, it is considered part of the kiwi stem Widrig and Field, 2022) and thus unsuitable to calibrate the crown Apterygiformes. Finally, the remaining differences in our prior choice regards the Tinamiformes fossil record. In the context of molecular dating, fossil ages provide a minimum date for the clade which it belongs with confidence (Ho and Phillips, 2009). Therefore, priority should be given for the specimens with the oldest geological age and with unambiguous phylogenetic placement (Ho and Phillips, 2009; Parham et al., 2012). Although the specimen MLP 87-XI-20-3 from the La Pampa Province (Argentina) is undoubtedly part of the Tinamiformes, it does not represent the oldest record in that clade (Bertelli et al., 2014). That status belongs to the MACN-SC-1399 and AMNH FAM 9151 specimens from the early Miocene Argentinian Formations of Pinturas and Santa Cruz, respectively. Nevertheless, those fragmentary specimens have a highly unstable position in cladistic studies (Bertelli, 2017; Almeida et al., 2021). The specimen MACN-SC-3610 is from the same age and locality of the specimen MACN-SC-1399, but with much higher confidence in its phylogenetic placement (Bertelli et al., 2014), thus justifying our choice as our fossil prior within Tinamiformes.

Estimating the divergence time of the crown Palaeognathae ancestor with both consistency and precision is crucial for resolving conflicting estimates and ensuring that hypotheses for early Palaeognathae evolution are developed reliably. Three competing diversification hypotheses for the group are currently supported. The first assumes that the crown Palaeognathae originated as an ancient lineage that existed between 83 and 72 Ma in the Upper Cretaceous (Haddrath and Baker, 2012; Jarvis et al., 2014; Yonezawa et al., 2017). Those estimates predate all known definitive neornithine fossils (Mayr, 2016a), and that ancestral stock with several large-bodied descendants would have left no trace in the fossil record, which is rich for many other non-noernithine avian clades (Mayr, 2016a). In the studies that support the first hypothesis, the antiquity of the palaeognath root also reverberates in the internal nodes (Table 4), implying a burst of palaeognathae diversification prior to the K–Pg. In this context, the Upper Cretaceous hypothesis favors the classic view in which cladogenesis within palaeognathae is linked with the fragmentation history of Gondwana, where flightlessness evolved once in the ratite (Palaeognathae except Tinamiformes) ancestor before the fragmentation of the supercontinent, and modern ratite lineages are the product of the subsequent continental breakup (Cracraft, 1973). At that time, the fragments of the supercontinent remained in close proximity (McLoughlin, 2001; Chatterjee et al., 2012; Ezcurra and Agnolin, 2012), potentially facilitating terrestrial biotic interchanges that would have included flightless ratites. However, this classic hypothesis also hinges on ratite monophyly, which is rejected by molecular and embriological data (Widrig and Field, 2022). Furthermore, our results suggest that this hypothesis may be influenced by limitations in the number and position of the fossil calibration priors. For instance, in our study, all ages that are older than our oldest estimate (74 Ma maximum 95% HPD of the MTG dataset) used one or zero fossil priors for the ingroup palaeognath clade (Tables 1 and 4).

In the second hypothesis, the crown group palaeognaths would be considerably younger and long after Gondwana break up (Mitchell et al., 2014; Claramunt and Cracraft, 2015; Grealy et al., 2017; 2023; Stiller et al., 2024; Wu et al., 2024). In this scenario, the crown Palaeognathae ancestor split into the Struthioniformes and Notopalaeognathae around the K–Pg boundary. After the divergence of Struthioniformes ancestor from the Palaeognathae root, the splits that originated the Notopalaeognathae orders occurred within the Paleocene and the Early Eocene. During that time, many continents in the Southern Hemisphere were still partially connected, such as Australia, Antarctica, South America and Zealandia, allowing the ancestral stocks of the clades that are endemic to those specific landmasses to disperse and achieve their present-day distributions (Claramunt and Cracraft, 2015; Yonezawa et al., 2017; Takezaki, 2023).

The third hypothesis suggests an Eocene age for crown Palaeognathae (Prum et al., 2015). In that timeframe, the extant clades would have split near the transition to the Oligocene or even later (Tables 3 and 4), a period by which the Southern landmasses were already far apart. The Eocene age hypothesis demands extremely long-distance dispersals by volant ancestors within each palaeognath lineage, which is currently unsupported by the fossil record and the biology of the extant species. For instance, although tinamous are volant, they are essentially ground-dwelling birds that only fly when needed, and their flight is a short-distance burst unlikely to sustain overwater dispersal (Widrig and Field, 2022). The only candidate palaeognaths with potent flight capabilities are the Lithornithidae, a Paleocene group with some derived traits shared with extant palaeognaths (Widrig and Field, 2022). In the Eocene age hypothesis, all Paleocene Lithornithidae clearly fall outside the crown Palaeognathae, thus ruling out the possibility of placing certain specimens within internal branches such as Tinamidae, as suggested by some morphological characters and character weighting (Worthy et al., 2017).

Our results support the second hypothesis and consistently reject the other two (Figures 2, 3; Tables 3 and 4). We provide empirical evidence that robust divergence time estimates for Palaeognathae are achieved by combining broad taxonomic and genomic diversity with multiple fossil calibrations, including both the root and ingroup. This strategy yielded consistent and precise results, which improved reliability across different data types and calibration strategies. Furthermore, the variation present in the different molecular data types reflected a broad range of evolutionary rates (see Results). The PRM dataset consists mostly of (80%) coding regions (Reddy et al., 2017), which have shown in previous studies to be notoriously challenging to extract phylogenetic signal, producing unstable, weakly supported and incongruent results (Chen et al., 2017; Reddy et al., 2017). Therefore, although the PRM dataset provides a comprehensive sample of neornithine clades, our findings suggest that the genomic regions sampled might not offer sufficient or reliable signal for estimating the split times in Palaeognathae without calibration priors on the internal nodes. Even with our full calibration set, while the PRM dataset estimates aligned more closely with those of other datasets, younger ages continued to be observed for shallow nodes like Casuariiformes (Figure 2). Specifically, while other datasets estimated the age of Casuariiformes to be in the Oligocene (22–43 Ma), the PRM-based estimates remained within the Miocene (15–11 Ma), albeit with a narrower confidence interval compared to the results of the calibration strategy used in the original study (Figure 2; Table 4). As the same fossil calibration strategy was applied to different datasets converged on consistent estimates, we conclude that the age of crown Palaeognathae is likely more influenced by the number and position of fossil constraints than by molecular data type, or a combination of both. Previous studies show that variations in age estimates in other clades can significantly influence biogeographic and ancestral trait interpretations in major diversification events in passerines (Selvatti et al., 2015) and in turtles (Selvatti et al., 2023). Our results also align with recent genome-wide time estimates for avian evolution and neornithine diversification (Stiller et al., 2024), attesting the robustness of our time trees.

Our results indicate that molecular data type alone cannot explain the reported Eocene age for crown palaeognaths (Prum et al., 2015). Our re-analyses with the same dataset using the original fossil calibration strategy (without ingroup fossil calibrations) produced identical results. This contrasts sharply with estimates from the other datasets, which were older in the absence of ingroup fossil constraints. However, when multiple internal fossil calibrations were applied to the original PRM dataset, the results closely aligned with those from the other genomic datasets sampled here and in previous studies (Table 1). The study by Prum et al. (2015) is a landmark in avian phylogenomics, being the first to include a broad taxon sampling, more than three times that of the pioneering work by Jarvis et al. (2014). However, the exclusion of fossil priors for Neognathae nodes may have influenced the divergence time estimates for the Palaeognathae root. Consequently, we propose that the Eocene age for Palaeognathae could reflect the combined effect of the absence of fossil calibrations at and near the neornithine root, as well as within the Palaeognath ingroup.

Our time-calibrated phylogeny allows for the possibility that some of the oldest Palaeognathae fossils may indeed belong within the crown Palaeognathae clade, whereas this possibility is excluded under the Eocene divergence hypothesis. For instance, in the K–Pg age hypothesis supported here, at least some Lithornithidae might be assigned to internal branches, providing explicit support even for hypotheses based on long-distance dispersals. A palaeognath radiation around the K–Pg requires at least some degree of overseas dispersals, thus compatible with a volant ancestor that might have included the Lithornithidae. The same principle applies to another key Paleocene fossil, D. fragilis. This ancient, flightless South American bird is one of the oldest named Palaeognathae (Widrig and Field, 2022). Despite ongoing debate regarding its affinity with either Casuariiformes or Rheiformes, assuming an Eocene origin for crown Palaeognathae precludes Diogenornis from being considered part of the crown group, necessitating its placement as a Palaeognath stem—an assignment for which there is no morphological support. In contrast, if the crown Palaeognathae originated around the K–Pg boundary, as suggested here, the affinities of Diogenornis clearly align with the crown Palaeognathae, thereby reducing the uncertainty surrounding its phylogenetic placement.

Although our results emphasize the critical role of internal fossil constraints in stabilizing divergence time estimates, we acknowledge that deep branches, such as the root of Palaeognathae and Neornithes, may remain sensitive to unmodeled rate variation or substitutional saturation. While saturation tests and data filtering indicated no saturation, and reanalysis of the Prum et al. (2015) dataset showed that replacing a broad neognath outgroup with Gallus alone had no effect on age estimates within Palaeognathae, we acknowledge that outgroup configuration could still influence branch length modeling under different data and age constraints. Future work should systematically assess the impact of alternative or expanded outgroup sampling on rate estimation and divergence dating while still verifying the effects of ingroup fossil constraints, particularly in datasets spanning deep evolutionary timescales.

5 Conclusion

Time is a fundamental aspect of evolution, as diversification hypotheses depend on knowing when lineages originated within specific geological contexts. In summary, our results demonstrate that multiple internal fossil constraints have a greater impact on crown Palaeognathae age estimates than different phylogenomic data types. The age of the crown Palaeognathae was consistently estimated to fall between the Latest Cretaceous and Earliest Paleogene (68–62 Ma), supporting the hypothesis that the group originated around the K–Pg boundary rather than in the Eocene. This reduces uncertainties surrounding the origin of the extant lineages and aligns with the phylogenetic placement of Paleogene fossils, such as Diogenornis (which was not included in our calibration), within the crown Palaeognathae. Our time tree also provides a clearer understanding of the relationships between crown Palaeognaths and the extinct Lithornithidae. Although some Lithornithidae species have been considered stem members of Palaeognathae (Mayr, 2022; Widrig and Field, 2022), their phylogenetic placement depends on the estimated age of the crown group. Assuming an Eocene origin for the crown Palaeognathae (Prum et al., 2015), these fossils would be placed outside the crown. If the crown Palaeognathae split is dated to the K–Pg boundary, some of these fossils could be positioned more internally within the Palaeognathae tree as stem members of major clades (Widrig and Field, 2022), shedding light on the intricate evolutionary history of this remarkable avian lineage, and highlighting the profound role of geological time in shaping biodiversity.

Data availability statement

Alignments generated in this study (MTG and TKZ) are available as Supplementary Material S1 deposited in Figshare (https://figshare.com/projects/Data_from_Internal_fossil_constraints_have_more_effect_on_the_age_estimates_of_crown_Palaeognathae_than_different_phylogenomic_data_type_/245348). The time trees (MTG, TKZ, C12, CNEE, UCE, and PRM with all fossil priors and root prior only) are available (newick format) as Supplementary Material S2. The differences in divergence time estimates and taxon sampling between the MTG and TKZ topologies are provided in pdf format in Supplementary Material S3. The original nuclear datasets analyzed in this study from Takezaki (2023) can be found at https://datadryad.org/stash/dataset/doi:10.5061/dryad.1jwstqjzs, and the original Prum et al. (2015) can be found at https://zenodo.org/records/28343.

Author contributions

AS: Conceptualization, Formal Analysis, Investigation, Methodology, Resources, Validation, Visualization, Writing – original draft, Writing – review and editing. NT: Data curation, Investigation, Methodology, Validation, Visualization, Writing – review and editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was funded by Fundação Carlos Chagas Filho de Amparo à Pesquisa do Estado do Rio de Janeiro (FAPERJ) Edital E_21/2024, Processo 306643 granted to AS.

Acknowledgments

We would like to thank Claudia A. M. Russo for her invaluable early ideas and for fostering the professional collaboration between the authors of this study. We thank the three reviewers for their valuable comments and insightful suggestions, which have significantly contributed to the improvement of this manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that Generative AI was used in the creation of this manuscript. Generative AI tools were used to assist English language review and editing for grammar, clarity, and overall writing quality of the final manuscript. It was not used to generate or interpret any scientific content in this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fbinf.2025.1563786/full#supplementary-material

References

Agnolin, F. L., and Chafrat, P. (2015). New fossil bird remains from the chichinales formation (early miocene) of Northern Patagonia, Argentina. Ann. Paléontologie 101, 87–94. doi:10.1016/j.annpal.2015.02.001

CrossRef Full Text | Google Scholar

Almeida, F. C., Porzecanski, A. L., Cracraft, J. L., and Bertelli, S. (2021). The evolution of tinamous (Palaeognathae: Tinamidae) in light of molecular and combined analyses. Zoöl. J. Linn. Soc. 195, 106–124. doi:10.1093/zoolinnean/zlab080

CrossRef Full Text | Google Scholar

Alvarenga, H. M. F. (1983). Uma ave ratitae do paleoceno brasileiro: bacia calcária de itaboraí, estado do Rio de Janeiro, Brasil. Bol mus nac. Geol 41, 1–8.

Google Scholar

Azzalini, A. (2023). The R package “sn”: the Skew-normal and related distributions such as the Skew-t and the SUN (version 2.1.1). Available online at: https://cran.r-project.org/package=sn (Accessed April 6, 2024).

Google Scholar

Bertelli, S. (2017). Advances on tinamou phylogeny: an assembled cladistic study of the volant palaeognathous birds. Cladistics 33, 351–374. doi:10.1111/cla.12172

PubMed Abstract | CrossRef Full Text | Google Scholar

Bertelli, S., and Chiappe, L. (2005). Earliest tinamous (Aves: palaeognathae) from the Miocene of Argentina and their phylogenetic position. Contributions Sci. Nat. Hist. Mus. 502, 1–20. doi:10.5962/p.226787

CrossRef Full Text | Google Scholar

Bertelli, S., Chiappe, L. M., and Mayr, G. (2014). Phylogenetic interrelationships of living and extinct tinamidae, volant palaeognathous birds from the New World. Zoological J. Linn. Soc. 172 (1), 145–184. doi:10.1111/zoj.12156

CrossRef Full Text | Google Scholar

Bibi, F. (2013). A multi-calibrated mitochondrial phylogeny of extant bovidae (artiodactyla, ruminantia) and the importance of the fossil record to systematics. BMC Evol. Biol. 13, 166. doi:10.1186/1471-2148-13-166

PubMed Abstract | CrossRef Full Text | Google Scholar

Boles, W. (1992). Revision of dromaius gidju patterson and rich, 1987 from riversleigh, northwestern queensland, Australia, with a reassessment of its generic position. Los Angel. Cty. Mus. Sci. Ser. 36, 195–208.

Google Scholar

Boles, W. E. (2001). A new emu (dromaiinae) from the late oligocene etadunna formation. Emu 101, 317–321. doi:10.1071/mu00052

CrossRef Full Text | Google Scholar

Carruthers, T., Sanderson, M. J., and Scotland, R. W. (2020). The implications of lineage-specific rates for divergence time estimation. Syst. Biol. 69, 660–670. doi:10.1093/sysbio/syz080

PubMed Abstract | CrossRef Full Text | Google Scholar

Castresana, J. (2000). Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis, Mol. Biol. Evol. 17, 540–552. doi:10.1093/oxfordjournals.molbev.a026334

PubMed Abstract | CrossRef Full Text | Google Scholar

Chatterjee, S., Goswami, A., and Scotese, C. R. (2012). The longest voyage: tectonic, magmatic, and paleoclimatic evolution of the Indian plate during its northward flight from Gondwana to Asia. Gondwana Res. 23, 238–267. doi:10.1016/j.gr.2012.07.001

CrossRef Full Text | Google Scholar

Chen, M.-Y., Liang, D., and Zhang, P. (2017). Phylogenomic resolution of the phylogeny of laurasiatherian mammals: exploring phylogenetic signals within coding and noncoding sequences. Genome Biol. Evol. 9, 1998–2012. doi:10.1093/gbe/evx147

PubMed Abstract | CrossRef Full Text | Google Scholar

Cibois, A., Vallotton, L., Ericson, P. G. P., Blom, M. P. K., and Irestedt, M. (2020). Genetic and radiographic insights into the only known mounted specimen of Kangaroo Island Emu. Rev. Suisse Zool. 126, 209–217. doi:10.5281/zenodo.3463451

CrossRef Full Text | Google Scholar

Claramunt, S., and Cracraft, J. (2015). A new time tree reveals Earth historys imprint on the evolution of modern birds. Sci. Adv. 1, e1501005–e1501013. doi:10.1126/sciadv.1501005

PubMed Abstract | CrossRef Full Text | Google Scholar

Cloutier, A., Sackton, T. B., Grayson, P., Clamp, M., Baker, A. J., and Edwards, S. V. (2019). Whole-genome analyses resolve the phylogeny of flightless birds (palaeognathae) in the presence of an empirical anomaly zone. Syst. Biol. 68, 937–955. doi:10.1093/sysbio/syz019

PubMed Abstract | CrossRef Full Text | Google Scholar

Cooper, A., Lalueza-Fox, C., Anderson, S., Rambaut, A., Austin, J., and Ward, R. (2001). Complete mitochondrial genome sequences of two extinct moas clarify ratite evolution. Nature 409, 704–707. doi:10.1038/35055536

PubMed Abstract | CrossRef Full Text | Google Scholar

Cracraft, J. (1973). Continental drift, paleoclimatology, and the evolution and biogeography of birds. J. Zool. 169, 455–543. doi:10.1111/j.1469-7998.1973.tb03122.x

CrossRef Full Text | Google Scholar

Crane, A., Benito, J., Chen, A., Musser, G., Torres, C. R., Clarke, J. A., et al. (2024). Taphonomic damage obfuscates interpretation of the retroarticular region of the Asteriornis mandible. Geobios 90, 31–43. doi:10.1016/j.geobios.2024.03.003

CrossRef Full Text | Google Scholar

Crouch, N. M. A., and Clarke, J. A. (2019). Body size evolution in palaeognath birds is consistent with Neogene cooling-linked gigantism. Palaeogeogr. Palaeoclimatol. Palaeoecol. 532, 109224. doi:10.1016/j.palaeo.2019.05.046

CrossRef Full Text | Google Scholar

dos Reis, M., Thawornwattana, Y., Angelis, K., Telford, M. J., Donoghue, P. C. J., and Yang, Z. (2015). Uncertainty in the timing of origin of animals and the limits of precision in molecular timescales. Curr. Biol. 25, 2939–2950. doi:10.1016/j.cub.2015.09.066

PubMed Abstract | CrossRef Full Text | Google Scholar

Duchêne, S., Lanfear, R., and Ho, S. Y. W. (2014). The impact of calibration and clock-model choice on molecular estimates of divergence times. Mol. Phylogenet. Evol. 78, 277–289. doi:10.1016/j.ympev.2014.05.032

PubMed Abstract | CrossRef Full Text | Google Scholar

Edwards, S. V., Cloutier, A., and Baker, A. J. (2017). Conserved nonexonic elements: a novel class of marker for phylogenomics. Syst. Biol. 66, 1028–1044. doi:10.1093/sysbio/syx058

PubMed Abstract | CrossRef Full Text | Google Scholar

Edwards, S. V., Cloutier, A., Cockburn, G., Driver, R., Grayson, P., Katoh, K., et al. (2024). A nuclear genome assembly of an extinct flightless bird, the little bush moa. Sci. Adv. 10, eadj6823. doi:10.1126/sciadv.adj6823

PubMed Abstract | CrossRef Full Text | Google Scholar

Ezcurra, M. D., and Agnolin, F. L. (2012). A new global palaeobiogeographical model for the late mesozoic and early tertiary. Syst. Biol. 61, 553–566. doi:10.1093/sysbio/syr115

PubMed Abstract | CrossRef Full Text | Google Scholar

Feng, S., Stiller, J., Deng, Y., Armstrong, J., Fang, Q., Reeve, A. H., et al. (2020). Dense sampling of bird diversity increases power of comparative genomics. Nature 587, 252–257. doi:10.1038/s41586-020-2873-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Field, D. J., Benito, J., Chen, A., Jagt, J. W. M., and Ksepka, D. T. (2020). Late cretaceous neornithine from Europe illuminates the origins of crown birds. Nature 579, 397–401. doi:10.1038/s41586-020-2096-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Gill, F., Donsker, D., and Rasmussen, P. (2024). IOC world bird list (v 14.2). doi:10.14344/ioc.ml.14.1

CrossRef Full Text | Google Scholar

Grealy, A., Miller, G. H., Phillips, M. J., Clarke, S. J., Fogel, M., Patalwala, D., et al. (2023). Molecular exploration of fossil eggshell uncovers hidden lineage of giant extinct bird. Nat. Commun. 14, 914. doi:10.1038/s41467-023-36405-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Grealy, A., Phillips, M., Miller, G., Gilbert, M. T. P., Rouillard, J.-M., Lambert, D., et al. (2017). Eggshell palaeogenomics: palaeognath evolutionary history revealed through ancient nuclear and mitochondrial DNA from Madagascan elephant bird (Aepyornis sp.) eggshell. Mol. Phylogenetics Evol. 109, 151–163. doi:10.1016/j.ympev.2017.01.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Haddrath, O., and Baker, A. J. (2001). Complete mitochondrial DNA geonome sequences of extinct birds: ratite phylogenetics and the vicariance biogeography hypothesis. Proc. R. Soc. B-Biological Sci. 268, 939–945. doi:10.1098/rspb.2001.1587

PubMed Abstract | CrossRef Full Text | Google Scholar

Haddrath, O., and Baker, A. J. (2012). Multiple nuclear genes and retroposons support vicariance and dispersal of the palaeognaths, and an early Cretaceous origin of modern birds. Proc. R. Soc. B-Biological Sci. 279, 4617–4625. doi:10.1098/rspb.2012.1630

PubMed Abstract | CrossRef Full Text | Google Scholar

Hagemann, A., Sánchez-Villagra, M. R., and Hautier, L. (2023). Is it inappropriate to ask for your age? Evaluating parameter impact on tree dating in a challenging clade (Macroscelidea). Syst. Biol. 72 (1), 161–176. doi:10.1093/sysbio/syac058

CrossRef Full Text | Google Scholar

Härlid, A., Janke, A., and Arnason, U. (1998). The complete mitochondrial genome of rhea Americana and early avian divergences. J. Mol. Evol. 46, 669–679. doi:10.1007/pl00006347

PubMed Abstract | CrossRef Full Text | Google Scholar

Harshman, J., Braun, E. L., Braun, M. J., Huddleston, C. J., Bowie, R. C. K., Chojnowski, J. L., et al. (2008). Phylogenomic evidence for multiple losses of flight in ratite birds. PNAS 105, 13462–13467. doi:10.1073/pnas.0803242105

PubMed Abstract | CrossRef Full Text | Google Scholar

Ho, S. Y. W., and Phillips, M. J. (2009). Accounting for calibration uncertainty in phylogenetic estimation of evolutionary divergence times. Syst. Biol. 58, 367–380. doi:10.1093/sysbio/syp035

PubMed Abstract | CrossRef Full Text | Google Scholar

Jarvis, E. D., Mirarab, S., Aberer, A. J., Li, B., Houde, P., Li, C., et al. (2014). Whole-genome analyses resolve early branches in the tree of life of modern birds. Science 346, 1320–1331. doi:10.1126/science.1253451

PubMed Abstract | CrossRef Full Text | Google Scholar

Kuhl, H., Frankl-Vilches, C., Bakker, A., Mayr, G., Nikolaus, G., Boerno, S. T., et al. (2020). An unbiased molecular approach using 3’UTRs resolves the avian family-level tree of life. Mol. Biol. Evol. 38, 108–127. doi:10.1093/molbev/msaa191

PubMed Abstract | CrossRef Full Text | Google Scholar

Kumar, S. (2005). Molecular clocks: four decades of evolution. Nat. Rev. Genet. 6, 654–662. doi:10.1038/nrg1659

PubMed Abstract | CrossRef Full Text | Google Scholar

Lan, Y., and Xu, H. (2020). Characterization of the complete mitochondrial genome and phylogenetic analysis of Apteryx rowi (Apterygidae: Apteryx). Mitochondrial DNA Part B 5, 1213–1214. doi:10.1080/23802359.2020.1730723

CrossRef Full Text | Google Scholar

Lemmon, A. R., Emme, S. A., and Lemmon, E. M.(2012). Anchored Hybrid Enrichment for Massively High-Throughput Phylogenomics. Syst. Biol. 61, 727–744. doi:10.1093/sysbio/sys049

PubMed Abstract | CrossRef Full Text | Google Scholar

Linder, H. P., Hardy, C. R., and Rutschmann, F. (2005). Taxon sampling effects in molecular clock dating: an example from the African Restionaceae. Mol. Phylogenetics Evol. 35, 569–582. doi:10.1016/j.ympev.2004.12.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Livezey, B. C., and Zusi, R. L. (2007). Higher-order phylogeny of modern birds (Theropoda, Aves: Neornithes) based on comparative anatomy. II. Analysis and discussion. Zoological J. Linn. Soc. 149, 1–95. doi:10.1111/j.1096-3642.2006.00293.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Marshall, C. R. (2008). A simple method for bracketing absolute divergence times on molecular phylogenies using multiple fossil calibration points. Am. Nat. 171, 726–742. doi:10.1086/587523

PubMed Abstract | CrossRef Full Text | Google Scholar

Mayr, G. (2011). Metaves, Mirandornithes, Strisores and other novelties - a critical review of the higher-level phylogeny of neornithine birds. J. Zoological Syst. Evol. Res. 49, 58–76. doi:10.1111/j.1439-0469.2010.00586.x

CrossRef Full Text | Google Scholar

Mayr, G. (2016a). Avian evolution: the fossil record of birds and its paleobiological significance. Chichester, UK: Wiley.

CrossRef Full Text | Google Scholar

Mayr, G. (2016b). On the taxonomy and osteology of the early Eocene North American Geranoididae (Aves, gruoidea). Swiss J. Palaeontol. 135, 315–325. doi:10.1007/s13358-016-0117-2

CrossRef Full Text | Google Scholar

Mayr, G. (2019). Hindlimb morphology of palaeotis suggest palaeognathous affinities of the geranoididae and other “crane-like” birds from the Eocene of the Northern Hemisphere. Acta Palaeontol. Pol. 64. doi:10.4202/app.00650.2019

CrossRef Full Text | Google Scholar

Mayr, G. (2022). Paleogene fossil birds. Cham, Switzerland: Springer Nature.

Google Scholar

McLoughlin, S. (2001). The breakup history of Gondwana and its impact on pre-cenozoic floristic provincialism. Aust. J. Bot. 49, 271–300. doi:10.1071/bt00023

CrossRef Full Text | Google Scholar

Mello, B., and Schrago, C. (2014). Assignment of calibration information to deeper phylogenetic nodes is more effective in obtaining precise and accurate divergence time estimates. Evol. Bioinforma. 79, 79–85. doi:10.4137/ebo.s13908

PubMed Abstract | CrossRef Full Text | Google Scholar

Mendes, F. K., and Hahn, M. W. (2016). Gene tree discordance causes apparent substitution rate variation. Syst. Biol. 65, 711–721. doi:10.1093/sysbio/syw018

PubMed Abstract | CrossRef Full Text | Google Scholar

Miao, Y.-W., Peng, M.-S., Wu, G.-S., Ouyang, Y.-N., Yang, Z.-Y., Yu, N., et al. (2013). Chicken domestication: an updated perspective based on mitochondrial genomes. Heredity 110, 277–282. doi:10.1038/hdy.2012.83

PubMed Abstract | CrossRef Full Text | Google Scholar

Minh, B. Q., Schmidt, H. A., Chernomor, O., Schrempf, D., Woodhams, M. D., Haeseler, A. von, et al. (2020). IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37, 1530–1534. doi:10.1093/molbev/msaa015

PubMed Abstract | CrossRef Full Text | Google Scholar

Mitchell, K. J., Llamas, B., Soubrier, J., Rawlence, N. J., Worthy, T. H., Wood, J., et al. (2014). Ancient DNA reveals elephant birds and kiwi are sister taxa and clarifies ratite bird evolution. Science 344, 898–900. doi:10.1126/science.1251981

PubMed Abstract | CrossRef Full Text | Google Scholar

Moutinho, A. F., Bataillon, T., and Dutheil, J. Y. (2020). Variation of the adaptive substitution rate between species and within genomes. Evol. Ecol. 34, 315–338. doi:10.1007/s10682-019-10026-z

CrossRef Full Text | Google Scholar

Noriega, J. I., Jordan, E. A., Vezzosi, R. I., and Areta, J. I. (2017). A new species of Opisthodactylus Ameghino, 1891 (Aves, Rheidae), from the late Miocene of northwestern Argentina, with implications for the paleobiogeography and phylogeny of rheas. J. Vertebr. Paléontol. 37, e1278005. doi:10.1080/02724634.2017.1278005

CrossRef Full Text | Google Scholar

Paradis, E., and Schliep, K. (2019). Ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics 35 (3), 526–528. doi:10.1093/bioinformatics/bty633

PubMed Abstract | CrossRef Full Text | Google Scholar

Parham, J. F., Donoghue, P. C. J., Bell, C. J., Calway, T. D., Head, J. J., Holroyd, P. A., et al. (2012). Best practices for justifying fossil calibrations. Syst. Biol. 61, 346–359. doi:10.1093/sysbio/syr107

PubMed Abstract | CrossRef Full Text | Google Scholar

Phillips, M. J., Gibb, G. C., Crimp, E. A., and Penny, D. (2010). Tinamous and moa flock together: mitochondrial genome sequence analysis reveals independent losses of flight among ratites. Syst. Biol. 59, 90–107. doi:10.1093/sysbio/syp079

PubMed Abstract | CrossRef Full Text | Google Scholar

Picasso, M. B. J., Hospitaleche, C. A., and Mosto, M. C. (2022). An overview and update of South American and Antarctic fossil rheidae and putative ratitae (Aves, Palaeognathae). J. S. Am. Earth Sci. 115, 103731. doi:10.1016/j.jsames.2022.103731

CrossRef Full Text | Google Scholar

Prum, R. O., Berv, J. S., Dornburg, A., Field, D. J., Townsend, J. P., Lemmon, E. M., et al. (2015). A comprehensive phylogeny of birds (Aves) using targeted next-generation DNA sequencing. Nature 526, 569–573. doi:10.1038/nature15697

PubMed Abstract | CrossRef Full Text | Google Scholar

Rambaut, A., Drummond, A. J., Xie, D., Baele, G., and Suchard, M. A. (2018). Posterior summarization in bayesian phylogenetics using tracer 1.7. Syst. Biol. 67, 901–904. doi:10.1093/sysbio/syy032

PubMed Abstract | CrossRef Full Text | Google Scholar

R Core Team (2024). R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. Available online at: http://books.google.com.br/books?id=K5sBXe1qPt8C (Accessed April 6, 2024).

CrossRef Full Text | Google Scholar

Reddy, S., Kimball, R. T., Pandey, A., Hosner, P. A., Braun, M. J., Hackett, S. J., et al. (2017). Why do phylogenomic data sets yield conflicting trees? Data type influences the avian tree of life more than taxon sampling. Syst. Biol. 66, 857–879. doi:10.1093/sysbio/syx041

PubMed Abstract | CrossRef Full Text | Google Scholar

Reis, M. dos, Donoghue, P. C. J., and Yang, Z. (2016). Bayesian molecular clock dating of species divergences in the genomics era. Nat. Rev. Genet. 17, 71–80. doi:10.1038/nrg.2015.8

PubMed Abstract | CrossRef Full Text | Google Scholar

Reis, M. dos, Gunnell, G. F., Barba-Montoya, J., Wilkins, A., Yang, Z., and Yoder, A. D. (2018). Using phylogenomic data to explore the effects of relaxed clocks and calibration strategies on divergence time estimation: primates as a test case. Syst. Biol. 67, 594–615. doi:10.1093/sysbio/syy001

PubMed Abstract | CrossRef Full Text | Google Scholar

Reis, dos M., and Yang, Z. (2011). Approximate likelihood calculation on a phylogeny for bayesian estimation of divergence times. Mol. Biol. Evol. 28, 2161–2172. doi:10.1093/molbev/msr045

PubMed Abstract | CrossRef Full Text | Google Scholar

Rutschmann, F., Eriksson, T., Salim, K. A., and Conti, E. (2007). Assessing calibration uncertainty in molecular dating: the assignment of fossils to alternative calibration points. Syst. Biol. 56, 591–608. doi:10.1080/10635150701491156

PubMed Abstract | CrossRef Full Text | Google Scholar

Sackton, T. B., Grayson, P., Cloutier, A., Hu, Z., Liu, J. S., Wheeler, N. E., et al. (2019). Convergent regulatory evolution and loss of flight in paleognathous birds. Science 364, 74–78. doi:10.1126/science.aat7244

PubMed Abstract | CrossRef Full Text | Google Scholar

Schliep, K. P. (2011). Phangorn: phylogenetic analysis in R. Bioinformatics 27 (4), 592–593. doi:10.1093/bioinformatics/btq706

PubMed Abstract | CrossRef Full Text | Google Scholar

Selvatti, A. P., Gonzaga, L. P., and Russo, C. A. de M. (2015). A Paleogene origin for crown passerines and the diversification of the Oscines in the New World. Mol. Phylogenetics Evol. 88, 1–15. doi:10.1016/j.ympev.2015.03.018

PubMed Abstract | CrossRef Full Text | Google Scholar

Selvatti, A. P., Moreira, F. R. R., Carvalho, D. C., Prosdocimi, F., Russo, C. A. de M., and Junqueira, A. C. M. (2023). Phylogenomics reconciles molecular data with the rich fossil record on the origin of living turtles. Mol. Phylogenetics Evol. 183, 107773. doi:10.1016/j.ympev.2023.107773

PubMed Abstract | CrossRef Full Text | Google Scholar

Stiller, J., Feng, S., Chowdhury, A.-A., Rivas-González, I., Duchêne, D. A., Fang, Q., et al. (2024). Complexity of avian evolution revealed by family-level genomes. Nature 629, 851–860. doi:10.1038/s41586-024-07323-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Takezaki, N. (2023). Effect of different types of sequence data on palaeognath phylogeny. Genome Biol. Evol. 15, evad092. doi:10.1093/gbe/evad092

PubMed Abstract | CrossRef Full Text | Google Scholar

Villaverde, T., Maguilla, E., Luceño, M., and Hipp, A. L. (2021). Assessing the sensitivity of divergence time estimates to locus sampling, calibration points, and model priors in a RAD-seq phylogeny of Carex section Schoenoxiphium. J. Syst. Evol. 59, 687–697. doi:10.1111/jse.12724

CrossRef Full Text | Google Scholar

Wang, Z., Zhang, J., Xu, X., Witt, C., Deng, Y., Chen, G., et al. (2022). Phylogeny and sex chromosome evolution of Palaeognathae. J. Genet. Genom. 49, 109–119. doi:10.1016/j.jgg.2021.06.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Wetmore, A. (1933). Fossil bird remains from the Eocene of Wyoming. Condor 35, 115–118. doi:10.2307/1363436

CrossRef Full Text | Google Scholar

Widrig, K., and Field, D. J. (2022). The evolution and fossil record of palaeognathous birds (Neornithes: palaeognathae). Diversity 14, 105. doi:10.3390/d14020105

CrossRef Full Text | Google Scholar

Worthy, T. H., Degrange, F. J., Handley, W. D., and Lee, M. S. Y. (2017). The evolution of giant flightless birds and novel phylogenetic relationships for extinct fowl (Aves, Galloanseres). R. Soc. Open Sci. 4, 170975. doi:10.1098/rsos.170975

PubMed Abstract | CrossRef Full Text | Google Scholar

Worthy, T. H., Hand, S. J., and Archer, M. (2014). Phylogenetic relationships of the Australian oligo–miocene ratite Emuarius gidju Casuariidae. Integr. Zoöl. 9, 148–166. doi:10.1111/1749-4877.12050

PubMed Abstract | CrossRef Full Text | Google Scholar

Worthy, T. H., Worthy, J. P., Tennyson, A. J., Salisbury, S. W., Hand, S. J., and Scofield, R. P. (2013). “Miocene fossils show that kiwi (Apteryx, Apterygidae) are probably not phyletic dwarves,” in Paleornithological research 2013—proceedings of the 8th international meeting of the society of avian paleontology and evolution (Vienna: Natural History Museum Vienna), 63–80.

Google Scholar

Wu, S., Rheindt, F. E., Zhang, J., Wang, J., Zhang, L., Quan, C., et al. (2024). Genomes, fossils, and the concurrent rise of modern birds and flowering plants in the Late Cretaceous. Proc. Natl. Acad. Sci. U. S. A. 121, e2319696121. doi:10.1073/pnas.2319696121

PubMed Abstract | CrossRef Full Text | Google Scholar

Xia, X. (2018). DAMBE7: new and improved tools for data analysis in molecular biology and evolution. Mol. Biol. Evol. 35, 1550–1552. doi:10.1093/molbev/msy073

PubMed Abstract | CrossRef Full Text | Google Scholar

Xia, X., and Lemey, P. (2009). “Assessing substitution saturation with DAMBE,” in The phylogenetic handbook: a practical approach to DNA and protein phylogeny. Editors P. Lemey, M. Salemi, and A.-M. Vandamme 2nd ed. (Cambridge: Cambridge University Press), 615–630.

CrossRef Full Text | Google Scholar

Xia, X., Xie, Z., Salemi, M., Chen, L., and Wang, Y. (2003). An index of substitution saturation and its application. Mol. Phylogenet Evol. 26, 1–7. doi:10.1016/S1055-7903(02)00326-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, Z. (2007). PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591. doi:10.1093/molbev/msm088

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, Z. H. (2006). Computational molecular evolution. Oxford: Oxford University Press.

Google Scholar

Yonezawa, T., Segawa, T., Mori, H., Campos, P. F., Hongoh, Y., Endo, H., et al. (2017). Phylogenomics and morphology of extinct paleognaths reveal the origin and evolution of the ratites. Geol. Soc. Lond., Spéc. Publ. 27, 68–77. doi:10.1016/j.cub.2016.10.029

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, G., Jarvis, E. D., and Gilbert, M. T. P. (2014a). A flock of genomes. Science 346, 1308–1309. doi:10.1126/science.346.6215.1308

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, G., Li, C., Li, Q., Li, B., Larkin, D. M., Lee, C., et al. (2014b). Comparative genomics reveals insights into avian genome evolution and adaptation. Science 346, 1311–1320. doi:10.1126/science.1251385

PubMed Abstract | CrossRef Full Text | Google Scholar

Glossary

Bayesian Relaxed Clock A molecular dating method that estimates divergence times between lineages by allowing evolutionary rates to vary across branches and incorporating uncertainty in both rates and fossil calibrations.

Calibration Prior A constraint based on (preferably) fossil or geological data used to convert the genetic differences into absolute (real) divergence ages.

Crown Group The most recent common ancestor of all living members of a group and all its descendants.

Divergence Time The point in evolutionary history when two species or lineages split from a common ancestor.

Fossil Constraint A fossil used to inform the minimum (and sometimes maximum) age of a node in a phylogenetic tree, often with a probabilistic prior distribution.

Ingroup/Outgroup In phylogenetics, the ingroup is the primary set of species under study, while the outgroup is a related lineage used for rooting the tree.

Mitogenome The complete mitochondrial genome of an organism, often used in phylogenetic analyses for its maternal inheritance and relatively rapid evolution.

Molecular Clock The classic 1965 hypothesis that predicted that genetic mutations accumulate at a constant rate, allowing estimates of divergence times based on genetic distance. The idea that substitution rates vary within and among genes, chromosomes, genetic compartments, and clades fostered the development of methods that take such variation and uncertainty into account.

Neornithes The clade comprising all modern birds, including both Palaeognathae and Neognathae.

Node (in a phylogenetic tree) A point on a tree that represents a divergence event where one ancestral lineage splits into two or more descendants.

Notopalaeognathae A clade within Palaeognathae that includes all members except the ostrich (Struthioniformes), thus comprising Rheiformes, Tinamiformes, Casuariiformes, Apterygiformes, and extinct relatives.

Partition Scheme A strategy where sequence alignments containing different data subsets (e.g., gene regions or codon positions) are grouped based on common patterns such as base composition and rate heterogeneity. The best evolutionary model is then selected for each partition and proceed to phylogenetic tree inference.

Phylogenomics The use of genome-scale data to reconstruct evolutionary relationships and histories.

Posterior Probability A statistical measure in Bayesian analysis indicating the likelihood of a hypothesis (e.g., a divergence time) given the data and model.

Sequence Alignment The process of arranging DNA, RNA, or protein sequences to identify regions of similarity that may indicate functional, structural, or evolutionary relationships.

Stratigraphy The study of rock layers (strata) and their use in dating fossils and reconstructing geological history.

Substitution Rate The speed at which genetic mutations become fixed in a lineage over time; a key parameter in tree reconstruction and molecular dating.

Keywords: aves, time tree, fossil priors, internal constraints, relaxed molecular clock

Citation: Selvatti AP and Takezaki N (2025) Internal fossil constraints have more effect on the age estimates of crown Palaeognathae than different phylogenomic data type. Front. Bioinform. 5:1563786. doi: 10.3389/fbinf.2025.1563786

Received: 20 January 2025; Accepted: 02 July 2025;
Published: 07 August 2025.

Edited by:

Jack M Craig, Temple University, United States

Reviewed by:

Marcel Van Tuinen, University of Groningen, Netherlands
Toni Gossmann, Technical University Dortmund, Germany
Patrick Arnold, University of Potsdam, Germany

Copyright © 2025 Selvatti and Takezaki. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Alexandre Pedro Selvatti, YWxleGFuZHJlLnBlZHJvLm51bmVzQHVlcmouYnI=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.