Your new experience awaits. Try the new design now and help us make it even better

HYPOTHESIS AND THEORY article

Front. Genet.

Sec. Evolutionary and Population Genetics

Volume 16 - 2025 | doi: 10.3389/fgene.2025.1610942

Slavs in the Closet: Computational Genomic Analysis Reveals Cryptic Slavic Signatures in the Avar Khaganate and Their Contribution to Medieval Croatian Population Formation

Provisionally accepted
Svetoslav  StamovSvetoslav Stamov1*Todor  ChobanovTodor Chobanov2*
  • 1Regional Historical Museum, Burgas, Bulgaria
  • 2Bulgarian Academy of Sciences (BAS), Sofia, Sofia City, Bulgaria

The final, formatted version of the article will be published soon.

Our study applies a systematic computational genomic approach to investigate the complex population dynamics of Southern Slavs in the Hungarian Plain and Avar Khaganate, and their subsequent role in forming the medieval Croatian population. Using a quality-controlled dataset of 1,800 ancient DNA samples, we implemented a comprehensive analytical framework centered on systematic screening of marginal Principal Components to detect cryptic Slavic genetic signatures. This strategic methodological approach addresses the well-documented analytical challenge that Germanic and Slavic populations remain indistinguishable using conventional PC1-2 analysis due to shared Baltic Bronze Age ancestry. Through systematic evaluation of all principal components (PC1-20), we identified PC9 as a reliable indicator of Slavic ancestry within European ancient DNA samples when combined with PC4 and PC3. This approach revealed substantial Baltic genetic components in early Slavic populations (57% in Slovakia/Slovenia) decreasing to 39-51% in medieval Croatian samples. Statistical modeling demonstrates that contemporary Croatian populations formed through three distinct migration waves, with 50-60% total Slavic ancestry and 20-25% pre-Slavic Balkan continuity. Significantly, we identified individuals with Slavic genetic profiles in prestigious Avar burial contexts, questioning established understanding of social hierarchies within the Khaganate. The genomic evidence indicates that key aspects of South Slavic genetic structure emerged through interactions within the Carpathian Basin rather than after Balkan arrival. Our findings demonstrate that Croatian ethnogenesis involved gradual integration rather than population replacement, with the Avar Khaganate serving as a crucial demographic interface where South Slavic genetic structure emerged. Our approach addresses longstanding historical questions regarding Croatian ethnogenesis by identifying specific genetic signatures and quantifying their population-level contributions, demonstrating how application of computational genomics provides unprecedented resolution in studying complex population transformations when traditional historical and archaeological approaches reach interpretive limits.

Keywords: ANCIENT DNA, Slavic migrations, Avar Khaganate, Croatian ethnogenesis, Computational genomics, Principal Component Analhysis (PCA)

Received: 17 Apr 2025; Accepted: 18 Aug 2025.

Copyright: © 2025 Stamov and Chobanov. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Svetoslav Stamov, Regional Historical Museum, Burgas, Bulgaria
Todor Chobanov, Bulgarian Academy of Sciences (BAS), Sofia, 1040, Sofia City, Bulgaria

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.