ORIGINAL RESEARCH article

Front. Bioinform.

Sec. Genomic Analysis

Volume 5 - 2025 | doi: 10.3389/fbinf.2025.1550730

Utility of Regional STR Marker Variations in Tunisian and sub-Saharan Populations: Insights into Forensic and Population Genetics  

Provisionally accepted
  • 1Laboratory of Human Genome and Multifactorial Diseases, Faculty of Pharmacy, University of Monastir, Monastir 5000, Tunisia, Monastir, Tunisia
  • 2Department of Genetics, Farhat Hached University Hospital,, Sousse, Tunisia
  • 3Laboratory of Cytogenetics, Molecular Genetics and Reproductive Biology (LR03SP02), Farhat Hached University Hospital,, Sousse, Tunisia
  • 4Sydney Brenner Institute for Molecular Bioscience, Faculty of Health Sciences, University of the Witwatersrand,, Johannesburg, South Africa
  • 5Higher Institute of Biotechnology of Monastir, Monastir, Tunisia

The final, formatted version of the article will be published soon.

This study investigates the genetic variability and forensic applicability of Short Tandem Repeat (STR) loci including autosomal, X and Y-STR markers, across distinct Tunisian regions and among sub-Saharan African populations. Our objectives were to examine the regional allelic diversity of STR markers in Tunisia, and to assess the utility of these markers for forensic differentiation between Tunisian and sub-Saharan African. Twenty two STRs were genotyped in 500 Tunisian individuals and 501 sub-Saharan corpses by capillary electrophoresis using commercial system kits.A Chi-square test for homogeneity was applied to assess allele distribution and Principal Component Analysis to assess geographical allele variations. Bioinformatic methods in R packages were used, such as Logistic Regression Model to predict geographic group membership and Random Forest models to evaluate the discriminative power of the analyzed STRs. Statistical analyses revealed significant allelic variability between Northern, Central, and Southern Tunisia for markers such as D1S1656, D8S1179, and CSF1PO. PCA illustrated a clear genetic distinction between Tunisian and sub-Saharan populations, largely attributable to geographical and historical gene flow barriers. LRM achieved high accuracy (95.96%) in predicting geographic affiliation. RF analysis identified DYS391 as highly discriminative in population differentiation. Our findings align with prior research on Tunisian genetic diversity and extend this knowledge by illustrating allelic frequency variations in order to establish region-specific databases. This study contributes valuable insights into the genetic structure of Tunisian and sub-Saharan populations, emphasizing tailored approaches in forensic practices.

Keywords: Allele frequency, Genotype, Power of discrimination, random forest, Y-STR

Received: 11 Jan 2025; Accepted: 27 May 2025.

Copyright: © 2025 ATTAOUI, FODDHA, OTHMAN, BEN ABDENNEBI and HAJ KHELIL. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Amel HAJ KHELIL, Higher Institute of Biotechnology of Monastir, Monastir, 5000, Tunisia

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.