AUTHOR=Tay Guan K. , Henschel Andreas , Daw Elbait Gihan , Al Safar Habiba S. TITLE=Genetic Diversity and Low Stratification of the Population of the United Arab Emirates JOURNAL=Frontiers in Genetics VOLUME=Volume 11 - 2020 YEAR=2020 URL=https://www.frontiersin.org/journals/genetics/articles/10.3389/fgene.2020.00608 DOI=10.3389/fgene.2020.00608 ISSN=1664-8021 ABSTRACT=With high consanguinity rates on the Arabian Peninsula, it would not have been unexpected if the population of the UAE was shown to be relatively homogenous. However, this study of 1,000 UAE nationals, provided a contrasting perspective, one of a relatively heterogeneous population. Located at the apex of Europe, Asia and Africa, the observed diversity could be explained by a plethora of migration patterns since the first Out-of-Africa movement. A strategy to explore the extent of genetic variation of the population of the UAE is presented. The first step involved a comprehensive population stratification study that was instructive for subsequent Whole Genome Sequencing (WGS) of suitable representatives (which is described elsewhere). When this UAE data was compared to previous smaller studies from the region, the findings were consistent with a population that is a diverse and admixed group of people. However, rather than sharp and distinctive clusters, a continuous genotype distribution was observed. The analysis showed a continuous gradient of ancestral populations, suggesting that admixture on the south eastern tip of Arabian Peninsula occurred gradually. When visualized using a unique technique that combined admixture ratios and Principal Component Analysis (PCA), unappreciated diversity was revealed while mitigating projection bias of conventional PCA. The analytical strategy used here highlights the complementary nature of data from genotype array and WGS for anthropological studies. Specifically, genotype array data was instructive to select representative subjects for WGS. Furthermore, from the 2.3 million allele frequencies obtained from genotype arrays, we identified 46,481 loci with allele frequencies that were significantly different with respect to other world populations. This comparison of allele frequencies facilitates variant prioritization in common diseases. In addition, these loci bear great potential as biomarkers in anthropological and forensic studies.