- 1Department of Biology, University of Puerto Rico – Rio Piedras Campus, San Juan, PR, United States
- 2Departmenf of Genetics, Stanford University, Stanford, CA, United States
- 3Department of Translational Neuroscience, Wake Forest University School of Medicine, Winston-Salem, NC, United States
- 4Department of Microbiology, Committee on Microbiology, University of Chicago, Chicago, IL, United States
- 5Department of Medicine, University of Puerto Rico – Medical Sciences Campus, San Juan, PR, United States
- 6Department of Medicine, Central University of the Caribbean, Bayamón, PR, United States
- 7Department of Genetics, Washington University in St. Louis School of Medicine, St. Louis, MO, United States
Cardiovascular diseases (CVDs) are the leading cause of death worldwide, claiming millions of lives each year. Genome-wide association studies (GWASs) have identified thousands of CVD-associated variants and have created the foundation for risk assessment and prevention through genetic testing. However, despite all the progress in understanding cardiovascular genomics, our genetic research and findings are overwhelmingly skewed towards individuals of European ancestry. This fact has limited our understanding and effectiveness for the diagnosis and treatment of CVDs in underrepresented populations, such as individuals of African ancestry. This gap is especially consequential because African ancestry populations harbor the greatest global genetic diversity, with variant frequencies and haplotypes that are often poorly captured by current reference datasets. In this review, we highlight recent efforts to understand the effectiveness of current tools in accurately diagnosing and treating CVDs in individuals of African ancestry compared to other populations. Additionally, we also performed a multi-database analysis to explore the persistent diversity gap in cardiovascular genetics. In doing so, we aim to raise awareness about the ancestry gaps faced in disease genomic research, supported by recent findings and the current landscape of our genetic databases.
Introduction
Cardiovascular diseases (CVDs) remain the leading cause of mortality worldwide and are projected to increase by 90% over the next 25 years, with an estimated 35.6 million CVD-related deaths anticipated by 2050 (Li et al., 2023; Amini et al., 2021; Gaidai et al., 2023). In response to this growing burden, prevention strategies have increasingly emphasized the integration of genetic information into risk assessment tools. However, prediction of CVD risk remains imprecise due to the remaining ancestry gaps in biobanks and genome-wide association studies (GWASs) (Fernández-Rhodes, 2023). Despite global emphasis on diversity, over 90% of GWAS as of 2023 still focus on individuals of European descent, leaving other ancestry groups, especially African populations, significantly underrepresented (Zhang et al., 2023; Buniello et al., 2019). This imbalance highlights a systemic Eurocentric bias in genomic research; resources like the United Kingdom Biobank are widely used simply because large-scale genomic datasets for African and other underrepresented populations have not been developed to the same extent (Fatumo et al., 2022). As a result, polygenic risk scores (PRSs), which are derived mostly from these Eurocentric datasets, perform poorly in African individuals, whose unique genetic architecture is insufficiently captured (Ndong Sima et al., 2024; Adam et al., 2022). Paradoxically, Africa harbors the greatest genetic diversity in the world, yet many disease-relevant variants remain undiscovered due to limited inclusion in genetic research (Oni-Orisan et al., 2021; Tishkoff and Verrelli, 2003). This lack of representation reduces the clinical accuracy of PRSs and similar tools for African populations, hindering effective CVD risk prediction and widening disparities in precision medicine. These gaps are especially concerning given that African populations bear the highest global burden of cardiovascular disease and non-communicable disease mortality (Li et al., 2023; O’Sullivan et al., 2022; Minja et al., 2022).
A growing body of literature has examined the consequences of ancestral underrepresentation in genomic research, particularly related to CVD and African ancestry populations. Several reviews have already documented how Eurocentric study designs perpetuate disparities in both variant discovery and clinical translation. Bentley et al. (2020), for instance, highlighted the shortcomings of excluding African genomes, emphasizing that failure to include the world’s most genetically diverse populations limits the discovery of causal variants and reduces the applicability of genomic medicine (Bentley et al., 2020). Clarke et al. (2021) expanded on this point by demonstrating how racial disparities propagate across the cardiovascular genomics pipeline from genome assembly to clinical interpretation, ultimately limiting the utility of tools such as PRSs in non-European groups (Clarke et al., 2021). Earlier systematic evaluations, such as Kaufman et al. (2015), highlighted that genomic research had contributed relatively little to understanding the drivers of racial disparities in CVD, largely because key studies lacked adequate African ancestry representation (Kaufman et al., 2015). More recent reviews indicate that despite growing awareness, this issue persists. Ojewunmi and Fatumo (2025) argued that African genomic data remain essential for achieving equity in precision medicine and for building reference panels capable of capturing Africa’s enormous genetic diversity (Ojewunmi and Fatumo, 2025). Similarly, Jurado Vélez et al. (2025) emphasized that equitable cardiovascular genomics requires not only diverse sampling but also African-led research infrastructures, ethical frameworks, and sustainable capacity building (Jurado Vélez et al., 2025). Beyond broad assessments, several focused analyses have addressed specific CVD traits. For example, Bose et al. (2023) highlighted how genetic studies have struggled to explain racial disparities in CVD due to unbalanced sampling and the absence of fine mapping in African ancestry cohorts (Bose et al., 2023). Singh et al. (2021) further explored this challenge in their review of blood pressure and hypertension GWASs in African ancestry populations. They show that only a small fraction of known loci identified in European cohorts replicated in African cohorts, highlighting substantial population-specific biology (Singh et al., 2021).
Collectively, prior research underscores a consistent conclusion: African ancestry populations remain critically underrepresented across cardiovascular genomics research. This review builds on prior work by integrating cross-population GWAS findings with a multi-database quantitative analysis of cardiovascular genetic diversity, while also examining recent data on PRS performance, regulatory variation, and epigenetic differences across ancestries. We highlight the unique genetic architecture of African populations, the scientific and clinical consequences of their exclusion, and the structural, ethical, and logistical barriers that hinder their participation in global genomics research. Finally, this review outlines current initiatives addressing these challenges and proposes future strategies to enhance inclusion, foster equitable research, and improve precision medicine outcomes for African populations.
The current state of genetic research in CVD
GWASs are a fundamental tool used globally to identify and annotate genetic variants associated with complex human traits and diseases (Buniello et al., 2019). The National Human Genome Research Institute–European Bioinformatics Institute (NHGRI-EBI) GWAS Catalog currently includes data from over 7,000 published studies, covering more than 5,000 diseases and traits. Increasing recognition of the value of GWASs has led to substantial growth in study sample sizes, resulting in more statistically significant associations across a wide range of phenotypes (Mills and Rahal, 2020; Abdellaoui et al., 2023). However, despite initiatives such as Human Heredity and Health in Africa (H3Africa) (Ramsay et al., 2016) and the African Genome Variation Project (Gurdasani et al., 2015) aimed at enhancing representation, GWASs remain predominantly Eurocentric (Figure 1A). As of mid-2025, individuals of European ancestry comprise 90.53% of all GWAS participants, while the representation of African ancestry populations has shown minimal progress over the past decade (Mills and Rahal, 2020). In over 15 years, the percentage of GWASs conducted primarily on individuals with European ancestry has only dropped 5% since the first quantitative review of ancestral diversity in GWAS by Need et al. in 2009 (Figure 1B) (Need and Goldstein, 2009; Corpas et al., 2025). This lack of diversity is not just limited to the broader landscape of genetic research, but is particularly pronounced in cardiovascular GWASs, where the underrepresentation of African and other non-European populations remains a persistent issue (Figures 1C,D; Supplementary Figure S1).
Figure 1. The state of cardiovascular genomic research. (A) Number of participants in GWASs from different regions worldwide. Note: Participants may be counted multiple times if their genome was used in more than one GWAS. The data used to generate the figure was downloaded from https://gwasdiversitymonitor.com. Legend is represented on a logarithmic scale. (B) Number of individual genomes sequenced per database. A fraction of African ancestry genomes is marked in blue, and percentages are at the top of each bar. (C) Number of participants from each ancestry in cardiovascular GWAS from 2008 to 2023. (D) Percentage of participants from each ancestry in cardiovascular GWAS from 2008 to 2023. Full information and details from (A–D) are available in Supplementary Material 1. (E) Total number of participants in GWASs for cardiovascular diseases and measurements to date. The color scheme for each population is the same as (D). The source data was extracted from the GWAS diversity monitor database and is provided in the Supplementary Material 1. Note: The African ancestries displayed are a combination of sub-Saharan, African American, and Afro-Caribbean.
In 2021, Clarke et al. noted the lack of replication studies in coronary artery diseases (CAD), the largest studied cardiovascular phenotype, involving individuals of African and Latin American ancestry (Clarke et al., 2021; Singh et al., 2023; Matarin et al., 2008; Helgadottir et al., 2008; Cluett et al., 2009; Barad et al., 2024). Four years later, this observation largely remains accurate, as many genetic studies on CAD continue to rely exclusively on cohorts of European ancestry. For instance, Jain et al. (2024) investigated the role of sex hormones in CAD using only participants from the United Kingdom Biobank, all European descent. Similarly, Trenkwalder et al. (2025) explored the genetic overlap between CAD and aortic stenosis using data from sources like the Estonian Biobank and the FinnGen project, all limited to European ancestry (Trenkwalder et al., 2025). Although some researchers aim to include individuals of African ancestry, these efforts are hindered by small sample sizes, often limiting the statistical power and depth of analyses compared to those conducted in European populations (Sapkota et al., 2022; Rocheleau et al., 2024). Many studies investigating stroke, hypertrophic cardiomyopathy (HCM) and dilated cardiomyopathy (DCM) have also largely excluded African, Latin/Hispanic, or admixed populations (Traylor et al., 2017; Silva et al., 2023; Doumatey et al., 2023), reflecting the remaining ubiquitousness of underrepresentation across GWASs despite the growing burden of these diseases in African populations (Figure 1E) (Minja et al., 2022; Jordan et al., 2023; Keates et al., 2017).
The impact of the diversity gap in cardiovascular genomics
The Eurocentric bias in GWASs has carried over into recent advances in cardiovascular genomics, particularly the development of polygenic risk scores (PRSs). PRSs estimate an individual’s genetic predisposition to disease by aggregating the weighted effects of multiple variants, offering clinical value for early prevention and intervention strategies (Bentley et al., 2020; Singh et al., 2023). However, because virtually all PRSs depend on GWASs to derive variant effect sizes (Xiang et al., 2024; Zhao et al., 2024), their performance is often compromised in non-European populations. This is due to the overwhelming predominance of European ancestry data in GWAS, which limits the applicability of PRSs in more genetically diverse groups, such as those of African ancestry (Fahed et al., 2021; Dikilitas et al., 2022). For example, a 2024 study by Topriceanu et al. found that polygenic scores for cardiovascular diseases performed significantly better (p-value < 0.025) in White European (AUC = 0.77, 95% CI [0.76, 0.78]) and South Asian (AUC = 0.74, 95% CI [0.70, 0.77]) individuals than in those of African ancestry (AUC = 0.66, 95% CI [0.61, 0.72]). Odds ratios (OR) were also greater for White Europeans (OR = 1.61, 95% CI [1.56, 1.66]) than for individuals of African ancestry (OR = 1.20, 95% CI [1.09, 1.32]) (Topriceanu et al., 2024). Similarly, Smith et al. (2024) reported that even when using summary statistics from the largest congenital heart disease (CHD) meta-analysis to date and the most ancestrally diverse cohort available, PRS performance for individuals of African ancestry still fell behind that of other groups (Smith et al., 2024). Hence, multi-ancestry PRSs developed using pruning and thresholding methods showed the strongest association with CHDs in individuals of European (OR = 1.65 95%, CI [1.59–1.72]) or South Asian ancestry (OR = 2.75, 95% CI [2.41–3.14]), while the lowest was identified in African ancestry (OR = 1.16, 95% CI [1.11–1.21]) (Smith et al., 2024).
Beyond differences in allele frequencies and effect sizes, a growing body of work across large biobanks has shown that PRS transferability is strongly constrained by cohort-specific LD structure and ancestry composition. Multiple biobank evaluations consistently demonstrate that PRS trained in predominantly European datasets lose substantial predictive accuracy when applied to African ancestry cohorts in resources such as United Kingdom Biobank, the Million Veteran Program, and other population-specific datasets (Martin et al., 2019; Duncan et al., 2019; Ding et al., 2023). By comparing ancestry representation in 733 polygenic scoring studies with global population estimates and evaluating cross-ancestry PRS performance, a study by Duncan et al. (2019) demonstrates both extreme European over-representation (∼460%) and substantial African and Latino under-representation, alongside reduced PRS accuracy in non-European groups (Duncan et al., 2019). Ding et al. (2023) similarly reported that PRS accuracy declines with increasing genetic distance in the ATLAS precision health biobank. Genetic distance was defined as the Euclidean distance between a target individual and the center of the training data in the principal component space. They found a strong negative correlation between genetic distance and PRS accuracy in African American individuals (R = −0.88). Moreover, those in the most genetically distant decile exhibited approximately four-fold lower PRS accuracy compared to individuals genetically closest to the training population, highlighting poor PRS transferability across ancestries (Ding et al., 2023). Importantly, reduced performance is not solely attributable to continental ancestry differences since even within the same ancestry label, PRS accuracy can vary considerably between biobanks due to differences in factors like sampling frameworks and environmental covariates (Mostafavi et al., 2020). Together, these observations reinforce that the diversity gap documented throughout this review is not merely a matter of unequal representation but also a mechanistic barrier that limits the portability of genetic risk models. They also highlight the need for either larger GWAS in underrepresented populations or methodological approaches that leverage multi-ancestry training and recalibration in target cohorts to ensure equitable risk prediction (Kurniansyah et al., 2023; Kachuri et al., 2024).
In parallel with limitations of PRS transferability, widely used clinical risk scores for CVD also demonstrate racial miscalibration, potentially aggravating inequities when combined with genomic predictors. Multiple evaluations of the Pooled Cohort Equations (PCE) and Framingham models document systematic over- or under-estimation of 10-year atherosclerotic CVD (ASCVD) risk in African ancestry individuals, with miscalibration errors exceeding ∼20–30% in some settings (Yadlowsky et al., 2018; DeFilippis et al., 2015; Khan et al., 2024). For example, Yadlowsky et al. found that PCE substantially misestimated 10-year ASCVD risk, with large errors for Black adults, a third of whom received extreme risk estimates that were less than 70% or more than 250% of those for White adults with matching risk profiles (Yadlowsky et al., 2018; DeFilippis et al., 2015). More recently, newer tools like the American Heart Association’s PREVENT equations and the UK’s QR4 algorithm have improved performance in their derivation cohorts. PREVENT, which is race-free, achieved strong discrimination and calibration in U.S. validation cohorts (Ade et al., 2024). QR4, optimized in very large United Kingdom primary-care records, also outperforms legacy United Kingdom risk scores (Hippisley-Cox et al., 2024). However, critical gaps remain: PREVENT’s external validation is based on U.S. data and aggregated racial categories that may not reflect the genetic and social diversity of continental African populations (Ade et al., 2024; Hippisley-Cox et al., 2024). Additionally, QR4 was developed and validated exclusively on United Kingdom data. While the authors highlight its superior performance compared to international risk scores, a lack of validation in non-UK populations suggests the need for external validation before broader international application (Hippisley-Cox et al., 2024). Because these clinical tools are increasingly used alongside PRS to guide preventive therapy, these limitations risk compounding bias in risk stratification, highlighting an urgent need for model recalibration in diverse cohorts and for ancestry-aware combination of clinical and genomic risk estimation (Khan et al., 2024).
The underrepresentation of African ancestry and other minoritized populations in genomic datasets not only hampers predictive accuracy but also reduces the clinical utility of these tools for identifying clinically significant variants, particularly those not shared across ancestries (Bentley et al., 2020; Jurado Vélez et al., 2025; Manrai et al., 2016). For instance, Landry and Rehm (2018) demonstrated that genetic testing for cardiomyopathy yielded a higher detection rate of pathogenic or likely pathogenic variants in White patients (29.0%; 1,314/4,539) compared to individuals from underrepresented minority (URM) groups—defined as African, Hispanic, Native American, and Pacific Islander ancestries—who had a detection rate of only 18.4% (155/842; χ2 = 39.8; p-value <0.001). Additionally, inconclusive results were significantly more frequent among URM patients (39.8%) than White patients (24.6%) (Landry and Rehm, 2018; Huang et al., 2024). More recently, Jordan et al. (2024) reported that patients of African ancestry had a markedly lower proportion of clinically actionable variants for dilated cardiomyopathy (8.2%, 95% CI, [5.2%–11.1%]) compared to patients of European ancestry (25.5%, 95% CI [21.3%–29.6%]), further underscoring the clinical consequences of ancestry-biased data representation.
Recognizing these limitations, the Polygenic Risk Methods in Diverse Populations (PRIMED) Consortium, an NIH-funded effort, was launched in 2024 to improve PRS performance across ancestries. PRIMED develops and benchmarks methods that account for cross-population differences in allele frequencies and LD structure, supported by a coordinated framework for harmonizing multi-ancestry datasets on cloud platforms such as AnVIL (Kullo et al., 2024). Recent work from the PRIMED Methods Working Group shows that multi-ancestry PRS models and ancestry-informed recalibration strategies improve prediction accuracy in underrepresented populations, including African ancestry cohorts (Kachuri et al., 2024). Through shared data resources and systematic evaluation across global populations, PRIMED remains one of the most active initiatives driving improvements in PRS transferability.
Unique genetic architecture of African populations
Current precision medicine approaches for disease risk prediction and detection are significantly constrained by their inability to account for the extensive genetic heterogeneity observed in African ancestry populations. African genomes, on average, harbor approximately one million more genetic variants than non-African genomes (Sibomana, 2024; Zhang et al., 2022; Kamp et al., 2024). They also have lower linkage disequilibrium (LD), elevated allele frequency distributions for risk-associated variants, as well as the highest predicted 3D genome diversity (Kamp et al., 2024; Gilbertson et al., 2024). A study examining allele frequencies at 3,036 GWAS loci revealed that African populations have significantly higher frequencies of risk alleles compared to non-African populations (mean difference +1.15%, p-value = 0.0213, paired Wilcoxon signed-rank test) (Kim et al., 2018). We observed the same trend after performing a cardiac expression quantitative trait loci (eQTL) analysis of CVD-risk variants in African LD blocks (Figure 2A).
Figure 2. Ancestry-specific genetic architecture and regulatory mechanisms. (A) Effect allele frequency per population from expression quantitative trait loci (eQTL) in cardiac tissue (heart left ventricle and atrial appendage). The R2 value in each population is calculated based on the correlation with the African frequency. (B) Effect size of all SNPs in cardiac tissue in African (top) and European (bottom) ancestries. Genes in eQTL with an effect size >1.0 are colored orange and labeled. Genes unique for each ancestry LD block are labeled in red. (C) Number of unique cardiac GWAS single nucleotide polymorphisms (SNPs) per population. (D) Effect size of cardiac eQTL genes of African-specific SNPs. SNP-GENE combinations are labeled, with duplicate combinations representing different cardiac tissues. (E) Effect size of African-specific genes in cardiac eQTL. The data used to generate this figure was downloaded from Peña-Martínez et al. (2023) (Peña-Martínez et al., 2023; Peña-Martínez et al., 2025). A total of 16,026 CVD-risk variants that occur in at least one ancestry LD block were used to analyze differences in allelic frequencies and effect size across all five populations (see full list in Supplementary Material 3). Lead SNPs were identified from the GWAS catalog, followed by an LD expansion performed independently in each ancestry (R2 ≥ 0.80) using LDlinkR (Myers et al., 2020) and a cardiac eQTL analysis extracted from the GTEx database (Lonsdale et al., 2013). Output from LDlinkR includes the source population, allelic frequencies, and the effect size of each allele in different cardiac tissues.
Differences between African and non-African populations extend beyond genetic sequences and variants, also encompassing regulatory elements and molecular interactions that govern gene expression and biological function. For example, cis-regulatory elements, such as enhancers, can exhibit population-specific activity. A study by Pettie et al. (2024) found that transcription factors (TFs) like NF-κB, JunD, and PU.1 showed signs of lineage-specific selection in enhancers but not in promoters (Pettie et al., 2024). This indicates that enhancers may play a key role in the differential regulation of gene expression across ancestries. The study underscores the utility of chromatin-level data in revealing hidden patterns of genetic variation shaped by evolutionary pressures. In many cases, such divergence is driven by variants that alter TF binding, pointing to population-specific regulatory mechanisms (Pettie et al., 2024; Tian et al., 2018; Husquin et al., 2018). Additional evidence comes from Patel et al. (2022), who analyzed data from African American and European American individuals in the Multi-Ethnic Study of Atherosclerosis (MESA). They reported significantly smaller gene expression effect sizes from trait-associated SNPs in African ancestry individuals compared to Europeans (slope = 0.85, 95% CI [0.81–0.89]) (Patel et al., 2022). This likely reflects differing patterns of gene-by-environment and gene-by-gene interactions, reinforcing the importance of incorporating diverse populations to better capture the full spectrum of regulatory variation in human genomics.
Epigenetic studies further reveal profound ancestry-specific differences that are often overlooked in Eurocentric research. Epigenetic influences shaped by environment and lifestyle, such as DNA methylation and non-coding RNAs, also appear to affect disease risk differently across ancestries. A review by Opare-Addo et al. (2024) found that these mechanisms may modulate stroke susceptibility in African populations, especially in response to psychosocial stressors (Opare-Addo et al., 2024). Despite this, only six studies have investigated the role of epigenetic regulation in stroke risk among individuals of African ancestry, revealing a stark gap in our understanding of ancestry-specific disease etiology. Additionally, broader evidence suggests that chromatin architecture and activity may differ by ancestry (Gilbertson et al., 2024), indicating that higher-order gene regulatory processes vary across populations in ways we are only beginning to understand.
Despite the rich genetic diversity originating from the African continent, GWASs have largely failed to capture this variation. We accessed cardiac tissue eQTL data on CVD GWAS variants and observed differences in effect size across ancestry LD blocks (Figure 2B; Supplementary Figure S2). Notably, while European ancestry populations lead in the discovery of unique single nucleotide polymorphisms (SNPs), African populations display a more than 40-fold lower count of registered cardiovascular GWAS SNPs, (Figures 2C,D). Conversely, cardiac eQTL analysis in African LD block also showed unique genes with genotype-dependent activity in cardiac tissue (Figure 2E) (Peña-Martínez et al., 2023; Peña-Martínez et al., 2025). The significant underrepresentation of African genomic variation helps explain the limited applicability of many existing PRSs in individuals of African descent, as trait-associated variants that are common in African populations may be rare or entirely absent in European-based GWAS cohorts.
Recent methodological work has focused on improving the transferability and fairness of PRS for underrepresented and admixed populations. Two papers by Martin and colleagues highlight the scope of this problem. Using 1000 Genomes data and GWAS summary statistics for multiple traits, Martin et al. (2017) demonstrated that PRS derived from single-ancestry GWAS are biased by demographic history and genetic drift, leading to poor performance across populations (Martin et al., 2017). Martin et al. (2019) further showed that current PRS are far more accurate in individuals of European ancestry, a consequence of Eurocentric GWAS sampling that may worsen health disparities if these scores are used clinically (Martin et al., 2019). Building on these observations, several methods have been recently developed to account for ancestry-related bias and heterogeneity. For example, PRS-CSx integrates GWAS summary statistics from multiple populations and integrates effect-size estimates through a Bayesian modeling framework, leveraging linkage disequilibrium (LD) diversity to improve prediction across ancestries (Ruan et al., 2022). Another method, GAUDI (Genetic Ancestry Utilization in polygenic risk scores for aDmixed Individuals), takes a different approach by using penalized regression to model ancestry-specific genetic effects while borrowing information across ancestral populations (Sun et al., 2024). Applied to data from the Women’s Health Initiative, GAUDI improved prediction for traits such as white blood cell count and C-reactive protein by over 60% in African-American samples, outperforming several existing methods, including PRS-CSx in some settings (Sun et al., 2024). FairPRS addresses the same challenge through invariant risk minimization (IRM). It uses an autoencoder-based framework that combines empirical and invariant risk objectives to generate ancestry-fair PRS or to debias existing scores. Evaluations on synthetic and United Kingdom Biobank data showed that FairPRS improves phenotype prediction while reducing ancestry-related bias (MacHado Reyes et al., 2023). Together, these studies illustrate complementary strategies that directly address the transferability and equity challenges described by Martin et al. They underscore the importance of incorporating ancestry-aware or multi-population models when deploying PRS in diverse clinical and research settings.
Immigration, genomic research, and clinical outcomes
As of 2024, the number of international migrants has almost doubled since 1990, reflecting an increase from 2.9% to 3.7% of the world population (United Nations International Migrant Stock, 2024). Europe and North America are the regions receiving the most international migrants, and as the wealthiest areas of the world, they also have the most access to genetic testing and precision medicine. However, as previously discussed, the genetic databases upon which this healthcare depends do not equally benefit people of all ancestries.
Recent migration and diaspora movements have significantly shaped the distribution of African ancestry in global genomic cohorts, such that both historical forced migrations (e.g., the trans-Atlantic slave trade) and more recent voluntary mobility produce measurable admixture patterns and shifts in relatedness that directly affect association testing and ancestry-aware analyses in genomics. Understanding these patterns is necessary to contextualize the underrepresentation of African ancestry in genomic research. For example, analyses of African-American cohorts reveal that twentieth-century internal migrations within the United States (“Great Migration”) altered regional ancestry proportions and increased long-range genetic relatedness, showing how recent demographic mobility is reflected in genomic structures used for disease mapping and population studies (Baharian et al., 2016). Broader genomic investigations highlight that the underrepresentation of individuals with diverse ancestry in genomic research limits both discovery and translational impact, and that greater inclusion of such populations can advance understanding of genetic variation and health disparities (Fernández-Rhodes, 2023; Bentley et al., 2020). Moreover, considering African populations themselves is equally important, as their rich genetic diversity further informs global reference datasets. Within Africa itself, widespread genetic diversity shaped by ancient and recent movements and admixture underscores that both local and long-distance population flows contribute to the genetic landscape underpinning reference panels and risk prediction models (Busby et al., 2016). Finally, because sub-Saharan Africa has a comparatively young, working-age population and continued population growth relative to Europe and North America, demographic momentum will likely make African sources increasingly important for future international migration patterns, reinforcing the need for migration-informed genomic sampling strategies in global research (Mueller et al., 2019; Cottier, 2024).
Taken together, these demographic and genetic factors highlight why precision medicine must integrate population history to assess treatment effectiveness across ancestries accurately. Precision medicine aims to address variation in treatment effectiveness of diseases, and those of African descent can respond differently to CVD treatments compared to other ethnicities. Differences in effectiveness have been shown in the use of diuretics, calcium channel blockers, warfarin, and statins (including lovastatin and simvastatin) (Brewster et al., 2004; Dang et al., 2005; Shear et al., 1992; Krauss et al., 2008). Adverse reactions to angiotensin-converting enzyme (ACE) inhibitors have been demonstrably more prevalent in African Americans compared to white Americans, and the risk of hemorrhagic complications in patients with intracranial hemorrhage treated with thrombolytics shows a similar relationship (Wright et al., 2005; Mehta et al., 2014).
An unknown portion of this treatment discrepancy can be attributed to genetic variation. This challenge is exemplified in the pharmacogenomics of warfarin, where genes such as CYP2C9 and VKORC1, often harboring unique mutations in African populations, significantly affect dosage requirements and risk of adverse drug reactions (Sibomana, 2024; Tata et al., 2020; Asiimwe et al., 2024). The CYP2D6 gene, which affects the metabolism of hypertensive medications such as β-blockers, varies in frequency across the world (Lefebvre et al., 2007; Hamadeh et al., 2014). In fact, the allele frequency of CYP2D6 does not only vary across global populations, but also within continental groups, with pronounced differences even among African lineages, including between native African populations and individuals of African descent in the American continent (Pettie et al., 2024). Similarly, for clopidogrel, a widely used anticoagulant, common variants in the CYP2C19 gene among African individuals can reduce drug metabolism efficiency, thereby elevating the risk of cardiovascular ischemic events and mortality in comparison to non-African individuals (Tata et al., 2020; Asiimwe et al., 2024; Shuldiner et al., 2009). To further highlight this disparity, a study of heart failure (HF) in adults by Smith et al. (2010) found that the genome-wide significant SNP rs10519210, identified in individuals of European ancestry, was not associated with HF in those of African ancestry (Smith et al., 2010). Lastly, another study by the Consotium of Minority Population GWAS of Stroke (COMPASS), demonstrated how stroke disproportionately affects people of African ancestry, who face nearly twice the incidence and almost 3-fold higher mortality than European Americans. COMPASS addressed this gap by analyzing >22,000 African-ancestry individuals, identifying a genome-wide significant variant near HNF1A and validating most loci across independent cohorts, highlighting both shared and potentially novel stroke-risk regions (Keene et al., 2020). All of these differences, and others like it, are necessary to acknowledge for accurate identification of risk factors for various treatments. Due to the reliance of genetic medicine on large-scale databases to effectively and accurately identify risk-increasing variants, it is vital that more participants of non-European descent are included in genetic studies (Smith et al., 2016).
Clearly, statistics are worrisome, but the concern disproportionately weighs on the consciousness of those whose current efforts fail to properly serve. In the United States, for instance, African-Americans are more likely to experience uncertainty regarding personalized medicine and genetic testing outcomes compared to European-Americans (Singer et al., 2004; Suther and Kiros, 2009; Zimmerman et al., 2006; Bloss et al., 2010; Diaz et al., 2014). To truly fulfill the promise of precision medicine, clinical applications should be able to treat everyone, regardless of their ethnicity. On the contrary, failing to address the existing inequality of representation in genetic data will likely worsen already-present disparities. With the growing rates of immigration, the world is becoming increasingly transient, and populations are becoming less stratified. It is more important than ever to circulate cross-ancestry genomic data and improve clinical outcomes to maximize the potential of precision medicine. All in all, these findings stress the importance of diversifying genomic data sets to achieve a robust interpretation of genome-wide variation when generalizing findings and observations across ancestral populations.
Diversification of GWAS with African ancestry data
The imperative to increase diversity in genomic research extends beyond addressing health disparities in African populations as it also presents a strategic opportunity to enhance the resolution and generalizability of genomic studies worldwide. The current state of research in CVDs is heavily skewed towards European ancestries (Figures 3A,B). The number of publications on CVD GWAS on individuals of European ancestry is significantly higher than African (14-fold) and Hispanic (25-fold) ancestries. For PRS research, the magnitudes are 10-fold for African ancestry and 17-fold for Hispanic ancestry. Owing to their deep evolutionary history, African populations possess the most genetically diverse and ancient lineages among all human groups (Sibomana, 2024; Gomez et al., 2014). As such, greater inclusion of African ancestry individuals in genomic datasets is crucial for enriching the global catalog of human genetic variation. A striking example of this potential emerged in 2020 through the H3Africa initiative, which identified over 3.4 million genetic variants from whole-genome sequencing of just 426 individuals of African descent (Pereira et al., 2021; Choudhury et al., 2020). Despite this, African ancestry individuals represented only 2.4% of GWAS participants at the time yet contributed to 7% of disease-related genetic associations, underscoring their disproportionate value in variant discovery (Pereira et al., 2021). For some types of CVDs, there is a lack of genomic research, which compounds how our understanding is not only limited, but also heavily skewed towards European ancestries (Figure 3C). More recently, the inclusion of African ancestry populations has led to the identification of novel variants associated with a range of cardiovascular traits and diseases, including blood pressure regulation (Udosen et al., 2024), rheumatic heart disease (Machipisa et al., 2021), and cardiometabolic conditions (Choudhury et al., 2022), further highlighting the global scientific and clinical gains of diversifying genomics research.
Figure 3. Racial disparities in GWAS and PRS research. (A) Number of GWAS (left) and PRS (right) publications registered in PubMed. The X-axis contains the terms used for the command-line-based query. The full list of publications and search terms is in Supplementary Material 4. (B) Venn diagram of PubMed IDs extracted from (A) showing GWAS (left) and PRS (right) publications of cross-ancestry analysis. (C) Number of GWAS (left) and PRS (right) publications on specific CVDs.
The power of incorporating African ancestries is not just limited to increasing the diversity of datasets, but it’s also evident in the active improvement on the resolution and translational value of genetic studies for worldwide populations. As highlighted earlier, shorter LD blocks and greater haplotype diversity, characteristic of African genomes, offer a powerful advantage for fine-mapping disease-associated loci in GWASs (Kamp et al., 2024). This genomic architecture enables researchers to more precisely localize causal variants by minimizing the confounding effects of extended LD that often obscure association signals in European datasets. In a multi-ancestry GWAS of coronary artery calcification (CAC), the inclusion of African ancestry participants facilitated the discovery of eight novel loci and improved the resolution of known signals (Kavousi et al., 2023).
Barriers to inclusion, ongoing initiatives, and solutions
The underrepresentation of individuals of African ancestry in genomics studies is not merely a statistical issue, but a reflection of deep-rooted challenges that hinder the development of genomic capacity across the African continent (Table 1). One of the most pressing barriers is inadequate funding in Africa, which represents the primary source for African ancestry data. Although sequencing costs have declined significantly since the Human Genome Project, many African researchers still cannot afford these services in regions where most graduate research is self-funded and economic hardship is widespread (Omotoso et al., 2022). As a result, the necessary research infrastructure, such as next-generation sequencing (NGS) and genotyping technologies, remains largely inaccessible across much of Africa (Bentley et al., 2020; Omotoso et al., 2022; Onywera et al., 2024; Olono et al., 2024). This financial limitation also contributes to a shortage of trained medical genetics professionals, further emphasizing the scarcity of local expertise (Onywera et al., 2024). Without adequate personnel, it becomes challenging to build a healthcare system that can support genomics-based research and clinical care (Kamp et al., 2021). Educational barriers add another layer to the problem since limited public awareness about the benefits of genomics contributes to low participation in research efforts and slows the integration of African ancestry data into broader genomic datasets (Kamp et al., 2021; Kamga et al., 2024). Ethical and cultural considerations present additional challenges. Concerns about informed consent, data ownership, and privacy must be addressed thoughtfully and transparently. Historical exploitation in medical research involving African populations has led to widespread mistrust and skepticism toward current initiatives, which continues to undermine public engagement (Kamga et al., 2024; Jackson et al., 2023; Amoakoh-Coleman et al., 2023).
Table 1. Summary of issues behind racial disparities in cardiovascular genomics and strategies to address them.
Overcoming barriers to African representation in genomic research is essential for advancing global equity and scientific innovation. In recent years, several initiatives have emerged to promote inclusion, build research capacity, and embed ethical practices. The H3Africa initiative, launched in 2012, recognizes Africa’s vast genetic diversity as crucial to understanding human biology and addressing health disparities (Lumaka et al., 2022). It supports African-led projects, invests in sustainable infrastructure, and offers training in areas such as bioinformatics, grant writing, and ethical governance, while fostering locally informed and equitable research practices (Lumaka et al., 2022; Bentley et al., 2019). The 10-year H3Africa program, although coming to an end in 2022, supported three major biorepositories in Uganda, Nigeria, and South Africa, coordinating genomic and biospecimen collections across 30 African countries and maintaining a catalog that, as of June 2025, includes 19 studies, 37 datasets, over 23,000 biospecimens, and ∼120,000 participants (https://catalog.h3africa.org) (Abimiku et al., 2017). Its initiatives span a wide range of diseases, including cardiovascular, neurological, immune, reproductive, cancer, and infectious conditions, and have produced landmark resources such as the whole-genome sequencing of 426 individuals from 50 ethnolinguistic groups, which identified more than 3 million previously undescribed variants and 62 loci under strong selection (Choudhury et al., 2020). Building on this work, the planned 3 Million African Genomes (3MAG) project aims to generate a representative pan-African reference genome and a continent-wide clinical biobank, scaling efforts comparable to the United Kingdom Biobank’s 500,000-genome sequencing initiative (Wonkam, 2021). Similarly, the Harnessing Data Science for Health Discovery and Innovation in Africa (DS-I Africa) program, funded by the NIH Common Fund, also builds on the infrastructure and data-sharing foundations established by H3Africa to expand data science–driven health research across the African continent. It strengthens local capacity by supporting interdisciplinary research hubs, as well as open data platforms that enable African-led solutions to public health challenges (Adebamowo et al., 2023).
Similarly, the All of Us Research Program, led by the U.S. National Institutes of Health, enhances African ancestry representation by collecting diverse whole-genome sequences, thereby improving the accuracy of PRSs for underrepresented populations (Jurado Vélez et al., 2025; Gouveia et al., 2025; Tsuo et al., 2024). Likewise, the African Genomic Medicine Portal (AGMP) curates African-specific data to improve population sampling and geographic granularity in genomic medicine research. The Africa Pathogen Genomics Initiative (PGI), established by the Africa CDC in 2020, strengthens lab and bioinformatics capacity through an African-owned data platform (Othman et al., 2022). The Together for CHANGE (T4C) initiative, launched in 2023, promotes inclusive innovation through community engagement, public-private partnerships, and equitable healthcare development (Hildreth and Shanker, 2024). Finally, as briefly mentioned earlier, efforts like COMPASS further expand representation by coordinating stroke GWAS to provide ancestry-informed insights into cerebrovascular risk (Keene et al., 2020). Collectively, these efforts encourage investigators working with African genomic data to share their datasets through public repositories while also educating research participants about the importance of sharing aggregate frequency data and increasing funding to support large-scale generation of African genomic data.
Conclusion
Equitable representation in cardiovascular genomics is not only a matter of scientific rigor but also of global health justice. African populations, while bearing a disproportionate burden of CVD and possessing unparalleled genetic diversity, remain vastly underrepresented in GWASs and related studies. This lack of inclusion limits the clinical validity of tools like PRSs and contributes to persistent disparities in disease prediction, prevention, and treatment. Enhancing diversity in genomics, particularly through the inclusion of African ancestry data, holds immense potential to improve the resolution and generalizability of genetic discoveries for all populations. Addressing the structural, economic, and ethical barriers to African participation is critical to achieving these goals. Initiatives like H3Africa, AGMP, and Together for CHANGE represent important strides toward building capacity, fostering ethical engagement, and promoting sustainable, African-led research. Moving forward, a concerted global effort is needed to support infrastructure development, facilitate data sharing, and educate communities on the value of genomics. In the US, many of these efforts and solutions are only possible through the continued support of federal funding agencies (NIH, NSF), as well as other global charitable foundations, such as Wellcome Trust. Only by closing the diversity gap can we realize the full promise of precision medicine and ensure that its benefits extend to all people, regardless of ancestry.
Author contributions
DP-M: Writing – review and editing, Writing – original draft, Conceptualization. ML: Writing – original draft, Writing – review and editing. AR-M: Data curation, Writing – review and editing, Software, Formal Analysis. OE-B: Data curation, Writing – review and editing. MÁ-C: Writing – review and editing, Data curation. SH-Q: Data curation, Writing – review and editing. GR-S: Data curation, Writing – review and editing. RV-N: Data curation, Writing – review and editing. EP-M: Writing – review and editing, Supervision, Writing – original draft, Software, Investigation, Data curation, Project administration, Visualization, Conceptualization.
Funding
The author(s) declared that financial support was received for this work and/or its publication. DAPM was funded by NSF GRFP (DGE-2146755). OLEB was funded by NSF PR-LSAMP Bridge to the Doctorate Fellowship Grant (2306079). EGPM was funded by NIH RISE Fellowship (5R25GM061151–20).
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that generative AI was used in the creation of this manuscript. The use of artificial intelligence models was used in this work solely for the purpose of revising grammar.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2025.1647176/full#supplementary-material
References
Abdellaoui, A., Yengo, L., Verweij, K. J. H., and Visscher, P. M. (2023). 15 years of GWAS discovery: realizing the promise. Nat. Genet. Prepr. 110, 179–194. doi:10.1016/j.ajhg.2022.12.011
Abimiku, A., Mayne, E. S., Joloba, M., Beiswanger, C. M., Troyer, J., Wideroff, L., et al. (2017). H3Africa biorepository program: supporting genomics research on African populations by sharing high-quality biospecimens. Biopreserv. Biobank 15, 99–102. doi:10.1089/bio.2017.0005
Adam, Y., Sadeeq, S., Kumuthini, J., Ajayi, O., Wells, G., Solomon, R., et al. (2022). Polygenic risk score in African populations: progress and challenges. F1000Res 11, 175. doi:10.12688/f1000research.76218.1
Ade, C., Scheuermann, B., Brown, A., Colburn, T., Hakeem, H., and Chow, C. H. (2024). External validation of the American heart association PREVENT cardiovascular disease risk equations. JAMA Netw. Open 7, e2438311. doi:10.1001/jamanetworkopen.2024.38311
Adebamowo, C. A., Callier, S., Akintola, S., Maduka, O., Jegede, A., Arima, C., et al. (2023). The promise of data science for health research in Africa. Nat. Commun. Prepr. 14 (1), 6084. doi:10.1038/s41467-023-41809-2
Amini, M., Zayeri, F., and Salehi, M. (2021). Trend analysis of cardiovascular disease mortality, incidence, and mortality-to-incidence ratio: results from global burden of disease study 2017. BMC Public Health 21, 401. doi:10.1186/s12889-021-10429-0
Amoakoh-Coleman, M., Vieira, D., and Abugri, J. (2023). Ethical considerations for biobanking and use of genomics data in Africa: a narrative review. BMC Med. Ethics 24, 108. doi:10.1186/s12910-023-00985-y
Asiimwe, I. G., Blockman, M., Cavallari, L. H., Cohen, K., Cupido, C., Dandara, C., et al. (2024). Meta-analysis of genome-wide association studies of stable warfarin dose in patients of African ancestry. Blood Adv. 8, 5248–5261. doi:10.1182/bloodadvances.2024014227
Baharian, S., Barakatt, M., Gignoux, C. R., Shringarpure, S., Errington, J., Blot, W. J., et al. (2016). The great migration and African-American genomic diversity. PLoS Genet. 12, e1006059. doi:10.1371/journal.pgen.1006059
Barad, A., Clark, A. G., Pressman, E. K., and O’Brien, K. O. (2024). Associations between genetically predicted iron status and cardiovascular disease risk: a mendelian randomization study. J. Am. Heart Assoc. 13, e034991. doi:10.1161/JAHA.124.034991
Bentley, A. R., Callier, S., and Rotimi, C. (2019). The emergence of genomic research in Africa and new frameworks for equity in biomedical research. Ethn. Dis. 29, 179–186. doi:10.18865/ed.29.S1.179
Bentley, A. R., Callier, S. L., and Rotimi, C. N. (2020). Evaluating the promise of inclusion of African ancestry populations in genomics. NPJ Genom. Med. 5, 5. doi:10.1038/s41525-019-0111-x
Bloss, C. S., Ornowski, L., Silver, E., Cargill, M., Vanier, V., Schork, N. J., et al. (2010). Consumer perceptions of direct-to-consumer personalized genomic risk assessments. Genet. Med. 12, 556–566. doi:10.1097/GIM.0b013e3181eb51c6
Bose, A., Platt, D. E., Kartoun, U., Ng, K., and Parida, L. (2023). Role of genetics in capturing racial disparities in cardiovascular disease. medRxiv Prepr. doi:10.1101/2023.02.10.23285769
Brewster, L. M., Van Montfrans, G. A., and Kleijnen, J. (2004). Systematic review: antihypertensive drug therapy in black patients. Ann. Intern. Med. Prepr. 141, 614–627. doi:10.7326/0003-4819-141-8-200410190-00009
Buniello, A., Macarthur, J. A. L., Cerezo, M., Harris, L. W., Hayhurst, J., Malangone, C., et al. (2019). The NHGRI-EBI GWAS catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005–D1012. doi:10.1093/nar/gky1120
Busby, G. B. J., Band, G., Le, Q. S., Jallow, M., Bougama, E., Mangano, V. D., et al. (2016). Admixture into and within Sub-Saharan Africa. Elife 5, e15266. doi:10.7554/elife.15266
Choudhury, A., Aron, S., Botigué, L. R., Sengupta, D., Botha, G., Bensellak, T., et al. (2020). High-depth African genomes inform human migration and health. Nature 586, 741–748. doi:10.1038/s41586-020-2859-7
Choudhury, A., Brandenburg, J. T., Chikowore, T., Sengupta, D., Boua, P. R., Crowther, N. J., et al. (2022). Author correction: meta-analysis of sub-saharan African studies provides insights into genetic architecture of lipid traits. Nat. Commun. 13 (1), 2578. doi:10.1038/s41467-022-30098-w
Clarke, S. L., Assimes, T. L., and Tcheandjieu, C. (2021). The propagation of racial disparities in cardiovascular genomics research. Circ. Genom. Precis. Med. 14 (5) e003178. doi:10.1161/CIRCGEN.121.003178
Cluett, C., McDermott, M. M. G., Guralnik, J., Ferrucci, L., Bandinelli, S., Miljkovic, I., et al. (2009). The 9p21 myocardial infarction risk allele increases risk of peripheral artery disease in older people. Circ. Cardiovasc. Genet. 2, 347–353. doi:10.1161/CIRCGENETICS.108.825935
Corpas, M., Pius, M., Poburennaya, M., Guio, H., Dwek, M., Nagaraj, S., et al. (2025). Bridging genomics’ greatest challenge: the diversity gap. Cell Genom. 5 (1) 100724. doi:10.1016/j.xgen.2024.100724
Cottier, F. (2024). Projecting future migration with Bayesian hierarchical gravity models of migration: an application to Africa. Front. Clim. 6, 1384295. doi:10.3389/fclim.2024.1384295
Dang, M. T. N., Hambleton, J., and Kayser, S. R. (2005). The influence of ethnicity on warfarin dosage requirement. Ann. Pharmacother. 39, 1008–1012. doi:10.1345/aph.1E566
DeFilippis, A. P., Young, R., Carrubba, C. J., McEvoy, J. W., Budoff, M. J., Blumenthal, R. S., et al. (2015). An analysis of calibration and discrimination among multiple cardiovascular risk scores in a modern multiethnic cohort. Ann. Intern. Med. 162, 266–275. doi:10.7326/M14-1281
Diaz, V. A., Mainous, A. G., Gavin, J. K., and Wilson, D. (2014). Racial differences in attitudes toward personalized medicine. Public Health Genomics 17, 1–6. doi:10.1159/000354785
Dikilitas, O., Schaid, D. J., Tcheandjieu, C., Clarke, S. L., Assimes, T. L., and Kullo, I. J. (2022). Use of polygenic risk scores for coronary heart disease in ancestrally diverse populations. Curr. Cardiol. Rep. Prepr. 24, 1169–1177. doi:10.1007/s11886-022-01734-0
Ding, Y., Hou, K., Xu, Z., Pimplaskar, A., Petter, E., Boulier, K., et al. (2023). Polygenic scoring accuracy varies across the genetic ancestry continuum. Nature 618, 774–781. doi:10.1038/s41586-023-06079-4
Doumatey, A. P., Bentley, A. R., Akinyemi, R., Olanrewaju, T. O., Adeyemo, A., and Rotimi, C. (2023). Genes, environment, and African ancestry in cardiometabolic disorders. Trends Endocrinol. Metab. Prepr. 34, 601–621. doi:10.1016/j.tem.2023.07.007
Duncan, L., Shen, H., Gelaye, B., Meijsen, J., Ressler, K., Feldman, M., et al. (2019). Analysis of polygenic risk score usage and performance in diverse human populations. Nat. Commun. 10, 3328. doi:10.1038/s41467-019-11112-0
Fahed, A. C., Aragam, K. G., Hindy, G., Chen, Y. D. I., Chaudhary, K., Dobbyn, A., et al. (2021). Transethnic transferability of a genome-wide polygenic score for coronary artery disease. Circ. Genom. Precis. Med. 14, e003092. doi:10.1161/CIRCGEN.120.003092
Fatumo, S., Chikowore, T., Choudhury, A., Ayub, M., Martin, A. R., and Kuchenbaecker, K. (2022). A roadmap to increase diversity in genomic studies. Nat. Med. Prepr. doi:10.1038/s41591-021-01672-4
Fernández-Rhodes, L. (2023).Beyond borders: a commentary on the benefit of promoting immigrant populations in genome-wide association studies. HGG Adv. 4 (3) 100205. doi:10.1016/j.xhgg.2023.100205
Gaidai, O., Cao, Y., and Loginov, S. (2023). Global cardiovascular diseases death rate prediction. Curr. Probl. Cardiol. Prepr. 48, 101622. doi:10.1016/j.cpcardiol.2023.101622
Gilbertson, E. N., Brand, C. M., McArthur, E., Rinker, D. C., Kuang, S., Pollard, K. S., et al. (2024). Machine learning reveals the diversity of human 3D chromatin contact patterns. Mol. Biol. Evol. 41, msae209. doi:10.1093/molbev/msae209
Gomez, F., Hirbo, J., and Tishkoff, S. A. (2014). Genetic variation and adaptation in Africa: implications for human evolution and disease. Cold Spring Harb. Perspect. Biol. 6, a008524. doi:10.1101/cshperspect.a008524
Gouveia, M. H., Meeks, K. A. C., Borda, V., Leal, T. P., Kehdy, F. S. G., Mogire, R., et al. (2025). Subcontinental genetic diversity in the all of Us research program: implications for biomedical research. bioRxiv Prepr., 2025.01.09.632250. doi:10.1101/2025.01.09.632250
GTEx Consortium, , Aguet, F., Barbeira, A. N., Bonazzola, R., Brown, A., Castel, S. E., et al. (2020). The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369 (6509), 1318–1330. doi:10.1126/science.aaz1776
Gurdasani, D., Carstensen, T., Tekola-Ayele, F., Pagani, L., Tachmazidou, I., Hatzikotoulas, K., et al. (2015). The African genome variation project shapes medical genetics in Africa. Nature 517, 327–332. doi:10.1038/nature13997
Hamadeh, I. S., Langaee, T. Y., Dwivedi, R., Garcia, S., Burkley, B. M., Skaar, T. C., et al. (2014). Impact of CYP2D6 polymorphisms on clinical efficacy and tolerability of metoprolol tartrate. Clin. Pharmacol. Ther. 96, 175–181. doi:10.1038/clpt.2014.62
Helgadottir, A., Thorleifsson, G., Magnusson, K. P., Grétarsdottir, S., Steinthorsdottir, V., Manolescu, A., et al. (2008). The same sequence variant on 9p21 associates with myocardial infarction, abdominal aortic aneurysm and intracranial aneurysm. Nat. Genet. 40, 217–224. doi:10.1038/ng.72
Hildreth, J. E. K., and Shanker, A. (2024). The Diaspora human genomics institute launches the together for change initiative: a transformative, historic partnership to ensure health equity in a time of unprecedented technological advancements. J. Health Care Poor Underserved Prepr. 35. doi:10.1353/hpu.2024.a919801
Hippisley-Cox, J., Coupland, C. A. C., Bafadhel, M., Russell, R. E. K., Sheikh, A., Brindle, P., et al. (2024). Development and validation of a new algorithm for improved cardiovascular risk prediction. Nat. Med. 30, 1440–1447. doi:10.1038/s41591-024-02905-y
Huang, H., Verma, J., Mok, V., Bharadwaj, H. R., Alrawashdeh, M. M., Aratikatla, A., et al. (2024). Exploring health care disparities in genetic testing and research for hereditary cardiomyopathy: current state and future perspectives. Glob. Med. Genet. 11, 36–47. doi:10.1055/s-0044-1779469
Husquin, L. T., Rotival, M., Fagny, M., Quach, H., Zidane, N., McEwen, L. M., et al. (2018). Exploring the genetic basis of human population differences in DNA methylation and their causal impact on immune gene regulation 06 biological sciences 0604 genetics. Genome Biol. 19, 222. doi:10.1186/s13059-018-1601-3
Jackson, F., Clinton, C., and Caldwell, J. (2023). Core issues, case studies, and the need for expanded legacy African American genomics. Front. Genet. 14, 843209. doi:10.3389/fgene.2023.843209
Jain, V., Dabbs-Brown, A., Liu, C., Hui, Q., Mehta, A., Wilson, P. W. F., et al. (2024). Genome-wide European ancestry study identifies coronary artery disease–associated loci through gene–sex hormone interaction. J. Am. Heart Assoc. 13, e034132. doi:10.1161/JAHA.123.034132
Jordan, E., Kinnamon, D. D., Haas, G. J., Hofmeyer, M., Kransdorf, E., Ewald, G. A., et al. (2023). Genetic architecture of dilated cardiomyopathy in individuals of African and European ancestry. JAMA 330, 432–441. doi:10.1001/jama.2023.11970
Jurado Vélez, J., Anderson, N., Datcher, I., Foster, C., Jackson, P., and Hidalgo, B. (2025). Striving towards equity in cardiovascular genomics research. Curr Atheroscler. Rep. 27 (1) 34. doi:10.1007/s11883-025-01277-z
Kachuri, L., Chatterjee, N., Hirbo, J., Schaid, D. J., Martin, I., Kullo, I. J., et al. (2024). Principles and methods for transferring polygenic risk scores across global populations. Nat. Rev. Genet. Prepr. 25, 8–25. doi:10.1038/s41576-023-00637-2
Kamga, K. K., Fonkam, M. P., Nguefack, S., and Wonkam, A. (2024). Navigating the genetic frontier for the integration of genetic services into African healthcare systems: a scoping review. Res. Sq. doi:10.21203/rs.3.rs-3978686/v1
Kamp, M., Krause, A., and Ramsay, M. (2021). Has translational genomics come of age in Africa? Preprint. Hum. Mol. Genet. 30, R164–R173. doi:10.1093/hmg/ddab180
Kamp, M., Pain, O., Lewis, C. M., and Ramsay, M. (2024). Ancestry-aligned polygenic scores combined with conventional risk factors improve prediction of cardiometabolic outcomes in African populations. Genome Med. 16, 106. doi:10.1186/s13073-024-01377-6
Kaufman, J. S., Dolman, L., Rushani, D., and Cooper, R. S. (2015). The contribution of genomic research to explaining racial disparities in cardiovascular disease: a systematic review. Am. J. Epidemiol. 181, 464–472. doi:10.1093/aje/kwu319
Kavousi, M., Bos, M. M., Barnes, H. J., Lino Cardenas, C. L., Wong, D., Lu, H., et al. (2023). Multi-ancestry genome-wide study identifies effector genes and druggable pathways for coronary artery calcification. Nat. Genet. 55, 1651–1664. doi:10.1038/s41588-023-01518-4
Keates, A. K., Mocumbi, A. O., Ntsekhe, M., Sliwa, K., and Stewart, S. (2017). Cardiovascular disease in Africa: epidemiological profile and challenges. Nat. Rev. Cardiol. Prepr. 14. 273–293. doi:10.1038/nrcardio.2017.19
Keene, K. L., Hyacinth, H. I., Bis, J. C., Kittner, S. J., Mitchell, B. D., Cheng, Y. C., et al. (2020). Genome-wide association study meta-analysis of stroke in 22 000 individuals of African descent identifies novel associations with stroke. Stroke 51, 2454–2463. doi:10.1161/STROKEAHA.120.029123
Khan, S. S., Matsushita, K., Sang, Y., Ballew, S. H., Grams, M. E., Surapaneni, A., et al. (2024). Development and validation of the American heart association’s PREVENT equations. Circulation 149, 430–449. doi:10.1161/CIRCULATIONAHA.123.067626
Kim, M. S., Patel, K. P., Teng, A. K., Berens, A. J., and Lachance, J. (2018). Genetic disease risks can be misestimated across global populations 06 biological sciences 0604 genetics. Genome Biol. 19, 179. doi:10.1186/s13059-018-1561-7
Knoppers, B. M., and Beauvais, M. J. S. (2021). Three decades of genetic privacy: a metaphoric journey. Hum. Mol. Genet. 30 (R2), R156–R160. doi:10.1093/hmg/ddab164
Krauss, R. M., Mangravite, L. M., Smith, J. D., Medina, M. W., Wang, D., Guo, X., et al. (2008). Variation in the 3-hydroxyl-3-methylglutaryl coenzyme A reductase gene is associated with racial differences in low-density lipoprotein cholesterol response to simvastatin treatment. Circulation 117, 1537–1544. doi:10.1161/CIRCULATIONAHA.107.708388
Kullo, I. J., Conomos, M. P., Nelson, S. C., Adebamowo, S. N., Choudhury, A., Conti, D., et al. (2024). The PRIMED consortium: reducing disparities in polygenic risk assessment. Am. J. Hum. Genet. Prepr. 111, 2594–2606. doi:10.1016/j.ajhg.2024.10.010
Kurniansyah, N., Goodman, M. O., Khan, A. T., Wang, J., Feofanova, E., Bis, J. C., et al. (2023). Evaluating the use of blood pressure polygenic risk scores across race/ethnic background groups. Nat. Commun. 14, 3202. doi:10.1038/s41467-023-38990-9
Landry, L. G., and Rehm, H. L. (2018). Association of racial/ethnic categories with the ability of genetic tests to detect a cause of cardiomyopathy. JAMA Cardiol. 3, 341–345. doi:10.1001/jamacardio.2017.5333
Lefebvre, J., Poirier, L., Poirier, P., Turgeon, J., and Lacourciere, Y. (2007). The influence of CYP2D6 phenotype on the clinical response of nebivolol in patients with essential hypertension. Br. J. Clin. Pharmacol. 63, 575–582. doi:10.1111/j.1365-2125.2006.02796.x
Li, Y., Cao, G. Y., Jing, W. Z., Liu, J., and Liu, M. (2023). Global trends and regional differences in incidence and mortality of cardiovascular disease, 1990−2019: findings from 2019 global burden of disease study. Eur. J. Prev. Cardiol. 30, 276–286. doi:10.1093/eurjpc/zwac285
Lonsdale, J., Thomas, J., Salvatore, M., Phillips, R., Lo, E., Shad, S., et al. (2013). The genotype-tissue expression (GTEx) project. Nat. Genet. 45, 580–585. doi:10.1038/ng.2653
Lumaka, A., Carstens, N., Devriendt, K., Krause, A., Kulohoma, B., Kumuthini, J., et al. (2022). Increasing African genomic data generation and sharing to resolve rare and undiagnosed diseases in Africa: a call-to-action by the H3Africa rare diseases working group. Orphanet J. Rare Dis. 17, 230. doi:10.1186/s13023-022-02391-w
MacHado Reyes, D., Bose, A., Karavani, E., and Parida, L. (2023). “FairPRS: adjusting for admixed populations in polygenic risk scores using invariant risk minimization,” in Pacific symposium on biocomputing. doi:10.1142/9789811270611_0019
Machipisa, T., Chong, M., Muhamed, B., Chishala, C., Shaboodien, G., Pandie, S., et al. (2021). Association of novel locus with rheumatic heart disease in black African individuals: findings from the RHDGen study. JAMA Cardiol. 6, 1000–1011. doi:10.1001/jamacardio.2021.1627
Manrai, A. K., Funke, B. H., Rehm, H. L., Olesen, M. S., Maron, B. A., Szolovits, P., et al. (2016). Genetic misdiagnoses and the potential for health disparities. N. Engl. J. Med. 375, 655–665. doi:10.1056/nejmsa1507092
Maples, B. K., Gravel, S., Kenny, E. E., and Bustamante, C. D. (2013). RFMix: a discriminative modeling approach for rapid and robust local-ancestry inference. Am. J. Hum. Genet. 93 (2), 278–288. doi:10.1016/j.ajhg.2013.06.020
Martin, A. R., Gignoux, C. R., Walters, R. K., Wojcik, G. L., Neale, B. M., Gravel, S., et al. (2017). Human demographic history impacts genetic risk prediction across diverse populations. Am. J. Hum. Genet. 100, 635–649. doi:10.1016/j.ajhg.2017.03.004
Martin, A. R., Kanai, M., Kamatani, Y., Okada, Y., Neale, B. M., and Daly, M. J. (2019). Clinical use of current polygenic risk scores May exacerbate health disparities. Nat. Genet. 51, 584–591. doi:10.1038/s41588-019-0379-x
Matarin, M., Brown, W. M., Singleton, A., Hardy, J. A., and Meschia, J. F., and ISGS investigators (2008). Whole genome analyses suggest ischemic stroke and heart disease share an association with polymorphisms on chromosome 9p21. Stroke 39, 1586–1589. doi:10.1161/STROKEAHA.107.502963
Mehta, R. H., Cox, M., Smith, E. E., Xian, Y., Bhatt, D. L., Fonarow, G. C., et al. (2014). Race/ethnic differences in the risk of hemorrhagic complications among patients with ischemic stroke receiving thrombolytic therapy. Stroke 45, 2263–2269. doi:10.1161/STROKEAHA.114.005019
Mills, M. C., and Rahal, C. (2020). The GWAS diversity monitor tracks diversity by disease in real time. Preprint. doi:10.1038/s41588-020-0580-y
Minja, N. M., Nakagaayi, D., Aliku, T., Zhang, W., Ssinabulya, I., Nabaale, J., et al. (2022). Cardiovascular diseases in Africa in the twenty-first century: gaps and priorities going forward. Front. Cardiovasc. Med. doi:10.3389/fcvm.2022.1008335
Mostafavi, H., Harpak, A., Agarwal, I., Conley, D., Pritchard, J. K., and Przeworski, M. (2020). Variable prediction accuracy of polygenic scores within an ancestry group. Elife 9, e48376. doi:10.7554/eLife.48376
Mueller, V., Thurlow, J., Rosenbach, G., and Masias, I. (2019). “Africa’s rural youth in the global context,” in Youth and jobs in rural Africa. doi:10.1093/oso/9780198848059.003.0001
Myers, T. A., Chanock, S. J., and Machiela, M. J. (2020). LDlinkR: an R package for rapidly calculating linkage disequilibrium statistics in diverse populations. Front. Genet. 11, 157. doi:10.3389/fgene.2020.00157
Ndong Sima, C. A. A., Step, K., Swart, Y., Schurz, H., Uren, C., and Möller, M. (2024). Methodologies underpinning polygenic risk scores estimation: a comprehensive overview. Hum. Genet. 143, 1265–1280. doi:10.1007/s00439-024-02710-0
Need, A. C., and Goldstein, D. B. (2009). Next generation disparities in human genomics: concerns and remedies. Trends Genet. 25, 489–494. doi:10.1016/j.tig.2009.09.012
Ojewunmi, O. O., and Fatumo, S. (2025). Driving global health equity and precision medicine through African genomic data. Hum. Mol. Genet., ddaf025. doi:10.1093/hmg/ddaf025
Olono, A., Mitesser, V., Happi, A., and Happi, C. (2024). Building genomic capacity for precision health in Africa. Prepr. A. T. Nat. Res. 30, 1856–1864. doi:10.1038/s41591-024-03081-9
Omotoso, O. E., Teibo, J. O., Atiba, F. A., Oladimeji, T., Adebesin, A. O., and Babalghith, A. O. (2022). Bridging the genomic data gap in Africa: implications for global disease burdens. Glob. Health Prepr. 18, 103. doi:10.1186/s12992-022-00898-2
Oni-Orisan, A., Mavura, Y., Banda, Y., Thornton, T. A., and Sebro, R. (2021). Embracing genetic diversity to improve black health. N. Engl. J. Med. 384, 1163–1167. doi:10.1056/nejmms2031080
Onywera, H., Ondoa, P., Nfii, F., Ogwell, A., Kebede, Y., Christoffels, A., et al. (2024). Boosting pathogen genomics and bioinformatics workforce in Africa. Lancet Infect. Dis. Prepr. 24, e106–e112. doi:10.1016/S1473-3099(23)00394-8
Opare-Addo, P. A., Sarfo, F. S., Aikins, M., Bediako, S. A., and Ovbiagele, B. (2024). Epigenetics as a target to mitigate excess stroke risk in people of African ancestry: a scoping review. J. Stroke Cerebrovasc. Dis. 33, 107585. doi:10.1016/j.jstrokecerebrovasdis.2024.107585
Othman, H., Zass, L., da Rocha, J. E. B., Radouani, F., Samtal, C., Benamri, I., et al. (2022). African genomic medicine portal: a web portal for biomedical applications. J. Pers. Med. 12, 265. doi:10.3390/jpm12020265
O’Sullivan, J. W., Raghavan, S., Marquez-Luna, C., Luzum, J. A., Damrauer, S. M., Ashley, E. A., et al. (2022). Polygenic risk scores for cardiovascular disease: a scientific statement from the American heart association. Prepr. A. T. Lippincott Williams Wilkins 146, e93–e118. doi:10.1161/CIR.0000000000001077
Patel, R. A., Musharoff, S. A., Spence, J. P., Pimentel, H., Tcheandjieu, C., Mostafavi, H., et al. (2022). Genetic interactions drive heterogeneity in causal variant effect sizes for gene expression and complex traits. Am. J. Hum. Genet. 109, 1286–1297. doi:10.1016/j.ajhg.2022.05.014
Peña-Martínez, E. G., Pomales-Matos, D. A., Rivera-Madera, A., Messon-Bird, J. L., Medina-Feliciano, J. G., Sanabria-Alberto, L., et al. (2023). Prioritizing cardiovascular disease-associated variants altering NKX2-5 and TBX5 binding through an integrative computational approach. J. Biol. Chem. 299, 105423. doi:10.1016/j.jbc.2023.105423
Peña-Martínez, E. G., Messon-Bird, J. L., Rodríguez-Ríos, J. M., Velázquez-Roig, R., Pomales-Matos, D. A., Rivera-Madera, A., et al. (2025). Cardiovascular disease-associated non-coding variants disrupt GATA4-DNA binding and regulatory functions. Hum. Genet. Genomics Adv. 6, 100415. doi:10.1016/j.xhgg.2025.100415
Pereira, L., Mutesa, L., Tindana, P., and Ramsay, M. (2021). African genetic diversity and adaptation inform a precision medicine agenda. Nat. Rev. Genet. Prepr. 22, 284–306. doi:10.1038/s41576-020-00306-8
Pettie, K. P., Mumbach, M., Lea, A. J., Ayroles, J., Chang, H. Y., Kasowski, M., et al. (2024). Chromatin activity identifies differential gene regulation across human ancestries. Genome Biol. 25, 21. doi:10.1186/s13059-024-03165-2
Popejoy, A. B., and Fullerton, S. M. (2016). Genomics is failing on diversity. Nature 538 (7624), 161–164. doi:10.1038/538161a
Ramsay, M., Crowther, N., Tambo, E., Agongo, G., Baloyi, V., Dikotope, S., et al. (2016). H3Africa AWI-gen collaborative centre: a resource to study the interplay between genomic and environmental risk factors for cardiometabolic diseases in four sub-saharan African countries. Glob. Health Epidemiol. Genom 1, e20. doi:10.1017/gheg.2016.17
Rocheleau, G., Clarke, S. L., Auguste, G., Hasbani, N. R., Morrison, A. C., Heath, A. S., et al. (2024). Rare variant contribution to the heritability of coronary artery disease. Nat. Commun. 15, 8741. doi:10.1038/s41467-024-52939-6
Ruan, Y., Lin, Y. F., Feng, Y. C. A., Chen, C. Y., Lam, M., Guo, Z., et al. (2022). Improving polygenic prediction in ancestrally diverse populations. Nat. Genet. 54, 573–580. doi:10.1038/s41588-022-01054-7
Sapkota, Y., Liu, Q., Li, N., Bhatt, N. S., Ehrhardt, M. J., Wilson, C. L., et al. (2022). Contribution of genome-wide polygenic score to risk of coronary artery disease in childhood cancer survivors. JACC CardioOncol. 4, 258–267. doi:10.1016/j.jaccao.2022.04.003
Shear, C. L., Franklin, F. A., Stinnett, S., Hurley, D. P., Bradford, R. H., Chremos, A. N., et al. (1992). Expanded clinical evaluation of lovastatin (EXCEL) study results: effect of patient characteristics on lovastatin-induced changes in plasma concentrations of lipids and lipoproteins. Circulation 85, 1293–1303. doi:10.1161/01.CIR.85.4.1293
Shuldiner, A. R., O’Connell, J. R., Bliden, K. P., Gandhi, A., Ryan, K., Horenstein, R. B., et al. (2009). Association of cytochrome P450 2C19 genotype with the antiplatelet effect and clinical efficacy of clopidogrel therapy. JAMA 302, 849–857. doi:10.1001/jama.2009.1232
Sibomana, O. (2024). Genetic diversity landscape in African population: a review of implications for personalized and precision medicine. Pharmgenomics Pers. Med. 17, 487–496. doi:10.2147/PGPM.S485452
Silva, S., Nitsch, D., and Fatumo, S. (2023). Genome-wide association studies on coronary artery disease: a systematic review and implications for populations of different ancestries. PLoS One 18, e0294341. doi:10.1371/journal.pone.0294341
Singer, E., Antonucci, T., and Van Hoewyk, J. (2004). Racial and ethnic variations in knowledge and attitudes about genetic testing. Genet. Test. 8, 31–43. doi:10.1089/109065704323016012
Singh, S., Brandenburg, J. T., Choudhury, A., Gómez-Olivé, F. X., and Ramsay, M. (2021). Systematic review of genomic associations with blood pressure and hypertension in populations with african-ancestry. Front. Genet. Prepr. 12, 699445. doi:10.3389/fgene.2021.699445
Singh, S., Choudhury, A., Hazelhurst, S., Crowther, N. J., Boua, P. R., Sorgho, H., et al. (2023). Genome-wide association study meta-analysis of blood pressure traits and hypertension in sub-saharan African populations: an AWI-gen study. Nat. Commun. 14, 8376. doi:10.1038/s41467-023-44079-0
Smith, N. L., Felix, J. F., Morrison, A. C., Demissie, S., Glazer, N. L., Loehr, L. R., et al. (2010). Association of genome-wide variation with the risk of incident heart failure in adults of European and African ancestry: a prospective meta-analysis from the cohorts for heart and aging research in genomic epidemiology (CHARGE) consortium. Circ. Cardiovasc. Genet. 3, 256–266. doi:10.1161/CIRCGENETICS.109.895763
Smith, C. E., Fullerton, S. M., Dookeran, K. A., Hampel, H., Tin, A., Maruthur, N. M., et al. (2016). Using genetic technologies to reduce, rather than widen, health disparities. Health Aff. 35, 1367–1373. doi:10.1377/hlthaff.2015.1476
Smith, J. L., Tcheandjieu, C., Dikilitas, O., Iyer, K., Miyazawa, K., Hilliard, A., et al. (2024). Multi-ancestry polygenic risk score for coronary heart disease based on an ancestrally diverse genome-wide association study and population-specific optimization. Circ. Genom. Precis. Med. 17, e004272. doi:10.1161/CIRCGEN.123.004272
Sun, Q., Rowland, B. T., Chen, J., Mikhaylova, A. V., Avery, C., Peters, U., et al. (2024). Improving polygenic risk prediction in admixed populations by explicitly modeling ancestral-differential effects via GAUDI. Nat. Commun. 15, 1016. doi:10.1038/s41467-024-45135-z
Suther, S., and Kiros, G. E. (2009). Barriers to the use of genetic testing: a study of racial and ethnic disparities. Genet. Med. 11, 655–662. doi:10.1097/GIM.0b013e3181ab22aa
Tata, E. B., Ambele, M. A., and Pepper, M. S. (2020). Barriers to implementing clinical pharmacogenetics testing in Sub-Saharan Africa. A Critical Review, Prepr. doi:10.3390/pharmaceutics12090809
Taliun, D., Harris, D. N., Kessler, M. D., Carlson, J., Szpiech, Z. A., Torres, R., et al. (2021). Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program. Nature 590 (7845), 290–299. doi:10.1038/s41586-021-03205-y
Tian, L., Khan, A., Ning, Z., Yuan, K., Zhang, C., Lou, H., et al. (2018). Genome-wide comparison of allele-specific gene expression between African and European populations. Hum. Mol. Genet. 27, 1067–1077. doi:10.1093/hmg/ddy027
Tishkoff, S. A., and Verrelli, B. C. (2003). Patterns of human genetic diversity: implications for human evolutionary history and disease. Annu. Rev. Genomics Hum. Genet. Prepr. 4, 293–340. doi:10.1146/annurev.genom.4.070802.110226
Topriceanu, C. C., Chaturvedi, N., Mathur, R., and Garfield, V. (2024). Validity of European-centric cardiometabolic polygenic scores in multi-ancestry populations. Eur. J. Hum. Genet. 32, 697–707. doi:10.1038/s41431-023-01517-3
Traylor, M., Rutten-Jacobs, L., Curtis, C., Patel, H., Breen, G., Newhouse, S., et al. (2017). Genetics of stroke in a UK African ancestry case-control study: south London ethnicity and stroke study. Neurol. Genet. 3, e142. doi:10.1212/NXG.0000000000000142
Trenkwalder, T., Maj, C., Al-Kassou, B., Debiec, R., Doppler, S. A., Musameh, M. D., et al. (2025). Distinct genetic risk profile in aortic stenosis compared with coronary artery disease. JAMA Cardiol. 10, 145–154. doi:10.1001/jamacardio.2024.3738
Tsuo, K., Shi, Z., Ge, T., Mandla, R., Hou, K., Ding, Y., et al. (2024). All of Us diversity and scale improve polygenic prediction contextually with greatest improvements for under-represented populations. bioRxiv Prepr., 2024.08.06.606846. doi:10.1101/2024.08.06.606846
Udosen, B., Soremekun, O., Kamiza, A., Machipisa, T., Cheickna, C., Omotuyi, O., et al. (2024). Correction to: meta-analysis and multivariate GWAS analyses in 80,950 individuals of African ancestry identify novel variants associated with blood pressure traits. Int. J. Mol. Sci. 24 (3), 2164. doi:10.3390/ijms24032164
United Nations International Migrant Stock. (2024). International Migrant Stock 2024. Available online at: https://www.un.org/development/desa/pd/sites/www.un.org.development.desa.pd/files/undesa_pd_2025_intlmigstock_2024_key_facts_and_figures_advance-unedited.pdf.
Wonkam, A. (2021). Sequence three million genomes across Africa. Nature 590, 209–211. doi:10.1038/d41586-021-00313-7
Wright, J. T., Dunn, J. K., Cutler, J. A., Davis, B. R., Cushman, W. C., Ford, C. E., et al. (2005). Outcomes in hypertensive black and nonblack patients treated with chlorthalidone, amlodipine, and lisinopril. JAMA 293, 1595–1608. doi:10.1001/jama.293.13.1595
Xiang, R., Kelemen, M., Xu, Y., Harris, L. W., Parkinson, H., Inouye, M., et al. (2024). Recent advances in polygenic scores: translation, equitability, methods and FAIR tools. Genome Med. Prepr. 16, 33. doi:10.1186/s13073-024-01304-9
Yadlowsky, S., Hayward, R. A., Sussman, J. B., McClelland, R. L., Min, Y. I., and Basu, S. (2018). Clinical implications of revised pooled cohort equations for estimating atherosclerotic cardiovascular disease risk. Ann. Intern. Med. 169, 20–29. doi:10.7326/M17-3011
Zhang, C., Hansen, M. E. B., and Tishkoff, S. A. (2022). Advances in integrative African genomics. Trends Genet. Prepr. 38, 152–168. doi:10.1016/j.tig.2021.09.013
Zhang, J., Zhang, S., Qiao, J., Wang, T., and Zeng, P. (2023). Similarity and diversity of genetic architecture for complex traits between East Asian and European populations. BMC Genomics 24, 314. doi:10.1186/s12864-023-09434-x
Zhao, Z., Gruenloh, T., Yan, M., Wu, Y., Sun, Z., Miao, J., et al. (2024). Optimizing and benchmarking polygenic risk scores with GWAS summary statistics. Genome Biol. 25, 260. doi:10.1186/s13059-024-03400-w
Keywords: African ancestries, cardiovascular diseases, genetic testing, genome-wide association studies, polygenic risk scores, population genomics
Citation: Pomales-Matos DA, Lyerly M, Rivera-Madera A, Echevarría-Bonilla OL, Álvarez-Cortés M, Henriquez-Quiñones SE, Reyes-Sosa GM, Villanueva-Nogueras RA and Peña-Martínez EG (2026) Ancestry gaps in cardiovascular GWAS: a multi-database review of African representation in genomic studies. Front. Genet. 16:1647176. doi: 10.3389/fgene.2025.1647176
Received: 14 June 2025; Accepted: 19 December 2025;
Published: 14 January 2026.
Edited by:
Lindsay Fernandez-Rhodes, The Pennsylvania State University (PSU), United StatesReviewed by:
Ryan Irvin, University of Alabama at Birmingham, United StatesYasmina Jaufeerally Fakim, University of Mauritius, Mauritius
Aritra Bose, IBM Research - Yorktown Heights, United States
Copyright © 2026 Pomales-Matos, Lyerly, Rivera-Madera, Echevarría-Bonilla, Álvarez-Cortés, Henriquez-Quiñones, Reyes-Sosa, Villanueva-Nogueras and Peña-Martínez. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Edwin G. Peña-Martínez, ZXBlYUB3dXN0bC5lZHU=; Diego A. Pomales-Matos, ZHBvbWFsZXNAc3RhbmZvcmQuZWR1