Validation of a genome-wide polygenic score for body mass index in South Asians

Menon, Ramesh; Khan, Nikhat; Charugulla, Sandeep; Bassi, Akshi; Dangare, Pooja; Dedaniya, Akshay; Pant, Aakanksha; Anjanappa, Rammurthy; Samson, Praveena L.; Satagopan, Uthra; Murugan, Sakthivel; Naikawadi, Amol; Ramprasad, Vedam L.; Gupta, Ravi

doi:10.3389/fgene.2025.1603542

ORIGINAL RESEARCH article

Front. Genet., 03 September 2025

Sec. Genetics of Common and Rare Diseases

Volume 16 - 2025 | https://doi.org/10.3389/fgene.2025.1603542

This article is part of the Research TopicInsights in Genetics of Common and Rare Diseases 2024View all 31 articles

Validation of a genome-wide polygenic score for body mass index in South Asians

Updated

A correction has been applied to this article in:

Correction: Validation of a genome-wide polygenic score for body mass index in South Asians
1. Read correction

Ramesh Menon¹

Nikhat Khan²

Sandeep Charugulla¹

Akshi Bassi¹

Pooja Dangare¹

Akshay Dedaniya²

Aakanksha Pant²

Rammurthy Anjanappa¹

Praveena L. Samson¹

Uthra Satagopan¹

Sakthivel Murugan¹

Amol Naikawadi²

Vedam L. Ramprasad¹

Ravi Gupta¹*

¹MedGenome Labs Pvt. Ltd., Bengaluru, Karnataka, India
²Genetics Department, Indus Health Plus, Pune, India

Obesity is a complex disorder, manifested by the interaction of inherited and environmental factors and modulated by a person’s lifestyle habits. India has witnessed more than a two-fold increase in the number of overweight adults in the last 30 years. The polygenic risk score (PRS) quantitatively measures an individual’s risk for common diseases. The PRS for obesity have been validated in the Caucasian population but not in the South Asian (SAS) population. In this study, we benchmarked and validated the existing genome-wide PRS model of obesity with 2.1 million variants in the SAS population. We analyzed a total of 14,263 individuals from three different South Asian cohorts. We compared the risk score with the body mass index (BMI) categories (underweight, normal weight, overweight, and obese) in all three cohorts. High PRS was associated with increased BMI in all the three cohorts. This study also compared validation results from another population-specific PRS model for the BMI. We conclude that high PRS is associated with high BMI in South Asians. Our study suggests that the PRS score can perhaps be an early predictor of overweight and obesity in the South Asian population.

Introduction

Obesity is a lifestyle disorder that has seen an alarming rise in recent times. The prevalence of obesity has tripled since 1975, with approximately 650 million adults older than 18 years and nearly 340 million children in the age group 5–18 years having an unhealthy body mass index (BMI) (Jih et al., 2014). It is one of the key factors that increase the risk for diabetes apart from other chronic diseases such as hypertension, heart disease, cancers, and arthritis (Forouzanfar et al., 2016). Notably, a population-based longitudinal study showed that the lifetime diabetes risk was as high as 86% in obese South Asian individuals (Deepa et al., 2015).

Genetic susceptibility to morbid obesity has often been attributed to monogenic mutations in genes such as FTO, LEPR, MC4R, and PCSK1 (Clément et al., 1998; Fawcett and Barroso, 2010; Yeo et al., 1998; Jackson et al., 1997). Unlike monogenic obesity, polygenic obesity is caused by small individual impacts of several genetic variations across the genome. The polygenic risk score (PRS) is by far the most reliable approach to evaluate an individual’s risk for complex diseases and traits. PRS has been successfully demonstrated in cardiovascular diseases, metabolic disorders, neurologic disorders, and various cancer types (Nikpay et al., 2015; Zeinomar and Chung, 2020).

The economic burden that can arise due to obesity and its related health implications is enormous. Hence, identification of the genetic risk for obesity is of profound importance for the prevention and efficient management of the associated conditions. The majority of PRS-based studies on obesity have been conducted in the European population, especially with samples derived from the UK Biobank. However, the impact of obesity polygenic risk markers in the South Asian population (SAS) is still not known. Previously, we reported the validation of the PRS for coronary artery disease (CAD) in SAS cohorts (Wang et al., 2020). In this study, we report the outcome of genome-wide polygenic risk markers for the BMI in South Asian cohorts.

Results

South Asian validation cohort and ethnicity assessment

A total of 14,247 samples out of 15,503 were included in the study after removing the samples that failed various QC steps (see methods) (Table 1). Principal component analysis (PCA) was performed for the three cohorts, while keeping the GenomeAsia global populations including the South Asian samples as the reference (Wall et al., 2023; Supplementary Figure S1). We performed PCA separately for the UK Biobank dataset (Affymetrix Axiom) as the genotyping platform is different from other two cohorts (BMI.SAS.1 and BMI.SAS.2), for which the genotype platform is Illumina Infinium GSA version 3. The BMI.SAS.1 (pink color) and BMI.SAS.2 (green color) samples are shown in Supplementary Figure S1A. These samples overlap with the South Asian samples of the GenomeAsia cohort. Similarly, as expected, the BMI.UKB.SAS cohort samples (yellow color) overlaps with SAS samples from GenomeAsia cohort (Supplementary Figure S1B).

Table 1

Table 1. Sample summary with sample numbers, the median age with standard deviation, male percentage, and median BMI with standard deviation of the three cohorts included in the study.

In the study cohort, the median BMI increased from 23 to 26 until the age of 38 years (N = 2072, male% = 63.9), stayed more or less the same until the age of 68 years (N = 11,727, male% = 61.6), gradually decreased to 24 in the 68–78-year age group (N = 425, male% = 72), and then finally was 22 in the 78–98-year age group (N = 23, male% = 60.8; Supplementary Figure S2).

Benchmarking of polygenic risk scores

We benchmarked the performance of two large-sized published BMI–PRS models from Khera et al. (2019) and Yengo and colleagues (Yengo et. al., 2018) in South Asian samples. Comparison of age- and gender-adjusted BMI–PRS obtained from Yengo et al. and Khera et al. showed Pearson’s correlation of 0.14 and 0.16, respectively, for BMI–PRS and measured BMI (Supplementary Figure S3). The model of Khera et al. performed better separately for the three South Asian cohorts (BM1.SAS.1, BMI.SAS.2, and BMI.UKB.SAS) as well. The model of Khera et al. showed a better correlation than the one of Yengo et al., and for the following analysis in this study, we have used the model of Khera et al.

Validation of the polygenic risk score for BMI in South Asian cohorts

The generated age- and gender-adjusted BMI polygenic risk score was compared with different BMI categories (UW = underweight, NW = normal weight, OW = overweight, and OB = obese; Figure 1A). An increasing trend of median BMI–PRS was observed from the underweight category to the obese category across all the three cohorts (Figure 1A). We further divided the individuals of the three cohorts into quintiles (five equal bins) based on the normalized PRS. We observed a steady increase in median BMI for individuals from Q1 to Q5, and the finding is consistent across all three cohorts (Figure 1B). The measured BMI showed a modest correlation with BMI–PRS across all groups, including the pooled cohort. Q-statistics was performed for the pooled cohort, and the heterogeneity was found to be insignificant (p.val < 0.1).

Figure 1

(A) Box plots show the distribution of BMI-PRS across cohorts BMI.SAS.1, BMI.SAS.2, and BMI.UKB.SAS, categorized by BMI groups: underweight (UW), normal weight (NW), overweight (OW), and obese (OB). (B) Line graph illustrates the BMI means across quintiles Q1 to Q5 for cohorts BMI.SAS.1, BMI.SAS.2, BMI.UKB.SAS, and Pooled SAS cohort, with corresponding Pearson's correlation values. Tables provide data counts for each BMI category and quintile per cohort.

Figure 1. BMI–PRS in various categories and the comparison with measured BMI. (A) BMI–PRS distribution in BMI categories in the three cohorts. The number of samples in each BMI category is given in the lower panel (UW = underweight, NW = normal weight, OW = overweight, and OB = obese) for the three cohorts (B) Measured BMI versus BMI–PRS in quintiles for the three cohorts and the pooled cohort (black color) with Pearson’s correlation of BMI and BMI–PRS for each cohort. The lower panel contains the number of samples in each PRS quintile bins for the three cohorts and the pooled cohort.

We observe a slightly higher BMI for the samples from the BMI.UKB.SAS cohort (Table 1). The increasing trend of the BMI–PRS from Q1 to Q5 was consistent across all three cohorts. Furthermore, we stratified the samples into three groups based on the predicted BMI–PRS score, namely, the lower-risk bin (Q1), medium-risk bins (Q2, Q3, and Q4), and the higher-risk bin (Q5). We then looked at the distribution of these three groups across different measured Asian BMI categories as per the WHO guidelines, namely, underweight, normal weight, overweight, and obese. We observed that more than 80% of the high-BMI risk bin samples are either obese or overweight samples. This was consistently observed in all three cohorts (Figure 2).

Figure 2

Three bar charts illustrate the percentage of samples in different BMI categories across three cohorts: BMI.SAS.1, BMI.SAS.2, and BMI.UKB.SAS. Each chart shows percentages for underweight, normal weight, overweight, and obese categories, split into lower, medium, and higher bins. Chart A shows higher percentages in the overweight bin; Chart B shows significant values in the obese category; Chart C highlights the overweight category.

Figure 2. Samples were stratified into three categories, namely, the bottom quintile (Q1, white color), medium quintiles (Q2–Q4, gray color), and the top quintile (Q5, black color), based on the increasing order of the polygenic risk score plotted in the BMI categories for all the three cohorts, where the panels were as follows: (A) = BMI.SAS.1; (B) = BMI.SAS.2; (C) = BMI.UKB.SAS.

We then derived the odds ratios for the obese BMI category (BMI >27.5) and the highly obese category (BMI >35) with respect to the middle quintile (Q3). The samples (BMI >27.5) in Q5 were (Figure 3) associated with a 1.67-, 2.35-, and 1.65-fold increased risk for the BMI.SAS.1, BMI.SAS.2, and BMI.UKB.SAS cohorts, respectively. For the BMI >35 category, the odds ratios were found to be 1.95, 3.95, and 2.22, respectively, in the BMI.SAS.1, BMI.SAS.2, and BMI.UKB.SAS cohorts.

Figure 3

Forest plot comparing odds ratios with 95% confidence intervals for cohorts BMI.SAS.1, BMI.SAS.2, and BMI.UKB.SAS across different quartiles and quintiles. The y-axis labels BMI categories over 27.5 and 35, while the x-axis represents odds ratios from 0 to 6. A red dashed line marks the odds ratio of 1.

Figure 3. Samples were stratified into quintiles of BMI–PRS scores in the three cohorts (blue color for BMI.SAS.1, green color for BMI.SAS.2, and orange color for BMI.UKB.SAS) for two BMI categories, BMI >27.5 and BMI >35. The odds ratios are given in the x-axis. The dotted vertical line indicates the odds ratio of 1 for middle quintile (Q3), which was taken as the reference for calculating the odds ratios.

Discussion

Our study outcome is consistent with the association of high PRS with elevated BMI from other studies on BMI–PRS, suggesting that the combined effects of several genetic variations drive the tendency to gain weight (Khera et al., 2019; Dashti et al., 2022; Hüls et al., 2021). Since there is a huge gap in multi-ethnicity validations, especially in the Asian sub-populations (Choe et al., 2022), in this study, we report an association between increased BMI and high BMI–PRS in 14,247 samples belonging to three distinct South Asian cohorts. By optimizing a BMI–PRS model for South Asians, we observe odds ratios ranging from 1.9- to 3.9-fold increases in PRS for samples with very high BMI (>35). Furthermore, a modest correlation was observed between PRS and the measured BMI. The trend of associations was concordant across the three independent South Asian cohorts. This aligns with the reports of previous studies, which showed the feasibility of the multi-ethnic application of BMI polygenic scores with ancestry corrections and imputation of genotypes with ethnicity-matched large reference panels (Wang et al., 2020; Wall et al., 2023). The odds ratios obtained for samples with BMI >35 in the BMI.SAS.2 cohort (OR = 3.9) were comparable with that reported in the European population (OR = 4.22), which was observed for BMI >40 (Khera et al., 2019).

We observed a two-fold increase in the median BMI through Q1 to Q5 of BMI–PRS categories, which was consistent in all three cohorts. Overweight and obese individuals comprised approximately 80%–90% of the Q5 bin in each of the three study cohorts. Overall, the BMI–PRS was significantly higher in overweight and obese individuals than in normal-weight individuals in all three SAS cohorts, which aligns with the previous findings in the European cohort (Khera et al., 2019). An additional observation was the slightly elevated median BMI in the SAS samples from the UK Biobank than in the other two cohorts.

The rising prevalence of obesity in the South Asian population and the resultant rise in comorbidities, especially cardiovascular diseases, make it imperative to understand the risk factors that result in obesity (Siegel et al., 2014). Coronary artery disease is one of the well-studied examples of the utility of PRSs, where the clinical guidelines recommend ancestry-specific validation before using PRS as a screening tool (Js et al., 2023; O’Sullivan et al., 2022; Abu-El-Haija et al., 2023). A previous study has successfully validated PRS in CAD and has reported that the odds ratios are slightly lower than that of the European population (Wang et al., 2020).

Overall, our study validates and reports the association of high PRS with high BMI, showing the applicability of high PRS as a screening tool for obesity or high BMI in the South Asian population, which can facilitate preventive measures and risk management.

Materials and methods

Sample collection

For our BMI study, we obtained data from 15,503 individuals through three different South Asian cohorts (Table 1). The first cohort (BMI.SAS.1) consists of individuals from our previous published study (Wang et al., 2020). The second cohort (BMI.SAS.2) consists of individuals from Indus Health Plus Pvt. Ltd., a preventive healthcare provider located in Pune, India. The consenting individuals went through a health risk assessment telephonic questionnaire conducted by qualified clinicians at Indus Health Plus Pvt. Ltd., where the participants’ present height and weight were asked. Samples with ambiguous or unavailable data were excluded. Furthermore, for a subset of 500 samples, height and weight were measured and compared with the self-reported data. A very high concordance rate was observed between the measured and self-reported data.

The third cohort (BMI.UKB.SAS) was obtained from UK Biobank queried through the ukbREST server (Milton and Hae Kyung, 2019). Individuals with South Asian ancestry from UK Biobank were selected for this study. Additionally, individuals that were distant from the SAS cluster in the ancestry analysis were removed from the respective cohorts (Supplementary Figure S1A, B). Furthermore, only the individuals aged above 18 years were included in this study.

For the BMI.SAS.1 and BMI.SAS.2 cohorts, blood samples (3–5 mL) were collected in EDTA tubes from individuals with informed consent, as per the accepted clinical guidelines and in accordance with the applicable laws from the respective center, and registered with unique identification numbers at MedGenome Labs Ltd., Bangalore. The study using the BMI.SAS.1 cohort was approved by the institutional review boards at each of the recruitment sites. Informed consent with signed forms was collected through the ethics committee for the BMI.SAS.2 cohort.

The BMI.UKB.SAS genotype data were obtained from UK Biobank (UK Biobank ethnicity codes 3001, 3002, and 3003), where BMI and age data were available.

DNA extraction and genotyping and data quality control

Extraction of genomic DNA from the samples was performed by magnetic separation using the QIASymphony SP system (QIAGEN, Valencia, CA) following the manufacturer’s protocol. The Qubit^® dsDNA BR (broad-range) Assay Kit (Thermo Fisher Scientific) was used to quantify the DNA. QIAxpert (QIAGEN, Valencia, CA) and agarose gel were used to assess the quality of DNA. Genotyping was performed using Infinium™ Global Screening Array-24 v3 BeadChip (Illumina, California, United States), which consists of 654,027 genome-wide markers, according to the manufacturer’s protocol (Illumina, California, United States). In this method, 200 ng of genomic DNA was isothermally amplified at 37 °C for 20 h–24 h, enzymatically fragmented, precipitated, resuspended, and loaded onto the BeadChip, which was incubated at 48 °C for 16 h–24 h. Subsequently, the BeadChip was washed and prepared for single-base extension. The BeadChip was scanned on the Illumina iScan System array scanner, as per the protocol given by the manufacturer.

The exported GSA data with phenotypic information such as height, weight, and gender were used to perform quality check through a custom bioinformatics pipeline (MG-ArrayQC tool) using VCFtools version 0.1.14, R package (R version 3.3.2), gdsfmt 1.1.0, and SNPRelate 1.16.0 to generate the call rate, heterozygosity rate, and principal component analysis plots, among others.

Quality control

Quality assessment was performed on GSAv3 genotyped samples of BMI.SAS.1, BMI.SAS.2, and BMI.UKB.SAS cohorts. Samples with <5% sample and site level missing rate and <95% genotyping rate were removed from the study. Markers with Hardy–Weinberg equilibrium p-value >1e-10 and MAF <0.001 were removed. This was followed by LD pruning for estimating the genetic relationship and PCA. Kinship cutoff of 0.088 was used. PCA was carried out for the samples, and the genotyped samples were projected to the PCA space.

PRS generation

QC-passed individuals and markers were subjected to genotype imputation using Beagle v5.0 with the GenomeAsia phase 2 (GAv2) imputation reference panel consisting of 6,461 samples, predominantly SAS samples, using the Beagle tool (version 5; Browning et al., 2018; Wall et al., 2023). The 24,687,484 imputed sites were compared with the BMI GWAS summary markers reported in the previous study (Khera et al., 2019).

We obtained the markers with weight from the PGS catalog model (PGSID: PGS000027), which was pre-calibrated (LDpred, ρ = 0.03), comprising 2,100,302 genome-wide markers from the published study in European ancestry. The variants were directly genotyped and imputed using the GenomeAsia reference panel. Of the total 2,100,302 markers in the model, the imputed data were able to cover 83% (1,946,327) of the markers. PRS was generated using the MedGenome pipeline, which uses PLINK v2.0 (Chang et al., 2015) for scoring. Since the summary statistics were derived from the European population, the obtained raw PRS was normalized for the SAS ancestry with GAv2 WGS-500 SAS samples using PCA (PC1–PC5) with FlashPCA R package version 2.6. The PCA cut-offs are provided in Supplementary Table S3.

BMI–PRS correlation analysis

Pearson’s correlation analysis was performed between the normalized PRS score and BMI categories. The normalized PRS scores and clinical metadata were used for the analysis. Each sample was categorized as underweight (UW), BMI <18.5; normal weight (NW), BMI between 18.5 and 23; overweight (OW), BMI between 23 and 27.5; and obese (OB), BMI >27.5 according to the WHO recommended guidelines for the Asian population (Jih et al., 2014). All the samples in the cohort were sorted in the descending order of the normalized PRS value and then divided into five equal bins. Bin 1 was considered a lower bin, and bin 5 was considered a higher bin. Bins 2, 3, and 4 were considered medium bins.

Statistical analysis

The age- and gender- adjusted BMI–PRS were generated using the generalized linear model function in stats R package version 3.6.2. Nagelkerke R2 estimate of variance explained by the BMI–PRS after covariate adjustment for each cohort and the pooled cohort were calculated using fmsb R package version 0.7.6. The Q-statistics was estimated using “gamlss” R package version 5.4.

PRS validation framework

We followed our earlier framework for the validation of the PRS (Supplementary Figure S4). The upper panel describes the processes involved in the validated PRS in BMI that are deposited in the PGS catalog database (Lambert et al., 2021). The lower panel is the process for the South Asian population-specific validation of the raw PRS generated from the effects’ weights derived model, which was predominantly from BMI studies conducted in populations of European ancestry. The recruited samples were processed, and the missing genotypes were imputed using the GenomeAsia phase 2 reference data comprising 6,461 predominantly South Asian samples (Wall et al., 2023). Then, PCA was performed with the South Asian reference samples. The PC residuals were used to adjust for the ancestry of the South Asian PRS.

Data availability statement

The original contributions presented in the study are publicly available. This data can be found here: https://ega-archive.org/ under accession number: EGAS00001008309.

Ethics statement

The study was conducted in accordance with the local legislation and institutional requirements of MedGenome Lab Ltd, Bangalore, India. The participants provided their written informed consent to participate in this study.

Author contributions

RM: Investigation, Methodology, Formal Analysis, Writing – original draft, Writing – review and editing. NK: Data curation, Resources, Writing – review and editing. SC: Data curation, Formal Analysis, Writing – original draft. AB: Data curation, Formal Analysis, Writing – original draft. PD: Data curation, Formal Analysis, Writing – original draft. AD: Data curation, Resources, Writing – review and editing. AP: Data curation, Resources, Writing – review and editing. RA: Resources, Writing – original draft. PLS: Resources, Writing – original draft. US: Writing – original draft. SM: Resources, Writing – original draft. AN: Conceptualization, Project administration, Investigation, Writing – review and editing. VLR: Conceptualization, Project administration, Investigation, Writing – review and editing. RG: Conceptualization, Project administration, Investigation, Writing – review and editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. The study is funded by MedGenome Labs Ltd, Bangalore, India.

Acknowledgments

This research has been conducted using the UK Biobank Resource under Application Number 108924

Conflict of interest

Authors RM, SC, AB, PD, RA, PS, US, SM, VR, and RG were employed and/or have equity in MedGenome Labs Pvt. Ltd.

Authors NK, AP, AD, AN were employed and/or have equity in Indus Health Plus Pvt. Ltd.

Correction note

A correction has been made to this article. Details can be found at: 10.3389/fgene.2025.1708353.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2025.1603542/full#supplementary-material

References

Abu-El-Haija, A., Reddi, H. V., Wand, H., Rose, N. C., Mori, M., Qian, E., et al. (2023). The clinical application of polygenic risk scores: a points to consider statement of the American College of Medical Genetics and Genomics (ACMG). Genet. Med. Off. J. Am. Coll. Med. Genet. 25. doi:10.1016/j.gim.2023.100803

PubMed Abstract | CrossRef Full Text | Google Scholar

Browning, B. L., Zhou, Y., and Browning, S. R. (2018). A one-penny imputed genome from next-generation reference panels. Am. J. Hum. Genet. 103, 338–348. doi:10.1016/j.ajhg.2018.07.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Chang, C. C., Chow, C. C., Tellier, L. C., Vattikuti, S., Purcell, S. M., and Lee, J. J. (2015). Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience 4, 7. doi:10.1186/s13742-015-0047-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Chikowore, T., Läll, K., Micklesfield, L. K., Lombard, Z., Goedecke, J. H., Fatumo, S., et al. (2024). Variability of polygenic prediction for body mass index in Africa. Genome Med. 16, 74–13. doi:10.1186/s13073-024-01348-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Clément, K., Vaisse, C., Lahlou, N., Cabrol, S., Pelloux, V., Cassuto, D., et al. (1998). A mutation in the human leptin receptor gene causes obesity and pituitary dysfunction. Nature 392. 398, 401. doi:10.1038/32911

PubMed Abstract | CrossRef Full Text | Google Scholar

Choe, E. K., Shivakumar, M., Lee, S. M., Verma, A., and Kim, D. (2022). Dissecting the clinical relevance of polygenic risk score for obesity-a cross-sectional, longitudinal analysis. Int. J. Obes. 2005 46, 1686–1693. doi:10.1038/s41366-022-01168-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Dashti, H. S., Miranda, N., Cade, B. E., Huang, T., Redline, S., Karlson, E. W., et al. (2022). Interaction of obesity polygenic score with lifestyle risk factors in an electronic health record biobank. BMC Med. 20, 5. doi:10.1186/s12916-021-02198-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Deepa, M., Grace, M., Binukumar, B., Pradeepa, R., Roopa, S., Khan, H. M., et al. (2015). High burden of prediabetes and diabetes in three large cities in South asia: the center for cArdio-metabolic risk reduction in South asia (CARRS) study. Diabetes Res. Clin. Pract. 110, 172–182. doi:10.1016/j.diabres.2015.09.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Fawcett, K. A., and Barroso, I. (2010). The genetics of obesity: FTO leads the way. Trends Genet. TIG 26, 266–274. doi:10.1016/j.tig.2010.02.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Forouzanfar, M. H., Afshin, A., Alexander, L. T., Anderson, H. R., Bhutta, Z. A., Biryukov, S., et al. (2016). Global, regional, and national comparative risk assessment of 79 behavioural, environmental and occupational, and metabolic risks or clusters of risks, 1990–2015: a systematic analysis for the Global Burden of Disease Study 2015. Lancet 388, 1659–1724. doi:10.1016/S0140-6736(16)31679-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Hüls, A., Wright, M. N., Bogl, L. H., Kaprio, J., Lissner, L., Molnár, D., et al. (2021). Polygenic risk for obesity and its interaction with lifestyle and sociodemographic factors in European children and adolescents. Int. J. Obes. 45, 1321–1330. doi:10.1038/s41366-021-00795-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Jackson, R. S., Creemers, J. W., Ohagi, S., Raffin-Sanson, M. L., Sanders, L., Montague, C. T., et al. (1997). Obesity and impaired prohormone processing associated with mutations in the human prohormone convertase 1 gene. Nat. Genet. 16, 303–306. doi:10.1038/ng0797-303

PubMed Abstract | CrossRef Full Text | Google Scholar

Jih, J., Mukherjea, A., Vittinghoff, E., Nguyen, T. T., Tsoh, J. Y., Fukuoka, Y., et al. (2014). Using appropriate body mass index cut points for overweight and obesity among Asian Americans. Prev. Med. 65, 1–6. doi:10.1016/j.ypmed.2014.04.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Khera, A. V., Chaffin, M., Wade, K. H., Zahid, S., Brancale, J., Xia, R., et al. (2019). Polygenic prediction of weight and obesity trajectories from birth to adulthood. Cell 177, 587–596.e9. doi:10.1016/j.cell.2019.03.028

PubMed Abstract | CrossRef Full Text | Google Scholar

Lambert, S. A., GilL, Jupp, S., Ritchie, S. C., Xu, Y., Buniello, A., et al. (2021). The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nat. Genet. 53, 420–425. doi:10.1038/s41588-021-00783-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Milton, P., and Hae Kyung, I. (2019). ukbREST: efficient and streamlined data access for reproducible research in large biobanks. Bioinforma. Oxf. Engl. 35, 1971–1973. doi:10.1093/bioinformatics/bty925

CrossRef Full Text | Google Scholar

Nara, Y., and Yoon Shin, C. (2023). Development of a polygenic risk score for BMI to assess the genetic susceptibility to obesity and related diseases in the Korean population. Int. J. Mol. Sci. 24, 11560. doi:10.3390/ijms241411560

PubMed Abstract | CrossRef Full Text | Google Scholar

Nikpay, M., Goel, A., Won, H-H., Hall, L. M., Willenborg, C., Kanoni, S., et al. (2015). A comprehensive 1,000 Genomes-based genome-wide association meta-analysis of coronary artery disease. Nat. Genet. 47, 1121–1130. doi:10.1038/ng.3396

PubMed Abstract | CrossRef Full Text | Google Scholar

O’Sullivan, J. W., Raghavan, S., Marquez-Luna, C., Luzum, J. A., Damrauer, S. M., Ashley, E. A., et al. (2022). Polygenic risk scores for cardiovascular disease: a scientific statement from the American heart association. Circulation 146, e93–e118. doi:10.1161/CIR.0000000000001077

PubMed Abstract | CrossRef Full Text | Google Scholar

Phulka, J. S., Ashraf, M., Bajwa, B. K., Pare, G., and Laksman, Z. (2023). Current state and future of polygenic risk scores in cardiometabolic disease: a scoping review. Circ. Genomic Precis. Med. 16, 286–313. doi:10.1161/CIRCGEN.122.003834

PubMed Abstract | CrossRef Full Text | Google Scholar

Siegel, K. R., Patel, S. A., and Ali, M. K. (2014). Non-communicable diseases in South Asia: contemporary perspectives. Br. Med. Bull. 111, 31–44. doi:10.1093/bmb/ldu018

PubMed Abstract | CrossRef Full Text | Google Scholar

Wall, J. D., Sathirapongsasuti, J. F., Gupta, R., Rasheed, A., Venkatesan, R., Belsare, S., et al. (2023). South Asian medical cohorts reveal strong founder effects and high rates of homozygosity. Nat. Commun. 14, 3377–11. doi:10.1038/s41467-023-38766-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, M., Menon, R., Mishra, S., Patel, A. P., Chaffin, M., Tanneeru, D., et al. (2020). Validation of a genome-wide polygenic score for coronary artery disease in South Asians. J. Am. Coll. Cardiol. 76, 703–714. doi:10.1016/j.jacc.2020.06.024

PubMed Abstract | CrossRef Full Text | Google Scholar

Yeo, G. S., Farooqi, I. S., Aminian, S., Halsall, D. J., Stanhope, R. G., and O”Rahilly, S. (1998). A frameshift mutation in MC4R associated with dominantly inherited human obesity. Nat. Genet. 20, 111–112. doi:10.1038/2404

PubMed Abstract | CrossRef Full Text | Google Scholar

Yengo, L., Sidorenko, J., Kemper, K, E., Zheng, Z., Wood, A. R., Weedon, M. N., et al. (2018). Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry. Hum Mol Genet. 27 (20), 3641–3649. doi:10.1093/hmg/ddy271

PubMed Abstract | CrossRef Full Text | Google Scholar

Zeinomar, N., and Chung, W. K. (2020). Cases in precision medicine: the role of polygenic risk scores in breast cancer risk assessment. Ann. Intern. Med. 174, 408–412. doi:10.7326/M20-5874

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: polygenic risk score, obesity, South Asian, genetic screening, validation studies

Citation: Menon R, Khan N, Charugulla S, Bassi A, Dangare P, Dedaniya A, Pant A, Anjanappa R, Samson PL, Satagopan U, Murugan S, Naikawadi A, Ramprasad VL and Gupta R (2025) Validation of a genome-wide polygenic score for body mass index in South Asians. Front. Genet. 16:1603542. doi: 10.3389/fgene.2025.1603542

Received: 31 March 2025; Accepted: 29 July 2025;
Published: 03 September 2025; Corrected: 27 November 2025.

Edited by:

Mara Marongiu, National Research Council (CNR), Italy

Reviewed by:

Yoichi Sutoh, Iwate Medical University, Japan
Lide Han, Vanderbilt University Medical Center, United States

Copyright © 2025 Menon, Khan, Charugulla, Bassi, Dangare, Dedaniya, Pant, Anjanappa, Samson, Satagopan, Murugan, Naikawadi, Ramprasad and Gupta. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ravi Gupta, cmF2aWdAbWVkZ2Vub21lLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.