- 1The Kids Research Institute Australia, Perth, WA, Australia
- 2National Centre for Indigenous Genomics, Australian National University, Canberra, ACT, Australia
Type-2 diabetes is a systemic condition with rising global prevalence, disproportionately affecting Indigenous communities worldwide. Recent advances in epigenomics methods, particularly in DNA methylation detection, have enabled the discovery of associations between epigenetic changes and Type-2 diabetes. In this review, we summarise DNA methylation profiling methods, and discuss how these technologies can facilitate the discovery of epigenomic biomarkers for Type-2 diabetes. We critically evaluate previous DNA methylation biomarker studies, particularly those using microarray platforms, and advocate for a shift towards sequencing-based approaches to improve genome-wide coverage. Furthermore, we emphasise the need for biomarker studies that include genetically diverse populations, especially Indigenous communities who are significantly impacted by Type-2 diabetes. We discuss research approaches and ethical considerations that can better facilitate Type-2 diabetes biomarker development to ensure that future genomics-based precision medicine efforts deliver equitable health outcomes. We propose that by addressing these gaps, future research can better capture the genetic and environmental complexities of Type-2 diabetes among populations at disproportionate levels of risk, ultimately leading to more effective diagnostic and therapeutic strategies.
1 Introduction
Type 2 diabetes (T2D) is a systemic cardiometabolic condition of increasing global significance. It is estimated that 6.1% of the global population (529 million people) live with diabetes mellitus, with 96% of cases attributed to T2D (GBD, 2021 Diabetes Collaborators, 2023). Increasing incidence of T2D has led to a near doubling of age-standardised prevalence estimates since 1990 (3.1%), with an anticipated future age-standardised prevalence of 9.8% expected in 2050 (GBD, 2021 Diabetes Collaborators, 2023). The global surge in T2D is driven by a transition to higher calorie diets combined with increasingly sedentary lifestyles (Chatterjee et al., 2017). Following a T2D diagnosis, if blood glucose levels are not appropriately controlled, secondary complications can develop (DeFronzo et al., 2015). These complications can be classified into macrovascular (e.g., cardiovascular disease) or microvascular (e.g., retinopathy, nephropathy and neuropathy), which significantly increases the risk of morbidity and mortality for people with T2D (Zheng et al., 2018). Of particular concern is the global increase in T2D and associated complications amongst young people, where individuals typically experience more severe clinical presentations, and have limited approved treatment options (Viner et al., 2017; Bjornstad et al., 2023). Additionally, emerging evidence indicates that young people whose mothers were living with diabetes during pregnancy have an increased risk of developing T2D and associated complications (Dabelea et al., 2008; Wicklow et al., 2018). Given the complexity of T2D aetiology and the broad range of risk factors and potential complications, T2D is a condition that would benefit from robust biomarkers that enable a precision medicine driven approach across the continuum of care.
T2D develops when insulin regulation processes break down, often following a period of partial dysfunction known as pre-diabetes, during which many patients remain asymptomatic (DeFronzo et al., 2015). This breakdown reduces the body’s ability to manage excess carbohydrate, leading to hyperinsulinaemia followed by hyperglycaemia (Roden and Shulman, 2019). To diagnose T2D, a blood screening test for either plasma glucose concentration or Haemoglobin A1C (HbA1c) levels is required (American Diabetes Association Professional Practice Committee, 2024). Due to the insidious nature of T2D, management of risks and early identification of individuals on the trajectory of T2D and complications development is vital. When people with T2D or prediabetes are identified early, lifestyle modifications through dietary and exercise changes can contribute to re-establishing blood glucose control and preventing or delaying the trajectory of T2D and associated complications for many people (Magkos et al., 2020; Siopis et al., 2021; Hocking et al., 2024). Given the effectiveness of early intervention, the development of new biomarkers that better predict T2D risk could improve early diagnosis and implementation of personalised treatment strategies.
T2D aetiology and associated complications is complex, with multiple genetic and environmental risk factors identified (Zheng et al., 2018). The 2021 Global Burden of Disease Study highlighted the impact of several factors on the overall burden of T2D, with risk factors accounting for 76.5% (95% Uncertainty Interval (UI) 58.0–87.5) of Disability Adjusted Life Years (DALYs) and of these, high body-mass index (BMI) was responsible for over half of the DALYs attributed to T2D (GBD, 2021 Diabetes Collaborators, 2023). Additionally, dietary factors played a crucial role (25.7%, 95% UI 8.6–40.7), significantly influencing the condition’s burden (GBD, 2021 Diabetes Collaborators, 2023). Environmental and occupational exposures also contributed substantially to DALYs (19.6%, 95% UI 12.7–26.5), underscoring the multifaceted nature of T2D risk (GBD, 2021 Diabetes Collaborators, 2023). Indigenous communities, particularly those impacted by settler colonialism, experience a disproportionate burden of T2D and associated complications (Harris et al., 2017). Such disparities are also observed with young people. For example, American-Indian young people (10–19 years) experience more than double the burden of T2D than American young people overall, with such disparity persisting between measurements from 2002–03 (22.6 vs. 9.0 per 100,000) and 2017-18 (46.0 vs. 17.9 per 100,000) (Wagenknecht et al., 2023). Some of the highest rates globally have been reported amongst Aboriginal and Torres Strait Islander young people (≤24 years) from northern Australia where in 2016–17, 6.7 T2D cases per 1,000 were described (Titmuss et al., 2022). Alarmingly, there are reports of Aboriginal and Torres Strait Islander children as young as 4 years old diagnosed with T2D (Titmuss et al., 2022). This younger onset T2D highlights a potential intergenerational risk, with a Canadian study observing higher incidence of T2D for people ≤30 years born to First Nations mothers experiencing T2D during pregnancy (5.63 per 1,000 person-years), compared to First Nations mothers with gestational diabetes (1.67 per 1,000 person-years) or no diabetes (0.83 per 1,000 person-years) during pregnancy (Wicklow et al., 2018). Such striking disparities and the intergenerational risk of T2D highlight an urgent need for further research into the causes and impacts of T2D in Indigenous communities. Addressing these health inequities requires targeted studies that provide the evidence needed to develop effective strategies and close the gap in T2D outcomes for Indigenous Peoples.
A meta-analysis of genomes from European (60.3%), East Asian (19.8%), ancestrally diverse African American (10.5%), ancestrally diverse Hispanic (5.9%), South Asian (3.3%) and South African (0.2%) populations revealed 611 loci associated with T2D (Suzuki et al., 2024). While a person’s genome remains largely unchanged throughout life, a multitude of environmental factors (e.g., diet, chemical exposure, chronic stress) have the potential to impact the epigenome. A cell’s epigenome comprises the modifiable biochemical molecules that interact with the genome to regulate gene expression (Portela and Esteller, 2010; Wu et al., 2023). The epigenome plays key roles in cell differentiation, development, and it influences gene expression patterns that are crucial for maintaining cellular identity and function across different tissues (Jaenisch and Bird, 2003). Epigenomics is emerging as a promising field in T2D research with multiple studies identifying epigenetic associations with T2D when profiling blood and pancreatic β-cells, as well as other tissues of relevance in T2D aetiology (Muka et al., 2016; Walaszczyk et al., 2018; Ling et al., 2022). As DNA methylation is the most widely studied epigenetic modification, this review will focus on the development of DNA methylation detection technologies and their application to T2D biomarker development. We refer to (Bannister and Kouzarides, 2011; Holoch and Moazed, 2015) for further discussion on the mechanisms of histone modifications and non-coding RNAs respectively, and (Kumar et al., 2024; Chi et al., 2021) for discussion of these epigenetic modifications in the context of T2D.
DNA methylation 5-methylcytosine (5mC), is a state where a methyl group is bound to the fifth carbon position of a cytosine base in DNA. DNA methylation commonly occurs in vertebrates at cytosines that precede a guanine base, a context known as CpG methylation (Li and Zhang, 2014). Non-CpG methylation, whilst common in plants, only typically occurs in mammals at appreciable levels in pluripotent stem cells and mature neurons (Lister et al., 2009; Luo et al., 2018; de Mendoza et al., 2021; Buckberry et al., 2023). While the intermediate demethylation product 5-hydroxymethylcytosine (5hmC) may be involved with DNA replication, transcription, cell differentiation and human disease (Kriukienė et al., 2024), discussion of 5hmC in the context of T2D biomarkers is beyond the scope of this review. Therefore, when discussing DNA methylation herein, we are referring to CpG methylation.
DNA methylation is a reversible epigenetic modification typically associated with gene repression, but is also associated with actively transcribed gene bodies and, in some contexts, with gene activation (Schübeler, 2015; Greenberg and Bourc’his, 2019). In human somatic cells, ∼70–80% of CpGs are methylated (Greenberg and Bourc’his, 2019), with significant variation between different cell types (Loyfer et al., 2023). While some transcription factors are sensitive to CpG methylation (Yin et al., 2017), which can impact gene expression, knowledge of precisely how DNA methylation regulates gene expression remains incomplete, likely with distinct functions in different genomic regions and cell types (Zhu et al., 2016; de Mendoza et al., 2022). DNA methylation is a dynamic process with methyl groups added to cytosines by DNA methyltransferase (DNMT) enzymes and removed either passively during incomplete transfer of methylation patterns during cell replication or actively by ten-eleven translocation (TET) proteins (Greenberg and Bourc’his, 2019). Through these mechanisms, it has been postulated that the dynamic nature of DNA methylation allows the methylome and therefore gene expression to respond to environmental stimuli (Martin and Fry, 2018; Cavalli and Heard, 2019; Law and Holland, 2019).
Identifying DNA methylation patterns that correlate with environmental factors is of particular interest in the development of biomarkers for chronic diseases, as many of these conditions develop due to an interplay between genetic and environmental influences (Boye et al., 2024). While genetic variants can be useful for identifying people at increased risk of developing a condition, they give little indication of current health states or reflect response to therapy. In contrast to genetics, a person’s environment can change throughout life with a myriad of exposures from diet to heavy metals, pollution, tobacco smoke and other toxins known to associate with DNA methylation levels in individual genes or larger global patterns of change (Martin and Fry, 2018; Yousefi et al., 2022). While changes to the epigenome that impact cancer susceptibility are well reported, emerging evidence indicates a role for the epigenome in T2D (Feinberg, 2018). Given the recent global surge in T2D prevalence, it is highly likely that the environmental factors that heavily contribute to T2D aetiology, likely also influence DNA methylation in some tissues. Therefore, identifying DNA methylation patterns that associate with T2D incidence and progression could contribute to the development of new predictive tools that improve our ability to identify at-risk individuals, thereby reducing the burden of this condition.
2 Technologies for identifying DNA methylation changes associated with T2D
The study of DNA methylation has advanced exponentially since 1975 when it was first postulated that DNA methylation contributed to gene regulation (Holliday and Pugh, 1975). Since then, a plethora of indirect and direct sequencing techniques have been developed to profile genome-wide DNA methylation patterns. This review will focus on the most commonly utilised strategies; candidate gene analysis, microarray based technologies, and next-generation sequencing (short read sequencing), as well as the emerging potential of long-read sequencing. For discussion on liquid chromatography, mass spectrometry, methylation sensitive restriction enzymes, affinity enrichment and biosensing, see (Zhang et al., 2024).
2.1 Candidate gene analyses
Candidate gene analyses are the screening of individual CpG sites for methylation within small regions of interest that are selected a priori, such as a gene promoter, typically by bisulfite PCR sequencing (Frommer et al., 1992). This technique uses sodium bisulfite to chemically convert unmethylated cytosines to uracil, leaving methylated cytosines as cytosine (Frommer et al., 1992). Multiple factors can influence the reliability of bisulfite PCR candidate gene analysis including, but not limited to, inefficient bisulfite conversion leading to an overestimation of DNA methylation (Krueger et al., 2012). While bisulfite PCR sequencing remains a popular validation method, several other methodologies have been developed, including bisulfite pyrosequencing (Pajares et al., 2021). Like bisulfite PCR, pyrosequencing utilises bisulfite conversion as the differentiating mechanism for DNA methylation, however it differs from sequencing in that differential methylation is detected via a real time luciferase reaction (Colella et al., 2003). While enabling the discovery of DNA methylation signatures in selected regions, candidate gene analyses are limited in analytical potential as they can not assess global DNA methylation patterns (Shabalin et al., 2015). However, candidate gene analysis is a cost-effective method widely used to validate microarray data.
2.2 DNA methylation microarrays
Microarray technologies capitalise on single stranded DNA’s affinity for hybridisation to a complementary strand, and underwent extensive development in the 1990s (Southern et al., 1999). One of the most widely used technologies today is the Illumina BeadChip, which uses complementary strand probes and fluorescent nucleotides to bind 50 base-pair (bp) lengths of DNA commencing at a CpG site of interest (Bibikova et al., 2009). Differential fluorescence levels then allow determination of the presence of DNA methylation in that to 50-bp lengths sequence (Bibikova et al., 2009). The CpG sites selected for screening with microarray represent only a fraction of CpG sites within the genome (Figure 1), with sequences for probes determined from the human reference genome (Noguera-Castells et al., 2023). The first large-scale DNA methylation microarray from Illumina, the 27 K BeadChip, was created in 2008 to target sequences within proximal promoter regions (Bibikova et al., 2009), but was replaced in 2011 by the 450 K microarray which expanded the scope by adding probes for a wide range of sites including but not limited to CpG islands, RefSeq gene regions, and a selection of enhancers (Bibikova et al., 2011). Microarray capacity was further expanded in 2016 with the release of the 850 K microarray, which further extended the range of enhancer sites (Pidsley et al., 2016). The latest Illumina EPIC DNA methylation microarray (900 K) includes the addition of probes to study open chromatin as well as additional enhancer locations (Noguera-Castells et al., 2023). Microarrays are widely used, particularly in large population studies, due to their lower costs and ease of use (Noguera-Castells et al., 2023). However, the limited proportion of CpG sites selected to be included on the microarray platform restricts capacity to screen for biomarkers to pre-selected regions. A recent whole genome bisulfite (WGBS) study of 205 healthy tissues representing 39 cell-types reported that the widely-used Illumina 450 K microarray and EPIC microarray only included 14% and 24% respectively of cell-type-specific differentially methylated blocks (defined as five or more CpG sites) (Loyfer et al., 2023). Microarrays also have reduced efficacy due to cross-reactivity or genomic sequence differences (Pidsley et al., 2016). As microarray development is heavily influenced by the human reference genome, current platforms under-represent global human diversity, due to the majority of the human reference genome sequence arising from one individual (Schneider et al., 2017). Thus, the suitability of microarray for measuring DNA methylation levels in individuals and populations whose genomes differ substantially from the human reference genome, is unknown.

Figure 1. Comparison of candidate gene studies, microarrays, and methylome sequencing approaches for DNA methylation biomarker studies. Genome CpG coverage varies significantly across methods. Candidate gene approaches can typically assess tens to thousands of CpG sites, microarrays (such as Illumina 450 k and 850 k platforms) cover hundreds of thousands of CpG sites, and methylome-wide approaches capture nearly all 29.4 million CpG sites within the human genome (hg38, autosomes, X and Y). Base- and strand-level resolution highlights the ability to measure methylation at single-CpG resolution on individual DNA strands, and is only practically feasible with methylome sequencing. Microarrays provide site-level resolution based on reference genome sequence, but do not typically distinguish between strands, while candidate gene approaches are limited to specific loci typically without strand information. The potential for participant re-identification increases with the scale and resolution of the data. Sequencing-based methods pose a higher risk due to the comprehensive and individual-specific nature of the genomic data, requiring robust data governance and privacy protections. Microarrays and candidate gene studies present lower re-identification risks, as they capture less data and provide limited genomic context. Relative cost per sample reflects the resources needed for data production and analysis. Candidate gene approaches are the most cost-effective, while microarrays offer a balance of affordability, with methylome sequencing being the most costly due to sequencing and computational demands. Raw data sizes illustrate the storage demands of each method. Candidate gene studies generate minimal data (<10 MB per sample), while microarrays produce 16–20 MB per sample. In contrast, methylomes at 30x coverage produce approximately 110 GB of raw data per sample (compressed FASTQ format) and about 62 GB of mapped data (CRAM format), accounting for ∼10% data loss through PCR duplicates and read filtering, however this can be highly variable. The computational resources required increase with data complexity. Candidate gene and microarray studies can typically be processed on desktop computers or small servers, while methylome analyses often require servers or high-performance computing (HPC) environments. The shift to advanced computing infrastructure is driven by the large datasets and computationally intensive analyses associated with sequencing-based studies.
2.3 Whole-genome DNA methylation profiling
With improving technology and corresponding cost reductions, sequence-based whole genome DNA methylation profiling is becoming an increasingly viable option for profiling the methylome. WGBS is a widely used method that detects methylated cytosines by fragmenting DNA, treating it with sodium bisulfite to convert unmethylated cytosines to uracil (read as thymine after PCR), and next-generation sequencing of the fragments. Methylation is determined by aligning reads to a reference genome and analysing the proportions of cytosine and thymine at CpG sites (Lister et al., 2009). The WGBS technique can produce base-level resolution coverage of ∼94% of CpG sites within the genome (Lister et al., 2009), leading to it quickly becoming the gold standard. Whilst many related enrichment-based methods coupled with sequencing exist, such as reduced representation bisulfite sequencing, they all feature biases towards different genome features or sequence contexts [reviewed in Plongthongkum et al. (2014)]. However, WGBS also features some limitations, including higher costs and the requirement of ∼200–500 ng of DNA (Li and Tollefsbol, 2021). A recently published technique that is capable of utilising existing WGBS data analysis methods is enzymatic DNA methylation sequencing (EM-Seq), which employs TET2, T4-BGT and APOBEC3A enzymes to transform methylated cytosines to uracil prior to sequencing (Vaisvila et al., 2021). EM-Seq has improved coverage of CpG rich-regions (such as promoters and CG islands), requires as little as 100 pg of DNA and gives coverage of ∼96% of 5mC bases (Vaisvila et al., 2021). Overall, the use of sequencing-based techniques capable of measuring genome-wide base-level DNA methylation levels can provide deeper insights into how DNA methylation is associated with different health states, especially when population genetic differences with respect to reference genomes may need to be accounted for.
However, the short read lengths of next-generation sequencing have a correspondingly low mapping accuracy with highly repetitive sequences (Treangen and Salzberg, 2011). These limitations can be overcome by the use of long read sequencing techniques, such as PacBio HiFi and the Oxford Nanopore sequencing, which can measure DNA methylation for long sequences of several kilobases directly from DNA, without the need for bisulfite or enzymatic conversion (Searle et al., 2023). However, these long-read technologies are not yet widely used in biomarker discovery due to the inability to rival the cost, throughput, and input DNA requirement of short read technologies (Chen et al., 2023). However, it is anticipated that long read sequencing technologies will dominate DNA methylation studies in years to come.
3 Improvements in DNA methylation detection technologies have accelerated the development of T2D biomarkers
3.1 The current state of T2D DNA methylation biomarkers
The major attraction of epigenetic biomarkers is the potential for translation to preventative and precision medicine, where improving the prediction of the likelihood of a condition would enable earlier intervention and targeted mitigation strategies (Skinner, 2024). Currently, the diagnosis of many chronic health conditions occurs at presentation of symptoms, with treatments instigated in response to diagnosis. With T2D, this occurs when the body is no longer able to cope with sustained excess glucose, with unmanaged hyperglycaemia resulting in macro and microvascular damage that can cause life threatening complications (American Diabetes Association Professional Practice Committee, 2024). If T2D is identified early, lifestyle changes (such as improving diet and exercise) and appropriate treatment, can prevent or delay T2D development (DeFronzo et al., 2015). Emerging evidence indicates DNA methylation biomarkers may contribute to improving risk stratification for T2D, with a recent longitudinal study with the Generation Scotland cohort reporting improvements in 10-year incident risk prediction scores with the addition of DNA methylation data (Cheng et al., 2023). In the following sections, we discuss the current knowledge, limitations and future opportunities in the identification of DNA methylation biomarkers for T2D.
3.2 Candidate gene studies
Several findings to emerge from candidate gene analyses are the identification of differential DNA methylation within the PPARGC1A, insulin (INS) and PDX-1 gene promoters within pancreatic islets. One case-control study identified differential DNA methylation in four CpG sites in the PPARGC1A promoter between 10 participants with T2D (50.0% male; mean age: 65.1 years with SEM ± 2.6 years) and nine without T2D (77.8% male; mean age: 54.2 years with SEM ± 3.5 years) where the group with T2D had a higher average DNA methylation of 10.5 ± 2.7% compared to those without T2D 4.7 ± 0.9% (p < 0.04) (Ling et al., 2008). This finding extended to reduced PPARGC1A gene expression levels within participants with T2D (p = 0.002) (Ling et al., 2008). The study further found that knockout of PPARGC1A in human islets resulted in a 41% decrease (p ≤ 0.01) in insulin secretion (Ling et al., 2008). Another study investigating 25 CpG sites within the INS gene promoter of nine participants with T2D (55.6% male; mean age: 57.0 years with SD ± 13.1 years) and 48 participants without T2D (54.2% male; mean age: 56.7 years with SD ± 10.1 years) observed four CpG sites that had significantly increased DNA methylation in participants with T2D (Yang et al., 2011). These findings were associated with a 58% reduction (p = 0.002) in INS gene expression, 57% reduction (p = 0.004) in insulin content and a 26% reduction (p = 0.04) in glucose-stimulated insulin secretion (Yang et al., 2011). While investigation of the PDX-1 gene from nine participants with T2D (55.6% male; mean age: 57.0 years with SD ± 13.1 years) and 55 participants without T2D (52.7% male; mean age: 56.7 years with SD ± 9.8 years) revealed 10 CpG sites within the distal promoter and enhancer regions that were hypermethylated (Yang et al., 2012). Expression analysis revealed PDX-1 mRNA expression had significant positive correlation with insulin mRNA expression and glucose-stimulated insulin secretion, and negative correlation with HbA1c levels and BMI (Yang et al., 2012). Participants with T2D were observed to have significantly reduced expression of PDX-1 (0.40 ± 0.076 compared to 1.29 ± 0.15; p = 2 × 10−4 for participants without T2D) (Yang et al., 2012). Further analysis within the above studies suggested that PPARGC1A, INS, and PDX-1 have an important role in insulin secretion and that gene expression differences influenced by methylation patterns of these genes in T2D is associated with regulation of insulin secretion and insulin content. Moreover, preliminary data from blood samples indicate associations with reduced DNA methylation levels and increased expression for genes in insulin signalling and metabolism with T2D in chronic kidney disease (Khurana et al., 2023). Together, these studies demonstrate the ability of candidate gene analysis to detect differential DNA methylation associated with T2D for relevant genes selected a priori, however candidate gene analysis is typically now only utilised as a validation tool.
3.3 Microarray-based studies
DNA methylation microarrays have become widely used for T2D biomarker development due to their cost effective ability to screen orders of magnitude more CpG sites than candidate approaches. This has resulted in differential DNA methylation signatures being identified in clinically significant tissues for T2D (adipose, skeletal muscle, liver and pancreas), however the predominant tissue utilised for screening thus far remains blood (Muka et al., 2016; Walaszczyk et al., 2018; Willmer et al., 2018). In adipose tissue, a study which included 28 participants with T2D (53.6% male; mean age: 74.5 years with SD ± 4.2 years), age and sex matched to 28 participants without T2D, observed 15,627 differentially methylated loci within 7,046 genes with the Illumina 450 K Beadchip microarray (Nilsson et al., 2014). A KEGG pathway analysis of these genes showed significant enrichment in pathways including inflammation and glycan metabolism (Nilsson et al., 2014). While a study of skeletal muscle and subcutaneous adipose tissue using the Illumina 27 K microarray with 12 monozygotic twin pairs (50% male; mean age: 68.3 years with SD ± 7.7 years) discordant for T2D from Denmark, found that within the 11 pairs that provided skeletal muscle, CpG sites linked with IL8 were significantly different, and within the 5 pairs that provided subcutaneous adipose tissue, CpG sites from ZNF668, HSPA2, C8orf31, CD320, SFT2D3, TWIST1, and MYo5A showed statistically significant differential DNA methylation (Ribel-Madsen et al., 2012). In a study of liver tissue using the Illumina 450 K Beadchip, significant hypomethylation at a CpG site within PDGFA was identified in a European ancestry cohort of 96 women with obesity and T2D (mean age: 48.2 years with SD ± 6.34 years) and 96 age and BMI matched women with obesity and without T2D (41.3% methylation for participants with T2D versus 60.3% methylation for participants without T2D) (Abderrahmani et al., 2018). These findings were replicated in a German cohort of 12 participants with T2D and 53 participants without T2D (Abderrahmani et al., 2018). Increased expression of PDGFA was observed to correlate with hepatic fibrosis risk, hyperinsulineamia and insulin resistance, with levels of CpG methylation at this loci also having an inverse correlation with PDGFA expression (Abderrahmani et al., 2018). An investigation of pancreatic islet cells from people with and without T2D identified 5,584 differentially methylated CpG sites by EPIC Beadchips that were also associated with HbA1c (Rönn et al., 2023). Intriguingly, when profiling islets from individuals not previously diagnosed with T2D, the results indicate that HbA1c-associated CpG loci are predictive of future T2D (Rönn et al., 2023). Gene expression analysis further revealed 65 differentially expressed genes that were linked to 113 CpG sites associated with T2D and HbA1c (Rönn et al., 2023). Further analysis of blood samples (collected prior to T2D diagnosis) in a longitudinal cohort revealed four sites within NKX6.2, SYNPO, RHOT1, and CABLES1 that were differentially methylated (Rönn et al., 2023). Through the use of siRNA gene silencing in pancreatic islet cells, FOXP1, TBC1D4, RHOT1 and CABLES1 were observed to have functional involvement in glucose-stimulated insulin secretion (Rönn et al., 2023). Further analysis in Rhot1 knock-out rat β-cells observed Rhot1 was integral for glucose-stimulated insulin secretion and mitochondrial function (Rönn et al., 2023). Although some evidence of differential DNA methylation has been observed in these tissues, a common limitation of many tissue-based studies is their relatively small sample size compared to other biomarker discovery studies. This limits statistical power and increases the risks of false-negative findings. Additionally, while detectable statistical differences in DNA methylation may offer mechanistic insights for tissue-based studies, the need for tissue sampling complicates the translation of these findings into clinical biomarkers. Aside from the blood sample sub-analysis undertaken by Rönn et al. (2023), all of the above described studies have used a retrospective case-control design. Retrospective case-control studies are quicker and cheaper to undertake comparative to longitudinal studies investigating incident cases, however a major limitation in T2D biomarker research is that it is practically impossible to determine if the identified biological variation pre-dated the onset of T2D or was a consequence of T2D. By conducting longitudinal studies where samples are available prior to clinically identifiable T2D, for participants who later develop T2D, it can be possible to identify potential DNA methylation biomarkers predictive of future T2D development. See Table 1 for studies reviewed herein that have longitudinal designs.
Differential DNA methylation signatures for T2D have been identified in blood samples using microarray platforms in several large population studies, as well as two meta-analyses. In one study investigating the incidence of T2D in people of Indian Asian ancestry (discovery cohort; 1,074 with T2D and 1,590 without T2D) and people of European ancestry (replication cohort; 377 with T2D and 764 without T2D), five CpG sites (TXNIP, ABCG1, PHOSPHO1, SOCS3, and SREBF1) were identified by Illumina 450 K microarray as having a statistically significant association with T2D after replication (Table 1) (Chambers et al., 2015). While the mechanisms for this variation are unknown, the authors postulate that future functional gene analysis of these genes will reveal involvement in T2D development due to the association of these sites with genes involved in metabolic pathways (Chambers et al., 2015). While in another Illumina 450 K microarray study, 18 CpG loci were significant in a cohort of European ancestry people (563 with T2D and 701 without T2D) and 14 and 16 of these showed the same directional change (p < 0.05) in separate cohorts (Table 1) (Cardona et al., 2019). Gene set enrichment analysis of these sites indicated enrichment in pathways involved in cholesterol biosynthesis, carnitine metabolism and AMPK signalling (Cardona et al., 2019). A more recent study investigated incident (534 with T2D and 13,437 without T2D) and prevalent (348 with T2D and 14,002 without T2D) T2D on a platform screening 752,722 CpG sites within the Generation Scotland cohort (Hillary et al., 2023). This study reported 58 significant CpG sites within the incidence investigation and 52 significant CpG sites within the prevalence investigation, 17 of which overlapped between investigations (Table 1) (Hillary et al., 2023). Here, several significant pathways including cholesterol biosynthesis and cholesterol metabolism were associated with T2D for the prevalence cohort (Hillary et al., 2023). Further, a meta-analysis of European ancestry populations (340 with T2D and 3,428 without T2D; Illumina 450 K Beadchip) revealed 6 CpG sites (TXNIP, ABCG1, CPT1A, HDAC4, SYNM, and MIR23A) as well as 77 differentially methylated regions that were associated with T2D (Juvinao-Quintero et al., 2021). Analysis on KEGG pathways and GO terms failed to identify any enriched pathways for these genes (Juvinao-Quintero et al., 2021). The authors however caution of correlation with cell-type proportions, with all 6 CpG sites identified associating with white blood cell-type, highlighting the confounding effects of cell-type heterogeneity in blood-based studies (Juvinao-Quintero et al., 2021). This further highlights the need for adjusting for cell-type heterogeneity or cell-type deconvolution in biomarker discovery (Teschendorff and Zheng, 2017; Titus et al., 2017; De Ridder et al., 2024). In a meta-analysis of five prospective studies with people of European descent (1,250 with T2D and 1,950 without T2D; measured on Illumina Methylation EPIC chips or 450 K microarrays: 416,716 - 470,870 probes retained for analysis) 76 CpG sites were associated with T2D after accounting for age, sex, cell-type composition and batch, with four remaining significant after adjustment for BMI (Table 1) (Fraszczyk et al., 2022a). The 76 CpGs were further assessed in a separate replication cohort with people of Indian Asian descent (1,072 with T2D and 1,587 without T2D), where 64 of the CpG sites were directionally consistent (p < 0.05) on a model accounting for age, sex, cell-type composition and batch (Fraszczyk et al., 2022a). Pathway analysis by GO terms and Reactome enrichment analysis revealed several enriched pathways including phospholipid metabolism (Fraszczyk et al., 2022a). While in a smaller cohort (218 with T2D and 77 without T2D) of Indigenous (85% of cohort) and non-Indigenous young people (mean age 15 years with SD ± 3.0 years; 64% female) from Canada, the Illumina EPIC Beadchip was used to identify 3,830 CpG sites (3,725 of which were novel) with ≥1% DNA methylation difference between participants with T2D and those without (Salama et al., 2024). Furthermore, three of these CpG sites, all within the PFKFB3 gene, were also associated with maternal diabetes exposure during gestation (Salama et al., 2024). These findings indicate that DNA methylation differences between people with T2D and people without T2D are detectable in youth-onset T2D, highlighting the potential for DNA methylation biomarkers unique to this risk group.
The transition to microarray-based screening technology has enabled the identification of a broader range of T2D-associated differential DNA methylation patterns across the genome, far surpassing the capabilities of candidate gene analysis. However, the commonly used microarray platforms such as the Illumina HumanMethylation450 and the MethylationEPIC platforms only enable profiling of ∼413 k and ∼850 k CpG sites respectively (Pidsley et al., 2016), and do not provide base-level resolution of DNA methylation (Figure 1). Whereas, on 10 ng of DNA input, EM-Seq can detect 53.7 million CpG sites and WGBS can detect 36.0 million CpG sites, with 35.8 million CpG sites correlating between detection methods (Vaisvila et al., 2021). Thus, current microarray platforms can, at best, only assess ∼1.5% of the CpG sites that can be profiled by the latest short-read sequencing-based methods.
3.4 Sequencing based studies
Whole-genome DNA methylation profiling techniques such as WGBS and EM-Seq far surpass microarray detection techniques due to their unbiased, base-level resolution and the high proportions of CpG sites covered. To the best of our knowledge, there are only three reported applications of whole-genome DNA methylation profiling in the study of T2D, with all utilising case-control designs and WGBS (Table 2). The first study identified 25,820 differentially methylated regions (DMRs) (13,696 hypermethylated and 12,124 hypomethylated) in pancreatic islet cells from six participants with T2D and eight participants without T2D, of which 159 DMRs were assigned to 43 of 65 previously identified candidate genes (Volkov et al., 2017). The second, used WGBS data for a targeted DNA methylation analysis of ± 5 kbs of a CpG site (ch17: 55484635) within the MSI2 gene in pancreatic islets, revealing 39 differentially methylated positions, 36 of which were hypomethylated (Jeon et al., 2017). This region was selected for investigation after a blood-based 450 K microarray study identified a neighbouring CpG site (chr17:55484600) as differentially methylated and blood-based pyrosequencing within this site confirmed this differential DNA methylation (Jeon et al., 2017). While acknowledging different tissues were screened, the use of WGBS revealed finer-grained detail about the DNA methylation patterns of the MSI2 gene than either the 450 K microarray or pyrosequencing. The third study, reported 9,025 DMRs (3,269 hypermethylated and 5,756 hypomethylated) mapping to 2,019 differentially methylated genes (DMGs), 77 of which were in previously identified candidate genes, from the spermatozoa from eight men with T2D and nine without (Chen et al., 2020). As demonstrated by these studies, the correlation of multiple DMRs with previously identified candidate genes demonstrates that whole-genome technologies produce concordant findings with earlier DNA methylation detection technologies. Furthermore, the generation of large numbers of DMRs highlights the potential of whole genome technologies to reveal previously unknown relationships between the DNA methylome and T2D, which could reveal new mechanistic insights into the pathogenesis of T2D. Given T2D is a complex and phenotypically heterogeneous condition, future T2D biomarker discovery studies should ideally consider whole-genome profiling techniques to enable a larger proportion of the methylome to be profiled, as many genomic regions of potential change are not assessable by microarray platforms.

Table 2. Features of Type-2 diabetes studies that have used whole genome sequencing methods to measure DNA methylation.
3.5 The majority of reproducible DNA methylation signatures for T2D have been identified from blood samples
Blood is the most commonly assayed tissue in DNA methylation screening, particularly within studies of large populations. Blood samples are ideal for biomarker development because they are less invasive to collect than the tissues of direct clinical significance, and phlebotomy is already a routine clinical procedure. Importantly, when comparing differential DNA methylation findings for T2D from blood and tissue samples of significance to T2D from the same individual, several CpG marks have been shown to correlate between blood and tissue samples. For example, paired blood and liver samples from 175 people were used as a validation assessment for five blood-identified differentially methylated CpG sites (TXNIP [cg19693,031], ABCG1 [cg06500161], PHOSPHO1 [cg02650017], SOCS3 [cg18181703] and SREBF1 [cg11024682]), with TXNIP (p = 0.02) and SOCS3 (p = 5.3 × 10−5) loci identified to correlate (Chambers et al., 2015). These findings were replicated using matched blood, skeletal tissue and adipose tissue from nine monozygotic twin pairs discordant for T2D, with correlations found between blood and adipose tissue for SOCS3 and SREBF1 sites (Regression coefficient = 0.31 and 0.40, p = 0.010 and 0.052, n = 28) (Dayeh et al., 2016). Despite low sample numbers in matched analysis, several studies assaying blood and other tissues in unmatched samples have demonstrated that some tissue-based DNA methylation patterns are reflected in blood. For example, 57.7% of age-related epigenetic changes identified in pancreatic islet cells from 87 participants without T2D were reflected in the blood of a second cohort of 421 participants without T2D (Bacos et al., 2016). Additionally, 67.8% of age-related epigenetic changes in livers of 95 people (35 of which with T2D, but sub-analysis by T2D status was not presented) undergoing Roux-en-Y gastric bypass were also identified in white blood cells of a separate cohort of 421 participants (13,022/13,631 were also concordant with DNA methylation direction) (Bysani et al., 2017). While biomarkers from tissues directly involved in T2D pathology would be ideal, blood appears to be a suitable proxy given its practicalities and the emerging evidence of correlations with tissues of significance. Further studies with matched tissue and blood samples that investigate the extent of the DNA methylation correlation between blood cells and the tissues involved with T2D would further aid in identifying accurate tissue correlated proxy loci that can be assessed by blood cell profiling.
3.6 Reproducibility and functions of differentially methylated genes
Several genes associated with differential DNA methylation sites consistent with T2D have been identified by multiple studies and systematic reviews containing case-control data (Table 1) (Muka et al., 2016; Walaszczyk et al., 2018; Willmer et al., 2018). Genes with frequently documented differential DNA methylation in blood samples include TXNIP, ABCG1, SREBF1 and CPT1A in the context of both incidence and prevalence studies. When searching the GWAS catalogue (Sollis et al., 2023), the genes ABCG1 (Mansour Aly et al., 2021) and SREBF1 (Ray and Chatterjee, 2020; Vujkovic et al., 2020) have been identified as being significantly associated with T2D. This provides independent lines of evidence that epigenetic changes linked with T2D at these loci are occurring in regions of the genome known to be associated with T2D genetic risk.
When reviewing the biological pathways and molecular functions of these genes, there is either a direct link with cellular mechanisms involved with T2D pathogenesis, or associated cardiometabolic phenotypes. The TXNIP gene, which codes for the Thioredoxin-interacting protein, is involved in several inflammatory and redox biochemical pathways, with inflammation as a known byproduct/precipitating factor of T2D development (Choi and Park, 2023). Of particular interest, a murine model has demonstrated that TXNIP causes apoptosis in mouse pancreatic β-cells when upregulated in response to high glucose levels (Chen et al., 2008). Moreover, the expression of the ABCG1 gene has been shown in mice to be integral to cellular efflux of cholesterol and prevention of atherosclerosis, a common comorbidity of T2D (Wang et al., 2004; Kennedy et al., 2005; Out et al., 2007). Thus, it has been proposed that increasing ABCG1 expression may have protective effects against atherosclerosis development (Frambach et al., 2020; Matsuo, 2022). With respect to SREBF1 (also known as SREBP1), a murine model engineered to not express SREBP1 showed that in the refeed state following a fast, mice without SREBP1 failed to induce liver lipogenesis (Shimano et al., 1999), an important biochemical pathway for converting excess carbohydrate to lipids. Similarly, the CPT1A gene involved in fatty acid oxidation in the mitochondria (Schlaepfer and Joshi, 2020), has been observed to complex with acyl-CoA synthetase and voltage-dependent anion channels in the mitochondrial outer membrane to facilitate uptake of fatty acids (Lee et al., 2011), which is important for maintaining metabolic equilibrium. Whilst detecting correlations of blood-based DNA methylation measures with T2D does not imply the same epigenetic differences exist in the tissue or organs underpinning the condition, such as the pancreas, kidney, and muscle glucose uptake, the direct implication of these genes with strong T2D relevant pathways does indicate that blood cell DNA methylation patterns may reflect changes at the tissue level, or wider systemic epigenetic change as a result of the condition.
4 The potential to incorporate epigenetic age for biomarker discovery
Age is a risk factor for many chronic conditions, including T2D, and can be defined as the deterioration of cellular processes that contribute to aberrant health outcomes (López-Otín et al., 2013). A person’s rate of ageing is variable, and can be influenced by multiple internal (intrinsic) and environmental (extrinsic) factors. Epigenomic alterations have been classified as one of 12 hallmarks of ageing, with DNA methylation changes observed to correlate with advancing age (López-Otín et al., 2023). This observation has led to the development of ‘epigenetic clocks’, which utilise machine learning models and regression statistics to estimate the epigenetic age of an individual based on methylation levels at specific CpG sites (Horvath and Raj, 2018). The calculated epigenetic age can then be compared to years since birth age (chronological age) to determine if epigenetic ageing is in acceleration or deceleration (Horvath and Raj, 2018).
While multiple epigenetic clocks have been developed, they can be broadly classified into three generations. First generation clocks use chronological age metrics as the primary source of data in their calibration, with Horvath’s (Horvath, 2013) and Hannum’s (Hannum et al., 2013) clocks the most popular from this generation. Second generation clocks include measures of health and lifestyle factors (e.g., smoking) in their design to better predict morbidity and mortality, including the GrimAge (Lu et al., 2019) and PhenoAge (Levine et al., 2018) clocks. Third generation clocks have transitioned to investigating rates of ageing, which include the DunedinPoAm (Belsky et al., 2020) and its updated version the DunedinPACE clock (Belsky et al., 2022). Of note, all six clocks have been developed for microarray data, highlighting the wide application of microarray technology in biomarker investigation. In addition to this, with the exception of Horvath’s, which used data from 51 healthy tissues, blood was the tissue utilised in development; further highlighting the utility of blood as a suitable tissue for developing epigenetic age estimates.
In addition to being considered as a condition of interest in the development of several clocks (PhenoAge, GrimAge), several studies have investigated the utility of epigenetic age in predicting T2D. For instance, HorvathAge, HannumAge, PhenoAge, GrimAge, Telomere Length, and DunedinPoAm clocks were tested on multiple morbidity and mortality conditions within the Generation Scotland cohort, and several were found to significantly associate with incident T2D (GrimAge Hazard Ratio (HR) = 1.52, 95% CI = 1.20-1.90; PhenoAge HR = 1.54, 95% CI = 1.21–1.97) (Hillary et al., 2020). However, in an investigation with Horvath, Hannum, GrimAge and PhenoAge clocks in a nested case-control T2D study with people from the prospective Doetinchem Cohort Study in the Netherlands, no significant difference was observed with epigenetic ages, and when assessing age acceleration, only the Horvath clock was significant (Fraszczyk et al., 2022b). An investigation of PAI-1, Telomere Length, DunedinPACE, PCHorvath1, PCHorvath2, PCHannum, PCPhenoAge and PCGrimAge clocks was undertaken with participants with and without diabetes from the Swedish Adoption/Twin Study of Aging (study did not distinguish type of diabetes, but assumed 94% of participants had T2D), where it was observed that on smoothened average curves within the 60–70 years age range, DunedinPACE and PAI-1 measures for the participants with diabetes were significantly higher (Wikström Shemer et al., 2024). However when all clocks were assessed by Cox Proportional Hazard modelling no significance was observed (Wikström Shemer et al., 2024). While an investigation of the relationship between DNA methylation age and mortality in a cohort with T2D from Italy found that, after adjusting for risk factors, accelerated epigenetic age was associated with increased mortality (PhenoAge HR = 1.16, 95% CI 1.05-1.28 and DunedinPoAm HR = 3.65, 95% CI 1.43–9.35) (Sabbatinelli et al., 2024). Although it is promising that several epigenetic clocks have observed correlations between ageing rate and T2D, non-significant results with several clocks suggests further refinement of epigenetic clocks for the study of T2D risk is required. One suggested approach for improving the utility of epigenetic age measurements for T2D would be to generate T2D-specific biomarker clocks (Wikström Shemer et al., 2024). Given the above described epigenetic clocks rely on microarray data, a promising method for the identification of novel CpG sites suitable for a T2D specific biomarker is whole genome DNA methylation profiling, with several techniques capable of screening ∼96% of CpG sites. Given that microarrays assess less than one million CpG sites, epigenetic clocks generated from whole genome DNA methylation profiling data have the potential to have greater applicability for T2D prediction and monitoring.
5 Future directions
In order to ensure equity in T2D biomarker development, researchers should seek to diversify their study populations to ensure that the translation of biomarkers to the clinic provides equitable representation for populations of a broader set of ancestral backgrounds. The majority of T2D biomarker discovery has occurred within people of European ancestry, biassing findings toward genetic and environmental factors present with people of European origins. It has been demonstrated that the precision of polygenic risk scores for common conditions decays when applied to different ancestry groups (Kachuri et al., 2023; Moreno-Grau et al., 2024). However, the addition of DNA methylation data could assist with improving transferability of polygenic risk scores (McCartney et al., 2018), with some preliminary evidence indicating reduced impact of ancestry on DNA methylation association with medically-relevant phenotypes (Thompson et al., 2022). Whilst it is unclear if the addition of DNA methylation data will improve polygenic risk prediction for people of non-European ancestry, broadening the population diversity in biomarker development will likely have the greatest impact in improving predictive accuracy across diverse ancestries. This approach could mitigate current barriers to diagnosis and management of chronic conditions, ensuring that epigenetic biomarker development delivers more reliable health predictions for underrepresented populations. This need for greater diversity within DNA methylation biomarker development has been demonstrated by a comparison of microarray generated DNA methylation data from blood of healthy people from European ancestry and South Asian ancestry, which observed 16,433 differentially methylated sites; the majority (76%) of which could be ascribed to different cell-type compositions within ancestry groups (Elliott et al., 2022).
T2D is a condition of global significance that disproportionately affects Indigenous Peoples who are impacted by settler colonialism (Harris et al., 2017). While it is encouraging to see studies such as the Strong Heart Study among American Indian communities (Domingo-Relloso et al., 2022), and the iCARE study working with young First Nations people in Canada (Salama et al., 2024), it is imperative further efforts ensure more Indigenous communities have equal opportunity to benefit from biomarker development. Of note, the iCARE study consisted of 85% Indigenous participants (n = 251) and 15% non-Indigenous participants (n = 44), including a subanalysis comparing data between Indigenous and non-Indigenous participants and observed that five of the 3,830 identified differentially methylated CpG sites had differing directions of methylation (Salama et al., 2024). While the high concordance between differential DNA methylation findings between the Indigenous subset and the wider cohort demonstrates the robustness of DNA methylation as a biomarker, the observation of discordant sites suggests a need for greater representation of Indigenous Peoples within T2D biomarker discovery. Furthermore, while not in the context of T2D, a study investigating epigenetic ageing of Native Hawaiian, Japanese American and White participants from Hawaii observed Native Hawaiian people to have a significantly accelerated ageing rate comparative to the other participants (Maunakea et al., 2024). The authors identified several socio-economic and health factors that were protective, as well as several that increased risk, highlighting the importance of the environment in understanding DNA methylation (Maunakea et al., 2024). Given the great sociological and cultural diversity that exists amongst Indigenous Peoples globally, different Indigenous communities likely have different environmental influences that could impact DNA methylation. This highlights the importance of ethical inclusion and equitable representation of Indigenous communities within DNA methylation biomarker research to ensure T2D biomarker development contributes to equitable health outcomes.
Future DNA methylation studies should aim to capture a wide range of age groups, as many prior T2D DNA methylation studies are composed of individuals who are middle aged or older (Tables 1 and 2). Given that DNA methylation patterns change throughout life (Jones et al., 2015), a skew in T2D biomarker studies towards older people risks T2D epigenetic biomarker development becoming further biassed towards older populations. This is of concern for younger people at risk of T2D due to increasing global incidence of younger onset T2D which is often accompanied by more severe phenotypes (Bjornstad et al., 2023). In light of this, future DNA methylation investigations should endeavour to recruit a wide range of age groups to ensure T2D DNA methylation biomarkers enable accurate prediction of condition risk for younger populations.
6 Ensuring ethical genomics and equity for Indigenous Peoples in T2D biomarker development
The application of genomics for human health over the past decades has not been equitable, as biases in studies have arguably limited the benefits to specific populations, often to the disadvantage of Indigenous Peoples (Mills and Rahal, 2019; Fatumo et al., 2022). While it could be argued that providing equitable benefits to Indigenous people can be achieved by diversifying recruitment (see discussion below), it could also be argued that true equity can not be achieved while lack of diversity also exists within the human reference genome, genomic functional annotations and databases that catalogue genetic variation. This is of particular importance in the context of T2D biomarker research with Indigenous communities, due to the workflow reliance of popular DNA methylation detection technologies on the human reference genome and functional annotation databases. Currently, the most commonly used human reference genome is GRCh38, which derives ∼70% of its sequence from a single individual of African/European ancestry (Schneider et al., 2017). While recent efforts to improve the human reference genome has seen the release of T2T-CHM13, which improves GRCh38 by filling heterochromatin gaps, the cell lineage utilised is still of majority European ancestry (Nurk et al., 2022). While it is encouraging that improvements to the human reference genome are occurring (Liao et al., 2023), lack of diversity can introduce unknown biases into epigenetic biomarker development. For example, recent whole-genome sequencing of genomes from Indigenous Australians found that while T2T-Ch13 was a more accurate reference than GRCh38, large numbers of unannotated structural variations were identified (Reis et al., 2023). Moreover, 3.4 million single nucleotide variants identified among Indigenous Australians are not present in either the 1,000 Genomes or Human Genome Diversity projects (Silcocks et al., 2023). The lack of diversity in current reference genomes could impact the accuracy of DNA methylation quantification as the human reference genome is typically required as a reference for sequence read mapping and with microarray probe design. If any discordance surrounding CpG loci differences between populations are not accounted for, it is plausible that results may have population-specific biases. Until human reference genomes and analysis methods are able to reflect global human diversity, there is a risk that any T2D DNA methylation biomarkers developed without Indigenous populations may have lower efficacy for the populations with some of the highest burden of T2D.
Rates of T2D in Indigenous communities impacted by colonisation often exceed that of non-Indigenous people of the same country or region (Harris et al., 2017). While the interactions between individual Indigenous communities and the colonial powers (and their established medical and research institutions) on their lands may be unique, many burdens resulting from colonialism are shared. These include, racism, trauma, disruptions to family structures, and shifts towards a Western diet and lifestyle (Wolfe, 2006; Glenn, 2015; Paradies, 2016). Additionally, many Indigenous communities have experienced exploitation of their bodies, bioresources and knowledge by research practices that sought to document, catalogue and exploit Indigenous Peoples and their ways of being (Turnbull, 2007; Reid et al., 2019). In addition to experiencing disproportionate rates of T2D, Indigenous people can experience disadvantages that can arise due to epistemic racism and a failure to consider historical contexts surrounding how Indigenous people have experienced Western healthcare (Sinclaire et al., 2023). For many Indigenous communities, these experiences can carry through generations as intergenerational trauma and have profound effects on health and wellbeing (Griffiths et al., 2016). In this domain, there are concerns that researchers investigating epigenomics of intergenerational trauma with Indigenous communities impacted by colonisation may unintentionally further perpetuate trauma (Saulnier et al., 2022). Thus, it is important that when undertaking genomics studies with Indigenous communities impacted by colonisation, researchers develop an understanding of a community’s individual experiences with colonialism, and how that can impact both their health as well as their relationship with biomedical research.
Attempting to increase diversity in genomics-based biomarker development requires an understanding of what has led to the current lack of diversity. Underrepresentation of Indigenous Peoples within genomics datasets can be partially attributed to self-exclusion due to previous experiences with exploitative Western research practices, lack of consultation with Indigenous Peoples in the design of projects, and the requirements for open-access data (Sherwood, 2013; Garrison et al., 2019; Hudson et al., 2020; Mc Cartney et al., 2022). Sharing data and open access data under the ethos of the Findable, Accessible, Interoperable and Reusable (FAIR) data principles (Wilkinson et al., 2016) is well established in human genomic research. However, open access data is of concern for many Indigenous communities (Hudson et al., 2020). Indigenous Peoples may not receive the same benefits from the use of their genomics data, as communities have raised concerns that they may not have control over who uses their data and what it is used for, with the risk of data being used in ways contrary to cultural protocols (Hudson et al., 2020; Mc Cartney et al., 2022). To mitigate risks from open access data, the Collective benefit, Authority to control, Responsibility and Ethics (CARE) principles were developed (Carroll et al., 2020). The CARE Principles are designed to complement the FAIR principles by providing a framework for Indigenous people to self-determine research objectives and have governance over studies and their associated data (Carroll et al., 2020). If Indigenous Peoples have no control over their data, then the scientific community risks perpetuating the same situations that lead many Indigenous communities to disengage from research in the first place, thereby jeopardising equitable T2D biomarker development. This is critically important if future T2D biomarkers are to be developed with the populations most severely impacted by the condition.
Extensive qualitative work undertaken with Australian Indigenous communities has demonstrated that respectful and ongoing consultation is critical to rebuilding trust in the field (Hermes et al., 2021). In addition to reconciling historical damages, researchers should strive to work in equal partnership with Indigenous Peoples to ensure their research is respectful of Indigenous data sovereignty and governance (Garrison et al., 2019). Researchers and institutes can achieve this through respecting and including Indigenous Knowledge, working in partnership with Indigenous Peoples throughout the project lifecourse (Claw et al., 2018), abiding by CARE principles for data management (Carroll et al., 2020), and developing policies and procedures that are protective of Indigenous biosamples and intellectual property (Garrison et al., 2019). A mechanism through which this can be operationalised is by the creation of project Indigenous governance committees (Hudson et al., 2020) which oversee study operations, sample and data use, and reporting. By empowering Indigenous communities to have authority and ownership over genomics research conducted with them, researchers and institutes can begin rebuilding trust in the genomics field (Hudson et al., 2020). As demonstrated by work undertaken with Indigenous communities in Chile (Arango-Isaza et al., 2023), engagement with Indigenous communities during data interpretation not only returns results to the community and is respectful, but it also provides a richness to the findings that only those with lived experiences can deliver. Finally, when completing the research cycle with reporting, it is important to consider a ‘strengths-based’ as opposed to ‘deficit’ narrative in reporting (Hyett et al., 2019). The above-listed references are a selection of the resources available to the genomics community for enacting Indigenous data sovereignty, for more information see Supplementary Table 1 of Mc Cartney et al. (2022).
There is an urgent need for the research community to build trust by ensuring ethical community engagement, and the inclusion of and governance by Indigenous Peoples, their knowledge, and sovereign rights. Indigenous communities globally are rich and diverse in culture and community protocols, and we provide the following general advice for commencing engagement and partnership with Indigenous communities. We advise learning more about the history and culture of the Peoples you wish to partner with, for example, by undertaking cultural awareness training delivered by an Elder, or community leader. Echoing the advice of Hudson et al. (2020), we recommend seeking out and following local guidelines (co-created with local Indigenous Peoples) for how to respectfully undertake research in partnership with local Indigenous communities. For example, the work we currently undertake is guided by The South Australian Aboriginal Health Research Accord (Morey et al., 2023). Inline with local guideline documents, we recommend initiating discussions with community representatives or spokespeople and discuss what expertise and resources the research group has and whether there is a project the community would like to partner on. If the community wishes to co-develop a project, consider collaborating with community members in the project’s design and governance, adhere to CARE principles, and implement local Indigenous data sovereignty procedures throughout the project life cycle (Claw et al., 2018; Carroll et al., 2020; Hudson et al., 2020; Griffiths et al., 2021). Furthermore, it is important for research teams and institutes to support the learning of Indigenous Peoples in how genomics data is generated, managed and analysed (Hudson et al., 2020; Waanders et al., 2023). By enhancing the genomics knowledge of Indigenous Peoples, communities will have a greater capacity to be involved with research, which will contribute toward building trust within the genomics field; thus facilitating better access to the benefits of genomics for Indigenous communities. Without such a considered approach, the T2D health disparity gap experienced by many Indigenous Peoples is at great risk of increasing.
7 Challenges of developing DNA methylation biomarkers
7.1 Reproducibility challenges
The identification of multiple differential DNA methylation patterns prior to a diagnosis of T2D, as well as post-T2D diagnosis, demonstrates the promise of DNA methylation as a potential biomarker for the identification and monitoring of T2D. However, one major limitation is that few differential DNA methylation patterns have been replicated in more than one cohort. For example, in an examination of seven EWAS studies, only 6.5% (12/185) of differentially methylated sites were identified in more than two studies (Hillary et al., 2023). There are multiple possible explanations for this, including: differing statistical methodologies; different sample sizes and study designs; variation in selection of features corrected for (e.g., age, sex and BMI); definition of T2D (self report versus clinically identified); different technologies used to assess DNA methylation; and variations in population genetics and ancestry (Hillary et al., 2023). Awareness and where possible, controlling for these variations, will improve replicability.
As DNA methylation datasets grow in depth and complexity, the analytical and statistical demands increase accordingly. Researchers in the field of epigenomics have increasingly employed machine learning (ML) strategies (Rauschert et al., 2020), however, several challenges can exist with ML. These include input variable selection, the assessment of model performance, data leakage, model performance metrics, how generalisable the developed model is to the population of interest, and sufficient sample size for analysis (Yousefi et al., 2022). In addition, the way the data is distributed between training, test and prediction data sets; how dependencies are managed within it; confounding variables; data leakage within the analysis pipeline; and balancing of the classes between training, test and prediction dataset, can all influence replicability of ML findings (Whalen et al., 2022). For extensive discussion and best practice mitigation advice on the above listed challenges, we refer the reader to the comprehensive reviews of Whalen et al. (2022) and Yousefi et al., 2022. While ML is delivering detailed findings, DNA methylation is highly variable, nuanced and diverse across both the body and time (Greenberg and Bourc’his, 2019), and ML linear regression based models may not appropriately capture this. To overcome this, a transition towards large language models has been proposed, with recent publication of DNA methylation analytical programs such as MethylGPT (Ying et al., 2024) and CpGPT (de Lima Camillo et al., 2024). These models show great potential due to their network approach, however they have been trained on microarray data, so further development may be required for use with whole genome sequencing data. The increase in dataset complexity combined with the rapid improvements in artificial intelligence facilitated statistics gives great potential to deliver improved biomarkers for T2D prediction and progression.
In addition to statistical considerations, study design could influence replicability. Initial T2D candidate gene studies typically utilised retrospective case-control cohorts, while studies with microarrays used either retrospective case-control cohorts or longitudinal cohorts investigating incident T2D. Many of the above described studies using retrospective case-control designs did not list for participants with T2D, how long they had T2D, or any T2D associated complications. So, for these studies, it is not possible to determine at what stage of T2D progression the described biomarkers correspond to. Furthermore, for the studies investigating incidence of T2D, described DNA methylation differences would be those that presented early in the trajectory of T2D development. Thus, given that T2D is a condition that exhibits a spectrum of clinical presentations (DeFronzo et al., 2015), and that DNA methylation patterns can change in response to health and environmental stimuli (Yousefi et al., 2022), it is possible that studies are identifying DNA methylation patterns that shift as the condition progresses. To improve replicability, future biomarker studies should include, if possible, descriptors of T2D duration and presence/absence of any T2D associated co-conditions to account for the phenotypical heterogeneity that occurs with this condition.
Study sample size can also influence replicability. Many candidate gene T2D DNA methylation studies had small sample sizes (often only in the 10s), which greatly increases the possibility of type II errors. The transition towards microarray technology has been accompanied by larger sample sizes, with hundreds to thousands of samples being common (Table 1). However, a major limitation of microarrays is they are limited to ∼900 k CpGs or less; over an order of magnitude less than the ∼30 million CpG sites possible with methylome sequencing (Vaisvila et al., 2021). Thus, a transition to sequencing based methods, will enable the relationship of DNA methylation with T2D to be studied at genome-wide scale. However, to support this, larger sample sizes will be required to avoid type II errors. As DNA methylation detection technologies improve and become cheaper, more replicates will facilitate further understanding of the relationship between DNA methylation and T2D.
Cell-type heterogeneity can also influence DNA methylation measures as different cell-type feature distinct DNA methylation patterns (Loyfer et al., 2023). As discussed above, blood is the most common tissue in T2D DNA methylation studies, however potential confounding can occur due to blood being composed of multiple different cell-type that can exhibit variation in their proportionality. When investigating DNA methylation from blood, cell-type heterogeneity should be accounted for. One method to control for cell-type composition is fluorescence-activated cell sorting (FACS) (Bonner et al., 1972), however this process requires specialised equipment, can be costly and must occur before DNA extraction. To overcome the infeasibility of FACS in large studies, several computational based methods have been developed to account for cell-type heterogeneity directly from DNA methylation data, with tools such as EpiDish performing particularly well for blood with both microarray and methylome sequencing data (Houseman et al., 2012; Teschendorff et al., 2017; De Ridder et al., 2024) This is particularly promising for future T2D biomarker studies, as cell-type variation can be accounted for in analysis and thus bypasses the need for physical sorting of cells prior to analysis.
7.2 Practical limitations of whole genome sequencing
While whole genome sequencing based DNA methylation approaches deliver the highest resolution datasets, several challenges present with methylome sequencing. Firstly, comparative to other techniques, sequencing approaches require more specialised equipment and are more costly to generate data. Secondly, the per-sample sequence data are exponentially larger than from candidate gene studies or microarrays, and often require high-performance computing (HPC) for data management and analysis; both of which can be costly (Figure 1). Moreover, the increasingly complex genomics datasets that are being generated require advanced data analysis expertise (Krishna and Elisseev, 2021). In the realm of biomarker development, the complexity of these datasets, combined with the need to use more sophisticated analytical methods introduces new challenges. One such challenge is the ‘curse of dimensionality’, where the large number of features vastly exceeds the sample size, making it difficult to identify meaningful patterns and increasing the risk of overfitting, thereby complicating statistical evaluation across all dimensions. This added complexity, correspondingly increases the cost and resource requirements of methylome sequencing based studies, thus making it more challenging for research teams that have a limited capacity to handle and analyse large volumes of complex sequence data.
7.3 Ethical and privacy considerations
Progress in biomarker development towards sequencing-based methylome profiling introduces additional participant privacy considerations. When considering sharing DNA methylation data obtained from sequencing, researchers should be aware that it may be possible to re-identify people and their families, and in the future, potentially deduce other information regarding health and lifestyle (Santaló and Berdasco, 2022). While the level of risk to participant privacy with data sharing is still to be determined, it cannot be ignored, due to the potential harms that can arise from exploitation of this information (Oestreich et al., 2021). There are several strategies that can be adopted to protect participant privacy, including keeping data secured through mechanisms such as restricted access and encryption, and anonymising identifiable information prior to public release (Bonomi et al., 2020). Furthermore, researchers should also be aware that epigenomic data may not receive the same level of legal protection as genomics data, which can have privacy implications, particularly when non-medical industries, such as insurance and forensics, express interest in this data (Dupras et al., 2018). For further discussion on ethico-legal considerations of epigenetic data and consent management, see (Dupras et al., 2019; 2020; Oliva et al., 2024). Given that some populations continue to experience discrimination and prejudice based on aspects of their identity, such as their ethnicity, health status and gender, we advise that the technique selected to protect privacy be informed by consultation with the study population(s). Although focusing on populations comfortable with open access data may offer easier data access and management, investing in the creation of secure data environments for all populations would have wide-ranging benefits.
8 Discussion
Our review of the microarray and sequencing-based studies above reveals some common biological pathways and functions associated with differentially methylated CpG sites, particularly in cholesterol biosynthesis, insulin signalling, and metabolism, all of which are linked to the pathogenesis of T2D and its associated complications. However, it is important to acknowledge that there are significant challenges in linking differentially methylated CpGs to precise biological functions. Firstly, CpG sites are often associated with nearby genes based solely on linear genome proximity, which may fail to reflect the complex regulatory interactions between DNA methylation at distal regulatory elements as well as gene expression (Taher and Ovcharenko, 2009; McLean et al., 2010; Yao et al., 2015). Secondly, many studies focus on DNA methylation in white blood cells, which, while easily accessible via phlebotomy, are not the primary tissue involved in T2D pathogenesis. As a result, inferring gene regulatory changes in key tissues, such as the pancreas or liver, is problematic, and it is unrealistic to expect robust causal mechanistic insights from these data alone. Nonetheless, in the case of blood-based studies, it is plausible that T2D could induce epigenetic changes in white blood cells through systemic effects driven by T2D, which could explain the observed DNA methylation changes. While blood based studies exhibit these limitations, blood based biomarkers will likely have the greatest clinical translation potential, due to the comparative ease of phlebotomy over tissue biopsy. Moreover, the systemic nature of T2D and its complications may increase stochastic variation in the epigenome, potentially explaining why different studies detect varying epigenetic changes. From a biomarker development perspective, whilst linking any changes in DNA methylation to mechanisms linked with T2D pathogenesis would increase confidence in the results, it is important to recognise that robust biomarkers may still be developed based on consistent DNA methylation patterns, even without direct mechanistic insights, provided they reliably associate with health condition states or outcomes across independent cohorts. These biomarkers could offer valuable predictive or diagnostic potential, especially when used in conjunction with other molecular and clinical data.
We have highlighted the importance of ensuring genomic technologies are equitably and ethically accessible for Indigenous Peoples. This is of particular importance given the discussed potential for predictive biomarkers from DNA methylation data for T2D; a condition with disproportionate prevalence within many Indigenous communities (Harris et al., 2017). Our review highlights how genomics research poses barriers for Indigenous Peoples and offers guidance on how researchers can adapt their approaches to uphold the highest ethical standards when collaborating with Indigenous communities. We have attempted to discuss all these issues from a global perspective and to provide general Indigenous data sovereignty advice that can be used by all, but we acknowledge that we write from the perspective of Aboriginal (SM and AB) and non-Aboriginal (SB) Australians whose research work is predominantly in partnership with Australian Indigenous communities. We acknowledge the rich diversity that exists within Indigenous communities and appreciate that aspects of our advice may not apply in all contexts. Regardless, we implore that researchers who wish to undertake genomics work with Indigenous communities form genuine collaborations with Indigenous Peoples and be led by the directives of the communities they partner with.
In summary, based on the study sizes reviewed herein, the complex nature of T2D pathogenesis, and the observation that many Indigenous populations globally exhibit the highest prevalence rates, there are key limitations in previous biomarker studies, particularly the insufficient genome-wide coverage of microarray platforms. To advance T2D biomarker development, we see good evidence that future efforts should focus on: 1) large longitudinal cohorts to increase statistical power, 2) the inclusion of diverse populations, particularly Indigenous populations with higher T2D prevalence, and 3) sequencing-based DNA methylation profiling at the discovery stage, which allows for the incorporation of diverse genetic backgrounds and enables broader genome-wide coverage in analyses. By embracing these strategies, the field can move closer to developing robust, globally relevant biomarkers that not only advance our understanding of T2D but also pave the way for more equitable healthcare solutions for those most affected by this complex condition.
Author contributions
SM: Writing–review and editing, Conceptualization, Investigation, Methodology, Writing–original draft. AB: Conceptualization, Funding acquisition, Supervision, Writing–review and editing. SB: Conceptualization, Funding acquisition, Supervision, Writing–review and editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. SB was supported by an NHMRC Investigator Grant (GNT2034779). SM was supported by an Australian Government Research Training Program (AGRTP) Fee Offset Scholarship, a Thermo Fisher Scientific Scholarship for Indigenous Genomics Program scholarship, an ANU Indigenous Australian Graduate Scholarship, and a ANU Postgraduate Research Scholarship. This work was also supported by the Australian Medical Research Future Fund (MRF2017210, MRF2016124) and NHMRC (GNT2011277).
Acknowledgments
Figure 1 was created in BioRender. Buckberry (2025) https://BioRender.com/j05r706.
Conflict of interest
SB is an inventor on a pending patent (PCT/AU2019/051296) filed by the University of Western Australia for erasing epigenetic memory in induced pluripotent stem cells.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Abderrahmani, A., Yengo, L., Caiazzo, R., Canouil, M., Cauchi, S., Raverdy, V., et al. (2018). Increased hepatic PDGF-AA signaling mediates liver insulin resistance in obesity-associated type 2 diabetes. Diabetes 67, 1310–1321. doi:10.2337/db17-1539
American Diabetes Association Professional Practice Committee (2024). 2. Diagnosis and classification of diabetes: standards of care in diabetes-2024. Diabetes Care 47, S20–S42. doi:10.2337/dc24-S002
Arango-Isaza, E., Aninao, M. J., Campbell, R., Martínez, F. I., Shimizu, K. K., and Barbieri, C. (2023). Bridging the gap: returning genetic results to indigenous communities in Latin America. Front. Genet. 14, 1304974. doi:10.3389/fgene.2023.1304974
Bacos, K., Gillberg, L., Volkov, P., Olsson, A. H., Hansen, T., Pedersen, O., et al. (2016). Blood-based biomarkers of age-associated epigenetic changes in human islets associate with insulin secretion and diabetes. Nat. Commun. 7, 11089. doi:10.1038/ncomms11089
Bannister, A. J., and Kouzarides, T. (2011). Regulation of chromatin by histone modifications. Cell Res. 21, 381–395. doi:10.1038/cr.2011.22
Belsky, D. W., Caspi, A., Arseneault, L., Baccarelli, A., Corcoran, D. L., Gao, X., et al. (2020). Quantification of the pace of biological aging in humans through a blood test, the DunedinPoAm DNA methylation algorithm. Elife 9, e54870. doi:10.7554/eLife.54870
Belsky, D. W., Caspi, A., Corcoran, D. L., Sugden, K., Poulton, R., Arseneault, L., et al. (2022). DunedinPACE, a DNA methylation biomarker of the pace of aging. Elife 11, e73420. doi:10.7554/eLife.73420
Bibikova, M., Barnes, B., Tsan, C., Ho, V., Klotzle, B., Le, J. M., et al. (2011). High density DNA methylation array with single CpG site resolution. Genomics 98, 288–295. doi:10.1016/j.ygeno.2011.07.007
Bibikova, M., Le, J., Barnes, B., Saedinia-Melnyk, S., Zhou, L., Shen, R., et al. (2009). Genome-wide DNA methylation profiling using Infinium® assay. Epigenomics 1, 177–200. doi:10.2217/epi.09.14
Bjornstad, P., Chao, L. C., Cree-Green, M., Dart, A. B., King, M., Looker, H. C., et al. (2023). Youth-onset type 2 diabetes mellitus: an urgent challenge. Nat. Rev. Nephrol. 19, 168–184. doi:10.1038/s41581-022-00645-1
Bonner, W. A., Hulett, H. R., Sweet, R. G., and Herzenberg, L. A. (1972). Fluorescence activated cell sorting. Rev. Sci. Instrum. 43, 404–409. doi:10.1063/1.1685647
Bonomi, L., Huang, Y., and Ohno-Machado, L. (2020). Privacy challenges and research opportunities for genomic data sharing. Nat. Genet. 52, 646–654. doi:10.1038/s41588-020-0651-0
Boye, C., Nirmalan, S., Ranjbaran, A., and Luca, F. (2024). Genotype × environment interactions in gene regulation and complex traits. Nat. Genet. 56, 1057–1068. doi:10.1038/s41588-024-01776-w
Buckberry, S. (2025). Fig 1: Comparison of candidate gene studies, microarrays, and methylome sequencing approaches for DNA methylation biomarker studies. BioRender. Available online at: https://BioRender.com/j05r706.
Buckberry, S., Liu, X., Poppe, D., Tan, J. P., Sun, G., Chen, J., et al. (2023). Transient naive reprogramming corrects hiPS cells functionally and epigenetically. Nature 620, 863–872. doi:10.1038/s41586-023-06424-7
Bysani, M., Perfilyev, A., de Mello, V. D., Rönn, T., Nilsson, E., Pihlajamäki, J., et al. (2017). Epigenetic alterations in blood mirror age-associated DNA methylation and gene expression changes in human liver. Epigenomics 9, 105–122. doi:10.2217/epi-2016-0087
Cardona, A., Day, F. R., Perry, J. R. B., Loh, M., Chu, A. Y., Lehne, B., et al. (2019). Epigenome-wide association study of incident type 2 diabetes in a British population: EPIC-norfolk study. Diabetes 68, 2315–2326. doi:10.2337/db18-0290
Carroll, S. R., Garba, I., Figueroa-Rodríguez, O. L., Holbrook, J., Lovett, R., Materechera, S., et al. (2020). The CARE principles for indigenous data governance. Data Sci. J. 19. doi:10.5334/dsj-2020-043
Cavalli, G., and Heard, E. (2019). Advances in epigenetics link genetics to the environment and disease. Nature 571, 489–499. doi:10.1038/s41586-019-1411-0
Chambers, J. C., Loh, M., Lehne, B., Drong, A., Kriebel, J., Motta, V., et al. (2015). Epigenome-wide association of DNA methylation markers in peripheral blood from Indian Asians and Europeans with incident type 2 diabetes: a nested case-control study. Lancet Diabetes Endocrinol. 3, 526–534. doi:10.1016/S2213-8587(15)00127-8
Chatterjee, S., Khunti, K., and Davies, M. J. (2017). Type 2 diabetes. Lancet 389, 2239–2251. doi:10.1016/S0140-6736(17)30058-2
Chen, J., Saxena, G., Mungrue, I. N., Lusis, A. J., and Shalev, A. (2008). Thioredoxin-interacting protein: a critical link between glucose toxicity and beta-cell apoptosis. Diabetes 57, 938–944. doi:10.2337/db07-0715
Chen, X., Lin, Q., Wen, J., Lin, W., Liang, J., Huang, H., et al. (2020). Whole genome bisulfite sequencing of human spermatozoa reveals differentially methylated patterns from type 2 diabetic patients. J. Diabetes Investig. 11, 856–864. doi:10.1111/jdi.13201
Chen, X., Xu, H., Shu, X., and Song, C.-X. (2023). Mapping epigenetic modifications by sequencing technologies. Cell Death Differ. 32, 56–65. doi:10.1038/s41418-023-01213-1
Cheng, Y., Gadd, D. A., Gieger, C., Monterrubio-Gómez, K., Zhang, Y., Berta, I., et al. (2023). Development and validation of DNA methylation scores in two European cohorts augment 10-year risk prediction of type 2 diabetes. Nat. Aging 3, 450–458. doi:10.1038/s43587-023-00391-4
Chi, T., Lin, J., Wang, M., Zhao, Y., Liao, Z., and Wei, P. (2021). Non-coding RNA as biomarkers for type 2 diabetes development and clinical management. Front. Endocrinol. (Lausanne) 12, 630032. doi:10.3389/fendo.2021.630032
Choi, E.-H., and Park, S.-J. (2023). TXNIP: a key protein in the cellular stress response pathway and a potential therapeutic target. Exp. Mol. Med. 55, 1348–1356. doi:10.1038/s12276-023-01019-8
Claw, K. G., Anderson, M. Z., Begay, R. L., Tsosie, K. S., Fox, K., Garrison, N. A., et al. (2018). A framework for enhancing ethical genomic research with Indigenous communities. Nat. Commun. 9, 2957. doi:10.1038/s41467-018-05188-3
Colella, S., Shen, L., Baggerly, K. A., Issa, J. P., and Krahe, R. (2003). Sensitive and quantitative universal Pyrosequencing methylation analysis of CpG sites. Biotechniques 35, 146–150. doi:10.2144/03351md01
Dabelea, D., Mayer-Davis, E. J., Lamichhane, A. P., D’Agostino, R. B., Liese, A. D., Vehik, K. S., et al. (2008). Association of intrauterine exposure to maternal diabetes and obesity with type 2 diabetes in youth: the SEARCH Case-Control Study. Diabetes Care 31, 1422–1426. doi:10.2337/dc07-2417
Dayeh, T., Tuomi, T., Almgren, P., Perfilyev, A., Jansson, P.-A., de Mello, V. D., et al. (2016). DNA methylation of loci within ABCG1 and PHOSPHO1 in blood DNA is associated with future type 2 diabetes risk. Epigenetics 11, 482–488. doi:10.1080/15592294.2016.1178418
Dayeh, T., Volkov, P., Salö, S., Hall, E., Nilsson, E., Olsson, A. H., et al. (2014). Genome-wide DNA methylation analysis of human pancreatic islets from type 2 diabetic and non-diabetic donors identifies candidate genes that influence insulin secretion. PLoS Genet. 10, e1004160. doi:10.1371/journal.pgen.1004160
DeFronzo, R. A., Ferrannini, E., Groop, L., Henry, R. R., Herman, W. H., Holst, J. J., et al. (2015). Type 2 diabetes mellitus. Nat. Rev. Dis. Prim. 1, 15019. doi:10.1038/nrdp.2015.19
de Lima Camillo, L. P., Sehgal, R., Armstrong, J., Higgins-Chen, A. T., Horvath, S., and Wang, B. (2024). CpGPT: a foundation model for DNA methylation. bioRxiv. doi:10.1101/2024.10.24.619766
de Mendoza, A., Nguyen, T. V., Ford, E., Poppe, D., Buckberry, S., Pflueger, J., et al. (2022). Large-scale manipulation of promoter DNA methylation reveals context-specific transcriptional responses and stability. Genome Biol. 23, 163. doi:10.1186/s13059-022-02728-5
de Mendoza, A., Poppe, D., Buckberry, S., Pflueger, J., Albertin, C. B., Daish, T., et al. (2021). The emergence of the brain non-CpG methylation system in vertebrates. Nat. Ecol. Evol. 5, 369–378. doi:10.1038/s41559-020-01371-2
De Ridder, K., Che, H., Leroy, K., and Thienpont, B. (2024). Benchmarking of methods for DNA methylome deconvolution. Nat. Commun. 15, 4134. doi:10.1038/s41467-024-48466-z
Domingo-Relloso, A., Gribble, M. O., Riffo-Campos, A. L., Haack, K., Cole, S. A., Tellez-Plaza, M., et al. (2022). Epigenetics of type 2 diabetes and diabetes-related outcomes in the Strong Heart Study. Clin. Epigenetics 14, 177. doi:10.1186/s13148-022-01392-7
Dupras, C., Beck, S., and Rothstein, M. A. (2019). Potential (mis) use of epigenetic age estimators by private companies and public agencies: human rights law should provide ethical guidance. Epigenetics. doi:10.1093/eep/dvz018
Dupras, C., Joly, Y., and Rial-Sebbag, E. (2020). Human rights in the postgenomic era: challenges and opportunities arising with epigenetics. Soc. Sci. Inf. 59, 12–34. doi:10.1177/0539018419900139
Dupras, C., Song, L., Saulnier, K. M., and Joly, Y. (2018). Epigenetic discrimination: emerging applications of epigenetics pointing to the limitations of policies against genetic discrimination. Front. Genet. 9, 202. doi:10.3389/fgene.2018.00202
Elliott, H. R., Burrows, K., Min, J. L., Tillin, T., Mason, D., Wright, J., et al. (2022). Characterisation of ethnic differences in DNA methylation between UK-resident South Asians and Europeans. Clin. Epigenetics 14, 130. doi:10.1186/s13148-022-01351-2
Fatumo, S., Chikowore, T., Choudhury, A., Ayub, M., Martin, A. R., and Kuchenbaecker, K. (2022). A roadmap to increase diversity in genomic studies. Nat. Med. 28, 243–250. doi:10.1038/s41591-021-01672-4
Feinberg, A. P. (2018). The key role of epigenetics in human disease prevention and mitigation. N. Engl. J. Med. 378, 1323–1334. doi:10.1056/NEJMra1402513
Frambach, S. J. C. M., de Haas, R., Smeitink, J. A. M., Rongen, G. A., Russel, F. G. M., and Schirris, T. J. J. (2020). Brothers in arms: ABCA1-and ABCG1-mediated cholesterol efflux as promising targets in cardiovascular disease treatment. Pharmacol. Rev. 72, 152–190. doi:10.1124/pr.119.017897
Fraszczyk, E., Spijkerman, A. M. W., Zhang, Y., Brandmaier, S., Day, F. R., Zhou, L., et al. (2022a). Epigenome-wide association study of incident type 2 diabetes: a meta-analysis of five prospective European cohorts. Diabetologia 65, 763–776. doi:10.1007/s00125-022-05652-2
Fraszczyk, E., Thio, C. H. L., Wackers, P., Dollé, M. E. T., Bloks, V. W., Hodemaekers, H., et al. (2022b). DNA methylation trajectories and accelerated epigenetic aging in incident type 2 diabetes. Geroscience 44, 2671–2684. doi:10.1007/s11357-022-00626-z
Frommer, M., McDonald, L. E., Millar, D. S., Collis, C. M., Watt, F., Grigg, G. W., et al. (1992). A genomic sequencing protocol that yields a positive display of 5-methylcytosine residues in individual DNA strands. Proc. Natl. Acad. Sci. U. S. A. 89, 1827–1831. doi:10.1073/pnas.89.5.1827
Garrison, N. A., Hudson, M., Ballantyne, L. L., Garba, I., Martinez, A., Taualii, M., et al. (2019). Genomic research through an indigenous lens: understanding the expectations. Annu. Rev. Genomics Hum. Genet. 20, 495–517. doi:10.1146/annurev-genom-083118-015434
GBD 2021 Diabetes Collaborators (2023). Global, regional, and national burden of diabetes from 1990 to 2021, with projections of prevalence to 2050: a systematic analysis for the Global Burden of Disease Study 2021. Lancet 402, 203–234. doi:10.1016/S0140-6736(23)01301-6
Glenn, E. N. (2015). Settler colonialism as structure: a framework for comparative studies of U.s. race and gender formation. Sociol. Race Ethn. (Thousand Oaks) 1, 52–72. doi:10.1177/2332649214560440
Greenberg, M. V. C., and Bourc’his, D. (2019). The diverse roles of DNA methylation in mammalian development and disease. Nat. Rev. Mol. Cell Biol. 20, 590–607. doi:10.1038/s41580-019-0159-6
Griffiths, K., Coleman, C., Lee, V., and Madden, R. (2016). How colonisation determines social justice and Indigenous health—a review of the literature. J. Popul. Res. (Canberra) 33, 9–30. doi:10.1007/s12546-016-9164-1
Griffiths, K., Johnston, M., and Bowman-Derrick, S. (2021). Indigenous data sovereignty: readiness assessment and evaluation toolkit. Available online at: https://researchers.cdu.edu.au/en/publications/indigenous-data-sovereignty-readiness-assessment-and-evaluation-t.
Hannum, G., Guinney, J., Zhao, L., Zhang, L., Hughes, G., Sadda, S., et al. (2013). Genome-wide methylation profiles reveal quantitative views of human aging rates. Mol. Cell 49, 359–367. doi:10.1016/j.molcel.2012.10.016
Harris, S. B., Tompkins, J. W., and TeHiwi, B. (2017). Call to action: a new path for improving diabetes care for Indigenous peoples, a global review. Diabetes Res. Clin. Pract. 123, 120–133. doi:10.1016/j.diabres.2016.11.022
Hermes, A., Wiersma, M., Kerridge, I., Easteal, S., Light, E., Dive, L., et al. (2021). Beyond platitudes: a qualitative study of Australian Aboriginal people’s perspectives on biobanking. Intern. Med. J. 51, 1426–1432. doi:10.1111/imj.15223
Hillary, R. F., McCartney, D. L., Smith, H. M., Bernabeu, E., Gadd, D. A., Chybowska, A. D., et al. (2023). Blood-based epigenome-wide analyses of 19 common disease states: a longitudinal, population-based linked cohort study of 18,413 Scottish individuals. PLoS Med. 20, e1004247. doi:10.1371/journal.pmed.1004247
Hillary, R. F., Stevenson, A. J., McCartney, D. L., Campbell, A., Walker, R. M., Howard, D. M., et al. (2020). Epigenetic measures of ageing predict the prevalence and incidence of leading causes of death and disease burden. Clin. Epigenetics 12, 115. doi:10.1186/s13148-020-00905-6
Hocking, S. L., Markovic, T. P., Lee, C. M. Y., Picone, T. J., Gudorf, K. E., and Colagiuri, S. (2024). Intensive lifestyle intervention for remission of early type 2 diabetes in primary care in Australia: DiRECT-aus. Diabetes Care 47, 66–70. doi:10.2337/dc23-0781
Holliday, R., and Pugh, J. (1975). DNA modification mechanisms and gene activity during development. Science 187, 226–232. doi:10.1126/science.187.4173.226
Holoch, D., and Moazed, D. (2015). RNA-mediated epigenetic regulation of gene expression. Nat. Rev. Genet. 16, 71–84. doi:10.1038/nrg3863
Horvath, S. (2013). DNA methylation age of human tissues and cell types. Genome Biol. 14, R115. doi:10.1186/gb-2013-14-10-r115
Horvath, S., and Raj, K. (2018). DNA methylation-based biomarkers and the epigenetic clock theory of ageing. Nat. Rev. Genet. 19, 371–384. doi:10.1038/s41576-018-0004-3
Houseman, E. A., Accomando, W. P., Koestler, D. C., Christensen, B. C., Marsit, C. J., Nelson, H. H., et al. (2012). DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinforma. 13, 86. doi:10.1186/1471-2105-13-86
Huang, S., Qin, P., Chen, Q., Zhang, D., Cheng, C., Guo, C., et al. (2021). Association of FTO gene methylation with incident type 2 diabetes mellitus: a nested case-control study. Gene 786, 145585. doi:10.1016/j.gene.2021.145585
Hudson, M., Garrison, N. A., Sterling, R., Caron, N. R., Fox, K., Yracheta, J., et al. (2020). Rights, interests and expectations: indigenous perspectives on unrestricted access to genomic data. Nat. Rev. Genet. 21, 377–384. doi:10.1038/s41576-020-0228-x
Hyett, S., Gabel, C., Marjerrison, S., and Schwartz, L. (2019). Deficit-based indigenous health research and the stereotyping of indigenous peoples. Can. J. Bioeth. 2, 102–109. doi:10.7202/1065690ar
Jaenisch, R., and Bird, A. (2003). Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat. Genet. 33 (Suppl. l), 245–254. doi:10.1038/ng1089
Jeon, J.-P., Koh, I.-U., Choi, N.-H., Kim, B.-J., Han, B.-G., and Lee, S. (2017). Differential DNA methylation of MSI2 and its correlation with diabetic traits. PLoS One 12, e0177406. doi:10.1371/journal.pone.0177406
Jones, M. J., Goodman, S. J., and Kobor, M. S. (2015). DNA methylation and healthy human aging. Aging Cell 14, 924–932. doi:10.1111/acel.12349
Juvinao-Quintero, D. L., Marioni, R. E., Ochoa-Rosales, C., Russ, T. C., Deary, I. J., van Meurs, J. B. J., et al. (2021). DNA methylation of blood cells is associated with prevalent type 2 diabetes in a meta-analysis of four European cohorts. Clin. Epigenetics 13, 40. doi:10.1186/s13148-021-01027-3
Kachuri, L., Chatterjee, N., Hirbo, J., Schaid, D. J., Martin, I., Kullo, I. J., et al. (2023). Principles and methods for transferring polygenic risk scores across global populations. Nat. Rev. Genet. 25, 8–25. doi:10.1038/s41576-023-00637-2
Kennedy, M. A., Barrera, G. C., Nakamura, K., Baldán, A., Tarr, P., Fishbein, M. C., et al. (2005). ABCG1 has a critical role in mediating cholesterol efflux to HDL and preventing cellular lipid accumulation. Cell Metab. 1, 121–131. doi:10.1016/j.cmet.2005.01.002
Khurana, I., Howard, N. J., Maxwell, S., Du Preez, A., Kaipananickal, H., Breen, J., et al. (2023). Circulating epigenomic biomarkers correspond with kidney disease susceptibility in high-risk populations with type 2 diabetes mellitus. Diabetes Res. Clin. Pract. 204, 110918. doi:10.1016/j.diabres.2023.110918
Krishna, R., and Elisseev, V. (2021). User-centric genomics infrastructure: trends and technologies. Genome 64, 467–475. doi:10.1139/gen-2020-0096
Kriukienė, E., Tomkuvienė, M., and Klimašauskas, S. (2024). 5-Hydroxymethylcytosine: the many faces of the sixth base of mammalian DNA. Chem. Soc. Rev. 53, 2264–2283. doi:10.1039/d3cs00858d
Krueger, F., Kreck, B., Franke, A., and Andrews, S. R. (2012). DNA methylome analysis using short bisulfite sequencing data. Nat. Methods 9, 145–151. doi:10.1038/nmeth.1828
Kumar, K. K., Aburawi, E. H., Ljubisavljevic, M., Leow, M. K. S., Feng, X., Ansari, S. A., et al. (2024). Exploring histone deacetylases in type 2 diabetes mellitus: pathophysiological insights and therapeutic avenues. Clin. Epigenetics 16, 78. doi:10.1186/s13148-024-01692-0
Law, P.-P., and Holland, M. L. (2019). DNA methylation at the crossroads of gene and environment interactions. Essays Biochem. 63, 717–726. doi:10.1042/EBC20190031
Lee, K., Kerner, J., and Hoppel, C. L. (2011). Mitochondrial carnitine palmitoyltransferase 1a (CPT1a) is part of an outer membrane fatty acid transfer complex. J. Biol. Chem. 286, 25655–25662. doi:10.1074/jbc.M111.228692
Levine, M. E., Lu, A. T., Quach, A., Chen, B. H., Assimes, T. L., Bandinelli, S., et al. (2018). An epigenetic biomarker of aging for lifespan and healthspan. Aging (Albany NY) 10, 573–591. doi:10.18632/aging.101414
Li, E., and Zhang, Y. (2014). DNA methylation in mammals. Cold Spring Harb. Perspect. Biol. 6, a019133. doi:10.1101/cshperspect.a019133
Li, S., and Tollefsbol, T. O. (2021). DNA methylation methods: global DNA methylation and methylomic analyses. Methods 187, 28–43. doi:10.1016/j.ymeth.2020.10.002
Liao, W.-W., Asri, M., Ebler, J., Doerr, D., Haukness, M., Hickey, G., et al. (2023). A draft human pangenome reference. Nature 617, 312–324. doi:10.1038/s41586-023-05896-x
Ling, C., Bacos, K., and Rönn, T. (2022). Epigenetics of type 2 diabetes mellitus and weight change — a tool for precision medicine? Nat. Rev. Endocrinol. 18, 433–448. doi:10.1038/s41574-022-00671-w
Ling, C., Del Guerra, S., Lupi, R., Rönn, T., Granhall, C., Luthman, H., et al. (2008). Epigenetic regulation of PPARGC1A in human type 2 diabetic islets and effect on insulin secretion. Diabetologia 51, 615–622. doi:10.1007/s00125-007-0916-5
Lister, R., Pelizzola, M., Dowen, R. H., Hawkins, R. D., Hon, G., Tonti-Filippini, J., et al. (2009). Human DNA methylomes at base resolution show widespread epigenomic differences. Nature 462, 315–322. doi:10.1038/nature08514
López-Otín, C., Blasco, M. A., Partridge, L., Serrano, M., and Kroemer, G. (2013). The hallmarks of aging. Cell 153, 1194–1217. doi:10.1016/j.cell.2013.05.039
López-Otín, C., Blasco, M. A., Partridge, L., Serrano, M., and Kroemer, G. (2023). Hallmarks of aging: an expanding universe. Cell 186, 243–278. doi:10.1016/j.cell.2022.11.001
Loyfer, N., Magenheim, J., Peretz, A., Cann, G., Bredno, J., Klochendler, A., et al. (2023). A DNA methylation atlas of normal human cell types. Nature 613, 355–364. doi:10.1038/s41586-022-05580-6
Lu, A. T., Quach, A., Wilson, J. G., Reiner, A. P., Aviv, A., Raj, K., et al. (2019). DNA methylation GrimAge strongly predicts lifespan and healthspan. Aging 11, 303–327. doi:10.18632/aging.101684
Luo, C., Hajkova, P., and Ecker, J. R. (2018). Dynamic DNA methylation: in the right place at the right time. Science 361, 1336–1340. doi:10.1126/science.aat6806
Magkos, F., Hjorth, M. F., and Astrup, A. (2020). Diet and exercise in the prevention and treatment of type 2 diabetes mellitus. Nat. Rev. Endocrinol. 16, 545–555. doi:10.1038/s41574-020-0381-5
Mansour Aly, D., Dwivedi, O. P., Prasad, R. B., Käräjämäki, A., Hjort, R., Thangam, M., et al. (2021). Genome-wide association analyses highlight etiological differences underlying newly defined subtypes of diabetes. Nat. Genet. 53, 1534–1542. doi:10.1038/s41588-021-00948-2
Martin, E. M., and Fry, R. C. (2018). Environmental influences on the epigenome: exposure- associated DNA methylation in human populations. Annu. Rev. Public Health 39, 309–333. doi:10.1146/annurev-publhealth-040617-014629
Matsuo, M. (2022). ABCA1 and ABCG1 as potential therapeutic targets for the prevention of atherosclerosis. J. Pharmacol. Sci. 148, 197–203. doi:10.1016/j.jphs.2021.11.005
Maunakea, A. K., Phankitnirundorn, K., Peres, R., Dye, C., Juarez, R., Walsh, C., et al. (2024). Socioeconomic status, lifestyle, and DNA methylation age among racially and ethnically diverse adults: NIMHD social epigenomics program. JAMA Netw. Open 7, e2421889. doi:10.1001/jamanetworkopen.2024.21889
Mc Cartney, A. M., Anderson, J., Liggins, L., Hudson, M. L., Anderson, M. Z., TeAika, B., et al. (2022). Balancing openness with Indigenous data sovereignty: an opportunity to leave no one behind in the journey to sequence all of life. Proc. Natl. Acad. Sci. U. S. A. 119, e2115860119. doi:10.1073/pnas.2115860119
McCartney, D. L., Hillary, R. F., Stevenson, A. J., Ritchie, S. J., Walker, R. M., Zhang, Q., et al. (2018). Epigenetic prediction of complex traits and death. Genome Biol. 19, 136. doi:10.1186/s13059-018-1514-1
McLean, C. Y., Bristor, D., Hiller, M., Clarke, S. L., Schaar, B. T., Lowe, C. B., et al. (2010). GREAT improves functional interpretation of cis-regulatory regions. Nat. Biotechnol. 28, 495–501. doi:10.1038/nbt.1630
Mills, M. C., and Rahal, C. (2019). A scientometric review of genome-wide association studies. Commun. Biol. 2, 9. doi:10.1038/s42003-018-0261-x
Moreno-Grau, S., Vernekar, M., Lopez-Pineda, A., Mas-Montserrat, D., Barrabés, M., Quinto-Cortés, C. D., et al. (2024). Polygenic risk score portability for common diseases across genetically diverse populations. Hum. Genomics 18, 93. doi:10.1186/s40246-024-00664-y
Morey, K., Franks, C., Pearson, O., Glover, K., and Brown, A. (2023). Research ACCORDing to whom? Developing a south Australian aboriginal and Torres strait islander health research accord. First Nations Health Wellbeing - Lowitja J. 1, 100003. doi:10.1016/j.fnhli.2023.100003
Muka, T., Nano, J., Voortman, T., Braun, K. V. E., Ligthart, S., Stranges, S., et al. (2016). The role of global and regional DNA methylation and histone modifications in glycemic traits and type 2 diabetes: a systematic review. Nutr. Metab. Cardiovasc. Dis. 26, 553–566. doi:10.1016/j.numecd.2016.04.002
Nilsson, E., Jansson, P. A., Perfilyev, A., Volkov, P., Pedersen, M., Svensson, M. K., et al. (2014). Altered DNA methylation and differential expression of genes influencing metabolism and inflammation in adipose tissue from subjects with type 2 diabetes. Diabetes 63, 2962–2976. doi:10.2337/db13-1459
Noguera-Castells, A., García-Prieto, C. A., Álvarez-Errico, D., and Esteller, M. (2023). Validation of the new EPIC DNA methylation microarray (900K EPIC v2) for high-throughput profiling of the human DNA methylome. Epigenetics 18, 2185742. doi:10.1080/15592294.2023.2185742
Nurk, S., Koren, S., Rhie, A., Rautiainen, M., Bzikadze, A. V., Mikheenko, A., et al. (2022). The complete sequence of a human genome. Science 376, 44–53. doi:10.1126/science.abj6987
Oestreich, M., Chen, D., Schultze, J. L., Fritz, M., and Becker, M. (2021). Privacy considerations for sharing genomics data. EXCLI J. 20, 1243–1260. doi:10.17179/excli2021-4002
Oliva, A., Kaphle, A., Reguant, R., Sng, L. M. F., Twine, N. A., Malakar, Y., et al. (2024). Future-proofing genomic data and consent management: a comprehensive review of technology innovations. Gigascience 13, giae021. doi:10.1093/gigascience/giae021
Out, R., Hoekstra, M., Meurs, I., de Vos, P., Kuiper, J., Van Eck, M., et al. (2007). Total body ABCG1 expression protects against early atherosclerotic lesion development in mice. Arterioscler. Thromb. Vasc. Biol. 27, 594–599. doi:10.1161/01.ATV.0000257136.24308.0c
Pajares, M. J., Palanca-Ballester, C., Urtasun, R., Alemany-Cosme, E., Lahoz, A., and Sandoval, J. (2021). Methods for analysis of specific DNA methylation status. Methods 187, 3–12. doi:10.1016/j.ymeth.2020.06.021
Paradies, Y. (2016). Colonisation, racism and indigenous health. J. Popul. Res. 33, 83–96. doi:10.1007/s12546-016-9159-y
Pidsley, R., Zotenko, E., Peters, T. J., Lawrence, M. G., Risbridger, G. P., Molloy, P., et al. (2016). Critical evaluation of the Illumina MethylationEPIC BeadChip microarray for whole-genome DNA methylation profiling. Genome Biol. 17, 208. doi:10.1186/s13059-016-1066-1
Plongthongkum, N., Diep, D. H., and Zhang, K. (2014). Advances in the profiling of DNA modifications: cytosine methylation and beyond. Nat. Rev. Genet. 15, 647–661. doi:10.1038/nrg3772
Portela, A., and Esteller, M. (2010). Epigenetic modifications and human disease. Nat. Biotechnol. 28, 1057–1068. doi:10.1038/nbt.1685
Qie, R., Chen, Q., Wang, T., Chen, X., Wang, J., Cheng, R., et al. (2021). Association of ABCG1 gene methylation and its dynamic change status with incident type 2 diabetes mellitus: the Rural Chinese Cohort Study. J. Hum. Genet. 66, 347–357. doi:10.1038/s10038-020-00848-z
Ray, D., and Chatterjee, N. (2020). A powerful method for pleiotropic analysis under composite null hypothesis identifies novel shared loci between Type 2 Diabetes and Prostate Cancer. PLoS Genet. 16, e1009218. doi:10.1371/journal.pgen.1009218
Rauschert, S., Raubenheimer, K., Melton, P. E., and Huang, R. C. (2020). Machine learning and clinical epigenetics: a review of challenges for diagnosis and classification. Clin. Epigenetics 12, 51. doi:10.1186/s13148-020-00842-4
Reid, P., Cormack, D., and Paine, S.-J. (2019). Colonial histories, racism and health—the experience of Māori and Indigenous peoples. Public Health 172, 119–124. doi:10.1016/j.puhe.2019.03.027
Reis, A. L. M., Rapadas, M., Hammond, J. M., Gamaarachchi, H., Stevanovski, I., Ayuputeri Kumaheri, M., et al. (2023). The landscape of genomic structural variation in Indigenous Australians. Nature 624, 602–610. doi:10.1038/s41586-023-06842-7
Ribel-Madsen, R., Fraga, M. F., Jacobsen, S., Bork-Jensen, J., Lara, E., Calvanese, V., et al. (2012). Genome-wide analysis of DNA methylation differences in muscle and fat from monozygotic twins discordant for type 2 diabetes. PLoS One 7, e51302. doi:10.1371/journal.pone.0051302
Roden, M., and Shulman, G. I. (2019). The integrative biology of type 2 diabetes. Nature 576, 51–60. doi:10.1038/s41586-019-1797-8
Rönn, T., Ofori, J. K., Perfilyev, A., Hamilton, A., Pircs, K., Eichelmann, F., et al. (2023). Genes with epigenetic alterations in human pancreatic islets impact mitochondrial function, insulin secretion, and type 2 diabetes. Nat. Commun. 14, 8040. doi:10.1038/s41467-023-43719-9
Sabbatinelli, J., Giuliani, A., Kwiatkowska, K. M., Matacchione, G., Belloni, A., Ramini, D., et al. (2024). DNA Methylation-derived biological age and long-term mortality risk in subjects with type 2 diabetes. Cardiovasc. Diabetol. 23, 250. doi:10.1186/s12933-024-02351-7
Salama, O. E., Hizon, N., Del Vecchio, M., Kolsun, K., Fonseca, M. A., Lin, D. T. S., et al. (2024). DNA methylation signatures of youth-onset type 2 diabetes and exposure to maternal diabetes. Clin. Epigenetics 16, 65. doi:10.1186/s13148-024-01675-1
Santaló, J., and Berdasco, M. (2022). Ethical implications of epigenetics in the era of personalized medicine. Clin. Epigenetics 14, 44. doi:10.1186/s13148-022-01263-1
Saulnier, K., Berner, A., Liosi, S., Earp, B., Berrios, C., Dyke, S. O. M., et al. (2022). Studying vulnerable populations through an epigenetics lens: proceed with caution. Can. J. Bioeth. 5, 68–78. doi:10.7202/1087205ar
Schlaepfer, I. R., and Joshi, M. (2020). CPT1A-mediated fat oxidation, mechanisms, and therapeutic potential. Endocrinology 161, bqz046. doi:10.1210/endocr/bqz046
Schneider, V. A., Graves-Lindsay, T., Howe, K., Bouk, N., Chen, H.-C., Kitts, P. A., et al. (2017). Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly. Genome Res. 27, 849–864. doi:10.1101/gr.213611.116
Schübeler, D. (2015). Function and information content of DNA methylation. Nature 517, 321–326. doi:10.1038/nature14192
Searle, B., Müller, M., Carell, T., and Kellett, A. (2023). Third-generation sequencing of epigenetic DNA. Angew. Chem. Weinh. Bergstr. Ger. 135. doi:10.1002/ange.202215704
Shabalin, A. A., Aberg, K. A., and van den Oord, E. J. C. G. (2015). Candidate gene methylation studies are at high risk of erroneous conclusions. Epigenomics 7, 13–15. doi:10.2217/epi.14.70
Sherwood, J. (2013). Colonisation–It’s bad for your health: the context of Aboriginal health. Contemp. Nurse 46, 28–40. doi:10.5172/conu.2013.46.1.28
Shimano, H., Yahagi, N., Amemiya-Kudo, M., Hasty, A. H., Osuga, J., Tamura, Y., et al. (1999). Sterol regulatory element-binding protein-1 as a key transcription factor for nutritional induction of lipogenic enzyme genes. J. Biol. Chem. 274, 35832–35839. doi:10.1074/jbc.274.50.35832
Silcocks, M., Farlow, A., Hermes, A., Tsambos, G., Patel, H. R., Huebner, S., et al. (2023). Indigenous Australian genomes show deep structure and rich novel variation. Nature 624, 593–601. doi:10.1038/s41586-023-06831-w
Sinclaire, M., Lavallee, B., Cyr, M., and Schultz, A. (2023). Indigenous peoples and type 2 diabetes: a discussion of colonial wounds and epistemic racism. Can. J. Diabetes 47, 451–454. doi:10.1016/j.jcjd.2023.01.008
Siopis, G., Colagiuri, S., and Allman-Farinelli, M. (2021). Effectiveness of dietetic intervention for people with type 2 diabetes: a meta-analysis. Clin. Nutr. 40, 3114–3122. doi:10.1016/j.clnu.2020.12.009
Skinner, M. K. (2024). Epigenetic biomarkers for disease susceptibility and preventative medicine. Cell Metab. 36, 263–277. doi:10.1016/j.cmet.2023.11.015
Sollis, E., Mosaku, A., Abid, A., Buniello, A., Cerezo, M., Gil, L., et al. (2023). The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res. 51, D977–D985. doi:10.1093/nar/gkac1010
Southern, E., Mir, K., and Shchepinov, M. (1999). Molecular interactions on microarrays. Nat. Genet. 21, 5–9. doi:10.1038/4429
Suzuki, K., Hatzikotoulas, K., Southam, L., Taylor, H. J., Yin, X., Lorenz, K. M., et al. (2024). Genetic drivers of heterogeneity in type 2 diabetes pathophysiology. Nature 627, 347–357. doi:10.1038/s41586-024-07019-6
Taher, L., and Ovcharenko, I. (2009). Variable locus length in the human genome leads to ascertainment bias in functional inference for non-coding elements. Bioinformatics 25, 578–584. doi:10.1093/bioinformatics/btp043
Teschendorff, A. E., Breeze, C. E., Zheng, S. C., and Beck, S. (2017). A comparison of reference-based algorithms for correcting cell-type heterogeneity in Epigenome-Wide Association Studies. BMC Bioinforma. 18, 105. doi:10.1186/s12859-017-1511-5
Teschendorff, A. E., and Zheng, S. C. (2017). Cell-type deconvolution in epigenome-wide association studies: a review and recommendations. Epigenomics 9, 757–768. doi:10.2217/epi-2016-0153
Thompson, M., Hill, B. L., Rakocz, N., Chiang, J. N., Geschwind, D., Sankararaman, S., et al. (2022). Methylation risk scores are associated with a collection of phenotypes within electronic health record systems. NPJ Genom Med. 7, 50. doi:10.1038/s41525-022-00320-1
Titmuss, A., Davis, E. A., O’Donnell, V., Wenitong, M., Maple-Brown, L. J., Haynes, A., et al. (2022). Youth-onset type 2 diabetes among First Nations young people in northern Australia: a retrospective, cross-sectional study. Lancet Diabetes Endocrinol. 10, 11–13. doi:10.1016/S2213-8587(21)00286-2
Titus, A. J., Gallimore, R. M., Salas, L. A., and Christensen, B. C. (2017). Cell-type deconvolution from DNA methylation: a review of recent applications. Hum. Mol. Genet. 26, R216-R224–R224. doi:10.1093/hmg/ddx275
Treangen, T. J., and Salzberg, S. L. (2011). Repetitive DNA and next-generation sequencing: computational challenges and solutions. Nat. Rev. Genet. 13, 36–46. doi:10.1038/nrg3117
Turnbull, P. (2007). Scientific theft of remains in colonial Australia. Aust. Indig. law Rev. 11, 92–104.
Vaisvila, R., Ponnaluri, V. K. C., Sun, Z., Langhorst, B. W., Saleh, L., Guan, S., et al. (2021). Enzymatic methyl sequencing detects DNA methylation at single-base resolution from picograms of DNA. Genome Res. 31, 1280–1289. doi:10.1101/gr.266551.120
Viner, R., White, B., and Christie, D. (2017). Type 2 diabetes in adolescents: a severe phenotype posing major clinical challenges and public health burden. Lancet 389, 2252–2260. doi:10.1016/S0140-6736(17)31371-5
Volkov, P., Bacos, K., Ofori, J. K., Esguerra, J. L. S., Eliasson, L., Rönn, T., et al. (2017). Whole-genome bisulfite sequencing of human pancreatic islets reveals novel differentially methylated regions in type 2 diabetes pathogenesis. Diabetes 66, 1074–1085. doi:10.2337/db16-0996
Vujkovic, M., Keaton, J. M., Lynch, J. A., Miller, D. R., Zhou, J., Tcheandjieu, C., et al. (2020). Discovery of 318 new risk loci for type 2 diabetes and related vascular outcomes among 1.4 million participants in a multi-ancestry meta-analysis. Nat. Genet. 52, 680–691. doi:10.1038/s41588-020-0637-y
Waanders, A., Brown, A., Caron, N. R., Plisiewicz, A., McHugh, S. T., Nguyen, T. Q., et al. (2023). Indigenous peoples and inclusion in clinical and genomic research: understanding the history and navigating contemporary engagement. Neoplasia 37, 100879. doi:10.1016/j.neo.2023.100879
Wagenknecht, L. E., Lawrence, J. M., Isom, S., Jensen, E. T., Dabelea, D., Liese, A. D., et al. (2023). Trends in incidence of youth-onset type 1 and type 2 diabetes in the USA, 2002-18: results from the population-based SEARCH for Diabetes in Youth study. Lancet Diabetes Endocrinol. 11, 242–250. doi:10.1016/S2213-8587(23)00025-6
Walaszczyk, E., Luijten, M., Spijkerman, A. M. W., Bonder, M. J., Lutgers, H. L., Snieder, H., et al. (2018). DNA methylation markers associated with type 2 diabetes, fasting glucose and HbA1c levels: a systematic review and replication in a case–control sample of the Lifelines study. Diabetologia 61, 354–368. doi:10.1007/s00125-017-4497-7
Wang, N., Lan, D., Chen, W., Matsuura, F., and Tall, A. R. (2004). ATP-binding cassette transporters G1 and G4 mediate cellular cholesterol efflux to high-density lipoproteins. Proc. Natl. Acad. Sci. U. S. A. 101, 9774–9779. doi:10.1073/pnas.0403506101
Whalen, S., Schreiber, J., Noble, W. S., and Pollard, K. S. (2022). Navigating the pitfalls of applying machine learning in genomics. Nat. Rev. Genet. 23, 169–181. doi:10.1038/s41576-021-00434-9
Wicklow, B. A., Sellers, E. A. C., Sharma, A. K., Kroeker, K., Nickel, N. C., Philips-Beck, W., et al. (2018). Association of gestational diabetes and type 2 diabetes exposure in utero with the development of type 2 diabetes in first nations and non–first nations offspring. JAMA Pediatr. 172, 724–731. doi:10.1001/jamapediatrics.2018.1201
Wikström Shemer, D., Mostafaei, S., Tang, B., Pedersen, N. L., Karlsson, I. K., Fall, T., et al. (2024). Associations between epigenetic aging and diabetes mellitus in a Swedish longitudinal study. Geroscience 46, 5003–5014. doi:10.1007/s11357-024-01252-7
Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J. J., Appleton, G., Axton, M., Baak, A., et al. (2016). The FAIR guiding principles for scientific data management and stewardship. Sci. Data 3, 160018. doi:10.1038/sdata.2016.18
Willmer, T., Johnson, R., Louw, J., and Pheiffer, C. (2018). Blood-based DNA methylation biomarkers for type 2 diabetes: potential for clinical applications. Front. Endocrinol. 9, 744. doi:10.3389/fendo.2018.00744
Wittenbecher, C., Ouni, M., Kuxhaus, O., Jähnert, M., Gottmann, P., Teichmann, A., et al. (2019). Insulin-like growth factor binding protein 2 (IGFBP-2) and the risk of developing type 2 diabetes. Diabetes 68, 188–197. doi:10.2337/db18-0620
Wolfe, P. (2006). Settler colonialism and the elimination of the native. J. Genocide Res. 8, 387–409. doi:10.1080/14623520601056240
Wu, H., Eckhardt, C. M., and Baccarelli, A. A. (2023). Molecular mechanisms of environmental exposures and human disease. Nat. Rev. Genet. 24, 332–344. doi:10.1038/s41576-022-00569-3
Yang, B. T., Dayeh, T. A., Kirkpatrick, C. L., Taneera, J., Kumar, R., Groop, L., et al. (2011). Insulin promoter DNA methylation correlates negatively with insulin gene expression and positively with HbA1c levels in human pancreatic islets. Diabetologia 54, 360–367. doi:10.1007/s00125-010-1967-6
Yang, B. T., Dayeh, T. A., Volkov, P. A., Kirkpatrick, C. L., Malmgren, S., Jing, X., et al. (2012). Increased DNA methylation and decreased expression of PDX-1 in pancreatic islets from patients with type 2 diabetes. Mol. Endocrinol. 26, 1203–1212. doi:10.1210/me.2012-1004
Yao, L., Berman, B. P., and Farnham, P. J. (2015). Demystifying the secret mission of enhancers: linking distal regulatory elements to target genes. Crit. Rev. Biochem. Mol. Biol. 50, 550–573. doi:10.3109/10409238.2015.1087961
Yin, Y., Morgunova, E., Jolma, A., Kaasinen, E., Sahu, B., Khund-Sayeed, S., et al. (2017). Impact of cytosine methylation on DNA binding specificities of human transcription factors. Science 356, eaaj2239. doi:10.1126/science.aaj2239
Ying, K., Song, J., Cui, H., Zhang, Y., Li, S., Chen, X., et al. (2024). MethylGPT: a foundation model for the DNA methylome. bioRxivorg. doi:10.1101/2024.10.30.621013
Yousefi, P. D., Suderman, M., Langdon, R., Whitehurst, O., Davey Smith, G., and Relton, C. L. (2022). DNA methylation-based predictors of health: applications and statistical considerations. Nat. Rev. Genet. 23, 369–383. doi:10.1038/s41576-022-00465-w
Zhang, H., Liu, L., and Li, M. (2024). Mini-review of DNA methylation detection techniques and their potential applications in disease diagnosis, prognosis, and treatment. ACS Sens. 9, 1089–1103. doi:10.1021/acssensors.3c02328
Zheng, Y., Ley, S. H., and Hu, F. B. (2018). Global aetiology and epidemiology of type 2 diabetes mellitus and its complications. Nat. Rev. Endocrinol. 14, 88–98. doi:10.1038/nrendo.2017.151
Keywords: type-2 diabetes, cardiometabolic disease, DNA methylation, biomarkers, epigenetic clock, Indigenous health, Indigenous data sovereignty
Citation: Munns S, Brown A and Buckberry S (2025) Type-2 diabetes epigenetic biomarkers: present status and future directions for global and Indigenous health. Front. Mol. Biosci. 12:1502640. doi: 10.3389/fmolb.2025.1502640
Received: 27 September 2024; Accepted: 03 February 2025;
Published: 28 April 2025.
Edited by:
Simone L. Cree, University of Otago, New ZealandReviewed by:
Shashwat Deepali Nagar, CRISPR Therapeutics, Inc., United StatesBoxun Li, Duke University, United States
Copyright © 2025 Munns, Brown and Buckberry. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Sam Buckberry, c2FtLmJ1Y2tiZXJyeUBhbnUuZWR1LmF1