ORIGINAL RESEARCH article
Front. Genet.
Sec. Epigenomics and Epigenetics
Volume 16 - 2025 | doi: 10.3389/fgene.2025.1637186
This article is part of the Research TopicVariability of Epigenetic Clocks and their Mechanisms in Association with Culture, Lifestyle, Aging and DiseaseView all articles
Using buccal methylomic data to create explainable aging clocks as well as classifiers and regressors for lifestyle and demographic factors
Provisionally accepted- Tally Health, New York, United States
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
In human blood, it has been demonstrated that methylomic information can be used to predict smoking status, alcohol intake, and chronological age. While it is possible to robustly predict chronological age using DNA methylation information derived from buccal tissue, it remains to be determined if other variables can be directly predicted in cheek swabs. Here, we demonstrate that classifiers for smoking status and race/ethnicity can be built in a buccal methylomic dataset derived from 8,045 adults spanning an age range of 18 – 93 years. Furthermore, we build novel regressors for body mass index, alcohol intake, and chronological age. For each of these models, we identify the 1,000 most important CpGs and perform enrichment analyses on them to expose associated biological pathways and transcription factor targets. We additionally explore how the architecture of an epigenetic aging clock – specifically how many hidden layers are present – influences model accuracy. Finally, we build a proof-of-concept, explainable deep learning models that connect DNA methylation sites annotated to genes to Reactome pathways or to transcription factors. These pathways and target sets are then used to estimate age, a feature that provides interpretability. All together, these findings further emphasize the usability of buccal data for epigenetic predictions.
Keywords: smoking status, Body Mass Index, alcohol intake, chronological age, aging clocks, DNA Methylation, Aging biomarkers, random forest
Received: 28 May 2025; Accepted: 29 Aug 2025.
Copyright: © 2025 Shokhirev and Johnson. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Maxim Shokhirev, Tally Health, New York, United States
Adiv Johnson, Tally Health, New York, United States
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.