AUTHOR=Jiang Minyun , Cai Na , Hu Juan , Han Lei , Xu Fanwei , Zhu Baoli , Wang Boshen TITLE=Genomic and algorithm-based predictive risk assessment models for benzene exposure JOURNAL=Frontiers in Public Health VOLUME=Volume 12 - 2024 YEAR=2025 URL=https://www.frontiersin.org/journals/public-health/articles/10.3389/fpubh.2024.1419361 DOI=10.3389/fpubh.2024.1419361 ISSN=2296-2565 ABSTRACT=AimIn this research, we leveraged bioinformatics and machine learning to pinpoint key risk genes associated with occupational benzene exposure and to construct genomic and algorithm-based predictive risk assessment models.Subject and methodsWe sourced GSE9569 and GSE21862 microarray data from the Gene Expression Omnibus. Utilizing R software, we performed an initial screen for differentially expressed genes (DEGs), which was followed by the enrichment analyses to elucidate the affected functions and pathways. Subsequent steps included the application of three machine learning algorithms for key gene identification, and the validation of these genes within both a cohort exposed to benzene and a benzene-exposed mice model. We then conducted a functional prediction analysis on these genes using four machine learning models, complemented by GSVA enrichment analysis.ResultsOut of the data, 40 DEGs were identified, primarily linked to cytokine signaling, lipopolysaccharide response, and chemokine pathways. NFKB1, PHACTR1, PTGS2, and PTX3 were pinpointed as significant through machine learning. Validation confirmed substantial changes in NFKB1 and PTX3 following exposure, with PTX3 emerging as paramount, suggesting its utility as a diagnostic biomarker for benzene damage.ConclusionRisk assessment models, informed by oxidative stress markers, successfully discriminated between benzene-injured patients and controls.