AUTHOR=Anaadumba Raphael , Bozkurt Yigit , Sullivan Connor , Pagare Madhavi , Kurup Pradeep , Liu Benyuan , Alam Mohammad Arif Ul TITLE=Graph neural network-based water contamination detection from community housing information JOURNAL=Frontiers in Environmental Engineering VOLUME=Volume 4 - 2025 YEAR=2025 URL=https://www.frontiersin.org/journals/environmental-engineering/articles/10.3389/fenve.2025.1488965 DOI=10.3389/fenve.2025.1488965 ISSN=2813-5067 ABSTRACT=Introduction: Detecting water contamination in community housing is crucial for protecting public health. Early detection enables timely action to prevent waterborne diseases and ensures equitable access to safe drinking water. Traditional methods recommended by the Environmental Protection Agency (EPA) rely on collecting water samples and conducting lab tests, which can be both time-consuming and costly.Methods: To address these limitations, this study introduces a Graph Attention Network (GAT) to predict lead contamination in drinking water. The GAT model leverages publicly available municipal records and housing information to model interactions between homes and identify contamination patterns. Each house is represented as a node, and relationships between nodes are analyzed to provide a clearer understanding of contamination risks within the community.Results: Using data from Flint, Michigan, the model demonstrated higher performance compared to traditional methods. Specifically, the GAT achieved an accuracy of 0.80, precision of 0.71, and recall of 0.93, outperforming XGBoost, a classical machine learning algorithm, which had an accuracy of 0.70, precision of 0.66, and recall of 0.67.Discussion: In addition to its predictive capabilities, the GAT model identifies key factors contributing to lead contamination, enabling more precise targeting of at-risk areas. This approach offers a practical tool for policymakers and public health officials to assess and mitigate contamination risks, ultimately improving community health and safety.