ORIGINAL RESEARCH article
Front. Mar. Sci.
Sec. Ocean Observation
Satellite-Based Machine Learning Models for Chlorophyll-a and TSS Retrieval in Abu Dhabi's Coastal Waters
Ali Ibrahim 1,2
Noura Alkarbi 1
Meera Alsaadi 1
Alyazyeh Almazrouei 1
Sara Alshamsi 1
Mohamed Abdelmoghny Hamouda 1,3
1. College of Engineering, United Arab Emirates University, AlAin, United Arab Emirates
2. College of Engineering, Zagazig University, Zagazig, Egypt
3. United Arab Emirates University National Water and Energy Center, Al Ain, United Arab Emirates
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Abstract
Effective estimation of water quality parameters is essential for the sustainability of the coastal ecosystem. This research examines the potential use of Sentinel-2 Satellite images with machine learning models to measure the concentrations of Cholorophyll-a (Chl-a) and Total Suspended Solids (TSS) in the coastal area of Abu Dhabi. Google Earth Engine (GEE) was utilized to obtain Sentinel-2 Level-2A surface reflectance values, which are collocated with the in situ data. Field measurements were obtained from various locations, with 365 and 196 available samples for Chl-a and TSS, respectively. The former had 165 collocated points, whereas the latter had only 77 points. For feature engineering, two strategies were compared: spectral indices from the literature and Principal Component Analysis (PCA) with raw bands. Four machine learning algorithms were examined to find the optimal model for each parameter by using 5-fold cross-validated hyperparameter tuning. The selected models are Random Forest Regression (RFR), Support Vector Regression (SVR), Extreme Gradient Boosting (XGB), and Partial Least Squares (PLS) Regression. For Chl-a, the analysis showed that a general model was limited by localized bloom events near coastal outlets. Creating a specialized "Ambient-Conditions" model by excluding these outliers greatly improved performance. The optimal Chl-a model (XGB with PCA on six bands) achieved the highest accuracy with Test R2 = 0.7 and Test RMSE of 1.62 µg/L, representing an 80% improvement in precision compared to the general model trained on the full dataset (Test R² = 0.65, RMSE = 8.21 µg/L). PCA + Random Forest (on 10 bands) was the optimal model for TSS, with R2 = 0.61, despite the small dataset size. The results demonstrated that merging machine learning and remote sensing is effective for retrieving Chl-a and TSS in challenging marine waters.
Summary
Keywords
chlorophyll-a, coastal water quality monitoring, machine learning, Principalcomponent analysis, Sentinel-2, total suspended solids
Received
14 January 2026
Accepted
16 February 2026
Copyright
© 2026 Ibrahim, Alkarbi, Alsaadi, Almazrouei, Alshamsi and Hamouda. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Mohamed Abdelmoghny Hamouda
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.