You're viewing our updated article page. If you need more time to adjust, you can return to the old layout.

ORIGINAL RESEARCH article

Front. Mar. Sci.

Sec. Ocean Observation

Satellite-Based Machine Learning Models for Chlorophyll-a and TSS Retrieval in Abu Dhabi's Coastal Waters

  • 1. College of Engineering, United Arab Emirates University, AlAin, United Arab Emirates

  • 2. College of Engineering, Zagazig University, Zagazig, Egypt

  • 3. United Arab Emirates University National Water and Energy Center, Al Ain, United Arab Emirates

The final, formatted version of the article will be published soon.

Abstract

Effective estimation of water quality parameters is essential for the sustainability of the coastal ecosystem. This research examines the potential use of Sentinel-2 Satellite images with machine learning models to measure the concentrations of Cholorophyll-a (Chl-a) and Total Suspended Solids (TSS) in the coastal area of Abu Dhabi. Google Earth Engine (GEE) was utilized to obtain Sentinel-2 Level-2A surface reflectance values, which are collocated with the in situ data. Field measurements were obtained from various locations, with 365 and 196 available samples for Chl-a and TSS, respectively. The former had 165 collocated points, whereas the latter had only 77 points. For feature engineering, two strategies were compared: spectral indices from the literature and Principal Component Analysis (PCA) with raw bands. Four machine learning algorithms were examined to find the optimal model for each parameter by using 5-fold cross-validated hyperparameter tuning. The selected models are Random Forest Regression (RFR), Support Vector Regression (SVR), Extreme Gradient Boosting (XGB), and Partial Least Squares (PLS) Regression. For Chl-a, the analysis showed that a general model was limited by localized bloom events near coastal outlets. Creating a specialized "Ambient-Conditions" model by excluding these outliers greatly improved performance. The optimal Chl-a model (XGB with PCA on six bands) achieved the highest accuracy with Test R2 = 0.7 and Test RMSE of 1.62 µg/L, representing an 80% improvement in precision compared to the general model trained on the full dataset (Test R² = 0.65, RMSE = 8.21 µg/L). PCA + Random Forest (on 10 bands) was the optimal model for TSS, with R2 = 0.61, despite the small dataset size. The results demonstrated that merging machine learning and remote sensing is effective for retrieving Chl-a and TSS in challenging marine waters.

Summary

Keywords

chlorophyll-a, coastal water quality monitoring, machine learning, Principalcomponent analysis, Sentinel-2, total suspended solids

Received

14 January 2026

Accepted

16 February 2026

Copyright

© 2026 Ibrahim, Alkarbi, Alsaadi, Almazrouei, Alshamsi and Hamouda. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Mohamed Abdelmoghny Hamouda

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Share article

Article metrics