Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Artif. Intell.

Sec. Machine Learning and Artificial Intelligence

This article is part of the Research TopicApplication of AI/ML for Predictive Modeling in Waste-based Binder Performance in sustainable concreteView all articles

A Hybrid Deep Boosting Framework with Adaptive Label Stabilization for SEM-Based Porosity Estimation in Fly-Ash Cement Mortar

Provisionally accepted
Abdullah  abdullahAbdullah abdullahMuhammad Ateeb  AtherMuhammad Ateeb AtherJosé Luis  Oropeza RodríguezJosé Luis Oropeza Rodríguez*Carlos Guzmán  Sánchez-MejoradaCarlos Guzmán Sánchez-Mejorada*
  • Centro de Investigación en Computación, Instituto Politécnico Nacional (CIC-IPN), Mexico City, Mexico

The final, formatted version of the article will be published soon.

Accurate Accurate measurements of porosity of cementitious matrices are critical in predicting mechanical behavior, durability, and transportation processes. Traditional methods based on SEM, such as manual thresholding, a simple binarization method, and end-to-end convolutional neural network (CNN) regressors, are, however, highly affected by image contrast variation, polishing quality, magnification, and operator bias. To address these limitations, the current article develops a hybrid deep-boosting framework for fully automatic porosity estimation directly from raw backscattered-electron SEM images of fly-ash cement mortar. The key novelty of the proposed approach lies in the adaptive stabilization of porosity labels and the hybrid fusion of deep semantic and handcrafted texture features, which together improve robustness to imaging artifacts, boundary ambiguity, and overfitting. Annotation ground-truth porosity is optimized using an Adaptive Porosity Label Stabilizer (APLS) that successively improves Otsu threshold masks, first using entropy measures and morphological consistency measures to reduce label noise. Multiscale semantic representations are learned on a ResNet-18 backbone, which is trained with SimCLR on SEM data, while local statistical texture is captured using handcrafted gray-level co-occurrence map (GLCM) features. The resulting mismatched set of features is combined with a learnable Hybrid Feature Refinement Block (HFRB) together with a Feature-Interaction Attention (FIA) block, which explicitly characterizes inter-scale relationships among convolutional channels and texture regressors. The latent representation is then condensed and regressed using a weighted ensemble including CatBoost, XGBoost, and LightGBM learners. The proposed methodology achieves R² = 0.9816, RMSE = 0.0236, and MAE = 0.00875 on a rigorously held-out test set, outperforming baseline methods that rely exclusively on CNN features, handcrafted descriptors, or naïve hybrid combinations. The validity, stability, and physical plausibility of the model are ensured through a comprehensive assessment, including ablation studies, domain-shift experiments, uncertainty and stability calibration, and a hybrid explainability framework (Grad-CAM++, SHAP). The architecture does not require any manual segmentation, generalizes across magnifications and imaging conditions, and provides transparent, domain-consistent explanatory visualizations. Overall, the proposed framework represents an important step toward fast, reliable, and scalable SEM-based porosity estimation in cementitious systems.

Keywords: Cementitious materials, Feature–interaction attention, Gradient boosting ensemble, Hybrid deep learning, materials informatics, Porosity estimation, Self-supervised learning, SEM image analysis

Received: 12 Dec 2025; Accepted: 29 Jan 2026.

Copyright: © 2026 abdullah, Ather, Rodríguez and Sánchez-Mejorada. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
José Luis Oropeza Rodríguez
Carlos Guzmán Sánchez-Mejorada

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.