Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Earth Sci.

Sec. Interdisciplinary Climate Studies

MULTIMODAL FUSIONS FOR DEFECT DETECTION OF PHOTOVOLTAIC PANELS BY MASK R-CNN AND HAWKFISH OPTIMIZATION ALGORITHM

Provisionally accepted
  • Altinbas Universitesi Mahmutbey Teknoloji Yerleskesi, Istanbul, Türkiye

The final, formatted version of the article will be published soon.

Accurate detection of photovoltaic (PV) module defects requires integrating information from multiple sensing modalities, as single-image approaches often fail under real-world variability. This paper proposes a multimodal segmentation framework that fuses RGB, infrared (IR), and electroluminescence (EL) imagery within a modified Mask R-CNN architecture. RGB provides structural and surface context, EL captures subsurface micro-cracks, and IR contributes complementary thermal signatures associated with hotspots, cell mismatch, and localized heating fault characteristics that cannot be observed in RGB or EL alone. A dedicated alignment pipeline based on homography and ECC refinement ensures geometric consistency across modalities, while a feature-level Fusion Attention Block adaptively combines modality-specific features. Hyperparameters and fusion weights are automatically tuned using the HawkFish Optimization Algorithm (HFOA), improving stability and segmentation accuracy. Experiments on statistically paired RGB–EL–IR datasets demonstrate that incorporating IR imagery significantly enhances the detection of thermally driven defects and reduces false negatives in low-contrast regions. The proposed framework achieves state-of-the-art performance, confirming the value of thermal information in multimodal PV inspection.

Keywords: Electroluminescence imaging, HawkFish Optimization Algorithm (HFOA), Hyperparameter optimization, infrared thermography, Instance segmentation, Mask R-CNN, multimodal fusion, Photovoltaic defect detection

Received: 30 Sep 2025; Accepted: 12 Dec 2025.

Copyright: © 2025 Almukhtar and Kurnaz. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Nazar Almukhtar

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.