ORIGINAL RESEARCH article

Front. Mar. Sci.

Sec. Marine Ecosystem Ecology

Volume 12 - 2025 | doi: 10.3389/fmars.2025.1613565

This article is part of the Research TopicIntelligent Multi-scale Big Data Mapping of Coastal HabitatsView all 4 articles

Enhanced Hyperspectral Image Classification for Coastal Wetlands Using a Hybrid CNN-Transformer Approach with Cross-Attention Mechanism

Provisionally accepted
Zhongmei  LiZhongmei Li1,2Tang  LiuTang Liu1*Yuxiang  LuYuxiang Lu1Jing  TianJing Tian2Meng  ZhangMeng Zhang2Chenghu  ZhouChenghu Zhou1
  • 1Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences (CAS), Beijing, China
  • 2Beijing Institute of Remote Sensing Information, Beijing, China

The final, formatted version of the article will be published soon.

Coastal wetlands play a vital role in shoreline protection, material cycling, and biodiversity conservation. Utilizing hyperspectral remote sensing technology for wetland monitoring can enhance scientific management of these ecosystems. However, the complex water-land interactions and vegetation mixtures in wetlands often lead to significant spectral confusion and complicated spatial structures, posing challenges for fine classification. This paper proposes a novel hyperspectral image classification method that combines the strengths of Convolutional Neural Networks (CNNs) for local feature extraction and Transformers for modeling long-range dependencies. The method utilizes both 3D and 2D convolution operations to effectively capture spectral and spatial features of coastal wetlands. Additionally, dual-branch Transformers equipped with cross-attention mechanisms are employed to explore deep features from multiple perspectives and model the interrelationships between various characteristics. Comprehensive experiments conducted on two typical coastal wetland hyperspectral datasets demonstrate that the proposed method achieves an overall accuracy (OA) of 96.52% and 85.72%, surpassing other benchmarks by 1.0-8.64%. Notably, challenging categories such as mudflats and mixed vegetation area benefit significantly. This research provides valuable insights for the application of hyperspectral imagery in coastal wetland classification.

Keywords: Convolutional Neural Network, transformer, Cross Attention Mechanism, hyperspectral image classification, Coastal wetland classification

Received: 17 Apr 2025; Accepted: 09 Jun 2025.

Copyright: © 2025 Li, Liu, Lu, Tian, Zhang and Zhou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Tang Liu, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences (CAS), Beijing, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.