ORIGINAL RESEARCH article

Front. Remote Sens.

Sec. Image Analysis and Classification

Volume 6 - 2025 | doi: 10.3389/frsen.2025.1599099

Efficient Vision Transformers with Edge Enhancement for Robust Small Target Detection in Drone-Based Remote Sensing

Provisionally accepted
Xuguang  ZhuXuguang Zhu1Zhizhao  ZhangZhizhao Zhang2*
  • 1College of Innovation and Practice, Liaoning Technical University, Fuxin, China
  • 2School of Software, Liaoning Technical University, Huludao, China

The final, formatted version of the article will be published soon.

Small object detection in UAV remote sensing imagery faces significant challenges due to scale variations, background clutter, and real-time processing requirements. This study proposes a lightweight transformer-based detector, MLD-DETR, which enhances detection performance in complex scenarios through multi-scale edge enhancement and hierarchical attention mechanisms. First, a Multi-Scale Edge Enhancement Fusion (MSEEF) module is designed, integrating adaptive pooling and edge-aware convolution to preserve target boundary details while enabling cross-scale feature interaction. Second, a Layered Attention Fusion (LAF) mechanism is developed, leveraging spatial depth-wise convolution and omnidirectional kernel feature fusion to improve hierarchical localization capability for densely occluded targets. Furthermore, a Dynamic Positional Encoding (DPE) module replaces traditional fixed positional embeddings, enhancing spatial perception accuracy under complex geometric perspectives through learnable spatial adapters. Combined with an Inner Generalized Intersection-over-Union (Inner-GIoU) loss function to optimize bounding box geometric consistency, MLD-DETR achieves 36.7% AP50 and 14.5% APs on the VisDrone2019 dataset, outperforming the baseline RT-DETR by 3.2% and 1.8% in accuracy while achieving 20% parameter reduction and maintaining computational efficiency suitable for UAV platforms equipped with modern edge computing hardware. Experimental results demonstrate the algorithm's superior performance in UAV remote sensing applications such as crop disease monitoring and traffic congestion detection, offering an efficient solution for real-time edge-device deployment.

Keywords: UAV, Drone-based remote sensing, RT-DETR, Small object detection, Multi-scale edge enhancement

Received: 24 Mar 2025; Accepted: 03 Jul 2025.

Copyright: © 2025 Zhu and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Zhizhao Zhang, School of Software, Liaoning Technical University, Huludao, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.