ORIGINAL RESEARCH article

Front. Mar. Sci.

Sec. Ocean Observation

Volume 12 - 2025 | doi: 10.3389/fmars.2025.1524134

This article is part of the Research TopicAdvanced Monitoring, Modelling, and Analysis of Coastal Environments and EcosystemsView all 29 articles

AviaryMOT: Aviary Attention-Based Adaptive Multi-Object Tracking of Cranes and Storks in Wetlands

Provisionally accepted
Chang  LiuChang Liu1*Xuran  MaXuran Ma2Jiahong  ZhouJiahong Zhou2Nini  SunNini Sun2Hengming  LiuHengming Liu1
  • 1Beijing Information Science and Technology University, Beijing, China
  • 2Shandong Changdao National Nature Reserve, Shandong, China

The final, formatted version of the article will be published soon.

This study focuses on tracking cranes and storks to aid in wetland ecological protection.Multi-target tracking of these birds presents challenges such as frequent occlusions, sudden appearances, and disappearances. To tackle these issues, we propose a novel multi-target tracking algorithm, AviaryMOT, which utilizes a fusion technique that combines shallow and deep features to enhance tracking accuracy and effectiveness. We construct a dataset, BirdTrack, for cranes and storks tracking. In the detecting stage, we proposed Aviary Attention to effectively capture the features of birds, by integrating the Coordinate Attention into the YOLOv8 framework and applying Soft-NMS to improve detection in occluded scenarios. In the tracking stage, the BYTE data association method effectively utilizes similarities between low-score detection boxes and tracking trajectories, enabling the identification of true objects and filtering out background noise. Experimental results show that our method outperforms the state-of-art models, maintaining stable target trajectories while ensuring high-quality detection.

Keywords: multiple object tracking, Aviary Attention, YOLOv framework, ByteTrack, Wetlands protection

Received: 07 Nov 2024; Accepted: 06 May 2025.

Copyright: © 2025 Liu, Ma, Zhou, Sun and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Chang Liu, Beijing Information Science and Technology University, Beijing, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.