Skip to main content

ORIGINAL RESEARCH article

Front. Remote Sens.
Sec. Acoustic Remote Sensing
Volume 5 - 2024 | doi: 10.3389/frsen.2024.1384562

Revised clusters of annotated unknown sounds in the Belgian part of the North Sea Provisionally Accepted

  • 1Flanders Marine Institute, Belgium
  • 2Ghent University, Belgium
  • 3Alfred Wegener Institute Helmholtz Centre for Polar and Marine Research (AWI), Germany

The final, formatted version of the article will be published soon.

Receive an email when it is updated
You just subscribed to receive the final version of the article

Acoustic signals, especially those of biological source, remain unexplored in the Belgian part of the North Sea (BPNS). The BPNS, although dominated by anthrophony (sounds from human activities), is expected to be acoustically diverse given the presence of biodiverse sandbanks, gravel beds and artificial hard structures. Under the framework of the LifeWatch Broadband Acoustic Network, sound data have been collected since the spring of 2020. These recordings, encompassing both biophony, geophony and anthrophony, have been listened to and annotated for unknown, acoustically salient sounds. To obtain the acoustic features of these annotations, we used two existing automatic feature extractions: the Animal Vocalization Encoder based on Self-Supervision (AVES) and a convolutional autoencoder network (CAE) retrained on the data from this study. An unsupervised density-based clustering algorithm (HDBSCAN) was applied to predict clusters. We coded a grid search function to reduce the dimensionality of the feature sets and to adjusttune the hyperparameters of HDBSCAN. We searched the hyperparameter space for the most optimized combination of parameter values based on two selected clustering evaluation measures: the homogeneity and the density-based clustering validation (DBCV) scores. Although both feature sets produced meaningful clusters, AVES feature sets resulted in more solid, homogeneous clusters with relatively lower intra-cluster distances, appearing to be more advantageous for the purpose and dataset of this study. The 26 final clusters we obtained were revised by a bioacoustics expert. , of which wWe were able to name and describe 10 unique sounds, but only clusters named as 'Jackhammer' and 'Tick' can be interpreted as biological with certainty. Although unsupervised clustering is conventional in ecological research, we highlight its practical use in revising clusters of annotated unknown sounds. The revised clusters we detailed in this study already define a few groups of distinct and recurring sounds that could serve as a preliminary component of a valid annotated training dataset potentially feeding supervised machine learning and classifier models.

Keywords: unsupervised, Training dataset, unknown soundscape, Aves, Autoencoder, Grid search, annotation, Bioacoustic

Received: 09 Feb 2024; Accepted: 22 Apr 2024.

Copyright: © 2024 Calonge, Parcerisas, Schall and Debusschere. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Ms. Arienne Calonge, Flanders Marine Institute, Ostend, Belgium