ORIGINAL RESEARCH article

Front. Mar. Sci.

Sec. Marine Megafauna

Volume 12 - 2025 | doi: 10.3389/fmars.2025.1232022

Machine learning to predict killer whale (Orcinus orca) behaviors using partially labeled vocalization data

Provisionally accepted
  • Shady Side Academy, Pittsburgh, United States

The final, formatted version of the article will be published soon.

Orcinus orca (killer whales) exhibit complex calls. In a call, an orca typically varies the frequencies, varies the length, varies the temporal patterns, varies their volumes, and can use multiple frequencies simultaneously. Behavior data is hard to obtain because orcas live under water and travel quickly. Sound data is relatively easy to capture. This paper studies whether machine learning can predict behavior from vocalizations. Such prediction would help scientific research and have safety applications because one would like to predict behavior while only having to capture sound. A significant challenge in this process is lack of labeled data. This paper works with recent recordings of McMurdo Sound orcas [Wellard et al., 2020a] where each recording is labeled with the behaviors observed during the recording. This yields a dataset where sound segments-continuous vocalizations that can be thought of as call sequences or more general structures-within the recordings are labeled with potentially superfluous behaviors. This is because in a given segment, an orca may not be exhibiting all of the behaviors that were observed during the recording from which the segment was taken. Despite that, with a careful combination of recent machine learning techniques, including a ResNet-34 convolutional neural network and a custom loss function designed for partially labeled learning, a 96.1% general behavior label classification accuracy on previously unheard segments is achieved. This is promising for future research on orca behavior as well as language and safety applications.

Keywords: ORCA, vocalization, Calls, Behavior prediction, machine learning, partially labeled learning, Language, semantics

Received: 31 May 2023; Accepted: 20 May 2025.

Copyright: © 2025 Sandholm. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Sophia Sandholm, Shady Side Academy, Pittsburgh, United States

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.