Vocal behavior of false killer whale (Pseudorca crassidens) acoustic subgroups

Understanding the vocal behavior of cetaceans is an important component of many passive acoustic applications. This study quantifies the vocal behavior of acoustic subgroups of false killer whales (Pseudorca crassidens) from the Hawaiian Archipelago. The acoustic subgroups (N = 523) exhibit diverse vocal behavior that varies between encounters. Overall, 29% of acoustic subgroups only echolocate, 16% only whistle, and 55% emit both types of vocalizations. These results contribute important information for developing automated passive acoustic cetacean tracking, localization, and classification techniques, and thus, support future cetacean monitoring and assessment efforts.


Introduction
Passive acoustic monitoring has many applications in marine mammal research, including density estimation for management and conservation purposes (Van Parijs et al., 2009;Thomas and Marques, 2012;Hildebrand et al., 2019;Martin et al., 2020;Barlow et al., 2022). Although density estimation for marine mammals is primarily performed using visual distance sampling or capture-recapture methods, it is often advantageous to use acoustic data to estimate densities, especially with elusive species (Moretti et al., 2010;Martin et al., 2013). A variety of approaches exist to estimate animal density from passive acoustic data (see Ref (Thomas and Marques, 2012). and references within). Variables quantified to reach estimates are both study-and species-dependent since types and rates of vocalizations in marine mammal species vary as a function of the animal behavior, group size, time of day, season, year, and location (Henderson et al., 2012;Thomas and Marques, 2012). For many species, there is limited knowledge of the basic vocal behavior that is necessary for estimating vocalization rates, thus impeding the use of passive acoustic data for density estimation (Thomas and Marques, 2012;Bradford et al., 2020).
The goal of this study is to report the first insights into the vocal behavior of false killer whales (Pseudorca crassidens) from around the Hawaiian Archipelago. False killer whales are highly vocal and produce echolocation clicks, whistles, and burst pulses (Au et al., 1995;Murray et al., 1998;Oswald et al., 2007;Baumann-Pickering et al., 2015). Echolocation clicks and whistles are the predominant types of vocalizations, and burst pulses rarely occur (McCullough et al., 2021). Although the acoustic characteristics of these signals have been described (Au et al., 1995;Murray et al., 1998;Oswald et al., 2007;Baumann-Pickering et al., 2015) and classification schemes proposed (Barkley et al., 2019;McCullough et al., 2021), the vocal behavior and rate of production of these signals in wild populations are unknown.
Three distinct populations of false killer whales are found in the Hawaiian Archipelago, with one currently listed as endangered under the U.S. Endangered Species Act (Baird, 2009;Oleson et al., 2010;Bradford et al., 2020). These delphinids are known to interact with fisheries throughout the archipelago leading to whale mortality or injury (Baird, 2009;Oleson et al., 2010) at sometimes unsustainable rates. Thus, these populations have been a focus of extensive survey efforts to collect visual data for abundance estimation citepBradford2020. However, their elusive nature, dispersed aggregations of subgroups, and potential attraction to the research vessel introduce biases to visual-based abundance estimates (Bradford et al., 2014). This study aims to improve the understanding of false killer whale behavior by reporting the first analysis of the vocal behavior from false killer whale acoustic subgroups. An acoustic subgroup refers to a small acoustically tracked group of false killer whales swimming closely together within a larger, spatially dispersed group. Since acoustic subgroups are defined using localization data from towed hydrophone arrays, they collectively capture the overall group's relative behavior through space and time. Unlike visually sighted subgroups (Bradford et al., 2014), the number of individuals within an acoustic subgroup could not be accurately estimated in the current study due to the small spacing between sensors in the hydrophone array and a lack of data on false killer whale vocalization statistics such as rate of vocalization per animal. However, quantifying acoustic subgroup vocal behavior provides additional information about the overall group behavior to complement the visual data, contribute knowledge to address biases, and support future passive acoustic density estimation efforts.

Methods
The data were collected during a joint visual and acoustic shipbased survey of Hawaiian waters (Hawaiian Islands Cetacean and Ecosystem Assessment Survey-HICEAS) from July to December of 2017, using two research vessels (Yano et al., 2018). Recordings were collected using a multi-channel towed hydrophone array that sampled at 500 kHz. A detailed description of the survey and the equipment can be found in Ref (Yano et al., 2018)., and a portion of the acoustic recordings are publicly available at Ref (NOAA Pacific Islands Fisheries Science Center, 2022).. Briefly, the Pacific Island Fisheries Science Center's protocol for surveying false killer whales consists of two phases (Yano et al., 2018). Phase I is an on-effort trackline passing mode where a vessel travels in a straight line, and Phase II is an off-effort mode where a vessel does not travel in a straight line (Yano et al., 2018). During Phase II, boat movements are erratic preventing consistent acoustic tracking with the hydrophone array. Therefore, only acoustic recordings from Phase I were used in the present study.
Recordings from the first and the last hydrophone of the towed array system were used for the analysis in order to maximize the sensor separation and thus the spatial resolution (Gruden et al., 2021). Sensor separation for each encounter is listed in Table  reftable: encounters. The data were pre-processed as described in Ref (Gruden et al., 2021). to obtain cross-correlograms and examples can be seen in Supplementary Material 1 . Crosscorrelograms allow multiple spatially separated sources to be tracked and localized by tracking the changes in time-differenceof-arrival (TDOA) of an emitted signal between two sensors for each source (or a group of closely spaced sources, i.e., acoustic subgroup). Briefly, two types of cross-correlograms were created: one based on clicks and the other based on whistles. Burst pulses were not considered separately in this study since they occur rarely (McCullough et al., 2021) and have typically low amplitude. The burst pulses that had amplitudes comparable to echolocation clicks were considered in the click-based cross-correlograms due to their impulsive nature.
Prior to constructing cross-correlograms for this study, the data were pre-processed as follows. To construct a click-based crosscorrelogram, the data were first bandpass filtered using a 4th-order Butterworth filter with cut-off frequencies of 8 and 30 kHz (Gruden et al., 2021). To construct a whistle-based cross-correlogram, the data were first processed with a three-stage click removal process which included a 4th-order Butterworth bandpass filter with cut-off frequencies of 2.5 and 12 kHz (Gruden et al., 2021). Crosscorrelograms in both cases were then computed based on the generalized cross-correlation smoothed coherence transform (GCC-SCOT) with a 1 s long sliding window and 50% overlap (Gruden et al., 2021). Cross-correlograms were normalized and measurements were extracted by applying a threshold of 3.7 (Gruden et al., 2021). Measurements consisted of TDOA and amplitude of the cross-correlation function information. After measurement extraction, the measurements from both types of cross-correlograms were combined so that analysis was performed on joint information.
From the combined measurements, the experienced manual analyst created tracks of individual acoustic subgroups of false killer whales by linking together slowly changing consecutive TDOAs that had a high amplitude of the cross-correlation function. From here on, a track is thus defined as a set of TDOAs for each acoustic subgroup connected into a trajectory. This resulted in three types of tracks: tracks based on whistles, echolocation clicks, and both whistles and clicks. Note that one track can be a result of one or multiple animals as it is not possible to differentiate due to limited spatial resolution of the measurements (Gruden et al., 2021). Moreover, tracks include periods of silence (animals are present but not vocalizing and move during this time),and these were investigated in terms of vocal activity as follows.
Vocal activity of acoustic subgroups was investigated by examining vocal and silent periods within each track (an example is shown in Supplementary Material 2 ). A vocal period is defined as a period during which two consecutive TDOAs in the same track are separated by less than 21 s. This threshold was determined by examining the manually annotated tracks, and 98% of all consecutive TDOAs within a given track were separated by less than that value. A silent period is any period in a given track in which consecutive TDOAs are separated by more than 21 s. Further, for each track, two states were defined: vocal and silent. Vocal state is a sum of all vocal periods in a given track divided by the track duration, and silent state is a sum of all silent periods in a given track divided by the track duration.

Results
Nine visually confirmed, single species acoustic encounters of false killer whales, recorded on eight separate days, were investigated with TDOA tracks identified (Table 1, map of the encounters is shown in Supplementary Material 3 ). Two of the encounters occurred on the same day and time (Encounters 1 and 9, Table 1), but they were collected at two different locations (see map in the Supplementary Material), by concurrent research vessels, and are thus considered independent. Overall, 523 tracks were identified in 749 min (total duration of all encounters), with median overall track duration of 3.9 min (interquartile range 9.3 min). Of these tracks, 29% consisted of only echolocation clicks, 16% only whistles, and 55% both clicks and whistles. While there was a variation in the proportion of a given track type between encounters, tracks consisting of both clicks and whistles occurred significantly more frequent in all encounters (Kruskall-Wallis test, p <0.001, Figure 1A).
The proportion of a given track type varied between encounters ( Figure 1B). Track type consisting of both clicks and whistles were predominant in 7 out of 9 encounters, but the proportion of the other two track types varied between encounters ( Figure 1B).
Track duration varied between encounters (Table 1). Although the overall duration of encounters was long, the individual acoustic subgroup tracks were short (median overall track duration of 3.9 min). Track duration also varied between different track types within the same encounter ( Figure 2). The track type consisting of both clicks and whistles was significantly longer compared to the other two track types in most encounters (Kruskall-Wallis test, p ≤ 0.01 for encounters 2-4, 6-9, and "All").
Further, the overall vocal activity of each encounter was investigated. False killer whale subgroups spent significantly more time being vocal than silent for all encounters except Encounter 1 (Kruskall-Wallis test, p <0.001, Figure 3A). Vocal activity was also investigated for each track type overall, and subgroups spent significantly more time being vocal than silent for all track types (Kruskall-Wallis test, p <0.001, Figure 3B).

Discussion
False killer whales were shown to exhibit diverse vocal behavior within the same encounter and between encounters. Acoustic subgroup tracks that contained both whistles and clicks were significantly more common than tracks based on either clicks or whistles alone (Figure 1). Traditionally, tracking of groups of animals from ship-based surveys is carried out separately for clicks and whistles, but the results of this study show the need to track on joint information from clicks and whistles to obtain more accurate information about an encounter.
Overall, tracks containing both clicks and whistles were significantly longer compared to the other two track types (Figure 2), which could give the subgroups of animals that exhibit such behavior a better chance of being detected, tracked, and localized (Gruden et al., 2021). This should be taken into consideration for applications that infer parameters based on acoustically localized subgroups, such as abundance and density estimation, and for applications where detecting all subgroups of animals is important. For example, if this variation in detectability (and thus availability) is persistent among subgroups, any resulting data set violates assumptions of independence and could lead to large biases in density and abundance estimates if left unaccounted for in the modeling process (Carothers, 1973;Wilbur and Landwehr, 1974). Note that there were outliers in track duration for all track types in many encounters, which could be due to the variability at the individual level ( Figure 2).
The proportions of track types within a given encounter varied likely due to the acoustic subgroup's behavioral state and the total number of animals present along with factors affecting signal propagation (such as distance and sea state). Moreover, the differences in the beam pattern between clicks and whistles (Au et al., 1995), could also have affected detection ranges and contributed to differences in the proportions. Data on the behavioral state of the acoustic subgroups (e.g., feeding, traveling, socializing) are not available, since the majority of the visual encounters occur in Phase II of the survey, which often takes place hours after the acoustic tracking conducted in Phase I of the survey. Due to elusive surface behavior of this species, large proportions of the acoustic subgroups tend to be missed by visual observers in the Phase I of the survey (Yano et al., 2018). Thus, linking the subgroups from acoustically tracked subgroups in Phase I to visually detected subgroups in Phase II is not reliable.
However, since the data were collected across multiple months and over a large geographic area, the data likely capture a representative collection of different behavioral states. While the number of animals in each acoustic subgroup are also not available, for the reasons discussed above, track density (the number of tracks per encounter, or the number of tracks per unit time) could be used as an approximation of the overall group size in the encounter. Future studies should investigate these issues further.
When vocal activity was investigated for each encounter, it was observed that the animals spent more time in vocal state than silent in the majority of encounters ( Figure 3A). This was true also for the individual track types ( Figure 3B). This is a rough estimate, since vocal states still include periods of silence between consecutive TDOAs, albeit shorter than 21 s. The periods of silence within each vocal state can still cause the automated methods to detect tracks in segments (track fragmentation) (Gruden et al., 2021). In the present study, tracks were manually annotated to remove the fragmentation resulting from automated methods, to investigate the proportion of time subgroups spent vocalizing. However, as in any manual approach, this leads to a certain degree of subjectivity in track assignments. In order to make results more objective and processing time more efficient, future efforts will focus on adapting the methods in Ref (Gruden et al., 2021). to obtain these behavioral parameters automatically.
Understanding the vocal behavior (track type composition, duration, and vocal state) of a given species is important not only for detection, tracking, and localization, but can also contribute to classification efforts. Adding contextual information, such as timing between calls or spatial positioning, has previously improved classification performance (Roch et al., 2018;Madhusudhana et al., 2021). The Hawaiian waters are home to other cetacean species with acoustic characteristics similar to false killer whales, such as short-finned pilot whales (Globicephala macrorhynchus) Date and time indicate start of the encounter reported in GMT. Encounter duration is the duration of the analysed encounter and corresponds to Phase I of the survey protocol (Yano et al., 2018). Track duration is reported as a median per encounter with 25th and 75th percentiles in the brackets.

FIGURE 1
Proportion of a given track type (Clicks/Whistles/Both) in all encounters collectively (A) and in each individual encounter (B). On each box in (A) the central line indicates the median, the bottom and top edges of the box indicate 25th and 75th percentiles, respectively. The whiskers in (A) extend to most extreme data points not considered outliers. Statistically significant results are denoted with an asterisk. The numbers above each bar in (B) denote the number of a given track type in that encounter. (McCullough et al., 2021). Future studies could utilize the methods outlined in this paper to quantify the vocal behavior of these species and identify important features that may improve acoustic classification between species. Similarly, the proposed methods may be used to investigate the vocal behavior of the three distinct populations of false killer whales. Previous classification work did not find reliable acoustic features of their whistles to distinguish between the populations (Barkley et al., 2019), but including aspects of their vocal behavior as an additional feature may improve classification models and thus, aid conservation efforts. This study showed that subgroups of false killer whales varied in their vocal behavior by examining time-difference-of-arrivals of the vocalizations between sensors in a towed hydrophone array. Most subgroups produced both whistles and echolocation clicks, although some subgroups only produced a single vocalization type. The tracks from subgroups emitting both clicks and whistles were longer in duration compared to the other two track types, increasing the availability of such subgroups for detection and tracking. Methods presented in this study may be applied to additional survey data sets collected in Hawaiian waters to increase the sample size and allow for further consideration of factors affecting vocal behavior, such as spatiotemporal variables. The findings of this study contribute to tracking, localization, classification efforts, and acoustic cue rate studies, and thus A B

FIGURE 3
Track proportion spent in a given state, vocal or silent, (A) for each encounter and for all encounters collectively, (B) for each track type collectively. On each box the central line indicates the median, the bottom and top edges of the box indicate 25th and 75th percentiles, respectively. The whiskers extend to most extreme data points not considered outliers, and outliers are indicated as black dots. Statistically significant results are denoted with an asterisk. Track duration per encounter and for all encounters collectively. On each box the central line indicates the median, the bottom and top edges of the box indicate 25th and 75th percentiles, respectively. The whiskers extend to most extreme data points not considered outliers, and outliers are indicated as black dots. Note, some of the longer duration outliers are omitted for easier readability. Statistically significant results are denoted with an asterisk.
provide data to inform future species monitoring and assessment efforts.

Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.