ORIGINAL RESEARCH article
A Novel Framework to Predict Relative Habitat Selection in Aquatic Systems: Applying Machine Learning and Resource Selection Functions to Acoustic Telemetry Data From Multiple Shark Species
- 1Department of Environmental Conservation, University of Massachusetts Amherst, Amherst, MA, United States
- 2Wetland and Aquatic Research Center, United States Geological Survey, Davie, FL, United States
- 3The Nature Conservancy, Narragansett, RI, United States
- 4National Park Service, Key West, FL, United States
- 5Office of Habitat Conservation, NOAA Fisheries, Silver Spring, MD, United States
- 6National Park Service, Christiansted, US Virgin Islands
- 7Massachusetts Division of Marine Fisheries, New Bedford, MA, United States
Resource selection functions (RSFs) have been widely applied to animal tracking data to examine relative habitat selection and to help guide management and conservation strategies. While readily used in terrestrial ecology, RSFs have yet to be extensively used within marine systems. As acoustic telemetry continues to be a pervasive approach within marine environments, incorporation of RSFs can provide new insights to help prioritize habitat protection and restoration to meet conservation goals. To overcome statistical hurdles and achieve high prediction accuracy, machine learning algorithms could be paired with RSFs to predict relative habitat selection for a species within and even outside the monitoring range of acoustic receiver arrays, making this a valuable tool for marine ecologists and resource managers. Here, we apply RSFs using machine learning to an acoustic telemetry dataset of four shark species to explore and predict species-specific habitat selection within a marine protected area. In addition, we also apply this RSF-machine learning approach to investigate predator-prey relationships by comparing and averaging tiger shark relative selection values with the relative selection values derived for eight potential prey-species. We provide methodological considerations along with a framework and flexible approach to apply RSFs with machine learning algorithms to acoustic telemetry data and suggest marine ecologists and resource managers consider adopting such tools to help guide both conservation and management strategies.
Habitat loss and degradation are two of the largest drivers of loss in global biodiversity (Hoekstra et al., 2005), making identifying important habitats critical for resource managers to prioritize habitat protection for species of concern (Morris and Kingston, 2002; Chetkiewicz and Boyce, 2009; Heinrichs et al., 2017). Habitat selection is driven by the physical, chemical, and biological composition and condition of an area that is occupied by a given animal (Block and Brennan, 1993). Thus, behavioral choices related to selection are ultimately determined by a wide range of coupled and uncoupled abiotic and biotic factors, such as energetic demands and tradeoffs from foraging opportunities, predation risk, and competition (Rosenzweig, 1974; Craig and Crowder, 2002). Understanding how species select habitats across heterogeneous landscapes provides key information regarding occupancy patterns that contribute to survival and reproductive success (Kramer et al., 1997; McGarigal et al., 2016). Such information could then be used to identify, protect, and restore specific ecologically valuable habitats and corridors (Kramer and Chapman, 1999; Beier et al., 2008; Fraschetti et al., 2009; Zeller et al., 2017).
Resource selection functions (RSFs), defined as a function that produces values that are proportional to the probability of use by an animal (Manly et al., 2007), are a popular method to determine and predict relative habitat selection by animals (e.g., Nielsen et al., 2003; Johnson et al., 2004; Ciarniello et al., 2007). These functions evaluate the relationships between resource use (i.e., the units of area selected by an animal) and the environmental characteristics associated with each unit of area (Boyce et al., 2002). Animal spatial data, from sources such as telemetry, can be incorporated into RSFs to define the relative habitat selection strengths among animal space use and a given set of environmental covariates, such as habitat type, substrate, elevation, or water depth (Boyce and McDonald, 1999). When the true absences are unknown, as generated by presence only data derived from sources such as telemetry approaches, RSFs are implemented within a use/availability framework where known presences (1) are compared with a random sample across ‘available’ resource units, also known as pseudo-absences or background points (0) (Boyce, 2006; Pearce and Boyce, 2006). Alternative to use/availability (e.g., from telemetry), data from observations collected from survey methods, often without timestamps, are typically referred to as presence-background and are fitted as species distribution models (Fieberg et al., 2018). Using RSFs to derive the relative probability of selection, rather than the absolute probability (see Lele et al., 2013; Avgar et al., 2017), telemetry data are then typically fitted using logistic regression models (Johnson et al., 2006; Manly et al., 2007) or, as of more recently, with machine learning algorithms [e.g., random forest (RF), boosted regression trees] (Shoemaker et al., 2018; Heffelfinger et al., 2020).
While RSFs have been largely applied in terrestrial ecology, such as with wolves (Ordiz et al., 2020), birds (Meager et al., 2012), grizzly bears (McLoughlin et al., 2002), and deer (Godvik et al., 2009), the application of RSFs within aquatic environments has been limited comparatively, likely due to technological challenges related to continuously tracking animals through water (Hussey et al., 2015). Today, passive acoustic telemetry has become one of the most common practices to quantify aquatic animal space use (Cooke et al., 2004; Donaldson et al., 2014; Hussey et al., 2015). This technique involves tagging an animal with an acoustic transmitter that periodically emits an ultrasonic ping with a unique identification number (ID code). When in range and with sufficient detection efficiency the ping is detected by an acoustic receiver that registers both the unique ID code and the time the transmitter was detected (Hussey et al., 2015). Depending on the scope and extent of both research questions and available funding, acoustic receivers are strategically arranged in fixed locations with either non-overlapping detection ranges (Heupel et al., 2006; Brownscombe et al., 2019b), or with overlapping detection ranges that can produce high resolution positioning estimates of space use (Espinoza et al., 2011). While both methods are limited to the available detection coverage (presence only data), the former is often used to examine space use across a given study area at much larger spatial extents (Carlisle et al., 2019) and, thus, is well catered to exploring relative habitat selection.
Although the application of RSFs in combination with acoustic telemetry has been limited (see Freitas et al., 2016; Harrison et al., 2016; Gutowsky et al., 2017; Selby et al., 2019; Griffin et al., 2020), much needed information on animal habitat selection in the marine environment can be derived. For example, Selby et al. (2019) and Griffin et al. (2020) applied RSFs to acoustic telemetry data for hawksbill (Eretmochelys imbricata) and juvenile green turtle (Chelonia mydas), respectively, from St. Croix, United States Virgin Islands, and determined that the size and extent of a marine protected area (MPA) being used by these sea turtles was sufficient to meet conservation goals. In addition to providing insights on potential drivers of relative habitat selection, RSFs were also extended to predict movements in areas that did not have acoustic receivers to provide potential locations where fine-scale habitat protection may be further prioritized (Griffin et al., 2020). Such results have important implications within marine environments especially for resource managers seeking to incorporate animal movement data to generate effective conservation strategies (Cooke, 2008; Knip et al., 2012; Allen and Singh, 2016; Hays et al., 2016, 2019; Lea et al., 2016). Since management and conservation efforts often rely on spatial management techniques (Peel and Lloyd, 2004; Sequeira et al., 2019), including MPAs (Gell and Roberts, 2003; Lubchenco et al., 2003; Gleason et al., 2010; Lubchenco and Grorud-Colvert, 2015; Weeks et al., 2017; Feeley et al., 2018; Keller et al., 2020; Gallagher et al., 2021), habitat selection predictions should help managers meet conservation endpoints and play a role in evaluating management alternative strategies for both species and for the habitats on which they rely on.
In this study, we provide a framework to implement RSFs using machine learning algorithms to examine and accurately predict relative habitat selection for tracking data collected using acoustic telemetry. Specifically, we apply RSFs to evaluate the relative habitat and resource selection of four shark species: Caribbean reef (Carcharhinus perezi), lemon (Negaprion brevirostris), nurse (Ginglymostoma cirratum), and tiger (Galeocerdo cuvier) sharks within a Caribbean MPA. In the Caribbean Sea, these four species occupy a wide range of environments from nearshore reef and seagrass habitats to offshore pelagic habitats (Pikitch et al., 2005; Legare et al., 2015; Pickard et al., 2016; Casselberry et al., 2020; Gallagher et al., 2021). Considering Caribbean reef sharks are listed as “Endangered” by the IUCN Red List (Carlson et al., 2021a), lemon and nurse sharks are listed as “Vulnerable” (Carlson et al., 2021b, c), and tiger sharks are listed as “Near Threatened” (Ferreira and Simpfendorfer, 2019), all with decreasing population trends, conservation and management efforts would benefit from understanding and incorporating findings surrounding their spatial ecology.
Successful management is especially needed since it has been suggested losses in shark abundance may disrupt food web dynamics that would lead to reduced ecosystem health (Baum and Worm, 2009; Ferretti et al., 2010; Heupel et al., 2014; Hammerschlag et al., 2019). Indeed, food web simulations for Caribbean coral reefs show sharks, as top predators, are members of strongly interacting tri-trophic food chains whose loss could result in trophic cascades (Bascompte et al., 2005). This is supported by in situ studies of mesopredator fish populations in Australia that found shark-depleted coral reefs have reduced fish diversity, species abundance, and biomass, with individual species showing changes in diet and body condition when compared to reefs with healthy shark populations (Barley et al., 2017a, b). Information about shark habitat use and selection could lead to proactive management strategies to mitigate non-sustainable or illegal harvest (White et al., 2017; Jacoby et al., 2020) and/or protect and restore important habitat (Speed et al., 2016; Daly et al., 2018). Because spatial management techniques, such as MPAs, can provide protection for multiple species across a variety of life stages, understanding resource selection across species should help to tailor effective conservation strategies and, specifically, ensure adequate coverage of ecologically vital habitats and areas (Lea et al., 2016).
Considering MPA design may benefit from the inclusion and understanding of predator-prey dynamics (Micheli et al., 2004; Cashion et al., 2020), we also demonstrate how RSFs can be extended to examine spatially explicit relationships between marine predators and their prey. This was accomplished by deriving and averaging overlapping selection values from tiger sharks and from their potential prey, including juvenile green turtles, juvenile Caribbean reef sharks, juvenile lemon sharks, great barracuda (Sphyraena barracuda), horse-eye jack (Caranx latus), yellowtail snapper (Ocyurus chrysurus), and mutton snapper (Lutjanus analis) (Lowe et al., 1996; Simpfendorfer et al., 2001; O’Shea et al., 2015; Aines et al., 2018; Gallagher et al., 2021). Herein, we provide a framework for studies wishing to investigate animal relative habitat selection and predator-prey relationships with acoustic telemetry in marine environments.
Ultimately, these collective RSF findings provide insights into shark spatial ecology and is useful for the conservation of Caribbean reef, lemon, nurse, and tiger sharks and their habitats. In addition, we have included an R code vignette (Appendix A), to improve accessibility and application of RSFs and machine learning.
Materials and Methods
Study Area and Field Data Collection
Buck Island Reef National Monument (BIRNM), a 77 km2 no-take MPA, is located on the northeast shelf of St. Croix, United States Virgin Islands (Pittman et al., 2008). Buck Island is an uninhabited, 0.7 km2 island that is situated in the middle of the MPA, and 2.5 km northeast of St. Croix. This MPA ranges from shallow-water habitats (<10 m) near the island to deep-water habitats (>1,000 m) off the continental shelf. Generally, benthic habitats range from lagoon habitat (50–150 m wide, around the island excluding the west and southwest sides of the island), linear reef (south side of island and wrapping toward northwest corner), patch reef systems (northwest and north of the island, and south of the southern linear reef), seagrass patches (Thalassia sp., Syringodium sp., and Halophila sp.) and sand flats (south and southwest) (Pittman et al., 2008; Costa et al., 2012).
Between 2011 and 2019, a total of 147 VEMCO VR2W receivers (Innovasea Systems Inc., Nova Scotia, Canada) were deployed as a passive acoustic receiver array within BIRNM to study multiple species (Becker et al., 2016, 2020; Bryan et al., 2019; Selby et al., 2019; Casselberry et al., 2020; Griffin et al., 2020; Novak et al., 2020a, b) (Supplementary). Receivers were deployed, in depths ranging from 2 to 40 m, either on sand screws or cement block anchors around the island with receiver downloads occurring twice a year (see Becker et al., 2016; Selby et al., 2016; Casselberry et al., 2020 for mooring details). Among years, the receiver array design changed in extent through the addition of new receiver stations or decommissioning old stations, due to the availability of receivers and evolving project goals, while maintaining a core set of receiver stations through the duration of the project. The array began with 17 receivers in 2011 and reached its greatest coverage with 147 receivers in 2017. For this study, we collected and analyzed acoustic telemetry data from only 2013–2019 when detection coverage and tag deployment was most substantial across BIRNM. Animal tracking data were collected from surgically implanted V13 or V16 transmitters (delay 60–180 s, battery life 360–3,217 days, Innovasea Systems Inc., Nova Scotia, Canada) in 14 Caribbean reef sharks (between 2013 and 2019), 10 lemon sharks (between 2013 and 2019), 11 nurse sharks (between 2013 and 2019), and eight tiger sharks (between 2015 and 2019). In addition, to examine prey habitat selection in relation to tiger shark selection, data were also collected from 58 juvenile green turtles (between 2013 and 2014), 25 great barracuda (hereinafter referred to as barracuda) (between 2014 and 2015), five horse-eye jack (between 2016 and 2017), eight yellowtail snapper (between 2015 and 2017), and four mutton snapper (between 2015 and 2016). All detection data were reviewed and filtered to remove false detections (Simpfendorfer et al., 2015), including detections that occurred within 60 s of each other for a given individual, singular detections occurring within 12 h, and detections that indicated unrealistic movements (>3 m per second). Tagging locations and methods, including additional specific detection filtering processes, can be found for sharks in Casselberry et al. (2020), barracuda in Becker et al. (2016), horse-eye jack in Novak et al. (2020a), yellowtail snapper in Novak et al. (2020b), and for green turtles in Griffin et al. (2020).
Framework to Apply RSFs Using Acoustic Telemetry and Machine Learning
To derive and predict relative selection values of each species, we describe four important components that include defining available resource units, aggregating habitat information, implementing, evaluating, and interpreting RSFs with machine learning algorithms, and, ultimately, predicting habitat selection for sharks across BIRNM. All analyses were conducted in R version 3.6.2 (R Core Team, 2019). We describe each step in detail below and in the included R code vignette (Appendix A).
Defining Available Resource Units and Presence/Background Points
To estimate fine resolution space use away from the exact location of receivers, detection data was first converted into short-term centers of activity (COAs) using the mean position algorithm (Simpfendorfer et al., 2002). To disaggregate detection data from receiver locations, this method, using the detections across multiple receivers, provides position estimates that are based on the weighted means of the number of detections among each receiver during a specified time window (Simpfendorfer et al., 2002). Here, using the VTRACK package (Campbell et al., 2012), this algorithm was implemented with 90-min time bins to provide animal positioning data across BIRNM (Selby et al., 2019; Griffin et al., 2020). In addition to disaggregating data from receiver locations, constructed COAs provide an approach to potentially reduce issues with autocorrelation by subsampling data into defined time steps (e.g., 90-min bins) (Matley et al., 2017). Autocorrelation, an inherent problem with tracking data, occurs when sequential locations are obtained from the same individual and can lead to biased parameter estimates of animal habitat/space use (Legendre, 1993; Johnson et al., 2013; Fleming et al., 2015).
Consistent with Selby et al. (2019) and Griffin et al. (2020), we defined our available resource units by deriving 400 m buffers around each receiver for each year it was deployed (i.e., 2013–2019) (Supplementary Figure 1). While detection range was variable across BIRNM habitats with an average of 58.2% (95% confidence interval: 44–73% CI) probability of detection at 100 m distance from a receiver (Selby et al., 2016), we decided to extend the buffer size to 400 m since COAs are able to provide approximate positioning estimates even outside of receiver detection range.
To implement RSFs within a use/availability framework and to account for variable receiver coverage across years, we restricted both the COAs (presences) and the randomly distributed background points (pseudo-absences) to our defined available resource units (400 m receiver buffers at the year level) only. Background points were randomly distributed equal to the number of observed COAs (see Barbet-Massin et al., 2012) per individual, diel period (night vs. day), and year across all available resource units (Figure 1). Diel period was calculated using the maptools package (Bivand and Lewin-Koh, 2013). Only using COAs and background points that were within the 400 m buffer from any receiver, they were then collapsed into 200 m × 200 m raster cells.
Figure 1. Conceptual diagram to implement resource selection functions with acoustic telemetry and machine learning algorithms; from generated centers of activities and background points, deriving available resource units, model development, model performance and interpretation, and, ultimately, predicting and extrapolating relative selection.
Aggregating Habitat Information
Using habitat mapping data provided by the National Oceanic and Atmospheric Administration (NOAA) (Costa et al., 2012), we converted available and relevant shapefile data into 200 m × 200 m raster cells using the raster (Hijmans et al., 2015) and the sp (Pebesma et al., 2012) packages. Derived habitat raster files included classifications aggregated by zone (fore reef, reef flat, lagoon, etc.) (Supplementary Figure 2A), fine-scale structure (aggregate reef, sand, pavement with sand channels, etc.) (Supplementary Figure 2B), fine-scale cover [seagrass patchy (10%–<50%), seagrass patchy (50–<90%), seagrass continuous (90–100%), etc.], broad-scale cover (algae, live coral, seagrass, etc.), and percent coral cover (i.e., 0–<10%, 10–<50%). In addition, we generated two relevant habitat raster files including distance to land (m) (Buck Island) and depth (m) (Supplementary Figure 3).
Subsequently, corresponding habitat and depth information were extracted from each raster cell and assigned to each COA and background point using the rasterize function from the raster package (Hijmans et al., 2015) (see Appendix A). Habitat information was converted into factors with depth (m) and distance to land (m) remaining as continuous variables.
Applying RSFs With Machine Learning
Resource selection functions were applied using RF models, a commonly utilized machine learning algorithm, to evaluate the relative habitat selection of each species within BIRNM. RF, using binary recursive partitioning to fit multiple data trees with randomly selected predictor subsets (Breiman, 2001), effectively reduces variance and model overfitting while optimizing predictive accuracy (James et al., 2013; Hengl et al., 2018; Schratz et al., 2018). To increase the prediction of the response variable (presences/background points), RF models were fit for each species with 500 trees, replacement, and with 60% of the data. The 40% remainder of each dataset, known as the holdout dataset, were then used to test model performance. RF models were implemented in the ranger (Wright and Ziegler, 2015) and mlr (Bischl et al., 2016) packages.
Characteristic of RF models, a user may tune how trees are generated and fitted from the data. These controls, referred to as hyperparameters, are set prior to fitting an RF by running multiple iterations of values (see Probst et al., 2019). While default values for hyperparameters lead to relatively high performance alone, tuning can often lead to overall model improvement (Lovelace et al., 2019; Probst et al., 2019). Ultimately, hyperparameter settings control the degree of randomness across trees and may include the number of predictors that should be used in each tree (mtry), the fraction of observations to be used in each tree (sample.fraction) with lower fractions leading to lower correlation across trees, and the number of observations a terminal node (within a tree) should at least have (min.node.size) (Lovelace et al., 2019). To find the optimal hyperparameter values, we first partitioned the training dataset into five distinct geographic sections and then for each partition, we generated 50 random combinations of hyperparameters and subsequently chose the optimal combination (see Lovelace et al., 2019) using the tuneParams function from the mlr package (Bischl et al., 2016). Thus, 50 iterations of random hyperparameter values across each of the five partitions resulted in 250 models in total. Subsequently, the optimal hyperparameter combinations were used to tune and train the final RF for each species, using the 60% training dataset.
Performance, Interpretation, and Prediction
With the trained model, we predicted across the 40% holdout dataset to evaluate performance. The functions calculateConfusionMatrix and calculateROCMeasures from the mlr package (Bischl et al., 2016) were used to determine overall and class (present versus background point) accuracy, error rate, and performance of each RF. Specifically, performance measures were derived from the confusion matrix table that compared the true observations versus the model predictions, these metrics included overall accuracy, i.e., correct number of predictions/total number of predictions, sensitivity (true positive rate), specificity (true negative rate), fall-out (false positive rate), miss rate (false negative rate), and precision (positive predictive value). Predictor, also known as feature, importance was also assessed using the permutation importance method (mean decrease in accuracy) where predictors were evaluated based on the increase or decrease in prediction error after permutation (Breiman, 2001). For interpretation across RF models, all importance values were normalized (min-max normalization). To identify which variables generated the greatest two-way interaction strengths, we derived the H-statistic (Friedman and Popescu, 2008) using the Interaction function from the iml package (Molnar et al., 2018). This calculation, which can be extremely computationally intensive when examining every possible interaction, was implemented after rerunning each RF model but with only 25% of the training dataset. After identifying the top three variables that led to the greatest interaction strengths, this function was used again to assess those other variables with which each top variable interacted. However, this was performed using the original model, with the entire training dataset, since computation times were greatly reduced when examining single two-way interactions as opposed to each possible interaction.
To assess the marginal effect of covariates on the predicted outcome (ŷ), i.e., predictor probabilities for each RF, we constructed partial dependency plots, using the pdp package (Greenwell, 2017), for the most important feature as identified by the mean decrease in accuracy approach. Partial dependency plots were also generated for the top three two-way interactions as identified from the calculated H-statistic values. Discrete and continuous predictors were shown with 95% confidence intervals. To visualize marginal effect variation within each continuous predictor, we used a generalized additive model smoother via the ggplot2 package (Wickham, 2011). All partial dependency plots were restricted to depths of 50 m or less so to avoid extrapolating outside the shelf of BIRNM where no receivers were located. Finally, the trained RFs were then used to predict relative habitat selection at the species level in BIRNM. Model extrapolation across the MPA was constrained to the maximum depth observed for the given species based on acoustic detections.
RSFs and Predator-Prey Relationships
Resource selection functions were also extended to explore selection overlap values between large juvenile and mature tiger sharks (n = 8, >200 cm FL) and their potential prey species, including juvenile green turtles, juvenile Caribbean reef sharks (n = 12, <120 cm FL), juvenile lemon sharks (8, <120 cm FL), barracuda, horse-eye jacks, yellowtail snapper, and mutton snapper. First, relative habitat selection values were calculated and extrapolated across BIRNM for each potential prey species following the steps outlined in Sections “Defining Available Resource Units and Presence/Background Points,” “Aggregating Habitat Information,” “Applying RSFs With Machine Learning,” and “Performance, Interpretation, and Prediction.” Second, to explore areas of potential predator-prey overlap, relative habitat selection values across BIRNM were averaged between tiger sharks and each potential prey species. Lastly, by removing raster cells where relative selection values of potential prey were <0.5, we examined specific high overlap areas between tiger sharks and each potential prey species.
Kernel Density Estimates
To compare the predicted relative habitat selection values to observed animal space use, we fit kernel utilization distributions to the COAs at the species level. Each species’ kernel utilization distribution, representing a bivariate probability density function of animal use (Worton, 1989; Lichti and Swihart, 2011), was then used to extract the 50 and 95% kernel density estimates to produce space use estimates. Kernel utilization distributions and subsequent kernel density estimates were constructed using the adehabitatHR package (Calenge, 2006) with 200 m smoothing parameters. While species level kernel density estimates were plotted along with all predicted relative habitat selection values, it should be noted these estimates were used for broad comparison since they are likely biased to some extent due to unequal sample sizes across individuals.
Additional Methodological Considerations
To explore model sensitivity to varying parameter inputs, we also implemented RF models using COA data binned at 60-min timesteps. These models and their outputs were derived from using the same procedures as outlined in Sections “Defining Available Resource Units and Presence/Background Points,” “Aggregating Habitat Information,” “Applying RSFs With Machine Learning,” and “Performance, Interpretation, and Prediction.” To examine how models performed under different available habitat extents, we again ran RF models but with available habitat defined using either 200 or 600 m buffers. Hyperparameter inputs were kept consistent with original respective models. Further, to avoid extrapolating predictive models beyond the range of our measured data (Mesgaran et al., 2014), we explored and mapped extrapolation reliability in BIRNM using the dsmextra package (Bouchet et al., 2020). Using the presence/background locations and their depth (m) and distance to land (m) values, the compute_extrapolation function evaluated areas across BIRNM that fell within the sampled covariate space (Mesgaran et al., 2014; Bouchet et al., 2019, 2020). This multivariate statistical tool highlighted areas of univariate extrapolation (when predictions are considered outside the range of covariates), combinational extrapolation (predications made within range of covariates but in novel combinations), and areas of geographical interpolation [predications made within our range of covariates and in analogous conditional space (see Mesgaran et al., 2014)]. Subsequently, using the map_extrapolation function (Bouchet et al., 2020), we visually assessed extrapolation reliability within BIRNM (see Appendix A).
Lastly, we examined individual shark space use variation by constructing Brownian bridge movement models (BBMMs) (Horne et al., 2007) and by implementing RSFs using generalized linear mixed models (GLMMs) with individual as the random effect. BBMMs incorporate movement paths into the modeling process (Horne et al., 2007) and has been recommended when evaluating individual space use with COAs since it can better account for temporal autocorrelation (Udyawer et al., 2018). Here, using all available detection data, we constructed individual BBMMs and plotted individual space use within BIRNM.
While BBMMs highlight variation in space use across individuals, RF models are currently unable to easily incorporate random effects to account for individual level effects, thus, model outputs are potentially biased to some extent. Alternatively, RSFs used in-combination with mixed effect models, can include a random effect for individual ID to explicitly account for individual variability and, in turn, provide measures of inference for the entire population (Gillies et al., 2006; Aarts et al., 2008; Hebblewhite and Merrill, 2008). Here, we implemented RSF GLMMs with individual ID as the random effect for each shark species using the top three most important variables as fixed effects that were identified by RF models. All variables were examined for correlation issues using variance inflation scores and continuous variables were standardized. As a simplified approach and for the purpose of examining the relative contribution of individual ID on each model, no temporal or spatial autocorrelation dependency structures, interaction terms, or non-linear relationships were included. All models were implemented and assessed using 60% of the dataset via the glmmTMB (Magnusson et al., 2017) and performance (Lüdecke et al., 2019) packages. Two goodness-of-fit metrics, marginal R2 and conditional R2, were calculated for each model. While marginal R2 evaluates the variance explained by fixed effects, conditional R2 evaluates the variance explained by both fixed and random effects, allowing us to assess the relative contribution of the random effect on each model (Nakagawa and Schielzeth, 2013). In addition to both goodness-of-fit metrics, we also used the 40% holdout datasets to test GLMM performance, as was done with the RF models above.
Using the converted COA tracking data (Table 1), RF model accuracy and model performance varied across species with overall accuracy ranging from 80 to 95% and sensitivity (true positive rate) from 85 to 98% (Table 2). Predictor importance and rank varied across shark species, with depth (m) as either the most important or within the top two most important predictors for all four species (Figure 2). Overall, ŷ values generally decreased as depth increased for all sharks (Figure 3). While ŷ values decreased for Caribbean reef, nurse, and tiger sharks in areas >3 km from Buck Island, tiger sharks appeared to have higher ŷ values farther away from the island at distances approximately between 500 and 2,000 m. Lemon shark ŷ values decreased rapidly as distance from land increased, with lowest values occurring >1,000 m.
Table 1. Tagging and detection data for shark species and the potential prey species for tiger sharks, including a subset of juvenile Caribbean reef and lemon sharks, monitored within Buck Island National Monument.
Table 2. Confusion matrix performance metrics derived from using the trained random forest model to predict across the 40% holdout dataset.
Figure 2. Feature importance, indicating how important each predictor variable is within each shark random forest model for (A) Caribbean reef sharks, (B) lemon sharks, (C) nurse sharks, and (D) tiger sharks. Higher values and darker colors indicate greater relative importance. Values were calculated using the mean decrease in accuracy method and, subsequently, all importance values were normalized from 0 to 1 for comparison across species.
Figure 3. Random forest partial dependency (marginal effects, ŷ) plots for the top three most important (ordered from top to bottom) predictors for (A) Caribbean reef sharks, (B) lemon sharks, (C) nurse sharks, and (D) tiger sharks. The marginal effects indicate the relative selection strength for the given variable with greater ŷ values indicating high relative selection probabilities and lesser values indicating low relative selection probabilities. Discrete predictors are displayed with bar plots and continuous predictors are displayed using a generalized additive model smoother with 95% confidence intervals. Partial dependency plots involving depth were restricted to 50 m depth for each species.
Caribbean reef sharks were more likely to select for coral or coral-containing habitats with higher ŷ values observed within coral habitats (sand with scattered coral and rock, aggregate reef, and aggregated patch reefs) (Figure 3A). Caribbean reef shark two-way predictor interactions highlighted relatively high ŷ values in depths of 20–30 m, areas <2 km away from land, and in areas of sand with scattered coral and rock (Figure 4A). Lemon sharks were more likely to select for shallow areas directly adjacent to land, specifically in shallow (0–5 m) habitats classified as channel, lagoon, and reef crest (Figures 3B, 4B). While nurse sharks followed a similar pattern, ŷ values indicated they were more likely to select for habitats between 0 and 2,000 m away from land but within <15 and 25–30 m of depth. In addition, ŷ values were higher in areas of sand with scattered coral and rock located within bank/shelf, bank/shelf escarpment, fore reef, and reef crest zones (Figures 3C, 4C). Lastly, tiger sharks exhibited the greatest ŷ values away from land (500–2,000 m), in <30 m depth, and in aggregate reef, sand, sand with scattered coral and rock, and pavement habitats (Figures 3D, 4D).
Figure 4. Top three two-way interactions (ordered from top to bottom) displayed and extracted from random forest models for (A) Caribbean reef sharks, (B) lemon sharks, (C) nurse sharks, and (D) tiger sharks. Mean marginal effects (ŷ) are shown in each two-way interaction partial dependency plot with colors indicating a continuum from high (blue) to low (white) probabilities of relative selection. Partial dependency plots involving depth were restricted to 50 m depth for each species.
Extrapolated relative habitat selection values across BIRNM, as computed from the trained RF models, followed similar patterns to kernel density estimates (Figure 5). Specifically, 50% kernel density estimates largely overlapped with extrapolated areas of high relative selection. However, for Caribbean reef, nurse, and tiger sharks, levels of high extrapolated relative selection also extended beyond 50% kernel density estimates along the western shelf and the eastern side of BIRNM, where receiver coverage was limited. While Caribbean reef, lemon, and tiger sharks exhibited more targeted habitat selection with greater affinities to specific areas and habitats, nurse sharks exhibited a more generalist approach to relative habitat selection across BIRNM (Figure 5). Caribbean reef sharks showed strong affinity to habitats with reef-containing structure, including areas of linear reef around the island and in the aggregated patch reef system that is characteristic north of the island (Figure 5A). In addition, Caribbean reef sharks exhibited higher relative selection values along the western shelf near adjacent deep water habitats (>50 m). Alternatively, lemon shark relative habitat selection values were tightly located around the island, within the reef sheltered lagoon, with lower values along the southwest side of the island where less lagoon and structure habitat exist (Figure 5B). Nurse shark extrapolated relative selection values were wide ranging with the densest cluster of higher values surrounding the island (reef habitats), to the southwest of the island along the bank (sand and seagrass habitats), and to the far eastern side of BIRNM (reef, pavement, and sand habitats) (Figure 5D). Lastly, similar to nurse sharks, tiger sharks primarily have highest selection values extrapolated south of the island along banks containing both seagrass and sand habitats, leading to the continental shelf break in the west. Relative selection values were also expected to be high along the western shelf and in some locations around the north/northwest shelf. While low relative selection values were expected for tiger sharks in the network of highly rugose patch reefs north of the island, higher selections values existed on eastern side of BIRNM habitats containing mainly reef, pavement, and sand (Figure 5D).
Figure 5. Derived from random forest models, the predicted and extrapolated probability of relative selection across Buck Island National Reef Monument (designated by red dashed line) for (A) Caribbean reef sharks, (B) lemon sharks, (C) nurse sharks, and (D) tiger sharks. Higher and darker values indicate higher relative selection values. Model extrapolation across the study areas restricted to the maximum observed depth for each species. 50% (green) and 95% (blue) kernel density estimates shown in the upper right corner of each plot.
Resource selection functions were also extended to examine potential areas of relative selection overlap between tiger sharks and their potential prey sources. Depending on the species, overlap selections varied in location and intensity. For example, juvenile green turtles (Figure 6A) and tiger sharks were most likely to overlap in selection south of the island where seagrass beds along bank habitats were most abundant. For juvenile Caribbean reef sharks and tiger sharks, averaged overlap selection values were greatest along the western shelf and in the southwest portion of BIRNM (Figure 6B). Juvenile Caribbean reef sharks also had high relative selection values north of the island, averaged overlap was comparatively lower in this area to the western shelf due to reduced tiger shark selection values (Figure 5D). The averaged overlap selection values between barracuda and tiger sharks followed a similar pattern with higher barracuda relative selection values north of the island but averages reduced due to lower tiger shark relative selection values (Figure 6C). Horse-eye jacks, with the most similar relative selection values of tiger sharks, had the greatest averaged overlap values along the western shelf, south of the island, and in the southeastern portion of BIRNM (Figure 6D). When tiger shark relative selection values were averaged across the other three species, including juvenile lemon sharks, mutton snapper, and yellowtail snapper, they followed similar patterns with higher averaged relative selection overlap values where the potential prey species had higher selection values unless it was directly north of the island where patch reef systems exist (Supplementary Figure 4).
Figure 6. (Left) Derived from random forest models, the predicted and extrapolated probability of relative selection across Buck Island National Reef Monument (designated by red dashed line) for (A) juvenile green turtles (B) juvenile Caribbean reef sharks, (C) great barracuda, and (D) horse-eye jacks. 50% (green) and 95% (blue) kernel density estimates shown in the upper right corner of each plot. (Middle) Relative selection values averaged between tiger sharks and each corresponding species on the left. (Right) Relative selection values averaged between tiger sharks and each corresponding species on the left after the removal of cells where potential prey relative selection values were <.5. In all plots, higher and darker values indicate higher relative selection values or their averages.
Additional Methodological Considerations
The RF models using COA data of 60-min bins produced similar results to models that used COA data of 90-min bins (Appendix B). The top two most important variables remained unchanged for all shark species (Appendix Figure B1) and pdps and associated ŷ values only changed slightly (Appendix Figure B2). Most notably, the new 60-min binned RF models indicated ŷ values generally increased (rather than decreased) with depth for Caribbean reef sharks and ŷ values for aggregate reef were lower for tiger sharks (Appendix Figures B2, B3). However, for Caribbean reef sharks, ŷ values related to depth remained similar across the interaction of depth and distance to land.
While accuracy metrics were similar for all species across the 60- and 90-min RF models (∼1–2% differences), some varied substantially (e.g., 5–7%) (Table 2 and Appendix Table B1). The use of 60-min time bins led to a decrease in overall accuracy for nurse sharks (85–80%) but an increase in overall accuracy for barracuda (87–92%), horse-eye jack (80–90%), and mutton snapper (88–96%). Subsequently, model predictions and extrapolations within BIRNM reflected these discrepancies with lower accuracy scores producing more homogenous and generalized relative habitat selection patterns than when models produced higher predictive accuracies (Figures 5, 6 and Appendix Figures B4, B5).
RF models using 200 m buffers for available habitat construction scored lower accuracy measurements and also predicted higher relative selection homogeneously across BIRNM (Supplementary Figure 5 and Supplementary Table 1). Alternatively, models using 600 m buffers for available habitat construction produced similar accuracy measures and predictions across BIRNM as compared to the original models (Supplementary Figure 5 and Supplementary Table 1). Interestingly, relative selection predictions for tiger sharks were higher along the northeastern shelf edge (Supplementary Figure 5) than in the original model (Figure 5), matching Casselberry et al. (2020) findings.
When assessing extrapolation reliability across BIRNM, extrapolation space became unreliable (univariate extrapolation) in areas off the shelf in deeper and further areas from land (Appendix A). However, areas within the MPA that remained in shallower water (<50 m) were analogous to the range of our covariates as measured by depth (m) and distance to land (m), confirming our approach to limiting extrapolations to the maximum observed depth was warranted.
While BBMMs highlighted individual level variation in space use across BIRNM (see examples, Supplementary Figure 6), GLMMs and respective marginal and conditional R2 values indicated variance was largely explained by the fixed effects (marginal R2) alone (Supplementary Table 2). However, the GLMM involving lemon sharks appeared to have substantial variance explained by both the fixed and random effects combined (conditional R2), suggesting individual variation may be higher within this species dataset. Interestingly, GLMM accuracy for lemon sharks was also nearly as accurate as the RF model (92% versus 94%, respectively). Accuracy metrics for the other GLMMs were substantially lower than respective RF models (Table 2 and Supplementary Table 2).
The Approach and Ecological Implications
Using acoustic telemetry data for four shark species, we demonstrate the utility of RSFs with machine learning to accurately predict and understand complex environmental drivers of marine species. Across species, we found variable patterns of relative habitat selection within the MPA, ranging from habitat specialists to generalists. Overall, as depth increased, relative selection decreased for all shark species. While relative selection probabilities for Caribbean reef, lemon, and nurse sharks decreased as distances from land increased, tiger sharks showed highest affinities for areas between 500 and 2,000 m away from land. Top interactions along with predicted relative selection values highlighted the differences and preferences across species in terms of habitat types, structures, and depths. Finally, using the relative selection values of tiger sharks and their potential prey, we highlight the ability of this framework to generate multiple species selection values that could provide insights into predator-prey relationships when averaged and overlaid with one another.
The results for shark habitat selection presented herein are largely consistent with those presented by Casselberry et al. (2020) that used GLMMs and detection data from fixed receiver locations to model presence in BIRNM habitats, but with improved model accuracy (83–94%) and additional covariates. While GLMMs in Casselberry et al. (2020) were limited by unequal receiver distribution, requiring aggregation across habitat types, the analyses presented here (use/availability framework with COAs and background points) were able to sample across multiple habitats and at finer scales. Further, while GLMMs were limited to a single generalized habitat covariate (factor levels including: unconsolidated sediments, submerged vegetation, and coral, rock, and colonized hardbottom) and depth, RSFs paired with machine learning algorithms were able to easily assess five separate habitat covariates that ranged from two and 10 factor levels each along with two additional continuous predictors. Ultimately, RSFs confirmed use of shallow water habitats near land for lemon sharks and the use of sand and coral associated habitats at mid-depths for nurse sharks, while highlighting tiger sharks’ affinity for the continental shelf break and southern sand and seagrass beds. The added analytical flexibility of RSFs and machine learning greatly improved predictions of habitat use for Caribbean reef sharks, whose space use changes dramatically with age across BIRNM’s varied landscape (Casselberry et al., 2020). Previous models showed low probability of presence in the acoustic array across habitats and depths compared to the other three shark species, while RSFs highlight specialized use of multiple highly rugose reef habitats at mid-depths. However, GLMMs produced in Casselberry et al. (2020) and the RSF models produced here differed when predicting tiger shark depth preference. Casselberry et al. (2020) showed probability of presence in the acoustic array increasing with depth across habitat types (coral, rock, and colonized hardbottom, submerged vegetation, and unconsolidated sediments), while ŷ values consistently decreased with depths beyond 30 m in RSF models.
Examining the tiger shark partial dependency plots reveals high interactions between depth and distance to land at depths between 10–15 and 25–30 m (Figure 4). These same depth bins (10–15 and 25–30 m) also had higher ŷ values when combined with aggregate reef, pavement, sand, and sand with scattered coral and rock habitats (Figure 4). These habitat types and distance to land achieved higher ŷ values alone than depth in tiger shark models indicating that these variables have a stronger influence on tiger shark habitat use (Figure 3). However, tiger sharks are known to use depths greater than 50 m in and around BIRNM that are beyond the depths of acoustic array coverage (Casselberry unpublished data). This, again, highlighted the need to assess extrapolation reliability (Mesgaran et al., 2014) of RSFs prior to model interpretation since they may have limited ability to extrapolate outside of observed conditions of a given array, for example in areas of BIRNM where depths were greater than 50 m.
When predictions were made within our range of covariates in analogous conditional space (less than 50 m depth), the application of RSFs, as opposed to more traditional use of COAs alone, kernel density estimates, or network analyses, highlighted potentially favorable habitats in BIRNM with limited receiver coverage. The eastern portion of BIRNM has had limited acoustic receiver coverage in part because of the complexity of the coral reef structure in the area. Receiver moorings were not established there in order to avoid damaging the protected reef structure. The RSFs show that favorable habitats exist in this low coverage region for nurse, Caribbean reef, and tiger sharks, particularly at intersections between reef, pavement, and sand habitats. This further highlights the suitability of this MPA for shark conservation and management in St. Croix (Figure 5; Casselberry et al., 2020).
Examining overlapping RSFs between tiger sharks and their potential prey highlights regions of potential foraging success for sharks, high predation vulnerability for prey, and areas of ecological importance for managers. Areas of high tiger shark-prey overlap coincide mainly with the seagrass beds south of Buck Island and the western continental shelf break, while many potential prey species also have high selection potential in areas north of Buck Island. This could be a reflection of tiger sharks selectively using areas with higher potential for foraging success (Heithaus et al., 2002). Areas north of the Buck Island are occupied by highly complex coral reef habitats, offering ample areas to refuge or escape from predators (Hixon and Beets, 1993), while habitats south and west of Buck Island are more open at depths of ∼12 m. These waters could be more maneuverable for large juvenile and adult tiger sharks when compared to more structurally complex environments (Fu et al., 2016), perhaps with an increased possibility of foraging success (Heithaus and Dill, 2002; Heithaus et al., 2007; Wirsing et al., 2007). Alternatively, these areas could be a reflection of similar habitat preferences and ecologies among apex and mesopredators in a tropical reef system (Ledee et al., 2016; Heupel et al., 2019). Regardless, areas of high averaged relative selection highlight important regions in BRINM that could be used to inform future habitat monitoring or restoration studies, particularly with the potential for habitat degradation as the climate changes (Graham et al., 2020; Hastings et al., 2020).
As technological tools continue to advance our ability to monitor aquatic animal space use, ecologists are beginning to answer some of the most pressing questions to help direct and prioritize resource management and conservation strategies. Habitat destruction remains on the forefront of decreases in biodiversity, from climate change (Pratchett et al., 2011; Descombes et al., 2015) to destructive landscape use (Rothschild et al., 1994; Coverdale et al., 2013). With calls to protect at least 30% of the ocean by 2030 through establishing MPAs (O’Leary et al., 2016; Sala et al., 2018), an accurate understanding of how marine animals use space and select habitats is increasingly imperative for well informed and effective marine spatial planning (Foley et al., 2010; Ogburn et al., 2017; Lowerre-Barbieri et al., 2019; Gallagher et al., 2021; Roberts et al., 2021). The RSF modeling framework provided here can produce high accuracy models of relative habitat selection for multiple species of differing ecologies and can be averaged across species to highlight overlapping potential space use or selection. These models can then be used to extrapolate to areas lacking acoustic receiver coverage, as long as within the original measured parameters, accounting for a common issue in acoustic telemetry with incomplete coverage of the study site due to logistic or budgetary limitations. Assuming a sufficient number of individuals are tagged for a given species and age class, the outputs of these models can produce easily interpretable maps for highlighting regions of importance and communicating results to stakeholders, which could result in greater acceptance of study findings given committed stakeholder engagement (Nguyen et al., 2019).
Benefits, Challenges, and Considerations
As technological advancements (e.g., from remote sensing to acoustic telemetry data) allow for high-resolution datasets, machine learning approaches have become increasingly adopted by ecologists because of their ability to handle large datasets and complex non-linear hierarchical relationships and statistical assumptions that are typically violated by conventional parametric approaches, e.g., multiple correlated predictors (Olden et al., 2008; Peters et al., 2014; Durden et al., 2017; Brownscombe et al., 2020). While RSFs have typically been applied within a classical statistical framework (e.g., logistic and linear models) (Johnson et al., 2006; Manly et al., 2007), machine learning does not require non-linear predictor relationships and their interactions to be specified prior to implementing. Thus, allowing for a flexible, realistic, and accessible application when applying RSFs to animal space use in relation to multiple and complex environmental gradients across a landscape (Shoemaker et al., 2018). Further, implementing machine learning with ecological data can also provide highly accurate predictive models (Cutler et al., 2007; Elith et al., 2008; Olden et al., 2008). For instance, Shoemaker et al. (2018) applying RSFs with mule deer (Odocoileus hemionus) telemetry data demonstrated machine learning algorithms outperformed the traditional approach of logistic regression with higher prediction accuracy. In another example, although not directly comparable, when implementing a RF using the juvenile green turtle data in this study, we found a higher accuracy compared to that as reported by Griffin et al. (2020) (0.95 versus 0.77, respectively), who also applied RSFs on juvenile green turtle acoustic telemetry data from BIRNM but were fitted with GLMMs and fewer predictor variables.
While machine learning algorithms offer some advantages as an accurate non-parametric technique, the difficulty to account for spatial-temporal autocorrelation and individual level effects presents additional challenges. Whereas RF models are unable to easily incorporate, RSF GLMMs can explicitly include individual ID as a random effect (Gillies et al., 2006). Further, generalized models can incorporate autocorrelation dependency structures (Zuur et al., 2017; Winton et al., 2018a; Griffin et al., 2019; Gutowsky et al., 2020), however, it is worth noting that defining the correct correlation structure still remains challenging within a use/availability (presences/pseudo-absences) sampling design (see Koper and Manseau, 2009; Fieberg et al., 2010). In this study, while BBMMs highlighted individual variation in space use, simplified GLMMs indicated including individual as a random effect contributed relatively less to explaining overall variance than the fixed effects alone. However, this was not the case for lemon sharks, suggesting larger potential differences in relative habitat selection across individuals. Confirmed by individual BBMMs and network analyses from Casselberry et al. (2020), some lemon sharks were consistently close to the island while others used areas farther away and at greater depths. While approaches are being developed to incorporate mixed effects into machine learning algorithms (Hajjem et al., 2014), it is still relatively inaccessible due to its complexity. Future studies using RSFs and machine learning algorithms should attempt to measure or address random variation across individuals and sample size biases either within the approach and/or with complimentary analyses. For example, using test datasets that contain individuals not used in the training dataset may better help to assess model performance and transferability (Buston and Elith, 2011; Raymond et al., 2015). Further, running models for each individual and, subsequently, collectively deriving the 95% confidence interval estimates across the computed marginal effects for all individuals may be a viable approach to assess population level effects. Alternatively, using both mixed effects models and machine learning approaches in tandem may be the most appropriate (see Shoemaker et al., 2018). With consideration to this caveat, machine learning algorithms provide useful and flexible advantages to deal with complex ecological datasets and to obtain accurate results.
Beyond the application of RSFs with machine learning algorithms, by design, passive acoustic telemetry arrays provide an intuitive approach for implementing RSFs since available resource units can easily be defined based upon receiver positioning. Constraining COAs and background points to the available resource units, defined by acoustic receiver location at the year level, allows for the incorporation or removal of additional receivers across a study period. This flexibility is ideal as arrays often change over time due to funding constraints or adapting research questions. However, to safeguard against biased relative selection estimates, it is important to ensure receiver arrays, including their modifications, are designed to capture space use that is representative of the habitat available (Selby et al., 2019; Griffin et al., 2020). For future studies aimed at examining relative selection, we suggest grid array designs (Heupel et al., 2006; Kraus et al., 2018) to achieve proportionally representative coverage of areas rather than deployments guided by a priori beliefs of animal space use (Brownscombe et al., 2019b). In addition, detection range and efficiency, should be considered during the array design (Brownscombe et al., 2019b), when constructing COAs (Winton et al., 2018b), when defining available resource units around receivers, or even explicitly in the modeling process (see Brownscombe et al., 2019a). Detection efficiency and range, often limited by physical structure, wind, currents, animal noise, or by human activities may vary greatly across a given study area (Gjelland and Hedger, 2013; Kessel et al., 2014).
Here, while potentially incorporating biases due to variable detection ranges (see Selby et al., 2016), we chose a 400 m buffer around each receiver to allow for COAs and associated background points to extend beyond observed detection ranges. However, we recommend testing a wide range of parameter inputs from COA time bin selection to available habitat buffer size. Such inputs should be guided by ecological knowledge, acoustic telemetry coverage, and model accuracy metrics. While COA time bins of 60-min provided more accurate measures for some species and refined predictions, we opted for 90-min bins for all species since this would potentially reduce issues with autocorrelation by subsampling further (Swihart and Slade, 1985). Further, COA time bin selection should consider both the programmed tag delay and the speed of tagged animals, with smaller time bins for faster moving species and larger time bins for slow moving animals. In this example, we found a smaller available habitat buffer produced lower accuracy metrics and led to unreliable predictions across BIRNM. Alternatively, applying a larger available habitat buffer provided similar results to the original models that used 400 m buffers and also captured relative selection for tiger sharks in areas (northeast shelf) where we expected higher values. While 400 m buffers were chosen for consistency to Selby et al. (2019) and Griffin et al. (2020), future researchers should explore and evaluate multiple extents for a given species, study area, and array. Along with variable detection range and efficiency, future RSF studies using acoustic telemetry should also investigate the role of spatial and/or temporal scales on selection modeling (McGarigal et al., 2016); this is especially relevant when collapsing habitat and presences/background points for model implementation.
In summary, we highlight the utility of combining acoustic telemetry, RSFs, and machine learning to understand and accurately predict the relative habitat selection of marine animals across both monitored and unmonitored areas. While RSFs have been used extensively within terrestrial environments, we suggest marine ecologists should also adopt these methods to improve resource management actions. Such applications could help to prioritize habitat protection and restoration in the face of continued anthropogenic threats (Millennium Ecosystem Assessment, 2003). This may have particular advantages centered around MPA design. Here, applied to four shark species within an MPA, we found accurate models that could extrapolate to areas where receiver coverage was limited. Further, when these RSF values were extended to examine predator-prey relationships, we found areas that varied in mutual selection, highlighting the potential overlap of predators and their prey.
Data Availability Statement
The datasets generated for this study are not readily available because studies for each individual species are currently ongoing. Data can be made available upon reasonable request through GC at firstname.lastname@example.org or directly to the data owners: GS (sharks), AJ (reef fish), and KH (sea turtles). Code for the resource selection function framework with a sample dataset can be found at https://github.com/lucaspgriffin.
The animal study was reviewed and approved by University of Massachusetts Amherst IACUC no. 2013-0031 and 2019-0043. Research conducted within BIRNM was approved by NPS under study no. BUIS-00058 and individual research collection permit nos. BUIS-2013-SCI_0003, BUIS-2014-SCI-0006, and BUIS-2019-0010. For green turtle fieldwork, permitting was under NMFS permits 16146 and 20315, issued to KH, National Park IACUC USGS-SESC2014-02 and USGS IACUC WARC\GNV 2017-04. Additional permits issued to KH BUIS-2011-SCI-0012; BUIS-2014-SCI-0009; BUIS-2016-SCI-0009. Acoustic receiver stations were deployed under the following permit numbers through the United States Army Corp of Engineers: SAJ-2014-01790, SAJ-2015-02061, SAJ-2015-02062, SAJ-2017-00622, and SAJ-2017-00624.
LG and GC conceived and led the study. GC, KH, AJ, SB, AN, BD, CP, IL, ZH-S, AD, and GS conducted the field work. LG analyzed the data. All authors interpreted the findings, wrote the manuscript, and approved the final version.
This work was supported by grants from the following funders: Puerto Rico Sea Grant (R-101-2-14), The New England Aquarium’s Marine Conservation Action Fund, The Atlantic White Shark Conservancy, National Geographic Society, The Allen Family Foundation, and the USGS Natural Resource Protection Program and USGS Ecosystems Program.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
We acknowledge the USGS Natural Resource Protection Program (NRPP) for funding. AD was supported by the National Institute of Food and Agriculture, United States Department of Agriculture, the Massachusetts Agricultural Experiment Station, and the Department of Environmental Conservation. GC was supported by the NOAA ONMS Dr. Nancy Foster Scholarship. We thank the numerous National Park Service staff and interns for their work maintaining BIRNM receiver array throughout the years, especially Tessa Code and Nathaniel Hanna Holloway. We also thank Jake Brownscombe and Laura D’Acunto for review of an earlier draft of the manuscript. We thank Brace Thompson for providing his artistic talents to figure one. Finally, we thank the reviewers and associate editor for thorough and constructive feedback. Any use of trade, product, or firm names is for descriptive purposes only and does not imply endorsement by the United States Government.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmars.2021.631262/full#supplementary-material
BIRNM, Buck Island Reef National Monument; BBMM, Brownian bridge movement model; COAs, centers of activity; GLMM, generalized linear mixed model; MPA, marine protected area; RF, random forest; RSF, resource selection function.
Aarts, G., MacKenzie, M., McConnell, B., Fedak, M., and Matthiopoulos, J. (2008). Estimating space−use and habitat preference from wildlife telemetry data. Ecography 31, 140–160. doi: 10.1111/j.2007.0906-7590.05236.x
Aines, A. C., Carlson, J. K., Boustany, A., Mathers, A., and Kohler, N. E. (2018). Feeding habits of the tiger shark, Galeocerdo cuvier, in the northwest Atlantic Ocean and Gulf of Mexico. Environ. Biol. Fishes 101, 403–415. doi: 10.1007/s10641-017-0706-y
Avgar, T., Lele, S. R., Keim, J. L., and Boyce, M. S. (2017). Relative selection strength: quantifying effect size in habitat−and step−selection inference. Ecol. Evol. 7, 5322–5330. doi: 10.1002/ece3.3122
Barbet-Massin, M., Jiguet, F., Albert, C. H., and Thuiller, W. (2012). Selecting pseudo-absences for species distribution models: how, where and how many? Methods Ecol. Evol. 3, 327–338. doi: 10.1111/j.2041-210x.2011.00172.x
Barley, S. C., Meekan, M. G., and Meeuwig, J. J. (2017b). Species diversity, abundance, biomass, size and trophic structure of fish on coral reefs in relation to shark abundance. Mar. Ecol. Prog. Ser. 565, 163–179. doi: 10.3354/meps11981
Bascompte, J., Melián, C. J., and Sala, E. (2005). Interaction strength combinations and the overfishing of a marine food web. Proc. Natl. Acad. Sci.U.S.A. 102, 5443–5447. doi: 10.1073/pnas.0501562102
Becker, S. L., Finn, J. T., Danylchuk, A. J., Pollock, C. G., Hillis-Starr, Z., Lundgren, I., et al. (2016). Influence of detection history and analytic tools on quantifying spatial ecology of a predatory fish in a marine protected area. Mar. Ecol. Prog. Ser. 562, 147–161. doi: 10.3354/meps11962
Becker, S. L., Finn, J. T., Novak, A. J., Danylchuk, A. J., Pollock, C. G., Hillis-Starr, Z., et al. (2020). Coarse-and fine-scale acoustic telemetry elucidates movement patterns and temporal variability in individual territories for a key coastal mesopredator. Environ. Biol. Fishes 103, 13–29. doi: 10.1007/s10641-019-00930-2
Bouchet, P., Miller, D. L., Roberts, J., Mannocci, L., Harris, C. M., and Thomas, L. (2019). From Here and Now to There and Then: Practical Recommendations for Extrapolating Cetacean Density Surface Models to Novel Conditions. (CREEM technical report No. 2019–01). St Andrews: University of St Andrews.
Bouchet, P. J., Miller, D. L., Roberts, J. J., Mannocci, L., Harris, C. M., and Thomas, L. (2020). dsmextra: extrapolation assessment tools for density surface models. Methods Ecol. Evolution. 11, 1464–1469. doi: 10.1111/2041-210x.13469
Brownscombe, J. W., Griffin, L. P., Chapman, J. M., Morley, D., Acosta, A., Crossin, G. T., et al. (2019a). A practical method to account for variation in detection range in acoustic telemetry arrays to accurately quantify the spatial ecology of aquatic animals. Methods Ecol. Evol. 11, 82–94. doi: 10.1111/2041-210X.13322
Brownscombe, J. W., Griffin, L. P., Morley, D., Acosta, A., Hunt, J., Lowerre-Barbieri, S. K., et al. (2020). Application of machine learning algorithms to identify cryptic reproductive habitats using diverse information sources. Oecologia 194, 283–298. doi: 10.1007/s00442-020-04753-2
Brownscombe, J. W., Lédée, E. J. I. I., Raby, G. D., Struthers, D. P., Gutowsky, L. F. G. G., Nguyen, V. M., et al. (2019b). Conducting and interpreting fish telemetry studies: considerations for researchers and resource managers. Rev. Fish Biol. Fish. 29, 369–400. doi: 10.1007/s11160-019-09560-4
Bryan, D. R., Feeley, M. W., Nemeth, R. S., Pollock, C., and Ault, J. S. (2019). Home range and spawning migration patterns of queen triggerfish Balistes vetula in St. Croix, US Virgin Islands. Mar. Ecol. Prog. Ser. 616, 123–139. doi: 10.3354/meps12944
Buston, P. M., and Elith, J. (2011). Determinants of reproductive success in dominant pairs of clownfish: a boosted regression tree analysis. J. Anim. Ecol. 80, 528–538. doi: 10.1111/j.1365-2656.2011.01803.x
Campbell, H. A., Watts, M. E., Dwyer, R. G., and Franklin, C. E. (2012). V-Track: software for analysing and visualising animal movement from acoustic telemetry detections. Mar. Freshw. Res. 63, 815–820. doi: 10.1071/MF12194
Carlisle, A. B., Tickler, D., Dale, J. J., Ferretti, F., Curnick, D. J., Chapple, T. K., et al. (2019). Estimating space use of mobile fishes in a large marine protected area with methodological considerations in acoustic array design. Front. Mar. Sci. 6:256. doi: 10.3389/fmars.2019.00256
Carlson, J., Charvet, P., Blanco-Parra, M. P, Briones Bell-lloch, A., Cardenosa, D., Derrick, D., et al. (2021a). Carcharhinus perezi. The IUCN Red List of Threatened Species 2021: e.T60217A3093780. doi: 10.2305/IUCN.UK.2021-1.RLTS.T60217A3093780.en
Carlson, J., Charvet, P., Ba, A., Bizzarro, J., Derrick, D., Espinoza, M., et al. (2021b). Negaprion Brevirostris. The IUCN Red List of Threatened Species 2021: e.T39380A2915472. doi: 10.2305/IUCN.UK.2021-1.RLTS.T39380A2915472.en
Carlson, J., Charvet, P., Blanco-Parra, M. P., Briones Bell-lloch, A., Cardenosa, D., Derrick, D., et al. (2021c). Ginglymostoma Cirratum. The IUCN Red List of Threatened Species 2021: e.T144141186A3095153. doi: 10.2305/IUCN.UK.2021-1.RLTS.T144141186A3095153.en
Cashion, T., Nguyen, T., ten Brink, T., Mook, A., Palacios-Abrantes, J., and Roberts, S. M. (2020). Shifting seas, shifting boundaries: dynamic marine protected area designs for a changing climate. PLoS One 15:e0241771. doi: 10.1371/journal.pone.0241771
Casselberry, G. A., Danylchuk, A. J., Finn, J. T., DeAngelis, B. M., Jordaan, A., Pollock, C. G., et al. (2020). Network analysis reveals multispecies spatial associations in the shark community of a Caribbean marine protected area. Mar. Ecol. Prog. Ser. 633, 105–126. doi: 10.3354/meps13158
Ciarniello, L. M., Boyce, M. S., Heard, D. C., and Seip, D. R. (2007). Components of grizzly bear habitat selection: density, habitats, roads, and mortality risk. J. Wildl. Manag. 71, 1446–1457. doi: 10.2193/2006-229
Cooke, S. J. (2008). Biotelemetry and biologging in endangered species research and animal conservation: relevance to regional, national, and IUCN Red List threat assessments. Endanger. Species Res. 4, 165–185. doi: 10.3354/esr00063
Cooke, S. J., Hinch, S. G., Wikelski, M., Andrews, R. D., Kuchel, L. J., Wolcott, T. G., et al. (2004). Biotelemetry: a mechanistic approach to ecology. Trends Ecol. Evol. 19, 334–343. doi: 10.1016/j.tree.2004.04.003
Costa, B. M., Tormey, S., and Battista, T. A. (2012). Benthic Habitats of Buck Island Reef National Monument. NOAA Technical Memorandum NOS NCCOS 142. Silver Spring, MD: NCCOS Center for Coastal Monitoring and Assessment Biogeography Branch, 64.
Coverdale, T. C., Herrmann, N. C., Altieri, A. H., and Bertness, M. D. (2013). Latent impacts: the role of historical human activity in coastal habitat loss. Front. Ecol. Environ. 11:69–74. doi: 10.1890/120130
Craig, J. K., and Crowder, L. B. (2002). “Factors influencing habitat selection in fishes with a review of marsh ecosystems,” in Concepts and Controversies in Tidal Marsh Ecology, eds M. P. Weinstein and D. A. Kreeger (Dordrecht: Springer), 241–266. doi: 10.1007/0-306-47534-0_12
Daly, R., Smale, M. J., Singh, S., Anders, D., Shivji, M. K., Daly, C. A., et al. (2018). Refuges and risks: evaluating the benefits of an expanded MPA network for mobile apex predators. Divers. Distrib. 24, 1217–1230. doi: 10.1111/ddi.12758
Descombes, P., Wisz, M. S., Leprieur, F., Parravicini, V., Heine, C., Olsen, S. M., et al. (2015). Forecasted coral reef decline in marine biodiversity hotspots under climate change. Glob. Change Biol. 21, 2479–2487. doi: 10.1111/gcb.12868
Donaldson, M. R., Hinch, S. G., Suski, C. D., Fisk, A. T., Heupel, M. R., and Cooke, S. J. (2014). Making connections in aquatic ecosystems with acoustic telemetry monitoring. Front. Ecol. Environ. 12:565–573. doi: 10.1890/130283
Durden, J. M., Luo, J. Y., Alexander, H., Flanagan, A. M., and Grossmann, L. (2017). Integrating “big data” into aquatic ecology: CHALLENGES and opportunities. Limnol. Oceanogr. Bull. 26, 101–108. doi: 10.1002/lob.10213
Espinoza, M., Farrugia, T. J., Webber, D. M., Smith, F., and Lowe, C. G. (2011). Testing a new acoustic telemetry technique to quantify long-term, fine-scale movements of aquatic animals. Fish. Res. 108, 364–371. doi: 10.1016/j.fishres.2011.01.011
Feeley, M. W., Morley, D., Acosta, A., Barbera, P., Hunt, J., Switzer, T., et al. (2018). Spawning migration movements of Mutton Snapper in Tortugas, Florida: spatial dynamics within a marine reserve network. Fish. Res. 204, 209–223. doi: 10.1016/j.fishres.2018.02.020
Fieberg, J., Matthiopoulos, J., Hebblewhite, M., Boyce, M. S., and Frair, J. L. (2010). Correlation and studies of habitat selection: problem, red herring or opportunity? Philos. Trans. R. Soc. B Biol. Sci. 365, 2233–2244. doi: 10.1098/rstb.2010.0079
Fieberg, J. R., Forester, J. D., Street, G. M., Johnson, D. H., ArchMiller, A. A., and Matthiopoulos, J. (2018). Used−habitat calibration plots: a new procedure for validating species distribution, resource selection, and step−selection models. Ecography 41, 737–752. doi: 10.1111/ecog.03123
Fleming, C. H., Fagan, W. F., Mueller, T., Olson, K. A., Leimgruber, P., and Calabrese, J. M. (2015). Rigorous home range estimation with movement data: a new autocorrelated kernel density estimator. Ecology 96, 1182–1188. doi: 10.1890/14-2010.1
Foley, M. M., Halpern, B. S., Micheli, F., Armsby, M. H., Caldwell, M. R., Crain, C. M., et al. (2010). Guiding ecological principles for marine spatial planning. Mar. Policy 34, 955–966. doi: 10.1016/j.marpol.2010.02.001
Fraschetti, S., D’Ambrosio, P., Micheli, F., Pizzolante, F., Bussotti, S., and Terlizzi, A. (2009). Design of marine protected areas in a human-dominated seascape. Mar. Ecol. Prog. Ser. 375, 13–24. doi: 10.3354/meps07781
Freitas, C., Olsen, E. M., Knutsen, H., Albretsen, J., and Moland, E. (2016). Temperature−associated habitat selection in a cold−water marine fish. J. Anim. Ecol. 85, 628–637. doi: 10.1111/1365-2656.12458
Fu, A. L., Hammerschlag, N., Lauder, G. V., Wilga, C. D., Kuo, C., and Irschick, D. J. (2016). Ontogeny of head and caudal fin shape of an apex marine predator: the tiger shark (Galeocerdo cuvier). J. Morphol. 277, 556–564. doi: 10.1002/jmor.20515
Gallagher, A. J., Shipley, O. N., van Zinnicq Bergmann, M. P., Brownscombe, J. W., Dahlgren, C. P., Frisk, M. G., et al. (2021). Spatial connectivity and drivers of shark habitat use within a large marine protected area in the caribbean, the bahamas shark sanctuary. Front. Mar. Sci. 7:1223.
Gillies, C. S., Hebblewhite, M., Nielsen, S. E., Krawchuk, M. A., Aldridge, C. L., Frair, J. L., et al. (2006). Application of random effects to the study of resource selection by animals. J. Anim. Ecol. 75, 887–898. doi: 10.1111/j.1365-2656.2006.01106.x
Gjelland, K. Ø, and Hedger, R. D. (2013). Environmental influence on transmitter detection probability in biotelemetry: developing a general model of acoustic transmission. Methods Ecol. Evol. 4, 665–674. doi: 10.1111/2041-210x.12057
Gleason, M., McCreary, S., Miller-Henson, M., Ugoretz, J., Fox, E., Merrifield, M., et al. (2010). Science-based and stakeholder-driven marine protected area network planning: a successful case study from north central California. Ocean Coast. Manag. 53, 52–68. doi: 10.1016/j.ocecoaman.2009.12.001
Godvik, I. M. R., Loe, L. E., Vik, J. O., Veiberg, V., Langvatn, R., and Mysterud, A. (2009). Temporal scales, trade−offs, and functional responses in red deer habitat selection. Ecology 90, 699–710. doi: 10.1890/08-0576.1
Griffin, L. L. P., Finn, J. T. J., Diez, C., and Danylchuk, A. A. J. (2019). Movements, connectivity, and space use of immature green turtles within coastal habitats of the Culebra Archipelago, Puerto Rico: implications for conservation. Endanger. Species Res. 40, 75–90. doi: 10.3354/esr00976
Griffin, L. P., Smith, B. J., Cherkiss, M. S., Crowder, A. G., Pollock, C. G., Starr, Z. H., et al. (2020). Space use and relative habitat selection for immature green turtles within a Caribbean marine protected area. Anim. Biotelem. 8, 1–13. doi: 10.1186/s40317-020-00209-9
Gutowsky, L. F. G., Harrison, P. M., Martins, E. G., Leake, A., Patterson, D. A., Zhu, D. Z., et al. (2017). Daily temperature experience and selection by adfluvial bull trout (Salvelinus confluentus). Environ. Biol. Fishes 100, 1167–1180. doi: 10.1007/s10641-017-0634-x
Gutowsky, L. F. G., Romine, J. G., Heredia, N. A., Bigelow, P. E., Parsley, M. J., Sandstrom, P. T., et al. (2020). Revealing migration and reproductive habitat of invasive fish under an active population suppression program. Conserv. Sci. Pract. 2: e119.
Hammerschlag, N., Schmitz, O. J., Flecker, A. S., Lafferty, K. D., Sih, A., Atwood, T. B., et al. (2019). Ecosystem function and services of aquatic predators in the anthropocene. Trends Ecol. Evol. 34, 369–383. doi: 10.1016/j.tree.2019.01.005
Harrison, P. M., Gutowsky, L. F. G., Martins, E. G., Patterson, D. A., Cooke, S. J., and Power, M. (2016). Temporal plasticity in thermal−habitat selection of burbot Lota lota a diel−migrating winter−specialist. J. Fish Biol. 88, 2111–2129. doi: 10.1111/jfb.12990
Hastings, R. A., Rutterford, L. A., Freer, J. J., Collins, R. A., Simpson, S. D., and Genner, M. J. (2020). Climate change drives poleward increases and equatorward declines in marine species. Curr. Biol. 30, 1572–1577.e2.
Hays, G. C., Bailey, H., Bograd, S. J., Bowen, W. D., Campagna, C., Carmichael, R. H., et al. (2019). Translating marine animal tracking data into conservation policy and management. Trends Ecol. Evol. 34, 459–473.
Hebblewhite, M., and Merrill, E. (2008). Modelling wildlife–human relationships for social species with mixed−effects resource selection models. J. Appl. Ecol. 45, 834–844. doi: 10.1111/j.1365-2664.2008.01466.x
Heffelfinger, L. J., Stewart, K. M., Shoemaker, K. T., Darby, N. W., and Bleich, V. C. (2020). Balancing current and future reproductive investment: variation in resource selection during stages of reproduction in a long-lived herbivore. Front. Ecol. Evol. 8:163. doi: 10.3389/fevo.2020.00163
Heinrichs, J. A., Aldridge, C. L., O’Donnell, M. S., and Schumaker, N. H. (2017). Using dynamic population simulations to extend resource selection analyses and prioritize habitats for conservation. Ecol. Model. 359, 449–459. doi: 10.1016/j.ecolmodel.2017.05.017
Heithaus, M. R., and Dill, L. M. (2002). Food availability and tiger shark predation risk influence bottlenose dolphin habitat use. Ecology 83, 480–491. doi: 10.1890/0012-9658(2002)083[0480:faatsp]2.0.co;2
Heithaus, M. R., Dill, L. M., Marshall, G. J., and Buhleier, B. (2002). Habitat use and foraging behavior of tiger sharks (Galeocerdo cuvier) in a seagrass ecosystem. Mar. Biol. 140, 237–248. doi: 10.1007/s00227-001-0711-7
Heithaus, M. R., Frid, A., Wirsing, A. J., Dill, L. M., Fourqurean, J. W., Burkholder, D., et al. (2007). State−dependent risk−taking by green sea turtles mediates top−down effects of tiger shark intimidation in a marine ecosystem. J. Anim. Ecol. 76, 837–844. doi: 10.1111/j.1365-2656.2007.01260.x
Hengl, T., Nussbaum, M., Wright, M. N., Heuvelink, G. B. M., and Gräler, B. (2018). Random forest as a generic framework for predictive modeling of spatial and spatio-temporal variables. PeerJ 6:e5518. doi: 10.7717/peerj.5518
Heupel, M. R., Munroe, S. E. M., Lédée, E. J. I., Chin, A., and Simpfendorfer, C. A. (2019). Interspecific interactions, movement patterns and habitat use in a diverse coastal shark assemblage. Mar. Biol. 166:68.
Heupel, M. R., Semmens, J. M., and Hobday, A. J. (2006). Automated acoustic tracking of aquatic animals: scales, design and deployment of listening station arrays. Mar. Freshw. Res. 57, 1–13. doi: 10.1071/mf05091
Hoekstra, J. M., Boucher, T. M., Ricketts, T. H., and Roberts, C. (2005). Confronting a biome crisis: global disparities of habitat loss and protection. Ecol. Lett. 8, 23–29. doi: 10.1111/j.1461-0248.2004.00686.x
Hussey, N. E., Kessel, S. T., Aarestrup, K., Cooke, S. J., Cowley, P. D., Fisk, A. T., et al. (2015). ECOLOGY. Aquatic animal telemetry: a panoramic window into the underwater world. Science (New York, N.Y.) 348:1255642. doi: 10.1126/science.1255642
Jacoby, D. M. P., Ferretti, F., Freeman, R., Carlisle, A. B., Chapple, T. K., Curnick, D. J., et al. (2020). Shark movement strategies influence poaching risk and can guide enforcement decisions in a large, remote marine protected area. J. Appl. Ecol. 57, 1782–1792. doi: 10.1111/1365-2664.13654
Johnson, C. J., Nielsen, S. E., Merrill, E. H., McDONALD, T. L., and Boyce, M. S. (2006). Resource selection functions based on use−availability data: theoretical motivation and evaluation methods. J. Wildl. Manag. 70, 347–357. doi: 10.2193/0022-541x(2006)70[347:rsfbou]2.0.co;2
Johnson, C. J., Seip, D. R., and Boyce, M. S. (2004). A quantitative approach to conservation planning: using resource selection functions to map the distribution of mountain caribou at multiple spatial scales. J. Appl. Ecol. 41, 238–251. doi: 10.1111/j.0021-8901.2004.00899.x
Keller, J. A., Herbig, J. L., Morley, D., Wile, A., Barbera, P., and Acosta, A. (2020). Grouper tales: use of acoustic telemetry to evaluate grouper movements at Western Dry Rocks in the Florida Keys. Mar. Coast. Fish. 12, 290–307. doi: 10.1002/mcf2.10109
Kessel, S. T., Cooke, S. J., Heupel, M. R., Hussey, N. E., Simpfendorfer, C. A., Vagle, S., et al. (2014). A review of detection range testing in aquatic passive acoustic telemetry studies. Rev. Fish Biol. Fish. 24, 199–218. doi: 10.1007/s11160-013-9328-4
Knip, D. M., Heupel, M. R., and Simpfendorfer, C. A. (2012). Evaluating marine protected areas for the conservation of tropical coastal sharks. Biol. Conserv. 148, 200–209. doi: 10.1016/j.biocon.2012.01.008
Koper, N., and Manseau, M. (2009). Generalized estimating equations and generalized linear mixed−effects models for modelling resource selection. J. Appl. Ecol. 46, 590–599. doi: 10.1111/j.1365-2664.2009.01642.x
Kramer, D. L., Rangeley, R. W., and Chapman, L. J. (1997). “Habitat selection: patterns of spatial distribution from behavioural decisions,” in Behavioural Ecology of Teleost Fishes, ed. J.-G. J. Godin (Oxford: Oxford University Press), 37–80.
Kraus, R. T., Holbrook, C. M., Vandergoot, C. S., Stewart, T. R., Faust, M. D., Watkinson, D. A., et al. (2018). Evaluation of acoustic telemetry grids for determining aquatic animal movement and survival. Methods Ecol. Evol. 9, 1489–1502. doi: 10.1111/2041-210x.12996
Lea, J. S. E. E., Humphries, N. E., von Brandis, R. G., Clarke, C. R., and Sims, D. W. (2016). Acoustic telemetry and network analysis reveal the space use of multiple reef predators and enhance marine protected area design. Proc. R. Soc. B Biol. Sci. 283:20160717. doi: 10.1098/rspb.2016.0717
Ledee, E. J. I., Heupel, M. R., Tobin, A. J., Mapleston, A., and Simpfendorfer, C. A. (2016). Movement patterns of two carangid species in inshore habitats characterised using network analysis. Mar. Ecol. Prog. Ser. 553, 219–232. doi: 10.3354/meps11777
Legare, B., Kneebone, J., DeAngelis, B., and Skomal, G. (2015). The spatiotemporal dynamics of habitat use by blacktip (Carcharhinus limbatus) and lemon (Negaprion brevirostris) sharks in nurseries of St. John, United States Virgin Islands. Mar. Biol. 162, 699–716. doi: 10.1007/s00227-015-2616-x
Lele, S. R., Merrill, E. H., Keim, J., and Boyce, M. S. (2013). Selection, use, choice and occupancy: clarifying concepts in resource selection studies. J. Anim. Ecol. 82, 1183–1191. doi: 10.1111/1365-2656.12141
Lowe, C. G., Wetherbee, B. M., Crow, G. L., and Tester, A. L. (1996). Ontogenetic dietary shifts and feeding behavior of the tiger shark, Galeocerdo cuvier, in Hawaiian waters. Environ. Biol. Fishes 47, 203–211. doi: 10.1007/bf00005044
Lowerre-Barbieri, S. K., Catalán, I. A., Frugård Opdal, A., and Jørgensen, C. (2019). Preparing for the future: integrating spatial ecology into ecosystem-based management. ICES J. Mar. Sci. 76, 467–476. doi: 10.1093/icesjms/fsy209
Lubchenco, J., Palumbi, S. R., Gaines, S. D., and Andelman, S. (2003). Plugging a hole in the ocean: the emerging science of marine reserves 1. Ecol. Appl. 13, 3–7. doi: 10.1890/1051-0761(2003)013[0003:pahito]2.0.co;2
Manly, B. F. L., McDonald, L., Thomas, D. L., McDonald, T. L., and Erickson, W. P. (2007). Resource Selection by Animals: Statistical Design and Analysis for Field Studies. Berlin: Springer Science & Business Media.
Matley, J. K., Heupel, M. R., Fisk, A. T., Simpfendorfer, C. A., and Tobin, A. J. (2017). Measuring niche overlap between co-occurring Plectropomus spp. using acoustic telemetry and stable isotopes. Mar. Freshw. Res. 68, 1468–1478. doi: 10.1071/mf16120
McGarigal, K., Wan, H. Y., Zeller, K. A., Timm, B. C., and Cushman, S. A. (2016). Multi-scale habitat selection modeling: a review and outlook. Landsc. Ecol. 31, 1161–1175. doi: 10.1007/s10980-016-0374-x
McLoughlin, P. D., Case, R. L., Gau, R. J., Cluff, D. H., Mulders, R., and Messier, F. (2002). Hierarchical habitat selection by barren-ground grizzly bears in the central Canadian Arctic. Oecologia 132, 102–108. doi: 10.1007/s00442-002-0941-5
Mesgaran, M. B., Cousens, R. D., and Webber, B. L. (2014). Here be dragons: a tool for quantifying novelty due to covariate range and correlation change when projecting species distribution models. Divers. Distrib. 20, 1147–1159. doi: 10.1111/ddi.12209
Micheli, F., Amarasekare, P., Bascompte, J., and Gerber, L. R. (2004). Including species interactions in the design and evaluation of marine reserves: some insights from a predator-prey model. Bull. Mar. Sci. 74, 653–669.
Nakagawa, S., and Schielzeth, H. (2013). A general and simple method for obtaining R2 from generalized linear mixed−effects models. Methods Ecol. Evol. 4, 133–142. doi: 10.1111/j.2041-210x.2012.00261.x
Nguyen, V. M., Young, N., Brownscombe, J. W., and Cooke, S. J. (2019). Collaboration and engagement produce more actionable science: quantitatively analyzing uptake of fish tracking studies. Ecol. Appl. 29: e01943.
Nielsen, S. E., Boyce, M. S., Stenhouse, G. B., and Munro, R. H. M. (2003). Development and testing of phenologically driven grizzly bear habitat models. Ecoscience 10, 1–10. doi: 10.1080/11956860.2003.11682743
Novak, A. J., Becker, S. L., Finn, J. T., Danylchuk, A. J., Pollock, C. G., Hillis-Starr, Z., et al. (2020a). Inferring residency and movement patterns of horse-eye jack Caranx latus in relation to a Caribbean marine protected area acoustic telemetry array. Anim. Biotelem. 8, 1–13.
Novak, A. J., Becker, S. L., Finn, J. T., Pollock, C. G., Hillis−Starr, Z., and Jordaan, A. (2020b). Scale of biotelemetry data influences ecological interpretations of space and habitat use in Yellowtail Snapper. Mar. Coast. Fish. 12, 364–377. doi: 10.1002/mcf2.10119
Ogburn, M. B., Harrison, A.-L., Whoriskey, F. G., Cooke, S. J., Mills Flemming, J. E., and Torres, L. G. (2017). Addressing challenges in the application of animal movement ecology to aquatic conservation and management. Front. Mar. Sci. 4:70. doi: 10.3389/fmars.2017.00070
O’Leary, B. C., Winther−Janson, M., Bainbridge, J. M., Aitken, J., Hawkins, J. P., and Roberts, C. M. (2016). Effective coverage targets for ocean protection. Conserv. Lett. 9, 398–404. doi: 10.1111/conl.12247
Ordiz, A., Uzal, A., Milleret, C., Sanz-Pérez, A., Zimmermann, B., Wikenros, C., et al. (2020). Wolf habitat selection when sympatric or allopatric with brown bears in Scandinavia. Sci. Rep. 10, 1–11.
O’Shea, O. R., Mandelman, J., Talwar, B., and Brooks, E. J. (2015). Novel observations of an opportunistic predation event by four apex predatory sharks. Mar. Freshw. Behav. Physiol. 48, 374–380. doi: 10.1080/10236244.2015.1054097
Peters, D. P. C., Havstad, K. M., Cushing, J., Tweedie, C., Fuentes, O., and Villanueva-Rosales, N. (2014). Harnessing the power of big data: infusing the scientific method with machine learning to transform ecology. Ecosphere 5, 1–15.
Pickard, A. E., Vaudo, J. J., Wetherbee, B. M., Nemeth, R. S., Blondeau, J. B., Kadison, E. A., et al. (2016). Comparative use of a Caribbean mesophotic coral ecosystem and association with fish spawning aggregations by three species of shark. PLoS One 11:e0151221. doi: 10.1371/journal.pone.0151221
Pikitch, E. K., Chapman, D. D., Babcock, E. A., and Shivji, M. S. (2005). Habitat use and demographic population structure of elasmobranchs at a Caribbean atoll (Glover’s Reef, Belize). Mar. Ecol. Prog. Ser. 302, 187–197. doi: 10.3354/meps302187ittman
Pittman, S. J., Hile, S. D., Jeffrey, C. F. G., Caldow, C., Kendall, M. S., Monaco, M. E., et al. (2008). Fish Assemblages and Benthic Habitats of Buck Island Reef National Monument (St. Croix, U.S. Virgin Islands) and the Surrounding Seascape: A Characterization of Spatial and Temporal Patterns. NOAA Tech Memo NOS NCCOS 71. Silver Spring, MD: NOAA.
Pratchett, M. S., Hoey, A. S., Wilson, S. K., Messmer, V., and Graham, N. A. J. (2011). Changes in biodiversity and functioning of reef fish assemblages following coral bleaching and coral loss. Diversity 3, 424–452. doi: 10.3390/d3030424
Raymond, B., Lea, M., Patterson, T., Andrews−Goff, V., Sharples, R., Charrassin, J., et al. (2015). Important marine habitat off east Antarctica revealed by two decades of multi−species predator tracking. Ecography 38, 121–129. doi: 10.1111/ecog.01021
Rosenzweig, M. L. (1974). “On the evolution of habitat selection,” in Proceedings of the First International Congress of Ecology, (Wageningen: Center for Agricultural Publishing and Documentation), 404.
Rothschild, B. J., Ault, J. S., Goulletquer, P., and Héral, M. (1994). Decline of the Chesapeake Bay oyster population: a century of habitat destruction and overfishing. Mar. Ecol. Prog. Ser. 111, 29–39. doi: 10.3354/meps111029
Sala, E., Lubchenco, J., Grorud-Colvert, K., Novelli, C., Roberts, C., and Sumaila, U. R. (2018). Assessing real progress towards effective ocean protection. Mar. Policy 91, 11–13. doi: 10.1016/j.marpol.2018.02.004
Schratz, P., Muenchow, J., Iturritxa, E., Richter, J., and Brenning, A. (2018). Performance evaluation and hyperparameter tuning of statistical and machine-learning models using spatial data. arXiv [Preprint] arXiv:1803.11266. doi: 10.1016/j.ecolmodel.2019.06.002
Selby, T. H., Hart, K. M., Fujisaki, I., Smith, B. J., Pollock, C. J., Hillis-Starr, Z., et al. (2016). Can you hear me now? Range-testing a submerged passive acoustic receiver array in a Caribbean coral reef habitat. Ecol. Evol. 6, 4823–4835. doi: 10.1002/ece3.2228
Selby, T. H., Hart, K. M. K., Smith, B. B. J., Pollock, C. G. C., Hillis-Starr, Z., and Oli, M. M. K. (2019). Juvenile hawksbill residency and habitat use within a Caribbean marine protected area. Endanger. Species Res. 40, 53–64. doi: 10.3354/esr00975
Sequeira, A. M. M., Hays, G. C., Sims, D. W., Eguíluz, V. M., Rodríguez, J. P., Heupel, M. R., et al. (2019). Overhauling ocean spatial planning to improve marine megafauna conservation. Front. Mar. Sci. 6:639. doi: 10.3389/fmars.2019.00639
Shoemaker, K. T., Heffelfinger, L. J., Jackson, N. J., Blum, M. E., Wasley, T., and Stewart, K. M. (2018). A machine−learning approach for extending classical wildlife resource selection analyses. Ecol. Evol. 8, 3556–3569. doi: 10.1002/ece3.3936
Simpfendorfer, C. A., Goodreid, A. B., and McAuley, R. B. (2001). Size, sex and geographic variation in the diet of the tiger shark, Galeocerdo cuvier, from Western Australian waters. Environ. Biol. Fishes 61, 37–46. doi: 10.1023/a:1011021710183
Simpfendorfer, C. A., Heupel, M. R., and Hueter, R. E. (2002). Estimation of short-term centers of activity from an array of omnidirectional hydrophones and its use in studying animal movements. Can. J. Fish. Aqua. Sci. 59, 23–32. doi: 10.1139/f01-191
Simpfendorfer, C. A., Huveneers, C., Steckenreuter, A., Tattersall, K., Hoenner, X., Harcourt, R., et al. (2015). Ghosts in the data: false detections in VEMCO pulse position modulation acoustic telemetry monitoring equipment. Anim. Biotelem. 3:55.
Speed, C. W., Meekan, M. G., Field, I. C., McMahon, C. R., Harcourt, R. G., Stevens, J. D., et al. (2016). Reef shark movements relative to a coastal marine protected area. Reg. Stud. Mar. Sci. 3, 58–66. doi: 10.1016/j.rsma.2015.05.002
Udyawer, V., Dwyer, R. G., Hoenner, X., Babcock, R. C., Brodie, S., Campbell, H. A., et al. (2018). A standardised framework for analysing animal detections from automated tracking arrays. Anim. Biotelem. 6, 1–14. doi: 10.1186/s40317-018-0162-2
White, T. D., Carlisle, A. B., Kroodsma, D. A., Block, B. A., Casagrandi, R., de Leo, G. A., et al. (2017). Assessing the effectiveness of a large marine protected area for reef shark conservation. Biol. Conserv. 207, 64–71. doi: 10.1016/j.biocon.2017.01.009
Winton, M. V., Fay, G., Haas, H. L., Arendt, M., Barco, S., James, M. C., et al. (2018a). Estimating the distribution and relative density of satellite-tagged loggerhead sea turtles using geostatistical mixed effects models. Mar. Ecol. Prog. Ser. 586, 217–232. doi: 10.3354/meps12396
Winton, M. V., Kneebone, J., Zemeckis, D. R., and Fay, G. (2018b). A spatial point process model to estimate individual centres of activity from passive acoustic telemetry data. Methods Ecol. Evol. 9, 2262–2272. doi: 10.1111/2041-210x.13080
Wirsing, A. J., Heithaus, M. R., and Dill, L. M. (2007). Living on the edge: dugongs prefer to forage in microhabitats that allow escape from rather than avoidance of predators. Anim. Behav. 74, 93–101. doi: 10.1016/j.anbehav.2006.11.016
Zeller, K. A., Vickers, T. W., Ernest, H. B., and Boyce, W. M. (2017). Multi-level, multi-scale resource selection functions and resistance surfaces for conservation planning: pumas as a case study. PLoS One 12:e0179570. doi: 10.1371/journal.pone.0179570
Zuur, A. F., Ieno, E. N., and Saveliev, A. A. (2017). Beginner’s Guide to Spatial, Temporal, and Spatial-temporal Ecological Data Analysis With R-INLA: Using GLM and GLMM, Vol. I. Newburgh: Highland Satistics LTd.
Keywords: resource selection, space use, acoustic telemetry, machine learning, random forest, marine protected area, sharks, predator-prey
Citation: Griffin LP, Casselberry GA, Hart KM, Jordaan A, Becker SL, Novak AJ, DeAngelis BM, Pollock CG, Lundgren I, Hillis-Starr Z, Danylchuk AJ and Skomal GB (2021) A Novel Framework to Predict Relative Habitat Selection in Aquatic Systems: Applying Machine Learning and Resource Selection Functions to Acoustic Telemetry Data From Multiple Shark Species. Front. Mar. Sci. 8:631262. doi: 10.3389/fmars.2021.631262
Received: 19 November 2020; Accepted: 19 March 2021;
Published: 29 April 2021.
Edited by:Mark J. Henderson, United States Geological Survey, United States
Reviewed by:Ross Dwyer, University of the Sunshine Coast, Australia
Vinay Udyawer, Australian Institute of Marine Science (AIMS), Australia
Copyright © 2021 Griffin, Casselberry, Hart, Jordaan, Becker, Novak, DeAngelis, Pollock, Lundgren, Hillis-Starr, Danylchuk and Skomal. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Lucas P. Griffin, email@example.com
†These authors have contributed equally to this work and share last authorship