A Scoping Review of Species Distribution Modeling Methods for Tick Vectors

Background Globally, tick-borne disease is a pervasive and worsening problem that impacts human and domestic animal health, livelihoods, and numerous economies. Species distribution models are useful tools to help address these issues, but many different modeling approaches and environmental data sources exist. Objective We conducted a scoping review that examined all available research employing species distribution models to predict occurrence and map tick species to understand the diversity of model strategies, environmental predictors, tick data sources, frequency of climate projects of tick ranges, and types of model validation methods. Design Following the PRISMA-ScR checklist, we searched scientific databases for eligible articles, their references, and explored related publications through a graphical tool (www.connectedpapers.com). Two independent reviewers performed article selection and characterization using a priori criteria. Results We describe data collected from 107 peer-reviewed articles that met our inclusion criteria. The literature reflects that tick species distributions have been modeled predominantly in North America and Europe and have mostly modeled the habitat suitability for Ixodes ricinus (n = 23; 21.5%). A wide range of bioclimatic databases and other environmental correlates were utilized among models, but the WorldClim database and its bioclimatic variables 1–19 appeared in 60 (56%) papers. The most frequently chosen modeling approach was MaxEnt, which also appeared in 60 (56%) of papers. Despite the importance of ensemble modeling to reduce bias, only 23 papers (21.5%) employed more than one algorithm, and just six (5.6%) used an ensemble approach that incorporated at least five different modeling methods for comparison. Area under the curve/receiver operating characteristic was the most frequently reported model validation method, utilized in nearly all (98.9%) included studies. Only 21% of papers used future climate scenarios to predict tick range expansion or contraction. Regardless of the representative concentration pathway, six of seven genera were expected to both expand and retract depending on location, while Ornithodoros was predicted to only expand beyond its current range. Conclusion Species distribution modeling techniques are useful and widely employed tools for predicting tick habitat suitability and range movement. However, the vast array of methods, data sources, and validation strategies within the SDM literature support the need for standardized protocols for species distribution and ecological niche modeling for tick vectors.


INTRODUCTION
An urgency in understanding the geographic occurrence and epidemiology of emerging zoonotic pathogens has led to the continued interest in describing the climate and habitat suitability for ticks and tick-borne diseases (Tkadlec et al., 2018;MacDonald et al., 2019). Due to their impact on human and veterinary health, livelihoods, and numerous global economies (Jongejan and Uilenberg, 2004), identifying and prioritizing interventions for surveillance, prevention, and control of ticks is of the utmost importance Zanet et al., 2020). Globally, ticks and their associated pathogens cost billions of dollars annually in control measures, lost revenue due to livestock infestations and infection, and medical care (Jongejan and Uilenberg, 2004). Therefore, developing tools that identify suitable habitat for ticks can assist with creating strategies to slow the spread of both the ticks and tick-borne diseases, and further the understanding of environmental factors necessary for tick survival and reproduction (Wilson, 1996;Hahn et al., 2016). Knowledge of how climatic and habitat characteristics determine patterns of tick presence can be accomplished using species distribution models (SDM, Estrada-Peña et al., 2016).
Species distribution models, also referred to as ecological niche models and habitat suitability models [despite differences in depth of focus on defining fundamental species niches among these approaches (Peterson and Soberón, 2012)], represent a suite of statistical and machine-learning tools for predicting suitable species habitat ranges and niches based on correlated environmental conditions (Guisan and Zimmermann, 2000;Franklin, 2010;Peterson et al., 2011). These strategies range from deterministic (e.g., logistic regression) to stochastic (e.g., Bayesian regression trees) approaches, and utilize numerous model validation techniques. With the introduction of the BIOCLIM bioclimatic dataset in the mid-1980s, SDMs were applied to an increasing number of organisms over vast geographies (Booth, 2018). Improvements in modeling power and complexity occurred over the next several decades, but noted limitations still exist in both inherent sampling bias (Phillips et al., 2009;Eisen and Eisen, 2021;Mader et al., 2021) and autocorrelation (Veloz, 2009) of data sources, model assumptions (Stockwell and Peterson, 2002;Lobo et al., 2008;Peterson et al., 2008), and a lack of standardized reporting approaches to modeling parameters to ensure reproducibility (Feng et al., 2019;Wunderlich et al., 2019). Additionally, performance comparisons demonstrate that not all models are equally robust or appropriate depending on the species and its ecology (Stockwell and Peterson, 2002;Qiao et al., 2015;Wunderlich et al., 2019), therefore documenting the variety of SDMs used to predict tick distributions is critical to more thoroughly understanding the extent of their abilities and their limitations.
Predicting the global distributions of tick species often involves the use of climate projections to account for forecasted changes in abiotic conditions which subsequently affect tick survival. To effectively incorporate abiotic variables, many models use bioclimatic variables that summarize raster data derived from mean monthly temperature and precipitation to estimate climate ranges meaningful for biological species ( 1 Fick and Hijmans, 2017). Of the 19 bioclimatic variables available, only 15 are now considered to be suitable for habitat suitability modeling due to presence of spatial artifacts among estimates of mean temperature of wettest and driest quarter  and precipitation of warmest and driest quarter (BIO 18-19;Escobar et al., 2014). In concert with spatial data on vegetation composition and/or elevation data, SDMs can be incorporated into climate projections using either Intergovernmental Panel on Climate Change (IPCC) climate scenarios or Representative Concentration Pathways (R) with or without the use of Global Circulation Models (GCMs). Global Circulation Models are mathematical representations of the physical processes in the atmosphere, ocean, cryosphere and land surface, where with the many GCMs available, it is crucial to distinguish between the nuances in how climate values are estimated and how these results impact spatial variability in species distributions (Guevara et al., 2019). In contrast, climate and RCP emission scenarios (e.g., A2, A2A, B2A, RCP 4.5, RCP 6.0, RCP 8.5) represent predictions of climate outcomes based on greenhouse gas and aerosol concentrations and land use change (Jubb et al., 2013). For example, RCP 4.5 is the lowest-emission scenario where atmospheric carbon is 650 parts per million (ppm) and stabilizes after the year 2100, whereas RCP 8.5 is the highest-emission scenario with > 1,370 ppm atmospheric carbon (Arora et al., 2011;Jubb et al., 2013). Combining both GCMs and climate projection scenarios allow for a greater understanding of how tick species distributions can potentially shift because of suitable habitat availability.
Species distribution models are relatively new in their application to tick vectors, and a standardized approach for reproducibility does not exist for data inclusion, model ensembles, and validation methods (Feng et al., 2019). Additionally, the focus on projecting distributions based on future climate scenarios for different species presents multiple possibilities for tick control and mitigation. These reasons make a scoping review a practical and useful approach to systematically examine, evaluate, and identify any gaps within the literature on this topic. A previous scoping review demonstrated the gaps in SDM studies involving ticks within the Amblyomma genus in conjunction with rickettsial bacteria, a major global pathogen of human and veterinary concern (Lippi et al., 2021a). Similarly, a recent systematic review sought to examine the past 20 years of distribution modeling on ticks and tick-borne pathogens, with a focused meta-analysis on 20 SDM studies conducted in Africa (Zannou et al., 2021). Our study sought to understand the state of SDM research surrounding all tick vectors since the beginning of SDM algorithms because differences in habitat requirements across species may impact the types of algorithms, environmental correlates, and modeling approaches chosen.
We conducted a scoping review examining all available peerreviewed literature that employed habitat suitability and species distribution models to predict occurrence of tick species for the purposes of documenting the diversity of model algorithms and strategies, abiotic and biotic environmental predictors, tick occurrence data sources, types of model validation methods, and frequency and source of climate projection data for forecasting of future tick occurrence.

MATERIALS AND METHODS
Following the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) guidelines for scoping reviews (Tricco et al., 2018), we conducted literature searches through August 2021, using the databases Web of Science 2 and PubMed 3 since evidence suggests their keyword search tools together offer the best article update frequency and longitudinal coverage (Falagas et al., 2008). Our search criteria for all databases included "ticks" [AND] "species distribution model, " [OR] "habitat suitability model, " [OR] "ecological niche model." We selected all resulting peer-reviewed articles published in English, or with English translations, with no publication date or study region exclusions. Secondary and tertiary literature searches for additional publications meeting these search criteria were conducted by reviewing references from the initial search results, and by entering the original search results into Connected Papers 4 , a graphical literature search tool that uses the concepts of co-citation and bibliographic coupling to compile relevant paper lists and graphics from Semantic Scholar Paper Corpus (Ammar et al., 2018). Duplicate papers were removed, and the final list was examined for eligibility by two independent reviewers.

RESULTS
Our literature search, after duplicate article removal and examination of references and related papers, revealed 138 peerreviewed articles on species distribution models and tick vectors that were assessed for eligibility (Figure 1). Fifteen studies were removed from consideration because they lacked a statistical algorithm that evaluated tick occurrence based on abiotic or biotic environmental correlates. Six papers were excluded from analysis because they were reviews, not a primary literature article, or were gray literature (i.e., not peer-reviewed), and 10 papers were removed from analysis because they focused the ecological niche analysis primarily on a tick-borne pathogen or a non-tick animal. A total of 107 full-text articles were included in the final analysis (Figure 1).

Tick Occurrence and Environmental Data Sources
Tick occurrence records were sourced from a variety of collections worldwide that included both active (n = 21) and passive (n = 16) surveillance methods, often both within a single paper (n = 106). The most frequently used source of tick location data was published literature records (n = 62), followed by field collections (n = 30) conducted by the authors (Figure 4). Four publicly accessible, internet-based databases of tick occurrence records were also observed in the literature, namely the Global Biodiversity Information Facility (GBIF) (n = 8), Walter Reed Biosystematics Unit (WBRU) (n = 7), VectorMap (n = 6), and Biodiversity Information Serving our Nation (BISON) (n = 1) (Figure 4). There were 60 different databases and sources of environmental and other correlates used across the included literature (Supplementary Table 1). The most frequently used database for abiotic variables was WorldClim, included in just over half (56%) of papers. Variables representing tick host presence were rarely included and appeared in the form of host availability in only two papers Miao et al., 2020).
The finest scale resolution observed for environmental correlates was 15 m 2 for elevation and land cover data extracted from the Southern Ontario Land Resource Information System (SOLRISv3.0) Slatculescu et al., 2020),  while the coarsest scale observed was 600 km 2 for rainfall, temperature, vegetation type, NDVI, and elevation from the Center for Resource and Environmental Studies (Cumming, 2002; Supplementary Table 1). Most papers examined tick distribution probabilities at a spatial resolution between 1 and 10 km 2 (n = 65) (Supplementary Table 1). Analyses in 12 papers were conducted at a spatial resolution less than 1 km 2 , and tick species distribution modeling was performed in 18 papers at spatial resolutions greater than 10 km 2 (Supplementary Table 1). Finally, 12 papers did not report the spatial resolution at which the analysis was performed.

Modeling and Validation Methods
Thirty-two different habitat distribution modeling methods were represented in this sample of publications. The most frequently chosen modeling approaches were maximum entropy, otherwise known as MaxEnt, which appeared in 56% of papers (n = 60), followed by generalized linear models (n = 24; 22.4%) ( Figure 5 and Table 2). The remaining methods were used nine times or fewer, each accounting for less than 10% of the remaining papers ( Figure 5 and Table 2). Only 23 papers (21.5%) employed more than one algorithm, and just six (5.6%) used an ensemble approach that incorporated at least five different modeling methods for comparison and weighted averaging across the interpolated surface ( Table 2).
The remaining validation methods were reported once or twice across all included literature. More than half of papers with validation methods reported (49; 60.6%) used two or more evaluation techniques for each model. Ten papers (roughly 9% of the total papers included in this review) either did not include a description of validation methods, or supplemental files with this information were not available.

Future Climate Projections
There were 23 studies that included future climate projections, representing seven tick genera and 27 species (Table 4). Of these, only 65% of studies (15/23) used climate projections with the most used climate projection being MIROC (8; 53%), followed by CCSM (6; 40%) and NCAR (4; 26%). Note that multiple climate projection scenarios were used in a study, often as a comparison, with an average of approximately four climate projections used per study. In contrast, 86% of studies (20/23) used AB (5; 25%) or Representative Concentration Pathways (RCP, 15; 75%) emission scenarios. For RCP emissions scenarios, the most used scenario was 8.5 (14; 93%), followed closely by RCP 4.5 (13; 86%), while the most used Special Report on Emissions Scenarios (SRES) scenario was A2 (4; 80%). Eleven studies used environmental data such as NDVI, vegetation cover and elevation to parameterize their models and 14/23 used BIOCLIM data.
For the seven tick genera, twenty studies used climate models to predict both global expansion and retraction of suitable habitat ( Table 5). All genera (6/7) were expected to both expand and retract, except for Ornithodoros where only expansion was predicted. Of these studies, Rhipicephalus habitat suitability was predicted by seven studies (35%), with the range of Ixodes predicted by six studies (30%), followed by Amblyomma which was found in five studies (25%). In contrast, the habitat suitability, and future species distributions of Dermancentor and Ornithodoros were only forecasted by one study each.

DISCUSSION
Our scoping review of tick vector species distribution modeling literature identified 107 papers that represent a wide array of tick occurrence record sources, environmental correlates, model validation strategies, and future climate-related scenarios for tick habitat suitability and distributions. However, we found that certain modeling strategies and data sources are more frequently used than others. MaxEnt, a machine-learning application, was the most used algorithm, appearing in 56% of papers, for predicting tick habitat suitability and distributions. This modeling approach is often favored because it only requires knowledge of species presence locations and generates pseudoabsences to complete the background locations to predict the distribution of a species (Phillips et al., 2006). It is reported to routinely perform better than other presence-only models (Elith et al., 2006;Merow et al., 2013) and is also highly accessible in both open-source R programming packages as well as its own software and is highly user-friendly (Phillips et al., 2006).
A recent comparison of presence-absence model performance found little difference in the general performance (accuracy, discrimination, calibration, and precision) of most of these other model strategies, however, the authors caution that because poorly performing models can generate overfit predictions and overconfident estimations, it is important to work within sets of models first that are complementary within their assumptions before creating multi-model ensembles . Overall, machine-learning approaches appear to be a popular choice for predicting tick distributions and habitat associations, though relatively few studies incorporated model ensembles despite the importance of controlling for individual model biases and ecological circumstances to reduce uncertainty (Marmion et al., 2009;Roura-Pascual et al., 2009;Qiao et al., 2015), or to simply highlighting model agreement and disagreement (Stohlgren et al., 2010).
Within this body of tick distribution modeling literature were dozens of different tick occurrence data sources that include both active and passive surveillance methods. Given the absence of standardized collection efforts and data recording and data availability across large spatial areas, many types and sources of occurrence records were combined without controlling for sampling biases. Tick occurrence data is inherently biased due  Messina et al., 2015;Springer et al., 2015;Hahn et al., 2016;Sun et al., 2017;Kessler et al., 2019b;Miao et al., 2020;Glass et al., 2021;Lippi et al., 2021b;Zhao et al., 2021 Classification and regression trees (CART) (n = 1) Frontiers in Ecology and Evolution | www.frontiersin.org to sampling regime and location uncertainty, and the inclusion of data sources must reflect this (Zizka et al., 2021). In addition, data collection methods vary widely across and within datasets, affecting both the completeness and the accuracy of reported data. Active surveillance for ticks can vary by collection method (dragging/flagging, small mammal trapping, CO2 trapping), by sampling intensity (time, distance/trap density), and by sampling based on habitat, (Rydzewski et al., 2011(Rydzewski et al., , 2012 leading to variation in the certainty of absence data (Lyons et al., 2021). This can result in models fitted to sampling effort and not true species distribution (Hendrickx et al., 2021). Passive surveillance varies by participation type and by the accuracy of data collected, from GPS location to self-reported, from tick submission to photo identification to self-report (Eisen and Eisen, 2021). A recent review of passive methods in mosquito surveillance found little coherence among programs, resulting in non-comparable data; similar issues likely exist in passive tick surveillance data (Sousa et al., 2022). Inclusion of these data sources without consideration of the potentially conflicting biases could lead to underestimation of model uncertainty (Kramer-Schadt et al., 2013). Particularly of importance with the most common modeling methods is certainty of location data. Positional uncertainty in data points reduces the prediction accuracy of the model (Naimi et al., 2014), but models may be built using historical databases and natural history collections with widely varying certainty due to changes in location methods (Gilliam et al., 2020), as well as misidentifications, changing taxonomical classifications, and unknown origin of the specimens (Graham et al., 2004). It should be noted, however, that these historical and museum-based records may include a substantial level of detail in the data collected, suggesting that they should not be ignored due to the potential complications in data collection; larger data sets can counteract the effect of positional uncertainty (Mitchell et al., 2017). The importance of positional uncertainty may also be mitigated by spatial autocorrelation among bioclimatic variables (Naimi et al., 2014), particularly if the range of the spatial autocorrelation is no more than three times the standard deviation of the positional error (Naimi et al., 2011). Positional uncertainty is also more important among species with narrow ranges than among widespread species (Soultan and Safi, 2017). For example, tick species (3-host) and life stages that parasitize hosts with small dispersal ranges (e.g., small rodents) require stricter thresholds and consideration of positional uncertainty error than those tick species (1-host) and life stages that rely on larger mammals that are capable of much broader dispersal. While difficult, it would be prudent for future tick SDMs to include metrics of host biology within model covariates. In addition, explanatory variables, such as landscape and climate, are frequently time-dependent; temporal alignment of observations and these time-dependent variables is essential to ensure model fitting based on circumstances at the time of observation and the climate niche (Estrada-Peña et al., 2013b).
SDMs also differ in how they handle various sample sizes (Wisz et al., 2008) and proportions of true presence/absence data versus presence/pseudo-absences (Wisz and Guisan, 2009), therefore choosing the proper tools to handle these biases is crucial to reliable prediction outcomes. For example, MaxEnt (Phillips et al., 2017) is a prime option for handling data that contains known presence only, and would thus be a good choice for data structures that contain museum records. Sampling biases across a dataset can also be investigated prior to building models with software tools like "sampbias" which quantifies the biasing effect of human accessibility to data collection sites, and is available as a R package (Zizka et al., 2021). Similar to the variety of tick occurrence records were the numerous sources of bioclimatic data correlates. We reported 60 different databases for environmental parameters used across modeling papers, however, bioclimatic variables from WorldClim were the most frequently modeled. Since 1984, WorldClim has existed as a database of interpolated global climate data derived from weather stations using thin-plate splines, now including up to 35 different measures of temperature and precipitation/moisture (Hijmans et al., 2005). Variables 1-19, drawing from the years 1970-2000, are the most frequently used in modern SDM models (Booth, 2018), and a new high-resolution (1-km 2 ) set of WorldClim monthly climate surfaces was released in 2017 (Fick and Hijmans, 2017). Many of these sources of environmental variables appear to be chosen due to ease of use and access, as well as due to the wide scale spatial coverage. For example, the WorldClim bioclimatic parameters are widely available via numerous spatial and targeted species distribution modeling R packages (e.g., SDM, krigR, envirem, etc.), and allow a user to apply them without having to upload and merge separate raster files (Naimi and Araújo, 2016;Title and Bemmels, 2018;Kusch and Davy, 2022). These data are also available in several resolutions and at a global scale, providing applicable climate resources regardless of the distribution locations chosen for model projections. Despite the common use and user-friendliness of these interpolated environmental variables to fit SDMs, it is important for modelers to recognize their limitations. Global bioclimatic datasets have not been validated on a smaller, local scale and could produce erroneous predictions in ecological niche modeling (Bedia et al., 2013). For this reason, it is important to also consider the use of satellite-derived remote sensing data for environmental parameters because they can provide more up-todate and more precise measures of local climate and landscape (Amiri et al., 2020). Many of the bioclimatic variables also exhibit multicollinearity and spatial autocorrelation problems, and tests to exclude these biases from the models are inconsistently performed across the SDM literature (Araújo and Guisan, 2006;Estrada-Peña et al., 2013b). We also tracked the types of species distribution model validation methods employed across the literature, with the understanding that there can be differences in how well validation applications can evaluate spatial models. The vast majority (98.9%) of papers used the receiver operating curve/area under the curve (ROC/AUC) metric to determine model fitness, and more than half of papers (60.6%) included two or more evaluation techniques. Nearly 10% of the body of literature reviewed did not report the validation method used to evaluate models, presenting challenges to the reproducibility of those modeling approaches. Similarly, the predominant use of WorldClim variables within this body of literature, there appears to be over-reliance on ROC/AUC to determine sensitivity and specificity of the model results (Lobo et al., 2008). Some researchers have cautioned that while ROC/AUC is helpful in that it is a threshold-independent measure and thereby more objective, it is a poor measure of model accuracy specifically in species distribution modeling. Lobo et al. (2008) report five issues with using ROC/AUC alone, including that ROC/AUC does not provide a spatial distribution of the model errors, and most critically, that the extent of the model can erroneously inflate the AUC score. Alternatives to basic ROC/AUC can include partial receiver operating curves (pROC), which avoids these criticisms and allows for differential weighting of omission and commission errors, as well as True Skill Statistic (TSS) (Lobo et al., 2008;Peterson et al., 2008). However, since many of the evaluation metrics can be biased by the proportions of occurrence presences to absences, strategies from other disciplines have also been suggested. Wunderlich et al. (2019) proposed the Odds Ratio Skill Score and the Symmetric Extremal Dependence Index (SEDI) to replace TSS in the evaluation of presence-background SDM methods, arguing that TSS can be biased depending on the number of true absences within a dataset. Overall, care must be taken in choosing evaluation methods that fit the type of species data and modeling algorithm. The importance of model evaluation and validation methodology should not be understated as species distribution models are increasingly being used in the context of future climate-related projections to inform public policy on ticks and TBDs and improve priorities associated with One Health (Semenza et al., 2012). Approximately twenty percent of the studies included in this review included climate projection, of which fifty-six percent were published within the last 5 years. This showcases not only the increased interest in using SDM as a tool to understand the current distribution of tick species but also future possibilities to better control and mitigate TBDs. However, it is crucial that before integrating climate projections within SDM, there needs to be improved understanding of climate models and the inherent uncertainty within (Harris et al., 2014). Estrada-Peña (2003a) was the first to predict habitat suitability 15 years into the future for four tick species in South Africa using Fourier series analysis using decadal abiotic data from 1983 to 2000. More recently, researchers are using newer tools such as climate and emissions scenarios as well as Global Circulation Models (GCMs). When developing a SDM for projecting future species distributions, there are several considerations, including not only spatial resolutions and type of baseline data but also which environmental or climate variables to use. In addition, to avoid irrelevant or correlated climate variables there is need to account for phylogenetic data as species traits determine their distribution (Soberón, 2007;Morales-Castilla et al., 2017). Using an integrated approach that combines climate and phylogeny can clarify whether the underlying mechanism for differences in species richness or distribution shifts across space is more closely related to time and diversification rates (Kozak and Wiens, 2012;Wang et al., 2019).  Olwoch et al., 2003;Estrada-Peña et al., 2005, 2006bWalter et al., 2016;Li et al., 2019;Clarke-Crespo et al., 2020 Boosted Regression Trees Akaike's Information Criterion Correlation coefficient Model deviance k-fold cross validation Kappa index of agreement Partial receiver operating characteristic (pROC) Receiver operating characteristic (ROC) Sensitivity and specificity True skill statistic Messina et al., 2015;Springer et al., 2015;Hahn et al., 2016;Sun et al., 2017;Kessler et al., 2019b;Miao et al., 2020;Glass et al., 2021;Lippi et al., 2021b;Zhao et al., 2021 Classification and regression trees Akaike's Information Criterion Receiver operating characteristic (ROC)  Cumming, 2000Cumming, , 2002Guerra et al., 2002;Brownstein et al., 2003;Estrada-Peña et al., 2004, 2006dBrown et al., 2011;Jore et al., 2014;De Clercq et al., 2015;Gabriele-Rivet et al., 2015;Springer et al., 2015;Hahn et al., 2016;Kessler et al., 2019a,b;Sungirai et al., 2018;Vajana et al., 2018;Clarke-Crespo et al., 2020;Glass et al., 2021;Lippi et al., 2021b;Namgyal et al., 2021 Genetic   Springer et al., 2015;Hahn et al., 2016;Kessler et al., 2019b;Clarke-Crespo et al., 2020;Glass et al., 2021 Negative binomial regression Correlation coefficient Ceballos et al., 2014 Occupancy  Springer et al., 2015;Hahn et al., 2016;Kessler et al., 2019b;Walter et al., 2020;Glass et al., 2021;Lippi et al., 2021b Spatial The potential options allow for multiple combinations and corresponding divergence in climate estimates as the patterns of temperature and precipitation differ between GCMs (Guevara et al., 2019). To compensate for these differences, those studies in our review that used GCMs tend to use multiple models when evaluating future distributions. For example, Minigan et al. (2018), used ten GCMs when projecting the future distribution of Dermacentor variabilis in North America to incorporate uncertainty of possible future distributions and be able to showcase potential minimum and maximum range shifts (Harris et al., 2014). For those studies that used only one or two GCMs, the spatial extent of the projected area was limited, such as Lieske and Lloyd (2018) predicting I. scapularis in the province of New Brunswick or de  predicting A. cajennense s.s. and A. sculptum in Brazil. Regardless of the number of GCMs incorporated within SDMs, it is important when drawing conclusions to consider that climate projections do not predict future climate, but instead provide possible futures under a given scenario (Rosentrater, 2010).
Emission and climate scenarios were included in most articles that included climate projections with less variation in which scenarios are included and compared. Most included a version of the Representative Concentration Pathways (RCPs) or the Intergovernmental Panel on Climate Change (IPCC) climate scenarios upon which the GCMs act to derive possible future climates. These climate and emission scenarios acted as possible futures with many including the recommended high and low emissions scenario to compare potential shifts in tick distributions under a changing climate. For example, at a simple level, Estrada-Peña (2003) found that increasing temperature by 2 • C would potentially result in reduction of tick habitat in South Africa for A. hebraeum, B. decoloratus, H. truncatum, and R. appendiculatus. For more complex models using IPCC climate such as A2 (equivalent of RCP 8.5), a high emissions scenario that describes rapidly rising temperatures, there tends to be a loss of habitat range for I. ricinus in the southern range of its present distribution range in Poland and Italy yet expansion in Norway and Sweden (Boeckmann and Joyner, 2014). In contrast, in the southern hemisphere, in Brazil, for the RCP 8.5 emissions scenario, there is retraction of A. cajennense and A. sculptum in northern Brazil and expansion of suitable habitat in southern Brazil . Alternative scenarios show similar expansion and retraction, however, the extent varies. For example, for low emission scenarios such as B1 (equivalent RCP 4.5) where there is a leveling of temperatures, there is increased loss of suitable habitat for Rhipicephalus Frontiers in Ecology and Evolution | www.frontiersin.org   Giles et al., 2014;Hadgu et al., 2019;Alkishe et al., 2020;Marques et al., 2020 sanguineus in South America as compared to the RCP 8.5 scenario . This suggests that moderate increases in temperature decreases habitat suitability without resulting in habitat changes elsewhere where tick species could find refuge. Overall, understanding how climate change could potentially influence tick distributions and range shifts will depend not only on the tick species, but also the region and climate model . We acknowledge several limitations to this review that also present opportunities for further research and investigation. Since we did not include gray literature or unpublished manuscripts, there are likely numerous applicable and appropriate analyses that are not represented in these data. The date cutoff for inclusion was August 2021, so there also are likely several relevant papers that have been released since that time that would have been included but are not. Within the assessment of validation methods, we did not include a question about training/testing dataset splitting, or in other words, how much of a dataset was used in prediction and forecasting. This is particularly important for ensuring reproducible projections of future climate-based tick habitat suitability. We also did not document whether researchers included location uncertainty thresholds to tick occurrence data. Some researchers recommend setting the limit of tick species observation position uncertainty to match that of the WorldClim data (i.e., 4,000 m) , and others  have set the limit to at a finer resolution (e.g., 1,000 m). Regardless, it is important for reproducibility of these analyses as well as accuracy of the model prediction for these thresholds to be set as well as reported in the literature. Researchers employing these modeling strategies for ticks or any species should consult checklists and guides on how to generate and report SDM studies in a reproducible manner (Araújo and Guisan, 2006;Austin and Van Niel, 2011;Feng et al., 2019).

CONCLUSION
Since the 1980s, species distribution modeling methods and data sources have greatly improved the ability to predict suitable tick habitat and likely distribution ranges. However, given the wide range of options that are used to estimate current and future tick habitat, it is critical to establish standardized methodologies for conducting, validating, and reporting species distribution model predictions. These parameters are necessary to create reproducible, comparable, and reliable guides to monitor tick vectors and their global risk to humans, domestic animals, and wildlife.

AUTHOR CONTRIBUTIONS
RS and HK: conception. HK and SH: data collection and analysis. HK, SH, and RS: writing. RS: funding. SH: supervision. All authors contributed to the article and approved the submitted version.

FUNDING
This research was supported in part by a grant from the Department of Defense #TB180052.