Direct Approach or Detour: A Comparative Model of Inhibition and Neural Ensemble Size in Behavior Selection

Tjøstheim, Trond A.; Johansson, Birger; Balkenius, Christian

doi:10.3389/fnsys.2021.752219

ORIGINAL RESEARCH article

Front. Syst. Neurosci., 09 November 2021

Volume 15 - 2021 | https://doi.org/10.3389/fnsys.2021.752219

This article is part of the Research TopicComparative Animal ConsciousnessView all 13 articles

Direct Approach or Detour: A Comparative Model of Inhibition and Neural Ensemble Size in Behavior Selection

Trond A. Tjøstheim^*

Birger Johansson

Christian Balkenius

Department of Philosophy, Lund University Cognitive Science, Lund, Sweden

Organisms must cope with different risk/reward landscapes in their ecological niche. Hence, species have evolved behavior and cognitive processes to optimally balance approach and avoidance. Navigation through space, including taking detours, appears also to be an essential element of consciousness. Such processes allow organisms to negotiate predation risk and natural geometry that obstruct foraging. One aspect of this is the ability to inhibit a direct approach toward a reward. Using an adaptation of the well-known detour paradigm in comparative psychology, but in a virtual world, we simulate how different neural configurations of inhibitive processes can yield behavior that approximates characteristics of different species. Results from simulations may help elucidate how evolutionary adaptation can shape inhibitive processing in particular and behavioral selection in general. More specifically, results indicate that both the level of inhibition that an organism can exert and the size of neural populations dedicated to inhibition contribute to successful detour navigation. According to our results, both factors help to facilitate detour behavior, but the latter (i.e., larger neural populations) appears to specifically reduce behavioral variation.

1. Introduction

Navigation through space, including taking detours, is an essential element of consciousness (Klein and Barron, 2016; Mallatt et al., 2021). Therefore, exploring the basic mechanisms of these behaviors contributes to the study of consciousness, even if the early steps in the evolution of animal navigation were algorithmic-like and lacking in subjective consciousness like in the model presented in this study. When an organism can no longer follow gradients but must use memory and map-like cognitive structures to cope with an environment, that organism comes closer to supporting a representation of space that is not centered on itself. That is, it supports allocentric representations in addition to self-centered, or egocentric representations. The former affords to see the self in relation to the environment, like being behind a tree or to the east of a river. The latter affords direct movement like going forward or turning to the right.

Natural environments may require a diverse number of behavioral strategies to yield optimal access to resources, while balancing safety and competition concerns. However, this variety can often be condensed into two major types mentioned above; allocentric map-based navigation or egocentric direct approach (Bottini and Doeller, 2020). The extent to which species are biased toward egocentric or allocentric navigation is typically dependent on ecological factors like food availability and the availability of sensory cues (Bruck et al., 2017). Much work has been done to compare species with regards to their ability to control the urge to directly approach salient targets like food, mates, or social groups, and be able to navigate around obstacles via detour paths (Kabadayi et al., 2018). In psychology and ethology, this kind of behavior is investigated using detour tasks. The idea of these experimental tasks is that an animal cannot directly approach a target, but must navigate or reach around a barrier first (As shown in Figure 1). In the case of navigation tasks, there is usually defined a barrier zone immediately in front of the barrier, and the time the animal spends in this zone can be used to operationalize an experimental measure of its inhibitory control, which is the ability to inhibit a futile direct approach and then take a detour.

FIGURE 1

Figure 1. Configuration of the detour task used in experiments, to scale. The barrier is semitransparent with vertical opaque stripes, and the agent is placed facing the apex of the barrier. The diagram shows the goal in red, the semitransparent barrier with transparent parts with dotted lines, the barrier zone in gray, the agent in pink, and the surrounding border.

Kabadayi et al. (2018) review how detour tasks are used in animal cognition. They enumerate the various configurations, measurements, and animal species that have so far been employed in this context. According to them, the behaviors of a wide variety of families of species have been measured, including apes (homo and hominoidae), monkeys (cercopithecoidae and platyrrhini), lemuriforms (strepsirrhini), canids (canidae), equids (equidae), birds (aves), reptiles (reptilia), amphibians (amphibia), fish (pisces), molluscs (mollusca), and spiders (salticidae). Detour tasks have also been used to elucidate the characteristics of several cognitive capacities that include inhibitory control, insight learning, memory, motor and cognitive development, functional generalization, and social learning.

As mentioned, Kabadayi et al. (2018) enumerate several configurations of the detour task. One of these is the V-shaped semitransparent configuration. This has been used to test social learning, problem solving, and inhibitory control in several canids such as dingos, dogs, and wolves, as well as mammals like mice, and goats, and reptilians like tortoises. For mice, the configuration is typically adapted to have a circular border and be filled with water, while the goal is a platform that allows subjects to escape from submersion. This is in contrast with e.g., canids, where the goal is a reward like food or social interaction. Subjects can either be placed inside the V barrier and having to move out of it (outward task), or outside it, having to move in (inward task). Refer to Figure 1 for an example of the inward task which is used in this study. The outward task is usually taken to be the more challenging one as it typically requires subjects to move in the opposite direction to the goal.

Focusing on inhibitory ability and behavioral control in the inward, semitransparent V configuration of the detour task, Marshall-Pescini et al. (2015) investigated how wolves (Canis lupus) and dogs (Canis lupus familiaris) differ in this configuration, seeking to test which species can exhibit better inhibitory control. They found that wolves showed shorter latency to reach the goal, and persevered for less time at the barrier. However, dogs had the upper hand in the so-called cylinder task where subjects are required to get at the reward by gaining access through the opening of a cylinder. It is notable that Bray et al. (2015) found that differences appear to exist between dogs with different levels of excitability, or temperament. Comparing calm and excitable dogs, their findings indicate that calm dogs improved their success rate and apparent inhibitory control with increasing arousal, while excitable dogs performed poorer. Juszczak and Miller (2016) employed the V-shaped detour task placed in shallow water to investigate detour behavior in mice. They measured time in the barrier zone in front of the barrier, for both transparent and semitransparent barriers. In their tests, the performance of the mice appeared to depend both on individual inhibitory skills and experience with the task. That is, they found that performance tended to improve over time, and the mice spent less time in the barrier zone as they gained experience.

The ability to change behavior and strategies for approach as presented above is referred to as behavioral flexibility (e.g., Coppens et al., 2010). As the animal studies explain, behavioral flexibility is thought to involve inhibitory activity to balance the influence both of learned behavior and approach motivation toward salient reward stimuli in the immediate environment. For humans, Uddin (2021) identified large-scale functional brain networks encompassing lateral and orbital frontoparietal, midcingulo-insular and frontostriatal regions that support flexibility across the lifespan.

Spiers and Gilbert (2015) propose a conceptual model in which the lateral prefrontal cortex (PFC) provides a prediction error signal about the change in the path, the frontopolar and superior PFC support the re-formulation of the route plan as a novel subgoal and the hippocampus (HC) simulates the new path. Similarly, the ventromedial (vm) PFC may mediate between the conflicting behavioral responses indicated by HC or caudate systems when active (Doeller et al., 2008). The caudate nucleus is involved in landmark-based, egocentric navigation, while HC is involved in boundary-based, allocentric navigation. According to Piray et al. (2016), the strength of the vmPFC projection to medial striatum including the caudate nucleus, biases toward model-centric choices. Model-centric strategies are typically associated with allocentric navigation (Doeller et al., 2008). These circuits for navigation present contingent behavioral sequences that can be activated. Which of them will be chosen at any given time is dependent on separate machinery, as explained below.

Neural competition is a cornerstone of many theories of brain function, particularly for processes involved in selection and decision making (Amari, 1977; Grossberg, 1978; Erlhagen and Schöner, 2002). Leaky competing integrator models incorporate aspects of both the psychological and neurophysiological models (Usher and McClelland, 2001, 2004; Johnson and Ratcliff, 2014). Relating to this, Smith (2015) shows that the precision of neural populations increases with the number of participating neural units. In the experiments presented in their study, they used units designed to behave according to an idealized Poisson process, having an exponentially decreasing probability of activity after a stimulus. In the context of visual short term memory, they showed in particular that the signal-to-noise ratio (i.e., the precision) increases proportionally to the square root of the neuronal population size. They also showed that normalization of inputs can be achieved by shunting inhibition, which in practice allows fractional scaling of inputs without losing temporal signatures of signals (Prescott and De Koninck, 2003). According to them, their population-size-dependent normalization model allows theoretical models of reaction time and decision accuracy to be reconciled with experimental data.

Earlier we focused on arousal levels in the context of the noradrenergic system (Balkenius et al., 2018), and found that neural gain in the form of noradrenergic activation may contribute to switching between explorative and exploitative behavioral strategies by e.g., varying the amount of noise present in the selection process. In this study, we concentrate on the effect of varying the size of neural populations, and how that affects precision and integration of sensory information. Additionally, we explore how inhibitive efficacy and precision individually and together can contribute to behavioral strategy selection. Finally, we compare our results with data from experiments on animal species, specifically mice, dogs, and wolves.

2. Method

In this section, we explain the rationale behind the model, its properties, and how in particular it is implemented.

2.1. Properties of the Model

To allow selection between the two strategies of egocentric direct approach and allocentric detour, we appropriated a hypothesized network proposed by Barker and Baier (2015). This was originally suggested as a model of approach and avoidance behavior in fish. But given appropriate input signals, it can be used as a winner-takes-all network to select between strategies for approach. In particular, we added one-way inhibition between barrier-collision signals to the neural units representing egocentric strategy. This modified network architecture (as shown in Figure 2 for a diagram) is informed by work on the spatial pathway from the parietal cortex to vmPFC (Kravitz et al., 2011) that includes boundary sensitive cells in the subiculum (Epstein et al., 2017), and projections from vmPFC to the subthalamic nucleus that can inhibit impulsive behavior (Eagle and Baunez, 2010). The variation of population size and inhibitive strength is likewise informed by Smith (2015) and Piray et al. (2016), respectively.

FIGURE 2

Figure 2. Diagram showing model of strategy selection. Green circular objects represent neural populations that receive signals from perceptual modules. Neural populations are simulated with different numbers of neural units, as described in the text. The red connection indicates inhibition, the level of which is varied between 0 and 1 in simulations. The “Barrier” population is excited by barriers or obstacles immediately in front of the agent, while the “Reward proximity” population is excited by the width of red-colored objects in the visual field.

The spiking model used for the neural elements is as defined by Izhikevich (2003):

\begin{array}{l} \frac{d v}{d t} = 0.04 v^{2} + 5 v + 140 - u + I & (1) \end{array}

\begin{array}{l} \frac{d u}{d t} = a (b v - u) & (2) \end{array}

\begin{array}{l} v = {\begin{cases} c, if v = 30 mV \\ v, otherwise \end{cases} & (3) \end{array}

\begin{array}{l} u = {\begin{cases} u + d, if v = 30 mV \\ u, otherwise \end{cases} & (4) \end{array}

In this study, Equations (1, 2) define pre-spike behavior, while Equations (3, 4) define the reset behavior after a spike. In Equation (1), I is for direct input current; v is the voltage potential of the unit, and u is a negative feedback variable to v accounting for positive ionic currents. Refer to Table 1 for parameter values for a, b, c, and d; these values are in accordance with “regular firing” units as defined in Izhikevich (2003).

TABLE 1

Table 1. Numerical values used for simulation.

The formula for the leaky integrator is given by:

\begin{array}{l} y_{t + 1} = e (x - (1 - l) y_{t}) & (5) \end{array}

where y is the value of the integrator, e is the growth or decay factor (as shown below), x is the input, and l is the leakage factor that affects accumulation. These are defined as follows:

\begin{array}{l} e = {\begin{cases} ω, if x < τ \\ ϵ, otherwise \end{cases} & (6) \end{array}

\begin{array}{l} l = {\begin{cases} 0, if x < τ \\ λ, otherwise \end{cases} & (7) \end{array}

Equations (6, 7) define the behavior of the integrator when the input is less than the decay threshold τ. At this point, the integrator begins leaking, or decaying in value, and the value of e changes from ϵ to ω. Refer to Table 1 for numerical values for these parameters.

2.2. Implementation

The neural simulation model was implemented using the Processing framework v.3.5.3 (Reas and Fry, 2007) with the pOSC library v.0.9.9, while the agent and environment were implemented in the Unity game engine v.3.5. Refer to Figure 1 for task configuration in Unity. The neural simulation and the agent world were connected using the Open Sound Control (OSC) protocol (Wright and Freed, 1997). In this way, the agent sends out sensory signals while the neural simulation processes these signals, and computes a motor response that is transmitted back and executed by the agent. This back-and-forth communication happens continuously and asynchronously. The set of signals is described in Table 2.

TABLE 2

Table 2. List of OSC messages used to communicate state of agent in simulated environment.

The simulation supports two-approach strategies; egocentric direct approach and allocentric approach using a map. The former is implemented by slicing a vector of pixels from the color channels of the cameras, then using pixels from the green and blue channels to remove anything but the purely red pixels in the vector from the red channel. The red pixels are counted, and their center point is calculated. Together, this yields a weighted homing signal that can be used for a direct approach such that the sensor information and the motor signals together form a feedback control circuit.

The allocentric map navigation is based on the classical wavefront algorithm (WFA) (Dijkstra, 1959). To facilitate the building of wavefront maps, the agent world sends bounding boxes of all necessary borders, obstacles, and goals, as well as the position of the agent itself. These bounding boxes are used to render a matrix of binary values, making up a map of the environment that can be used by the WFA. The WFA then calculates a gradient from the goal to the agent at every simulation step (to tell if it is getting closer), which gives the agent a direction to move in. This enables the agent to take detours around the obstacles.

As a source of bias for the allocentric strategy, we sliced a vector from the middle of the depth texture from the camera, and transformed it into a two-dimensional matrix. The four topmost rows of this matrix then represent obstacles at various distances from the agent. The rows were weighted and summed up, and the resulting sum was used as a direct input to the spiking population named “Barrier” in Figure 2. The spatial pixel density, thus, forms a kind of receptive field similar to those associated with boundary and obstacle cells in medial temporal areas (Epstein et al., 2017; Poulter et al., 2018). Similarly, the aforementioned sum of red pixels taken from the color camera was used as direct input to the parallel spiking population named “Reward proximity” in Figure 2. These populations were connected to populations representing either the allocentric strategy or the direct approach strategy, with the output of the obstacle bias also connected to the direct approach unit via an adjustable inhibitory weight. Again, refer to Figure 2 for a diagram of the network. The output of the two strategy units was connected to leaky integrator units to be able to transform the spiking trains to scalars suitable for identifying the index of the channel with the largest value (argmax selection). This index was then used to select the winning motor commands for transmission to the agent motor system.

During experiments, the level of inhibitory weight was controlled and set to progressively be from zero to one (refer to Table 3). The agent was given a starting point in view of the target (refer to Figure 1), then left to find its way. The maximum number of steps was set to 1,200, and the simulation was run at 10 Hz, giving a maximum time of 120 s. This makes it possible to compare times in seconds with animal experiments (120 s was also the maximum time limit used for dogs and wolves in Bray et al., 2015). A successful approach to the target was defined as the agent coming within a set radius (5 world units) of the center of the target. After reaching the goal, or the time limit being exceeded, the simulation was reset, parameters for the spiking units were slightly randomized (refer to Table 1), and the agent returned to its initial position. Fifteen trials like this were carried out for each inhibitory weight and neuron population size pair. The information gathered from each trial is given again in Table 3, and the data was then used to produce statistics.

TABLE 3

Table 3. Summary statistics for simulations with varying population size and inhibition level, listing summary statistics including mean with SD, median with interquartile range (IQR), as well as minimum and maximum values.

The statistics was done using Jupyter notebook software (Kluyver et al., 2016), the python programming language (Van Rossum and Drake, 1995), and the Pandas (McKinney, 2010), Seaborn (Waskom, 2021), numpy (Harris et al., 2020), and scipy (Virtanen et al., 2020) libraries.

To calculate the mean and SD of time in the barrier for the animals in Figure 4, we used published data from Marshall-Pescini et al. (2015) for dogs and wolves, and Juszczak and Miller (2016) for mice. Our model does not support learning, hence we calculated statistics only for the subset of data that was recorded at the first trial to minimize the effects of learning and experience. Where different barrier configurations were used, we chose only data from the inward-V configuration.

3. Results

In this section, we show results suggesting that increasing the population size of spiking neurons in the neural network generally reduces behavioral variability of the agent, while increasing the weight of inhibition tends to reduce waiting time in the barrier zone. Both of these factors work together to consistently favor the allocentric navigation strategy upon detection of a barrier.

Figure 3 shows barrier wait times for the simulated agents, grouped by inhibition level and the size of the involved neuronal populations. The general trend displayed by this figure is that time in the barrier reduces as the size of the neuronal population grows. Similarly, the variance as indicated by SD reduces. Within a group of the same population sizes, there is an analogous trend of barrier time reduction as inhibition level increases, going from a median of 12.4 (mean = 26.39, SD = 31.03) at zero inhibition and a single neuron per population, to a median of 4.5 (mean = 5.28, SD = 2.19) at inhibition level of one. At the other end of the scale, with 10 neurons per population, the median at zero inhibition is 7.2 (mean = 9.94, SD = 7.17), and 4.25 (mean = 3.96. SD = 1.12) at inhibition of one. It is also noticeable that between the extreme points, both barrier time and variation jump around somewhat for all population sizes except the maximum 10. In this study, the reduction in barrier time is monotonic (as shown in Table 3).

FIGURE 3

Figure 3. The plot of log10 median time with 95% confidence interval in barrier zone for different simulation configurations. Actual times are indicated by the pale blue and pink dots. (A) Simulated neuronal populations each consist of a single neuron. Zero inhibition level yields the highest variance and highest median time in the barrier zone, an while inhibition level of 1 gives the lowest median barrier time. (B) Neuronal populations consist of two neurons, (C) shows with five neurons, and (D) shows with 10 neurons per population. Barrier times and variation generally trend downwards with an increasing number of neurons. Note that median is used instead of mean in these graphs to better accommodate the asymmetric density of the recorded data.

Figure 4 shows a scatter plot of mean barrier time vs. SD (i.e., variability). Both animal and simulation data are shown, allowing the animal data to be related to the simulations. Qualitatively, mice spend the least time in the barrier zone and have the least variance, followed by wolves, and with dogs having both the longest time in the barrier, as well as the most variance. Dogs also differ most from the simulated data, spending longer time in the barrier.

FIGURE 4

Figure 4. Plot of mean vs. SD for time in barrier zone for different species and simulations. Both the simulation and the animal data appear to be approximately linear. For the three animals, dogs are at the top end of barrier time and variability, and mice at the other extreme. Reducing inhibition yields longer mean time in barrier, while reducing population size increases variability in the form of higher SD. Due to the somewhat stochastic nature of spiking networks, the simulation data naturally displays the noisiness of Figure 3.

4. Discussion

In this final section, we first look at possible explanations for the somewhat surprising position of mouse data on our comparative plot and identify stress as one plausible factor. After that, we turn to the role of inhibition in behavior selection, how the ability to make use of allocentric navigation strategies is an elemental part of consciousness, and how inhibition could be of different use to predator and prey species. We then move to some indications that neural population numbers might not automatically predict inhibitive capabilities and discuss how our results might inform findings from animal experiments.

Comparison of behavior between species requires careful controls to take into account differences in anatomy, body structure, and sensory adaptations. Larger bodies tend to require larger brains to control them, and hence direct comparisons of neural numbers are less useful than neural numbers relative to body volume or weight. Another difference between species that can confound comparisons is their dependence on chemical sensation or olfaction. Species for which olfaction is less important are termed microsmatic, while those that depend to a large degree on olfaction are termed macrosmatic (Santacà et al., 2019a,b). Mice, dogs, and wolves are, hence macrosmatic, while e.g., guppies are considered microsmatic (Santacà et al., 2019a).

One of the interesting inferences one might draw from our results is that mice appear to have more inhibitive powers and larger neuronal populations than do dogs and wolves. One could infer this because mice spend less time at the barrier and more time detouring, so in Figure 4 they group with the high-inhibition and large-population points. This inference, however, is unlikely to be the actual case. Instead, the reason why mice move out of the barrier zone quickly rather than staying like dogs and wolves could be due to the different experimental designs. Mice are averse to being immersed in water, which is a stressor, and they seek the relief of the above-water platform. This means that the mice engage in escape behavior, or avoidance from an aversive stimulus instead of an approach to a rewarding one, as do dogs and wolves. Furthermore, mice are typically aversive to moving into open spaces, which likely also contributes to them spending less time in the barrier zone (e.g., Bailey and Crawley, 2009). According to Schwabe et al. (2010), mice that were subjected to stress preferred an egocentric strategy more often than an allocentric one. Hence, it would be expected that once a goal is detected, they would engage in a direct approach to that goal and, thus, be likely to persevere at the barrier. But the submerged mice in the detour experiments used the allocentric strategy instead. This demands some further explanation: approach and avoidance activate different behavioral pathways in the brain (Namboodiri et al., 2016), where the avoidance pathways are typically less focused on one particular goal-site and instead result in a kind of “anywhere but here” escape behavior (Gray, 1982; Graeff, 1994). In such panic behavior, animals are even prone to crashing into obstacles in an effort to get away. Gray (1982) argues that the mammalian defense system is hierarchical, with the undirected escape system as the most basic one, and which is active at the most acute level of stress. At lower arousal levels with no stress or panic, the behavioral hierarchy allows goal-directed escape. Some support for this hypothesis might come from Juszczak and Stryjek (2019). They found that administering scopolamine to mice tended to increase perseverance behavior and time in the barrier zone. Given that scopolamine inhibits cholinergic activity by antagonistically binding to muscarinic receptors (Birdsall et al., 1978), and that the cholinergic system contributes to the level of arousal, e.g., in fight or flight behavior (Skinner et al., 2004), one interpretation is that the lowered arousal level induced by scopolamine reduces escape motivation enough that the water-stress configuration used for mice becomes more similar to the approach to reward configuration used for other species; i.e., allowing more decision time at the barrier and more time variance in making the decision to detour. Together these factors might explain the surprising position of mice in Figure 4.

Figure 4 shows an approximately linear relationship between mean barrier delay and its variance: more neurons correlate with more inhibition and less delay in successfully choosing to detour. This is in agreement with findings from the animal cognition literature that brain size and neuronal density tend to accompany success rate in tasks that require inhibition (Herculano-Houzel, 2017). Hence, biological neural population numbers can be compared at least relatively to simulated population sizes. This yields the prediction that unstressed mice should display more behavioral variability than dogs in an approach oriented version of the semitransparent V-shaped detour task (i.e., mice in a food-seeking version on dry ground).

Escape behaviors can be automatic, or stimulus-response processes in animals. Such processes are generally believed to be less reliant on consciousness than those necessary for making detours. Consciousness seems to depend on back-and-forth (recurrent) communication between neurons and on the resultant rhythmic synchronization and resonance (e.g., Engel and Singer, 2001; Meador et al., 2002; Engel and Fries, 2016). However, in our model, there are no recurrent connections, and neural populations are not synchronized with rhythmic inhibition. Additionally, as described above, the simulated populations have randomized parameters to explicitly increase activation variance. Hence, there is no direct correlation between neural population activity, and populations are not synchronized. Therefore, the model indicates that synchronizing populations is not necessary to achieve useful signal integration for behavioral strategy selection in navigation.

Behavioral selection without subjective consciousness also appears to be possible through subcortical pathways to the amygdala. These pathways are held to be evolutionarily older than cortical pathways and are found in both fish and reptiles, as well as mammals (McHaffie et al., 2005). For vision, one such pathway projects from the retina, via the brainstem superior colliculus and the thalamic pulvinar nucleus, to the amygdala. This pathway is generally assumed to be responsible for phenomena like blindsight, where people with cortical blindness can still guess the position of objects in their near environment. In particular, signals indicating dangerous stimuli, like the presence of snakes and spiders (and angry faces), are mediated via this pathway to the amygdala, which can then engage defensive behaviors. Furthermore, it appears that even routine, non-escape behavior like touching the position of a light signal may be supported by subcortical pathways, without requiring conscious perception. This is evidenced by studies on monkeys (Cowey and Stoerig, 1997).

How could we go from a simple, nonconscious allocentric navigation strategy (Figure 2) to one that uses consciousness? Merker (2007) argues that consciousness functionally can be understood via a “tripartite” division into (i) target selection (ii) action selection, and (iii) motivational ranking. Although these functions may operate on their own, they typically interact such that motivational ranking can influence target selection, which again can influence the selection of actions. Merker (2007) further argues that these functions need to operate in real time, and that they are integrated via a form of simulation. It is this simulation process that effectively constitutes conscious experience. Both target and action selection processes are related to spatial cognition and allow an animal to cope with spatially distributed resources, e.g., that shelter, food, and mates are not all found in the same place. As mentioned above, allocentric maps particularly support navigation to targets that are not directly approachable, or even in the direct vicinity. Hence, a system that allows an animal to be conscious of resource-place associations that are spread out potentially provides evolutionary benefits. Klein and Barron (2016) argue that insect brains may be capable of subjective consciousness since in the proposal of Merker (2007), this is mediated by evolutionary old, subcortical structures like the midbrain and the basal ganglia, and insects have structures that are functionally analogous to these. Similarly, the apparent lack of sufficient spatial perception or sensing in plants is used as a an argument by Mallatt et al. (2021) against plants having consciousness.

Carnivorous predator species and herbivorous prey species have adapted different usage for behavioral inhibition. Whereas, predators could benefit from inhibiting direct approach to prey to avoid detection (Hasson, 1991; Radford et al., 2020), a prey species may use inhibition to stop an approach to potential danger, as well as to “play dead” to reduce attack motivation in a predator (e.g., Gallup et al., 1971). In the case of predators, the perception of an eye pattern in the prey can indicate that the prey is turned in the direction of the predator; this can induce behavioral freeze and change the motivation from a direct approach to detour behavior. This would correspond to the perception of a barrier in our model, and the consequent switch to an allocentric navigation strategy. Similarly, the eyes of predators tend to be front-facing, which is useful for estimating distance (Detwiler, 1955). Prey species, on the other hand, often have side facing eyes since it facilitates surveying larger surrounding areas and hence the detection of potential predators. Although predator and prey species may use inhibition differently to adaptively control behavior, what exactly mediates inhibitory capability in different species is still not completely understood. We turn to this issue next. We have argued above that larger populations of neurons can confer increased precision, but that inhibitive efficacy is not fully dependent on population size. Kabadayi et al. (2017) explored the hypothesis that neuronal population size in the avian pallium might predict success rates on the cylinder task. Given that ravens are very adept at this task, and ravens have a densely populated pallium, they sought to investigate whether other birds with similarly high neural densities perform equally well. Parrots are birds that, like ravens, have comparatively dense palliums. Using parrots as subjects, they did not find evidence for a positive relationship between population size and success on the cylinder task. The parrots performed much poorer than did ravens. The authors interpret these results in two ways. Either that inhibition might not be correlated with pallial neuron count, or that the cylinder task does not measure motor inhibition. Our results lend support to the former of these interpretations (neuron number does not matter in this study) but with a slight twist, namely that there may be differences in inhibitive populations that are independent of total population size but that affect inhibitive efficacy.

Moving from birds to arthropods, Long (2021) compared brain sizes of different spider species and classified the spiders into four groups, where the first group had the smallest brain and the fourth group the largest. Interestingly, a species belonging to the first group, the spitting spider Scytodes pallidus, is hunted by a species of the fourth group, the jumping spider Portia labiata. Notably, P. labiata sometimes changes its hunting strategy depending on whether its prey is a male or female, and whether the female is carrying eggs (Jackson et al., 2002). An egg-carrying female is apparently less dangerous since it must drop its egg to spit. In this case, P. labiata makes use of faster, direct-approach strategies. But when hunting a female without eggs, P. labiata instead takes longer detours, to attack from behind. This more complex behavior might only be possible due to the larger and more complex brain of Portia.

In summary, we have presented a model of navigational strategy selection that shows how a direct approach vs. detour might be influenced by the interplay of both neuronal population size and inhibitive efficacy. The former appears to confer precision that improves signal integration, while the latter facilitates the suppression of direct approach strategies and the usage of allocentric navigation around obstacles. Together both processes contribute to behavioral flexibility in navigating complex environments. Comparing the results presented in this study with data from animal experiences may elucidate differences in inhibitive capabilities in various species.

The work presented in this study opens up several new avenues of exploration and complements earlier simulation work we have presented on awareness (Balkenius et al., 2018) and memory (Balkenius et al., 2020). Combining the present study with the former might further elucidate processes of arousal and how they might affect navigation and behavioral selection in the context of making detours. The latter work on episodic memory and decision making offer exciting opportunities for exploring path-learning and how an agent might react when such paths are changed. In the animal cognition literature, the mechanism by which animals are able to take advantage of shortcuts is an example of this that is of particular interest.

Data Availability Statement

The code for the simulations are publicly available. This data can be found here: https://github.com/trondarild/Tjostheim_et_al_direct_approach_inhibition.

Author Contributions

TT conducted simulations and wrote the manuscript in collaboration with and under the supervision of BJ and CB. All authors contributed to the article and approved the submitted version.

Funding

This study was partially supported by the Wallenberg AI, Autonomous Systems and Software Program–Humanities and Society (WASP-HS) and funded by the Marianne and Marcus Wallenberg Foundation and the Marcus and Amalia Wallenberg Foundation.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We thank Can Kabadayi for valuable input in planning experiments and writing, as well as the editor and two reviewers for their significant contribution in improving the structure and text of this article.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnsys.2021.752219/full#supplementary-material

References

Amari, S.-I. (1977). Dynamics of pattern formation in lateral-inhibition type neural fields. Biol. Cybern. 27, 77–87. doi: 10.1007/BF00337259

PubMed Abstract | CrossRef Full Text | Google Scholar

Bailey, K. R., and Crawley, J. N. (2009). “Anxiety-related behaviors in mice,” in Methods of Behavior Analysis in Neuroscience, 2nd Edn, ed J. J. Buccafusco (Boca Raton, FL: CRC Press).

Google Scholar

Balkenius, C., Tjøstheim, T. A., and Johansson, B. (2018). “Arousal and awareness in a humanoid robot,” in CEUR Workshop Proceedings, Vol. 2287 (CEUR Workshop Proceedings) (Stanford, CA).

Google Scholar

Balkenius, C., Tjøstheim, T. A., Johansson, B., Wallin, A., and Gärdenfors, P. (2020). The missing link between memory and reinforcement learning. Front. Psychol. 11:3446. doi: 10.3389/fpsyg.2020.560080

PubMed Abstract | CrossRef Full Text | Google Scholar

Barker, A. J., and Baier, H. (2015). Sensorimotor decision making in the zebrafish tectum. Curr. Biol. 25, 2804–2814. doi: 10.1016/j.cub.2015.09.055

PubMed Abstract | CrossRef Full Text | Google Scholar

Birdsall, N., Burgen, A., and Hulme, E. (1978). The binding of agonists to brain muscarinic receptors. Mol. Pharmacol. 14, 723–736.

PubMed Abstract | Google Scholar

Bottini, R., and Doeller, C. F. (2020). Knowledge across reference frames: cognitive maps and image spaces. Trends Cogn. Sci. 24, 606–619. doi: 10.1016/j.tics.2020.05.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Bray, E. E., MacLean, E. L., and Hare, B. A. (2015). Increasing arousal enhances inhibitory control in calm but not excitable dogs. Anim. Cogn. 18, 1317–1329. doi: 10.1007/s10071-015-0901-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Bruck, J. N., Allen, N. A., Brass, K. E., Horn, B. A., and Campbell, P. (2017). Species differences in egocentric navigation: the effect of burrowing ecology on a spatial cognitive trait in mice. Anim. Behav. 127, 67–73. doi: 10.1016/j.anbehav.2017.02.023

CrossRef Full Text | Google Scholar

Coppens, C. M., de Boer, S. F., and Koolhaas, J. M. (2010). Coping styles and behavioural flexibility: towards underlying mechanisms. Philos. Trans. R. Soc. B Biol. Sci. 365, 4021–4028. doi: 10.1098/rstb.2010.0217

PubMed Abstract | CrossRef Full Text | Google Scholar

Cowey, A., and Stoerig, P. (1997). Visual detection in monkeys with blindsight. Neuropsychologia 35, 929–939. doi: 10.1016/S0028-3932(97)00021-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Detwiler, S. R. (1955). The eye and its structural adaptations. Proc. Am. Philos. Soc. 99, 224–238.

Google Scholar

Dijkstra, E. W. (1959). A note on two problems in connexion with graphs. Numer. Math. 1, 269–271. doi: 10.1007/BF01386390

CrossRef Full Text | Google Scholar

Doeller, C. F., King, J. A., and Burgess, N. (2008). Parallel striatal and hippocampal systems for landmarks and boundaries in spatial memory. Proc. Natl. Acad. Sci. U.S.A. 105, 5915–5920. doi: 10.1073/pnas.0801489105

PubMed Abstract | CrossRef Full Text | Google Scholar

Eagle, D. M., and Baunez, C. (2010). Is there an inhibitory-response-control system in the rat? evidence from anatomical and pharmacological studies of behavioral inhibition. Neurosci. Biobehav. Rev. 34, 50–72. doi: 10.1016/j.neubiorev.2009.07.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Engel, A. K., and Fries, P. (2016). “Neuronal oscillations, coherence, and consciousness,” in The Neurology of Consciousness, eds S. Laureys, O. Gosseries, and G. Tononi (New York, NY: Elsevier), 49–60. doi: 10.1016/B978-0-12-800948-2.00003-0

CrossRef Full Text | Google Scholar

Engel, A. K., and Singer, W. (2001). Temporal binding and the neural correlates of sensory awareness. Trends Cogn. Sci. 5, 16–25. doi: 10.1016/S1364-6613(00)01568-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Epstein, R. A., Patai, E. Z., Julian, J. B., and Spiers, H. J. (2017). The cognitive map in humans: spatial navigation and beyond. Nat. Neurosci. 20, 1504–1513. doi: 10.1038/nn.4656

PubMed Abstract | CrossRef Full Text | Google Scholar

Erlhagen, W., and Schöner, G. (2002). Dynamic field theory of movement preparation. Psychol. Rev. 109, 545. doi: 10.1037/0033-295X.109.3.545

PubMed Abstract | CrossRef Full Text | Google Scholar

Gallup, G. G., Nash, R. F., and Ellison, A. L. (1971). Tonic immobility as a reaction to predation: artificial eyes as a fear stimulus for chickens. Psychon. Sci. 23, 79–80. doi: 10.3758/BF03336016

CrossRef Full Text | Google Scholar

Graeff, F. G. (1994). Neuroanatomy and neurotransmitter regulation of defensive behaviors and related emotions in mammals. Braz. J. Med. Biol. Res. 27, 811–829.

PubMed Abstract | Google Scholar

Gray, J. A. (1982). Précis of the neuropsychology of anxiety: an enquiry into the functions of the septo-hippocampal system. Behav. Brain Sci. 5, 469–484. doi: 10.1017/S0140525X00013066

CrossRef Full Text | Google Scholar

Grossberg, S. (1978). Competition, decision, and consensus. J. Math. Anal. Appl. 66, 470–493. doi: 10.1016/0022-247X(78)90249-4

CrossRef Full Text | Google Scholar

Harris, C. R., Millman, K. J., van der Walt, S. J., Gommers, R., Virtanen, P., Cournapeau, D., et al. (2020). Array programming with NumPy. Nature 585, 357–362. doi: 10.1038/s41586-020-2649-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Hasson, O. (1991). Pursuit-deterrent signals: communication between prey and predator. Trends Ecol. Evolut. 6, 325–329. doi: 10.1016/0169-5347(91)90040-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Herculano-Houzel, S. (2017). Numbers of neurons as biological correlates of cognitive capability. Curr. Opin. Behav. Sci. 16, 1–7. doi: 10.1016/j.cobeha.2017.02.004

CrossRef Full Text | Google Scholar

Izhikevich, E. M. (2003). Simple model of spiking neurons. IEEE Trans. Neural Netw. 14, 1569–1572. doi: 10.1109/TNN.2003.820440

PubMed Abstract | CrossRef Full Text | Google Scholar

Jackson, R. R., Pollard, S. D., Li, D., and Fijn, N. (2002). Interpopulation variation in the risk-related decisions of portia labiata, an araneophagic jumping spider (araneae, salticidae), during predatory sequences with spitting spiders. Anim. Cogn. 5, 215–223. doi: 10.1007/s10071-002-0150-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Johnson, E. J., and Ratcliff, R. (2014). “Computational and process models of decision-making in psychology and behavioral economics,” in Neuroeconomics: Decision Making and the Brain, 2nd Edn, eds P. W. Glimcher and E. Fehr (New York, NY: Academic Press).

Google Scholar

Juszczak, G. R., and Miller, M. (2016). Detour behavior of mice trained with transparent, semitransparent and opaque barriers. PLoS ONE 11:e0162018. doi: 10.1371/journal.pone.0162018

PubMed Abstract | CrossRef Full Text | Google Scholar

Juszczak, G. R., and Stryjek, R. (2019). Scopolamine increases perseveration in mice subjected to the detour test. Behav. Brain Res. 356, 71–77. doi: 10.1016/j.bbr.2018.07.028

PubMed Abstract | CrossRef Full Text | Google Scholar

Kabadayi, C., Bobrowicz, K., and Osvath, M. (2018). The detour paradigm in animal cognition. Anim. Cogn. 21, 21–35. doi: 10.1007/s10071-017-1152-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Kabadayi, C., Krasheninnikova, A., O'neill, L., van de Weijer, J., Osvath, M., and von Bayern, A. M. (2017). Are parrots poor at motor self-regulation or is the cylinder task poor at measuring it? Anim. Cogn. 20, 1137–1146. doi: 10.1007/s10071-017-1131-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Klein, C., and Barron, A. B. (2016). Insects have the capacity for subjective experience. Anim. Sent. 1, 1. doi: 10.51291/2377-7478.1113

CrossRef Full Text | Google Scholar

Kluyver, T., Ragan-Kelley, B., Pérez, F., Granger, B., Bussonnier, M., Frederic, J., et al. (2016). “Jupyter Notebooks-a publishing format for reproducible computational workflows,” in Positioning and Power in Academic Publishing: Players, Agents and Agendas: Proceedings of the 20th International Conference on Electronic Publishing (Amsterdam: IOS Press), 87.

Google Scholar

Kravitz, D. J., Saleem, K. S., Baker, C. I., and Mishkin, M. (2011). A new neural framework for visuospatial processing. Nat. Rev. Neurosci. 12, 217–230. doi: 10.1038/nrn3008

PubMed Abstract | CrossRef Full Text | Google Scholar

Long, S. M. (2021). Variations on a theme: morphological variation in the secondary eye visual pathway across the order of araneae. J. Compar. Neurol. 529, 259–280. doi: 10.1002/cne.24945

PubMed Abstract | CrossRef Full Text | Google Scholar

Mallatt, J., Blatt, M. R., Draguhn, A., Robinson, D. G., and Taiz, L. (2021). Debunking a myth: plant consciousness. Protoplasma 258, 459–476. doi: 10.1007/s00709-020-01579-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Marshall-Pescini, S., Virányi, Z., and Range, F. (2015). The effect of domestication on inhibitory control: wolves and dogs compared. PLoS ONE 10:e0118469. doi: 10.1371/journal.pone.0118469

PubMed Abstract | CrossRef Full Text | Google Scholar

McHaffie, J. G., Stanford, T. R., Stein, B. E., Coizet, V., and Redgrave, P. (2005). Subcortical loops through the basal ganglia. Trends Neurosci. 28, 401–407. doi: 10.1016/j.tins.2005.06.006

PubMed Abstract | CrossRef Full Text | Google Scholar

McKinney, W. (2010). “Data structures for statistical computing in python,” in Proceedings of the 9th Python in Science Conference, Vol. 445, eds S. van der Walt and J. Millman (Austin, TX: SciPy), 51–56.

Google Scholar

Meador, K. J., Ray, P. G., Echauz, J. R., Loring, D. W., and Vachtsevanos, G. J. (2002). Gamma coherence and conscious perception. Neurology 59, 847–854. doi: 10.1212/WNL.59.6.847

PubMed Abstract | CrossRef Full Text | Google Scholar

Merker, B. (2007). Consciousness without a cerebral cortex: a challenge for neuroscience and medicine. Behav. Brain Sci. 30, 63–81. doi: 10.1017/S0140525X07000891

PubMed Abstract | CrossRef Full Text | Google Scholar

Namboodiri, V. M. K., Rodriguez-Romaguera, J., and Stuber, G. D. (2016). The habenula. Curr. Biol. 26, R873–R877. doi: 10.1016/j.cub.2016.08.051

PubMed Abstract | CrossRef Full Text | Google Scholar

Piray, P., Toni, I., and Cools, R. (2016). Human choice strategy varies with anatomical projections from ventromedial prefrontal cortex to medial striatum. J. Neurosci. 36, 2857–2867. doi: 10.1523/JNEUROSCI.2033-15.2016

PubMed Abstract | CrossRef Full Text | Google Scholar

Poulter, S., Hartley, T., and Lever, C. (2018). The neurobiology of mammalian navigation. Curr. Biol. 28, R1023–R1042. doi: 10.1016/j.cub.2018.05.050

PubMed Abstract | CrossRef Full Text | Google Scholar

Prescott, S. A., and De Koninck, Y. (2003). Gain control of firing rate by shunting inhibition: roles of synaptic noise and dendritic saturation. Proc. Natl. Acad. Sci. U.S.A. 100, 2076–2081. doi: 10.1073/pnas.0337591100

PubMed Abstract | CrossRef Full Text | Google Scholar

Radford, C., McNutt, J. W., Rogers, T., Maslen, B., and Jordan, N. (2020). Artificial eyespots on cattle reduce predation by large carnivores. Commun. Biol. 3, 1–8. doi: 10.1038/s42003-020-01156-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Reas, C., and Fry, B. (2007). Processing: a Programming Handbook for Visual Designers and Artists. Mit Press.

Google Scholar

Santacà, M., Busatta, M., Lucon-Xiccato, T., and Bisazza, A. (2019a). Sensory differences mediate species variation in detour task performance. Anim. Behav. 155:153–162. doi: 10.1016/j.anbehav.2019.05.022

CrossRef Full Text | Google Scholar

Santacà, M., Busatta, M., Savaşçı, B. B., Lucon-Xiccato, T., and Bisazza, A. (2019b). The effect of experience and olfactory cue in an inhibitory control task in guppies, poecilia reticulata. Anim. Behav. 151, 1–7. doi: 10.1016/j.anbehav.2019.03.003

CrossRef Full Text | Google Scholar

Schwabe, L., Schächinger, H., de Kloet, E. R., and Oitzl, M. S. (2010). Corticosteroids operate as a switch between memory systems. J. Cogn. Neurosci. 22, 1362–1372. doi: 10.1162/jocn.2009.21278

PubMed Abstract | CrossRef Full Text | Google Scholar

Skinner, R. D., Homma, Y., and Garcia-Rill, E. (2004). Arousal mechanisms related to posture and locomotion: 2. ascending modulation. Progr. Brain Res. 143, 291–298. doi: 10.1016/S0079-6123(03)43028-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Smith, P. L. (2015). The poisson shot noise model of visual short-term memory and choice response time: normalized coding by neural population size. J. Math. Psychol. 66, 41–52. doi: 10.1016/j.jmp.2015.03.007

CrossRef Full Text | Google Scholar

Spiers, H. J., and Gilbert, S. J. (2015). Solving the detour problem in navigation: a model of prefrontal and hippocampal interactions. Front. Hum. Neurosci. 9:125. doi: 10.3389/fnhum.2015.00125

PubMed Abstract | CrossRef Full Text | Google Scholar

Uddin, L. Q. (2021). Cognitive and behavioural flexibility: neural mechanisms and clinical considerations. Nat. Rev. Neurosci. 22, 167–179. doi: 10.1038/s41583-021-00428-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Usher, M., and McClelland, J. L. (2001). The time course of perceptual choice: the leaky, competing accumulator model. Psychol. Rev. 108, 550. doi: 10.1037/0033-295X.108.3.550

PubMed Abstract | CrossRef Full Text | Google Scholar

Usher, M., and McClelland, J. L. (2004). Loss aversion and inhibition in dynamical models of multialternative choice. Psychol. Rev. 111, 757. doi: 10.1037/0033-295X.111.3.757

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Rossum, G., and Drake, F. L. Jr. (1995). Python Tutorial (Vol. 620). Amsterdam: Centrum voor Wiskunde en Informatica.

Google Scholar

Virtanen, P., Gommers, R., Oliphant, T. E., Haberland, M., Reddy, T., Cournapeau, D., et al. (2020). SciPy 1.0: fundamental algorithms for scientific computing in python. Nat. Methods 17, 261–272. doi: 10.1038/s41592-020-0772-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Waskom, M. L. (2021). seaborn: statistical data visualization. J. Open Source Softw. 6, 3021. doi: 10.21105/joss.03021

CrossRef Full Text | Google Scholar

Wright, M., and Freed, A. (1997). “Open Sound Control: A New Protocol for Communicating with Sound Synthesizers,” in Proceedings of the 1997 International Computer Music Conference (San Francisco, CA: International Computer Music Association), 101–104.

Google Scholar

Keywords: detour task, egocentric navigation, allocentric navigation, navigational strategy selection, consciousness, inhibition

Citation: Tjøstheim TA, Johansson B and Balkenius C (2021) Direct Approach or Detour: A Comparative Model of Inhibition and Neural Ensemble Size in Behavior Selection. Front. Syst. Neurosci. 15:752219. doi: 10.3389/fnsys.2021.752219

Received: 02 August 2021; Accepted: 29 September 2021;
Published: 09 November 2021.

Edited by:

Jon Mallatt, Washington State University, United States

Reviewed by:

Grzegorz Juszczak, Institute of Genetics and Animal Biotechnology, Polish Academy of Sciences (PAN), Poland
Maria Santacà, University of Padova, Italy

Copyright © 2021 Tjøstheim, Johansson and Balkenius. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Trond A. Tjøstheim, dHJvbmRfYXJpbGQudGpvc3RoZWltQGx1Y3MubHUuc2U=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.