HYPOTHESIS AND THEORY article

Front. Neuroanat., 17 August 2021

Volume 15 - 2021 | https://doi.org/10.3389/fnana.2021.693346

Five Breakthroughs: A First Approximation of Brain Evolution From Early Bilaterians to Humans

  • Independent Researcher, New York, NY, United States

Article metrics

View details

12

Citations

20,3k

Views

3k

Downloads

Abstract

Retracing the evolutionary steps by which human brains evolved can offer insights into the underlying mechanisms of human brain function as well as the phylogenetic origin of various features of human behavior. To this end, this article presents a model for interpreting the physical and behavioral modifications throughout major milestones in human brain evolution. This model introduces the concept of a “breakthrough” as a useful tool for interpreting suites of brain modifications and the various adaptive behaviors these modifications enabled. This offers a unique view into the ordered steps by which human brains evolved and suggests several unique hypotheses on the mechanisms of human brain function.

Introduction

Humans have an incredibly diverse suite of intellectual faculties as well as incredibly complicated brains. But all these varied faculties and brain structures are likely to have evolved from simpler prototypes in the simpler brains of our ancestors. This general idea of progressive complexification of behavior and brains from simpler roots has been elegantly articulated in Paul Cisek’s theory of “phylogenetic refinement,” whereby behaviors and brain structures are interpreted as the consequence of evolutionary refinement from more basic building blocks (Cisek, 2019). An essential aspect of this research paradigm is chronicling the specific brain modifications that occurred in the human lineage, and what specific behavioral modifications they enabled.

Much work has been done to chronicle the specific brain modifications that occurred along major milestones in the human lineage from early bilaterians to extant humans (Kaas, 2009; Striedter and Northcutt, 2020). Work has also been done to reconstruct the adaptive behavioralabilities that emerged along major milestones in the human lineage from early bilaterians to extant humans (Murray et al., 2017; Cisek, 2019; Ginsburg and Jablonka, 2019; Bennett, 2021). A challenge of this research paradigm is how numerous the brain and behavioral modifications have been throughout evolution. The aim of this article is to introduce the concept of a “breakthrough” and use this concept to offer a first approximation explanation of a multitude of both brain and behavioral modifications that occurred during major evolutionary milestones. The general structure of the argument herein is that at each major milestone in human brain evolution, many neural modifications can be explained as having enabled a new single breakthrough, which was thereby applied in many adaptive ways.

An Analogy to Technological Innovation: The Useful Concept of a “Breakthrough”

To elaborate on this idea, I will draw on an analogy to technological innovation. Consider the modifications and applications of a technological breakthrough such as the transition from gas-powered to electrically-powered households during the late 19th century. The physicalmodifications within a household were many: cables were laid, switches were added, circuit boards were installed. But all these new physical modifications can be reasonably understood through the lens of a single new breakthroughcapability that they enabled—namely the capability of using electricity for energy. And this breakthrough capability was thereby applied in many adaptive ways: lighting a home at night for reading, heating a house during the cold without fire, speaking to faraway family members (using a telephone). The value of the physical modifications is defined only in the context of these adaptive applications.

Now imagine you were an alien observer trying to “explain” the observed physical changes to the 19th century home, as well as the observed new adaptive abilities of 19th century homeowners. Without a notion of the underlying breakthrough (electricity), and instead only with a model of the adaptive abilities and the physical modifications themselves, explaining the transformation of the 19th century home would be more perplexing. If you tried to explain individual physical modifications through the adaptive value they provided, it would be unclear: what was the specific adaptive value of a single light switch or a single wire? Conversely, if you tried to find the “substrates” underlying a given new “ability,” such as reading at night, the answer would also be unclear: entire suites of physical modifications worked together to enable these abilities. There was no single substrate. Further, many of the substrates of seemingly completely different newfound abilities are highly overlapping: reading at night, heating your home during the cold, and communicating at a distance all used overlapping physical features within a home (all used common cords, switches, and circuit breakers). In contrast, armed with the concept of a breakthrough, both the new physical modifications and the seemingly different new abilities have a much more interpretable first approximation: the varied physical modifications together served the single purpose of using electricity for energy, and this single breakthrough was applied in many different ways, including reading at night and communicating at a distance.

An additional benefit of the concept of a breakthrough is that it incorporates prior constraints into subsequent physical modifications and newfound abilities. Consider a technological breakthrough that occurred after that of electricity: the television (TV). Can we understand the modifications and breakthroughs of TV without first understanding that of electricity? TV was only possible because electricity came first and the ways in which electricity worked imposed constraints on how TVs would have to work. For example, TVs had to work on the relatively low voltage available within the home at the time. As such, we can only “explain” the specific modifications of TVs when we understand the constraints of the breakthrough that came before.

In sum, there are two useful benefits of the concept of a breakthrough. The first is that it enables a useful first approximation of both physical modifications and the adaptive value they provide. The second is it provides a simple interpretation of ordered constraints, whereby prior breakthroughs imposed important constraints on future breakthroughs. The analogy is far from perfect. For example, numerous technological modifications to a home can be planned in advance, whereas evolution can only work via tinkering over each evolutionary iteration. Regardless, this analogy is illustrative—it offers an instructive lens for the concept of a breakthrough and the useful benefits it offers in explaining transformations (see Figure 1). This article is an attempt to apply this concept of ordered breakthroughs to brain evolution in order to offer explanations of the major brain and behavioral modifications throughout the evolutionary lineage from bilaterians to extant humans.

Figure 1

The Approach

In the context of brain evolution, I will define three key terms:

  • Physical Modifications: the actual physical changes in underlying neural structures.

  • Breakthrough: a new capability that these numerous physical changes enabled, and which had numerous adaptive behavioral applications.

  • Adaptive Applications: a new behavioral ability that offered survival and/or reproductive benefit to an animal and is one of many applications of an underlying breakthrough capability.

Below I will chronicle what I propose are the five major breakthroughs that occurred from the first bilaterians to the first human brains. There are undeniably many more modifications than will be described below, however, I argue that a remarkably broad set of brain functions and behaviors across taxa throughout the human lineage can be understood through the lens of only these five major modifications. I will connect these major modifications to likely behavioral abilities that emerged throughout our evolutionary timelines. This simplified model of brain evolution provides a useful “first approximation” of how and why brains evolved.

I intentionally call this a “model” and not a “theory.” I make the following distinction between the two: a model is a useful approximation of a set of phenomena, whereas a theory is a comprehensive explanation of a set of phenomena (Wunsch, 1994). Through this lens then, this article does not present a theory, as it is undeniably a simplification of the process of brain evolution. Instead, what it presents is a model—a useful first approximation that can be used to interpret and explain a multitude of observations.

Further, the scope of this article is intentionally anthropocentric—it seeks to chronicle the phylogenetic history of behavioral abilities in the human lineage between early bilaterians and extant humans. This requires an essential caveat to the hypotheses presented herein. Proposing a hypothesis regarding the emergence of abilities along the evolutionary lineage from early bilaterians to humans is not the same thing as proposing a hypothesis regarding a unique ability of humans relative to other extant animals alive today. For example, the hypothesis that episodic memory emerged in early mammals is not the same as a hypothesis that only mammals exhibit episodic memory. Convergent evolution is not the exception, but the rule.

I will start by presenting the model and then provide evidence for the model.

The Model

Breakthrough #1: Steering in Early Bilaterians

The hypothesis here is that the major neural modifications that emerged in early bilaterians facilitated the breakthrough of “steering,” which was thereby applied in multiple adaptive ways, such as in local area restricted search, avoiding predation, and maintaining homeostasis.

By “steering” I refer to the capability of categorizing external stimuli into two simple groups-positive valence (for approach) and negative valence (for escape). Agents can then turn towards directions where “positive valence” stimuli increase in potency (or “negative valence” stimuli decrease) and turn away from directions where “positive valence” stimuli decrease in potency (or “negative valence” stimuli increase). It is a breakthrough in the sense that an agent can navigate a complex environment remarkably well with only these simple categorizations and turning decisions. This navigational strategy is often called “taxis navigation,” whereby organisms move towards or away from specific stimuli and thereby climb up or down sensory gradients. Taxis navigation is present in single-celled organisms and almost definitely existed far before the first bilaterians. However, this model proposes that it was only in the first bilaterian animals where taxis navigation was implemented with neurons and muscles, and this unique implementation offered several additional benefits.

This model proposes that there were four physical modifications in early bilaterians: a bilateral body plan, valence sensory neurons that connected to global neural integration centers (the first “brain”), neurobiological mechanisms of associative plasticity, and neuromodulatory systems that generated persistent behavioral states. An interpretation of how these neural structures together implemented “steering” is as follows. A bilateral body plan reduced navigational decision making to simply forward or backward, and left or right (Holló and Novák, 2012). The sensory neurons of early bilaterians were evolutionarily hardwired to categorize specific external stimuli into those for “approaching” (forward) and others for “avoiding” (turn). These sensory neurons were directly sensitive to internal states. A chemosensory neuron in early Bilateria that was responsive to food cues would then also be directly modulated by hunger signals. In this way, the sensory apparatus of early bilaterians would have directly computed valence. Further, the global neural integration centers of early bilaterians would have been used to integrate competing input across valence neurons, which would have enabled the selection of a single cross-model decision in the presence of tradeoffs. For example, if an early bilaterian animal detected both the increase in a food cue (positive valence) and an increase in heat (negative valence), these different valence neurons would project to common interneurons where they could be integrated into a single decision of forward locomotion or turning depending on the relative strength of each valence response.

The mechanisms of associative plasticity, both postsynaptic and presynaptic, enabled early bilaterians to change the weights and hence valence of various stimuli. For example, experiencing pain in the presence of light could make light more aversive in the future. This enabled smarter steering decisions over time.

This model proposes that it was in these early bilaterians in whom neuromodulators such as dopamine and norepinephrine (and octopamine) were first used for valence-related signaling. Dopamine was released in the presence of food cues and persisted in extra synaptic space even after food cues disappeared. This enabled an animal to perform local area restricted search even in the absence of any specific food cues. Octopamine and norepinephrine did the same for negative valence stimuli, driving an animal to continue its relocation locomotion even after negative stimuli faded.

Together, these physical modifications enabled the breakthrough of “steering,” which enabled early bilaterians to optimally explore and exploit food patches while maintaining homeostasis. This implementation of taxis-navigation within neurons and muscles may have enabled comparably larger organisms to use taxis navigation than the prior taxis mechanisms of cellular cilia. Further, such a neuron implementation may have enabled more accurate sensitivity to internal states and cross-modal integration. This breakthrough was only possible because bilaterian brains were built on the foundation of neurons and muscles that evolved prior in eumetazoans.

Breakthrough #2: Reinforcing in Early Vertebrates

The hypothesis here is that the brain modifications that emerged in early vertebrates facilitated the singular breakthrough of “reinforcing,” which was thereby applied in multiple adaptive ways, such as in map-based navigation, interval timing, and omission learning.

By “reinforcing” I refer to “model-free reinforcement learning”. In reinforcement learning, a distinction is often made between “model-free” and “model-based” methods. In simple terms, model-based methods include the ability to “plan,” which requires an agent to “play out actions” before taking them and choosing the sequence of actions that has the best outcome. This “playing out” of actions thereby requires a “model” of state-transition probabilities. In contrast, model-free methods include only learning the direct association with the current state and the available actions. The hypothesis here is that this “model-free” method of learning first emerged in early vertebrates, while the “model-based” method emerged later with the first mammals.

Model-free reinforcement learning requires several features: recognition of states, predicting the magnitude reward, predicting the timing of reward, temporal difference error signal, and the use of this error signal to update reward predictions. Despite these shared features, there are still many different implementations and conceptualizations of model-free reinforcement learning and how it manifests in brains. As such, two clarifications must be made regarding the features of model-free reinforcement learning this hypothesis proposes emerged in early vertebrates.

The first clarification is regarding spatial maps. Model-based RL and spatial maps (sometimes called cognitive maps) are two concepts that sometimes get conflated—it has been suggested that evidence for the presence of a spatial map in an animal, as evaluated by various map-based navigational tests, is evidence for the presence of model-based reinforcement learning. The reasoning being that a spatial map requires a “model” of space. However, what makes an agent’s learning method model-based is not the presence of a spatial map but the use of that spatial map for the purpose of simulating future actions. A spatial map can still be used in the context of model-free learning without such simulation of future actions. For example, an agent’s current state can be defined as a location in space, and the actions it associates rewards with can be defined by the next target locations, which thereby would generate a homing vector from the current location to the next target location. This contains no “playing out” of state transition probabilities, but does have various adaptive properties, such as being robust to small changes in starting locations or paths (such as due to perturbations in water current). The hypothesis here proposes that spatial maps first emerged in vertebrates but they were not used for simulating future actions, and only used for learning associations between the current location and rewarding next target locations (see Figure 2).

Figure 2

The second clarification is regarding goal-directed vs. habitual behavior. Model-based learning and the ability to use reward identity in learning are sometimes conflated: it has been suggested that the presence of successful devaluation is evidence for the presence of model-based reinforcement learning. I will argue that this is not the case. A common experimental setup for such devaluation experiments is to allow a mouse to associate two levers each with a different type of food. Once this association is well learned, mice are allowed to eat one of those foods to satiation. When subsequently presented with each of these levers, mice will usually immediately favor the lever that produces the food that was not eaten to satiation (i.e., not devalued). Different experimental setups, as well as lesions of specific regions, can make mice insensitive to devaluation, whereby they will continue pushing the lever for the devalued food. Such behavior is typically considered to be “habitual” whereas that which is sensitive to devaluation is considered “goal-directed”. Habitual behavior at times is conflated with model-free learning: habitual behavior demonstrates direct stimulus-response associations, which can seem analogous to direct learning of rewarding actions from a given state in model-free learning. And similarly, goal-directed behavior clearly requires a “stimulus-stimulus” association, where each lever is associated with the food itself, and not just the original reward—this has been suggested to be evidence of model-based learning. However, neither is necessarily the case. If “states” include interoceptive information such as hunger level, then model-free learning can exhibit behavior that is sensitive to devaluation without any planning or playing out of future actions. Therefore, the proposal that early vertebrates were capable of model-free RL but not model-based RL does not suggest that all behavior of early vertebrates was habitual and insensitive to devaluation.

The main four new brain structures that emerged with early vertebrates were the pallium, the basal ganglia (BG), the tectum, and the cerebellum (Sugahara et al., 2017). This entire network of new brain structures of early vertebrates can be reasonably understood through the lens of the emergence of model-free reinforcement learning with spatial maps and timing. An interpretation of how these neural structures together implemented model-free RL is as follows (see Figure 3). Specific subregions of the pallium acquired the ability to represent allocentric representations of space (a “spatial map”), while others acquired the ability to recognize patterns of stimulus cues. Valence neurons in the evolutionarily older hypothalamus (inherited from the valence neurons of early bilaterians) became direct controllers of dopamine responses, whereby positive valence neurons stimulated dopamine and negative valence neurons inhibited dopamine. Activation of dopamine allowed long–term potentiation in the synapses between the pallium and the striatum of the BG. The BG then chunked together sequences of stimuli and places that tended to activate dopamine. The BG used these sequences to predict its own dopamine activations and used these predictions to inhibit dopamine neurons. This filtered out expected dopamine activations, thereby converting dopamine signals from a global valence signal to a reward prediction error (also called a “temporal difference learning signal”), which is an essential feature of model-free learning. The BG disinhibited neurons in the tectum, where allocentric representations from the pallium could be converted to egocentric movements, and hence drove movement towards specific “goal” locations activated in the BG. The pallial-BG system thereby learned sequences of allocentric goals that progressively climbed a dopamine gradient. This network generates “homing vectors” towards sequences of goal locations which are defined merely by climbing a dopamine gradient. Timing signals within the cerebellum, and potentially also in the pallium, enabled a representation of time that allowed animals to make choices not only based on where they are relative to an outcome, but also when they are relative to an outcome.

Figure 3

The breakthrough of model-free reinforcement learning would have offered early vertebrates numerous adaptive behavioral abilities. It would have enabled early vertebrates to navigate their environment not only with taxis-navigation, but also with map-based navigation—able to remember and navigate towards or away from specific locations in three-dimensional space. It would have enabled vertebrates to remember the specific timing between events, and thereby learn when to act. And it also would have allowed early vertebrates to not only learn from the presence of stimuli but also from the omission of stimuli. Such omission learning enabled vertebrates to perform much better on avoidance tasks.

Crucially, model-free learning would have only been possible in vertebrates because of the prior existence of valence neurons and neuromodulatory signals that evolved earlier in Bilateria.

Breakthrough #3: Simulating in Early Mammals

The hypothesis here is that the unique brain regions that emerged in early mammals facilitated the singular breakthrough of “simulating,” which was thereby applied in multiple adaptive ways, such as in vicarious trial and error (VTE), episodic memory, and counterfactual learning.

By “simulating” I simply refer to the ability to perform model-based reinforcement learning, whereby an animal can play out and simulate action sequences before taking an action. In early mammals, the dorsal pallium of the ancestral amniote transformed into the neocortex (Tosches et al., 2018). I propose that the unique capabilities offered by this new neocortex relative to the pallium were its ability to internally invoke simulated actions and stimuli. A mammal with such an ability can pause, simulate a reality, manipulate it, evaluate it, and then act accordingly. This ability can be applied in many ways, such as for VTE (simulating paths), episodic memory (simulating a past event), or counterfactual learning (simulating alternative choices).

An interpretation of how the neocortex enabled such model-based reinforcement learning is as follows. The neocortex seems to be made up of a repeated columnar microcircuit (the “neocortical column”; Mountcastle, 1978). There are many competing theories of the specific computations performed by this microcircuit—including predictive coding (Rao and Ballard, 1999; Bastos et al., 2012; Spratling, 2017; Keller and Mrsic-Flogel, 2018), adaptive resonance theory (Grossberg and Versace, 2008), and hierarchical temporal memory (George and Hawkins, 2009; Hawkins and Ahmad, 2016; Bennett, 2020). Despite differences in these interpretations, they all generally agree that the microcircuit builds a self-supervised “model” with the purpose of predicting the entirety of its bottoms-up input. The self-supervised nature of the neocortex shares many features with a class of machine learning models called “generative models” (Kersten et al., 2004; Knill and Pouget, 2004; Parr and Friston, 2018). A generative model learns a “latent representation” (also called a “model” or an “explanation”) of its input. A generative model has two modes—an “inference mode” where it picks a latent representation that best “explains” its bottom-up input and a “generative mode” where it generates its own training data given a specific latent representation. Learning occurs by comparing the match between the simulated data and the actual data. Learning is optimized to minimize such mismatches (i.e., minimize “prediction errors”). Hence these generative models are “self-supervised”—trained only by the degree with which their own model of reality has successfully predicted its own input.

The neocortex of early mammals had two broad sub-regions: the frontal cortex and the sensory cortex (see Figure 4). The frontal cortex in early mammals is believed to be homologous to the anterior cingulate cortex (ACC) of later mammals (Laubach et al., 2018; van Heukelum et al., 2020). The sensory cortex had several homologous subregions for different modalities—visual areas, somatosensory areas, and auditory areas. This model suggests that each subregion implemented a generative model of sensor data from a given modality, with the goal of explaining its own input. However, this model proposes that the ACC served a different function, albeit performing an identical computation. Instead of receiving input from external sensory, the ACC of all mammals receives input from the amygdala and hippocampus (Reppucci and Petrovich, 2015), and projects throughout the sensory cortex (Zhang et al., 2014; Goll et al., 2015; Atlan et al., 2018; White et al., 2018; reviewed in Kamigaki, 2019). I hypothesize that the ACC is building a generative model of “paths” from the hippocampus, given “goals” from the amygdala. The goal represented is not a complex representation of the actual objects or sensory stimulus, but rather the actual valence results in the amygdala. The ACC thereby tries to explain the sequence of places that will be taken given a latent representation of a “goal” from the amygdala. One interpretation of this is that the latent representation in ACC is a model of “intent”—it observes an animal’s path, place, and context from the hippocampus and attempts to predict why the animal is behaving the way it is. This is consistent with other conceptualizations of generative models in the context of movement, often referred to as “active inference” (Adams et al., 2012).

Figure 4

This model proposes that one function of this ACC representation of “intent” is its ability to trigger internally invoked simulations, which thereby allowed animals to engage in model-based learning (see Figure 5). When an animal reaches a “choice point” where the right answer is uncertain, this uncertainty is represented by multiple conflicting predictions from different columns of the ACC. This conflicting set of predictions triggers the animal to pause its movements. The neural substrate of this pausing may be the ACC direct projection to the subthalamic nucleus, which has been shown to be leveraged during top-down inhibition (Aron, 2006; Heikenfeld et al., 2020). The ACC can then trigger simulated paths through its loop with the hippocampus and can internally invoke the corresponding sensory representations of such paths through either its direct connections to the sensory cortex or through its indirect connections through the claustrum or hippocampus. During this “pause,” the generative model in the sensory neocortex shifts from being externally driven (“inference mode”) to internally driven (“generative mode”). The ACC will continue to explore “movements” consistent with its generative model of intent. These internally invoked representations of the world in the sensory neocortex can then be evaluated in the BG. When an imagined path finally achieves an outcome that leads the basal ganglia to release enough dopamine, it will trigger a “GO” response. This accomplishes two things. First, it immediately sensitizes the “imagined” path that the ACC triggered through the hippocampus, thereby biasing subsequent movements to be consistent with what was imagined. Second, it overcomes the ACC suppression of movement through the STN, enabling the evolutionarily older basal ganglia to take over behavior again.

Figure 5

During ongoing movement, when columns of ACC agree in their predictions of subsequent movement, the ACC can still exert some control over behavior by biasing the latent representations in the sensory neocortex to be consistent with the imagined path. This perhaps was the first version of “cognitive control” and “attention”. The ACC projects to sensory neocortex where it can perform “gain control” and bias sensory representations (Goll et al., 2015; Atlan et al., 2018; White et al., 2018).

Note that this “pause-simulate” behavior does not only apply to imagining action paths, but it also equally applies to imagining past events (episodic memory), imagining alternative choices to choices you already made (counterfactual learning), and perhaps even working memory (holding things “in mind”). The important feature is the ability of the ACC to trigger internally invoked simulations in the sensory cortex, which can be used to train the basal ganglia vicariously.

The motor cortex emerged in later mammals (placentals; Beck et al., 1996; Kaas, 2012). Motor cortex can also be interpreted through the lens of a frontal region that builds a model of “intent” and uses it to predict movement and trigger internally invoked simulation in the presence of uncertainty. In early mammals, the motor cortex was not required for moving in general but was required for movements that require preplanning. For example, movements where animals must grasp something they see or carefully step their feet on specific platforms requires simulating actions before moving—these fine movement skills are uniquely enabled by the motor cortex. The motor cortex can simulate these actions through its projections to the somatosensory cortex; the same way ACC can simulate paths through its projections to overall sensory cortex. The key difference between the motor cortex and the ACC is that the motor cortex gets its top-down input of “intent” from the ACC, and predicts specific body movements in the somatosensory cortex, while the ACC gets its top-down input of “intent” form the amygdala and predicts general navigational paths in the hippocampus. In this sense, the ACC relationship with the motor cortex was the first “motor hierarchy,” where goals flowed from the ACC to motor cortex. Therefore, the addition of the motor cortex can be viewed as an elaboration on the previous ACC-sensory network, which enabled the planning of fine motor movements.

The unique neocortical ability to trigger internally invoked simulations and use them for learning would not have been possible without two features inherited from earlier vertebrates. Firstly, spatial mapping in earlier vertebrates was repurposed in later mammals in order to explore environments vicariously. Without a spatial map, it would be impossible to simulate various movements and their consequences. Secondarily, internally invoked simulations work by training the basal ganglia vicariously—the basal ganglia does not have to tell the difference between an internally invoked or externally invoked sensory data from the sensory cortex, it merely learns what sequences of movements trigger dopamine release. This ability to learn vicariously was only possible because it was built on top of the foundation of the older basal ganglia.

Breakthrough #4: Mentalizing in Early Primates

The hypothesis here is that the unique brain regions that emerged in early primates facilitated the singular breakthrough of “mentalizing,” which was thereby applied in multiple adaptive ways, such as in anticipating future needs, theory of mind, and learning skills through observation.

By “mentalizing” I refer to the ability to construct a model of the mind, inclusive of an individual’s intent and knowledge. Such a model of mind can be applied in multiple ways—three such ways are “anticipating future needs,” “theory of mind” and “learning skills through observation.” All three of these can be seen merely as different applications of this singular ability of primates to engage in “mentalizing.” For example, mentalizing can be used to simulate a mind state of yourself that you do not have yet (imagining being hungry tomorrow if I do not gather food now, even though I am not hungry right now). It can be used to simulate the mind state of another conspecific you are observing (such as imagining how they must feel given their situation). It can be used to simulate the intentions and actions of others when you are watching them do various motor skills, which enables you to “learn by observation.” The behavioral manifestations are different, but the neural substrates are overlapping for a reason: the overall function is the same (simulating a mind state).

The primary two new brain structures that emerged in early primates were the granular prefrontal cortex (gPFC) and polysensory cortex (PSC; Kaas, 2009). An interpretation of how granular prefrontal cortex and PSC enabled such mentalizing is as follows (see Figure 6). gPFC gets its inputs from three primary sources: (1) it is bidirectionally connected with the ACC, both directly and through the mediodorsal thalamus (Kondo et al., 2004; Cera et al., 2019; Tang et al., 2019); (2) gPFC is interconnected with polysensory areas such as the superior temporal cortex and TPJ (Sanfey, 2003; Greene et al., 2004; Buchsbaum et al., 2005; Tei et al., 2017); and (3) it is interconnected with motor cortex both directly and indirectly through descending loops through the motor thalamus (Bosch-Bouju et al., 2013; Yokoi and Diedrichsen, 2019). I hypothesize that gPFC and PSC together implement a generative model of the ACC-sensory generative model itself. In other words, the gPFC-PSC generative model is constructed to “explain” the “intentions” from the ACC given “knowledge” from PSC. The emergent property of this is that it is effectively a generative model of one’s own “mind,” the use of which can be thought of as “mentalizing”.

Figure 6

When applying this gPFC-PSC network to “mentalizing about oneself,” an animal can anticipate a need it does not currently feel yet. One big difference between the ACC-sensory network and the gPFC-PSC network is that the ACC-sensory network is self-supervised to predict amygdala and hypothalamic activation, while the gPFC-PSC is self-supervised to predict latent representations within ACC-sensory network. The ACC-sensory network can predict paths that accomplish the needs currently experienced. On the other hand, gPFC-PSC network can simulate situations in the future and predict what intentions would be selected in the ACC-sensory network given such situations. The ACC-sensory network can’t do this because it is supervised to predict amygdala activation, which is therefore only sensitive to current needs. One reason this new network was adaptive was because it enabled early primates to anticipate future needs and hence plan much more flexibly than the mammals that came before. Practically, this enables primates such as humans to go to the grocery store and pick up food for the week even when they are not yet hungry—or to bring a jacket on a trip even though they are not yet cold. This ability to “anticipate future needs” was originally thought to be unique to humans, summarized as the Bischof-Kohler Hypothesis (Bischof-Köhler, 1985), but this model proposes that it in fact evolved in early primates, which as we will see in later sections is consistent with more recent tests in nonhuman primates and non-primate mammals.

Once a generative model of one’s own mind exists, it can be applied not only to “mentalizing about oneself” but also to “mentalizing about others”. Because the gPFC-PSC network is a model of what behaviors are generated from what intentions and knowledge, it can be applied to trying to identify what intentions and knowledge in others are consistent with their observed behaviors. The same way that the ACC-sensory system pauses and simulates objects and paths, the gPFC-PSC system can pause and simulate intentions and knowledge of others to generate a latent representation most consistent with the observed behavior. This mentalizing about others can manifest itself in several ways. First, it can be used in the basic theory of mind tasks to infer the knowledge of others given their perspective. And second, it can be used to learn motor skills through observation.

One attractive feature of this proposal of the gPFC-PSC network is that it fits within the conceptualization of the neocortical microcircuit. Given the observation that the neocortical column seems remarkably uniform throughout the neocortex (Mountcastle, 1978), any purported new “function” attributed to a new neocortical region should be primarily a consequence of unique inputs and outputs, as opposed to any changes in the underlying computations of the neocortical column. Consistent with this, the idea here is that the microcircuitry of the granular prefrontal cortex is the same as that of the ACC and sensory cortex: it is still implementing a generative model. The difference is merely where it receives input from. The ACC constructs a model getting bottom-up input from the hippocampus. The latent representation in the ACC is then a representation of an animal’s current intent (intent is an “explanation” of an observed path), which is used to make predictions of subsequent navigational paths. In contrast, the granular prefrontal cortex gets bottom-up input from the ACC. The latent representation in the granular prefrontal cortex is then a representation of a mind state (mind state is an “explanation of an intent”) which is then used to construct make predictions about one’s own or others’ intent.

Crucially, simulating mind states (mentalizing) in early primates was only possible because of the inherited ability to simulate world states that evolved in earlier mammals. Mentalizing was built on the foundation of simulating world states. The model here proposes that mentalizing network itself is a model of the mammalian ACC-sensory network, literally built on top of the evolutionarily older brain structures.

Breakthrough #5: Speaking in Early Humans

The hypothesis here is that the unique brain regions that emerged in early humans facilitated the singular breakthrough of “speaking,” which was thereby applied for language and music.

By “speaking” I do not refer only to vocal communication, but broadly to semantic rhythmic communication in general. Human brains, although bigger, are remarkably similar to the primate brains that came before. However, one fundamental difference that seemed to emerge was a modification to the arcuate fasciculus and its connectivity with the basal ganglia (“AF-BG network”). I hypothesize, as have others, that the neural innovations for language and music emerge from the unique connectivity of the arcuate fasciculus, as well as perhaps other additional connectivity with the striatum (Fujii et al., 2016). This has been the classic view of language since the time of Wernicke (Wernicke, 1874; Stookey, 1963; Berker et al., 1986; Anderson et al., 1999). More modern theories of language have criticized the simplicity of this original model and proposed additional structures (Rasmussen and Milner, 1977; Imaizumi et al., 1997; Rauschecker and Scott, 2009; Chang et al., 2015). I do not make any specific claim on the exact mechanisms of language production, merely that language and music were emergent properties only possible with the additional connectivity of the arcuate fasciculus.

The view here is that music and language are two sides of the same coin and emerge from the same neural innovations. There is an intuitive appeal to this hypothesis, as both music and language share many features. Both require rhythmic entertainment coordinated with other conspecifics. In other words, “beat perception” is necessary for both music and for taking turns appropriately in a conversation. Both are hierarchical and nested (Drake et al., 2000; Toiviainen and Snyder, 2003; McKinney and Moelants, 2006; Martens, 2011)—music contains beats within “bars” within “phrases,” and language contains phonemes within words within sentences. Both are highly “predictive”—when you hear an unfinished sentence such as “the ford mustang is my favorite …,” you cannot help but finish it. This is the same for music, when you hear an unresolved musical phrase.

In the context of this model, one explanation for the ordering of these breakthroughs is that such rhythmic semantic processing for communication was only possible after the breakthrough of “mentalizing”. Only with the ability to infer and understand the knowledge of others is one able to devise a reasonable communication to transfer information to someone. Consistent with this, the AF-BG network is very overlapping with the mentalizing regions that came before.

Model Summary

In Figure 7, you can see a summary of the proposed five breakthroughs, and the modifications and behaviors they explain. You can also see the homologous regions in human brains.

Figure 7

As previously mentioned, one attractive feature of this model of five breakthroughs is that it incorporates evolutionary constraints and thereby helps understand how neural innovations build on each other. Consider the following dependencies suggested by this model.

Steering in early bilaterians was only possible because of the neural building blocks that emerged earlier in eumetazoans, which provided all-or-nothing neurons with sensory cells and muscle cells. It provided inhibitory and excitatory neurons for the creation of neural circuits, and neuropeptides for the modulation of neural responses based on internal states.

Reinforcing (model-free reinforcement learning) in early vertebrates was only possible because of the features of steering in the ancestral bilaterian brain. The core mechanism for learning sequences of places is the temporal-difference learning mechanism which bootstraps these learned responses on hardcoded valence responses. Ancestral bilaterians had such valence responses that could stimulate the release of various neuromodulators. The pallial-BG-tectal circuit could then build plastic networks on top of these basic valence responses to enable learning more complex sequences of paths. This is consistent with other interpretations of behavior as evolving (through phylogeny and ontogeny) from the basic foundation of approach and withdrawal (Schneirla, 1959).

Simulating (model-based reinforcement learning) in early mammals was only possible because of model-free reinforcement learning in ancestral vertebrate brains. Simulating requires: (1) a “pointer” to a specific suite of stimuli; and (2) a mechanism for manipulating key features of the stimuli once invoked. The spatial map in the pallial hippocampus provided both of these features—internally invoking a “place” in the pallial hippocampus will reactivate a broad suite of learned responses to that place. Further, the ability to tether different places together and generate novel paths, enables the pallial hippocampus, if it were internally invoked, to imagine “novel” paths. It is then no surprise that the neocortex is, in some ways, built directly on top of the much older hippocampus, and uses the hippocampus for its simulating functions, such as VTE and episodic memory.

Mentalizing in early primates was only possible because simulating in ancestral mammal brains came first. In the model described here, mentalizing is computationally the same as simulating, the only difference being that mentalizing uses a generative model of the mind itself, requiring new neocortical regions and new connectivity between them. Mentalizing could only occur after there was already a generative model of intent on which to simulate possible mind states.

And lastly, language and music in early humans were only possible because of mentalizing in ancestral primate brains. There are several pieces of evidence for this. First, language and music are built directly on top of the neural structures of mentalizing: they emerge from uniquely primate mentalizing areas such as STS, TPJ, and the prefrontal cortex. Second, mirror neurons have been highly implicated in both theories of language origin as well as language function (Gallese et al., 1996; Rizzolatti et al., 1996; Rizzolatti and Arbib, 1998; Fogassi et al., 2005; Schooler et al., 2011; Vredeveldt et al., 2011; Yonemitsu et al., 2018). Third, people with disorders that disrupt mentalizing abilities, such as autists, also show language impairments (Mitchell et al., 2006; Luyster et al., 2008; Mody and Belliveau, 2013). And lastly, as an infant develops, the advanced mentalizing abilities emerge first in children before language emerges, with “intersubjectivity” and “joint attention” occurring before words (Bateson, 1975; Beebe et al., 1988; Carpenter et al., 1998; Meltzoff and Brooks, 2001; Bard et al., 2005). These “theory of mind” abilities are a requirement for the development of language abilities (de Villiers, 2007). The connection between mentalizing and language makes sense through the lens of various theories of the evolution of language. If the original evolutionary pressure for language was in fact for communicating the type and location of faraway food to conspecifics, as has been proposed (Monahan, 1996), then it makes sense that it would at first require mentalizing: the goal of the communication is to figure out what your communication partner does not know that you want them to know, and how to use shared understanding to transfer this knowledge.

Evidence for the Model

Three categories of evidence are presented in support of the model presented here. First, evidence is presented to support the proposed phylogenetic timing of specific brain modifications. Second, evidence is presented to support the proposed phylogenetic timing of specific behavioral abilities. And third, evidence is presented to support the proposed function of these brain modifications and how they enabled the specific behavioral abilities. For brevity, much of the evidence presented will direct readers to other work where the evidence for various claims is presented more thoroughly. Some of the claims are less controversial than others; extra attention will be spent on the claims for which there is more controversy.

Evidence For the Proposed Phylogenetic Timing of Brain Modifications

Evidence That a Global Neural Integration Center (The First “Brain”) Emerged in Early Bilaterians

The prevailing view is that the first neurons and nervous systems emerged in the common ancestor of eumetazoans (“real metazoans”) during the Ediacaran period around 600 million years ago (Peterson and Butterfield, 2005). Animals before Eumetazoa are believed to be devoid of nervous systems, akin to extant sea sponges (Bucher and Anderson, 2015). In these early Eumetazoans, nervous systems were not organized into any recognizable “brain,” but rather a diffuse “nerve net” (Holland, 2003; Lowe et al., 2003; Galliot and Quiquand, 2011; Arendt et al., 2015). Evidence for this comes from the nerve nets of cnidarians, which are some of the earliest diverging eumetazoans.

Despite a lack of a brain, nonbilaterian eumetazoans such as cnidarians have a surprisingly rich set of neural features, strongly suggesting that early eumetazoans already contained many of the building blocks of brains. For example, neurons in cnidarians communicate using “all-or-nothing” action potentials and form chemical synapses with each other (Satterlie, 2015). Cnidarian neurons contain many modern-day neurotransmitters, including glutamate, GABA, and acetylcholine (Kass-Simon and Pierobon, 2007; Marlow et al., 2009; Delgado et al., 2010; Pierobon, 2012). Cnidarians have also been shown to contain many of the same ionotropic and metabotropic receptors for glutamate (both AMPA and NMDA), GABA, acetylcholine, and even some monoamines (Anctil, 2009; Collin et al., 2013; Bosch et al., 2017).

Neural circuits within cnidarians, and hence also likely early eumetazoans, also came with several well-known features of neurons across taxa such as adaptation and interoception. Adaptation, whereby animals respond less to repeated stimuli, has been shown in various responses such as the tentacular responses in hydras and sea anemones (Parker, 1916; Pantin and Pantin, 1943; Batham and Pantin, 1950). Interoception, whereby animals modulate their responses based on internal cues for various need states, has also been shown in cnidaria. For example, hydras and sea anemones seem to be less sensitive to food cues (less likely to trigger the feeding response and less responsive to mechanical stimulation) when full than when hungry (Parker, 1916; Pantin, 1935; Batham and Pantin, 1950; Lenhoff and Loomis, 1963; Han et al., 2018).

In contrast, even very early diverging bilaterians, such as C. elegans and planarians, have global neural integration centers that can be thought of as a “brain” (Garrity et al., 2010). Although there is still controversy regarding whether the first bilaterians had a neural net or an actual brain (Hejnol and Martindale, 2008; Arendt et al., 2015), both interpretations agree that the first brain emerged within the bilaterian lineage.

Evidence That the Pallium, Basal Ganglia, Tectum, and Cerebellum Emerged in Early Vertebrates

Before the first vertebrates, brains are believed to have been made up of a homolog of a hypothalamus, midbrain, and hindbrain, as in early diverging chordates such as amphioxus (Nieuwenhuys, 1998; Gorbman et al., 1999; Uchida et al., 2003; Murakami et al., 2005). There is a general consensus that with vertebrates, at least five additional structures emerged: the pallium, the BG, the tectum, the cerebellum, and the thalamus (Sugahara et al., 2017). Evidence mostly comes from the fact that these structures, their microcircuits, and the canonical connectivity between them are highly shared amongst extant vertebrates. Even the lamprey, considered one of the earliest diverging vertebrates and considered a model organism for the early vertebrates, shares exactly this template (Grillner and Robertson, 2016). Evidence suggests the amygdala, hippocampus, and neocortex all evolved from the pallium: specific subregions of the pallium in early diverging non-mammal vertebrates such as fish seems to be a proto-hippocampus (Macphail, 1982; López et al., 2003), and the other subregions of the pallium a proto-amygdala or proto-olfactory cortex, performing similar functions and containing similar microcircuits.

Some protostomes, such as arthropods, have brain regions that are structurally and functionally similar to the vertebrate pallium and basal ganglia, such as the mushroom body (similar association properties as the pallium) and central complex (similar properties as the basal ganglia). Some have argued that these structures share homology with vertebrate structures, suggesting that precursors to them emerged early in Bilateria (Strausfeld and Hirth, 2013). However, others argue it is simply an example of convergent evolution (Northcutt, 2012; Farries, 2013). Given that these structures are entirely absent from most protostome taxa, including nematodes, flatworms, mollusks, and annelids, the principle of parsimony makes convergent evolution seem more likely. Further, modern views of the urbilaterian ancestor suggest that its brain was devoid of such structures and instead was made up of a simple sensory region and a motor region (Arendt et al., 2015). Either way, the protostome versions of these structures have many more differences from the vertebrate versions than the vertebrate versions have from each other. This implies that the modern form of these structures first emerged in early vertebrates.

Evidence That the Neocortex With ACC and Sensory Regions Emerged in Early Mammals

There is broad consensus that the six layered neocortex first evolved in early mammals, and was derived from the three layered dorsal pallium in earlier amniotes (Tosches et al., 2018). Along with this modification, the surrounding pallial structures are also believed to have become recognizable in their mammalian forms—the medial pallium became the mammalian form of the hippocampus, and the lateral pallium became the mammalian olfactory cortex and/or the amygdala (Kaas, 2009; Tosches et al., 2018). The basal ganglia, tectum, thalamus, hypothalamus, and other midbrain and hindbrain structures are remarkably similar across non-mammal vertebrates and mammals alike, strongly suggestive that they were left relatively unchanged in early mammals.

Although there is little controversy regarding the evolution of the neocortex in early mammals, the degree to which it is “different” from the structures that emerged earlier is somewhat controversial. For example, some evidence suggests that the dorsal ventricular ridge (DVR) of birds (containing the “nidopallium” and “mesopallium”) and the neocortex of mammals both derive from the pallium of their shared amniote ancestor (Karten, 1969, 1997; Reiner et al., 2004; Dugas-Ford et al., 2012). It has been shown that the DVR and neocortex share many features, including molecular properties and the subcortical structures they interact with. Hence, this might be used to call into Question the uniqueness of the mammalian neocortex, and perhaps suggests that the amniote common ancestor had neocortex-like structures. However, this interpretation is unconvincing for two reasons.

First, the brains of birds are a poor model organism for the amniote last common ancestor. It is not even clear that the DVR itself is homologous with the neocortex and instead might share homology with the mammalian amygdaloid complex (Jarvis et al., 2005; Striedter, 2005). Further, even if the DVR does share homology with the neocortex, the microcircuitry of the DVR is very different from that of the neocortex. The neocortex is organized into six layers, while the DVR is organized into clustered nuclei (Ulinski, 1983). The ontogeny of the DVR and the mammalian neocortex is different (Jones and Levi-Montalcini, 1958; Striedter and Keefer, 2000; Dugas-Ford et al., 2012). Additionally, the pallial homologs in non-bird and non-mammal extant amniotes such as reptiles, also have unique ontogeny and microcircuitry from both birds and mammals (Goffinet et al., 1986; Cheung et al., 2007). For example, turtles have a three layered pallium, instead of the clustered nuclei of the DVR or the six layered neocortex. The turtle cortex is more like the three layered pallium of other non-amniote vertebrates, such as fish (Mueller and Wullimann, 2009), than it is to the DVR of birds or the neocortex of mammals. This is more consistent with the proposal that early vertebrates had a three layered pallium, much like that of extant reptiles and teleosts, and that the pallial homologs in birds and mammals each underwent substantial independent modification since the amniote last common ancestor.

Although there is considerable debate regarding which neocortical regions emerged first, it is generally accepted that by the time the early placental mammals emerged around 65 million years ago the following neocortical areas existed: V1, S1, A1, M1, cingulate cortex, insular cortex, orbital frontal cortex (Kaas, 2012). Even in earlier diverging mammals such as marsupials, which diverged from our lineage over 150 million years ago, there are numerous neocortical areas, such as a cingulate, V1, S1, and A1 (Karlen and Krubitzer, 2007; Wong and Kaas, 2009). This is consistent with the model herein which proposes that the very early neocortex contained both frontal regions and sensory regions. I use the label ACC to refer to the entirety of the agranular frontal cortex in early diverging mammals, inclusive of the areas sometimes called prelimbic or infralimbic cortex. Consistent with this, evidence suggests the entire prefrontal cortex of rodents is homologous to the anterior cingulate of primates (Laubach et al., 2018; van Heukelum et al., 2020).

Evidence That the Major Brain Modification of Early Primates Was the Addition of the “gPFC-PSC Network”

Primate brains are bigger in size than earlier diverging mammals relative to the overall body, but for the most part, contain the same neural structures. Some relative differences within primate brains include a shrinking of the olfactory bulbs and a substantial expansion of the visual cortex, somatosensory cortex, and posterior parietal cortex. However, the four substantial differences in the brains of extant primates from earlier diverging mammals are: the addition of (1) granular prefrontal cortex (gPFC; premotor areas, dorsolateral prefrontal cortex, and frontopolar cortex; Semendeferi et al., 2001); (2) PSC (including the superior temporal sulcus and the temporoparietal junction); (3) the dorsal pulvinar (Preuss, 2006); and (4) a unique cortico-motoneuronal system, where corticospinal projections bypass older circuits and make direct connections with spinal motorneurons (Murabe et al., 2018; Lemon, 2019). Although some suggest such direct corticospinal projections also occur in rats (Elger et al., 1977; Carlin et al., 2000; Yang and Lemon, 2003; Alstermark et al., 2004; Maeda et al., 2015; Gu et al., 2017), new evidence shows that these direct projections disappear in adulthood (Murabe et al., 2018), unlike in primates (Armand et al., 1997; Eyre, 2007).

The predominant view is that these structures and modifications emerged in early primates as they do not seem to exist in non-primate mammals, while they do exist in most extant primates (Kaas, 2009). Further consistent with the proposed model, and the idea that these new structures served a single purpose, the dlPFC, temporal cortex, and parietal areas all make up their own interconnected network, through the uniquely primate dorsal pulvinar (also called medial pulvinar; Goldman-Rakic, 1988; Gutierrez et al., 2000).

Evidence That the Major Brain Modification of Early Humans Was a Modification to the Arcuate Fasciculus

There are two main differences in the connectivity of human brains relative to those of non-human primates. First, the arcuate fasciculus (AF), which is a network of cortico-cortical connections between areas in the prefrontal cortex (“broca’s area”) and areas in the posterior cortex (“Wernick’s area”), is massively expanded in humans (Aboitiz and Garciía V, 1997; Aboitiz et al., 2006, 2010; Rilling et al., 2008, 2012; Aboitiz, 2012; Catani and Bambini, 2014; Petrides, 2014; Rilling, 2014; Stout and Hecht, 2017) and contains unique connectivity (Petrides and Pandya, 1984; Catani et al., 2004; Schmahmann et al., 2007; Rilling et al., 2008; Thiebaut de Schotten et al., 2012). Second, in humans, there is a direct projection from the motor cortex to laryngeal motor neurons, which is not found in nonhuman primates (Fitch, 2018; Jarvis, 2019). Although Broca’s area and Wernicke’s area are highly implicated in unique human abilities such as language, homologous regions of both have been found in nonhuman primates (Cantalupo and Hopkins, 2001; Schenker et al., 2009; Spocter et al., 2010), and therefore do not seem to have uniquely emerged in early humans.

Evidence for Proposed Phylogenetic Timing of Behavioral Abilities

Elsewhere I have more thoroughly reviewed the evidence for the below hypotheses regarding the phylogenetic timing of behavioral abilities during brain evolution in the human lineage (Bennett, 2021). I will briefly review some of the evidence here.

Evidence That Taxis-Navigation and Associative Learning Emerged in Early Bilaterians

To clarify, the hypothesis here is that taxis navigation, implemented in neurons and muscles, first emerged in bilaterians, and not that taxis navigation in general first emerged in bilaterians. Because this article is interested in brain evolution, the functions and features of the first brains are the focus of this article. Therefore, taxis navigation that existed in non-neural substrates is not explored in detail. C. elegans and flatworms, generally considered model organisms for urbilateria, demonstrate clear taxis-based navigation (Pearl, 1903; Larsch et al., 2015). These model organisms are also able to perform cross-modal decision-making. They can integrate conflicting input from various modalities (thermosensation, mechanosensation, photosensation, and chemosensation) in order to make a single integrated navigational decision. For example, C. elegans will make different decisions about whether to cross a copper barrier (which is aversive) to get to the food on the other side depending on the strength of the food smell relative to the concentration of copper (Ishihara et al., 2002). Similarly, flatworms will navigate towards food despite an aversive light source, but as the light becomes brighter, they will not go as far towards the food. This type of integrated decision-making occurs across numerous modalities including mechanosensation and thermosensation (Inoue et al., 2015). The taxis navigation of these model organism also demonstrate persistent navigational states, which would have been useful for persisting navigational decisions even after sensory stimuli have faded. For example, the C. elegans demonstrates at least three behavioral states: roaming, dwelling, and sleep (Fujiwara et al., 2002). Roaming is categorized as primarily straight-line swimming with infrequent turns—enabling an animal to relocate. In contrast, dwelling is categorized by slow swimming and frequent turning, enabling an animal to “locally search” its general area.

On the other hand, evidence for taxis navigation in adult form non-bilaterian eumetazoans is sparse. The hunting strategy of sea anemones, believed to be a model organism for early cnidarians (Yuan et al., 2011), is to wait for food to come to them (Ruppert et al., 2004). Retraction reflexes in cnidarians don’t drive locomotion in a specific direction, and instead seem to simply globally increase arousal (Batham and Pantin, 1950). Even most medusae, a more complex adult cnidarian form that likely evolved after the cnidarian-bilaterian divergence, do not show taxis navigation towards food sources, and merely seem to orient themselves in direction of current (Fossette et al., 2015), and hunt by moving in a “levy walk” (Hays et al., 2011). There are admittedly some exceptions to this. Box jellyfish can use eye spots to avoid obstacles (Garm et al., 2007), but it is generally accepted that the eyes of box jellyfish evolved independently (Nilsson, 2013; Bosch et al., 2017). Sea anemones have been shown to move towards light sources (Parker, 1916), but this has been shown to be independent of their own visual apparatus and driven simply by nearby amoebae (Pearse, 1974; Foo et al., 2019).

While larvae of earlier diverging metazoans, such as sponges, show taxis navigation (Leys et al., 2002), their adult forms show no such behavior and the neural implementation is based on cilia and not neurons and muscles. Adult forms of ctenophores move through ciliated pumping which may be coordinated via neurons and therefore may represent an adult form metazoan with neuron-based taxis navigation. However, how well extant ctenophores represent early metazoans is unclear; much evidence favors the idea that ctenophores independently evolved many features of nervous systems (Ryan, 2014; Moroz, 2015; Moroz and Kohn, 2016; Liebeskind et al., 2017). This would suggest that ctenophore taxis navigation is not indicative of early metazoans before the cnidarian-bilaterian last common ancestor.

Most evidence also suggests that associative learning first emerged in early bilateria (Bennett, 2021; Ginsburg and Jablonka, 2021). Associative learning, including classical conditioning and instrumental conditioning, has been shown broadly across Bilateria, including even early diverging species such as aplysia (Hawkins et al., 1989), planarians (Prados et al., 2012), and C. elegans (Ardiel and Rankin, 2010). In contrast, attempts to find associative learning across cnidaria have shown primarily negative results (Rushforth, 1973; Torley, 2009). Admittedly there is a single report of associative learning in a sea anemone (Haralson, 1975)—however, this is inconsistent with most other studies. Similar to others, I conclude that cnidaria do not contain associative learning (Ginsburg and Jablonka, 2019), and at the very least, if they do, it evolved convergently.

It should be noted that there are different interpretations of associative learning. Some make a distinction between “alpha conditioning” and “beta conditioning” (Razran, 1971; Moore, 2004). “Alpha conditioning” is defined as that where a non-habituated CS elicits the same reflexive response as the US, but after pairing with the US the magnitude of the response elicited by the CS increases. In contrast, “beta conditioning” is considered “true associative learning” whereby the CS elicits no reflexive response before pairing. However, others disagree with this distinction (Hawkins and Kandel, 1984; Kandel, 2006), and instead argue that there is, in fact, no difference between alpha and beta conditioning—they argue that all wiring is pre-wired, the only difference is whether the wiring is strong enough to elicit a response before pairing. All these perspectives agree that there is a distinction between general sensitization, where the specific US globally sensitizes a suite of reflexive responses, and associative learning, whereby this sensitization is local and specific to the paired US.

Evidence That Map-Based Navigation, Interval Timing, and Omission Learning Emerged in Early Vertebrates

In the context of map-based navigation, diverse and early diverging vertebrates including fish (Burt de Perera et al., 2016), reptiles (Wilkinson and Huber, 2012; Broglio et al., 2015), turtles (López et al., 2001), amphibians (Phillips et al., 1995), and tortoises (Wilkinson et al., 2007) all show the ability to build spatial maps of their environment and flexibly generate novel navigational routes to known places (Rodríguez et al., 2002). Model organisms for early bilaterians, such as flatworms seem to navigate only with taxis and perhaps response-based learning (Pearl, 1903; Luersen et al., 2014; Larsch et al., 2015; Gourgou et al., 2021) and show no ability to remember specific un-cued locations. Further, the neural substrates of such map-based navigation in vertebrates are uniquely vertebrate structures, such as pallial homologs of the hippocampus, suggestive of vertebrate origins. Importantly, the map-based navigational tests that early diverging vertebrates, such as fish, pass do not require planning, but they do require a spatial map that can generate novel homing vectors to well-learned locations in three-dimensional space. There is indeed evidence of sophisticated 3-dimensional navigation in arthropods, however, the degree to which they truly represent map-based representations and the degree to which these abilities are representative of early bilaterians is unclear. Some evidence suggests that arthropods fail at map-based navigation tasks (Benhamou et al., 1990; Wehner et al., 1996, 2006; Walker, 1997), while other evidence suggests that they indeed can build map-like memories (Boles and Lohmann, 2003; Menzel et al., 2005, 2011). Further, many impressive abilities of arthropods emerge from mushroom bodies (Perry et al., 2013; Cope et al., 2018), which is a cortex-like structure believed to have evolved independently (Farris, 2008).

In the context of interval timing, diverse and early diverging vertebrates such as fish (Sumbre et al., 2008), birds (Bateson and Kacelnik, 1997; Ohyama et al., 1999; Buhusi et al., 2002), non-human primates (Gribova et al., 2002), and mice (Roberts and Church, 1978; Gallistel et al., 2004; Buhusi et al., 2005) all show the ability to remember the precise timing between two cues. In contrast, invertebrates show a weak perception of time, if one at all (reviewed in Abramson and Wells, 2018). Further, the neural substrates of interval timing in vertebrates seem to be uniquely vertebrate structures (Buhusi and Meck, 2005; Yin and Meck, 2014).

In the context of omission learning, vertebrates such as dogs (Cole and Wahlsten, 1968), mice (Kamin, 1957; Avcu et al., 2014), and fish (Woodard and Bitterman, 1973; Abramson et al., 1988; Portavella, 2004; Vindas et al., 2014) have demonstrated the ability to learn from omission. By omission learning, I refer to the ability to learn an association based on a predicted event not occurring, as opposed to learning from a stimulus presentation or offset. In contrast to vertebrates, invertebrates seem to fail on such omission learning studies, and only learn from stimulus offsets (Abramson et al., 1988; Wenner and Wells, 1990; Sanderson et al., 2013; Abramson and Wells, 2018).

Evidence That Vicarious Trial and Error, Episodic Memory, and Counterfactual Learning Emerged in Early Mammals

VTE is a behavior whereby an animal pauses at choice points and toggles its head back and forth, and “plays out” each option vicariously (reviewed in Redish, 2016). Recording studies have corroborated the hypothesis that animals are playing out these options vicariously, hippocampal place cells are shown to vicariously encode place sequences of each possible path (Johnson and Redish, 2007; Gupta et al., 2012). VTE has been shown across rodents, nonhuman primates, and humans (reviewed in Redish, 2016). Further, uniquely mammalian structures, such as the prefrontal cortex, are highly implicated in such VTE behavior. In contrast, I am not aware of any studies that have shown VTE behavior in non-mammals. Taken together, this is suggestive that VTE emerged in early mammals.

Counterfactual learning is when an animal learns from an alternative choice they could have made but did not actually make. Learning from counterfactuals has been shown in rodents (Lewis, 2014; Steiner and Redish, 2014), nonhuman primates (Abe and Lee, 2011), and humans (Zhang et al., 2015). Further, uniquely mammalian structures, such as the orbitofrontal cortex, are highly implicated in counterfactual learning (Gilovich and Medvec, 1995; Camille et al., 2004; Coricelli et al., 2005, 2007). In contrast, I am not aware of any studies that have demonstrated counterfactual learning in non-mammals. Taken together, this is suggestive that counterfactual learning emerged in early mammals.

A key test of the ability to engage in episodic memory is whether an animal can answer an unexpected Question about its own past. The ability to answer such unexpected Questions has been shown in mammals including dogs (Fugazza et al., 2020), rats (Crystal, 2013), and nonhuman primates (Menzel, 1999). The neural substrate of such episodic memory seems to be the hippocampus reactivating distributed representations across the neocortex (Eichenbaum et al., 2007). Such episodic memory has also been shown in pigeons (Zentall et al., 2008) and cephalopods (Billard, 2020). However, birds are poor model organisms for the amniote common ancestor, and cephalopods are a poor model organism for the bilaterian common ancestor; both seem to have independently evolved many unique brain structures (Roth, 2015). I am not aware of any tests of episodic memory, whereby animals answer unexpected Questions, in amniotes outside of birds and mammals. Taken together, this is suggestive, but far from conclusive, that this type of episodic memory emerged in early mammals.

One might think that vicarious trial and error is a requirement in order for “spatial maps” to be used by animals for map-based navigation, but this seems to not be the case. For example, fish do not show VTE behavior, but can remember locations in three-dimensional space (Karnik and Gerlai, 2012; Burt de Perera et al., 2016; Lucon-Xiccato and Bisazza, 2017; Wallach et al., 2018), and can generate novel paths to the goal, even if it requires losing sight of that goal and swimming further away from it at first (Gómez-Laplaza and Gerlai, 2010). This demonstrates some ability to generate spatial maps, some lightweight “working memory” (staying on task even though losing sight of a goal), and object permanence (Sovrano et al., 2018).

If much of the presumably intelligence spatial navigation tasks can be performed by animals without VTE, then what is the benefit of VTE? Some suggestive evidence can be seen in the superior performance of mammals on certain tasks requiring “hard choices.” For example, mammals tend to substantially outperform non-mammal vertebrates on detour tasks, where an animal has to make a roundabout path around a barrier in order to get to a goal (MacLean et al., 2014; Gatto et al., 2018; Macario et al., 2020). Mammals also seem to outperform non-mammals in delayed gratification tasks, whereby they have to resist choosing an immediate small reward in order to get a delayed larger reward (Stevens et al., 2010). Some evidence also suggests that early diverging vertebrates such as fish cannot learn to zero-shot update the reward value of places (Beyiuc, 1938).

Evidence That Anticipating Future Needs, Theory of Mind, and Learning Through Observation Emerged in Early Primates

The Bischof-Kohler hypothesis states that only humans can take actions to alleviate a need that they will have in the future, but do not currently feel (Bischof-Köhler, 1985). However, evidence now suggests that in addition to humans, many nonhuman primates are also capable of this ability (McKenzie et al., 2004; Mulcahy and Call, 2006; Naqshbandi and Roberts, 2006; Janmaat et al., 2014). In contrast, non-primate mammals such as rodents have been shown to be unable to anticipate future needs (Naqshbandi and Roberts, 2006). Consistent with this, the substrates of the ability to anticipate future need states and use them to change current decisions seem to be uniquely primate structures, such as the dorsolateral prefrontal cortex (McClure et al., 2004; Kim et al., 2008; Hare et al., 2009). This is suggestive that the ability to perform this task emerged in early primates.

Theory of Mind (ToM) refers to the ability to take the perspective of others and understand their intentions and knowledge. It is still controversial whether any animals outside of humans contain this ability. But the balance of evidence seems to favor the idea that many primates indeed have ToM, even if it is not as robust as that of humans. Diverse species of nonhuman primates pass “false belief tests” (Bräuer, 2014; Krupenye et al., 2016; Smith, 2016; Kano et al., 2019). Further, nonhuman primates can distinguish between accidental and intentional actions and can distinguish between someone “unwilling” to do an action and someone “unable” to do an action (Call and Tomasello, 1998; Tomasello et al., 2003; Call et al., 2004; Tomasello et al., 2005). Further, the two neural structures most implicated in the theory of mind, the superior temporal sulcus and the temporoparietal junction, seem to have uniquely emerged in early primates (Kaas, 2009). In contrast, the bulk of studies on non-primate mammals conclude that they do not have ToM (Byrne et al., 2001; Bräuer, 2014; Aldhous, 2015). There is some evidence of ToM in birds (Bugnyar et al., 2016), but as we have discussed, birds are a poor model organism for the amniote common ancestor. Taken together, this is suggestive that theory of mind, even in a primitive form, emerged in early primates.

Learning skills through observation has been demonstrated across diverse species of nonhuman primates (Tomasello et al., 1987; Meunier et al., 2007; Ferrucci et al., 2019). The neural substrate of learning through observation also seems to be uniquely primate structures, including the STS (Perrett et al., 1985; Puce and Perrett, 2003). A key feature of learning skills through observation in primates is the ability to infer the intention of movement, and not simply mirroring the movement. Consistent with this, the mirror neurons in the premotor cortex of nonhuman primates have been found to be selective for abstract goals (Rizzolatti et al., 2001; Fogassi et al., 2005). There is some evidence that fish and reptiles can learn paths through observation (Brown, 2015; Lindeyer and Reader, 2010; Wilkinson et al., 2010). However, this knowledge seems isolated to observing paths and is not readily passed down in generations (Lindeyer and Reader, 2010). Taken together, this is suggestive that learning motor skills through observation emerged in early primates.

It should be noted that there are many highly intelligent social behaviors that are observed across vertebrates well outside of the primate taxa. Empathy behaviors are seen in mice (Hofer, 1996; Bartal et al., 2011; Mogil, 2012; Rennie et al., 2013). Play has been observed in reptiles (Kramer and Burghardt, 2010), birds (Fagen, 1981), and mammals (Wojciech, 2009). Jealousy and fairness preferences have been observed in mice (Douglas, 2012). Reciprocity has been observed in fish (Brandl and Bellwood, 2015) and in mice (Viana et al., 2010). Complex understanding and implementation of social hierarchies have been observed in fish (Whoriskey, 1991; Grosenick et al., 2007; Reebs, 2010) and mice (Haller and Kruk, 2006). Kin recognition has been observed even in fish (Fricke, 1974; Spence et al., 2008; Reebs, 2010; Spence, 2011), reptiles, and non-primate mammals (Brennan and Kendrick, 2006). And movement imitation has been observed in reptiles (University of Lincoln, 2014) and rats (Seyfarth and Cheney, 2013). And even gaze following has been seen in reptiles (Simpson and O’Hara, 2018).

But despite the impressive complexity of these social behaviors, none of these in fact require the theory of mind or learning through observation, and all of them can be explained by the valence and mapping functionality of the vertebrate brain. Take empathy, for example, it merely requires cues of satisfaction or dissatisfaction of conspecifics to be “hardwired” to elicit similar valence responses in an individual the same way that predator smells are hardwired to elicit fear. Consistent with this, “pain” neurons in rats that get activated when in pain, also get activated when they see the pain in others (Netherlands Institute for Neuroscience—KNAW, 2019). Kin recognition, social hierarchies, jealousy, and play can all be explained in the same way—through cue recognition and hardwired valence. Gaze following and movement imitation can be explained as merely an impressively complex reflex.

Evidence That Language and Music Emerged in Early Humans

The key features of human language that differentiates it from other forms of animal communication seems to be the ability to “name” objects and organize these names with “grammar” (Berwick and Chomsky, 2017; Terrace, 2019). Despite many attempts, non-human primates have been shown to be unable to learn such human-like language (Terrace et al., 1979; Thompson and Church, 1980). A key feature in the ontogenetic development of language learning in humans seems to be “joint attention,” whereby a child and a parent can jointly attend to the same object, enabling the process of “naming” (Carpenter and Call, 2013). Attempts to demonstrate joint attention in nonhuman primates have shown negative results (Warneken and Tomasello, 2006; Warneken et al., 2006).

Additionally, music seems to be a human universal, shown across all human cultures (Brown and Jordania, 2011). A key feature of music is the ability to engage in “beat-based timing,” whereby a human can tap to a beat. Despite this being trivial in humans, nonhuman primates cannot learn to synchronize taps with an auditory or visual metronome, even after a year of training (Zarco et al., 2009; Honing et al., 2012). Nonhuman primates also struggle with relative pitch perception more than humans do, struggling to generalize melodies to transpositions (Hulse et al., 1984; D’Amato, 1988; Ralston and Herman, 1995; Ghazanfar, 2002).

Taken together, this is suggestive that language with words and grammar, along with key features of music, emerged in early humans.

Evidence for Proposed Function of Specific Brain Modifications

Evidence That the Brain of Early Bilaterians Contained Valence Neurons and Neuromodulators Which Together Enabled Taxis Navigation

Many of the sensory neurons within early diverging bilaterians such as C. elegans can be interpreted as valence neurons. Many sensory neurons in C. elegans are directly modulated by internal states and thereby seem to directly encode valence as opposed to raw sensory information: hunger peptides modulate responses of neurons sensitive to food smells, pain, and various aversive smells (Davis et al., 2017; Lau et al., 2017; Rhoades et al., 2019). Further, temperature-sensitive neurons do not actually reliably encode temperature, but rather activate when a temperature rises above a homeostatic baseline—in other words, it is encoding the “negative valence” of temperature, not temperature per se (Luo et al., 2014). Food smells that trigger approach when hungry often have no effect when C. elegans is satiated (Davis et al., 2017). In C. elegans, some cues like carbon dioxide, which can signal both food as well as predators, shift from attractive when hungry to aversive when well fed depending on hunger state (Rengarajan et al., 2019).

It is likely that the cnidarian-bilaterian common ancestor had neurons that also contained neuropeptides that signaled internal states (Jekely, 2013), but the use of these peptides for such coordinated navigational decisions (locally search area, or relocated entirely, or rest) seems to either be unique to or have been highly elaborated in Bilateria. In Cnidaria, such neuropeptides seem to have primarily modulated independent reflexes, such as the likelihood of nematocyst release (Kass-Simon and Pierobon, 2007) or the swallowing reflex (Barron et al., 2010).

In early bilaterians, neuromodulators seem to have played a large role in navigational decisions, especially when it came to persisting navigational states. For example, in C. elegans, dopamine seems to trigger dwelling behavior whereby an animal engages in local area restricted search (Sawin et al., 2000), while octopamine seems to trigger roaming behavior whereby an animal relocates entirely (Churgin et al., 2017). Throughout Bilateria, neuromodulators have a remarkably consistent template for triggering affective states (Fellous, 1999; Lövheim, 2012).

The neural mechanisms of associative learning are remarkably conserved across bilateria. Across bilateria, animals use cAMP as well as NMDA and AMPA receptors in learning processes (Kandel, 2001, 2006; Dubnau et al., 2002; Farooqui et al., 2003; Glanzman, 2010; Hawkins and Byrne, 2015). And further, “third factor” neuromodulators such as dopamine and serotonin are involved in gating both presynaptic and postsynaptic learning processes across both vertebrates and invertebrates including diverse invertebrates such as crickets (Hammer and Menzel, 1998; Farooqui et al., 2003; Vergoz et al., 2007), fruit flies (Burke et al., 2012; Liu et al., 2012), honeybees (Hammer and Menzel, 1998; Farooqui et al., 2003; Vergoz et al., 2007), and C. elegans (Kusayama and Watanabe, 2000; Qin and Wheeler, 2007).

Further consistent with shared mechanisms of learning and affect, even early diverging invertebrates, such as C. elegans and planarians, show addiction to similar chemicals that manipulate dopamine and other monoamines as vertebrates do (Kusayama and Watanabe, 2000; Barron et al., 2009, 2010; Devineni and Heberlein, 2009; Kaun et al., 2011; Søvik and Barron, 2013).

Evidence That the Pallial-BG-Tectal Network Performed the Function of Model-Free Reinforcement Learning in Early Vertebrates, and Was Applied to Enable Map-Based Navigation, Interval Timing, and Omission Learning

The proposed “breakthrough” of early vertebrates was the emergence of model-free RL of the kind which included spatial maps and sensitivity to interoceptive information. Studies of the function of the structures that emerged in early vertebrates corroborate this idea. Substantial evidence suggests that homologous regions of the pallium generate a spatial map across vertebrates. The ability to navigate spatial maps is abolished when hippocampus homologs (present in the pallium) are lesioned across many early diverging vertebrates (reviewed in Murray et al., 2018), including goldfish and turtles (López et al., 2003; Durán et al., 2010; Broglio et al., 2015). Further, border cells, head direction cells, place cells, and velocity cells have been observed in the hippocampus-like structures across vertebrates, including the lateral pallium of early diverging vertebrates such as teleost fish (Vinepinsky et al., 2018, 2020), which is what you would expect if the lateral pallium were the homolog of the hippocampus and the neural substrate of the “spatial map”. Further, lesions to hippocampal homologs that impair spatial navigation also impair time perception (Meck et al., 1984, 1987, 2012; Melgire et al., 2005; Balci et al., 2009; Yin and Meck, 2014; Lucon-Xiccato and Bisazza, 2017), consistent with the idea that the pallium incorporated representations of both time and space. Time cells have also been found in the hippocampal complex (Eichenbaum, 2014), and time has been shown to be represented in the hippocampus as a map, just as space is (Oprisan et al., 2018). The cerebellum, another uniquely vertebrate structure, is also shown to be critical to absolute timing tasks (reviewed in Breska and Ivry, 2016).

Temporal difference learning signals are also observed in common vertebrate structures such as midbrain dopamine neurons, lateral habenula, and the basal ganglia across vertebrates, including early diverging vertebrates such as teleost fish (Li, 2012; Cheng et al., 2014). Further, the specific circuitry of the basal ganglia, which is conserved even in the earliest diverging vertebrates such as the lamprey, has been shown to be entirely consistent with a class of model-free learning algorithms called “actor-critic” models (Grillner and Robertson, 2016).

The specific neural substrates of avoidance and omission learning also corroborate the idea that the pallial-BG-tectal implemented a model-free RL algorithm. In model-free RL, we would expect avoidance learning to work in the following way. When an animal experiences unexpected pain, there should be a decline in dopamine (negative reward prediction), and they are driven to escape via amygdala-brainstem circuitry. Whenever pain is offset, this should lead to rebound excitation of dopamine (positive reward prediction, or “relief”). If specific actions consistently precede the offset of pain, this dopamine burst will drive plasticity in striatal circuits, which learn the contingency between a CS predictive of pain and the dopamine reward of relief. If this happens enough times, the CS will cease to be scary (pavlovian responses fade) and will simply drive the avoidance behavior that has been reinforced. Such an interpretation of avoidance learning has been suggested by others (Oleson et al., 2012).

The evidence supports exactly the above mechanism for how avoidance learning works in vertebrates. First, as expected from the above model, it has been shown that the basal amygdala to basal ganglia circuitry is required for active avoidance but not escape (which requires a basal amygdala to central amygdala circuit; Bandura and Rosenthal, 1966; LeDoux et al., 2016). Second, it is shown that during initial learning dopamine declines in the striatum, but after avoidance is well learned, dopamine increases, and the increase in dopamine is predictive of the performance of avoidance (Oleson et al., 2012). Crucially, this dopamine does not increase in the striatum if the pain is always inescapable. It was even shown that stimulating dopamine neurons during aversive cue increases active avoidance performance (Wenzel et al., 2018) while inhibiting dopamine in the striatum prevents avoidance (Wenzel et al., 2018). Third, it has been shown that pain or other noxious stimuli offset drives dopamine bursts in the striatum (Navratilova et al., 2015). Fourth, it explains why pavlovian fear responses fade while avoidance can maintain itself (Annau and Kamin, 1961; Kamin et al., 1963; Blanchard and Blanchard, 1969; Starr and Mineka, 1977; Kapp et al., 1979; Mineka, 1979; Bolles and Fanselow, 1980). And fifth, it is shown that avoidance responses seem to show signs of being habitual (extensive avoidance training makes the amygdala not required for avoidance anymore (Lázaro-Muñoz et al., 2010), although still being necessary for the acquisition (Choi et al., 2010).

This interpretation of the pallial-BG-tectal system is consistent with others who similarly suggest these structures together implement a model-free learning algorithm (Joel et al., 2002; Stephenson-Jones et al., 2013; Grillner and Robertson, 2016).

Evidence That the Neocortex Served the Function of Enabling Internally Invoked Simulations in Early Mammals, Which Was Applied for VTE, Counterfactual Learning, and Episodic Memory

The specific proposed model of ACC-sensory function presented in this article is consistent with various observations. There is emerging consensus that the neocortex, especially the sensory cortex, implements some form of a generative model (reviewed in Kersten et al., 2004; Knill and Pouget, 2004; Parr and Friston, 2018). Consistent with such a generative model, imagining a stimulus activates the same exact representations in the sensory neocortex as the stimulus itself (O’Craven and Kanwisher, 2000; Doll et al., 2015; Pearson et al., 2015). Further, lesions to specific areas of the sensory cortex create impairments both in perception and imagination within the same modality (Bisiach and Luzzatti, 1978; Farah et al., 1992).

There is less consensus regarding the function of the prefrontal cortex, which is further confused by the fact that the nomenclature of prefrontal regions in rodents often confuses the homology of these regions between rodents and primates. Recent research suggests that PL, IL, cg1, and cg2 regions of the rat prefrontal cortex are all homologous with the anterior cingulate cortex and medial cingulate cortex of primates (Laubach et al., 2018; van Heukelum et al., 2020). Through this lens, emerging evidence is consistent with the proposal that the frontal cortex builds a model of intent, which is used to trigger internal simulations.

For example, lesion studies provide evidence that the prefrontal cortex triggers internally invoked simulations in rodents: frontal lesions create impairments in all three proposed forms of internally invoked simulations, including vicarious trial and error, episodic memory, and counterfactual learning. More specifically, rodents with lesioned or inactivated prefrontal areas reduce their vicarious trial and error behavior (Schmidt et al., 2019), no longer have goal representations in the hippocampus (Ito et al., 2015), and are impaired in episodic memory tasks (Frankland et al., 2004). Evidence also suggests rats with frontal lesions become impaired at causal reasoning tasks, consistent with an inability to engage in counterfactual learning (Jones et al., 2012). Frontal lesions also make rodents uniquely impaired at spatial navigation tasks that require preplanning (Granon and Poucet, 1995). Rodents with frontal lesions struggle to stay on task in an ongoing plan and often do actions out of sequence (Seamans et al., 1995). Lesions of some of these cingulate regions in rats impair their ability to incorporate “effort” into their decision-making, as if they are unable to actually imagine how hard taking an action would be (Walton et al., 2003; Schweimer et al., 2005; Hu et al., 2021). Disconnecting areas of the cingulate from the amygdala in rats has the same effect (Floresco and Ghods-Sharifi, 2006). Further, temporarily inhibiting the hippocampus during tests where mice are asked to recall episodic memories abolishes their ability to do so (Crystal, 2013), consistent with the model whereby the frontal cortex makes inquiries for an episodic memory through its projections to the hippocampus.

Recording studies within the prefrontal cortex of rats also show evidence of modeling “intent”. In complex tasks sequences, ensembles of neurons in the prefrontal areas of rats become highly selective for the specific places in the task sequence and reliably track progress towards imagined goals (Cowen and McNaughton, 2007; Fujisawa et al., 2008). Further, in working memory tasks when rats must do tasks from memory without the presence of any cues to follow, neurons in the prefrontal cortex show delay activity (Baeg et al., 2003). The observation that the prefrontal cortex of rodents, such as the ACC, becomes particularly activated by surprise (Bryden et al., 2011) and error (Totah et al., 2009) is consistent with the model whereby during hard choices, the ACC pauses behavior and triggers internally invoked simulations to play out possible futures and resolve any conflicts or errors.

Even primates, who have many more prefrontal regions, still have these more ancient agranular frontal regions such as the ACC, which is homologous to most of the prefrontal cortex of non-primate mammals (Laubach et al., 2018; van Heukelum et al., 2020) and share many of these functions. The ACC in humans also gets uniquely activated during “errors” (Dehaene et al., 1994) and conflict (Carter et al., 1998; Braver et al., 2001). The ACC in humans also seems to encode locations in task sequences (Koechlin et al., 2002). Single neuron recordings of ACC within nonhuman primates show single neuron level selectivity for different actions (Nakamura et al., 2005), as well as selectivity for the serial order of tasks irrelevant of the actual movements made (Procyk et al., 2000; Procyk and Joseph, 2001). In humans, hippocampal lesions also create severe impairments both in the ability to recall autobiographical events and imagining potential future events (Addis et al., 2007; Hassabis et al., 2007). ACC lesions in nonhuman primates also impair the ability to stay on task during delay periods (Rudebeck et al., 2014).

The supposed “default mode network,” consisting of the mPFC, ACC, hippocampus, and the posterior cingulate all become uniquely active both during the retrieval of autobiographical memories as well as during imagining potential futures (Hassabis and Maguire, 2009; Martin et al., 2011; Andrews-Hanna et al., 2014b). Although some areas of the default mode network in primates include uniquely primate areas (such as granular prefrontal cortex), the DMN in primates also includes more ancient areas such as the ACC, which is also part of the purported DMN in rodents (Lu et al., 2012; Stafford et al., 2014; Grandjean et al., 2020), suggestive that such a network was present even in the very early neocortex.

It is interesting to note how many behavioral abilities that are attributed to the neocortex are readily performed by many animals that do not have a neocortex. Take object recognition for example. Complex object recognition, even of human faces, has been shown across phyla, including fish (Newport et al., 2016; Schumacher et al., 2016). Object recognition that is insensitive to changes in rotation and transformation has also been shown across non-mammalian phyla, also including fish (Newport et al., 2018). Object identification despite occlusion, demonstrating inference, has also been shown across non-mammalian phyla, including fish (Sovrano and Bisazza, 2007). However, consistent with the proposed function of the neocortex, mammals seem to be unique in their use of “mental rotation” to solve object recognition tasks. Mental rotation has been suggested by studies in monkeys, humans, and sea lions (Mauck and Dehnhardt, 1997), and negative results have been found in pigeons (Hollard and Delius, 1982). This would again represent the usage of such “simulation” functionality of the neocortex.

If simulating is so adaptive, then why did it only evolve in mammals and not in non-mammal vertebrates? One possible explanation is the fact that early mammals were likely arboreal species (Fröbisch and Reisz, 2009). Navigating tree branches with far eyesight presents unique challenges and evolutionary pressures not previously experienced: namely, irreversible choices. As a small animal living in trees, they had to plan their route well in advance when navigating across branches. And it is likely they had to very regularly experience novel branches. This perhaps created pressure for “vicarious trial and error”. Consistent with this, computational models have found that the usefulness of “planning” is directly tied to visual range. Visual range in water is so poor that computational models suggest planning in water is barely useful at all (Mugan and MacIver, 2020). However, inconsistent with this proposal is the fact that there are plenty of invertebrate and reptilian arboreal animals without the neocortex. Another hypothesis, as suggested by Mugan and MacIver (2020), is that many intellectual abilities occurred only in mammals due to the unique evolution of endothermy, which enabled much faster neural processing than when ectothermic. The neocortical generative model perhaps came at a high computational cost, and as such, without endothermy, nonmammalian vertebrates were prevented from garnering this adaptation. Consistent with this, the non-mammal vertebrates that demonstrate the most impressive intellectual abilities, birds, also seem to have evolved both endothermy (Walter and Seebacher, 2009) and neocortical-like structures (Tosches, 2021) through their own convergent evolution.

Evidence That the gPFC-PSC Network Performed the Function of “Mentalizing” in Early Primates, and Was Similarly Applied for Anticipating Future Needs, Theory of Mind, and Learning Through Observation

When identifying the neural substrates of mentalizing, it is important to draw a distinction between emotional contagion (e.g., “emotional empathy”) and mentalizing (e.g., “theory of mind”; Shamay-Tsoory, 2011). Each is a different process, and evidence suggests they have separate neural substrates. Emotional contagion is the process of reflexively adopting the emotional state of a conspecific based on various cues that reveal their emotions. In contrast, mentalizing is the process of actively considering another’s perspective, knowledge, or emotions by imagining oneself in another’s shoes. Emotional contagion has been shown in non-primate mammals such as rodents (Langford, 2006) whereas, as discussed above, mentalizing seems to be unique to primates. Consistent with this, the key frontal substrate for emotional contagion seems to be the ACC, which is homologous to regions of the frontal cortex in early mammals. For example, certain areas of the ACC are activated both by one’s own experience of pain and by watching a conspecific experience pain (Derbyshire, 2000; Jackson et al., 2006). In contrast, the neocortical areas most commonly implicated in mentalizing and theory of mind are not the ACC but instead contained within the gPFC-PSC network. More specifically, metanalysis that has reviewed a multitude of imaging studies have primarily implicated the dmPFC (BA 8, 9), amPFC (BA 10), TPJ, and STS as areas that are uniquely activated by tasks that require the theory of mind (Carrington and Bailey, 2009; Van Overwalle and Baetens, 2009). The idea that emotional contagion evolved first, subserved by older frontal regions, and ToM then emerged later, subserved by newer primate frontal regions, has similarly been proposed by other (de Waal, 2008). Further consistent with this, damage to granular areas of mPFC leaves the ability to simulate past or future imagined scenes intact but impairs the ability to imagine yourself in that scene. In contrast, hippocampal lesions impair the ability to simulate complex scenes but leaves the ability to imagine yourself in those scenes intact (Kurczek et al., 2015).

Several studies have shown a linear relationship between the volume of orbital PFC, medial PFC, and social network size (Powell et al., 2010; Lewis et al., 2011; Powell et al., 2012). It has also been observed that gray matter specifically in mPFC increases when macaques move from smaller to larger social groups (Sallet et al., 2011). Further, one of the only frontal regions that is disproportionately larger in humans than other primates is BA 10 (Semendeferi et al., 2001). All of this is consistent with the concept that these areas of gPFC subserve mentalizing, and that mentalizing was a key feature in supporting larger social groups in early primates and even larger social groups in early humans.

The above evidence implicates only a relatively small portion of the granular prefrontal cortex (BA 8, 9, 10) is related to the process of mentalizing. This suggests that the general function of the granular prefrontal cortex is not mentalizing, but rather mentalizing is merely a function subserved by specific areas of the granular prefrontal cortex. However, a broader examination of the different types of tasks involved in mentalizing implicates a much broader suite of areas within gPFC. For example, within “ToM” it has been argued that there is a dissociation between “affective ToM” and “cognitive ToM” (Shamay-Tsoory and Aharon-Peretz, 2007). Affective ToM refers to the ability to take the emotional perspective of another, whereas cognitive ToM refers to the ability to take the physical sensory perspective of another. When examining the neural substrates of these different types of ToM, broader regions of gPFC get implicated in addition to just medial BA 8, 9, and 10. Specifically, evidence suggests that the vmPFC (BA 11, 12, 14) is specifically necessary for the affective aspects of ToM whereas dlPFC (BA 44, 45, 46) is specifically necessary for the cognitive aspects of ToM. vmPFC lesions selectively impair affective ToM and not cognitive ToM (Shamay-Tsoory and Aharon-Peretz, 2007)—although admittedly given that this was examined in humans, some of these lesions may have included areas of standard mentalizing mPFC (BA 8, 9, 10) as well as areas of the ACC (BA 24, 32). In contrast, dlPFC disruption seems to selectively impair cognitive ToM (visual perspective taking; Conson et al., 2015; Qureshi et al., 2020) but not affective ToM (Kalbe et al., 2010). Disrupting the right TPJ function also disrupts visual perspective taking, while disrupting dmPFC does not (Martin et al., 2020). Some of these dissociations may explain why there is evidence that some individuals with extensive lesions of ACC and mPFC can retain features of self-awareness (Philippi et al., 2012) and theory of mind (Bird et al., 2004).

Another dissociation may be in what specific type of inquiry an individual is making into another’s mind. fMRI evidence suggests that dmPFC and vlPFC get more activated when considering how to make an observed character feel better than when merely trying to identify the observed character’s emotion (Reniers et al., 2013). On the other hand, the same study found that the dlPFC gets more activated when a participant is asked to “imagine how theythemselves would feel” in an observed character’s situation than when merely trying to identify the observed character’s emotion.

There may be even more nuanced dissociations between the roles of various areas of the granular prefrontal cortex and various features of mentalizing. When comparing activation during first-person perspective taking to third-person perspective taking, areas of vlPFC (BA 44) and premotor cortex (BA 6) seem to be uniquely implicated in third-person perspective taking (David et al., 2006). Further, fMRI evidence suggests that amPFC (BA 10) builds models of person-specific theory of mind, while dmPFC (BA 8 and 9) is used for general processes of the theory of mind (Welborn and Lieberman, 2015). Other fMRI evidence suggests that amPFC is activated in any self-referential thinking, and only selectively during the considering of one’s own mental state (or presumably someone elses’), does dmPFC also become activated (Andrews-Hanna et al., 2010, 2014a). Left dlPFC may be specific for considering the mental state of others, while right dlPFC may be specific for considering the mental state of yourself (Otsuka et al., 2011). Yet another dissociation that has been proposed is that the TPJ is responsible for making mental inferences about others (such as about goals or beliefs), while the mPFC is responsible for attributing overall personality traits about yourself and others (Van Overwalle and Baetens, 2009).

Consistent with the idea that mentalizing about yourself is repurposed for the task of mentalizing about others, these same areas of the granular prefrontal cortex also get activated when considering your own mind state, including dmPFC (Gusnard et al., 2001; Gallagher and Frith, 2003; Amodio and Frith, 2006; Gilbert et al., 2006; Ochsner, 2008; Van Overwalle, 2009; Bzdok et al., 2012; Moran et al., 2012), amPFC/vmPFC (Lane et al., 1997; Phan et al., 2004; Lotze et al., 2007), dlPFC (Otsuka et al., 2011), and TPJ (Kelly et al., 2014). The general idea that there are common substrates, especially in mPFC, for imagining yourself in a future or past situation as well as for imagining being in the mind of another has been reviewed elsewhere (Buckner and Carroll, 2007; Jenkins and Mitchell, 2011). When evaluating your own personality traits or receiving evaluations of yourself by others, the same mentalizing network in mPFC activates (Ochsner et al., 2005; Gilbert et al., 2006). Further consistent with this idea that theory of mind of others is “bootstrapped” on a generative model of yourself, is the fact that the concept of “self” emerges first in child development before the theory of mind emerges (Rochat, 1998; Ritblatt, 2000; Keenan et al., 2005).

The argument in this article is that the overall function of mentalizing was not only applied to understanding the cognitive and emotional states of yourself and others (“theory of mind”), but also for anticipating future needs. The idea being that mentalizing about yourself in the future is the fundamental mechanism by which you can anticipate what your intentions and desires will be in that future situation, and thereby generate behaviors necessary for fulfilling those needs. A multitude of findings are consistent with the idea that the areas of the gPFC-PSC network specific for the cognitive theory of mind, such as the dlPFC, seem to also be key substrates for anticipating these future need states. Damage to the dorsolateral prefrontal cortex (dlPFC) dramatically impairs performance on tasks requiring abstract goal representations such A-not-B tasks (Diamond and Goldman-Rakic, 1989), consistent with the idea that dlPFC is the neural substrate of such goal representations and that it cascades these goals down the motor hierarchy where representations are progressively more “habitual”. dlPFC uniquely activates when considering future rewards but not present rewards (McClure, 2004; Tanaka et al., 2004; Berns et al., 2007; McClure et al., 2007; Kim et al., 2008). dlPFC is selectively activated when choosing a delayed reward over an immediate reward (McClure et al., 2004, 2007; Weber and Huettel, 2008). It is activated when avoiding temptation such as when dieters exert self-control (Hare et al., 2009). Damage to dlPFC leads to impairment in the ability to give up immediate rewards for future ones (Figner et al., 2010). And temporary inactivation of the dlPFC impairs people’s ability to forgo an exciting high-risk high reward option for a lower-reward lower-risk option (Knoch et al., 2006). Evidence also suggests that some types of “rule-based” categorization are unique to primates—consistent with the idea that gPFC enables the presentation of abstract goals and intentions (Soto and Wasserman, 2014).

This article also argues that the function of mentalizing in the gPFC-PSC network was also applied to enable learning through observation. There are several pieces of evidence for this. The same neurons in the premotor cortex activate for the “intentions” within yourself as when observing those same intentions in others (Amodio and Frith, 2006)—these have been called “mirror neurons.” Further, such mirror neurons have been identified in nonhuman primates (Gallese et al., 1996; Rizzolatti et al., 1996; Fogassi et al., 2005) while I am not aware of reports of mirror neurons in non-primate mammals. Although there is still controversy surrounding whether these mirror neurons are in fact modeling others’ movements (Hickok, 2009; Churchland, 2011).

Taken together, the above evidence suggests that a key, if not the primary, function of the gPFC-PSC network was to enable mentalizing—defined as building a generative model of one’s own mind state, inclusive of intentions, emotions, and knowledge. And this function was thereby applied in numerous adaptive ways, such as modeling the mind of others (theory of mind), anticipating future needs, and learning through observation.

Evidence That Modifications to the Arcuate Fasciculus Enabled Both Language and Music in Early Humans

There are compelling parallels between the connectivity of the AF-BG network and modern machine learning language models. The most popular language models, which have been shown to be capable of remarkably flexible sentence production and completion, are long-short-term memory (LSTM) models. LSTM models have “working memory” gates whereby the model can learn to maintain “context” from past words and sentences to the current word production (Mikolov et al., 2015; Dennis Singh and Lee, 2017; Ororbia et al., 2017). This is similar to the function proposed by frontal-cortex-basal-ganglia loops (O’Reilly and Frank, 2006). Further, these networks are recursive, whereby the next word is a function of the previous word, similar to the extensive recursive connectivity of both posterior and frontal cortex. The unique lateralization of language and music relative to other neocortical functions may be a requirement for the rapid sequence prediction in language and melody production.

Consistent with this interpretation of the AF-BG network, the arcuate fasciculus, and the basal ganglia are both highly implicated in language and music. Damage to AF impairs fluency, comprehension, and verbal working memory (Binder and Desai, 2011; Schomers et al., 2017). The development of the AF is correlated to the development of language abilities in children (Friederici, 2011; Yeatman et al., 2011; Skeide et al., 2016; Goucha et al., 2017; Schomers et al., 2017). The strength of connections in AF is correlated with word learning performance (Lopez-Barroso et al., 2013). It has long been known that language related tasks such as sentence generation activate and require a functioning Broca’s area, Wernick’s area, and the basal ganglia (Brown et al., 2006). However, less appreciated is the fact that music tasks such as melody generation activate the exact same network (Koelsch et al., 2002; Brown et al., 2006), although with a bias towards the right hemisphere, while language is biased towards the left hemisphere (Sammler et al., 2015). Consistent with a common neural implementation, children with language impairments also show musical impairments (Jentschke et al., 2008).

The direct projection from the motor cortex to laryngeal motoneurons, which is not found in nonhuman primates, is clearly the projection by which humans can control speech sounds (Simonyan and Horwitz, 2011). However, this was likely an addition after language had already emerged (Hewes et al., 1973; Corballis, 2002)—and not the fundamental modification that enabled language. Evidence for this can be seen in the fact that complex gestural languages (with words and grammar) are seen in deaf communities all over the world, none of which requires these descending laryngeal projections. Further, the neural substrates of such gestural language are highly overlapping with those of vocal-verbal language, implying a shared circuitry for “language,” independent of the modality (Neville et al., 1997). As such, I hypothesize that the big breakthrough that unlocked music and language was in fact the AF-BG network, and direct control of laryngeal motor neurons came later, enabling verbal language to replace the shorter-range gestural language.

Discussion

Here I have argued that the proposed ordered set of five breakthroughs provides a first approximate explanation of a diverse set of both brain and behavioral modifications through major milestones in human brain evolution. This model of brain evolution provides a useful simplification through which to interpret brain modifications and the progressive complexification of behavior through phylogenetic refinement (Cisek, 2019).

Caveats

By summarizing such a long history into only a handful of “breakthroughs,” I am undeniably simplifying the actual story. The objective is to provide a view of the “forest” of evolution at the cost of describing “the trees”. This approach will inevitably miss some important changes in brains and behavior. However, as argued above, a surprisingly broad set of brain structures and behaviors can be understood through a remarkably small number of “breakthroughs.” Perhaps this is not so surprising, given that perhaps brain evolution often occurred in fits and starts, where some adaptive structure was stumbled upon, rapidly elaborated on, and then brains remained relatively stable for a long period of time afterward. This can be seen simply in the transition from our primate ancestors to homo sapiens. For 30 million years primate brains remained mostly unchanged. And then, over the last 1 million years our ancestors’ brains expanded by a multiple of three. A million years ago humans could not speak flexible language, and 100,000 years ago they could—this is a split second from an evolutionary perspective.

It should also be noted that there is scant research with identically designed behavioral tests of animals across different phyla. This makes it both challenging and perilous to compare behavioral abilities across distant species. Many of the behavioral studies cited were not identically designed for each species tested nor evaluated using the same methods for different species, and hence such results must be interpreted with caution. For example, the experimental designs of asking a mouse to answer “unexpected Questions” from its own past are of course subtly different from asking the same Question of a cephalopod, who has a different sensory repertoire and is inevitably asked the Question in a different environment. As such, differences in results may be a consequence of the experimental design as opposed to differences in abilities. Further, comparative behavior research likely contains a positive results bias—whereby negative findings in animals are reported substantially less frequently than positive results (Fanelli, 2011). As such, there are scant reports of negative findings in general, and it is challenging to conclude the lack of ability in an extant animal simply because of a lack of reports of its presence—the absence of evidence is not the evidence of absence. As such, many pieces of this model are currently speculative and will require further studies comparing behavioral abilities across taxa.

Comparison to Other Work

This work is highly inspired by others who have proposed ordered modifications in the evolution of brains: Eva Jablonka and Simona Ginsburg’s retelling of the evolution of learning systems (Vredeveldt et al., 2011), Paul Cisek’s theory of phylogenetic refinement (Cisek, 2019), and Elisabeth Murray, Steven Wise, and Kim Garham’s work on the evolution of memory systems (Murray et al., 2017). All three of these take an evolutionary approach to understanding how brains work today, by virtue of retelling the evolutionary steps by which they came to be.

My goal in this article is to add to this corpus of work in four ways. First, to provide an initial template and “first approximation” of the entire evolutionary story of the human brain, from the first bilaterains to the first homo sapiens. Second, to simultaneously hypothesize the emergence of both brain regions and adaptive behaviors that these brain structures enabled. Third, to directly incorporate dependencies between each sequence of changes. And fourth, to attempt to explain broad behaviors using common neural innovations, as opposed to specifically focusing on a type of behavior (such as learning or memory). In this last sense, this article is in some ways an attempt to bring together the ideas of Jablonka and Ginsburg, Cisek, and Murray, Wise, and Garham with the evolutionary neuroscience work of specific brain structures from Kaas (2009) and Striedter and Northcutt (2020).

The most well-known model of brain evolution is Paul MacLean’s Triune Brain (MacLean, 1990), where he named the three brain systems that he argued evolved sequentially: the “Reptilian Complex,” the “Paleomammalian Complex,” and the “Neomammalian Complex”. Despite how well known the model is, it has been highly discredited and demonstrated to be completely wrong (Cesario et al., 2020). A key flaw in MacLean’s model is the conceptualization of extant animals such as reptiles or monkeys as somehow “lower” animals, and humans as “higher.” His idea that the human brain has a “reptile brain” and a “monkey brain” within it, incorrectly conceptualizes an extant monkey brain as being more primitive than a human brain. The truth is of course that all extant animals have gone through evolution for the same amount of time and evolved from common ancestors. In contrast, the model of five breakthroughs proposed here is crafted to discuss specifically the evolutionary process only within the human lineage and is not meant to be used to make comparisons to extant animals today.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Statements

Data availability statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Author contributions

MB conceived the overall theory and wrote the entire manuscript.

Conflict of interest

The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  • 1

    AbeH.LeeD. (2011). Distributed coding of actual and hypothetical outcomes in the orbital and Dorsolateral prefrontal cortex. Neuron70, 731741. 10.1016/j.neuron.2011.03.026

  • 2

    AboitizF. (2012). Gestures, vocalizations, and memory in language origins. Front. Evol. Neurosci.4:2. 10.3389/fnevo.2012.00002

  • 3

    AboitizF.Garciía VR. (1997). The evolutionary origin of the language areas in the human brain. A neuroanatomical perspective. Brain Res. Rev.25, 381396. 10.1016/s0165-0173(97)00053-2

  • 4

    AboitizF.AboitizS.GarcíaR. R. (2010). The phonological loop. Curr. Anthropol.51, S55S65. 10.1086/650525

  • 5

    AboitizF.GarcíaR. R.BosmanC.BrunettiE. (2006). Cortical memory mechanisms and language origins. Brain Lang.98, 4056. 10.1016/j.bandl.2006.01.006

  • 6

    AbramsonC. I.WellsH. (2018). An inconvenient truth: some neglected issues in invertebrate learning. Perspect. Behav. Sci.41, 395416. 10.1007/s40614-018-00178-8

  • 7

    AbramsonC. I.ArmstrongP. M.FeinmanR. A.FeinmanR. D. (1988). Signaled avoidance in the eye withdrawal reflex of the green crab. J. Exp. Anal. Behav.50, 483492. 10.1901/jeab.1988.50-483

  • 8

    AdamsR. A.ShippS.FristonK. J. (2012). Predictions not commands: active inference in the motor system. Brain Struct. Funct.218, 611643. 10.1007/s00429-012-0475-5

  • 9

    AddisD. R.MoscovitchM.McAndrewsM. P. (2007). Consequences of hippocampal damage across the autobiographical memory network in left temporal lobe epilepsy. Brain130, 23272342. 10.1093/brain/awm166

  • 10

    AldhousP. (2015). Wellcome. The Smartest Animal You’ve Never Heard Of. Available online at: https://mosaicscience.com/extra/smartest-animal-youve-never-heard/. Accessed March 23, 2021.

  • 11

    AlstermarkB.OgawaJ.IsaT. (2004). Lack of monosynaptic corticomotoneuronal EPSPs in rats: disynaptic EPSPs mediated via reticulospinal neurons and polysynaptic EPSPs via segmental interneurons. J. Neurophysiol.91, 18321839. 10.1152/jn.00820.2003

  • 12

    AmodioD. M.FrithC. D. (2006). Meeting of minds: the medial frontal cortex and social cognition. Nat. Rev. Neurosci.7, 268277. 10.1038/nrn1884

  • 13

    AnctilM. (2009). Chemical transmission in the sea anemone Nematostella vectensis: a genomic perspective. Comp. Biochem. Physiol. Part D Genomics Proteomics4, 268289. 10.1016/j.cbd.2009.07.001

  • 14

    AndersonJ. M.GilmoreR.RoperS.CrossonB.BauerR. M.NadeauS.et al. (1999). Conduction aphasia and the arcuate fasciculus: a reexamination of the wernicke-geschwind model. Brain Lang.70, 112. 10.1006/brln.1999.2135

  • 15

    Andrews-HannaJ. R.ReidlerJ. S.SepulcreJ.PoulinR.BucknerR. L. (2010). Functional-anatomic fractionation of the brain’s default network. Neuron65, 550562. 10.1016/j.neuron.2010.02.005

  • 16

    Andrews-HannaJ. R.SaxeR.YarkoniT. (2014a). Contributions of episodic retrieval and mentalizing to autobiographical thought: evidence from functional neuroimaging, resting-state connectivity, and fMRI meta-analyses. NeuroImage91, 324335. 10.1016/j.neuroimage.2014.01.032

  • 17

    Andrews-HannaJ. R.SmallwoodJ.SprengR. N. (2014b). The default network and self-generated thought: component processes, dynamic control, and clinical relevance. Ann. N Y Acad. Sci.1316, 2952. 10.1111/nyas.12360

  • 18

    AnnauZ.KaminL. J. (1961). The conditioned emotional response as a function of intensity of the US. J. Comp. Physiol. Psychol.54, 428432. 10.1037/h0042199

  • 19

    ArdielE. L.RankinC. H. (2010). An elegant mind: learning and memory in Caenorhabditis elegans. Learn. Mem.17, 191201. 10.1101/lm.960510

  • 20

    ArendtD.ToschesM. A.MarlowH. (2015). From nerve net to nerve ring, nerve cord and brain—evolution of the nervous system. Nat. Rev. Neurosci.17, 6172. 10.1038/nrn.2015.15

  • 21

    ArmandJ.OlivierE.EdgleyS. A.LemonR. N. (1997). Postnatal development of corticospinal projections from motor cortex to the cervical enlargement in the macaque monkey. J. Neurosci.17, 251266. 10.1523/JNEUROSCI.17-01-00251.1997

  • 22

    AronA. R. (2006). Cortical and subcortical contributions to stop signal response inhibition: role of the subthalamic nucleus. J. Neurosci.26, 24242433. 10.1523/JNEUROSCI.4682-05.2006

  • 23

    AtlanG.TeremA.Peretz-RivlinN.SehrawatK.GonzalesB. J.PoznerG.et al. (2018). The claustrum supports resilience to distraction. Curr. Biol.28, 2752.e72762.e7. 10.1016/j.cub.2018.06.068

  • 24

    AvcuP.JiaoX.MyersC. E.BeckK. D.PangK. C.ServatiusR. J. (2014). Avoidance as expectancy in rats: sex and strain differences in acquisition. Front. Behav. Neurosci.8:334. 10.3389/fnbeh.2014.00334

  • 25

    BaegE. H.KimY. B.HuhK.Mook-JungI.KimH. T.JungM. W. (2003). Dynamics of population code for working memory in the prefrontal cortex. Neuron40, 177188. 10.1016/s0896-6273(03)00597-x

  • 26

    BalciF.MeckW. H.MooreH.BrunnerD. (2009). Timing deficits in aging and neuropathology. Anim. Models Hum. Cogn. Aging1, 141. 10.1007/978-1-59745-422-3_8

  • 27

    BanduraA.RosenthalT. L. (1966). Vicarious classical conditioning as a function of arousal level. J. Pers. Soc. Psychol.3, 5462. 10.1037/h0022639

  • 28

    BardK. A.Myowa-YamakoshiM.TomonagaM.TanakaM.CostallA.MatsuzawaT. (2005). Group differences in the mutual gaze of chimpanzees (Pan troglodytes). Dev. Psychol.41, 616624. 10.1037/0012-1649.41.4.616

  • 29

    BarronA. B.MaleszkaR.HelliwellP. G.RobinsonG. E. (2009). Effects of cocaine on honey bee dance behaviour. J. Exp. Biol.212, 163168. 10.1242/jeb.025361

  • 30

    BarronA. B.SøvikE.CornishJ. L. (2010). The roles of dopamine and related compounds in reward-seeking behavior across animal phyla. Front. Behav. Neurosci.4:163. 10.3389/fnbeh.2010.00163

  • 31

    BartalI. B.-A.DecetyJ.MasonP. (2011). Empathy and pro-social behavior in rats. Science334, 14271430. 10.1126/science.1210789

  • 32

    BastosA. M.UsreyW. M.AdamsR. A.MangunG. R.FriesP.FristonK. J. (2012). Canonical microcircuits for predictive coding. Neuron76, 695711. 10.1016/j.neuron.2012.10.038

  • 33

    BatesonM. (1975). Mother-infant exchanges: the epigenesis of conversational interaction. Ann. N Y Acad. Sci.263, 101113. 10.1111/j.1749-6632.1975.tb41575.x

  • 34

    BatesonM.KacelnikA. (1997). Starlings’ preferences for predictable and unpredictable delays to food. Anim. Behav.53, 11291142. 10.1006/anbe.1996.0388

  • 35

    BathamE. J.PantinC. F. A. (1950). Phases of activity in the sea-anemone, Metridium senile (L.), and their relation to external stimuli. J. Exp. Biol.27, 377399. 10.1242/jeb.27.3.377

  • 36

    BeckP. D.PospichalM. W.KaasJ. H. (1996). Topography, architecture and connections of somatosensory cortex in opossums: evidence for five somatosensory areas. J. Comp. Neurol.366, 109133. 10.1002/(SICI)1096-9861(19960226)366:1<109::AID-CNE8>3.0.CO;2-7

  • 37

    BeebeB.AlsonD.JaffeJ.FeldsteinS.CrownC. (1988). Vocal congruence in mother-infant play. J. Psycholinguist. Res.17, 245259. 10.1007/BF01686358

  • 38

    BenhamouS.SauveJ. P.BovetP. (1990). Spatial memory in large scale movements: efficiency and limitation of the egocentric coding process. J. Theor. Biol.145, 112. 10.1016/s0022-5193(05)80531-4

  • 39

    BennettM. (2020). An attempt at a unified theory of the neocortical microcircuit in sensory cortex. Front. Neural Circuits14:40. 10.3389/fncir.2020.00040

  • 40

    BennettM. (2021). What behavioral abilities emerged at key milestones in human brain evolution? 13 hypotheses on the 600-million-year phylogenetic history of human intelligence. Front. Psychol. 12:685853. 10.3389/fpsyg.2021.685853

  • 41

    BerkerE. A.BerkerA.SmithA. (1986). Translation of broca’s 1865 report. Localization of speech in the third left frontal convolution. Arch. Neurol.43, 10651072. 10.1001/archneur.1986.00520100069017

  • 42

    BernsG. S.LaibsonD.LoewensteinG. (2007). Intertemporal choice—toward an integrative framework. Trends Cogn. Sci.11, 482488. 10.1016/j.tics.2007.08.011

  • 43

    BerwickR. C.ChomskyN. (2017). Why Only Us: Language and Evolution.Cambridge, MA: MIT Press.

  • 44

    BeyiucM. (1938). The Roundabout Path of the Fighting Fish.10.1111/j.1469-7998.1938.tb07904.x

  • 45

    BillardP. (2020). Comparative study of episodic memory in common cuttlefish (Sepia officinalis) and Eurasian jay (Garrulus glandarius). Available online at: https://hal.archives-ouvertes.fr/tel-03150039v2.

  • 46

    BinderJ. R.DesaiR. H. (2011). The neurobiology of semantic memory. Trends Cogn. Sci.15, 527536. 10.1016/j.tics.2011.10.001

  • 47

    BirdC. M.CastelliF.MalikO.FrithU.HusainM. (2004). The impact of extensive medial frontal lobe damage on ‘Theory of Mind’ and cognition. Brain127, 914928. 10.1093/brain/awh108

  • 48

    Bischof-KöhlerD. (1985). “Zur phylogenese menschticher motivation [On the phylogeny of human motivation],” in Emotion und Reflexivitat, eds EckensbergerL. H.LantermannE. D. (Vienna: Urban and Schwarzenberg), 347.

  • 49

    BisiachE.LuzzattiC. (1978). Unilateral neglect of representational space. Cortex14, 129133. 10.1016/s0010-9452(78)80016-1

  • 50

    BlanchardR. J.BlanchardD. C. (1969). Crouching as an index of fear. J. Comp. Physiol. Psychol.67, 370375. 10.1037/h0026779

  • 51

    BoschT.KlimovichA.Domazet-LosoT.GründerS.HolsteinT. W.JékelyG.et al. (2017). Back to the basics: cnidarians start to fire. Trends Neurosci.40, 92105. 10.1016/j.tins.2016.11.005

  • 52

    BolesL. C.LohmannK. J. (2003). True navigation and magnetic maps in spiny lobsters. Nature421, 6063. 10.1038/nature01226

  • 53

    BollesR. C.FanselowM. S. (1980). A perceptual-defensive-recuperative model of fear and pain. Behav. Brain Sci.3, 291323. 10.1017/s0140525x0000491x

  • 54

    BoschT. C. G.KlimovichA.Domazet-LosoT.GründerS.HolsteinT. W.JékelyG.et al. (2017). Back to the basics: cnidarians start to fire. Trends Neurosci.40, 92105. 10.1016/j.tins.2016.11.005

  • 55

    Bosch-BoujuC.HylandB. I.Parr-BrownlieL. C. (2013). Motor thalamus integration of cortical, cerebellar and basal ganglia information: implications for normal and parkinsonian conditions. Front. Comput. Neurosci.7:163. 10.3389/fncom.2013.00163

  • 56

    BrandlS.BellwoodD. (2015). Coordinated vigilance provides evidence for direct reciprocity in coral reef fishes. Sci. Rep.5:14556. 10.1038/srep14556

  • 57

    BräuerJ. (2014). “Chapter 10—what dogs understand about humans,” in The Social Dog: Behaviour and Cognition, eds KaminskiJ.Marshall-PesciniS. (Academic Press), 295317. Available online at: https://www.google.com/books/edition/The_Social_Dog/THRAAwAAQBAJ?hl=en.

  • 58

    BraverT. S.BarchD. M.GrayJ. R.MolfeseD. L.SnyderA. (2001). Anterior cingulate cortex and response conflict: effects of frequency, inhibition and errors. Cereb. Cortex11, 825836. 10.1093/cercor/11.9.825

  • 59

    BrennanP. A.KendrickK. M. (2006). Mammalian social odours: attraction and individual recognition. Philos. Trans. R. Soc. B Biol. Sci.361, 20612078. 10.1098/rstb.2006.1931

  • 60

    BreskaA.IvryR. B. (2016). Taxonomies of timing: where does the cerebellum fit in?Curr. Opin. Behav. Sci.8, 282288. 10.1016/j.cobeha.2016.02.034

  • 61

    BroglioC.Martín-MonzónI.OcañaF. M.GómezA.DuránE.SalasC.et al. (2015). Hippocampal pallium and map-like memories through vertebrate evolution. J. Behav. Brain Sci.05, 109120. 10.4236/jbbs.2015.53011

  • 62

    BrownC. (2015). Fish intelligence, sentience and ethics. Anim. Cogn.18, 117. 10.1007/s10071-014-0761-0

  • 63

    BrownS.JordaniaJ. (2011). Universals in the world’s musics. Psychol. Music41, 229248. 10.1177/0305735611425896

  • 64

    BrownS.MartinezM. J.ParsonsL. M. (2006). Music and language side by side in the brain: a PET study of the generation of melodies and sentences. Eur. J. Neurosci.23, 27912803. 10.1111/j.1460-9568.2006.04785.x

  • 65

    BrydenD. W.JohnsonE. E.TobiaS. C.KashtelyanV.RoeschM. R. (2011). Attention for learning signals in anterior cingulate cortex. J. Neurosci.31, 1826618274. 10.1523/JNEUROSCI.4715-11.2011

  • 66

    BucherD.AndersonP. A. (2015). Evolution of the first nervous systems—what can we surmise?J. Exp. Biol.218, 501503. 10.1242/jeb.111799

  • 67

    BuchsbaumB. R.GreerS.ChangW.-L.BermanK. F. (2005). Meta-analysis of neuroimaging studies of the Wisconsin Card-Sorting task and component processes. Hum. Brain Mapp.25, 3545. 10.1002/hbm.20128

  • 68

    BucknerR. L.CarrollD. C. (2007). Self-projection and the brain. Trends Cogn. Sci.11, 4957. 10.1016/j.tics.2006.11.004

  • 69

    BugnyarT.ReberS. A.BucknerC. (2016). Ravens attribute visual access to unseen competitors. Nat. Commun.7:10506. 10.1038/ncomms10506

  • 70

    BuhusiC. V.MeckW. H. (2005). What makes us tick? Functional and neural mechanisms of interval timing. Nat. Rev. Neurosci.6, 755765. 10.1038/nrn1764

  • 71

    BuhusiC. V.PereraD.MeckW. H. (2005). Memory for timing visual and auditory signals in albino and pigmented rats. J. Exp. Psychol. Anim. Behav. Process.31, 1830. 10.1037/0097-7403.31.1.18

  • 72

    BuhusiC. V.SasakiA.MeckW. H. (2002). Temporal integration as a function of signal and gap intensity in rats (Rattus norvegicus) and pigeons (Columba livia). J. Comp. Psychol.116, 381390. 10.1037/0735-7036.116.4.381

  • 73

    BurkeC. J.HuetterothW.OwaldD.PerisseE.KrashesM. J.DasG.et al. (2012). Layered reward signalling through octopamine and dopamine in Drosophila. Nature492, 433437. 10.1038/nature11614

  • 74

    Burt de PereraT.HolbrookR. I.DavisV. (2016). The representation of three-dimensional space in fish. Front. Behav. Neurosci.10:40. 10.3389/fnbeh.2016.00040

  • 75

    ByrneR.MendlM.DevereuxC.HeldS. (2001). Behaviour of domestic pigs in a visual perspective taking task. Behaviour138, 13371354. 10.1163/156853901317367627

  • 76

    BzdokD.LangnerR.HoffstaedterF.TuretskyB. I.ZillesK.EickhoffS. B. (2012). The modular neuroarchitecture of social judgments on faces. Cereb. Cortex22, 951961. 10.1093/cercor/bhr166

  • 77

    CallJ.HareB.CarpenterM.TomaselloM. (2004). ‘Unwilling’ versus ‘unable’: chimpanzees’ understanding of human intentional action. Dev. Sci.7, 488498. 10.1111/j.1467-7687.2004.00368.x

  • 78

    CallJ.TomaselloM. (1998). Distinguishing intentional from accidental actions in orangutans (Pongo pygmaeus), chimpanzees (Pan troglodytes), and human children (Homo sapiens). J. Comp. Psychol.112, 192206. 10.1037/0735-7036.112.2.192

  • 79

    CamilleN.CoricelliG.SalletJ.Pradat-DiehlP.DuhamelJ. R.SiriguA. (2004). The involvement of the orbitofrontal cortex in the experience of regret. Science304, 11671170. 10.1126/science.1094550

  • 80

    CantalupoC.HopkinsW. D. (2001). Asymmetric Broca’s area in great apes. Nature414, 505505. 10.1038/35107134

  • 81

    CarlinK. P.JiangZ.BrownstoneR. M. (2000). Characterization of calcium currents in functionally mature mouse spinal motoneurons. Eur. J. Neurosci.12, 16241634. 10.1046/j.1460-9568.2000.00050.x

  • 82

    CarpenterM.AkhtarN.TomaselloM. (1998). Fourteen- through 18-month-old infants differentially imitate intentional and accidental actions. Infant Behav. Dev.21, 315330. 10.1016/s0163-6383(98)90009-1

  • 83

    CarpenterM.CallJ. (2013). “How joint is the joint attention of apes and human infants?,” in Agency Joint Attention, eds MetcalfeJ.TerraceH. S. (Oxford: Oxford University Press), 4961. 10.1093/acprof:oso/9780199988341.003.0003

  • 84

    CarringtonS. J.BaileyA. J. (2009). Are there theory of mind regions in the brain? A review of the neuroimaging literature. Hum. Brain Mapp.30, 23132335. 10.1002/hbm.20671

  • 85

    CarterC. S.BraverT. S.BarchD. M.BotvinickM. M.NollD.CohenJ. D. (1998). Anterior cingulate cortex, error detection, and the online monitoring of performance. Science280, 747749. 10.1126/science.280.5364.747

  • 86

    CataniM.BambiniV. (2014). A model for social communication and language evolution and development (SCALED). Curr. Opin. Neurobiol.28, 165171. 10.1016/j.conb.2014.07.018

  • 87

    CataniM.JonesD. K.ffytcheD. H. (2004). Perisylvian language networks of the human brain. Ann. Neurol.57, 816. 10.1002/ana.20319

  • 88

    CeraN.EspositoR.CieriF.TartaroA. (2019). Altered cingulate cortex functional connectivity in normal aging and mild cognitive impairment. Front. Neurosci.13:857. 10.3389/fnins.2019.00857

  • 89

    CesarioJ.JohnsonD. J.EisthenH. L. (2020). Your brain is not an onion with a tiny reptile inside. Curr. Dir. Psychol. Sci.29, 255260. 10.1177/0963721420917687

  • 90

    ChangE. F.RaygorK. P.BergerM. S. (2015). Contemporary model of language organization: an overview for neurosurgeons. J. Neurosurg.122, 250261. 10.3171/2014.10.JNS132647

  • 91

    ChengR.-K.JesuthasanS. J.PenneyT. B. (2014). Zebrafish forebrain and temporal conditioning. Philos. Trans. R. Soc. B Biol. Sci.369:20120462. 10.1098/rstb.2012.0462

  • 92

    CheungA. F.PollenA. A.TavareA.DeProtoJ.MolnárZ. (2007). Comparative aspects of cortical neurogenesis in vertebrates. J. Anat.211, 164176. 10.1111/j.1469-7580.2007.00769.x

  • 93

    ChoiJ.-S.CainC. K.LeDouxJ. E. (2010). The role of amygdala nuclei in the expression of auditory signaled two-way active avoidance in rats. Learn. Mem.17, 139147. 10.1101/lm.1676610

  • 94

    ChurchlandP. (2011). Braintrust: What Neuroscience Tells Us about Morality.Princeton, NJ: Princeton University Press.

  • 95

    ChurginM. A.McCloskeyR. J.PetersE.Fang-YenC. (2017). Antagonistic serotonergic and octopaminergic neural circuits mediate food-dependent locomotory behavior in caenorhabditis elegans. J. Neurosci.37, 78117823. 10.1523/JNEUROSCI.2636-16.2017

  • 96

    CisekP. (2019). Resynthesizing behavior through phylogenetic refinement. Atten. Percept. Psychophys.81, 22652287. 10.3758/s13414-019-01760-1

  • 97

    ColeM.WahlstenD. (1968). Response-contingent CS termination as a factor in avoidance conditioning. Psychon. Sci.12, 1516. 10.3758/bf03331165

  • 98

    CollinC.HauserF.de ValdiviaE. G.LiS.ReisenbergerJ.CarlsenE. M.et al. (2013). Two types of muscarinic acetylcholine receptors in Drosophila and other arthropods. Cell. Mol. Life Sci.70, 32313242. 10.1007/s00018-013-1334-0

  • 99

    ConsonM.ErricoD.MazzarellaE.GiordanoM.GrossiD.TrojanoL. (2015). Transcranial electrical stimulation over dorsolateral prefrontal cortex modulates processing of social cognitive and affective information. PLoS One10:e0126448. 10.1371/journal.pone.0126448

  • 100

    CopeA. J.VasilakiE.MinorsD.SaboC.MarshallJ. A. R.BarronA. B. (2018). Abstract concept learning in a simple neural network inspired by the insect brain. PLoS Comput. Biol.14:e1006435. 10.1371/journal.pcbi.1006435

  • 101

    CorballisM. C. (2002). From Hand to Mouth: The Origins of Language.Princeton, New Jersey: Princeton University Press.

  • 102

    CoricelliG.CritchleyH. D.JoffilyM.O’DohertyJ. P.SiriguA.DolanR. J. (2005). Regret and its avoidance: a neuroimaging study of choice behavior. Nat. Neurosci.8, 12551262. 10.1038/nn1514

  • 103

    CoricelliG.DolanR. J.SiriguA. (2007). Brain, emotion and decision making: the paradigmatic example of regret. Trends Cogn. Sci.11, 258265. 10.1016/j.tics.2007.04.003

  • 104

    CowenS. L.McNaughtonB. L. (2007). Selective delay activity in the medial prefrontal cortex of the rat: contribution of sensorimotor information and contingency. J. Neurophysiol.98, 303316. 10.1152/jn.00150.2007

  • 105

    CrystalJ. D. (2013). Remembering the past and planning for the future in rats. Behav. Process.93, 3949. 10.1016/j.beproc.2012.11.014

  • 106

    D’AmatoM. R. (1988). A search for tonal pattern perception in cebus monkeys: why monkeys can’t hum a tune. Music Percept.5, 453480. 10.2307/40285410

  • 107

    DavidN.BewernickB. H.CohenM. X.NewenA.LuxS.FinkG. R.et al. (2006). Neural representations of self versus other: visual-spatial perspective taking and agency in a virtual ball-tossing game. J. Cogn. Neurosci.18, 898910. 10.1162/jocn.2006.18.6.898

  • 108

    DavisK.CheongM. C.ParkJ. S.YouY.-J. (2017). “Appetite control in C. elegans,” in Appetite and Food Intake. Central Control. 2nd Edition, ed R.Harris (Boca Raton, FL: CRC Press/Taylor and Francis), 116. 10.1201/9781315120171-1

  • 109

    de VilliersJ. (2007). The interface of language and theory of mind. Lingua117, 18581878. 10.1016/j.lingua.2006.11.006

  • 110

    de WaalF. B. (2008). Putting the altruism back into altruism: the evolution of empathy. Annu. Rev. Psychol.59, 279300. 10.1146/annurev.psych.59.103006.093625

  • 111

    DehaeneS.PosnerM. I.TuckerD. M. (1994). Localization of a neural system for error detection and compensation. Psychol. Sci.5, 303305. 10.1111/j.1467-9280.1994.tb00630.x

  • 112

    DelgadoL. M.CouveE.SchmachtenbergO. (2010). GABA and glutamate immunoreactivity in tentacles of the sea anemone Phymactis papillosa(LESSON 1830). J. Morphol.. 271, 845852. 10.1002/jmor.10838

  • 113

    Dennis SinghM.LeeM. (2017). “Temporal hierarchies in multilayer gated recurrent neural networks for language models,” in Proceedings of the 2017 International Joint Conference on Neural Networks (IJCNN), (Anchorage, AK), 21522157. Available online at: https://ieeexplore.ieee.org/document/7965815

  • 114

    DerbyshireS. W. G. (2000). Exploring the pain “neuromatrix”. Curr. Rev. Pain4, 467477. 10.1007/s11916-000-0071-x

  • 115

    DevineniA. V.HeberleinU. (2009). Preferential ethanol consumption in drosophila models features of addiction. Curr. Biol.19, 21262132. 10.1016/j.cub.2009.10.070

  • 116

    DiamondA.Goldman-RakicP. S. (1989). Comparison of human infants and rhesus monkeys on Piaget’s AB task: evidence for dependence on dorsolateral prefrontal cortex. Exp. Brain Res.74, 2440. 10.1007/BF00248277

  • 117

    DollB. B.DuncanK. D.SimonD. A.ShohamyD.DawN. D. (2015). Model-based choices involve prospective neural activity. Nat. Neurosci.18, 767772. 10.1038/nn.3981

  • 118

    DouglasH. (2012). “Inequity Aversion: Do Rats Get Jealous?” Honors Projects. 63. Available online at: https://scholarworks.bgsu.edu/cgi/viewcontent.cgi?article=1064&context=honorsprojects.

  • 119

    DrakeC.JonesM. R.BaruchC. (2000). The development of rhythmic attending in auditory sequences: attunement, referent period, focal attending. Cognition77, 251288. 10.1016/s0010-0277(00)00106-2

  • 120

    DubnauJ.ChiangA.-S.TullyT. (2002). Neural substrates of memory: from synapse to system. J. Neurobiol.54, 238253. 10.1002/neu.10170

  • 121

    Dugas-FordJ.RowellJ. J.RagsdaleC. W. (2012). Cell-type homologies and the origins of the neocortex. Proc. Natl. Acad. Sci. U S A109, 1697416979. 10.1073/pnas.1204773109

  • 122

    DuránE.OcañaF. M.BroglioC.RodríguezF.SalasC. (2010). Lateral but not medial telencephalic pallium ablation impairs the use of goldfish spatial allocentric strategies in a “hole-board” task. Behav. Brain Res.214, 480487. 10.1016/j.bbr.2010.06.010

  • 123

    EichenbaumH. (2014). Time cells in the hippocampus: a new dimension for mapping memories. Nat. Rev. Neurosci.15, 732744. 10.1038/nrn3827

  • 124

    EichenbaumH.YonelinasA. P.RanganathC. (2007). The medial temporal lobe and recognition memory. Annu. Rev. Neurosci.30, 123152. 10.1146/annurev.neuro.30.051606.094328

  • 125

    ElgerC. E.SpeckmannE.-J.CaspersH.JanzenR. W. C. (1977). Cortico-spinal connections in the rat. I. Monosynaptic and polysynaptic responses of cervical motoneurons to epicortical stimulation. Exp. Brain Res.2828. 10.1007/BF00235718

  • 126

    EyreJ. (2007). Corticospinal tract development and its plasticity after perinatal injury. Neurosci. Biobehav. Rev.31, 11361149. 10.1016/j.neubiorev.2007.05.011

  • 127

    FagenR. (1981). Animal play behavior. Evolution35:1254. 10.2307/2408140

  • 128

    FanelliD. (2011). Negative results are disappearing from most disciplines and countries. Scientometrics90, 891904. 10.1007/s11192-011-0494-7

  • 129

    FarahM. J.SosoM. J.DasheiffR. M. (1992). Visual angle of the mind’s eye before and after unilateral occipital lobectomy. J. Exp. Psychol. Hum. Percept. Perform.18, 241246. 10.1037//0096-1523.18.1.241

  • 130

    FarooquiT.RobinsonK.VaessinH.SmithB. H. (2003). Modulation of early olfactory processing by an octopaminergic reinforcement pathway in the honeybee. J. Neurosci.23, 53705380. 10.1523/JNEUROSCI.23-12-05370.2003

  • 131

    FarriesM. A. (2013). How Basal’ are the basal ganglia?Brain Behav. Evol.82, 211214. 10.1159/000356101

  • 132

    FarrisS. M. (2008). Evolutionary convergence of higher brain centers spanning the protostome-deuterostome boundary. Brain Behav. Evol.72, 106122. 10.1159/000151471

  • 133

    FellousJ.-M. (1999). Neuromodulatory basis of emotion. Neuroscientist5, 283294. 10.1177/107385849900500514

  • 134

    FerrucciL.NougaretS.GenovesioA. (2019). Macaque monkeys learn by observation in the ghost display condition in the object-in-place task with differential reward to the observer. Sci. Rep.9:401. 10.1038/s41598-018-36803-4

  • 135

    FignerB.KnochD.JohnsonE. J.KroschA. R.LisanbyS. H.FehrE.et al. (2010). Lateral prefrontal cortex and self-control in intertemporal choice. Nat. Neurosci.13, 538539. 10.1038/nn.2516

  • 136

    FitchW. T. (2018). The biology and evolution of speech: a comparative analysis. Annu. Rev. Linguist.4, 255279. 10.1146/annurev-linguistics-011817-045748

  • 137

    FlorescoS. B.Ghods-SharifiS. (2006). Amygdala-prefrontal cortical circuitry regulates effort-based decision making. Cereb. Cortex17, 251260. 10.1093/cercor/bhj143

  • 138

    FogassiL.FerrariP. F.GesierichB.RozziS.ChersiF.RizzolattiG. (2005). Parietal lobe: from action organization to intention understanding. Science308, 662667. 10.1126/science.1106138

  • 139

    FooS. A.LiddellL.GrossmanA.CaldeiraK. (2019). Photo-movement in the sea anemone aiptasia influenced by light quality and symbiotic association. Coral Reefs39, 4754. 10.1007/s00338-019-01866-w

  • 140

    FossetteS.GleissA. C.ChalumeauJ.BastianT.ArmstrongC. D.VandenabeeleS.et al. (2015). Current-oriented swimming by jellyfish and its role in bloom maintenance. Curr. Biol.25, 342347. 10.1016/j.cub.2014.11.050

  • 141

    FranklandP. W.BontempiB.TaltonL. E.KaczmarekL.SilvaA. J. (2004). The involvement of the anterior cingulate cortex in remote contextual fear memory. Science304, 881883. 10.1126/science.1094804

  • 142

    FrickeH. W. (1974). Oko-Ethologie des monogamen Anemonenfisches Amphiprion bicintus (Freiwasseruntersuchung aus dem Roten Meer) [Eco-ethology of the monogamous anemone fish Amphiprion bicinctus (Field studies in the Red Sea)]. Z. Tierpsychol.36, 429512.

  • 143

    FriedericiA. D. (2011). The brain basis of language processing: from structure to function. Physiol. Rev.91, 13571392. 10.1152/physrev.00006.2011

  • 144

    FröbischJ.ReiszR. R. (2009). The Late Permian herbivore Suminia and the early evolution of arboreality in terrestrial vertebrate ecosystems. Proc. R. Soc. B Biol. Sci.276, 36113618. 10.1098/rspb.2009.0911

  • 145

    FugazzaC.PongráczP.PogányÁ.LenkeiR.MiklósiÁ. (2020). Mental representation and episodic-like memory of own actions in dogs. Sci. Rep.10:10449. 10.1038/s41598-020-67302-0

  • 146

    FujiiM.MaesawaS.IshiaiS.IwamiK.FutamuraM.SaitoK. (2016). Neural basis of language: an overview of an evolving model. Neurol. Med. Chir.56, 379386. 10.2176/nmc.ra.2016-0014

  • 147

    FujisawaS.AmarasinghamA.HarrisonM. T.BuzsákiG. (2008). Behavior-dependent short-term assembly dynamics in the medial prefrontal cortex. Nat. Neurosci.11, 823833. 10.1038/nn.2134

  • 148

    FujiwaraM.SenguptaP.McIntireS. L. (2002). Regulation of body size and behavioral state of C. elegans by sensory perception and the EGL-4 cGMP-dependent protein kinase. Neuron36, 10911102. 10.1016/s0896-6273(02)01093-0

  • 149

    GallagherH. L.FrithC. D. (2003). Functional imaging of ‘theory of mind’. Trends Cogn. Sci.7, 7783. 10.1016/s1364-6613(02)00025-6

  • 150

    GalleseV.FadigaL.FogassiL.RizzolattiG. (1996). Action recognition in the premotor cortex. Brain119, 593609. 10.1093/brain/119.2.593

  • 151

    GalliotB.QuiquandM. (2011). A two-step process in the emergence of neurogenesis. Eur. J. Neurosci.34, 847862. 10.1111/j.1460-9568.2011.07829.x

  • 152

    GallistelC. R.KingA.McDonaldR. (2004). Sources of variability and systematic error in mouse timing behavior. J. Exp. Psychol. Anim. Behav. Process.30, 316. 10.1037/0097-7403.30.1.3

  • 153

    GarmA.O’ConnorM.ParkefeltL.NilssonD.-E. (2007). Visually guided obstacle avoidance in the box jellyfish tripedalia cystophora and chiropsella bronzie. J. Exp. Biol.210, 36163623. 10.1242/jeb.004044

  • 154

    GarrityP. A.GoodmanM. B.SamuelA. D.SenguptaP. (2010). Running hot and cold: behavioral strategies, neural circuits and the molecular machinery for thermotaxis in C. elegans and Drosophila. Genes Dev.24, 23652382. 10.1101/gad.1953710

  • 155

    GlanzmanD. L. (2010). Common mechanisms of synaptic plasticity in vertebrates and invertebrates. Curr. Biol.20, R31R36. 10.1016/j.cub.2009.10.023

  • 156

    GattoE.Lucon-XiccatoT.BisazzaA. (2018). Factors affecting the measure of inhibitory control in a fish (Poecilia reticulata). Behav. Process.157, 1117. 10.1016/j.beproc.2018.08.003

  • 157

    GeorgeD.HawkinsJ. (2009). Towards a mathematical theory of cortical micro-circuits. PLoS Comput. Biol.5:e1000532. 10.1371/journal.pcbi.1000532

  • 158

    GhazanfarA. A. (2002). Primate Audition Behaviour and Neurobiology.Boca Raton, FL: CRC Press.

  • 159

    GilbertS. J.SpenglerS.SimonsJ. S.SteeleJ. D.LawrieS. M.FrithC. D.et al. (2006). Functional specialization within rostral prefrontal cortex (Area 10): a meta-analysis. J. Cogn. Neurosci.18, 932948. 10.1162/jocn.2006.18.6.932

  • 160

    GilovichT.MedvecV. H. (1995). The experience of regret: what, when, and why. Psychol. Rev.102, 379395. 10.1037/0033-295x.102.2.379

  • 161

    GinsburgS.JablonkaE. (2019). The Evolution of the Sensitive Soul: Learning and the Origins of Consciousness.Cambridge, MA: MIT Press.

  • 162

    GinsburgS.JablonkaE. (2021). Evolutionary transitions in learning and cognition. Philos. Trans. R. Soc. B Biol. Sci.376:20190766. 10.1098/rstb.2019.0766

  • 163

    GoffinetA. M.DaumerieC.LangerwerfB.PieauC. (1986). Neurogenesis in reptilian cortical structures: 3H-thymidine autoradiographic analysis. J. Comp. Neurol.243, 106116. 10.1002/cne.902430109

  • 164

    Goldman-RakicP. S. (1988). Topography of cognition: parallel distributed networks in primate association cortex. Annu. Rev. Neurosci.11, 137156. 10.1146/annurev.ne.11.030188.001033

  • 165

    GollY.AtlanG.CitriA. (2015). Attention: the claustrum. Trends Neurosci.38, 486495. 10.1016/j.tins.2015.05.006

  • 166

    Gómez-LaplazaL. M.GerlaiR. (2010). Latent learning in zebrafish (Danio rerio). Behav. Brain Res.208, 509515. 10.1016/j.bbr.2009.12.031

  • 167

    GorbmanA.NozakiM.KubokawaK. (1999). A brain-hatschek’s pit connection in amphioxus. Gen. Comp. Endocrinol.113, 251254. 10.1006/gcen.1998.7193

  • 168

    GouchaT.ZaccarellaE.FriedericiA. D. (2017). A revival of Homo loquens as a builder of labeled structures: neurocognitive considerations. Neurosci. Biobehav. Rev.81, 213224. 10.1016/j.neubiorev.2017.01.036

  • 169

    GourgouE.AdigaK.GoettemoellerA.ChenC.HsuA.-L. (2021). Caenorhabditis elegans learning in a structured maze is a multisensory behavior. iScience24:102284. 10.1016/j.isci.2021.102284

  • 170

    GrandjeanJ.CanellaC.AnckaertsC.AyrancG.BougachaS.BienertT.et al. (2020). Common functional networks in the mouse brain revealed by multi-centre resting-state fMRI analysis. NeuroImage205:116278. 10.1016/j.neuroimage.2019.116278

  • 171

    GranonS.PoucetB. (1995). Medial prefrontal lesions in the rat and spatial navigation: evidence for impaired planning. Behav. Neurosci.109, 474484. 10.1037//0735-7044.109.3.474

  • 172

    GreeneJ. D.NystromL. E.EngellA. D.DarleyJ. M.CohenJ. D. (2004). The neural bases of cognitive conflict and control in moral judgment. Neuron44, 389400. 10.1016/j.neuron.2004.09.027

  • 173

    GribovaA.DonchinO.BergmanH.VaadiaE.de OliveiraS. C. (2002). Timing of bimanual movements in human and nonhuman primates in relation to neuronal activity in primary motor cortex and supplementary motor area. Exp. Brain Res.146, 322335. 10.1007/s00221-002-1174-x

  • 174

    GrillnerS.RobertsonB. (2016). The basal ganglia over 500 million years. Curr. Biol.26, R1088R1100. 10.1016/j.cub.2016.06.041

  • 175

    GrosenickL.ClementT. S.FernaldR. D. (2007). Fish can infer social rank by observation alone. Nature445, 429432. 10.1038/nature05511

  • 176

    GrossbergS.VersaceM. (2008). Spikes, synchrony, and attentive learning by laminar thalamocortical circuits. Brain Res.1218, 278312. 10.1016/j.brainres.2008.04.024

  • 177

    GuZ.KalambogiasJ.YoshiokaS.HanW.LiZ.KawasawaY. I.et al. (2017). Control of species-dependent cortico-motoneuronal connections underlying manual dexterity. Science357, 400404. 10.1126/science.aan3721

  • 178

    GuptaA. S.van der MeerM. A.TouretzkyD. S.RedishA. D. (2012). Segmentation of spatial experience by hippocampal θ sequences. Nat. Neurosci.15, 10321039. 10.1038/nn.3138

  • 179

    GusnardD. A.AkbudakE.ShulmanG. L.RaichleM. E. (2001). Medial prefrontal cortex and self-referential mental activity: relation to a default mode of brain function. Proc. Natl. Acad. Sci. U S A98, 42594264. 10.1073/pnas.071043098

  • 180

    GutierrezC.ColaM. G.SeltzerB.CusickC. (2000). Neurochemical and connectional organization of the dorsal pulvinar complex in monkeys. J. Comp. Neurol.419, 6186. 10.1002/(sici)1096-9861(20000327)419:1<61::aid-cne4>3.0.co;2-i

  • 181

    HammerM.MenzelR. (1998). Multiple sites of associative odor learning as revealed by local brain microinjections of octopamine in honeybees. Learn Mem.5, 146156.

  • 182

    HallerJ.KrukM. R. (2006). Normal and abnormal aggression: human disorders and novel laboratory models. Neurosci. Biobehav. Rev.30, 292303. 10.1016/j.neubiorev.2005.01.005

  • 183

    HanS.TaralovaE.DupreC.YusteR. (2018). Comprehensive machine learning analysis of Hydra behavior reveals a stable basal behavioral repertoire. eLife7:e32605. 10.7554/eLife.32605

  • 184

    HaralsonJ. (1975). Classical conditioning in the sea anemone, cribrina xanthogrammica. Physiol. Behav.15, 455460. 10.1016/0031-9384(75)90214-0

  • 185

    HareT. A.CamererC. F.RangelA. (2009). Self-control in decision-making involves modulation of the vmPFC valuation system. Science324, 646648. 10.1126/science.1168450

  • 186

    HassabisD.KumaranD.VannS. D.MaguireE. A. (2007). Patients with hippocampal amnesia cannot imagine new experiences. Proc. Natl. Acad. Sci. U S A104, 17261731. 10.1073/pnas.0610561104

  • 187

    HassabisD.MaguireE. A. (2009). The construction system of the brain. Philos. Trans. R. Soc. B Biol. Sci.364, 12631271. 10.1098/rstb.2008.0296

  • 188

    HawkinsJ.AhmadS. (2016). Why neurons have thousands of synapses, a theory of sequence memory in neocortex. Front. Neural Circuits10:23. 10.3389/fncir.2016.00023

  • 189

    HawkinsR. D.ByrneJ. H. (2015). Associative learning in invertebrates. Cold Spring Harb. Perspect. Biol.7:a021709. 10.1101/cshperspect.a021709

  • 190

    HawkinsR. D.KandelE. R. (1984). Is there a cell-biological alphabet for simple forms of learning?Psychol. Rev.91, 375391. 10.1037/0033-295x.91.3.375

  • 191

    HawkinsR. D.LalevicN.ClarkG. A.KandelE. R. (1989). Classical conditioning of the aplysia siphon-withdrawal reflex exhibits response specificity. Proc. Natl. Acad. Sci. U S A86, 76207624. 10.1073/pnas.86.19.7620

  • 192

    HaysG. C.BastianT.DoyleT. K.FossetteS.GleissA. C.GravenorM. B.et al. (2011). High activity and Lévy searches: jellyfish can search the water column like fish. Proc. R. Soc. B Biol. Sci.279, 465473. 10.1098/rspb.2011.0978

  • 193

    HeikenfeldC.MederosS.ChenC.KorotkovaT.SchnitzlerA.PonomarenkoA. (2020). Prefrontal—subthalamic pathway supports action selection in a spatial working memory task. Sci. Rep.10:10497. 10.1038/s41598-020-67185-1

  • 194

    HejnolA.MartindaleM. Q. (2008). Acoel development supports a simple planula-like urbilaterian. Philos. Trans. R. Soc. B Biol. Sci.363, 14931501. 10.1098/rstb.2007.2239

  • 195

    HewesG.AndrewR.CariniL.ChoeH.GardnerR.KortlandtA.et al. (1973). Primate communication and the gestural origin of language. Curr. Anthropol.14, 524. Available online at: http://www.jstor.org/stable/2741093. Accessed April 7, 2021.

  • 196

    HickokG. (2009). Eight problems for the mirror neuron theory of action understanding in monkeys and humans. J. Cogn. Neurosci.21, 12291243. 10.1162/jocn.2009.21189

  • 197

    HoferM. A. (1996). Multiple regulators of ultrasonic vocalization in the infant rat. Psychoneuroendocrinology21, 203217. 10.1016/0306-4530(95)00042-9

  • 198

    HollandN. D. (2003). Early central nervous system evolution: an era of skin brains?Nat. Rev. Neurosci.4, 617627. 10.1038/nrn1175

  • 199

    HollardV. D.DeliusJ. D. (1982). Rotational invariance in visual pattern recognition by pigeons and humans. Science218, 804806. 10.1126/science.7134976

  • 200

    HollóG.NovákM. (2012). The manoeuvrability hypothesis to explain the maintenance of bilateral symmetry in animal evolution. Biol. Direct7:22. 10.1186/1745-6150-7-22

  • 201

    HoningH.MerchantH.HádenG. P.PradoL.BartoloR. (2012). Rhesus monkeys (Macaca mulatta) detect rhythmic groups in music, but not the beat. PLoS One7:e51369. 10.1371/journal.pone.0051369

  • 202

    HuY.van WingerdenM.SellittoM.SchäbleS.KalenscherT. (2021). Anterior cingulate cortex lesions abolish budget effects on effort-based decision-making in rat consumers. J. Neurosci.41, 44484460. 10.1523/JNEUROSCI.2541-20.2021

  • 203

    HulseS. H.CynxJ.HumpalJ. (1984). Absolute and relative pitch discrimination in serial pitch perception by birds. J. Exp. Psychol. General113, 3854. 10.1037/0096-3445.113.1.38

  • 204

    ImaizumiS.MoriK.KiritaniS.KawashimaR.SugiuraM.FukudaH.et al. (1997). Vocal identification of speaker and emotion activates differerent brain regions. Neuroreport8, 28092812. 10.1097/00001756-199708180-00031

  • 205

    InoueT.HoshinoH.YamashitaT.ShimoyamaS.AgataK. (2015). Planarian shows decision-making behavior in response to multiple stimuli by integrative brain function. Zoological Lett.1:7. 10.1186/s40851-014-0010-z

  • 206

    IshiharaT.IinoY.MohriA.MoriI.Gengyo-AndoK.MitaniS.et al. (2002). HEN-1, a secretory protein with an LDL receptor motif, regulates sensory integration and learning in Caenorhabditis elegans. Cell109, 639649. 10.1016/s0092-8674(02)00748-1

  • 207

    ItoH. T.ZhangS. J.WitterM. P.MoserE. I.MoserM. B. (2015). A prefrontal-thalamo-hippocampal circuit for goal-directed spatial navigation. Nature522, 5055. 10.1038/nature14396

  • 208

    JacksonP. L.RainvilleP.DecetyJ. (2006). To what extent do we share the pain of others? Insight from the neural bases of pain empathy. Pain125, 59. 10.1016/j.pain.2006.09.013

  • 209

    JanmaatK. R.PolanskyL.BanS. D.BoeschC. (2014). Wild chimpanzees plan their breakfast time, type and location. Proc. Natl. Acad. Sci. U S A111, 1634316348. 10.1073/pnas.1407524111

  • 210

    JarvisE. D. (2019). Evolution of vocal learning and spoken language. Science366, 5054. 10.1126/science.aax0287

  • 211

    JarvisE. D.GüntürkünO.BruceL.CsillagA.KartenH.KuenzelW.et al. (2005). Avian brains and a new understanding of vertebrate brain evolution. Nat. Rev. Neurosci.6, 151159. 10.1038/nrn1606

  • 212

    JekelyG. (2013). Global view of the evolution and diversity of metazoan neuropeptide signaling. Proc. Natl. Acad. Sci. U S A110, 87028707. 10.1073/pnas.1221833110

  • 213

    JenkinsA. C.MitchellJ. P. (2011). Medial prefrontal cortex subserves diverse forms of self-reflection. Soc. Neurosci.6, 211218. 10.1080/17470919.2010.507948

  • 214

    JentschkeS.KoelschS.SallatS.FriedericiA. D. (2008). Children with specific language impairment also show impairment of music-syntactic processing. J. Cogn. Neurosci.20, 19401951. 10.1162/jocn.2008.20135

  • 215

    JoelD.NivY.RuppinE. (2002). Actor-critic models of the basal ganglia: new anatomical and computational perspectives. Neural Netw.15, 535547. 10.1016/s0893-6080(02)00047-3

  • 216

    JohnsonA.RedishA. D. (2007). Neural ensembles in CA3 transiently encode paths forward of the animal at a decision point. J. Neurosci.27, 1217612189. 10.1523/JNEUROSCI.3761-07.2007

  • 217

    JonesJ. L.EsberG. R.McDannaldM. A.GruberA. J.HernandezA.MirenziA.et al. (2012). Orbitofrontal cortex supports behavior and learning using inferred but not cached values. Science338, 953956. 10.1126/science.1227489

  • 218

    JonesA. W.Levi-MontalciniR. (1958). Patterns of differentiation of the nerve centers and fiber tracts of the avian cerebral hemispheres. Arch. Ital. Biol.96, 231284.

  • 219

    KaasJ. H. (2009). Evolutionary Neuroscience.Cambridge, CA: Academic Press/Elsevier.

  • 220

    KaasJ. H. (2012). The evolution of brains from early mammals to humans. Wiley Interdiscip. Rev. Cogn. Sci.4, 3345. 10.1002/wcs.1206

  • 221

    KalbeE.SchlegelM.SackA. T.NowakD. A.DafotakisM.BangardC.et al. (2010). Dissociating cognitive from affective theory of mind: a TMS study. Cortex46, 769780. 10.1016/j.cortex.2009.07.010

  • 222

    KamigakiT. (2019). Prefrontal circuit organization for executive control. Neurosci. Res.140, 2336. 10.1016/j.neures.2018.08.017

  • 223

    KaminL. J. (1957). The effects of termination of the CS and avoidance of the US on avoidance learning: an extension. Can. J. Psychol.11, 4856. 10.1037/h0083690

  • 224

    KaminL. J.BrimerC. J.BlackA. H. (1963). Conditioned suppression as a monitor of fear of the CS in the course of avoidance training. J. Comp. Physiol. Psychol.56, 497501. 10.1037/h0047966

  • 225

    KandelE. R. (2001). The molecular biology of memory storage: a dialogue between genes and synapses. Science294, 10301038. 10.1126/science.1067020

  • 226

    KandelE. R. (2006). Search of Memory: The Emergence of a New Science of Mind.New York, NY: W.W. Norton and Company.

  • 227

    KanoF.KrupenyeC.HirataS.TomonagaM.CallJ. (2019). Great apes use self-experience to anticipate an agent’s action in a false-belief test. Proc. Natl. Acad. Sci. U S A116, 2090420909. 10.1073/pnas.1910095116

  • 228

    KappB. S.FrysingerR. C.GallagherM.HaseltonJ. R. (1979). Amygdala central nucleus lesions: effect on heart rate conditioning in the rabbit. Physiol. Behav.23, 11091117. 10.1016/0031-9384(79)90304-4

  • 229

    KarlenS. J.KrubitzerL. (2007). The functional and anatomical organization of marsupial neocortex: evidence for parallel evolution across mammals. Prog. Neurobiol.82, 122141. 10.1016/j.pneurobio.2007.03.003

  • 230

    KarnikI.GerlaiR. (2012). Can zebrafish learn spatial tasks? an empirical analysis of place and single CS-US associative learning. Behav. Brain Res.233, 415421. 10.1016/j.bbr.2012.05.024

  • 231

    KartenH. J. (1969). The organization of the avian telencephalon and some speculations on the phylogeny of the amniote telencephalon*. Ann. N Y Acad. Sci.167, 164179. 10.1111/j.1749-6632.1969.tb20442.x

  • 232

    KartenH. J. (1997). Evolutionary developmental biology meets the brain: the origins of mammalian cortex. Proc. Natl. Acad. Sci. U S A94, 28002804. 10.1073/pnas.94.7.2800

  • 233

    Kass-SimonG.PierobonP. (2007). Cnidarian chemical neurotransmission, an updated overview. Comp. Biochem. Physiol. A Mol. Integr. Physiol.146, 925. 10.1016/j.cbpa.2006.09.008

  • 234

    KaunK. R.AzanchiR.MaungZ.HirshJ.HeberleinU. (2011). A drosophila model for alcohol reward. Nat. Neurosci.14, 612619. 10.1038/nn.2805

  • 235

    KeenanJ. P.RubioJ.RacioppiC.JohnsonA.BarnaczA. (2005). The right hemisphere and the dark side of consciousness. Cortex41, 695704. 10.1016/s0010-9452(08)70286-7

  • 236

    KellerG. B.Mrsic-FlogelT. D. (2018). Predictive processing: a canonical cortical computation. Neuron100, 424435. 10.1016/j.neuron.2018.10.003

  • 237

    KellyY. T.WebbT. W.MeierJ. D.ArcaroM. J.GrazianoM. S. (2014). Attributing awareness to oneself and to others. Proc. Natl. Acad. Sci. U S A111, 50125017. 10.1073/pnas.1401201111

  • 238

    KerstenD.MamassianP.YuilleA. (2004). Object perception as bayesian inference. Annu. Rev. Psychol.55, 271304. 10.1146/annurev.psych.55.090902.142005

  • 239

    KimS.HwangJ.LeeD. (2008). Prefrontal coding of temporally discounted values during intertemporal choice. Neuron59, 161172. 10.1016/j.neuron.2008.05.010

  • 240

    KnillD. C.PougetA. (2004). The Bayesian brain: the role of uncertainty in neural coding and computation. Trends Neurosci.27, 712719. 10.1016/j.tins.2004.10.007

  • 241

    KnochD.Pascual-LeoneA.MeyerK.TreyerV.FehrE. (2006). Diminishing reciprocal fairness by disrupting the right prefrontal cortex. Science314, 829832. 10.1126/science.1129156

  • 242

    KoechlinE.DanekA.BurnodY.GrafmanJ. (2002). Medial prefrontal and subcortical mechanisms underlying the acquisition of motor and cognitive action sequences in humans. Neuron35, 371381. 10.1016/s0896-6273(02)00742-0

  • 243

    KoelschS.GunterT. C.V CramonD. Y.ZyssetS.LohmannG.FriedericiA. D. (2002). Bach speaks: a cortical “language-network” serves the processing of music. NeuroImage17, 956966. 10.1006/nimg.2002.1154

  • 244

    KondoH.OsakaN.OsakaM. (2004). Cooperation of the anterior cingulate cortex and dorsolateral prefrontal cortex for attention shifting. NeuroImage23, 670679. 10.1016/j.neuroimage.2004.06.014

  • 245

    KramerM.BurghardtG. M. (2010). Precocious courtship and play in emydid turtles. Ethology104, 3856. 10.1111/j.1439-0310.1998.tb00028.x

  • 246

    KrupenyeC.KanoF.HirataS.CallJ.TomaselloM. (2016). Great apes anticipate that other individuals will act according to false beliefs. Science354, 110114. 10.1126/science.aaf8110

  • 247

    KurczekJ.WechslerE.AhujaS.JensenU.CohenN. J.TranelD.et al. (2015). Differential contributions of hippocampus and medial prefrontal cortex to self-projection and self-referential processing. Neuropsychologia73, 116126. 10.1016/j.neuropsychologia.2015.05.002

  • 248

    KusayamaT.WatanabeS. (2000). Reinforcing effects of methamphetamine in planarians. Neuroreport11, 25112513. 10.1097/00001756-200008030-00033

  • 249

    LaneR. D.FinkG. R.ChauP. M.-L.DolanR. J. (1997). Neural activation during selective attention to subjective emotional responses. Neuroreport8, 39693972. 10.1097/00001756-199712220-00024

  • 250

    LangfordD. J. (2006). Social modulation of pain as evidence for empathy in mice. Science312, 19671970. 10.1126/science.1128322

  • 251

    LarschJ.FlavellS. W.LiuQ.GordusA.AlbrechtD. R.BargmannC. I. (2015). A circuit for gradient climbing in C. elegans chemotaxis. Cell Rep.12, 17481760. 10.1016/j.celrep.2015.08.032

  • 252

    LauH. E.CecereZ. T.LiuZ.YangC. J.SharpeeT. O.ChalasaniS. H. (2017). Neural mechanisms driving hunger-induced changes in sensory perception and behavior in Caenorhabditis elegans. bioRxiv [Preprint]. 10.1101/156109

  • 253

    LaubachM.AmaranteL. M.SwansonK.WhiteS. R. (2018). What, if anything, is rodent prefrontal cortex. eNeuro5:ENEURO.0315-18.2018. 10.1523/ENEURO.0315-18.2018

  • 254

    Lázaro-MuñozG.LeDouxJ. E.CainC. K. (2010). Sidman instrumental avoidance initially depends on lateral and basal amygdala and is constrained by central amygdala-mediated Pavlovian processes. Biol. Psychiatry67, 11201127. 10.1111/jam.15213

  • 255

    LeDouxJ. E.MoscarelloJ.SearsR.CampeseV. (2016). The birth, death and resurrection of avoidance: a reconceptualization of a troubled paradigm. Mol. Psychiatry22, 2436. 10.1038/mp.2016.166

  • 256

    LemonR. (2019). Recent advances in our understanding of the primate corticospinal system. F1000Research8:F1000 Faculty Rev-274. 10.12688/f1000research.17445.1

  • 257

    LenhoffH. M.LoomisW. F. (Eds.) (1963). “The biology of hydra and of some other coelenterates: 1961,” in Quarterly Review of Biology, vol. 38, 267268. Available online at: https://www.journals.uchicago.edu/doi/10.1086/403898

  • 258

    LewisP. A.RezaieR.BrownR.RobertsN.DunbarR. I. M. (2011). Ventromedial prefrontal volume predicts understanding of others and social network size. NeuroImage57, 16241629. 10.1016/j.neuroimage.2011.05.030

  • 259

    LewisT. (2014). Like humans, rats experience regret, study suggests. LiveScience. Available online at: https://www.livescience.com/46184-rats-experience-regret.html. Accessed March 22, 2021.

  • 260

    LeysS.CroninT.DegnanB.MarshallJ. (2002). Spectral sensitivity in a sponge larva. J. Comp. Physiol. A Neuroethol. Sens. Neural Behav. Physiol.188, 199202. 10.1007/s00359-002-0293-y

  • 261

    LiM. J. (2012). Identification of an Operant Learning Circuit by Whole Brain Functional Imaging in Larval Zebrafish. Doctoral dissertation. Harvard University. Available online at: https://dash.harvard.edu/handle/1/10974703.

  • 262

    LiebeskindB. J.HofmannH. A.HillisD. M.ZakonH. H. (2017). Evolution of animal neural systems. Ann. Rev. Ecol. Evol. Systematics48, 377398. 10.1146/annurev-ecolsys-110316-023048

  • 263

    LindeyerC. M.ReaderS. M. (2010). Social learning of escape routes in zebrafish and the stability of behavioural traditions. Animal Behav.79, 827834. 10.1016/j.anbehav.2009.12.024

  • 264

    LiuC.PlaçaisP.-Y.YamagataN.PfeifferB. D.AsoY.FriedrichA. B.et al. (2012). A subset of dopamine neurons signals reward for odour memory in Drosophila. Nature488, 512516. 10.1038/nature11304

  • 265

    LópezJ. C.GómezY.RodríguezF.BroglioC.VargasJ. P.SalasC. (2001). Spatial learning in turtles. Anim. Cogn.4, 4959. 10.1111/jam.15213

  • 266

    LópezJ. C.VargasJ. P.GómezY.SalasC. (2003). Spatial and non-spatial learning in turtles: the role of medial cortex. Behav. Brain Res.143, 109120. 10.1007/s100710100091

  • 267

    Lopez-BarrosoD.CataniM.RipollesP.Dell’AcquaF.Rodriguez-FornellsA.de Diego-BalaguerR. (2013). Word learning is mediated by the left arcuate fasciculus. Proc. Natl. Acad. Sci. U S A110, 1316813173. 10.1073/pnas.1301696110

  • 268

    LotzeM.VeitR.AndersS.BirbaumerN. (2007). Evidence for a different role of the ventral and dorsal medial prefrontal cortex for social reactive aggression: an interactive fMRI study. NeuroImage34, 470478. 10.1016/j.neuroimage.2006.09.028

  • 269

    LövheimH. (2012). A new three-dimensional model for emotions and monoamine neurotransmitters. Med. Hypotheses78, 341348. 10.1016/j.mehy.2011.11.016

  • 270

    LoweC. J.WuM.SalicA.EvansL.LanderE.Stange-ThomannN.et al. (2003). Anteroposterior patterning in hemichordates and the origins of the chordate nervous system. Cell113, 853865. 10.1016/s0092-8674(03)00469-0

  • 271

    LuH.ZouQ.GuH.RaichleM. E.SteinE. A.YangY. (2012). Rat brains also have a default mode network. Proc. Natl. Acad. Sci. U S A109, 39793984. 10.1073/pnas.1200506109

  • 272

    Lucon-XiccatoT.BisazzaA. (2017). Complex maze learning by fish. Anim. Behav.125, 6975. 10.1016/j.anbehav.2016.12.022

  • 273

    LuersenK.FaustU.GottschlingD.-C.DoringF. (2014). Gait-specific adaptation of locomotor activity in response to dietary restriction in Caenorhabditis elegans. J. Exp. Biol.217, 24802488. 10.1242/jeb.099382

  • 274

    LuoL.CookN.VenkatachalamV.Martinez-VelazquezL. A.ZhangX.CalvoA. C.et al. (2014). Bidirectional thermotaxis in Caenorhabditis elegans is mediated by distinct sensorimotor strategies driven by the AFD thermosensory neurons. Proc. Natl. Acad. Sci. U S A111, 27762781. 10.1073/pnas.1315205111

  • 275

    LuysterR. J.KadlecM. B.CarterA.Tager-FlusbergH. (2008). Language assessment and development in toddlers with autism spectrum disorders. J. Autism Dev. Disord.38, 14261438. 10.1007/s10803-007-0510-1

  • 276

    MacarioA.DardenS. K.VerbruggenF.CroftD. P. (2020). Intraspecific variation in inhibitory motor control in guppies, Poecilia reticulata. J. Fish Biol.98, 317328. 10.1111/jfb.14608

  • 277

    MacLeanP. D. (1990). The Triune Brain in Evolution: Role in Paleocerebral Functions.New York, NY: Plenum Press.

  • 278

    MacLeanE. L.HareB.NunnC. L.AddessiE.AmiciF.AndersonR. C.et al. (2014). The evolution of self-control. Proc. Natl. Acad. Sci. U S A111, E2140E2148. 10.1073/pnas.1323533111

  • 279

    MacphailE. M. (1982). Brain and Intelligence in Vertebrates.Oxford: Clarendon Press.

  • 280

    MaedaH.FukudaS.KamedaH.MurabeN.IsooN.MizukamiH.et al. (2015). Corticospinal axons make direct synaptic connections with spinal motoneurons innervating forearm muscles early during postnatal development in the rat. J. Physiol.594, 189205. 10.1113/JP270885

  • 281

    MarlowH. Q.SrivastavaM.MatusD. Q.RokhsarD.MartindaleM. Q. (2009). Anatomy and development of the nervous system of Nematostella vectensis, an anthozoan cnidarian. Dev. Neurobiol.69, 235254. 10.1002/dneu.20698

  • 282

    MartensP. A. (2011). The ambiguous tactus: tempo, subdivision benefit and three listener strategies. Music Percept.28, 433448. 10.1525/mp.2011.28.5.433

  • 283

    MartinA. K.KesslerK.CookeS.HuangJ.MeinzerM. (2020). The right temporoparietal junction is causally associated with embodied perspective-taking. J. Neurosci.40, 30893095. 10.1523/JNEUROSCI.2637-19.2020

  • 284

    MartinV. C.SchacterD. L.CorballisM. C.AddisD. R. (2011). A role for the hippocampus in encoding simulations of future events. Proc. Natl. Acad. Sci. U S A108, 1385813863. 10.1073/pnas.1105816108

  • 285

    MauckB.DehnhardtG. (1997). Mental rotation in a california sea lion (Zalophus californianus). J. Exp. Biol.200, 13091316.

  • 286

    McClureS. M. (2004). Separate neural systems value immediate and delayed monetary rewards. Science306, 503507. 10.1126/science.1100907

  • 287

    McClureS. M.EricsonK. M.LaibsonD. I.LoewensteinG.CohenJ. D. (2007). Time discounting for primary rewards. J. Neurosci.27, 57965804. 10.1523/JNEUROSCI.4246-06.2007

  • 288

    McClureS. M.LaibsonD. I.LoewensteinG.CohenJ. D. (2004). Separate neural systems value immediate and delayed monetary rewards. Science306, 503507. 10.1126/science.1100907

  • 289

    McKenzieT.ChermanT.BirdL. R.NaqshbandiM.RobertsW. A. (2004). Can squirrel monkeys (Saimiri sciureus) plan for the future? studies of temporal myopia in food choice. Learn. Behav.32, 377390. 10.3758/bf03196035

  • 290

    McKinneyM. F.MoelantsD. (2006). Ambiguity in tempo perception: what draws listeners to different metrical levels. Music Percept.24, 155166. 10.1525/mp.2006.24.2.155

  • 291

    MeckW. H.ChurchR. M.OltonD. S. (1984). Hippocampus, time and memory. Behav. Neurosci.98, 322. 10.1037//0735-7044.98.1.3

  • 292

    MeckW. H.ChurchR. M.WenkG. L.OltonD. S. (1987). Nucleus basalis magnocellularis and medial septal area lesions differentially impair temporal memory. J. Neurosci.7, 35053511. 10.1523/JNEUROSCI.07-11-03505.1987

  • 293

    MeckW. H.DoyèreV.GruartA. (2012). Interval Timing and Time-Based Decision Making.Google Books. Available online at: https://books.google.com/books?id=UqF8DwAAQBAJ. Accessed March 21, 2021.

  • 294

    MelgireM.RagotR.SamsonS.PenneyT. B.MeckW. H.PouthasV. (2005). Auditory/visual duration bisection in patients with left or right medial-temporal lobe resection. Brain Cogn.58, 119124. 10.1016/j.bandc.2004.09.013

  • 295

    MeltzoffA. N.BrooksR. (2001). ““Like me” as a building block for understanding other minds: bodily acts, attention and intention,” in Intentions and Intentionality: Foundations of Social Cognition, eds Dare A. Baldwin, Bertram F. Malle, Louis J. Moses (Cambridge, CA: MIT Press), 171191. 10.7551/mitpress/3838.003.0013

  • 296

    MenzelC. R. (1999). Unprompted recall and reporting of hidden objects by a chimpanzee (Pan troglodytes) after extended delays. J. Comp. Psychol.113, 426434. 10.1037/0735-7036.113.4.426

  • 297

    MenzelR.GreggersU.SmithA.BergerS.BrandtR.BrunkeS.et al. (2005). Honey bees navigate according to a map-like spatial memory. Proc. Natl. Acad. Sci. U S A102, 30403045. 10.1073/pnas.0408550102

  • 298

    MenzelR.KirbachA.HaassW. D.FischerB.FuchsJ.KoblofskyM.et al. (2011). A common frame of reference for learned and communicated vectors in honeybee navigation. Curr. Biol.21, 645650. 10.1016/j.cub.2011.02.039

  • 299

    MeunierM.MonfardiniE.BoussaoudD. (2007). Learning by observation in rhesus monkeys. Neurobiol. Learn. Mem.88, 243248. 10.1016/j.nlm.2007.04.015

  • 300

    MikolovT.JoulinA.ChopraS.MathieuM.RanzatoM. A. (2015). Learning longer memory in recurrent neural networks. arXiv.org [Preprint]. Available online at: https://arxiv.org/abs/1412.7753. Accessed April 5, 2021.

  • 301

    MinekaS. (1979). The role of fear in theories of avoidance learning, flooding and extinction. Psychol. Bull.86, 9851010.

  • 302

    MitchellS.BrianJ.ZwaigenbaumL.RobertsW.SzatmariP.SmithI.et al. (2006). Early language and communication development of infants later diagnosed with autism spectrum disorder. J. Dev. Behav. Pediatr.27, S69S78. 10.1097/00004703-200604002-00004

  • 303

    ModyM.BelliveauJ. W. (2013). Speech and language impairments in autism: insights from behavior and neuroimaging. N. Am. J. Med. Sci.5:157. 10.7156/v5i3p157

  • 304

    MogilJ. S. (2012). The surprising empathic abilities of rodents. Trends Cogn. Sci.16, 143144. 10.1016/j.tics.2011.12.012

  • 305

    MonahanC. (1996). New zooarchaeological data from Bed, I. I., Olduvai Gorge, Tanzania: implications for hominid behavior in the early Pleistocene. J. Hum. Evol.31, 93128. 10.1006/jhev.1996.0053

  • 306

    MooreB. R. (2004). The evolution of learning. Biol. Rev.79, 301335. 10.1017/s1464793103006225

  • 307

    MoranJ. M.JollyE.MitchellJ. P. (2012). Social-cognitive deficits in normal aging. J. Neurosci.32, 55535561. 10.1523/JNEUROSCI.5511-11.2012

  • 308

    MorozL. L. (2015). Convergent evolution of neural systems in ctenophores. J. Exp. Biol.218, 598611. 10.1242/jeb.110692

  • 309

    MorozL. L.KohnA. B. (2016). Independent origins of neurons and synapses: insights from ctenophores. Philos. Trans. R. Soc. Lond. B Biol. Sci.371:20150041. 10.1098/rstb.2015.0041

  • 310

    MountcastleV. B. (1978). “An organizing principle for cerebral function: the unit module and the distributed system,” in The Mindful Brain: Cortical Organization and the Group-Selective Theory of Higher Brain Function, eds EdelmanG. M.MountcastleV. V. (Cambridge, MA: MIT Press), 750. Available online at: https://psycnet.apa.org/record/1979-25355-000.

  • 311

    MuellerT.WullimannM. F. (2009). An evolutionary interpretation of teleostean forebrain anatomy. Brain Behav. Evol.74, 3042. 10.1159/000229011

  • 312

    MuganU.MacIverM. A. (2020). Spatial planning with long visual range benefits escape from visual predators in complex naturalistic environments. Nat. Commun.11:3057. 10.1038/s41467-020-16102-1

  • 313

    MulcahyN. J.CallJ. (2006). Apes save tools for future use. Science312, 10381040. 10.1126/science.1125456

  • 314

    MurabeN.MoriT.FukudaS.IsooN.OhnoT.MizukamiH.et al. (2018). Higher primate-like direct corticomotoneuronal connections are transiently formed in a juvenile subprimate mammal. Sci. Rep.8:16536. 10.1038/s41598-018-34961-z

  • 315

    MurakamiY.UchidaK.RijliF. M.KurataniS. (2005). Evolution of the brain developmental plan: insights from agnathans. Dev. Biol.280, 249259. 10.1016/j.ydbio.2005.02.008

  • 316

    MurrayE. A.WiseS. P.GrahamK. S. (2017). The Evolution of Memory Systems: Ancestors, Anatomy and Adaptations.Oxford, United Kingdom: Oxford University Press.

  • 317

    MurrayE. A.WiseS. P.GrahamK. S. (2018). Representational specializations of the hippocampus in phylogenetic perspective. Neurosci. Lett.680, 412. 10.1016/j.neulet.2017.04.065

  • 318

    NakamuraK.RoeschM. R.OlsonC. R. (2005). Neuronal activity in macaque SEF and ACC during performance of tasks involving conflict. J. Neurophysiol. 93, 884908. 10.1152/jn.00305.2004

  • 319

    NaqshbandiM.RobertsW. A. (2006). Anticipation of future events in squirrel monkeys (Saimiri sciureus) and rats (Rattus norvegicus): tests of the bischof-kohler hypothesis. J. Comp. Psychol.120, 345357. 10.1037/0735-7036.120.4.34

  • 320

    NavratilovaE.AtcherleyC. W.PorrecaF. (2015). Brain circuits encoding reward from pain relief. Trends Neurosci.38, 741750. 10.1016/j.tins.2015.09.003

  • 321

    Netherlands Institute for Neuroscience—KNAW. (2019). I feel you: emotional mirror neurons found in the rat. ScienceDaily. Available online at: http://www.sciencedaily.com/releases/2019/04/190411115239.htm. Accessed April 5, 2021.

  • 322

    NevilleH. J.CoffeyS. A.LawsonD. S.FischerA.EmmoreyK.BellugiU. (1997). Neural systems mediating american sign language: effects of sensory experience and age of acquisition. Brain Lang.57, 285308. 10.1006/brln.1997.1739

  • 323

    NewportC.WallisG.SiebeckU. E. (2018). Object recognition in fish: accurate discrimination across novel views of an unfamiliar object category (human faces). Animal Behav.145, 3949. 10.1016/j.anbehav.2018.09.002

  • 324

    NewportC.WallisG.ReshitnykY.SiebeckU. E. (2016). Discrimination of human faces by archerfish (Toxotes chatareus). Sci. Rep.6:27523. 10.1038/srep27523

  • 325

    NieuwenhuysR. (1998). “Amphioxus,” in The Central Nervous System of Vertebrates, (Berlin, Heidelberg: Springer), 365396. 10.1007/978-3-642-18262-4_9

  • 326

    NilssonD. E. (2013). Eye evolution and its functional basis. Vis. Neurosci.30, 520. 10.1017/S0952523813000035

  • 327

    NorthcuttR. G. (2012). Evolution of centralized nervous systems: two schools of evolutionary thought. Proc. Natl. Acad. Sci. U S A109, 1062610633. 10.1073/pnas.1201889109

  • 328

    OchsnerK. N. (2008). The social-emotional processing stream: five core constructs and their translational potential for schizophrenia and beyond. Biol. Psychiatry64, 4861. 10.1016/j.biopsych.2008.04.024

  • 329

    OchsnerK. N.BeerJ. S.RobertsonE. R.CooperJ. C.GabrieliJ. D. E.KihsltromJ. F.et al. (2005). The neural correlates of direct and reflected self-knowledge. NeuroImage28, 797814. 10.1016/j.neuroimage.2005.06.069

  • 330

    O’CravenK. M.KanwisherN. (2000). Mental imagery of faces and places activates corresponding stimulus-specific brain regions. J. Cogn. Neurosci.12, 10131023. 10.1162/08989290051137549

  • 331

    OhyamaT.GibbonJ.DeichJ. D.BalsamE. D. (1999). Temporal control during maintenance and extinction of conditioned keypecking in ring doves. Animal Learn. Behav.27, 8998. 10.3758/BF03199434

  • 332

    OlesonE. B.GentryR. N.ChiomaV. C.CheerJ. F. (2012). Subsecond dopamine release in the nucleus accumbens predicts conditioned punishment and its successful avoidance. J. Neurosci.32, 1480414808. 10.1523/JNEUROSCI.3087-12.2012

  • 333

    OprisanS. A.AftT.BuhusiM.BuhusiC. V. (2018). Scalar timing in memory: a temporal map in the hippocampus. J. Theor. Biol.438, 133142. 10.1016/j.jtbi.2017.11.012

  • 334

    O’ReillyR. C.FrankM. J. (2006). Making working memory work: a computational model of learning in the prefrontal cortex and basal ganglia. Neural Comput.18, 283328. 10.1162/089976606775093909

  • 335

    OrorbiaI. I. A. G.MikolovT.ReitterD. (2017). Learning simpler language models with the differential state framework. Neural Comput.29, 33273352. 10.1162/neco_a_01017

  • 336

    OtsukaY.OsakaN.YaoiK.OsakaM. (2011). First-person perspective effects on theory of mind without self-reference. PLoS One6:e19320. 10.1371/journal.pone.0019320

  • 337

    PantinC. F. A. (1935). The nerve net of the actinozoa: IV. facilitation and the “Staircase”. J. Exp. Biol.12, 389396. 10.1242/jeb.12.4.389

  • 338

    PantinC. F. A.PantinA. M. P. (1943). The stimulus to feeding in anemonia sulcata. J. Exp. Biol.20, 613. 10.1242/jeb.20.1.6

  • 339

    ParkerG. H. (1916). Locomotion of sea-anemones. Proc. Natl. Acad. Sci. U S A2, 449450. 10.1073/pnas.2.8.449

  • 340

    ParrT.FristonK. J. (2018). The anatomy of inference: generative models and brain structure. Front. Comput. Neurosci.12:90. 10.3389/fncom.2018.00090

  • 341

    PearlR. (1903). The movements and reactions of fresh-water planarians: a study in animal behaviour / by Raymond Pearl. J. Cell Sci.17, 509714. 10.1242/jcs.s2-46.184.509

  • 342

    PearseV. B. (1974). Modification of sea anemone behavior by symbiotic zooxanthellae: expansion and contraction. Biol. Bull.147, 641651. 10.2307/1540747

  • 343

    PearsonJ.NaselarisT.HolmesE. A.KosslynS. M. (2015). Mental imagery: functional mechanisms and clinical applications. Trends Cogn. Sci.19, 590602. 10.1016/j.tics.2015.08.003

  • 344

    PerrettD. I.SmithP. A. J.MistlinA. J.ChittyA. J.HeadA. S.PotterD. D.et al. (1985). Visual analysis of body movements by neurones in the temporal cortex of the macaque monkey: a preliminary report. Behav. Brain Res.16, 153170. 10.1016/0166-4328(85)90089-0

  • 345

    PerryC. J.BarronA. B.ChengK. (2013). Invertebrate learning and cognition: relating phenomena to neural substrate. Wiley Interdiscip. Rev. Cogn. Sci.4, 561582. 10.1002/wcs.1248

  • 346

    PetersonK. J.ButterfieldN. J. (2005). From the cover: origin of the eumetazoa: testing ecological predictions of molecular clocks against the proterozoic fossil record. Proc. Natl. Acad. Sci. U S A102, 95479552. 10.1073/pnas.0503660102

  • 347

    PetridesM. (2014). Neuroanatomy of Language Regions of the Human Brain.London: Academic Press.

  • 348

    PetridesM.PandyaD. N. (1984). Projections to the frontal cortex from the posterior parietal region in the rhesus monkey. J. Comp. Neurol.228, 105116. 10.1002/cne.902280110

  • 349

    PhanK. L.TaylorS. F.WelshR. C.HoS.-H.BrittonJ. C.LiberzonI. (2004). Neural correlates of individual ratings of emotional salience: a trial-related fMRI study. NeuroImage21, 768780. 10.1016/j.neuroimage.2003.09.072

  • 350

    PhilippiC. L.FeinsteinJ. S.KhalsaS. S.DamasioA.TranelD.LandiniG.et al. (2012). Preserved self-awareness following extensive bilateral brain damage to the insula, anterior cingulate and medial prefrontal cortices. PLoS One7:e38413. 10.1371/journal.pone.0038413

  • 351

    PhillipsJ. B.AdlerK.BorlandS. C. (1995). True navigation by an amphibian. Anim. Behav.50, 855858. 10.1016/0003-3472(95)80146-4

  • 352

    PierobonP. (2012). Coordinated modulation of cellular signaling through ligand-gated ion channels in Hydra vulgaris (Cnidaria, Hydrozoa). Int. J. Dev. Biol.56, 551565. 10.1387/ijdb.113464pp

  • 353

    PortavellaM. (2004). Avoidance response in goldfish: emotional and temporal involvement of medial and lateral telencephalic pallium. J. Neurosci.24, 23352342. 10.1523/JNEUROSCI.4930-03.2004

  • 354

    PowellJ. L.LewisP. A.DunbarR. I. M.Garcia-FinanaM.RobertsN. (2010). Orbital prefrontal cortex volume correlates with social cognitive competence. Neuropsychologia48, 35543562. 10.1016/j.neuropsychologia.2010.08.004

  • 355

    PowellJ. L.LewisP. A.RobertsN.Garcia-FinanaM.DunbarR. I. M. (2012). Orbital prefrontal cortex volume predicts social network size: an imaging study of individual differences in humans. Proc. R. Soc. B Biol. Sci.279, 21572162. 10.1098/rspb.2011.2574

  • 356

    PradosJ.AlvarezB.HowarthJ.StewartK.GibsonC. L.HutchinsonC. V.et al. (2012). Cue competition effects in the planarian. Anim. Cogn.16, 177186. 10.1007/s10071-012-0561-3

  • 357

    PreussT. M. (2006). “Evolutionary specializations of primate brain systems,” PRIMATE ORIGINS: Adaptations and Evolution, eds M. J. Ravosa, and M. Dagosto (Boston, MA: Springer), 625675. 10.1007/978-0-387-33507-0_18

  • 358

    ProcykE.JosephJ. P. (2001). Characterization of serial order encoding in the monkey anterior cingulate sulcus. Eur. J. Neurosci.14, 10411046. 10.1046/j.0953-816x.2001.01738.x

  • 359

    ProcykE.TanakaY. L.JosephJ. P. (2000). Anterior cingulate activity during routine and non-routine sequential behaviors in macaques. Nat. Neurosci.3, 502508. 10.1038/74880

  • 360

    PuceA.PerrettD. (2003). Electrophysiology and brain imaging of biological motion. Philos. Trans. R. Soc. Lond. B Biol. Sci.358, 435445. 10.1098/rstb.2002.1221

  • 361

    QinJ.WheelerA. R. (2007). Maze exploration and learning in C. elegans. Lab on a Chip7, 186192. 10.1039/b613414a

  • 362

    QureshiA. W.BrethertonL.MarshB.MonkR. L. (2020). Stimulation of the dorsolateral prefrontal cortex impacts conflict resolution in Level-1 visual perspective taking. Cogn. Affect. Behav. Neurosci.20, 565574. 10.3758/s13415-020-00786-5

  • 363

    RalstonJ. V.HermanL. M. (1995). Perception and generalization of frequency contours by a bottlenose dolphin (Tursiops truncatus). J. Comp. Psychol.109, 268277. 10.1037/0735-7036.109.3.268

  • 364

    RaoR. P.BallardD. H. (1999). Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat. Neurosci.2, 7987. 10.1038/4580

  • 365

    RasmussenT.MilnerB. (1977). The role of early left-brain injury in determining lateralization of cerebral speech functions. Ann. N. Y. Acad. Sci.299, 355369. 10.1111/j.1749-6632.1977.tb41921.x

  • 366

    RauscheckerJ. P.ScottS. K. (2009). Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nat. Neurosci.12, 718724. 10.1038/nn.2331

  • 367

    RazranG. (1971). Mind in Evolution: An East-West Synthesis of Learned Behavior and Cognition.Boston, MA: Houghton Mifflin.

  • 368

    RedishA. D. (2016). Vicarious trial and error. Nat. Rev. Neurosci.17, 147159. 10.1038/nrn.2015.30

  • 369

    ReebsS. (2010). Social Intelligence in Fishes. Available online at: http://howfishbehave.ca/pdf/Social%20intelligence.pdf. Accessed April 5, 2021.

  • 370

    ReinerA.PerkelD. J.BruceL. L.ButlerA. B.CsillagA.KuenzelW.et al. (2004). Revised nomenclature for avian telencephalon and some related brainstem nuclei. J. Comp. Neurol.473, 377414. 10.1002/cne.20118

  • 371

    RengarajanS.YankuraK. A.GuillerminM. L.FungW.HallemE. A. (2019). Feeding state sculpts a circuit for sensory valence in Caenorhabditis elegans. Proc. Natl. Acad. Sci. U S A116, 17761781. 10.1073/pnas.1807454116

  • 372

    ReniersR. L.VöllmB. A.ElliottR.CorcoranR. (2013). Empathy, ToM and self-other differentiation: an fMRI study of internal states. Soc. Neurosci.9, 5062. 10.1080/17470919.2013.861360

  • 373

    RennieS. M.MoitaM. M.MainenZ. F. (2013). Social cognition in the rodent: nothing to be sniffed at. Trends Cogn. Sci.17, 306307. 10.1016/j.tics.2013.04.011

  • 374

    ReppucciC. J.PetrovichG. D. (2015). Organization of connections between the amygdala, medial prefrontal cortex and lateral hypothalamus: a single and double retrograde tracing study in rats. Brain Struct. Funct.221, 29372962. 10.1007/s00429-015-1081-0

  • 375

    RhoadesJ. L.NelsonJ. C.NwabudikeI.YuS. K.McLachlanI. G.MadanG. K.et al. (2019). ASICs mediate food responses in an enteric serotonergic neuron that controls foraging behaviors. Cell176, 8597. 10.1016/j.cell.2018.11.023

  • 376

    RillingJ. K. (2014). Comparative primate neurobiology and the evolution of brain language systems. Curr. Opin. Neurobiol.28, 1014. 10.1016/j.conb.2014.04.002

  • 377

    RillingJ. K.GlasserM. F.JbabdiS.AnderssonJ.PreussT. M. (2012). Continuity, divergence and the evolution of brain language pathways. Front. Evol. Neurosci.3:11. 10.3389/fnevo.2011.00011

  • 378

    RillingJ. K.GlasserM. F.PreussT. M.MaX.ZhaoT.HuX.et al. (2008). The evolution of the arcuate fasciculus revealed with comparative DTI. Nat. Neurosci.11, 426428. 10.1038/nn2072

  • 379

    RitblattS. N. (2000). Children’s level of participation in a false-belief task, age and theory of mind. J. Genet. Psychol.161, 5364. 10.1080/00221320009596694

  • 380

    RizzolattiG.ArbibM. A. (1998). Language within our grasp. Trends Neurosci.21, 188194. 10.1016/s0166-2236(98)01260-0

  • 381

    RizzolattiG.FadigaL.GalleseV.FogassiL. (1996). Premotor cortex and the recognition of motor actions. Cogn. Brain Res.3, 131141. 10.1016/0926-6410(95)00038-0

  • 382

    RizzolattiG.FogassiL.GalleseV. (2001). Neurophysiological mechanisms underlying the understanding and imitation of action. Nat. Rev. Neurosci.2, 661670. 10.1038/35090060

  • 383

    RobertsS.ChurchR. M. (1978). Control of an internal clock. J. Exp. Psychol. Anim. Behav. Processes4, 318337. 10.1037/0097-7403.4.4.318

  • 384

    RochatP. (1998). Self-perception and action in infancy. Exp. Brain Res.123, 102109. 10.1007/s002210050550

  • 385

    RodríguezF.LópezJ. C.VargasJ. P.BroglioC.GómezY.SalasC. (2002). Spatial memory and hippocampal pallium through vertebrate evolution: insights from reptiles and teleost fish. Brain Res. Bull.57, 499503. 10.1016/s0361-9230(01)00682-7

  • 386

    RothG. (2015). Convergent evolution of complex brains and high intelligence. Philos. Trans. R. Soc. Lond. B Biol. Sci.370:20150049. 10.1098/rstb.2015.0049

  • 387

    RudebeckP. H.PutnamP. T.DanielsT. E.YangT.MitzA. R.RhodesS. E.et al. (2014). A role for primate subgenual cingulate cortex in sustaining autonomic arousal. Proc. Natl. Acad. Sci. U S A111, 53915396. 10.1073/pnas.1317695111

  • 388

    RuppertE. E.FoxS. R.BarnesD. R. (2004). Ivertebrate Zoology, 7th Edn.Boston, MA: Cengage Learning.

  • 389

    RushforthN. B. (1973). Behavioral modifications in coelenterates. Invert. Learn.1, 123169.

  • 390

    RyanJ. F. (2014). Did the ctenophore nervous system evolve independently?Zoology117, 225226. 10.1016/j.zool.2014.06.001

  • 391

    SawinE. R.RanganathanR.HorvitzH. R. (2000). C. elegans locomotory rate is modulated by the environment through a dopaminergic pathway and by experience through a serotonergic pathway. Neuron26, 619631. 10.1016/s0896-6273(00)81199-x

  • 392

    SalletJ.MarsR. B.NoonanM. P.AnderssonJ. L.O’ReillyJ. X.JbabdiS.et al. (2011). Social network size affects neural circuits in macaques. Science334, 697700. 10.1126/science.1210027

  • 393

    SammlerD.GrosbrasM.-H.AnwanderA.BestelmeyerP. E. G.BelinP. (2015). Dorsal and ventral pathways for prosody. Curr. Biol.25, 30793085. 10.1016/j.cub.2015.10.009

  • 394

    SandersonC. E.CookP.HillP. S. M.OrozcoB. S.AbramsonC. I.WellsH. (2013). Nectar Quality perception by honey bees (Apis mellifera ligustica). J. Comp. Psychol.127, 341351. 10.1037/a0032613

  • 395

    SanfeyA. G. (2003). The neural basis of economic decision-making in the ultimatum game. Science300, 17551758. 10.1126/science.1082976

  • 396

    SatterlieR. A. (2015). Cnidarian nerve nets and neuromuscular efficiency. Int. Comp. Biol.55, 10501057. 10.1093/icb/icv067

  • 397

    SchenkerN. M.HopkinsW. D.SpocterM. A.GarrisonA. R.StimpsonC. D.ErwinJ. M.et al. (2009). Broca’s area homologue in chimpanzees (Pan troglodytes): probabilistic mapping, asymmetry and comparison to humans. Cereb. Cortex20, 730742. 10.1093/cercor/bhp138

  • 398

    SchmahmannJ. D.PandyaD. N.WangR.DaiG.D’ArceuilH. E.de CrespignyA. J.et al. (2007). Association fibre pathways of the brain: parallel observations from diffusion spectrum imaging and autoradiography. Brain130, 630653. 10.1093/brain/awl359

  • 399

    SchmidtB.DuinA. A.RedishA. D. (2019). Disrupting the medial prefrontal cortex alters hippocampal sequences during deliberative decision making. J. Neurophysiol.121, 19812000. 10.1152/jn.00793.2018

  • 400

    SchneirlaT. C. (1959). “An evolutionary and developmental theory of biphasic processes underlying approach and withdrawal,” in Nebraska Symposium on Motivation, ed JonesM. R. (Lincoln, NE: University of Nebraska Press), 142.

  • 401

    SchomersM. R.GaragnaniM.PulvermüllerF. (2017). Neurocomputational consequences of evolutionary connectivity changes in perisylvian language cortex. J. Neurosci.37, 30453055. 10.1523/JNEUROSCI.2693-16.2017

  • 402

    SchoolerJ. W.SmallwoodJ.ChristoffK.HandyT. C.ReichleE. D.SayetteM. A. (2011). Meta-awareness, perceptual decoupling and the wandering mind. Trends Cogn. Sci.15, 319326. 10.1016/j.tics.2011.05.006

  • 403

    SchumacherS.Burt de PereraT.ThenertJ.von der EmdeG. (2016). Cross-modal object recognition and dynamic weighting of sensory inputs in a fish. Proc. Natl. Acad. Sci. U S A113, 76387643. 10.1073/pnas.1603120113

  • 404

    SchweimerJ.SaftS.HauberW. (2005). Involvement of catecholamine neurotransmission in the rat anterior cingulate in effort-related decision making. Behav. Neurosci.119, 16871692. 10.1037/0735-7044.119.6.1687

  • 405

    SeamansJ. K.FlorescoS. B.PhillipsA. G. (1995). Functional differences between the prelimbic and anterior cingulate regions of the rat prefrontal cortex. Behav. Neurosci.109, 10631073. 10.1037//0735-7044.109.6.1063

  • 406

    SemendeferiK.ArmstrongE.SchleicherA.ZillesK.Van HoesenG. W. (2001). Prefrontal cortex in humans and apes: a comparative study of area 10. Am. J. Phys. Anthropol.114, 224241. 10.1002/1096-8644(200103)114:3<224::AID-AJPA1022>3.0.CO;2-I

  • 407

    SeyfarthR. M.CheneyD. L. (2013). Affiliation, empathy and the origins of theory of mind. Proc. Natl. Acad. Sci. U S A110, 1034910356. 10.1073/pnas.1301223110

  • 408

    Shamay-TsooryS. G. (2011). The neural bases for empathy. Neuroscientist17, 1824. 10.1177/1073858410379268

  • 409

    Shamay-TsooryS. G.Aharon-PeretzJ. (2007). Dissociable prefrontal networks for cognitive and affective theory of mind: a lesion study. Neuropsychologia45, 30543067. 10.1016/j.neuropsychologia.2007.05.021

  • 410

    SimonyanK.HorwitzB. (2011). Laryngeal motor cortex and control of speech in humans. Neuroscientist17, 197208. 10.1177/1073858410386727

  • 411

    SimpsonJ.O’HaraS. J. (2018). Gaze following in an asocial reptile (Eublepharis macularius). Anim. Cogn.22, 145152. 10.1007/s10071-018-1230-y

  • 412

    SkeideM. A.BrauerJ.FriedericiA. D. (2016). Brain functional and structural predictors of language performance. Cereb. Cortex26, 21272139. 10.1093/cercor/bhv042

  • 413

    SmithR. A. (2016). Apes understand that some things are all in your head. Duke Today. Available online at: https://today.duke.edu/2016/10/falsebeliefs. Accessed June 17, 2021.

  • 414

    SotoF. A.WassermanE. A. (2014). Mechanisms of object recognition: what we have learned from pigeons. Front. Neural Circuits8:122. 10.3389/fncir.2014.00122

  • 415

    SøvikE.BarronA. B. (2013). Invertebrate models in addiction research. Brain Behav. Evol.82, 153165. 10.1159/000355506

  • 416

    SovranoV. A.BarattiG.PotrichD. (2018). A detour task in four species of fishes. Front. Psychol.9:2341. 10.3389/fpsyg.2018.02341

  • 417

    SovranoV. A.BisazzaA. (2007). Recognition of partly occluded objects by fish. Anim. Cogn.11, 161166. 10.1007/s10071-007-0100-9

  • 418

    SpenceR. (2011). “Zebrafish ecology and behaviour,” in Zebrafish Models in Neurobehavioral Research, eds KalueffA. V.CachatJ. M. (Berlin: Springer Verlag), 146.

  • 419

    SpenceR.GerlachG.LawrenceC.SmithC. (2008). The behaviour and ecology of the zebrafish, Danio rerio. Biol. Rev.83, 1334. 10.1111/j.1469-185X.2007.00030.x

  • 420

    SpocterM. A.HopkinsW. D.GarrisonA. R.BauernfeindA. L.StimpsonC. D.HofP. R.et al. (2010). Wernicke’s area homologue in chimpanzees (Pan troglodytes) and its relation to the appearance of modern human language. Proc. R. Soc. B Biol. Sci.277, 21652174. 10.1098/rspb.2010.0011

  • 421

    SpratlingM. (2017). A review of predictive coding algorithms. Brain Cogn.112, 9297. 10.1016/j.bandc.2015.11.003

  • 422

    StaffordJ. M.JarrettB. R.Miranda-DominguezO.MillsB. D.CainN.MihalasS.et al. (2014). Large-scale topology and the default mode network in the mouse connectome. Proc. Natl. Acad. Sci. U S A111, 1874518750. 10.1073/pnas.1404346111

  • 423

    StarrM. D.MinekaS. (1977). Determinants of fear over the course of avoidance learning. Learn. Motiv.8, 332350. 10.1016/0023-9690(77)90056-x

  • 424

    SteinerA. P.RedishA. D. (2014). Behavioral and neurophysiological correlates of regret in rat decision-making on a neuroeconomic task. Nat. Neurosci.17, 9951002. 10.1038/nn.3740

  • 425

    Stephenson-JonesM.KardamakisA. A.RobertsonB.GrillnerS. (2013). Independent circuits in the basal ganglia for the evaluation and selection of actions. Proc. Natl. Acad. Sci. U S A110, E3670E3679. 10.1073/pnas.1314815110

  • 426

    StevensJ. R.StephensD. W.MaddenG. J. (2010). “The adaptive nature of impulsivity,” in Impulsivity: The Behavioral and Neurological Science of Discounting, eds MaddenG. J.BickelW. K. (Washington DC: American Psychological Association), 361387.

  • 427

    StookeyB. (1963). Jean-Baptiste Bouillaud and Ernest AUBURTIN. Early studies on cerebral localization and the speech center. JAMA184, 10241029. 10.1001/jama.1963.73700260007011

  • 428

    StoutD.HechtE. E. (2017). Evolutionary neuroscience of cumulative culture. Proc. Natl. Acad. Sci. U S A114, 78617868. 10.1073/pnas.1620738114

  • 429

    StrausfeldN. J.HirthF. (2013). Deep homology of arthropod central complex and vertebrate basal ganglia. Science340, 157161. 10.1126/science.1231828

  • 430

    StriedterG. F. (2005). Principles of Brain Evolution.Sunderland, MA: Sinauer Associates.

  • 431

    StriedterG. F.KeeferB. P. (2000). Cell migration and aggregation in the developing telencephalon: pulse-labeling chick embryos with bromodeoxyuridine. J. Neurosci.20, 80218030. 10.1523/JNEUROSCI.20-21-08021.2000

  • 432

    StriedterG. F.NorthcuttR. G. (2020). Brains Through Time: A Natural History of Vertebrates.Oxford: Oxford University Press.

  • 433

    SugaharaF.MurakamiY.Pascual-AnayaJ.KurataniS. (2017). Reconstructing the ancestral vertebrate brain. Dev. Growth Differ.59, 163174. 10.1111/dgd.12347

  • 434

    SumbreG.MutoA.BaierH.PooM.-M. (2008). Entrained rhythmic activities of neuronal ensembles as perceptual memory of time interval. Nature456, 102107. 10.1038/nature07351

  • 435

    TanakaS. C.DoyaK.OkadaG.UedaK.OkamotoY.YamawakiS. (2004). Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops. Nat. Neurosci.7, 887893. 10.1038/nn1279

  • 436

    TangW.JbabdiS.ZhuZ.CottaarM.GrisotG.LehmanJ. F.et al. (2019). A connectional hub in the rostral anterior cingulate cortex links areas of emotion and cognitive control. eLife8:e43761. 10.7554/eLife.43761

  • 437

    TeiS.FujinoJ.KawadaR.JankowskiK. F.KauppiJ.-P.van den BosW.et al. (2017). Collaborative roles of temporoparietal junction and dorsolateral prefrontal cortex in different types of behavioural flexibility. Sci. Rep.7:6415. 10.1038/s41598-017-06662-6

  • 438

    TerraceH. S. (2019). Why Chimpanzees Can’t Learn Language and Only Humans Can.New York, NY: Columbia University Press.

  • 439

    TerraceH. S.PetittoL. A.SandersR. J.BeverT. G. (1979). Can an ape create a sentence?Science206, 891902. 10.1126/science.504995

  • 440

    Thiebaut de SchottenM.Dell’AcquaF.ValabregueR.CataniM. (2012). Monkey to human comparative anatomy of the frontal lobe association tracts. Cortex48, 8296. 10.1016/j.cortex.2011.10.001

  • 441

    ThompsonC.ChurchR. (1980). An explanation of the language of a chimpanzee. Science208, 313314. 10.1126/science.7367862

  • 442

    ToiviainenP.SnyderJ. S. (2003). Tapping to bach: resonance-based modeling of pulse. Music Percept.21, 4380. 10.1525/mp.2003.21.1.43

  • 443

    TomaselloM.CallJ.HareB. (2003). Chimpanzees understand psychological states—the Question is which ones and to what extent. Trends Cogn. Sci.7, 153156. 10.1016/s1364-6613(03)00035-4

  • 444

    TomaselloM.CarpenterM.CallJ.BehneT.MollH. (2005). Understanding and sharing intentions: the origins of cultural cognition. Behav. Brain Sci.28, 675691. 10.1017/S0140525X05000129

  • 445

    TomaselloM.Davis-DasilvaM.CamakL.BardK. (1987). Observational learning of tool-use by young chimpanzees. Hum. Evol.2, 175183. 10.1007/bf02436405

  • 446

    TorleyV. J. (2009). The Anatomy of a Minimal Mind. PhD Thesis. Melbourne, Australia: University of Melbourne.

  • 447

    ToschesM. A. (2021). Different origins for similar brain circuits. Science371, 676677. 10.1126/science.abf9551

  • 448

    ToschesM. A.YamawakiT. M.NaumannR. K.JacobiA. A.TushevG.LaurentG. (2018). Evolution of pallium, hippocampus and cortical cell types revealed by single-cell transcriptomics in reptiles. Science360, 881888. 10.1126/science.aar4237

  • 449

    TotahN. K.KimY. B.HomayounH.MoghaddamB. (2009). Anterior cingulate neurons represent errors and preparatory attention within the same behavioral sequence. J. Neurosci.29, 64186426. 10.1523/JNEUROSCI.1142-09.2009

  • 450

    UchidaK.MurakamiY.KurakuS.HiranoS.KurataniS. (2003). Development of the adenohypophysis in the lamprey: evolution of epigenetic patterning programs in organogenesis. J. Exp. Zool.300B, 3247. 10.1002/jez.b.44

  • 451

    UlinskiP. S. (1983). Dorsal Ventricular Ridge: A Treatise on Forebrain Organization in Reptiles and Birds.New York, NY: Wiley.

  • 452

    University of Lincoln. (2014). First evidence that reptiles can learn through imitation. ScienceDaily. Available online at: https://www.sciencedaily.com/releases/2014/09/140930090443.htm. Accessed April 5, 2021.

  • 453

    van HeukelumS.MarsR. B.GuthrieM.BuitelaarJ. K.BeckmannC. F.TiesingaP. H. E.et al. (2020). Where is cingulate cortex? A cross-species view. Trends Neurosci.43, 285299. 10.1016/j.tins.2020.03.007

  • 454

    Van OverwalleF. (2009). Social cognition and the brain: a meta-analysis. Hum. Brain Mapp.30, 829858. 10.1002/hbm.20547

  • 455

    Van OverwalleF.BaetensK. (2009). Understanding others’ actions and goals by mirror and mentalizing systems: a meta-analysis. NeuroImage48, 564584. 10.1016/j.neuroimage.2009.06.009

  • 456

    VergozV.RousselE.SandozJ.-C.GiurfaM. (2007). Aversive learning in honeybees revealed by the olfactory conditioning of the sting extension reflex. PLoS One2:e288. 10.1371/journal.pone.0000288

  • 457

    VianaD. S.GordoI.SucenaÉ.MoitaM. A. (2010). Cognitive and motivational requirements for the emergence of cooperation in a rat social game. PLoS One5:e8483. 10.1371/journal.pone.0008483

  • 458

    VindasM. A.JohansenI. B.Vela-AvituaS.NørstrudK. S.AalgaardM.BraastadB. O.et al. (2014). Frustrative reward omission increases aggressive behaviour of inferior fighters. Proc. R. Soc. B Biol. Sci.281:20140300. 10.1098/rspb.2014.0300

  • 459

    VinepinskyE.CohenL.PerchikS.Ben-ShaharO.DonchinO.SegevR. (2018). Representation of borders and swimming kinematics in the brain of freely-navigating fish. bioRxiv [Preprint]. 10.1101/291013

  • 460

    VinepinskyE.CohenL.PerchikS.Ben-ShaharO.DonchinO.SegevR. (2020). Representation of edges, head direction, and swimming kinematics in the brain of freely-navigating fish. Sci. Rep.10:14762. 10.1038/s41598-020-71217-1

  • 461

    VredeveldtA.HitchG. J.BaddeleyA. D. (2011). Eyeclosure helps memory by reducing cognitive load and enhancing visualisation. Mem. Cogn.39, 12531263. 10.3758/s13421-011-0098-8

  • 462

    WalkerM. M. (1997). Magnetic orientation and the magnetic sense in arthropods. EXS84, 187213. 10.1007/978-3-0348-8878-3_7

  • 463

    WallachA.Harvey-GirardE.JunJ. J.LongtinA.MalerL. (2018). A time-stamp mechanism may provide temporal information necessary for egocentric to allocentric spatial transformations. eLife7:e36769. 10.7554/eLife.36769

  • 464

    WalterI.SeebacherF. (2009). Endothermy in birds: underlying molecular mechanisms. J. Exp. Biol.212, 23282336. 10.1242/jeb.029009

  • 465

    WaltonM. E.BannermanD. M.AlterescuK.RushworthM. F. (2003). Functional specialization within medial frontal cortex of the anterior cingulate for evaluating effort-related decisions. J. Neurosci.23, 64756479. 10.1523/JNEUROSCI.23-16-06475.2003

  • 466

    WarnekenF.ChenF.TomaselloM. (2006). Cooperative activities in young children and chimpanzees. Child Dev.77, 640663. 10.1111/j.1467-8624.2006.00895.x

  • 467

    WarnekenF.TomaselloM. (2006). Altruistic helping in human infants and young chimpanzees. Science311, 13011303. 10.1126/science.1121448

  • 468

    WeberB. J.HuettelS. A. (2008). The neural substrates of probabilistic and intertemporal decision making. Brain Res.1234, 104115. 10.1016/j.brainres.2008.07.105

  • 469

    WehnerR.BoyerM.LoertscherF.SommerS.MenziU. (2006). Ant navigation: one-way routes rather than maps. Curr. Biol.16, 7579. 10.1016/j.cub.2005.11.035

  • 470

    WehnerR.MichaelB.AntonsenP. (1996). Visual navigation in insects: coupling of egocentric and geocentric information. J. Exp. Biol.199, 129140. 10.1242/jeb.199.1.129

  • 471

    WelbornB. L.LiebermanM. D. (2015). Person-specific theory of mind in medial pFC. J. Cogn. Neurosci.27, 112. 10.1162/jocn_a_00700

  • 472

    WennerA. M.WellsP. H. (1990). An Anatomy of a Controversy.New York, NY: Columbia University Press.

  • 473

    WenzelJ. M.OlesonE. B.GoveW. N.ColeA. B.GyawaliU.DantrassyH. M.et al. (2018). Phasic dopamine signals in the nucleus accumbens that cause active avoidance require endocannabinoid mobilization in the midbrain. Curr. Biol.28, 1392.e51404.e5. 10.1016/j.cub.2018.03.037

  • 474

    WernickeC. (1874). Der aphasische Symptomencomplex: eine psychologische Studie auf anatomischer Basis: Wernicke, Carl, 1848–1905: Free Download, Borrow and Streaming. Internet Archive. Available online at: https://archive.org/details/b24763445. Accessed April 5, 2021.

  • 475

    WhiteM. G.PanickerM.MuC.CarterA. M.RobertsB. M.DharmasriP. A.et al. (2018). Anterior cingulate cortex input to the claustrum is required for top-down action control. Cell Rep.22, 8495. 10.1016/j.celrep.2017.12.023

  • 476

    WhoriskeyF. G. (1991). Stickleback distraction displays: sexual or foraging deception against egg cannibalism?Anim. Behav.41, 989995. 10.1016/s0003-3472(05)80637-2

  • 477

    WilkinsonA.ChanH.-M.HallG. (2007). Spatial learning and memory in the tortoise (Geochelone carbonaria). J. Comp. Psychol.121, 412418. 10.1037/0735-7036.121.4.412

  • 478

    WilkinsonA.HuberL. (2012). “Cold-blooded cognition: reptilian cognitive abilities,” in The Oxford Handbook of Comparative Evolutionary Psychology, eds VonkJ.ShackelfordT. K. (Oxford University Press), 129143. 10.1093/oxfordhb/9780199738182.013.0008

  • 479

    WilkinsonA.KuenstnerK.MuellerJ.HuberL. (2010). Social learning in a non-social reptile (Geochelone carbonaria). Biol. Lett.6, 614616. 10.1098/rsbl.2010.0092

  • 480

    WojciechP. (2009). Curiosity and Information Seeking in Animal and Human Behavior. Irvine, CA: BrownWalker Press. Available online at: https://www.researchgate.net/publication/264422629_Curiosity_and_Information_Seeking_in_Animal_and_Human_Behavior. Accessed April 5, 2021.

  • 481

    WongP.KaasJ. H. (2009). An architectonic study of the neocortex of the short-tailed opossum (Monodelphis domestica). Brain Behav. Evol.73, 206228. 10.1159/000225381

  • 482

    WoodardW. T.BittermanM. E. (1973). Pavlovian analysis of avoidance conditioning in the goldfish (Carassius auratus). J. Comp. Physiol. Psychol.82, 123129. 10.1037/h0033820

  • 483

    WunschG. (1994). Theories, models, and data. Demografie36, 2029.

  • 484

    YangH.-W.LemonR. N. (2003). An electron microscopic examination of the corticospinal projection to the cervical spinal cord in the rat: lack of evidence for cortico-motoneuronal synapses. Exp. Brain Res.149, 458469. 10.1007/s00221-003-1393-9

  • 485

    YeatmanJ. D.DoughertyR. F.RykhlevskaiaE.SherbondyA. J.DeutschG. K.WandellB. A.et al. (2011). Anatomical properties of the arcuate fasciculus predict phonological and reading skills in children. J. Cogn. Neurosci.23, 33043317. 10.1162/jocn_a_00061

  • 486

    YinB.MeckW. H. (2014). Comparison of interval timing behaviour in mice following dorsal or ventral hippocampal lesions with mice having δ-opioid receptor gene deletion. Philos. Trans. R. Soc. Lond. B Biol. Sci.369:20120466. 10.1098/rstb.2012.0466

  • 487

    YokoiA.DiedrichsenJ. (2019). Neural organization of hierarchical motor sequence representations in the human neocortex. Neuron103, 1178.e71190.e7. 10.1016/j.neuron.2019.06.017

  • 488

    YonemitsuF.SasakiK.GobaraA.KosugiK. E.YamadaY. (2018). Close and ye shall find: eye closure during thinking enhances creativity. arXiz [Preprint]. 10.31234/osf.io/uq6mg

  • 489

    YuanX.-L.ChenZ.XiaoS.-H.ZhouC.-M.HuaH. (2011). An early ediacaran assemblage of macroscopic and morphologically differentiated eukaryotes. Nature470, 390393. 10.1038/nature09810

  • 490

    ZarcoW.MerchantH.PradoL.MendezJ. C. (2009). Subsecond timing in primates: comparison of interval production between human subjects and rhesus monkeys. J. Neurophysiol.102, 31913202. 10.1152/jn.00066.2009

  • 491

    ZentallT. R.SingerR. A.StagnerJ. P. (2008). Episodic-like memory: pigeons can report location pecked when unexpectedly asked. Behav. Processes79, 9398. 10.1016/j.beproc.2008.05.003

  • 492

    ZhangY.PaikJ.PirolliP. (2015). Reinforcement learning and counterfactual reasoning explain adaptive behavior in a changing environment. Top. Cogn. Sci.7, 368381. 10.1111/tops.12143

  • 493

    ZhangS.XuM.KamigakiT.Hoang DoJ. P.ChangW.-C.JenvayS.et al. (2014). Long-range and local circuits for top-down modulation of visual cortex processing. Science345, 660665. 10.1126/science.1254126

Summary

Keywords

evolutionary neuroscience, evolutionary neuroanatomy, vertebrate brain evolution, mammalian brain evolution, primate brain evolution, bilaterian brain, brain evolution

Citation

Bennett MS (2021) Five Breakthroughs: A First Approximation of Brain Evolution From Early Bilaterians to Humans. Front. Neuroanat. 15:693346. doi: 10.3389/fnana.2021.693346

Received

10 April 2021

Accepted

13 July 2021

Published

17 August 2021

Volume

15 - 2021

Edited by

Juan F. Montiel, Diego Portales University, Chile

Reviewed by

Hans A. Hofmann, University of Texas at Austin, United States; Jon I. Arellano, Yale University, United States

Updates

Copyright

*Correspondence: Max S. Bennett

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics