Convergent and divergent neural circuit architectures that support acoustic communication

Kelley, Darcy B.

doi:10.3389/fncir.2022.976789

SYSTEMATIC REVIEW article

Front. Neural Circuits, 17 November 2022

Volume 16 - 2022 | https://doi.org/10.3389/fncir.2022.976789

Convergent and divergent neural circuit architectures that support acoustic communication

DB
Darcy B. Kelley ^*

Department of Biological Sciences, Columbia University, New York, NY, United States

Article metrics

View details

Citations

Views

1,4k

Downloads

Abstract

Vocal communication is used across extant vertebrates, is evolutionarily ancient, and been maintained, in many lineages. Here I review the neural circuit architectures that support intraspecific acoustic signaling in representative anuran, mammalian and avian species as well as two invertebrates, fruit flies and Hawaiian crickets. I focus on hindbrain motor control motifs and their ties to respiratory circuits, expression of receptors for gonadal steroids in motor, sensory, and limbic neurons as well as divergent modalities that evoke vocal responses. Hindbrain and limbic participants in acoustic communication are highly conserved, while forebrain participants have diverged between anurans and mammals, as well as songbirds and rodents. I discuss the roles of natural and sexual selection in driving speciation, as well as exaptation of circuit elements with ancestral roles in respiration, for producing sounds and driving rhythmic vocal features. Recent technical advances in whole brain fMRI across species will enable real time imaging of acoustic signaling partners, tying auditory perception to vocal production.

The evolution of vocal communication in tetrapod vertebrates; Introduction and overview

Acoustic communication plays an essential role in social behaviors of many species. In tetrapod vertebrates (Figure 1), both the cries of infants and the songs used in courtship are the result of neural circuit activity that drives muscles interposed between the lungs and the mouth. Sensory, CNS and motor systems that support innate, species-specific vocal communication reflect heritable genetic differences over evolutionary times scales. For example, crying is an innate behavior in infants (deaf babies cry), a key component of social interactions in our species. As human listeners misinterpret distress levels conveyed by tempo and pitch in cries from other primate species (bonobo and chimpanzee, Kelly et al., 2017), baby Homo neanderthalis and sapiens cries were probably species-specific. Courtship songs in other tetrapods are also typically innate, with only a few exceptions, most notably songbirds (Jarvis, 2019). Producing and recognizing different innate vocalizations (e.g., call types in birds; crying, sighing, laughter in H. sapiens) is essential for social communication (Simonyan and Horwitz, 2011; Rose et al., 2022).

Figure 1

Acoustic communication is ancient in tetrapods and was maintained over long periods: (Chen and Wiens, 2020; Jorgewich-Cohen et al., 2022). How did the neural circuit architectures that support the production and perception of songs evolve? Figure 1 illustrates extant species that serve as models for the neural bases of vocal communication and their evolutionary histories. This review aims to compare CNS circuits that produce and respond to innate vocalizations in these models to identify conserved and divergent features across evolutionary time scales.

Anurans (frogs and toads) are among the most ancient acoustic communicators, appearing in the fossil record ~270 mya (Figure 1). Within the Anura, the terrestrial Neobatrachians (e.g. Ranids) emerged from a world-wide extinction event and underwent massive radiations at the KT boundary (~68 mya; Feng et al., 2017). The Archebatrachians, in contrast, neither became extinct, nor radiated massively, but instead gave rise to aquatic anurans, the Xenopodinae, exceptionally well-represented in the fossil record (Cannatella, 2015) as well by 29 extant species (Evans et al., 2015). Xenopus laevis from South Africa was adopted by biologists in the 1800s for studies in experimental ethology, development, endocrinology, cell biology, and neurobiology (Wallingford, 2022). Males in all 29 species communicate vocally (Tobias et al., 2011), as do females, although one species group (A) has lost the female release call (Tobias et al., 2014).

Xenopus are secondarily aquatic (derived from terrestrial ancestors) and vocalizations are produced by a larynx modified for underwater sound production to produce sounds without airflow (Kwong-Brown et al., 2019). The neural circuits that support sex- specific acoustic communication have been elucidated and hindbrain neurons responsible for species-specific song rhythms identified [reviewed in Kelley et al. (2020)]. Identifying the genetic basis of the production and reception of species-specific vocal signals in Xenopus (clawed frogs) is our current research focus.

Placental mammals are also ancient acoustic communicators (~80 mya; Figure 1). The Rodentia—having diverged from langiomorphs (rabbits) ~85 mya—include many vocal genera (Mus, Scotinomys, Rattus, Heterocephalus). Mice (Mus) have dominated investigations of rodent ultrasonic vocalizations (USVs), due in part to the genetic advantages of specific laboratory strains. The Chiroptera (bats)—another highly vocal group—diverged from other mammals ~75 mya (Agnarsson et al., 2011). In some species, bats use vocalizations in both social communication and prey capture (echolocation). Among Hominids, the ancestor of our own species (H. sapiens) is recent (~200–250 kya; Vidal et al., 2022). Homo sapiens is the sole extant species and we do not know whether, for example, H. neanderthalensis spoke or sang. In all mammals, the larynx is the major organ of vocal expression. Sounds are powered by expiration and shaped acoustically by the vocal tract (Milsom et al., 2022).

Birds evolved from Archosaurs (dinosaurs) ~240 MYA and bird species radiated ~60 mya, again reflecting the worldwide extinction event at the K-T boundary. All extant birds communicate vocally (Figure 1), suggesting that ancestral dinosaurs sang as well. Avian behaviors were a focus of early ethologists (e.g., Lorentz and Tinbergen) and the discovery of geographical dialects in some species, provided experimental model systems for vocal motor learning. The zebra finch (Figure 1, lower left), Taeniopygia guttata, is currently the “lab rat” for the study of bird song neural circuits. While song control nuclei in the forebrain are not homologs of mammalian cortical areas that participate in acoustic communication, they exhibit convergent neural circuit architectures including, for example, a role for dopamine in song learning (Gadagkar et al., 2016).

I begin this review by examining neural circuit mechanisms that receive and generate species-specific Xenopus songs and then compare shared and divergent circuit motifs with other vocal vertebrates. As invertebrates are also prominent acoustic communicators. I conclude by comparing circuit motifs in tetrapods to acoustic communication in two invertebrates—fruit flies and Hawaiian crickets.

Phylogeny of vocal signaling in Xenopus

Understanding how the nervous system generates and responds to vocal signals and how circuit architectures diverge evolutionarily ideally requires a multispecies genus that communicates vocally, in which both the neural circuits that generate vocalizations—and those that respond to socially relevant sounds—can be mapped, characterized and compared electrophysiologically and anatomically, and the underlying genetic architectures of key neurons identified. Neural circuits are constructed developmentally and thus easy access to the nervous system at all developmental stages is advantageous. These features are all prominent in Xenopus, the focus of our experimental studies for many decades (see Kelley et al., 2020).

Each species of Xenopus can be identified definitively from the temporal and spectral features of male advertisement calls (Figure 2B; Tobias et al., 2011; Evans et al., 2015). A major experimental advantage of Xenopus are the ex vivo preparations: the vocal organ and the brain that “sing in the dish.” The ex vivo larynx generates fictive songs (sound pulse patterns) in response to stimulation of attached laryngeal nerves that mimics call patterns (Figure 2); spectral features are identical to actual calls (Tobias and Kelley, 1987). The ex vivo brain generates fictive songs, patterned laryngeal activity nerve that matches actual vocalizations, in response to application of the neuromodulator serotonin (Rhodes et al., 2007). Spectral features are thus created within the larynx without air flow while temporal features are generated by neural circuits within the CNS (Luksch et al., 1996; Kelley et al., 2020). Application of fluorescent dextran amines to the ex vivo brain allows visualization of neurons within specific brain nuclei that receive auditory input as well as the auditory vocal interface and components of the vocal motor pathways, including axonal trajectories (reviewed in Kelley et al., 2020). Sex-specific characteristics of these neural circuits and of the larynx are due to hormone-regulated development (Zornik and Kelley, 2011). These ex vivo preparations allow us to experimentally determine which hormonally regulated, sexually differentiated features are organ-autonomous and which reflect brain-larynx interactions.

Figure 2

How Xenopus communicate

Recordings from a South African pond across the breeding season together with laboratory studies (Tobias et al., 2004) reveal a rich vocal repertoire specific to social context and sex in Xenopus laevis, the most widely studied species (Figure 3).

Figure 3

How Xenopus make sounds

In most tetrapods, sounds are powered by expiration of air from the lungs driving vibrations of the vocal folds (Ghazanfar and Rendall, 2008; Fitch and Suthers, 2016) or—in birds—of the internal tympaniform membranes (Elemans, 2014). In mice, for example, ultrasonic vocalizations are produced by high velocity air jets that power vocal fold vibrations (Håkansson et al., 2022). CNS respiratory and vocal circuitry must thus be closely linked.

However, the ability of the ex vivo Xenopus larynx to create sounds in the absence of air flow from the lungs, together with observations that these sounds are not shifted in frequency by heliox (Yager, 1982; Kwong-Brown et al., 2019), suggested that Xenopus do not use air flow to power their underwater songs. Instead, sounds are created by rapid separation of intra-laryngeal arytenoid cartilage disks (Figure 4), creating vibrations of the entire body (Kwong-Brown et al., 2019). Sounds are propagated effectively underwater because of impedance matching; the body is mostly water and the medium is water. In vivo, vibrations can be recorded from the entire body, including a single digit. The frog's body thus serves as a “loudspeaker.” This novel mechanism for anuran vocal production allowed Xenopus to retain ancestral, terrestrial frog vocal signaling (Feng et al., 2017) during underwater social interactions.

Figure 4

The ability to evoke sex- and species-typical sounds from the ex vivo Xenopus larynx (Figure 4) reveals that, unlike mammals and birds, in which respiration paces sound production and the CNS controls sound frequencies via the vocal tract (Matzinger and Fitch, 2021), the spectral features of Xenopus vocalizations are intrinsic to the larynx.

When males sing, neural activity that closely corresponds to actual male and female calls is recorded en passant from the laryngeal motor nerve (Yamaguchi and Kelley, 2000; Figure 5). Tightly synchronized Compound Action Potentials (CAPs) recorded from the nerve match the temporal pattern of simultaneously recorded underwater songs across sexes and species (fictive singing: Leininger and Kelley, 2013; Barkan et al., 2018). The temporal features of Xenopus songs are generated within the CNS.

Figure 5

How Xenopus hear sounds

Underwater sound waves produce vibrations of the Xenopus tympanic disk (Christensen-Dalsgaard and Elepfandt, 1995). The disk is located just behind the eye, under the skin (Figure 6). The stapes (a middle ear bone) inserts into the disk proximally and abuts the oval window distally (Mason et al., 2009). An air-filled cavity connects the two tympanic disks which thus function together as a pressure receiver.

Figure 6

As in other anurans, the inner ear includes an amphibian and a basilar papilla innervated by fibers of the eighth cranial nerve that arise from neuronal cell bodies in the acoustic ganglion (Homma et al., 2022), and whose terminals innervate post-synaptic neurons in the dorsal medullary nucleus (DMN, Figure 7; Kelley, 1980; Paton et al., 1982). Within the inferior colliculus of the midbrain (ICo), neurons in the laminar nucleus respond to calls (Elliott et al., 2011) and are rate-tuned to temporal properties of specific calls, as is the case for other anurans (Edwards et al., 2007). Song playbacks also activate the nucleus of the lateral line (NLL) and the principal nucleus (P) of the inferior colliculus (Kelley, 1980). Both project to the central nucleus of the thalamus (CT) nucleus (also illustrated below in Figure 7), suggesting that underwater sound waves are also detectable by the lateral line system.

Figure 7

How the CNS generates Xenopus vocal patterns

When serotonin is applied to the isolated brain of males and females (Figure 7A), compound action potentials (CAPS) recorded from the laryngeal nerve (Figure 7A) match male- and female-specific vocal patterns (Rhodes et al., 2007). These patterns are called “fictive calling.” The fast trill portion of the fictive male advertisement call is driven by a rhythmic local field potential produced by neurons in the parabrachial nucleus (PB). The PB is a central pattern generator for advertisement calling.

Anterograde and retrograde mapping in ex vivo male and female brains—using fluorescent dextran amines thar travel both anterograde and retrograde—reveal components of the neural circuits that generate vocal patterns (Figure 8). Vocal motor neurons occupy caudal Nucleus Ambiguus, NA. Glottal motor neurons (the glottis is closed during calling), commissural interneurons, and neurons projecting bilaterally to the parabrachial nucleus (PB) occupy anterior Nucleus Ambiguus (antNA). Neurons in PB project throughout NA (shading in Figure 8B), both ipsilaterally and contralaterally, as well as reciprocally. Serotonergic neurons in the rostral Raphe, pars dorsalis (rRpd; Ra in Figure 8) project contralaterally to each other and ipsi- and contralaterally to vocal motor nuclei including the periaqueductal gray (PAG), PB, and NA (Brahic and Kelley, 2003; see also Figure 10C). Two forebrain nuclei (the Central nucleus of the Amygdala, CeA) and the Bed Nucleus of the Stria Terminalis (BNST) project to their contralateral counterparts as well as to Ra and PB. The resulting pattern of connectivity (Figure 8B) is highly recurrent and bilateral, insuring effective simultaneous contraction of laryngeal muscles required to produce a sound pulse (Figure 4).

Figure 8

Reproductive state; Hormones and behavior

In native ponds, the behaviors illustrated in Figure 3—calling and clasping—are seasonal and depend on reproductive state. A sexually reproductive state can be induced in the laboratory by injection of human chorionic gonadotropin (HCG). Embryos can thus be generated at any time of year, greatly facilitating discoveries in developmental, cell and molecular biology (reviewed in Wallingford, 2022) as well as powering discoveries in neurodevelopmental disorders (e.g. Willsey et al., 2021).

The behavioral effects of HCG on male calling are due to gonadotropin itself, to direct effects on neurons expressing gonadotropin receptors in the CeA (Yang et al., 2007), as well as to evoking increased synthesis and release into the circulatory system of gonadal steroids (androgens and estrogens) that activate neurons in the CNS. Sexually unreceptive or ovariectomized females respond to male clasping with leg extension and ticking (Figure 3F) while gonadotropin-injected intact females respond with leg flexion (Figure 3E). Castration abolishes male clasping and calling, behaviors reinstated by androgen treatment (Kelley and Pfaff, 1976; Wetzel and Kelley, 1983). Androgen effects on calling include activating vocal motor neurons as well as their inputs from the parabrachial nucleus. On the auditory side, gonadal hormones effects include direct action on androgen receptor expressing neurons in the acoustic ganglion in the periphery (Kelley, 1981), in the auditory midbrain (Kelley, 1980) and in the CeA of the ventral forebrain, where auditory input and pre-motor output intersect (Hall et al., 2013). Similar patterns of hormone receptor expression are found across other vertebrates (see Figure 10).

In summary, innate acoustic communication in Xenopus is characterized by species-specificity, a large vocal repertoire, pronounced sexual differences due to secretion of gonadal hormones and male/female, male/male duetting. These features reflect the preeminence of acoustic signals in turbid aquatic habitats over the evolutionary time scales (~ 170 mya, Feng et al., 2017) since the Pipoidae diverged from terrestrial anurans.

Neural circuit architecture underlying social vocalization in tetrapods

Across phyla (Figure 1), acoustic communication is closely associated with nocturnal species (Chen and Wiens, 2020). Bats, for example, are among the most vocal species, using sound both for locating prey (echolocation) and for social interactions within the roost (Kanwal, 2009). Male sac winged bats (Saccopteryx bilineta) produce complex courtship songs directed toward the females in their harems at dawn and dusk, sandwiched between territorial songs (Behr and von Helversen, 2004). The time of day that vocal species are active can vary. Within rodents (~40% of mammalian species), mice (Mus) and rats (Rattus) are nocturnal while other genera (e.g., Scotinomys Neotropical mice), are diurnal (see below). Vocal communication is also prominent in subterranean genera such as naked mole rats (Heterocephalus glaber, Credner et al., 1997). Anurans (frogs and toads) are also nocturnal and vocal.

In contrast, birds and humans—perhaps the most highly vocal groups—are predominantly diurnal. Most birds sing as the sun first rises and throughout the day. Even night songsters -i.e., nightingales—join in the dawn chorus (Amrhein et al., 2002). Though humans are diurnal, for most of our evolutionary history, visual cues were not available at night; essential social cues (e.g., your own baby's cry) were vocal. Reflecting diurnal activity, in birds and humans social communication is multimodal; visual cues can shape auditory perception. In humans, watching sound production changes what is heard (the “McGurk effect,” Alsius et al., 2018). In songbirds (zebra finches: Taeniopygia guttata), visual signals from conspecifics influence the activity of auditory neurons (George et al., 2011).

As for vocal behaviors, the neural circuits that support acoustic communication in tetrapods leave no trace in the fossil record. We can however compare circuit architectures across vocal vertebrates to determine which features are shared and which are specific to a particular group. For this comparison I've chosen three mammals: a bat (Pteronotus parnelli) and two rodents: Alston's singing mice (Scotinomys teguina) as well as mice (Mus musculis), vocal species with well-characterized repertoires and CNS vocal circuits. As for humans, features of acoustic communication in some species of birds are learned (Jarvis, 2019). Zebra finches and related finches that also learn their songs provide the opportunity to compare circuit motifs across wide phylogenetic distances (Figure 1) as well as providing insight into how acoustic experience and feedback can modify brain circuitry more generally.

Bats

Bats diverged from other Laurasiatherians ~70 mya (Doronina et al., 2017) and comprise ~20% of extant mammalian species. In mustached bats, Pteronotus parnelli, adults of both sexes, as well as pups, vocalize during social encounters. Nineteen syllable types are distinguishable acoustically and each is associated with specific social interactions. Ultrasonic vocalizations (USVs), used to locate prey, are also employed during social behaviors. Auditory cortex neurons tuned for echolocation contribute to recognition of USVs used to interact socially at the roost (Washington and Kanwal, 2008). CNS nuclei that support acoustic communication are depicted in Figure 9 (Kanwal et al., 2013; Kanwal, 2021).

Figure 9

CNS vocal circuitry: In P. parnelli, mid- and hindbrain neural regions involved in vocal communication include the nucleus ambiguus (laryngeal motor neurons), the reticular formation, the PAG and the PB. Stimulating the CeA evokes agonistic vocalizations (Ma and Kanwal, 2014) and social calls evoke neural activity (Naumann and Kanwal, 2011). Components of the neural circuitry supporting acoustic communication are also responsive to affective and reproductive states (Salles et al., 2019). The distribution of oxytocinergic and vasopressinergic neurons has been mapped (Rao and Kanwal, 2004) and includes forebrain nuclei, such as the CeA. Regions expressing receptors for gonadal hormones such as estrogens and androgens have not been mapped to date. Because expiration drives mammalian vocalizations, in bats that vocalize with open mouths, the activity of muscles such as the diaphragm, the jaw and the tongue must be coordinated, as in Scotinomys (see following section) but has not yet been described.

Rodents

Muroid rodents (rats and mice) comprise ~40% of extant mammalian species and diverged from a common ancestor with lagomorphs ~75 MYA (Churakov et al., 2010). Alston's singing mice, Scotinomys teguina, are neotropical, diurnal Cricitine rodents, whose evolutionary divergence was more recent (~7 MYA; Marshall, 1979). As for Pteronotus, in all three genera both pup and adult vocalizations are associated with social interactions. A dramatic example in adults is the post-ejaculatory 22 kHz song of male rats (Barfield and Geyer, 1972). Mouse pups produce USV vocalizations when away from the nest that elicit maternal retrieval and both sexes vocalize as the male chases the female before mating (Portfors and Perkel, 2014). In Scotinomys, both sexes also vocalize during social interactions associated with mate attraction and competition. Songs are acoustically indistinguishable, although male songs are longer (Banerjee et al., 2019). Pairs of males alternate their songs precisely: “turn taking” [see Vanderhoff and Bernal Hoverud (2022) for a discussion of duetting, turn taking, and antiphony]. When one male is introduced into the cage of another, vocalization is stimulated in both, but always ends with the more variable song of the introduced male (Banerjee et al., 2019).

CNS vocal circuitry: A recent approach to identify brain regions that participate in acoustic communication in mammals is injecting pseudorabies virus (PRV) into vocal muscles and then following transneuronal (retrograde) spread at successive intervals. This PRV approach identifies CNS nuclei that participate in vocal production (Figures 10A,B) and can be combined with monosynaptic anterograde or retrograde tracers to map connectivity (Figure 10A).

Figure 10

Mice Arriaga and Jarvis (2013) injected PRV into two laryngeal muscles (CT and CA) resulting (90 h post-PRV injection) in ipsilateral labeling of neurons in Amb (Figure 10A). Injecting BDA into regions of motor cortex—in which neurons express immediate early genes after mice produce USVs—reveals a sparse, apparently monosynaptic, input from M1 onto laryngeal motor neurons (back labeled with a retrograde tracer: cholera toxin). These observations suggest that mouse cortex can directly influence vocal motor neurons.

Scotinomys Alston's singing mice (Figure 10B) vocalize with open mouths; movements of jaw muscles must be coordinated with vocal circuits. Injecting PRV into both jaw and laryngeal muscles—and mapping virus-infected neurons up to 96 h post injection—outlines a set of CNS vocal nuclei (Figure 10B) that includes Amb, PB, PAG, CeA, and orofacial motor cortex (OMC). Stimulating OMC in a male during vocal turn taking with another male pauses his song sequence which then resumes at the pause point. Cooling the OMC elongates the song by adding additional notes, slowing song progression. The OMC appears to coordinate male/male singing rather than driving vocal motor production (Okobi Jr et al., 2019). The function of the sparse M1/M2 motor cortex projection to laryngeal motor neurons in mice is not known.

Anurans

Hindbrain components of CNS circuitry that drive vocal production in frogs (including the PB) were first identified by Schmidt (1976). In X. laevis, fluorescent dextran amines applied to the ex vivo brain travel both anterograde (labeled fibers and terminal fields) and retrograde (labeled neuronal cell bodies). We used this approach (originally described by Luksch et al., 1996) to identify a projection from the CeA to the pontine parabrachial nucleus (PB: Figure 10C) as well as input to the CeA from auditory thalamus (CT: Figure 7; Hall et al., 2013). In Xenopus neurons that drive laryngeal muscles occupy Amb which receives input from the periaqueductal gray (PAG), a brain region recently proposed as a key node for courtship displays across vertebrates (Schwark et al., 2022). The Xenopus PB is reciprocally connected to the PAG in the midbrain as well as to the CeA in the forebrain (Figure 10C).

Microstimulation of the CeA in the ex vivo brain evokes “fictive calling” in adult males (Hall et al., 2013) as well as females (Ballagh, 2014). The Xenopus PAG is reciprocally connected to the PB, identified as the central pattern generator (CPG) for the male advertisement call (Rhodes et al., 2007). PB neurons are intrinsically rhythmically active (Barkan et al., 2018). As the evolution of the cerebral cortex is evolutionarily recent (Striedter and Northcutt, 2019), a projection to the hindbrain vocal circuit is not expected in Xenopus. A projection from dorsal forebrain is present in songbirds (Figure 11).

Figure 11

Comparing neural circuit motifs in Pteronotus (Figure 9), Mus, Scotinomys, and Xenopus (Figure 10) reveals shared hindbrain, midbrain and forebrain components of vocal production circuitry, including the nucleus ambiguus, the parabrachial nucleus, the periaqueductal gray and the central nucleus of the amygdala. Conservation of these neural circuit motifs supports an ancient origin for tetrapod vocal circuits.

Birds

Many birds are accomplished songsters and—in some species—males and females duet (Kingsley et al., 2018; Riebel et al., 2019). Zebra finches (Taeniopygia guttata, Figure 1) are the most widely studied species; they are readily bred and maintained in the laboratory as are related species such as Bengalese finches. Both male and female zebra finches produce unlearned vocalizations—calls—to locate other adult conspecifics (e.g., the distance call, Elie and Theunissen, 2018). Within the nest, parent zebra finches employ soft calls to coordinate chick care (Elie et al., 2010). As in other tetrapods, song is powered by expiration (Suthers et al., 1999).

Two aspects of vocal production, however, are avian-specific, presumably reflecting Arcosaur ancestry (Figure 1). While birds have a larynx and vocal tract (including the tongue), the spectral features of their vocalization are shaped by the syrinx, the avian specific vocal organ (Kingsley et al., 2018), (see Albersheim-Carter et al., 2016). In contrast to calls, male courtship songs are learned by young birds in a sequence that resembles human speech acquisition (Thorpe, 1954; Doupe and Kuhl, 1999). Song learning is supported by specialized forebrain nuclei that shape vocal production to reflect auditory experience. Forebrain neural circuits, notably the auditory recipient nucleus (Field L and its subnuclei), as well as the vocal efferent nuclei (HVc and RA, Nif and Av; Figure 11) are not homologs of mammalian auditory and motor cortex (in humans, Wernicke and Broca's area; see Mooney, 2022) although neural circuit motifs do resemble those of mammals (Calabrese and Woolley, 2015). Motifs shared between songbirds with vocal learning and human primates can thus provide insight into convergent principles of neural circuit formation and modifications that support vocal learning.

The bird forebrain auditory-recipient nucleus (Field L in the neostriatum) consists of several interconnected sub-regions (L1-3) whose circuit architecture resembles processing in mammalian auditory cortex (Calabrese and Woolley, 2015), a striking example of evolutionary convergence. Neurons in Field L project to a strip of tissue, originally termed the “shelf” (Kelley and Nottebohm, 1979) immediately ventral to the neostriatal nucleus, HVC (higher vocal center). Axons of HVC neurons travel ventrally to nucleus robustus in the archistriatum (RA) forming an encircling cup (Nottebohm et al., 1982). RA neurons innervate motor neuron pools in the hindbrain that drive respiratory and syringeal muscles. Breaths and “mini-breaths” (brief inspiration bout within a sound-producing expiration) control patterns of vocal expression via expulsion of air from the lungs (Wild et al., 1998), much as in the patterning of cries in mouse pups described below.

Another conserved feature across species with vocal learning is the role of dopamine and the basal ganglia (LMAN and Area X). The young bird “evaluates” the match between a learned song and his own match to that song, linking motor output to its acoustic consequences (Gadagkar et al., 2016; Mooney, 2020). This match is mediated by convergence between inputs to HVc from Nif and midbrain dopaminergic inputs (Tanaka et al., 2018). Dopamine (and the basal ganglia more generally) is implicated in motor patterning across vertebrates (Grillner and Robertson, 2016; Suryanarayana et al., 2022). This widely conserved feature appears to have been exapted for both the modification of vocal circuits essential for language learning in humans and song learning in birds. A recent paper that employed cutaneous sensory stimuli to shape vocal production rather than auditory feedback, also identified a role for dopamine in vocal learning (McGregor et al., 2022). A role for dopamine in reproductive-state dependent odor preferences has recently been identified in Drosophila as well (Boehm et al., 2022). Taken together these observations suggest that the ability of dopamine to shape neural circuitry is highly conserved.

Central pattern generators and vocalization

As discussed below, a vocal CPG that patterns mouse pup cries has recently been identified in the inferior reticular formation (Wei et al., 2022). In Xenopus, the parabrachial nucleus (PB) is a CPG for the male advertrisement call [reviewed in Kelley et al. (2020)]. When the ex vivo Xenopus brain is exposed to serotonin, fictive advertisement calling CAPs recorded from the laryngeal nerve coincide with a pronounced local field potential recorded from the PB (Rhodes et al., 2007). Transection at various levels of the CNS as well as cooling studies confirm the role of the PB as a vocal CPG. PB neurons retain their intrinsic rhythmicity in the ex vivo brain even when isolated synaptically (Barkan and Zornik, 2019). A CPG that drives slow trill has not yet been identified but might correspond to PiCO, a proposed inspiratory CPG in mice (Anderson et al., 2016). While Xenopus vocal production is independent of respiration (Figure 4), a rhythmically active neural circuit element that functions to gate the inspiratory/expiratory transition in rats (Dutschmann and Herbert, 2006) functions in Xenopus as a CPG controlling vocal rhythms, suggesting exaptation of a respiratory circuit element present in the common ancestor of tetrapods. As discussed below, one class of neurons in the PB, the FTNs, differ intrinsically in rhythmicity across related species, opening a window into genetic divergence that supports speciation (Baker et al., 2019).

As bird songs are coordinated with respiration, one approach to finding a vocal CPG in birds is to identify the respiratory CPG. Wild (1997) described neurons in nucleus retroambiguus (Ram: Figure 11B) projecting to respiratory motor neurons in pigeon and songbirds. RAm efferents were also observed in the PB, rostroventral lateral medulla (RVL), caudal pons and in XIIts. Both RAm and PB have been considered candidate vocal pattern generating nuclei in songbirds. A recent review (Mooney, 2020) suggests instead that the songbird vocal CPG is located in a reticular nucleus, RVL. RVL drives activity of syringeal motor neurons but is gated by neurons in the caudolateral PAG. An alternative suggestion is that RVL coordinates activity of vocal motor neurons (as suggested for LRF in Scotinomys) while the homolog of PB contributes controls vocal patterning. If so, the origin of the PB as a vocal CPG could be evolutionarily ancient.

A recent study (Wei et al., 2022) sought to identify a vocal CPG in infant mouse pups by examining the neural circuits that generate USVs. In pups, a single large breath can be associated with either one or multiple cries. For multiples, each cry is accompanied by a smaller increases or decreases in airflow (resembling the “minibreaths” in canary and zebra finch songs, Hartley and Suthers, 1989; Wild et al., 1998). The authors predicted that this vocal pattern is generated by an intrinsically faster CPG that coordinates with the overall breathing pattern. Previous studies in mice have established that breathing is patterned by a inspiratory CPG that includes neurons in the preBotzinger nucleus (PBC). Blocking the activity of laryngeal TA and CT prevented cry production but not the minibreath pattern, suggesting separate CPGs for cry production and minibreaths. Interneurons innervating TA and CT motor neurons form three groups: rv-iRF (glutamatergic neurons), Botzinger and preBotzinger nuclei (gabaergic) and Nucleus Retroambiguus (mixed). Interneurons innervating both tongue motor neurons and TA motor neurons were also found in rv-iRF. Inactivating rv-iRF disrupted the interval between cry bouts as well as intervals within a bout, but not basal breathing. Brief optogentic stimulation of the rv-iRF produced cry bouts throughout the longer breath. Comparing activity patterns in brain slices that included the rv-iRF and the pre-Botzinger nucleus revealed a faster oscillation (every 6s as compared to 23s) in the former. These experiments provide strong evidence that the rv-iRF generates the pattern of pup cries. This vocal CPG provides input to preBotzinger neurons to drive inspiration (triggering minibreaths) and coordinates activity in the laryngeal TA and CT muscles that control glottal opening.

The IRO is also a candidate participant in patterning of the more complex courtship vocalizations of adult mice. At the behavioral level, the overall spectro-temporal features of male mouse USVs develop continuously from pup calls, stabilizing about 4 weeks later. A shared CPG might represent the “common biological mechanism” suggested by Castellucci et al. (2018). Regardless, the Wei et al. study provides an experimental blueprint for identifying candidate CPGs in adult mice as well as other rodents, such as rats and Scotinomys. Identification of IRO as a CPG for pup calls does not preclude the participation of other CPGs in patterning vocalizations. In cats, for example, neurons in the parabrachial nucleus (specifically the Kolliker-Fuse nucleus) are rhythmically active during inspiration, post-inspiration and expiration (Dick et al., 1993), providing a candidate vocal CPG (see also Hage, 2010 for discussion of primate vocal CPGs).

Reproductive state: Comparing CNS gonadal hormone receptor expression across vertebrates

Another highly conserved feature of CNS vocal circuitry is the expression of receptors for gonadal steroids (typically androgen in males and estrogen in females) in auditory and vocal neurons (shown for the androgen receptor in motor components of the vocal circuit in Scotinomys (nuclei in yellow: Figure 10B) and for Xenopus (nuclei in yellow and green, Figure 10C).

The capacity for synthesizing estrogen arose before the evolution of the ancestral ER (Eick and Thornton, 2011). Steroid hormone receptors are also evolutionarily ancient; derived from a single ancestral receptor that diverged from the nuclear receptor superfamily early in vertebrate evolution. These receptors diversified in the chordates; amphioxus has two: an ER and a member of the AR/PR/GR/MR family. The pipoidae (ancestral to modern pipids including Xenopus) emerged during the Jurassic ~170 mya and the genus Xenopus ~50 mya (Feng et al., 2017). The Rodentia diverged from a common ancestor with the Lagomorpa ~65 mya (Romanenko et al., 2012) and Scotinomys perhaps 7 mya (Fernández-Vargas et al., 2021). While characters related to reproductive signaling (such as sexual dimorphism) can be lost as well as gained evolutionarily (e.g., Leininger and Kelley, 2013), similarities in the distribution of androgen receptors in two evolutionarily very distant species (Xenopus and Scotinomys, see below) suggest an ancient role in the coordination of vocal signaling during reproduction.

Xenopus In X. laevis, androgen (acting synergistically with gonadotropin) controls male clasping (Kelley and Pfaff, 1976) and estrogen (acting synergistically with LHRH and gonadotropin) controls female receptivity (Kelley, 1982). Gonadectomy abolishes adult reproductive behaviors in both sexes. These hormones also participate in the control of vocal communication. Gonadal steroid receptors are expressed in brain regions implicated in acoustic communication; from the acoustic ganglion through to the larynx during both development and adulthood including laryngeal motor neurons (Kelley, 1980).

In females, but not in males, preferential auditory evoked potential responses to each species' dominant frequencies are abolished by ovariectomy and reinstated by androgen (Hall et al., 2013). Testosterone is the major circulating gonadal steroid in female Xenopus (Lutz et al., 2001) but can be converted to estrogen in situ by aromatase. Neurons within the CeA express estrogen (Figure 10C) and gonadotropin receptors (Yang et al., 2007). Gonadotropin synergizes with androgen to restore calling to castrated makes (Wetzel and Kelley, 1983). The CeA receives auditory input and is required for males to produce socially appropriate responses to female calls (Hall et al., 2013). Laryngeal motor neurons in NA express androgen receptor (Kelley, 1980). The vocal pattern generator (PB) includes neurons expressing androgen but not estrogen receptor (Figure 10C).

Bats as Kanwal points out: “There is a deep connection between hormones, the perception and production of social vocalizations, and behavior. Hormones-to-circuits-to-perception or production is a bi-directional process… hormones can modulate and set up either transient or long-lasting neural circuits for the processing, perception, and production of sounds, particularly those having social consequences” (Kanwal, 2021, p. 239). Neurons in the DSCF (Doppler-shifted constant frequency) region of P. parnelli respond both to echolocation and to social vocalizations (Washington and Kanwal, 2008) and processing is lateralized in males (but not females) with more responsive neurons in the left hemisphere (Kanwal, 2021), suggesting a sex difference likely to be driven by gonadal hormones. While the locations of gonadal hormone receptor expressing neurons have not yet been mapped, the bat CeA most likely shares this common vertebrate circuit motif. Current research on bat social communication is shifting to Carollia perspicillata, as this species is more readily maintained in breeding colonies and uses complex vocal interactions to communicate. Individual C. perspicillata have distinctive vocal signatures. Distress calls have been shown to activate neurons in the amygdala (Hechavarría et al., 2020). Mapping gonadal steroid hormone receptor distributions in C. perspicillata will be a useful test of evolutionary hypotheses.

Singing mice despite evolutionary divergence (Figure 1) circuit motifs for acoustic communication and AR expression in S. teguina share multiple features with other tetrapods, inclding Xenopus (Figure 11). Notably, in both species, vocal motor neurons in nucleus ambiguus and pre-motor neurons in the parabrachial nucleus express androgen receptor (yellow B; yellow and green, C). Neurons in the inferior colliculus of both species also express AR (not illustrated in B). While estrogen receptor expression has not been mapped in Scotinomys, ER is expressed in inferior colliculus and CeA of laboratory mice (Charitidi and Canlon, 2010) and is likely to also be expressed in auditory nuclei of singing mice.

Song birds As for anurans and rodents, androgen receptor expression is widespread in bird vocal control nuclei (Figure 11B); estrogen receptor however is limited to HVc (Frankl-Vilches and Gahr, 2018). In female white-crowned sparrows, circulating estrogen during the breeding season increases responses of auditory neurons in Field L (Figure 10A) (Caras et al., 2012) as well as immediate early gene expression in the social behavior network (Maney et al., 2008). Auditory responses to song have also been recorded in the bird homolog of the amygdala, nucleus taenia (Fujii et al., 2016). Hormonal regulation of both sensory and motor neural circuits that participate in vocal courtship in vertebrates appears ancient.

The CeA: A conserved node for vocal communication across vertebrates

The central nucleus of the amygdala (CeA) has been described as the “autonomic” amygdala because of its role in respiration and heart rate. Given the prominence of expiration for vocal expression across vertebrates, CeA involvement in vocal communication makes sense. In Pteronotus CeA stimulation evokes agonistic vocalizations (Ma and Kanwal, 2014). Neurons in the CeA also respond to social vocalizations, especially those associated with aggression. In primates, a baby's cry activates the parents' amygdala (Riem et al., 2021). Autonomic rhythms pace vocalizations of marmosets (Zhang and Ghazanfar, 2016). Output from the CeA (central-medial boundary) to the PAG transiently suppress vocalization in mouse pups (Tschida et al., 2019). In adult mice, activating neurons in the preoptic area of the hypothalamus (POA) that express estrogen receptor in adults inhibits inhibitory PAG neurons allowing USV expression as well as scaling the duration and persistence of bouts (Chen et al., 2021).

In Xenopus, a species that uncoupled breathing from calling many millions of years ago, the CeA matches acoustic stimuli to vocal expression. Lesions of the CeA in Xenopus result in socially inappropriate responses of males to song playbacks (Hall et al., 2013). Lesioned males respond to broadcasts of rapping and even an actual rapping female (Figure 3G) with prolonged vocal suppression (the response normally elicited by a vocally dominant male) rather than answer calling (the socially appropriate response; Figure 3). In Xenopus, the inhibitory output from CeA to a putative PAG homolog is conserved. However, unlike mammals, the Xenopus APOA does not project to PAG directly, instead innervating and receiving input from rRpd (Brahic and Kelley, 2003). rRpd, a serotonergic nucleus, is reciprocally connected to APOA, PB and NA and also projects to PAG. Thus, while in mice POA ER-expressing neurons have direct access to the PAG—inhibiting inhibitory neurons and promoting vocalization—in Xenopus, APOA neurons may influence vocalization via serotonergic innervation of PB and NA.

Neural circuit motifs that generate species-specific vocal rhythms; Genetic approaches in Xenopus

The persistence of species depends on successful reproduction: the production and survival of offspring that go on to reproduce and survive themselves (Darwin, 1872). Because hybrid offspring can be disadvantaged in development, survival and/or mating, identifying a potential mate of the same species is a primary imperative (Lemmon and Lemmon, 2010). As there are 29 extant species of Xenopus, in all of which males produce advertisement calls (Tobias et al., 2011), evolutionary conservation of neural circuitry across the genus can be explored. For example, the role of the PB in generating vocal patterns in Xenopus was evaluated recently by cross-species comparison between X. laevis and X. petersii, members of the L species subgroup that diverged ~8 mya (Figure 12A).

Figure 12

A specific class of rhythmically active neurons (Fast Trill Neurons or FTNs) in the PB was identified electrophysiologically in both species (Figure 12F). When synaptically- isolated by blocking sodium channels and stimulated by application of a glutamate agonist (NMDA), the membrane potential of FTNs oscillates at the species-specific rate and rhythm. This inter-species observation strengthens the identification of the PB as the vocal CPG (Rhodes et al., 2007) and implicates a specific class of rhythmically active neurons in divergence of vocal signaling across the L subgroup.

To drive the beginning phases of speciation that resulted, for example, in the different advertisement calls of the L subgroup, divergence in male courtship songs across populations must have co-evolved with female sensitivity to—or preference for -acoustic features of those songs (or vice versa). In Xenopus, each sound pulse includes two dominant frequencies (Figure 1) that differ across species. In the L subgroup, the DF2/DF1 ratio is 1.22, except for X. laevis in which the ratio is 1.14 (Kwong-Brown et al., 2019). Auditory evoked potentials reveal that females are preferentially acoustically sensitive to species-specific DFs at the species-specific ratio (Hall et al., 2013), suggesting that this spectral feature is salient for same species recognition by females in L subgroup species. Both temporal and spectral features of male and female calls determine vocal responses in X. laevis (Vignal and Kelley, 2007). As described above, in most vocal vertebrates the CNS controls both the temporal features of songs (via respiratory/vocal CPGs) and song spectral features (via hypoglossal control of the vocal tract). In Xenopus however the brain controls only the temporal features while spectral features are inherent to the larynx. This separation simplifies the genetic analysis of song divergence during speciation.

Within the L subgroup, advertisement calls are species-specific. What differences in gene expression between FTNs in the hindbrain and the vocal organ of different species contribute to species specific vocal signlling? Unusually, in Xenopus interspecific hybrids between extant species can produce fertile F1 and F2 offspring of both sexes (Evans, 2008). Genetic candidates for speciation-associated divergence in song temporal features are loci encoding or modulating FTN ion channels (Barkan et al., 2018). Candidates for species divergence in song spectral features are loci that contribute to the laryngeal cartilage components that support production of the two dominant frequencies (Kwong-Brown et al., 2019). To function in species divergence, female vocal perception and preference and diverging male songs must co-evolve. Recent research in two invertebrates, Hawaiian crickets and fruit flies, is revealing genetic architectures that support co-ordination of evolution of acoustic signaling in the sexes.

Neural circuit motifs that generate species-specific acoustic communication: Invertebrates

Crickets

Crickets use acoustic communication at a distance (far field) during courtship. Interestingly, female preferences co-evolve with acoustic features of male courtship songs in Hawaiian crickets (Laupala) (Xu and Shaw, 2021). Many small to moderate effect genetic loci are linked to species differences in male pulse rates. Fine mapping using high density SNP linkage maps has narrowed QTL confidence intervals and permitted annotation of genes within QTL peaks, highlighting candidate genes for linked production and preference. Comparison of species pairs from different islands revealed that, despite the many small to moderate effect sizes, multiple interspecific divergences of Laupala mating songs involve similar genetic architectures and share more QTL than were expected. Notably, pulse rate (male) and pulse preference (female) co-localize in the genome, raising the possibility that the linkage between male performance and female preference contributes to shared QTL.

QTL in Laupala are associated with genomic regions that—in the fruit fly Drosophila—are associated with neuronal development, rhythmic action and neuromodulators known to influence CPGs (Blankers et al., 2018). Combining the cellular approaches to production and perception of songs in field crickets (Grillus; Schöneich, 2020) with recent approaches to cricket genome editing and manipulation (Nakamura et al., 2022) should provide an experimental arena for testing candidate genes from QTL fine mapping.

Fruitflies

Fruitflies use acoustic communication during courtship (Murthy, 2010). Male songs are generated by wing vibration and neural circuitry supporting species-specific acoustic communication has been mapped in detail for D. melanogaster (reviewed in Sato et al., 2020). The Drosophila melanogaster subgroup includes 9 species with evolutionary divergence times (relative to melanogaster) ranging from ~5 (simulans) to ~13 mya (yakuba: Tamura et al., 2004). A recent study identified a homologous descending interneuron in D. melanogaster and D. yakuba (plP10) that is activated by similar social contexts, but drives different motor outputs (Ding et al., 2019). In both species, louder songs are used while chasing females (pulse for D. melanogaster and cluck for D. yakuba). D. melanogaster can produce clack songs, suggesting that circuitry for this song type is shared between species (Figure 13).

Figure 13

Comparing vertebrate and invertebrate sound communication

Because plP10 can drive both “clack-like” and “pulse” song; it can be considered a “multipurpose” interneuron, with access to at least two motor programs (Figure 13). Drosophila melanogaster and yakuba plP10 neurons are electrophysiologically similar (Ding et al., 2019). The species difference is the intensity with which a defined interneuron (plP10) drives downstream song production. Low levels of plP10 activity drive high amplitude songs and high levels of activity drive low amplitude songs (Figure 13). Inhibitory sound control circuitry must be interposed between plP10 and motor neurons involved in the wing vibrations that produce songs. Disinhibition is also a circuit motif in mice. USVs can be elicited by stimulating ER1+ neurons in the LPOA, relieving an inhibitory clamp in the PAG (Chen et al., 2021). Increasing the intensity of stimulation in mice scales USV intensity and duration, although in a direction opposite to D. melanogaster. In X. laevis, microstimulation of the CeA in the ex vivo brain drives fictive singing (Hall et al., 2013). The CeA is almost entirely GABA-ergic (Brox et al., 2003) thus, as in D. melanogaster and mice, disinhibition (gating) is a prominent circuit motif. In mouse pups, the output of the CeA to the PAG is also inhibitory and disinhibition gates production of their cries (Tschida et al., 2019). Disinhibition thus gates acoustic communication across phyla.

For Xenopus laevis and petersii, as in D. melanogaster and yakuba, the species difference is apparent in interneurons rather than, for example, sensory or motor neurons. Xenopus FTNs display species-specific electrophysiological properties: cell autonomous, species-specific membrane oscillation rhythms when stimulated with NMDA (Figure 12). PB neurons provide high fidelity, excitatory innervation directly to laryngeal motor neurons (Zornik and Kelley, 2008). In Xenopus, motor neurons modulate CPG activity (“feedback to the future”; Barkan and Zornik, 2019). Axon collaterals from laryngeal motor neurons synapse on inhibitory interneurons that control the precision (i.e., the interval between spikes that ride on each PB neuron oscillation) with which PB drives the vocal pattern. Whether this circuit motif for vocal precision occurs in other species remains to be determined. Retuning the PB CPG fast trill neurons across species might seem analogous to differences in the output of plP10. However, Xenopus FTNs provide monosynaptic excitatory input that drives vocal motor neurons while the Drosophila plP10 is an inhibitory synapse onto a downstream, interneuronal circuit motif.

Matching production and perception/preference

A still mysterious aspect of the divergence in vocal communication that accompanies speciation—in both vertebrates and invertebrates—is how perception or preference of the receiver for an acoustic signal—and the production of that signal—co-evolve (see Yeh, 2022, for a recent example in zebra finches). Matching production and perception during speciation is not confined to vocal signaling. Sensory stimuli associated with a non-reproductive benefit—such as a specific color that signals a desirable food—might be adopted to create or enhance attractive signaling: the “sensory trap” and “sensory exploitation” hypotheses [reviewed in Ryan (2021)]. However, neither hypothesis directly addresses how the production of communication signals and the acoustic recognition of those signals co-evolve as species diverge, at the level of underlying neural circuit functions.

In Xenopus, vocal production is supported by a dedicated CNS motor pathway and neuromuscular control of contractions of laryngeal muscles. Species-specificity reflects the intrinsic patterned activity of FTN neurons in the PB. Vocal perception is influenced both by detectability and recognition. Neurons in the acoustic ganglion of females support enhanced detectability of own-species sound pulse dyads (Hall et al., 2013). Neurons in the anuran auditory midbrain (ICo) are tuned to sound pulse rate, supporting recognition both for call type within a species (Figure 3) and potentially for recognizing conspecifics. Reproductive state gates vocal communication in both sexes via expression of receptors for gonadal hormones acting on the vocal communication system from the level of primary auditory neurons to vocal muscles.

Speciation in Xenopus follows two trajectories. One occurs in the L and M clades (tetraploid species with different call patterns: L; biphasic and burst; M: burst and click, Tobias et al., 2011) suggesting evolutionary divergence driven by sexual or natural selection, rather than genetic drift. Xenopus also speciate by hybridization, resulting in genome sizes ranging from tetraploid to dodecaploid (A species group). The A group is the most speciose in the genus and the female release call (ticking, Figure 3) is absent (Tobias et al., 2014) suggesting the possibility that loss of the female unreceptive call facilitated hybridization. As ticking can also be produced by the ex vivo brain of female X. laevis this preparation could be very useful in figuring out the basis—neural circuit and genetic architecture—for the loss of ticking in the A species group.

In other frogs, recognizing a heterospecific male is selected for in females because F1 hybrids are less fit. In Pseudacris, for example, the lifetime fitness of hybrid males, but not females, is reduced by 44% (Lemmon and Lemmon, 2010). The simplest hypothesis for co-evolution of vocal signaling in males, and preference in females, is overlapping gene networks in neurons that produce and respond to sounds. Lemmon and her colleagues (Ospina et al., 2021) compared divergence of gene networks in populations of P. ferriarum in sympatry or allopatry with P. negrita. They identified seven candidate synaptic transmission genes that have diverged between these populations, with more genes overall diverged between females than males. Neurons in the anuran inferior colliculus are selectively driven by interpulse interval (Edwards et al., 2007). Preliminary studies suggest differences in tuning of Pseudacris ICo neurons between sympatric and allopatric populations, providing a possible neural substrate for matching production and perception. Whether this difference is sex specific (reflecting greater genetic divergence in females) is not yet clear but if so, could be due to gonadal hormones.

Multimodal signaling, sex, speciation and language

Acoustic signaling is ancient and phylogenetically associated with extant species that are nocturnal or especially vocal at dawn or dusk (Figure 1, Chen and Wiens, 2020). Birds and humans are the major exceptions. Humans display the “McGurk effect” in which visual stimuli from the face of a speaker influence the acoustic identification of a syllable (McGurk and MacDonald, 1976). In starlings, conspecific visual stimuli also modify responses of neurons in the primary forebrain projection area—Field L—to familiar and unfamiliar songs: familiar songs suppress responses and unfamiliar songs enhance responses (George et al., 2011). Multimodal sensory integration is thus also likely to have shaped the evolution of vocal communication in diurnal species such as primates and birds.

Because speciation reflects both sexual selection (success in attracting mates) and natural selection (survival), sex and speciation are linked at many levels. In vertebrates, sexual differentiation is governed by pituitary and gonadal hormones. Patterns of AR and ER expression—from sensory receptors through to neural circuits for muscle effectors—are targets for evolutionary selection. Broder et al. (2021) argue that “it may be easier than assumed to evolve new sexual signals because sexual signals may be arbitrary, sexual conflict is common and receivers are capable of perceiving much more of the world than just existing sexual signals.” Part of this argument is based on the idea that an arbitrary sensory stimulus associated with a positive experience (for example a red food source) can be co-opted to shape behaviors, such as approach (sensory exploitation). However, such co-opted sensory stimuli are not necessarily useful finding or selecting reproductive partner of the same species for reproduction. This is particularly for true females whose gametes are finite and provide resources for the embryo, unlike those of males.

Robert and the late Dorothy Cheyney argued (Seyfarth and Cheney, 2014)—using multi-year field data on vocal communication in baboons—that the origin of human language might lie in social cognition. Baboons have a matrilineal dominance hierarchy; each female has a distinctive “grunt” vocalization. Seyfarth and Cheyney recorded vocalizations during social interactions between all female pairs in Year 1. In Year 2, they observed female A/female B interactions and then played back A's call to B to determine whether B's response to the playback reflected what had happened (grooming, for example, or biting) during that specific interaction. Did B stay put (grooming: positive interaction) or move away (biting: negative interaction)? They reported that B's response was triggered specifically by A's grunt and matched the social valence of their recent interaction. If indeed the substrate for language evolution, we have much more to learn about the neurobiology of vocal communication across species.

While humans do not actually bite each other during arguments (at least as adults), we do use biting language. We also devote considerable attention to decoding how people feel about us from cues in voice to construct a socially appropriate response. Areas of the human brain involved in language production and perception must (at the very least) access other areas that identify social context-driven voice cues regulated by the endocrine and neuromodulatory systems described in this review. Advances in fMRI now allow imaging of entire brains in response to conspecific and heterospecific vocal sounds in other animals (Van Ruijssevelt et al., 2013; Gábor et al., 2020). Imaging whole brain activity during vocal communication across species over the next few years will drive additional discoveries in this scientific arena.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Statements

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

DBK wrote this manuscript and adapted from published work or created the figures.

Conflict of interest

The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

1
AgnarssonI.Zambrana-TorrelioC. M.Flores-SaldanaN. P.May-ColladoL. J. (2011). A time-calibrated species-level phylogeny of bats (Chiroptera, Mammalia). PLoS Curr.3:RRN1212. 10.1371/currents.RRN1212
2
Albersheim-CarterJ.BlubaumA.BallaghI. H.MissaghiK.SiudaE. R.McMurrayG.et al. (2016). Testing the evolutionary conservation of vocal motoneurons in vertebrates. Respirat. Physiol. Neurobiol.224, 2–10. 10.1016/j.resp.2015.06.010
3
AlsiusA.Par,éM.MunhallK. G. (2018). Forty years after hearing lips and seeing voices: the McGurk effect revisited. Multisensory Re.31, 111–144. 10.1163/22134808-00002565
4
AmrheinV.KornerP.NaguibM. (2002). Nocturnal and diurnal singing activity in the nightingale: correlations with mating status and breeding cycle. Animal Behav.64, 939–944. 10.1006/anbe.2002.1974
- CrossRef
- Google Scholar
5
AndersonT. M.GarciaA. J.BaertschN. A.PollakJ.BloomJ. C.WeiA. D.et al. (2016). A novel excitatory network for the control of breathing. Nature536, 76–80. 10.1038/nature18944
6
ArriagaG.JarvisE. D. (2013). Mouse vocal communication system: are ultrasounds learned or innate?Brain Lang.124, 96–116. 10.1016/j.bandl.2012.10.002
7
BakerC. A.ClemensJ.MurthyM. (2019). Acoustic pattern recognition and courtship songs: insights from insects. Annual Rev. Neurosci.42, 129–147. 10.1146/annurev-neuro-080317-061839
8
BallaghI. H. (2014). Sex Differences in the Structure, Function and Regulation of Vocal Circuits in Xenopus.Columbia University.
- Google Scholar
9
BanerjeeA.PhelpsS. M.LongM. A. (2019). Singing mice. Curr. Biol.29, R190–R191. 10.1016/j.cub.2018.11.048
10
BarfieldR. J.GeyerL. A. (1972). Sexual behavior: ultrasonic postejaculatory song of the male rat. Science176, 1349–1350. 10.1126/science.176.4041.1349
11
BarkanC. L.KelleyD. B.ZornikE. (2018). Premotor neuron divergence reflects vocal evolution. J. Neurosci.38, 5325–5337. 10.1523/JNEUROSCI.0089-18.2018
12
BarkanC. L.ZornikE. (2019). Feedback to the future: motor neuron contributions to central pattern generator function. J. Exp. Biol.222, 193318. 10.1242/jeb.193318
13
BehrO.von HelversenO. (2004). Bat serenades—complex courtship songs of the sac-winged bat (Saccopteryx bilineata). Behav. Ecol. Sociobiol.56, 106–115. 10.1007/s00265-004-0768-7
- CrossRef
- Google Scholar
14
BerwickR. C.BeckersG. J.OkanoyaK.BolhuisJ. J. (2012). A bird's eye view of human language evolution. Front. Evolut. Neurosci.4:5. 10.3389/fnevo.2012.00005
15
BlankersT.OhK. P.BombarelyA.ShawK. L. (2018). The genomic architecture of a rapid island radiation: recombination rate variation, chromosome structure, and genome assembly of the Hawaiian cricket Laupala. Genetics209, 1329–1344. 10.1534/genetics.118.300894
16
BoehmA. C.FriedrichA. B.HuntS.BandowP.SijuK. P.De BackerJ. F.et al. (2022). A dopamine-gated learning circuit underpins reproductive state-dependent odor preference in Drosophila females. Elife11:e7764310.7554/eLife.77643.sa2
17
BrahicC. J.KelleyD. B. (2003). Vocal circuitry in Xenopus laevis: telencephalon to laryngeal motor neurons. J. Compar. Neurol.464, 115–130. 10.1002/cne.10772
18
BroderE. D.EliasD. O.RodríguezR. L.RosenthalG. G.SeymoureB. M.TinghitellaR. M. (2021). Evolutionary novelty in communication between the sexes. Biol. Lett.17:20200733. 10.1098/rsbl.2020.0733
19
BroxA.PuellesL.FerreiroB.MedinaL. (2003). Expression of the genes GAD67 and Distal-less-4 in the forebrain of Xenopus laevis confirms a common pattern in tetrapods. J. Compar. Neurol.461, 370–393. 10.1002/cne.10688
20
CalabreseA.WoolleyS. M. (2015). Coding principles of the canonical cortical microcircuit in the avian brain. Proc. Natl. Acad. Sci. U.S.A.112, 3517–3522. 10.1073/pnas.1408545112
21
CannatellaD. (2015). Xenopus in space and time: fossils, node calibrations, tip-dating, and paleobiogeography. Cytogenet. Genome Res.145, 283–301. 10.1159/000438910
22
CarasM. L.O'BrienM.BrenowitzE. A.RubelE. W. (2012). Estradiol selectively enhances auditory function in avian forebrain neurons. J. Neurosci.32, 17597–17611. 10.1523/JNEUROSCI.3938-12.2012
23
CastellucciG. A.CalbickD.McCormickD. (2018). The temporal organization of mouse ultrasonic vocalizations. PLoS ONE13:e0199929. 10.1371/journal.pone.0199929
24
CharitidiK.CanlonB. (2010). Estrogen receptors in the central auditory system of male and female mice. Neuroscience165, 923–933. 10.1016/j.neuroscience.2009.11.020
25
ChenJ.MarkowitzJ. E.LilascharoenV.TaylorS.SheurpukdiP.KellerJ. A.et al. (2021). Flexible scaling and persistence of social vocal communication. Nature593, 108–113. 10.1038/s41586-021-03403-8
26
ChenZ.WiensJ. J. (2020). The origins of acoustic communication in vertebrates. Nat. Commun.11:369. 10.1038/s41467-020-14356-3
27
Christensen-DalsgaardJ.ElepfandtA. (1995). Biophysics of underwater hearing in the clawed frog, Xenopus laevis. J. Compar. Physiol.176, 317–324. 10.1007/BF00219057
28
ChurakovG.SadasivuniM. K.RosenbloomK. R.HuchonD.BrosiusJ.SchmitzJ. (2010). Rodent evolution: back to the root. Mol. Biol. Evol.27, 1315–1326. 10.1093/molbev/msq019
29
CrednerS.BurdaH.LudescherF. (1997). Acoustic communication underground: vocalization characteristics in subterranean social mole-rats (Cryptomys sp., Bathyergidae). J. Compar. Physiol. A180, 245–255. 10.1007/s003590050045
30
DarwinC. (1872). On the Origin of Species by Means of Natural Selection, 6th Edn.London: John Murray.
- Google Scholar
31
DickT. E.OkuY. O. S. H. I. T. A. K. A.RomaniukJ. R.CherniackN. S. (1993). Interaction between central pattern generators for breathing and swallowing in the cat. J. Physiol.465, 715–730. 10.1113/jphysiol.1993.sp019702
32
DingY.LillvisJ. L.CandeJ.BermanG. J.ArthurB. J.LongX.et al. (2019). Neural evolution of context-dependent fly song. Curr. Biol.29, 1089–109910.1016/j.cub.2019.02.019
33
DoroninaL.ChurakovG.KuritzinA.ShiJ.BaertschR.ClawsonH.et al. (2017). Speciation network in Laurasiatheria: retrophylogenomic signals. Genome Res.27, 997–1003. 10.1101/gr.210948.116
34
DoupeA. J.KuhlP. K. (1999). Birdsong and human speech: common themes and mechanisms. Annual Rev. Neurosci.22, 567–631. 10.1146/annurev.neuro.22.1.567
35
DutschmannM.HerbertH. (2006). The Kölliker-Fuse nucleus gates the postinspiratory phase of the respiratory cycle to control inspiratory off-switch and upper airway resistance in rat. Eur. J. Neurosci.24, 1071–1084. 10.1111/j.1460-9568.2006.04981.x
36
EdwardsC. J.LearyC. J.RoseG. J. (2007). Counting on inhibition and rate-dependent excitation in the auditory system. J. Neurosci.27, 13384–13392. 10.1523/JNEUROSCI.2816-07.2007
37
EickG. N.ThorntonJ. W. (2011). Evolution of steroid receptors from an estrogen-sensitive ancestral receptor. Mol. Cell. Endocrinol.334, 31–38. 10.1016/j.mce.2010.09.003
38
ElemansC. P. (2014). The singer and the song: the neuromechanics of avian sound production. Curr. Opin. Neurobiol.28, 172–178. 10.1016/j.conb.2014.07.022
39
ElieJ. E.MarietteM. M.SoulaH. A.GriffithS. C.MathevonN.VignalC. (2010). Vocal communication at the nest between mates in wild zebra finches: a private vocal duet?Animal Behav.80, 597–605. 10.1016/j.anbehav.2010.06.003
- CrossRef
- Google Scholar
40
ElieJ. E.TheunissenF. E. (2018). Zebra finches identify individuals using vocal signatures unique to each call type. Nat. Commun.9:4026. 10.1038/s41467-018-06394-9
41
ElliottT. M.Christensen-DalsgaardJ.KelleyD. B. (2007). Tone and call responses of units in the auditory nerve and dorsal medullary nucleus of Xenopus laevis. J. Compar. Physiol. A193, 1243–1257. 10.1007/s00359-007-0285-z
42
ElliottT. M.Christensen-DalsgaardJ.KelleyD. B. (2011). Temporally selective processing of communication signals by auditory midbrain neurons. J. Neurophysiol.105, 1620–163. 10.1152/jn.00261.2009
43
EvansB. J. (2008). Genome evolution and speciation genetics of clawed frogs (Xenopus and Silurana). Front. Biosci. Landmark13, 4687–4706. 10.2741/3033
44
EvansB. J.CarterT. F.GreenbaumE.GvoŽdíkV.KelleyD. B.McLaughlinP. J.et al. (2015). Genetics, morphology, advertisement calls, and historical records distinguish six new polyploid species of African clawed frog (Xenopus, Pipidae) from West and Central Africa. PLoS ONE1:e0142823. 10.1371/journal.pone.0142823
45
FengY. J.BlackburnD. C.LiangD.HillisD. M.WakeD. B.CannatellaD. C.et al. (2017). Phylogenomics reveals rapid, simultaneous diversification of three major clades of Gondwanan frogs at the Cretaceous–Paleogene boundary. Proc. Natl. Acad. Sci. U.S.A.114, E5864–E5870. 10.1073/pnas.1704632114
46
Fernández-VargasM.RiedeT.PaschB. (2021). Mechanisms and constraints underlying acoustic variation in rodents. Anim. Behav. 184, 135–137. 10.1016/j.anbehav.2021.07.011
- CrossRef
- Google Scholar
47
FitchW.SuthersR. A. (2016). Vertebrate vocal production: an introductory overview, in Vertebrate Sound Production and Acoustic Communication. Springer Handbook of Auditory Research, Vol. 53, eds SuthersR.FitchW.FayR.PopperA. (Cham: Springer). 10.1007/978-3-319-27721-9_1
- CrossRef
- Google Scholar
48
Frankl-VilchesC.GahrM. (2018). Androgen and estrogen sensitivity of bird song: a comparative view on gene regulatory levels. J. Comp. Physiol. A204, 113126. 10.1007/s00359-017-1236-y
49
FujiiT. G.IkebuchiM.OkanoyaK. (2016). Auditory responses to vocal sounds in the songbird nucleus taeniae of the amygdala and the adjacent arcopallium. Brain Behav. Evol.87, 275–289. 10.1159/000447233
50
GáborA.GácsiM.SzabóD.MiklósiÁ.KubinyiE.AndicsA. (2020). Multilevel fMRI adaptation for spoken word processing in the awake dog brain. Sci. Rep.10:11968. 10.1038/s41598-020-68821-6
51
GadagkarV.PuzereyP. A.ChenR.Baird-DanielE.FarhangA. R.GoldbergJ. H. (2016). Dopamine neurons encode performance error in singing birds. Science354, 1278–1282. 10.1126/science.aah683
52
GeorgeI.RichardJ. P.CousillasH.HausbergerM. (2011). No need to talk, I know you: familiarity influences early multisensory integration in a songbird's brain. Front. Behav. Neurosci.5:193. 10.3389/fnbeh.2010.00193
53
GhazanfarA. A.RendallD. (2008). Evolution of human vocal production. Curr. Biol.18, R457–R460. 10.1016/j.cub.2008.03.030
54
GrillnerS.RobertsonB. (2016). The basal ganglia over 500 million years. Curr. Biol.26, R1088–R1100. 10.1016/j.cub.2016.06.041
55
HåkanssonJ.JiangW.XueQ.ZhengX.DingM.AgarwalA. A.et al. (2022). Aerodynamics and motor control of ultrasonic vocalizations for social communication in mice and rats. BMC Biolo.20:3. 10.1186/s12915-021-01185-z
56
HageS. R. (2010). Localization of the central pattern generator for vocalization, in Handbook of Behavioral Neuroscience, Vol. 19 ed BrudzynskiS. M. (Elsevier), 329–337. 10.1016/B978-0-12-374593-4.00031-0
- CrossRef
- Google Scholar
57
HallI. C.BallaghI. H.KelleyD. B. (2013). The Xenopus amygdala mediates socially appropriate vocal communication signals. J. Neurosci.33, 14534–1454810.1523/JNEUROSCI.1190-13.2013
58
HartleyR. S.SuthersR. A. (1989). Airflow and pressure during canary song: direct evidence for mini-breaths. J. Comparat. Physiol. A165, 15–26. 10.1007/BF00613795
- CrossRef
- Google Scholar
59
HechavarríaJ. C.Jerome BeetzM.García-RosalesF.KösslM. (2020). Bats distress vocalizations carry fast amplitude modulations that could represent an acoustic correlate of roughness. Sci. Rep.10:7332. 10.1038/s41598-020-64323-7
60
HommaT.SohelM. S. H.OnouchiS.SaitoS. (2022). Morphometric study of the vestibuloauditory organ of the African clawed frog, Xenopus laevis. Anatomia Histol. Embryol.51, 514–523. 10.1111/ahe.12821
61
JarvisE. D. (2019). Evolution of vocal learning and spoken language. Science366, 50–54. 10.1126/science.aax0287
62
Jorgewich-CohenG.TownsendS. W.PadoveseL. R.KleinN.PraschagP.FerraraC. R.et al. (2022). Common evolutionary origin of acoustic communication in choanate vertebrates. Nat. Commun.13, 17. 10.1038/s41467-022-33741-8
63
KanwalJ. S. (2009). Audiovocal Communication in Bats. Encyclopedia of Neuroscience ed SquireL. R. (Oxford: Academic Press), 681–690. 10.1016/B978-008045046-9.01839-8
- CrossRef
- Google Scholar
64
KanwalJ. S. (2021). Sonic and ultrasonic communication in bats: acoustics, perception, and production, in Neuroendocrine Regulation of Animal Vocalization eds RosenfeldC. S.HoffmannF. (Academic Press), 239–265. 10.1016/B978-0-12-815160-0.00011-6
- CrossRef
- Google Scholar
65
KanwalJ. S.ZhangZ.FengJ. (2013). Decision-making and socioemotional vocal behavior in bats, in Bat Evolution, Ecology, and Conservation eds AdamaR. A.PetersenS. C. (New York, NY: Springer), 243–270.
- Google Scholar
66
KelleyD. B. (1980). Auditory and vocal nuclei in the frog brain concentrate sex hormones. Science207, 553–555. 10.1126/science.7352269
67
KelleyD. B. (1981). Locations of androgen-concentrating cells in the brain of Xenopus laevis: autoradiography with 3H-dihydrotestosterone. J. Comparative Neurol.199, 221–231. 10.1002/cne.901990206
68
KelleyD. B. (1982). Female sex behaviors in the South African clawed frog, Xenopus laevis: gonadotropin-releasing, gonadotropic, and steroid hormones. Hormones Behav.16, 158–174. 10.1016/0018-506X(82)90016-2
69
KelleyD. B.BallaghI. H.BarkanC. L.BendeskyA.ElliottT. M.EvansB. J.et al. (2020). Generation, coordination, and evolution of neural circuits for vocal communication. J. Neurosci.40, 22–36. 10.1523/JNEUROSCI.0736-19.2019
70
KelleyD. B.NottebohmF. (1979). Projections of a telencephalic auditory nucleus–field L–in the canary. J. Compar. Neurol.183, 455–469. 10.1002/cne.901830302
- CrossRef
- Google Scholar
71
KelleyD. B.PfaffD. W. (1976). Hormone effects on male sex behavior in adult South African clawed frogs, Xenopus laevis. Hormones Behav.7, 159–182. 10.1016/0018-506X(76)90045-3
72
KellyT.RebyD.LevréroF.KeenanS.GustafssonE.KoutseffA.et al. (2017). Adult human perception of distress in the cries of bonobo, chimpanzee, and human infants. Biol. J. Linnean Soc.120, 919–93010.1093/biolinnean/blw016
- CrossRef
- Google Scholar
73
KingsleyE. P.EliasonC. M.RiedeT.LiZ.HiscockT. W.FarnsworthM.et al. (2018). Identity and novelty in the avian syrinx. Proc. Natl. Acad. Sci. U.S.A.115, 10209–10217. 10.1073/pnas.1804586115
74
Kwong-BrownU.TobiasM. L.EliasD. O.HallI. C.ElemansC. P.KelleyD. B. (2019). The return to water in ancestral Xenopus was accompanied by a novel mechanism for producing and shaping vocal signals. ELife8:39946. 10.7554/eLife.39946
75
LeiningerE. C.KelleyD. B. (2013). Distinct neural and neuromuscular strategies underlie independent evolution of simplified advertisement calls. Proc. R. Soc. B280:20122639. 10.1098/rspb.2012.2639
76
LemmonE. M.LemmonA. R. (2010). Reinforcement in chorus frogs: lifetime fitness estimates including intrinsic natural selection and sexual selection against hybrids. Evolution64, 1748–1761. 10.1111/j.1558-5646.2010.00955.x
77
LukschH.WalkowiakW.MuñozA.HansJ. (1996). The use of in vitro preparations of the isolated amphibian central nervous system in neuroanatomy and electrophysiology. J. Neurosci. Methods70, 91–102. 10.1016/S0165-0270(96)00107-0
78
LutzL. B.ColeL. M.GuptaM. K.KwistK. W.AuchusR. J.HammesS. R. (2001). Evidence that androgens are the primary steroids produced by Xenopus laevis ovaries and may signal through the classical androgen receptor to promote oocyte maturation. Proc. Natl. Acad. Sci. U.S.A.98, 13728–13733. 10.1073/pnas.241471598
79
MaJ.KanwalJ. S. (2014). Stimulation of the basal and central amygdala in the mustached bat triggers echolocation and agonistic vocalizations within multimodal output. Front. Physiol.5:55. 10.3389/fphys.2014.00055
80
ManeyD. L.GoodeC. T.LangeH. S.SanfordS. E.SolomonB. L. (2008). Estradiol modulates neural responses to song in a seasonal songbird. J. Comp. Neurol.511, 173–186. 10.1002/cne.21830
81
MarshallL. G. (1979). A model for paleobiogeography of South American cricetine rodents. Paleobiology5, 126–132. 10.1017/S0094837300006412
- CrossRef
- Google Scholar
82
MasonM. J.WangM.NarinsP. M. (2009). Structure and function of the middle ear apparatus of the aquatic frog, Xenopus laevis. Proc. Institute Acoustics Institute Acoustics31:13.
- Pubmed Abstract
- Google Scholar
83
MatzingerT.FitchW. T. (2021). Voice modulatory cues to structure across languages and species. Philosophical Transact. R. Soc. B376:20200393. 10.1098/rstb.2020.0393
84
McGregorJ. N.GrasslerA. L.JaffeP. I.JacobA. L.BrainardM. S.SoberS. J. (2022). Shared mechanisms of auditory and non-auditory vocal learning in the songbird brain. Elife11, e75691. 10.7554/eLife.75691
85
McGurkH.MacDonaldJ. (1976). Hearing lips and seeing voices. Nature264, 746–748. 10.1038/264746a0
86
MilsomW. K.KinkeadR.HedrickM. S.GilmourK.PerryS.GargaglioniL.et al. (2022). Evolution of vertebrate respiratory central rhythm generators. Respiratory Physiol. Neurobiol.295:10378110.1016/j.resp.2021.103781
87
MooneyR. (2020). The neurobiology of innate and learned vocalizations in rodents and songbirds. Curr. Opin. Neurobiol.64, 24–3110.1016/j.conb.2020.01.004
88
MooneyR. (2022). Birdsong. Curr. Biol. 32, R1090–R1094. 10.1016/j.cub.2022.07.006
89
MurthyM. (2010). Unraveling the auditory system of Drosophila. Curr. Opin. Neurobiol.20, 281–287. 10.1016/j.conb.2010.02.016
90
NakamuraT.YllaG.ExtavourC. G. (2022). Genomics and genome editing techniques of crickets, an emerging model insect for biology and food science. Curr. Opin. Insect Sci.50:100881. 10.1016/j.cois.2022.100881
91
NaumannR. T.KanwalJ. S. (2011). Basolateral amygdala responds robustly to social calls: spiking characteristics of single unit activity. J. Neurophysiol.105, 2389–2404. 10.1152/jn.00580.2010
92
NottebohmF.PatonJ. A.KelleyD. B. (1982). Connections of vocal control nuclei in the canary telencephalon. J. Comparative Neurol.207, 344–357. 10.1002/cne.902070406
93
Okobi JrD. E.BanerjeeA.MathesonA. M.PhelpsS. M.LongM. A. (2019). Motor cortical control of vocal interaction in neotropical singing mice. Science363, 983–988. 10.1126/science.aau9480
94
OspinaO. E.LemmonA. R.DyeM.ZdyrskiC.HollandS.StriblingD.et al. (2021). Neurogenomic divergence during speciation by reinforcement of mating behaviors in chorus frogs (Pseudacris). BMC Genomics22, 1–23. 10.1186/s12864-021-07995-3
95
PatonJ. A.KelleyD. B.SejnowskiT. J.YodlowskiM. L. (1982). Mapping the auditory central nervous system of Xenopus laevis with 2-deoxyglucose autoradiography. Brain Res.249, 15–22. 10.1016/0006-8993(82)90164-0
96
PortforsC. V.PerkelD. J. (2014). The role of ultrasonic vocalizations in mouse communication. Curr. Opin. Neurobiol.28, 115–120. 10.1016/j.conb.2014.07.002
97
RaoP. P.KanwalJ. S. (2004). Oxytocin and vasopressin immunoreactivity within the forebrain and limbic-related areas in the mustached bat, Pteronotus parnellii. Brain Behav. Evolution63, 151–168. 10.1159/000076241
98
RhodesH. J.HeatherJ. Y.YamaguchiA. (2007). Xenopus vocalizations are controlled by a sexually differentiated hindbrain central pattern generator. J. Neurosci.27, 1485–1497. 10.1523/JNEUROSCI.4720-06.2007
99
RiebelK.OdomK. J.LangmoreN. E.HallM. L. (2019). New insights from female bird song: towards an integrated approach to studying male and female communication roles. Biol. Lett.15:20190059. 10.1098/rsbl.2019.0059
100
RiemM. M.LotzA. M.HorstmanL. I.CimaM.VerheesM. W.Alyousefi-van DijkK.et al. (2021). A soft baby carrier intervention enhances amygdala responses to infant crying in fathers: a randomized controlled trial. Psychoneuroendocrinology132:105380. 10.1016/j.psyneuen.2021.105380
101
RomanenkoS. A.PerelmanP. L.TrifonovV. A.GraphodatskyA. S. (2012). Chromosomal evolution in Rodentia. Heredity108, 4–16. 10.1038/hdy.2011.110
102
RoseE. M.PriorN. H.BallG. F. (2022). The singing question: re-conceptualizing birdsong. Biol. Rev.97, 326–342. 10.1111/brv.12800
103
RyanM. J. (2021). Darwin, sexual selection, and the brain. Proc. Natl. Acad. Sci. U.S.A.118:e2008194118. 10.1073/pnas.2008194118
104
SallesA.BohnK. M.MossC. F. (2019). Auditory communication processing in bats: what we know and where to go. Behav. Neurosci.133:305. 10.1037/bne0000308
105
SatoK.TanakaR.IshikawaY.YamamotoD. (2020). Behavioral evolution of Drosophila: unraveling the circuit basis. Genes11:157. 10.3390/genes11020157
106
SchmidtR. S. (1976). Neural correlates of frog calling. J. Comparative Physiol.108, 99–113. 10.1007/BF02169043
- CrossRef
- Google Scholar
107
SchöneichS. (2020). Neuroethology of acoustic communication in field crickets-from signal generation to song recognition in an insect brain. Progr. Neurobiol.194:101882. 10.1016/j.pneurobio.2020.101882
108
SchwarkR. W.FuxjagerM. J.SchmidtM. F. (2022). Proposing a neural framework for the evolution of elaborate courtship displays. eLife11:e74860. 10.7554/eLife.74860
109
SeyfarthR. M.CheneyD. L. (2014). The evolution of language from social cognition. Curr. Opin. Neurobiol.28, 5–9. 10.1016/j.conb.2014.04.003
110
SimonyanK.HorwitzB. (2011). Laryngeal motor cortex and control of speech in humans. Neuroscientist17, 197–208. 10.1177/1073858410386727
111
StriedterG. F.NorthcuttR. G. (2019). Brains Through Time: A Natural History of Vertebrates. Oxford: Oxford University Press. 10.1093/oso/9780195125689.001.0001
- CrossRef
- Google Scholar
112
SuryanarayanaS. M.RobertsonB.GrillnerS. (2022). The neural bases of vertebrate motor behaviour through the lens of evolution. Philosoph. Transact. R. Soc. B377:2020052110.1098/rstb.2020.0521
113
SuthersR.GollerF.PytteC. (1999). The neuromuscular control of birdsong. Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci.354, 927–939.
- Google Scholar
114
TamuraK.SubramanianS.KumarS. (2004). Temporal patterns of fruit fly (Drosophila) evolution Vrevealed by mutation clocks. Mol. Biol. Evolution21, 36–44. 10.1093/molbev/msg236
115
TanakaM.SunF.LiY.MooneyR. (2018). A mesocortical dopamine circuit enables the cultural transmission of vocal behaviour. Nature563, 117–120. 10.1038/s41586-018-0636-7
116
ThorpeW. H. (1954). The process of song-learning in the chaffinch as studied by means of the sound spectrograph. Nature173, 465–469. 10.1038/173465a0
- CrossRef
- Google Scholar
117
TobiasM.EvansB. J.KelleyD. B. (2011). Evolution of advertisement calls in African clawed frogs. Behaviour148, 519–549. 10.1163/000579511X569435
118
TobiasM. L.BarnardC.O'HaganR.HorngS. H.RandM.KelleyD. B. (2004). Vocal communication between male Xenopus laevis. Animal Behav.67, 353–365. 10.1016/j.anbehav.2003.03.016
119
TobiasM. L.CorkeA.KorshJ.YinD.KelleyD. B. (2010). Vocal competition in male Xenopus laevis frogs. Behavioral ecology and sociobiology, 64, 1791–1803.
- Pubmed Abstract
- Google Scholar
120
TobiasM. L.KelleyD. B. (1987). Vocalizations by a sexually dimorphic isolated larynx: peripheral constraints on behavioral expression. J. Neurosci.7, 3191–3197. 10.1523/JNEUROSCI.07-10-03191.1987
121
TobiasM. L.KorshJ.KelleyD. B. (2014). Evolution of male and female release calls in African clawed frogs. Behaviour151, 1313–1334. 10.1163/1568539X-00003186
- CrossRef
- Google Scholar
122
TobiasM. L.ViswanathanS. S.KelleyD. B. (1998). Rapping, a female receptive call, initiates male–female duets in the South African clawed frog. Proc. Natl. Acad. Sci. U.S.A.95, 1870–1875. 10.1073/pnas.95.4.1870
123
TschidaK.MichaelV.TakatohJ.HanB. X.ZhaoS.SakuraiK.et al. (2019). A specialized neural circuit gates social vocalizations in the mouse. Neuron103, 459–44710.1016/j.neuron.2019.05.025
124
Van RuijsseveltL.Van der KantA.De GroofG.Van der LindenA. (2013). Current state-of-the-art of auditory functional MRI (fMRI) on zebra finches: technique and scientific achievements. J. Physiol. Paris107, 156–16910.1016/j.jphysparis.2012.08.005
125
VanderhoffE. N.Bernal HoverudN. (2022). Perspectives on antiphonal calling, duetting and counter-singing in non-primate mammals: an overview with notes on the coordinated vocalizations of bamboo rats (Dactylomys s, Rodentia: Echimyidae). Front. Ecol. Evolution10:906546. 10.3389/fevo.2022.906546
- CrossRef
- Google Scholar
126
VidalC. M.LaneC. S.AsratA.BarfodD. N.MarkD. F.TomlinsonE. L.et al. (2022). Age of the oldest known Homo sapiens from eastern Africa. Nature601, 579–583. 10.1038/s41586-021-04275-8
127
VignalC.KelleyD. (2007). Significance of temporal and spectral acoustic cues for sexual recognition in Xenopus laevis. Proc. R. Soc.274, 479–488. 10.1098/rspb.2006.3744
128
WallingfordJ. B. (2022). A quick history of xenopus: “The Humble Batrachian”, in Xenopus eds FeinsodA.MoodyS. (Boca Raton: CRC Press), 3–12. 10.1201/9781003050230-2
- CrossRef
- Google Scholar
129
WashingtonS. D.KanwalJ. S. (2008). DSCF neurons within the primary auditory cortex of the mustached bat process frequency modulations present within social calls. J. Neurophysiol.100, 3285–3304. 10.1152/jn.90442.2008
130
WeiX. P.CollieM.DempseyB.FortinG.YackleK. (2022). A novel reticular node in the brainstem synchronizes neonatal mouse crying with breathing. Neuron110, 644–65710.1016/j.neuron.2021.12.014
131
WetzelD. M.KelleyD. B. (1983). Androgen and gonadotropin effects on male mate calls in South African clawed frogs, Xenopus laevis. Hormones Behav.17, 388–40410.1016/0018-506X(83)90048-X
132
WildJ. M. (1997). Neural pathways for the control of birdsong production. J. Neurobiol.33, pp.653–670.
- Pubmed Abstract
- Google Scholar
133
WildJ. M.GollerF.SuthersR. A. (1998). Inspiratory muscle activity during bird song. J. Neurobiol.36, 441–453. 10.1002/(SICI)1097-4695(19980905)36:3<441::AID-NEU11>3.0.CO;2-E
134
WillseyH. R.ExnerC. R.XuY.EverittA.SunN.WangB.et al. (2021). Parallel in vivo analysis of large-effect autism genes implicates cortical neurogenesis and estrogen in risk and resilience. Neuron109, 788–804. 10.1016/j.neuron.2021.01.002
135
XuM.ShawK. L. (2021). Extensive linkage and genetic coupling of song and preference loci underlying rapid speciation in Laupala crickets. J. Heredity112, 204–213. 10.1093/jhered/esab001
136
YagerD. (1982). A novel mechanism for underwater sound production in Xenopus borealis. Am. Zool.22:887.
- Google Scholar
137
YamaguchiA.KelleyD. B. (2000). Generating sexually differentiated vocal patterns: laryngeal nerve and EMG recordings from vocalizing male and female African clawed frogs (Xenopus laevis). J. Neurosci.20, 1559–1567. 10.1523/JNEUROSCI.20-04-01559.2000
138
YangE. J.NasipakB. T.KelleyD. B. (2007). Direct action of gonadotropin in brain integrates behavioral and reproductive functions. Proc. Natl. Acad. Sci. U.S.A.104, 2477–2482. 10.1073/pnas.0608391104
139
YehY. T. (2022). Auditory tuning in vocal learning songbirds (Doctoral dissertation). Columbia University.
- Google Scholar
140
YIpP. K.SchmitzbergerM.Al-HasanM.GeorgeJ.TripolitiE.Michael-TitusA. T.et al. (2020). Serotonin expression in the song circuitry of adult male zebra finches. Neuroscience444, 170–182. 10.1016/j.neuroscience.2020.06.018
141
ZhangY.AsifS.GhazanfarA. (2022). Evolving alternative neural pathways for vocal dexterity. Proc. Natl. Acad. Sci. U.S.A.119:e2205899119. 10.1073/pnas.2205899119
142
ZhangY. S.GhazanfarA. A. (2016). Perinatally influenced autonomic system fluctuations drive infant vocal sequences. Curr. Biol.26, 1249–1260. 10.1016/j.cub.2016.03.023
143
ZhengD. J.Okobi JrD. E.ShuR.AgrawalR.SmithS. K.LongM. A.et al. (2022). Mapping the vocal circuitry of Alston's singing mouse with pseudorabies virus. J. Comparative Neurol.530, 2075–2099. 10.1002/cne.25321
144
ZhengD. J.SinghA.PhelpsS. M. (2021). Conservation and dimorphism in androgen receptor distribution in Alston's singing mouse (Scotinomys teguina). J. Compar. Neurol.529, 2539–255710.1002/cne.25108
145
ZornikE.KelleyD. B. (2008). Regulation of respiratory and vocal motor pools in the isolated brain of Xenopus laevis. J. Neurosci.28, 612–621. 10.1523/JNEUROSCI.4754-07.2008
146
ZornikE.KelleyD. B. (2011). A neuroendocrine basis for the hierarchical control of frog courtship vocalizations. Front. Neuroendocrinol.32, 353–366. 10.1016/j.yfrne.2010.12.006

Summary

Keywords

vocal, auditory, neural, circuit, communication, evolution, sex, hormones

Citation

Kelley DB (2022) Convergent and divergent neural circuit architectures that support acoustic communication. Front. Neural Circuits 16:976789. doi: 10.3389/fncir.2022.976789

Received

23 June 2022

Accepted

19 October 2022

Published

17 November 2022

Volume

16 - 2022

Edited by

Stefano Zucca, University of Turin, Italy

Reviewed by

Stefan Schöneich, Friedrich Schiller University Jena, Germany; Katherine Tschida, Cornell University, United States

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Darcy B. Kelley dbk3@columbia.edu

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

SYSTEMATIC REVIEW article

Convergent and divergent neural circuit architectures that support acoustic communication

Abstract

The evolution of vocal communication in tetrapod vertebrates; Introduction and overview

Phylogeny of vocal signaling in Xenopus

How Xenopus communicate

How Xenopus make sounds

How Xenopus hear sounds

How the CNS generates Xenopus vocal patterns

Reproductive state; Hormones and behavior

Neural circuit architecture underlying social vocalization in tetrapods