<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="review-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Artif. Intell.</journal-id>
<journal-title>Frontiers in Artificial Intelligence</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Artif. Intell.</abbrev-journal-title>
<issn pub-type="epub">2624-8212</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/frai.2021.530937</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Artificial Intelligence</subject>
<subj-group>
<subject>Review</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Neuronal Sequence Models for Bayesian Online Inference</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Fr&#x000F6;lich</surname> <given-names>Sascha</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/763659/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Markovi&#x00107;</surname> <given-names>Dimitrije</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/21826/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Kiebel</surname> <given-names>Stefan J.</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/4308/overview"/>
</contrib>
</contrib-group>
<aff><institution>Department of Psychology, Technische Universit&#x000E4;t Dresden</institution>, <addr-line>Dresden</addr-line>, <country>Germany</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Bertram M&#x000FC;ller-Myhsok, Max Planck Institute of Psychiatry (MPI), Germany</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Hazem Toutounji, University of Nottingham, United Kingdom; Philipp Georg S&#x000E4;mann, Max Planck Institute of Psychiatry, Germany</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Sascha Fr&#x000F6;lich <email>sascha.froelich&#x00040;tu-dresden.de</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Medicine and Public Health, a section of the journal Frontiers in Artificial Intelligence</p></fn></author-notes>
<pub-date pub-type="epub">
<day>21</day>
<month>05</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>4</volume>
<elocation-id>530937</elocation-id>
<history>
<date date-type="received">
<day>30</day>
<month>01</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>13</day>
<month>04</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2021 Fr&#x000F6;lich, Markovi&#x00107; and Kiebel.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Fr&#x000F6;lich, Markovi&#x00107; and Kiebel</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract><p>Various imaging and electrophysiological studies in a number of different species and brain regions have revealed that neuronal dynamics associated with diverse behavioral patterns and cognitive tasks take on a sequence-like structure, even when encoding stationary concepts. These neuronal sequences are characterized by robust and reproducible spatiotemporal activation patterns. This suggests that the role of neuronal sequences may be much more fundamental for brain function than is commonly believed. Furthermore, the idea that the brain is not simply a passive observer but an active predictor of its sensory input, is supported by an enormous amount of evidence in fields as diverse as human ethology and physiology, besides neuroscience. Hence, a central aspect of this review is to illustrate how neuronal sequences can be understood as critical for probabilistic predictive information processing, and what dynamical principles can be used as generators of neuronal sequences. Moreover, since different lines of evidence from neuroscience and computational modeling suggest that the brain is organized in a functional hierarchy of time scales, we will also review how models based on sequence-generating principles can be embedded in such a hierarchy, to form a generative model for recognition and prediction of sensory input. We shortly introduce the Bayesian brain hypothesis as a prominent mathematical description of how online, i.e., fast, recognition, and predictions may be computed by the brain. Finally, we briefly discuss some recent advances in machine learning, where spatiotemporally structured methods (akin to neuronal sequences) and hierarchical networks have independently been developed for a wide range of tasks. We conclude that the investigation of specific dynamical and structural principles of sequential brain activity not only helps us understand how the brain processes information and generates predictions, but also informs us about neuroscientific principles potentially useful for designing more efficient artificial neuronal networks for machine learning tasks.</p></abstract>
<kwd-group>
<kwd>neuronal sequences</kwd>
<kwd>Bayesian inference</kwd>
<kwd>generative models</kwd>
<kwd>Bayesian brain hypothesis</kwd>
<kwd>predictive coding</kwd>
<kwd>hierarchy of time scales</kwd>
<kwd>recurrent neural networks</kwd>
<kwd>spatiotemporal trajectories</kwd>
</kwd-group>
<contract-num rid="cn001">EXC 2050/1, Project ID 390696704</contract-num>
<contract-num rid="cn001">SFB 940/2, A9</contract-num>
<contract-num rid="cn001">TRR 265/1, B09</contract-num>
<contract-sponsor id="cn001">Deutsche Forschungsgemeinschaft<named-content content-type="fundref-id">10.13039/501100001659</named-content></contract-sponsor>
<counts>
<fig-count count="7"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="191"/>
<page-count count="17"/>
<word-count count="14262"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>1. Introduction</title>
<p>In the neurosciences, one important experimental and theoretical finding of recent years was that many brain functions can be described as predictive (Rao and Ballard, <xref ref-type="bibr" rid="B151">1999</xref>; Pastalkova et al., <xref ref-type="bibr" rid="B140">2008</xref>; Friston and Kiebel, <xref ref-type="bibr" rid="B69">2009</xref>; Aitchison and Lengyel, <xref ref-type="bibr" rid="B8">2017</xref>). This means that the brain not only represents current states of the environment but also potential states of the future to adaptively select its actions and behavior. For such predictions, one important feature of neuronal dynamics is their often-observed sequence-like structure. In this review, we will present evidence that sequence-like structure in neuronal dynamics is found over a wide range of different experiments and different species. In addition, we will also review models for such sequence-like neuronal dynamics, which can be used as generative models for Bayesian inference to compute predictions. To familiarize readers of different backgrounds with each of these topics, we first briefly give an overview of the topics of sequences, predictions, hierarchical structure, the so-called Bayesian brain hypothesis and provide a more precise definition of the kind of sequence-like neuronal dynamics that we consider in this review.</p>
<sec>
<title>1.1. Sequences in the Brain</title>
<p>The brain is constantly receiving spatiotemporally structured sensory input. This is most evident in the auditory domain where, when listening to human speech, the brain receives highly structured, sequential input in the form of phonemes, words, and sentences (Giraud and Poeppel, <xref ref-type="bibr" rid="B79">2012</xref>). Furthermore, even in situations which apparently provide only static sensory input, the brain relies on spatiotemporally structured coding. For example, when observing a static visual scene, the eyes constantly perform high-frequency micro-oscillations and exploratory saccades (Martinez-Conde et al., <xref ref-type="bibr" rid="B126">2004</xref>; Martinez-Conde, <xref ref-type="bibr" rid="B125">2006</xref>), which renders the visual input spatiotemporally structured, and yet the visual percepts appear stationary. Another example is olfaction, where in animal experiments, it has been shown that neurons in the olfactory system respond to a stationary odor with an elaborate temporal coding scheme (Bazhenov et al., <xref ref-type="bibr" rid="B19">2001</xref>; Jones et al., <xref ref-type="bibr" rid="B96">2007</xref>). In the state space of those neurons, their activity followed a robust and reproducible trajectory, a neuronal sequence (see <xref ref-type="table" rid="T1">Table 1</xref>), which was specific to the presented odor. Similarly, in a behavioral experiment with monkeys, spatial information of an object was encoded by a dynamical neural code, although the encoded relative location of the object remained unchanged (Crowe et al., <xref ref-type="bibr" rid="B45">2010</xref>). In other words, there is evidence that the brain recognizes both dynamic and static entities in our environment on the basis of sequence-like encoding.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Glossary.</p></caption>
<table frame="hsides" rules="groups">
<tbody><tr>
<td valign="top" align="left">Neuronal sequence</td>
<td valign="top" align="left">Spatiotemporal patterns of neuronal activity that encode stimulus properties, abstract concepts, or motion signals (see <xref ref-type="fig" rid="F1">Figure 1</xref>). Can be described by a specific, sequential trajectory in the so-called state space of the system, see also <xref ref-type="fig" rid="F3">Figure 3</xref> for an example.</td>
</tr>
<tr>
<td valign="top" align="left">State space/Phase space</td>
<td valign="top" align="left">A multidimensional space that encompasses all possible states a system can be in. Every possible state is defined by a unique point in the space.</td>
</tr>
<tr>
<td valign="top" align="left">Continuodiscrete dynamics/Trajectory</td>
<td valign="top" align="left">Reproducible spatiotemporal trajectories characterized by discrete points in state space (see <xref ref-type="fig" rid="F3">Figure 3</xref>).</td>
</tr>
<tr>
<td valign="top" align="left">Winnerless Competition (WLC)</td>
<td valign="top" align="left">Type of dynamic behavior of a system where the system shortly settles into a stable or metastable state before being forced away from it (by internal or external mechanisms) (see <xref ref-type="fig" rid="F3">Figures 3</xref>, <xref ref-type="fig" rid="F6">6</xref>).</td>
</tr>
<tr>
<td valign="top" align="left">Metastable state/Saddle state</td>
<td valign="top" align="left">A state in the state space of a dynamical system. A metastable state of a system is stable in some directions and unstable in others. A saddle point is a metastable point where the first derivative vanishes.</td>
</tr>
<tr>
<td valign="top" align="left">Stable heteroclinic channel (SHC)</td>
<td valign="top" align="left">Type of dynamic behavior of a system where the system goes through a succession of saddle points (metastable states) forming heteroclinic state-space trajectories (orbits). Importantly, small deviations from those trajectories will not diverge away from the heteroclinic orbit. See section 2.2.2.</td>
</tr>
<tr>
<td valign="top" align="left">Heteroclinic orbit/Trajectory</td>
<td valign="top" align="left">A path in the state space of a system that connects two equilibrium points.</td>
</tr>
<tr>
<td valign="top" align="left">Limit cycle</td>
<td valign="top" align="left">Attractor type occurring in some complex dynamical systems. Closed, continuous trajectory in state space with fixed period and amplitude. The regular firing behavior of neurons can be described by limit cycle behavior. See section 2.2.1.</td>
</tr>
<tr>
<td valign="top" align="left">Synfire chain</td>
<td valign="top" align="left">A feed-forward neuronal network architecture. See section 2.1.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Neuronal sequences have been reported in a wide range of experimental contexts. For example, in the hippocampus of mice and rats (MacDonald et al., <xref ref-type="bibr" rid="B123">2011</xref>; Pastalkova et al., <xref ref-type="bibr" rid="B140">2008</xref>; Bhalla, <xref ref-type="bibr" rid="B22">2019</xref>; Skaggs and McNaughton, <xref ref-type="bibr" rid="B160">1996</xref>; Dragoi and Tonegawa, <xref ref-type="bibr" rid="B54">2011</xref>), the visual cortex of cats and rats (Kenet et al., <xref ref-type="bibr" rid="B99">2003</xref>; Ji and Wilson, <xref ref-type="bibr" rid="B95">2007</xref>), the somatosensory cortex of mice (Laboy-Ju&#x000E1;rez et al., <xref ref-type="bibr" rid="B111">2019</xref>), the parietal cortex of monkeys and mice (Crowe et al., <xref ref-type="bibr" rid="B45">2010</xref>; Harvey et al., <xref ref-type="bibr" rid="B85">2012</xref>), the frontal cortex of monkeys (Seidemann et al., <xref ref-type="bibr" rid="B158">1996</xref>; Abeles et al., <xref ref-type="bibr" rid="B2">1995</xref>; Baeg et al., <xref ref-type="bibr" rid="B16">2003</xref>), the gustatory cortex of rats (Jones et al., <xref ref-type="bibr" rid="B96">2007</xref>), the locust antennal lobe (Bazhenov et al., <xref ref-type="bibr" rid="B19">2001</xref>), specific song-related areas in the brain of songbirds (Hahnloser et al., <xref ref-type="bibr" rid="B82">2002</xref>), and the amygdala of monkeys (Reitich-Stolero and Paz, <xref ref-type="bibr" rid="B153">2019</xref>), among others. Even at the cellular level, there is evidence of sequence-processing capacities of single neurons (Branco et al., <xref ref-type="bibr" rid="B31">2010</xref>). Neuronal sequences seem to serve a variety of different purposes. While sequences in specific brain regions drive the spatiotemporal motor patterns during behavior like birdsong rendition (Hahnloser et al., <xref ref-type="bibr" rid="B82">2002</xref>) (<xref ref-type="fig" rid="F1">Figure 1B</xref>), in other studies of different brain areas and different species, neuronal sequences were found to encode stationary stimuli (Seidemann et al., <xref ref-type="bibr" rid="B158">1996</xref>; Bazhenov et al., <xref ref-type="bibr" rid="B19">2001</xref>) and spatial information (Crowe et al., <xref ref-type="bibr" rid="B45">2010</xref>), to represent past experience (Skaggs and McNaughton, <xref ref-type="bibr" rid="B160">1996</xref>) (see also <xref ref-type="fig" rid="F1">Figure 1A</xref>), and to be involved with both working memory and memory consolidation (MacDonald et al., <xref ref-type="bibr" rid="B123">2011</xref>; Harvey et al., <xref ref-type="bibr" rid="B85">2012</xref>; Skaggs and McNaughton, <xref ref-type="bibr" rid="B160">1996</xref>). Behaviorally relevant neuronal sequences were reported to occur before the first execution of a task (Dragoi and Tonegawa, <xref ref-type="bibr" rid="B54">2011</xref>), and in some behavioral tasks sequences were found to be predictive of future behavior (Abeles et al., <xref ref-type="bibr" rid="B2">1995</xref>; Pastalkova et al., <xref ref-type="bibr" rid="B140">2008</xref>).</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Four illustrative examples of sequential neuronal activity in different paradigms and experimental contexts. <bold>(A)</bold> Sequential activation of rat hippocampal cells are found during action and in rest phases after the behavioral tasks. The top plot shows the spiking histogram of 91 hippocampal cells during a rat&#x00027;s trip along a physical track. The bottom panel shows the rat&#x00027;s actual position on the track (blue line) against the position inferred from the spiking pattern of its hippocampal cells. After the traversal of the track, hippocampal cells &#x0201C;replayed&#x0201D; their activation sequence in reverse during a short ripple event (red box, enlarged in the box on the right). Figure adapted from Pfeiffer (<xref ref-type="bibr" rid="B143">2020</xref>) (Copyright 1999&#x02013;2019 John Wiley &#x00026; Sons, Inc.). <bold>(B)</bold> Zebra finches are songbirds whose songs consist of highly consistent so-called song motifs. Here, the activations of ten different HVC<sub>(RA)</sub> neurons and two HVC interneurons in the HVC nucleus of the zebra finch brain during ten renditions of the same song motif are shown. HVC<sub>(RA)</sub> project from the HVC nucleus to the RA nucleus in the birdbrain, and exhibit precise and reproducible firing sequences during the rendition of a song. Adapted from Hahnloser et al. Hahnloser et al. (<xref ref-type="bibr" rid="B82">2002</xref>) with permission from Springer Nature. <bold>(C)</bold> Firing patterns of neurons in the gustatory cortex of rats <italic>in vivo</italic> when presented with four different odors. The sequential switching of states of a hidden Markov model (HMM, see section 3.1) was characteristic of the presented aroma. For each of the four odors, the different color hues represent different HMM states that were inferred based on the data. Adapted from Jones et al. (<xref ref-type="bibr" rid="B96">2007</xref>) (Copyright 2007 National Academy of Sciences, U.S.A.). <bold>(D)</bold> Evidence for fast sequence representation in human participants during planning of a trajectory through task state space, see Kurth-Nelson et al. (<xref ref-type="bibr" rid="B110">2016</xref>) for details. The four examples, each for a different participant, show evidence of brain activity, as measured with magnetoencephalography (MEG), to quickly transition through task state space with roughly 40 ms duration for each sequence element. Figure taken from Kurth-Nelson et al. (<xref ref-type="bibr" rid="B110">2016</xref>).</p></caption>
<graphic xlink:href="frai-04-530937-g0001.tif"/>
</fig>
<p>As these findings show, neuronal sequences can be measured in different species, in different brain areas and at different levels of observation, where the expression of these sequences depends on the measurement and analysis method. A neuronal sequence can appear as the successive spiking of neurons (<xref ref-type="fig" rid="F1">Figures 1A,B</xref>), or the succession of more abstract compound states (<xref ref-type="fig" rid="F1">Figure 1C</xref>), or in yet different forms, depending on the experimental approach. For example, evidence for sequences can also be found with non-invasive cognitive neuroscience methods like magnetoencephalography (MEG) as shown in <xref ref-type="fig" rid="F1">Figure 1D</xref>. Given these very different appearances of experimentally observed neuronal sequences, it is clear that an answer to the question of &#x0201C;What is a neuronal sequence?&#x0201D; depends on the experimental setup. In the context of this article, we understand a &#x0201C;neuronal sequence&#x0201D; quite broadly as any kind of robust and reproducible spatiotemporal trajectory, where stimulus properties, abstract concepts, or motion signals are described by a specific trajectory in the state space of the system (see <xref ref-type="table" rid="T1">Table 1</xref>). The brain may use such trajectory representations, whose experimental expressions are measured as neuronal sequences, to form a basis for encoding the spatiotemporal structure of sensory stimuli (Buonomano and Maass, <xref ref-type="bibr" rid="B33">2009</xref>) and the statistical dependencies between past, present, and future (Friston and Buzs&#x000E1;ki, <xref ref-type="bibr" rid="B68">2016</xref>). Here, we will review evidence for this type of encoding and discuss some of the implications for our understanding of the brain&#x00027;s capacity to perform probabilistic inference, i.e., recognition based on spatiotemporally structured sensory input.</p></sec>
<sec>
<title>1.2. Hierarchies in the Brain</title>
<p>The brain&#x00027;s structure and function are often described with reference to a hierarchical organization, which we will cover in more detail in section 3.2. Human behavior can be described as a hierarchically structured process (Lashley and Jeffress, <xref ref-type="bibr" rid="B112">1951</xref>; Rosenbaum et al., <xref ref-type="bibr" rid="B155">2007</xref>; Dezfouli et al., <xref ref-type="bibr" rid="B50">2014</xref>), as can memory, where the grouping of information-carrying elements into chunks constitutes a hierarchical scheme (Bousfield, <xref ref-type="bibr" rid="B30">1953</xref>; Miller, <xref ref-type="bibr" rid="B133">1956</xref>; Fonollosa et al., <xref ref-type="bibr" rid="B65">2015</xref>). Similarly, the perception and recognition of spatiotemporally structured input can be regarded as a hierarchical process. For example, percepts, such as the observation of a walking person can be regarded as percepts of higher order (&#x0201C;walking person&#x0201D;), as they emerge from the combination of simpler, lower order percepts, e.g., a specific sequence of limb movements. Critically, the concept &#x0201C;someone walking&#x0201D; is represented at a slower time scale as compared to the faster movements of individual limbs that constitute the walking. There is emerging evidence that the brain is structured and organized hierarchically along the relevant time scales of neuronal sequences (e.g., Murray et al., <xref ref-type="bibr" rid="B135">2014</xref>; Hasson et al., <xref ref-type="bibr" rid="B86">2008</xref>; Cocchi et al., <xref ref-type="bibr" rid="B42">2016</xref>; Mattar et al., <xref ref-type="bibr" rid="B127">2016</xref>; Gauthier et al., <xref ref-type="bibr" rid="B75">2012</xref>; Kiebel et al., <xref ref-type="bibr" rid="B100">2008</xref>). Such a hierarchy allows the brain to model the causal structure of its sensory input and form predictions at slower time scales (&#x0201C;someone walking&#x0201D;) by representing trajectories capturing the dynamics of its expected spatiotemporal sensory input at different time scales, and by representing causal dependencies between time scales. This allows for inference about the causes of sensory input in the environment, as well as for inference of the brain&#x00027;s own control signals (e.g., motor actions). In this paper, we will review some of the experimental evidence and potential computational models for sequence generation and inference.</p>
<p>In the following section 1.3 we will first give a short introduction to the Bayesian brain hypothesis and the basic concept of the brain as a predictor of its environment. In section 1.4 we will go into more detail about the question &#x0201C;What is a sequence?&#x0201D; and will further discuss the trajectory representation. In section 2, we will provide an overview of several dynamical principles that might underlie the generation of neuronal trajectories in biological networks. Importantly, we are going to focus on general dynamical network principles that may underlie sequence generation, and which may differentiate types of sequence-generating networks. We are therefore not going to cover the vast field of sequence learning (e.g., Sussillo and Abbott, <xref ref-type="bibr" rid="B165">2009</xref>; Tully et al., <xref ref-type="bibr" rid="B170">2016</xref>; Lipton et al., <xref ref-type="bibr" rid="B116">2015</xref>; W&#x000F6;rg&#x000F6;tter and Porr, <xref ref-type="bibr" rid="B177">2005</xref>), which mainly investigates neurobiologically plausible learning rules and algorithms that can lead to neuronal sequences, and thus possibly to the network types discussed in this article. In section 3, we review some approaches in which sequences are used to model recognition of sensory input. To highlight the relevance of sequence generators to a large variety of problems, we will visit methods and advances in computer science and machine learning, where structured artificial recurrent neural networks (RNNs) that are able to generate spatiotemporal activity patterns are used to perform a range of different computational tasks. This will however only serve as a rough and incomplete overview over some common machine learning methods, and we will not cover methods like Markov Decision Processes (Feinberg and Shwartz, <xref ref-type="bibr" rid="B60">2012</xref>) and related approaches, as an overview of research on sequential decision making is beyond the scope of this review. Finally, we will briefly discuss functional hierarchies in the brain and in machine learning applications. A glossary of technical terms that we will use in the review can be found in <xref ref-type="table" rid="T1">Table 1</xref>.</p></sec>
<sec>
<title>1.3. The Bayesian Brain Hypothesis</title>
<p>Dating back to Hermann von Helmholtz in the 19th century, the idea that the brain performs statistical inference on its sensory input to infer the underlying probable causes of that same input (Helmholtz, <xref ref-type="bibr" rid="B88">1867</xref>), started gaining considerable traction toward the end of the 20th century and had a strong influence on both computer science and neuroscience (Hinton and Sejnowski, <xref ref-type="bibr" rid="B89">1983</xref>; Dayan et al., <xref ref-type="bibr" rid="B47">1995</xref>; Wolpert et al., <xref ref-type="bibr" rid="B176">1995</xref>; Friston, <xref ref-type="bibr" rid="B66">2005</xref>; Friston et al., <xref ref-type="bibr" rid="B70">2006</xref>; Beck et al., <xref ref-type="bibr" rid="B20">2008</xref>; see also Rao and Ballard, <xref ref-type="bibr" rid="B151">1999</xref>; Ernst and Banks, <xref ref-type="bibr" rid="B59">2002</xref>; K&#x000F6;rding and Wolpert, <xref ref-type="bibr" rid="B104">2004</xref>). In particular, research into this interpretation of brain function led to the formulation of the Bayesian brain hypothesis (Knill and Pouget, <xref ref-type="bibr" rid="B102">2004</xref>; Doya et al., <xref ref-type="bibr" rid="B53">2007</xref>; Friston, <xref ref-type="bibr" rid="B67">2010</xref>). The Bayesian brain hypothesis posits that aspects of brain function can be described as equivalent to Bayesian inference based on a causal generative model of the world, which models the statistical and causal regularities of the environment. In this framework, recognition is modeled as Bayesian inversion of the generative model, which assigns probabilities, that is, beliefs to different states of the world based on perceived sensory information. This process of Bayesian inference is hypothesized to be an appropriate basis for the mathematical description of most, if not all, brain functions (Friston, <xref ref-type="bibr" rid="B67">2010</xref>; Knill and Pouget, <xref ref-type="bibr" rid="B102">2004</xref>). Although the hypothesis that the brain is governed by Bayesian principles has met with criticism since human behavior does not always appear to be Bayes-optimal (Rahnev and Denison, <xref ref-type="bibr" rid="B149">2018</xref>; Soltani et al., <xref ref-type="bibr" rid="B161">2016</xref>), and because the definition of Bayes-optimality can be ambiguous (Colombo and Seri&#x000E8;s, <xref ref-type="bibr" rid="B43">2012</xref>), there is growing evidence that human behavior can indeed be explained by Bayesian principles (<xref ref-type="fig" rid="F2">Figure 2</xref>) (Ernst and Banks, <xref ref-type="bibr" rid="B59">2002</xref>; K&#x000F6;rding and Wolpert, <xref ref-type="bibr" rid="B104">2004</xref>; Weiss et al., <xref ref-type="bibr" rid="B175">2002</xref>; Feldman, <xref ref-type="bibr" rid="B61">2001</xref>), and that even phenomena like mental disorders might be explained by Bayesian mechanisms (Adams et al., <xref ref-type="bibr" rid="B5">2013</xref>; Leptourgos et al., <xref ref-type="bibr" rid="B114">2017</xref>; Fletcher and Frith, <xref ref-type="bibr" rid="B64">2009</xref>) (see Knill and Pouget, <xref ref-type="bibr" rid="B102">2004</xref> and Clark, <xref ref-type="bibr" rid="B41">2013</xref> for reviews on the Bayesian brain hypothesis). How Bayesian inference is achieved in the human brain is an ongoing debate, and it has been proposed that the corresponding probabilities are encoded on a population level (Zemel et al., <xref ref-type="bibr" rid="B187">1998</xref>; Beck et al., <xref ref-type="bibr" rid="B20">2008</xref>) or on single-neuron level (Deneve, <xref ref-type="bibr" rid="B48">2008</xref>).</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Illustration of Bayesian Inference. The prior belief (blue) about a state is updated by sensory evidence (red) represented by the likelihood function. The updated belief is the posterior belief (turquoise), which will serve as the prior belief in the next updating step. Each row illustrates how the shape of the prior distribution and the likelihood influence the inference process. Both an increase in likelihood precision (inverse variance), and a decrease in prior precision result in a posterior belief which is more biased toward the sensory evidence. This is illustrated by a deviation of the posterior toward the sensory evidence and away from the prior belief (dashed line and arrows). In the Bayesian predictive coding framework (Friston and Kiebel, <xref ref-type="bibr" rid="B69">2009</xref>; Rao and Ballard, <xref ref-type="bibr" rid="B151">1999</xref>), inference naturally minimizes the prediction error, defined as the difference between expected and observed outcomes. Figure reprinted from Adams et al. (<xref ref-type="bibr" rid="B5">2013</xref>).</p></caption>
<graphic xlink:href="frai-04-530937-g0002.tif"/>
</fig>
<p>Under the Bayesian view, model inversion, i.e., recognition, satisfies Bayes&#x00027; theorem, which states that the optimal posterior belief about a state is proportional to the generative model&#x00027;s prior expectation about the state multiplied by the probability of the sensory evidence under the generative model. In Bayesian inference, prior expectation, posterior belief, and sensory evidence are represented as probability distributions and accordingly called <italic>prior distribution, posterior distribution</italic>, and <italic>likelihood</italic> (<xref ref-type="fig" rid="F2">Figure 2</xref>). The posterior can be regarded as an updated version of the prior distribution, and will act as the prior in the next inference step. Importantly, the prior is part of the generative model as different priors could lead to qualitatively different expectations (Gelman et al., <xref ref-type="bibr" rid="B76">2017</xref>).</p>
<p>The quality of the inference, that is, the quality of the belief about the hidden states of the world, is dependent on the quality of the agent&#x00027;s generative model, and the appropriateness of a tractable (approximate) inference scheme. In this review paper, we suggest that good generative models of our typical environment should generate, that is, expect sequences, and that such a sequence-like representation of environmental dynamics is used to robustly perform tractable inference on spatiotemporally structured sensory data.</p>
<p>The theory of predictive coding suggests that the equivalent of an inversion of the generative model in the cortex is achieved in a hierarchical manner by error-detecting neurons which encode the difference between top-down predictions and sensory input (Friston and Kiebel, <xref ref-type="bibr" rid="B69">2009</xref>; Rao and Ballard, <xref ref-type="bibr" rid="B151">1999</xref>; Aitchison and Lengyel, <xref ref-type="bibr" rid="B8">2017</xref>) (<xref ref-type="fig" rid="F2">Figure 2</xref>). The fact that sequences in specific contexts appear to have predictive properties (Abeles et al., <xref ref-type="bibr" rid="B2">1995</xref>; Pastalkova et al., <xref ref-type="bibr" rid="B140">2008</xref>) is interesting in light of possible combinations of the frameworks of predictive coding and the Bayesian brain hypothesis (Knill and Pouget, <xref ref-type="bibr" rid="B102">2004</xref>; Doya et al., <xref ref-type="bibr" rid="B53">2007</xref>; Friston, <xref ref-type="bibr" rid="B67">2010</xref>). One intriguing idea is that the brain&#x00027;s internal representations and predictions rely on sequences of neuronal activity (FitzGerald et al., <xref ref-type="bibr" rid="B63">2017</xref>; Kiebel et al., <xref ref-type="bibr" rid="B101">2009</xref>; Hawkins et al., <xref ref-type="bibr" rid="B87">2009</xref>). Importantly, empirical evidence suggests that these approximate representations are structured in temporal and functional hierarchies (see sections 1.2 and 3.2) (Koechlin et al., <xref ref-type="bibr" rid="B103">2003</xref>; Giese and Poggio, <xref ref-type="bibr" rid="B78">2003</xref>; Botvinick, <xref ref-type="bibr" rid="B27">2007</xref>; Badre, <xref ref-type="bibr" rid="B15">2008</xref>; Fuster, <xref ref-type="bibr" rid="B73">2004</xref>). Combining the Bayesian brain hypothesis with the hierarchical aspect of predictive coding provides a theoretical basis for computational mechanisms that drive a lifelong learning of the causal model of the world (Friston et al., <xref ref-type="bibr" rid="B72">2014</xref>). Examples for how these different frameworks can be combined can be found in Yildiz and Kiebel (<xref ref-type="bibr" rid="B182">2011</xref>) and Yildiz et al. (<xref ref-type="bibr" rid="B183">2013</xref>).</p>
<p>As an example of a tight connection between prediction and sequences, one study investigating the electrophysiological responses in the song nucleus HVC of bengalese finch (Bouchard and Brainard, <xref ref-type="bibr" rid="B29">2016</xref>) found evidence for an internal prediction of upcoming song syllables, based on sequential neuronal activity in HVC. As another example, a different study investigating single-cell recordings of neurons in the rat hippocampus found that sequences of neuronal activations during wheel-running between maze runs were predictive of the future behavior of the rats, including errors (Pastalkova et al., <xref ref-type="bibr" rid="B140">2008</xref>). This finding falls in line with other studies showing that hippocampal sequences can correlate with future behavior (Pfeiffer, <xref ref-type="bibr" rid="B143">2020</xref>).</p></sec>
<sec>
<title>1.4. What Are Sequences?</title>
<p>What does it mean to refer to neuronal activity as sequential? In the most common sense of the word, a sequence is usually understood as the serial succession of discrete elements or states. Likewise, when thinking of sequences, most people intuitively think of examples like &#x0201C;A, B, C,&#x02026;&#x0201D; or &#x0201C;1, 2, 3,&#x02026;.&#x0201D; However, when extending this discrete concept to neuronal sequences, there are only few compelling examples where spike activity is readily interpretable as a discrete sequence, like the &#x0201C;domino-chain&#x0201D; activation observed in the birdbrain nucleus HVC (Hahnloser et al., <xref ref-type="bibr" rid="B82">2002</xref>) (<xref ref-type="fig" rid="F1">Figure 1B</xref>). As mentioned before, we will use the word &#x0201C;sequence&#x0201D; to describe robust and reproducible spatiotemporal trajectories, which encode information to be processed or represented. Apart from the overwhelming body of literature reporting sequences in many different experimental settings (section 1.1), particularly interesting are the hippocampus (Bhalla, <xref ref-type="bibr" rid="B22">2019</xref>; Pfeiffer, <xref ref-type="bibr" rid="B143">2020</xref>) and entorhinal cortex (Zutshi et al., <xref ref-type="bibr" rid="B191">2017</xref>; O&#x00027;Neill et al., <xref ref-type="bibr" rid="B139">2017</xref>). Due to the strong involvement of the hippocampus and the entorhinal cortex with sequences, the idea that neuronal sequences are also used in brain areas directly connected to them is not too far-fetched. For example, hippocampal-cortical interactions are characterized by sharp wave ripples (Buzs&#x000E1;ki, <xref ref-type="bibr" rid="B34">2015</xref>), which are effectively compressed spike sequences. Recent findings suggest that other cortical areas connected to the hippocampus use grid-cell like representations similar to space representation in the entorhinal cortex (Constantinescu et al., <xref ref-type="bibr" rid="B44">2016</xref>; Stachenfeld et al., <xref ref-type="bibr" rid="B163">2017</xref>). This is noteworthy because grid cells have been linked to sequence-like information processing (Zutshi et al., <xref ref-type="bibr" rid="B191">2017</xref>; O&#x00027;Neill et al., <xref ref-type="bibr" rid="B139">2017</xref>). This suggests that at least areas connected to the hippocampus and entorhinal cortex are able to decode neuronal sequences.</p>
<p>The example of odor recognition shows that sequences are present even in circumstances where one intuitively would not expect them (<xref ref-type="fig" rid="F1">Figure 1C</xref>). This very example does also show an interesting gap between a continuous and a discrete type of representation: The spatiotemporal trajectory is of a continuous nature, while the representation of the odor identity is characterized by discrete states and at a slower time scale. This gap also presents itself on another level. While we understand the term &#x0201C;neuronal sequence&#x0201D; to refer to a robust and reproducible spatiotemporal trajectory, in many cases these continuous state-space trajectories appear as a succession of quasi-discrete states (Abeles et al., <xref ref-type="bibr" rid="B2">1995</xref>; Seidemann et al., <xref ref-type="bibr" rid="B158">1996</xref>; Mazor and Laurent, <xref ref-type="bibr" rid="B128">2005</xref>; Jones et al., <xref ref-type="bibr" rid="B96">2007</xref>). In order to emphasize this interplay between continuous dynamics and discrete points we will denote such dynamics as <italic>continuodiscrete</italic> (see <xref ref-type="table" rid="T1">Table 1</xref>). In continuodiscrete dynamics, robust, and reproducible spatiotemporal trajectories are characterized by discrete points in state-space. As an example, in <xref ref-type="fig" rid="F1">Figure 1C</xref> one can see the response of <italic>in vivo</italic> neurons in the gustatory cortex of rats, which is determined by the odor that is presented to the animal. The activity patterns of the neurons were analyzed with a hidden Markov model which revealed that the activity of the neuron ensemble can be described as a robust succession of discrete Markov states, where the system remains in a state for hundreds of milliseconds before quickly switching to another discrete state. These sequential visits to discrete states and the continuous expression of these states, specifically the switching between them, in terms of fast neuronal dynamics (here spiking neurons) is what we consider as continuodiscrete dynamics. Similar observations have been made in other experiments (Abeles et al., <xref ref-type="bibr" rid="B2">1995</xref>; Seidemann et al., <xref ref-type="bibr" rid="B158">1996</xref>; Mazor and Laurent, <xref ref-type="bibr" rid="B128">2005</xref>; Rabinovich et al., <xref ref-type="bibr" rid="B147">2001</xref>; Rivera et al., <xref ref-type="bibr" rid="B154">2015</xref>) (see also <xref ref-type="fig" rid="F3">Figure 3</xref>). The discrete states of a continuodiscrete sequence can be for example stable fixed points (Gros, <xref ref-type="bibr" rid="B81">2009</xref>), or saddle points (Rabinovich et al., <xref ref-type="bibr" rid="B148">2006</xref>, <xref ref-type="bibr" rid="B147">2001</xref>) of the system, or simply points along a limit cycle trajectory (Yildiz and Kiebel, <xref ref-type="bibr" rid="B182">2011</xref>; Yildiz et al., <xref ref-type="bibr" rid="B183">2013</xref>), depending on the modeling approach (see section 2). Depending on the dynamical model, the system might leave a fixed point due to autonomously induced destabilization (Gros, <xref ref-type="bibr" rid="B80">2007</xref>, <xref ref-type="bibr" rid="B81">2009</xref>), noise (Rabinovich et al., <xref ref-type="bibr" rid="B148">2006</xref>, <xref ref-type="bibr" rid="B147">2001</xref>), or external input (Kurikawa and Kaneko, <xref ref-type="bibr" rid="B109">2015</xref>; Toutounji and Pipa, <xref ref-type="bibr" rid="B169">2014</xref>; Rivera et al., <xref ref-type="bibr" rid="B154">2015</xref>; Hopfield, <xref ref-type="bibr" rid="B90">1982</xref>).</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p><bold>(A)</bold> Illustration of continuodiscrete dynamics based on Stable Heteroclinic Channels (SHC, see section 2.2.2 and <xref ref-type="table" rid="T1">Table 1</xref>). The solid line represents a continuous heteroclinic trajectory in three-dimensional phase space and the dotted lines indicate invariant manifolds between saddle states (see <xref ref-type="table" rid="T1">Table 1</xref>). The green tube illustrates a Stable Heteroclinic Channel. All heteroclinic trajectories originating in the SHC will remain inside of it. This is a type of WLC dynamics. <bold>(B)</bold> Simulation of an SHC-trajectory based on Lotka-Volterra dynamics, where a point in phase space determines the firing rate of each neuron. <bold>(C)</bold> Neuronal responses to odor representation in the locust brain. <bold>(B,C)</bold> Are adapted from Rabinovich et al. (<xref ref-type="bibr" rid="B147">2001</xref>). Copyright (2001) by the American Physical Society.</p></caption>
<graphic xlink:href="frai-04-530937-g0003.tif"/>
</fig>
<p>Concepts similar to continuodiscrete trajectories have been introduced before. For example, in winner-less competition (WLC) (Rabinovich et al., <xref ref-type="bibr" rid="B146">2000</xref>; Afraimovich et al., <xref ref-type="bibr" rid="B7">2004b</xref>; Rabinovich et al., <xref ref-type="bibr" rid="B145">2008</xref>), a system moves from one discrete metastable fixed-point (see <xref ref-type="table" rid="T1">Table 1</xref>) of the state space to the next, never settling for any state, similar to the fluctuations in a Lotka-Volterra system (Rabinovich et al., <xref ref-type="bibr" rid="B147">2001</xref>) (see <xref ref-type="fig" rid="F3">Figure 3</xref>). In winner-take-all (WTA) dynamics, like during memory recall in a Hopfield network (Hopfield, <xref ref-type="bibr" rid="B90">1982</xref>), the system is attracted to one fixed point in which it will settle. Both WLC and WTA are thus examples of continuodiscrete dynamics. The concept of continuodiscrete dynamics also allows for dynamics which are characterized by an initial alteration between discrete states, before settling into a final state, as for example in Rivera et al. (<xref ref-type="bibr" rid="B154">2015</xref>). In section 2, we will look at different ways to model continuodiscrete neuronal dynamics.</p>
<p>For the brain, representing continuodiscrete trajectories seems to combine the best of two worlds: Firstly, the representation of discrete points forms the basis for the generalization and categorization of the sequence. For example, for the categorization of a specific movement sequence, it is not necessary to consider all the details of the sensory input, as it is sufficient to categorize the sequence type (dancing, walking, running) by recognizing the sequence of discrete points, as e.g., in Giese and Poggio (<xref ref-type="bibr" rid="B78">2003</xref>). Secondly, the brain requires a way of representing continuous dynamics to not miss important details. This is because key information can only be inferred by subtle variations within a sequence, as is often the case in our environment. For instance, when someone is talking, most of the speech content, i.e., what is being said, is represented by discrete points that describe a sequence of specific vocal tract postures. Additionally, there are subtle variations in the exact expression of these discrete points and the continuous dynamics connecting them, which let us infer about otherwise hidden states like the emotional state of the speaker (Birkholz et al., <xref ref-type="bibr" rid="B24">2010</xref>; Kotz et al., <xref ref-type="bibr" rid="B105">2003</xref>; Schmidt et al., <xref ref-type="bibr" rid="B157">2006</xref>). Some of these subtle variations in the sensory input may be of importance to the brain, while others are not. For example, when listening to someone speaking, slight variations in the speaker&#x00027;s talking speed or pitch of voice might give hints about her mood, state of health, or hidden intentions. In other words, representing sensory input as continuodiscrete trajectories enables the recognition of invariances of the underlying movements without losing details.</p>
<p>There is growing evidence that sequences with discrete states like fixed points are a fundamental feature of cognitive and perceptual representations (e.g., Abeles et al., <xref ref-type="bibr" rid="B2">1995</xref>; Seidemann et al., <xref ref-type="bibr" rid="B158">1996</xref>; Mazor and Laurent, <xref ref-type="bibr" rid="B128">2005</xref>; Jones et al., <xref ref-type="bibr" rid="B96">2007</xref>). This feature may be at the heart of several findings in the cognitive sciences which suggest that human perception is chunked into discrete states, see VanRullen and Koch (<xref ref-type="bibr" rid="B172">2003</xref>) for some insightful examples. Assuming that the brain uses some form of continuodiscrete dynamics to model sensory input, we will next consider neuronal sequence-generating mechanisms that may implement such dynamics and act as a generative model for recognition of sensory input. Importantly, as we are interested in generative models of sequential sensory input, we will only consider models that have the ability to autonomously generate sequential activity. Therefore, we are not going to discuss models where sequential activity is driven by sequential external input, as in models of non-autonomous neural networks (Toutounji and Pipa, <xref ref-type="bibr" rid="B169">2014</xref>), or in models where intrinsic sequential neural activity is disrupted by bifurcation-inducing external input (Kurikawa and Kaneko, <xref ref-type="bibr" rid="B109">2015</xref>).</p></sec></sec>
<sec id="s2">
<title>2. Neuronal Network Models as Sequence Generators</title>
<p>In order to explain sequential neuronal activity in networks of biological neurons, several models have been proposed, some of which we are going to review in the following sections. As this paper aims at a general overview of neuronal sequence-generating mechanisms and less at a detailed analysis, we will not cover the details and nuances of the presented dynamical models and refer the interested reader to the references given in the text.</p>
<sec>
<title>2.1. Synfire Chains</title>
<p>Synfire chains are concatenated groups of excitatory neurons with convergent-divergent feed-forward connectivity, as illustrated in <xref ref-type="fig" rid="F4">Figure 4A</xref> (Abeles, <xref ref-type="bibr" rid="B1">1991</xref>; Diesmann et al., <xref ref-type="bibr" rid="B51">1999</xref>). Synchronous activation of one group leads to the activation of the subsequent group in the chain after one synaptic delay (<xref ref-type="fig" rid="F4">Figure 4B</xref>). It has been shown that the only stable operating mode in synfire chains is the synchronous mode where all neurons of a group spike in synchrony (Litvak et al., <xref ref-type="bibr" rid="B117">2003</xref>). Synfire chains create sequences that are temporally highly precise (Abeles, <xref ref-type="bibr" rid="B1">1991</xref>; Diesmann et al., <xref ref-type="bibr" rid="B51">1999</xref>). Such temporally precise sequences have been observed in slices of the mouse primary visual cortex and in V1 of anaesthetized cats (Ikegaya et al., <xref ref-type="bibr" rid="B92">2004</xref>), as well as in the HVC nucleus of the bird brain during song production (Hahnloser et al., <xref ref-type="bibr" rid="B82">2002</xref>; Long et al., <xref ref-type="bibr" rid="B120">2010</xref>), and in the frontal cortex of behaving monkeys (Prut et al., <xref ref-type="bibr" rid="B144">1998</xref>; Abeles and Gat, <xref ref-type="bibr" rid="B3">2001</xref>). While synfire chains make predictions that agree well with these observations, a striking mismatch between synfire chains and neuronal networks in the brain is the absence of recurrent connections in the synfire chain&#x00027;s feed-forward architecture. Modeling studies have shown that sequential activation similar to synfire chain activity can be achieved by changing a small fraction of the connections in a random neural network (Rajan et al., <xref ref-type="bibr" rid="B150">2016</xref>; Chenkov et al., <xref ref-type="bibr" rid="B37">2017</xref>), and that synfire chains can emerge in self-organizing recurrent neural networks under the influence of multiple interacting plasticity mechanisms (Zheng and Triesch, <xref ref-type="bibr" rid="B190">2014</xref>). Such fractional changes of network connections were used to implement working memory (Rajan et al., <xref ref-type="bibr" rid="B150">2016</xref>) or give a possible explanation for the occurrence of memory replay after one-shot learning (Chenkov et al., <xref ref-type="bibr" rid="B37">2017</xref>). Such internally generated sequences have been proposed as a mechanism for memory consolidation, among other things (see Pezzulo et al., <xref ref-type="bibr" rid="B142">2014</xref> for a review).</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p><bold>(A)</bold> Illustration of a synfire chain between groups of neurons (filled circles). Arrows indicate excitatory connections. <bold>(B)</bold> Illustration of a spiking histogram of neurons in a synfire chain with 10 groups of 100 neurons each. The average time interval between the firing of two adjacent groups corresponds to one synaptic delay.</p></caption>
<graphic xlink:href="frai-04-530937-g0004.tif"/>
</fig></sec>
<sec>
<title>2.2. Attractor Networks</title>
<sec>
<title>2.2.1. Limit Cycles</title>
<p>Limit cycles are stable attractors in the phase space of a system, and they occur in practically every physical domain (Strogatz, <xref ref-type="bibr" rid="B164">2018</xref>). A limit cycle is a closed trajectory, with fixed period and amplitude (<xref ref-type="fig" rid="F5">Figure 5</xref>). Limit cycles occur frequently in biological and other dynamical systems, and the beating of the heart, or the periodic firing of a pacemaker neuron are examples of limit cycle behavior (Strogatz, <xref ref-type="bibr" rid="B164">2018</xref>). They are of great interest to theoretical neuroscience, as periodic spiking activity can be represented by limit cycles, both on single-cell level (Izhikevich, <xref ref-type="bibr" rid="B93">2007</xref>) and population level (Berry and Quoy, <xref ref-type="bibr" rid="B21">2006</xref>; Jouffroy, <xref ref-type="bibr" rid="B97">2007</xref>; Mi et al., <xref ref-type="bibr" rid="B131">2017</xref>). They also play an important role in the emulation of human motion in robotics. While there are numerous ways to model human motion, one interesting approach is that of <italic>dynamic motion primitives</italic> (DMPs) (Schaal et al., <xref ref-type="bibr" rid="B156">2007</xref>), which elegantly unifies the two different kinds of human motion, rhythmic and non-rhythmic motion, in one framework. The main idea of DMPs is that the limbs move as if they were pulled toward an attractor state. In the case of rhythmic motion, the attractor is given by a limit cycle, while in the case of motion strokes the attractor is a discrete point in space (Schaal et al., <xref ref-type="bibr" rid="B156">2007</xref>). In Kiebel et al. (<xref ref-type="bibr" rid="B101">2009</xref>), Yildiz and Kiebel (<xref ref-type="bibr" rid="B182">2011</xref>), and Yildiz et al. (<xref ref-type="bibr" rid="B183">2013</xref>), the authors used a hierarchical generative model of sequence-generators based on limit cycles to model the generation and perception of birdsong and human speech.</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p>Two different representations of a limit cycle. <bold>(A)</bold> A Limit cycle in three-dimensional phase space. In the case of a neuronal network, the dimensions of the phase space can be interpreted as the firing rates of the neurons. <bold>(B)</bold> Representation of a six-dimensional limit cycle as alternating activations of six different neurons.</p></caption>
<graphic xlink:href="frai-04-530937-g0005.tif"/>
</fig></sec>
<sec>
<title>2.2.2. Heteroclinic Trajectories</title>
<p>Another approach to modeling continuodiscrete dynamics are heteroclinic networks (Ashwin and Timme, <xref ref-type="bibr" rid="B14">2005</xref>; Rabinovich et al., <xref ref-type="bibr" rid="B145">2008</xref>) (see also <xref ref-type="table" rid="T1">Table 1</xref>). A heteroclinic network is a dynamical system with semi-stable states (saddle points) which are connected by invariant manifolds, so-called heteroclinic connections. Networks of coupled oscillators have been shown to give rise to phenomena like heteroclinic cycles (Ashwin and Swift, <xref ref-type="bibr" rid="B13">1992</xref>; Ashwin et al., <xref ref-type="bibr" rid="B12">2007</xref>). It has therefore been proposed that neuronal networks exhibit such heteroclinic behavior as well, which has been verified using simulations of networks of globally coupled Hodgkin-Huxley neurons (Hansel et al., <xref ref-type="bibr" rid="B83">1993a</xref>,<xref ref-type="bibr" rid="B84">b</xref>; Ashwin and Borresen, <xref ref-type="bibr" rid="B10">2004</xref>). Interestingly, heteroclinic networks can be harnessed to perform computational tasks (Ashwin and Borresen, <xref ref-type="bibr" rid="B11">2005</xref>; Neves and Timme, <xref ref-type="bibr" rid="B137">2012</xref>), and it has been shown that it is possible to implement any logic operation within such a network (Neves and Timme, <xref ref-type="bibr" rid="B137">2012</xref>). Furthermore, the itinerancy in a heteroclinic network can be guided by external input, where the trajectory of fixed points discriminates between different inputs (Ashwin et al., <xref ref-type="bibr" rid="B12">2007</xref>; Neves and Timme, <xref ref-type="bibr" rid="B137">2012</xref>), which means that different inputs are encoded by different trajectories in phase space.</p>
<p>While theoretical neuroscience has progressed with research on heteroclinic behavior of coupled neural systems, concrete biological evidence is still sparse, as this requires a concrete and often complex mathematical model which is often beyond the more directly accessible research questions in biological science. Despite this, heteroclinic behavior has been shown to reproduce findings from single-cell recordings in insect olfaction (Rabinovich et al., <xref ref-type="bibr" rid="B147">2001</xref>; Rivera et al., <xref ref-type="bibr" rid="B154">2015</xref>) and olfactory bulb electroencephalography (EEG) in rabbits (Breakspear, <xref ref-type="bibr" rid="B32">2001</xref>). Another study replicated the chaotic hunting behavior of a marine mollusk based on an anatomically plausible neuronal model with heteroclinic winnerless competition (WLC) dynamics (Varona et al., <xref ref-type="bibr" rid="B173">2002</xref>), which is closely related to the dynamic alteration between states in a heteroclinic network (Rabinovich et al., <xref ref-type="bibr" rid="B146">2000</xref>; Afraimovich et al., <xref ref-type="bibr" rid="B7">2004b</xref>; Rabinovich et al., <xref ref-type="bibr" rid="B145">2008</xref>). WLC was proposed as a general information processing principle for dynamical networks and is characterized by dynamic switching between network states, where the switching behavior is based on external input (Afraimovich et al., <xref ref-type="bibr" rid="B7">2004b</xref>) (see <xref ref-type="table" rid="T1">Table 1</xref>). Importantly, the traveled trajectory identifies the received input, while any single state of the trajectory generally does not, see for example Neves and Timme (<xref ref-type="bibr" rid="B137">2012</xref>). In phase space representation, WLC can be achieved by open or closed sequences of heteroclinically concatenated saddle points. Such sequences are termed stable heteroclinic sequences (SHS) if the heteroclinic connections are dissipative, i.e., when a trajectory starting in a neighborhood close to the sequence remains close (Afraimovich et al., <xref ref-type="bibr" rid="B6">2004a</xref>). While perturbations and external forcing can destroy stable heteroclinic sequences, it can be shown that even under such adverse circumstances, in many neurobiologically relevant situations the general sequential behavior of the system is preserved (Rabinovich et al., <xref ref-type="bibr" rid="B148">2006</xref>). Such behavior is described by the concept of Stable Heteroclinic Channels (SHC) (see <xref ref-type="fig" rid="F3">Figure 3</xref> and <xref ref-type="table" rid="T1">Table 1</xref>) (Rabinovich et al., <xref ref-type="bibr" rid="B148">2006</xref>). A simple implementation of SHCs is based on the generalized Lotka-Volterra equations (Bick and Rabinovich, <xref ref-type="bibr" rid="B23">2010</xref>; Rabinovich et al., <xref ref-type="bibr" rid="B147">2001</xref>), which are a type of recurrent neural network implicitly implementing the WLC concept. The temporal precision of a system that evolves along an SHC is defined by the noise level as well as the eigenvalues of the invariant directions of the saddle points. Therefore, sequences along heteroclinic trajectories are reproducible although the exact timing of the sequence elements may be subject to fluctuation.</p>
<p>In a similar approach, recent theoretical work on the behavior of RNNs has introduced the concept of excitable network attractors, which are characterized by stable states of a system connected by excitable connections (Ceni et al., <xref ref-type="bibr" rid="B35">2019</xref>). The conceptual idea of orbits between fixed points may further be implemented in different ways. For instance, transient activation of neuronal clusters can be achieved by autonomously driven destabilization of stable fixed points (Gros, <xref ref-type="bibr" rid="B80">2007</xref>, <xref ref-type="bibr" rid="B81">2009</xref>).</p></sec></sec>
<sec>
<title>2.3. Hierarchical Sequence Generators</title>
<p>As briefly introduced in section 1.2, growing evidence suggests that the brain is organized into a hierarchy of different time scales, which enables the representation of different temporal features in its sensory input (e.g., Murray et al., <xref ref-type="bibr" rid="B135">2014</xref>; Hasson et al., <xref ref-type="bibr" rid="B86">2008</xref>; Cocchi et al., <xref ref-type="bibr" rid="B42">2016</xref>; Mattar et al., <xref ref-type="bibr" rid="B127">2016</xref>; Gauthier et al., <xref ref-type="bibr" rid="B75">2012</xref>). Here the idea is that lower levels represent dynamics at faster time scales, which are integrated at higher levels that represent slower time scales. For example, speech consists of phonemes (fast time scales), which are integrated into increasingly slower representations of syllables, words, sentences, and a conversation (Hasson et al., <xref ref-type="bibr" rid="B86">2008</xref>; Ding et al., <xref ref-type="bibr" rid="B52">2016</xref>; Boemio et al., <xref ref-type="bibr" rid="B26">2005</xref>). The combination of this hierarchical aspect of brain function with the Bayesian brain hypothesis and the concept of neuronal sequences suggests that the brain implicitly uses hierarchical continuodiscrete dynamical systems as generative models. One illustrative example of a hierarchical continuodiscrete process is given in <xref ref-type="fig" rid="F6">Figure 6</xref>. In this example, the dynamics of the 2nd and 3rd level of the hierarchy are modeled by limit cycles and govern the evolution of parameters of the sequence-generating mechanisms at the levels below. Such an approach for a generative model for prediction and recognition of sensory data has been used to model birdsong and human speech recognition (Yildiz and Kiebel, <xref ref-type="bibr" rid="B182">2011</xref>; Yildiz et al., <xref ref-type="bibr" rid="B183">2013</xref>; Kiebel et al., <xref ref-type="bibr" rid="B101">2009</xref>) (see <xref ref-type="fig" rid="F6">Figure 6</xref>). In Yildiz and Kiebel (<xref ref-type="bibr" rid="B182">2011</xref>), the 3rd level represented sequential neuronal activity in area HVC (proper name, see also <xref ref-type="fig" rid="F1">Figure 1B</xref>), and the 2nd level modeled activity in the robust nucleus of the arcopallium (RA). Similarly, in Rivera et al. (<xref ref-type="bibr" rid="B154">2015</xref>) the authors employed a hierarchical generative model with a heteroclinic sequence for a sequence-generating mechanism to model odor recognition in the insect brain. In a slightly different approach to hierarchical continuodiscrete modeling, hierarchical SHCs, implementing winnerless competition, were used to demonstrate how chunking of information can emerge, similar to memory representation in the brain (Fonollosa et al., <xref ref-type="bibr" rid="B65">2015</xref>). One computational study provided a proof of principle that complex behavior, like handwriting, can be decomposed into a hierarchical organization of stereotyped dynamical flows on manifolds of lower dimensions (Perdikis et al., <xref ref-type="bibr" rid="B141">2011</xref>). These stereotyped dynamics can be regarded as the discrete points in a continuodiscrete sequence, which gave rise to complex and flexible behavior.</p>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p>Illustration of hierarchical continuodiscrete dynamics based on limit cycles. Slowly changing dynamics at the 3rd level parametrize the sequence of states of the faster changing 2nd-level dynamics <italic>z</italic><sup>(2)</sup>. As the dynamics of variables <inline-formula><mml:math id="M1"><mml:msup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>&#x02192;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msup></mml:math></inline-formula> and <inline-formula><mml:math id="M2"><mml:msup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>&#x02192;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>3</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msup></mml:math></inline-formula> change between the states &#x0201C;on&#x0201D; and &#x0201C;off,&#x0201D; their behavior constitutes continuodiscrete WLC dynamics. At around iteration step 600, the green unit at the 3rd level (element <inline-formula><mml:math id="M3"><mml:msubsup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>&#x02192;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>3</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:math></inline-formula>) becomes active, which changes the 2nd-level sequential dynamics from red&#x02192;green&#x02192;orange&#x02192;blue&#x02192;red to green&#x02192;orange&#x02192;red&#x02192;blue&#x02192;green. This is achieved by a change of the 2nd-level connectivity matrix <italic>&#x003C1;</italic><sup>(2)</sup> which depends on the 3rd-level variable <inline-formula><mml:math id="M4"><mml:msup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>&#x02192;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>3</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msup></mml:math></inline-formula>. In this toy example, the 2nd-level dynamics model the evolution of the parameters of an Ornstein-Uhlenbeck process (black graph showing the evolution of variable <italic>x</italic><sup>(1)</sup>). In the framework of hierarchical generative modeling, the 1st level would correspond to an agent&#x00027;s predictions of its sensory input, while the higher levels are the hidden states of the agent&#x00027;s generative model. This hierarchical parametrization of sequences is similar to the approach in Kiebel et al. (<xref ref-type="bibr" rid="B101">2009</xref>). The dot product between vectors <bold><italic>b</italic></bold> &#x0003D; (0.6, 0, &#x02212;1, &#x02212;0.3)<sup><italic>T</italic></sup> and <inline-formula><mml:math id="M5"><mml:msup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>&#x02192;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msup></mml:math></inline-formula> determines the 1st-level attractor <italic>&#x003BC;</italic>. The rate parameter &#x00398; is parametrized by vector <bold><italic>a</italic></bold> &#x0003D; (1, 0.5, 1.2, 0.8)<sup><italic>T</italic></sup> and its dot product with <inline-formula><mml:math id="M6"><mml:msup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>&#x02192;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msup></mml:math></inline-formula>. &#x003C3;(&#x000B7;) is the softmax function which is applied element-wise. <bold>1</bold> denotes a vector of ones. <italic>&#x003BA;</italic> &#x0003D; 2, &#x003BB; &#x0003D; 1/8. Gray vertical lines in the 1st level mark the time-points where states in the 2nd level change. This hierarchical parametrization of sequences is similar to the approach in Kiebel et al. (<xref ref-type="bibr" rid="B101">2009</xref>). Similar hierarchical autonomous models can be used as a generative model for Bayesian inference to achieve prediction and recognition of sequential data, as has for example been done in Yildiz and Kiebel (<xref ref-type="bibr" rid="B182">2011</xref>) and Yildiz et al. (<xref ref-type="bibr" rid="B183">2013</xref>).</p></caption>
<graphic xlink:href="frai-04-530937-g0006.tif"/>
</fig>
<p>In the following section, we will briefly review how sequential methods have been used for problems in neuroscience and especially AI. Afterwards, we will review evidence for the organization of neuronal sequences into a hierarchy of time scales.</p></sec></sec>
<sec id="s3">
<title>3. Recognition of Sequences</title>
<p>Although neuronal sequence models, such as the ones introduced in the preceding sections have been used to explain experimentally observed neuronal activity, these models by themselves do not explain how predictions are formed about the future trajectory of a sequence. To take the example of song production and recognition in songbirds, a sequence-generating model of birdsong generation is not sufficient to model or explain how a listening bird recognizes a song (Yildiz and Kiebel, <xref ref-type="bibr" rid="B182">2011</xref>). Given a generative model, recognition of a song corresponds to statistical model inversion (Watzenig, <xref ref-type="bibr" rid="B174">2007</xref>; Ulrych et al., <xref ref-type="bibr" rid="B171">2001</xref>). A simple example of such a scheme is provided in Bitzer and Kiebel (<xref ref-type="bibr" rid="B25">2012</xref>), where RNNs are used as a generative model such that model inversion provides for an online recognition model. As shown in Friston et al. (<xref ref-type="bibr" rid="B71">2011</xref>), one can also place such a generative model into the active inference framework to derive a model that not only recognizes sequential movements from visual input but also generates continuodiscrete movement patterns. Generative models are not only interesting from a cognitive neuroscience perspective but also point at a shared interest with the field of artificial intelligence and specifically machine learning, to find a mechanistic understanding of how spatiotemporally structured sensory input can be recognized by an artificial or a biological agent. In the following, we will discuss how both fields seem to converge on the conceptual idea that generative models should be spatiotemporally structured and hierarchical.</p>
<sec>
<title>3.1. Sequence Recognition in Machine Learning</title>
<p>The most widely-used models for discrete sequence generation are hidden Markov models (HMM) and their time-dependent generalisation, hidden semi-Markov models (HSMM) (Yu, <xref ref-type="bibr" rid="B184">2015</xref>). In particular, HMMs and HSMMs are standard tools in a wide range of applications concerned with e.g., speech recognition (Liu et al., <xref ref-type="bibr" rid="B118">2018</xref>; Zen et al., <xref ref-type="bibr" rid="B188">2004</xref>; Deng et al., <xref ref-type="bibr" rid="B49">2006</xref>) and activity recognition (Duong et al., <xref ref-type="bibr" rid="B55">2005</xref>). Furthermore, they have often been used for the analysis of neuronal activity (Tokdar et al., <xref ref-type="bibr" rid="B168">2010</xref>) and human behavior in general (Eldar et al., <xref ref-type="bibr" rid="B58">2011</xref>). Similar to HSMMs, artificial RNNs are used in machine learning for classifying and predicting time series data. When training a generic RNN for prediction and classification of time series data, one faces various challenges, most notably incorporating information about long-term dependencies in the data. To address these dependencies, specific RNN architectures have been proposed, such as <italic>long-short term memory</italic> (LSTM) networks (Gers et al., <xref ref-type="bibr" rid="B77">1999</xref>) and <italic>gate recurrent units</italic> (GRU) (Chung et al., <xref ref-type="bibr" rid="B39">2014</xref>). In a common LSTM network, additionally to the output variable, the network computes an internal memory variable. This endows the network with high flexibility. LSTM networks belong to the most successful and most widely applied RNN architectures, with applications in virtually every field involving time-series data, or any data structure with long-range dependencies (Yu et al., <xref ref-type="bibr" rid="B185">2019</xref>; LeCun et al., <xref ref-type="bibr" rid="B113">2015</xref>). Another RNN approach is <italic>reservoir computing</italic> (RC), which started with the development of echo-state networks and liquid state machines in the early 2000s (Luko&#x00161;evi&#x0010D;ius et al., <xref ref-type="bibr" rid="B121">2012</xref>; Jaeger, <xref ref-type="bibr" rid="B94">2001</xref>; Maass et al., <xref ref-type="bibr" rid="B122">2002</xref>). In RC, sequential input is fed to one or more input neurons. Those neurons are connected with a <italic>reservoir</italic> of randomly connected neurons, which in turn are connected to one or more output neurons. Connections in the reservoir are pseudo-randomized to elicit dynamics at the edge of chaos (Yildiz et al., <xref ref-type="bibr" rid="B181">2012</xref>), leading to a spatiotemporal network response in the form of reverberations over multiple time scales. RC networks have successfully been applied in almost every field of machine learning and data science, such as speech recognition, handwriting recognition, robot motor control, and financial forecasting (Luko&#x00161;evi&#x0010D;ius et al., <xref ref-type="bibr" rid="B121">2012</xref>; Tanaka et al., <xref ref-type="bibr" rid="B167">2019</xref>).</p>
<p>While there is a lot of research on neurobiologically plausible learning paradigms for RNNs (Sussillo and Abbott, <xref ref-type="bibr" rid="B165">2009</xref>; Miconi, <xref ref-type="bibr" rid="B132">2017</xref>; Taherkhani et al., <xref ref-type="bibr" rid="B166">2020</xref>), one possible approach for understanding the role of neuronal sequences is to use neurobiologically more plausible sequence generation models, which can act as generative models of the causal dynamic relationships in the environment. A natural application would be the development of recognition models based on Bayesian inference (Bitzer and Kiebel, <xref ref-type="bibr" rid="B25">2012</xref>), and more specifically in terms of variational inference (Friston et al., <xref ref-type="bibr" rid="B70">2006</xref>; Daunizeau et al., <xref ref-type="bibr" rid="B46">2009</xref>).</p></sec>
<sec>
<title>3.2. Biological and Artificial Inferential Hierarchies</title>
<p>In neuroscience and the cognitive sciences, the brain is often viewed as a hierarchical system, where a functional hierarchy can be mapped to the structural hierarchy of the cortex (Badre, <xref ref-type="bibr" rid="B15">2008</xref>; Koechlin et al., <xref ref-type="bibr" rid="B103">2003</xref>; Kiebel et al., <xref ref-type="bibr" rid="B100">2008</xref>). The best example of such a hierarchical organization is the visual system, for which the existence of both a functional and an equivalent structural hierarchy is established (Felleman and Van Essen, <xref ref-type="bibr" rid="B62">1991</xref>). Cells in lower levels of the hierarchy encode simple features and have smaller receptive fields than cells further up the hierarchy, which posses larger receptive fields and encode more complex patterns by integrating information from lower levels (Hubel and Wiesel, <xref ref-type="bibr" rid="B91">1959</xref>; Zeki and Shipp, <xref ref-type="bibr" rid="B186">1988</xref>; Giese and Poggio, <xref ref-type="bibr" rid="B78">2003</xref>). This functional hierarchy is mediated by an asymmetry of recurrent connectivity in the visual stream, where forward connections to higher layers are commonly found to have fast, excitatory effects on the post-synaptic neurons, while feedback connections act in a slower, modulatory manner (Zeki and Shipp, <xref ref-type="bibr" rid="B186">1988</xref>; Sherman and Guillery, <xref ref-type="bibr" rid="B159">1998</xref>). Moreover, neuroimaging studies have shown that the brain is generally organized into a modular hierarchical structure (Bassett et al., <xref ref-type="bibr" rid="B18">2010</xref>; Meunier et al., <xref ref-type="bibr" rid="B130">2009</xref>, <xref ref-type="bibr" rid="B129">2010</xref>). This is substantiated by other network-theoretical characteristics of the brain, like its scale-free property (Eguiluz et al., <xref ref-type="bibr" rid="B56">2005</xref>), which is a natural consequence of modular hierarchy (Ravasz and Barab&#x000E1;si, <xref ref-type="bibr" rid="B152">2003</xref>). Hierarchies also play an important role in cognitive neuroscience as most if not all types of behavior, as well as cognitive processes, can be described in a hierarchical fashion. For example, making a cup of tea can be considered a high-order goal in a hierarchy with subgoals that are less abstract and temporally less extended. In the example of making a cup of tea, these subgoals can be: (i) putting a teabag into a pot, (ii) pouring hot water into the pot, and (iii) pouring tea into a cup (example adopted from Botvinick, <xref ref-type="bibr" rid="B27">2007</xref>).</p>
<sec>
<title>3.2.1. A Hierarchy of Time Scales</title>
<p>Importantly, all theories of cortical hierarchies of function share the common assumption that primary sensory regions encode rather quickly changing dynamics representing the fast features of sensory input, and that those regions are at the bottom of the hierarchy, while temporally more extended or more abstract representations are located in higher order cortices. This principle has been conceptualized as a &#x0201C;hierarchy of time scales&#x0201D; (Kiebel et al., <xref ref-type="bibr" rid="B100">2008</xref>; Hasson et al., <xref ref-type="bibr" rid="B86">2008</xref>; Koechlin et al., <xref ref-type="bibr" rid="B103">2003</xref>; Badre, <xref ref-type="bibr" rid="B15">2008</xref>; Kaplan et al., <xref ref-type="bibr" rid="B98">2020</xref>). In this view, levels further up the hierarchy code for more general characteristics of the environment and inner cognitive processes, which generally change slowly (Hasson et al., <xref ref-type="bibr" rid="B86">2008</xref>; Koechlin et al., <xref ref-type="bibr" rid="B103">2003</xref>; Badre, <xref ref-type="bibr" rid="B15">2008</xref>). For example, although the visual hierarchy is typically understood as a spatial hierarchy, experimental evidence is emerging that it is also a hierarchy of time scales (Cocchi et al., <xref ref-type="bibr" rid="B42">2016</xref>; Gauthier et al., <xref ref-type="bibr" rid="B75">2012</xref>; Mattar et al., <xref ref-type="bibr" rid="B127">2016</xref>). Importantly, the information exchange in such a hierarchy is bidirectional. While top-down information can be regarded as the actions of a generative model trying to predict the sensory input (Dayan et al., <xref ref-type="bibr" rid="B47">1995</xref>; Friston, <xref ref-type="bibr" rid="B66">2005</xref>), recognition is achieved by bottom-up information that provides higher levels in the hierarchy with information about the sensory input, see also Yildiz and Kiebel (<xref ref-type="bibr" rid="B182">2011</xref>) and Yildiz et al. (<xref ref-type="bibr" rid="B183">2013</xref>) for illustrations of this concept. A related finding is an experimentally observed hierarchy of time scales with respect to the time lag of the autocorrelation of neuronal measurements (e.g., Murray et al., <xref ref-type="bibr" rid="B135">2014</xref>). Here, it was found that the decay of autocorrelation was fastest for sensory areas (&#x0003C;100 ms) but longest for prefrontal areas like ACC (&#x0003E;300 ms).</p>
<p>The importance of cognition based on spatiotemporal structure at multiple time scales is also illustrated by various computational modeling studies. In one study, robots were endowed with a neural network whose parameters were let free to evolve over time to optimize performance during a navigation task (Nolfi, <xref ref-type="bibr" rid="B138">2002</xref>). After some time, the robots had evolved neural assemblies with representations at clearly distinct time scales: one assembly had assumed a quickly changing, short time scale associated with immediate sensory input while another assembly had adopted a long time scale, associated with an integration of information over an extended period of time, which was necessary for succeeding at the task. Another modeling study showed that robots with neuronal populations of strongly differing time-constants performed their tasks significantly better than when endowed only with units of approximately identical time-constants (Yamashita and Tani, <xref ref-type="bibr" rid="B179">2008</xref>). In Botvinick (<xref ref-type="bibr" rid="B27">2007</xref>) it was shown that, after learning, a neural network with a structural hierarchy similar to the one proposed for the frontal cortex had organized in such a way that high-level units coded for temporal context while low-level units encoded fast responses similar to the role assigned to sensory and motor regions in theories of hierarchical cortical processing (Kiebel et al., <xref ref-type="bibr" rid="B100">2008</xref>; Alexander and Brown, <xref ref-type="bibr" rid="B9">2018</xref>; Rao and Ballard, <xref ref-type="bibr" rid="B151">1999</xref>; Botvinick, <xref ref-type="bibr" rid="B28">2008</xref>; Badre, <xref ref-type="bibr" rid="B15">2008</xref>; Koechlin et al., <xref ref-type="bibr" rid="B103">2003</xref>; Fuster, <xref ref-type="bibr" rid="B73">2004</xref>).</p>
<p>The principle of representing spatiotemporal dynamics at multiple time scales has also been used to model birdsong generation and inference in songbirds by combining a hierarchically structured RNN with a model of songbirds&#x00027; vocal tract dynamics (Yildiz and Kiebel, <xref ref-type="bibr" rid="B182">2011</xref>). The system consisted of three levels, each of which was governed by the sequential dynamics of an RNN following a limit cycle. The sequential dynamics were influenced both by top-down predictions, and bottom-up prediction errors. In another study, the same concept was applied to the recognition of human speech (Yildiz et al., <xref ref-type="bibr" rid="B183">2013</xref>). The resulting inference scheme was able to recognize spoken words, even under adversarial circumstances like accelerated speech, since it inferred and adapted parameters in an online fashion during the recognition process. The same principle can also be translated to very different types of input, see Rivera et al. (<xref ref-type="bibr" rid="B154">2015</xref>) for an example of insect olfaction.</p></sec>
<sec>
<title>3.2.2. A Hierarchy of Time Scales: Neuroimaging Evidence</title>
<p>Experimental evidence for the hypothesis of a hierarchy of time scales has been reported in several neuroimaging studies (Koechlin et al., <xref ref-type="bibr" rid="B103">2003</xref>; Hasson et al., <xref ref-type="bibr" rid="B86">2008</xref>; Lerner et al., <xref ref-type="bibr" rid="B115">2011</xref>; Gauthier et al., <xref ref-type="bibr" rid="B75">2012</xref>; Cocchi et al., <xref ref-type="bibr" rid="B42">2016</xref>; Mattar et al., <xref ref-type="bibr" rid="B127">2016</xref>; Baldassano et al., <xref ref-type="bibr" rid="B17">2017</xref>; Gao et al., <xref ref-type="bibr" rid="B74">2020</xref>), two of which we are going to briefly discuss in the following. One functional magnetic resonance imaging (fMRI) study investigated the temporal receptive windows (TRW) of several brain regions in the human brain (Hasson et al., <xref ref-type="bibr" rid="B86">2008</xref>). The TRW of an area is the time-interval over which the region &#x0201C;integrates&#x0201D; incoming information, in order to extract meaning over a specific temporal scale. It was found that regions, such as the primary visual cortex exhibited rather short TRW, while high order regions exhibited intermediate to long TRW (Hasson et al., <xref ref-type="bibr" rid="B86">2008</xref>). Similarly, in Lerner et al. (<xref ref-type="bibr" rid="B115">2011</xref>) the same principle was tested with temporally structured auditory input, i.e., speech. Using fMRI, the authors found evidence for a hierarchy of time scales in specific brain areas. The different time scales represented fast auditory input, words, sentences and paragraphs (see <xref ref-type="fig" rid="F7">Figure 7</xref>).</p>
<fig id="F7" position="float">
<label>Figure 7</label>
<caption><p>Study by Lerner et al. (<xref ref-type="bibr" rid="B115">2011</xref>) as an example for representations in a hierarchy of time scales. Here, the authors used fMRI and a between-subject correlational analysis to categorize brain voxels according to four levels of representation. These four levels were fast dynamics of auditory input (red), words (yellow), sentences (green), and paragraphs (blue). Results are displayed on a so-called inflated cortical surface. Figure reprinted from Lerner et al. (<xref ref-type="bibr" rid="B115">2011</xref>).</p></caption>
<graphic xlink:href="frai-04-530937-g0007.tif"/>
</fig></sec>
<sec>
<title>3.2.3. A Hierarchy of Time Scales: Machine Learning</title>
<p>Not surprisingly, the importance of hierarchies of time scales is well-established within the machine learning community (El Hihi and Bengio, <xref ref-type="bibr" rid="B57">1996</xref>; Malhotra et al., <xref ref-type="bibr" rid="B124">2015</xref>). Current state-of-the-art RNN architectures used for prediction and classification of complex time series data are based on recurrent network units organized as temporal hierarchies. Notable examples are the clockwork RNN (Koutnik et al., <xref ref-type="bibr" rid="B106">2014</xref>), gated feedback RNN (Chung et al., <xref ref-type="bibr" rid="B40">2015</xref>), hierarchical multi-scale RNN (Chung et al., <xref ref-type="bibr" rid="B38">2016</xref>), fast-slow RNN (Mujika et al., <xref ref-type="bibr" rid="B134">2017</xref>), and higher order RNNs (HORNNs) (Soltani and Jiang, <xref ref-type="bibr" rid="B162">2016</xref>). These modern RNN architectures have found various applications in motion classification (Neverova et al., <xref ref-type="bibr" rid="B136">2016</xref>; Yan et al., <xref ref-type="bibr" rid="B180">2018</xref>), speech synthesis (Wu and King, <xref ref-type="bibr" rid="B178">2016</xref>; Achanta and Gangashetty, <xref ref-type="bibr" rid="B4">2017</xref>; Zhang and Woodland, <xref ref-type="bibr" rid="B189">2018</xref>), recognition (Chan et al., <xref ref-type="bibr" rid="B36">2016</xref>), and other related areas (Liu et al., <xref ref-type="bibr" rid="B119">2015</xref>; Krause et al., <xref ref-type="bibr" rid="B107">2017</xref>; Kurata et al., <xref ref-type="bibr" rid="B108">2017</xref>). These applications of hierarchical RNN architectures further confirm the relevance of hierarchically organized sequence generators for capturing complex dynamics in our everyday environments.</p></sec></sec></sec>
<sec sec-type="conclusions" id="s4">
<title>4. Conclusion</title>
<p>Here, we have reviewed the evidence that our brain senses its environment as sequential sensory input, and consequently, uses neuronal sequences for predicting future sensory input. Although the general idea that the brain is a prediction device has by now become a mainstream guiding principle in cognitive neuroscience, it is much less clear how exactly the brain computes these predictions. We have reviewed results from different areas of the neurosciences that the brain may achieve this by using a hierarchy of time scales, specifically a hierarchy of sequential dynamics. If this were the case, the question would be whether already known neuroscience results in specific areas can be re-interpreted as evidence for the brain&#x00027;s operations in such a hierarchy of time scales. Such an interpretation is quite natural for neuroscience fields like auditory processing, where such a temporal hierarchy is most evident. But it is much less evident for other areas, like for example decision-making. To further test this suggested theory of brain function, researchers need to design experimental paradigms which are specifically geared toward testing what probabilistic inference mechanisms the brain uses to predict its input at different time scales, and select its own actions. Importantly, hierarchical computational modeling approaches as reviewed here could be used to further provide theoretical evidence of the underlying multi-scale inference mechanism and generate new predictions that can be tested experimentally.</p>
<p>What we found telling is that recent advances in machine learning converge on similar ideas of representing multi scale dynamics in sensory data, although with a different motivation and different aims. The simple reason for this convergence may be that much of the sensory data that is input to machine learning implementations is similar to the kind of sensory input experienced by humans, as for example in videos and speech data. Therefore, we believe that as computational modeling in the neurosciences as reviewed here will gain traction, there will be useful translations form the neurosciences to machine learning applications.</p></sec>
<sec id="s5">
<title>Author Contributions</title>
<p>DM and SK contributed to the conception of the manuscript. SF wrote the manuscript, with contributions by DM and SK. All authors contributed to the article and approved the submitted version.</p></sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p></sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Abeles</surname> <given-names>M.</given-names></name></person-group> (<year>1991</year>). <source>Corticonics: Neural Circuits of the Cerebral Cortex</source>. <publisher-loc>Cambridge, UK</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>. <pub-id pub-id-type="doi">10.1017/CBO9780511574566</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abeles</surname> <given-names>M.</given-names></name> <name><surname>Bergman</surname> <given-names>H.</given-names></name> <name><surname>Gat</surname> <given-names>I.</given-names></name> <name><surname>Meilijson</surname> <given-names>I.</given-names></name> <name><surname>Seidemann</surname> <given-names>E.</given-names></name> <name><surname>Tishby</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>1995</year>). <article-title>Cortical activity flips among quasi-stationary states</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>92</volume>, <fpage>8616</fpage>&#x02013;<lpage>8620</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.92.19.8616</pub-id><pub-id pub-id-type="pmid">7567985</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abeles</surname> <given-names>M.</given-names></name> <name><surname>Gat</surname> <given-names>I.</given-names></name></person-group> (<year>2001</year>). <article-title>Detecting precise firing sequences in experimental data</article-title>. <source>J. Neurosci. Methods</source> <volume>107</volume>, <fpage>141</fpage>&#x02013;<lpage>154</lpage>. <pub-id pub-id-type="doi">10.1016/S0165-0270(01)00364-8</pub-id><pub-id pub-id-type="pmid">11389951</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Achanta</surname> <given-names>S.</given-names></name> <name><surname>Gangashetty</surname> <given-names>S. V.</given-names></name></person-group> (<year>2017</year>). <article-title>Deep elman recurrent neural networks for statistical parametric speech synthesis</article-title>. <source>Speech Commun</source>. <volume>93</volume>, <fpage>31</fpage>&#x02013;<lpage>42</lpage>. <pub-id pub-id-type="doi">10.1016/j.specom.2017.08.003</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Adams</surname> <given-names>R. A.</given-names></name> <name><surname>Stephan</surname> <given-names>K. E.</given-names></name> <name><surname>Brown</surname> <given-names>H. R.</given-names></name> <name><surname>Frith</surname> <given-names>C. D.</given-names></name> <name><surname>Friston</surname> <given-names>K. J.</given-names></name></person-group> (<year>2013</year>). <article-title>The computational anatomy of psychosis</article-title>. <source>Front. Psychiatry</source> <volume>4</volume>:<fpage>47</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyt.2013.00047</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Afraimovich</surname> <given-names>V.</given-names></name> <name><surname>Zhigulin</surname> <given-names>V.</given-names></name> <name><surname>Rabinovich</surname> <given-names>M.</given-names></name></person-group> (<year>2004a</year>). <article-title>On the origin of reproducible sequential activity in neural circuits</article-title>. <source>Chaos</source> <volume>14</volume>, <fpage>1123</fpage>&#x02013;<lpage>1129</lpage>. <pub-id pub-id-type="doi">10.1063/1.1819625</pub-id><pub-id pub-id-type="pmid">15568926</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Afraimovich</surname> <given-names>V. S.</given-names></name> <name><surname>Rabinovich</surname> <given-names>M. I.</given-names></name> <name><surname>Varona</surname> <given-names>P.</given-names></name></person-group> (<year>2004b</year>). <article-title>Heteroclinic contours in neural ensembles and the winnerless competition principle</article-title>. <source>Int. J. Bifurc. Chaos</source> <volume>14</volume>, <fpage>1195</fpage>&#x02013;<lpage>1208</lpage>. <pub-id pub-id-type="doi">10.1142/S0218127404009806</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aitchison</surname> <given-names>L.</given-names></name> <name><surname>Lengyel</surname> <given-names>M.</given-names></name></person-group> (<year>2017</year>). <article-title>With or without you: predictive coding and bayesian inference in the brain</article-title>. <source>Curr. Opin. Neurobiol</source>. <volume>46</volume>, <fpage>219</fpage>&#x02013;<lpage>227</lpage>. <pub-id pub-id-type="doi">10.1016/j.conb.2017.08.010</pub-id><pub-id pub-id-type="pmid">28942084</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alexander</surname> <given-names>W. H.</given-names></name> <name><surname>Brown</surname> <given-names>J. W.</given-names></name></person-group> (<year>2018</year>). <article-title>Frontal cortex function as derived from hierarchical predictive coding</article-title>. <source>Sci. Rep</source>. <volume>8</volume>:<fpage>3843</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-018-21407-9</pub-id><pub-id pub-id-type="pmid">29497060</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ashwin</surname> <given-names>P.</given-names></name> <name><surname>Borresen</surname> <given-names>J.</given-names></name></person-group> (<year>2004</year>). <article-title>Encoding via conjugate symmetries of slow oscillations for globally coupled oscillators</article-title>. <source>Phys. Rev. E</source> <volume>70</volume>:<fpage>026203</fpage>. <pub-id pub-id-type="doi">10.1103/PhysRevE.70.026203</pub-id><pub-id pub-id-type="pmid">15447561</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ashwin</surname> <given-names>P.</given-names></name> <name><surname>Borresen</surname> <given-names>J.</given-names></name></person-group> (<year>2005</year>). <article-title>Discrete computation using a perturbed heteroclinic network</article-title>. <source>Phys. Lett. A</source> <volume>347</volume>, <fpage>208</fpage>&#x02013;<lpage>214</lpage>. <pub-id pub-id-type="doi">10.1016/j.physleta.2005.08.013</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ashwin</surname> <given-names>P.</given-names></name> <name><surname>Orosz</surname> <given-names>G.</given-names></name> <name><surname>Wordsworth</surname> <given-names>J.</given-names></name> <name><surname>Townley</surname> <given-names>S.</given-names></name></person-group> (<year>2007</year>). <article-title>Dynamics on networks of cluster states for globally coupled phase oscillators</article-title>. <source>SIAM J. Appl. Dyn. Syst</source>. <volume>6</volume>, <fpage>728</fpage>&#x02013;<lpage>758</lpage>. <pub-id pub-id-type="doi">10.1137/070683969</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ashwin</surname> <given-names>P.</given-names></name> <name><surname>Swift</surname> <given-names>J. W.</given-names></name></person-group> (<year>1992</year>). <article-title>The dynamics of n weakly coupled identical oscillators</article-title>. <source>J. Nonlin. Sci</source>. <volume>2</volume>, <fpage>69</fpage>&#x02013;<lpage>108</lpage>. <pub-id pub-id-type="doi">10.1007/BF02429852</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ashwin</surname> <given-names>P.</given-names></name> <name><surname>Timme</surname> <given-names>M.</given-names></name></person-group> (<year>2005</year>). <article-title>Nonlinear dynamics: when instability makes sense</article-title>. <source>Nature</source> <volume>436</volume>:<fpage>36</fpage>. <pub-id pub-id-type="doi">10.1038/436036b</pub-id><pub-id pub-id-type="pmid">16001052</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Badre</surname> <given-names>D.</given-names></name></person-group> (<year>2008</year>). <article-title>Cognitive control, hierarchy, and the rostro-caudal organization of the frontal lobes</article-title>. <source>Trends Cogn. Sci</source>. <volume>12</volume>, <fpage>193</fpage>&#x02013;<lpage>200</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2008.02.004</pub-id><pub-id pub-id-type="pmid">18403252</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baeg</surname> <given-names>E.</given-names></name> <name><surname>Kim</surname> <given-names>Y.</given-names></name> <name><surname>Huh</surname> <given-names>K.</given-names></name> <name><surname>Mook-Jung</surname> <given-names>I.</given-names></name> <name><surname>Kim</surname> <given-names>H.</given-names></name> <name><surname>Jung</surname> <given-names>M.</given-names></name></person-group> (<year>2003</year>). <article-title>Dynamics of population code for working memory in the prefrontal cortex</article-title>. <source>Neuron</source> <volume>40</volume>, <fpage>177</fpage>&#x02013;<lpage>188</lpage>. <pub-id pub-id-type="doi">10.1016/S0896-6273(03)00597-X</pub-id><pub-id pub-id-type="pmid">14527442</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baldassano</surname> <given-names>C.</given-names></name> <name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Zadbood</surname> <given-names>A.</given-names></name> <name><surname>Pillow</surname> <given-names>J. W.</given-names></name> <name><surname>Hasson</surname> <given-names>U.</given-names></name> <name><surname>Norman</surname> <given-names>K. A.</given-names></name></person-group> (<year>2017</year>). <article-title>Discovering event structure in continuous narrative perception and memory</article-title>. <source>Neuron</source> <volume>95</volume>, <fpage>709</fpage>&#x02013;<lpage>721</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2017.06.041</pub-id><pub-id pub-id-type="pmid">28772125</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bassett</surname> <given-names>D. S.</given-names></name> <name><surname>Greenfield</surname> <given-names>D. L.</given-names></name> <name><surname>Meyer-Lindenberg</surname> <given-names>A.</given-names></name> <name><surname>Weinberger</surname> <given-names>D. R.</given-names></name> <name><surname>Moore</surname> <given-names>S. W.</given-names></name> <name><surname>Bullmore</surname> <given-names>E. T.</given-names></name></person-group> (<year>2010</year>). <article-title>Efficient physical embedding of topologically complex information processing networks in brains and computer circuits</article-title>. <source>PLoS Comput. Biol</source>. <volume>6</volume>:<fpage>e1000748</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1000748</pub-id><pub-id pub-id-type="pmid">20421990</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bazhenov</surname> <given-names>M.</given-names></name> <name><surname>Stopfer</surname> <given-names>M.</given-names></name> <name><surname>Rabinovich</surname> <given-names>M.</given-names></name> <name><surname>Abarbanel</surname> <given-names>H. D.</given-names></name> <name><surname>Sejnowski</surname> <given-names>T. J.</given-names></name> <name><surname>Laurent</surname> <given-names>G.</given-names></name></person-group> (<year>2001</year>). <article-title>Model of cellular and network mechanisms for odor-evoked temporal patterning in the locust antennal lobe</article-title>. <source>Neuron</source> <volume>30</volume>, <fpage>569</fpage>&#x02013;<lpage>581</lpage>. <pub-id pub-id-type="doi">10.1016/S0896-6273(01)00286-0</pub-id><pub-id pub-id-type="pmid">11395015</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Beck</surname> <given-names>J. M.</given-names></name> <name><surname>Ma</surname> <given-names>W. J.</given-names></name> <name><surname>Kiani</surname> <given-names>R.</given-names></name> <name><surname>Hanks</surname> <given-names>T.</given-names></name> <name><surname>Churchland</surname> <given-names>A. K.</given-names></name> <name><surname>Roitman</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>Probabilistic population codes for bayesian decision making</article-title>. <source>Neuron</source> <volume>60</volume>, <fpage>1142</fpage>&#x02013;<lpage>1152</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2008.09.021</pub-id><pub-id pub-id-type="pmid">19109917</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berry</surname> <given-names>H.</given-names></name> <name><surname>Quoy</surname> <given-names>M.</given-names></name></person-group> (<year>2006</year>). <article-title>Structure and dynamics of random recurrent neural networks</article-title>. <source>Adapt. Behav</source>. <volume>14</volume>, <fpage>129</fpage>&#x02013;<lpage>137</lpage>. <pub-id pub-id-type="doi">10.1177/105971230601400204</pub-id><pub-id pub-id-type="pmid">18624656</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bhalla</surname> <given-names>U. S.</given-names></name></person-group> (<year>2019</year>). <article-title>Dendrites, deep learning, and sequences in the hippocampus</article-title>. <source>Hippocampus</source> <volume>29</volume>, <fpage>239</fpage>&#x02013;<lpage>251</lpage>. <pub-id pub-id-type="doi">10.1002/hipo.22806</pub-id><pub-id pub-id-type="pmid">29024221</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bick</surname> <given-names>C.</given-names></name> <name><surname>Rabinovich</surname> <given-names>M. I.</given-names></name></person-group> (<year>2010</year>). <article-title>On the occurrence of stable heteroclinic channels in lotka-volterra models</article-title>. <source>Dyn. Syst</source>. <volume>25</volume>, <fpage>97</fpage>&#x02013;<lpage>110</lpage>. <pub-id pub-id-type="doi">10.1080/14689360903322227</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Birkholz</surname> <given-names>P.</given-names></name> <name><surname>Kroger</surname> <given-names>B. J.</given-names></name> <name><surname>Neuschaefer-Rube</surname> <given-names>C.</given-names></name></person-group> (<year>2010</year>). <article-title>Model-based reproduction of articulatory trajectories for consonant-vowel sequences</article-title>. <source>IEEE Trans. Audio Speech Lang. Process</source>. <volume>19</volume>, <fpage>1422</fpage>&#x02013;<lpage>1433</lpage>. <pub-id pub-id-type="doi">10.1109/TASL.2010.2091632</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bitzer</surname> <given-names>S.</given-names></name> <name><surname>Kiebel</surname> <given-names>S. J.</given-names></name></person-group> (<year>2012</year>). <article-title>Recognizing recurrent neural networks (RRNN): Bayesian inference for recurrent neural networks</article-title>. <source>Biol. Cybernet</source>. <volume>106</volume>, <fpage>201</fpage>&#x02013;<lpage>217</lpage>. <pub-id pub-id-type="doi">10.1007/s00422-012-0490-x</pub-id><pub-id pub-id-type="pmid">22581026</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boemio</surname> <given-names>A.</given-names></name> <name><surname>Fromm</surname> <given-names>S.</given-names></name> <name><surname>Braun</surname> <given-names>A.</given-names></name> <name><surname>Poeppel</surname> <given-names>D.</given-names></name></person-group> (<year>2005</year>). <article-title>Hierarchical and asymmetric temporal sensitivity in human auditory cortices</article-title>. <source>Nat. Neurosci</source>. <volume>8</volume>, <fpage>389</fpage>&#x02013;<lpage>395</lpage>. <pub-id pub-id-type="doi">10.1038/nn1409</pub-id><pub-id pub-id-type="pmid">15723061</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Botvinick</surname> <given-names>M. M.</given-names></name></person-group> (<year>2007</year>). <article-title>Multilevel structure in behaviour and in the brain: a model of Fuster&#x00027;s hierarchy</article-title>. <source>Philos. Trans. R. Soc. B Biol. Sci</source>. <volume>362</volume>, <fpage>1615</fpage>&#x02013;<lpage>1626</lpage>. <pub-id pub-id-type="doi">10.1098/rstb.2007.2056</pub-id><pub-id pub-id-type="pmid">17428777</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Botvinick</surname> <given-names>M. M.</given-names></name></person-group> (<year>2008</year>). <article-title>Hierarchical models of behavior and prefrontal function</article-title>. <source>Trends Cogn. Sci</source>. <volume>12</volume>, <fpage>201</fpage>&#x02013;<lpage>208</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2008.02.009</pub-id><pub-id pub-id-type="pmid">18420448</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bouchard</surname> <given-names>K. E.</given-names></name> <name><surname>Brainard</surname> <given-names>M. S.</given-names></name></person-group> (<year>2016</year>). <article-title>Auditory-induced neural dynamics in sensory-motor circuitry predict learned temporal and sequential statistics of birdsong</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>113</volume>, <fpage>9641</fpage>&#x02013;<lpage>9646</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1606725113</pub-id><pub-id pub-id-type="pmid">27506786</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bousfield</surname> <given-names>W. A.</given-names></name></person-group> (<year>1953</year>). <article-title>The occurrence of clustering in the recall of randomly arranged associates</article-title>. <source>J. Gen. Psychol</source>. <volume>49</volume>, <fpage>229</fpage>&#x02013;<lpage>240</lpage>. <pub-id pub-id-type="doi">10.1080/00221309.1953.9710088</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Branco</surname> <given-names>T.</given-names></name> <name><surname>Clark</surname> <given-names>B. A.</given-names></name> <name><surname>H&#x000E4;usser</surname> <given-names>M.</given-names></name></person-group> (<year>2010</year>). <article-title>Dendritic discrimination of temporal input sequences in cortical neurons</article-title>. <source>Science</source> <volume>329</volume>, <fpage>1671</fpage>&#x02013;<lpage>1675</lpage>. <pub-id pub-id-type="doi">10.1126/science.1189664</pub-id><pub-id pub-id-type="pmid">20705816</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Breakspear</surname> <given-names>M.</given-names></name></person-group> (<year>2001</year>). <article-title>Perception of odors by a nonlinear model of the olfactory bulb</article-title>. <source>Int. J. Neural Syst</source>. <volume>11</volume>, <fpage>101</fpage>&#x02013;<lpage>124</lpage>. <pub-id pub-id-type="doi">10.1142/S0129065701000564</pub-id><pub-id pub-id-type="pmid">14632166</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Buonomano</surname> <given-names>D. V.</given-names></name> <name><surname>Maass</surname> <given-names>W.</given-names></name></person-group> (<year>2009</year>). <article-title>State-dependent computations: spatiotemporal processing in cortical networks</article-title>. <source>Nat. Rev. Neurosci</source>. <volume>10</volume>:<fpage>113</fpage>. <pub-id pub-id-type="doi">10.1038/nrn2558</pub-id><pub-id pub-id-type="pmid">19145235</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Buzs&#x000E1;ki</surname> <given-names>G.</given-names></name></person-group> (<year>2015</year>). <article-title>Hippocampal sharp wave-ripple: a cognitive biomarker for episodic memory and planning</article-title>. <source>Hippocampus</source> <volume>25</volume>, <fpage>1073</fpage>&#x02013;<lpage>1188</lpage>. <pub-id pub-id-type="doi">10.1002/hipo.22488</pub-id><pub-id pub-id-type="pmid">26135716</pub-id></citation></ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ceni</surname> <given-names>A.</given-names></name> <name><surname>Ashwin</surname> <given-names>P.</given-names></name> <name><surname>Livi</surname> <given-names>L.</given-names></name></person-group> (<year>2019</year>). <article-title>Interpreting recurrent neural networks behaviour via excitable network attractors</article-title>. <source>Cogn. Comput</source>. <volume>12</volume>, <fpage>330</fpage>&#x02013;<lpage>356</lpage>. <pub-id pub-id-type="doi">10.1007/s12559-019-09634-2</pub-id></citation></ref>
<ref id="B36">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chan</surname> <given-names>W.</given-names></name> <name><surname>Jaitly</surname> <given-names>N.</given-names></name> <name><surname>Le</surname> <given-names>Q.</given-names></name> <name><surname>Vinyals</surname> <given-names>O.</given-names></name></person-group> (<year>2016</year>). <article-title>Listen, attend and spell: a neural network for large vocabulary conversational speech recognition,</article-title> in <source>2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</source> (<publisher-loc>Shanghai</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>4960</fpage>&#x02013;<lpage>4964</lpage>. <pub-id pub-id-type="doi">10.1109/ICASSP.2016.7472621</pub-id></citation></ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chenkov</surname> <given-names>N.</given-names></name> <name><surname>Sprekeler</surname> <given-names>H.</given-names></name> <name><surname>Kempter</surname> <given-names>R.</given-names></name></person-group> (<year>2017</year>). <article-title>Memory replay in balanced recurrent networks</article-title>. <source>PLoS Comput. Biol</source>. <volume>13</volume>:<fpage>e1005359</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1005359</pub-id><pub-id pub-id-type="pmid">28135266</pub-id></citation></ref>
<ref id="B38">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chung</surname> <given-names>J.</given-names></name> <name><surname>Ahn</surname> <given-names>S.</given-names></name> <name><surname>Bengio</surname> <given-names>Y.</given-names></name></person-group> (<year>2016</year>). <article-title>Hierarchical multiscale recurrent neural networks</article-title>. <source>arXiv</source> arXiv:1609.01704.</citation></ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chung</surname> <given-names>J.</given-names></name> <name><surname>Gulcehre</surname> <given-names>C.</given-names></name> <name><surname>Cho</surname> <given-names>K.</given-names></name> <name><surname>Bengio</surname> <given-names>Y.</given-names></name></person-group> (<year>2014</year>). <article-title>Empirical evaluation of gated recurrent neural networks on sequence modeling</article-title>. <source>arXiv</source> 1412.3555.</citation></ref>
<ref id="B40">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chung</surname> <given-names>J.</given-names></name> <name><surname>Gulcehre</surname> <given-names>C.</given-names></name> <name><surname>Cho</surname> <given-names>K.</given-names></name> <name><surname>Bengio</surname> <given-names>Y.</given-names></name></person-group> (<year>2015</year>). <article-title>Gated feedback recurrent neural networks,</article-title> in <source>International Conference on Machine Learning</source> (<publisher-loc>Lille</publisher-loc>), <fpage>2067</fpage>&#x02013;<lpage>2075</lpage>.</citation></ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clark</surname> <given-names>A.</given-names></name></person-group> (<year>2013</year>). <article-title>Whatever next? Predictive brains, situated agents, and the future of cognitive science</article-title>. <source>Behav. Brain Sci</source>. <volume>36</volume>, <fpage>181</fpage>&#x02013;<lpage>204</lpage>. <pub-id pub-id-type="doi">10.1017/S0140525X12000477</pub-id><pub-id pub-id-type="pmid">23663408</pub-id></citation></ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cocchi</surname> <given-names>L.</given-names></name> <name><surname>Sale</surname> <given-names>M. V.</given-names></name> <name><surname>Gollo</surname> <given-names>L. L.</given-names></name> <name><surname>Bell</surname> <given-names>P. T.</given-names></name> <name><surname>Nguyen</surname> <given-names>V. T.</given-names></name> <name><surname>Zalesky</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>A hierarchy of timescales explains distinct effects of local inhibition of primary visual cortex and frontal eye fields</article-title>. <source>Elife</source> <volume>5</volume>:<fpage>e15252</fpage>. <pub-id pub-id-type="doi">10.7554/eLife.15252</pub-id><pub-id pub-id-type="pmid">27596931</pub-id></citation></ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Colombo</surname> <given-names>M.</given-names></name> <name><surname>Seri&#x000E8;s</surname> <given-names>P.</given-names></name></person-group> (<year>2012</year>). <article-title>Bayes in the brain&#x02014;on bayesian modelling in neuroscience</article-title>. <source>Br. J. Philos. Sci</source>. <volume>63</volume>, <fpage>697</fpage>&#x02013;<lpage>723</lpage>. <pub-id pub-id-type="doi">10.1093/bjps/axr043</pub-id></citation></ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Constantinescu</surname> <given-names>A. O.</given-names></name> <name><surname>O&#x00027;Reilly</surname> <given-names>J. X.</given-names></name> <name><surname>Behrens</surname> <given-names>T. E.</given-names></name></person-group> (<year>2016</year>). <article-title>Organizing conceptual knowledge in humans with a gridlike code</article-title>. <source>Science</source> <volume>352</volume>, <fpage>1464</fpage>&#x02013;<lpage>1468</lpage>. <pub-id pub-id-type="doi">10.1126/science.aaf0941</pub-id><pub-id pub-id-type="pmid">27313047</pub-id></citation></ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Crowe</surname> <given-names>D. A.</given-names></name> <name><surname>Averbeck</surname> <given-names>B. B.</given-names></name> <name><surname>Chafee</surname> <given-names>M. V.</given-names></name></person-group> (<year>2010</year>). <article-title>Rapid sequences of population activity patterns dynamically encode task-critical spatial information in parietal cortex</article-title>. <source>J. Neurosci</source>. <volume>30</volume>, <fpage>11640</fpage>&#x02013;<lpage>11653</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.0954-10.2010</pub-id><pub-id pub-id-type="pmid">20810885</pub-id></citation></ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Daunizeau</surname> <given-names>J.</given-names></name> <name><surname>Friston</surname> <given-names>K. J.</given-names></name> <name><surname>Kiebel</surname> <given-names>S. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Variational bayesian identification and prediction of stochastic nonlinear dynamic causal models</article-title>. <source>Phys. D</source> <volume>238</volume>, <fpage>2089</fpage>&#x02013;<lpage>2118</lpage>. <pub-id pub-id-type="doi">10.1016/j.physd.2009.08.002</pub-id><pub-id pub-id-type="pmid">19862351</pub-id></citation></ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dayan</surname> <given-names>P.</given-names></name> <name><surname>Hinton</surname> <given-names>G. E.</given-names></name> <name><surname>Neal</surname> <given-names>R. M.</given-names></name> <name><surname>Zemel</surname> <given-names>R. S.</given-names></name></person-group> (<year>1995</year>). <article-title>The Helmholtz machine</article-title>. <source>Neural Comput</source>. <volume>7</volume>, <fpage>889</fpage>&#x02013;<lpage>904</lpage>. <pub-id pub-id-type="doi">10.1162/neco.1995.7.5.889</pub-id></citation></ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Deneve</surname> <given-names>S.</given-names></name></person-group> (<year>2008</year>). <article-title>Bayesian spiking neurons I: inference</article-title>. <source>Neural Comput</source>. <volume>20</volume>, <fpage>91</fpage>&#x02013;<lpage>117</lpage>. <pub-id pub-id-type="doi">10.1162/neco.2008.20.1.91</pub-id></citation></ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Deng</surname> <given-names>L.</given-names></name> <name><surname>Yu</surname> <given-names>D.</given-names></name> <name><surname>Acero</surname> <given-names>A.</given-names></name></person-group> (<year>2006</year>). <article-title>Structured speech modeling</article-title>. <source>IEEE Trans. Audio Speech Lang. Process</source>. <volume>14</volume>, <fpage>1492</fpage>&#x02013;<lpage>1504</lpage>. <pub-id pub-id-type="doi">10.1109/TASL.2006.878265</pub-id></citation></ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dezfouli</surname> <given-names>A.</given-names></name> <name><surname>Lingawi</surname> <given-names>N. W.</given-names></name> <name><surname>Balleine</surname> <given-names>B. W.</given-names></name></person-group> (<year>2014</year>). <article-title>Habits as action sequences: hierarchical action control and changes in outcome value</article-title>. <source>Philos. Trans. R. Soc. B Biol. Sci</source>. <volume>369</volume>:<fpage>20130482</fpage>. <pub-id pub-id-type="doi">10.1098/rstb.2013.0482</pub-id><pub-id pub-id-type="pmid">25267824</pub-id></citation></ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Diesmann</surname> <given-names>M.</given-names></name> <name><surname>Gewaltig</surname> <given-names>M. O.</given-names></name> <name><surname>Aertsen</surname> <given-names>A.</given-names></name></person-group> (<year>1999</year>). <article-title>Stable propagation of synchronous spiking in cortical neural networks</article-title>. <source>Nature</source> <volume>402</volume>:<fpage>529</fpage>. <pub-id pub-id-type="doi">10.1038/990101</pub-id><pub-id pub-id-type="pmid">10591212</pub-id></citation></ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ding</surname> <given-names>N.</given-names></name> <name><surname>Melloni</surname> <given-names>L.</given-names></name> <name><surname>Zhang</surname> <given-names>H.</given-names></name> <name><surname>Tian</surname> <given-names>X.</given-names></name> <name><surname>Poeppel</surname> <given-names>D.</given-names></name></person-group> (<year>2016</year>). <article-title>Cortical tracking of hierarchical linguistic structures in connected speech</article-title>. <source>Nat. Neurosci</source>. <volume>19</volume>, <fpage>158</fpage>&#x02013;<lpage>164</lpage>. <pub-id pub-id-type="doi">10.1038/nn.4186</pub-id><pub-id pub-id-type="pmid">26642090</pub-id></citation></ref>
<ref id="B53">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Doya</surname> <given-names>K.</given-names></name> <name><surname>Ishii</surname> <given-names>S.</given-names></name> <name><surname>Pouget</surname> <given-names>A.</given-names></name> <name><surname>Rao</surname> <given-names>R. P.</given-names></name></person-group> (<year>2007</year>). <source>Bayesian Brain: Probabilistic Approaches to Neural Coding</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>. <pub-id pub-id-type="doi">10.7551/mitpress/9780262042383.001.0001</pub-id></citation></ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dragoi</surname> <given-names>G.</given-names></name> <name><surname>Tonegawa</surname> <given-names>S.</given-names></name></person-group> (<year>2011</year>). <article-title>Preplay of future place cell sequences by hippocampal cellular assemblies</article-title>. <source>Nature</source> <volume>469</volume>:<fpage>397</fpage>. <pub-id pub-id-type="doi">10.1038/nature09633</pub-id><pub-id pub-id-type="pmid">21179088</pub-id></citation></ref>
<ref id="B55">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Duong</surname> <given-names>T. V.</given-names></name> <name><surname>Bui</surname> <given-names>H. H.</given-names></name> <name><surname>Phung</surname> <given-names>D. Q.</given-names></name> <name><surname>Venkatesh</surname> <given-names>S.</given-names></name></person-group> (<year>2005</year>). <article-title>Activity recognition and abnormality detection with the switching hidden semi-Markov model,</article-title> in <source>2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR&#x00027;05)</source>, Vol. <volume>1</volume> (<publisher-loc>San Diego, CA</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>838</fpage>&#x02013;<lpage>845</lpage>. <pub-id pub-id-type="doi">10.1109/CVPR.2005.61</pub-id></citation></ref>
<ref id="B56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eguiluz</surname> <given-names>V. M.</given-names></name> <name><surname>Chialvo</surname> <given-names>D. R.</given-names></name> <name><surname>Cecchi</surname> <given-names>G. A.</given-names></name> <name><surname>Baliki</surname> <given-names>M.</given-names></name> <name><surname>Apkarian</surname> <given-names>A. V.</given-names></name></person-group> (<year>2005</year>). <article-title>Scale-free brain functional networks</article-title>. <source>Phys. Rev. Lett</source>. <volume>94</volume>:<fpage>018102</fpage>. <pub-id pub-id-type="doi">10.1103/PhysRevLett.94.018102</pub-id></citation></ref>
<ref id="B57">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>El Hihi</surname> <given-names>S.</given-names></name> <name><surname>Bengio</surname> <given-names>Y.</given-names></name></person-group> (<year>1996</year>). <article-title>Hierarchical recurrent neural networks for long-term dependencies,</article-title> in <source>Advances in Neural Information Processing Systems</source>, <fpage>493</fpage>&#x02013;<lpage>499</lpage>.</citation></ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eldar</surname> <given-names>E.</given-names></name> <name><surname>Morris</surname> <given-names>G.</given-names></name> <name><surname>Niv</surname> <given-names>Y.</given-names></name></person-group> (<year>2011</year>). <article-title>The effects of motivation on response rate: a hidden semi-Markov model analysis of behavioral dynamics</article-title>. <source>J. Neurosci. Methods</source> <volume>201</volume>, <fpage>251</fpage>&#x02013;<lpage>261</lpage>. <pub-id pub-id-type="doi">10.1016/j.jneumeth.2011.06.028</pub-id><pub-id pub-id-type="pmid">21782849</pub-id></citation></ref>
<ref id="B59">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ernst</surname> <given-names>M. O.</given-names></name> <name><surname>Banks</surname> <given-names>M. S.</given-names></name></person-group> (<year>2002</year>). <article-title>Humans integrate visual and haptic information in a statistically optimal fashion</article-title>. <source>Nature</source> <volume>415</volume>:<fpage>429</fpage>. <pub-id pub-id-type="doi">10.1038/415429a</pub-id><pub-id pub-id-type="pmid">11807554</pub-id></citation></ref>
<ref id="B60">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Feinberg</surname> <given-names>E. A.</given-names></name> <name><surname>Shwartz</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <source>Handbook of Markov Decision Processes: Methods and Applications</source>, Vol. <volume>40</volume>. <publisher-loc>Boston, MA</publisher-loc>: <publisher-name>Springer Science &#x00026; Business Media</publisher-name>.</citation></ref>
<ref id="B61">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Feldman</surname> <given-names>J.</given-names></name></person-group> (<year>2001</year>). <article-title>Bayesian contour integration</article-title>. <source>Percept. Psychophys</source>. <volume>63</volume>, <fpage>1171</fpage>&#x02013;<lpage>1182</lpage>. <pub-id pub-id-type="doi">10.3758/BF03194532</pub-id></citation></ref>
<ref id="B62">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Felleman</surname> <given-names>D. J.</given-names></name> <name><surname>Van Essen</surname> <given-names>D.</given-names></name></person-group> (<year>1991</year>). <article-title>Distributed hierarchical processing in the primate cerebral cortex</article-title>. <source>Cereb. Cortex</source> <volume>1</volume>, <fpage>1</fpage>&#x02013;<lpage>47</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/1.1.1</pub-id><pub-id pub-id-type="pmid">1822724</pub-id></citation></ref>
<ref id="B63">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>FitzGerald</surname> <given-names>T. H.</given-names></name> <name><surname>H&#x000E4;mmerer</surname> <given-names>D.</given-names></name> <name><surname>Friston</surname> <given-names>K. J.</given-names></name> <name><surname>Li</surname> <given-names>S. C.</given-names></name> <name><surname>Dolan</surname> <given-names>R. J.</given-names></name></person-group> (<year>2017</year>). <article-title>Sequential inference as a mode of cognition and its correlates in fronto-parietal and hippocampal brain regions</article-title>. <source>PLoS Comput. Biol</source>. <volume>13</volume>:<fpage>e1005418</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1005418</pub-id><pub-id pub-id-type="pmid">29236695</pub-id></citation></ref>
<ref id="B64">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fletcher</surname> <given-names>P. C.</given-names></name> <name><surname>Frith</surname> <given-names>C. D.</given-names></name></person-group> (<year>2009</year>). <article-title>Perceiving is believing: a bayesian approach to explaining the positive symptoms of schizophrenia</article-title>. <source>Nat. Rev. Neurosci</source>. <volume>10</volume>, <fpage>48</fpage>&#x02013;<lpage>58</lpage>. <pub-id pub-id-type="doi">10.1038/nrn2536</pub-id><pub-id pub-id-type="pmid">19050712</pub-id></citation></ref>
<ref id="B65">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fonollosa</surname> <given-names>J.</given-names></name> <name><surname>Neftci</surname> <given-names>E.</given-names></name> <name><surname>Rabinovich</surname> <given-names>M.</given-names></name></person-group> (<year>2015</year>). <article-title>Learning of chunking sequences in cognition and behavior</article-title>. <source>PLoS Comput. Biol</source>. <volume>11</volume>:<fpage>e1004592</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1004592</pub-id><pub-id pub-id-type="pmid">26584306</pub-id></citation></ref>
<ref id="B66">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friston</surname> <given-names>K.</given-names></name></person-group> (<year>2005</year>). <article-title>A theory of cortical responses</article-title>. <source>Philos. Trans. R. Soc. B Biol. Sci</source>. <volume>360</volume>, <fpage>815</fpage>&#x02013;<lpage>836</lpage>. <pub-id pub-id-type="doi">10.1098/rstb.2005.1622</pub-id></citation></ref>
<ref id="B67">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friston</surname> <given-names>K.</given-names></name></person-group> (<year>2010</year>). <article-title>The free-energy principle: a unified brain theory?</article-title> <source>Nat. Rev. Neurosci</source>. <volume>11</volume>, <fpage>127</fpage>&#x02013;<lpage>138</lpage>. <pub-id pub-id-type="doi">10.1038/nrn2787</pub-id><pub-id pub-id-type="pmid">20068583</pub-id></citation></ref>
<ref id="B68">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friston</surname> <given-names>K.</given-names></name> <name><surname>Buzs&#x000E1;ki</surname> <given-names>G.</given-names></name></person-group> (<year>2016</year>). <article-title>The functional anatomy of time: what and when in the brain</article-title>. <source>Trends Cogn. Sci</source>. <volume>20</volume>, <fpage>500</fpage>&#x02013;<lpage>511</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2016.05.001</pub-id><pub-id pub-id-type="pmid">27261057</pub-id></citation></ref>
<ref id="B69">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friston</surname> <given-names>K.</given-names></name> <name><surname>Kiebel</surname> <given-names>S.</given-names></name></person-group> (<year>2009</year>). <article-title>Predictive coding under the free-energy principle</article-title>. <source>Philos. Trans. R. Soc. B Biol. Sci</source>. <volume>364</volume>, <fpage>1211</fpage>&#x02013;<lpage>1221</lpage>. <pub-id pub-id-type="doi">10.1098/rstb.2008.0300</pub-id><pub-id pub-id-type="pmid">19528002</pub-id></citation></ref>
<ref id="B70">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friston</surname> <given-names>K.</given-names></name> <name><surname>Kilner</surname> <given-names>J.</given-names></name> <name><surname>Harrison</surname> <given-names>L.</given-names></name></person-group> (<year>2006</year>). <article-title>A free energy principle for the brain</article-title>. <source>J. Physiol</source>. <volume>100</volume>, <fpage>70</fpage>&#x02013;<lpage>87</lpage>. <pub-id pub-id-type="doi">10.1016/j.jphysparis.2006.10.001</pub-id></citation></ref>
<ref id="B71">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friston</surname> <given-names>K.</given-names></name> <name><surname>Mattout</surname> <given-names>J.</given-names></name> <name><surname>Kilner</surname> <given-names>J.</given-names></name></person-group> (<year>2011</year>). <article-title>Action understanding and active inference</article-title>. <source>Biol. Cybernet</source>. <volume>104</volume>, <fpage>137</fpage>&#x02013;<lpage>160</lpage>. <pub-id pub-id-type="doi">10.1007/s00422-011-0424-z</pub-id></citation></ref>
<ref id="B72">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friston</surname> <given-names>K. J.</given-names></name> <name><surname>Stephan</surname> <given-names>K. E.</given-names></name> <name><surname>Montague</surname> <given-names>R.</given-names></name> <name><surname>Dolan</surname> <given-names>R. J.</given-names></name></person-group> (<year>2014</year>). <article-title>Computational psychiatry: the brain as a phantastic organ</article-title>. <source>Lancet Psychiatry</source> <volume>1</volume>, <fpage>148</fpage>&#x02013;<lpage>158</lpage>. <pub-id pub-id-type="doi">10.1016/S2215-0366(14)70275-5</pub-id><pub-id pub-id-type="pmid">26360579</pub-id></citation></ref>
<ref id="B73">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fuster</surname> <given-names>J. M.</given-names></name></person-group> (<year>2004</year>). <article-title>Upper processing stages of the perception-action cycle</article-title>. <source>Trends Cogn. Sci</source>. <volume>8</volume>, <fpage>143</fpage>&#x02013;<lpage>145</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2004.02.004</pub-id><pub-id pub-id-type="pmid">15551481</pub-id></citation></ref>
<ref id="B74">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gao</surname> <given-names>R.</given-names></name> <name><surname>van den Brink</surname> <given-names>R. L.</given-names></name> <name><surname>Pfeffer</surname> <given-names>T.</given-names></name> <name><surname>Voytek</surname> <given-names>B.</given-names></name></person-group> (<year>2020</year>). <article-title>Neuronal timescales are functionally dynamic and shaped by cortical microarchitecture</article-title>. <source>Elife</source> <volume>9</volume>:<fpage>e61277</fpage>. <pub-id pub-id-type="doi">10.7554/eLife.61277</pub-id><pub-id pub-id-type="pmid">33226336</pub-id></citation></ref>
<ref id="B75">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gauthier</surname> <given-names>B.</given-names></name> <name><surname>Eger</surname> <given-names>E.</given-names></name> <name><surname>Hesselmann</surname> <given-names>G.</given-names></name> <name><surname>Giraud</surname> <given-names>A. L.</given-names></name> <name><surname>Kleinschmidt</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <article-title>Temporal tuning properties along the human ventral visual stream</article-title>. <source>J. Neurosci</source>. <volume>32</volume>, <fpage>14433</fpage>&#x02013;<lpage>14441</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.2467-12.2012</pub-id><pub-id pub-id-type="pmid">23055513</pub-id></citation></ref>
<ref id="B76">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gelman</surname> <given-names>A.</given-names></name> <name><surname>Simpson</surname> <given-names>D.</given-names></name> <name><surname>Betancourt</surname> <given-names>M.</given-names></name></person-group> (<year>2017</year>). <article-title>The prior can often only be understood in the context of the likelihood</article-title>. <source>Entropy</source> <volume>19</volume>:<fpage>555</fpage>. <pub-id pub-id-type="doi">10.3390/e19100555</pub-id><pub-id pub-id-type="pmid">26634229</pub-id></citation></ref>
<ref id="B77">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Gers</surname> <given-names>F. A.</given-names></name> <name><surname>Schmidhuber</surname> <given-names>J.</given-names></name> <name><surname>Cummins</surname> <given-names>F.</given-names></name></person-group> (<year>1999</year>). <source>Learning to Forget: Continual Prediction With LSTM</source>. <publisher-loc>Stevenage</publisher-loc>: <publisher-name>Institution of Engineering and Technology</publisher-name>. <pub-id pub-id-type="doi">10.1049/cp:19991218</pub-id><pub-id pub-id-type="pmid">11032042</pub-id></citation></ref>
<ref id="B78">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Giese</surname> <given-names>M. A.</given-names></name> <name><surname>Poggio</surname> <given-names>T.</given-names></name></person-group> (<year>2003</year>). <article-title>Cognitive neuroscience: neural mechanisms for the recognition of biological movements</article-title>. <source>Nat. Rev. Neurosci</source>. <volume>4</volume>:<fpage>179</fpage>. <pub-id pub-id-type="doi">10.1038/nrn1057</pub-id></citation></ref>
<ref id="B79">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Giraud</surname> <given-names>A. L.</given-names></name> <name><surname>Poeppel</surname> <given-names>D.</given-names></name></person-group> (<year>2012</year>). <article-title>Cortical oscillations and speech processing: emerging computational principles and operations</article-title>. <source>Nat. Neurosci</source>. <volume>15</volume>:<fpage>511</fpage>. <pub-id pub-id-type="doi">10.1038/nn.3063</pub-id><pub-id pub-id-type="pmid">22426255</pub-id></citation></ref>
<ref id="B80">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gros</surname> <given-names>C.</given-names></name></person-group> (<year>2007</year>). <article-title>Neural networks with transient state dynamics</article-title>. <source>New J. Phys</source>. <volume>9</volume>:<fpage>109</fpage>. <pub-id pub-id-type="doi">10.1088/1367-2630/9/4/109</pub-id></citation></ref>
<ref id="B81">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gros</surname> <given-names>C.</given-names></name></person-group> (<year>2009</year>). <article-title>Cognitive computation with autonomously active neural networks: an emerging field</article-title>. <source>Cogn. Comput</source>. <volume>1</volume>, <fpage>77</fpage>&#x02013;<lpage>90</lpage>. <pub-id pub-id-type="doi">10.1007/s12559-008-9000-9</pub-id></citation></ref>
<ref id="B82">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hahnloser</surname> <given-names>R. H.</given-names></name> <name><surname>Kozhevnikov</surname> <given-names>A. A.</given-names></name> <name><surname>Fee</surname> <given-names>M. S.</given-names></name></person-group> (<year>2002</year>). <article-title>An ultra-sparse code underliesthe generation of neural sequences in a songbird</article-title>. <source>Nature</source> <volume>419</volume>:<fpage>65</fpage>. <pub-id pub-id-type="doi">10.1038/nature00974</pub-id><pub-id pub-id-type="pmid">12214232</pub-id></citation></ref>
<ref id="B83">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hansel</surname> <given-names>D.</given-names></name> <name><surname>Mato</surname> <given-names>G.</given-names></name> <name><surname>Meunier</surname> <given-names>C.</given-names></name></person-group> (<year>1993a</year>). <article-title>Clustering and slow switching in globally coupled phase oscillators</article-title>. <source>Phys. Rev. E</source> <volume>48</volume>:<fpage>3470</fpage>. <pub-id pub-id-type="doi">10.1103/PhysRevE.48.3470</pub-id><pub-id pub-id-type="pmid">9961005</pub-id></citation></ref>
<ref id="B84">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hansel</surname> <given-names>D.</given-names></name> <name><surname>Mato</surname> <given-names>G.</given-names></name> <name><surname>Meunier</surname> <given-names>C.</given-names></name></person-group> (<year>1993b</year>). <article-title>Phase dynamics for weakly coupled hodgkin-huxley neurons</article-title>. <source>Europhys. Lett</source>. <volume>23</volume>:<fpage>367</fpage>. <pub-id pub-id-type="doi">10.1209/0295-5075/23/5/011</pub-id></citation></ref>
<ref id="B85">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Harvey</surname> <given-names>C. D.</given-names></name> <name><surname>Coen</surname> <given-names>P.</given-names></name> <name><surname>Tank</surname> <given-names>D. W.</given-names></name></person-group> (<year>2012</year>). <article-title>Choice-specific sequences in parietal cortex during a virtual-navigation decision task</article-title>. <source>Nature</source> <volume>484</volume>:<fpage>62</fpage>. <pub-id pub-id-type="doi">10.1038/nature10918</pub-id><pub-id pub-id-type="pmid">22419153</pub-id></citation></ref>
<ref id="B86">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hasson</surname> <given-names>U.</given-names></name> <name><surname>Yang</surname> <given-names>E.</given-names></name> <name><surname>Vallines</surname> <given-names>I.</given-names></name> <name><surname>Heeger</surname> <given-names>D. J.</given-names></name> <name><surname>Rubin</surname> <given-names>N.</given-names></name></person-group> (<year>2008</year>). <article-title>A hierarchy of temporal receptive windows in human cortex</article-title>. <source>J. Neurosci</source>. <volume>28</volume>, <fpage>2539</fpage>&#x02013;<lpage>2550</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.5487-07.2008</pub-id><pub-id pub-id-type="pmid">18322098</pub-id></citation></ref>
<ref id="B87">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hawkins</surname> <given-names>J.</given-names></name> <name><surname>George</surname> <given-names>D.</given-names></name> <name><surname>Niemasik</surname> <given-names>J.</given-names></name></person-group> (<year>2009</year>). <article-title>Sequence memory for prediction, inference and behaviour</article-title>. <source>Philos. Trans. R. Soc. B Biol. Sci</source>. <volume>364</volume>, <fpage>1203</fpage>&#x02013;<lpage>1209</lpage>. <pub-id pub-id-type="doi">10.1098/rstb.2008.0322</pub-id><pub-id pub-id-type="pmid">19528001</pub-id></citation></ref>
<ref id="B88">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Helmholtz</surname> <given-names>H. V.</given-names></name></person-group> (<year>1867</year>). <source>Handbuch der Physiologischen Optik</source>. <publisher-loc>Leipzig</publisher-loc>: <publisher-name>Voss</publisher-name>.</citation></ref>
<ref id="B89">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Hinton</surname> <given-names>G. E.</given-names></name> <name><surname>Sejnowski</surname> <given-names>T. J.</given-names></name></person-group> (<year>1983</year>). <article-title>Optimal perceptual inference,</article-title> in <source>Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</source>, Vol. <volume>448</volume> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Citeseer</publisher-name>).</citation></ref>
<ref id="B90">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hopfield</surname> <given-names>J. J.</given-names></name></person-group> (<year>1982</year>). <article-title>Neural networks and physical systems with emergent collective computational abilities</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>79</volume>, <fpage>2554</fpage>&#x02013;<lpage>2558</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.79.8.2554</pub-id><pub-id pub-id-type="pmid">6953413</pub-id></citation></ref>
<ref id="B91">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hubel</surname> <given-names>D. H.</given-names></name> <name><surname>Wiesel</surname> <given-names>T. N.</given-names></name></person-group> (<year>1959</year>). <article-title>Receptive fields of single neurones in the cat&#x00027;s striate cortex</article-title>. <source>J. Physiol</source>. <volume>148</volume>, <fpage>574</fpage>&#x02013;<lpage>591</lpage>. <pub-id pub-id-type="doi">10.1113/jphysiol.1959.sp006308</pub-id><pub-id pub-id-type="pmid">19525558</pub-id></citation></ref>
<ref id="B92">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ikegaya</surname> <given-names>Y.</given-names></name> <name><surname>Aaron</surname> <given-names>G.</given-names></name> <name><surname>Cossart</surname> <given-names>R.</given-names></name> <name><surname>Aronov</surname> <given-names>D.</given-names></name> <name><surname>Lampl</surname> <given-names>I.</given-names></name> <name><surname>Ferster</surname> <given-names>D.</given-names></name> <etal/></person-group>. (<year>2004</year>). <article-title>Synfire chains and cortical songs: temporal modules of cortical activity</article-title>. <source>Science</source> <volume>304</volume>, <fpage>559</fpage>&#x02013;<lpage>564</lpage>. <pub-id pub-id-type="doi">10.1126/science.1093173</pub-id><pub-id pub-id-type="pmid">15105494</pub-id></citation></ref>
<ref id="B93">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Izhikevich</surname> <given-names>E. M.</given-names></name></person-group> (<year>2007</year>). <source>Dynamical Systems in Neuroscience</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>. <pub-id pub-id-type="doi">10.7551/mitpress/2526.001.0001</pub-id></citation></ref>
<ref id="B94">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Jaeger</surname> <given-names>H.</given-names></name></person-group> (<year>2001</year>). <source>The &#x0201C;Echo State&#x0201D; Approach to Analysing and Training Recurrent Neural Networks-With an Erratum Note</source>. <publisher-loc>Bonn</publisher-loc>: <publisher-name>German National Research Center for Information Technology GMD Technical Report 148</publisher-name>.</citation></ref>
<ref id="B95">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ji</surname> <given-names>D.</given-names></name> <name><surname>Wilson</surname> <given-names>M. A.</given-names></name></person-group> (<year>2007</year>). <article-title>Coordinated memory replay in the visual cortex and hippocampus during sleep</article-title>. <source>Nat. Neurosci</source>. <volume>10</volume>, <fpage>100</fpage>&#x02013;<lpage>107</lpage>. <pub-id pub-id-type="doi">10.1038/nn1825</pub-id><pub-id pub-id-type="pmid">17173043</pub-id></citation></ref>
<ref id="B96">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jones</surname> <given-names>L. M.</given-names></name> <name><surname>Fontanini</surname> <given-names>A.</given-names></name> <name><surname>Sadacca</surname> <given-names>B. F.</given-names></name> <name><surname>Miller</surname> <given-names>P.</given-names></name> <name><surname>Katz</surname> <given-names>D. B.</given-names></name></person-group> (<year>2007</year>). <article-title>Natural stimuli evoke dynamic sequences of states in sensory cortical ensembles</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>104</volume>, <fpage>18772</fpage>&#x02013;<lpage>18777</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0705546104</pub-id><pub-id pub-id-type="pmid">18000059</pub-id></citation></ref>
<ref id="B97">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Jouffroy</surname> <given-names>G.</given-names></name></person-group> (<year>2007</year>). <article-title>Design of simple limit cycles with recurrent neural networks for oscillatory control,</article-title> in <source>Sixth International Conference on Machine Learning and Applications (ICMLA 2007)</source> (<publisher-loc>Cincinnati, OH</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>50</fpage>&#x02013;<lpage>55</lpage>. <pub-id pub-id-type="doi">10.1109/ICMLA.2007.99</pub-id></citation></ref>
<ref id="B98">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kaplan</surname> <given-names>H. S.</given-names></name> <name><surname>Thula</surname> <given-names>O. S.</given-names></name> <name><surname>Khoss</surname> <given-names>N.</given-names></name> <name><surname>Zimmer</surname> <given-names>M.</given-names></name></person-group> (<year>2020</year>). <article-title>Nested neuronal dynamics orchestrate a behavioral hierarchy across timescales</article-title>. <source>Neuron</source> <volume>105</volume>, <fpage>562</fpage>&#x02013;<lpage>576</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2019.10.037</pub-id><pub-id pub-id-type="pmid">31786012</pub-id></citation></ref>
<ref id="B99">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kenet</surname> <given-names>T.</given-names></name> <name><surname>Bibitchkov</surname> <given-names>D.</given-names></name> <name><surname>Tsodyks</surname> <given-names>M.</given-names></name> <name><surname>Grinvald</surname> <given-names>A.</given-names></name> <name><surname>Arieli</surname> <given-names>A.</given-names></name></person-group> (<year>2003</year>). <article-title>Spontaneously emerging cortical representations of visual attributes</article-title>. <source>Nature</source> <volume>425</volume>:<fpage>954</fpage>. <pub-id pub-id-type="doi">10.1038/nature02078</pub-id><pub-id pub-id-type="pmid">14586468</pub-id></citation></ref>
<ref id="B100">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kiebel</surname> <given-names>S. J.</given-names></name> <name><surname>Daunizeau</surname> <given-names>J.</given-names></name> <name><surname>Friston</surname> <given-names>K. J.</given-names></name></person-group> (<year>2008</year>). <article-title>A hierarchy of time-scales and the brain</article-title>. <source>PLoS Comput. Biol</source>. <volume>4</volume>:<fpage>e1000209</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1000209</pub-id></citation></ref>
<ref id="B101">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kiebel</surname> <given-names>S. J.</given-names></name> <name><surname>Von Kriegstein</surname> <given-names>K.</given-names></name> <name><surname>Daunizeau</surname> <given-names>J.</given-names></name> <name><surname>Friston</surname> <given-names>K. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Recognizing sequences of sequences</article-title>. <source>PLoS Comput. Biol</source>. <volume>5</volume>:<fpage>e1000464</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1000464</pub-id></citation></ref>
<ref id="B102">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Knill</surname> <given-names>D. C.</given-names></name> <name><surname>Pouget</surname> <given-names>A.</given-names></name></person-group> (<year>2004</year>). <article-title>The Bayesian brain: the role of uncertainty in neural coding and computation</article-title>. <source>Trends Neurosci</source>. <volume>27</volume>, <fpage>712</fpage>&#x02013;<lpage>719</lpage>. <pub-id pub-id-type="doi">10.1016/j.tins.2004.10.007</pub-id><pub-id pub-id-type="pmid">15541511</pub-id></citation></ref>
<ref id="B103">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Koechlin</surname> <given-names>E.</given-names></name> <name><surname>Ody</surname> <given-names>C.</given-names></name> <name><surname>Kouneiher</surname> <given-names>F.</given-names></name></person-group> (<year>2003</year>). <article-title>The architecture of cognitive control in the human prefrontal cortex</article-title>. <source>Science</source> <volume>302</volume>, <fpage>1181</fpage>&#x02013;<lpage>1185</lpage>. <pub-id pub-id-type="doi">10.1126/science.1088545</pub-id><pub-id pub-id-type="pmid">14615530</pub-id></citation></ref>
<ref id="B104">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>K&#x000F6;rding</surname> <given-names>K. P.</given-names></name> <name><surname>Wolpert</surname> <given-names>D. M.</given-names></name></person-group> (<year>2004</year>). <article-title>Bayesian integration in sensorimotor learning</article-title>. <source>Nature</source> <volume>427</volume>:<fpage>244</fpage>. <pub-id pub-id-type="doi">10.1038/nature02169</pub-id></citation></ref>
<ref id="B105">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kotz</surname> <given-names>S. A.</given-names></name> <name><surname>Meyer</surname> <given-names>M.</given-names></name> <name><surname>Alter</surname> <given-names>K.</given-names></name> <name><surname>Besson</surname> <given-names>M.</given-names></name> <name><surname>von Cramon</surname> <given-names>D. Y.</given-names></name> <name><surname>Friederici</surname> <given-names>A. D.</given-names></name></person-group> (<year>2003</year>). <article-title>On the lateralization of emotional prosody: an event-related functional MR investigation</article-title>. <source>Brain Lang</source>. <volume>86</volume>, <fpage>366</fpage>&#x02013;<lpage>376</lpage>. <pub-id pub-id-type="doi">10.1016/S0093-934X(02)00532-1</pub-id><pub-id pub-id-type="pmid">12972367</pub-id></citation></ref>
<ref id="B106">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Koutnik</surname> <given-names>J.</given-names></name> <name><surname>Greff</surname> <given-names>K.</given-names></name> <name><surname>Gomez</surname> <given-names>F.</given-names></name> <name><surname>Schmidhuber</surname> <given-names>J.</given-names></name></person-group> (<year>2014</year>). <article-title>A clockwork RNN</article-title>. arXiv 1402.3511.</citation></ref>
<ref id="B107">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Krause</surname> <given-names>J.</given-names></name> <name><surname>Johnson</surname> <given-names>J.</given-names></name> <name><surname>Krishna</surname> <given-names>R.</given-names></name> <name><surname>Fei-Fei</surname> <given-names>L.</given-names></name></person-group> (<year>2017</year>). <article-title>A hierarchical approach for generating descriptive image paragraphs,</article-title> in <source>Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</source> (<publisher-loc>Honolulu, HI</publisher-loc>), <fpage>317</fpage>&#x02013;<lpage>325</lpage>. <pub-id pub-id-type="doi">10.1109/CVPR.2017.356</pub-id></citation></ref>
<ref id="B108">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Kurata</surname> <given-names>G.</given-names></name> <name><surname>Ramabhadran</surname> <given-names>B.</given-names></name> <name><surname>Saon</surname> <given-names>G.</given-names></name> <name><surname>Sethy</surname> <given-names>A.</given-names></name></person-group> (<year>2017</year>). <article-title>Lan&#x0201D; guage modeling with highway LSTM,</article-title> in <source>2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)</source> (<publisher-loc>Okinawa</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>244</fpage>&#x02013;<lpage>251</lpage>. <pub-id pub-id-type="doi">10.1109/ASRU.2017.8268942</pub-id></citation></ref>
<ref id="B109">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kurikawa</surname> <given-names>T.</given-names></name> <name><surname>Kaneko</surname> <given-names>K.</given-names></name></person-group> (<year>2015</year>). <article-title>Memories as bifurcations: realization by collective dynamics of spiking neurons under stochastic inputs</article-title>. <source>Neural Netw</source>. <volume>62</volume>, <fpage>25</fpage>&#x02013;<lpage>31</lpage>. <pub-id pub-id-type="doi">10.1016/j.neunet.2014.07.005</pub-id><pub-id pub-id-type="pmid">25124069</pub-id></citation></ref>
<ref id="B110">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kurth-Nelson</surname> <given-names>Z.</given-names></name> <name><surname>Economides</surname> <given-names>M.</given-names></name> <name><surname>Dolan</surname> <given-names>R. J.</given-names></name> <name><surname>Dayan</surname> <given-names>P.</given-names></name></person-group> (<year>2016</year>). <article-title>Fast sequences of non-spatial state representations in humans</article-title>. <source>Neuron</source> <volume>91</volume>, <fpage>194</fpage>&#x02013;<lpage>204</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2016.05.028</pub-id><pub-id pub-id-type="pmid">27321922</pub-id></citation></ref>
<ref id="B111">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Laboy-Ju&#x000E1;rez</surname> <given-names>K. J.</given-names></name> <name><surname>Langberg</surname> <given-names>T.</given-names></name> <name><surname>Ahn</surname> <given-names>S.</given-names></name> <name><surname>Feldman</surname> <given-names>D. E.</given-names></name></person-group> (<year>2019</year>). <article-title>Elementary motion sequence detectors in whisker somatosensory cortex</article-title>. <source>Nat. Neurosci</source>. <volume>22</volume>, <fpage>1438</fpage>&#x02013;<lpage>1449</lpage>. <pub-id pub-id-type="doi">10.1038/s41593-019-0448-6</pub-id><pub-id pub-id-type="pmid">31332375</pub-id></citation></ref>
<ref id="B112">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Lashley</surname> <given-names>K. S.</given-names></name></person-group> (<year>1951</year>). <article-title>The problem of serial order in behavior,</article-title> in <source>Cerebral Mechanisms in Behavior; The Hixon Symposium</source>, ed <person-group person-group-type="editor"><name><surname>Jeffress</surname> <given-names>L. A.</given-names></name></person-group> (<publisher-name>Wiley</publisher-name>), <fpage>112</fpage>&#x02013;<lpage>146</lpage>.<pub-id pub-id-type="pmid">26209088</pub-id></citation></ref>
<ref id="B113">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>LeCun</surname> <given-names>Y.</given-names></name> <name><surname>Bengio</surname> <given-names>Y.</given-names></name> <name><surname>Hinton</surname> <given-names>G.</given-names></name></person-group> (<year>2015</year>). <article-title>Deep learning</article-title>. <source>Nature</source> <volume>521</volume>, <fpage>436</fpage>&#x02013;<lpage>444</lpage>. <pub-id pub-id-type="doi">10.1038/nature14539</pub-id></citation></ref>
<ref id="B114">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Leptourgos</surname> <given-names>P.</given-names></name> <name><surname>Den&#x000E8;ve</surname> <given-names>S.</given-names></name> <name><surname>Jardri</surname> <given-names>R.</given-names></name></person-group> (<year>2017</year>). <article-title>Can circular inference relate the neuropathological and behavioral aspects of schizophrenia?</article-title> <source>Curr. Opin. Neurobiol</source>. <volume>46</volume>, <fpage>154</fpage>&#x02013;<lpage>161</lpage>. <pub-id pub-id-type="doi">10.1016/j.conb.2017.08.012</pub-id><pub-id pub-id-type="pmid">28915387</pub-id></citation></ref>
<ref id="B115">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lerner</surname> <given-names>Y.</given-names></name> <name><surname>Honey</surname> <given-names>C. J.</given-names></name> <name><surname>Silbert</surname> <given-names>L. J.</given-names></name> <name><surname>Hasson</surname> <given-names>U.</given-names></name></person-group> (<year>2011</year>). <article-title>Topographic mapping of a hierarchy of temporal receptive windows using a narrated story</article-title>. <source>J. Neurosci</source>. <volume>31</volume>, <fpage>2906</fpage>&#x02013;<lpage>2915</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.3684-10.2011</pub-id><pub-id pub-id-type="pmid">21414912</pub-id></citation></ref>
<ref id="B116">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lipton</surname> <given-names>Z. C.</given-names></name> <name><surname>Berkowitz</surname> <given-names>J.</given-names></name> <name><surname>Elkan</surname> <given-names>C.</given-names></name></person-group> (<year>2015</year>). <article-title>A critical review of recurrent neural networks for sequence learning</article-title>. arXiv 1506.00019.</citation></ref>
<ref id="B117">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Litvak</surname> <given-names>V.</given-names></name> <name><surname>Sompolinsky</surname> <given-names>H.</given-names></name> <name><surname>Segev</surname> <given-names>I.</given-names></name> <name><surname>Abeles</surname> <given-names>M.</given-names></name></person-group> (<year>2003</year>). <article-title>On the transmission of rate code in long feedforward networks with excitatory-inhibitory balance</article-title>. <source>J. Neurosci</source>. <volume>23</volume>, <fpage>3006</fpage>&#x02013;<lpage>3015</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.23-07-03006.2003</pub-id><pub-id pub-id-type="pmid">12684488</pub-id></citation></ref>
<ref id="B118">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>He</surname> <given-names>L.</given-names></name> <name><surname>Bai</surname> <given-names>H.</given-names></name> <name><surname>Dai</surname> <given-names>B.</given-names></name> <name><surname>Bai</surname> <given-names>K.</given-names></name> <name><surname>Xu</surname> <given-names>Z.</given-names></name></person-group> (<year>2018</year>). <article-title>Structured inference for recurrent hidden semi-Markov model,</article-title> in <source>IJCAI</source> (<publisher-loc>Stockholm</publisher-loc>), <fpage>2447</fpage>&#x02013;<lpage>2453</lpage>. <pub-id pub-id-type="doi">10.24963/ijcai.2018/339</pub-id></citation></ref>
<ref id="B119">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>P.</given-names></name> <name><surname>Qiu</surname> <given-names>X.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Wu</surname> <given-names>S.</given-names></name> <name><surname>Huang</surname> <given-names>X.</given-names></name></person-group> (<year>2015</year>). <article-title>Multi-timescale long short-term memory neural network for modelling sentences and documents,</article-title> in <source>Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing</source> (<publisher-loc>Okinawa</publisher-loc>), <fpage>2326</fpage>&#x02013;<lpage>2335</lpage>. <pub-id pub-id-type="doi">10.18653/v1/D15-1280</pub-id></citation></ref>
<ref id="B120">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Long</surname> <given-names>M. A.</given-names></name> <name><surname>Jin</surname> <given-names>D. Z.</given-names></name> <name><surname>Fee</surname> <given-names>M. S.</given-names></name></person-group> (<year>2010</year>). <article-title>Support for a synaptic chain model of neuronal sequence generation</article-title>. <source>Nature</source> <volume>468</volume>:<fpage>394</fpage>. <pub-id pub-id-type="doi">10.1038/nature09514</pub-id><pub-id pub-id-type="pmid">20972420</pub-id></citation></ref>
<ref id="B121">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luko&#x00161;evi&#x0010D;ius</surname> <given-names>M.</given-names></name> <name><surname>Jaeger</surname> <given-names>H.</given-names></name> <name><surname>Schrauwen</surname> <given-names>B.</given-names></name></person-group> (<year>2012</year>). <article-title>Reservoir computing trends</article-title>. <source>K&#x000FC;nstl. Intell</source>. <volume>26</volume>, <fpage>365</fpage>&#x02013;<lpage>371</lpage>. <pub-id pub-id-type="doi">10.1007/s13218-012-0204-5</pub-id></citation></ref>
<ref id="B122">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maass</surname> <given-names>W.</given-names></name> <name><surname>Natschl&#x000E4;ger</surname> <given-names>T.</given-names></name> <name><surname>Markram</surname> <given-names>H.</given-names></name></person-group> (<year>2002</year>). <article-title>Real-time computing without stable states: a new framework for neural computation based on perturbations</article-title>. <source>Neural Comput</source>. <volume>14</volume>, <fpage>2531</fpage>&#x02013;<lpage>2560</lpage>. <pub-id pub-id-type="doi">10.1162/089976602760407955</pub-id><pub-id pub-id-type="pmid">12433288</pub-id></citation></ref>
<ref id="B123">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>MacDonald</surname> <given-names>C. J.</given-names></name> <name><surname>Lepage</surname> <given-names>K. Q.</given-names></name> <name><surname>Eden</surname> <given-names>U. T.</given-names></name> <name><surname>Eichenbaum</surname> <given-names>H.</given-names></name></person-group> (<year>2011</year>). <article-title>Hippocampal &#x0201C;time cells&#x0201D; bridge the gap in memory for discontiguous events</article-title>. <source>Neuron</source> <volume>71</volume>, <fpage>737</fpage>&#x02013;<lpage>749</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2011.07.012</pub-id><pub-id pub-id-type="pmid">21867888</pub-id></citation></ref>
<ref id="B124">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Malhotra</surname> <given-names>P.</given-names></name> <name><surname>Vig</surname> <given-names>L.</given-names></name> <name><surname>Shroff</surname> <given-names>G.</given-names></name> <name><surname>Agarwal</surname> <given-names>P.</given-names></name></person-group> (<year>2015</year>). <article-title>Long short term memory networks for anomaly detection in time series,</article-title> in <source>Proceedings</source> (<publisher-loc>Louvain-la-Neuve</publisher-loc>: <publisher-name>Presses Universitaires de Louvain</publisher-name>), <fpage>89</fpage>.</citation></ref>
<ref id="B125">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Martinez-Conde</surname> <given-names>S.</given-names></name></person-group> (<year>2006</year>). <article-title>Fixational eye movements in normal and pathological vision</article-title>. <source>Prog. Brain Res</source>. <volume>154</volume>, <fpage>151</fpage>&#x02013;<lpage>176</lpage>. <pub-id pub-id-type="doi">10.1016/S0079-6123(06)54008-7</pub-id><pub-id pub-id-type="pmid">17010709</pub-id></citation></ref>
<ref id="B126">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Martinez-Conde</surname> <given-names>S.</given-names></name> <name><surname>Macknik</surname> <given-names>S. L.</given-names></name> <name><surname>Hubel</surname> <given-names>D. H.</given-names></name></person-group> (<year>2004</year>). <article-title>The role of fixational eye movements in visual perception</article-title>. <source>Nat. Rev. Neurosci</source>. <volume>5</volume>, <fpage>229</fpage>&#x02013;<lpage>240</lpage>. <pub-id pub-id-type="doi">10.1038/nrn1348</pub-id><pub-id pub-id-type="pmid">14976522</pub-id></citation></ref>
<ref id="B127">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mattar</surname> <given-names>M. G.</given-names></name> <name><surname>Kahn</surname> <given-names>D. A.</given-names></name> <name><surname>Thompson-Schill</surname> <given-names>S. L.</given-names></name> <name><surname>Aguirre</surname> <given-names>G. K.</given-names></name></person-group> (<year>2016</year>). <article-title>Varying timescales of stimulus integration unite neural adaptation and prototype formation</article-title>. <source>Curr. Biol</source>. <volume>26</volume>, <fpage>1669</fpage>&#x02013;<lpage>1676</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2016.04.065</pub-id><pub-id pub-id-type="pmid">27321999</pub-id></citation></ref>
<ref id="B128">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mazor</surname> <given-names>O.</given-names></name> <name><surname>Laurent</surname> <given-names>G.</given-names></name></person-group> (<year>2005</year>). <article-title>Transient dynamics versus fixed points in odor representations by locust antennal lobe projection neurons</article-title>. <source>Neuron</source> <volume>48</volume>, <fpage>661</fpage>&#x02013;<lpage>673</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2005.09.032</pub-id><pub-id pub-id-type="pmid">16301181</pub-id></citation></ref>
<ref id="B129">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meunier</surname> <given-names>D.</given-names></name> <name><surname>Lambiotte</surname> <given-names>R.</given-names></name> <name><surname>Bullmore</surname> <given-names>E. T.</given-names></name></person-group> (<year>2010</year>). <article-title>Modular and hierarchically modular organization of brain networks</article-title>. <source>Front. Neurosci</source>. <volume>4</volume>:<fpage>200</fpage>. <pub-id pub-id-type="doi">10.3389/fnins.2010.00200</pub-id><pub-id pub-id-type="pmid">21151783</pub-id></citation></ref>
<ref id="B130">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meunier</surname> <given-names>D.</given-names></name> <name><surname>Lambiotte</surname> <given-names>R.</given-names></name> <name><surname>Fornito</surname> <given-names>A.</given-names></name> <name><surname>Ersche</surname> <given-names>K.</given-names></name> <name><surname>Bullmore</surname> <given-names>E. T.</given-names></name></person-group> (<year>2009</year>). <article-title>Hierarchical modularity in human brain functional networks</article-title>. <source>Front. Neuroinform</source>. <volume>3</volume>:<fpage>37</fpage>. <pub-id pub-id-type="doi">10.3389/neuro.11.037.2009</pub-id><pub-id pub-id-type="pmid">19949480</pub-id></citation></ref>
<ref id="B131">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mi</surname> <given-names>Y.</given-names></name> <name><surname>Katkov</surname> <given-names>M.</given-names></name> <name><surname>Tsodyks</surname> <given-names>M.</given-names></name></person-group> (<year>2017</year>). <article-title>Synaptic correlates of working memory capacity</article-title>. <source>Neuron</source> <volume>93</volume>, <fpage>323</fpage>&#x02013;<lpage>330</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2016.12.004</pub-id><pub-id pub-id-type="pmid">28041884</pub-id></citation></ref>
<ref id="B132">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Miconi</surname> <given-names>T.</given-names></name></person-group> (<year>2017</year>). <article-title>Biologically plausible learning in recurrent neural networks reproduces neural dynamics observed during cognitive tasks</article-title>. <source>Elife</source> <volume>6</volume>:<fpage>e20899</fpage>. <pub-id pub-id-type="doi">10.7554/eLife.20899</pub-id><pub-id pub-id-type="pmid">28230528</pub-id></citation></ref>
<ref id="B133">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Miller</surname> <given-names>G. A.</given-names></name></person-group> (<year>1956</year>). <article-title>The magical number seven, plus or minus two: some limits on our capacity for processing information</article-title>. <source>Psychol. Rev</source>. <volume>63</volume>:<fpage>81</fpage>. <pub-id pub-id-type="doi">10.1037/h0043158</pub-id><pub-id pub-id-type="pmid">8022966</pub-id></citation></ref>
<ref id="B134">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Mujika</surname> <given-names>A.</given-names></name> <name><surname>Meier</surname> <given-names>F.</given-names></name> <name><surname>Steger</surname> <given-names>A.</given-names></name></person-group> (<year>2017</year>). <article-title>Fast-slow recurrent neural networks,</article-title> in <source>Advances in Neural Information Processing Systems</source>, <fpage>5915</fpage>&#x02013;<lpage>5924</lpage>.</citation></ref>
<ref id="B135">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Murray</surname> <given-names>J. D.</given-names></name> <name><surname>Bernacchia</surname> <given-names>A.</given-names></name> <name><surname>Freedman</surname> <given-names>D. J.</given-names></name> <name><surname>Romo</surname> <given-names>R.</given-names></name> <name><surname>Wallis</surname> <given-names>J. D.</given-names></name> <name><surname>Cai</surname> <given-names>X.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>A hierarchy of intrinsic timescales across primate cortex</article-title>. <source>Nat. Neurosci</source>. <volume>17</volume>:<fpage>1661</fpage>. <pub-id pub-id-type="doi">10.1038/nn.3862</pub-id><pub-id pub-id-type="pmid">25383900</pub-id></citation></ref>
<ref id="B136">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Neverova</surname> <given-names>N.</given-names></name> <name><surname>Wolf</surname> <given-names>C.</given-names></name> <name><surname>Lacey</surname> <given-names>G.</given-names></name> <name><surname>Fridman</surname> <given-names>L.</given-names></name> <name><surname>Chandra</surname> <given-names>D.</given-names></name> <name><surname>Barbello</surname> <given-names>B.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Learning human identity from motion patterns</article-title>. <source>IEEE Access</source> <volume>4</volume>, <fpage>1810</fpage>&#x02013;<lpage>1820</lpage>. <pub-id pub-id-type="doi">10.1109/ACCESS.2016.2557846</pub-id></citation></ref>
<ref id="B137">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Neves</surname> <given-names>F. S.</given-names></name> <name><surname>Timme</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>Computation by switching in complex networks of states</article-title>. <source>Phys. Rev. Lett</source>. <volume>109</volume>:<fpage>018701</fpage>. <pub-id pub-id-type="doi">10.1103/PhysRevLett.109.018701</pub-id><pub-id pub-id-type="pmid">23031136</pub-id></citation></ref>
<ref id="B138">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nolfi</surname> <given-names>S.</given-names></name></person-group> (<year>2002</year>). <article-title>Evolving robots able to self-localize in the environment: the importance of viewing cognition as the result of processes occurring at different time-scales</article-title>. <source>Connect. Sci</source>. <volume>14</volume>, <fpage>231</fpage>&#x02013;<lpage>244</lpage>. <pub-id pub-id-type="doi">10.1080/09540090208559329</pub-id></citation></ref>
<ref id="B139">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>O&#x00027;Neill</surname> <given-names>J.</given-names></name> <name><surname>Boccara</surname> <given-names>C.</given-names></name> <name><surname>Stella</surname> <given-names>F.</given-names></name> <name><surname>Sch&#x000F6;nenberger</surname> <given-names>P.</given-names></name> <name><surname>Csicsvari</surname> <given-names>J.</given-names></name></person-group> (<year>2017</year>). <article-title>Superficial layers of the medial entorhinal cortex replay independently of the hippocampus</article-title>. <source>Science</source> <volume>355</volume>, <fpage>184</fpage>&#x02013;<lpage>188</lpage>. <pub-id pub-id-type="doi">10.1126/science.aag2787</pub-id><pub-id pub-id-type="pmid">28082591</pub-id></citation></ref>
<ref id="B140">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pastalkova</surname> <given-names>E.</given-names></name> <name><surname>Itskov</surname> <given-names>V.</given-names></name> <name><surname>Amarasingham</surname> <given-names>A.</given-names></name> <name><surname>Buzs&#x000E1;ki</surname> <given-names>G.</given-names></name></person-group> (<year>2008</year>). <article-title>Internally generated cell assembly sequences in the rat hippocampus</article-title>. <source>Science</source> <volume>321</volume>, <fpage>1322</fpage>&#x02013;<lpage>1327</lpage>. <pub-id pub-id-type="doi">10.1126/science.1159775</pub-id><pub-id pub-id-type="pmid">18772431</pub-id></citation></ref>
<ref id="B141">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Perdikis</surname> <given-names>D.</given-names></name> <name><surname>Huys</surname> <given-names>R.</given-names></name> <name><surname>Jirsa</surname> <given-names>V. K.</given-names></name></person-group> (<year>2011</year>). <article-title>Time scale hierarchies in the functional organization of complex behaviors</article-title>. <source>PLoS Comput. Biol</source>. <volume>7</volume>:<fpage>e1002198</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1002198</pub-id><pub-id pub-id-type="pmid">21980278</pub-id></citation></ref>
<ref id="B142">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pezzulo</surname> <given-names>G.</given-names></name> <name><surname>van der Meer</surname> <given-names>M. A.</given-names></name> <name><surname>Lansink</surname> <given-names>C. S.</given-names></name> <name><surname>Pennartz</surname> <given-names>C. M.</given-names></name></person-group> (<year>2014</year>). <article-title>Internally generated sequences in learning and executing goal-directed behavior</article-title>. <source>Trends Cogn. Sci</source>. <volume>18</volume>, <fpage>647</fpage>&#x02013;<lpage>657</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2014.06.011</pub-id><pub-id pub-id-type="pmid">25156191</pub-id></citation></ref>
<ref id="B143">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pfeiffer</surname> <given-names>B. E.</given-names></name></person-group> (<year>2020</year>). <article-title>The content of hippocampal &#x0201C;replay</article-title>. <source>Hippocampus</source> <volume>30</volume>, <fpage>6</fpage>&#x02013;<lpage>18</lpage>. <pub-id pub-id-type="doi">10.1002/hipo.22824</pub-id></citation></ref>
<ref id="B144">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Prut</surname> <given-names>Y.</given-names></name> <name><surname>Vaadia</surname> <given-names>E.</given-names></name> <name><surname>Bergman</surname> <given-names>H.</given-names></name> <name><surname>Haalman</surname> <given-names>I.</given-names></name> <name><surname>Slovin</surname> <given-names>H.</given-names></name> <name><surname>Abeles</surname> <given-names>M.</given-names></name></person-group> (<year>1998</year>). <article-title>Spatiotemporal structure of cortical activity: properties and behavioral relevance</article-title>. <source>J. Neurophysiol</source>. <volume>79</volume>, <fpage>2857</fpage>&#x02013;<lpage>2874</lpage>. <pub-id pub-id-type="doi">10.1152/jn.1998.79.6.2857</pub-id><pub-id pub-id-type="pmid">9636092</pub-id></citation></ref>
<ref id="B145">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rabinovich</surname> <given-names>M.</given-names></name> <name><surname>Huerta</surname> <given-names>R.</given-names></name> <name><surname>Laurent</surname> <given-names>G.</given-names></name></person-group> (<year>2008</year>). <article-title>Transient dynamics for neural processing</article-title>. <source>Science</source> <volume>321</volume>, <fpage>48</fpage>&#x02013;<lpage>50</lpage>. <pub-id pub-id-type="doi">10.1126/science.1155564</pub-id></citation></ref>
<ref id="B146">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rabinovich</surname> <given-names>M.</given-names></name> <name><surname>Huerta</surname> <given-names>R.</given-names></name> <name><surname>Volkovskii</surname> <given-names>A.</given-names></name> <name><surname>Abarbanel</surname> <given-names>H.</given-names></name> <name><surname>Stopfer</surname> <given-names>M.</given-names></name> <name><surname>Laurent</surname> <given-names>G.</given-names></name></person-group> (<year>2000</year>). <article-title>Dynamical coding of sensory information with competitive networks</article-title>. <source>J. Physiol</source>. <volume>94</volume>, <fpage>465</fpage>&#x02013;<lpage>471</lpage>. <pub-id pub-id-type="doi">10.1016/S0928-4257(00)01092-5</pub-id><pub-id pub-id-type="pmid">11165913</pub-id></citation></ref>
<ref id="B147">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rabinovich</surname> <given-names>M.</given-names></name> <name><surname>Volkovskii</surname> <given-names>A.</given-names></name> <name><surname>Lecanda</surname> <given-names>P.</given-names></name> <name><surname>Huerta</surname> <given-names>R.</given-names></name> <name><surname>Abarbanel</surname> <given-names>H.</given-names></name> <name><surname>Laurent</surname> <given-names>G.</given-names></name></person-group> (<year>2001</year>). <article-title>Dynamical encoding by networks of competing neuron groups: winnerless competition</article-title>. <source>Phys. Rev. Lett</source>. <volume>87</volume>:<fpage>068102</fpage>. <pub-id pub-id-type="doi">10.1103/PhysRevLett.87.068102</pub-id><pub-id pub-id-type="pmid">11497865</pub-id></citation></ref>
<ref id="B148">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rabinovich</surname> <given-names>M. I.</given-names></name> <name><surname>Huerta</surname> <given-names>R.</given-names></name> <name><surname>Varona</surname> <given-names>P.</given-names></name> <name><surname>Afraimovich</surname> <given-names>V. S.</given-names></name></person-group> (<year>2006</year>). <article-title>Generation and reshaping of sequences in neural systems</article-title>. <source>Biol. Cybernet</source>. <volume>95</volume>:<fpage>519</fpage>. <pub-id pub-id-type="doi">10.1007/s00422-006-0121-5</pub-id><pub-id pub-id-type="pmid">17136380</pub-id></citation></ref>
<ref id="B149">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rahnev</surname> <given-names>D.</given-names></name> <name><surname>Denison</surname> <given-names>R. N.</given-names></name></person-group> (<year>2018</year>). <article-title>Suboptimality in perceptual decision making</article-title>. <source>Behav. Brain Sci</source>. <volume>41</volume>, <fpage>1</fpage>&#x02013;<lpage>107</lpage>. <pub-id pub-id-type="doi">10.1017/S0140525X18000936</pub-id></citation></ref>
<ref id="B150">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rajan</surname> <given-names>K.</given-names></name> <name><surname>Harvey</surname> <given-names>C. D.</given-names></name> <name><surname>Tank</surname> <given-names>D. W.</given-names></name></person-group> (<year>2016</year>). <article-title>Recurrent network models of sequence generation and memory</article-title>. <source>Neuron</source> <volume>90</volume>, <fpage>128</fpage>&#x02013;<lpage>142</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2016.02.009</pub-id><pub-id pub-id-type="pmid">26971945</pub-id></citation></ref>
<ref id="B151">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rao</surname> <given-names>R. P.</given-names></name> <name><surname>Ballard</surname> <given-names>D. H.</given-names></name></person-group> (<year>1999</year>). <article-title>Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects</article-title>. <source>Nat. Neurosci</source>. <volume>2</volume>:<fpage>79</fpage>. <pub-id pub-id-type="doi">10.1038/4580</pub-id><pub-id pub-id-type="pmid">10195184</pub-id></citation></ref>
<ref id="B152">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ravasz</surname> <given-names>E.</given-names></name> <name><surname>Barab&#x000E1;si</surname> <given-names>A. L.</given-names></name></person-group> (<year>2003</year>). <article-title>Hierarchical organization in complex networks</article-title>. <source>Phys. Rev. E</source> <volume>67</volume>:<fpage>026112</fpage>. <pub-id pub-id-type="doi">10.1103/PhysRevE.67.026112</pub-id></citation></ref>
<ref id="B153">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reitich-Stolero</surname> <given-names>T.</given-names></name> <name><surname>Paz</surname> <given-names>R.</given-names></name></person-group> (<year>2019</year>). <article-title>Affective memory rehearsal with temporal sequences in amygdala neurons</article-title>. <source>Nat. Neurosci</source>. <volume>22</volume>, <fpage>2050</fpage>&#x02013;<lpage>2059</lpage>. <pub-id pub-id-type="doi">10.1038/s41593-019-0542-9</pub-id><pub-id pub-id-type="pmid">31768054</pub-id></citation></ref>
<ref id="B154">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rivera</surname> <given-names>D. C.</given-names></name> <name><surname>Bitzer</surname> <given-names>S.</given-names></name> <name><surname>Kiebel</surname> <given-names>S. J.</given-names></name></person-group> (<year>2015</year>). <article-title>Modelling odor decoding in the antennal lobe by combining sequential firing rate models with Bayesian inference</article-title>. <source>PLoS Comput. Biol</source>. <volume>11</volume>:<fpage>e1004528</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1004528</pub-id><pub-id pub-id-type="pmid">26451888</pub-id></citation></ref>
<ref id="B155">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rosenbaum</surname> <given-names>D. A.</given-names></name> <name><surname>Cohen</surname> <given-names>R. G.</given-names></name> <name><surname>Jax</surname> <given-names>S. A.</given-names></name> <name><surname>Weiss</surname> <given-names>D. J.</given-names></name> <name><surname>Van Der Wel</surname> <given-names>R.</given-names></name></person-group> (<year>2007</year>). <article-title>The problem of serial order in behavior: Lashley&#x00027;s legacy</article-title>. <source>Hum. Mov. Sci</source>. <volume>26</volume>, <fpage>525</fpage>&#x02013;<lpage>554</lpage>. <pub-id pub-id-type="doi">10.1016/j.humov.2007.04.001</pub-id><pub-id pub-id-type="pmid">17698232</pub-id></citation></ref>
<ref id="B156">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schaal</surname> <given-names>S.</given-names></name> <name><surname>Mohajerian</surname> <given-names>P.</given-names></name> <name><surname>Ijspeert</surname> <given-names>A.</given-names></name></person-group> (<year>2007</year>). <article-title>Dynamics systems vs. optimal control&#x02014;a unifying view</article-title>. <source>Prog. Brain Res</source>. <volume>165</volume>, <fpage>425</fpage>&#x02013;<lpage>445</lpage>. <pub-id pub-id-type="doi">10.1016/S0079-6123(06)65027-9</pub-id><pub-id pub-id-type="pmid">17925262</pub-id></citation></ref>
<ref id="B157">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmidt</surname> <given-names>K. L.</given-names></name> <name><surname>Ambadar</surname> <given-names>Z.</given-names></name> <name><surname>Cohn</surname> <given-names>J. F.</given-names></name> <name><surname>Reed</surname> <given-names>L. I.</given-names></name></person-group> (<year>2006</year>). <article-title>Movement differences between deliberate and spontaneous facial expressions: zygomaticus major action in smiling</article-title>. <source>J. Nonverb. Behav</source>. <volume>30</volume>, <fpage>37</fpage>&#x02013;<lpage>52</lpage>. <pub-id pub-id-type="doi">10.1007/s10919-005-0003-x</pub-id><pub-id pub-id-type="pmid">19367343</pub-id></citation></ref>
<ref id="B158">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Seidemann</surname> <given-names>E.</given-names></name> <name><surname>Meilijson</surname> <given-names>I.</given-names></name> <name><surname>Abeles</surname> <given-names>M.</given-names></name> <name><surname>Bergman</surname> <given-names>H.</given-names></name> <name><surname>Vaadia</surname> <given-names>E.</given-names></name></person-group> (<year>1996</year>). <article-title>Simultaneously recorded single units in the frontal cortex go through sequences of discrete and stable states in monkeys performing a delayed localization task</article-title>. <source>J. Neurosci</source>. <volume>16</volume>, <fpage>752</fpage>&#x02013;<lpage>768</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.16-02-00752.1996</pub-id><pub-id pub-id-type="pmid">8551358</pub-id></citation></ref>
<ref id="B159">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sherman</surname> <given-names>S. M.</given-names></name> <name><surname>Guillery</surname> <given-names>R.</given-names></name></person-group> (<year>1998</year>). <article-title>On the actions that one nerve cell can have on another: distinguishing &#x0201C;drivers&#x0201D; from &#x0201C;modulators</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>95</volume>, <fpage>7121</fpage>&#x02013;<lpage>7126</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.95.12.7121</pub-id><pub-id pub-id-type="pmid">9618549</pub-id></citation></ref>
<ref id="B160">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Skaggs</surname> <given-names>W. E.</given-names></name> <name><surname>McNaughton</surname> <given-names>B. L.</given-names></name></person-group> (<year>1996</year>). <article-title>Replay of neuronal firing sequences in rat hippocampus during sleep following spatial experience</article-title>. <source>Science</source> <volume>271</volume>, <fpage>1870</fpage>&#x02013;<lpage>1873</lpage>. <pub-id pub-id-type="doi">10.1126/science.271.5257.1870</pub-id><pub-id pub-id-type="pmid">8596957</pub-id></citation></ref>
<ref id="B161">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Soltani</surname> <given-names>A.</given-names></name> <name><surname>Khorsand</surname> <given-names>P.</given-names></name> <name><surname>Guo</surname> <given-names>C.</given-names></name> <name><surname>Farashahi</surname> <given-names>S.</given-names></name> <name><surname>Liu</surname> <given-names>J.</given-names></name></person-group> (<year>2016</year>). <article-title>Neural substrates of cognitive biases during probabilistic inference</article-title>. <source>Nat. Commun</source>. <volume>7</volume>:<fpage>11393</fpage>. <pub-id pub-id-type="doi">10.1038/ncomms11393</pub-id><pub-id pub-id-type="pmid">27116102</pub-id></citation></ref>
<ref id="B162">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Soltani</surname> <given-names>R.</given-names></name> <name><surname>Jiang</surname> <given-names>H.</given-names></name></person-group> (<year>2016</year>). <article-title>Higher order recurrent neural networks</article-title>. arXiv 1605.00064.</citation></ref>
<ref id="B163">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stachenfeld</surname> <given-names>K. L.</given-names></name> <name><surname>Botvinick</surname> <given-names>M. M.</given-names></name> <name><surname>Gershman</surname> <given-names>S. J.</given-names></name></person-group> (<year>2017</year>). <article-title>The hippocampus as a predictive map</article-title>. <source>Nat. Neurosci</source>. <volume>20</volume>:<fpage>1643</fpage>. <pub-id pub-id-type="doi">10.1038/nn.4650</pub-id></citation></ref>
<ref id="B164">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Strogatz</surname> <given-names>S. H.</given-names></name></person-group> (<year>2018</year>). <source>Nonlinear Dynamics and Chaos: With Applications to Physics, Biology, Chemistry, and Engineering</source>. <publisher-loc>Boca Raton, FL</publisher-loc>: <publisher-name>CRC Press</publisher-name>. <pub-id pub-id-type="doi">10.1201/9780429492563</pub-id></citation></ref>
<ref id="B165">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sussillo</surname> <given-names>D.</given-names></name> <name><surname>Abbott</surname> <given-names>L. F.</given-names></name></person-group> (<year>2009</year>). <article-title>Generating coherent patterns of activity from chaotic neural networks</article-title>. <source>Neuron</source> <volume>63</volume>, <fpage>544</fpage>&#x02013;<lpage>557</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2009.07.018</pub-id><pub-id pub-id-type="pmid">19709635</pub-id></citation></ref>
<ref id="B166">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Taherkhani</surname> <given-names>A.</given-names></name> <name><surname>Belatreche</surname> <given-names>A.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Cosma</surname> <given-names>G.</given-names></name> <name><surname>Maguire</surname> <given-names>L. P.</given-names></name> <name><surname>McGinnity</surname> <given-names>T. M.</given-names></name></person-group> (<year>2020</year>). <article-title>A review of learning in biologically plausible spiking neural networks</article-title>. <source>Neural Netw</source>. <volume>122</volume>, <fpage>253</fpage>&#x02013;<lpage>272</lpage>. <pub-id pub-id-type="doi">10.1016/j.neunet.2019.09.036</pub-id><pub-id pub-id-type="pmid">31726331</pub-id></citation></ref>
<ref id="B167">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tanaka</surname> <given-names>G.</given-names></name> <name><surname>Yamane</surname> <given-names>T.</given-names></name> <name><surname>H&#x000E9;roux</surname> <given-names>J. B.</given-names></name> <name><surname>Nakane</surname> <given-names>R.</given-names></name> <name><surname>Kanazawa</surname> <given-names>N.</given-names></name> <name><surname>Takeda</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Recent advances in physical reservoir computing: a review</article-title>. <source>Neural Netw</source>. <volume>115</volume>, <fpage>100</fpage>&#x02013;<lpage>123</lpage>. <pub-id pub-id-type="doi">10.1016/j.neunet.2019.03.005</pub-id><pub-id pub-id-type="pmid">30981085</pub-id></citation></ref>
<ref id="B168">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tokdar</surname> <given-names>S.</given-names></name> <name><surname>Xi</surname> <given-names>P.</given-names></name> <name><surname>Kelly</surname> <given-names>R. C.</given-names></name> <name><surname>Kass</surname> <given-names>R. E.</given-names></name></person-group> (<year>2010</year>). <article-title>Detection of bursts in extracellular spike trains using hidden semi-Markov point process models</article-title>. <source>J. Comput. Neurosci</source>. <volume>29</volume>, <fpage>203</fpage>&#x02013;<lpage>212</lpage>. <pub-id pub-id-type="doi">10.1007/s10827-009-0182-2</pub-id><pub-id pub-id-type="pmid">19697116</pub-id></citation></ref>
<ref id="B169">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Toutounji</surname> <given-names>H.</given-names></name> <name><surname>Pipa</surname> <given-names>G.</given-names></name></person-group> (<year>2014</year>). <article-title>Spatiotemporal computations of an excitable and plastic brain: neuronal plasticity leads to noise-robust and noise-constructive computations</article-title>. <source>PLoS Comput. Biol</source>. <volume>10</volume>:<fpage>e1003512</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1003512</pub-id><pub-id pub-id-type="pmid">24651447</pub-id></citation></ref>
<ref id="B170">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tully</surname> <given-names>P. J.</given-names></name> <name><surname>Lind&#x000E9;n</surname> <given-names>H.</given-names></name> <name><surname>Hennig</surname> <given-names>M. H.</given-names></name> <name><surname>Lansner</surname> <given-names>A.</given-names></name></person-group> (<year>2016</year>). <article-title>Spike-based Bayesian-Hebbian learning of temporal sequences</article-title>. <source>PLoS Comput. Biol</source>. <volume>12</volume>:<fpage>e1004954</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1004954</pub-id><pub-id pub-id-type="pmid">27213810</pub-id></citation></ref>
<ref id="B171">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ulrych</surname> <given-names>T. J.</given-names></name> <name><surname>Sacchi</surname> <given-names>M. D.</given-names></name> <name><surname>Woodbury</surname> <given-names>A.</given-names></name></person-group> (<year>2001</year>). <article-title>A bayes tour of inversion: a tutorial</article-title>. <source>Geophysics</source> <volume>66</volume>, <fpage>55</fpage>&#x02013;<lpage>69</lpage>. <pub-id pub-id-type="doi">10.1190/1.1444923</pub-id></citation></ref>
<ref id="B172">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>VanRullen</surname> <given-names>R.</given-names></name> <name><surname>Koch</surname> <given-names>C.</given-names></name></person-group> (<year>2003</year>). <article-title>Is perception discrete or continuous?</article-title> <source>Trends Cogn. Sci</source>. <volume>7</volume>, <fpage>207</fpage>&#x02013;<lpage>213</lpage>. <pub-id pub-id-type="doi">10.1016/S1364-6613(03)00095-0</pub-id><pub-id pub-id-type="pmid">12757822</pub-id></citation></ref>
<ref id="B173">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Varona</surname> <given-names>P.</given-names></name> <name><surname>Rabinovich</surname> <given-names>M. I.</given-names></name> <name><surname>Selverston</surname> <given-names>A. I.</given-names></name> <name><surname>Arshavsky</surname> <given-names>Y. I.</given-names></name></person-group> (<year>2002</year>). <article-title>Winnerless competition between sensory neurons generates chaos: a possible mechanism for molluscan hunting behavior</article-title>. <source>Chaos</source> <volume>12</volume>, <fpage>672</fpage>&#x02013;<lpage>677</lpage>. <pub-id pub-id-type="doi">10.1063/1.1498155</pub-id><pub-id pub-id-type="pmid">12779595</pub-id></citation></ref>
<ref id="B174">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Watzenig</surname> <given-names>D.</given-names></name></person-group> (<year>2007</year>). <article-title>Bayesian inference for inverse problems-statistical inversion</article-title>. <source>Elektrotech. Inform</source>. <volume>124</volume>, <fpage>240</fpage>&#x02013;<lpage>247</lpage>. <pub-id pub-id-type="doi">10.1007/s00502-007-0449-0</pub-id><pub-id pub-id-type="pmid">12812458</pub-id></citation></ref>
<ref id="B175">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Weiss</surname> <given-names>Y.</given-names></name> <name><surname>Simoncelli</surname> <given-names>E. P.</given-names></name> <name><surname>Adelson</surname> <given-names>E. H.</given-names></name></person-group> (<year>2002</year>). <article-title>Motion illusions as optimal percepts</article-title>. <source>Nat. Neurosci</source>. <volume>5</volume>, <fpage>598</fpage>&#x02013;<lpage>604</lpage>. <pub-id pub-id-type="doi">10.1038/nn0602-858</pub-id><pub-id pub-id-type="pmid">12021763</pub-id></citation></ref>
<ref id="B176">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wolpert</surname> <given-names>D. M.</given-names></name> <name><surname>Ghahramani</surname> <given-names>Z.</given-names></name> <name><surname>Jordan</surname> <given-names>M. I.</given-names></name></person-group> (<year>1995</year>). <article-title>An internal model for sensorimotor integration</article-title>. <source>Science</source> <volume>269</volume>, <fpage>1880</fpage>&#x02013;<lpage>1882</lpage>. <pub-id pub-id-type="doi">10.1126/science.7569931</pub-id></citation></ref>
<ref id="B177">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>W&#x000F6;rg&#x000F6;tter</surname> <given-names>F.</given-names></name> <name><surname>Porr</surname> <given-names>B.</given-names></name></person-group> (<year>2005</year>). <article-title>Temporal sequence learning, prediction, and control: a review of different models and their relation to biological mechanisms</article-title>. <source>Neural Comput</source>. <volume>17</volume>, <fpage>245</fpage>&#x02013;<lpage>319</lpage>. <pub-id pub-id-type="doi">10.1162/0899766053011555</pub-id><pub-id pub-id-type="pmid">15720770</pub-id></citation></ref>
<ref id="B178">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>Z.</given-names></name> <name><surname>King</surname> <given-names>S.</given-names></name></person-group> (<year>2016</year>). <article-title>Investigating gated recurrent networks for speech synthesis,</article-title> in <source>2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</source> (<publisher-loc>IEEE</publisher-loc>), <fpage>5140</fpage>&#x02013;<lpage>5144</lpage>. <pub-id pub-id-type="doi">10.1109/ICASSP.2016.7472657</pub-id></citation></ref>
<ref id="B179">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yamashita</surname> <given-names>Y.</given-names></name> <name><surname>Tani</surname> <given-names>J.</given-names></name></person-group> (<year>2008</year>). <article-title>Emergence of functional hierarchy in a multiple timescale neural network model: a humanoid robot experiment</article-title>. <source>PLoS Comput. Biol</source>. <volume>4</volume>:<fpage>e1000220</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1000220</pub-id><pub-id pub-id-type="pmid">18989398</pub-id></citation></ref>
<ref id="B180">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yan</surname> <given-names>S.</given-names></name> <name><surname>Smith</surname> <given-names>J. S.</given-names></name> <name><surname>Lu</surname> <given-names>W.</given-names></name> <name><surname>Zhang</surname> <given-names>B.</given-names></name></person-group> (<year>2018</year>). <article-title>Hierarchical multi-scale attention networks for action recognition</article-title>. <source>Signal Process</source>. <volume>61</volume>, <fpage>73</fpage>&#x02013;<lpage>84</lpage>. <pub-id pub-id-type="doi">10.1016/j.image.2017.11.005</pub-id></citation></ref>
<ref id="B181">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yildiz</surname> <given-names>I. B.</given-names></name> <name><surname>Jaeger</surname> <given-names>H.</given-names></name> <name><surname>Kiebel</surname> <given-names>S. J.</given-names></name></person-group> (<year>2012</year>). <article-title>Re-visiting the echo state property</article-title>. <source>Neural Netw</source>. <volume>35</volume>, <fpage>1</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1016/j.neunet.2012.07.005</pub-id><pub-id pub-id-type="pmid">22885243</pub-id></citation></ref>
<ref id="B182">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yildiz</surname> <given-names>I. B.</given-names></name> <name><surname>Kiebel</surname> <given-names>S. J.</given-names></name></person-group> (<year>2011</year>). <article-title>A hierarchical neuronal model for generation and online recognition of birdsongs</article-title>. <source>PLoS Comput. Biol</source>. <volume>7</volume>:<fpage>e1002303</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1002303</pub-id><pub-id pub-id-type="pmid">22194676</pub-id></citation></ref>
<ref id="B183">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yildiz</surname> <given-names>I. B.</given-names></name> <name><surname>von Kriegstein</surname> <given-names>K.</given-names></name> <name><surname>Kiebel</surname> <given-names>S. J.</given-names></name></person-group> (<year>2013</year>). <article-title>From birdsong to human speech recognition: Bayesian inference on a hierarchy of nonlinear dynamical systems</article-title>. <source>PLoS Comput. Biol</source>. <volume>9</volume>:<fpage>e1003219</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1003219</pub-id><pub-id pub-id-type="pmid">24068902</pub-id></citation></ref>
<ref id="B184">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Yu</surname> <given-names>S. Z.</given-names></name></person-group> (<year>2015</year>). <source>Hidden Semi-Markov Models: Theory, Algorithms and Applications</source>. <publisher-loc>Burlingotn, MA</publisher-loc>: <publisher-name>Morgan Kaufmann</publisher-name>.</citation></ref>
<ref id="B185">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname> <given-names>Y.</given-names></name> <name><surname>Si</surname> <given-names>X.</given-names></name> <name><surname>Hu</surname> <given-names>C.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>A review of recurrent neural networks: LSTM cells and network architectures</article-title>. <source>Neural Comput</source>. <volume>31</volume>, <fpage>1235</fpage>&#x02013;<lpage>1270</lpage>. <pub-id pub-id-type="doi">10.1162/neco_a_01199</pub-id><pub-id pub-id-type="pmid">31113301</pub-id></citation></ref>
<ref id="B186">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zeki</surname> <given-names>S.</given-names></name> <name><surname>Shipp</surname> <given-names>S.</given-names></name></person-group> (<year>1988</year>). <article-title>The functional logic of cortical connections</article-title>. <source>Nature</source> <volume>335</volume>:<fpage>311</fpage>. <pub-id pub-id-type="doi">10.1038/335311a0</pub-id></citation></ref>
<ref id="B187">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zemel</surname> <given-names>R. S.</given-names></name> <name><surname>Dayan</surname> <given-names>P.</given-names></name> <name><surname>Pouget</surname> <given-names>A.</given-names></name></person-group> (<year>1998</year>). <article-title>Probabilistic interpretation of population codes</article-title>. <source>Neural Comput</source>. <volume>10</volume>, <fpage>403</fpage>&#x02013;<lpage>430</lpage>. <pub-id pub-id-type="doi">10.1162/089976698300017818</pub-id></citation></ref>
<ref id="B188">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Zen</surname> <given-names>H.</given-names></name> <name><surname>Tokuda</surname> <given-names>K.</given-names></name> <name><surname>Masuko</surname> <given-names>T.</given-names></name> <name><surname>Kobayashi</surname> <given-names>T.</given-names></name> <name><surname>Kitamura</surname> <given-names>T.</given-names></name></person-group> (<year>2004</year>). <article-title>Hidden semi-markov model based speech synthesis,</article-title> in <source>Eighth International Conference on Spoken Language Processing</source> (<publisher-loc>Jeju Island</publisher-loc>).</citation></ref>
<ref id="B189">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>C.</given-names></name> <name><surname>Woodland</surname> <given-names>P. C.</given-names></name></person-group> (<year>2018</year>). <article-title>High order recurrent neural networks for acoustic modelling,</article-title> in <source>2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</source> (<publisher-loc>Calgary, AB</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>5849</fpage>&#x02013;<lpage>5853</lpage>. <pub-id pub-id-type="doi">10.1109/ICASSP.2018.8461608</pub-id></citation></ref>
<ref id="B190">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zheng</surname> <given-names>P.</given-names></name> <name><surname>Triesch</surname> <given-names>J.</given-names></name></person-group> (<year>2014</year>). <article-title>Robust development of synfire chains from multiple plasticity mechanisms</article-title>. <source>Front. Comput. Neurosci</source>. <volume>8</volume>:<fpage>66</fpage>. <pub-id pub-id-type="doi">10.3389/fncom.2014.00066</pub-id><pub-id pub-id-type="pmid">25071537</pub-id></citation></ref>
<ref id="B191">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zutshi</surname> <given-names>I.</given-names></name> <name><surname>Leutgeb</surname> <given-names>J. K.</given-names></name> <name><surname>Leutgeb</surname> <given-names>S.</given-names></name></person-group> (<year>2017</year>). <article-title>Theta sequences of grid cell populations can provide a movement-direction signal</article-title>. <source>Curr. Opin. Behav. Sci</source>. <volume>17</volume>, <fpage>147</fpage>&#x02013;<lpage>154</lpage>. <pub-id pub-id-type="doi">10.1016/j.cobeha.2017.08.012</pub-id><pub-id pub-id-type="pmid">29333481</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn fn-type="financial-disclosure"><p><bold>Funding.</bold> This work was funded by the German Research Foundation (DFG, Deutsche Forschungsgemeinschaft), SFB 940/2 - Project ID 178833530 A9, TRR 265/1 - Project ID 402170461 B09, and as part of Germany&#x00027;s Excellence Strategy - EXC 2050/1 - Project ID 390696704 -Cluster of Excellence Centre for Tactile Internet with Human-in-the-Loop (CeTI) of Technische Universit&#x000E4;t Dresden.</p>
</fn>
</fn-group>
</back>
</article>