<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2020.584017</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Hypothesis and Theory</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Primary Cognitive Categories Are Determined by Their Invariances</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>G&#x000E4;rdenfors</surname> <given-names>Peter</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/304448/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Cognitive Science, Department of Philosophy, Lund University</institution>, <addr-line>Lund</addr-line>, <country>Sweden</country></aff>
<aff id="aff2"><sup>2</sup><institution>Faculty of Humanities, Palaeo-Research Institute, University of Johannesburg</institution>, <addr-line>Johannesburg</addr-line>, <country>South Africa</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Guy Dove, University of Louisville, United States</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Ute Schmid, University of Bamberg, Germany; Daniel Weiskopf, Georgia State University, United States</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Peter G&#x000E4;rdenfors <email>peter.gardenfors&#x00040;lucs.lu.se</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Cognitive Science, a section of the journal Frontiers in Psychology</p></fn></author-notes>
<pub-date pub-type="epub">
<day>08</day>
<month>12</month>
<year>2020</year>
</pub-date>
<pub-date pub-type="collection">
<year>2020</year>
</pub-date>
<volume>11</volume>
<elocation-id>584017</elocation-id>
<history>
<date date-type="received">
<day>16</day>
<month>07</month>
<year>2020</year>
</date>
<date date-type="accepted">
<day>13</day>
<month>11</month>
<year>2020</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2020 G&#x000E4;rdenfors.</copyright-statement>
<copyright-year>2020</copyright-year>
<copyright-holder>G&#x000E4;rdenfors</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license> </permissions>
<abstract><p>The world as we perceive it is structured into objects, actions and places that form parts of events. In this article, my aim is to explain why these categories are cognitively primary. From an empiricist and evolutionary standpoint, it is argued that the reduction of the complexity of sensory signals is based on the brain&#x00027;s capacity to identify various types of invariances that are evolutionarily relevant for the activities of the organism. The first aim of the article is to explain why places, object and actions are primary cognitive categories in our constructions of the external world. It is shown that the invariances that determine these categories have their separate characteristics and that they are, by and large, independent of each other. This separation is supported by what is known about the neural mechanisms. The second aim is to show that the category of events can be analyzed as being constituted of the primary categories. The category of numbers is briefly discussed. Some implications for computational models of the categories are also presented.</p></abstract>
<kwd-group>
<kwd>category</kwd>
<kwd>invariance</kwd>
<kwd>space</kwd>
<kwd>object</kwd>
<kwd>place</kwd>
<kwd>event</kwd>
<kwd>number</kwd>
</kwd-group>
<counts>
<fig-count count="1"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="98"/>
<page-count count="11"/>
<word-count count="10119"/>
</counts>
</article-meta>
</front>
<body>

<sec id="s1">
<title>What Determines the Categorical Structure of our Perceptions?</title>
<p>The world as we perceive it is structured into <italic>objects, places</italic>, and <italic>actions</italic> that form parts of <italic>events</italic>. We have a strong tendency to be realists, that is, to believe that these categories exist out there in the world. Kant taught us, however, to distinguish between &#x0201C;das Ding an sich&#x0201D; and &#x0201C;das Ding f&#x000FC;r uns.&#x0201D; According to him, and much of modern cognitive science (e.g., Marr, <xref ref-type="bibr" rid="B57">1982</xref>; Humphrey, <xref ref-type="bibr" rid="B44">1993</xref>; Anderson et al., <xref ref-type="bibr" rid="B3">1998</xref>; Von Glasersfeld, <xref ref-type="bibr" rid="B83">2005</xref>; Hoffman, <xref ref-type="bibr" rid="B43">2019</xref>), we cannot know external reality but only how our minds construct the world. For such a constructivist position, a fundamental question is why our mental constructs end up with categories of objects, places and actions. The answer, as always, should be grounded in the evolutionary mechanisms that have molded our perceptual systems and in how the brain handles the information presented by these systems.</p>
<p>The senses generate an extremely rich and unstructured mass of signals. When trying to understand what happens to the sensory information in our brains, it is standard to distinguish between <italic>sensations</italic> and <italic>perceptions</italic>. Our subjective world is full of colors and patterns that we see, things that we taste and smell, itches, pains, and sensations of cold that we feel. In philosophy such sensations are called <italic>qualia</italic>. The evolutionary value of sensations is that they inform us about what is happening right now to our bodies (Humphrey, <xref ref-type="bibr" rid="B44">1993</xref>).</p>
<p>An individual that also receives signals about what is going on in the world and not only what is happening to its body will be better prepared to foresee the future and thus to survive in a challenging environment. This is the purpose of perceptions. In order to make sense of the sensations, the perceptions result from processes in the brain that reduce their complexity by structuring them into kinds of entities. In this article, I argue that this complexity reduction is based the brain&#x00027;s capacity to identify various types of <italic>invariances</italic> in the sensory signals&#x02014;invariances that are evolutionarily relevant for the activities of the organism. My aims are, firstly, to explain why places, object and actions are <italic>primary cognitive categories</italic> in our constructions of the external world, and, secondly, how these components generate cognitive representations of <italic>events</italic>.</p>
<p>Traditionally, there are two approaches to the functioning of the mechanisms of our brain that generate the primary categories: (1) nativism: the categories are <italic>innate</italic>; and (2) empiricism: the categories are <italic>learned</italic>. Spelke and Carey (Spelke, <xref ref-type="bibr" rid="B73">2000</xref>, <xref ref-type="bibr" rid="B74">2004</xref>; Spelke and Kinzler, <xref ref-type="bibr" rid="B76">2007</xref>; Carey, <xref ref-type="bibr" rid="B9">2009</xref>) propose objects, actions, space and numbers as &#x0201C;core knowledge domains,&#x0201D; which form the framework of perceptual categories. They defend a nativist position in relation to child development. In contrast, my solution will be empiricist (G&#x000E4;rdenfors, <xref ref-type="bibr" rid="B23">2018</xref>), although I will suggest that the structure of the brain imposes constraints on how the categories are learned. There is thus a nativist element in my analysis, albeit of a different kind than that advocated by Spelke and Carey. Following them, I will also briefly discuss to what extent numbers form another primary cognitive category.</p>
<p>A part of an evolutionarily grounded argument builds on the fact that human (and other mammal&#x00027;s) infants are not born as blank slates (Pinker, <xref ref-type="bibr" rid="B65">2002</xref>). By evolutionary processes, the brain is prepared to pick the most relevant invariances (see e.g., Leibo et al., <xref ref-type="bibr" rid="B54">2015</xref>). As examples of how the brain organizes invariances, the dorsal stream of the cortex handles space representation (the where pathway), the ventral stream generates object representation (what pathway) and the dorsal stream accounts for action representation in (how pathway). Even though these pathways are to some extent neurologically given, the infant must, however, <italic>learn</italic> to identify the invariances that create the most relevant cognitive categories. After the invariances have been learned, the plasticity of the cortex still supports considerable relearning: An amazing example is that a person who is given goggles turning the visual field upside-down, will, after a few weeks, be able to relearn the projection from the visual cortex so that the perceived world is &#x0201C;normal&#x0201D; again (Kohler, <xref ref-type="bibr" rid="B49">1951</xref>).</p>
<p>The strong capacity to detect invariances that the brain has, leads to the crucial question concerning which cognitive categories that are the most fundamental. A central question for the analysis becomes: <italic>Why</italic> are the invariances that determine places, objects and actions cognitively primary?<xref ref-type="fn" rid="fn0001"><sup>1</sup></xref> This is, in a sense, a neo-Kantian epistemological question, seeking the &#x0201C;forms of perception&#x0201D; (&#x0201C;Anschauungsformen&#x0201D;) that generate the framework for more specific categorizations.</p>
<p>By using an analysis in terms of invariances, I will show that each of the categories of places, objects and actions has its separate characteristics and that they are, by and large, independent of each other. A preliminary attempt to identify primary cognitive categories in terms of invariances for space, objects and actions was made in G&#x000E4;rdenfors (<xref ref-type="bibr" rid="B23">2018</xref>). That paper dealt with two learning processes: how the primary categories are learned and how concepts that are grounded in the categories are learned. This paper presents a more detailed analysis of the role of invariances and also analyses the categories of numbers and events.</p>
</sec>
<sec id="s2">
<title>Extracting Structure: Invariances in Perception</title>
<p>The primary categories build up our perceptual structures. My thesis is that the sensations, at an early stage of the process in the brain, become perceptions that are organized along primary ontological categories, in particular space, objects and actions. By saying that the categories are primary, I mean that they form the fundament from which specific concepts are constructed, for example, places as regions of the space, object categories as determined by specific properties or part-whole relations, etc. Since they are founded in the mechanisms of the human brain, they are also seen as common to all humans.</p>
<p>My approach to perception is in some respects similar to Gibson&#x00027;s (<xref ref-type="bibr" rid="B35">1966</xref>, <xref ref-type="bibr" rid="B36">1979</xref>) &#x0201C;ecological approach.&#x0201D; He writes: &#x0201C;The individual does not have to construct an awareness of the world from bare intensities and frequencies of energy; he has to detect the world from invariant properties in the flux of energy&#x0201D; Gibson (<xref ref-type="bibr" rid="B35">1966</xref>, 319). A useful metaphor is that the brain <italic>resonates</italic> with the sensory information. (Gibson, <xref ref-type="bibr" rid="B35">1966</xref>, 201) defines an invariant as a &#x0201C;non-change&#x0201D; that persists during change. This definition is not very useful for identifying invariances so I will instead rely on well-known types of invariances, some taken from physics and some from analyses of children&#x00027;s cognitive processes. Following Breidbach and Jost (<xref ref-type="bibr" rid="B8">2006</xref>), I outline in this section how a theory of perceptual invariances can explain our primary categories. A central type of perceptual information is what remains invariant when an agent moves through the environment and interacts with objects in it (see also Cutting, <xref ref-type="bibr" rid="B10">1986</xref>).</p>
<p>Unlike Gibson, I take a constructivist position and do not claim that invariances are &#x0201C;out there,&#x0201D; ready to be &#x0201C;picked up&#x0201D; by the brain. In contrast, I view invariances as something that is constructed by various processes in the brain. Not all possible invariances are constructed&#x02014;only those that are relevant for survival. Over the millennia, evolution has selected the invariances that are most salient for the activities of the organism.</p>
<p>One central notion for the analysis of invariances is <italic>fungibility</italic><xref ref-type="fn" rid="fn0002"><sup>2</sup></xref>, that is, replacements of equivalents. For example, a place remains the same independently of which objects are located at the place. In other words, objects are fungible with respect to places. Similarly, an object remains the same independently of which place it is allocated at, so places are fungible with respect to objects. These two types of fungibility form the main reason why the place and the object categories are independent<xref ref-type="fn" rid="fn0003"><sup>3</sup></xref>.</p>
</sec>
<sec id="s3">
<title>Space</title>
<p>According to Gibson&#x00027;s approach, the visual field is determined from invariances such as texture gradients, occlusions and visual flow. To a large extent the visual flow is determined by the movements of our bodies. Turning our heads and letting our eyes follow along, for example, leads to vary rapid changes in the image that reaches the retina. However, our brain simultaneously produces a representation of the surrounding space that remains still relative to the direction of our body.</p>
<p>During the first months of life, an infant learns how to coordinate sensory input&#x02014;vision, hearing, and touch&#x02014;with motor activities (Thelen and Smith, <xref ref-type="bibr" rid="B78">1994</xref>). The infant engages in &#x0201C;motor babbling&#x0201D; that generates an egocentric representation of space, coordinating it with its actions. As Gibson (<xref ref-type="bibr" rid="B36">1979</xref>: 2) writes, &#x0201C;the environment to be perceived [&#x02026;] is not the world of physics but the world at the level of ecology.&#x0201D; The space we perceive can be divided into <italic>peripersonal</italic> space&#x02014;the region immediately surrounding our bodies (di Pellegrino and L&#x000E0;davas, <xref ref-type="bibr" rid="B12">2015</xref>)&#x02014;and <italic>extrapersonal</italic> space, which is the space beyond our reach.</p>
<p>The peripersonal space makes it possible for an individual to see its <italic>field of action</italic>. Moving only the head and not the rest of the body, an individual&#x00027;s potential to act does not change. Since the hand actions of the individual occur in front of the body, it&#x00027;s more efficient if the brain creates a space that is constant in relation to the body direction. The peripersonal representation of space is therefore invariant of the direction of the eyes and the head. The space that is constructed is a three-dimensional space where the body determines its origo and principal direction<xref ref-type="fn" rid="fn0004"><sup>4</sup></xref>.</p>
<p>The representation of visual space then expands during the child&#x00027;s development. Firstly, when the auditory input is coordinated with the visual, the represented space extends beyond the child&#x00027;s current visual field to cover the entire surrounding space. The child is then able to direct its attention outside its peripersonal field and it becomes extrapersonal. Importantly, the egocentric representation of space that results from this extension is no longer just visual, but an <italic>amodal</italic> representation based on visual, auditory, tactile, and perhaps even olfactory sensations.</p>
<p>The adult visuo-spatial category should thus be seen as a combination of a peripersonal and an extrapersonal space. The two representations have different basic functions: The peripersonal is used for reaching and interacting with objects, and the extrapersonal for surveillance and navigation (Gallistel, <xref ref-type="bibr" rid="B19">1990</xref>).</p>
<p>There are several experiments supporting that the space category is not an innate structure. It must be learned through <italic>interaction</italic> with the world, where a first step is eye-hand coordination (e.g., Held and Hein, <xref ref-type="bibr" rid="B39">1963</xref>; Agrawal et al., <xref ref-type="bibr" rid="B2">2015</xref>). This process must learn how visual (and auditory) information can be used to create meaningful fields of action. For example, getting a new pair of glasses with stronger lenses changes the conditions for this process. Further experience is required before the brain has construed an adjusted space and can provide the perceptions needed for carrying out precise actions, such as walking down stairs without stumbling.</p>
<p>A second extension of the space representation involves the ability to represent an <italic>allocentric</italic> space. This is an <italic>imagined</italic> space where the location of the individual is no longer a fixed point. The allocentric representation makes it possible for the an individual to abandon the egocentric perspectives and instead imagine how the world looks like from another point of view<xref ref-type="fn" rid="fn0005"><sup>5</sup></xref>. The allocentric space representation is not just invariant of eye and head orientation but also of the <italic>orientation and location</italic> of the body. The primary role of the allocentric space is to allow planning for movements through space. Piaget and Inhelder&#x00027;s (<xref ref-type="bibr" rid="B64">1967</xref>) three mountain test was developed to determine when children master problem solving using representations of allocentric space. For a survey of how humans represent space, see Tversky (<xref ref-type="bibr" rid="B81">2003</xref>).</p>
<p>In the brain, a self-centered representation of location is transformed into an allocentric representation by a network involving the posterior parietal cortex, the medial retrosplenial complex and the hippocampal formation (hippocampus and entorhinal cortex) (Nau et al., <xref ref-type="bibr" rid="B61">2018</xref>). The allocentric representation in the hippocampal formation then projects allocentric coordinates back to guide navigation.</p>
<p>Importantly, by extracting the various forms of invariances, the egocentric and allocentric spaces that are generated considerably <italic>reduce</italic> the complexity of the information that hits the retinas. If the constructed allocentric space were perfectly invariant under rotations and translations (so-called Galilean transformations, Levy-Leblond, <xref ref-type="bibr" rid="B56">1971</xref>), it would follow that the resulting visual space is three-dimensional Euclidean. However, since our movements mainly take place in the two horizontal dimensions, the vertical dimension is less important for our perception. Consequently, our perception of the vertical dimension is &#x0201C;flattened&#x0201D; (Kaufman and Kaufman, <xref ref-type="bibr" rid="B48">2000</xref>).</p>
<p>An important aspect of the representation of space is that it is invariant of <italic>time</italic>. When we move or turn around, we perform rotational and translational transformations of the perceptual input. If these transformations were not invariant over time, it would not possible to use the represented space as a basis for actions. This point was made already by Gibson (<xref ref-type="bibr" rid="B35">1966</xref>, 264): &#x0201C;An individual who explores a strange place by locomotion produces transformations of the optic array for the very purpose of isolating what remains invariant during these transformations&#x0201D; (see also Agrawal et al., <xref ref-type="bibr" rid="B2">2015</xref>).</p>
<p>The domain of space can be divided into regions or <italic>places</italic>. The identity of a place is determined by its relation to a set of <italic>landmarks</italic> and not by its location in relation to some fixed coordinate system. For example, from my perspective your location may be in the passenger seat of my car that is moving through the landscape. The landmark is the car that determines the relative places inside it. For an extreme case, consider that the earth is rotating around the sun at a very high speed. Nevertheless, we take the earth to be the landmark and say that Sweden is located in northern Europe.</p>
<p>A place is also, to a large extent, invariant of the objects located there<xref ref-type="fn" rid="fn0006"><sup>6</sup></xref>. If somebody else sits in the passenger seat of my car, it will still be the same place. If Sweden, due to severe climate changes, turns into a desert, its identity as a place does not change. As mentioned earlier, we can say that objects are <italic>fungible</italic> with respect to places. Similarly, actions are fungible with respect to places&#x02014;the identity of a place does not depend on what is done there.</p>
<p>Sometimes, other properties than a set of landmarks are used to identify a place, for example its <italic>function</italic>. For example, in 1988 the Australian parliament moved from its old house to a new one in Canberra. Still one can refer to &#x0201C;the parliament&#x0201D; as a location. A more exotic example is that the entire town of Kiruna in northern Sweden will be moved two miles to the east because there is a risk that the extensive iron mining under the town will lead to a collapse of the ground. New streets will be laid out and many of the historic houses and official buildings will be moved to the new location and, but the spatial relations between the buildings will not be preserved. Still the identity of the town will be preserved for most practical purposes.</p>
</sec>
<sec id="s4">
<title>Objects</title>
<p>There are many kinds of objects, but I will focus on physical objects, since they have been the most important in the evolution of our cognitive systems. A central property of physical objects is that they have a <italic>shape</italic> (although it may vary over time). This means that the relative locations of different parts of an object can be described in terms of different types of invariances. For a rigid object, the invariances are total. The directions of the parts may change as the object moves, but all the spatial relations between the parts are invariant. For an object with movable parts such as animals, the relations between the locations within each part is more or less invariant and so are the relative locations of the points where the different parts are connected<xref ref-type="fn" rid="fn0007"><sup>7</sup></xref>. For example, the parts of your upper leg don&#x00027;t change their relative distances and the connection point between your leg and your body remains invariant. Johansson (<xref ref-type="bibr" rid="B45">1964</xref>) calls this type of invariance the &#x0201C;rigidity principle&#x0201D; that functions as a constraint of the visual process: Whenever equal motions in a series of simultaneous proximal elements are detected, the result is a perception of rigidity. Marr (<xref ref-type="bibr" rid="B57">1982</xref>) uses this principle extensively in his representation of shapes (for a computationally implemented model see Zhu and Yuille, <xref ref-type="bibr" rid="B97">1996</xref>).</p>
<p>In addition to rigidity or relative rigidity, there are many other types of invariants that apply to objects. The <italic>size</italic> of an object is, for example, typically invariant&#x02014;at least over short periods of time. This invariance makes it possible to accurately judge the distance to an object. Murray et al. (<xref ref-type="bibr" rid="B60">2005</xref>) show that size invariance has been picked up already in the dorsal retinotopic visual area V3. Another property exhibiting invariance is <italic>color</italic>. For many kinds of objects, for example, different species of birds, the patterns of colors are characteristic features. The absolute colors of objects are not invariant, however, since they vary with the illumination. However, the perceptual <italic>relations</italic> between the colors of an object are, in most cases, invariant (Land, <xref ref-type="bibr" rid="B52">1977</xref>).</p>
<p>Some objects are deformable, for example cushions, towels and doughs. Even though invariances of relative locations are less stable for such objects, the changes of relative locations are still continuous. This is what distinguishes objects from masses. Another general type of invariance is that objects are <italic>cohesive</italic>: if you pull at one end of an object, the other parts will follow. Clouds, flames and shadows are therefore marginal as objects. Leslie (<xref ref-type="bibr" rid="B55">1996</xref>) argues that infants just a few months old perceive the world as consisting of cohesive objects that keep much of the same form even when moving.</p>
<p>Clouds, flames and shadows indicate that there are grades of objecthood: they have properties that make them come close to being masses rather than objects. The characteristic distinctions between masses and objects is that masses, such as water and sand, (i) do not have a constant shape, (ii) are variable in size and (iii) are homogenous in material. Linguistically, the distinction shows up in that mass nouns are not countable&#x02014;one does not say &#x0201C;two sands&#x0201D;&#x02014;but nouns for objects are<xref ref-type="fn" rid="fn0008"><sup>8</sup></xref>.</p>
<p>Neuroscientific support for the thesis about invariances determining the object category is becoming stronger. In particular, Leibo et al. (<xref ref-type="bibr" rid="B54">2015</xref>) and Anselmi et al. (<xref ref-type="bibr" rid="B4">2016</xref>) present a neural model of object and face recognition based on invariances that builds on the idea that the main task of the ventral stream of visual processing is to compute a &#x0201C;signature for recognition&#x0201D; that is invariant of translations and rotations. They also show that when the relevant transformations have been learned for some objects it generalizes to other objects. For example, if we see a new face in a frontal position, we can accurately predict how it will look like if turned to the side. The grouping of objects is done by their transformation compatibility, that is, the class of transformations that preserve their identity. Another type of support comes from Kriegeskorte et al. (<xref ref-type="bibr" rid="B50">2008</xref>) who show that the inferior temporal cortex of monkeys and human share a common code for representing objects, in particular concerning major distinctions such as animate&#x02013;inanimate and face&#x02013;body. The response patterns in the cortex form category clusters that match between monkeys and humans.</p>
<p>The perception of objects also involves an extensive reduction of the dimensions of the sensory input. Several computational procedures for dimension reduction have been proposed, for example Principal Component Analysis (Abdi and Williams, <xref ref-type="bibr" rid="B1">2010</xref>) and Multidimensional Scaling (Kruskal and Wish, <xref ref-type="bibr" rid="B51">1978</xref>; Borg and Groenen, <xref ref-type="bibr" rid="B7">2005</xref>). It is not known, however, how similar these procedures are to real brain processes. Wiskott and Sejnowski (<xref ref-type="bibr" rid="B86">2002</xref>) have developed an artificial neural network based on &#x0201C;slow feature analysis&#x0201D; that is able to pick up translation, size, rotation, illumination and contrast invariances of objects. From a neuro-cognitive point of view, an interesting feature of the neural network is that the &#x0201C;what&#x0201D; and the &#x0201C;where&#x0201D; components become represented in separate components of the network. This provides indirect support for my hypothesis that the space and object invariances can be separated. The invariances that lead to the dimension reduction, both in Wiskott and Sejnowski&#x00027;s model and in that of Anselmi et al. (<xref ref-type="bibr" rid="B4">2016</xref>), show that the dimensional structure that is represented is closely related to a 3D Euclidean space. This is congenial with proposals that the hippocampal formation is not solely used to represent spatial information, but is also exploited to represent other types of conceptual spaces (Eichenbaum and Cohen, <xref ref-type="bibr" rid="B14">2014</xref>; Bellmund et al., <xref ref-type="bibr" rid="B6">2018</xref>).</p>
<p>When describing how infants represent objects, Spelke et al. (<xref ref-type="bibr" rid="B75">1992</xref>, 606) suggest the following criteria: (i) <italic>continuity</italic> (objects move in continuous paths), (ii) <italic>solidity</italic> (objects move only on unobstructed paths and therefore different objects do not occupy the same place), (iii) <italic>gravity</italic> (objects fall downwards, if not supported), and (iv) <italic>inertia</italic> (objects do not change their motion abruptly).</p>
<p>Except for solidity, which I have discussed above, these constraints do not concern invariances of objects. The last two are not about objects <italic>per se</italic>, but rather describe the behavior of objects. Furthermore, objects that are <italic>agents</italic> violate the inertia constraint. Surprisingly, the list of criteria proposed by Spelke et al. (<xref ref-type="bibr" rid="B75">1992</xref>) does not contain shape, despite the fact that children&#x00027;s categorizations of objects have a clear shape bias (e.g., Landau et al., <xref ref-type="bibr" rid="B53">1998</xref>; Smith and Samuelson, <xref ref-type="bibr" rid="B72">2006</xref>).</p>
<p>A consequence of the representation of the continuity of objects is <italic>object permanence</italic> (Piaget, <xref ref-type="bibr" rid="B63">1952</xref>), which means that objects are represented as being located at the place where they were last perceived, even if they currently do not produce any sensations. This means that the object is represented (imagined) in the inner world as located at a particular place, even if it is not perceived. The ability to keep an object in mind is not innate; human infants acquire it around 5 months of age (which is later than among other animal species) (Baillargeon and DeVos, <xref ref-type="bibr" rid="B5">1991</xref>).</p>
</sec>
<sec id="s5">
<title>Actions</title>
<p>The third primary category of our perceptions involves actions. Humans are exceptionally efficient at categorizing actions. For example, it is easy to instantly judge whether somebody is walking or jogging, even if the movements of the body parts are rather similar. Furthermore, only a very limited amount of information is needed to make such a categorization. The efficiency of action perception was shown by Johansson in a series of classical perception studies in the 1950&#x00027;s (Johansson, <xref ref-type="bibr" rid="B46">1973</xref>). The patch-light technique that he invented for analysing biological motion contains no direct shape information. Light bulbs were attached to the joints of actors who were dressed in black and moved in a black room. The actors were performing different actions such as walking, running, and dancing while being filmed. Subjects who then watched the films saw the movements of the light bulbs (but nothing else). They were able to correctly categorize the actions within a few hundred milliseconds.</p>
<p>Experiments of this kind indicate that that seeing the surfaces of the agents performing actions is not necessary for categorizing the actions (Hemeren, <xref ref-type="bibr" rid="B40">2008</xref>). A movie that contains stick figures or only dots moving in the same way is sufficient. These observations give additional support to Johansson&#x00027;s rigidity principle. The question now is what kind of invariances are involved in action categorizations.</p>
<p>Working in the tradition of Gibson, Runesson (<xref ref-type="bibr" rid="B68">1994</xref>, pp. 386&#x02013;387; see also Wolff, <xref ref-type="bibr" rid="B88">2008</xref>) argues that people can directly perceive the forces that generate different types of motion:</p>
<disp-quote><p>&#x0201C;The fact is that we can <italic>see</italic> the weight of an object handled by a person. The fundamental reason we are able to do so is exactly the same as for seeing the size and shape of the person&#x00027;s nose or the color of his shirt in normal illumination, namely that <italic>information</italic> about all these properties is available in the optic array.&#x0201D;</p></disp-quote>
<p>Runesson formulates this as that the kinematics of an action is sufficient to identify the underlying force patterns. For example, the pattern of forces involved in saluting is different from the pattern of forces involved throwing even if the actions are perceptually rather similar. Johansson and Runesson mainly apply their principles to biological motion. I hypothesize, however, that they can be applied to other forms of action as well. I have argued that the brain extracts the invariances that represent the <italic>forces</italic> that generate different kinds of actions (G&#x000E4;rdenfors, <xref ref-type="bibr" rid="B21">2007</xref>, <xref ref-type="bibr" rid="B22">2014</xref>). The process extracting the invariances is automatic: an individual cannot help perceiving the forces (Wolff, <xref ref-type="bibr" rid="B88">2008</xref>; Wolff and Shepard, <xref ref-type="bibr" rid="B90">2013</xref>; Wolff and Thorstad, <xref ref-type="bibr" rid="B91">2017</xref>). Just as for objects, the space of force patterns can therefore be seen as a perceptual category with a unique structure of similarities and defined by its own class of invariances. Of course, the perception of forces is not perfect; people are prone to illusions, just as in all types of perception (Johansson, <xref ref-type="bibr" rid="B45">1964</xref>, <xref ref-type="bibr" rid="B46">1973</xref>).</p>
<p>An example of an empirical study of force patterns it that of Wang et al. (<xref ref-type="bibr" rid="B84">2004</xref>). Based on data from the walking patterns of humans collected under different conditions and using the methods of Giese et al. (<xref ref-type="bibr" rid="B37">2008</xref>), the force patterns that were extracted were used to calculate the similarity of the different types of walking<xref ref-type="fn" rid="fn0009"><sup>9</sup></xref>.</p>
<p>A particular action is, of course, performed by a particular agent (a special kind of object) at a particular place. For the categorization of an action, however, a central invariance is that only the forces, but <italic>not</italic> the individuals or objects performing the action, are involved in the representation of the action. More generally, patterns of forces should be considered since several body parts are typically involved; and several force vectors are consequently interacting. This is analogous to Marr and Vaina&#x00027;s (<xref ref-type="bibr" rid="B58">1982</xref>) differential equations for actions. Such force patterns form the invariances that I submit generate the structure of action categories. However, the invariances that apply to actions are neither the same as those for objects, nor for those for space. To wit, the patterns for actions are neither dependent on the location of the acting object, nor on its object properties such as color or weight. This means that the objects and places are fungible with respect to actions and thus that the action category is independent of the object and space categories. In line with the situation for space and objects, the force patterns determined by the invariances involve a considerable reduction in dimensions. However, the empirical data concerning how actions are perceived is still limited so the precise structure of action space should be further investigated.</p>
<p>Human understanding of actions, however, does not only involve physical movements and their underlying forces, but often also the <italic>intention</italic> behind the action. For example, &#x0201C;blink&#x0201D; and &#x0201C;wink&#x0201D; cover the same kinds of physical eye movements, but the second action is intentional. Accounting for the intentionality of actions also involves representations of a <italic>goal space</italic> in the agent that is attributing the intention (G&#x000E4;rdenfors, <xref ref-type="bibr" rid="B22">2014</xref>, pp. 194&#x02013;197). It might be argued that such a goal space should also be included in among the primitive cognitive categories. The main reason for not counting the goal space to the primary categories is that representing intentional actions presumes the capacity to represent actions. This position is supported by recent experiments by Ganglmayer et al. (<xref ref-type="bibr" rid="B20">2019</xref>). In contrast to what has been claimed previously (Woodward, <xref ref-type="bibr" rid="B92">2009</xref>, <xref ref-type="bibr" rid="B93">2013</xref>), their results indicate that 11-12-month-old infants anticipate the movement path rather than the goal of an action.</p>
<p>To sum up: The three basic categories place, object and action are mutually fungible relative to each other. As a consequence these three categories are, to a large extent, cognitively independent: Space can be characterized independently of the objects and actions present; objects can be characterized independently of where they are located and which actions are performed on them; and actions can be characterized independently of where they are performed and who (what) performs them. These mutual invariances support my thesis that they form independent primary categories for our cognitive processes.</p>
<p>Following the strategy in Breidbach and Jost (<xref ref-type="bibr" rid="B8">2006</xref>), <italic>sub-categories</italic> can then be identified by adding the relevant invariances that characterize them. I have already mentioned the distinction between rigid and non-rigid objects, where the rigid objects are characterized by all distances between points on an object being invariant over time. Another example is the distinction between agents and non-agents, where agents are characterized as being objects that are capable of exerting forces. This distinction will be relevant for the model of events that will be presented below.</p>
<p>The primary categories show up in the structure of language, in particular in how it divides words into classes. G&#x000E4;rdenfors (<xref ref-type="bibr" rid="B22">2014</xref>, <xref ref-type="bibr" rid="B23">2018</xref>) has argued in some detail that semantic representations of nouns build on the category of objects, and that verbs build on actions. Furthermore, many prepositions express spatial relations. Different languages have different word classes, but all of them have means to denote objects, actions, and spatial relations. This universality of linguistic structure is a further indication that these categories are indeed cognitively primary.</p>
</sec>
<sec id="s6">
<title>Events</title>
<p>Even though space, object and actions form categorical structures that are determined by separate sets of invariances, it is obvious that there are interactions between these categories. They are all parts of <italic>events</italic>. Therefore, I suggest events as an overarching category for combining different perceptual categories (see also Strickland, <xref ref-type="bibr" rid="B77">2017</xref>). Already Gibson (<xref ref-type="bibr" rid="B36">1979</xref>, 100) describes events as primary realities. There is an extensive amount of research on how children&#x00027;s event cognition develops (e.g., Radvansky and Zacks, <xref ref-type="bibr" rid="B67">2014</xref>, Ch. 10; Papafragou, <xref ref-type="bibr" rid="B62">2015</xref>).</p>
<p>The cognitive structure of events is relational, gluing together objects, actions and locations. In earlier work (G&#x000E4;rdenfors and Warglien, <xref ref-type="bibr" rid="B29">2012</xref>; Warglien et al., <xref ref-type="bibr" rid="B85">2012</xref>; G&#x000E4;rdenfors, <xref ref-type="bibr" rid="B22">2014</xref>), I have suggested an approach to event categorization based on some geometric notions. The key idea is to represent event structures in terms of conceptual spaces&#x02014;one for actions and one for results&#x02014;and <italic>mappings</italic> between these spaces (see <xref ref-type="fig" rid="F1">Figure 1</xref>).</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>The main components of an event representation.</p></caption>
<graphic xlink:href="fpsyg-11-584017-g0001.tif"/>
</fig>
<p>Following the previous section, the action space is represented as a space of forces (or force patterns) acting upon some object. As mentioned above, I view non-intentional actions as primary. Modeling intentional actions would require adding a goal space to represent the aim of the action. The result space of the event represents changes in the properties of the target. This space can therefore be modeled as a vector space where the two ends of a result vector represent the properties of the object acted upon before and after the action<xref ref-type="fn" rid="fn0010"><sup>10</sup></xref>. The results of actions are typically changes of location (that is, the space category) or changes of object properties. For example, when Donald pushes the table, the agent Donald exerts a force vector (action) on the table that leads to a change of the position of the table (result). Or in the event of heavy rain undermining a road, the force of the rain (action) leads to a change of the shape property of the road (result). More complicated to represent mathematically are events of breaking or dividing when the object acted upon changes into two or more, and events of construction where different objects are combined into a new one<xref ref-type="fn" rid="fn0011"><sup>11</sup></xref>.</p>
<p>A consequence of characterizing an event as a combination of an action space and a result space is that the time domain is not defining for events, but it emerges from the relations between the components of an event. This position contrasts with, for example, Zacks and Tversky (<xref ref-type="bibr" rid="B95">2001</xref>) who focus on the temporal structure of events, in particular on how events are segmented. It is often suggested that cognitive representations of events presuppose representing time (Radvansky and Zacks, <xref ref-type="bibr" rid="B67">2014</xref>; Hoerl and McCormack, <xref ref-type="bibr" rid="B42">2019</xref>). For example, Zacks and Tversky (<xref ref-type="bibr" rid="B95">2001</xref>, p. 3) write that an archetypical event is &#x0201C;a segment of time at a given location that is conceived by an observer to have a beginning and an end.&#x0201D; If this were correct, time would also be primary category. Of course, the circadian system of our bodies in a sense represents the day and night cycle<xref ref-type="fn" rid="fn0012"><sup>12</sup></xref>. However, this system is inflexible and not involved in our representations of events. In fact, there is linguistics evidence that indicates that time is not a primary cognitive category: The abstract time dimension is not used by all human societies but it is the product of cultural systems for measuring time intervals, and hence time is a socio-historical construction (Sinha and G&#x000E4;rdenfors, <xref ref-type="bibr" rid="B71">2014</xref>). For example, the South American language Amondawa does not have an explicit representation of time. This language employs a time interval system that represents seasonal and diurnal events, but it has no calendric terms, including terms such as month and year. Furthermore, children understand events earlier in their development than they understand time as a separate dimension. The model of events presented here does not explicitly represent the time dimension. However, temporality is implicit in the model since actions and events are dynamic entities&#x02014;they unfold over time.</p>
<p>The action space and the result space represent different categories: forces have a different nature than changes in object properties. In the limiting case when the result vector is the null vector, that is, when nothing changes, the event is a <italic>state</italic>. As can be seen from this two-vector model of events, it combines the three primary categories of objects, actions and physical space into a relational structure: An event can be characterized as a mapping from an action on an object to a result.</p>
<p>In linguistics, the target entity of the event is called the <italic>patient</italic>. The object that creates the action vector is called the <italic>agent</italic>. The concept of an agent thus combines the object category with the action category. There exist, however, events without agents, for example events of falling, drowning, dying, growing and raining. An event may also include other &#x0201C;thematic roles&#x0201D; (Dowty, <xref ref-type="bibr" rid="B13">1991</xref>), such as recipient and instrument, but they are not components of all events.</p>
<p>As for actions, a particular event is, of course caused by a particular agent (a special kind of object) at a particular place. An event category is, in general, invariant of the location where it is performed and on which object (patient) the action is performed. G&#x000E4;rdenfors and Warglien (<xref ref-type="bibr" rid="B29">2012</xref>) define an event category as a structure (product space) that represents the mapping from the action space to the result space. An example is the event category of pushing a table, which is constituted by the force (exerted by some agent, human or non-human) on the table resulting in a movement (change in space) of the table.</p>
<p><italic>Causal relations</italic> can also be represented using the event structure (Wolff, <xref ref-type="bibr" rid="B87">2007</xref>, <xref ref-type="bibr" rid="B88">2008</xref>; G&#x000E4;rdenfors and Warglien, <xref ref-type="bibr" rid="B29">2012</xref>; G&#x000E4;rdenfors, <xref ref-type="bibr" rid="B25">2020a</xref>; G&#x000E4;rdenfors and Lombard, <xref ref-type="bibr" rid="B28">2020</xref>): The action causes the result. Most accounts of causation analyse the relation between the action and the effect as a relation between two events (see e.g., Zacks and Tversky, <xref ref-type="bibr" rid="B95">2001</xref>; Radvansky and Zacks, <xref ref-type="bibr" rid="B67">2014</xref>). In contrast, the model presented here views causation as a relation <italic>within</italic> an event by introducing a distinction between forces and changes of states (cf. Wolff, <xref ref-type="bibr" rid="B87">2007</xref>, <xref ref-type="bibr" rid="B88">2008</xref>, <xref ref-type="bibr" rid="B89">2012</xref>; Wolff and Thorstad, <xref ref-type="bibr" rid="B91">2017</xref>). In contrast to many other theories, causes and effects are not treated as symmetrical entities: they belong to different categories&#x02014;causes to the forces that are applied on objects and results to change in location (in the case of movements) or in some property of objects (color, size, weight, temperature, etc.).</p>
<p>The characteristic part of an event is the mapping between the force space and the result space. For example, pushing a table sometimes results in the table moving, sometimes not; aiming a dart at the bull&#x00027;s eye sometimes hit it, sometimes not. In such cases the mapping between the force vector and the result vector represents two different events. G&#x000E4;rdenfors et al. (<xref ref-type="bibr" rid="B27">2018</xref>) analyse three general constraints on event mappings:</p>
<list list-type="roman-lower">
<list-item><p>Larger forces lead to larger results (monotonicity constraint).</p></list-item>
<list-item><p>Small changes in the force lead to small changes of the result (continuity constraint).</p></list-item>
<list-item><p>Intermediate results are caused by intermediate forces (convexity preserving constraint).</p></list-item>
</list>
<p>Even though it is not the aim of the article to propose computational models of how various forms of invariances can be used in cognitive systems, the event model lends itself to some recommendations for how such models can be constructed [for more details, see G&#x000E4;rdenfors (<xref ref-type="bibr" rid="B24">2019</xref>, <xref ref-type="bibr" rid="B26">2020b</xref>) and G&#x000E4;rdenfors et al. (<xref ref-type="bibr" rid="B30">2019</xref>)]. There exist several efficient methods for constructing a computational model of space from video, laser range and other forms of input (see e.g., Wyeth and Milford, <xref ref-type="bibr" rid="B94">2009</xref>). Recent advances in deep learning have also led to good methods for object categorization (see e.g., Zhao et al., <xref ref-type="bibr" rid="B96">2017</xref>). It should be noted that these methods depend on the <italic>appearance</italic> of the objects. For robotic interaction with objects, however, these aspects are not the most important. Gibson (<xref ref-type="bibr" rid="B36">1979</xref>, Ch. 8) writes that &#x0201C;what we perceive when we look at objects are their affordances, not their qualities.&#x0201D; In other words, it is what we can <italic>do</italic> with objects that matter, not how they look. Shanahan et al. (<xref ref-type="bibr" rid="B70">2020</xref>) discuss this problem. As an example, they take the concept of a &#x0201C;container&#x0201D; that is central to much human interaction with the world. The appearance of containers can vary widely, but it is their affordances that are crucial for how we interact with them. There seems to be no good model of how to capture the affordances of objects from, say, a video stream (Shanahan et al., <xref ref-type="bibr" rid="B70">2020</xref>). As regards actions, they are understudied in robotics. The attempts have focused on the results of actions. For example, the algorithms for learning verb meanings developed by Kalkan et al. (<xref ref-type="bibr" rid="B47">2014</xref>) are based on &#x0201C;affordance relations&#x0201D; between entities, behaviors, and effects. Most attempts to computationally categorize actions in terms of manner have been based on stored data, but Gharaee et al. (<xref ref-type="bibr" rid="B34">2017b</xref>) present on online, real time algorithm.</p>
<p>There thus exists partly successful work in computer science and robotics that generate models of each of the basic cognitive categories space, objects and action. However, there are very few models of how to combine these models to generate representations of events. The one that comes closest to the approach presented here is Hinaut and Dominey&#x00027;s (<xref ref-type="bibr" rid="B41">2013</xref>) model of &#x0201C;reservoir computing.&#x0201D; In G&#x000E4;rdenfors (<xref ref-type="bibr" rid="B26">2020b</xref>), I make a programmatic attempt to describe a computational approach to events and illustrate it with a partial implementation, based on reservoir computing, in an iCub robot (Mealier et al., <xref ref-type="bibr" rid="B59">2016</xref>).</p>
<p>Finally, a comment on the relation between event representations and language. G&#x000E4;rdenfors (<xref ref-type="bibr" rid="B22">2014</xref>) has argued that a declarative sentence typically describes the main components of an event. This thesis provides an explanation of why sentences are natural units in language. The event structure connects naturally to the core &#x0201C;thematic roles&#x0201D; &#x02014;agent, patient, recipient, instrument, cause and effect, that help children understand how sentences are constructed and what their meanings are. For example, Papafragou (<xref ref-type="bibr" rid="B62">2015</xref>, 338) compares how speakers of Greek and English describe events and she concludes that basic patterns in event perception are independent of the language one speaks. Another example is Fernandes et al. (<xref ref-type="bibr" rid="B16">2006</xref>) who show that toddlers already in their third year have an understanding of the abstract categories &#x0201C;agent&#x0201D; and &#x0201C;patient.&#x0201D;</p>
</sec>
<sec id="s7">
<title>Number</title>
<p>Another cognitive category is <italic>number</italic>. Even though I do not view it as primary, I will discuss it briefly since it belongs to the core knowledge domains proposed by Spelke (<xref ref-type="bibr" rid="B73">2000</xref>, <xref ref-type="bibr" rid="B74">2004</xref>) and Carey (<xref ref-type="bibr" rid="B9">2009</xref>). Theories of number cognition distinguish between magnitude (&#x0201C;a large bag of beans&#x0201D;), numerosity (&#x0201C;many sheep&#x0201D;) and number (&#x0201C;five cows&#x0201D;) (Gemel and Quinon, <xref ref-type="bibr" rid="B32">2019</xref>). The underlying cognitive processes are divided into two subsystems that handle approximate magnitudes and discrete numbers respectively (Dehaene, <xref ref-type="bibr" rid="B11">1996</xref>). Non-human animals have an approximate number system that allow them to estimate the relative magnitude of two collections, sometimes with surprising precision (Gallistel, <xref ref-type="bibr" rid="B19">1990</xref>). The discrete number system is used only by humans and it must be learned. There exist human cultures, for example the Amazonian tribe of Pirah&#x000E3;, who do not have a discrete number system (Everett, <xref ref-type="bibr" rid="B15">2017</xref>). Thus, like time, number is a cultural construct and not as fundamental cognitively as the space, object and action categories are. This goes against Spelke&#x00027;s and Carey&#x00027;s position that number is a core knowledge domain.</p>
<p>Nevertheless, approximate as well as discrete numbers are governed by invariances (Harbour, <xref ref-type="bibr" rid="B38">2014</xref>). When judging the invariances that determine the categories of numbers, it should first be noted that number is a property of a <italic>collection</italic>. Collections form an abstract type of objects that can have different properties. Some such properties are shared by physical objects, for example weight and location: &#x0201C;These beans weigh 500 grams.&#x0201D; &#x0201C;The radishes are in the plastic bowl in the fridge.&#x0201D; Many properties are, however, unique to collections: For example, collections can be ordered or unordered, uniform (consisting of the same type of objects) or mixed, dense or spread out. In particular, collections have <italic>cardinality</italic>, that is, they contain a certain number of elements. The cardinality of a finite collection is expressed by a natural number.</p>
<p>Numerical invariances of collections have been studied extensively in developmental psychology (e.g., Gelman and Gallistel, <xref ref-type="bibr" rid="B31">1986</xref>; Fuson, <xref ref-type="bibr" rid="B18">1988</xref>; Sarnecka and Carey, <xref ref-type="bibr" rid="B69">2008</xref>). In a series early experiments concerning &#x0201C;conservation tasks,&#x0201D; Piaget (<xref ref-type="bibr" rid="B63">1952</xref>) tested children in order to understand which properties of collections they perceive as being invariant. In one experiment, two equinumerous collections of objects, for example marbles, are placed into two parallel lines that are equally long. Then the objects in one line are spread out. A child that has not understood cardinality will say that there are more objects in the longer line. Failing the Piaget conservation tasks means that a child has not understood that a number is a property of a collection that is invariant of its spatial layout (see Gelman and Gallistel, <xref ref-type="bibr" rid="B31">1986</xref>). In other words, number is fungible with respect to the location of the objects in a collection.</p>
<p>The characteristic invariance of the number category is, however, the <italic>fungibility of objects</italic>: If an object in a collection is exchanged for another object, the collection will still contain the same number of objects. Other properties of collections do not fulfill this criterion: If an object (an apple, say) replaces one of the objects in a uniform collection (of oranges, say), the resulting collection is not uniform any more.</p>
<p>The number of elements of a collection is also, to a large extent, invariant under actions, at least in the sense that independently of what kinds of actions the elements perform (for example, the movements of a football team), their number will still be the same. Similarly, number is typically invariant under actions performed on the objects (as long as the actions do not destroy the objects).</p>
</sec>
<sec sec-type="conclusions" id="s8">
<title>Conclusion</title>
<p>In this article I have used the notion of invariances to explain why the categories of space, object and action are fundamental cognitive structures. In philosophical terms this is a version of the neo-Kantian program of describing the &#x0201C;Anschauungsformen&#x0201D; of our perception. The analysis of the primary categories in terms of invariances can be seen as an explanation of such forms of perception. As a part of the explanation, an evolutionary perspective connects the categories to the success of the activities of an organism. I have also argued that all three categories are central elements in the more abstract category of events. The analysis of the category of numbers that I have presented indicates that, also for non-primary categories, different forms of invariances can be used to characterize a category.</p>
<p>Although the evidence for the invariances that I have presented in this article comes mainly from experiments with human subjects, the perceptual systems of, at least, mammals are sufficiently similar to warrant the conclusion that space and objects are also primary categories for them. As regards actions (and, consequently, events), the situation is less clear<xref ref-type="fn" rid="fn0013"><sup>13</sup></xref>.</p>
<p>The enterprise of identifying cognitively primary categories is not only of philosophical and psychological interest. It leads to new questions to cognitive neuroscience. The most pressing one concerns how the invariances are picked up by the brain (e.g., Nau et al., <xref ref-type="bibr" rid="B61">2018</xref>). Understanding these processes may help understanding the foundations of how we perceive the world. I have presented some results concerning how brain processes utilize invariances in creating cognitive representations, but this field has much more to analyse. New perspectives concerning invariances may be used to generate new hypotheses concerning how the brain handles primary categories and to generate new ideas for the architecture of computational and robotic systems that reason about the world and act in it.</p>
</sec>
<sec id="s9">
<title>Author Contributions</title>
<p>The author confirms being the sole contributor of this work and has approved it for publication.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abdi</surname> <given-names>H.</given-names></name> <name><surname>Williams</surname> <given-names>L. J.</given-names></name></person-group> (<year>2010</year>). <article-title>Principal component analysis</article-title>. <source>Wiley Interdiscipl. Rev. Comput. Stat.</source> <volume>2</volume>, <fpage>433</fpage>&#x02013;<lpage>459</lpage>. <pub-id pub-id-type="doi">10.1002/wics.101</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Agrawal</surname> <given-names>P.</given-names></name> <name><surname>Carreira</surname> <given-names>J.</given-names></name> <name><surname>Malik</surname> <given-names>J.</given-names></name></person-group> (<year>2015</year>). <article-title>Learning to see by moving</article-title>. <source>IEEE Int. Conf. Comput. Vis.</source> <volume>2015</volume>, <fpage>37</fpage>&#x02013;<lpage>45</lpage>. <pub-id pub-id-type="doi">10.1109/ICCV.2015.13</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anderson</surname> <given-names>J. R.</given-names></name> <name><surname>Reder</surname> <given-names>L. M.</given-names></name> <name><surname>Simon</surname> <given-names>H. A.</given-names></name> <name><surname>Ericsson</surname> <given-names>K. A.</given-names></name> <name><surname>Glaser</surname> <given-names>R.</given-names></name></person-group> (<year>1998</year>). <article-title>Radical constructivism and cognitive psychology</article-title>. <source>Brookings Papers Educ. Policy</source> <volume>1</volume>, <fpage>227</fpage>&#x02013;<lpage>278</lpage>.</citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anselmi</surname> <given-names>F.</given-names></name> <name><surname>Leibo</surname> <given-names>J. Z.</given-names></name> <name><surname>Rosasco</surname> <given-names>L.</given-names></name> <name><surname>Mutch</surname> <given-names>J.</given-names></name> <name><surname>Tacchetti</surname> <given-names>A.</given-names></name> <name><surname>Poggio</surname> <given-names>T.</given-names></name></person-group> (<year>2016</year>). <article-title>Unsupervised learning of invariant representations</article-title>. <source>Theor. Comput. Sci</source>. <volume>633</volume>, <fpage>112</fpage>&#x02013;<lpage>121</lpage>. <pub-id pub-id-type="doi">10.1016/j.tcs.2015.06.048</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baillargeon</surname> <given-names>R.</given-names></name> <name><surname>DeVos</surname> <given-names>J.</given-names></name></person-group> (<year>1991</year>). <article-title>Object permanence in young infants: further evidence</article-title>. <source>Child Dev</source>. <volume>62</volume>, <fpage>1227</fpage>&#x02013;<lpage>1246</lpage>. <pub-id pub-id-type="doi">10.2307/1130803</pub-id><pub-id pub-id-type="pmid">1786712</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bellmund</surname> <given-names>J.</given-names></name> <name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name> <name><surname>Moser</surname> <given-names>E.</given-names></name> <name><surname>Doeller</surname> <given-names>C.</given-names></name></person-group> (<year>2018</year>). <article-title>Navigating cognition: spatial codes for human thinking</article-title>. <source>Science</source> <volume>362</volume>:<fpage>eaat6766</fpage>. <pub-id pub-id-type="doi">10.1126/science.aat6766</pub-id><pub-id pub-id-type="pmid">30409861</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Borg</surname> <given-names>I.</given-names></name> <name><surname>Groenen</surname> <given-names>P. J.</given-names></name></person-group> (<year>2005</year>). <source>Modern Multidimensional Scaling: Theory and Applications</source>. <publisher-loc>Berlin</publisher-loc>: <publisher-name>Springer Science and Business Media</publisher-name>.</citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Breidbach</surname> <given-names>O.</given-names></name> <name><surname>Jost</surname> <given-names>J.</given-names></name></person-group> (<year>2006</year>). <article-title>On the gestalt concept</article-title>. <source>Theory Biosci.</source> <volume>125</volume>, <fpage>19</fpage>&#x02013;<lpage>36</lpage>. <pub-id pub-id-type="doi">10.1016/j.thbio.2006.02.001</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Carey</surname> <given-names>S.</given-names></name></person-group> (<year>2009</year>). <source>The Origin of Concepts.</source> <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation></ref>
<ref id="B10">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Cutting</surname> <given-names>J. E.</given-names></name></person-group> (<year>1986</year>). <source>Perception With an Eye for Motion</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>.</citation></ref>
<ref id="B11">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Dehaene</surname> <given-names>S.</given-names></name></person-group> (<year>1996</year>). <source>The Number Sense. How the Mind Creates Mathematics</source>. <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>di Pellegrino</surname> <given-names>G.</given-names></name> <name><surname>L&#x000E0;davas</surname> <given-names>E.</given-names></name></person-group> (<year>2015</year>). <article-title>Peripersonal space in the brain</article-title>. <source>Neuropsychologia</source> <volume>66</volume>, <fpage>126</fpage>&#x02013;<lpage>133</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuropsychologia.2014.11.011</pub-id><pub-id pub-id-type="pmid">25448862</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dowty</surname> <given-names>D.</given-names></name></person-group> (<year>1991</year>). <article-title>Thematic proto-roles and argument selection</article-title>. <source>Language</source> <volume>67</volume>, <fpage>547</fpage>&#x02013;<lpage>619</lpage>. <pub-id pub-id-type="doi">10.1353/lan.1991.0021</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eichenbaum</surname> <given-names>H.</given-names></name> <name><surname>Cohen</surname> <given-names>N. H.</given-names></name></person-group> (<year>2014</year>). <article-title>Can we reconcile the declarative memory and spatial navigation views on hippocampal function?</article-title> <source>Neuron</source> <volume>83</volume>, <fpage>764</fpage>&#x02013;<lpage>770</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2014.07.032</pub-id><pub-id pub-id-type="pmid">25144874</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Everett</surname> <given-names>C.</given-names></name></person-group> (<year>2017</year>). <source>Numbers and the Making of Us: Counting and the Course of Human Cultures</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>Harvard University Press</publisher-name>. <pub-id pub-id-type="doi">10.4159/9780674979185</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fernandes</surname> <given-names>K. J.</given-names></name> <name><surname>Marcus</surname> <given-names>G. F.</given-names></name> <name><surname>Di Nubila</surname> <given-names>J. A.</given-names></name> <name><surname>Vouloumanos</surname> <given-names>A.</given-names></name></person-group> (<year>2006</year>). <article-title>From semantics to syntax and back again: argument structure in the third year of life</article-title>. <source>Cognition</source> <volume>100</volume>, <fpage>B10</fpage>&#x02013;<lpage>B20</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2005.08.003</pub-id><pub-id pub-id-type="pmid">16289066</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fiorini</surname> <given-names>S. R.</given-names></name> <name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name> <name><surname>Abel</surname> <given-names>M.</given-names></name></person-group> (<year>2014</year>). <article-title>Representing part&#x02013;whole relations in conceptual spaces</article-title>. <source>Cogn. Process.</source> <volume>15</volume>, <fpage>127</fpage>&#x02013;<lpage>142</lpage>. <pub-id pub-id-type="doi">10.1007/s10339-013-0585-x</pub-id><pub-id pub-id-type="pmid">24146391</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Fuson</surname> <given-names>K. C.</given-names></name></person-group> (<year>1988</year>). <source>Children&#x00027;s Counting and Concepts of Number</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer</publisher-name>. <pub-id pub-id-type="doi">10.1007/978-1-4612-3754-9</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Gallistel</surname> <given-names>C. R.</given-names></name></person-group> (<year>1990</year>). <source>The Organization of Learning</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>.</citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ganglmayer</surname> <given-names>K.</given-names></name> <name><surname>Attig</surname> <given-names>M.</given-names></name> <name><surname>Daum</surname> <given-names>M. M.</given-names></name> <name><surname>Paulus</surname> <given-names>M.</given-names></name></person-group> (<year>2019</year>). <article-title>Infants&#x00027; perception of goal-directed actions: A multi-lab replication reveals that infants anticipate paths and not goals</article-title>. <source>Infant Behav. Dev.</source> <volume>57</volume>:<fpage>101340</fpage>.<pub-id pub-id-type="pmid">31387059</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name></person-group> (<year>2007</year>). <article-title>Representing actions and functional properties in conceptual spaces</article-title>, in <source>Body, Language and Mind, Volume 1: Embodiment</source>, eds T. Ziemke, J. Zlatev, and R. M. Frank (<publisher-loc>Berlin</publisher-loc>: <publisher-name>Mouton de Gruyter</publisher-name>), <fpage>167</fpage>&#x02013;<lpage>195</lpage>.</citation></ref>
<ref id="B22">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name></person-group> (<year>2014</year>). <source>Geometry of Meaning: Semantics Based on Conceptual Spaces</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>. <pub-id pub-id-type="doi">10.7551/mitpress/9629.001.0001</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name></person-group> (<year>2018</year>). <article-title>From sensations to concepts: a proposal for two learning processes</article-title>. <source>Rev. Philos. Psychol.</source> <volume>10</volume>, <fpage>441</fpage>&#x02013;<lpage>464</lpage>. <pub-id pub-id-type="doi">10.1007/s13164-017-0379-7</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name></person-group> (<year>2019</year>). <article-title>Using event representations to generate robot semantics</article-title>. <source>ACM Trans. Hum. Robot Interact.</source> <volume>8</volume>, <fpage>1</fpage>&#x02013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.1145/3341167</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name></person-group> (<year>2020a</year>). <article-title>Events and causal mappings modeled in conceptual spaces</article-title>. <source>Front. Psychol.</source> <volume>11</volume>:<fpage>630</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2020.00630</pub-id><pub-id pub-id-type="pmid">32373016</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name></person-group> (<year>2020b</year>). <article-title>An epigenetic approach to semantic domains</article-title>. <source>IEEE Trans. Cogn. Dev. Syst.</source> <volume>12</volume>, <fpage>139</fpage>&#x02013;<lpage>147</lpage>. <pub-id pub-id-type="doi">10.1109/TCDS.2018.2833387</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name> <name><surname>Jost</surname> <given-names>J.</given-names></name> <name><surname>Warglien</surname> <given-names>M.</given-names></name></person-group> (<year>2018</year>). <article-title>From actions to events: three constraints on event mappings</article-title>. <source>Front. Psychol.</source> <volume>14</volume>:<fpage>1391</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2018.01391</pub-id><pub-id pub-id-type="pmid">30154745</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name> <name><surname>Lombard</surname> <given-names>M.</given-names></name></person-group> (<year>2020</year>). <article-title>Technology led to more abstract causal reasoning</article-title>. <source>Biol. Philos.</source> <volume>35</volume>, <fpage>1</fpage>&#x02013;<lpage>23</lpage>. <pub-id pub-id-type="doi">10.1007/s10539-020-09757-z</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name> <name><surname>Warglien</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>Using conceptual spaces to model actions and events</article-title>. <source>J. Semant.</source> <volume>29</volume>, <fpage>487</fpage>&#x02013;<lpage>519</lpage>. <pub-id pub-id-type="doi">10.1093/jos/ffs007</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>G&#x000E4;rdenfors</surname> <given-names>P. M.</given-names></name> <name><surname>Williams</surname> <given-names>M.-A.</given-names></name> <name><surname>Johnston</surname> <given-names>B.</given-names></name> <name><surname>Billingsley</surname> <given-names>R.</given-names></name> <name><surname>Vitale</surname> <given-names>J.</given-names></name> <name><surname>Peppas</surname> <given-names>P.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Event boards as tools for holistic AI</article-title>, in <source>Proceedings of the 6th International Workshop on Artificial Intelligence and Cognition, CEUR Workshop Proceedings, Vol. 2418</source>, eds A. Chella, I. Infantino, and A. Lieto (Aachen), <fpage>1</fpage>&#x02013;<lpage>10</lpage>.</citation></ref>
<ref id="B31">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Gelman</surname> <given-names>R.</given-names></name> <name><surname>Gallistel</surname> <given-names>C. R.</given-names></name></person-group> (<year>1986</year>). <source>The Child&#x00027;s Understanding of Number</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>Harvard University Press</publisher-name>.<pub-id pub-id-type="pmid">11699675</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Gemel</surname> <given-names>A.</given-names></name> <name><surname>Quinon</surname> <given-names>P.</given-names></name></person-group> (<year>2019</year>). <article-title>Magnitude and number sensitivity of the approximate number system in conceptual spaces</article-title>, in <source>Conceptual Spaces: Elaborations and Applications</source>, eds M. Kaipanen, F. Zenker, A. Hautam&#x000E4;ki, and P G&#x000E4;rdenfors (<publisher-loc>Cham: Springer Nature</publisher-loc>), <fpage>183</fpage>&#x02013;<lpage>203</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-030-12800-5_10</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gharaee</surname> <given-names>Z.</given-names></name> <name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name> <name><surname>Johnsson</surname> <given-names>M.</given-names></name></person-group> (<year>2017a</year>). <article-title>First and second order dynamics in a hierarchical SOM system for action recognition</article-title>. <source>Appl. Soft Comput.</source> <volume>59</volume>, <fpage>574</fpage>&#x02013;<lpage>585</lpage>. <pub-id pub-id-type="doi">10.1016/j.asoc.2017.06.007</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gharaee</surname> <given-names>Z.</given-names></name> <name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name> <name><surname>Johnsson</surname> <given-names>M.</given-names></name></person-group> (<year>2017b</year>). <article-title>Online recognition of actions involving objects</article-title>. <source>Biol. Inspired Cogn. Architectures</source> <volume>22</volume>, <fpage>10</fpage>&#x02013;<lpage>19</lpage>. <pub-id pub-id-type="doi">10.1016/j.bica.2017.09.007</pub-id></citation></ref>
<ref id="B35">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Gibson</surname> <given-names>J. J.</given-names></name></person-group> (<year>1966</year>). <source>The Senses Considered as Perceptual Systems</source>. <publisher-loc>Oxford</publisher-loc>: <publisher-name>Houghton Mifflin</publisher-name>.</citation></ref>
<ref id="B36">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Gibson</surname> <given-names>J. J.</given-names></name></person-group> (<year>1979</year>). <source>The Ecological Approach to Visual Perception</source>. <publisher-loc>Hillsdale, NJ</publisher-loc>: <publisher-name>Lawrence Erlbaum</publisher-name>.</citation></ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Giese</surname> <given-names>M.</given-names></name> <name><surname>Thornton</surname> <given-names>I.</given-names></name> <name><surname>Edelman</surname> <given-names>S.</given-names></name></person-group> (<year>2008</year>). <article-title>Metrics of the perception of body movement</article-title>. <source>J. Vis</source>. <volume>8</volume>, <fpage>1</fpage>&#x02013;<lpage>18</lpage>. <pub-id pub-id-type="doi">10.1167/8.9.13</pub-id><pub-id pub-id-type="pmid">18831649</pub-id></citation></ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Harbour</surname> <given-names>D.</given-names></name></person-group> (<year>2014</year>). <article-title>Paucity, abundance, and the theory of number</article-title>. <source>Language</source> <volume>90</volume>, <fpage>185</fpage>&#x02013;<lpage>229</lpage>. <pub-id pub-id-type="doi">10.1353/lan.2014.0003</pub-id></citation></ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Held</surname> <given-names>R.</given-names></name> <name><surname>Hein</surname> <given-names>A.</given-names></name></person-group> (<year>1963</year>). <article-title>Movement-produced stimulation in the development of visually guided behavior</article-title>. <source>J. Comp. Physiol. Psychol</source>. <volume>56</volume>, <fpage>872</fpage>&#x02013;<lpage>876</lpage>. <pub-id pub-id-type="doi">10.1037/h0040546</pub-id><pub-id pub-id-type="pmid">14050177</pub-id></citation></ref>
<ref id="B40">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Hemeren</surname> <given-names>P. E.</given-names></name></person-group> (<year>2008</year>). <source>Mind in Action</source>. <publisher-loc>Lund: Lund University Cognitive Studies</publisher-loc>. p. <fpage>140</fpage>.</citation></ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hinaut</surname> <given-names>X.</given-names></name> <name><surname>Dominey</surname> <given-names>P. F.</given-names></name></person-group> (<year>2013</year>). <article-title>Real-time parallel processing of grammatical structure in the fronto-striatal system: a recurrent network simulation study using reservoir computing</article-title>. <source>PLoS ONE</source> <volume>8</volume>:<fpage>e52946</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0052946</pub-id><pub-id pub-id-type="pmid">23383296</pub-id></citation></ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hoerl</surname> <given-names>C.</given-names></name> <name><surname>McCormack</surname> <given-names>T.</given-names></name></person-group> (<year>2019</year>). <article-title>Thinking in and about time: a dual systems perspective on temporal cognition</article-title>. <source>Behav. Brain Sci.</source> <volume>42</volume>, <fpage>1</fpage>&#x02013;<lpage>16</lpage>. <pub-id pub-id-type="doi">10.1017/S0140525X18002157</pub-id><pub-id pub-id-type="pmid">30251619</pub-id></citation></ref>
<ref id="B43">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Hoffman</surname> <given-names>D. D.</given-names></name></person-group> (<year>2019</year>). <source>The Case Against Reality</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>W. W. Norton</publisher-name>.</citation></ref>
<ref id="B44">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Humphrey</surname> <given-names>N. K.</given-names></name></person-group> (<year>1993</year>). <source>A History of the Mind</source>. <publisher-loc>London</publisher-loc>: <publisher-name>Vintage Books</publisher-name>.</citation></ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Johansson</surname> <given-names>G.</given-names></name></person-group> (<year>1964</year>). <article-title>Perception of motion and changing form: a study of visual perception from continuous transformations of a solid angle of light at the eye</article-title>. <source>Scand. J. Psychol</source>. <volume>5</volume>, <fpage>181</fpage>&#x02013;<lpage>208</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-9450.1964.tb01425.x</pub-id></citation></ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Johansson</surname> <given-names>G.</given-names></name></person-group> (<year>1973</year>). <article-title>Visual perception of biological motion and a model for its analysis</article-title>. <source>Percept. Psychophys.</source> <volume>14</volume>, <fpage>201</fpage>&#x02013;<lpage>211</lpage>. <pub-id pub-id-type="doi">10.3758/BF03212378</pub-id><pub-id pub-id-type="pmid">15820512</pub-id></citation></ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kalkan</surname> <given-names>S.</given-names></name> <name><surname>Dag</surname> <given-names>N.</given-names></name> <name><surname>T&#x000FC;r&#x000FC;ten</surname> <given-names>O. A.</given-names></name> <name><surname>Borghi</surname> <given-names>M.</given-names></name> <name><surname>Sahin</surname> <given-names>E.</given-names></name></person-group> (<year>2014</year>). <article-title>Verb concepts from affordances</article-title>. <source>Interact. Stud.</source> <volume>15</volume>, <fpage>1</fpage>&#x02013;<lpage>37</lpage>. <pub-id pub-id-type="doi">10.1075/is.15.1.01kal</pub-id></citation></ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kaufman</surname> <given-names>L.</given-names></name> <name><surname>Kaufman</surname> <given-names>J.</given-names></name></person-group> (<year>2000</year>). <article-title>Explaining the moon illusion</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>97</volume>, <fpage>500</fpage>&#x02013;<lpage>504</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.97.1.500</pub-id><pub-id pub-id-type="pmid">10618447</pub-id></citation></ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kohler</surname> <given-names>I.</given-names></name></person-group> (<year>1951</year>). <article-title>Formation and transformation of the perceptual world</article-title>. <source>Psychol. Issues</source> <volume>3</volume>, <fpage>1</fpage>&#x02013;<lpage>173</lpage>.</citation></ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kriegeskorte</surname> <given-names>N.</given-names></name> <name><surname>Mur</surname> <given-names>M.</given-names></name> <name><surname>Ruff</surname> <given-names>D. R.</given-names></name> <name><surname>Kiani</surname> <given-names>R.</given-names></name> <name><surname>Bodurka</surname> <given-names>J.</given-names></name> <name><surname>Esteky</surname> <given-names>H.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>Matching categorical object representations in inferior temporal cortex of man and monkey</article-title>. <source>Neuron</source> <volume>60</volume>, <fpage>1126</fpage>&#x02013;<lpage>1141</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2008.10.043</pub-id><pub-id pub-id-type="pmid">19109916</pub-id></citation></ref>
<ref id="B51">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Kruskal</surname> <given-names>J. B.</given-names></name> <name><surname>Wish</surname> <given-names>M.</given-names></name></person-group> (<year>1978</year>). <source>Multidimensional Scaling</source>. <publisher-loc>Thousand Oaks, CA</publisher-loc>: <publisher-name>Sage Publising</publisher-name>. <pub-id pub-id-type="doi">10.4135/9781412985130</pub-id></citation></ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Land</surname> <given-names>E. H.</given-names></name></person-group> (<year>1977</year>). <article-title>The retinex theory of color vision</article-title>. <source>Sci. Am</source>. <volume>237</volume>, <fpage>108</fpage>&#x02013;<lpage>128</lpage>. <pub-id pub-id-type="doi">10.1038/scientificamerican1277-108</pub-id><pub-id pub-id-type="pmid">929159</pub-id></citation></ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Landau</surname> <given-names>B.</given-names></name> <name><surname>Smith</surname> <given-names>L.</given-names></name> <name><surname>Jones</surname> <given-names>S.</given-names></name></person-group> (<year>1998</year>). <article-title>Object perception and object naming in early development</article-title>. <source>Trends Cogn. Sci.</source> <volume>2</volume>, <fpage>19</fpage>&#x02013;<lpage>24</lpage>. <pub-id pub-id-type="doi">10.1016/S1364-6613(97)01111-X</pub-id><pub-id pub-id-type="pmid">21244958</pub-id></citation></ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Leibo</surname> <given-names>J. Z.</given-names></name> <name><surname>Liao</surname> <given-names>Q.</given-names></name> <name><surname>Anselmi</surname> <given-names>F.</given-names></name> <name><surname>Poggio</surname> <given-names>T.</given-names></name></person-group> (<year>2015</year>). <article-title>The invariance hypothesis implies domain-specific regions in visual cortex</article-title>. <source>PLoS Comput. Biol</source>. <volume>11</volume>:<fpage>e1004390</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1004390</pub-id><pub-id pub-id-type="pmid">26496457</pub-id></citation></ref>
<ref id="B55">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Leslie</surname> <given-names>A.</given-names></name></person-group> (<year>1996</year>). <source>A Theory of Agency</source>. <publisher-loc>New Brunswick, NJ</publisher-loc>: <publisher-name>Rutgers University Center for Cognitive Science</publisher-name>.</citation></ref>
<ref id="B56">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Levy-Leblond</surname> <given-names>J. M.</given-names></name></person-group> (<year>1971</year>). <article-title>Galilei group and galilean invariance</article-title>, in <source>Group Theory and its Applications</source> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Academic Press</publisher-name>), <fpage>221</fpage>&#x02013;<lpage>299</lpage>. <pub-id pub-id-type="doi">10.1016/B978-0-12-455152-7.50011-2</pub-id></citation></ref>
<ref id="B57">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Marr</surname> <given-names>D.</given-names></name></person-group> (<year>1982</year>). <source>Vision: A Computational Approach</source>. <publisher-loc>San Fransisco, CA</publisher-loc>: <publisher-name>Freeman</publisher-name>.</citation></ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marr</surname> <given-names>D.</given-names></name> <name><surname>Vaina</surname> <given-names>L.</given-names></name></person-group> (<year>1982</year>). <article-title>Representation and recognition of the movements of shapes</article-title>. <source>Proc. Royal Soc. London</source> <volume>B214</volume>, <fpage>501</fpage>&#x02013;<lpage>524</lpage>. <pub-id pub-id-type="doi">10.1098/rspb.1982.0024</pub-id><pub-id pub-id-type="pmid">6127693</pub-id></citation></ref>
<ref id="B59">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mealier</surname> <given-names>A. L.</given-names></name> <name><surname>Pointeau</surname> <given-names>G.</given-names></name> <name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name> <name><surname>Dominey</surname> <given-names>P. F.</given-names></name></person-group> (<year>2016</year>). <article-title>Construals of meaning: the role of attention in robotic language production</article-title>. <source>Interact. Stud.</source> <volume>17</volume>, <fpage>41</fpage>&#x02013;<lpage>69</lpage>. <pub-id pub-id-type="doi">10.1075/is.17.1.03mea</pub-id></citation></ref>
<ref id="B60">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Murray</surname> <given-names>S. O.</given-names></name> <name><surname>Boyaci</surname> <given-names>H.</given-names></name> <name><surname>Kersten</surname> <given-names>D. J.</given-names></name></person-group> (<year>2005</year>). <article-title>The emergence of object size invariance in the human visual cortex</article-title>. <source>J. Vis</source>. <volume>5</volume>, <fpage>744</fpage>&#x02013;<lpage>744</lpage>. <pub-id pub-id-type="doi">10.1167/5.8.744</pub-id></citation></ref>
<ref id="B61">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nau</surname> <given-names>M.</given-names></name> <name><surname>Julian</surname> <given-names>J. B.</given-names></name> <name><surname>Doeller</surname> <given-names>C. F.</given-names></name></person-group> (<year>2018</year>). <article-title>How the brain&#x00027;s navigation system shapes our visual experience</article-title>. <source>Trends Cogn. Sci.</source> <volume>22</volume>, <fpage>810</fpage>&#x02013;<lpage>825</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2018.06.008</pub-id><pub-id pub-id-type="pmid">30031670</pub-id></citation></ref>
<ref id="B62">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Papafragou</surname> <given-names>A.</given-names></name></person-group> (<year>2015</year>). <article-title>The representation of events in language and cognition</article-title>, in <source>The Conceptual Mind: New Directions in the Study of Concepts</source>, eds E. Margolis, and S. Laurence (<publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>), <fpage>327</fpage>&#x02013;<lpage>345</lpage>.<pub-id pub-id-type="pmid">12175572</pub-id></citation></ref>
<ref id="B63">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Piaget</surname> <given-names>J.</given-names></name></person-group> (<year>1952</year>). <source>The Origins of Intelligence in Children</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>International Universities Press</publisher-name>. <pub-id pub-id-type="doi">10.1037/11494-000</pub-id></citation></ref>
<ref id="B64">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Piaget</surname> <given-names>J.</given-names></name> <name><surname>Inhelder</surname> <given-names>B.</given-names></name></person-group> (<year>1967</year>). <source>The Child&#x00027;s Conception of Space</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Norton</publisher-name>.</citation></ref>
<ref id="B65">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Pinker</surname> <given-names>S.</given-names></name></person-group> (<year>2002</year>). <source>The Blank Slate: The Modern Denial of Human Nature</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Viking</publisher-name>.</citation></ref>
<ref id="B66">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Povinelli</surname> <given-names>D. J.</given-names></name></person-group> (<year>2000</year>). <source>Folk Physics for Apes: The Chimpanzee&#x00027;s Theory of How the World Works (Vol. 7).</source> <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation></ref>
<ref id="B67">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Radvansky</surname> <given-names>G. A.</given-names></name> <name><surname>Zacks</surname> <given-names>J. M.</given-names></name></person-group> (<year>2014</year>). <source>Event Cognition</source>. <publisher-loc>Oxford: Oxford University Press</publisher-loc>. <pub-id pub-id-type="doi">10.1093/acprof:oso/9780199898138.001.0001</pub-id></citation></ref>
<ref id="B68">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Runesson</surname> <given-names>S.</given-names></name></person-group> (<year>1994</year>). <article-title>Perception of biological motion: the KSD-principle and the implications of a distal versus proximal approach</article-title>, in <source>Perceiving Evens and Objects</source>, eds G. Jansson, S.-S. Bergstr&#x000F6;m, and W. Epstein (<publisher-loc>Hillsdale, NJ</publisher-loc>: <publisher-name>Lawrence Erlbaum</publisher-name>), <fpage>383</fpage>&#x02013;<lpage>405</lpage>.</citation></ref>
<ref id="B69">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sarnecka</surname> <given-names>B. W.</given-names></name> <name><surname>Carey</surname> <given-names>S.</given-names></name></person-group> (<year>2008</year>). <article-title>How counting represents number: what children must learn and when they learn it</article-title>. <source>Cognition</source> <volume>108</volume>, <fpage>662</fpage>&#x02013;<lpage>674</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2008.05.007</pub-id><pub-id pub-id-type="pmid">18572155</pub-id></citation></ref>
<ref id="B70">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shanahan</surname> <given-names>M.</given-names></name> <name><surname>Crosby</surname> <given-names>M.</given-names></name> <name><surname>Beyret</surname> <given-names>B.</given-names></name> <name><surname>Cheke</surname> <given-names>L.</given-names></name></person-group> (<year>2020</year>). <article-title>Artificial intelligence and the common sense of animals</article-title>. <source>Trends Cogn. Sci.</source> <volume>24</volume>, <fpage>862</fpage>&#x02013;<lpage>872</lpage>. <pub-id pub-id-type="doi">10.1016/j.tics.2020.09.002</pub-id><pub-id pub-id-type="pmid">33041199</pub-id></citation></ref>
<ref id="B71">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sinha</surname> <given-names>C.</given-names></name> <name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name></person-group> (<year>2014</year>). <article-title>Time, space, and events in language and cognition: a comparative view</article-title>. <source>Ann. NY Acad. Sci</source>. <volume>1326</volume>, <fpage>72</fpage>&#x02013;<lpage>81</lpage>. <pub-id pub-id-type="doi">10.1111/nyas.12491</pub-id><pub-id pub-id-type="pmid">25098724</pub-id></citation></ref>
<ref id="B72">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smith</surname> <given-names>L. B.</given-names></name> <name><surname>Samuelson</surname> <given-names>L.</given-names></name></person-group> (<year>2006</year>). <article-title>An attentional learning account of the shape bias: reply to Cimpian and Markman (2005) and Booth, Waxman, and Huang (2005)</article-title>. <source>Dev. Psychol</source>. <volume>42</volume>, <fpage>1339</fpage>&#x02013;<lpage>1343</lpage>. <pub-id pub-id-type="doi">10.1037/0012-1649.42.6.1339</pub-id><pub-id pub-id-type="pmid">17087565</pub-id></citation></ref>
<ref id="B73">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spelke</surname> <given-names>E. S.</given-names></name></person-group> (<year>2000</year>). <article-title>Core knowledge</article-title>. <source>Am. Psychol.</source> <volume>2000</volume>, <fpage>1233</fpage>&#x02013;<lpage>1243</lpage>. <pub-id pub-id-type="doi">10.1037/0003-066X.55.11.1233</pub-id><pub-id pub-id-type="pmid">11280937</pub-id></citation></ref>
<ref id="B74">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Spelke</surname> <given-names>E. S.</given-names></name></person-group> (<year>2004</year>). Core knowledge. In <italic>Attention and Performance, Vol. 20</italic>: <italic>Functional Neuroimaging of Visual Cognition</italic>, eds N. Kanwisher, and J. Duncan (<publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>29</fpage>&#x02013;<lpage>56</lpage>.</citation></ref>
<ref id="B75">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spelke</surname> <given-names>E. S.</given-names></name> <name><surname>Breinlinger</surname> <given-names>K.</given-names></name> <name><surname>Macomber</surname> <given-names>J.</given-names></name> <name><surname>Jacobson</surname> <given-names>K.</given-names></name></person-group> (<year>1992</year>). <article-title>Origins of knowledge</article-title>. <source>Psychol. Rev</source>. <volume>99</volume>, <fpage>605</fpage>&#x02013;<lpage>632</lpage>. <pub-id pub-id-type="doi">10.1037/0033-295X.99.4.605</pub-id><pub-id pub-id-type="pmid">1454901</pub-id></citation></ref>
<ref id="B76">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spelke</surname> <given-names>E. S.</given-names></name> <name><surname>Kinzler</surname> <given-names>K. D.</given-names></name></person-group> (<year>2007</year>). <article-title>Core knowledge</article-title>. <source>Dev. Sci</source>. <volume>10</volume>, <fpage>89</fpage>&#x02013;<lpage>96</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-7687.2007.00569.x</pub-id><pub-id pub-id-type="pmid">17181705</pub-id></citation></ref>
<ref id="B77">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Strickland</surname> <given-names>B.</given-names></name></person-group> (<year>2017</year>). <article-title>Language reflects core cognition: a new theory about the origin of cross-linguistic regularities</article-title>. <source>Cogn. Sci.</source> <volume>41</volume>, <fpage>70</fpage>&#x02013;<lpage>101</lpage>. <pub-id pub-id-type="doi">10.1111/cogs.12332</pub-id><pub-id pub-id-type="pmid">26923431</pub-id></citation></ref>
<ref id="B78">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Thelen</surname> <given-names>E.</given-names></name> <name><surname>Smith</surname> <given-names>L. B.</given-names></name></person-group> (<year>1994</year>). <source>A Dynamic Systems Approach to the Development of Cognition and Action</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>.</citation></ref>
<ref id="B79">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Thom</surname> <given-names>R.</given-names></name></person-group> (<year>1972</year>). <source>Structural Stability and Morphogenesis</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>W. A. Benjamin Inc</publisher-name>.</citation></ref>
<ref id="B80">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Tomasello</surname> <given-names>M.</given-names></name> <name><surname>Call</surname> <given-names>J.</given-names></name></person-group> (<year>1997</year>). <source>Primate Cognition</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation></ref>
<ref id="B81">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tversky</surname> <given-names>B.</given-names></name></person-group> (<year>2003</year>). <article-title>Structures of mental spaces: how people think about space</article-title>. <source>Environ. Behav</source>. <volume>35</volume>, <fpage>66</fpage>&#x02013;<lpage>80</lpage>. <pub-id pub-id-type="doi">10.1177/0013916502238865</pub-id></citation></ref>
<ref id="B82">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Viera</surname> <given-names>G.</given-names></name></person-group> (<year>2020</year>). <article-title>The sense of time</article-title>. <source>Br. J. Philos. Sci</source>. <volume>71</volume>, <fpage>443</fpage>&#x02013;<lpage>469</lpage>. <pub-id pub-id-type="doi">10.1093/bjps/axy019</pub-id></citation></ref>
<ref id="B83">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Von Glasersfeld</surname> <given-names>E.</given-names></name></person-group> (<year>2005</year>). <article-title>Thirty years constructivism</article-title>. <source>Constr. Found.</source> <volume>1</volume>, <fpage>9</fpage>&#x02013;<lpage>12</lpage>.</citation></ref>
<ref id="B84">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>W.</given-names></name> <name><surname>Crompton</surname> <given-names>R. H.</given-names></name> <name><surname>Carey</surname> <given-names>T. S.</given-names></name> <name><surname>G&#x000FC;nther</surname> <given-names>M. M.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Savage</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2004</year>). <article-title>Comparison of inverse-dynamics musculo-skeletal models of AL 288-1 Australopithecus afarensis and KNM-WT 15000 homo ergaster to modern humans, with implications for the evolution of bipedalism</article-title>. <source>J. Hum. Evol</source>. <volume>47</volume>, <fpage>453</fpage>&#x02013;<lpage>478</lpage>. <pub-id pub-id-type="doi">10.1016/j.jhevol.2004.08.007</pub-id><pub-id pub-id-type="pmid">15566947</pub-id></citation></ref>
<ref id="B85">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Warglien</surname> <given-names>M.</given-names></name> <name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name> <name><surname>Westera</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>Event structure, conceptual spaces and the semantics of verbs</article-title>. <source>Theor. Linguist.</source> <volume>38</volume>, <fpage>159</fpage>&#x02013;<lpage>193</lpage>.</citation></ref>
<ref id="B86">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wiskott</surname> <given-names>L.</given-names></name> <name><surname>Sejnowski</surname> <given-names>T. J.</given-names></name></person-group> (<year>2002</year>). <article-title>Slow feature analysis: unsupervised learning of invariances</article-title>. <source>Neural Comput.</source> <volume>14</volume>, <fpage>715</fpage>&#x02013;<lpage>770</lpage>. <pub-id pub-id-type="doi">10.1162/089976602317318938</pub-id><pub-id pub-id-type="pmid">11936959</pub-id></citation></ref>
<ref id="B87">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wolff</surname> <given-names>P.</given-names></name></person-group> (<year>2007</year>). <article-title>Representing causation</article-title>. <source>J. Exp. Psychol.</source> <volume>136</volume>, <fpage>82</fpage>&#x02013;<lpage>111</lpage>. <pub-id pub-id-type="doi">10.1037/0096-3445.136.1.82</pub-id></citation></ref>
<ref id="B88">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Wolff</surname> <given-names>P.</given-names></name></person-group> (<year>2008</year>). <article-title>Dynamics and the perception of causal events</article-title>, in <source>Understanding Events: How Humans See, Represent, and Act on Events</source>, eds T. Shipley, and J. Zacks (<publisher-loc>Oxford: Oxford University Press</publisher-loc>), <fpage>555</fpage>&#x02013;<lpage>587</lpage>. <pub-id pub-id-type="doi">10.1093/acprof:oso/9780195188370.003.0023</pub-id></citation></ref>
<ref id="B89">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wolff</surname> <given-names>P.</given-names></name></person-group> (<year>2012</year>). <article-title>Representing verbs with force vectors</article-title>. <source>Theor. Linguist.</source> <volume>38</volume>, <fpage>237</fpage>&#x02013;<lpage>248</lpage>. <pub-id pub-id-type="doi">10.1515/tl-2012-0015</pub-id></citation></ref>
<ref id="B90">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Wolff</surname> <given-names>P.</given-names></name> <name><surname>Shepard</surname> <given-names>J.</given-names></name></person-group> (<year>2013</year>). <article-title>Causation, touch, and the perception of force</article-title>, in <source>The Psychology of Learning and Motivation, Vol. 58</source>, ed B. H. Ross (<publisher-loc>New York, NY: Academic Press</publisher-loc>), <fpage>167</fpage>&#x02013;<lpage>202</lpage>. <pub-id pub-id-type="doi">10.1016/B978-0-12-407237-4.00005-0</pub-id></citation></ref>
<ref id="B91">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Wolff</surname> <given-names>P.</given-names></name> <name><surname>Thorstad</surname> <given-names>R.</given-names></name></person-group> (<year>2017</year>). <article-title>Force dynamics</article-title>, in <source>The Oxford Handbook of Causal Reasoning</source>, ed M. R. Waldmann (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>147</fpage>&#x02013;<lpage>167</lpage>.</citation></ref>
<ref id="B92">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Woodward</surname> <given-names>A. L.</given-names></name></person-group> (<year>2009</year>). <article-title>Infants&#x00027; grasp of others&#x00027; intentions</article-title>. <source>Curr. Dir. Psychol. Sci.</source> <volume>18</volume>, <fpage>53</fpage>&#x02013;<lpage>57</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-8721.2009.01605.x</pub-id><pub-id pub-id-type="pmid">23645974</pub-id></citation></ref>
<ref id="B93">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Woodward</surname> <given-names>A. L.</given-names></name></person-group> (<year>2013</year>). Infant foundations of intentional understanding<italic>,</italic> in <italic>Navigating the Social World: What Infants, Children, and Other Species Can Teach Us</italic>, eds M. R. Banaji, and S. A. Gelman (<publisher-loc>Oxford: Oxford University Press</publisher-loc>), <fpage>75</fpage>&#x02013;<lpage>80</lpage>. <pub-id pub-id-type="doi">10.1093/acprof:oso/9780199890712.003.0015</pub-id></citation></ref>
<ref id="B94">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wyeth</surname> <given-names>G.</given-names></name> <name><surname>Milford</surname> <given-names>M.</given-names></name></person-group> (<year>2009</year>). <article-title>Spatial cognition for robots</article-title>. <source>IEEE Robot. Autom. Mag.</source> <volume>16</volume>, <fpage>24</fpage>&#x02013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.1109/MRA.2009.933620</pub-id></citation></ref>
<ref id="B95">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zacks</surname> <given-names>J. M.</given-names></name> <name><surname>Tversky</surname> <given-names>B.</given-names></name></person-group> (<year>2001</year>). <article-title>Event structures in perception and conception</article-title>. <source>Psychol. Bull.</source> <volume>127</volume>, <fpage>3</fpage>&#x02013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.1037/0033-2909.127.1.3</pub-id></citation>
</ref>
<ref id="B96">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>B.</given-names></name> <name><surname>Feng</surname> <given-names>J.</given-names></name> <name><surname>Wu</surname> <given-names>X.</given-names></name> <name><surname>Yan</surname> <given-names>S.</given-names></name></person-group> (<year>2017</year>). <article-title>A survey on deep learning-based fine-grained object classification and semantic segmentation</article-title>. <source>Int. J. Autom. Comput.</source> <volume>14</volume>, <fpage>119</fpage>&#x02013;<lpage>135</lpage>. <pub-id pub-id-type="doi">10.1007/s11633-017-1053-3</pub-id></citation></ref>
<ref id="B97">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhu</surname> <given-names>S. C.</given-names></name> <name><surname>Yuille</surname> <given-names>A. L.</given-names></name></person-group> (<year>1996</year>). <article-title>FORMS: a flexible object recognition and modeling system</article-title>. <source>Int. J. Comput. Vis</source>. <volume>20</volume>, <fpage>187</fpage>&#x02013;<lpage>212</lpage>. <pub-id pub-id-type="doi">10.1007/BF00208719</pub-id><pub-id pub-id-type="pmid">10937963</pub-id></citation></ref>
<ref id="B98">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zwarts</surname> <given-names>J.</given-names></name> <name><surname>G&#x000E4;rdenfors</surname> <given-names>P.</given-names></name></person-group> (<year>2016</year>). <article-title>Locative and directional prepositions in conceptual spaces: the role of polar convexity</article-title>. <source>J. Logic Lang. Inform.</source> <volume>25</volume>, <fpage>109</fpage>&#x02013;<lpage>138</lpage>. <pub-id pub-id-type="doi">10.1007/s10849-015-9224-5</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn0001"><p><sup>1</sup>I avoid Spelke&#x00027;s use of &#x0201C;core&#x0201D; knowledge structures [and Carey&#x00027;s (<xref ref-type="bibr" rid="B9">2009</xref>) &#x0201C;core&#x0201D; cognition] since it is connected with an nativist position, and instead speak of primary categories (see G&#x000E4;rdenfors, <xref ref-type="bibr" rid="B23">2018</xref>).</p></fn>
<fn id="fn0002"><p><sup>2</sup>This term is borrowed from economics.</p></fn>
<fn id="fn0003"><p><sup>3</sup>Fungibility will also be central in my analysis of the number category.</p></fn>
<fn id="fn0004"><p><sup>4</sup>The space is generally throught to have a Euclidean geometry and be based on a Cartesian coordinate system, but there is some linguistic evidence that <italic>polar</italic> coordinates might give a better description of its perceptual geometry (e.g., G&#x000E4;rdenfors, <xref ref-type="bibr" rid="B22">2014</xref>; Zwarts and G&#x000E4;rdenfors, <xref ref-type="bibr" rid="B98">2016</xref>).</p></fn>
<fn id="fn0005"><p><sup>5</sup>The distinction between egocentric and allocentric corresponds to Gibson&#x00027;s (<xref ref-type="bibr" rid="B35">1966</xref>) distinction between &#x0201C;perspective structure&#x0201D; and &#x0201C;invariant structure.&#x0201D;</p></fn>
<fn id="fn0006"><p><sup>6</sup>If a place is determined by landmarks, however, the place will change if the landmarks change (Gallistel, <xref ref-type="bibr" rid="B19">1990</xref>).</p></fn>
<fn id="fn0007"><p><sup>7</sup>When categorizing objects with parts, the relations of the parts can be modeled in a &#x0201C;structure space&#x0201D; (Fiorini et al., <xref ref-type="bibr" rid="B17">2014</xref>).</p></fn>
<fn id="fn0008"><p><sup>8</sup>What counts as a mass noun is to some extent language dependent. For example, &#x0201C;furniture&#x0201D; (mass noun in English) is a count noun in French (meubles) and German (M&#x000F6;bel).</p></fn>
<fn id="fn0009"><p><sup>9</sup>Gharaee et al. (<xref ref-type="bibr" rid="B33">2017a</xref>) have applied the force dynamic model in a robotic system that has been constructed for categorizing actions.</p></fn>
<fn id="fn0010"><p><sup>10</sup>Slightly more mathematically, an event can be represented as a product space of these two spaces.</p></fn>
<fn id="fn0011"><p><sup>11</sup>Thom&#x00027;s (<xref ref-type="bibr" rid="B79">1972</xref>) work on catastrophy theory presents a general way of characterizing such disruptive changes.</p></fn>
<fn id="fn0012"><p><sup>12</sup>Viera (<xref ref-type="bibr" rid="B82">2020</xref>) argues that the circadian system allows us to sense time, but he does not consider the role of the time dimension in cognitive event representation.</p></fn>
<fn id="fn0013"><p><sup>13</sup>One reason for this caveat is that animals have difficulties reasoning about causality that depends on external forces (Tomasello and Call, <xref ref-type="bibr" rid="B80">1997</xref>; Povinelli, <xref ref-type="bibr" rid="B66">2000</xref>; G&#x000E4;rdenfors and Lombard, <xref ref-type="bibr" rid="B28">2020</xref>).</p></fn>
</fn-group>
<fn-group>
<fn fn-type="financial-disclosure"><p><bold>Funding.</bold> The author acknowledges support from Lund University.</p>
</fn>
</fn-group>
</back>
</article> 