An Alternative to Mapping a Word onto a Concept in Language Acquisition: Pragmatic Frames

Rohlfing, Katharina J.; Wrede, Britta; Vollmer, Anna-Lisa; Oudeyer, Pierre-Yves

doi:10.3389/fpsyg.2016.00470

HYPOTHESIS AND THEORY article

Front. Psychol., 19 April 2016

Sec. Cognitive Science

Volume 7 - 2016 | https://doi.org/10.3389/fpsyg.2016.00470

An Alternative to Mapping a Word onto a Concept in Language Acquisition: Pragmatic Frames

1. Cognitive Interaction Technology, Bielefeld University Bielefeld, Germany
2. Faculty of Arts and Humanities, Paderborn University Paderborn, Germany
3. INRIA Bordeaux, France

Abstract

The classic mapping metaphor posits that children learn a word by mapping it onto a concept of an object or event. However, we believe that a mapping metaphor cannot account for word learning, because even though children focus attention on objects, they do not necessarily remember the connection between the word and the referent unless it is framed pragmatically, that is, within a task. Our theoretical paper proposes an alternative mechanism for word learning. Our main premise is that word learning occurs as children accomplish a goal in cooperation with a partner. We follow Bruner’s (1983) idea and further specify pragmatic frames as the learning units that drive language acquisition and cognitive development. These units consist of a sequence of actions and verbal behaviors that are co-constructed with a partner to achieve a joint goal. We elaborate on this alternative, offer some initial parametrizations of the concept, and embed it in current language learning approaches.

Introduction

The direct mapping of words onto concepts has often been considered to be at the core of the language acquisition mechanism. Indeed, it has been suggested that “in order to successfully acquire a new word, young children must learn the correct associations between labels and their referents” (Wojcik, 2013, p. 1). Children must first “attend to and encode information about the referent,” and they have to learn “how the sounds in their language map onto objects, actions, and other properties of the world” (Wojcik, 2013, p. 1). Studies investigating this mechanism assume that once children are able to perceive a referent and to hear the word given to it, they will (1) link the word with the referent and (2) remember it as a mapping. Then when they see the referent in another situation, they will be able to recall the word; vice versa, when they hear the word, they will be able to recall a memory of the object.

However, recent studies on language success in children from low-income families indicate that the ability to refer a word to a referent might be only the last link in a long chain that first starts with establishing more fundamental communicative skills that have been called “communication foundations” (Bruner, 1983; Stephens and Matthews, 2014; Hirsh-Pasek et al., 2015, p. 2). Nonetheless, many questions regarding the nature of these communication foundations remain open. We believe that improving our understanding of these foundations will advance an alternative explanation of how children acquire language in a social interaction.

Although there are certainly plenty of situations in children’s everyday lives in which a novel word is taught to them by depicting and labeling a novel object, we suggest that this is not how word learning starts. Children learn the word “binky” not because it is a novel word introduced to them before its use, but because it is a word for an object that is used in the context of such activities as soothing that involve not only the child but also other persons. Importantly, the main purpose of these activities is not to acquire a new word but to achieve a joint action goal (Lock, 1978; Bruner, 1983). Hence, children will pick up this word because it is being used for a particular purpose that is relevant to them, and because it is uttered hundreds of times to express activities in which it is involved. Consequently, action and language are interwoven right from the start (Lock, 1978). Moreover, this interaction is organized systematically (e.g., Nomikou and Rohlfing, 2011) and constrained by culture (Shotter and Newson, 1982; Bruner, 1983; Nelson, 2007).

We think that one way to grasp this organization of language and action is via the concept of pragmatic frames. A pragmatic frame is a negotiated interaction protocol targeted to achieve a joint goal that involves (1) a surface layer, namely, an observable coordinated sequence of pragmatic behaviors in the form of words and actions, (2) a deep structure underlying these behaviors that targets the achievement of one or several joint goals, and (3) a nested cognitive layer that specifies which cognitive operations the frame triggers as it unfolds. We propose that pragmatic frames serve as a communicative foundation or a learning “matrix” (Bruner, 1983, p. 38) that emerges between interactants, and that they are the key to understanding ecological learning processes (see Figure 1)

FIGURE 1

Our proposed alternative to mapping in the form of pragmatic frames is not new and conforms to existing ideas in language acquisition research. However, in this article, we bundle these ideas together, offer new concrete parameterizations that can be used in models of social learning (see Pragmatic Frames—an Introduction and History), and link these ideas more tightly to the current debate on key aspects of word learning (see How Current Approaches Interface with Pragmatic Frames). We hope that our view – though certainly not yet fully developed – can motivate research in language acquisition to study not only individual words but also the emerging action sequences within which these words are crucial for attaining joint goals. We also hope that this theoretical perspective can inspire development in human–robot/(–computer)–interaction, enabling machines to use pragmatic frames and thus to learn and negotiate new interaction structures by themselves.

Pragmatic Frames—an Introduction and History

What we call pragmatic frames are also known as “interactional formats” (Bruner, 1983, p. 120) or “frames” (Fillmore, 1982, p. 111; Tomasello, 2003, p. 25). Pragmatic frames can be understood as recurrent interactional structures (Bruner, 1983; Ninio and Snow, 1996; Fogel and Garvey, 2007) that emerge over time (see Pragmatic Frames Require a History of an Interaction). In infant development, these structures first occur in a very specific context (Bateson, 1955; Goffman, 1974; Kendon, 1985; Rohlfing et al., 2015). They involve a sequence of goal-oriented actions that is coordinated with the interaction partner (see Pragmatic Frames Involve Goal-Oriented Actions and Table 1 for more examples). Take, for example, a guessing game in which a child is asked where the lamp is, and she or he points to it. When performing this speech act, a competent speaker knows that the goal has to be framed by a sequence of actions on the surface layer (1) such as looking at the listener and asking a question with a specific prosody and syntax that contains a slot for the requested object. The deep structure (2) involves expecting the listener to manipulate attention (e.g., by a pointing gesture) to make it joint. The cognitive layer (3), in turn, entails the cognitive function for the listener of identifying the requested object.

Table 1

Guessing game (Steels, 2001):
Speaker	Listener

(1) By pointing, eye gazing or other means, the speaker draws the listener’s visual attention toward an object of interest.	(2) The listener attends to the object

(3) The speaker shares a single predicate that is true for the object of interest but not for the other objects in this context.	(4) The listener shares the predicate and looks up all associations with this predicate in her memory.

	(5) The listener applies the highest scoring association and points to the object of interest.

(6) The speaker gives feedback.



Greeting (in which a child learns the name of another person):

Caregiver	Child	Greeted person

(1) The person recognizes somebody familiar and shares this with the child “Look, there is Anna! Let’s say hello to her!

(2) Both are approaching the other person or making them visible.

(3) The person looks at the other person says “hello!” and/or waves.		(4) The person looks up and recognizes a familiar person.

		(5) The listener says “Hello!” to the caregiver and the child or caregiver waves.

(6) Both acknowledge it by smiling.


Story telling (Quasthoff, 1997):

Story teller	Listener

	[(1) Can ask about an event]

(1) Display of referential relevance	Display of formal relevance (Can ask questions)



(2) Topicalization

(3) Elaboration

(4) Closing

(5) Translation/Evaluation

	(6) Evaluation



Action demonstrations (HRI – Akgun et al., 2012):

Experimenter/Programmer	Teaching user	Robot learner

(1) Start	(2) “New demonstration.”

	(3) The user moves the arms of the robot and records poses as keyframe with the command “Record frame.”
	(4) “End of demonstration.”

	(5) The user can ask the robot to perform the learned movement with the command “Can you perform the skill?”	(6) The robot performs the movement.

(7) End

Examples of pragmatic frames.

Pragmatic frames are retrieved from memory to guide the interpretation of an ongoing situation (see Fillmore, 1982; Bruner, 1983; Tomasello, 2003). Wittgenstein (1953/1997) called such protocols or scripts in which action and language are interwoven “language games [Sprachspiele].” They result in a behavioral disposition within interactants that enables mutual understanding. Language games follow regularities that are constrained by different contexts. Without such regularities, it is not possible for a word to have a meaning.

Steels (2001) modeled language games formally in computational and robotic simulations of the formation of linguistic conventions in groups of agents (Steels and Belpaeme, 2005). In these models, and in particularly in the Talking Head experiment (Steels and Kaplan, 2002), language games allowed robotic agents to successfully negotiate new semantic representations in which words were used as cues to draw the attention of social peers to a shared referent. In such models, language acquisition goes far beyond the mapping mechanism and fits within the pragmatic frame approach we propose here. However, both Wittgenstein as well as Steels and colleagues barely considered the details of the cognitive and developmental dimensions involved in learning these frames.

Schank and Abelson (1977) have addressed cognitive dimensions and emphasized the fact that our interactional knowledge is organized into sequences of actions and linguistic acts. Importantly, this knowledge involves extractions of events “connected directly to the goals and plans to realize those goals made by the participants” (Schank and Abelson, 1977, p. 156). Fillmore (1982) has shown that these extractions comprise semantic roles to evoke the same aspects of a scene in different ways. This knowledge is then used to interpret situations (Fillmore, 1982; see Pragmatic Frames Evoke an Interpretation of a Situation).

A reflection of this frame structure can be found on the linguistic surface in the approach known as Construction Grammar (e.g., Goldberg, 2003). Constructions that relate syntactic structure to meaning are acquired through “repeated exposure to the usage of language in context” (Fischer, 2015, p. 563). Central to this view is, once again, the use of syntactical units that occur in concrete events and then become abstracted. Several computational models, such as Spranger (2011) and Spranger et al.’s (2012) Fluid Construction Grammar, have begun to shed light on how semantic roles evolve and can be negotiated; how particular syntactical units become attached to cognitive operations; or how these links becomes routinized and abstracted through repeated usage (see the usage-based models in, e.g., Langacker, 1987; Behrens, 2009). However, Construction Grammar applies mainly to understanding and acquisition in the context of linguistic interaction. As a result, such frames have hardly ever been included and developed within more general forms of social interaction such as joint actions (but see Dominey and Boucher, 2005).

What interests us is not just the fact that usage-based models allow Construction Grammar to accommodate dynamic aspects of language such as acquisition and change (Croft, 2007). For our approach to what we call the communicative foundation, it is the social-pragmatic aspects of language that are more crucial and the way they preserve a tight link to actions (Harris et al., 1988; Barrett et al., 1991; Tomasello, 2003; Nelson, 2007). By extending the units of syntax to gestures, the concept of pragmatic frames can also be used to refer to the infants’ multimodal ability to enter “into some type of joint attentional focus with a mature language user” (Tomasello and Akhtar, 1995, p. 201). Bruner (1983) describes the value of joint attention extensively, and he ascribes the origins of semantics to the deictic acts of showing and following (e.g., pointing). Tomasello et al. (2005, p. 682) capitalized on this potential of deictic acts revealing that starting from 9 months of age, the child’s ability “to actually share goals” becomes visible when she or he points to, for example, an object: an infant uses pointing gestures, because they are a means to engage the attention of others. Although we think that the ability to engage in joint attention is certainly an important milestone in communication and is achieved not only via gaze and pointing but also via eye–hand coordination (Yu and Smith, 2013) and non-visual modalities (Akhtar and Gernsbacher, 2007), we suggest that joint attention is often a means toward reaching the joint goal (as an element required within a sequence of actions, as noted by Shotter and Newson, 1982) rather than being the purpose of an interaction: Communication with children does not stop at the coordination of attention.

Most recent approaches taking the perspective of a social-pragmatic theory of language acquisition focus on the role of social cues. These may be, for example, an eye gaze, a pointing gesture, or a smile. All such cues are supposed to be especially meaningful to young infants (e.g., Szufnarowska et al., 2014) who exhibit a particular responsivity to them (Gergely and Watson, 1999; Senju and Csibra, 2008; Csibra and Gergely, 2009; Csibra, 2010). Whereas some researchers claim that this responsivity belongs to their innate disposition (Csibra, 2010), others provide examples of how this disposition might be educated and emerge over time (Nomikou et al., 2013; Rkaczaszek-Leonardi et al., 2013; Rohlfing and Nomikou, 2014). At this point, we wish to emphasize the difference between the concept of a “cue” and the interactive process of a “co-construction”: whereas a cue would trigger a desired behavior in merely one short moment, a co-construction takes time and requires many turns in a process of mutual adjustment (Fogel, 1993; De Jaegher et al., 2010; Rkaczaszek-Leonardi et al., 2014). It is only as a consequence of this mutual adjustment – within which interactants have to exchange behaviors in order to agree on the joint goal (see Dynamic Coupling) – that a behavior eventually becomes a cue. A cue is supposed to direct attention toward a referent (which, in turn, can initiate a mapping). We will argue below that the establishment of a cue is driven by repeatedly occurring joint action sequences.

Whereas the concept of a frame is reduced mostly to particular early games (Bruner, 1983) and social cues in developmental studies, some further specifications can be found in modeling work. Steels (2001) and Steels and Kaplan (2002) (see also Table 1) have experimented with various preprogrammed interaction protocols designed specifically to allow robots to learn speech elements (Oudeyer, 2006), lexicons (Steels and Kaplan, 2002), or grammatical structures (Steels and Spranger, 2008). Related work has used similar interactional frames to allow a structured interaction between humans and robots. This has enabled robot learners to identify and learn new elements of language (Roy and Pentland, 2002; Cangelosi and Riga, 2006; Lyon et al., 2012). Work on human–robot interaction has shown that structured interaction protocols based on, for example, mechanisms of imitation or conditioning (Billard et al., 2008; Cuayáhuitl, 2015) allowed users to teach novel sensorimotor skills to robots. However, as we discuss in Vollmer et al. (submitted), the flexibility and power of social learning mechanisms is severely limited. In these existing models (1) the interaction protocols were preprogrammed (i.e., the robot knew how to use and understand them); (2) only few interaction protocols were used at the same time; and (3) they did not include mechanisms to learn and negotiate new interaction protocols. Overcoming these limitations will certainly result in new possibilities of interacting with artificial systems.

Although by allowing rich and varied interaction, protocols may, at first sight, appear to complexify the learning process, the information contained in variable parts of these protocols may actually be the key to cueing the inference process and enabling goal-oriented learning of novel actions in much larger dimensional representation spaces. In the following, we specify the key characteristics of pragmatic frames.

Pragmatic Frames Require a History of an Interaction

Repetition is central to the power of pragmatic frames. First, the division of roles and tasks is important to co-construct a goal-oriented sequence of social actions. Through the ability to remember a sequence of events (Davachi and DuBrow, 2015), its repeated occurrence enables participants to develop expectations regarding how actions relate to each other (Marcos, 1991; Nomikou et al., 2016, accepted). Repeating a particular situation leads to a familiarization effect. A familiarization results in (1) easily interpretable settings, and (2) lower cognitive load (Bruner, 1983). In the following, we explain these two strongly related effects in more detail.

In their first and second years of life, children learn primarily on the basis of familiar elements in the context, and they need environmental or social support to produce the appropriate behavior. Some studies have shown that elements of an ongoing situation give rise to cognitive operations in infants. Recently, for example, Parise and Csibra (2013) showed that children exhibited an N-400 semantic priming effect only when they heard their mothers voicing words referring to a visual stimulus presentation. With respect to children’s non-verbal behavior, Beisert et al. (2012) found that facing an unusual situation hindered imitation ability in 14-month-olds.

Marcos (1991) has cast light on the process of social support in repeating contexts and its value for language acquisition. In this study, mothers described a poster repeatedly to their 12- to 13-month-old children. Results showed that in comparison to controls, infants in the repeating context condition showed an increase in (1) the time devoted to referential behavior, (2) the number of infant initiations of dialog, and (3) the use of pointing gestures.

Farrar et al. (1993) examined the value of the familiarity of situational aspects for processing capabilities by observing mothers and their 2-year-old children playing with toys in either a familiar setting (same toys in all sessions) or an unfamiliar setting (new toys in each session). Comparing children’s word productions across settings revealed that children used more different lexical types, used more verbs, and had a higher mean length of utterance in the familiar toy setting. Farrar et al. (1993, p. 603) suggested that the familiarity with the toys provided both “a conceptual framework for interpreting the event” and an increase in “processing space.” In this sense, a familiar situation cues the retrieval of the appropriate meaning, because highly frequent items form stronger associative networks than less familiar sets (cf. Bjorklund, 1987). In another study, Rohlfing (2006) investigated the acquisition of spatial prepositions in 2-year-olds by presenting them with sets of toys that were either familiar or unfamiliar. Results showed the best learning with familiar sets. Furthermore, familiarity not only seems to support the interpretation of an ongoing situation, but may also be a prerequisite for the extension process by which children generalize their knowledge to novel situations. Without the ability to apply what they have learned in a familiar situation, children would be unable to generalize a spatial preposition to novel objects. This seems to indicate a hierarchical structure of learning proceeding from a familiar context to the ability to generalize (Rohlfing, 2006).

Taken together, this evidence suggests that the familiarity of a situation influences the perception of the overall interaction setting and the required degree of effort. When interpreting a situation, children need to hook up with familiar elements.

Currently, we do not know on which basis children are able to recognize a repeated situation, thereby enabling them to anticipate actions (Schacter et al., 2007; Ramscar et al., 2010). The ability to recognize sequences of events certainly plays a key role. In fact, in recent neurophysiological approaches, Davachi and DuBrow (2015) showed how sequences elicited hippocampal patterns that stabilized through repetition. They suggested that this “may be a distinction between single-trial or episodic sequence encoding and the representation of a well-learned, repeated, predictable sequence because each re-exposure to a sequence may modify the learned representation” (p. 2). This distinction also needs to be studied in developmental research. Although the comparison of learning situations is already recognized as a powerful mechanism (Yu and Smith, 2007; Trueswell et al., 2013), its use has been limited to investigations across trials or to fixed delays (e.g., Vlach, 2014). We need to fill this research gap and explain the process of dynamic integration between learned regularities and the immediate contextual cues guiding attention at the specific moment (Smith et al., 2010) that yields “attentional hierarchies” (Bahrick and Newell, 2008, p. 993).

Pragmatic Frames Involve Goal-Oriented Actions

Bruner (1983, pp. 24–31) acknowledges that children are equipped with some predispositions that help them to make sense of recurrent situations. He calls this the “initial cognitive endowment.” One proposed prerequisite is that children transfer experiences into means–goal structures (p. 18). This endowment supplies some semantic targets (p. 34). Similarly, Nelson (1974) proposes that actions and their results are at the core of children’s concepts (see also Mandler, 2012). In fact, in their early infancy, humans are “obsessed” with goals. Csibra and Gergely (2007) propose two main epistemic functions that this obsession serves: online prediction (see below) and social learning.

Online prediction encompasses the fact that infants look at (Baldwin et al., 2001), imitate (Meltzoff, 1995), and anticipate (Woodward, 1999; Falck-Ytter et al., 2006) others’ action goals rather than regarding other aspects of observed behaviors. Several researchers assume that the basis for this is the way the infant’s own motor system develops while gathering own motor experiences (Woodward, 2009; Kanakogi and Itakura, 2011). Thus, when observing someone performing an action, children will simulate an action plan covertly (Rizzolatti et al., 2001) if this plan is already in their action repertoire (Woodward, 1999).

Csibra and Gergely (2007), in turn, offer an alternative approach to action understanding in which situational constraints are relevant. “Teleological reasoning […] requires the recruitment of the relevant background knowledge that the observer accumulated about the physical constraints of the situation and of the actor” (Csibra and Gergely, 2007, p. 70). This argument is supported by the fact that infants attribute goals not only to conspecifics but also to abstract and artificial agents.

Yet another source that might serve as a database for infants to recognize a structure in the relation between an involved agent and her or his goal is the interaction (Reddy and Uithol, 2015; Nomikou et al., 2016, accepted). Even though Gerson and Woodward (2014) have argued recently for a “unique effect of active over observational experience” on the recognition of goal structures in actions, this effect might appear only because of the dichotomy between active and observational experience. But what about an experience in which an infant is only a part of an action? Reddy et al. (2013; see also Lock, 1978) have shown impressively that 2-month-old infants already perform anticipatory adjustments of their own body when they see a caregiver approaching to pick them up. Clearly, infants learn very early on in their development that one of the most important means to achieve their own goals is the actions of their caregiver. Simultaneously, they experience their own actions in the light of goals as interpreted by their caregivers: Rkaczaszek-Leonardi et al. (2013) have shown that seemingly random movements of 3-month-olds will be interpreted as some form of collaboration and weaved into a joint activity. It should be noted that the notion of goals, and especially how goals are represented, is currently underspecified in the scientific debate (Wrede et al., 2012). More specifically, little is known about bottom–up ways for learners to infer or detect the (potential) goal of an action. Wrede et al. (2012) argue, however, that it is reasonable to assume bottom-up biases that exist and predispose the learner to identify goal-relevant features. The relevant definition of a goal should also encompass the social dimension. Little is known, for example, about the role of emotional attunement in goal recognition, although it has been recognized as being crucial in infant development (Legerstee, 2005; see also Stephens and Matthews, 2014, for a brief review). Combining emotions with action goals, Rossmanith et al. (2014, p. 8) have proposed that “action arcs” shape activities with children. These are built up within the flow of the interaction and thus consist of a beginning, a building up, a climax, and a resolution. The term “climax” is similar to our conception of the joint goal but pinpoints the emotional function that sequentially organized actions have to fulfill.

Social Learning

Children experience goals by being active, by being a part of a collaborative activity, and by being interpreted as active. In fact, the goal of a joint interaction is central to the idea of the pragmatic frame studied in this article. We conform with Tomasello et al. (2005, p. 676) in believing that “human beings […] are biologically adapted for participating in collaborative activities involving shared goals and socially coordinated action plans.” With growing interactional experience, children become able to elicit goals on their own. However, we hypothesize that they start with ideas of goals that differ significantly from the goals pursued by older children or adults. In this sense, inducing a significant change in the environment such as turning the light on and off might well be an attractive goal for a young child (Wrede et al., 2012). Such goals are commonly used in the first steps of language acquisition: Stern and Stern (1975) reported a first understanding in their child following an instruction to change the environment in a significant acoustic way by, for example, ringing a bell or clapping hands. Nelson (1974, p. 279) also remarks that early vocabularies include objects that “move or change in some way or that child can act upon” so that the goal of the activities is perceivable.

This vision of how the acquisition of new words can occur as a side effect of interaction games leading toward joint actions leads us to disagree with Tomasello’s (2001) suggestion that language acquisition is possible only when children develop the skill of joint attention. We think that joint attention is helpful—and, as a frame, it massively boosts word learning. However, we think that children gain a rich interactional experience in expecting and coordinating actions that is crucial for language learning from other frames consisting of goal-directed actions in a sequence in which roles have to be fulfilled and children’s attention is educated in the sense of fulfilling this role (not necessarily visually). The goals of these actions are crucial for their organization, and we locate these on the deep structure level (see below). Our view is supported by Mastin and Vogt’s (2015) comparison between urban and rural communities in Mozambique. They found that language is unlikely to occur during joint attention with objects in a non-industrial rural environment in which object stimulation is not common. Instead, the vocabulary scores of children raised in this environment correlated positively with dyadic activities engaged in through touch and ritualized play. Here, further cross-cultural studies are necessary to reveal alternative interaction protocols. Along these lines, Nelson (2007, p. 118) suggests that even in societies in which parents do not engage in naming things, “immersion in a language-using community” contributes to patterns of interaction. These patterns are pragmatic frames that enable children’s learning.

Pragmatic Frames Consist of a Meaning and a Syntax

Bruner (1983, p. 46) points out that pragmatic frames can be characterized by a surface and a deep structure: he uses deep structure to characterize the invariant basic form of a pragmatic frame and surface structure to characterize its appearance (i.e., the variable forms in which the deep structure is realized). For Bruner’s (1983) famous example of the peek-a-boo game, the deep structure is about hiding and reappearing, whereas the means for it can vary on the surface structure.

We prefer to account for pragmatic frames by using the terms meaning and syntax. This allows us to further differentiate the structure introduced by Bruner (1983) (see Table 2). Another crucial difference is our inclusion of cognitive operations among the ingredients of meaningful behavior that are needed for an appropriate coordination and disposition of the participants.

Table 2

	Syntax	Meaning
Surface	– External	– Hidden
	– Directly observable	– Cognitive operations recruited from memory
	– Sequence of behaviors
	– Execution of deep syntax	– Emerging cues of behavior
Deep	– Invariant properties inferred from, e.g., statistical learning	– Hidden
	– The basis for a sequence	– Constructed around joint goal(s)
	– Specifies slot and type of learning content	– Cognitive operations in a sequence for a (joint) goal
		– Long-term effects on memory
		– Specification of learning content possible because it is embedded in familiar goal-directed sequence of actions

Our concrete parameterizations of pragmatic frames.

Meaning: Connection to the Cognitive Processes

We call the set of effects that a frame has on memory processes (i.e., the cognitive functions involved in the frame) the meaning of a pragmatic frame. Whereas, of course, memory effects are present for both interaction partners (see Figure 1), we focus on the learner side. In line with Bruner’s (1983) surface and deep structure, we differentiate the surface from the deep meaning of a pragmatic frame. The surface meaning denotes the set of cognitive operations associated with the pragmatic frame. We presume that these emerge with a child’s growing interactional experience. For example, Nomikou et al. (2013) showed that mothers use sensitivities toward motion and intermodal synchrony to educate their infant’s gaze behavior, and that this develops into a convention. As an effect, gazing at the interaction partner communicates a referential expectation (Senju and Csibra, 2008; Szufnarowska et al., 2014). The surface meaning consists of such conventionalized signals; that is, individual elements of behavioral patterns (pragmatic acts) that are known to the learner from previous interactions, already acquired pragmatic frames, or constituents of such. They can be basic and automated in terms of being reactive behaviors, but bear some dispositions that are co-constructed with the partner: for example, when a tutor points to an object, the learner not only follows the gesture but also expects a referent (Gliga and Csibra, 2009). These operations do not necessarily appear constantly, but may vary to a similar degree as the constituents of the surface structure. Take, for example, the way blind people perceive a new object via touch or hearing (Bigelow, 2003). Although this way will differ in the type of cognitive operations needed to process the perception, it results in the same deep structure. The patterns of the surface meaning thus create anticipation of the learning content on the cognitive level.

We use the deep meaning of the pragmatic frame to refer to the goal-directed cognitive operations involved in the processing of, for example, its learning content. The deep structure differs from the surface structure in that learners will experience more interactions and more variability in the performance of the behavioral sequence. As a consequence, they will be able to extract the invariant regularities and generalize the interaction structure. This extraction might be a kind of embodied abstraction (Binder and Desai, 2011) from concrete modality-specific memory to amodal memory. Although the understanding of the generalization process is still nebulous, we believe that it produces a kind of schema/construction (Nelson, 1974; Behrens, 2009; Binder and Desai, 2011). Constructions make a slot available to users that can take different forms: a non-verbal behavior within joint engagement (Nelson, 2007) or creative verbal behavior (Lieven et al., 2003). That is, children can expect a particular form of interaction (see also the deep syntax for more details), because the deep syntax allows them to tune into the content. For example, in a labeling frame, a learner will expect that the utterance “this is a...” performed with a pointing gesture toward an object is about associating a word with this object. In this frame, the learner’s role is to follow the gesture, extract a single object from the complex scene, and memorize its label in order to recall it when the referent is present. Thus, the learning slot affects memory but is linked to the pragmatic role that the learner takes in this frame. In other words, gaining a grasp of the deep meaning of pragmatic frames is equivalent to knowing what the interaction is about (Lock, 1978).

Thus, cognitive skills are linked to pragmatic skills as an important and innovative aspect of the pragmatic frames concept proposed here (see Figure 1). With pragmatic skills, we refer to children’s ability to contextualize their (verbal and non-verbal) behavior for a specific purpose. We predict that training with pragmatic frames should lead to an improvement in related cognitive skills that can be assessed by testing slow mapping abilities (in an interactive setting with an experimenter) and language processing (in an eye-tracking setting). Thom and Sandhofer (2009) have already provided some support for this prediction by demonstrating that 20-month-old children learned color words more robustly when exposed to more instances during training. We assume that the training in this study advanced the children’s ability to prioritize the colors of the objects in their memories.

For the meaning of pragmatic frames in general, we hypothesize that not all aspects within a frame will be learned at once, but that learning will proceed from the surface level (because recurring, directly observable patterns are easier to identify) to the deep level and from the syntax level to the underlying cognitive level. This indicates the need for different attentional processes (i.e., from surface to cognitive level) (Krauzlis et al., 2014). For example, turn taking is learned very early in a child’s development (Kaye and Wells, 1980) and enables a child to develop such cognitive operations as eliciting, expecting, and waiting for somebody’s response. These operations can be used on a surface level as a first encounter with the meaning. Our hypothesis is that observations of productive behavior (elements of the surface level) might be acquired before an understanding of behavior (operations needed on the cognitive level) in situations in which communication is scripted and thus transparent. In fact, Salas Poblete (2011) showed that in highly structured triadic situations in which they could imitate a model, 2-year-old learners displayed a productive communicative behavior (an appropriate gesture for a referent) without grasping the underlying robust meaning of a pragmatic frame that would allow them to transfer it to a new exemplar of the referent.

Syntax of Pragmatic Frames

We will call the sequence of verbal and non-verbal actions that characterize the appearance of a pragmatic frame its syntax. Like Bruner (1983), we define the surface syntax as the observable sequence of behaviors constituting the pragmatic frame. The surface syntax comprises, amongst others, the adequate sensory means, the possible orders of behavioral units, and information about actors. Surface syntax also specifies the sequence of cognitive actions. We assume that such cognitive operation sequences need to be learned (and automatized). One example of how a sequence can change the subsequent operations is given in Moll et al. (2006). In this study, children at the age of 14, 18, and 24 months witnessed an adult expressing excitement while looking to the side on which an object was located. In a condition in which the object was new for the adult, children responded by attending to the whole object. In a condition in which the child and the adult had previously played with the object, children responded by attending to a specific part of the object or by expecting another object of interest to be present in the room. Clearly, this behavior is not a part of the operations and roles related to the common frame in which a learner encounters a new object.

For word learning, deep syntax can specify the slot for the (learning) content (i.e., where it is in the sequence) and the type of content (e.g., whether a noun or a color is learned). The function of deep syntax is twofold: first, it reflects the generalizability of the interaction structure (because the invariant parameters will be gathered across various experiences); second, it contains information about the (learning) content and thus what the sequence is about—this constitutes a link to the deep meaning (see above).

Take the action of pointing to an object and labeling it. Performing this act, a competent speaker knows that its goal has to be framed by (1) looking at the other person, (2) pointing in the direction of an object, and (3) uttering its label. The role of the learner is then to recall this label in the presence of the referent. When learning new words, this sequence will be repeated with varying content in only some slots. Thus, the learning dimensions within frames are limited to operations such as to identifying the slot within a known frame and to processing the new information in the slot leading to a knowledge update.

Crucially, the slot with the learning content (e.g., a new word) works successfully only when the learner is able to apply the content (e.g., a new word) appropriately (e.g., by choosing the correct object from an array). Once cognitive operations are set up in line with a collaborative behavior for a specific learning task, the changing elements likely become easy to pick up. Note, however, that the element will be picked up only because it is crucial for the joint goal (e.g., the tutor expects the learner to choose the correct object and to give it to the tutor). The (learning) content, which is put into a particular slot, links the deep syntax to the deep meaning.

For learning, the relevant content is given by the deep level of both syntax and meaning. Whereas a pragmatic frame that appears to be situated solely on the surface level does not seem to contain a slot for learning in the explicit learning sense (compared to, e.g., the frame of greeting someone), the learner can prescind from this frame to create a slot for learning—for example, whom to greet and whom not to greet. Clearly, we need to specify which circumstances can push the learner forward: Is it the tutor, the cross-situational comparisons, or the negative examples? We presume that not only the deep meaning but also the deep syntax are involved in the schematization process (Behrens, 2009; Fischer, 2015) that results in abstract representations. This idea is compatible with Nelson’s (1974) suggestion that the functional core of a concept for individual words becomes synthesized from concrete acts and their various relationships. It is also compatible with Barrett et al. (1991) multiroute model of early lexical development. They suggest that context-bound words (initially mapped onto an event representation) and social-pragmatic words might be learnt differently from referential words such as nominals or non-nominals (initially mapped onto a prototype). We think that a frame of use for these words may well influence the kind of representations that arise. A concrete examination of word learning frames might help to verify this theoretical assumption. Roy et al. (2015) have suggested that in order to acquire verbs (in contrast to nouns), children might need the support of the activities taking place in a particular space.

Different frames might result in different forms of representations. Input can certainly help children to organize their knowledge (Gelman, 2009) and to exercise cross-situational comparisons (Gelman et al., 2005). In addition, the syntax of a frame can vary to some extent from person to person. Thus, children organize their conceptual knowledge on the basis of private experience that is “different from the adult’s and from the conventional meaning” (Nelson, 2007, p. 124). The form of the parts that constitute the syntax varies according to the utterances and tokens used (e.g., when uttering the label in the pointing–labeling frame described above, possible tokens include X [ = label], it’s an X, that’s an X, there’s an X [cf. Bruner, 1983, p. 79], intonation, prosody, and pause lengths). Young infants, however, are presented with a stable caregiver’s behavior on which they can rely. Certainly, the syntax of a pragmatic frame is highly conventionalized and “cannot be specified independently of the perceptions of the participants” (Bruner, 1983, p. 133). We assume that children need to experience a lot of variations on the surface level to grasp the deep meaning. The role of variation is compatible with what Clark (1993) proposes as the principles of contrast and conventionality that guide children’s word learning. We view pragmatic frames as interaction protocols that eventually become conventionalized. And when words are applied in different frames, these contrasts might advance their schematization (see above).

Hierarchy of Pragmatic Frames

Currently, one of the major challenges in this alternative approach is to determine how far formats may be modular; that is, how far they may be composed to form bigger units or recomposed to form other sequences. Can a pointing gesture to an object already be a pragmatic frame? Or is it just a part of it? Humans have the capacity to assemble a new structure from previously experienced elements (Davachi and DuBrow, 2015). Hence, it is likely that pragmatic frames can be created ad hoc from known elements. Heller and Rohlfing (2015) found that children as young as 13 months create new gestural practices when narrating a picture. Yet, this novelty occurred in the familiar context of joint book readings. For verbal behavior, Lieven et al. (2003) showed how a 2-year-old child was able to perform some operations (such as substitute, add-on) on previously heard utterances.

Bruner (1983) recognizes that:

Formats are also modular in the sense of being accessible as subroutines for incorporation in larger scale, long-term routines. A greeting format, for example, can be incorporated in a larger scale routine involving other forms of joint action. In this sense, any given format may have a hierarchical structure, parts being interpretable in terms of their placement in a larger structure. The creation of higher-order formats by incorporation of subroutine formats is one of the principal sources of presupposition. What is incorporated becomes implicit or presupposed (p. 133).

Along these lines, we agree with Bruner (1983) that:

Formats “grow” and can become as varied and complex as necessary. Their growth is effected in several ways. They may in time incorporate new means or strategies for the attainment of goals, including symbolic or linguistic ones. They may move toward coordination of the goals of the two partners not only in the sense of “agreement,” but also with respect to a division of a labor and a division of initiative. And they may become conventionalized or canonical in fashion that permits others within a symbolic community (e.g., a “speech community”) to enter the format in a provisional way to learn its special rules (p. 132–133).

Nonetheless, the challenge is still to create a model that can grow by (1) accepting new means for the effects, (2) modulating goals, and (3) accepting new frames when they become conventionalized, that is, a part of an interactional routine.

Pragmatic Frames Evoke an Interpretation of a Situation

To date, investigations of pragmatic frames have concentrated on whether and at which age children master a particular routine. For example, Franco and Butterworth (1996) have shown that 16-month-olds have already learned to visually check whether their interlocutor is attentive before they point to something. van der Goot et al. (2014) conducted a series of experiments showing which conditions have to be fulfilled for infants to point and what children expect their partners to do in a particular context (Thorgrimsson et al., 2014, 2015). This research attests to conventionality and children’s growing expectations that communicative situations will conform to a particular structure. Children expect a structure not only on a syntax level but also on the level of meaning. Matthews et al. (2010) showed that children usually expect only one label to refer to an object. They broke these “referential pacts” (Matthews et al., 2010, p. 749) by having different experimenters assign different labels to the same objects. When responding to the new labeling, 3-year-olds were slower, suggesting a greater cognitive load. Metzing and Brennan (2003) found that children demonstrated social-cognitive abilities unlike those of adults because they expected referential pacts to persist across experimenters. Some children protested explicitly: “While they understood that the alternative terms were intended to refer to the same object, they were very keen to pass normative judgment on term use and did not appear to fully appreciate that different people might take different perspectives on an object” (Matthews et al., 2010, p. 756).

In our opinion, previous experimental research has focused on very specific pragmatic frames (mostly labeling) and described their functions for a specific kind of learning. What is still lacking, however, is a broader perspective that accounts for other learning situations (Nakao and Andrews, 2014) and thus other possible pragmatic frames. One interesting recent study was conducted by Moore et al. (2013). They presented 3-year-olds with a hiding game and showed that the children could comprehend a novel communicative act even without the means of communication on which they typically rely. However, the children seemed “to exploit a number of everyday bodily cues in interpreting communicative intent” (Moore et al., 2013, p. 75). Thus, apparently, children are on the lookout for familiar frames that may help them to interpret an ongoing situation.

One of our studies (Salas Poblete, 2011; Rohlfing et al., 2013) provides a first methodological approach for manipulating pragmatic frames actively in the context of word learning in order to explore their influence on learning success. Specifically, we tested children’s word learning in known labeling frames and in frames consisting of new elements. For the latter, the object was highlighted by a light being switched on under it instead of using the familiar pointing gesture. We found that children still learned new words in unfamiliar frames (Salas Poblete, 2011), suggesting that new elements might slow down but not eliminate learning. This implies that young learners might be tolerant toward changes in the elements of a frame. It remains unclear whether some elements (e.g., ostension) can be interpreted more strongly as a particular frame than others (e.g., pointing).

Many open questions arise from these considerations: If a child needs to experience a situation repeatedly to accumulate information about it, we need to know what aspects of a situation are crucial for learning and whether children differ in the way they construct a situation. Crucially, we should look at the entire situation as constructed by individuals and not at isolated aspects of it. The first step could be to identify whether and how significant changes to the learning situation impose a greater cognitive load on learners. Because the interactive roles in a particular situation seem to be crucial for establishing a pragmatic frame (Bruner, 1983), it is relevant to investigate whether children’s perception of a situation changes when they are able to reverse the roles (Carpenter et al., 2005). In a next step, future research needs to explain how a change of a frame can be detected. A solution is necessary to account for the question regarding which factors indicate that a new frame is initiated and, thus, to allow intelligent systems to act flexibly within an interaction (Vollmer et al., submitted).

Asymmetric Pragmatic Frames: Scaffolding

For language acquisition, it is important to consider the frames that caregivers establish intuitively when eliciting and training specific behavior in children. More specifically, a caregiver seems to reduce “degrees of freedom in the task to manageable limits” and to mark critical features (Wood et al., 1976, p. 99; see also Pitsch et al., 2014). Bruner (1983) recognizes that:

One special property of formats involving an infant and an adult […] is that they are asymmetrical with respect to the knowledge of the partners—one ‘knows what’s up,’ the other does not know or knows less. Insofar as the adult is willing to ‘hand over’ his knowledge, he can serve in the format as model, scaffold, and monitor until the child achieves requisite mastery. (p. 133)

Following the “interactive turn” in social cognition research (De Jaegher et al., 2010; Cowley, 2011), recent studies have revealed that an unfolding interaction provides the tutor with an opportunity to adapt to the learner’s needs over time (e.g., Fukuyama et al., 2014; Pitsch et al., 2014). Continuing this line of research, we propose that modifications by the more competent partner offer the learner the possibility to join the interaction, even though the learner might not understand every detail of it (Wrede et al., 2013). Yet, we know little about how children gain a grasp of the deep structure when acting on the surface. Future research needs to examine how far some pragmatic frames are more appropriate and efficient for training than others: In what way might their structure be more transparent than the structure of other frames? In a study with 14- to 18-month-olds, Rohlfing et al. (2015) found that specific types of pragmatic frame such as labeling and questioning routines occurred in the context of joint book reading. Importantly, both frames fostered the child’s participation in an interaction involving pointing and later answering questions and were related to children’s later vocabulary. Although also observable in other contexts such as free play, the context of joint book reading seems to richly elicit specific kinds of frames (Gelman et al., 2005; Rohlfing et al., 2015). We need to understand more about deep syntax in the sense of the cognitive operations that give rise to subsequent sequences of actions.

Individual Differences

Because pragmatic frames comprise a link between communicative and cognitive skills, individual differences might emerge in these skills (Akhtar and Gernsbacher, 2007) and in what individual exposure a child needs to learn a frame: some children may establish some kind of structure after only a few exposures, whereas others will need more repetitions to take advantage of the frame (Rohlfing et al., submitted). The literature reveals other examples of the bandwidth of important aspects when learning frames. One essential aspect has been reported by Bedford et al. (2013), who gave word-learning tasks to 2-year-old toddlers at high and low risk for autism spectrum disorders. One group of children received social feedback confirming their choice; the other group received no feedback. Results showed that children with a low risk for ASD benefited from feedback on their initial choice, and that their further performance was above chance level. In contrast, those with a high risk for ASD did not show this effect. Because their pragmatic competencies are commonly viewed as being impaired, it is possible that they have difficulties in (1) recognizing a sequence of actions as a whole/frame or (2) prioritizing information within the single elements of a frame (giving higher priority to the confirmation rather than to own choice). Future research should investigate how individuals may differ in picking up a structure and learning from it.

How Current Approaches Interface with Pragmatic Frames

Even though many current approaches interface with the idea of pragmatic frames, most of them consider the brief moment of learning but not the history of interaction. As a consequence, these approaches explain only some surface aspects of learning and do not account for the communicative foundation underlying language acquisition. In the following, we examine how some existing perspectives on language learning interface with our approach. This allows us to provide further details on pragmatic frames and contrast them with existing concepts.

Social Cues

Whereas many recent studies advertise the role of social cues in the process of mapping a word onto its referent (Horst and Samuelson, 2008; Axelsson et al., 2012), strong criticism of the mapping metaphor can be found at the core of social-pragmatic approaches (Nelson, 2007). Tomasello (2001) explicitly suggests dropping the mapping metaphor in favor of a referential area. He attributes the learning process not to two independent entities, namely, the word and its referent, but to a child who is analyzing the whole situation in terms of a joint action goal. This situation analysis focuses on a person using a symbol to manipulate the other’s attention.

Despite cogent reasons for regarding the continuation of actions rather than just a moment as contributing to language acquisition, we think that this latter focus has its own justification, because adults can shape their vocal and non-verbal actions into “categorical units of cultural communication” (Fogel, 1993, p. 29). For an adult, a particular element, such as an eye gaze, a word, or a vocalization with a particular prosody bears a potential meaning. Thus, it is likely that when learning language, children progress from utilizing larger behavioral units to short sequences/cues in order to ascribe meaning in communication. This idea is supported by studies showing that children can already recognize single elements (previously seen within a sequence of actions) and interpret them in accordance with their experience. One remarkable example is Senju and Csibra’s (2008) study revealing that 6-month-old infants can already interpret eye gaze as a cue signaling a reference to objects. Children probably become educated to such cues (Nomikou et al., 2013) and can take advantage of them in later learning processes. This advantage has been demonstrated convincingly in Horst and Samuelson’s (2008) study showing how social cues in the form of an ostensive labeling can influence children’s learning performance. Children at the age of 24 months were presented with novel objects in either an ostensive (addressed and being gazed directly when labeling an object) or a follow-in naming condition (the object was labeled while the child was manipulating it). After 5 min delay, they could remember new labels presented in the ostensive but not in the follow-in naming condition. We argue that this ostensive condition takes advantage of a labeling frame that seems to trigger some specific cognitive operations in children (long-term remembering). However, the authors themselves interpreted their findings in terms of situational cues guiding the child’s attention and memory.

The literature on cognitive and language development reveals a consensus that in natural settings, children aged about 18 months can robustly figure out an intended referent on the basis of joint attention (and some cues) even when some noise is interfering with the situation (Baldwin, 1993; Carpenter et al., 1998). What is controversial, however, is the role of social information before the age of 18 months, and at which age children become sensitive to the social information that is considered to be a means of attaining joint attention in the form of eye contact, gesture, and language.

Pruden et al. (2006) tested which information – social or perceptual – 10-month-old children would apply to associate a word with an object. Whereas perceptual information was operationalized as the salient appearance of the object, the direction of eye gaze stood for the social information. The authors found that at the age of 10 months, infants regard the salience of objects rather than the eye gaze, and they concluded that perceptual but not social cues are weighted more heavily at this age. Applying Pruden et al.’s (2006) findings in an emergentist coalition model (Hirsh-Pasek and Golinkoff, 1996; Hollich et al., 2000; Golinkoff and Hirsh-Pasek, 2006, 2008) encompassing different sources of knowledge in the process of reference resolution, indicates that when it comes to word learning, social perception as a skill is acquired later in development (see also Booth et al., 2008). For younger children, general learning factors such as the sensitivity to perceptual salience seem to be more important. We think that the critique of social cues is justified if individual cues are expected to control attention and memory. However, cues do not occur in isolation in natural interactions, but embedded in a sequence of actions and together with other cues. This provides a rich environment, and attention and memory processes take advantage of the unfolding situation—and do not just start at the moment when the word is uttered.

Flom et al. (2004) have shown cogently that when cues were presented to in 9-month-olds in combinations (which is the case in natural settings), the frequency of gaze following toward peripheral targets increased. Hence, it is likely that additional cue alongside or preceding the eye gaze presented by Pruden et al. (2006) would have guided the children’s attention to the referent more reliably than a single cue. Clearly, different timescales have to be taken into consideration (Rkaczaszek-Leonardi, 2015): the immediately preceding context and the interaction experience that the child brings into the situation. Concerning the immediate context, Liszkowski (2014) recently highlighted the importance of preceding action contexts as a source of symbolic development. More specifically, he reported findings revealing that in 12- to 14-month-old children, the outcome of the reference process differs depending on what the agent has done or seen before (see also Moll et al., 2006). Thus, joint actions are crucial and establish a context/history of interaction against which young children already interpret or use the communicative means. Whereas Liszkowski (2014) discerns the immediately preceding action contexts as an ingredient of meaningful behavior in young children starting to communicate, he barely considers the conventionalization process (Clark, 1993; see Pragmatic Frames—an Introduction and History). Waxman and Gelman (2009, p. 261) made the critical point that associationist approaches disregard “the fact that each word participates in an exquisitely detailed linguistic, social, and symbolic system.” It is not just the association that is formed in the learning process. Instead, children learn to apply particular cognitive operations (some cognitive and communicative jobs) in coordination with their partner. In other words, social cues are not only a part of the immediate context but also a part of the physical events that the child has already experienced with another person in the past. They are, therefore, parts of events with a specific interaction history (Rkaczaszek-Leonardi, 2015, p. 7).

Indeed, studies with young infants show convincingly that language as a signal possesses a unique power from early on: Words, and not just tones, induce categorization processes in infants as young as 3 months of age (Ferry et al., 2010). It is reasonable to think that the link between speech and the fundamental cognitive process of categorization might well be part of an innate endowment. However, recent research suggests that the link might be based rather on the communicative patterns – what we referred to above as communicative foundation – that the children have already experienced: In a recent study by Ferguson and Waxman (2016), 6-month-old infants were presented with videos containing interactions between persons who used ‘beeps’ to communicate with each other in a contingent way. After this exposure, infants were then tested on whether these beeps facilitated the categorization of objects. The authors found that 6-month-olds can apply an otherwise non-communicative signal to categorize objects if they experience this signal in a cooperative, turn-taking setting. We think that these findings taken together support our argument that a communicative foundation is necessary to then give rise to communicative means that are connected to cognitive operations.

Dynamic Coupling

With respect to the different timescales that give rise to meaningful behavior in communication, current pragmatic theories (Schumacher, 2014) suggest that a phase architecture drives a communicating system. Whereas in the first phase, as suggested above, the system is supplied with cues that help to generate expectations for upcoming words and actions, in a second phase, a representation can be updated by dynamic coupling known also as alignment. Beyond developmental research (see Stephens and Matthews, 2014, for a brief review), the process of alignment between interactants is known in studies with adults (Pickering and Garrod, 2004, 2013; De Jaegher et al., 2010). According to the interactive alignment account, the automatic process of alignment is observable in the verbal (Garrod and Anderson, 1987), gestural, and non-verbal (Kimbara, 2006; Bergmann and Kopp, 2012) behavior of interactants and results in aligned linguistic representations. For language acquisition, this perspective implies that interaction should not be investigated as a “mere context” in which learning takes place; instead, interaction should be seen as a part of the cognitive processes (Nomikou, 2014, pp. 43–44) allowing individuals to coordinate their cognitive operations to achieve a joint goal. Investigating learning within interactions means modifying the idea of how individual cognitive mechanisms work (Nomikou, 2014). Hsu and Fogel (2003) propose considering mother–infant dyads as a whole rather than separating them into individual participants. In fact, this unique coupling of the caregiver with the learner gives rise to observable phenomena in child-directed behavior: Visible as caregivers’ child-directed behavior, these modifications have been observed in speech (Fernald and Mazzie, 1991; Dominey and Dodane, 2004), gesture (Iverson et al., 1999; Grimminger et al., 2010), and motion (Brand et al., 2002; Rohlfing et al., 2006; Wrede et al., 2013). Their function has been appraised across disciplines as facilitating children’s recognition of the structure in language and action. The first approaches toward a social feedback loop are now being formulated (Fukuyama et al., 2014; Pitsch et al., 2014; Warlaumont et al., 2014). However, further theoretical developments need to acknowledge that caregivers’ responsiveness is not restricted to particular cultural patterns such as joint attention but rests upon joint action— as already anticipated by Shotter and Newson (1982).

The Importance of Statistical Learning and an Accumulating Linguistic Knowledge Base

In Section “Pragmatic Frames Require a History of an Interaction,” we emphasized that pragmatic frames allow us to capture different time scales. The value of information accumulating within and across situations has been recognized by some authors who propose that children do not need a conceptual representation to map a word onto its referent in specific situations. First, grounded in the sense of novelty, an association can be established by the child’s perception of a novel object that will stand out because it is unfamiliar (Mather and Plunkett, 2012). Second, children benefit from their memories across situations, and they notice which elements remain constant across multiple uses of a word and thereby ascribe a meaning to it (Akhtar and Montague, 1999; see Smith and Yu, 2008; Yu and Smith, 2012, for computational models on cross-situational learning). The ability to compare across situations is in line with Bruner (1983) who emphasizes that children’s sensitivity to invariant aspects of a situation might be a part of their cognitive endowment. Even though children’s ability to compare across situations is viewed as crucial, there are only very few models that try to explain how children integrate these experiences with online information. Although McMurray et al. (2012) provide a model that uses two timescales, this is restricted to the one frame of learning the meanings of novel words.

Certainly, more flexible models are needed that can account for the integration of timescales without being limited to one task. In addition, models need to account for the fact that children seem to differ with respect to their ability to recognize invariant aspects of a situation. One excellent example of this individual difference is the shape bias. Jones and Smith (1993, p. 132) argued that shape bias emerges as “the child learns to represent the regularities that exist between how words are used, the co-occurrence of properties in objects, and the act of attending to particular properties.” Interestingly, children who are at risk for delay in language acquisition, so-called late talkers, seem to gain a different operation from their word-learning experience than their age-mates and lack “a potentially helpful shape bias” (Jones, 2003, p. 482).

Studying disadvantaged populations provides an important way for further research to recognize which experience results in which cognitive dispositions while taking individual differences into account. In the case of late-talking children, we think that the cognitive operation of attending to a shape is a product of a labeling frame. Children who do not develop this shape bias might need more exposure to a particular frame in order to discern their “jobs,” and thus their role (Heller and Rohlfing, in preparation). In a recent study, we showed that 3-year-old children diagnosed with language impairment benefited from repetitions of the context (Rohlfing et al., submitted). Further research should focus on children’s individual differences in applying other biases or principles in language acquisition such as the whole-object assumption (Markman and Wachtel, 1988) or mutual exclusivity (Carey and Bartlett, 1978). These differences might well be explained by children’s individual experiences with particular pragmatic frames and their individual need for a more frequent or dense occurrence of such frames.

A further question is whether pragmatic frames are relevant only for young children. We think that this is not the case. For older children and adults, Quasthoff (1997, p. 59) has applied the term “discourse unit” instead of pragmatic frame to refer to verbal behaviors performed with a specific purpose such as instructing on how to play a game, discussing, narrating a story or telling a joke in a conversation. Those behaviors exhibit a global structure that establishes sequential conditions, “not only for a single next turn but for an ordered series of next turns” (p. 58). Children and adults need to learn the global structure if they are to act appropriately (see Table 1).

Even though we argue that pragmatic frames are operative in later language competence (i.e., in older children as well as adults), we agree that once children possess a communicative foundation in the form of a repertoire of pragmatic frames, they can use language within them in a more elaborated way. For example, Nelson (1974, p. 277) suggests a “functional synthesis” is achieved once the child experiences various relations (pragmatic frames) in which a specific word is involved. Clearly, it is necessary to further specify the way language skills unfold within pragmatic frames (see Lieven et al., 2003, for examples) and with growing knowledge about their variety. This is not the focus here. Nonetheless, children are helped by their accumulating linguistic knowledge base, and they can also establish the association through their linguistic experience (Gershkoff-Stowe, 2002): The more words children already know, the more they can benefit from phonological and semantic memories of non-targets that spread inhibitory activation. Gershkoff-Stowe (2002, p. 665) reported that as children practice producing individual words, “those words become stronger and more resistant to interference from lexical competitors.” Thus, knowledge that is distinct to the target word minimizes word retrieval error (Capone and McGregor, 2005, p. 1469; see also Goodman et al., 1998). In line with this claim, Gershkoff-Stowe and Hahn (2007) investigated the role of practice by training 16- to 18-month-old children to comprehend and produce nouns across several sessions. Some words were trained in 12 (high-practice sets), some in 3 (medium), and some in only 1 session (low-practice sets). Compared to a control group that learned to label familiar objects, children in the experimental group not only improved their knowledge about words from high-practice sets but also demonstrated better performance in identifying low-practice words from the previous session. The authors concluded:

The more an item is selected for comprehension or production, the stronger the level of activation will be and, hence, the greater the probability of access. This idea suggests that practice with individual words in a rapidly expanding lexicon changes the operation of the lexicon through the accumulated activation of many items (p. 691).

Similarly, Thom and Sandhofer (2009) investigated the ability of 20-month-old children to extend new color words to new instances when trained with two, four, or six different color words. They found that the more broadly children learned (and the more instances they were exposed to during training), the more they were able to extend the acquired words. Thus, the “vocabulary size within a domain was related to subsequent word learning within that same domain” (Thom and Sandhofer, 2009, p. 471).

Experience that lets children bind words with their referents can lay “the ground work in infancy for more rapid (and perhaps more hypothesis-testing-like) processes in later word learning” (Smith and Yu, 2008, p. 1566). The idea here is that concepts are not isolated entities but part of semantic networks (cf. Keil and Batterman, 1984) and interactive activities. Thus, a child with extensive knowledge accretion about adjectives who is learning a new adjective for some entity is in a very different position from a child learning the same new word who has little or no knowledge of the members of that category (Keil and Batterman, 1984, p. 232; Thom and Sandhofer, 2009).

From our perspective, these examples indicate an experience not only with a particular word class but – more importantly – with its learning frame.

Conclusion and Future Directions

We think that the concept of “pragmatic frame” helps us to understand the co-development of cognitive and communicative dispositions in children. It provides research on language acquisition with an alternative framework to the widely assumed mapping process by offering a complex context of actions and goals that enfolds at different time scales and needs to be negotiated between interactants. As outlined above, we think that our extended notion of pragmatic frames goes beyond the frameworks currently offered. Although our concept relates strongly to Bruner’s definition, we extend it by (1) clearly differentiating between the meaning and the syntax of a pragmatic frame and (2) identifying the cognitive layer. Central to our view is the assumption that pragmatic frames are not limited to the agent’s own actions, but involve pragmatic aspects of an ongoing situation organized around an interactional goal.

Can pragmatic frames be conceived as speech acts? As already noted above, pointing to an object and labeling it can be seen as a speech act, and Bruner (1983, p. 133) points to the possibility that “eventually, formats provide the basis for speech acts and their constraining felicity conditions. We learn how to invoke them by speech.” Speech act theories focus on particular linguistic constructions occurring in different pragmatic contexts. For example, the indication “it is sweet” when referring to a new candy bar can be interpreted as either a description or a warning. Which illocutionary force it has will be determined by the preceding context of the indication. Austin (1962) differentiates further between the illocutionary act (which is intended by the speaker) and the perlocutionary act (the effect of an act). In our approach, the illocutionary and perlocutionary force are present not only in the surface meaning – because this enables individuals to act and react – but also in the deep meaning – because this is formed by the underlying deep syntax. From the developmental perspective, we second Bruner (1983) and view speech acts as patterns of verbal behavior that can eventually be used explicitly. However, we are convinced that the pragmatic frame is a more basic structure underlying speech acts.

Communication practice is another concept that is less well-known in cognitive science but stems from research exploiting communication within Conversation Analysis. In linguistic research, communicative practices refer to co-constructed acts of “linguistic habitus” (Hanks, 1987, p. 668). Hanks (1987) emphasizes the cultural dependencies of such discourse practices/genres, characterizing them as “elements of linguistic habitus, consisting of stylistic, thematic, and indexical schemata on which actors improvise in the course of linguistic production” (Hanks, 1987, p. 668). This line of research focuses on the co-constructive effort involved in these practices, and as a method specifies the tasks as well as roles that individuals are fulfilling in a sequence (see, e.g., Forrester, 2013; Rossmanith et al., 2014).

Because pragmatic frames consist of a sequence of actions, they resemble the concept of scripts (Schank and Abelson, 1977). Two other related aspects are (1) the hierarchy, because scripts are described as being nested in each other (a feature we mentioned in Section “Hierarchy of Pragmatic Frames”) and (2) the presence of slots. However, to the best of our knowledge, the concept of scripts does not differentiate between the syntax and the meaning. Instead, the focus lies on the surface structure and captures the sequentiality of an event in a particular context.

To summarize the similarities and differences, we think that current approaches to semantics highlight the role of form-meaning pairings when attempting to specify which factors of the context shape the meaning. In, contrast, our approach takes a broader perspective (in time and locus) on the interaction as a source for semantics. We argue that the communicative foundation is necessary for children to learn language. We view it as emerging from (1) the routines that have become established (consisting of a particular syntax) in order to accomplish (2) a goal (i.e., the meaning) by means of (3) joint contributions split into the participants’ roles in these routines. This behavior evokes (4) cognitive dispositions. As a result, cognitive factors are embedded in the child’s social experience (Nelson, 2007, p. 45), and meaning is distributed among participants (Shotter and Newson, 1982) and among the different time scales (e.g., Rkaczaszek-Leonardi et al., 2014) encompassing memories of established routines.

For further research we propose the following hypotheses and claims:

–
Individual differences in experiencing pragmatic frames (their quantity and variety) will be reflected in children’s later language skills as interactions guide them toward creating new slots or recognizing elements of frames that enable them to differentiate between known and new sequences.
–
Without knowing the frame, a child’s understanding of language will be impaired.
–
Learning a new frame (and thus new verbal behavior) involves a close coupling between tutor and learner. This coupling might thus be crucial for the overall success of learning.
–
Children make sense of new frames by assembling known elements of the deep and surface structure.

Importantly, the concept of pragmatic frames has methodological implications. Pragmatic frames emphasize the fact that a word is not learned in a binary fashion but that the understanding of a concept and its linguistic representation emerges gradually and can be measured at different levels (Nomikou et al., 2016, accepted). As stated above, we define pragmatic frames as consisting of processes at the cognitive and pragmatic level (Figure 1). Accordingly, learning can be measured at those levels. More specifically,

–
The development of the understanding of a new word can be measured through related cognitive functions. Visual attention within a pragmatic frame can indicate whether, for example, the goal of an action has been understood. Demonstrations of familiarity with certain objects or words indicate that a first step is being taken toward understanding the meaning of an object within an action. Motor control, such as the way in which an object is grasped, can also be an indicator of a learning process.
–
The development of the understanding of communicative or pragmatic functions within a frame can be measured by expectations or behavior of the learner: for example, that feedback is expected by the tutor after demonstrating an action, and so forth.

By refocusing on the concept of pragmatic frames in this article, our main aim was to shift attention away from the learning outcome toward the learning process. Nonetheless, at the end of this article, we have to admit that some aspects remain uncovered: These pertain to the process of generalization and transfer. For example, children’s progressive use of verbal behavior needs a special focus to achieve a description of the “progressive liberation of utterances from physical contexts of co-action” (Rkaczaszek-Leonardi et al., 2015, p. 15). Empirical research has to reveal how words become powerful in enriching a situation. Whereas some work has been done on the progressively creative use of words (Barrett et al., 1991; Lieven et al., 2003), little is known about the liberation from physical contexts of co-action. Another important question is how verbal knowledge accumulated within frames becomes decontextualized and transferred from one situation to another. This focus on the development of cognitive processing can be found in some theoretical and empirical work (e.g., Nelson, 1974; Barrett et al., 1991; Rohlfing, 2006). A meta-analysis identifying word learning scenarios as particular pragmatic frames and comparing their effects might be a next helpful step toward uncovering the differences in the flexible use of verbal knowledge.

Statements

Author contributions

KR, BW, A-LV, and P-YO developed the framework and wrote the paper. The paper benefits from interdisciplinary collaboration and expertise in language acquistion (KR, P-YO), developmental robotics (BW, P-YO, A-LV, KR) and human–robot–interaction (A-LV, BW, P-YO).

Funding

. This work was funded as part of the Cluster of Excellence Cognitive Interaction Technology ‘CITEC’ (EXC 277), Bielefeld University. We acknowledge support for the article processing charge by the Deutsche Forschungsgemeinschaft and the Open Access Publication Fund of Bielefeld University. The final version of this paper benefited immensely from the constructive comments of the reviewers.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

1
AkgunB.CakmakM.YooJ. W.ThomazA. L. (2012). “Trajectories and keyframes for kinesthetic teaching: a human-robot interaction perspective,” in Proceedings of the seventh annual ACM/IEEE international conference on Human-Robot Interaction,New York, NY: ACM. 391–398.
- Google Scholar
2
AkhtarN.GernsbacherM. A. (2007). Joint attention and vocabulary development: A critical look.Lang. Linguist. Compass1195–207. 10.1111/j.1749-818X.2007.00014.x
- CrossRef
- Google Scholar
3
AkhtarN.MontagueL. (1999). Early lexical acquisition: the role of cross-situational learning.First Lang.19347–358. 10.1177/014272379901905703
- CrossRef
- Google Scholar
4
AustinJ. L. (1962). How to Do Things With Words.Cambridge, MA: Harvard University Press.
- Google Scholar
5
AxelssonE. L.ChurchleyK.HorstJ. S. (2012). The right thing at the right time: why ostensive naming facilitates word learning.Front. Psychol.2:88. 10.3389/fpsyg.2012.00088
- CrossRef
- Google Scholar
6
BahrickL. E.NewellL. C. (2008). Infant discrimination of faces in naturalistic events: actions are more salient than faces.Dev. Psychol.44983–996. 10.1037/0012-1649.44.4.983
- CrossRef
- Google Scholar
7
BaldwinD. A. (1993). Early referential understanding: infants’ ability to recognize referential acts for what they are.Dev. Psychol.29832–843. 10.1037/0012-1649.29.5.832
- CrossRef
- Google Scholar
8
BaldwinD. A.BairdJ. A.SaylorM. M.ClarkM. A. (2001). Infants parse dynamic action.Child Dev.72708–717. 10.1111/1467-8624.00310
- CrossRef
- Google Scholar
9
BarrettM.HarrisM.ChasinJ. (1991). Early lexical development and maternal speech: a comparison of children’s initial and subsequent uses of words.J. Child Lang.1821–40. 10.1017/S0305000900013271
- CrossRef
- Google Scholar
10
BatesonG. (1955). A theory of play and fantasy.Psychiatr. Res. Rep.239–51.
- Google Scholar
11
BedfordR.GligaT.FrameK.HudryK.ChandlerS.JohnsonM. H.et al (2013). Failure to learn from feedback underlies word learning difficulties in toddlers at risk for autism.J. Child Lang.4029–46. 10.1017/S0305000912000086
- CrossRef
- Google Scholar
12
BehrensH. (2009). Usage-based and emergentist approaches to language acquisition.Linguistics47383–411. 10.1515/LING.2009.014
- CrossRef
- Google Scholar
13
BeisertM.ZmyjN.LiepeltR.JungF.PrinzW.DaumM. M. (2012). Rethinking “rational imitation” in 14-month-old infants: a perceptual distraction approach.PLoS ONE7:e32563. 10.1371/journal.pone.0032563
- CrossRef
- Google Scholar
14
BergmannK.KoppS. (2012). “Gestural alignment in natural dialogue,” in Proceedings of the 34th Annual Conference of the Cognitive Science Society,edsPeeblesD.MiyakeN.CooperR. P. (Austin, TX: Cognitive Science Society), 1326–13331.
- Google Scholar
15
BigelowA. E. (2003). The development of joint attention in blind infants.Dev. Psychopathol.15259–275. 10.1017/S0954579403000142
- CrossRef
- Google Scholar
16
BillardA.CalinonS.DillmannR.SchaalS. (2008). “Robot programming by demonstration,” in Springer Handbook of Robotics,edsSicilianoB.OussamaK. (Berlin: Springer), 1371–1394.
- Google Scholar
17
BinderJ. R.DesaiR. H. (2011). The neurobiology of semantic memory.Trends Cogn. Sci.15527–536. 10.1016/j.tics.2011.10.001
- CrossRef
- Google Scholar
18
BjorklundD. F. (1987). How age changes in knowledge base contribute to the development of children’s memory: an interpretive review.Dev. Rev.793–130. 10.1016/0273-2297(87)90007-4
- CrossRef
- Google Scholar
19
BoothA. E.McGregorK. K.RohlfingK. J. (2008). Socio-pragmatics and attention: contributions to gesturally guided word learning in toddlers.Lang. Learn. Dev.4179–202. 10.1080/15475440802143091
- CrossRef
- Google Scholar
20
BrandR. J.BaldwinD. A.AshburnL. A. (2002). Evidence for ‘motionese’: modifications in mothers’ infant-directed action.Dev. Sci.572–83. 10.1016/j.jecp.2011.10.012
- CrossRef
- Google Scholar
21
BrunerJ. (1983). Child’s Talk: Learning to Use Language.New York, NY: Norton.
- Google Scholar
22
CangelosiA.RigaT. (2006). An embodied model for sensorimotor grounding and grounding transfer: experiments with epigenetic robots.Cogn. Sci.30673–689. 10.1207/s15516709cog0000_72
- CrossRef
- Google Scholar
23
CaponeN. C.McGregorK. K. (2005). The effect of semantic representation on toddlers’ word retrieval.J. Speech Lang. Hear Res.481468–1480.
- Google Scholar
24
CareyS.BartlettE. (1978). Acquiring a single new word.Proc. Stanford Child Lang. Conf.1517–29.
- Google Scholar
25
CarpenterM.NagellK.TomaselloM.ButterworthG.MooreC. (1998). Social cognition, joint attention, and communicative competence from 9 to 15 months of age.Monogr. Soc. Res. Child Dev.631–143. 10.2307/1166214
- CrossRef
- Google Scholar
26
CarpenterM.TomaselloM.StrianoT. (2005). Role reversal imitation and language in typically developing infants and children with autism.Infancy8253–278. 10.1207/s15327078in0803_4
- CrossRef
- Google Scholar
27
ClarkE. V. (1993). The Lexicon in Acquisition.Cambridge: Cambridge University Press.
- Google Scholar
28
CowleyS. J. (2011). Taking a language stance.Ecol. Psychol.23185–209. 10.1080/10407413.2011.591272
- CrossRef
- Google Scholar
29
CroftW. (2007). “Construction grammar,” in The Oxford Handbook of Cognitive Linguistics,edsGeeraertsD.CuyckensH. (New York, NY: Oxford University Press), 463–508.
- Google Scholar
30
CsibraG. (2010). Recognizing communicative intentions in infancy.Mind Lang.25141–168. 10.1111/j.1468-0017.2009.01384.x
- CrossRef
- Google Scholar
31
CsibraG.GergelyG. (2007). “Obsessed with goals”: functions and mechanisms of teleological interpretation of actions in humans.Acta Psychol.12460–78. 10.1016/j.actpsy.2006.09.007
- CrossRef
- Google Scholar
32
CsibraG.GergelyG. (2009). Natural pedagogy.Trends Cogn. Sci.13148–153. 10.1016/j.tics.2009.01.005
- CrossRef
- Google Scholar
33
CuayáhuitlH. (2015). “Robot learning from verbal interaction: a brief survey,” in Proceedings of the New Frontiers in Human-Robot Interaction,Canterbury, 62.
- Google Scholar
34
DavachiL.DuBrowS. (2015). How the hippocampus preserves order: the role of prediction and context.Trends Cogn. Sci.1992–99. 10.1016/j.tics.2014.12.004
- CrossRef
- Google Scholar
35
De JaegherH.Di PaoloE.GallagherS. (2010). Can social interaction constitute social cognition?Trends Cogn. Sci.14441–447. 10.1016/j.tics.2010.06.009
- CrossRef
- Google Scholar
36
DomineyP. F.BoucherJ. D. (2005). Developmental stages of perception and language acquisition in a perceptually grounded robot.Cogn. Syst. Res.6243–259. 10.1016/j.cogsys.2004.11.005
- CrossRef
- Google Scholar
37
DomineyP. F.DodaneC. (2004). Indeterminacy in language acquisition: the role of child directed speech and joint attention.J. Neurolinguist.17121–145. 10.1016/S0911-6044(03)00056-3
- CrossRef
- Google Scholar
38
Falck-YtterT.GredebäckG.von HofstenC. (2006). Infants predict other people’s action goals.Nat. Neurosci.9878–879. 10.1038/nn1729
- CrossRef
- Google Scholar
39
FarrarM. J.FriendM. J.ForbesJ. N. (1993). Event knowledge and early language acquisition.J. Child Lang.20591–606. 10.1017/S0305000900008497
- CrossRef
- Google Scholar
40
FergusonB.WaxmanS. R. (2016). What the [beep]? Six-month-olds link novel communicative signals to meaning.Cognition146185–189. 10.1016/j.cognition.2015.09.020
- CrossRef
- Google Scholar
41
FerryA. L.HesposS. J.WaxmanS. R. (2010). Categorization in 3- and 4-month-old infants: an advantage of words over tones.Child Dev.81472–479. 10.1111/j.1467-8624.2009.01408.x
- CrossRef
- Google Scholar
42
FillmoreC. J. (1982). “Frame semantics,” in Linguistics in the Morning Calm,ed.The Linguistic Society of Korea (Seoul: Hanshin), 111–137.
- Google Scholar
43
FischerK. (2015). Conversation, construction grammar, and cognition.Lang. Cogn.7563–588. 10.1017/langcog.2015.23
- CrossRef
- Google Scholar
44
FlomR.DeákG. O.PhillC. G.PickA. D. (2004). Nine-month-olds’ shared visual attention as a function of gesture and object location.Infant Behav. Dev.27181–194. 10.1016/j.infbeh.2003.09.007
- CrossRef
- Google Scholar
45
FogelA. (1993). “Two principles of communication: co-regulation and framing,” in New Perspectives in Early Communicative Development,edsNadelJ.CamaioniL. (London: Routledge), 9–22.
- Google Scholar
46
FogelA.GarveyA. (2007). Alive communication.Infant Behav. Dev.30251–257. 10.1016/j.infbeh.2007.02.007
- CrossRef
- Google Scholar
47
ForresterM. A. (2013). Mutual adaptation in parent-child interaction: learning how to produce questions and answers.Interact. Stud.14190–211. 10.1075/is.14.2.03for
- CrossRef
- Google Scholar
48
FrancoF.ButterworthG. (1996). Pointing and social awareness: declaring and requesting in the second year.J. Child Lang.23307–336. 10.1017/S0305000900008813
- CrossRef
- Google Scholar
49
FrnaldA.MazzieC. (1991). Prosody and focus in speech to infants and adults.Dev. Psychol.27209–221. 10.1037/0012-1649.27.2.209
- CrossRef
- Google Scholar
50
FukuyamaH.QinS.KanakogiY.NagaiY.AsadaM.Myowa-YamakoshiM. (2014). Infant’s action skill dynamically modulates parental action demonstration in the dyadic interaction.Dev. Sci.181006–1013. 10.1111/desc.12270
- CrossRef
- Google Scholar
51
GarrodS.AndersonA. (1987). Saying what you mean in dialogue: a study in conceptual and semantic co-ordination.Cognition27181–218. 10.1016/0010-0277(87)90018-7
- CrossRef
- Google Scholar
52
GelmanS. A. (2009). Learning from others: children’s construction of concepts.Annu. Rev. Psychol.60115–140. 10.1146/annurev.psych.59.103006.093659
- CrossRef
- Google Scholar
53
GelmanS. A.ChesnickR. J.WaxmanS. R. (2005). Mother–child conversations about pictures and objects: referring to categories and individuals.Child Dev.761129–1143. 10.1111/j.1467-8624.2005.00876.x-i1
- CrossRef
- Google Scholar
54
GergelyG.WatsonJ. S. (1999). Early socio-emotional development: contingency perception and the social-biofeedback model.Early Soc. Cogn.60101–136.
- Google Scholar
55
Gershkoff-StoweL. (2002). Object naming, vocabulary growth, and the development of word retrieval abilities.J. Mem. Lang.46665–687. 10.1006/jmla.2001.2830
- CrossRef
- Google Scholar
56
Gershkoff-StoweL.HahnE. R. (2007). Fast mapping skills in the developing lexicon.J. Speech Lang. Hear Res.50682–697. 10.1044/1092-4388(2007/048)
- CrossRef
- Google Scholar
57
GersonS. A.WoodwardA. L. (2014). Learning from their own actions: the unique effect of producing actions on infants’ action understanding.Child Dev.85264–277. 10.1111/cdev.12115
- CrossRef
- Google Scholar
58
GligaT.CsibraG. (2009). One-year-old infants appreciate the referential nature of deictic gestures and words.Psychol. Sci.20347–353. 10.1111/j.1467-9280.2009.02295.x
- CrossRef
- Google Scholar
59
GoffmanE. (1974). Frame Analysis: An Essay on the Organization of Experience.Cambridge, MA: Harvard University Press.
- Google Scholar
60
GoldbergA. E. (2003). Constructions: a new theoretical approach to language.Trends Cogn. Sci.7219–224. 10.1016/S1364-6613(03)00080-9
- CrossRef
- Google Scholar
61
GolinkoffR. M.Hirsh-PasekK. (2006). Baby wordsmith from associationist to social sophisticate.Curr. Dir. Psychol. Sci.1530–33. 10.1111/j.0963-7214.2006.00401.x
- CrossRef
- Google Scholar
62
GolinkoffR. M.Hirsh-PasekK. (2008). How toddlers begin to learn verbs.Trends Cogn. Sci.12397–403. 10.1016/j.tics.2008.07.003
- CrossRef
- Google Scholar
63
GoodmanJ. C.McDonoughL.BrownN. B. (1998). The role of semantic context and memory in the acquisition of novel nouns.Child Dev.691330–1344. 10.1111/j.1467-8624.1998.tb06215.x
- CrossRef
- Google Scholar
64
GrimmingerA.RohlfingK. J.StennekenP. (2010). Children’s lexical skills and task demands affect gestural behavior in mothers of late-talking children and children with typical language development.Gesture10251–278. 10.1075/gest.10.2-3.07gri
- CrossRef
- Google Scholar
65
HanksW. F. (1987). Discourse genres in a theory of practice.Am. Ethnol.14668–692. 10.1525/ae.1987.14.4.02a00050
- CrossRef
- Google Scholar
66
HarrisM.BarrettM.JonesD.BrookesS. (1988). Linguistic input and early word meaning.J. Child Lang.1577–94. 10.1017/S030500090001206X
- CrossRef
- Google Scholar
67
HellerV.RohlfingK. J. (2015). “From establishing reference to representing events independent from the here and now: a longitudinal study of depictive practices in early childhood,” in Gesture and Speech in Interaction Proceedings,4th Edn, edsFerréG.TuttonM. (Nantes: University of Nantes), 143–148.
- Google Scholar
68
Hirsh-PasekK.AdamsonL. B.BakemanR.OwenM. T.GolinkoffR. M.PaceA.et al (2015). The contribution of early communication quality to low-income children’s language success.Psychol. Sci.261071–1083. 10.1177/0956797615581493
- CrossRef
- Google Scholar
69
Hirsh-PasekK.GolinkoffR. M. (1996). The Origins of Grammar. Evidence from Early Language Comprehension.Cambridge, MA: MIT.
- Google Scholar
70
HollichG. J.Hirsh-PasekK.GolinkoffR. M.BrandR. J.BrownE.ChungH. L.et al (2000). Breaking the Language Barrier: An Emergentist Coalition Model for the Origins of Word Learning.New York, NY: Wiley.
- Google Scholar
71
HorstJ. S.SamuelsonL. K. (2008). Fast mapping but poor retention by 24-month-old infants.Infancy13128–157. 10.1080/15250000701795598
- CrossRef
- Google Scholar
72
HsuH. C.FogelA. (2003). Stability and transitions in mother-infant face-to-face communication during the first 6 months: a microhistorical approach.Dev. Psychol.391061–1082. 10.1037/0012-1649.39.6.1061
- CrossRef
- Google Scholar
73
IversonJ. M.CapirciO.LongobardiE.CaselliM. C. (1999). Gesturing in mother-child interactions.Cogn. Dev.1457–75. 10.1016/S0885-2014(99)80018-5
- CrossRef
- Google Scholar
74
JonesS. S. (2003). Late talkers show no shape bias in a novel name extension task.Dev. Sci.6477–483. 10.1111/1467-7687.00304
- CrossRef
- Google Scholar
75
JonesS. S.SmithL. B. (1993). The place of perception in children’s concepts.Cogn. Dev.8113–139. 10.1016/0885-2014(93)90008-S
- CrossRef
- Google Scholar
76
KanakogiY.ItakuraS. (2011). Developmental correspondence between action prediction and motor ability in early infancy.Nat. Commun.2:341. 10.1038/ncomms1342
- CrossRef
- Google Scholar
77
KayeK.WellsA. J. (1980). Mothers’ jiggling and the burst—pause pattern in neonatal feeding.Infant Behav. Dev.329–46. 10.1016/S0163-6383(80)80005-1
- CrossRef
- Google Scholar
78
KeilF. C.BattermanN. (1984). A characteristic-to-defining shift in the development of word meaning.J. Verb. Learn. Verb. Behav.23221–236. 10.1016/S0022-5371(84)90148-8
- CrossRef
- Google Scholar
79
KendonA. (1985). “Behavioral foundations for the process of frame attunement in face-to-face interaction,” in Discovery Strategies in the Psychology of Action,edsGinsburgG. P.BrennerM.von CranachM. (Orlando, FL: Academic Press), 229–253.
- Google Scholar
80
KimbaraI. (2006). On gestural mimicry.Gesture639–61. 10.1075/gest.6.1.03kim
- CrossRef
- Google Scholar
81
KrauzlisR. J.BollimuntaA.ArcizetF.WangL. (2014). Attention as an effect not a cause.Trends Cogn. Sci.18457–464. 10.1016/j.tics.2014.05.008
- CrossRef
- Google Scholar
82
LangackerR. (1987). Foundations of Cognitive Grammar: Theoretical Prerequisites,Vol. 1. Standford, CA: Stanford University Press
- Google Scholar
83
LegersteeM. (2005). Infants’ Sense of People. Precursors to a Theory of Mind.Cambridge: Cambridge University Press.
- Google Scholar
84
LievenE.BehrensH.SpearesJ.TomaselloM. (2003). Early syntactic creativity: a usage-based approach.J. Child Lang.30333–370. 10.1017/S0305000903005592
- CrossRef
- Google Scholar
85
LiszkowskiU. (2014). Two sources of meaning in infant communication: preceding action contexts and act-accompanying characteristics.Philos. Trans. R. Soc. B Biol. Sci.369:20130294. 10.1098/rstb.2013.0294
- CrossRef
- Google Scholar
86
LockA. E. (1978). Action, Gesture and Symbol: The Emergence of Language.London: Academic Press.
- Google Scholar
87
LyonC.NehanivC. L.SaundersJ. (2012). Interactive language learning by robots: the transition from babbling to word forms.PLoS ONE7:e38236. 10.1371/journal.pone.0038236
- CrossRef
- Google Scholar
88
MandlerJ. M. (2012). On the spatial foundations of the conceptual system and its enrichment.Cogn. Sci.36421–451. 10.1111/j.1551-6709.2012.01241.x
- CrossRef
- Google Scholar
89
MarcosH. (1991). How adults contribute to the development of early referential communication?Euro. J. Psychol. Educ.6271–282. 10.1007/BF03173150
- CrossRef
- Google Scholar
90
MarkmanE. M.WachtelG. F. (1988). Children’s use of mutual exclusivity to constrain the meanings of words.Cogn. Psychol.20121–157. 10.1016/0010-0285(88)90017-5
- CrossRef
- Google Scholar
91
MastinJ. D.VogtP. (2015). Infant engagement and early vocabulary development: a naturalistic observation study of Mozambican infants from 1;1 to 2;1.J. Child Lang.[Epub ahead of print].
- Google Scholar
92
MatherE.PlunkettK. (2012). The role of novelty in early word learning.Cogn. Sci.361157–1177. 10.1111/j.1551-6709.2012.01239.x
- CrossRef
- Google Scholar
93
MatthewsD.LievenE.TomaselloM. (2010). What’s in a manner of speaking? Children’s sensitivity to partner-specific referential precedents.Dev. Psychol.46749–760. 10.1037/a0019657
- CrossRef
- Google Scholar
94
McMurrayB.HorstJ. S.SamuelsonL. K. (2012). Word learning emerges from the interaction of online referent selection and slow associative learning.Psychol. Rev.119831–877. 10.1037/a0029872
- CrossRef
- Google Scholar
95
MeltzoffA. N. (1995). Understanding the intentions of others: re-enactment of intended acts by 18-month-old children.Dev. Psychol.31838–850. 10.1037/0012-1649.31.5.838
- CrossRef
- Google Scholar
96
MetzingC.BrennanS. E. (2003). When conceptual pacts are broken: partner-specific effects on the comprehension of referring expressions.J. Mem. Lang.49201–213. 10.1016/S0749-596X(03)00028-7
- CrossRef
- Google Scholar
97
MollH.KoringC.CarpenterM.TomaselloM. (2006). Infants determine others’ focus of attention by pragmatics and exclusion.J. Cogn. Dev.7411–430. 10.1207/s15327647jcd0703_9
- CrossRef
- Google Scholar
98
MooreR.LiebalK.TomaselloM. (2013). Three-year-olds understand communicative intentions without language, gestures, or gaze.Interact. Stud.1462–80. 10.1111/desc.12206
- CrossRef
- Google Scholar
99
NakaoH.AndrewsK. (2014). Ready to teach or ready to learn: a critique of the natural pedagogy theory.Rev. Philos. Psychol.5465–483. 10.1007/s13164-014-0187-2
- CrossRef
- Google Scholar
100
NelsonK. (1974). Concept, word, and sentence: interrelations in acquisition and development.Psychol. Rev.81267–285. 10.1037/h0036592
- CrossRef
- Google Scholar
101
NelsonK. (2007). Young Minds in Social Worlds: Experience, Meaning, and Memory.Cambridge MA: Harvard University Press.
- Google Scholar
102
NinioA.SnowC. E. (1996). Pragmatic Development.Boulder, CO: Westview Press.
- Google Scholar
103
NomikouI. (2014). The Collaborative Construction of Early Multimodal Input and Its Significance For Language Development.Doctoral thesis, Bielefeld University, Bielefeld.
- Google Scholar
104
NomikouI.RohlfingK. J. (2011). Language does something: body action and language in maternal input to three-month-olds.Auton. Ment. Dev. IEEE Trans.3113–128. 10.1109/TAMD.2011.2140113
- CrossRef
- Google Scholar
105
NomikouI.RohlfingK. J.SzufnarowskaJ. (2013). Educating attention.Interact. Stud.14240–267. 10.1075/is.14.2.05nom
- CrossRef
- Google Scholar
106
OudeyerP.-Y. (2006). Self-Organization in the Evolution of Speech.Oxford: Oxford University Press.
- Google Scholar
107
PariseE.CsibraG. (2013). Neural responses to multimodal ostensive signals in 5-month-old infants.PLoS ONE8:e72360. 10.1371/journal.pone.0072360
- CrossRef
- Google Scholar
108
PickeringM. J.GarrodS. (2004). Toward a mechanistic psychology of dialogue.Behav. Brain Sci.27169–190. 10.1017/S0140525X04000056
- CrossRef
- Google Scholar
109
PickeringM. J.GarrodS. (2013). An integrated theory of language production and comprehension.Behav. Brain Sci.36329–347. 10.1017/S0140525X12001495
- CrossRef
- Google Scholar
110
PitschK.VollmerA. L.RohlfingK. J.FritschJ.WredeB. (2014). Tutoring in adult-child interaction: on the loop of the tutor’s action modification and the recipient’s gaze.Interact. Stud.1555–98. 10.1075/is.15.1.03pit
- CrossRef
- Google Scholar
111
PrudenS. M.Hirsh-PasekK.GolinkoffR. M.HennonE. A. (2006). The birth of words: ten-month-olds learn words through perceptual salience.Child Dev.77266–280. 10.1111/j.1467-8624.2006.00869.x
- CrossRef
- Google Scholar
112
QuasthoffU. (1997). “An interactive approach to narrative development,” in Narrative Development: Six Approaches,ed.BambergM. (Mahwah, NJ: Erlbaum), 51–83.
- Google Scholar
113
Rkaczaszek-LeonardiJ. (2015). How does a word become a message? An illustration on a developmental time-scale.New Ideas Psychol.(in press). 10.1016/j.newideapsych.2015.08.001
- CrossRef
- Google Scholar
114
Rkaczaszek-LeonardiJ.DębskaA.SochanowiczA. (2014). Pooling the ground: understanding and coordination in collective sense making.Front. Psychol.5:1233. 10.3389/fpsyg.2014.01233
- CrossRef
- Google Scholar
115
Rkaczaszek-LeonardiJ.NomikouI.RohlfingK. J. (2013). Young children’s dialogical actions: the beginnings of purposeful intersubjectivity.IEEE Trans. Auton. Ment. Dev.5210–221. 10.1109/TAMD.2013.2273258
- CrossRef
- Google Scholar
116
Rkaczaszek-LeonardiJ.RohlfingK. J.TomalskiP. (2015). EASE: Early semantic development. Linking language development to emerging participation in social events.Project Proposal “Beethoven” Polish-German Funding Initiative in the Humanities and Social Sciences.Warsaw and Paderborn: Polish Academy of Sciences, University of Warsaw and Paderborn University.
- Google Scholar
117
RamscarM.YarlettD.DyeM.DennyK.ThorpeK. (2010). The effects of feature-label-order and their implications for symbolic learning.Cogn. Sci.34909–957. 10.1111/j.1551-6709.2009.01092.x
- CrossRef
- Google Scholar
118
ReddyV.MarkovaG.WallotS. (2013). Anticipatory adjustments to being picked up in infancy.PLoS ONE8:e65289. 10.1371/journal.pone.0065289
- CrossRef
- Google Scholar
119
ReddyV.UitholS. (2015). Engagement: looking beyond the mirror to understand action understanding.Br. J. Dev. Psychol.34101–114. 10.1111/bjdp.12106
- CrossRef
- Google Scholar
120
RizzolattiG.FogassiL.GalleseV. (2001). Neurophysiological mechanisms underlying the understanding and imitation of action.Nat. Rev. Neurosci.2661–670. 10.1038/35090060
- CrossRef
- Google Scholar
121
RohlfingK. J. (2006). Facilitating the acquisition of UNDER by means of IN and ON–a training study in Polish.J. Child Lang.3351–69. 10.1017/S0305000905007257
- CrossRef
- Google Scholar
122
RohlfingK. J.FritschJ.WredeB.JungmannT. (2006). How can multimodal cues from child-directed interaction reduce learning complexity in robots?Adv. Robot.201183–1199. 10.1163/156855306778522532
- CrossRef
- Google Scholar
123
RohlfingK. J.GrimmingerA.NachtigällerK. (2015). “Gesturing in joint book reading,” in Learning From Picturebooks. Perspectives from Child Development & Literacy Studies,edsKümmerling-MeibauerB.MeibauerJ.RohlfingK. J.NachtigällerK. (London: Routledge), 99–116.
- Google Scholar
124
RohlfingK. J.NomikouI. (2014). Intermodal synchrony as a form of maternal responsiveness: association with language development.Lang. Interact. Acquis.5117–136. 10.1075/lia.5.1.06roh
- CrossRef
- Google Scholar
125
RohlfingK. J.PobleteJ. S.JoublinF. (2013). “Learning new words in unfamiliar frames from direct and indirect teaching,” in Proceedings of the 17th Workshop on the Semantics and Pragmatics of Dialogue,edsFernándezR.IsardA. (Amsterdam: University of Amsterdam), 121–130.
- Google Scholar
126
RossmanithN.CostallA.ReicheltA. F.LópezB.ReddyV. (2014). Jointly structuring triadic spaces of meaning and action: book sharing from 3 months on.Front. Psychol.5:1390. 10.3389/fpsyg.2014.01390
- CrossRef
- Google Scholar
127
RoyB. C.FrankM. C.DeCampP.MillerM.RoyD. (2015). Predicting the birth of a spoken word.Proc. Natl. Acad. Sci. U.S.A.11212663–12668. 10.1073/pnas.1419773112
- CrossRef
- Google Scholar
128
RoyD. K.PentlandA. P. (2002). Learning words from sights and sounds: a computational model.Cogn. Sci.26113–146. 10.1207/s15516709cog2601_4
- CrossRef
- Google Scholar
129
Salas PobleteJ. (2011). Learning Words: Comparing Two-Year-Olds’ Learning Success in Dyadic and Triadic Teaching Situations Embedded in Familiar and Unfamiliar Contexts.Doctoral thesis, Bielefeld University, Bielefeld.
- Google Scholar
130
SchacterD. L.AddisD. R.BucknerR. L. (2007). Remembering the past to imagine the future: the prospective brain.Nat. Rev. Neurosci.8657–661. 10.1038/nrn2213
- CrossRef
- Google Scholar
131
SchankR. C.AbelsonR. P. (1977). “Scripts, plans, and knowledge,” in Thinking: Readings in Cognitive Science,edsJohnson-LairdP. N.WasonP. C. (Cambridge: Cambridge University Press), 151–157.
- Google Scholar
132
SchumacherP. B. (2014). Content and context in incremental processing: “the ham sandwich” revisited.Philos. Stud.1681–15. 10.1007/s11098-013-0179-6
- CrossRef
- Google Scholar
133
SenjuA.CsibraG. (2008). Gaze following in human infants depends on communicative signals.Curr. Biol.18668–671. 10.1016/j.cub.2008.03.059
- CrossRef
- Google Scholar
134
ShotterJ.NewsonJ. (1982). “An ecological approach to cognitive development: implicate orders, joint action and intentionality,” in Social Development: Studies of the Development of Understanding,edsButterworthG.LightP. (Brighton: Harvester Press), 32–52.
- Google Scholar
135
SmithL.YuC. (2008). Infants rapidly learn word-referent mappings via cross-situational statistics.Cognition1061558–1568. 10.1016/j.cognition.2007.06.010
- CrossRef
- Google Scholar
136
SmithL. B.ColungaE.YoshidaH. (2010). Knowledge as process: contextually cued attention and early word learning.Cogn. Sci.341287–1314. 10.1111/j.1551-6709.2010.01130.x
- CrossRef
- Google Scholar
137
SprangerM. (2011). “A basic emergent grammar for space,” in Experiments in Cultural Language Evolution,ed.SteelsL. (Amsterdam: John Benjamins), 207–232.
- Google Scholar
138
SprangerM.PauwS.LoetzschM.SteelsL. (2012). “Open-ended procedural semantics,” in Language Grounding in Robots,edsSteelsL.HildM. (New York, NY: Springer), 153–172.
- Google Scholar
139
SteelsL. (2001). Language games for autonomous robots.Intell. Syst. IEEE1616–22. 10.1109/5254.956077
- CrossRef
- Google Scholar
140
SteelsL.BelpaemeT. (2005). Coordinating perceptually grounded categories through language: a case study for colour.Behav. Brain Sci.28469–488. 10.1017/S0140525X05000087
- CrossRef
- Google Scholar
141
SteelsL.KaplanF. (2002). “Bootstrapping grounded word semantics,” in Linguistic Evolution Through Language Acquisition: Formal and Computational Models,ed.BriscoeT. (Cambridge: Cambridge University Press), 53–73.
- Google Scholar
142
SteelsL.SprangerM. (2008). The robot in the mirror.Connect. Sci.20337–358. 10.1080/09540090802413186
- CrossRef
- Google Scholar
143
StephensG.MatthewsD. (2014). “The communicative infant from 0–18 months. The social-cognitive foundation of pragmatic development,” in Pragmatic Development in First Language Acquisition,ed.MatthewsD. (Amsterdam: John Benjamins Publishing Company), 13–35.
- Google Scholar
144
SternC.SternW. (1975). Die Kindersprache [The language of the child].Darmstadt: Wissenschaftliche Buchgesellschaft.
- Google Scholar
145
SzufnarowskaJ.RohlfingK. J.FawcettC.GredebäckG. (2014). Is ostension any more than attention?Sci. Rep.4:5304. 10.1038/srep05304
- CrossRef
- Google Scholar
146
ThomE. E.SandhoferC. M. (2009). More is more: the relationship between vocabulary size and word extension.J. Exp. Child Psychol.104466–473. 10.1016/j.jecp.2009.07.004
- CrossRef
- Google Scholar
147
ThorgrimssonG. B.FawcettC.LiszkowskiU. (2014). Infants’ expectations about gestures and actions in third-party interactions.Front. Psychol.5:321. 10.3389/fpsyg.2014.00321
- CrossRef
- Google Scholar
148
ThorgrimssonG. B.FawcettC.LiszkowskiU. (2015). 1-and 2-year-olds’ expectations about third-party communicative actions.Infant Behav. Dev.3953–66. 10.1016/j.infbeh.2015.02.002
- CrossRef
- Google Scholar
149
TomaselloM. (2001). Could we please lose the mapping metaphor, please?Behav. Brain Sci.241119–1120. 10.1017/S0140525X01390131
- CrossRef
- Google Scholar
150
TomaselloM. (2003). On the different origins of symbols and grammar.Stud. Evol. Lang.394–110. 10.1093/acprof:oso/9780199244843.003.0006
- CrossRef
- Google Scholar
151
TomaselloM.AkhtarN. (1995). Two-year-olds use pragmatic cues to differentiate reference to objects and actions.Cogn. Dev.10201–224. 10.1016/0885-2014(95)90009-8
- CrossRef
- Google Scholar
152
TomaselloM.CarpenterM.CallJ.BehneT.MollH. (2005). Understanding and sharing intentions: the origins of cultural cognition.Behav. Brain Sci.28675–691. 10.1017/S0140525X05000129
- CrossRef
- Google Scholar
153
TrueswellJ. C.MedinaT. N.HafriA.GleitmanL. R. (2013). Propose but verify: fast mapping meets cross-situational word learning.Cogn. Psychol.66126–156. 10.1016/j.cogpsych.2012.10.001
- CrossRef
- Google Scholar
154
van der GootM.TomaselloM.LiszkowskiU. (2014). Differences in the nonverbal requests of great apes and human infants.Child Dev.85444–455. 10.1111/cdev.12141
- CrossRef
- Google Scholar
155
VlachH. A. (2014). The spacing effect in children’s generalization of knowledge: allowing children time to forget promotes their ability to learn.Child Dev. Perspect.8163–168. 10.1111/cdep.12079
- CrossRef
- Google Scholar
156
WarlaumontA. S.RichardsJ. A.GilkersonJ.OllerD. K. (2014). A social feedback loop for speech development and its reduction in autism.Psychol. Sci.251314–1324. 10.1177/0956797614531023
- CrossRef
- Google Scholar
157
WaxmanS. R.GelmanS. A. (2009). Early word-learning entails reference, not merely associations.Trends Cogn. Sci.13258–263. 10.1016/j.tics.2009.03.006
- CrossRef
- Google Scholar
158
WittgensteinL. (1953/1997). Philosophical Investigations.Oxford: Blackwell.
- Google Scholar
159
WojcikE. H. (2013). Remembering new words: integrating early memory development into word learning.Front. Psychol.4:151. 10.3389/fpsyg.2013.00151
- CrossRef
- Google Scholar
160
WoodD.BrunerJ. S.RossG. (1976). The role of tutoring in problem solving.J. Child Psychol. Psychiatry1789–100. 10.1111/j.1469-7610.1976.tb00381.x
- CrossRef
- Google Scholar
161
WoodwardA. L. (1999). Infants’ ability to distinguish between purposeful and non-purposeful behaviors.Infant Behav. Dev.22145–160. 10.1016/S0163-6383(99)00007-7
- CrossRef
- Google Scholar
162
WoodwardA. L. (2009). Infants’ grasp of others’ intentions.Curr. Dir. Psychol. Sci.1853–57. 10.1111/j.1467-8721.2009.01605.x
- CrossRef
- Google Scholar
163
WredeB.RohlfingK. J.SteilJ. J.WredeS.OudeyerP.-Y.TaniY. (2012). “Towards robots with teleological action and language understanding,” in Proceedings of the IEEE-RAS International Conference on Humanoid Robots (HUMANOIDS),Osaka.
- Google Scholar
164
WredeB.SchillingmannL.RohlfingK. J. (2013). “Making use of multi-modal synchrony: a model of acoustic packaging,” in Theoretical and Computational Models of Word Learning: Trends in Psychology and Artificial Intelligence,edsGogateL. J.HollichG. (Hershey: Information Science Reference), 224–240.
- Google Scholar
165
YuC.SmithL. B. (2007). Rapid word learning under uncertainty via cross-situational statistics.Psychol. Sci.18414–420. 10.1111/j.1467-9280.2007.01915.x
- CrossRef
- Google Scholar
166
YuC.SmithL. B. (2012). Modeling cross-situational word–referent learning: prior questions.Psychol. Rev.11921–39. 10.1037/a0026182
- CrossRef
- Google Scholar
167
YuC.SmithL. B. (2013). Joint attention without gaze following: human infants and their parents coordinate visual attention to objects through eye-hand coordination.PLoS ONE8:e79659. 10.1371/journal.pone.0079659
- CrossRef
- Google Scholar

Summary

Keywords

language acquisition, pragmatics, infants’ social learning, frames, learning and memory, developmental robotics

Citation

Rohlfing KJ, Wrede B, Vollmer A-L and Oudeyer P-Y (2016) An Alternative to Mapping a Word onto a Concept in Language Acquisition: Pragmatic Frames. Front. Psychol. 7:470. doi: 10.3389/fpsyg.2016.00470

Received

26 November 2015

Accepted

17 March 2016

Published

19 April 2016

Volume

7 - 2016

Edited by

Hanne De Jaegher, University of the Basque Country, Spain

Reviewed by

Chris Sinha, Lund University, Sweden; Jordan Zlatev, Lund University, Sweden; Vasudevi Reddy, University of Portsmouth, UK

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Katharina J. Rohlfing, katharina.rohlfing@uni-paderborn.de

This article was submitted to Cognitive Science, a section of the journal Frontiers in Psychology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Cognitive Science

HYPOTHESIS AND THEORY article

An Alternative to Mapping a Word onto a Concept in Language Acquisition: Pragmatic Frames

Abstract

Introduction

Pragmatic Frames—an Introduction and History

Pragmatic Frames Require a History of an Interaction

Pragmatic Frames Involve Goal-Oriented Actions

Social Learning

Pragmatic Frames Consist of a Meaning and a Syntax

Meaning: Connection to the Cognitive Processes

Syntax of Pragmatic Frames

Hierarchy of Pragmatic Frames

Pragmatic Frames Evoke an Interpretation of a Situation

Asymmetric Pragmatic Frames: Scaffolding

Individual Differences

How Current Approaches Interface with Pragmatic Frames

Social Cues

Dynamic Coupling

The Importance of Statistical Learning and an Accumulating Linguistic Knowledge Base

Conclusion and Future Directions

Statements

Author contributions

Funding

Conflict of interest

References

Summary

Outline

Figures

Cite article

Article metrics

HYPOTHESIS AND THEORY article

An Alternative to Mapping a Word onto a Concept in Language Acquisition: Pragmatic Frames

Abstract

Introduction

Pragmatic Frames—an Introduction and History

Pragmatic Frames Require a History of an Interaction

Pragmatic Frames Involve Goal-Oriented Actions

Social Learning

Pragmatic Frames Consist of a Meaning and a Syntax

Meaning: Connection to the Cognitive Processes

Syntax of Pragmatic Frames

Hierarchy of Pragmatic Frames

Pragmatic Frames Evoke an Interpretation of a Situation

Asymmetric Pragmatic Frames: Scaffolding

Individual Differences

How Current Approaches Interface with Pragmatic Frames

Social Cues

Dynamic Coupling

The Importance of Statistical Learning and an Accumulating Linguistic Knowledge Base

Conclusion and Future Directions

Statements

Author contributions

Funding

Conflict of interest

References

Summary

Outline

Figures

Cite article

Share article

Article metrics