- Department of Speech, Language, and Hearing Sciences, University of Colorado, Boulder, CO, United States
Narrative production requires coordinating lexical, grammatical, and discourse skills. For bilingual children, these abilities develop across two languages, influencing how resources are used during storytelling. This study compared Japanese–English bilingual and English monolingual children’s narrative microstructure and macrostructure. Twenty-eight children in each group (ages 3–8, matched for age and sex) completed the English version of the Multilingual Assessment Instrument for Narratives (MAIN). Measures included mean length of utterance in morphemes (MLUm), number of different words (NDW), subordination index (SI), fragments, story structure (SS), structural complexity (SC), internal state terms (IST), and comprehension, as well as which MAIN episode was told with the most complexity. All micro- and macrostructure measures increased with age except SC. Group differences appeared only in NDW. Age × group interactions for MLUm, SI, and NDW showed bilingual preschoolers had the lowest scores, while bilingual school-age children produced the most fragments. Age and language groups did not differ in which episode they told most complexly: all participants produced the most complex retellings for episode one and the least for episode two. Overall, microstructure was more sensitive to language experience, whereas macrostructure remained relatively stable across groups. Older children outperformed younger children, and episodic complexity patterns were consistent across age and language.
1 Introduction
Narrative skills are essential for the development of reading (Lynch et al., 2008; Paris and Paris, 2003), and oral language skills, such as vocabulary, grammar knowledge, and discourse structure, are strong predictors of reading skills (Miller et al., 2006; Suggate et al., 2018). For bilingual children, adequate development of the first language (L1), or the heritage language (HL), is an essential prerequisite for the reading development of the second language (L2), or the majority language (ML) (Cummins, 1979a). Early identification of language difficulties is essential for providing necessary supports for academic success; however, many norm-referenced tests (e.g., Clinical Evaluation of Language Fundamentals-Fifth Edition, Test of Narrative Language-Second Edition, etc.) are developed for monolingual English speakers, making it challenging for educators and speech-language pathologists (SLPs) to identify bilingual children with a language disorder. Research has shown that narrative tasks can be an ecologically valid way of identifying language disorders in monolingual children (Auza et al., 2018; Botting, 2002; Liles et al., 1995; Mäkinen et al., 2014; To et al., 2010) and bilingual children (Altman et al., 2016; Boerma et al., 2016; Bohnacker et al., 2022; Gutierrez-Clellen, 2002; Iluz-Cohen and Walters, 2012; Gibson et al., 2018; Jacobson and Walden, 2013; Kunnari and Välimaa, 2020; Peña et al., 2014; Squires et al., 2014; Tsimpli et al., 2016). To further this promising line of research, more studies need to be done in more languages and with participants from diverse cultural backgrounds. Much of the research has focused on Spanish-English bilinguals, and few have examined Japanese-English bilinguals. This study aims to investigate the narrative development in typically developing preschool to early school-age simultaneous Japanese-English bilingual children and compare them to age-matched English monolingual children. This comparison is significant, particularly considering a Systems Framework of bilingualism (Titone and Tiv, 2023), which states that multilingualism requires moving beyond a purely individual-centric view, and embracing the profound influence of social and environmental factors on language development. The bilinguals in the current study are not only acquiring English in academic settings, but they also attend a Japanese language and culture school, which may impact their narrative skills in a unique way, given the distinct typological differences between Japanese and English.
1.1 The development of narrative language skills
Monolingual preschoolers (ages 3–5) typically produce simple event descriptions, but by school age (ages 6–9) their narratives become more coherent and include story grammar elements such as scene-setting details, connectives, internal states, causal links, and evaluative comments (Berman, 1988; Bishop and Donlan, 2005; Heilmann et al., 2010; Liles, 1993; Peterson and McCabe, 1983; Poulsen et al., 1979; Stein and Glenn, 1979; van Dijk, 1976a; Westby, 2012). Narrative comprehension also improves steadily, with children showing better understanding of aurally presented stories between ages 3–7 (Curenton, 2011; Lindgren et al., 2023). Narrative discourse is commonly analyzed at the microstructure and macrostructure levels (Justice et al., 2006). Microstructure focuses on language-specific features such as syntactic complexity and lexical diversity (Košutar et al., 2022), often measured through MLUm and NDW. Macrostructure, or story grammar, examines how utterances are organized into coherent narratives (e.g., Peterson and McCabe, 1983; Stein and Glenn, 1979). This study used the Multilingual Assessment Instrument of Narratives (MAIN; Gagarina et al., 2019) to assess macrostructure, including story structure, complexity, and internal state terms.
The central question of this study is how bilingual children’s development compares to monolingual patterns when two languages are present. Several factors may influence bilinguals’ narrative development. Cultural differences in storytelling shape narrative style, and bilingual children must navigate multiple sets of practices across home, school, and peer contexts. These styles can also interact across languages—for instance, a child may apply Japanese narrative conventions when telling a story in English, or use English conventions when producing Japanese narratives (Gorman et al., 2011).
Second, bilingual children divide their language experience across two systems, resulting in less input per language than monolinguals receive. Some researchers suggest that about 60% of monolingual-level input is needed for “typical” development (Cattani et al., 2014), and reduced input has implications for how bilinguals acquire and use their languages (Daskalaki et al., 2024; De Houwer, 2007; Gathercole and Thomas, 2009). Longitudinal evidence highlights the importance of input timing and amount: Spanish–English bilingual children’s English macrostructure grew from preschool into the school years, with earlier English exposure linked to stronger initial skills, while greater home Spanish exposure predicted faster later growth (Bitetti and Hammer, 2021).
Third, vocabulary knowledge represents another critical factor influencing bilingual children’s narrative development. Research showed that when assessed in only one of their languages, bilingual children tend to obtain lower vocabulary scores than their monolingual peers (Bialystok et al., 2010; Sheng et al., 2011). However, bilingual children’s conceptual vocabulary, which is the total number of independent concepts distributed across both languages (Kan and Kohnert, 2005), is equivalent in size to monolingual children of the same age. Additionally, the intralanguage vocabulary differences may impact narrative language development, as vocabulary has been shown to be strongly correlated with narrative competence (Altman et al., 2016; Chan et al., 2023; Hammer et al., 2012; Heilmann et al., 2010; Iluz-Cohen and Walters, 2012; Košutar et al., 2022).
Fourth, research on dual language development suggests bilingual children often outperform monolinguals in metalinguistic awareness, particularly on tasks requiring high cognitive control (Bialystok, 2001). This advantage may influence how bilinguals develop narrative discourse skills involving causality and theory of mind (Gagarina et al., 2019). Below, we review research comparing bilingual and monolingual children’s narrative microstructure and macrostructure.
1.2 Microstructure in bilingual and monolingual children
Microstructure (e.g., lexical diversity, morphosyntactic marking) is said to depend on language experience (e.g., Rodina, 2017) and varies across languages. For bilingual children, this often appears as differences in vocabulary surface forms, such as knowing a phonological form in only one language. As they age, bilingual children acquire more translation equivalents (Kan and Kohnert, 2005; Sheng et al., 2011), so vocabulary differences may appear in their narratives relative to monolinguals. Boerma et al. (2016) found that both typically developing and language-impaired bilingual children scored significantly lower on vocabulary tests than typically developing monolinguals. Hipfner-Boucher et al. (2015) similarly reported that only English learners who spoke mostly the home language (HL) scored lower in lexical diversity than monolinguals and English learners dominant in the majority language (ML). This could be explained differing amounts of input to the academic language, or the ML. From a broader theoretical perspective, such reduced exposure may delay the development of cognitive academic language proficiency (CALP), which depends on cumulative language experience, first-language proficiency, and age, and which emerges more slowly than basic interpersonal communicative skills (BICS; Cummins, 1979b). Additionally, Narrative tasks draw heavily on CALP, because they require decontextualized language, cohesion, causality, internal state descriptions, and abstract vocabulary. Kunnari et al. (2016) also observed differences between Finnish monolingual and Finnish–Swedish bilingual children: both groups produced similar word counts, but monolinguals produced more utterances. Kubota et al. (2022) found that token–type ratio in Japanese–English bilinguals depended on continued English exposure after returning to Japan. Finally, Dosi et al. (2024) observed early greater literacy exposure predicted greater noun diversity in Greek-Turkish bilingual children’s Greek. Collectively, these studies show that microstructure—especially vocabulary—is shaped by language experience, and increased exposure supports higher lexical diversity in bilinguals’ narratives.
In addition to vocabulary, small variations in morphosyntax can be seen in the development of narrative discourse in bilinguals and monolinguals of different language backgrounds. Slobin (1996) compared monolingual English and Spanish speakers and found differences in verb use even when macrostructure remained consistent – monolingual English children tended to use a more manner verbs, while monolingual Spanish children used more path verbs. Otwinowska et al. (2020) found that Polish-English bilinguals’ Polish narratives showed an overuse of function words, particularly pronouns. Rodina (2017) observed significantly more errors in morphosyntax in bilingual Norwegian-Russian preschoolers versus monolingual Norwegian and monolingual Russian preschoolers. Rezzonico et al. (2016) observed that Cantonese-English preschoolers tended to use simple syntax. Analysis of the English grammar errors showed that the source of agrammaticality was primarily verbal morphology omissions in obligatory contexts or incomplete BE plus -ing forms. Finally, Mishina-Mori et al. (2024) compared Japanese-English bilingual children with Japanese monolingual children, and observed cross-linguistic influence from English to Japanese in how the bilinguals reintroduced referents. They found that the bilingual children used more noun phrases, compared to the monolingual children who more often dropped the pronoun, to reintroduce characters in a story retell. These findings parallel similar findings from Chen and Lei (2013), who found that for referent re-introduction in Chinese, bilingual speakers produced more definite noun phrases and dropped fewer pronouns than their monolingual peers. On the other hand, Dosi et al. (2024) found that exposure to literacy in Greek, the L2, leads to greater syntactic complexity. While subtle, the differences in information structure from language to language appear to cross linguistic boundaries. In summary, acquisition of different languages, or multiple languages, can impact not only vocabulary usage, but also morphosyntactic patterns.
1.3 Macrostructure in bilingual and monolingual children
Early literacy practices are important for later language and reading outcomes, specifically, parent language during book reading contains greater lexical diversity and syntactic complexity, leading to improved language and literacy skills in their children (Ece Demir-Lira et al., 2019). Some studies have found subtle differences in narrative macrostructure depending on cultural storytelling norms. One study being Gorman et al. (2011), who compared 60 first- and second-grade African American, Latino American, and Caucasian children. They found that culture influenced the children’s narrative production on a wordless picture book task. They called for a better understanding of narrative structure, creativity, and style to provide ecologically valid narrative assessment and intervention for children from diverse cultural backgrounds. Similarly, Fiestas and Peña (2004) examined Spanish–English bilingual children and found that, although overall story organization remained coherent across languages, the distribution of narrative elements differed. In Spanish, children produced proportionally more references to initiating events and character intentions, whereas in English they provided more statements related to outcomes and consequences. These findings suggest that macrostructure involves both shared cognitive frameworks for organizing narrative events and culturally mediated discourse preferences that shape how narrative meaning is developed. Thus, examining English macrostructure in Japanese–English bilingual children relative to monolingual peers provides an important opportunity to determine whether narrative organization in English reflects a shared cross-linguistic framework or whether culturally and linguistically rooted patterns from Japanese storytelling continue to shape how children structure their narratives.
In contrast to microstructure, macrostructure is generally less affected by language-specific knowledge (e.g., Andreou and Lemoni, 2020; Boerma et al., 2016; Bohnacker et al., 2022). Macrostructure refers to the higher-order organization of discourse, or story grammar (Gagarina et al., 2019). These “narrative superstructures” (van Dijk, 1980)—such as setting, complication, resolution, evaluation, and moral—are predictable elements that shape a story’s global content. Van Dijk (1980) described macrostructures as the underlying semantic “gist” that enables summarization. Episodic Analysis (Stein and Glenn, 1979) is a macrostructural framework based on the idea that stories are organized around characters’ goal-directed actions. MAIN (Gagarina et al., 2019) adapts this framework into a clinical tool for assessing micro- and macrostructure in bilinguals, though it is also useful for monolinguals (Pesco and Kay-Raining Bird, 2016). This tool has supported extensive research on narrative development.
Kunnari et al. (2016) compared Finnish–Swedish bilingual and Finnish monolingual children using the MAIN and found group differences in SS only for storytelling, not retelling. They also reported that SC—specifically episodes containing Goal statements—increased with age, though complete and abbreviated episodes were rare. Tsimpli et al. (2016) compared monolingual and bilingual Greek-speaking children, including age-matched TD and SLI groups. While SLI groups scored lower than TD groups, both bilingual groups outperformed monolinguals on story production, and the bilingual SLI group outperformed the monolingual SLI group on comprehension, suggesting a potential bilingual advantage in story grammar. Otwinowska et al. (2020) found no differences between Polish–English bilinguals and Polish monolinguals in SS, ISTs, or comprehension on the MAIN. Lipner et al. (2024) showed that age predicted macrostructure in Hebrew–English bilinguals, with older children producing more macro elements. Macrostructure correlated across languages, indicating cross-linguistic transfer. They also found that episode three was told with the greatest complexity, likely because character relationships become explicit in the final episode. Overall, these studies show that macrostructure does not differ substantially between bilinguals and monolinguals and increases linearly with age in both groups.
1.4 The current study
The aim of the current study is to examine shared and divergent patterns in the English narratives of Japanese-English bilingual children ages three to eight and their age-matched English monolingual peers. We focused solely on the English narratives in this study for two reasons: English is the ML and the language of instruction, so all the children in the study have academic English skills; and focusing on English allowed a one-to-one comparison of the microstructure and macrostructure between the two groups. This study parallels many previous studies that used MAIN to examine narrative macrostructure in bilingual children, and most closely resembles Kunnari et al. (2016), in that it compares the narratives of typically developing bilingual children to typically developing monolingual children. Additionally, similar to Altman et al. (2016) and Hao et al. (2019), our study also examines the narratives of bilinguals who speak typologically distant languages.
The research questions guiding our study are:
1. How do Japanese–English bilingual children differ from English monolingual peers in English microstructure (i.e., MLUm, NDW, SI, and number of fragments)?
2. How do Japanese–English bilingual children compare with monolinguals on English macrostructure (i.e., story structure, structural complexity, internal state terms, and comprehension)?
3. To what extent does age group (preschool vs. school-age)—reflecting both developmental maturation and accumulated school experience—predict English microstructure and macrostructure?
4. Does age group (preschool vs. school-age) or bilingualism (monolingual vs. bilingual) affect which of the three episodes is told with more complexity in the English narratives of Japanese-English bilingual and English monolingual children?
For research question one, we hypothesize that there may be subtle differences in all microstructure measures, as evidenced by much of the research on bilingual vocabulary differences (e.g., Bialystok et al., 2010; Kan and Kohnert, 2005), as well as more subtle differences in morphosyntax due to cross-linguistic influence from Japanese (Chen and Lei, 2013; Mishina-Mori et al., 2024). For Research Question Two, we hypothesize that Japanese–English bilingual and English monolingual children will show similar overall macrostructure scores, reflecting the relatively language-general nature of story organization. However, drawing on research showing culturally mediated variation in how narrative elements are emphasized (Fiestas and Peña, 2004; Gorman et al., 2011), we anticipate that subtle differences may emerge in the distribution or elaboration of specific macrostructural components, even when global structure remains comparable. This pattern would be consistent with the view that narrative macrostructure is a shared cognitive framework shaped by culturally influenced discourse practices. Additionally, Hayes and Kan (2025) found evidence to support the transfer of underlying lexical semantic knowledge between languages in Japanese-English bilingual children, despite a discrepancy in language ability. For research question three, we hypothesize that the school-age children will have higher microstructure and macrostructure scores than the preschool children; much of the research has shown improvements in both with age (e.g., Brown, 1973; Durán et al., 2004; Bohnacker et al., 2022; Uccelli and Páez, 2007). Finally, for research question four, we hypothesize that, like Lipner et al. (2024), episode three of the MAIN will have the most SC components for both groups, since macrostructure skills are considered relatively stable across linguistic contexts; the preschool children may show more variety in which episode they tell with greater complexity, due to nascent macrostructure competence.
2 Methods
2.1 Participants
Participants were 56 children aged three to eight (M = 73.6 months, SD = 20.1 months) attending public school in Colorado. Half of the children were Japanese-English bilingual and half were age-matched monolingual English-speaking children. The bilingual group consisted of 28 simultaneous Japanese-English bilingual children (they began acquiring both languages younger than four; Genesee and Nicoladis, 2006) who also attend the Japanese Academy of the Rockies, a Saturday-only school that teaches Japanese language, literacy, and culture. These children have at least one parent who is native speaker of Japanese (12 girls; M = 73.53 months, SD = 20.25 months). Additionally, 21 of the 28 bilingual children reported being more comfortable speaking English, four more comfortable speaking Japanese, and three equally comfortable speaking Japanese and English. The age-matched monolingual group consisted of 28 English-speaking children (12 girls; M = 73.71, SD = 20.36). They were recruited from a local children’s summer camp, within roughly the same geographical area as the Japanese Academy of the Rockies. Both groups lived in a middle- to upper middle-class area of Colorado. There was no significant difference in chronological age among the groups, t(54) = 0.03, p = 0.97. We used a caregiver questionnaire that was adapted from Cheung et al. (2018) for Japanese-English bilingual and monolingual children to query demographic information, developmental history, and language environment (see Table 1). According to the questionnaire, all participants were considered typically developing based on not having an Individualized Education Plan (IEP) and not receiving special education services. Next, maternal education was measured using an ordinal scale: “lower than high school” being 1, “GED or high school” being 2, “associate’s degree/some college” being 3, “bachelor’s degree” being 4, and “Master’s/MD/JD/PhD” being 5. The difference between the monolingual’s maternal education (M = 4.43) and the bilingual’s maternal education (M = 3.89) was nonsignificant, z = 1.6, p = 0.10. Finally, there was no significant difference between the number of hours of total language input the children received at home, t(54) = 0.6, p = 0.55. However, there was a significant difference between the number of hours of English input the children received at home, t(54) = 5.33; p = 0.001. The bilinguals received an average of 8.6 h of English input and 7 h of Japanese input (15.6 h of total language input), while the monolinguals received an average of 16.5 h of input at home.
2.2 Materials and procedures
To elicit a language sample for micro and macrostructural ability, we used the Multilingual Assessment Instrument of Narratives (MAIN; Gagarina et al., 2019). The MAIN was developed to assess narrative production and comprehension skills of children from 3- to 10-years-old. It can be used as a story tell, retell, or model story. The dog and cat parallel stories were used as a story retell in this study.
The monolingual children only completed one testing session in English. Half of the children did the “Cat Story,” and the other half did the “Dog Story.” The bilingual children competed two stories, one in Japanese and one in English, at least one week apart, with sometimes longer between sessions depending on the family’s schedule. The bilingual group was tested for a larger study, thus, while the bilingual group was assessed in both languages, we focus on the English narratives in this study. The bilingual children were tested at the Japanese Academy of the Rockies, a heritage language school that is held on Saturdays. The examiners included three trained monolingual English-speaking research assistants (RAs), the PI – who is a Japanese-English bilingual speaker – and a trained native Japanese-English bilingual RA. To create an environment conducive to the monolingual mode (Grosjean, 1998), the children were tested by a different examiner for both tests, and all interaction prior to the testing, and instructions given during the testing, were done strictly in the testing language. The participants were given the option of three stories in envelopes to choose from; however, each envelope contained the same story. This was to control for the effect of shared knowledge during the presentation of the pictures (Gagarina et al., 2019). After the child chose the envelope, the examiner would display the pictures while reading the story, then prompt the child to tell the story back, “in their own words.” The order of administration and language for the bilingual children was counterbalanced.
All testing procedures were the same as the bilingual children, except all testing and interaction with the children and their families was done in English. Both groups’ narratives were recorded using OM System Olympus WS-882 Digital Voice Recorders.
2.3 Macrostructure measures
Four macrostructure measures were used as dependent variables: Story structure (SS), structural complexity (SC), internal state terms (IST), and story comprehension.
SS is scored on a 17-point scale, where two possible points are given for a setting statement (consisting of time and place), followed by three episodes comprised of five components (which equate to points) each: i) an internal state to initiate the goal, ii) a goal statement for a character iii) an attempt by a character, iv) an outcome of the attempt, and v) an internal state which expresses the character’s reaction.
SC was scored using a 3-point system outlined by Maviş et al. (2016). A sequence without Goal (i.e., Attempt-Outcome) would be given 1 point. An incomplete episode (single Goal, Goal-Attempt or Goal-Outcome) would be given 2 points. A complete episode (Goal-Attempt-Outcome) would be given 3 points. Each MAIN story consists of three stories, so a score of nine points (three GAO sequences) is the ceiling. The MAIN manual (Gagarina et al., 2019) scores by totaling the number of each type of sequence observed. We used a 3-point system to examine gradual developmental trends with age and compare the complexity of each episode within the story.
ISTs were totaled. Examples of ISTs from the MAIN manual include: Perceptual terms (e.g., see, hear, feel); physiological terms (e.g., thirsty, hungry, tired); consciousness terms (e.g., alive, awake, asleep); emotion terms (e.g., sad, happy, angry); mental verbs (e.g., want, think, know); linguistic verbs (e.g., say, call, shout).
Comprehension questions were scored on a scale of 1 to 10. Answers were scored based on the MAIN manual (Gagarina et al., 2019).
Finally, an aggregate score was derived by totaling all the macrostructure scores (i.e., SS, SC, IST, and comprehension). Within the current study, the aggregate score range was 13 to 47 (M = 25.91, SD = 6.94).
2.4 Microstructure measures
Four microstructure measures were used as dependent variables: Mean Length of Utterance in Morphemes (MLUm), Number of Different Words (NDW), Subordination Index (SI), and Fragments ([F]).
MLUm is a general index of syntactic development – as utterance length increases, syntactic complexity increases, e.g., number of semantic roles, negatives, auxiliaries, and clauses (Brown, 1973). MLUm is a reliable, global measure of syntactic complexity.
NDW is a commonly used measure of lexical diversity (Justice et al., 2006; Miller et al., 2006). Some studies have found that the lexical diversity measure “D” is a more reliable measure (Durán et al., 2004), however, Jacobson and Walden (2013) found the two to be highly correlated in both languages of Spanish-English bilinguals. Additionally, NDW is a sensitive measure of narrative productivity (Altman et al., 2016; Uccelli and Páez, 2007).
SI is a useful measure of syntactic complexity (Justice et al., 2006), measured by the ratio of the total number of clauses to the total number of C-units (Miller et al., 2019). Brown (1973) suggested that once MLUm is over an average of 4.0, it is no longer considered an accurate measure; therefore, since the age range in this study extended to eight years, SI was used as a more sophisticated measure of syntactic complexity to account for any insufficiencies in MLUm.
[F] utterances contain a succession of verbs without subjects. The subjects in these can be implied due to segmentation (Miller et al., 2019). In Japanese, personal pronouns are frequently dropped to avoid redundancy (Kaiser et al., 2013, p. 662), so the number of pro-drop utterances was noted to observe any potential crosslinguistic influence.
2.5 Transcription, scoring, and analysis
The children’s narratives were transcribed and analyzed using Systematic Analysis of Language Transcripts (SALT) software (Version 20; Miller et al., 2019). The audio recordings were transcribed and coded following SALT conventions by the PI and a trained RA. The transcripts were coded to mark individual words and bound morphemes to calculate MLUm and NDW. To calculate SI, the number of clauses was marked with the code “SI-[number]” to indicate the number of clauses per utterance (e.g., SI-1 for one clause, SI-2 for a main and subordinate clause, etc.). Finally, fragments were identified as well-formed utterances in an immediate sequence, where the first utterance contains the subject and the following utterances do not, but the subject can be implied (e.g., “C He got on the rock. C and fell off the rock [F].”) These utterances were marked with the code “[F]” to calculate the total number per transcript. Macrostructure was then scored following the scoring sheet provided by the MAIN, with the addition of the aggregate sum of each element from the MAIN rubric. To note which episode yielded the most GAO elements, a complexity score of 1, 2, or 3 was given to each of the three MAIN episodes based on the total number of GAO elements included in that episode. For example, if an episode included only a single G, A, or O, it was given a 1; if it included a GA, GO, or AO, it was given a 2; and if it included all three, GAO, then it was given a 3. For interrater agreement, a 10% overlap of the narratives were coded and scored for macrostructure by the PI and RA. Comparison of the SALT codes showed a 99% item agreement, and comparison of the MAIN macrostructure scoring sheets showed a 95% item agreement. Microstructure and macrostructure elements acted as dependent variables. Finally, each participant was indicated as monolingual/bilingual and preschool/school-age, which were independent variables.
3 Data analysis
Preliminary correlation analyses were conducted to explore trends in variables (see Table 2 for monolinguals and Table 3 for bilinguals). Age, MLUm, NDW, SI, SS, SC, IST, comprehension, and aggregate were compared to observe changes in development.
Separate 2×2 analyses of variance (ANOVAs) was used to answer research questions one through three, with Language group (monolingual or bilingual) and Age group (preschool or school-age) as independent variables, and microstructure (MLUm, NDW, SI, and [F]) and macrostructure (SS, SC, IST, comprehension, and aggregate) as dependent variables. Research question four used a linear mixed effects model with Language group, Age group, and Episode number (one, two, and three) as fixed effects, subject as a random effect, and Number of GAO Elements as the outcome.
4 Results
4.1 Descriptive statistics
Table 4 shows micro and macrostructure scores by group. Table 5 shows the frequency of each individual SS component. Table 6 shows accuracy on each type of comprehension question. The accuracy was equal for the theory of mind (ToM) 1 question, and the bilinguals’ accuracy was much higher for the ToM2 question, although not significant, X2 = (1, N = 56) = 1.82, p = 0.18. Both groups scored close to the ceiling on Goal and Internal State questions, with slightly lower accuracy on the ToM1 question, and the most significant drop in accuracy on the ToM2 question.
Table 6. Accuracy with different types of comprehension questions: goals (questions 1, 4, 7), internal states (questions 2, 3, 5, 6), ToM1 (questions 8, 9), ToM2 (question 10).
4.2 Research questions one through three: age and language group
Tables 7, 8 summarize the results examining the effects of Language and Age group on English micro and macrostructure. Figure 1 shows mean microstructure performance, and Figure 2 shows mean macrostructure performance for each group.
Figure 1. Narrative microstructure performance per group. MLUm, mean length of utterance in morphemes; SI, subordination index; NDW, number of different words; [F] = fragments (error bars show standard error).
Figure 2. Narrative macrostructure performance per group. SS, story structure; SC, structural complexity; IST, internal state terms (error bars show standard error).
For microstructure, MLUm was significant, F(3, 52) = 9.75, p < 0.001, R2 = 0.36, with Language group not being significant, p = 0.60, the interaction between Language and Age group nearing significance, p = 0.06, and Age group being significant, p < 0.001. To explore the near-significant interaction, Tukey HSD post-hoc analysis revealed significant differences between bilingual preschoolers (M = 5.82) and bilingual school-age (M = 8.43), p < 0.001, and bilingual preschoolers (M = 5.82) and monolingual school-age (M = 7.92), p < 0.01.
NDW was significant, F(3, 52) = 8.19, p < 0.001, R2 = 0.32, with Language group, p < 0.01, and Age group, p < 0.001, both being significant, and the interaction between Language and Age group nearing significance, p = 0.06. To further explore the near-significant interaction, Tukey HSD post-hoc analysis revealed significant differences between bilingual preschoolers (M = 31.07) and bilingual school-age (M = 47.5), p < 0.01, bilingual preschoolers (M = 31.07) and monolingual preschoolers (M = 45.07), p < 0.01, and bilingual preschoolers (M = 31.07) and monolingual school-age (M = 50), p < 0.001.
SI was significant, F(3, 52) = 6.37, p < 0.001, R2 = 0.27, with Language group not being significant, p = 0.36, and Age group, p < 0.01, and the interaction between Language and Age group being significant, p < 0.05. To further explore the interaction, Tukey HSD post-hoc analysis revealed significant differences between bilingual preschoolers (M = 0.99) and bilingual school-age (M = 1.19), p < 0.001, and bilingual preschoolers (M = 0.99) and monolinguals school-age (M = 1.14), p < 0.05.
Fragments were significant, F(3, 52) = 4.97, p < 0.01, R2 = 0.22, with Language group not being significant, p = 0.17, the interaction between Language and Age group nearing significance, p = 0.06 and age group being significant, p < 0.01. To further explore the near-significant interaction, Tukey HSD post-hoc analysis revealed significant differences between bilingual preschoolers (M = 0.21) and bilingual school-age (M = 1.43), p < 0.01, and monolingual preschoolers (M = 0.33) and bilingual school-age (M = 1.43), p < 0.05.
For macrostructure, SS was significant, F(3, 52) = 5.54, p < 0.01, R2 = 0.24, with Language group, p = 0.25, and the interaction between Language and Age group, p = 0.41, not being significant, and Age group being significant, p < 0.001.
SC was not significant, F(3, 52) = 1.62, p = 0.2, R2 = 0.09, and none of the variables reached significance.
ISTs were significant, F(3, 52) = 3.04, p < 0.05, R2 = 0.15, with Language group, p = 0.11, and the interaction between Language and Age group, p = 0.42, not being significant, and Age group being significant, p < 0.05.
Comprehension was significant, F(3, 52) = 3.15, p < 0.05, R2 = 0.15, with Language group, p = 0.88, and the interaction between Language and Age group, p = 0.17, not being significant, and Age group being significant, p < 0.01.
The Aggregate score was significant, F(3, 52) = 5.3, p < 0.01, R2 = 0.23, with Language group, p = 0.16, and the interaction between Language and Age group, p = 0.22, not being significant, and Age group being significant, p < 0.001.
4.3 Research question four: episode complexity
A linear mixed effects model was performed with Language group, Age group, Episode number (one, two, three), and the interactions between Language group with Episode number and Age group with Episode number as fixed effects, and Subject as a random effect (see Table 9). Language group was nonsignificant, F(1, 53) = 0.69, p = 0.41, Episode number was significant, F(2, 106) = 11.71, p < 0.001, the interaction between Language group and Episode number was nonsignificant, F(2, 106) = 1.73, p = 0.18, Age group was significant, F(1, 53) = 5.5, p < 0.05, and the interaction between Age group and Episode number was not significant, F(2, 106) = 0.39, p = 0.68. Tukey HSD post-hoc analysis revealed significant differences between episodes one and two, p < 0.001, and episodes one and three, p < 0.05, but a nonsignificant difference between episodes two and three, p = 0.08. Episode one had a mean complexity score of 2.04 (SD = = 0.54), episode two had a mean complexity score of 1.52 (SD = 0.63), and episode three had a mean complexity score of 1.75 (SD = 0.72). In sum, the school-age children included significantly more GAO elements than the preschoolers, but there was no difference between the Language and Age groups in which of the three episodes they tended to retell with more GAO elements – all groups retold episode one with the most complexity, followed by episode three, then episode two.
5 Discussion
The current study examined shared and divergent patterns in the English narrative micro and macrostructure of Japanese-English bilingual and English monolingual children, building on prior research examining bilingual narrative development (Bohnacker et al., 2022; Chan et al., 2023; Gagarina, 2016; Gutierrez-Clellen, 2002; Hao et al., 2019; Iluz-Cohen and Walters, 2012; Kubota et al., 2022; Kunnari et al., 2016; Lipner et al., 2024; Maviş et al., 2016; Otwinowska et al., 2020; Roch et al., 2016; Rodina, 2017). Specifically, microstructure elements – MLUm, NDW, SI, and number of fragments – macrostructure elements – SS, SC, IST – and story comprehension were compared to examine the effects of how age and learning a typologically distinct HL may impact narrative development in the ML. We recruited 56 children (24 girls) ages three to eight (M = 73.6 months, SD = 20.1 months), attending public school in English. Half of the group were Japanese-English bilingual children, the other half were age-matched English monolingual children. The MAIN was used to elicit narratives to analyze the two groups’ micro- and macrostructure, and a caregiver questionnaire was used to gather demographic and language input information. Key findings include: (1) All microstructure and macrostructure measures differed significantly between age groups, except SC; (2) the only microstructure measure that differed significantly between language groups was NDW, with the monolinguals significantly outperforming the bilinguals; (3) there was a significant and nearing significant interactions between Language and Age group for MLUm, NDW, SI and fragments, showing that the bilingual preschool group had the lowest MLUm, NDW, and SI scores, and the bilingual school-age group had the highest number of fragments; (4) no macrostructure measure was significantly different between Language groups; (5) Age and Language groups did not differ in which of the three episodes of the MAIN they tended to retell with greater complexity; specifically, both age and language groups retold episode one with the greatest complexity, followed by episode three, and lastly episode two with the least complexity.
5.1 Differences in microstructure
School-age children showed greater syntactic complexity than preschoolers, with higher MLUm and SI scores reflecting increased use of subordinate clauses, verbal morphology, and modifiers. This aligns with research on age-related narrative development (e.g., Justice et al., 2006; Košutar et al., 2022; Roch et al., 2016). School-age children also had higher NDW scores, while bilingual preschoolers showed significantly lower NDW than the other groups, consistent with findings that bilingual vocabulary growth is slower than monolinguals’ (e.g., Bialystok et al., 2010). Within narrative contexts, Tsimpli et al. (2016) reported greater lexical diversity in monolingual SLI children than bilingual SLI peers, and Otwinowska et al. (2020) found monolinguals outperformed bilinguals on type–token ratio. However, in the current study, NDW differences between bilingual and monolingual school-age children disappeared. This may reflect bilinguals’ increasing translation equivalents with age (Kan and Kohnert, 2005; Sheng et al., 2011) or easier acquisition of the more dominant ML compared to the HL (Gathercole and Thomas, 2009). Bilinguals may also take longer to develop English CALP because their input is divided across two languages (Cummins, 1979b). Thus, while bilingual preschoolers had the lowest NDW, SI, and MLUm, bilingual school-age children performed similarly to monolinguals, suggesting that microstructure—and CALP—converges with sufficient language-specific experience.
Input amount may explain this convergence. Cattani et al. (2014) suggest bilinguals need about 60% input in a language to perform comparably to monolinguals. In this study, bilinguals received roughly 55% of at-home input in English; when school input (≈35 h/week) is included, bilinguals received about 43.6 h of English per week versus monolinguals’ 51.5 h—approximately 84% of monolingual input. Thus, the bilinguals exceeded the 60% benchmark, which may have contributed to their comparable school-age microstructure outcomes.
An interesting finding in the current study was that the school-age bilinguals used the greatest number of fragments. This finding parallels the findings of Mishina-Mori et al. (2024). They found significantly higher usage of pronouns and fewer null forms among the bilingual children’s Japanese narratives compared with the Japanese monolingual group. They hypothesized that the excessive use of pronouns among the bilingual children could be interpreted as cross-linguistic influence from English. In the current study, the direction of influence was the opposite, showing evidence of the Japanese-English bilinguals dropping pronouns more often in successive clauses in the English narratives, a potential influence from Japanese. A similar findings was observed by Otwinowska et al. (2020), who found that the bilinguals overused pronouns in Polish, which may have been transfer from English. Overall, while the school-age Japanese-English bilinguals in our study did use more fragments in their narratives than the other groups, this difference was nearing significance and should be further investigated in future studies.
5.2 Differences in macrostructure
Much of the research on narrative development posits that microstructure is relatively language-experience dependent, which was reflected in the subtle microstructure differences between the language and age groups. In contrast, macrostructure is generally less tied to language-specific experience (Andreou and Lemoni, 2020; Boerma et al., 2016; Gagarina et al., 2016; Kubota et al., 2022; Rezzonico et al., 2016; Rodina, 2017), though cultural storytelling conventions can influence stylistic features (Fiestas and Peña, 2004; Gorman et al., 2011). In the present study, bilingual and monolingual children did not differ in SS. This contrasts with Kunnari et al. (2016), who found higher SS scores in Finnish monolinguals than Finnish–Swedish bilinguals; their bilinguals were more balanced, possibly receiving more equal input across both languages compared to the English-dominant bilinguals in our study. Both groups most frequently produced SS components such as IST, Initiating Event, Attempt, and Outcome, while Setting and Goal were least frequent—similar to the frequencies of Rodina (2017), and Bohnacker (2016), who also reported infrequent Goal statements. Setting information was rare despite its importance (van Dijk, 1976b); only one child provided both time and location, echoing Bohnacker’s (2016) finding that scene-setting occurs in only 25% of 5-year-olds’ narratives and 61% of 6–7-year-olds’. It may be the case that only some of the children took on a “narrative mode,” and used stereotypical phrases like “once upon a time,” or “the end,” which may have elicited more expositional or scene-setting information, whereas other children used more of a “picture-description mode” (Montanari, 2004) and focused on recitation of the events. Overall, our findings suggest that story structure is relatively language-general: SS knowledge does not appear affected by reduced input and may transfer across languages.
In contrast to the findings on SS, SC showed less of a consistent developmental trend with age for both groups, and was only predicted by grammaticality and lexical diversity in the bilingual group. SC has been operationalized in a variety of ways (Lindgren et al., 2023). In the current study, we employed a 3-point system (Chan et al., 2023; Maviş et al., 2016), which is different than the recommended scoring system of the MAIN manual, which employs Westby’s decision tree (Westby, 2012). Therefore, the inconsistent findings in SC may be explained by the SC scoring system used in the current study. It is also noted that the current study used MAIN only as a story retell, rather than as a story tell (e.g., Kunnari and Välimaa, 2020; Maviş et al., 2016), which may have impacted the outcome. However, our findings are similar to Chan et al., 2023, who employed the same SC scoring system. They found that SC was predicted by both NDW and grammatical skills. On the other hand, Maviş et al. (2016), who also employed the same SC scoring system, found that SC was correlated with age, whereas, in the current study, SC was not correlated with age for either group. In contrast to our study, Maviş et al. used the MAIN as both a story tell and story retell task. The difference in task demands may explain the differences in findings. However, a lot of the research using MAIN has found that, regardless of how SC was scored, children tend to rely on Attempts and Outcomes, often omitting Goals (e.g., Bohnacker, 2016; Bohnacker et al., 2022; Gagarina, 2016; Lipner et al., 2024; Rodina, 2017), which was also the case for the current study. It is possible that the SC scoring system of MAIN may not fully capture the process of summarizing the essential “point” of a narrative. In telling or retelling a story, the narrator summarizes the “gist” through operations like deletion (removing redundancy) or integration (combining components) (van Dijk, 1976a). The narrator chooses which information can be omitted because it can be inferred. Since goals can be inferred (Lindgren et al., 2023), it seems logical to omit them for the sake of economy. While MAIN offers a structured and widely used approach to assess narrative macrostructure, the standardized structure potentially might not capture differences in how a child summarizes a story, or culture-specific stylistic conventions in narrative discourse that could vary in more naturalistic language samples.
Notwithstanding the significant difference between the bilingual and monolinguals on NDW, there were no significant differences between the groups on ISTs. This finding is similar to Kunnari et al. (2016), who found no difference between Finnish-Swedish bilinguals and their monolingual peers in number of ISTs. Additionally, ISTs were strongly correlated with NDW in both groups, which is similar to the findings of Altman et al. (2016) and Chan et al. (2023). This finding is interesting because, while the microstructure measure of NDW was different between groups, there were no differences between either group on ISTs, suggesting that use of ISTs is not significantly impacted by differences in lexical diversity.
Story comprehension improved with age, especially on the more difficult ToM questions (Kunnari and Välimaa, 2020; Lynch et al., 2008; Paris and Paris, 2003). However, there were no language group differences in narrative comprehension. The bilingual group had a slight, yet nonsignificant, advantage on the final ToM question. Interestingly, Rodina (2017) observed a similar trend, with the bilinguals scoring higher than monolinguals on the ToM questions, but only in Norwegian, the ML. This finding is in line with research on bilinguals’ advantage in control of processing (Bialystok, 2001). Comprehension question 10 for the Dog and Cat stories asks if the boy and the dog/cat will be friends after the dog/cat has stolen something from the boy. Many of the monolinguals answered along the lines of, “Yes, because the boy might like animals.” The bilinguals tended to answer correctly, that being, “No, because the dog/cat stole from the boy.” The bilingual group may have been able to observe the situation with more control and answer the question with the most logical response, despite other plausible possibilities. Aside from this minor difference, both language groups scored close to the ceiling (see Table 4). It is to be expected that narrative comprehension scores are higher than narrative production scores, as comprehension is a fundamental prerequisite for production (Curenton, 2011); however, the distribution of answers for both groups is negatively skewed according to a Shapiro–Wilk test, W = 0.75, p < 0.001. In the current study, the mean comprehension score reached 8.6/10 by four years old, and 9.25/10 by five years old, indicating that the MAIN comprehension questions might be too easy for its age range.
Finally, the aggregate MAIN score was strongly correlated with age, vocabulary, and syntactic complexity in both groups. Squires et al. (2014) describe how children first learn a general framework for telling stories; then, as their ability develops, they begin providing more details in their stories through use of more complex and explicit microstructure elements. The children in the current study improved in all aspects of micro and macrostructure with age (except for SC), as they began integrating the two with greater intricacy. The aggregate score had the most robust correlations with age, vocabulary, and syntactic complexity, particularly for the bilingual group, suggesting that it is a strong measure of narrative development in the ML for both monolingual and bilingual children. Coupled with microstructure measures of lexical diversity and syntactic complexity, the aggregate score may provide a comprehension indicator of the child’s narrative skills.
5.3 Differences in episodic complexity
The present study found that both groups retell episode one of the MAIN with the most complexity, followed by episode three, and finally episode two with the least complexity; this finding was also consistent between the preschool and school-age children. This finding differs from Lipner et al. (2024), who found that in Hebrew-English bilingual children, episode three of the MAIN yielded the most complexity. This may be due to the different procedures. Lipner et al. (2024) had children retell the story to a puppet who “was sleeping” while the examiner was telling the story. This was to control for the effect of shared knowledge, which is controlled for in a different manner in the MAIN manual (see “methods”). This manner of story retell elicitation could have changed how the participants identified important or unimportant information. Additionally, Lipner et al. (2024) explained their finding saying that, in the third episode, all the characters meet, and relationships become explicit, making it easier to relate to the characters and “finalize” the story. In our study, it is possible that the children added more details to the first and third episodes, given their perceptual salience. The second episode is amid the end of the first and beginning of the third, which may have made it more difficult to remember and recall (see Appendix for examples).
5.4 Limitations
As with any study, there are many limitations. The absence of significant effects for the macrostructure measures may partly reflect limitations in sample size and composition. Although the number of participants was consistent with comparable research on bilingual and monolingual narratives (e.g., Kunnari et al., 2016; Rodina, 2017), the relatively small sample and broad age range may have introduced developmental variability, making it more difficult to group the bilinguals by both English and Japanese proficiency, and reducing the sensitivity to detect more specific group or age effects. Additionally, incorporating other measures, such as phonological, memory, or vocabulary measures would have provided a more detailed picture of both groups. While the MAIN does provide a thorough analysis of a child’s language, additional measures can help further explain the results found by the MAIN. We also would have liked to query more about SES and literacy skills within the caregiver questionnaire to fully round out the findings from the MAIN. Finally, the bilingual group used in the current study was English-dominant, and they were not grouped based on Japanese proficiency. Therefore, our findings on cross-linguistic transfer should be interpreted with caution, particularly for bilingual children with more homogenous profiles of language ability in both languages, or more Japanese-dominant profiles. More research with Japanese-English bilingual children who are more balanced or more Japanese-dominant would add to the body of research on bilingual narrative development, as well as using the MAIN as an effective clinical and research tool. Future directions would include examining Japanese-English bilinguals longitudinally or examine both languages of the bilingual children to further explain how multilinguals ad monolinguals differ and overlap in narrative development.
5.5 Clinical implications
This study has important implications for researchers and SLPs. The MAIN script used in this study was adapted into Japanese using the recommendations of Bohnacker and Gagarina (2020). This adapted script can further the theoretical development and clinical utility of MAIN by broadening the populations that researchers can use it with (Kan et al., 2020). For SLPs, language sample analysis is a comprehensive and unbiased way of evaluating the languages of bilingual children (Botting, 2002), and narratives are a good index of development over time (Liles, 1993). This study shows that macrostructure as measured by the MAIN – specifically the aggregate score – emerged as a potential index of language development. Additionally, this study elucidates what might be expected from the narrative of a typically developing Japanese-English bilingual child, that being, potentially lower lexical diversity and syntactic complexity, and more frequently omitted subject pronouns. Gutierrez-Clellen et al. (2000) point out that measures used to assess children’s language are typically only useful for monolingual English speakers (e.g., word order, omitted pronouns, etc.). Therefore, when assessing Japanese-English bilingual children, certain deviations in grammar and vocabulary ought to be expected. Ultimately, it is essential that the SLP be knowledgeable about the basic characteristics of the languages that their students speak to appropriately tease apart a difference from a disorder.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by University of Colorado Boulder Institutional Review Board. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.
Author contributions
RH: Data curation, Writing – review & editing, Conceptualization, Writing – original draft, Investigation, Funding acquisition, Project administration, Formal analysis. PK: Conceptualization, Writing – review & editing, Supervision, Writing – original draft, Formal analysis, Methodology.
Funding
The author(s) declared that financial support was not received for this work and/or its publication.
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author PK declared that they were an editorial board member of Frontiers at the time of submission. This had no impact on the peer review process and the final decision.
Generative AI statement
The author(s) declared that Generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Altman, C., Armon-Lotem, S., Fichman, S., and Walters, J. (2016). Macrostructure, microstructure, and mental state terms in the narratives of English–Hebrew bilingual preschool children with and without specific language impairment. Appl. Psycholinguist. 37, 165–193. doi: 10.1017/s0142716415000466
Andreou, G., and Lemoni, G. (2020). Narrative skills of monolingual and bilingual pre-school and primary school children with developmental language disorder (DLD): a systematic review. OJML 10, 429–458. doi: 10.4236/ojml.2020.105026
Auza, A., Harmon, M. T., and Murata, C. (2018). Retelling stories: grammatical and lexical measures for identifying monolingual Spanish speaking children with specific language impairment (SLI). J. Commun. Disord. 71, 52–60. doi: 10.1016/j.jcomdis.2017.12.001,
Berman, R. A. (1988). On the ability to relate events in narrative. Discourse Process. 11, 469–497. doi: 10.1080/01638538809544714
Bialystok, E. (2001). Bilingualism in development: Language, literacy, and cognition. 1st Edn. UK, New York: Cambridge University Press.
Bialystok, E., Luk, G., Peets, K. F., and Yang, S. (2010). Receptive vocabulary differences in monolingual and bilingual children. Biling 13, 525–531. doi: 10.1017/S1366728909990423,
Bishop, D. V. M., and Donlan, C. (2005). The role of syntax in encoding and recall of pictorial narratives: evidence from specific language impairment. Br. J. Dev. Psychol. 23, 25–46. doi: 10.1348/026151004x20685
Bitetti, D., and Hammer, C. S. (2021). English narrative macrostructure development of Spanish–English bilingual children from preschool to first grade. Am. J. Speech Lang. Pathol. 30, 1100–1115. doi: 10.1044/2021_AJSLP-20-00046,
Boerma, T., Leseman, P. P. M., Timmermeister, M., Wijnen, F., and Blom, E. (2016). Narrative abilities of monolingual and bilingual children with and without language impairment: implications for clinical practice. Int. J. Lang. Commun. Disord. 51, 626–638. doi: 10.1111/1460-6984.12234,
Bohnacker, U. (2016). Tell me a story in English or Swedish: narrative production and comprehension in bilingual preschoolers and first graders. Appl. Psycholinguist. 37, 19–48. doi: 10.1017/S0142716415000405
Bohnacker, U., and Gagarina, N. (2020). Introduction to MAIN – revised, how to use the instrument and adapt it to further languages. ZAS Papers in Linguistics 64, xiii–xxi. doi: 10.21248/zaspil.64.2020.549
Bohnacker, U., Lindgren, J., and Öztekin, B. (2022). Storytelling in bilingual Turkish-Swedish children: effects of language, age and exposure on narrative macrostructure. Linguist. Approach. Biling. 12, 413–445. doi: 10.1075/lab.20057.boh
Botting, N. (2002). Narrative as a tool for the assessment of linguistic and pragmatic impairments. Child Lang. Teach. Ther. 18, 1–21. doi: 10.1191/0265659002ct224oa
Brown, R. (1973). A first language: The early stages. Boston, MA: Harvard University Press. doi: 10.4159/harvard.9780674732469
Cattani, A., Abbot-Smith, K., Farag, R., Krott, A., Arreckx, F., Dennis, I., et al. (2014). How much exposure to English is necessary for a bilingual toddler to perform like a monolingual peer in language tests? Int. J. Lang. Commun. Disord. 49, 649–671. doi: 10.1111/1460-6984.12082,
Chan, A., Chen, S., Hamdani, S., Tse, B., and Cheng, K. (2023). Story telling in bilingual Urdu−Cantonese ethnic minority children: macrostructure and its relation to microstructural linguistic skills. Front. Psychol. 14:924056. doi: 10.3389/fpsyg.2023.924056,
Chen, L., and Lei, J. (2013). The production of referring expressions in oral narratives of Chinese–English bilingual speakers and monolingual peers. Child Lang. Teach. Ther. 29, 41–55. doi: 10.1177/0265659012459527
Cheung, S., Kan, P. F., Winicour, E., and Yang, J. (2018). Effects of home language input on the vocabulary knowledge of sequential bilingual children. Biling. Lang. Cogn. 22, 986–1004. doi: 10.1017/S1366728918000810
Cummins, J. (1979a). Cognitive/academic language proficiency, linguistic interdependence, the optimum age question and some other matters Working Papers on Bilingualism, No. 19. Working Papers on Bilingualism, vol. 19, 121–129.
Cummins, J. (1979b). Linguistic interdependence and the educational development of bilingual children. Rev. Educ. Res. 49, 222–251. doi: 10.2307/1169960
Curenton, S. M. (2011). Understanding the landscapes of stories: the association between preschoolers’ narrative comprehension and production skills and cognitive abilities. Early Child Dev. Care 181, 791–808. doi: 10.1080/03004430.2010.490946
Daskalaki, E., Soto-Corominas, A., Xia, V. Y., and Paradis, J. (2024). The role of parental characteristics, home language use, and schooling in children’s mandarin heritage language development in Canada. Front. Lang. Sci. 3:1435200. doi: 10.3389/flang.2024.1435200
De Houwer, A. (2007). Parental language input patterns and children’s bilingual use. Appl. Psycholinguist. 28, 411–424. doi: 10.1017/S0142716407070221
Dosi, I., Kouki, E., Lada, A., and Keulen, S. (2024). Beyond monolingual norms: analyzing the complexity of Greek-Turkish bilingual children’s written narratives. NPL. 1, 1–24.doi: 10.56395/npl.v1i1.12
Durán, P., Malvern, D., Richards, B., and Chipere, N. (2004). Developmental trends in lexical diversity. Appl. Linguist. 25, 220–242. doi: 10.1093/applin/25.2.220
Ece Demir-Lira, Ö., Applebaum, L. R., Goldin-Meadow, S., and Levine, S. C. (2019). Parents’ early book reading to children: relation to children’s later language and literacy outcomes controlling for other parent language input. Dev. Sci. 22:e12764. doi: 10.1111/desc.12764,
Fiestas, C. E., and Peña, E. D. (2004). Narrative discourse in bilingual children: language and task effects. Lang. Speech Hear. Serv. Sch. 35, 155–168. doi: 10.1044/0161-1461(2004/016),
Gagarina, N. (2016). Narratives of Russian–German preschool and primary school bilinguals: rasskaz and erzaehlung. Appl. Psycholinguist. 37, 91–122. doi: 10.1017/S0142716415000430
Gagarina, N. V., Klop, D., Kunnari, S., Tantele, K., Välimaa, T., Balčiūnienė, I., et al. (2019). MAIN: multilingual assessment instrument for narratives. Zaspil 56:155. doi: 10.21248/zaspil.56.2019.414
Gagarina, N., Klop, D., Tsimpli, I. M., and Walters, J. (2016). Narrative abilities in bilingual children. Appl. Psycholinguist. 37, 11–17. doi: 10.1017/S0142716415000399
Gathercole, V. C. M., and Thomas, E. M. (2009). Bilingual first-language development: dominant language takeover, threatened minority language take-up. Biling. Lang. Cogn. 12, 213–237. doi: 10.1017/S1366728909004015
Genesee, F., and Nicoladis, E. (2006). “Bilingual first language acquisition” in Handbook of language development. eds. E. Hoff and M. Shatz (Oxford: Blackwell), 45–67.
Gibson, T. A., Peña, E. D., and Bedore, L. M. (2018). The receptive–expressive gap in English narratives of Spanish–English bilingual children with and without language impairment. J. Speech Lang. Hear. Res. 61, 1381–1392. doi: 10.1044/2018_JSLHR-L-16-0432,
Gorman, B. K., Fiestas, C. E., Peña, E. D., and Clark, M. R. (2011). Creative and stylistic devices employed by children during a storybook narrative task: a cross-cultural study. Lang. Speech Hear. Serv. Sch. 42, 167–181. doi: 10.1044/0161-1461(2010/10-0052),
Grosjean, F. (1998). Studying bilinguals: Methodological and conceptual issues. Bilingualism: Language and Cognition. 1, 131–149. doi: 10.1017/S136672899800025X
Gutierrez-Clellen, V. F. (2002). Narratives in two languages: assessing performance of bilingual children. Linguist. Educ. 13, 175–197. doi: 10.1016/s0898-5898(01)00061-4
Gutierrez-Clellen, V. F., Restrepo, M. A., Bedore, L. M., Peña, E. D., and Anderson, R. T. (2000). Language sample analysis in Spanish-speaking children: methodological considerations. Language Speech Hearing Services Schools 31, 88–98. doi: 10.1044/0161-1461.3101.88,
Hammer, C. S., Komaroff, E., Rodriguez, B. L., Lopez, L. M., Scarpino, S. E., and Goldstein, B. (2012). Predicting Spanish–English bilingual children’s language abilities. JSLHR 55, 1251–1264. doi: 10.1044/1092-4388(2012/11-0016),
Hao, Y., Bedore, L. M., Sheng, L., and Peña, E. D. (2019). Narrative skills in two languages of mandarin–English bilingual children. Int. J. Speech-Lang. Pathol. 21, 325–335. doi: 10.1080/17549507.2018.1444092,
Hayes, R. L., and Kan, P. F. (2025). The semantic depth of Japanese-English bilingual children. Front. Lang. Sci. 4:1601352. doi: 10.3389/flang.2025.1601352
Heilmann, J., Miller, J. F., Nockerts, A., and Dunaway, C. (2010). Properties of the narrative scoring scheme using narrative retells in young school-age children. Am. J. Speech Lang. Pathol. 19, 154–166. doi: 10.1044/1058-0360(2009/08-0024),
Hipfner-Boucher, K., Milburn, T. F., Weitzman, E., Greenberg, J., Pelletier, J., and Girolametto, L. (2015). Narrative abilities in subgroups of English language learners and monolingual peers. Int. J. Bil. 19, 677–692. doi: 10.1177/1367006914534330
Iluz-Cohen, P., and Walters, J. (2012). Telling stories in two languages: narratives of bilingual preschool children with typical and impaired language. Biling. Lang. Cogn. 15, 58–74. doi: 10.1017/s1366728911000538
Jacobson, P. F., and Walden, P. R. (2013). Lexical diversity and omission errors as predictors of language ability in the narratives of sequential Spanish–English bilinguals: a cross-language comparison. Am. J. Speech-Lang. Pathol. 22, 554–565. doi: 10.1044/1058-0360(2013/11-0055),
Justice, L. M., Bowles, R. P., Kaderavek, J. N., Ukrainetz, T. A., Eisenberg, S. L., and Gillam, R. B. (2006). The index of narrative microstructure: a clinical tool for analyzing school-age children’s narrative performances. Am. J. Speech Lang. Pathol. 15, 177–191. doi: 10.1044/1058-0360(2006/017),
Kaiser, S., Ichikawa, Y., Kobayashi, N., and Yamamoto, H. (2013). Japanese: A comprehensive grammar. 2nd Edn. London: Routledge. doi: 10.4324/9780203085196
Kan, R. T. Y., Chan, A., and Gagarina, N. (2020). Investigating children’s narrative abilities in a Chinese and multilingual context: Cantonese, mandarin, Kam and Urdu adaptations of the multilingual assessment instrument for narratives (MAIN). Front. Psychol. 11:573780. doi: 10.3389/fpsyg.2020.573780,
Kan, P. F., and Kohnert, K. (2005). Preschoolers learning Hmong and English. J. Speech Lang. Hear. Res. 48, 372–383. doi: 10.1044/1092-4388(2005/026),
Košutar, S., Kramarić, M., and Hržica, G. (2022). The relationship between narrative microstructure and macrostructure: differences between six- and eight-year-olds. Psychol. Lang. Commun. 26, 126–153. doi: 10.2478/plc-2022-0007
Kubota, M., Chondrogianni, V., Clark, A. S., and Rothman, J. (2022). Linguistic consequences of toing and froing: factors that modulate narrative development in bilingual returnee children. Int. J. Bilingual Educ. Bilingual. 25, 2363–2381. doi: 10.1080/13670050.2021.1910621
Kunnari, S., and Välimaa, T. (2020). “Narrative comprehension in simultaneously bilingual Finnish-Swedish and monolingual Finnish children” in Developing narrative comprehension: Multilingual assessment instrument for narratives eds. U. Bohnacker and N. Gagarina 149–170. doi: 10.1075/sibil.61.05kun
Kunnari, S., Välimaa, T., and Laukkanen-Nevala, P. (2016). Macrostructure in the narratives of monolingual Finnish and bilingual Finnish–Swedish children. Appl. Psycholinguist. 37, 123–144. doi: 10.1017/S0142716415000442
Liles, B. Z. (1993). Narrative discourse in children with language disorders and children with normal language: a critical review of the literature. J. Speech Lang. Hear. Res. 36, 868–882. doi: 10.1044/jshr.3605.868,
Liles, B. Z., Duffy, R. J., Merritt, D. D., and Purcell, S. L. (1995). Measurement of narrative discourse ability in children with language disorders. J. Speech Lang. Hear. Res. 38, 415–425. doi: 10.1044/jshr.3802.415,
Lindgren, J., Tselekidou, F., and Gagarina, N. (2023). Acquisition of narrative macrostructure: a comprehensive overview of results from the multilingual assessment instrument for narratives. ZAS Pap. Linguist. 65, 111–132. doi: 10.21248/zaspil.65.2023.623
Lipner, M., Armon-Lotem, S., Fichman, S., Walters, J., and Altman, C. (2024). Impact of narrative task complexity and language on macrostructure in bilingual kindergarten children. Lang. Speech Hear. Serv. Sch. 55, 545–560. doi: 10.1044/2023_LSHSS-23-00152,
Lynch, J. S., van den Broek, P., Kremer, K. E., Kendeou, P., White, M. J., and Lorch, E. P. (2008). The development of narrative comprehension and its relation to other early reading skills. Read. Psychol. 29, 327–365. doi: 10.1080/02702710802165416
Mäkinen, L., Loukusa, S., Laukkanen, P., Leinonen, E., and Kunnari, S. (2014). Linguistic and pragmatic aspects of narration in Finnish typically developing children and children with specific language impairment. Clin. Linguist. Phon. 28, 413–427. doi: 10.3109/02699206.2013.875592,
Maviş, İ., Tunçer, M., and Gagarina, N. (2016). Macrostructure components in narrations of Turkish–German bilingual children. Appl. Psycholinguist. 37, 69–89. doi: 10.1017/S0142716415000429
Miller, J. F., Andriacchi, K., and Nockerts, A. (Eds.) (2019). Assessing language production using SALT. Third Edn. London: SALT Software LLC.
Miller, J. F., Heilmann, J., Nockerts, A., Iglesias, A., Fabiano, L., and Francis, D. J. (2006). Oral language and reading in bilingual children. Learn. Disabil. Res. Pract. 21, 30–43. doi: 10.1111/j.1540-5826.2006.00205.x
Mishina-Mori, S., Nakano, Y., Yujobo, Y. J., and Kawanishi, Y. (2024). Is referent reintroduction more vulnerable to crosslinguistic influence? An analysis of referential choice among Japanese–English bilingual children. Language 9:120. doi: 10.3390/languages9040120
Montanari, S. (2004). The development of narrative competence in the L1 and L2 of Spanish-English bilingual children. Int. J. Bilingual. 8, 449–497. doi: 10.1177/13670069040080040301
Otwinowska, A., Mieszkowska, K., Białecka-Pikul, M., Opacki, M., and Haman, E. (2020). Retelling a model story improves the narratives of polish-English bilingual children. Int. J. Bilingual Educ. Bilingual. 23, 1083–1107. doi: 10.1080/13670050.2018.1434124
Paris, A. H., and Paris, S. G. (2003). Assessing narrative comprehension in young children. Read. Res. Q. 38, 36–76. doi: 10.1598/RRQ.38.1.3
Peña, E. D., Gillam, R. B., and Bedore, L. M. (2014). Dynamic assessment of narrative ability in English accurately identifies language impairment in English language learners. JSLHR 57, 2208–2220. doi: 10.1044/2014_JSLHR-L-13-0151,
Pesco, D., and Kay-Raining Bird, E. (2016). Perspectives on bilingual children’s narratives elicited with the multilingual assessment instrument for narratives. Appl. Psycholinguist. 37, 1–9. doi: 10.1017/S0142716415000387
Peterson, C., and McCabe, A. (1983). Developmental psycholinguistics: three ways of looking at a child’s narrative. Softcover reprint of the original. 1st Edn. Boston, MA: Springer.
Poulsen, D., Kintsch, E., Kintsch, W., and Premack, D. (1979). Children’s comprehension and memory for stories. J. Exp. Child Psychol. 28, 379–403. doi: 10.1016/0022-0965(79)90070-5,
Rezzonico, S., Goldberg, A., Mak, K. K.-Y., Yap, S., Milburn, T. F., Belletti, A., et al. (2016). Narratives in two languages: storytelling of bilingual Cantonese–English preschoolers. J. Speech Lang. Hear. Res. 59, 521–532. doi: 10.1044/2015_jslhr-l-15-0052,
Roch, M., Florit, E., and Levorato, C. (2016). Narrative competence of Italian–English bilingual children between 5 and 7 years. Appl. Psycholinguist. 37, 49–67. doi: 10.1017/S0142716415000417
Rodina, Y. (2017). Narrative abilities of preschool bilingual Norwegian-Russian children. Int. J. Bilingual. 21, 617–635. doi: 10.1177/1367006916643528
Sheng, L., Lu, Y., and Kan, P. F. (2011). Lexical development in mandarin–English bilingual children. Biling. Lang. Cogn. 14, 579–587. doi: 10.1017/S1366728910000647
Slobin, D. I. (1996). “Two ways to travel: verbs of motion in English and Spanish” in Grammatical constructions: Their form and meaning. eds. M. Shibatani and S. A. Thompson (New York, NY: Clarendon Press), 195–219.
Squires, K. E., Lugo-Neris, M. J., Peña, E. D., Bedore, L. M., Bohman, T. M., and Gillam, R. B. (2014). Story retelling by bilingual children with language impairments and typically-developing controls. Int. J. Lang. Commun. Disord. 49:60. doi: 10.1111/1460-6984.12044,
Stein, N. L., and Glenn, C. G. (1979). “An analysis of story comprehension in elementary school children” in Discourse processing: Multidisciplinary perspectives. ed. R. Freedle (Norwood, NJ: Ablex).
Suggate, S., Schaughency, E., McAnally, H., and Reese, E. (2018). From infancy to adolescence: the longitudinal links between vocabulary, early literacy skills, oral narrative, and reading comprehension. Cogn. Dev. 47, 82–95. doi: 10.1016/j.cogdev.2018.04.005
Titone, D. A., and Tiv, M. (2023). Rethinking multilingual experience through a systems framework of bilingualism. Biling. Lang. Cogn. 26, 1–16. doi: 10.1017/S1366728921001127
To, C. K. S., Stokes, S. F., Cheung, H. T., and T’sou, B. (2010). Narrative assessment for Cantonese-speaking children. JSLHR 53, 648–669. doi: 10.1044/1092-4388(2009/08-0039),
Tsimpli, I. M., Peristeri, E., and Andreou, M. (2016). Narrative production in monolingual and bilingual children with specific language impairment. Appl. Psycholinguist. 37, 195–216. doi: 10.1017/S0142716415000478
Uccelli, P., and Páez, M. M. (2007). Narrative and vocabulary development of bilingual children from kindergarten to first grade: developmental changes and associations among English and Spanish skills. Lang. Speech Hearing Serv. Schools 38, 225–236. doi: 10.1044/0161-1461(2007/024),
van Dijk, T. A. (1976a). Narrative macro-structures: logical and cognitive foundations. J. Descriptive Poetics Theor. Lit. 1, 547–568.
van Dijk, T. A. (1976b). Philosophy of action and theory of narrative. Poetics 5, 287–338. doi: 10.1016/0304-422x(76)90014-0
van Dijk, T. A. (1980). Macrostructures: An interdisciplinary study of global structures in discourse, interaction, and cognition. 1st Edn. Milton: Routledge.
Westby, C. E. (2012). “Assessing and facilitating text comprehension problems” in Language and reading disabilities. eds. A. G. Kamhi and H. W. Catts (Boston, MA: Allyn & Bacon), 157–232.
Appendix
Structural complexity examples
Episode 1:
• GAO
So, one day a cat saw a butterfly on the bush.
And the cat wanted to catch the butterfly. (Goal)
The cat leaped over the bush to try to catch the butterfly. (Attempt)
But he fell into the bushes and hurt himself. (Outcome)
• AO
The mouse was sitting there.
He’s going to chase it. (Attempt)
He was running when he bumped into the tree. (Outcome)
• O
There was a playful dog.
The mouse ran away very quickly.
The dog bumped his head. (Outcome)
Episode 2:
• GAO
The boy was so surprised to see the cat in the bushes that he dropped the ball.
He was sad that the ball went into the water.
He wanted to get it back. (Goal)
The boy tried to get the ball back. (Attempt)
He got it and was very happy to have his ball back. (Outcome)
• GO
And while the boy was thinking how he wanted the ball, the cat saw the bucket. (Goal).
And the boy had the ball back. (Outcome)
• A
The boy is carrying a bag with sausages.
His balloon slipped out of his hands.
He’s getting the balloon. (Attempt)
Episode 3:
• GAO
The cat saw a bucket of fish.
And wanted to eat one. (Goal)
The cat went to get the fish from the bucket. (Attempt)
And the cat was happy to get fish. (Outcome)
• GA
And the dog noticed the sausages.
And he said he wanted to grab them. (Goal)
And the boy did not notice the dog was trying to grab them. (Attempt)
• O
And the dog saw the sausages.
And the man was so happy that he did not notice that the dog ate his sausage. (Outcome)
Keywords: bilinguals, Japanese, macrostructure, Multilingual Assessment Instrument of Narratives, story grammar
Citation: Hayes RL and Kan PF (2026) Shared and divergent patterns in narrative skills: comparing English monolingual and Japanese–English bilingual children. Front. Psychol. 17:1747702. doi: 10.3389/fpsyg.2026.1747702
Edited by:
Pia Knoeferle, Humboldt University of Berlin, GermanyReviewed by:
Ifigeneia Dosi, Democritus University of Thrace, GreeceHideyuki Taura, Ritsumeikan University, Japan
Copyright © 2026 Hayes and Kan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Richy Lewis Hayes, cmloYTg1MDhAY29sb3JhZG8uZWR1