Original Research ARTICLE
Fundamental Frequency Variation of Neonatal Spontaneous Crying Predicts Language Acquisition in Preterm and Term Infants
- 1Graduate School of Education, Kyoto University, Kyoto, Japan
- 2Japan Society for the Promotion of Science, Tokyo, Japan
- 3Department of Pediatrics, Graduate School of Medicine, Kyoto University, Kyoto, Japan
Spontaneous cries of infants exhibit rich melodic features (i.e., time variation of fundamental frequency [F0]) even during the neonatal period, and the development of these characteristics might provide an essential base for later expressive prosody in language. However, little is known about the melodic features of spontaneous cries in preterm infants, who have a higher risk of later language-related problems. Thus, the present study investigated how preterm birth influenced melodic features of spontaneous crying at term-equivalent age as well as how these melodic features related to language outcomes at 18 months of corrected age in preterm and term infants. At term, moderate-to-late preterm (MLP) infants showed spontaneous cries with significantly higher F0 variation and melody complexity than term infants, while there were no significant differences between very preterm (VP) and term infants. Furthermore, larger F0 variation within cry series at term was significantly related to better language and cognitive outcomes, particularly expressive language skills, at 18 months. On the other hand, no other melodic features at term predicted any developmental outcomes at 18 months. The present results suggest that the additional postnatal vocal experience of MLP preterm infants increased F0 variation and the complexity of spontaneous cries at term. Additionally, the increases in F0 variation may partly reflect the development of voluntary vocal control, which, in turn, contributes to expressive language in infancy.
Crying is one of the few means by which human infants can communicate a variety of emotions, including hunger, pain, and frustration. The majority of research investigating infant crying has evaluated acoustic features as a non-invasive index of neurophysiological states during the neonatal period, including pain stress (Stevens et al., 2007) and medical complications (Wasz-Höckert et al., 1985). In line with these findings, the most common acoustic feature of pain-induced cries is that of fundamental frequency (F0); for example, abnormally high F0 values are related to a variety of medical conditions including chromosomal, endocrine, metabolic, and neurological disturbances (Wasz-Höckert et al., 1985).
Previous studies investigating preterm infants (gestational age < 37 weeks) have focused on F0 in neonatal cries. Preterm infants exhibit higher F0 of pain-induced cries before term-equivalent age compared to term newborns but these differences disappear around term-equivalent age (Michelsson et al., 1983; Johnston et al., 1993; Cacace et al., 1995). In contrast, the present authors found that the spontaneous cries (i.e., cries unaffected by external stress) of preterm infants at term-equivalent age are higher in terms of F0 min, F0 mean, and F0 max (see section “Materials and Methods”) than those of term newborns (Shinya et al., 2014). Our research group also showed that shorter gestational age is significantly associated with higher F0, regardless of body size at cry recording and intrauterine growth retardation [IUGR]. Thus, rather than being a product of a smaller body size, the increased F0 of spontaneous cries in preterm infants may be the result of more complicated neurophysiological states, such as low vagal activity (Shinya et al., 2016), due to their different intra- and extrauterine experiences.
On the other hand, melody (i.e., time variation in F0) is considered to be one of the most crucial acoustic features of infant cries. In term infants, spontaneous cries exhibit rich melodic features even during the neonatal period (Mampe et al., 2009; Wermke et al., 2016) and their cry melodies are reported to become increasingly more variable (Prescott, 1975) and complex (Wermke et al., 2002, 2007) across the first months of life. For example, developmental changes have been observed in the spontaneous cries of term infants such that F0 variability (i.e., standard deviation [SD]) within a cry utterance increases from 28 Hz at 1–2 weeks of age to 53 Hz at 6–9 months of age (Prescott, 1975). Furthermore, the spontaneous cries of term infants show unidirectional development from mostly simple single-arc melody structures to an increasing number of multiple-arc structures during the first several months (Wermke et al., 2002, 2007; Figures 1A,B). These developmental changes in infant cry melody are assumed to reflect increases in laryngeal vocal coordination due to maturation in the developing central nervous system (CNS).
FIGURE 1. (A) An example of spectrogram and (B) corresponding melody (i.e., time series of fundamental frequency [F0]) contour diagrams for a complex cry with multiple-arcs. (C) Two examples of melody contours in a series of spontaneous cries in 60 s, indicating the F0 transposition range, which means the difference between minimum and maximum values among F0 max. The upper diagram indicated a smaller F0 transposition range (98 Hz [478–576], 3.24 semitones), while the bottom one indicated a greater F0 transposition range (197 Hz [405–602], 6.85 semitones). Not included in the diagrams were cry utterances excluded from the main analysis, respiratory phases, and no-phonatory periods.
Broadening the perspective, there is a potential relationship between the development of an infant’s cry melody and language acquisition (Darwin, 1872; Lieberman, 1985; Wermke and Mende, 2009). The evolutionary viewpoint emphasizes that expressive language may originate from the imitation and modification of humans’ instinctive cries by articulate sounds (Darwin, 1871). According to this view, an increased melodic variation in infant crying is assumed to provide an essential base for linguistic prosody expressive of various emotions (Darwin, 1872; Jespersen, 1922; Wermke and Mende, 2009). This assumption is consistent with observations of the high sensitivity to melodic features of adult speech during the fetal (Kisilevsky et al., 2009; Partanen et al., 2013; Webb et al., 2015) and neonatal (Kujala et al., 2004; Naoi et al., 2013) periods. Additionally, infant cry melody is shaped by ambient language during these periods (Mampe et al., 2009; Wermke et al., 2016).
The continuity between cry melody and language development is supported by phonetic evidence as well. Wermke et al. (2007) reported that a high percentage of term infants who exhibit multiple-arc melody in cries made during the second month have better language outcomes at 2.5 years (Wermke et al., 2007). Previous studies of mammals suggest that the production of infant cries in humans involves cortical regions, such as the anterior cingulate gyrus, as well as brainstem regions (Müller-Preuss and Jürgens, 1976; Jürgens, 2002; Newman, 2007). Considering the rapid maturation of the CNS during the first few months of life, the development of infant cry melody may reflect increasing voluntary vocal control by the cortex, which provides experience that facilitates control over later non-cry vocalizations related to language (Lieberman, 1985; Wermke et al., 2007).
However, the manner in which preterm birth might affect the melodic features of spontaneous cries and how these cries are related to later language acquisition in preterm peers remain unclear. Preterm infants exhibit slower acquisition of receptive and expressive language skills in their second year of life (Foster-Cohen et al., 2007; Sansavini et al., 2011) as well as during childhood (Luu et al., 2009; Barre et al., 2011; van Noort-van der Spek et al., 2012). Furthermore, during the earlier stages of language acquisition, preterm infants exhibit atypical developmental patterns of preverbal communication, such as an earlier onset of canonical babbling (Eilers et al., 1993) and reductions in joint attention (Garner et al., 1991; De Schuymer et al., 2011a; Imafuku et al., 2017). Considering their atypical developmental pattern of language acquisition, the identification of melodic features of spontaneous cries during early developmental periods and the assessment of the relationships of these features with early language acquisition are significant issues (Smith and Ulvund, 2003; Peña et al., 2010; De Schuymer et al., 2011b).
Thus, the present study assessed melodic features of spontaneous cries in preterm infants with no severe complications at a term-equivalent age and compared these features to those of term newborns (Shinya et al., 2014). In particular, the variability (i.e., height of F0 variation) and complexity (i.e., amount of melody arcs) of the melodic features were a specific focus. Furthermore, based on the assumption of a potential link between cry melody and language development (Darwin, 1872; Lieberman, 1985; Wermke and Mende, 2009), the manner in which the analyzed melodic features of spontaneous cries at term contribute to early communication and language development at 18 months of corrected age were examined in preterm and term infants. Especially, it was expected that infants who exhibit more variable and complex cry melodies at term would show better language outcomes at 18 months. We focused on their language outcomes at 18 months of age because it has been reported that preterm infants (gestational age ≤ 32 weeks) begin to show lower receptive and expressive language development by the age, compared to term infants (Sansavini et al., 2011).
Materials and Methods
The present study included 77 preterm infants (gestational age < 37 weeks) and 30 term infants (gestational age ≥ 37 weeks) who were recruited between 2011 and 2015 from Kyoto University Hospital. The preterm infants were assigned to two subgroups according to gestational age: very preterm (VP) infants (gestational age < 32 weeks; n = 36) and moderate-to-late preterm (MLP) infants (gestational age ≥ 32 weeks but < 37 weeks; n = 41); all participants were included in previous studies conducted by our research group (i.e., the preterm group in Shinya et al., 2014, 2016 and the term group in Shinya et al., 2016). The exclusion criteria consisted of severe neurological complications such as brain lesions (including periventricular leukomalacia, and Grade III or IV intraventricular hemorrhages), chromosomal abnormalities, and/or medical treatment required for respiratory disease at term-equivalent age. When all participants were in the hospital before cry recording, written informed consent was obtained from their parents. All participants came from Japanese families, and all families were considered middle class. On the basis of our interactions with the parents, all parents were regarded to be healthy with no cognitive and language deficits. The study was conducted with the approval of the ethics committee of Kyoto University Graduate School and Faculty of Medicine (No. E581) and was conducted according to standards specified in the Declaration of Helsinki from 1964. The demographic data of the participants at term-equivalent age are provided in Table 1.
Recording of Spontaneous Crying at Term-Equivalent Age
All infants were at term-equivalent age and studied between 5 and 9 p.m. while in a supine position in an open crib. Preterm infants were assessed in a growing care unit at Kyoto University Hospital where they stayed until leaving the hospital whereas term infants were assessed in a quiet examination room at the hospital. Environmental conditions, including the crib, noise level, and ambient temperatures, were controlled for all participants. Environmental noise levels in the rooms were perceptually judged to be low and acceptable for audio recording and analysis. Spontaneous cries of each infant less than 30 min before feeding were recorded for more than 60 s using a wave recorder at a 44.1 kHz sampling rate with 16-bit quantization. If an infant cried for longer than 60 s, the 60 s period of successive cries nearest to the feeding were selected. During recording, the distance between the microphone and the infant’s mouth was approximately 15 cm (EDIROL R-09; Roland, Corp., Los Angeles, CA, United States).
Assessment of Language Development at a Corrected Age of 18 months
The children were enrolled in a follow-up session to assess language development at 18 months of corrected age. Language development was evaluated using the MacArthur Communicative Development Inventory, adapted for Japanese (MCDI; Words and Gestures; Ogura and Watamaki, 1998) and the Kyoto Scale of Psychological Development (KSPD; Ikuzawa et al., 2002). The MCDI is a valid, reliable parent survey for assessing early communication and language development from 8 to 18 months. We used the questionnaire to evaluate infants’ gesture, receptive, and expressive language skills. Primary caregivers completed the MCDI after the administration of the KSPD. The KSPD is a Japanese standardized developmental scale commonly administered to typically developing infants and low-functioning children with disabilities. This scale is an individualized face-to-face test based on Gesell’s developmental diagnosis and the assessment items of the Binet test. The KSPD measures general developmental progress and delays in three domains: postural-motor (P-M), cognitive-adaptive (C-A), and language-social (L-S). The developmental quotients of these three areas are highly correlated with corresponding composite scores (i.e., motor, cognitive, and language) on the Bayley III (Kono et al., 2016). The KSPD was conducted in an examination room at Kyoto University Hospital for preterm infants and in an experimental room at Kyoto University for term infants. Visual distractions were removed when possible and the environmental noise level of the room was perceptually judged to be low.
For the follow-up assessment at 18 months, 83 infants from the initial cohort (n = 107) completed the MCDI (VP group: n = 29, MLP group: n = 27, and term group: n = 19) and/or KSPD (VP group: n = 30, MLP group: n = 32, and term group: n = 20). At the MCDI assessment, the mean corrected age of the VP group was 18.63 months (SD = 0.50, range = 17.71–19.81), that of the MLP group was 18.43 months (SD = 0.57, range = 17.45–19.91), and that of the term group was 18.45 months (SD = 0.44, range = 17.58–19.12). At the KSPD assessment, the mean corrected age of the VP group was 18.47 months (SD = 0.31, range = 17.71–19.19), that of the MLP group was 18.33 months (SD = 0.55, range = 17.45–19.68), and that of the term group was 18.17 months (SD = 0.42, range = 17.15–18.89). A number of participants did not complete the follow-up assessments for the following reasons: refusal to participate (e.g., moving far away; n = 9), incomplete questionnaire and developmental tests (n = 3), and no need to visit the hospital due to a lack of clinical conditions for the preterm group (n = 12). The included group (n = 83, female: n = 45) and the excluded group (n = 24, female: n = 14) did not significantly differ in terms of the acoustic features of the cries and the developmental outcomes. However, compared to the excluded group, the included group had a significantly earlier gestational age (included group: M = 33.5 weeks, SD = 4.8, range = 23.1–41.4; excluded group: M = 35.5 weeks, SD = 4.1, range = 26.6–40.7; t105 = -1.89, d = 0.44, p = 0.046). These group differences may have been due to the fact that the excluded group had a relatively high ratio of term infants and MLP preterm infants with fewer clinical problems.
Cry Acoustic Features
Acoustic features of infants’ cries were assessed using Praat (ver. 6.0.19) (Boersma and Weenink, 2016). A cry utterance was defined as a vocal output occurring on a single expiration and lasting for at least 0.3 s, to exclude non-cry sounds, such as coughs (Shinya et al., 2014, 2016). We manually segmented each infant’s 60 s cry series into single cry utterances based on amplitude-by-time waveforms, and 4,012 cries were extracted in total. These onset and offset points were also estimated to determine the duration of each cry utterance. Based on visual inspections of spectrograms, cry utterances containing broad regions of environmental noise were excluded from analysis to avoid artifacts when determining the F0. Additionally, phonatory noise phenomena and phenomena such as sudden F0 shifts or subharmonics were excluded from the analyses because the F0 contour cannot be reliably determined in those signals. Ultimately, 3,578 cries (89.2% of all cry utterances) were used in the acoustic analyses (number of cries per infant; VP group: M = 38.6 [SD = 16.7]; MLP group: M = 39.0 [SD = 17.3]; and term group: M = 34.1 [SD = 13.9]); a one-way analysis of variance (ANOVA) revealed no significant differences among the three groups (p = 0.401).
The cry utterances were down-sampled to 22.05 kHz and low-pass filtered at 10 kHz to eliminate outliers and artifacts. The following F0 estimation was calculated for each cry utterance using a Praat autocorrelation algorithm (a noise-resistant autocorrelation method at 150–900 Hz with a Hanning window length of 0.05 s) and then averaged for each infant. The F0 min (lowest F0 of a cry utterance), F0 mean (arithmetic average of F0), and F0 max (highest F0) were calculated to assess the height of F0. These F0 measures were previously reported by our research group (Shinya et al., 2014, 2016); the values and group differences of the F0 measures and cry duration are provided in Supplementary Table S1.
The height of the F0 variation was estimated to assess variability in the cry melody. As a measure of F0 variation within cry utterance, F0 range (difference between F0 min and F0 max) and F0 sigma (SD for the mean F0) were calculated. Furthermore, the F0 transposition range was determined to assess F0 variation within 60 s cry series. Beginning in the neonatal period, infants produce shape-similar cry melodies at different frequency levels for which there are large individual differences in the transposition range, even though the range gradually increases with postnatal age (Wermke and Mende, 2009). Based on these observations, F0 transposition range was calculated by subtracting the minimum value of F0 max from the maximum value of F0 max within cry series (Figure 1C).
To assess the complexity of cry melody, the melody complexity index (MCI) was calculated (Wermke et al., 2007, 2014). Neonatal cries with an identifiable melody commonly exhibit a single ascending–descending arc. These simple single-arc melodies are increasingly replaced by complex melodies (i.e., double- or multiple-arc melodies) over the first several months in term infants (Wermke et al., 2002, 2007). Thus, cry utterances were subdivided into those with only a simple (single-arc) melody and those with a complex (multiple-arc) melody (Figures 1A,B). Furthermore, arc-like melody substructures were identified, with arcs defined as longer than 150 ms and exhibiting an F0 range of at least two semitones (Wermke et al., 2014). MCI ratio was calculated for each infant by dividing the number of cries consisting of multiple-arc melodies by the total number of cry utterances (Wermke et al., 2007, 2014). In addition, we calculated an MCI number for each infant by averaging the number of arcs within cry utterance. Because some infants exhibit no-arc melodies (i.e., “flat plateau”; Wermke and Mende, 2009) or more than double-arc (Figures 1A,B) melodies during the neonatal period, the MCI number is assumed to be more sensitive for assessing the ability to produce complex cry melodies compared to the MCI ratio.
For the MCDI, the following three sub-components of early communication and language development were calculated at 18 months of corrected age: gesture, receptive language, and expressive language. Because the expressive language score was skewed positively, it was transformed using a natural log (ln) transform to normalize the distribution for the statistical analysis. For the KSPD, the developmental quotient was calculated by dividing developmental age by corrected age and then multiplying the resulting quotient by 100 for each of three areas (i.e., P-M, C-A, and L-S). It was expected that the melodic features of spontaneous cries would be selectively associated with the L-S score, which is a measure of early communication and language development on the KSPD.
Differences of Cry Melodic Features and Developmental Outcomes between Preterm and Term Infants
The acoustic features of spontaneous crying at term-equivalent age are detailed in Table 1 and language developmental scores at 18 months of corrected age are provided in Table 2. For the melodic features of crying, a one-way ANOVA assessing data from the gestational group revealed significant group differences in F0 range, F0 sigma, F0 transposition range, and MCI number (Table 1). Post hoc testing using the Bonferroni method revealed that the MLP group had higher values for F0 range (p < 0.001), F0 sigma (p < 0.001), F0 transposition range (p = 0.011), and MCI number (p = 0.041) than the term group (Figure 2).
FIGURE 2. Box plots for melodic features of spontaneous crying: (A) variability indexes (i.e., F0 range, F0 sigma, F0 Transposition range) and (B) melody complexity indexes (i.e., MCI ratio, MCI number) in very preterm (VP; n = 36, white box), moderate-to-late preterm (MLP; n = 41, gray box), and term infants (Term; n = 30, black box) at term-equivalent age. Boxes represent medians with upper and lower quartiles; whiskers represent maximum and minimum values, excluding outliers indicated by circles, at least 1.5 times the interquartile range (i.e., 1.5 box lengths from the upper or lower edge of the box). ∗p < 0.05; ∗∗∗p < 0.01.
For the developmental scores, there were significant group differences in the MCDI scores for gesture, receptive language, and expressive language and KSPD scores for the C-A and L-S (Table 2). Post hoc testing of the MCDI data showed that the VP group had lower gesture scores than the term group (p = 0.046) as well as lower receptive and expressive language scores than the MLP group (p < 0.001). Additionally, the receptive language score of the MLP group was higher than that of the term group (p = 0.023). For the KSPD, the VP group exhibited a lower C-A score than the MLP and term groups (p = 0.035; p = 0.003, respectively) and the L-S score of the VP group was lower than that of the MLP group (p = 0.041).
Relations between Cry Melodic Features and Language Development
We firstly performed Pearson’s correlation to assess relations between cry melodic features at term and developmental outcomes at 18 months for all participants (MCDI: n = 75; KSPD: n = 82). The F0 transposition range exhibited a significant positive correlation with the receptive language (r = 0.25, p = 0.033) and expressive language (r = 0.38, p < 0.001) scores on the MCDI and with the L-S score (r = 0.29, p = 0.008) on the KSPD; there were no significant correlations with the MCDI gesture score (r = 0.18, p = 0.119) or the KSPD P-M and C-A scores (r = 0.04, p = 0.729; r = 0.18, p = 0.114, respectively). Scatter plots were constructed to illustrate the correlations between F0 transposition range at term and language development at 18 months (Figure 3). The other acoustic variables did not significantly relate to any developmental outcomes (Supplementary Table S2).
FIGURE 3. Scatter plots and regression lines with 95% confidence intervals showing the relationships between F0 transposition range of spontaneous cries at term-equivalent age and language outcomes at 18 months of corrected age: (A) MCDI’s gesture (n = 75, r = 0.18, p = 0.119); (B) MCDI’s receptive language (n = 75, r = 0.25, p = 0.033); (C) MCDI’s expressive language (n = 75, r = 0.38, p < 0.001); (D) KSPD’s Language-Social (n = 82, r = 0.29, p = 0.008). The groups of infants were VP (white circles), MLP (gray circles), and Term (black circles).
Additionally, hierarchical multiple regression analyses were conducted to analyze further the independent contributions of cry melodic variables to developmental outcomes, especially language development. Gestational age (Aarnoudse-Moens et al., 2009) and IUGR (Bergvall et al., 2006) were entered as independent variables in the first two steps; the next two steps included weight at cry recording and corrected age of developmental assessment, and the final step included each of the melodic variables of crying. In this model, we excluded other demographic variables related to gestational age (i.e., birth weight, Apgar score at 5 min, and postnatal age at cry recording) to avoid any collinearity of the predictors.
In the first four steps, a shorter gestational age significantly predicted lower gesture scores on the MCDI (p < 0.001) and lower P-M (p = 0.033), C-A (p < 0.001), and L-S (p = 0.004) scores on the KSPD. IUGR was significantly related to lower gesture scores on the MCDI (p = 0.018) and C-A (p = 0.005) and L-S (p = 0.018) scores on the KSPD. The next two steps did not result in significant values for any developmental outcomes (Table 3). In the final step, a higher F0 transposition range predicted higher gesture (p = 0.022), receptive language (p = 0.008), and expressive language (p < 0.001) scores on the MCDI and C-A (p = 0.024) and L-S (p < 0.001) scores on the KSPD (Table 3). The other melodic features of crying and F0 measures did not significantly predict any developmental outcomes; the partial correlation coefficients are reported in Supplementary Table S2.
TABLE 3. Hierarchical multiple regression analyses predicting developmental outcomes in all participants.
To the best of our knowledge, the present study is the first to investigate the effects of preterm birth on melodic features (i.e., time variation of F0) of spontaneous cries at term-equivalent age. Additionally, the relationships of the analyzed melodic features of crying with language outcomes at 18 months of corrected age were examined in a sample including preterm and term infants, based on the assumption of cry melody as an ontogenetic origin of language (Darwin, 1872; Lieberman, 1985; Wermke and Mende, 2009). The present results revealed that the spontaneous cries of the MLP group had significantly higher F0 variation and melody complexity compared to term infants, and that increased F0 variation within cry series (i.e., F0 transposition range) at term was significantly associated with better language and cognitive outcomes at 18 months. Even after confounding factors such as gestational age and IUGR were controlled for, the F0 transposition range of spontaneous crying significantly predicted language and cognitive outcomes, particularly expressive language skill.
We demonstrated that the spontaneous cries of MLP infants at term showed higher F0 variation (i.e., F0 range, F0 sigma, and F0 transposition range) and melody complexity (i.e., MCI number) than those of term infants. In contrast, there were no significant differences in melodic features between VP and term infants. Previous studies of term infants have reported that the melody of spontaneous cries gradually exhibits increased variability (Prescott, 1975) and complexity (Wermke et al., 2002, 2007) across the first several months. These developmental changes are assumed to reflect the maturation of vocal control over melodic contours in spontaneous crying (Wermke and Mende, 2009). Additionally, because preterm infants begin vocalizing as early as 8 weeks before their projected due date (Caskey et al., 2011), the postnatal experiences of spontaneous crying at term in preterm infants may be greater than that of term infants. Therefore, the additional vocal experience of the MLP group in the extrauterine environment may facilitate vocal control over melodic contours during spontaneous crying. Actually, this observation may be in line with previous research reporting that healthy preterm infants, with age corrected for gestational age, began canonical babbling earlier than term infants (Eilers et al., 1993). On the other hand, it is possible that this additional experience might not contribute to the increased variability and complexity of cry melodies in the VP group due to their more complicated neurophysiological states (Peterson et al., 2000; Aarnoudse-Moens et al., 2009; Naoi et al., 2013; Shinya et al., 2016).
Moreover, the present study showed that a larger F0 variation within cry series (i.e., F0 transposition range) of spontaneous crying at term was significantly related to better language skills at 18 months. This finding is consistent with the notion of a potential continuity between the development of cry melody and language acquisition (Darwin, 1872; Lieberman, 1985; Wermke and Mende, 2009). Notably, the larger range was primarily associated with higher levels of expressive language skills at 18 months among developmental outcomes. F0 transposition range in infant crying reflects the ability to transpose melody across different frequency levels. Although the range is reported to increase gradually with postnatal age, there are large individual differences in F0 transposition range even during the neonatal period (Wermke and Mende, 2009). Thus, individual differences in F0 transposition range may reflect voluntary control over cry melody, which, in turn, contributes to later non-cry vocalizations such as babbling and expressive language (Lieberman, 1985). In practice, during language acquisition, the ability to transpose a melody across different frequency levels is assumed to be necessary for imitating simple melodies and speech sounds in one’s ambient language beginning in the neonatal period (Kuhl and Meltzoff, 1996; Mampe et al., 2009; Wermke and Mende, 2009; Gratier and Devouche, 2011; Wermke et al., 2016). Importantly, the effects of F0 transposition range at term on language development at 18 months are independent of confounding factors such as gestational age and IUGR (Hille et al., 2007; Aarnoudse-Moens et al., 2009). Considering that preterm infants, especially VP infants, have higher risks of later language-related problems (Foster-Cohen et al., 2007; Luu et al., 2009; Barre et al., 2011; Sansavini et al., 2011; van Noort-van der Spek et al., 2012), the F0 transposition range of spontaneous cries will be an additional measure that can predict the risks in preterm peers. Nevertheless, further studies are needed to replicate this preliminary observation of a relationship between the F0 transposition range of spontaneous cries and later language acquisition.
The present study did not reveal any significant associations between melody complexity at term and language outcomes at 18 months, which is inconsistent with previous reports showing that MCI rate during the second month is positively related to language outcomes at 2.5 years in term children (Wermke et al., 2007). The discrepancy might be due to the difference in the definitions of MCI. Wermke et al. (2007) used for a single melody arc definition an amplitude of at least three semitones, while the present study used only two semitones (Wermke et al., 2014). This different approach may increase the number of identifiable arcs and affect the results of the associations between MCI and language outcomes at 18 months. On the other hand, melody in spontaneous cries becomes increasingly more complex during the first several months of life (Wermke et al., 2002, 2007). Therefore, the melody complexity of spontaneous cries is assumed to be much lower at term age than at the second month. In fact, the mean MCI ratio at term was 0.29 in the present study while Wermke et al. (2007) found this ratio to be 0.52 during the second month. Thus, it is possible that melody complexity at term age has not sufficiently developed to reflect individual differences in voluntary vocal control that later contribute to language acquisition at 18 months of age. Future studies should follow-up the development of melody complexity of spontaneous cries in preterm peers after the term period and relate these findings to language development.
Another important finding is that the F0 transposition range of spontaneous crying at term was positively associated with cognitive development at 18 months, as indexed by the C-A score on the KSPD. One possibility is that individual differences in F0 transposition range at term may reflect the integrity of the developing CNS in a manner similar to that of spontaneous physical movements, such as general movements (GMs; Prechtl, 2001). Particularly in preterm infants, abnormal GMs at an early developmental stage (i.e., less variability and complexity) are a major predictor of cerebral palsy (Prechtl, 2001) and has been recently reported to relate to delayed cognitive development from infancy to school age (Butcher et al., 2009; Beccaria et al., 2012; Kanemaru et al., 2013). Less variable and frequent spontaneous movements are assumed to be due to reductions in the modulation of the generation of spontaneous movements from central pattern generators in the brainstem by the developing cerebral cortex; this can result in domain general developmental delays (Einspieler et al., 2016). Spontaneous crying is also generated from the reflexive central pattern generators in the periaqueductal gray and nucleus retroambiguus. These brainstem regions may be modulated by forebrain regions such as the amygdala and anterior cingulate cortex and lead to more variable patterns of melody in infant crying (Newman, 2007; Holstege and Subramanian, 2015). Thus, the limited F0 transposition range in spontaneous crying at term might reflect the reduced integrity of the developing CNS, which may be a partial risk factor of cognitive delay (Butcher et al., 2009; Einspieler et al., 2016).
The present study has several limitations that should be noted. First, the selection of participants for the present study might reflect a sample bias because the sample size was relatively small, especially for the term group. Future research is needed to replicate these findings using a larger sample that controls for confounding factors, such as socioeconomic status (Eilers et al., 1993; Fernald et al., 2012) and exposure to another language (Hoff et al., 2011), which may potentially affect developmental outcomes, including language development. Second, the present study did not investigate the melodic features of spontaneous cries in preterm infants from birth onward through development. To examine further the possibility that additional vocal experience may facilitate the development of cry melodies in healthy preterm peers, a longitudinal assessment of the developmental process from birth to term age will be required. Third, language development was evaluated only at 18 months of corrected age. In the present study, the difference in language development between the preterm and term groups at that timepoint was smaller than that observed in previous studies (Foster-Cohen et al., 2007; Sansavini et al., 2011). This may be because the present study included only preterm peers without severe complications and because the relationships between low birth weight and cognitive outcomes have been mitigated by recent medical advances (Goisis et al., 2017). However, researchers also need to follow-up with language outcomes after 18 months, considering that several recent studies have reported preterm children exhibit higher risks of language-related problems after infancy (Luu et al., 2009; Barre et al., 2011; van Noort-van der Spek et al., 2012).
The present study revealed that MLP infants exhibited greater F0 variation and complexity in neonatal spontaneous crying at term age compared to term infants, although the differences were not found between VP and term infants. Furthermore, F0 variation within cry series (i.e., F0 transposition range) was positively related to language development, particularly expressive language skills, at 18 months, which is consistent with previous observations of the potential link between cry melody and language (Darwin, 1872; Lieberman, 1985; Wermke and Mende, 2009). The present findings suggest that the additional postnatal vocal experience of MLP infants increased F0 variation and complexity of spontaneous cries at term. Additionally, the increased F0 variation within cry series may partially reflect developing voluntary vocal control, which contributes to expressive language in infancy. Future studies should examine whether assessments of the melodic features of crying represent a valuable addition to existing early indices and improve the ability to predict language outcomes in preterm peers.
YS, MK, FN, MI, and MM contributed to the conception and design of the work; contributed in revising the work for important intellectual content. YS, FN, and MI contributed in the data acquisition. YS, MK, and MM contributed in the analysis and interpretation of data for the work. YS and MI contributed in drafting the work. All authors approved the final version to be published, and agree to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.
This research was supported by Grants-in-Aid for Scientific Research from the Japan Society for the Promotion of Science and the Ministry of Education, Culture, Sports, Science and Technology (24119005, 17H01016 to MM; 14J06302, 17H06595 to YS), Exploratory Research for Advanced Technology, Okanoya Emotional Information Project, and the Center of Innovation Program from the Japan Science and Technology Agency, JST to MM, and the Mayekawa Houonkai Foundation to MM (2016–2018).
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
We would like to thank all the children and parents for their participation in this study. We would also like to acknowledge M. Shibata, M. Inagawa, and the hospital staff for recruiting participants, N. Naoi, Y. Fuchino, S. Mizugaki, K. Hirai, and M. Yoshii for developmental assessment.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2017.02195/full#supplementary-material
Aarnoudse-Moens, C. S. H., Weisglas-Kuperus, N., van Goudoever, J. B., and Oosterlaan, J. (2009). Meta-analysis of neurobehavioral outcomes in very preterm and/or very low birth weight children. Pediatrics 124, 717–728. doi: 10.1542/peds.2008-2816
Barre, N., Morgan, A., Doyle, L. W., and Anderson, P. J. (2011). Language abilities in children who were very preterm and/or very low birth weight: a meta-analysis. J. Pediatr. 158, 766–774. doi: 10.1016/j.jpeds.2010.10.032
Beccaria, E., Martino, M., Briatore, E., Podestà, B., Pomero, G., Micciolo, R., et al. (2012). Poor repertoire general movements predict some aspects of development outcome at 2years in very preterm infants. Early Hum. Dev. 88, 393–396. doi: 10.1016/j.earlhumdev.2011.10.002
Bergvall, N., Iliadou, A., Johansson, S., Tuvemo, T., and Cnattingius, S. (2006). Risks for low intellectual performance related to being born small for gestational age are modified by gestational age. Pediatrics 117, e460–e467. doi: 10.1542/peds.2005-0737
Boersma, P., and Weenink, D. (2016). Praat: do0069ng Phonetics by Computer (v. 6.0.19). [Computer Program.]. Available at: http://www.praat.org
Butcher, P. R., van Braeckel, K., Bouma, A., Einspieler, C., Stremmelaar, E. F., and Bos, A. F. (2009). The quality of preterm infants’ spontaneous movements: an early indicator of intelligence and behaviour at school age. J. Child Psychol. Psychiatry 50, 920–930. doi: 10.1111/j.1469-7610.2009.02066.x
Cacace, A. T., Robb, M. P., Saxman, J. H., Risemberg, H., and Koltai, P. (1995). Acoustic features of normal-hearing pre-term infant cry. Int. J. Pediatr. Otorhinolaryngol. 33, 213–224. doi: 10.1016/0165-5876(95)01211-7
De Schuymer, L., De Groote, I., Beyers, W., Striano, T., and Roeyers, H. (2011a). Preverbal skills as mediators for language outcome in preterm and full term children. Early Hum. Dev. 87, 265–272. doi: 10.1016/j.earlhumdev.2011.01.029
De Schuymer, L., De Groote, I., Striano, T., Stahl, D., and Roeyers, H. (2011b). Dyadic and triadic skills in preterm and full term infants: a longitudinal study in the first year. Infant Behav. Dev. 34, 179–188. doi: 10.1016/j.infbeh.2010.12.007
Eilers, R. E., Oller, D. K., Levine, S., Basinger, D., Lynch, M. P., and Urbano, R. (1993). The role of prematurity and socioeconomic status in the onset of canonical babbling in infants. Infant Behav. Dev. 16, 297–315. doi: 10.1016/0163-6383(93)80037-9
Einspieler, C., Bos, A. F., Libertus, M. E., and Marschik, P. B. (2016). The general movement assessment helps us to identify preterm infants at risk for cognitive dysfunction. Front. Psychol. 7:406. doi: 10.3389/fpsyg.2016.00406
Foster-Cohen, S., Edgin, J. O., Champion, P. R., and Woodward, L. J. (2007). Early delayed language development in very preterm infants: evidence from the MacArthur-Bates CDI. J. Child Lang. 34, 655–675. doi: 10.1017/S0305000907008070
Garner, P. W., Landry, S. H., and Richardson, M. A. (1991). The development of joint attention skills in very-low-birth-weight infants across the first 2 years. Infant Behav. Dev. 14, 489–495. doi: 10.1016/0163-6383(91)90035-Q
Goisis, A., Özcan, B., and Myrskylä, M. (2017). Decline in the negative association between low birth weight and cognitive ability. Proc. Natl. Acad. Sci. U.S.A. 114, 84–88. doi: 10.1073/pnas.1605544114
Hille, E. T. M., Weisglas-Kuperus, N., van Goudoever, J. B., Jacobusse, G. W., Ens-Dokkum, M. H., de Groot, L., et al. (2007). Functional outcomes and participation in young adulthood for very preterm and very low birth weight infants: the Dutch project on preterm and small for gestational age infants at 19 years of age. Pediatrics 120:e587–95.
Imafuku, M., Kawai, M., Niwa, F., Shinya, Y., Inagawa, M., and Myowa-Yamakoshi, M. (2017). Preference for dynamic human images and gaze-following abilities in preterm infants at 6 and 12 months of age: an eye-tracking study. Infancy 22, 223–239. doi: 10.1111/infa.12144
Johnston, C. C., Stevens, B., Craig, K. D., and Grunau, R. V. (1993). Developmental changes in pain expression in premature, full-term, two-and four-month-old infants. Pain 52, 201–208. doi: 10.1016/0304-3959(93)90132-9
Kanemaru, N., Watanabe, H., Kihara, H., Nakano, H., Takaya, R., Nakamura, T., et al. (2013). Specific characteristics of spontaneous movements in preterm infants at term age are associated with developmental delays at age 3 years. Dev. Med. Child Neurol. 55, 713–721. doi: 10.1111/dmcn.12156
Kisilevsky, B. S., Hains, S. M. J., Brown, C. A., Lee, C. T., Cowperthwaite, B., Stutzman, S. S., et al. (2009). Fetal sensitivity to properties of maternal speech and language. Infant Behav. Dev. 32, 59–71. doi: 10.1016/j.infbeh.2008.10.002
Kono, Y., Yonemoto, N., Kusuda, S., Hirano, S., Iwata, O., Tanaka, K., et al. (2016). Developmental assessment of VLBW infants at 18 months of age: a comparison study between KSPD and Bayley III. Brain Dev. 38, 377–385. doi: 10.1016/j.braindev.2015.10.010
Kujala, A., Huotilainen, M., Hotakainen, M., Lennes, M., Parkkonen, L., Fellman, V., et al. (2004). Speech-sound discrimination in neonates as measured with MEG. Neuroreport 15, 2089–2092. doi: 10.1097/00001756-200409150-00018
Lieberman, P. (1985). “The physiology of cry and speech in relation to linguistic behavior,” in Infant Crying: Theoretical and Research Perspectives, eds B. M. Lester and C. F. Z. Boukydis (New York, NY: Plenum Press), 29–57.
Luu, T. M., Vohr, B. R., Schneider, K. C., Katz, K. H., Tucker, R., Allan, W. C., et al. (2009). Trajectories of receptive language development from 3 to 12 years of age for very preterm children. Pediatrics 124, 333–341. doi: 10.1542/peds.2008-2587
Naoi, N., Fuchino, Y., Shibata, M., Niwa, F., Kawai, M., Konishi, Y., et al. (2013). Decreased right temporal activation and increased interhemispheric connectivity in response to speech in preterm infants at term-equivalent age. Front. Psychol. 4:94. doi: 10.3389/fpsyg.2013.00094
Partanen, E., Kujala, T., Näätänen, R., Liitola, A., Sambeth, A., and Huotilainen, M. (2013). Learning-induced neural plasticity of speech processing before birth. Proc. Natl. Acad. Sci. U.S.A. 110, 15145–15150. doi: 10.1073/pnas.1302159110
Peterson, B. S., Vohr, B., Staib, L. H., Cannistraci, C. J., Dolberg, A., Schneider, K. C., et al. (2000). Regional brain volume abnormalities and long-term cognitive outcome in preterm infants. JAMA 284, 1939–1947. doi: 10.1001/jama.284.15.1939
Prechtl, H. F. (2001). General movement assessment as a method of developmental neurology: new paradigms and their consequences. The 1999 Ronnie MacKeith lecture. Dev. Med. Child Neurol. 43, 836–842. doi: 10.1017/S0012162201001529
Sansavini, A., Guarini, A., Savini, S., Broccoli, S., Justice, L., Alessandroni, R., et al. (2011). Longitudinal trajectories of gestural and linguistic abilities in very preterm infants in the second year of life. Neuropsychology 49, 3677–3688. doi: 10.1016/j.neuropsychologia.2011.09.023
Shinya, Y., Kawai, M., Niwa, F., and Myowa-Yamakoshi, M. (2014). Preterm birth is associated with an increased fundamental frequency of spontaneous crying in human infants at term-equivalent age. Biol. Lett. 10:20140350. doi: 10.1098/rsbl.2014.0350
Shinya, Y., Kawai, M., Niwa, F., and Myowa-Yamakoshi, M. (2016). Associations between respiratory arrhythmia and fundamental frequency of spontaneous crying in preterm and term infants at term-equivalent age. Dev. Psychobiol. 58, 724–733. doi: 10.1002/dev.21412
Smith, L., and Ulvund, S. E. (2003). The role of joint attention in later development among preterm children: linkages between early and middle childhood. Soc. Dev. 12, 222–234. doi: 10.1111/1467-9507.00230
Stevens, B., McGrath, P., Gibbins, S., Beyene, J., Breau, L., Camfield, C., et al. (2007). Determining behavioural and physiological responses to pain in infants at risk for neurological impairment. Pain 127, 94–102. doi: 10.1016/j.pain.2006.08.012
van Noort-van der Spek, I. L., Franken, M. C., and Weisglas-Kuperus, N. (2012). Language functions in preterm-born children: a systematic review and meta-analysis. Pediatrics 129, 745–754. doi: 10.1542/peds.2011-1728
Wasz-Höckert, O., Michelsson, K., and Lind, J. (1985). “Twenty-five years of Scandinavian cry research,” in Infant Crying: Theoretical and Research Perspectives, eds B. M. Lester and C. F. Z. Boukydis (New York, NY: Plenum Press), 83–104.
Webb, A. R., Heller, H. T., Benson, C. B., and Lahav, A. (2015). Mother’s voice and heartbeat sounds elicit auditory plasticity in the human brain before full gestation. Proc. Natl. Acad. Sci. U.S.A. 112, 3152–3157. doi: 10.1073/pnas.1414924112
Wermke, K., Hain, J., Oehler, K., Wermke, P., and Hesse, V. (2014). Sex hormone influence on human infants’ sound characteristics: melody in spontaneous crying. Biol. Lett. 10:20140095. doi: 10.1371/journal.pone.0046610
Wermke, K., Leising, D., and Stellzig Eisenhauer, A. (2007). Relation of melody complexity in infants’ cries to language outcome in the second year of life: a longitudinal study. Clin. Linguist. Phon. 21, 961–973. doi: 10.1080/02699200701659243
Keywords: crying, melody, preterm birth, low-birth-weight, language, fundamental frequency, vocalization
Citation: Shinya Y, Kawai M, Niwa F, Imafuku M and Myowa M (2017) Fundamental Frequency Variation of Neonatal Spontaneous Crying Predicts Language Acquisition in Preterm and Term Infants. Front. Psychol. 8:2195. doi: 10.3389/fpsyg.2017.02195
Received: 29 June 2017; Accepted: 04 December 2017;
Published: 22 December 2017.
Edited by:Erika Nurmsoo, University of Kent, United Kingdom
Reviewed by:Alessandra Sansavini, Università di Bologna, Italy
Kathleen Wermke, Universitätsklinikum Würzburg, Germany
Copyright © 2017 Shinya, Kawai, Niwa, Imafuku and Myowa. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yuta Shinya, firstname.lastname@example.org