Skip to main content

ORIGINAL RESEARCH article

Front. Psychol., 09 March 2017
Sec. Psychology of Language
This article is part of the Research Topic Lexical Tone Perception in Infants and Young Children: Empirical studies and theoretical perspectives View all 23 articles

Pitch Perception in the First Year of Life, a Comparison of Lexical Tones and Musical Pitch

  • 1Utrecht Institute of Linguistics, Utrecht University, Utrecht, Netherlands
  • 2Communication Science School, Beijing Language and Culture University, Beijing, China
  • 3The MARCS Institute for Brain, Behaviour and Development, Western Sydney University, Sydney, NSW, Australia

Pitch variation is pervasive in speech, regardless of the language to which infants are exposed. Lexical tone is influenced by general sensitivity to pitch. We examined whether the development in lexical tone perception may develop in parallel with perception of pitch in other cognitive domains namely music. Using a visual fixation paradigm, 100 and one 4- and 12-month-old Dutch infants were tested on their discrimination of Chinese rising and dipping lexical tones as well as comparable three-note musical pitch contours. The 4-month-old infants failed to show a discrimination effect in either condition, whereas the 12-month-old infants succeeded in both conditions. These results suggest that lexical tone perception may reflect and relate to general pitch perception abilities, which may serve as a basis for developing more complex language and musical skills.

Introduction

The perceptual reorganization hypothesis assumes that acquiring native phonology involves learning the specific phonemic contrasts present in the to-be-learned language, whereas sensitivity to non-native contrasts gradually decreases. Such perceptual tuning occurs in the second half of the 1st year (Werker and Tees, 1984; Kuhl et al., 1992). Yet previous studies disagree on how the perception of lexical tones, or pitch contours realized on single syllables, changes in the 1st year of life. It is widely agreed that infants are highly sensitive to speech prosody (e.g., Mehler and Christophe, 1995; Nazzi et al., 1998; Soderstrom et al., 2011; Frota et al., 2014). With regard to lexical tones, several studies have found supportive evidence for such a decline in discrimination among non-tone language learning infants between 4 and 9 months (Harrison, 2000; Mattock and Burnham, 2006; Mattock et al., 2008; Yeung et al., 2013). Other studies, however, have found that sensitivity to lexical tones is maintained beyond the presumed perceptual reorganization window. Liu and Kager (2014) found that from 4 months onward, up until 17–18 months, Dutch infants were able to discriminate Chinese high-level and falling tone. When the acoustical distance between the two tones was reduced through manipulation, no discrimination was found between 9 and 15 months, yet the 5- and 17–18-month-olds succeeded at discrimination. English learning 14-month-old infants are able to learn words that are solely distinguished by lexical tones, and by 19 months, they are still able to discriminate Chinese rising and falling tones (Quam and Swingley, 2010; Hay et al., 2015). In addition, although it is a fact that non-tone language speakers find lexical tones notoriously difficult (Kiriloff, 1969; Bluhme and Burr, 1971; Shen, 1989), they can be fairly accurate at discriminating them (Burnham et al., 1996, 2015; So and Best, 2010; Chen et al., 2015). Non-tone language listeners’ acoustical sensitivity to lexical tones cannot simply reflect the effect of “nativeness,” but possibly sensitivity to pitch in language in general. Regardless of the salience of lexical tones, native tone language learning infants do not fully acquire lexical tones until childhood, and global intonation contours interfere with the recognition of lexical tones (Singh and Chee, 2016; Singh and Fu, 2016). In addition, although lexical tones are phonemic in Chinese, when learning novel words, 3-year-old Chinese children are more tolerant to lexical tone than to vowel mispronunciations (Ma et al., 2017). In sum, lexical tone perception seems flexible and exhibits a complex course of development.

It has been long debated whether language ability reflects domain specific mechanisms or whether it is the product of domain general development (e.g., Piaget, 1926; Fodor, 1983; Chomsky, 1986; Pinker, 1994; Tomasello, 2003). Language and music, two types of uniquely human sophisticated functions, are often compared to understand this question. Language and music are parallel in many aspects (Trehub, 2003). For both, pitch plays a fundamental role, and pitch contour (i.e., the shape of pitch patterns) forms a salient cue in perception (Yip, 2002; Trehub and Hannon, 2006). In the language domain, cross-linguistically, at phrase and sentence level intonation is largely encoded by pitch contour. Questions are commonly realized with a rising pitch contour whereas statements often carry a falling contour (e.g., Gussenhoven, 2004). Emphasizing certain aspects of information in many language or “focus” is often realized by raising pitch of the emphasized part and compressing pitch of the following part (Xu, 2011). In tone languages, lexical tones are used in a phonemic way to distinguish meaning at the lexical level (Yip, 2002). In music, pitch relations (rather than specific pitch levels where these relations are exhibited) are central for music perception and also play a role in memory. For example, for the vast majority of listeners, the same song played at a different pitch level is readily recognizable (e.g., Trehub and Hannon, 2006; Trainor and Hannon, 2013). In addition, adults are more sensitive to differences of “global contour” (i.e., the pattern of ups and downs) of melodies than to “intervals” (i.e., exact pitch distance between notes; e.g., Cuddy and Cohen, 1976; Dowling, 1978; Bartlett and Dowling, 1980; Schiavetto et al., 1999).

Although some pitch processing skills have been argued to be music specific (Hauser and McDermott, 2003; Peretz and Coltheart, 2003; Peretz et al., 2003), many studies have found positive correlations between pitch perception in both language and music domains, which suggests domain general cognitive mechanisms in pitch processing (e.g., Wong and Perrachione, 2007; Wong et al., 2012; Bidelman et al., 2013, among many others). Speaking a tone language natively modulates neural response to non-speech pitch (e.g., Chandrasekaran et al., 2007; Bidelman et al., 2011).

For music processing, the encoding of pitch contour is visible from very early on. Infants as young as 2 months are able to discriminate familiar and novel songs (Plantinga and Trainor, 2009), and by 6 months (and like adults), infants discriminate between songs by attending to the pitch contour rather than to specific pitch levels that they are played (Trainor et al., 2004; Plantinga and Trainor, 2005). Eight- to 11-month-old infants are sensitive to both contour-violating and contour-non-violating note changes, yet contour violation has been found to be perceptually more salient for infants than contour-sharing interval differences (Trehub et al., 1984, 1987). Moreover, infants are able to extract abstract pitch contour from the absolute pitch level at which it is played (Cohen et al., 1987; Trainor and Trehub, 1992). It should be noted that although infants discriminate songs from very early on (Trainor et al., 2004; Plantinga and Trainor, 2005, 2009), the songs not only differed in contour but also in rhythmic and temporal information. When using manipulated stimuli exhibiting contour differences alone, discrimination has only been attested on samples of infants older than 6 months (Trehub et al., 1984, 1987; Trainor and Trehub, 1992). It remains unknown whether younger infants are also sensitive to contour violation.

Although shared processing of lexical tone and music processing has been widely investigated among adults, not much is known regarding whether pitch perception development is related in these two domains in infancy. Mattock and Burnham (2006) tested both tone (Chinese and Cantonese) and non-tone (English) language learning infants on their discrimination of Thai tones as well as violin analogs of the tones. For the lexical tones, a decline of sensitivity was observed between 6 and 9 months among the English infants, but not among the Chinese infants. For the violin stimuli, however, both groups succeeded in the discrimination at both ages. By 10 months, native Japanese infants’ brain responses to pitch accents realized on words and to pure tones whose fundamental frequency was extracted from these words showed different lateralization patterns (Sato et al., 2010). These findings suggest that pitch perception develops in a domain specific manner. However, Mattock and Burnham (2006) and Sato et al. (2010) tested infants with non-speech rather than musical stimuli, as the analogs of lexical tones did not have a musical structure. The non-speech stimuli have no real life function, yet pitch contour is essential for perception and appreciation of music. In addition, these studies assume that lexical tones (or pitch accents) are phonological for infants, although non-tone language listeners may simply perceive them as musical (Chen et al., 2016).

In the current study, we investigate whether development observed in lexical tone perception may reflect general sensitivity to pitch, in the current study. We tested Dutch 4- and 12-month-old infants on their discrimination of lexical tones and comparable three-note musical melodies, both differing in pitch contour. A non-native pitch contrast was chosen so that the developmental change cannot be attributed to learning the specific tonal exemplars, and the music stimuli were manipulated so as to share similar properties to the lexical tones. We chose 4- and 12-month-olds since these age groups precede and follow perceptual reorganization, which allows us to observe whether development in lexical tone perception is language specific. As Dutch infants have shown high sensitivity to the contrast of Chinese high-level and high-falling tone (Liu and Kager, 2014) and to prevent a ceiling effect, we used two perceptually similar lexical tones (Hume and Johnson, 2001; Ma et al., 2017), namely the Chinese rising and dipping tones as the stimuli. Since, we focus on acoustic perception that underlies music and language processing, the infants were tested on their discrimination of single tokens of lexical tones and musical melodies, which prevented possible interference from normalization (Singh et al., 2004; Singh, 2008; Shi, 2010; Chen and Kager, 2015). If pitch contour perception develops in a domain general way, then we would expect a similar trajectory in both domains, possibly age-related enhancement. On the other hand, if development occurs in a domain specific manner, then based on the perceptual reorganization hypothesis (Mattock and Burnham, 2006; Mattock et al., 2008; Yeung et al., 2013) we would expect the 12-month-olds to be less sensitive than the 4-month-olds to the lexical tones, as these are linguistically irrelevant for the Dutch infants. For the musical stimuli, and given the high sensitivity to musical pitch contour among adults, a maintained or enhanced discrimination of the musical melodies should be observed.

Materials and Methods

Participants

One hundred and one infants were included in the analysis. All the infants were healthy full-term monolingual Dutch infants. There were 54 4-month-old infants (age range 4:01–4:29), 28 (18 boys, 10 girls) in the lexical tone condition and 26 (13 boys, 13 girls) in the music condition. There were 47 12-month-old (age range 12:01–12:29) infants, 23 in the lexical tone condition (10 boys, 13 girls), and 24 in the music condition (16 boys, 8 girls). Another 17 4-month-old infants were tested but excluded from analysis due to crying (N = 2), fussiness (N = 4), equipment failure (N = 1), experimenter error (N = 1), and failure to meet habituation criterion (N = 9, see below). Another 27 12-month-old infants were excluded from analysis due to crying (N = 7), fussiness (N = 4), equipment failure (N = 3), experimenter’s error (N = 2), parental interferences (N = 2), and failure to meet habituation criteria (N = 9).

As the experiment was not invasive and was conducted in a natural environment, Utrecht Institute of Linguistics did not require ethical approval at the time that the experiment was conducted. The experiments were conducted in accordance to guidelines of Utrecht Institute of Linguistics and Helsinki Declaration. Written consents from caregivers were obtained for all participating infants.

Stimuli

For the lexical tones, in order to prevent a ceiling effect (Liu and Kager, 2014), Mandarin Chinese rising tone (T2) and dipping tone (T3) were used as stimuli, as they have been found to be relatively difficult to discriminate (Hume and Johnson, 2001; Chen et al., 2015). We used /ma/ as tone-bearing syllable, as an initial nasal consonant ensured continuous pitch. A female Mandarin speaker recorded the two syllables. Then the pitch contours of naturally produced /ma2/ and /ma3/ were extracted by the software PRAAT (Boersma and Weenink, 2009). After normalizing the duration of these two contours (450 ms), the pitch contours of the T2 after time normalization were re-synthesized onto the original T3 syllable using the PSOLA method (Moulines and Laroche, 1995). Time-normalization ruled out the possibility of interference from duration as a potential confounding factor in the experiment. Five native Mandarin speakers listened to the stimuli and were all in agreement that all the stimuli sounded like natural, normal speech. As young infants have shown difficulties in normalizing variable tokens (Singh et al., 2004; Singh, 2008; Shi, 2010), we only used one single token of each tone to prevent improvement in normalization from being a confounding factor for any development observed. To ensure that the comparability between tasks, we did not transpose the melodies in the music condition.

For the musical melodies, 16th notes of D4, E4, F4, and C4 with a piano timbre were synthesized using a Nyquist script1,2. The notes were generated on the C4 (middle C) scale, along which the fundamental frequency of A4 equals 440 Hz, with the default duration (250 ms) of 16th notes in Nyquist. After synthesizing the four single notes separately, D, E, and F were concatenated to obtain a three-note rising melody— D-E-F, and D, C, and F were concatenated to obtain another three-note dipping melody— D-C-F. These two melodies were normalized to 450 ms and were then used as stimuli in this experiment. All the notes belonged to C major scale, which prevented possible discrimination based on key membership (Cohen et al., 1987). The two melodies had identical initial and final pitches, and the middle note determined global contour. This assured that the infants would not be able to discriminate the melodies by only attending to the onset or the offset. The difference between the two musical melodies was expected to be salient, as the middle note changed the pitch “direction” (e.g., up and down) rather than the “degree” of rising or falling (Trehub et al., 1984). The musical melodies and lexical tones had comparable contours, namely one rising and one dipping. Figure 1 plots the pitch contours of the speech stimuli.

FIGURE 1
www.frontiersin.org

FIGURE 1. Pitch contours of the rising and dipping tones used in the speech condition (A) and those of the musical melodies (B). Note that the first and last notes are the same in the two melodies.

Procedure

A visual habituation paradigm adapted from Liu and Kager (2014) was used, which has been found to be suitable for testing infants as young as 4 months. During the experiment, infants sat on their parent’s lap in the test cabin, and a 14-inch screen at the front displayed the visual stimuli, an infant-friendly colorful picture. The visual stimuli were contingent with the auditory stimuli, and the infants’ looking time to the visual stimuli was used as the indicator of their attention to the auditory stimuli. The auditory stimuli were presented at a comfortable volume through a frontal speaker. The parent listened to background music through headphones to prevent possible interaction with the infants. A hidden camera mounted above the screen recorded the infants’ looking behavior. The experimenter observed the video of the infants live and recorded whether the infant looked at the visual stimuli. For each trial, once the infant looked at the screen, the experimenter pressed a “looking” button on a button box to start the auditory stimuli. Whenever the infant looked away, the experimenter pressed another “non-looking” button on the same button box, and if the infant looked back to the screen, the experimenter pressed the “looking” button again. A trial ended if the infant looked away for more than 2 s, and an attention getter immediately appeared on the screen. Once the infant looked back at the screen, the experimenter started the next trial in the same way described above. The looking time of each trial as well as each look was automatically calculated on the experimenter’s computer.

The experiment consisted of a habituation and a test phase. Total looking time of the first three trials in the habituation phase was used as a baseline for measuring habituation. Starting from the fourth trial, the total looking time of each three consecutive habituation trials was calculated, and once this looking time was less than 65% of the total looking time of the first three habituation trials, the habituation criterion was met, and the test phase started automatically. The habituation phase had a minimum of six trials and a maximum of 12 trials. Those infants who failed to meet the habituation criterion within 12 trials were excluded from further analysis. The stimuli used for habituation were counter-balanced among the participants at each age for each condition. In the test phase, the infants were presented with one “old” trial, which was the same sound that they had heard in the habituation phase, followed by another “novel” trial, which was the new sound that they had not previously heard. In the test phase, if the infants were able to detect the difference between the two tones, then upon hearing the novel trial, their listening time should be recovered due to hearing something new. In both phases, a trial could have a maximum of 30 repetitions of the stimuli, with an inter-stimulus interval of 1 s. The same visual stimuli were used for the habituation and test. We did not counter-balance the order of test trials, and the current procedure was expected to highlight the discrimination response if there was any.

Results

Table 1 lists the raw looking time in the habituation phase and test phase in both conditions by both age groups. Before the analysis of test trials, infants’ response in the habituation phase was examined. A univariate ANOVA, taking condition and age as independent variables found a significant main effect of age, F(3,97) = 6.48, p < 0.05 (partial η2 = 0.063), where 4-month-olds needed more time to reach the habituation criterion. Condition, on the other hand, showed no significant effect, F(3,97) = 0.89, n.s.. No significant interaction between age and condition was found, F(3,97) = 0.002, n.s.. These findings suggest comparable habituation patterns for the music and the lexical tone condition. Next, the raw looking time of the infants was log transformed (base 10) to correct for skew (Gomez and Gerken, 1999; Gao et al., 2011). The log transformed looking times (logLT) of both age groups to both trial types fit a normal distribution. A repeated measures ANOVA was carried out with the logLT, where trial type (old/novel) was the within-subject factor, and condition (music/speech) and age (4/12-month-old) were between-subject factors. Trial type as well as condition showed a significant main effect Ftrialtype(1,97) = 5.20, p < 0.05 (partial η2 = 0.051); Fdomain(1,97) = 4.84, p < 0.05 (partial η2 = 0.047). A main effect of age was not significant, Fage(1,97) = 1.58, n.s.. A significant interaction was found between age and trial type F(1,97) = 4.50, p < 0.05 (partial η2 = 0.044). Post hoc analyses found that, after merging domains only the 12-month-old infants showed a significantly longer logLT to the novel trial, t(46) = -2.88, p < 0.05. No other interaction was found to be significant. Figure 2 depicts the logLT of the infants in each condition. As can be seen, for the 4-month-olds, no increase in listening time was observed for the novel trial in either condition. Such an increase, however, was found for the 12-month-old group in both conditions. The main effect of trial type was mainly driven by the 12-month-olds. In addition, both age groups had longer looking times in the lexical tone condition.

TABLE 1
www.frontiersin.org

TABLE 1. Mean habituation time (s) and mean number of trials needed for habituation; raw looking time (s) to old and novel trial, and mean number of tokens in old and novel trial, separated by age group and condition.

FIGURE 2
www.frontiersin.org

FIGURE 2. LogLT of the old and novel trial in the lexical tone and music condition as a function of infant age.

Discussion

In the current study, we investigated whether development in lexical tone perception may develop in parallel with perception of pitch in other cognitive domains namely music. The 4-month-olds did not show a discrimination effect in either the lexical tone or the music condition. For the lexical tones, at the age of 4 months, which has been assumed to precede the perceptual reorganization of lexical tones (Mattock and Burnham, 2006; Mattock et al., 2008; Yeung et al., 2013), Dutch infants failed to show a discrimination effect. Importantly, without inter-token variation, presumably the infants did not need to represent the lexical tones as phonological categories, but only needed to discriminate the lexical tones acoustically. The lack of a discrimination effect suggests that the 4-months-old infants did not perceive the acoustic difference between the two lexical tones. Similarly, without transpositions, the infants did not need to equalize the pitch contours played at different pitch levels before they could detect the contour violation, yet no discrimination was found. It is likely that the skills that adult listeners readily make use of when processing music are not fully mature at the beginning of life (Dowling, 1978; Schiavetto et al., 1999). The lack of discrimination effect in both conditions suggests that at 4 months, the infants are not proficient at processing the acoustic attributes that are exploited by linguistic and musical structures.

By 12 months, a parallel enhancement was observed in both the music and the language conditions. Importantly, what we show in the current study is that language input may not be the only factor driving perceptual development, and the perceptual behavior elicited by linguistic stimuli may reflect a general auditory rather than language specific development. As the infants were not exposed to lexical tones in their ambient input, the improvement cannot be explained by learning the lexical tones per se, but must reflect a general ability in dealing with pitch in speech. The similar developmental trajectory in both domains suggests that improved auditory pitch acuity may form a common basis for developing cognitively more advanced skills in language and music. The enhanced pitch perception may correlate with auditory maturation. Although frequency tuning is mature at birth at the cochlea level (Abdala et al., 1996), frequency resolution becomes adult-like between 3 and 6 months (Spetner and Olsho, 1990). Auditory brainstem also matures within the first 6 months after birth, and the maturation of auditory cortex continues to childhood (see Moore and Linthicum, 2007 for a review). At this moment, it is hard to infer whether the processing of musical and speech pitch recruited the same neural resources within the sample, yet basic auditory abilities seem to develop in a domain-general fashion. The physiological basis for successful discrimination of pitch realized on ecologically valid and spectrally complex sounds needs further investigation. It would be interesting for further study to investigate how such improved perception contributes to higher level processing such as phonological categorization or representation of musical pitch contours across pitch levels and musical instruments, and whether these abilities also show a comparable developmental trajectory in language and music.

So far, the perception of non-native lexical tones has been mostly studied in infants between 6 and 9 months (Harrison, 2000; Mattock and Burnham, 2006; Mattock et al., 2008; Yeung et al., 2013), and lexical tones are considered to be non-native phonological contrasts for infants learning a non-tone language. Pitch variation, however, is a language universal. The need to distinguish and understand intonation may help infants improve their sensitivity to pitch in general, which is reflected in their discrimination of lexical tones. It is possible that the 12-month-old Dutch infants assimilated T2 to a salient pitch contour in Dutch question rise. Non-tone language adults have been found to maintain a high psycho-acoustically based perceptual sensitivity to non-native lexical tones (Burnham et al., 1996, 2015; So and Best, 2010; Chen et al., 2015). Non-native infants’ sensitivity to lexical tones can remain after the assumed perceptual organization window (Liu and Kager, 2014; Chen and Kager, 2015; Hay et al., 2015). In the current study, we used a perceptually similar contrast than those used in Liu and Kager (2014; Hume and Johnson, 2001), and a progression from 4 to 12 months was observed. A growing body of evidence shows that the perception of speech sounds does not follow a single developmental trajectory (Narayan et al., 2010; Liu and Kager, 2014; Mazuka et al., 2014; Tsuji and Cristia, 2014; Tyler et al., 2014), and infants do not completely lose sensitivity to non-native contrasts. Our results, together with these other studies, lead to the question of what underlies perceptual attunement. It is possible that when infants grow older, they become less capable of perceiving non-native contrasts phonologically, but at the same time, psycho-acoustical perception may improve. Yet whether a better auditory perception can be found in general for speech sounds after 9 months, or whether such improvement is restricted to certain types of speech sounds, such as vowels (Mazuka et al., 2014) and pitch, needs further investigation. Perceptual narrowing is well motivated given the need to efficiently process environmentally relevant distinctions (Scott et al., 2007) and by observations that adults cannot learn a language as easily as infants. The inability to perceive non-native contrast has been claimed to be one of the hindrances to proficient learning in adults. Yet more efforts should be made to understand what exactly complicates non-native language perception and when exactly we lose the ease to perceive non-native contrasts.

In the music domain, sensitivity to contour differences has been claimed to be visible from very early on (Plantinga and Trainor, 2009; Stefanics et al., 2009). However, Plantinga and Trainor (2009) tested 2-month-old infants with songs, and such discrimination only called for coarse representation of the melodies, as the songs differed from one another on multiple dimensions, including rhythm and tempo. Our task, on the other hand, tested the detection of contour violation with manipulated stimuli, and the 4-month-olds failed. Hence, it is possible that young infants are able to coarsely represent pitch contours, yet their accurate perception of pitch details is still under-developed. In our task, the middle note violated the contour, and the edge notes were not informative. Several studies have proposed an “edge benefit” in rule learning, namely that the edge serves as the anchoring position, and items in a stream are memorized relative to the edge item (Hitch, 1996; Henson, 1998; Endress et al., 2005). It may be the case that young infants have difficulties perceiving pitch change at a medial position, which may hinder them in noticing the change of contour efficiently. It would be interesting for future studies to test whether young infants could more easily detect a contour violation occurring at an edge position.

Finally, it should be acknowledged that our musical stimuli were generated to match the lexical tones. The constituent notes had a slightly shorter duration compared to previous studies (e.g., Trainor and Trehub, 1992). It might be the case that for the younger group, the short duration hindered the infants from sufficient representation of each individual note, where the violation of contour was realized. When presented with the same stimuli, the 12-month-olds did show a clear discrimination effect. This suggests that the better contour violation perception at 12 months may be due to a higher temporal resolution in auditory perception (Morrongiello et al., 1984; Werner et al., 1992). Nevertheless, our musical stimuli were ecologically valid, as a 16th note has a duration of 125 ms when the tempo is 120 beats-per-minute. In addition, our stimuli were highly representative of pitch in speech and pitch in music: the musical ones were composed of discrete notes without segmental information, whereas the lexical tones had continuous pitch contours and were realized on syllables. Therefore, the distinction between music and speech stimuli was still maintained, and it is convincing that infants show a general enhancement in auditory pitch perception in the 1st year of life.

Conclusion

In the current study, we tested Dutch 4- and 12-month-old infants on their discrimination of pitch contours realized in speech, specifically, the Chinese rising and dipping tones, as well as musical stimuli exhibiting analogous pitch contours. We found that the 4-month-olds failed to show discrimination in either condition, whereas the older group succeeded in both conditions. These findings suggest that pitch perception develops in a domain-general fashion in early infancy, and development in speech perception may reside in more general auditory enhancement, and may not be a language specific development.

Author Contributions

AC contributed to the design of the work, acquisition and analysis of the data and drafting the work. CS contributed to the interpretation of the data, drafting and revising the work. RK contributed to the design of the work, interpretation of the data, drafting and revising the work.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Footnotes

  1. ^ http://audacity.sourceforge.net/help/nyquist
  2. ^ http://www.cs.cmu.edu/~music/music.software.html

References

Abdala, C., Sininger, Y. S., Ekelid, M., and Zeng, F. (1996). Distortion product otoacoustic emission suppression tuning curves in human adults and neonates. Hear. Res. 98, 38–53. doi: 10.1016/0378-5955(96)00056-1

CrossRef Full Text | Google Scholar

Bartlett, J., and Dowling, J. (1980). Recognition of transposed melodies: a key-distance effect in developmental perspective. J. Exp. Psychol. Hum. Percept. Perform. 6, 501–515.

PubMed Abstract | Google Scholar

Bidelman, G., Gandour, J., and Krishnan, A. (2011). Cross-domain effects of music and language experience on the representation of pitch in the human auditory brainstem. J. Cogn. Neurosci. 23, 425–434. doi: 10.1162/jocn.2009.21362

PubMed Abstract | CrossRef Full Text | Google Scholar

Bidelman, G., Hutka, S., and Moreno, S. (2013). Tone language speakers and musicians share enhanced perceptual and cognitive abilities for musical pitch: evidence for bidirectionality between the domains of language and music. PLoS ONE 8:e60676. doi: 10.1371/journal.pone.0060676

PubMed Abstract | CrossRef Full Text | Google Scholar

Bluhme, H., and Burr, R. (1971). An audio-visual display of pitch for teaching Chinese tones. Stud. Linguist. 22, 51–57.

Google Scholar

Boersma, P., and Weenink, D. (2009). Praat: Doing Phonetics by Computer (Version 5.1.05) [computer Program].

Google Scholar

Burnham, D., Francis, E., Webster, D., Luksaneeyanawin, S., Attapaiboon, C., Lacerda, F., et al. (1996). “Perception of lexical tone across languages: evidence for a linguistic mode of processing,” in Proceedings of ICSLP of the Paper Presented at the Spoken Language, 1996, Vol. 96, 2514–2517.

Google Scholar

Burnham, D., Kasisopa, B., Reid, A., Lukasaneeyanawin, S., Lacerda, F., Attina, V., et al. (2015). Universality and language-specific experience in the perception of lexical tone and pitch. Appl. Psycholinguist. 36, 1–33. doi: 10.1017/S0142716414000496

CrossRef Full Text | Google Scholar

Chandrasekaran, B., Krishnan, A., and Gandour, J. T. (2007). Experience-dependent neural plasticity is sensitive to shape of pitch contours. Neuroreport 18, 1963–1967. doi: 10.1097/WNR.0b013e3282f213c5

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, A., and Kager, R. (2015). Discrimination of lexical tones in the first year of life. Infant Child Dev. 25, 426–439. doi: 10.1002/icd.1944

CrossRef Full Text | Google Scholar

Chen, A., Liu, L., and Kager, R. (2015). Cross-linguistic perception of mandarin tone sandhi. Lang. Sci. 48, 62–69. doi: 10.1016/j.langsci.2014.12.002

CrossRef Full Text | Google Scholar

Chen, A., Liu, L., and Kager, R. (2016). Role of native language in cross domain pitch perception. J. Lang. Cogn. Neurosci. 31, 751–760. doi: 10.1121/1.4874619

PubMed Abstract | CrossRef Full Text

Chomsky, N. (1986). Knowledge of Language: Its Nature, Origins and Use. New York, NY: Praeger.

Google Scholar

Cohen, A. J., Thorpe, L. A., and Trehub, S. E. (1987). Infants’ perception of musical relations in short transposed tone sequences. Can. J. Psychol. 41, 33–47. doi: 10.1037/h0084148

CrossRef Full Text | Google Scholar

Cuddy, L., and Cohen, A. (1976). Recognition of transposed melodic sequences. Q. J. Exp. Psychol. 28, 255–270.

Google Scholar

Dowling, J. (1978). Scale and contour: two components of a theory of memory for melodies. Psychol. Rev. 85, 341–354.

Google Scholar

Endress, A. D., Scholl, B. J., and Mehler, J. (2005). The role of salience in the extraction of algebraic rules. J. Exp. Psychol. Gen. 134, 406–419. doi: 10.1037/0096-3445.134.3.406

PubMed Abstract | CrossRef Full Text | Google Scholar

Fodor, J. (1983). The Modularity of Mind. Cambridge, MA: The MIT Press.

Google Scholar

Frota, S., Butler, J., and Vigário, M. (2014). Infants’ perception of intonation: Is it a statement or a question? Infancy 19, 194–213. doi: 10.1111/infa.12037

CrossRef Full Text | Google Scholar

Gao, J., Shi, R., and Li, A. (2011). “Categorization of lexical tones in Mandarin learning infants,” in Proceedings of the Fifth International Conference on Speech Prosody 2011, Chicago, IL.

Google Scholar

Gomez, R. L., and Gerken, L. (1999). Artificial grammar learning by 1-year-olds leads to specific and abstract knowledge. Cognition 70, 109–135.

PubMed Abstract | Google Scholar

Gussenhoven, C. (2004). The Phonology of Tone and Intonation. Cambridge: University Press.

Google Scholar

Harrison, P. (2000). Acquiring the phonology of lexical tone in infancy. Lingua 110, 581–616. doi: 10.1016/S0024-3841(00)00003-6

CrossRef Full Text | Google Scholar

Hauser, M. D., and McDermott, J. (2003). The evolution of the music faculty: a comparative perspective. Nat. Neurosci. 6, 663–668.

Google Scholar

Hay, J., Estes, K., Wang, T., and Saffran, J. (2015). From flexibility to constraint: the contrastive use of lexical tone in early word learning. Child Dev. 86, 10–22. doi: 10.1111/cdev.12269

PubMed Abstract | CrossRef Full Text | Google Scholar

Henson, R. N. A. (1998). Short-term memory for serial order: the start-end model. Cogn. Psychol. 36, 73–137. doi: 10.1006/cogp.1998.0685

PubMed Abstract | CrossRef Full Text | Google Scholar

Hitch, G. J. (1996). Temporal grouping effects in immediate recall: a working memory analysis. Q. J. Exp. Psychol. A 49, 116–139. doi: 10.1080/713755609

CrossRef Full Text | Google Scholar

Hume, E., and Johnson, K. (2001). “A model of the interplay of speech perception and phonology,” in The Role of Speech Perception in Phonology, eds E. Hume and K. Johnson (New York, NY: Academic press), 3–26.

Google Scholar

Kiriloff, C. (1969). On the auditory discrimination of tones in mandarin. Phonetica 20, 63–67.

Kuhl, P. K., Williams, K. A., Lacerda, F., Stevens, K. N., and Lindblom, B. (1992). Linguistic experience alters phonetic perception in infants by 6 months of age. Science 31, 606–608.

PubMed Abstract | Google Scholar

Liu, L., and Kager, R. (2014). Perception of tones by infants learning a non-tone language. Cognition 133, 385–394. doi: 10.1016/j.cognition.2014.06.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Ma, W., Zhou, P., Singh, L., and Gao, L. (2017). Spoken word recognition in young tone language learners: age-dependent effects of segmental and suprasegmental variation. Cognition 159, 139–155. doi: 10.1016/j.cognition.2016.11.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Mattock, K., and Burnham, D. (2006). Chinese and English infants’ tone perception: evidence for perceptual reorganization. Infancy 10, 241–265. doi: 10.1207/s15327078in1003_3

CrossRef Full Text | Google Scholar

Mattock, K., Molnar, M., Polka, L., and Burnham, D. (2008). The developmental course of lexical tone perception in the first year of life. Cognition 106, 1367–1381. doi: 10.1016/j.cognition.2007.07.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Mazuka, R., Hasegawa, M., and Tsuji, S. (2014). Development of non-native vowel discrimination: improvement without exposure. Dev. Psychobiol. 56, 192–209. doi: 10.1002/dev.21193

PubMed Abstract | CrossRef Full Text | Google Scholar

Mehler, J., and Christophe, A. (1995). “Maturation and learning of language during the first year of life,” in The Cognitive Neurosciences, ed. M. S. Gazzaniga (Cambridge, MA: Bradford Books/MIT), 943–954.

Moore, J. K., and Linthicum, F. H. (2007). The human auditory system: a timeline of development. Int. J. Audiol. 46, 460–478. doi: 10.1080/14992020701383019

PubMed Abstract | CrossRef Full Text | Google Scholar

Morrongiello, B. A., Kulig, J. W., and Clifton, R. K. (1984). Developmental changes in auditory temporal perception. Child Dev. 55, 461–471.

Google Scholar

Moulines, E., and Laroche, J. (1995). Non-parametric techniques for pitch-scale and time-scale modification of speech. Speech Commun. 16, 175–205. doi: 10.1016/0167-6393(94)00054-E

CrossRef Full Text | Google Scholar

Narayan, C. R., Werker, J. F., and Beddor, P. S. (2010). The interaction between acoustic salience and language experience in developmental speech perception: evidence from nasal place discrimination. Dev. Sci. 13, 407–420. doi: 10.1111/j.1467-7687.2009.00898.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Nazzi, T., Bertoncini, J., and Mehler, J. (1998). Language discrimination by newborns: toward an understanding of the role of rhythm. J. Exp. Psychol. Hum. Percept. Perform. 24, 756–766. doi: 10.1037/0096-1523.24.3.756

PubMed Abstract | CrossRef Full Text | Google Scholar

Peretz, I., Champod, A., and Hyde, K. (2003). Varieties of musical disorders. Ann. N. Y. Acad. Sci. 999, 58–75.

PubMed Abstract | Google Scholar

Peretz, I., and Coltheart, M. (2003). Modularity of music processing. Nat. Neurosci. 6, 688–691.

Google Scholar

Piaget, J. (1926). The Language and Thought of the Child. New York, NY: Harcourt Brace & Company.

Google Scholar

Pinker, S. (ed.). (1994). The Language Instinct. New York, NY: Morrow.

Google Scholar

Plantinga, J., and Trainor, L. J. (2005). Memory for melody: infants use a relative pitch code. Cognition 98, 1–11. doi: 10.1016/j.cognition.2004.09.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Plantinga, J., and Trainor, L. J. (2009). Melody recognition by two-month-old infants. J. Acoust. Soc. Am. 125, EL58–EL62. doi: 10.1121/1.3049583

PubMed Abstract | CrossRef Full Text | Google Scholar

Quam, C., and Swingley, D. (2010). Phonological knowledge guides 2-year-olds’ and adults’ interpretation of salient pitch contours in word learning. J. Mem. Lang. 62, 135–150. doi: 10.1016/j.jml.2009.09.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Sato, Y., Sogabe, Y., and Mazuka, R. (2010). Development of hemispheric specialization for lexical pitch accent in Japanese infants. J. Cogn. Neurosci. 22, 2503–2513. doi: 10.1162/jocn.2009.21377

PubMed Abstract | CrossRef Full Text | Google Scholar

Schiavetto, A., Cortese, F., and Alain, C. (1999). Global and local processing of musical sequences: an event–related brain potential study. Neuroreport 10, 2467–2472.

PubMed Abstract | Google Scholar

Scott, L. S., Pascalis, O., and Nelson, C. A. (2007). A domain-general theory of the development of perceptual discrimination. Curr. Dir. Psychol. Sci. 16, 197–201. doi: 10.1111/j.1467-8721.2007.00503.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Shen, X. S. (1989). Toward a register approach in teaching mandarin tones. J. Chin. Lang. Teach. Assoc. 24, 27–47.

Google Scholar

Shi, R. (2010). Contextual variability and infants’ perception of tonal categories. Chin. J. Phon. 2, 1–9.

Google Scholar

Singh, L. (2008). Influences of high and low variability on infant word recognition. Cognition 106, 833–870. doi: 10.1016/j.cognition.2007.05.00

PubMed Abstract | CrossRef Full Text | Google Scholar

Singh, L., and Chee, M. (2016). Rise and fall: effects of tone and intonation on spoken word recognition in early childhood. J. Phon. 55, 109–118. doi: 10.1016/j.wocn.2015.12.005

CrossRef Full Text | Google Scholar

Singh, L., and Fu, C. S. L. (2016). A new view of language development: the acquisition of lexical tone. Child Dev. 87, 834–854. doi: 10.1111/cdev.12512

PubMed Abstract | CrossRef Full Text | Google Scholar

Singh, L., Morgan, J. L., and White, K. S. (2004). Preference and processing: the role of speech affect in early spoken word recognition. J. Mem. Lang. 51, 173–189. doi: 10.1016/j.jml.2004.04.004

CrossRef Full Text | Google Scholar

So, C. K., and Best, C. T. (2010). Cross-language perception of non-native tonal contrasts: effects of native phonological and phonetic influences. Lang. Speech 53, 273–293. doi: 10.1177/0023830909357156

PubMed Abstract | CrossRef Full Text | Google Scholar

Soderstrom, M., Ko, E., and Nevzorova, U. (2011). It’s a question? infants attend differently to yes/no questions and declaratives. Infant Behav. Dev. 34, 107–110. doi: 10.1016/j.infbeh.2010.10.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Spetner, N. B., and Olsho, L. W. (1990). Auditory frequency resolution in human infancy. Child Dev. 61, 632–652. doi: 10.1111/j.1467-8624.1990.tb02808.x

CrossRef Full Text | Google Scholar

Stefanics, G., Háden, G. P., Sziller, I., Balázs, L., Beke, A., and Winkler, I. (2009). Newborn infants process pitch intervals. Clin. Neurophysiol. 120, 304–308. doi: 10.1016/j.clinph.2008.11.020

PubMed Abstract | CrossRef Full Text | Google Scholar

Tomasello, M. (2003). Constructing a Language: A Usage Based Theory of Language Acquisition. Cambridge, MA: Harvard University Press.

Google Scholar

Trainor, L., and Hannon, E. E. (2013). “Musical development,” in The Psychology of Music, ed. D. Deutsch (San Diego, CA: Elsevier), 425–497.

Google Scholar

Trainor, L. J., and Trehub, S. (1992). A comparison of infants’ and adults’ sensitivity to western musical structure. J. Exp. Psychol. Hum. Percept. Perform. 18, 394–402.

Google Scholar

Trainor, L. J., Wu, L., and Tsang, C. D. (2004). Long-term memory for music: infants remember tempo and timbre. Dev. Sci. 7, 289–296. doi: 10.1111/j.1467-7687.2004.00348.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Trehub, S., Bull, D., and Thorpe, L. A. (1984). Infants’ perception of melodies: the role of melodic contour. Child Dev. 55, 821–830.

Google Scholar

Trehub, S., Thorpe, L., and Morrongiello, B. A. (1987). Organizational processes in infants’ perception of auditory patterns. Child Dev. 58, 741–749.

Google Scholar

Trehub, S. E. (2003). The developmental origins of musicality. Nat. Neurosci. 6, 669–673.

Google Scholar

Trehub, S. E., and Hannon, E. E. (2006). Infant music perception: domain-general or domain-specific mechanisms? Cognition 100, 73–99. doi: 10.1016/j.cognition.2005.11.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Tsuji, S., and Cristia, A. (2014). Perceptual attunement in vowels: a meta-analysis. Dev. Psychobiol. 56, 179–191. doi: 10.1002/dev.21179

PubMed Abstract | CrossRef Full Text | Google Scholar

Tyler, M. D., Best, C. T., Goldstein, L. M., and Antoniou, M. (2014). Investigating the role of articulatory organs and perceptual assimilation of native and non-native fricative place contrasts. Dev. Psychobiol. 56, 210–227. doi: 10.1002/dev.21195

PubMed Abstract | CrossRef Full Text | Google Scholar

Werker, J. F., and Tees, R. C. (1984). Cross-language speech perception: evidence for perceptual reorganization during the first year of life. Infant Behav. Dev. 7, 49–63. doi: 10.1016/S0163-6383(84)80022-3

CrossRef Full Text | Google Scholar

Werner, L. A., Marean, G. C., Halpin, C. F., Spetner, N. B., and Gillenwater, J. M. (1992). Infant auditory temporal acuity: gap detection. Child Dev 6.3, 260–272.

Google Scholar

Wong, P., Ciocca, V., Chan, A., Ha, L., Tan, L., and Peretz, I. (2012). Effects of culture on musical pitch perception. PLoS ONE 7:e33424. doi: 10.1371/journal.pone.0033424

PubMed Abstract | CrossRef Full Text | Google Scholar

Wong, P., and Perrachione, T. K. (2007). Learning pitch patterns in lexical identification by native English-speaking adults. Appl. Psycholinguist. 28, 565–585.

Google Scholar

Xu, Y. (2011). “Post-focus compression: cross-linguistic distribution and historical origin,” in Proceedings of the 17th International Congress of Phonetic Sciences, Hong Kong, 152–155.

Google Scholar

Yeung, H. H., Chen, K. H., and Werker, J. F. (2013). When does native language input affect phonetic perception? The precocious case of lexical tone. J. Mem. Lang. 68, 123–139. doi: 10.1016/j.jml.2012.09.004

CrossRef Full Text | Google Scholar

Yip, M. (2002). Tone. Cambridge: Cambridge University Press.

Google Scholar

Keywords: lexical tone, musical pitch, perception development, cross-domain cognition, infancy

Citation: Chen A, Stevens CJ and Kager R (2017) Pitch Perception in the First Year of Life, a Comparison of Lexical Tones and Musical Pitch. Front. Psychol. 8:297. doi: 10.3389/fpsyg.2017.00297

Received: 25 October 2016; Accepted: 16 February 2017;
Published: 09 March 2017.

Edited by:

Leher Singh, National University of Singapore, Singapore

Reviewed by:

Xiuli Tong, University of Hong Kong, Hong Kong
Weiyi Ma, Macquarie University, Australia

Copyright © 2017 Chen, Stevens and Kager. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ao Chen, aXJpc2NoZW43MUBob3RtYWlsLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.