Perceptual Reorganization of Lexical Tones: Effects of Age and Experimental Procedure

Götz, Antonia; Yeung, H. Henny; Krasotkina, Anna; Schwarzer, Gudrun; Höhle, Barbara

doi:10.3389/fpsyg.2018.00477

ORIGINAL RESEARCH article

Front. Psychol., 06 April 2018

Sec. Psychology of Language

Volume 9 - 2018 | https://doi.org/10.3389/fpsyg.2018.00477

This article is part of the Research TopicLexical Tone Perception in Infants and Young Children: Empirical studies and theoretical perspectivesView all 23 articles

Perceptual Reorganization of Lexical Tones: Effects of Age and Experimental Procedure

Antonia Götz¹^*

¹Linguistics Department, University of Potsdam, Potsdam, Germany
²Department of Linguistics, Simon Fraser University, Burnaby, BC, Canada
³Developmental Psychology, Justus-Liebig University Gießen, Giessen, Germany

Findings on the perceptual reorganization of lexical tones are mixed. Some studies report good tone discrimination abilities for all tested age groups, others report decreased or enhanced discrimination with increasing age, and still others report U-shaped developmental curves. Since prior studies have used a wide range of contrasts and experimental procedures, it is unclear how specific task requirements interact with discrimination abilities at different ages. In the present work, we tested German and Cantonese adults on their discrimination of Cantonese lexical tones, as well as German-learning infants between 6 and 18 months of age on their discrimination of two specific Cantonese tones using two different types of experimental procedures. The adult experiment showed that German native speakers can discriminate between lexical tones, but native Cantonese speakers show significantly better performance. The results from German-learning infants suggest that 6- and 18-month-olds discriminate tones, while 9-month-olds do not, supporting a U-shaped developmental curve. Furthermore, our results revealed an effect of methodology, with good discrimination performance at 6 months after habituation but not after familiarization. These results support three main conclusions. First, habituation can be a more sensitive procedure for measuring infants' discrimination than familiarization. Second, the previous finding of a U-shaped curve in the discrimination of lexical tones is further supported. Third, discrimination abilities at 18 months appear to reflect mature perceptual sensitivity to lexical tones, since German adults also discriminated the lexical tones with high accuracy.

Introduction

During the first year of life, infants' perception abilities may change for stimuli that are not present or not relevant in their environment. For example, in the linguistic domain, perceptual changes have been detected in infants' sensitivity to native and non-native speech sounds. With increased experience with their native language, infants show an enhanced ability to distinguish between native speech sounds, whereas the initial sensitivity to non-native speech sounds decreases. This pattern of perceptual reorganization has been shown for consonants (Werker and Tees, 1984; Rivera-Gaxiola et al., 2005), vowels (Polka and Bohn, 1996, 2011; Tsuji and Cristia, 2014), lexical tones (Mattock and Burnham, 2006; Mattock et al., 2008; Yeung et al., 2013; Liu and Kager, 2014); (Singh and Fu, 2016), and word stress (Höhle et al., 2009; Skoruppa et al., 2009; Bijeljac-Babic et al., 2012).

However, research in recent years has converged on the idea that this picture is too simplistic. On the one hand, not all linguistically relevant sound contrasts are easily discriminable by young infants (Narayan et al., 2010; for a review, see Maurer and Werker, 2014). On the other hand, there are non-native sound contrasts that are discriminable by children beyond the typical ages of perceptual reorganization, and even by adults (for consonantal contrasts, see Best et al., 2001; for vocalic contrasts, see Mazuka et al., 2014). The present paper investigates the potential perceptual reorganization of lexical tones by infants learning non-tone languages. Previous research on lexical tone discrimination in infants is characterized by a rather complex pattern of findings: prior studies have found evidence for an increase, a decrease, and no-change in infants' and toddlers' ability to discriminate non-native tone contrasts across ages (for an overview, see Table 1). These divergent findings may be related to a number of dimensions on which these studies varied, including the tone contrasts used, the native language of the participants, and the experimental procedures. Our study focuses on the latter factor and compares the effects of familiarization vs. habituation in the initial exposure phase on German-learning infants' discrimination of a Cantonese tone contrast. In familiarization experiments infants are exposed to certain stimuli for a fixed time, thus the exposure is experimenter-controlled. In contrast, exposure in habituation is infant-controlled as the infant needs to reach a specific criterion (decrease in looking time) to proceed to the test phase. Thus, the latter type of pre-exposure may be more sensitive to the performance of individual infants.

TABLE 1

Table 1. Summary of the previous results on infant lexical tone perception.

We will first review prior studies on infants' and adults' perception of lexical tones and then present three experimental studies. In the first study, Cantonese tone discrimination in adult native speakers of Cantonese was compared to that in adult native speakers of German. In the second study, the discrimination of the high-rising and the mid-level Cantonese tones was tested in German-learning infants between 6 and 18 months of age using a familiarization procedure. The third experiment investigated discrimination of the same tone contrast in 6- and 9-month-old German infants using a habituation procedure.

Previous Studies on Infants' Non-native Lexical Tone Perception

A detailed review of infant tone perception can be found elsewhere (Singh and Fu, 2016). Here, we focus on studies that have investigated how infants learning a non-tonal language as their native language perceive different tones from various tone systems and we incorporate some more recent studies on infant tone perception. Furthermore, our review will also highlight details of prior experimental methods.

The first studies that tested perceptual reorganization of lexical tones provided evidence for a decline in tone discrimination by infants learning a non-tone language. Mattock and Burnham (2006) compared English and Chinese (Mandarin- or Cantonese-learning) infants at 6 and 9 months on their discrimination of Thai rising vs. falling as well as rising vs. low tones using the Conditioned Head-Turn (CHT) paradigm. Infants were first trained to perform a head-turn whenever an auditory background stimulus (a syllable carrying one tone) was replaced by the target stimulus (the segmentally same syllable with another tone). In the test phase—which was started after three consecutively correct head-turns in the training—the number of correct head-turns to a stimulus change was the dependent variable. Both 6- and 9-month-old Chinese-learning infants discriminated both tone contrasts, but English-learning infants showed a decrease in their discrimination from 6 to 9 months of age, with an overall higher performance for the rising-falling than for the rising-low contrast.

Mattock et al. (2008) extended this study to 4-month-old infants learning English or French, while continuing to test 6- and 9-month-olds acquiring these languages. They used a visual fixation paradigm (i.e., they measured infants' looking time at a central visual display during auditory stimulus presentation), where infants were initially exposed to a syllable representing either a low or a rising Thai tone for 30 s in a familiarization phase. In the test phase, two trial types were presented: four alternating trials that contained both the familiarized and the non-familiarized tone, and four non-alternating trials that only contained tokens of the familiarized tone. In this Stimulus Alternation Preference Procedure (SAPP), the 4- and 6-month-olds but not the 9-month-olds showed significantly longer looking times for the alternating trials compared to the non-alternating trials with no difference across the language groups.

Yeung et al. (2013) tested 4- and 9-month-olds learning Cantonese, Mandarin, and English on Cantonese tones that were similar to the Thai contrast (high-rising vs. mid-level tones) investigated by Mattock and colleagues. Using a modification of the SAPP, infants heard three trial types in the test phase: four alternating trials (familiarized and non-familiarized tone intermixed), two non-alternating trials only containing the familiarized tone, and two non-alternating trials only containing the non-familiarized tone. With this modification, discrimination and preference could be measured in the looking times obtained within the same experiment: that is, differences between the alternating and non-alternating trials would indicate discrimination while the direction of differences between the non-alternating trials would indicate preference. The English-learning infants showed a decline in the ability to discriminate these contrasts while this was not the case for the Mandarin or Cantonese infants. Moreover, infants learning one of the tonal languages showed an asymmetrical performance pattern with better discrimination when they were familiarized with the high-rising tone than with the mid-level tone.

While these studies showed a decline in discrimination ability for non-tone language learners, others have found enhanced perceptual abilities with increasing age (Chen and Kager, 2016; Chen et al., 2017; Tsao, 2017). Chen and Kager (2016) as well as Chen et al. (2017) tested Dutch-learning infants' discrimination of the Mandarin low-rising and low-dipping tones. Different from Mattock et al. (2008) and Yeung et al. (2013), who used familiarization in the initial exposure phase, infants were first habituated by repeatedly being exposed to one of the tones until their looking time had decreased for a predefined percentage. Then in the test phase, one trial of the habituated tone and one trial of the non-habituated tone were presented. The results from both studies suggest successful discrimination in 6- and 12-month-olds but not in 4-month-olds. The authors concluded from their results that, with increasing age, infants develop more fine-grained acoustic discrimination abilities for pitch information. Increasing perceptual sensitivity was also observed by Tsao (2017), who tested 6–8 and 10–12-month-old Mandarin- and English-learning infants using the CHT paradigm on the Mandarin high-level vs. low-dipping tones. Both language groups showed discrimination at both ages and their discrimination ability was enhanced with increasing age.

A third pattern found in the literature is that infants show no changes in their discrimination ability with increasing age (Liu and Kager, 2014, 2017; Ramachers et al., 2017; Shi et al., 2017; Tsao, 2017). Ramachers et al. (2017) tested Dutch and Limburgian¹ 6-, 9-, and 12-month-old infants with Limburgian falling vs. falling-rising tones. After the infants were habituated with one tone, they were presented with trials that only contained the habituated tone (non-alternating) or with a mixture of the habituated and the non-habituated tones (alternating). Looking time to a central visual display was the dependent measure, and results showed that Dutch infants at all ages (with no previous exposure to this specific dialect) discriminated the Limburgian tone contrast. Ramachers et al. (2017) argue that Dutch intonation has pitch contours (H*L and H*LH%) that are acoustically comparable to the Limburgian tones (Gussenhoven, 2004), which may have led to a maintenance of discrimination. Shi et al. (2017) came to a similar result when testing French-learning 4-, 8-, and 11-month-old infants. They habituated the infants to one instance of two Mandarin tone contrasts: either one token from the perceptually close rising vs. low-dipping contrast or one from the perceptually more distinct high-level vs. falling contrast. Infants were then tested on their discrimination of the habituated and the non-habituated tones. The infants showed successful discrimination across all three age groups with slight indications of a decline only for the perceptually close contrast. They discuss their findings as an indication of the emerging impact of native phonology and of the acoustic salience of the tested contrast in the perception of the non-native tone patterns.

Finally, a fourth developmental pattern was observed by Liu and Kager (2014), who tested the discrimination of the Mandarin high-level vs. high-falling tonal contrast in Dutch infants between 5 and 18 months of age using the visual fixation paradigm implemented with a habituation procedure. Their study revealed perceptual sensitivity at all ages when using naturally recorded speech stimuli. However, they found a U-shaped developmental curve in a second experiment, in which synthesized stimuli with smaller acoustic differences of the same contrast were used. Specifically, Dutch-learning infants at 5–6 and 17–18 months of age discriminated the contrast in these materials, but not the intermediate age groups. This U-shaped development was also found in a group of bilingual infants learning Dutch and another non-tone language (Liu and Kager, 2017). In line with Shi et al. (2017), the authors interpreted the finding that Dutch-learning infants regain their ability to discriminate the tones as a result of their experience with the native (Dutch) intonation system and its modulation by the acoustic salience of the contrast. To our knowledge, the two studies by Liu and Kager (2014, 2017) are the only ones that have tested tone perception across a larger age range extending into the second year of life and that have found evidence for a U-shaped learning curve.

In sum, previous studies have shown that infants' non-native tone perception is probably influenced by a large number of factors, including age, task demands, the acoustic salience of the target tone contrast, and the prosodic systems of the native languages of the infant participants. Thus, developmental change in language acquisition and the experimental observation of this change seem to be dependent on a complex interaction of different factors. This links up with findings that show that older children and adult speakers of non-tone languages can also identify and discriminate lexical tones, even though their performance is typically below that of native speakers of the particular language (Burnham and Francis, 1997; Hallé et al., 2004; Francis et al., 2008; So and Best, 2010; Hay et al., 2015). The adult perception of L2 tones has been shown to be influenced by various factors, among others by the L1 lexical tone system (if the L1 is a tone language) or the use of pitch variation for post-lexical functions, (e.g., different intonation or phrasing patterns) in the native language (Wayland and Li, 2008; Caldwell-Harris et al., 2015), but also by specific task conditions (e.g., duration of the interstimulus interval, requirement to count backwards during the interstimulus interval) that can show differential effects on non-native and native speakers' performance (Lee et al., 1996). One explanation for good tone discrimination abilities in adult speakers of non-tonal languages is that hearers might adopt their knowledge about the native intonation system for identifying and discriminating lexical tones (Francis et al., 2008). For instance, Francis et al. (2008) found that English listeners were highly accurate in identifying the Cantonese high-rising tone, which the authors linked to the acoustic similarity of this Cantonese tone to the rising intonation pattern of questions in English. Another possibility derives from the acoustic salience of the tested contrast. Highly acoustically salient tone contrasts are easier to discriminate independent of the native language background (Hallé et al., 2004). Given these findings that tone discrimination in adult speakers of non-tonal languages is possible, but is modulated by several factors, adult speakers' performance also needs to be considered when studying perceptual reorganization of tone discrimination in early infancy.

The Current Study

The above-reviewed research on infants' non-native tone perception reflects the influence of several factors on experimental outcomes: acoustic properties of the tones used in the experiments, characteristics of the prosodic systems of the native languages of the participants, and also aspects of the experimental procedures. The studies that have found a perceptual decline with increasing age have mainly used familiarization procedures (Mattock et al., 2008; Yeung et al., 2013), whereas all studies that have found patterns of (re-)increased or maintained sensitivity across age have used infant-controlled habituation or conditioning procedures (Liu and Kager, 2014, 2017; Hay et al., 2015; Chen and Kager, 2016; Chen et al., 2017; Ramachers et al., 2017; Shi et al., 2017; Tsao, 2017). This suggests that habituation may be the more robust procedure to reveal discrimination abilities in infants. In line with this consideration, a recent test–retest reliability study suggests that habituation results are more consistent and reveal larger effects at the group level than familiarization (Cristia et al., 2016). One reason for this could be that infants in a habituation procedure enter the test phase of the experiment on an individually controlled encoding status of the stimulus. The duration of the exposure during the habituation procedure is dependent on infants' response to the stimulus. In contrast, familiarization has a fixed duration that does not take into account individual differences in the speed of encoding the stimuli. According to the model by Hunter and Ames (1988), the degree of familiarity with the exposed stimulus (which depends on an interaction of stimulus complexity and the infants' age as an indicator of developmental level) determines whether an infant prefers the familiar or the novel stimulus in the test phase. Therefore, group results may reflect heterogeneous individual patterns of novelty or familiarity preferences, which may lead to null effects. This inconsistency in the direction of preferences is actually predicted after familiarization in some cases but is never predicted after habituation. Thus, the conflicting results on infants' tone perception obtained across different studies may at least partly be related to the use of different pre-exposure techniques.

The present study had two main objectives. First, we further investigated the U-shaped development found by Liu and Kager (2014) using another tone contrast and testing a population with a different native language than Dutch. To this end, discrimination of a Cantonese tone contrast was tested with German-learning infants between 6 and 18 months of age, as well as with a group of German and Cantonese adults. Second, we wanted to pursue the question of methodological impacts on the results in infant discrimination studies. For that reason, the effect of using a familiarization or a habituation technique on the discrimination performance of 6- and 9-month-olds was investigated by testing these two age groups with two different experimental procedures.

Before testing infants, we first asked whether the target tone contrast would be discriminated by adult speakers of German. We tested a group of German adults on their ability to discriminate Cantonese tone contrasts and compared the results to the performance of a group of adult native speakers of Cantonese. Our prediction was that German adults may be able to discriminate these tones in an AXB task but that Cantonese speakers should outperform the German speakers. An AXB task was chosen to reduce the effects of memory load. Different tokens of syllables from the same tonal category were used to force listeners to discriminate categorically rather than acoustically.

Experiment 1: Adults' Discrimination of Cantonese Lexical Tones

Methods

Participants

Ten native Cantonese speakers (19–31 years, 5 female) and 14 native German speakers (22–31 years, 8 female) participated in this study. None of the native German speakers had any language competence in Cantonese or another tone language. Although all participants reported L2 proficiency in English, they considered themselves to be monolingual. All participants reported normal hearing abilities. The study was approved by the Ethics Committee of the University of Potsdam. Written informed consent in accordance with the Declaration of Helsinki was obtained from all participants.

Stimuli

The stimuli for the adult experiment comprised five different Cantonese lexical tones: high-rising (Tone 25), mid-level (Tone 33), low-falling (Tone 21), low-rising (Tone 23), and low-level (Tone 22). Although our experiments with the German infants (see below) were restricted to testing the discrimination of only Tone 33 and Tone 25², we examined more tone contrasts in the adult experiment. This was done in order to minimize any effects of only presenting two tones repeatedly, which may draw the participants' attention to their specific acoustic differences and thus foster enhancement of discrimination during the experiment. A second reason for including multiple tones was to generate a broader picture of German adults' processing of lexical tones.

A female native speaker of Cantonese produced 40 segmentally different CV and CVC syllables in each of these five tones leading to 200 different syllables overall (e.g., the syllables/jin/and/se/, each produced with five different tones). Half of the stimuli were CV and the other half CVC syllables. All syllables had a legal German phonotactic structure and were meaningful Cantonese words. To create acoustic variability the speaker produced each stimulus four times. An acoustic analysis of the pitch patterns of the stimuli was conducted using PRAAT (see Table 2; Boersma and Weenink, 2016). Pitch contours were measured by sampling at three different time points within the vowel: at initial, middle (at 50%), and final position. Figure 1 illustrates an example of the five different pitch contours of the syllable/jin/. The pitch contour of level tones showed no change across the syllable (Tone 22, Tone 33), whereas for contour tones a pitch rise (Tone 23, Tone 25) or fall (Tone 21) occurred at the end of the syllable. For the experiment, all stimuli were normalized in intensity.

TABLE 2

Table 2. Results from the acoustic analysis of the different Cantonese lexical tones.

FIGURE 1

Figure 1. An example of the F0 contours of the syllable /jin/ of the five different tested Cantonese tones.

Procedure

Both Cantonese and German adults performed an AXB discrimination task. In this task, participants needed to discriminate between ten different tone pairs. The five tone types were combined with each other, such that Stimulus A and B of a trial were always segmentally identical syllables but belonged to different tone categories; X also had the same segmental structure and belonged either to the same tone category as A or as B. An AXB task was chosen to reduce the effects of memory load compared to an ABX task. The X in an AXB task is equally distant from A to B, which prevents a mapping bias to the B stimulus (Best et al., 2001; Hallé et al., 2004; Strange and Shafer, 2008). Within a trial, different tokens of the syllables from the same tonal category were used to force listeners to discriminate categorically rather than acoustically (Best et al., 1988; Polka, 1991, 1992), thereby increasing the likelihood of finding language-specific effects.

Four different trial types with the four possible orders of the stimuli were presented: AAB, ABB, BAA, and BBA. Each participant heard each of the 40 types of syllables combined with only one tone contrast. The pairing was randomized and counterbalanced across the participants (e.g., one participant heard the contrast Tone 25–Tone 33 on the syllable/se/, while another participant heard the contrast Tone 22–Tone 33 on the same syllable). Therefore, every participant heard each of the 40 syllables during the experiment but the tone contrast that was instantiated on these syllables varied across the participants. Each tone contrast occurred with four different syllables for each participant. During the experiment, each syllable-tone pairing was presented four times, once in each trial type. This resulted in an overall number of 160 trials for each participant (4 syllables × 10 tone contrasts × 4 trial orders). These trials were divided into four blocks of 40 trials, in order to allow pauses in between. Each block only contained one of the trial types for a syllable-tone pair. The trials within a block were presented in a pseudo-randomized order with the same tone contrast never repeating twice in row. The stimuli within trials were separated by an interstimulus interval of 1,000 ms; the intertrial interval was 3,000 ms. An interstimulus interval of 1,000 ms was chosen because previous studies have shown that language-specific effects are more clearly revealed with long interstimulus intervals (Werker and Logan, 1985). The maximum response time for the participants was 2,500 ms, measured from the offset of the last syllable. The pause between blocks was controlled by the participant, and the experiment continued when the participant pressed a button. In total, the experiment lasted around 20 min.

Participants were instructed to decide whether the second syllable was more similar to the first or to the third syllable, otherwise they were not instructed to attend to any specific part of the syllables. The experiment and the participants' responses on a keyboard were controlled with OpenSesame (Mathôt et al., 2012) and run on a laptop. All trials were presented over headphones in a silent room.

Results

Figure 2 summarizes the percentages of correct responses given for all contrasts by both language groups. Statistical analyses were run on the number of correct responses as the dependent variable. The performance of both language groups was significantly higher than predicted by chance for all tone contrasts (one sample t-test against chance level, all p's < 0.001). This was also true for the relevant tone contrast for the infant study (Tone 33–Tone 25). Most importantly, a one sample t-test against chance revealed above chance performance in German adults (t = 18.55, p < 0.001) for this contrast.

FIGURE 2

Figure 2. Results from the AXB discrimination task, separated by group and tone contrast.

As a next step, we compared different models that were computed with the function glmer from the lme4 package (Bates et al., 2015) in R (R Core Team, 2017). Models and their results were obtained by the anova function. The best fitting model [lowest Akaike Information Criterion (AIC, Akaike, 1998) and significant difference in the Chi-square test] included item and subject as random factors and interaction of language group (Cantonese and German) and tone contrast (the 10 different tone contrasts) as fixed factors; see Table 3. Additionally, we asked for musical experience. Participants were asked whether they had learned to play an instrument and if yes, how long they do or did play it. Model comparison revealed that musical experience (years playing an instrument) did not modulate the outcome of our data. Compared to the model including the interaction of Tone Contrast and Language group, the model including musical experience has higher AIC (2183.4 compared to 2175.6) and no significantly better fit with Chi-square test results (p = 0.19).

TABLE 3

Table 3. Results from the model comparison of the adult perception experiment.

In general, our results reveal good performance in both groups, but show that German native listeners performed less accurately than the native Cantonese listeners (86.5 vs. 93.4%, respectively). The statistical analysis showed that the overall performance differed significantly between the two language groups (β = −2.253, SE = 0.758, z = −2.973, p < 0.01). However, this group difference was not significant across all contrasts as indicated by the interaction of tone contrast with group. Cantonese listeners best discriminated high-rising (25) vs. mid-level (33), high-rising (25) vs. low-level (22), and mid-level (33) vs. low-rising (23), each at a level of 98.7%. German adults performed best on the discrimination of mid-level (33) vs. low-falling (21). For both groups, the contrast high-rising (25) vs. low-rising (23) was the most difficult contrast.

With respect to the infant experiments, we were especially interested in how native and non-native adults perceive the difference between high-rising and mid-level tones. Our results revealed that the Cantonese adults discriminated Tone 25 vs. Tone 33 significantly better than the German listeners (β = −2.503, SE = 0.871, z = −2.874, p < 0.01). Furthermore, native listeners discriminated Tone 25 vs. Tone 22 (β = −2.567, SE = 0.786, z = −3.265, p < 0.01), Tone 33 vs. Tone 23 (β = −2.047, SE = 0.850, z = −2.409, p < 0.01), Tone 21 vs. Tone 23 (β = −1.818, SE = 0.713, z = −2.549, p < 0.05), and Tone 23 vs. Tone 22 (β = −1.127, SE = 0.336, z = −3.358, p < 0.001) significantly better than the non-native German listeners. The discrimination for the other tone contrasts was not significantly different between the Cantonese and the German listeners.

Discussion

The first experiment tested the discrimination of Cantonese lexical tones by adult German listeners without knowledge of Cantonese and by native speakers of Cantonese. Three main findings were obtained: First, German native speakers were able to distinguish between different lexical tones. Second, native Cantonese speakers outperformed German listeners in their overall discrimination abilities. Third, there was variation in German listeners' discrimination performance depending on the specific contrast: while the discrimination reached native-like levels for some contrasts, performance was below that of native speakers for other contrasts. This is in line with other discrimination studies that have shown good discrimination by non-native listeners, but an overall better performance by native listeners (Lee et al., 1996; Burnham and Francis, 1997; Cutler and Chen, 1997; Francis et al., 2008).

However, the picture becomes less clear when comparing performances of each tone contrast separately. Some lexical tones (high-rising vs. mid-level, high-rising vs. low-level, low-rising vs. mid-level, low-rising vs. low-level, and low-rising vs. falling) are harder to discriminate for German than for Cantonese native speakers. However, there are also contrasts for which both language groups show comparable levels of high performance (high-rising vs. low-falling, mid-level vs. low-falling, and low-level vs. falling). Further, there are two contrasts for which both language groups show comparably lower performance (high-rising vs. low-rising, mid-level vs. low-level). It is striking that the pairs that are highly discriminable by both groups contain one level and one contour tone or two contour tones with frequency changes in opposite directions, while the tone pairs that are harder to discriminate are both level tones or show the same direction of frequency change. This pattern suggests that for non-native as well as for native tone discrimination, acoustic properties and the acoustic distance of the specific tone contrast are relevant for their discriminability. In addition, it is possible that German listeners assimilate some of the tones to their native intonation system. This would then support a language-specific account of adult tone perception. It is noteworthy that all contrasts that are highly discriminable for the German listeners contain the falling Tone 21. The good discrimination seen here might stem from familiarity with the German intonation system, which uses falling contours for neutral statements (Grice and Baumann, 2002). That is, similar to what Francis et al. (2008) have proposed for English listeners, German native speakers might use their knowledge of the native intonation system to discriminate non-native lexical tones.

To summarize, our findings from the first experiment revealed that German native speakers discriminate Cantonese lexical tones highly accurately, but native listeners perform significantly better. The overall good discrimination performance for German listeners could be explained by acoustic salience and/or assimilation to the native prosody. Our results thus showed that native and nonnative adults' performance may differ depending on the specific contrast. Discrimination abilities in adults should therefore be considered before testing potential changes in infants' non-native sound discrimination. Overall, the most important finding from our first experiment is that German adults can discriminate the tone contrast that was used in our infant studies (Tone 33 vs. Tone 25), but that their performance was below that of native speakers of Cantonese. The finding that German adults can hear the difference between these tones increases the likelihood of observing a U-shaped developmental pattern, or perceptual enhancement with increasing age. But the finding that native Cantonese listeners show higher achievements in discriminating these two tones suggest that their discrimination is not only due to a large acoustic distance, but is also affected by the native language of the listener.

Experiment 2: Testing 6-, 9-, and 18-Month-Olds Using a Familiarization Procedure

Here we contribute new data to the infant tone perception literature by testing German infants' perception of the Cantonese Tone 33 vs. Tone 25 contrast that had previously been used in a study with English-learning infants by Yeung et al. (2013). Similar to Liu and Kager (2014), we included a wider age range than Yeung et al. had done in order to test for evidence of a U-shaped developmental curve in German 6-, 9-, and 18-month-olds. Following the Yeung et al. study, we used a procedure involving familiarization, but the discrimination abilities during the test phase were assessed with the head-turn preference procedure.