<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2014.01059</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Original Research Article</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Magnitude of phonetic distinction predicts success at early word learning in native and non-native accents</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Escudero</surname> <given-names>Paola</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/53507"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Best</surname> <given-names>Catherine T.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/103455"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Kitamura</surname> <given-names>Christine</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/168145"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Mulak</surname> <given-names>Karen E.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/168287"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>The MARCS Institute, University of Western Sydney</institution> <country>Sydney, NSW, Australia</country></aff>
<aff id="aff2"><sup>2</sup><institution>School of Social Sciences and Psychology, University of Western Sydney</institution> <country>Sydney, NSW, Australia</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: <italic>Janet F. Werker, The University of British Columbia, Canada</italic></p></fn>
<fn fn-type="edited-by"><p>Reviewed by: <italic>Christopher Terrence Fennell, University of Ottawa, Canada; Suzanne V. H. Van Der Feest, The University of Texas at Austin, USA</italic></p></fn>
<fn fn-type="corresp" id="fn002"><p>&#x0002A;Correspondence: <italic>Paola Escudero, The MARCS Institute, University of Western Sydney, Locked Bag 1797, Penrith, Sydney, NSW 2751, Australia e-mail: <email>paola.escudero@uws.edu.au</email></italic></p></fn>
<fn fn-type="other" id="fn001"><p>This article was submitted to Language Sciences, a section of the journal Frontiers in Psychology.</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>30</day>
<month>09</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="collection">
<year>2014</year>
</pub-date><volume>5</volume>
<elocation-id>1059</elocation-id>
<history>
<date date-type="received">
<day>06</day>
<month>06</month>
<year>2014</year>
</date>
<date date-type="accepted">
<day>04</day>
<month>09</month>
<year>2014</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2014 Escudero, Best, Kitamura and Mulak.</copyright-statement>
<copyright-year>2014</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/"><p> This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Although infants perceptually attune to native vowels and consonants well before 12 months, at 13&#x02013;15 months, they have difficulty learning to associate novel words that differ by their initial consonant (e.g., BIN and DIN) to their visual referents. However, this difficulty may not apply to all minimal pair novel words. While Canadian English (CE) 15-month-olds failed to respond to a switch from the newly learned word DEET to the novel non-word DOOT, they did notice a switch from DEET to DIT (<xref ref-type="bibr" rid="B14">Curtin et al., 2009</xref>). Those authors argued that early word learners capitalize on large phonetic differences, seen in CE DEET&#x02013;DIT, but not on smaller phonetic differences, as in CE DEET&#x02013;DOOT. To assess this hypothesis, we tested Australian English (AusE) 15-month-olds, as AusE has a smaller magnitude of phonetic difference in both novel word pairs. Two groups of infants were trained on the novel word DEET and tested on the vowel switches in DIT and DOOT, produced by an AusE female speaker or the same CE female speaker as in <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref>. If the size of the phonetic distinction plays a more central role than native accent experience in early word learning, AusE children should more easily recognize both of the unfamiliar but larger CE vowel switches than the more familiar but smaller AusE ones. The results support our phonetic-magnitude hypothesis: AusE children taught and tested with the CE-accented novel words looked longer to both of the switch test trials (DIT, DOOT) than same test trials (DEET), while those who heard the AusE-accented tokens did not notice either switch. Implications of our findings for models of early word learning are discussed.</p>
</abstract>
<kwd-group>
<kwd>early word learning</kwd>
<kwd>phonetic distinction</kwd>
<kwd>native accent</kwd>
<kwd>non-native accent</kwd>
<kwd>vowel perception</kwd>
</kwd-group>
<counts>
<fig-count count="3"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="59"/>
<page-count count="11"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec>
<title>INTRODUCTION</title>
<p>The first year of life sees the emergence of native phonemic categories, demonstrated by children&#x02019;s persisting discrimination of native contrasts and diminishing discrimination of non-native contrasts (<xref ref-type="bibr" rid="B55">Werker and Tees, 1983</xref>, <xref ref-type="bibr" rid="B56">1984</xref>; <xref ref-type="bibr" rid="B43">Polka and Werker, 1994</xref>). Children are born able to discriminate nearly all consonant and vowel contrasts (e.g., <xref ref-type="bibr" rid="B1">Aslin and Pisoni, 1980</xref>; for reviews, see <xref ref-type="bibr" rid="B8">Burnham, 1986</xref>; <xref ref-type="bibr" rid="B2">Best, 1994</xref>; <xref ref-type="bibr" rid="B57">Werker and Tees, 1999</xref>), but by 6&#x02013;8 months this ability begins to decline for many vowel contrasts not present in the native language environment (<xref ref-type="bibr" rid="B43">Polka and Werker, 1994</xref>; cf. <xref ref-type="bibr" rid="B42">Polka and Bohn, 1996</xref>), and by 10&#x02013;12 months sensitivity to most non-native consonant contrasts similarly declines (<xref ref-type="bibr" rid="B55">Werker and Tees, 1983</xref>, <xref ref-type="bibr" rid="B56">1984</xref>; cf. <xref ref-type="bibr" rid="B5">Best et al., 1988</xref>, <xref ref-type="bibr" rid="B4">1995</xref>). For instance, infants aged 6&#x02013;8 months brought up in an English language environment discriminate the Hindi contrast [ta]&#x02013;[a] and Salish contrast [k&#x02019;i]-[q&#x02019;i], but by 10&#x02013;12 months this ability declines, and continues to do so until, like English-speaking adults, they are no longer able to reliably discriminate many contrasts that are not present in their native language environment. By the same token, children brought up in Hindi or Salish language environments continue to discriminate the contrasts present in their native languages, as do Hindi-speaking and Salish-speaking adults (<xref ref-type="bibr" rid="B55">Werker and Tees, 1983</xref>, <xref ref-type="bibr" rid="B56">1984</xref>).</p>
<p>Paradoxically, following this auspicious beginning, 14-month-old children have difficulty applying their phonetic and phonological knowledge to learning new words. That is, children younger than 17 months do not reliably discriminate newly learned words that differ by a single native consonant contrast (<xref ref-type="bibr" rid="B48">Stager and Werker, 1997</xref>; <xref ref-type="bibr" rid="B54">Werker et al., 2002</xref>; <xref ref-type="bibr" rid="B40">Pater et al., 2004</xref>), whereas older children succeed (<xref ref-type="bibr" rid="B54">Werker et al., 2002</xref>). For example, in a Switch task in which infants were habituated to novel word-object pairings, 14-month-olds failed to notice when the novel word associated with one object was switched to a new word that differed in only one consonant (e.g., BIH switched to DIH). Crucially, this was not due to a general problem with associating visual referents to spoken words, because 14-month-olds did learn word-referent pairs when the words differed in all of their consonants and vowels, such as LIF vs. NEEM. Nor was it due to an inability to discriminate the minimal pair contrasts, as 14-month-olds discriminated the same consonant minimal pair words when they were presented outside a word-learning context in a simple auditory discrimination task (<xref ref-type="bibr" rid="B48">Stager and Werker, 1997</xref>).</p>
<p>Researchers have suggested that the difficulty children younger than 17 months have in using phonetic detail for the purpose of word learning is due to the circumstances or demands of the experimental task (e.g., <xref ref-type="bibr" rid="B48">Stager and Werker, 1997</xref>; <xref ref-type="bibr" rid="B25">Fennell and Werker, 2003</xref>). Word learning is argued to be a difficult task, with increased difficulty for similar sounding words (<xref ref-type="bibr" rid="B53">Werker and Fennell, 2004</xref>). Indeed, success at associating novel words to visual referents depends on a variety of perceptual, attentional and memory factors (<xref ref-type="bibr" rid="B49">Thiessen, 2007</xref>; <xref ref-type="bibr" rid="B44">Rost and McMurray, 2009</xref>; <xref ref-type="bibr" rid="B59">Yoshida et al., 2009</xref>). For instance, although the 14-month-olds described above failed to notice when a newly learned word was switched to a word differing in one consonant in the Switch task (<xref ref-type="bibr" rid="B48">Stager and Werker, 1997</xref>), children&#x02019;s successful pairing of the novel words BIN and DIN with their corresponding novel objects was demonstrated when they instead performed a preferential looking task after exposure to the associations (<xref ref-type="bibr" rid="B59">Yoshida et al., 2009</xref>). Children&#x02019;s success in learning the novel words BIN and DIN in a preferential looking task but not in a Switch task suggests that the latter is a more demanding task than the former. That is, while children may be able to encode some phonetic detail in novel words, they are unable to do so to an extent that allows them to overcome the additional demands of the Switch task (<xref ref-type="bibr" rid="B59">Yoshida et al., 2009</xref>).</p>
<p>Furthermore, contextualization of novel words aids early word learning. Young children learn novel word-object mappings with words that differ in only one consonant when it is clear that the words and objects are to be associated. That is, when presented with sentences such as &#x0201C;Look. It&#x02019;s the BIN,&#x0201D; or &#x0201C;I like the BIN,&#x0201D; 14-month-olds learn that &#x0201C;BIN&#x0201D; and &#x0201C;DIN&#x0201D; refer to two different objects (<xref ref-type="bibr" rid="B24">Fennell and Waxman, 2010</xref>). Accessing phonetic detail in early word learning is also aided by prior exposure to familiar words that refer to familiar objects such as &#x0201C;car&#x0201D; and &#x0201C;kitty,&#x0201D; and prior exposure to the visual referents aids the association of those objects to similar sounding novel words (<xref ref-type="bibr" rid="B23">Fennell, 2012</xref>).</p>
<p>Another line of research has shown that not all novel minimal pair words are equally difficult for young children, and that difficulties with some pairs persist beyond the first 2 years of life. In an interactive object-reaching task where children learn to pair novel objects with their novel names, 16-, 20- and 30-month-olds learned and identified novel minimal pairs that differed in only one consonant, but intriguingly, failed with pairs that differed in only one vowel (<xref ref-type="bibr" rid="B36">Nazzi, 2005</xref>; <xref ref-type="bibr" rid="B38">Nazzi and New, 2007</xref>; <xref ref-type="bibr" rid="B28">Havy and Nazzi, 2009</xref>; <xref ref-type="bibr" rid="B37">Nazzi et al., 2009</xref>). This consonant-vowel disparity is found even when the cognitive demand is reduced by testing children on familiar words. In a preferential-looking task, 15-month-olds were sensitive to consonant mispronunciations of familiar words (e.g., BALL pronounced GALL), but were less sensitive to vowel mispronunciations (e.g., BALL pronounced BULE; <xref ref-type="bibr" rid="B33">Mani and Plunkett, 2007</xref>). In the same experiment, 18-month-olds (and 24-month-olds) were sensitive to both consonant and vowel mispronunciations of familiar words, converging with research demonstrating sensitivity at that age to lexically contrastive variation in vowels embedded in novel words (<xref ref-type="bibr" rid="B15">Dietrich et al., 2007</xref>).</p>
<p>Tasks that are more supportive and provide more context about words and their referents have been shown to decrease cognitive task demands, resulting in successful novel word learning by children younger than 17 months (<xref ref-type="bibr" rid="B24">Fennell and Waxman, 2010</xref>). The interactive object-reaching task (<xref ref-type="bibr" rid="B36">Nazzi, 2005</xref>; <xref ref-type="bibr" rid="B38">Nazzi and New, 2007</xref>; <xref ref-type="bibr" rid="B28">Havy and Nazzi, 2009</xref>; <xref ref-type="bibr" rid="B37">Nazzi et al., 2009</xref>), which presents words in a sentential context and allows pre-exposure to items before each trial, is thus reasoned to impose lower cognitive demands relative to the Switch task. <xref ref-type="bibr" rid="B28">Havy and Nazzi&#x02019;s (2009)</xref> finding that 16-month-olds were able to learn novel minimal pairs differing in only one consonant in an interactive object-reaching task further supports the notion that similarly aged infants&#x02019; failure to learn novel minimal pair words in the Switch task is due to its higher cognitive demands, which lead to an underrepresentation of infants&#x02019; abilities (<xref ref-type="bibr" rid="B59">Yoshida et al., 2009</xref>). But even when tested in procedures thought to impose relatively lower cognitive demands, such as the interactive object-reaching task and the preferential looking tasks used by <xref ref-type="bibr" rid="B33">Mani and Plunkett (2007)</xref>, children younger than 18 months do not reliably learn novel word-object associations involving vowel minimal pairs. This suggests that a greater difficulty with vowel minimal pairs relative to consonant minimal pairs for children younger than 17 months would persist if tested in the Switch task. Also, the fact that no single vowel minimal pair was correctly identified by the 16-month-olds in <xref ref-type="bibr" rid="B28">Havy and Nazzi (2009)</xref> suggests that this difficulty might extend to all vowel minimal pairs. These predictions are in line with <xref ref-type="bibr" rid="B39">Nespor et al.&#x02019;s (2003)</xref> hypothesis that infants should focus more on consonants than vowels in early word learning because vowels carry more between-speaker variation and are perceived less categorically (e.g., <xref ref-type="bibr" rid="B41">Pisoni, 1973</xref>).</p>
<p>However, infants younger than 17 months <italic>have</italic> learned some novel vowel minimal pairs in a Switch paradigm. <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref> found that Canadian English (CE) learning 15-month-olds associated two novel words that differed in only one CE vowel to their corresponding novel object referents in the Switch task. Using the same version of the Switch task as that used by <xref ref-type="bibr" rid="B54">Werker et al. (2002)</xref>, three groups of children were trained on two novel word-object associations for one of three vowel minimal pairs: DEET&#x02013;DIT, DEET&#x02013;DOOT, and DIT&#x02013;DOOT. At test, only the group presented with DEET&#x02013;DIT noticed a switch in the word-object pairing (Switch trials), as shown by their higher looking time relative to trials that presented the prior word-object associations (Same trials). Children in the DEET&#x02013;DOOT and DIT&#x02013;DOOT training conditions did not demonstrate a difference in looking time to Switch trials vs. Same trials in the test phase, suggesting that only some vowel minimal pairs can be learned under the high demands of the original Switch task.</p>
<p><xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref> suggested these findings indicate that infants&#x02019; phonological representations of vowels may not be adult-like and may instead be based on the most reliable phonetic dimensions for the specific contrast. Vowels are defined by their formant frequencies, which largely reflect the position of the tongue body when producing them. The first formant (F1) is primarily associated with vowel height (tongue height), and in CE, F1 was found to reliably distinguish /i/&#x02013;// (DEET&#x02013;DIT) but not the other two non-discriminated vowel contrasts, which were instead reliably differentiated only by F2 (vowel/tongue backness: /i/&#x02013;/u/ [DEET&#x02013;DOOT] and //&#x02013;/u/ [DIT&#x02013;DOOT]), and F3 (lip rounding: /i/&#x02013;/u/). That 15-month-olds discriminated only the contrast /i/&#x02013;// suggests that for young children, the F1 dimension (vowel/tongue height) may be a stronger phonetic cue for distinguishing vowels than F2 and F3. That is, they may take the simpler approach of attending to F1 over attending to a wider range of cues. The authors proposed several reasons for this bias toward F1, which may be more apparent in tasks with high demands. Firstly, F1 may draw more attention simply because it has the most energy in the speech signal. Alternatively, it may be that in the linguistic environment of CE, F1 is attended to most because of the wide range of vowel contrasts that are defined by F1 differences, and furthermore by the weakening of cues such as F2 and F3 due to increased fronting and decreased rounding of the cardinal vowel /u/ in North American English accents (<xref ref-type="bibr" rid="B50">Thomas, 2001</xref>; <xref ref-type="bibr" rid="B14">Curtin et al., 2009</xref>, p. 5). As the authors pointed out, these interpretations are consistent with the linguistic perception (LP) model (<xref ref-type="bibr" rid="B7">Boersma et al., 2003</xref>; <xref ref-type="bibr" rid="B18">Escudero and Boersma, 2004</xref>; <xref ref-type="bibr" rid="B16">Escudero, 2005</xref>, <xref ref-type="bibr" rid="B17">2009</xref>), which proposes that young children categorize segments according to large and consistent phonetic differences along individual continua, rather than multidimensional phonemic categories as seen in adults, and that only later in development do abstract phonological categories emerge. The findings are also compatible within the framework for processing rich information from multidimensional interactive representations (PRIMIR; <xref ref-type="bibr" rid="B52">Werker and Curtin, 2005</xref>), which posits that the reliance on individual phonetic dimensions decreases over time as phonemes emerge.</p>
<p><xref ref-type="bibr" rid="B14">Curtin et al.&#x02019;s (2009)</xref> findings demonstrate that the magnitude of the phonetic distinction between two vowel sounds is predictive of early word learning success. In the present study, we further examine the phonetic-magnitude hypothesis across two different English accents. We reasoned that children from an English regional accent background [Australian English (AusE)] that displays much smaller phonetic differences among the same three vowels than those presented in CE, and who are unfamiliar with CE, may use the same phonetic dimensions differently. The results of our study will demonstrate whether the F1 dimension is always the phonetic cue that receives most attention regardless of accent differences, or whether the magnitude of its importance is accent-dependent. The results will also shed light on whether success in early word learning is restricted to children&#x02019;s native accent. We examined AusE 15-month-olds&#x02019; ability to learn and discriminate the novel words DEET, DIT and DOOT, comparing performance between participants presented with the words produced in their native AusE accent, and participants presented with words produced in the unfamiliar CE accent. We used the simple version of the Switch task (<xref ref-type="bibr" rid="B48">Stager and Werker, 1997</xref>, experiments 2 and 3) in which children are familiarized with one novel word-object pairing (DEET). We modified the task to include two types of Switch trials, so that each participant was tested with two vowel contrasts (DIT and DOOT) rather than a single contrast relative to the familiarized word. Compared to <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref>, our version of the Switch task had a simpler familiarization phase, as they used two word-object pairings rather than one, and a more complex testing phase, with two Switch trials rather than a single Switch trial per participant. We chose a simpler familiarization phase in order to present two Switch trials during the test, which allowed us to compare the detection of a switch in two different vowels in the same infants. This was not possible in <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref>. We reasoned that this design will trigger word-object association performance, as <xref ref-type="bibr" rid="B48">Stager and Werker (1997</xref>, experiment 2) argued that 14-month-olds&#x02019; inability to notice the switch from BIH to DIH with this simplified procedure, despite their ability to perceptually discriminate the contrast /b/&#x02013;/d/, was due to their treatment of the procedure as a word-object association task.</p>
<p>Our interest in examining accent differences stems in part from recent findings that the accent of both speaker and listener markedly shapes native and non-native vowel perception in adults (<xref ref-type="bibr" rid="B18">Escudero and Boersma, 2004</xref>; <xref ref-type="bibr" rid="B19">Escudero and Chl&#x000E1;dkov&#x000E1;, 2010</xref>; <xref ref-type="bibr" rid="B10">Chl&#x000E1;dkov&#x000E1; and Podlipsk&#x000FD;, 2011</xref>; <xref ref-type="bibr" rid="B9">Chl&#x000E1;dkov&#x000E1; and Escudero, 2012</xref>; <xref ref-type="bibr" rid="B22">Escudero and Williams, 2012</xref>; <xref ref-type="bibr" rid="B21">Escudero et al., 2012</xref>), and recognition of words with accent-differing vowels in 15-month-olds (<xref ref-type="bibr" rid="B6">Best et al., 2009</xref>; <xref ref-type="bibr" rid="B35">Mulak et al., 2013</xref>). If these findings extend to 15-month-olds&#x02019; learning of novel vowel minimal pair words, it is expected that AusE children will behave differently than the CE children in <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref>. That is, since AusE and CE vowels have different phonetic realizations in F1/F2 space (<xref ref-type="bibr" rid="B12">Cox and Palethorpe, 2007</xref>, see Figure 1, below), AusE 15-month-olds trained on novel word-object pairings produced in the CE accent are likely to exhibit different patterns of early word learning than those shown by their CE-learning counterparts in <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref>. But will they show different levels of success across their native AusE vs. the unfamiliar CE accents?</p>
<p>Models of perceptual attunement to native categories such as Kuhl&#x02019;s Native Language Magnet model (NLM; <xref ref-type="bibr" rid="B29">Kuhl, 1991</xref>, <xref ref-type="bibr" rid="B30">1994</xref>) and Best&#x02019;s Perceptual Assimilation Model (PAM; <xref ref-type="bibr" rid="B2">Best, 1994</xref>, <xref ref-type="bibr" rid="B3">1995</xref>) predict ease in discrimination for native vowel contrasts, as infants become highly attuned to the specific properties of their native vowels by 6 months (<xref ref-type="bibr" rid="B56">Werker and Tees, 1984</xref>; <xref ref-type="bibr" rid="B32">Kuhl et al., 1992</xref>; <xref ref-type="bibr" rid="B43">Polka and Werker, 1994</xref>). While both models are well supported by perceptual data in children younger than 15 months, they do not specifically address word learning involving minimal pairs at this age (cf. <xref ref-type="bibr" rid="B51">Tsao et al., 2004</xref>; <xref ref-type="bibr" rid="B31">Kuhl et al., 2005</xref>). However, if their thesis that native language attunement streamlines perception is correct, it would seem likely that with regard to the present study, children&#x02019;s performance on the word learning task would be optimal in the native accent condition, where vowels would map precisely onto native categories based on familiar information that children hear on a regular basis.</p>
<p>Other studies also support better performance on early word recognition across accents for native/familiar accents (for a review, see <xref ref-type="bibr" rid="B13">Cristia et al., 2012</xref>). For instance, 20-month-olds looked longer to the picture of the target word CAR when it was produced with a final rhotic (/ka/), which is the most frequent production in the children&#x02019;s Bristol UK environment, than when it was produced without the rhotic (/ka/), a pronunciation that is less frequent in Bristol (<xref ref-type="bibr" rid="B26">Floccia et al., 2012</xref>). Similarly, <xref ref-type="bibr" rid="B35">Mulak et al. (2013)</xref> found that when 15-month-olds heard a familiar word produced in their native AusE, they looked at the target image longer than the distracter image, but looked at both images equally when the word was produced in an unfamiliar accent (Jamaican Mesolect English). However, exposure to unfamiliar pronunciations or accents may overrule this native accent advantage for recognition of both familiar and novel words. For instance, <xref ref-type="bibr" rid="B58">White and Aslin (2011)</xref> showed that 19-month-olds who were familiarized to word-object pairings in which the word was consistently produced with a different vowel (e.g., BLACK or BATTLE instead of BLOCK or BOTTLE), subsequently generalized this vowel change to other familiar word-object pairings (e.g., they looked longer at the picture of a SOCK than at a distractor picture when hearing the word SACK). Additionally, 24-month-olds were able to recognize novel words across native and unfamiliar non-native accents when word training was in the unfamiliar accent (<xref ref-type="bibr" rid="B47">Schmale et al., 2011</xref>), and recognized a novel word produced in their native and in an unfamiliar non-native accent after a 2-min exposure to stories produced in the unfamiliar accent (<xref ref-type="bibr" rid="B46">Schmale et al., 2012</xref>).</p>
<p>The purpose of our study is to examine word learning of minimally different novel words (e.g., DEET&#x02013;DIT) produced in different accents, rather than the recognition of familiar words produced in novel accents (e.g., <xref ref-type="bibr" rid="B6">Best et al., 2009</xref>; <xref ref-type="bibr" rid="B58">White and Aslin, 2011</xref>; <xref ref-type="bibr" rid="B26">Floccia et al., 2012</xref>; <xref ref-type="bibr" rid="B35">Mulak et al., 2013</xref>). Since we present each infant with a single accent, our study is also different from <xref ref-type="bibr" rid="B47">Schmale et al. (2011</xref>, <xref ref-type="bibr" rid="B46">2012</xref>), where novel word recognition was tested between accents (familiarizing infants with one accent and testing them with another). Instead, we aim to demonstrate that the specific acoustic-phonetic realizations of a particular accent determine early word learning success in the absence of word knowledge or accent familiarity. To that end, we compare the performance of two infant groups, each presented with a different accent.</p>
<p>We propose that infants&#x02019; ability to learn our novel word stimuli (produced in a single accent throughout familiarization and testing) will be explained by the magnitude of the phonetic distinction of minimally different words in the accent with which they are presented (CE or AusE), rather than by accent familiarity (AusE = familiar/native, CE = non-native/unfamiliar). Inspection of the specific phonetic properties of the vowels in DEET (/i/), DIT (//) and DOOT (/u/) produced by CE and AusE speakers leads us to predict that in a word-object associative task with high demands such as the Switch task, the former accent will lead to higher success than the latter in early word learners. This prediction is supported by the values shown in Figure <xref ref-type="fig" rid="F1">1</xref> where it can be observed that while /i/ and // are largely distinguished by F1 differences in CE, the same vowels produced in AusE have very similar F1 and F2 values<sup><xref ref-type="fn" rid="fn01">1</xref></sup>. If infants rely only on F1 and F2 for distinguishing these two vowels, as suggested by <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref>, AusE children would be expected to better distinguish /i/ and // in the unfamiliar CE accent than their native accent. Similarly, the magnitude of the phonetic distinction along the F1 and F2 dimensions for /i/&#x02013;/u/ appears larger for CE than AusE vowels, since /u/ is more fronted in AusE than in CE and is therefore even closer to /i/. In fact, AusE /u/<sup><xref ref-type="fn" rid="fn02">2</xref></sup> can be produced as far front as /&#x000E6;/ (though it is, of course, higher than /&#x000E6;/), which means that the only back vowel characteristic that it retains is its rounding feature (<xref ref-type="bibr" rid="B11">Cox, 2006</xref>). If the phonetic magnitude hypothesis predicts early word learning, AusE children presented with novel words containing CE vowels will notice a difference between a switch in the vowel of the familiarized word DEET better than those presented with the novel words containing AusE vowels.</p>
<p>This prediction of higher success for AusE children on CE novel words compared to AusE novel words that differ in the vowels /i/, // and /u/ is in line with the LP and PAM models which posit that listeners of any age classify vowel tokens based on their acoustic or articulatory properties, respectively. As shown in Figure <xref ref-type="fig" rid="F1">1</xref>, both CE // and /u/ have F1 and F2 values that are acoustically closer to other AusE vowels than to their phonemic counterparts. Specifically, CE // is a better acoustic match to AusE /&#x1D700;/, while CE /u/ matches AusE //. Considered in terms of their articulatory properties, which mirror those of the acoustic patterns just described, the same pattern of assimilation is predicted by PAM. For an AusE listener then, the CE vowel contrasts /i/&#x02013;// and /i/&#x02013;/u/ should be perceived as the AusE contrasts /i/&#x02013;/&#x1D700;/ and /i/&#x02013;//, which both display larger phonetic distinctions than the AusE phonemic counterparts /i/&#x02013;// and /i/&#x02013;/u/. Thus, AusE listeners should distinguish these two vowel contrasts. Given that the LP model proposes continuity between vowel perception at the end of the first year and word recognition early in the second year, AusE infants are likewise predicted to detect a switch from DEET to DIT and from DEET to DOOT in the unfamiliar CE accent. Such a finding would be in contradiction to the expectation and finding of the asymmetry in discrimination of these CE contrasts by CE children reported in <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref>, in which children detected a switch from DEET to DIT, but not DEET to DOOT (or DIT to DOOT).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p><bold>Familiarization image <bold>(A)</bold> and pre- and post-test image <bold>(B)</bold>.</bold> Visual stimuli were the same as those used in <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref>.</p></caption>
<graphic xlink:href="fpsyg-05-01059-g001.tif"/>
</fig>
</sec>
<sec id="s1" sec-type="materials|methods">
<title>MATERIALS AND METHODS</title>
<sec>
<title>PARTICIPANTS</title>
<p>Participants were forty-eight 15-month-olds, who were randomly assigned to two groups: Twenty-four were familiarized and tested on CE stimuli (mean age = 15.26 months, range = 14.79&#x02013;16.00 months; 12 girls) and 24 on AusE stimuli (mean age = 15.30 months, range = 14.79&#x02013;16.10 months; 12 girls). All parents provided informed consent in accordance with the University of Western Sydney Human Research Ethics Committee. The infants were primarily Caucasian and from middle- to upper-middle-class AusE-speaking households in Sydney, Australia. Their amount of exposure to non-native languages or non-AusE accents ranged from 0 to no more than 12 h per week, none of which included the CE accent, as indicated by parental report. They were recruited via advertisements at pregnancy and parenthood fairs and parents&#x02019; magazines. Another 30 infants were tested but excluded from the final sample because of fussiness (<italic>n</italic><sub>AusE</sub> = 16; <italic>n</italic><sub>CE</sub> = 3), parental interference (<italic>n</italic><sub>CE</sub> = 1), pre-existing hearing loss (<italic>n</italic><sub>AusE</sub> = 1), obstruction of gaze from experimenter (<italic>n</italic><sub>AusE</sub> = 1) or because they did not meet the habituation criterion (<italic>n</italic><sub>AusE</sub> = 6; <italic>n</italic><sub>CE</sub> = 2).</p>
</sec>
<sec>
<title>STIMULI AND APPARATUS</title>
<p>Participants were exposed to three CVC non-words during the task, namely DEET (/dit/), DIT (/dt/) and DOOT (/dut/). The CE stimuli were the same as those used in <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref>, which were produced by a female native speaker of CE. For the present study, we recorded a female native speaker of AusE who produced the same three CVC non-words. Both sets of stimuli were recorded at a 44 kHz sample rate directly onto a computer.</p>
<p>It was discovered that in the set of tokens for DEET, DIT, and DOOT used in <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref>, the first three and last three tokens were identical. This was mirrored when developing the AusE stimuli for the current study, such that both the CE and AusE speakers produced seven tokens of each CVC item, using the same range of infant-directed contours, with the first three tokens repeated at the end to create 10 tokens. The AusE speaker used the CE stimuli as models to match the F0 (fundamental frequency) contours as closely as possible. Following <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref>, infants were presented with a single sound file for each of the three words. The AusE sound files mirrored the CE sound files in token sequence (i.e., sequence of intonation contours), inter-stimulus interval and total duration of 20 s.</p>
<p>While the difference in the production of the consonants surrounding the vowels (/d/ and /t/) across the two accents was negligible, the vowels were judged by the first three authors (two trained phoneticians, one a non-native speaker of English, the other a native speaker of northern-cities American English, and the third a native speaker of AusE) to differ perceptibly and substantially between the two accents. These observations were confirmed by the F1, F2, and F3 values of the vowels in the two accents shown in Figure <xref ref-type="fig" rid="F1">1</xref> and Table <xref ref-type="table" rid="T1">1</xref><sup><xref ref-type="fn" rid="fn03">3</xref></sup>. The table also includes measures of vowel duration and F0. Formant measurements were taken from the midpoint of the vowel (50% of total vowel duration).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Average formant values, F0, and vowel duration for the vowels in the native accent (AusE) and unfamiliar accent (CE).</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"></th>
<th valign="top" align="center" colspan="3">Australian English (AusE)<hr/></th>
<th valign="top" align="center" colspan="3">Canadian English (CE)<hr/></th>
</tr>
<tr>
<th valign="top" align="left"></th>
<th valign="top" align="left">DEET<break/> /i/</th>
<th valign="top" align="left">DIT<break/> //</th>
<th valign="top" align="left">DOOT<break/> /u/</th>
<th valign="top" align="left">DEET<break/> /i/</th>
<th valign="top" align="left">DIT<break/> //</th>
<th valign="top" align="left">DOOT<break/> /u/</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">F1</td>
<td valign="top" align="left">498.7<break/> (72.4)</td>
<td valign="top" align="left">465.5<break/> (60.6)</td>
<td valign="top" align="left">461.0<break/> (80.6)</td>
<td valign="top" align="left">389.1<break/> (44.8)</td>
<td valign="top" align="left">620.2<break/> (73.4)</td>
<td valign="top" align="left">451.4<break/> (42.3)</td>
</tr>
<tr>
<td valign="top" align="left">F2</td>
<td valign="top" align="left">2581.9<break/> (226.9)</td>
<td valign="top" align="left">2677.5<break/> (66.1)</td>
<td valign="top" align="left">2156.2<break/> (178.3)</td>
<td valign="top" align="left">2622.2<break/> (121.5)</td>
<td valign="top" align="left">2276.8<break/> (111.1)</td>
<td valign="top" align="left">1496.2<break/> (114.7)</td>
</tr>
<tr>
<td valign="top" align="left">F3</td>
<td valign="top" align="left">3193.6<break/> (303.9)</td>
<td valign="top" align="left">3182.0<break/> (246.8)</td>
<td valign="top" align="left">2719.7<break/> (258.2)</td>
<td valign="top" align="left">3025.5<break/> (182.5)</td>
<td valign="top" align="left">2937.8<break/> (158.7)</td>
<td valign="top" align="left">2471.8<break/> (199.6)</td>
</tr>
<tr>
<td valign="top" align="left">F0</td>
<td valign="top" align="left">273.8<break/> (88.8)</td>
<td valign="top" align="left">311.8<break/> (78.7)</td>
<td valign="top" align="left">265.9<break/> (76.8)</td>
<td valign="top" align="left">312.9<break/> (106.1)</td>
<td valign="top" align="left">271.5<break/> (55.1)</td>
<td valign="top" align="left">272.4<break/> (76.5)</td>
</tr>
<tr>
<td valign="top" align="left">duration</td>
<td valign="top" align="left">253.5<break/> (51.7)</td>
<td valign="top" align="left">244.0<break/> (59.9)</td>
<td valign="top" align="left">298.9<break/> (99.2)</td>
<td valign="top" align="left">302.6<break/> (42.1)</td>
<td valign="top" align="left">245.7<break/> (28.7)</td>
<td valign="top" align="left">300.8<break/> (38.5)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic>Formant measurements (in Hz) were taken from the midpoint of the vowel (50\% of total vowel length). Duration is in ms. Values in parentheses represent one standard deviation from the mean.</italic></attrib></table-wrap-foot>
</table-wrap>
<p>The values in Table <xref ref-type="table" rid="T1">1</xref> show that the CE stimuli indeed have larger intervocalic differentiation in F1 and F2 than the AusE stimuli, confirming our hypothesis that the acoustic features (or articulatory correlates) of CE vowels could be used as clearer cues to vowel discrimination than those of AusE vowels. Specifically, as shown in Figure <xref ref-type="fig" rid="F1">1</xref> and discussed in the introduction, the vowels in the CE stimuli show larger phonetic distinctions than the vowels in the AusE stimuli, as the former stimuli have acoustic properties that match (&#x0201C;&#x02192;&#x0201D;) those of highly distinct AusE vowels: CE DEET &#x02192; AusE /i/ or //, CE DIT &#x02192; AusE /e/, and CE DOOT &#x02192; AusE //. Thus the prediction set forth by LP and PAM that CE vowels would be better discriminated than AusE vowels apply to the specific stimuli used in the present study.</p>
<p>The visual stimuli used in the familiarization and test phases were two of the images used by <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref>. One attractive novel object (see Figure <xref ref-type="fig" rid="F2">2A</xref>) was used for the familiarization phase (habituation) and test trials, and a toy waterwheel (Figure <xref ref-type="fig" rid="F2">2B</xref>) was used for both the pre- and post-tests. Similar to the presentation procedure in <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref>, the novel object moved back and forth across the screen at a slow and constant speed, while the waterwheel was filmed with its arms moving in a rotating motion.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p><bold>Looking time to the Same (DEET) test trial, and two Switch trials (DIT, DOOT) for the AusE and CE stimuli groups.</bold> Error bars represent one standard error.</p></caption>
<graphic xlink:href="fpsyg-05-01059-g002.tif"/>
</fig>
</sec>
<sec>
<title>PROCEDURE</title>
<p>We used the simple version of the Switch design (<xref ref-type="bibr" rid="B48">Stager and Werker, 1997</xref>, experiments 2 and 3), which we modified to include two types of Switch trials rather than one so that each participant was presented with all three vowel contrasts. During familiarization to the novel word-object association, infants were presented with a single word-object pairing, which consisted of the crown object (Figure <xref ref-type="fig" rid="F2">2A</xref>) paired and ten tokens of the word DEET. As in <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref>, each familiarization trial had a duration of 20 s, where the infants heard a sound file containing 10 tokens of the word DEET produced by either the CE speaker or the AusE speaker. Each trial started when the infant looked at a looming attention getter. Looking time to the screen for each trial was coded online, and familiarization trials repeated until participants reached a pre-set fixed habituation criterion (two consecutive trials with &#x0003C;65% of looking time from the average of the first two trials). Once the habituation criterion was met, three test trials were presented, each of them starting when the infant looked at a looming attention getter, as during familiarization. In the Same trial, the same 10 tokens of the word DEET and the crown object were presented. In the two types of Switch trials, the pairing was violated. That is, infants saw the same object moving but heard ten tokens of a different word in each Switch trial: DIT or DOOT.</p>
<p>As in previous early word learning studies that used the Switch design, if infants do not recognize the auditory word presented in a Switch trial to be different from the word presented to them during familiarization, the Same (DEET) and Switch trials (DIT or DOOT) would be equally familiar, resulting in equal looking times for both types of trials. This would be interpreted as infants&#x02019; failure to discriminate the vowel in familiarization trials (DEET) from the vowel in the Switch trial (DIT or DOOT). Conversely, if infants do recognize that the auditory word presented in the Switch trial is different than the word presented in the familiarization trials, they would look longer to Switch than Same trials, which would be interpreted as discrimination of the vowels presented in the Switch trials. In order to rule out the possible effect of order of Same and Switch trials, infants in both the CE and AusE stimulus condition were presented with three different orders for the test trials: (1) DEET&#x02013;DOOT&#x02013;DIT (Same&#x02013;Switch1&#x02013;Switch2), (2) DOOT&#x02013;DEET&#x02013;DIT (Switch1&#x02013;Same&#x02013;Switch2), and (3) DIT&#x02013;DOOT&#x02013;DEET (Switch2&#x02013;Switch1&#x02013;Same). Each accent &#x000D7; order group contained four infants (two females, two males).</p>
<p>The familiarization and test trials were preceded (pre-test trial) and followed (post-test trial) by a trial in which the waterwheel object (Figure <xref ref-type="fig" rid="F2">2B</xref>) was presented together with 10 tokens of the novel word LARD<sup><xref ref-type="fn" rid="fn04">4</xref></sup>, produced by a different female AusE speaker in infant-directed speech. This was to ensure that the infants recovered (i.e., showed an increase in looking time) when presented with a large acoustic-phonetic change in the auditory word and visual referent, indicating that they were not fatigued or generally disinterested in the task.</p>
</sec>
</sec>
<sec>
<title>RESULTS</title>
<p>We first analyzed levels of attention during the pre- and post-test trials as well as performance during familiarization to assure that group differences during testing were not attributable to differences in their overall attention or in their rate of habituation. With respect to overall attention to the task, a mixed 2 (trial: post-test vs. last familiarization trial) &#x000D7; 2 (stimulus: CE vs. AusE) analysis of variance (ANOVA) revealed a significant effect of trial [<italic>F</italic>(1,46) = 371.11, <italic>p</italic> &#x0003C; 0.001; <inline-formula><mml:math id="M1"><mml:msubsup><mml:mi mathvariant='normal' mathcolor='black'>&#x03b7;</mml:mi><mml:mi mathvariant='normal' mathcolor='black'>p</mml:mi><mml:mn mathvariant='normal' mathcolor='black'>2</mml:mn></mml:msubsup></mml:math></inline-formula> = 0.89], with infants looking longer to the post-test trial (<italic>M</italic> = 18.31 s, <italic>SD</italic> = 1.71) than to the average of the last two familiarization trials (<italic>M</italic> = 7.89 s, <italic>SD</italic> = 3.18), and there was no interaction with accent. Thus, infants&#x02019; engagement in the task persisted until the end of the experiment in both accent conditions. Regarding their performance during familiarization, an independent-samples <italic>t</italic>-test revealed no difference in average looking time to the last two familiarization trials across accent conditions [<italic>t</italic>(46) = -0.92, <italic>p</italic> = 0.363, 95% CI (-2.12, 0.79)]. Furthermore, an independent-samples <italic>t</italic>-test on the number of familiarization trials, which were between 4 and 24 for all infants (<italic>M</italic> = 8.88, <italic>SD</italic> = 4.33), did not differ between CE and AusE stimulus conditions [<italic>t</italic>(46) = 0.20, <italic>p</italic> = 0.84, (-2.29, 2.79)]. Together, these results suggest that neither overall looking time nor degree of habituation were different across the accent groups and are therefore not predictive of differences during testing.</p>
<p>To test our prediction that detection of a switch in the test trials would differ between the two accent groups, we conducted a repeated measures ANOVA using looking time during test trials as the dependent variable, with test trial (Same = DEET vs. Switch = DIT vs. Switch = DOOT) as a within-subject factor, and accent of the stimuli (CE vs. AusE) and order of test trials (DEET&#x02013;DOOT&#x02013;DIT vs. DOOT&#x02013;DEET&#x02013;DIT vs. DIT&#x02013;DOOT&#x02013;DEET) as between-subjects factors. This revealed a main effect of test trial [<italic>F</italic>(2,84) = 4.55, <italic>p</italic> = 0.013, <inline-formula><mml:math id="M2"><mml:msubsup><mml:mi mathvariant='normal' mathcolor='black'>&#x03b7;</mml:mi><mml:mi mathvariant='normal' mathcolor='black'>p</mml:mi><mml:mn mathvariant='normal' mathcolor='black'>2</mml:mn></mml:msubsup></mml:math></inline-formula> = 0.10], as well as a trend toward a main effect of order of test trials [<italic>F</italic>(2,42) = 2.98, <italic>p</italic> = 0.062, <inline-formula><mml:math id="M3"><mml:msubsup><mml:mi mathvariant='normal' mathcolor='black'>&#x03b7;</mml:mi><mml:mi mathvariant='normal' mathcolor='black'>p</mml:mi><mml:mn mathvariant='normal' mathcolor='black'>2</mml:mn></mml:msubsup></mml:math></inline-formula> = 0.12]. Participants who received the test trials in the order DOOT&#x02013;DEET&#x02013;DIT looked longer during the test overall compared to participants who received trials in the order DEET&#x02013;DOOT&#x02013;DIT [<italic>t</italic>(31) = 2.08, <italic>p</italic> = 0.043, 95% CI (0.08, 5.35)] or DIT&#x02013;DOOT&#x02013;DEET [<italic>t</italic>(31) = 2.15, <italic>p</italic> = 0.038, (0.17, 5.43)]. There was also a trend toward an interaction between test trials &#x000D7; accent [<italic>F</italic>(2,84) = 2.82, <italic>p</italic> = 0.065, <inline-formula><mml:math id="M4"><mml:msubsup><mml:mi mathvariant='normal' mathcolor='black'>&#x03b7;</mml:mi><mml:mi mathvariant='normal' mathcolor='black'>p</mml:mi><mml:mn mathvariant='normal' mathcolor='black'>2</mml:mn></mml:msubsup></mml:math></inline-formula> = 0.06]. Independent-samples <italic>t</italic>-tests comparing looking time to each test trial between accent conditions revealed no significant difference in looking time to test trials between accents, but a trend toward longer looking to DEET (Same) in AusE relative to the CE condition [<italic>t</italic>(46) = -1.83, <italic>p</italic> = 0.074, (-4.70, 0.22)].</p>
<p>To follow up the main effect of test trial, we conducted simple effects tests comparing participants&#x02019; looking time to each of the Switch trials (DIT, DOOT) with looking time to the Same trial (DEET). Looking time was greater for DIT (Switch; <italic>M</italic> = 10.56 s, <italic>SD</italic> = 4.60) than for DEET [Same; <italic>M</italic> = 9.32 s, <italic>SD</italic> = 4.34; <italic>F</italic>(1,42) = 4.84, <italic>p</italic> = 0.033, <inline-formula><mml:math id="M5"><mml:msubsup><mml:mi mathvariant='normal' mathcolor='black'>&#x03b7;</mml:mi><mml:mi mathvariant='normal' mathcolor='black'>p</mml:mi><mml:mn mathvariant='normal' mathcolor='black'>2</mml:mn></mml:msubsup></mml:math></inline-formula> = 0.10], and was greater for DOOT (Switch; <italic>M</italic> = 10.45 s, <italic>SD</italic> = 4.71) than for DEET [Same; <italic>F</italic>(1,42) = 8.62, <italic>p</italic> = 0.005, <inline-formula><mml:math id="M6"><mml:msubsup><mml:mi mathvariant='normal' mathcolor='black'>&#x03b7;</mml:mi><mml:mi mathvariant='normal' mathcolor='black'>p</mml:mi><mml:mn mathvariant='normal' mathcolor='black'>2</mml:mn></mml:msubsup></mml:math></inline-formula> = 0.17].</p>
<p>For our specific prediction that participants would show a greater magnitude of difference in looking time to Switch trials relative to the Same trial for the CE than for the AusE stimuli condition, we carried out simple effect tests on participants&#x02019; performance on each test trial for the CE and AusE conditions separately. As can be seen in Figure <xref ref-type="fig" rid="F3">3</xref>, participants in the CE condition had longer looking times for DIT (Switch; <italic>M</italic> = 10.84 s, <italic>SD</italic> = 4.49) than for DEET [Same; <italic>M</italic> = 8. 20 s, <italic>SD</italic> = 4.34; <italic>F</italic>(1,23) = 8.66, <italic>p</italic> = 0.007, <inline-formula><mml:math id="M7"><mml:msubsup><mml:mi mathvariant='normal' mathcolor='black'>&#x03b7;</mml:mi><mml:mi mathvariant='normal' mathcolor='black'>p</mml:mi><mml:mn mathvariant='normal' mathcolor='black'>2</mml:mn></mml:msubsup></mml:math></inline-formula> = 0.27], and for DOOT (Switch; <italic>M</italic> = 10.45 s, <italic>SD</italic> = 4.71) than for DEET [Same; <italic>F</italic>(1,23) = 6.39, <italic>p</italic> = 0.019, <inline-formula><mml:math id="M8"><mml:msubsup><mml:mi mathvariant='normal' mathcolor='black'>&#x03b7;</mml:mi><mml:mi mathvariant='normal' mathcolor='black'>p</mml:mi><mml:mn mathvariant='normal' mathcolor='black'>2</mml:mn></mml:msubsup></mml:math></inline-formula> = 0.22]. In contrast, for participants in the AusE condition, simple effects tests showed that there was no difference between looking times to DIT (<italic>M</italic> = 10.28 s, <italic>SD</italic> = 4.75) and DEET [<italic>M</italic> = 10.44 s, <italic>SD</italic> = 4.13; <italic>F</italic>(1,23) = 0.49, <italic>p</italic> = 0.827, <inline-formula><mml:math id="M9"><mml:msubsup><mml:mi mathvariant='normal' mathcolor='black'>&#x03b7;</mml:mi><mml:mi mathvariant='normal' mathcolor='black'>p</mml:mi><mml:mn mathvariant='normal' mathcolor='black'>2</mml:mn></mml:msubsup></mml:math></inline-formula> &#x0003C; 0.01], or between DOOT (<italic>M</italic> = 11.69 s, <italic>SD</italic> = 4.38) and DEET [<italic>F</italic>(1,23) = 2.28, <italic>p</italic> = 0.145, <inline-formula><mml:math id="M10"><mml:msubsup><mml:mi mathvariant='normal' mathcolor='black'>&#x03b7;</mml:mi><mml:mi mathvariant='normal' mathcolor='black'>p</mml:mi><mml:mn mathvariant='normal' mathcolor='black'>2</mml:mn></mml:msubsup></mml:math></inline-formula> = 0.09]. Thus, participants in the CE condition distinguished both DIT and DOOT from DEET, while those in the AusE condition did not make either of these two distinctions.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p><bold>Average spectral change for the vowel /i/ in the ten familiarization tokens of DEET for the two accents.</bold> The accent label and the end of each line are plotted at the average formant frequency (across tokens) at 75% of the vowel duration, and each line originates at the average formant frequency at 25% of the vowel duration. There was a larger movement of the formants across the 25 and 75% points of the vowels in the AusE than in CE.</p></caption>
<graphic xlink:href="fpsyg-05-01059-g003.tif"/>
</fig>
<p>To determine whether there were differences in spectral variation across the CE and AusE word DEET, which may have been responsible for the differential performance in the two accent conditions, measures of F1 and F2 were taken at 25 and 75% of the vowel for each of the 10 familiarization tokens. Using F1 and F2 measures as the dependent variables, we ran two (2) &#x000D7; 2 ANOVAs, with time (25, 75%) as a within-subjects factor, and accent (AusE, CE) as a between-subjects factor. For the F1 measure, there was a main effect of time [<italic>F</italic>(1,18) = 38.16, <italic>p</italic> &#x0003C; 0.001, <inline-formula><mml:math id="M11"><mml:msubsup><mml:mi mathvariant='normal' mathcolor='black'>&#x03b7;</mml:mi><mml:mi mathvariant='normal' mathcolor='black'>p</mml:mi><mml:mn mathvariant='normal' mathcolor='black'>2</mml:mn></mml:msubsup></mml:math></inline-formula> = 0.68] and accent [<italic>F</italic>(1,18) = 15.19, <italic>p</italic> = 0.001, <inline-formula><mml:math id="M12"><mml:msubsup><mml:mi mathvariant='normal' mathcolor='black'>&#x03b7;</mml:mi><mml:mi mathvariant='normal' mathcolor='black'>p</mml:mi><mml:mn mathvariant='normal' mathcolor='black'>2</mml:mn></mml:msubsup></mml:math></inline-formula> = 0.68], as well as a time &#x000D7; accent interaction [<italic>F</italic>(1,18) = 9.43, <italic>p</italic> = 0.007, <inline-formula><mml:math id="M13"><mml:msubsup><mml:mi mathvariant='normal' mathcolor='black'>&#x03b7;</mml:mi><mml:mi mathvariant='normal' mathcolor='black'>p</mml:mi><mml:mn mathvariant='normal' mathcolor='black'>2</mml:mn></mml:msubsup></mml:math></inline-formula> = 0.34]. For the F2 measures, there was a main effect of time [<italic>F</italic>(1,18) = 83.39, <italic>p</italic> &#x0003C; 0.001, <inline-formula><mml:math id="M14"><mml:msubsup><mml:mi mathvariant='normal' mathcolor='black'>&#x03b7;</mml:mi><mml:mi mathvariant='normal' mathcolor='black'>p</mml:mi><mml:mn mathvariant='normal' mathcolor='black'>2</mml:mn></mml:msubsup></mml:math></inline-formula> = 0.82] and a time &#x000D7; accent interaction [<italic>F</italic>(1,18) = 21.93, <italic>p</italic> &#x0003C; 0.001, <inline-formula><mml:math id="M15"><mml:msubsup><mml:mi mathvariant='normal' mathcolor='black'>&#x03b7;</mml:mi><mml:mi mathvariant='normal' mathcolor='black'>p</mml:mi><mml:mn mathvariant='normal' mathcolor='black'>2</mml:mn></mml:msubsup></mml:math></inline-formula> = 0.55]. As can be seen in Figure <xref ref-type="fig" rid="F4">4</xref>, spectral change is much larger for the DEET vowel in the AusE than in the CE stimuli. This larger variation within the 10 AusE tokens may explain the longer looking times to DEET Same trials during the test phase, as participants may have treated some AusE tokens as containing different vowels. In that respect, it is worth mentioning that five of the seven infants who did not meet criterion were in the AusE condition, which indicates that a larger number of infants in this condition relative to the CE condition failed to habituate to their DEET trial.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p><bold>Average spectral change for the vowel /i/ in the ten familiarization tokens of DEET for the two accents.</bold> The accent label and the end of each line are plotted at the average formant frequency (across tokens) at 75% of the vowel duration, and each line originates at the average formant frequency at 25% of the vowel duration. There was a larger movement of the formants across the 25 and 75% points of the vowels in the AusE than in CE.</p></caption>
<graphic xlink:href="fpsyg-05-01059-g004.tif"/>
</fig>
</sec>
<sec>
<title>DISCUSSION</title>
<p>This study compared AusE-learning 15-month-olds&#x02019; ability to learn a novel word-object pairing (DEET) and subsequently distinguish it from pairings that included the same referent object, but switched the spoken word to two words that differed from the original word by their vowel (DOOT and DIT). The novel word and two foils were produced in either the participants&#x02019; native AusE accent, or an unfamiliar accent, CE. The young word learners distinguished the newly learned word from the two vowel-differing alternates when words were spoken in CE, but not when they were produced in their native AusE accent. That is, only children who heard the CE words showed a recovery in looking time from the Same trial to the Switch trials.</p>
<p>These results demonstrate for the first time that children younger than 17 months can distinguish minimal vowel pairs in which the vowels primarily differ along acoustic dimensions other than F1. <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref> found that CE-learning 15-month-olds discriminated only the contrast /i/&#x02013;//, which is primarily differentiated in F1. Based on this, the authors proposed that F1 has special status in vowel discrimination in early word learning, and speculated that this may be due either to F1 having more energy in the speech signal compared to F2 and F3, or to F1 differentiating a wide range of vowel contrasts in CE. Here, AusE-learning 15-month-olds noticed a change from the familiarized DEET stimulus regardless of whether the Switch-trial vowels differed mainly in the F1 dimension (DEET&#x02013;DIT) or F2 dimension (DEET&#x02013;DOOT). This contradicts the findings of <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref> and their proposal that F1 is more important than F2 in vowel discrimination by children of this age. It seems that the utilization of phonetic detail in early word learning is not universal, but rather is dependent on how phonetic dimensions are perceived by specific listener groups based on their native accent experience.</p>
<p>Alternatively, the different findings across studies could be explained by their different procedures. Specifically, despite the fact that <xref ref-type="bibr" rid="B48">Stager and Werker (1997)</xref> also found word learning difficult with the single word-object version of the Switch task used in the present study, this simpler familiarization phase may have triggered word discrimination rather than word-object association in our study. This possibility is unlikely, however, as it would suggest that two groups of infants of the same age used different processing strategies when presented with the same task, i.e., discrimination for the group presented with CE stimuli and word-object association for the group presented with AusE stimuli. Future studies should further explore this possibility by presenting CE infants with our single-word familiarization or AusE infants with <xref ref-type="bibr" rid="B14">Curtin et al.&#x02019;s (2009)</xref> two-word familiarization. Further research should also examine the possibility that infants might resort to different processing strategies for stimuli produced in their native vs. a non-native accent.</p>
<p>The present findings showing that 15-month-olds detect differences in vowel minimal pairs is in contrast with work showing that children under 17 months are unable to learn novel vowel minimal pairs in an interactive object reaching task (but do learn novel consonant minimal pairs; <xref ref-type="bibr" rid="B36">Nazzi, 2005</xref>; <xref ref-type="bibr" rid="B38">Nazzi and New, 2007</xref>; <xref ref-type="bibr" rid="B28">Havy and Nazzi, 2009</xref>; <xref ref-type="bibr" rid="B37">Nazzi et al., 2009</xref>). As discussed in <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref>, this disparity may be due to differences between Nazzi and colleagues&#x02019; interactive object-reaching task, which used live pronunciations in a natural sentential context by speakers interacting with the participants, and the task used in the present study, which used previously recorded strings of single word utterances. It may be that when interacting with a real speaker, children younger than 17 months relax their tolerance for vowel variation in a way that they do not for consonants or for less interactive settings. Additionally, as the stimuli in the present study were comprised of strings of single words differing only in their vowel, this may have focused children&#x02019;s attention to the vowel differences in a way that would be less likely to occur in a more natural language setting.</p>
<p>The most striking finding is that AusE children&#x02019;s success with F1 and F2 minimal pair vowel distinctions was limited to words produced in the unfamiliar CE accent. The NLM model (<xref ref-type="bibr" rid="B29">Kuhl, 1991</xref>, <xref ref-type="bibr" rid="B30">1994</xref>) would predict that familiarity with words and vowels in the native accent should lead to better discrimination of minimal pairs in the native accent compared to an unfamiliar accent. Our findings also pose a substantial challenge to exemplar models and other models of early word learning that rely on tracking of statistical distributions in the input (e.g., <xref ref-type="bibr" rid="B45">Saffran et al., 1996</xref>). Such models cannot explain why young children fail to distinguish minimal pairs in the Switch task when the words are produced in their native accent, but succeed when they are produced in an unfamiliar accent. This is in part because neither approach explicitly considers how the cognitive demands of the experimental task may affect performance, specifically that some tasks may make it more difficult to pay attention to small phonetic differences in early word learning.</p>
<p>Our results support the phonetic magnitude hypothesis that we put forward in the introduction, which posits that in a demanding task, such as word learning for novice learners, the magnitude of the phonetic distinction between two vowel sounds predicts successful learning and discrimination of vowels in a word learning context (<xref ref-type="bibr" rid="B14">Curtin et al., 2009</xref>). This appears to occur irrespective of the regional accent spoken in the native environment. As can be seen in Table <xref ref-type="table" rid="T1">1</xref>, the F1 and F2 distinctions between the vowel contrasts are greater in CE than in AusE. The AusE-learning infants distinguished the CE vowel minimal pairs, but their performance was less reliable when listening to the same vowel contrasts in their native AusE accent. Our study thus shows that if an infant is presented with novel word-object pairings in only one accent, rather than novel words across accents (<xref ref-type="bibr" rid="B47">Schmale et al., 2011</xref>, <xref ref-type="bibr" rid="B46">2012</xref>), minimally different words that are distinguished by a large phonetic contrast are easier to learn than those with a smaller phonetic contrast, regardless of whether the accent in which the words are produced is familiar or novel.</p>
<p>Specifically, we believe that the small phonetic difference between AusE vowels, rather than a difference in performance by infants across accent groups, better explains our results given the much larger attrition rate for infants in the AusE vs. CE condition. As shown in the participants section, 22 infants in the AusE condition were excluded from analysis because of either fussiness during the experiment or because they did not meet the habituation criteria, while only 5 infants in the CE condition were excluded for the same two reasons. Thus, infants had more trouble performing the task when presented with the AusE than with the CE stimuli, suggesting difficulty processing the native AusE stimuli.</p>
<p>Furthermore, recent results from our lab (<xref ref-type="bibr" rid="B20">Escudero et al., accepted</xref>) demonstrate that AusE adult listeners also have difficulty learning the same AusE vowel minimal pairs of the present study. Adult AusE listeners were tested on their ability to identify the correct word-object associations after a short exposure to word-object referent pairs that could only be inferred across trials. They had fewer correct answers to minimal pairs involving the words DIT, DEET, DOOT, and DUT than when the minimal pairs involved the words BON, DON, PON, and TON (used to identify consonant minimal pairs). Since the vowel minimal pairs included the same vowels and consonants presented in the current study, it can be concluded that these AusE vowel minimal pairs are difficult to perceive even for native-accent adult listeners. Although the vowels in the CE stimuli do not have properties that are exactly the same as the acoustically closest AuE vowels (Figure <xref ref-type="fig" rid="F1">1</xref>), and would therefore be less frequent in the AusE infants&#x02019; linguistic environment, the magnitude of their phonetic contrast is much larger than that of their AusE counterparts, and according to our phonetic magnitude hypothesis and our results, easier to discriminate and use in early word learning. It remains to be tested whether AusE adults also have less trouble learning the same vowel minimal pairs when produced in another regional accent of English in which the magnitude of the same vowel contrasts is larger (e.g., CE or American English). That would mean that our phonetic magnitude hypothesis might apply across the lifespan when task demands are high, for instance, when having to demonstrate word learning after only a few minutes of exposure in an implicit learning task.</p>
<p>The findings are in line with the LP model, which can be considered a theoretical and computational implementation of the phonetic magnitude hypothesis (<xref ref-type="bibr" rid="B7">Boersma et al., 2003</xref>; <xref ref-type="bibr" rid="B18">Escudero and Boersma, 2004</xref>; <xref ref-type="bibr" rid="B16">Escudero, 2005</xref>, <xref ref-type="bibr" rid="B17">2009</xref>). The LP model asserts that infants&#x02019; vowel categories are emergent and based on the specific auditory dimensions that are most salient to infants depending on their native accent and their age. This means that adults, children, and infants exposed to different accents are likely to differ in the way they weight the auditory dimensions of any given vowel token (native or non-native). Within the model, the saliency or perceptual weight of a phonetic dimension, such as F1 or F2, depends on the magnitude of the phonetic difference it offers in a specific accent. It is proposed that young infants, who do not yet have a well-developed lexicon, may concentrate on the most salient phonetic cue, while ignoring other less salient ones. From an LP perspective, AusE children are exposed to very small differences in F1 and F2 in the production of their native vowels /i/, // and /u/, and therefore hear large enough differences between the CE productions of the same vowels along both dimensions, which explains why they more easily discriminate them. In contrast, CE infants are exposed to larger F1 than F2 distinctions for these three vowels, which is the explanation given in <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref> for their asymmetric findings. Thus, the reason why AusE children rely on both F1 and F2 for the CE stimuli is because both dimensions are as salient to them, while the same two dimensions are equally difficult to distinguish in the AusE stimuli. Following the LP model, we predict that CE infants would have the same failure to distinguish AusE vowels as AusE infants, due to the small, non-salient contrast for F1 and F2 in the AusE vowels.</p>
<p>The PAM model presumes that native categories are in place by 15 months, but that they have not yet necessarily become phonological contrasts used for differentiation of words. Instead, these more advanced lexical skills appear to emerge later on, and are associated with the expressive vocabulary expansion that occurs around 19 months (<xref ref-type="bibr" rid="B6">Best et al., 2009</xref>; <xref ref-type="bibr" rid="B34">Mulak and Best, 2013</xref>; <xref ref-type="bibr" rid="B35">Mulak et al., 2013</xref>). At 15 months, discrimination of native and non-native segments is dependent on mappings to L1 categories. While this could predict better performance in the native accent, it may be that the AusE-learning children perceived the CE /i/&#x02013;// vowel contrast as corresponding to the phonetically larger AusE /i/&#x02013;/&#x1D700;/ contrast, and the CE /i/&#x02013;/u/ contrast to the phonetically larger AusE /i/&#x02013;// contrast (see Figure <xref ref-type="fig" rid="F1">1</xref>).</p>
<p>Under high cognitive load, such as in the word learning task of the present study, reliable phonetic cues may play a larger role, in line with both LP and PAM. The results of this study are thus consistent with performance being linked to unidimensional distinctions between vowels, as proposed within the LP framework, rather than the multidimensional approach in adult listening. This holds regardless of whether each stimulus dimension is characterized in terms of acoustic dimensions (F1 and F2 values: LP) or articulatory distinctions (vowel height and jaw opening: PAM). Further research should show whether the use of reliable phonetic cues is a developmental stage in L1 phonological acquisition, as proposed by the LP model, a strategy used in highly demanding word-learning situations, or a combination of both.</p>
<p>In sum, these results show that success in early word learning depends on the magnitude of the phonetic (acoustic or articulatory) distance between novel vowel minimal pairs, and not on familiarity with the specific productions of the words (native vs. non-native accent), nor on the universal salience of a specific acoustic dimension (e.g., F1 vs. F2). Current models of early language development should consider the role of phonetic distance in perceptual and lexical development and how this may vary as a function of task demands.</p>
</sec>
<sec>
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack>
<p>This research was supported by MARCS Institute start-up funds (Paola Escudero), and Australian Research Council grants DP130102181 (CI Paola Escudero) and DP130104237 (CIs Catherine T. Best &#x00026; Christine Kitamura). We would like to thank Anne Dwyer and Michelle Pal for assistance with participant recruitment and testing, Anne Dwyer for recording the AusE stimuli, and Suzanne Curtin and Christopher Fennell for sharing the CE stimuli. We also thank the families who participated in this research.</p>
</ack>
<ref-list>
<title>REFERENCES</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aslin</surname> <given-names>R. N.</given-names></name> <name><surname>Pisoni</surname> <given-names>D. B.</given-names></name></person-group> (<year>1980</year>). &#x0201C;<article-title>Some developmental processes in speech perception</article-title>,&#x0201D; in <source><italic>Child Phonology</italic></source> <volume>Vol. 2</volume> <source><italic>Perception</italic></source> <role>eds</role> <person-group person-group-type="editor"><name><surname>Yeni-Komshian</surname> <given-names>G. H.</given-names></name> <name><surname>Kavanagh</surname> <given-names>J. F.</given-names></name> <name><surname>Ferguson</surname> <given-names>C. A.</given-names></name></person-group> (<publisher-loc>New York</publisher-loc>: <publisher-name>Academic Press</publisher-name>).</citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Best</surname> <given-names>C. T.</given-names></name></person-group> (<year>1994</year>). <article-title>&#x0201C;The emergence of native-language phonological influences in infants: a perceptual assimilation model,&#x0201D; in</article-title> <source><italic>Development of Speech Perception: The Transition from Speech Sounds to Spoken Words,</italic></source> <role>eds</role> <person-group person-group-type="editor"><name><surname>Goodman</surname> <given-names>J.</given-names></name> <name><surname>Nusbaum</surname> <given-names>H. C.</given-names></name></person-group> (<publisher-loc>Cambridge</publisher-loc>: <publisher-name>MIT Press</publisher-name>), <fpage>167</fpage>&#x02013;<lpage>224</lpage>.</citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Best</surname> <given-names>C. T.</given-names></name></person-group> (<year>1995</year>). <article-title>&#x0201C;A direct realist perspective on cross-language speech perception,&#x0201D; in</article-title> <source><italic>Speech Perception and Linguistic Experience: Issues in Cross-Language Research</italic>,</source> <role>eds</role> <person-group person-group-type="editor"><name><surname>Strange</surname> <given-names>W.</given-names></name> <name><surname>Jenkins</surname> <given-names>J. J.</given-names></name></person-group> (<publisher-loc>Timonium, MD</publisher-loc>: <publisher-name>York Press</publisher-name>), <fpage>171</fpage>&#x02013;<lpage>204</lpage>.</citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Best</surname> <given-names>C. T.</given-names></name> <name><surname>McRoberts</surname> <given-names>G. W.</given-names></name> <name><surname>LaFleur</surname> <given-names>R.</given-names></name> <name><surname>Silver-Isenstadt</surname> <given-names>J.</given-names></name></person-group> (<year>1995</year>). <article-title>Divergent developmental patterns for infants&#x02019; perception of two nonnative consonant contrasts.</article-title> <source><italic>Infant Behav. Dev.</italic></source> <volume>18</volume> <fpage>339</fpage>&#x02013;<lpage>350</lpage>. <pub-id pub-id-type="doi">10.1016/0163-6383(95)90022-5</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Best</surname> <given-names>C. T.</given-names></name> <name><surname>McRoberts</surname> <given-names>G. W.</given-names></name> <name><surname>Sithole</surname> <given-names>N. M.</given-names></name></person-group> (<year>1988</year>). <article-title>Examination of perceptual reorganization for nonnative speech contrasts: Zulu click discrimination by English-speaking adults and infants.</article-title> <source><italic>J. Exp. Psychol. Hum. Percept. Perform.</italic></source> <volume>14</volume> <fpage>345</fpage>&#x02013;<lpage>360</lpage>. <pub-id pub-id-type="doi">10.1037/0096-1523.14.3.345</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Best</surname> <given-names>C. T.</given-names></name> <name><surname>Tyler</surname> <given-names>M. D.</given-names></name> <name><surname>Gooding</surname> <given-names>T. N.</given-names></name> <name><surname>Orlando</surname> <given-names>C. B.</given-names></name> <name><surname>Quann</surname> <given-names>C. A.</given-names></name></person-group> (<year>2009</year>). <article-title>Development of phonological constancy: toddlers&#x02019; perception of native- and Jamaican-accented words.</article-title> <source><italic>Psychol. Sci.</italic></source> <volume>20</volume> <fpage>539</fpage>&#x02013;<lpage>542</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-9280.2009.02327.x</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boersma</surname> <given-names>P.</given-names></name> <name><surname>Escudero</surname> <given-names>P.</given-names></name> <name><surname>Hayes</surname> <given-names>R.</given-names></name></person-group> (<year>2003</year>). <article-title>&#x0201C;Learning abstract phonological from auditory phonetic categories: An integrated model for the acquisition of language-specific sound categories,&#x0201D; in</article-title> <source><italic>Proceedings of the 15th International Congress of Phonetic Sciences</italic></source> <volume>Vol. 1013</volume> <publisher-loc>Barcelona</publisher-loc>.</citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Burnham</surname> <given-names>D. K.</given-names></name></person-group> (<year>1986</year>). <article-title>Developmental loss of speech perception: exposure to and experience with a first language.</article-title> <source><italic>Appl. Psycholinguist.</italic></source> <volume>7</volume> <fpage>207</fpage>&#x02013;<lpage>239</lpage>. <pub-id pub-id-type="doi">10.1017/S0142716400007542</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chl&#x000E1;dkov&#x000E1;</surname> <given-names>K.</given-names></name> <name><surname>Escudero</surname> <given-names>P.</given-names></name></person-group> (<year>2012</year>). <article-title>Comparing vowel perception and production in Spanish and Portuguese: European versus Latin American dialects.</article-title> <source><italic>J. Acoust. Soc. Am.</italic></source> <volume>131</volume> EL119&#x02013;EL125. <pub-id pub-id-type="doi">10.1121/1.3674991</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chl&#x000E1;dkov&#x000E1;</surname> <given-names>K.</given-names></name> <name><surname>Podlipsk&#x000FD;</surname> <given-names>V. J.</given-names></name></person-group> (<year>2011</year>). <article-title>Native dialect matters: perceptual assimilation of dutch vowels by Czech listeners.</article-title> <source><italic>J. Acoust. Soc. Am.</italic></source> <volume>130</volume> EL186&#x02013;EL192. <pub-id pub-id-type="doi">10.1121/1.3629135</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cox</surname> <given-names>F.</given-names></name></person-group> (<year>2006</year>). <article-title>The acoustic characteristics of /hVd/ vowels in the speech of some Australian teenagers.</article-title> <source><italic>Aust. J. Linguist.</italic></source> <volume>26</volume> <fpage>147</fpage>&#x02013;<lpage>179</lpage>. <pub-id pub-id-type="doi">10.1080/07268600600885494</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cox</surname> <given-names>F.</given-names></name> <name><surname>Palethorpe</surname> <given-names>S.</given-names></name></person-group> (<year>2007</year>). <article-title>An illustration of the IPA: Australian English.</article-title> <source><italic>J. Int. Phon. Assoc.</italic></source> <volume>37</volume> <fpage>341</fpage>&#x02013;<lpage>350</lpage>. <pub-id pub-id-type="doi">10.1017/S0025100307003192</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cristia</surname> <given-names>A.</given-names></name> <name><surname>Seidl</surname> <given-names>A.</given-names></name> <name><surname>Vaughn</surname> <given-names>C.</given-names></name> <name><surname>Bradlow</surname> <given-names>A.</given-names></name> <name><surname>Schmale</surname> <given-names>R.</given-names></name> <name><surname>Floccia</surname> <given-names>C.</given-names></name></person-group> (<year>2012</year>). <article-title>Linguistic processing of accented speech across the lifespan.</article-title> <source><italic>Front. Psychol.</italic></source> <volume>3</volume>:<issue>479</issue>. <pub-id pub-id-type="doi">10.3389/fpsyg.2012.00479</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Curtin</surname> <given-names>S. A.</given-names></name> <name><surname>Fennell</surname> <given-names>C.</given-names></name> <name><surname>Escudero</surname> <given-names>P.</given-names></name></person-group> (<year>2009</year>). <article-title>Weighting of vowel cues explains patterns of word-object associative learning.</article-title> <source><italic>Dev. Sci.</italic></source> <volume>12</volume> <fpage>725</fpage>&#x02013;<lpage>731</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-7687.2009.00814.x</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dietrich</surname> <given-names>C.</given-names></name> <name><surname>Swingley</surname> <given-names>D.</given-names></name> <name><surname>Werker</surname> <given-names>J. F.</given-names></name></person-group> (<year>2007</year>). <article-title>Native language governs interpretation of salient speech sound differences at 18 months.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>104</volume> <fpage>16027</fpage>&#x02013;<lpage>16031</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0705270104</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Escudero</surname> <given-names>P.</given-names></name></person-group> (<year>2005</year>). <source><italic>Linguistic Perception and Second Language Acquisition: Explaining the Attainment of Optimal Phonological Categorization</italic>.</source> <publisher-name>Ph.D. thesis, LOT Dissertation Series 113. Utrecht University, Utrecht</publisher-name>.</citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Escudero</surname> <given-names>P.</given-names></name></person-group> (<year>2009</year>). <article-title>&#x0201C;The linguistic perception of similar L2 sounds,&#x0201D; in</article-title> <source><italic>Phonology in Perception</italic>,</source> <role>eds</role> <person-group person-group-type="editor"><name><surname>Boersma</surname> <given-names>P.</given-names></name> <name><surname>Hamann</surname> <given-names>S.</given-names></name></person-group> (<publisher-loc>Berlin</publisher-loc>: <publisher-name>Mouton de Gruyter</publisher-name>), <fpage>152</fpage>&#x02013;<lpage>190</lpage>.</citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Escudero</surname> <given-names>P.</given-names></name> <name><surname>Boersma</surname> <given-names>P.</given-names></name></person-group> (<year>2004</year>). <article-title>Bridging the gap between L2 speech perception research and phonological theory.</article-title> <source><italic>Stud. Second Lang. Acquis.</italic></source> <volume>26</volume> <fpage>551</fpage>&#x02013;<lpage>585</lpage>. <pub-id pub-id-type="doi">10.10170/S02722631040-40021</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Escudero</surname> <given-names>P.</given-names></name> <name><surname>Chl&#x000E1;dkov&#x000E1;</surname> <given-names>K.</given-names></name></person-group> (<year>2010</year>). <article-title>Spanish listeners&#x02019; perception of American and Southern British English vowels.</article-title> <source><italic>J. Acoust. Soc. Am.</italic></source> <volume>128</volume> EL254&#x02013;EL260. <pub-id pub-id-type="doi">10.1121/1.3488794</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Escudero</surname> <given-names>P.</given-names></name> <name><surname>Mulak</surname> <given-names>K. E.</given-names></name> <name><surname>Vlach</surname> <given-names>H. A.</given-names></name></person-group> (<comment>accepted</comment>). <article-title>Cross-situational learning of minimal word pairs.</article-title> <source><italic>Cogn. Sci.</italic></source></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Escudero</surname> <given-names>P.</given-names></name> <name><surname>Simon</surname> <given-names>E.</given-names></name> <name><surname>Mitterer</surname> <given-names>H.</given-names></name></person-group> (<year>2012</year>). <article-title>The perception of English front vowels by North Holland and Flemish listeners: acoustic similarity predicts and explains cross-linguistic and L2 perception.</article-title> <source><italic>J. Phon.</italic></source> <volume>40</volume> <fpage>280</fpage>&#x02013;<lpage>288</lpage>. <pub-id pub-id-type="doi">10.1016/j.wocn.2011.11.004</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Escudero</surname> <given-names>P.</given-names></name> <name><surname>Williams</surname> <given-names>D.</given-names></name></person-group> (<year>2012</year>). <article-title>Native dialect influences second-language vowel perception: peruvian versus Iberian Spanish learners of Dutch.</article-title> <source><italic>J. Acoust. Soc. Am.</italic></source> <volume>131</volume> EL406&#x02013;EL412. <pub-id pub-id-type="doi">10.1121/1.3701708</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fennell</surname> <given-names>C. T.</given-names></name></person-group> (<year>2012</year>). <article-title>Object familiarity enhances infants&#x02019; use of phonetic detail in novel words.</article-title> <source><italic>Infancy</italic></source> <volume>17</volume> <fpage>339</fpage>&#x02013;<lpage>353</lpage>. <pub-id pub-id-type="doi">10.1111/j.1532-7078.2011.00080.x</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fennell</surname> <given-names>C. T.</given-names></name> <name><surname>Waxman</surname> <given-names>S. R.</given-names></name></person-group> (<year>2010</year>). <article-title>What paradox? Referential cues allow for infant use of phonetic detail in word learning.</article-title> <source><italic>Child Dev.</italic></source> <volume>81</volume> <fpage>1376</fpage>&#x02013;<lpage>1383</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-8624.2010.01479.x</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fennell</surname> <given-names>C. T.</given-names></name> <name><surname>Werker</surname> <given-names>J. F.</given-names></name></person-group> (<year>2003</year>). <article-title>Early word learners&#x02019; ability to access phonetic detail in well-known words.</article-title> <source><italic>Lang. Speech</italic></source> <volume>46</volume> <fpage>245</fpage>&#x02013;<lpage>264</lpage>. <pub-id pub-id-type="doi">10.1177/00238309030460020901</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Floccia</surname> <given-names>C.</given-names></name> <name><surname>Delle Luche</surname> <given-names>C.</given-names></name> <name><surname>Durrant</surname> <given-names>S.</given-names></name> <name><surname>Butler</surname> <given-names>J.</given-names></name> <name><surname>Goslin</surname> <given-names>J.</given-names></name></person-group> (<year>2012</year>). <article-title>Parent or community: where do 20-month-olds exposed to two accents acquire their representation of words?</article-title> <source><italic>Cognition</italic></source> <volume>124</volume> <fpage>95</fpage>&#x02013;<lpage>100</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2012.03.011</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Harrington</surname> <given-names>J.</given-names></name> <name><surname>Cox</surname> <given-names>F.</given-names></name> <name><surname>Evans</surname> <given-names>Z.</given-names></name></person-group> (<year>1997</year>). <article-title>An acoustic phonetic study of broad, general, and cultivated Australian English vowels.</article-title> <source><italic>Aust. J. Linguist.</italic></source> <volume>17</volume> <fpage>155</fpage>&#x02013;<lpage>184</lpage>. <pub-id pub-id-type="doi">10.1080/07268609708599550 </pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Havy</surname> <given-names>M.</given-names></name> <name><surname>Nazzi</surname> <given-names>T.</given-names></name></person-group> (<year>2009</year>). <article-title>Better processing of consonantal over vocalic information in word learning at 16 months of age.</article-title> <source><italic>Infancy</italic></source> <volume>14</volume> <fpage>439</fpage>&#x02013;<lpage>456</lpage>. <pub-id pub-id-type="doi">10.1080/15250000902996532</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuhl</surname> <given-names>P. K.</given-names></name></person-group> (<year>1991</year>). <article-title>Human adults and human infants show a &#x0201C;perceptual magnet effect&#x0201D; for the prototypes of speech categories, monkeys do not.</article-title> <source><italic>Percept. Psychophys.</italic></source> <volume>50</volume> <fpage>93</fpage>&#x02013;<lpage>107</lpage>. <pub-id pub-id-type="doi">10.3758/BF03212211</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuhl</surname> <given-names>P. K.</given-names></name></person-group> (<year>1994</year>). <article-title>Learning and representation in speech and language.</article-title> <source><italic>Curr. Opin. Neurobiol.</italic></source> <volume>4</volume> <fpage>812</fpage>&#x02013;<lpage>822</lpage>. <pub-id pub-id-type="doi">10.1016/0959-4388(94)90128-7</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuhl</surname> <given-names>P. K.</given-names></name> <name><surname>Conboy</surname> <given-names>B. T.</given-names></name> <name><surname>Padden</surname> <given-names>D.</given-names></name> <name><surname>Nelson</surname> <given-names>T.</given-names></name> <name><surname>Pruitt</surname> <given-names>J.</given-names></name></person-group> (<year>2005</year>). <article-title>Early speech perception and later language development: implications for the &#x0201C;critical period.&#x0201D;</article-title> <source><italic>Lang. Learn. Dev.</italic></source> <volume>1</volume> <fpage>237</fpage>&#x02013;<lpage>264</lpage>. <pub-id pub-id-type="doi">10.1080/15475441.2005.9671948</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuhl</surname> <given-names>P. K.</given-names></name> <name><surname>Williams</surname> <given-names>K.</given-names></name> <name><surname>Lacerda</surname> <given-names>F.</given-names></name> <name><surname>Stevens</surname> <given-names>K.</given-names></name> <name><surname>Lindblom</surname> <given-names>B.</given-names></name></person-group> (<year>1992</year>). <article-title>Linguistic experience alters phonetic perception in infants by 6 months of age.</article-title> <source><italic>Science</italic></source> <volume>255</volume> <fpage>606</fpage>&#x02013;<lpage>608</lpage>. <pub-id pub-id-type="doi">10.1126/science.1736364</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mani</surname> <given-names>N.</given-names></name> <name><surname>Plunkett</surname> <given-names>K.</given-names></name></person-group> (<year>2007</year>). <article-title>Phonological specificity of vowels and consonants in early lexical representations.</article-title> <source><italic>J. Mem. Lang.</italic></source> <volume>57</volume> <fpage>252</fpage>&#x02013;<lpage>272</lpage>. <pub-id pub-id-type="doi">10.1016/j.jml.2007.03.005</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mulak</surname> <given-names>K. E.</given-names></name> <name><surname>Best</surname> <given-names>C. T.</given-names></name></person-group> (<year>2013</year>). <article-title>&#x0201C;Development of word recognition across speakers and accents,&#x0201D; in</article-title> <source><italic>Theoretical and Computational Models of Word Learning: Trends in Psychology and Artificial Intelligence,</italic></source> <role>eds</role> <person-group person-group-type="editor"><name><surname>Gogate</surname> <given-names>L. J.</given-names></name> <name><surname>Hollich</surname> <given-names>G.</given-names></name></person-group> (<publisher-loc>Hershey</publisher-loc>: <publisher-name>IGI Global: Robotics Division</publisher-name>), <fpage>242</fpage>&#x02013;<lpage>269</lpage>. <pub-id pub-id-type="doi">10.4018/978-1-4666-2973-8.ch011</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mulak</surname> <given-names>K. E.</given-names></name> <name><surname>Best</surname> <given-names>C. T.</given-names></name> <name><surname>Tyler</surname> <given-names>M. D.</given-names></name> <name><surname>Kitamura</surname> <given-names>C.</given-names></name> <name><surname>Irwin</surname> <given-names>J. R.</given-names></name></person-group> (<year>2013</year>). <article-title>Development of phonological constancy: 19-month-olds, but not 15-month-olds, identify words spoken in a non-native regional accent.</article-title> <source><italic>Child Dev.</italic></source> <volume>84</volume> <fpage>2064</fpage>&#x02013;<lpage>2078</lpage>. <pub-id pub-id-type="doi">10.1111/cdev.12087</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nazzi</surname> <given-names>T.</given-names></name></person-group> (<year>2005</year>). <article-title>Use of phonetic specificity during the acquisition of new words: differences between consonants and vowels.</article-title> <source><italic>Cognition</italic></source> <volume>98</volume> <fpage>13</fpage>&#x02013;<lpage>30</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2004.10.005 </pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nazzi</surname> <given-names>T.</given-names></name> <name><surname>Floccia</surname> <given-names>C.</given-names></name> <name><surname>Moquet</surname> <given-names>B.</given-names></name> <name><surname>Butler</surname> <given-names>J.</given-names></name></person-group> (<year>2009</year>). <article-title>Bias for consonantal information over vocalic information in 30-month-olds: cross-linguistic evidence from French and English.</article-title> <source><italic>J. Exp. Child Psychol.</italic></source> <volume>102</volume> <fpage>522</fpage>&#x02013;<lpage>537</lpage>. <pub-id pub-id-type="doi">10.1016/j.jecp.2008.05.003</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nazzi</surname> <given-names>T.</given-names></name> <name><surname>New</surname> <given-names>B.</given-names></name></person-group> (<year>2007</year>). <article-title>Beyond stop consonants: consonantal specificity in early lexical acquisition.</article-title> <source><italic>Cogn. Dev.</italic></source> <volume>22</volume> <fpage>271</fpage>&#x02013;<lpage>279</lpage>. <pub-id pub-id-type="doi">10.1016/j.cogdev.2006.10.007</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nespor</surname> <given-names>M.</given-names></name> <name><surname>Pe&#x000F1;a</surname> <given-names>M.</given-names></name> <name><surname>Mehler</surname> <given-names>J.</given-names></name></person-group> (<year>2003</year>). <article-title>On the different roles of vowels and consonants in speech processing and language acquisition.</article-title> <source><italic>Lingue e Linguaggio</italic></source> <volume>2</volume> <fpage>203</fpage>&#x02013;<lpage>229</lpage>.</citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pater</surname> <given-names>J.</given-names></name> <name><surname>Stager</surname> <given-names>C.</given-names></name> <name><surname>Werker</surname> <given-names>J. F.</given-names></name></person-group> (<year>2004</year>). <article-title>The perceptual acquisition of phonological contrasts.</article-title> <source><italic>Language</italic></source> <volume>80</volume> <fpage>384</fpage>&#x02013;<lpage>402</lpage>. <pub-id pub-id-type="doi">10.1353/lan.2004.0141</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pisoni</surname> <given-names>D. B.</given-names></name></person-group> (<year>1973</year>). <article-title>Auditory and phonetic memory codes in the discrimination of consonants and vowels.</article-title> <source><italic>Percept. Psychophys.</italic></source> <volume>13</volume> <fpage>253</fpage>&#x02013;<lpage>260</lpage>. <pub-id pub-id-type="doi">10.3758/BF03214136</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Polka</surname> <given-names>L.</given-names></name> <name><surname>Bohn</surname> <given-names>O. S.</given-names></name></person-group> (<year>1996</year>). <article-title>A cross-language comparison of vowel perception in English-learning and German-learning infants.</article-title> <source><italic>J. Acoust. Soc. Am.</italic></source> <volume>100</volume> <fpage>577</fpage>&#x02013;<lpage>592</lpage>. <pub-id pub-id-type="doi">10.1121/1.415884</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Polka</surname> <given-names>L.</given-names></name> <name><surname>Werker</surname> <given-names>J. F.</given-names></name></person-group> (<year>1994</year>). <article-title>Developmental changes in perception of nonnative vowel contrasts.</article-title> <source><italic>J. Exp. Psychol. Hum. Percept. Perform.</italic></source> <volume>20</volume> <fpage>421</fpage>&#x02013;<lpage>435</lpage>. <pub-id pub-id-type="doi">10.1037/0096-1523.20.2.421</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rost</surname> <given-names>G. C.</given-names></name> <name><surname>McMurray</surname> <given-names>B.</given-names></name></person-group> (<year>2009</year>). <article-title>Speaker variability augments phonological processing in early word learning.</article-title> <source><italic>Dev. Sci.</italic></source> <volume>12</volume> <fpage>339</fpage>&#x02013;<lpage>349</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-7687.2008.00786.x</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saffran</surname> <given-names>J. R.</given-names></name> <name><surname>Newport</surname> <given-names>E. L.</given-names></name> <name><surname>Aslin</surname> <given-names>R. N.</given-names></name></person-group> (<year>1996</year>). <article-title>Word segmentation: the role of distributional cues.</article-title> <source><italic>J. Mem. Lang.</italic></source> <volume>35</volume> <fpage>606</fpage>&#x02013;<lpage>621</lpage>. <pub-id pub-id-type="doi">10.1006/jmla.1996.0032</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmale</surname> <given-names>R.</given-names></name> <name><surname>Cristi&#x000E0;</surname> <given-names>A.</given-names></name> <name><surname>Seidl</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <article-title>Toddlers recognize words in an unfamiliar accent after brief exposure.</article-title> <source><italic>Dev. Sci.</italic></source> <volume>15</volume> <fpage>732</fpage>&#x02013;<lpage>738</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-7687.2012.01175.x </pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmale</surname> <given-names>R.</given-names></name> <name><surname>Hollich</surname> <given-names>G.</given-names></name> <name><surname>Seidl</surname> <given-names>A.</given-names></name></person-group> (<year>2011</year>). <article-title>Contending with foreign accent in early word learning.</article-title> <source><italic>J. Child Lang.</italic></source> <volume>38</volume> <fpage>1</fpage>&#x02013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1017/S0305000910000619</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stager</surname> <given-names>C. L.</given-names></name> <name><surname>Werker</surname> <given-names>J. F.</given-names></name></person-group> (<year>1997</year>). <article-title>Infants listen for more phonetic detail in speech perception than in word-learning tasks.</article-title> <source><italic>Nature</italic></source> <volume>388</volume> <fpage>381</fpage>&#x02013;<lpage>382</lpage>. <pub-id pub-id-type="doi">10.1038/41102</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thiessen</surname> <given-names>E. D.</given-names></name></person-group> (<year>2007</year>). <article-title>The effect of distributional information on children&#x02019;s use of phonemic contrasts.</article-title> <source><italic>J. Mem. Lang.</italic></source> <volume>56</volume> <fpage>16</fpage>&#x02013;<lpage>34</lpage>. <pub-id pub-id-type="doi">10.1016/j.jml.2006.07.002</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thomas</surname> <given-names>E. R.</given-names></name></person-group> (<year>2001</year>). <source><italic>An Acoustic Analysis of Vowel Variation in New World English.</italic></source> <publisher-loc>Durham</publisher-loc>: <publisher-name>Duke University Press</publisher-name>.</citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tsao</surname> <given-names>F.-M.</given-names></name> <name><surname>Liu</surname> <given-names>H.-M.</given-names></name> <name><surname>Kuhl</surname> <given-names>P. K.</given-names></name></person-group> (<year>2004</year>). <article-title>Speech perception in infancy predicts language development in the second year of life: a longitudinal study.</article-title> <source><italic>Child Dev.</italic></source> <volume>75</volume> <fpage>1067</fpage>&#x02013;<lpage>1084</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-8624.2004.00726.x</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Werker</surname> <given-names>J. F.</given-names></name> <name><surname>Curtin</surname> <given-names>S. A.</given-names></name></person-group> (<year>2005</year>). <article-title>PRIMIR: a developmental framework of infant speech processing.</article-title> <source><italic>Lang. Learn. Dev.</italic></source> <volume>1</volume> <fpage>197</fpage>&#x02013;<lpage>234</lpage>. <pub-id pub-id-type="doi">10.1080/15475441.2005.9684216</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Werker</surname> <given-names>J. F.</given-names></name> <name><surname>Fennell</surname> <given-names>C. T.</given-names></name></person-group> (<year>2004</year>). <article-title>&#x0201C;Listening to sounds versus listening to words: early steps in word learning,&#x0201D; in</article-title> <source><italic>Weaving A Lexicon</italic>,</source> <role>eds</role> <person-group person-group-type="editor"><name><surname>Hall</surname> <given-names>D. G.</given-names></name> <name><surname>Waxman</surname> <given-names>S. R.</given-names></name></person-group> (<publisher-loc>Cambridge</publisher-loc>: <publisher-name>MIT Press</publisher-name>), <fpage>79</fpage>&#x02013;<lpage>109</lpage>.</citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Werker</surname> <given-names>J. F.</given-names></name> <name><surname>Fennell</surname> <given-names>C. T.</given-names></name> <name><surname>Corcoran</surname> <given-names>K. M.</given-names></name> <name><surname>Stager</surname> <given-names>C. L.</given-names></name></person-group> (<year>2002</year>). <article-title>Infants&#x02019; ability to learn phonetically similar words: effects of age and vocabulary size.</article-title> <source><italic>Infancy</italic></source> <volume>3</volume> <fpage>1</fpage>&#x02013;<lpage>30</lpage>. <pub-id pub-id-type="doi">10.1207/S15327078IN0301_1</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Werker</surname> <given-names>J. F.</given-names></name> <name><surname>Tees</surname> <given-names>R. C.</given-names></name></person-group> (<year>1983</year>). <article-title>Developmental changes across childhood in the perception of non-native speech sounds.</article-title> <source><italic>Can. J. Psychol.</italic></source> <volume>37</volume> <fpage>278</fpage>&#x02013;<lpage>286</lpage>. <pub-id pub-id-type="doi">10.1037/h0080725</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Werker</surname> <given-names>J. F.</given-names></name> <name><surname>Tees</surname> <given-names>R. C.</given-names></name></person-group> (<year>1984</year>). <article-title>Cross-language speech perception: evidence for perceptual reorganization during the first year of life.</article-title> <source><italic>Infant Behav. Dev.</italic></source> <volume>7</volume> <fpage>49</fpage>&#x02013;<lpage>63</lpage>. <pub-id pub-id-type="doi">10.1016/S0163-6383(84)80022-3 </pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Werker</surname> <given-names>J. F.</given-names></name> <name><surname>Tees</surname> <given-names>R. C.</given-names></name></person-group> (<year>1999</year>). <article-title>Influences on infant speech perception: toward a new synthesis.</article-title> <source><italic>Annu. Rev. Psychol.</italic></source> <volume>50</volume> <fpage>509</fpage>&#x02013;<lpage>535</lpage>. <pub-id pub-id-type="doi">10.1146/annurev.psych.50.1.509</pub-id></citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>White</surname> <given-names>K. S.</given-names></name> <name><surname>Aslin</surname> <given-names>R. N.</given-names></name></person-group> (<year>2011</year>). <article-title>Adaptation to novel accents by toddlers.</article-title> <source><italic>Dev. Sci.</italic></source> <volume>14</volume> <fpage>372</fpage>&#x02013;<lpage>384</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-7687.2010.00986 </pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yoshida</surname> <given-names>K. A.</given-names></name> <name><surname>Fennell</surname> <given-names>C. T.</given-names></name> <name><surname>Swingley</surname> <given-names>D.</given-names></name> <name><surname>Werker</surname> <given-names>J. F.</given-names></name></person-group> (<year>2009</year>). <article-title>Fourteen-month-old infants learn similar-sounding words.</article-title> <source><italic>Dev. Sci.</italic></source> <volume>12</volume> <fpage>412</fpage>&#x02013;<lpage>418</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-7687.2008.00789.x</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn01"><label>1</label><p>In adult speech, the AusE vowels seem to be distinguished instead mostly by subtle diphthongization (/i/ can be produced with a small &#x0201C;onglide&#x0201D; or delayed target which gives it the quality of a diphthong) and duration (<xref ref-type="bibr" rid="B12">Cox and Palethorpe, 2007</xref>; see Figure 2 in <xref ref-type="bibr" rid="B11">Cox, 2006</xref>). <xref ref-type="bibr" rid="B14">Curtin et al. (2009)</xref> also showed that in their CE stimuli, which were produced in child-directed speech, /i/ and /&#x00131;/ had overlapping duration values since in this speech style all CE vowels are apparently lengthened to similar extents. The authors show that duration is therefore an unreliable cue for this contrast in CE. Duration differences among these vowels are likely to also be unreliable in AusE child-directed speech, as is evident in <bold>Table <xref ref-type="table" rid="T1">1</xref></bold>.</p></fn>
<fn id="fn02"><label>2</label><p>To more accurately reflect its phonetic characteristics, centralized and rounded [u-] is commonly used to represent AusE /u/ (<xref ref-type="bibr" rid="B27">Harrington et al., 1997</xref>; <xref ref-type="bibr" rid="B11">Cox, 2006</xref>).</p></fn>
<fn id="fn03"><label>3</label><p>Although the first three and last three tokens are identical in the set of 10 tokens for DEET, DIT, and DOOT in both AusE and CE, formant averages are based on all 10 tokens so that the averages reflect all the tokens that infants heard during familiarization.</p></fn>
<fn id="fn04"><label>4</label><p>As LARD occurs at a low frequency in adult vocabularies, it is not expected to be part of the 15-month-old lexicon and is thus regarded here as a novel word.</p></fn>
</fn-group>
</back>
</article>