Hypothesis and Theory ARTICLE
Auditory-motor entrainment and phonological skills: precise auditory timing hypothesis (PATH)
- 1Auditory Neuroscience Laboratory, Northwestern University, Evanston, IL, USA
- 2Institute for Neuroscience, Northwestern University, Evanston, IL, USA
- 3Department of Communication Sciences, Northwestern University, Evanston, IL, USA
- 4Department of Neurobiology and Physiology, Northwestern University, Evanston, IL, USA
- 5Department of Otolaryngology, Northwestern University, Evanston, IL, USA
Phonological skills are enhanced by music training, but the mechanisms enabling this cross-domain enhancement remain unknown. To explain this cross-domain transfer, we propose a precise auditory timing hypothesis (PATH) whereby entrainment practice is the core mechanism underlying enhanced phonological abilities in musicians. Both rhythmic synchronization and language skills such as consonant discrimination, detection of word and phrase boundaries, and conversational turn-taking rely on the perception of extremely fine-grained timing details in sound. Auditory-motor timing is an acoustic feature which meets all five of the pre-conditions necessary for cross-domain enhancement to occur (Patel, 2011, 2012, 2014). There is overlap between the neural networks that process timing in the context of both music and language. Entrainment to music demands more precise timing sensitivity than does language processing. Moreover, auditory-motor timing integration captures the emotion of the trainee, is repeatedly practiced, and demands focused attention. The PATH predicts that musical training emphasizing entrainment will be particularly effective in enhancing phonological skills.
Musical Training and Reading
There is growing interest in the idea that music training can enhance language skills. This music-to-language transfer has been observed for a number of communication skills, including speech-in-noise perception (Parbery-Clark et al., 2009; Strait et al., 2012; Zendel and Alain, 2012) and verbal memory (Chan et al., 1998; Tierney et al., 2008; Parbery-Clark et al., 2009, 2011; Strait et al., 2010), but the skill that has been most extensively investigated is the ability to read. The hypothesis that musical training can enhance reading ability was tested as early as 1975 (Hurwitz et al., 1975) and has since been investigated in at least twenty-two separate studies. This work has ranged from cross-sectional comparisons of musicians and nonmusicians to longitudinal studies with random assignment of children to music training or control groups (for recent examples see Cogo-Moreira et al., 2013 and Rautenberg, 2013). Although there is a moderate degree of variability in the results reported, likely due to the heterogeneous nature of musical training, eighteen of the twenty-two studies reported that musical training increased reading or pre-reading abilities (for a review see Tierney and Kraus, 2013a).
Thus there is considerable evidence that musical training enhances reading skills. Why it can do so remains unknown. At first glance, musical training and the skills necessary for the acquisition of reading would appear to have little in common, as one requires auditory-motor integration, while the other would appear to be a silent visual activity. Complicating the search for an explanation of how these two domains of expertise are related is that both music performance and reading are complex skills which rely on a host of perceptual and cognitive processes. To develop a hypothesis for the mechanisms underlying transfer of learning from music to reading, therefore, one must first identify these musical and linguistic processes. Second, one must demonstrate that certain shared neural resources provide the foundation for both music and reading processes. If successful, such a hypothesis could enable the development of musical training regimens targeted to maximize transfer of learning to language skills.
Auditory-Motor Entrainment and Reading
Here we demonstrate the feasibility of this approach by focusing on entrainment to auditory signals. In this paper we use the term “entrainment” to refer to the process of moving to a repeated auditory signal such that there is a consistent relationship between the timing of one’s movements and the timing of sound onsets. Deficits in auditory-motor entrainment have been found alongside preserved discrimination of complex rhythms (Fries and Swihart, 1990), suggesting that auditory-motor entrainment is a skill dissociable from other rhythmic abilities (see the later section entitled “Other rhythmic skills”). Entrainment is also an integral part of group musical performance, and is therefore a skill which could be improved by a wide variety of musical training approaches. We argue that learning to entrain to complex stimuli may enhance the development of phonological awareness, the explicit knowledge of the component sounds of speech. Phonological awareness is a vital component of normal reading development and is impaired in poor readers (Ramus, 2003; Ramus et al., 2003; Rvachew and Grawburg, 2006; Siegel, 2006).
Participants who have difficulty reading have difficulty entraining to a metronome (Thomson et al., 2006; Thomson and Goswami, 2008; Goswami, 2011). This relationship between entrainment and reading skill has also been demonstrated in typically-developing participants (Tierney and Kraus, 2013b; see Figure 1). This relationship, therefore, is not specific to learning impairment and instead applies to reading skills more generally. Moreover, even before learning to read, children who are able to entrain to a metronome also have better language skills such as phonological awareness and verbal memory (Woodruff Carr et al., 2014), suggesting that entrainment is linked to core abilities necessary for the acquisition of reading. Entrainment skill is connected more strongly to language skills than the ability to tap steadily in silence (Thomson et al., 2006; Thomson and Goswami, 2008; Corriveau and Goswami, 2009; Tierney and Kraus, 2013b), suggesting that it is the ability to use auditory input to maintain and correct one’s timing that is important for reading acquisition. Aside from its relationship to language, entrainment variability is lower in trained musicians (Repp and Doggett, 2007; Repp, 2010), suggesting that it is a trainable skill.
Figure 1. Participants who tapped less variably to a metronome also performed better on tests of nonword reading (left; r = −0.35, p = 0.0067) and word reading (right; r = −0.38, p = 0.0036) tasks (Tierney and Kraus, 2013b). Synchronized tapping performance was measured as the average standard deviation across two speeds, 1.5 and 2 Hz. Word and nonword reading abilities were assessed using the Test of Word Reading Efficiency (TOWRE; Torgeson et al., 1999).
The OPERA Hypothesis
Here we explore the question of why auditory-motor entrainment practice might lead to enhanced phonological skills and subsequent reading ability through the framework of the OPERA hypothesis (Patel, 2011, 2012, 2014). OPERA proposes that effects of musical training on language skills can be understood by examining shared acoustic features of music and speech and demonstrating that they satisfy five conditions: overlap, precision, emotion, repetition, and attention. First, the neural resources responsible for processing that acoustic feature in both music and speech must overlap, enabling transfer from learning in one domain to the other. Second, the precision with which the feature must be processed must be greater in music than in language, with the result that music training improves precision of processing in language. Third, the feature must elicit emotion when encountered in music, as emotion augments perceptual learning. Fourth, the feature must be repeatedly processed in the course of listening to or playing music, enabling the repeated practice that is crucial for learning. Finally, the feature must be the subject of directed attention.
We propose the Precise Auditory Timing Hypothesis (PATH): both auditory-motor entrainment and phonological awareness rely upon precise neural timing in the auditory system and integration of this auditory timing information with motor and cognitive networks. In the remainder of this paper we examine the five conditions of the OPERA hypothesis and demonstrate that precise auditory timing satisfies all five. We do not propose that precise auditory timing is the only shared feature driving the effects of musical training on reading ability. Given the heterogeneous natures of both music and reading, there are likely several mechanisms by which musical training can enhance reading. Nevertheless, our view is that PATH provides a framework to understand a principal mechanism of music-reading transfer.
Perception of the Timing of Sound Events, Which is Important for Both Acquiring Phonological Skills and Entraining to Music, Relies on the Precision of the Auditory System’s Processing of Sound
When a participant moves at a consistent tempo in the absence of any perceptual input, their performance can be modeled as a simple combination of variability contributed by the motor system and variability contributed by an internal timekeeper (Wing and Kristofferson, 1973). When a pacing stimulus is present, however, a third component is added to the model—that of auditory-motor feedback. Small variations in motor timing, if not corrected, will tend to cause movement to drift away from the stimulus. In order to entrain, therefore, participants must perceive the timing of each sound, compare this information with the timing of their movement, and adjust their next motor movement accordingly (Semjen et al., 1998). Less accurate temporal perception may lead to a greater build-up of timing discrepancies prior to successful error correction, resulting in more variable performance (Krause et al., 2010). Participants who move less variably when entraining to a metronome are better able to distinguish two nearly simultaneous events (Krause et al., 2010), confirming that expert entrainment relies upon precise auditory timing perception.
The development of phonological awareness relies upon the ability to align precise perception of timing with meaningful phonological categories. For example, in consonant-vowel syllables, the phonemes /d/ and /t/ can be distinguished by their voice onset time (VOT), or the difference in time between the release of the stop (the movement of the tongue away from the roof of the mouth) and the onset of vowel voicing. To learn to perceive these phonemes listeners must track sound timing and, through experience, learn to classify sounds with a certain range of timing as belonging to one or the other phonological category. This learned classification then enables children to map letter combinations onto speech sounds in the course of learning to read. The development of phonological awareness, therefore, requires children to integrate auditory timing information with the cognitive and motor networks responsible for developing the ability to represent and produce speech sounds. Children with reading impairment have difficulty with this timing integration, as they often exhibit abnormal categorical perception of stop consonants (Reed, 1989; Kraus et al., 1996; King et al., 2002; Richardson et al., 2003; Serniclaes et al., 2004; Tsao et al., 2004; Sharma et al., 2006; Boets et al., 2008, 2011; Vandermosten et al., 2011; Berent et al., 2012).
Moreover, the perception of certain temporal patterns, such as the slowing that tends to occur as speakers approach the ends of sentences (Klatt and Cooper, 1975; Fant et al., 1991; Vaissière, 1991; Venditti and van Santen, 1998), can facilitate the perception of word and phrase boundaries (Nakatani and Schaffer, 1978; Smith et al., 1989; Cutler and Butterfield, 1992). Further confirming the importance of precise time perception for reading, children with language impairments also have problems with backward masking, the ability to perceptually separate a sound and a subsequent noise burst (Wright et al., 1997; Marler et al., 2001, 2002; McArthur and Hogben, 2001; Griffiths et al., 2003; Montgomery et al., 2005; Gibson et al., 2006; Tierney and Kraus, 2013b).
The inferior colliculus represents auditory timing with high temporal precision (Warrier et al., 2011) and connects directly to the cerebellum (Mower et al., 1979; Hashikawa, 1983; Saint Marie, 1996), a structure involved in auditory-motor entrainment (Rao et al., 1997; Molinari et al., 2007; Bijsterbosch et al., 2011; Grahn et al., 2011) and tracking of temporal structure in speech (Kotz and Schwartze, 2010; Schwartze and Kotz, 2013). This colliculo-cerebellar connection may, therefore, be a primary pathway by which auditory timing information is used as feedback to correct errors in motor output during entrainment. Participants who move more variably to a metronome also show greater trial-by-trial timing variability (Tierney and Kraus, 2013c; see Figure 2) and delayed onset timing (Tierney et al., 2014) of the brainstem response to sound (primarily generated within the inferior colliculus), suggesting that the precision with which timing information is represented in the subcortical auditory system helps determine the precision with which listeners can perceive, react to, and predict the timing of auditory events (see also Woodruff Carr et al., 2014). If an individual’s auditory system cannot generate responses to sound that occur with consistent, reliable timing, estimations of the timing of sound events that rely on these responses may be blurred, and this lack of precise temporal perception may affect the stability of entrainment to auditory signals.
Figure 2. (Left) Participants who moved consistently to a metronome also tended to have consistent neural responses to speech. (Right) Less variable movement during entrainment is associated with neural responses to sound that are jittered less in time from trial to trial. Temporal jitter was measured by calculating frequency-by-frequency phase consistency across all trials (Tierney and Kraus, 2013c).
Poor readers (i.e., subjects with poor phonological skills) also exhibit brainstem responses that are delayed (King et al., 2002; Banai et al., 2005, 2009) and more variable on a trial-by-trial basis (Hornickel and Kraus, 2013; see Figure 3) compared to those of good readers. Delayed and variable neural responses are likewise present in a rat model of dyslexia (Centanni et al., 2013). Thus, the exact timing perception necessary for both auditory-motor entrainment and phonological processing may depend on the temporal precision of the auditory system (Figure 4). As a result, repeated practice entraining to music could engender benefits for perception of the timing of speech sounds, eventually leading to enhanced phonological awareness.
Figure 3. (Top) Response consistency is a neural measure that assesses the extent to which the neural response to sound is consistent on a trial-by-trial basis across a recording. (Bottom) Good readers have responses with greater consistency than average readers, who have responses with greater consistency than poor readers (Hornickel and Kraus, 2013).
Figure 4. Schematic displaying recently discovered relationships between reading, auditory-motor entrainment, and auditory neural consistency.
Although Both Phonological Skills and Auditory-Motor Entrainment Rely on Precise Timing, Entrainment Places Greater Demands on Timing Precision
The development of phonological awareness depends on precise timing perception. Differences in VOT or the duration of formant transitions that distinguish consonants are on the order of tens of milliseconds. During auditory-motor entrainment participants adjust their movement based on changes in timing of as little as 3 ms (Repp, 2000; Madison and Merker, 2004). Thus, expert entrainment performance relies upon greater precision of timing perception than is necessary for speech perception and, by extension, development of phonological awareness. As a result, entrainment practice could enhance the precision of timing perception to a greater degree than would everyday language experience, leading to benefits for language processing as well.
Entraining to Music is a Positive Emotional Experience, Thereby Facilitating Learning
Entrainment is a natural social behavior; in fact, toddlers are only able to entrain to a metronome if it is produced by another person, as opposed to a machine or a disembodied sound (Kirschner and Tomasello, 2009). Moreover, not only is entrainment facilitated when performed in a social setting, the act of entrainment strengthens social bonds. For example, a group of 4-year-old children who moved in synchrony with one another were shown to be more likely, compared to a control group, to cooperate in tasks that could be performed in either a separate or a cooperative manner (Kirschner and Tomasello, 2010). Similarly, young adults who move in synchrony with one another are led to cooperate more in subsequent economic games, compared to participants engaging in a control task involving group behavior that is not synchronized (Wiltermuth and Heath, 2009). Thus, entraining together solidifies social bonds, a process that is likely to be highly rewarding. Moreover, participants listening to music commonly make spontaneous movements even when listening by themselves, and will do so more when listening to highly rhythmic music (Janata et al., 2012), suggesting that the act of entraining to music is intrinsically rewarding even in the absence of social context. Given that emotional engagement facilitates auditory neural plasticity (David et al., 2012), the rewarding nature of entrainment during music making makes it a particularly powerful learning tool.
Entrainment is a Task Which is Constantly Repeated During Music Practice
The process of entrainment is one of the most highly repetitive aspects of music. First, entrainment involves constantly monitoring the relationship between one’s movements and the timing of the sounds to which one is moving. In this sense, entrainment is a skill that is constantly trained throughout group practice and performance. Second, musical rhythms are highly predictable: durational patterns are repeated and tempi are maintained for long periods of time, allowing the performer and audience to predict—and move along to—sound events. As a result, even complex forms of entrainment (for example, a group producing a multilayered temporal pattern as part of a piece of music) are quite repetitive. During this repeated entrainment practice musicians are constantly monitoring the timing of acoustic events. Over time, this repeated, effortful practice hones automatic mechanisms which track auditory timing and relay this information to the motor system.
Successful Entrainment Requires Focusing Attention on Acoustic Timing
Focusing attention enhances the brain’s ability to adapt to input (Fritz et al., 2005). As a result, a musical feature upon which attention is focused during musical performance is more likely to lead to cross-domain enhancements than features which are not attended. Although a seemingly simple task, entraining to a metronome calls upon attentional resources. Children with attention deficit and hyperactivity disorder, for example, are more variable in tapping to a metronome (Pitcher et al., 2002; Rubia et al., 2003; Toplak and Tannock, 2005; Ben-Pazi et al., 2006), suggesting that an inability to sustain attention hinders mechanisms that would normally reduce tapping variability. This relationship between sustained attention ability and variability during entrainment has also been demonstrated in a typically-developing population (Tierney and Kraus, 2013b), confirming that entrainment relies upon attentional resources.
Error corrections to supraliminal perturbations during entrainment are more accurate and faster than corrections in response to subliminal perturbations, which tend to be overly large, leading to overshoot and the necessity for further corrections (Repp, 2001, 2002, 2011). Thus, optimal entrainment performance requires the focusing of attention on the timing of acoustic events. Repeated focused attention to a particular feature may enhance the automatic precision of neural encoding of that feature within the auditory midbrain in part via efferent corticofugal connections (Kraus and Chandrasekaran, 2010). Over time, repeated attention to auditory timing in the past may shape the auditory system such that representation of timing becomes automatically more precise (Kraus and Nicol, 2014).
Given that entraining to a signal as simple as an isochronous pulse relies on attention, it is likely that entrainment during performance of real music is much more demanding of attentional resources, given the much more complex rhythmic structure of music. The perception of musical structure has been hypothesized to rely upon entrainment of ongoing neural oscillations (Large, 2008), a process which has been also suggested as a mechanism of selective attention (Calderone et al., 2014). Rhythm perception could, therefore, involve a structured waxing and waning of attention as important time points (“strong” and “weak” beats) come and go (Large and Jones, 1999). This dynamic attending may enable listeners to predict when sounds are more or less likely to occur, facilitating behavioral entrainment (Patel et al., 2005). Dynamic allocation of attentional resources is, therefore, a vital component of entrainment to music (Tierney and Kraus, 2014).
Predictions of PATH
One prediction of PATH is that musical training should lead to an enhanced ability to use durational cues during speech perception to discriminate speech sounds and segment words and phrases. Consistent with this prediction, professional musicians are better able to distinguish consonants based on durational cues such as VOT (Elmer et al., 2013; Zuk et al., 2013) and formant transition durations (Zuk et al., 2013) but are no better than nonmusicians at distinguishing consonants based on frequency information alone (Zuk et al., 2013). Musicians are also better able to detect anomalous lengthening of the final syllables of sentences (Marie et al., 2011), suggesting that they are better able to take advantage of durational cues to word and phrase segmentation. These enhancements in the perception of speech timing may be due to the entrainment practice that is commonly a part of musical training (e.g., playing with others, with recordings, or with a metronome). Given that these studies were not specifically designed to study the effects of entrainment in isolation, however, we do not know to what extent entrainment practice played a part in the training of these musicians.
A second, related prediction of PATH is that musicians whose training places greater emphasis on entrainment will show enhanced phonological skills relative to musicians whose training does not emphasize entrainment. One way to test this prediction would be to compare drummers vs. vocalists in terms of their auditory system timing and phonological abilities. The vocalists are likely to spend less time entraining with a metronome, a recording, or groups of musicians than the drummers. Intuitively, one might expect singers to have better phonological abilities than drummers (since their musical form involves words), yet PATH makes the opposite, counterintuitive prediction.
It may be possible, using the auditory brainstem response to complex sounds, to identify children with imprecise neural timing who would likely benefit from musical training that emphasizes precise auditory-motor entrainment. The complex auditory brainstem response demonstrates high test-retest reliability (Hornickel et al., 2012) and tracks with individual differences in speech perception and language skills (Kraus and Chandrasekaran, 2010), making it an ideal candidate for identifying children with auditory-based communication problems. The complex auditory brainstem response can serve as a useful metric of the impact of musical training on the auditory system (Kraus and Chandrasekaran, 2010; Barrett et al., 2013; Strait and Kraus, 2014). Moreover, entrainment is among the easiest of musical skills to isolate and assess in an automated fashion. As a result, it may be practical to boost entrainment ability with video games and training programs, an approach which could benefit children unable to afford traditional music lessons.
Other Rhythmic Skills
Performance is correlated across various rhythm tasks such as entrainment, rhythm discrimination, and amplitude rise time perception, (Thomson et al., 2006; Thomson and Goswami, 2008; Huss et al., 2011), suggesting that these tasks broadly assess a single rhythmic competence. Nevertheless, there is evidence that dramatic impairments in single rhythmic skills can coexist with normal performance in other rhythmic skills. Phillips-Silver et al. (2011), for example, reported a case study of a participant who was able to entrain to a metronome but not to a piece of music. Similarly, Launay et al. (2014) describe a group of participants with difficulties tapping to the beat of rhythmic patterns but with preserved ability to entrain to a metronome. On the other hand, Fries and Swihart (1990) report that a patient with right hemisphere damage was unable to entrain to an acoustic rhythm but was able to entrain to a visual stimulus and could both discriminate rhythmic patterns and produce rhythms from memory. Auditory-motor entrainment, therefore, appears to be dissociable from other rhythmic skills.
These findings suggest that there are several identifiable rhythmic skills that at least partially rely on distinct neural resources. As a result, each rhythmic skill may relate to language skills in different ways. Tracking the rhythmic structure of music, for example, may rely on phase-locking of ongoing slow oscillations (1.5–7 Hz) in auditory cortex to the rhythmic structure of music (Large, 2008); this same neural mechanism has also been proposed for the tracking of the amplitude envelope of speech (Goswami, 2011). Alternately, tracking the rhythmic structure of music may call upon motor planning regions, which could work in concert with the auditory system to predict when future beats will arrive (Patel and Iversen, 2014). This procedure could also underlie the tracking of rhythmic regularities in speech, such as the tendency for speakers to slow as they near the ends of phrases or sentences. Thus, although both entrainment and rhythmic discrimination (Strait et al., 2011) relate to language skills, the mechanisms underlying these two relationships may be different. Here we propose that auditory-motor entrainment and phonological skills relate due to a shared reliance on precise representation of neural timing in the auditory system.
In summary, we hypothesize that musical training incorporating entrainment practice requires musicians to perceive the timing of acoustic events with a high degree of precision. This constant attention to the timing of sounds eventually leads to increased timing precision in the auditory system’s automatic representation of sound, which in turn leads to enhanced perception of the timing of speech sounds. The perception of speech timing is vital for the acquisition of phonological skills, which facilitates reading development. PATH, therefore, explains the consistent finding that musical training can lead to enhancements in reading ability as a consequence of the central role of entrainment in musical practice and performance.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
This research was funded by the National Science Foundation grant 1430400, the National Association of Music Merchants, and the Knowles Hearing Center. The authors thank Aniruddh Patel, Trent Nicol, Dana Strait, and Kali Woodruff Carr for comments on the manuscript.
Ben-Pazi, H., Shalev, R., Gross-Tsur, V., and Bergman, H. (2006). Age and medication effects on rhythmic responses in ADHD: possible oscillatory mechanisms? Neuropsychologia 44, 412–416. doi: 10.1016/j.neuropsychologia.2005.05.022
Berent, I., Vaknin-Nusbaum, V., Balaban, E., and Galaburda, A. M. (2012). Dyslexia impairs speech recognition but can spare phonological competence. PloS One 7:e44875. doi: 10.1371/journal.pone.0044875
Bijsterbosch, J., Lee, K., Hunter, M., Tsoi, D., Lankappa, S., Wilkinson, I., et al. (2011). The role of the cerebellum in sub- and supraliminal error correction during sensorimotor synchronization: evidence from fMRI and TMS. J. Cogn. Neurosci. 23, 1100–1112. doi: 10.1162/jocn.2010.21506
Boets, B., Vandermosten, M., Poelmans, H., Luts, H., Wouters, J., and Ghesquière, P. (2011). Preschool impairments in auditory processing and speech perception uniquely predict future reading problems. Res. Dev. Disabil. 32, 560–570. doi: 10.1016/j.ridd.2010.12.020
Boets, B., Wouters, J., van Wieringen, A., De Smedt, B., and Ghesquière, P. (2008). Modelling relations between sensory processing, speech perception, orthographic and phonological ability and literacy achievement. Brain Lang. 106, 29–40. doi: 10.1016/j.bandl.2007.12.004
Calderone, D., Lakatos, P., Butler, P., and Castellanos, X. (2014). Entrainment of neural oscillations as a modifiable substrate of attention. Trends Cogn. Sci. 18, 300–309. doi: 10.1016/j.tics.2014.02.005
Centanni, T. M., Booker, A. B., Sloan, A. M., Chen, F., Maher, B. J., Carraway, R. S., et al. (2013). Knockdown of the dyslexia-associated gene Kiaa0319 Impairs temporal responses to speech stimuli in rat primary auditory cortex. Cereb. Cortex 24, 1753–1766. doi: 10.1093/cercor/bht028
Cogo-Moreira, H., Brandão de Ávila, C. R., Ploubidis, G. B., and Mari, J. (2013). Effectiveness of music education for the improvement of reading skills and academic achievement in young poor readers: a pragmatic cluster-randomized, controlled clinical trial. PloS One 8:e59984. doi: 10.1371/journal.pone.0059984
David, S., Fritz, J., and Shamma, S. (2012). Task reward structure shapes rapid receptive field plasticity in auditory cortex. Proc. Natl. Acad. Sci. U S A 109, 2144–2149. doi: 10.1073/pnas.1117717109
Elmer, S., Hänggi, J., Meyer, M., and Jäncke, L. (2013). Increased cortical surface area of the left planum temporale in musicians facilitates the categorization of phonetic and temporal speech sounds. Cortex 49, 2812–2821. doi: 10.1016/j.cortex.2013.03.007
Grahn, J., Henry, M., and McAuley, J. (2011). FMRI investigation of cross-modal interactions in beat perception: audition primes vision, but not vice versa. Neuroimage 54, 1231–1243. doi: 10.1016/j.neuroimage.2010.09.033
Griffiths, Y., Hill, N., Bailey, P., and Snowling, M. (2003). Auditory temporal order discrimination and backward recognition masking in adults with dyslexia. J. Speech Lang. Hear. Res. 46, 1352–1366. doi: 10.1044/1092-4388(2003/105)
Hornickel, J., Zecker, S., Bradlow, A., and Kraus, N. (2012). Assistive listening devices drive neuroplasticity in children with dyslexia. Proc. Natl. Acad. Sci. U S A 109, 16731–16736. doi: 10.1073/pnas.1206628109
Hurwitz, I., Wolff, P. H., Bortnick, B. D., and Kokas, K. (1975). Nonmusicol effects of the kodaly music curriculum in primary grade children. J. Learn. Disabil. 8, 167–174. doi: 10.1177/002221947500800310
Huss, M., Verney, J. P., Fosker, T., Mead, N., and Goswami, U. (2011). Music, rhythm, rise time perception and developmental dyslexia: perception of musical meter predicts reading and phonology. Cortex 47, 674–689. doi: 10.1016/j.cortex.2010.07.010
King, C., Warrier, C. M., Hayes, E., and Kraus, N. (2002). Deficits in auditory brainstem pathway encoding of speech sounds in children with learning problems. Neurosci. Lett. 319, 111–115. doi: 10.1016/s0304-3940(01)02556-3
Kraus, N., McGee, T. J., Carrell, T. D., Zecker, S. G., Nicol, T. G., and Koch, D. B. (1996). Auditory neurophysiologic responses and discrimination deficits in children with learning problems. Science 273, 971–973. doi: 10.1126/science.273.5277.971
Kraus, N., and Nicol, T. (2014). “The cognitive auditory system: the role of learning in shaping the biology of the auditory system,” in Perspectives on Auditory Research, eds R. Fay and A. Popper (Springer Handbook of Auditory Research, Springer-Verlag: Heiderlberg), 299–319.
Krause, V., Pollok, B., and Schnitzler, A. (2010). Perception in action: the impact of sensory information on sensorimotor synchronization in musicians and non-musicians. Acta Psychol. (Amst) 133, 28–37. doi: 10.1016/j.actpsy.2009.08.003
Marler, J., Champlin, C., and Gillam, R. (2001). Backward and simultaneous masking measured in children with language-learning impairments who received intervention with fast forword or laureate learning systems software. Am. Ann. Deaf 10, 258–268. doi: 10.1044/1058-0360(2001/023)
McArthur, G., and Hogben, J. (2001). Auditory backward recognition masking in children with a specific language impairment and children with a specific reading disability. J. Acoust. Soc. Am. 109, 1092–1100. doi: 10.1121/1.1338559
Mower, G., Gibson, A., and Glickstein, M. (1979). Tectopontine pathway in the cat: laminar distribution of cells of origin and visual properties of target cells in dorsolateral pontine nucleus. J. Neurophysiol. 42, 1–15.
Parbery-Clark, A., Strait, D., Anderson, S., Hittner, E., and Kraus, N. (2011). Musical experience and the aging auditory system: implications for cognitive abilities and hearing speech in noise. PLoS One 6:e18082. doi: 10.1371/journal.pone.0018082
Patel, A., and Iversen, J. (2014). The evolutionary neuroscience of musical beat perception: the Action Simulation for Auditory Prediction (ASAP) hypothesis. Front. Syst. Neurosci. 8:57. doi: 10.3389/fnsys.2014.00057
Phillips-Silver, J., Toiviainen, P., Gosselin, N., Piché, O., Nozaradan, S., Palmer, C., et al. (2011). Born to dance but beat deaf: a new form of congenital amusia. Neuropsychologia 49, 961–969. doi: 10.1016/j.neuropsychologia.2011.02.002
Pitcher, T., Piek, J., and Barrett, N. (2002). Timing and force control in boys with attention deficit hyperactivity disorder: subtype differences and the effect of comorbid developmental coordination disorder. Hum. Mov. Sci. 21, 919–945. doi: 10.1016/s0167-9457(02)00167-7
Ramus, F., Rosen, S., Dakin, S., Day, B., Castellote, J., White, S., et al. (2003). Theories of developmental dyslexia: insights from a multiple case study of dyslexic adults. Brain 126, 841–865. doi: 10.1093/brain/awg076
Repp, B. (2002). Automaticity and voluntary control of phase correction following event onset shifts in sensorimotor synchronization. J. Exp. Psychol. Hum. Percept. Perform. 28, 410–430. doi: 10.1037//0096-15188.8.131.520
Repp, B. H. (2011). Tapping in synchrony with a perturbed metronome: the phase correction response to small and large phase shifts as a function of tempo. J. Mot. Behav. 43, 213–227. doi: 10.1080/00222895.2011.561377
Richardson, U., Leppänen, P. H. T., Leiwo, M., and Lyytinen, H. (2003). Speech perception of infants with high familial risk for dyslexia differ at the age of 6 months. Dev. Neuropsychol. 23, 385–397. doi: 10.1207/s15326942dn2303_5
Rubia, K., Noorloos, J., Smith, A., Gunning, B., and Sergeant, J. (2003). Motor timing deficits in community and clinical boys with hyperactive behavior: the effect of methylphenidate on motor timing. J. Abnorm. Child Psychol. 31, 301–313. doi: 10.1023/A:1023233630774
Saint Marie, R. L. (1996). Glutamatergic connections of the auditory midbrain: selective uptake and axonal transport of D-[3H]Aspartate. J. Comp. Neurol. 373, 255–270. doi: 10.1002/(sici)1096-9861(19960916)373:2<255::aid-cne8>3.0.co;2-2
Serniclaes, W., Van Heghe, S., Mousty, P., Carré, R., and Sprenger-Charolles, L. (2004). Allophonic mode of speech perception in dyslexia. J. Exp. Child Psychol. 87, 336–361. doi: 10.1016/j.jecp.2004.02.001
Sharma, M., Purdy, S. C., Newall, P., Wheldall, K., Beaman, R., and Dillon, H. (2006). Electrophysiological and behavioral evidence of auditory processing deficits in children with reading disorder. Clin. Neurophysiol. 117, 1130–1144. doi: 10.1016/j.clinph.2006.02.001
Smith, M. R., Cutler, A., Butterfield, S., and Nimmo-Smith, I. (1989). The perception of rhythm and word boundaries in noise-masked speech. J. Speech Hear. Res. 32, 912–920. doi: 10.1044/jshr.3204.912
Strait, D. L., Hornickel, J., and Kraus, N. (2011). Subcortical processing of speech regularities underlies reading and music aptitude in children. Behav. Brain Funct. 7:44. doi: 10.1186/1744-9081-7-44
Strait, D. L., and Kraus, N. (2014). Biological impact of auditory expertise across the life span: musicians as a model of auditory learning. Hear. Res. 308, 109–121. doi: 10.1016/j.heares.2013.08.004
Strait, D. L., Kraus, N., Parbery-Clark, A., and Ashley, R. (2010). Musical experience shapes top-down auditory mechanisms: evidence from masking and auditory attention performance. Hear. Res. 261, 22–29. doi: 10.1016/j.heares.2009.12.021
Strait, D. L., Parbery-Clark, A., Hittner, E., and Kraus, N. (2012). Musical training during early childhood enhances the neural encoding of speech in noise. Brain Lang. 123, 191–201. doi: 10.1016/j.bandl.2012.09.001
Thomson, J. M., and Goswami, U. (2008). Rhythmic processing in children with developmental dyslexia: auditory and motor rhythms link to reading and spelling. J. Physiol. Paris 102, 120–129. doi: 10.1016/j.jphysparis.2008.03.007
Tierney, A., and Kraus, N. (2013a). “Musical training for the development of language skills,” in Changing Brains - Applying Brain Plasticity to Advance and Recover Human Ability (Vol. 207), eds M. M. Merzenich, M. Nahum and T. Vleet (Amsterdam: Elsevier), 209–241.
Tsao, F.-M., Liu, H.-M., and Kuhl, P. K. (2004). Speech perception in infancy predicts language development in the second year of life: a longitudinal study. Child Dev. 75, 1067–1084. doi: 10.1111/j.1467-8624.2004.00726.x
Vandermosten, M., Boets, B., Luts, H., Poelmans, H., Wouters, J., and Ghesquière, P. (2011). Impairments in speech and nonspeech sound categorization in children with dyslexia are driven by temporal processing difficulties. Res. Dev. Disabil. 32, 593–603. doi: 10.1016/j.ridd.2010.12.015
Warrier, C. M., Nicol, T. G., Abrams, D. A., and Kraus, N. (2011). Inferior colliculus contributions to phase encoding of stop consonants in an animal model. Hear. Res. 282, 108–118. doi: 10.1016/j.heares.2011.09.001
Woodruff Carr, K., White-Schwoch, T., Tierney, A. T., Strait, D. L., and Kraus, N. (2014). Beat synchronization predicts neural speech encoding and reading readiness in preschoolers. Proc. Natl. Acad. Sci. U S A 111, 14559–14564. doi: 10.1073/pnas.1406219111
Wright, B. A., Lombardino, L. J., King, W. M., Puranik, C. S., Leonard, C. M., and Merzenich, M. M. (1997). Deficits in auditory temporal and spectral resolution in language-impaired children. Nature 387, 176–178. doi: 10.1038/387176a0
Keywords: synchronization, auditory timing, phonological skills, musical training, reading
Citation: Tierney A and Kraus N (2014) Auditory-motor entrainment and phonological skills: precise auditory timing hypothesis (PATH). Front. Hum. Neurosci. 8:949. doi: 10.3389/fnhum.2014.00949
Received: 01 July 2014; Paper pending published: 10 October 2014;
Accepted: 07 November 2014; Published online: 27 November 2014.
Edited by:Petri Toiviainen, University of Jyväskylä, Finland
Reviewed by:Jessica Phillips-Silver, Georgetown University Medical Center, USA
María Teresa Daza González, University of Almería, Spain
Copyright © 2014 Tierney and Kraus. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution and reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Nina Kraus, Auditory Neuroscience Laboratory, Northwestern University, 2240 Campus Drive, Evanston, IL 60208, USA e-mail: firstname.lastname@example.org;