Original Research ARTICLE
Physiologic discrimination of stop consonants relates to phonological skills in pre-readers: a biomarker for subsequent reading ability?†
- 1Auditory Neuroscience Laboratory, Northwestern University, Evanston, IL, USA
- 2Department of Communication Sciences, Northwestern University, Evanston, IL, USA
- 3Institute for Neuroscience, Northwestern University, Evanston, IL, USA
- 4Department of Neurobiology and Physiology, Northwestern University, Evanston, IL, USA
- 5Department of Otolaryngology, Northwestern University, Chicago, IL, USA
Reading development builds upon the accurate representation of the phonological structure of spoken language. This representation and its neural foundations have been studied extensively with respect to reading due to pervasive performance deficits on basic phonological tasks observed in children with dyslexia. The subcortical auditory system – a site of intersection for sensory and cognitive input – is exquisitely tuned to code fine timing differences between phonemes, and so likely plays a foundational role in the development of phonological processing and, eventually, reading. This temporal coding of speech varies systematically with reading ability in school age children. Little is known, however, about subcortical speech representation in pre-school age children. We measured auditory brainstem responses to the stop consonants [ba] and [ga] in a cohort of 4-year-old children and assessed their phonological skills. In a typical auditory system, brainstem responses to [ba] and [ga] are out of phase (i.e., differ in time) due to formant frequency differences in the consonant-vowel transitions of the stimuli. We found that children who performed worst on the phonological awareness task insufficiently code this difference, revealing a physiologic link between early phonological skills and the neural representation of speech. We discuss this finding in light of existing theories of the role of the auditory system in developmental dyslexia, and argue for a systems-level perspective for understanding the importance of precise temporal coding for learning to read.
Learning to read scaffolds on the development of more basic language skills. One such primitive is phonological awareness, the knowledge that spoken language is made up of smaller units such as syllables and phonemes (Sandak et al., 2004; Kovelman et al., 2012; Pugh et al., 2013). Phonological processing has been an area of keen interest in the study of reading for years due to the observation of pervasive performance deficits in dyslexics on basic phonological tasks (Swan and Goswami, 1997; Pugh et al., 2013; Ramus et al., 2013). Theories of developmental dyslexia, and theories of reading more generally, must therefore account for the biological mechanisms supporting phonological processing and related language skills.
Developmental dyslexia affects approximately 5–10% of children and is characterized by a failure to develop effective reading skills despite typical intelligence and adequate support from parents, teachers, and caregivers (Démonet et al., 2004). As a group, children (and adults) with dyslexia have a constellation of deficits in auditory processing. There are, for example, extensive performance gaps between dyslexic and typically developing children on a variety of basic auditory tasks (Wright et al., 1997; Goswami et al., 2002; Ahissar et al., 2006). Children with dyslexia have difficulty coding rapidly changing frequency content in speech such as formant transitions in consonant–vowel syllables (Tallal and Piercy, 1975; Tallal, 1980). Dyslexics also have difficulty tracking amplitude envelope modulations in speech, such as in syllable onsets (Goswami et al., 2002, 2011). However, it remains unknown whether these deficits are each observed within an individual or if there are variable manifestations of developmental dyslexia.
Neurophysiologic deficits associated with dyslexia include increased variability in neural firing as observed in auditory midbrain in humans (Hornickel and Kraus, 2013) and cortex in a rat model of dyslexia (Centanni et al., 2013), in addition to decreased auditory cortical phase-locking to the acoustic envelope (Abrams et al., 2009; Lehongre et al., 2011). Our own group has identified a number of deficits in speech coding throughout the central auditory system that are linked to poor reading (Kraus et al., 1996; Wible et al., 2005; Abrams et al., 2009; Banai et al., 2009; Chandrasekaran et al., 2009; Hornickel and Kraus, 2013).
In light of the wide variety of auditory deficits identified in dyslexics, a plethora of theories as to the disorder’s biological origin have emerged, each of which has tried to identify a “core deficit.” Although these theories are not necessarily mutually exclusive, there is little accord in the literature (cf. Livingstone et al., 1991; Wright et al., 2000; Stein, 2001; Ahissar, 2007; Vidyasagar and Pammer, 2010; Goswami, 2011; Lallier et al., 2013). Many theories have centered on a core deficit in phonological processing and, consequently, a number of neurophysiologic investigations have characterized the biology underlying this skill and deficits thereof. Neuroimaging studies have identified diminished activity in left-lateralized language networks in dyslexic children performing phonological tasks (Kovelman et al., 2012; Pugh et al., 2013). Lehongre et al. (2011) used magnetoencephalography to measure neural entrainment to amplitude modulated noise bursts, and found that dyslexics had poorer phase-locking in the “low gamma” range (~30 Hz), correlating with poor performance on phonological tasks. Finally, the discrimination of stop consonants in auditory midbrain is linked to reading ability in school age children (Hornickel et al., 2009).
While these studies (in addition to many others) have offered insight into the pathophysiology underlying phonological and/or reading deficits, they are complicated by the reciprocal relationship between phonological processing and reading. For although phonological awareness likely bootstraps reading development, the first years of reading themselves influence phonological awareness (Castles and Coltheart, 2004). Therefore, here, we assessed the relationship between phonological awareness and neurophysiologic discrimination of stop consonants in a group of typically developing 4-year-old children. We hypothesized that early phonological awareness is linked to the precision of physiologic speech sound discrimination. To test this hypothesis, we measured neural responses to a pair of speech stimuli previously shown to vary systematically with phonological processing in school-age children (Hornickel et al., 2009). By assessing physiologic processing of speech in pre-school age children we hope to gain insight into the developmental trajectory of reading development. Moreover, we may identify a potential biomarker to predict subsequent reading ability.
Materials and Methods
Four-year-old children (N = 26, 14 female) were recruited from the Chicago area to participate in a developmental study at Northwestern University. No child had a history of a neurologic or otologic condition, second language experience, or a diagnosis of autism spectrum disorder. Four children had immediate family histories of dyslexia (parent or sibling). All children passed a brief screening of peripheral auditory function (normal tympanometry and distortion product otoacoustic emissions at least 6 dB above the noise floor for octaves from 1–8 kHz). Additionally, all children had normal click-evoked auditory brainstem responses (Wave V latency < 6.0 ms, measured by a 100 μs click presented at 80 dB SPL to the right ear at 31.25 Hz).
Although we consider these children too young to have attained fully developed reading skills, and so refer to them as “pre-readers,” we note that many of them may have begun some explicit instruction. We did not formally evaluate their reading skills and acknowledge this as a limitation. Nevertheless, we suggest that our cohort represents children who have either not yet begun to learn to read, or are only in the first stages, and so offers novel insight into the relationship between phonological processing and auditory-neurophysiologic responses to speech early in life.
Parents provided informed consent for their children to participate in the study, and the subjects provided verbal assent. The Institutional Review Board of Northwestern University approved all procedures and children were paid $10/hr for their participation.
Behavioral Measure – Phonological Awareness
Phonological awareness was measured with the Clinical Evaluation of Language Fundamentals Preschool, 2nd edn., phonological awareness subtest (CELF 2; Wiig et al., 2004). The test evaluates a child’s knowledge of the sound structure of the English language and measures a child’s ability to manipulate sound through: compound word and syllable blending, sentence and syllable segmentation, and rhyme awareness and production. Raw scores are computed and were used for analysis. The maximum score is 24, and higher scores correspond to better performance. All children met the age-appropriate “criterion” cutoff, indicating that they are within the range of typically developing children. Therefore our data represent a cohort of children with developmentally appropriate performance on the phonological awareness test but with a large range of variability.
Auditory brainstem responses were elicited in response to the stop consonants [ba] and [ga]. Both consonant–vowel (CV) syllables were 170 ms stimuli that have been described previously (Hornickel et al., 2009). Briefly, both begin with a 5 ms stop burst and have a 50 ms transition from the consonant to the vowel. The vowel is sustained for 120 ms. Both stimuli have a flat fundamental frequency (F0 = 100 Hz) and during the 50 ms transition the first three formant frequencies shift. The [ba] and [ga] differ only in the F2 onset frequency (F2OF[ba] = 900 Hz; F2OF[ga] = 2480 Hz) but are identical in F2 frequency for the vowel portion (F2VOWEL = 1240 Hz; see Figure 1). The remaining formants are identical (F1 = 400–720 Hz; F3 = 2580–2500 Hz) with F4-6 steady through the 170 ms stimuli (F4 = 3300 Hz, F5 = 3750 Hz, F6 = 4900 Hz). Stimuli were presented monaurally to the right ear at 80.4 dB SPL through electromagnetically shielded insert earphones (ER-3, Etymotic Research, Elk Grove Village, IL, USA). Stimulus presentation was controlled by E-Prime 2.0 (Psychology Software Tools, Inc., Sharpsburg, PA, USA) and stimuli were presented in alternating polarity with an 81 ms interstimulus interval. 4200 sweeps of each stimulus were presented, and the presentation order was randomized for each subject.
FIGURE 1. (A) Schematic illustrating formant information for the [ba] and [ga] stimuli. The F0, F1, and F3 are identical. The stimuli differ in F2 onset frequencies with [ba] (blue) ascending and [ga] (green) descending to stabilize for the vowel portion. (B) Schematic illustrating the expected phase relationship between brainstem responses to the [ba] and [ga]. Since the [ga] (green) has a higher F2 onset frequency it is expected that brainstem responses to [ga] phase lead those to [ba]. (C,D) Grand average waveforms are displayed for the [ba] (top, blue) and [ga] (bottom, green).
Neurophysiology: Recording and Data Processing
Brainstem responses were collected using a BioSEMI Active2 recording system with ABR module. Active electrodes were placed at Cz and each ear with CMS/DRL placed on the forehead, one-half centimeter on either side of Fpz. Only ipsilateral (Cz-A2) responses are used in analysis. Responses were digitized at 16.384 kHz and collected with online filters from 100–3000 Hz (20 dB/decade roll-off) in the BioSEMI ActiABR and recorded into LabView 2.0 (National Instruments, Austin, TX, USA). Since speech-evoked brainstem responses are ideally filtered with a highpass of 70 Hz (Skoe and Kraus, 2010), in MATLAB responses were offline amplified in the frequency domain with an inverse power ramp, 20 dB per decade for 3 decades below 100 Hz (i.e., from 0.1 to 100 Hz, then flat from 0.1 Hz down to DC). Next, a bandpass filter (70–2000 Hz, Butterworth filter, 12 dB/octave roll-off) was applied to frequency-amplified responses. Responses were epoched from -40–213 ms (stimulus onset at 0 ms) and baseline corrected. Artifact rejection was set at ± 35 μVs. Final responses comprised 2000 artifact-free sweeps of each polarity, and responses from alternating polarities were added to emphasize the envelope-following brainstem response while minimizing the influence of stimulus artifact and cochlear microphonic (Campbell et al., 2012).
Neurophysiology Data Analysis: Phase Distinction between Responses
Due to the tonotopicity of the ascending auditory system, stimuli that differ in frequency elicit brainstem responses which are out of phase (Gorga et al., 1988). Therefore, it is expected that responses to [ba] and [ga] begin out of phase from each other (during the transition portion) and are phase-aligned during the vowel, when the stimuli are acoustically identical. In this regard, the relative phases of the two responses are used as proxies for the relative timing of the responses at each frequency. A schematic illustrating this expected relationship is presented in Figure 1.
The phase relationship between responses to [ba] and [ga] was measured using custom routines in MATLAB (Skoe et al., in press). Responses were divided into overlapping 20 ms windows from -40–170 ms (1 ms separating each adjacent window) and ramped with a 20 ms Hanning window. The cross-power spectral density function (cspd) was applied between brainstem responses, and power estimates were converted to phase angles to index alignment of the two signals. A larger phase angle (in radians) indicates that the responses are farther out of phase and, therefore, that there is a larger timing lag between responses at a given frequency. A three dimensional “cross-phaseogram” figure is constructed illustrating time (ms, x-axis), frequency (Hz, y-axis), and phase angle (radians, colorbar). During the transition region positive phase angles indicate better neural consonant distinction, as this indicates that [ga] phase-leads [ba], the expected relationship since [ga] has a higher F2OF.
Scores on the CELF formed a normal distribution with a mean score of 18.96 (SD, 3.80; Kolmogorov–Smirnov D(26) = 0.146, p = 0.160). Children were grouped based on their performance on the CELF with a median split defining the top phonological awareness “Top PA” (CELF > 19, N = 14, 6 female) and bottom phonological awareness “Bottom PA” (CELF < 19, N = 12, 6 female) groups. Five subjects in the Top PA performed at ceiling on the test (scores of 24). Each group included two children with a family history of dyslexia. Groups did not differ in distribution of males and females (χ2 = 0.154, p = 0.70) nor on non-verbal intelligence (matrix reasoning subtest, Wechsler Preschool and Primary Scale of Intelligence, Revised; Wechsler, 1989; p = 0.35). As expected, the groups did statistically differ in performance on the CELF, t(24) = 9.11, p < .001, Cohen’s d = 3.56. Summary statistics for the two groups are presented in Table 1.
Summary of Results
Group average cross-phaseograms are presented in Figure 2. The Top phonological awareness group (Top PA) evinces a large phase distinction corresponding in time to the transitions in the stimuli, which occurs in the responses from approximately 300–700 Hz (indicated by a large orange–red swatch) and a more moderate phase shift from approximately 750–1000 Hz. Conversely, relatively small phase distinctions were observed in the bottom group (Bottom PA) suggesting that the frequency difference between the stimuli was not strongly represented in these children. Phase distinctions for individual subjects are presented in Figure 3, along with group means. No phase distinctions were observed in the response region corresponding to the steady state vowel in either group, as is expected since the stimuli are acoustically identical in the vowel portions.
FIGURE 2. Group average cross-phaseograms are presented for the Top phonological awareness (PA) and Bottom PA groups. The Top PA more strongly codes the difference between the [ba] and [ga] stimuli, as indicated by the large red–orange swatches in response to the CV transition.
Phase Distinctions in the Consonant-Vowel Transition, 300-700 Hz
Mean phase angle distinctions were calculated for the lower frequency region (15-55 ms × 300-700 Hz). This was the primary region of interest, since it corresponds best to previous reports (Skoe et al., in press; Parbery-Clark et al., 2012). In the Top PA there was a larger mean phase distinction than in the Bottom PA group, t(24) = 2.61, p = .015, Cohen’s d = 1.07. See Table 2 for descriptive statistics. Since there was a slightly skewed distribution in the Top PA group, this comparison was repeated, and confirmed, with the non-parametric Mann–Whitney U test (p = 0.015).
TABLE 2. Mean phase distinctions for the two frequency ranges are reported for each group (in rad, with SDs).
Individual phase distinctions for each group are presented in Figure 3. The majority of subjects in the Top PA group had positive phase distinctions whereas most subjects in the Bottom PA group had either very small phase distinctions or distinctions in the opposite of the expected direction (i.e., responses to [ba] phase lead those to [ga]). It is also noteworthy that the magnitude of the largest phase distinctions in the Top PA group exceeds those observed in the Bottom PA group.
FIGURE 3. (A,B) Phase distinctions for individual subjects (15-55 ms × 300-600 Hz) are presented for the Top PA (Panel A, red) and Bottom PA (Panel B, black) groups. (C) Group means are presented. Error bars, ± 1 SEM; *p < 0.05.
Finally, a trending correlation between CELF score and phase distinction was observed (Spearman’s ρ(26) = 0.38, p = .056) with higher scores on the CELF corresponding to larger phase distinction. We suspect that with a larger subject group, and relatively fewer subjects at ceiling on the CELF, this relationship would be stronger.
Phase Distinctions in the Consonant-Vowel Transition, 750-1000 Hz
To further explore group differences, and to ensure that there were no phase distinctions in response to the vowel, additional analyses were pursued. The first analysis focused on the higher frequency phase distinction (25-55 ms × 750-1000 Hz). As indicated in Figure 2, the Top PA group had a larger mean phase distinction than the Bottom PA group, t(24) = 2.00, p = 0.06, Cohen’s d = 0.82.
Phase Distinctions in the Vowel Region
Since the [ba] and [ga] stimuli are identical in the steady state vowel portions no phase distinction is expected in this region. This is reflected in Figure 2 where green indicates a phase difference of about 0 rad. To confirm this statistically, mean phase angles were calculated for the same frequency regions as in the transition from 60 to 155 ms. There were no group differences in phase distinction for the lower frequency region (300–700 Hz; t(24) = 0.81, p = 0.427) or higher frequency region (750–1000 Hz, t(24) = 0.55, p = 0.589).
We assessed the physiologic discrimination of stop consonants in a group of 4-year-old children and reveal a link between this discrimination and phonological awareness. Children with higher phonological awareness had superior neural discrimination of the stop consonants [ba] and [ga], as inferred by far-field electrophysiology. Conversely, children who performed worse on the phonological awareness test, on average, did not robustly distinguish these speech sounds. This relationship has previously been observed in school age children, with neural speech discrimination varying in concert with phonological awareness (Hornickel et al., 2009). By demonstrating this relationship in pre-school children too young to have attained full reading competence we can begin to trace the developmental trajectory of the primitives necessary for complex language-based tasks such as reading. However, we do not know if these children with weak consonant differentiation will soon develop a strong neural differentiation and end up as normal readers, or if they will face challenges as they learn to read. The latter possibility would suggest that these children are at risk for a reading disorder. Regardless, this relationship highlights the role of central auditory processing in developing language skills, and complements phonological deficit theories of reading.
One interpretation of the current results is that they reflect different levels of maturation. The auditory system undergoes rapid developmental plasticity through the first several years of life, and this is reflected in subcortical (Johnson et al., 2008; Skoe et al., in press) and cortical evoked potentials (Choudhury and Benasich, 2011). Individual differences in this rate of maturation may explain the variability in the [ba]-[ga] phase distinction. We do not think this vitiates the link between subcortical auditory function and phonological processing, however. After all, slower maturation of the neural processes important for phonological development may set certain children at a disadvantage when they begin learning to read. Nevertheless, the functional developmental consequences of this maturation for reading (dis)ability remain to be seen.
Temporal Sampling: A System-Wide Perspective
Goswami (2011) has proposed a theoretical framework to understand developmental dyslexia, the “temporal sampling framework” (TSF). Under TSF, the core deficit in dyslexia is phonological and is due to impaired oscillatory phase locking for low frequency temporal coding in auditory cortex. An attractive feature of TSF is that it resolves many apparent discrepancies between competing theories of developmental dyslexia. Support for TSF may be found in a large series of psychophysical and neurophysiologic investigations (Witton et al., 1998; Goswami et al., 2002, 2011; Serniclaes et al., 2004; Noordenbos et al., 2012, 2013; Leong and Goswami, 2013; Power et al., 2013).
Although TSF predicts deficient slow cortical phase locking in dyslexia (at rates < 30 Hz; Goswami, 2011), our demonstration of a link between high frequency phase locking in the subcortical auditory system and phonological processing may also be consistent with TSF. While TSF predicts superior cortical phase locking at fast rates for dyslexics, and here we report a deficit for high frequency temporal coding in auditory midbrain, we advocate for a systems-level perspective with different “optimal” rates of phase locking as a function of the physiology of the site of interest along the auditory pathway. We see this as compatible with TSF because our view is that the auditory system is best thought of as an integrated circuit that interacts dynamically with cognitive, reward, and other sensory systems (Kraus and Nicol, in press; Kraus and Chandrasekaran, 2010; Bajo and King, 2012; Anderson et al., 2013). The subcortical evoked response we analyzed in the current paper (and others from our group) is generated predominantly by inferior colliculus (IC; for review see Chandrasekaran and Kraus, 2010). Most IC neurons phase lock in the range of 100–1000 Hz (Liu et al., 2006) which is 10-fold the range of the impaired theta and delta oscillatory phase locking in auditory cortex observed in dyslexia (Goswami, 2011). Optimal phonological coding may rely on the interaction of rapid temporal sampling in IC with relatively slow sampling in auditory cortex. Indeed, Abrams et al. (2006) reported that subcortical timing was linked to the temporal integrity of auditory cortical speech coding. Wible et al. (2005) reported correlated subcortical and cortical neural synchrony in representing speech, both of which were diminished in children with language-based learning problems.
That said, relatively little is known about the temporal coding of low frequency information in IC (i.e., < 30 Hz), which may in fact be deficient in dyslexics. Recordings from cat IC do demonstrate phase locking as low as 10 Hz (Langner and Schreiner, 1988), however, the lower limits of phase locking in human IC, and more broadly the oscillatory dynamics of IC, remain an avenue for future research. Temporal coding at multiple rates may occur in parallel through the auditory pathway; evidence from a guinea pig model suggests that a paralemniscal thalamocortical pathway relays slow temporal information to auditory cortex in parallel with fast temporal information relayed through a lemniscal pathway (Abrams et al., 2011). Therefore, a full elucidation of the relationship between auditory phase locking and reading ability on a system-wide level will likely have to accommodate simultaneous temporal coding at multiple rates.
Further support for this integrated view of system-wide temporal coding comes from the rhythm perception literature, which has connected poor reading with an impaired ability to entrain to an external beat and impoverished perception of musical meter (Thomson et al., 2006; Huss et al., 2011; Tierney and Kraus, 2013b). This rhythmic entrainment seems to rely on auditory cortical phase locking (Power et al., 2012). However, the ability to entrain to an external beat is also linked to rapid subcortical phase locking and neural synchrony (Tierney and Kraus, 2013a), again suggesting that phase locking across multiple temporal rates may support perceptual skills linked to reading, if not phonological processing itself. An overarching theoretical framework for reading, then, may have to include relatively rapid subcortical phase locking as a key component that interacts with slower cortical oscillatory sampling. Both rapid and slow sampling likely rely on the synchronous firing of neurons in the auditory system, which supports precise representation of transient sounds (McGinley et al., 2012), 0.1 ms precision timing (Anderson et al., 2012), and speech discrimination (Engineer et al., 2008). And once again, dyslexia has been linked to deficits in neural synchrony as observed in humans (Hornickel and Kraus, 2013) and a rat model (Centanni et al., 2013).
This view would be also consistent with the Rapid Auditory Processing theory of developmental dyslexia (RAP; Tallal, 1980; Benasich and Tallal, 2002). Decreased sensitivity to rapidly changing phonological features could drive the impoverished distinction between speech sounds. Previous work has demonstrated that lengthening formant transitions in speech can improve the cortical discrimination of speech sounds (Bradlow et al., 1999; Steinschneider and Fishman, 2011), but it is unknown what effect this has on subcortical discrimination. Finally, we note that these our findings would be broadly consistent with the view that there are general, non-linguistic sensory deficits in dyslexia (Wright et al., 1997; Stein, 2001; Ahissar et al., 2006). Future work, therefore, should consider the interactions of acoustics, phonemics, and behavioral relevance in subcortical temporal processing.
A Biomarker for Subsequent Reading Ability?
Although it is important to develop and refine empirically based theories of reading, it is also important to develop methods to identify children at risk for reading disorders. Previous neurophysiologic studies have identified cortical predictors of dyslexia, such as slower right hemisphere polarity shifts in evoked responses to speech (Guttorm et al., 2005; for review, see Leppänen et al., 2012). The structural integrity and volume of left articulate fasciculus is diminished in young children with poor phonological awareness (Saygin et al., 2013). Performance on speech perception tasks is also predictive (Benasich and Tallal, 2002), in addition to oscillatory dynamics in the infant brain (Gou et al., 2011). While the current analysis is not longitudinal, the techniques employed here may one day be useful for predicting future reading ability, either independently or as a complement to existing techniques. In fact, the children in the current study will be tracked over the next several years in hopes of identifying early predictors of subsequent reading ability. We note that the CELF phonological awareness test combines many subskills under the phonological awareness construct (Wiig et al., 2004); it is unknown if group differences are driven primarily by one or two of these subskills, and future investigation is warranted to look specifically at which aspects of phonological awareness are linked to auditory system development.
There are a number of attractive features of the “cross-phaseogram” as a potential biomarker. For one, it is a fast and objective automated procedure. Moreover, as we illustrate here, this measure relates to individual differences in language-based skills. Subcortical evoked responses to speech are relatively easy to obtain and meaningful in individuals (Skoe and Kraus, 2010). From a practical standpoint, responses may be elicited when a child is sleeping or watching a video, thereby eliminating the need for subject compliance in task-related physiologic measurements. And by following the children in this study longitudinally, we may be able to explore individual differences in neurophysiology that distinguish between individual presentations of dyslexia.
Dyslexia, Treatment, and the Search for a Core Deficit
A number of short-term interventions have been employed to improve phonological abilities and reading skills, and these offer further insight into the biological foundations of reading. Some of these studies have focused on perceptual deficits related to poor phonological processing (Tallal et al., 1996; Temple et al., 2003). Other interventions have been broader, such as assistive listening devices that improve classroom signal-to-noise ratios by directing attention to meaningful sound – and in fact also improve neural synchrony in response to speech (Hornickel et al., 2012). Non-speech training such as playing action video games, which improve attentional abilities (Green and Bavelier, 2012), can also improve reading skills (Franceschini et al., 2013), suggesting a role for non-auditory mechanisms in reading development and/or remediation.
Music training, which engenders a host of auditory perceptual and cognitive benefits, may also hold promise. Since precise temporal coding of sound supports fundamental reading skills, and this coding is strengthened by musical experience, it stands to reason that music training may promote the development of reading-related skills (Tierney and Kraus, 2013c). In fact, music experience has been directly linked to improved phonological skills and reading (Moreno et al., 2009; Besson et al., 2011), in addition to physiologic discrimination of speech sounds, as presented in the current study (Parbery-Clark et al., 2012; Strait et al., 2013). Given the established link between rhythm skills and phonological abilities, the rhythmic components of music training may be especially important for developing language-based skills. In fact, Bhide et al. (2013) reported that a comprehensive rhythm training regimen improves phonological skills.
To understand the biological bases of reading, and develop strategies that engender reading skills and remediate dyslexia, it is important to identify which skills to target. In this regard, the quest for the core deficit is important. That said, this search may at times cloud the principal problem, namely, that certain children have tremendous difficulty learning to read. Moreover, the possibility remains that no single deficit accounts for every child who has difficulty reading. Our view is that even without a full understanding of the pathophysiology of dyslexia it is important to identify children at risk as early as possible. Here we have identified a neural correlate of early phonological awareness in pre-school age children. Due to the importance of precise phonological representations for reading this correlate may indicate a biological bottleneck certain children face when they begin to learn to read.
Travis White-Schwoch and Nina Kraus designed the study; Travis White-Schwoch oversaw data collection and analyzed the data; Travis White-Schwoch and Nina Kraus wrote the paper.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
We thank members of the Auditory Neuroscience Laboratory for their assistance with data collection, especially Dana L. Strait for contributions to study development. We also thank Trent Nicol, Jane Hornickel, Adam Tierney, Karen Chan Barrett, and Samira Anderson for their input on earlier drafts of the manuscript. Supported by NIH R01HD069414 and the Knowles Hearing Center.
Abrams, D. A., Nicol, T., Zecker, S., and Kraus, N. (2009). Abnormal cortical processing of the syllable rate of speech in poor readers. J. Neurosci. 29, 7686–7693. doi: 10.1523/JNEUROSCI.5242-08.2009
Abrams, D. A., Nicol, T., Zecker, S., and Kraus, N. (2011). A possible role for a paralemniscal auditory pathway in the coding of slow temporal information. Hear. Res. 272, 125–134. doi: 10.1016/j.heares.2010.10.009
Anderson, S., White-Schwoch, T., Parbery-Clark, A., and Kraus, N. (2013). A dynamic auditory-cognitive system supports speech-in-noise perception in older adults. Hear. Res. 300, 18–32. doi: 10.1016/j.heares.2013.03.006
Bhide, A., Power, A., and Goswami, U. (2013). A hythmic musical intervention for poor readers: a comparison of efficacy with a letter-based intervention. Mind Brain Educ. 7, 113–123. doi: 10.1111/mbe.12016
Bradlow, A. R., Kraus, N., Nicol, T. G., McGee, T. J., Cunningham, J., Zecker, S. G., et al. (1999). Effects of lengthened formant transition duration on discrimination and neural representation of synthetic CV syllables by normal and learning-disabled children. J. Acoust. Soc. Am. 106, 2086. doi: 10.1121/1.427953
Campbell, T., Kerlin, J. R., Bishop, C. W., and Miller, L. M. (2012). Methods to eliminate stimulus transduction artifact from insert earphones during electroencephalography. Ear Hear. 33, 144. doi: 10.1097/AUD.0b013e3182280353
Centanni, T., Booker, A., Sloan, A., Chen, F., Maher, B., Carraway, R., et al. (2013). Knockdown of the dyslexia-associated gene kiaa0319 impairs temporal responses to speech stimuli in rat primary auditory cortex. Cereb. Cortex doi: 10.1093/cercor/bht028 [Epub ahead of print].
Chandrasekaran, B., Hornickel, J., Skoe, E., Nicol, T., and Kraus, N. (2009). Context-dependent encoding in the human auditory brainstem relates to hearing speech in noise: implications for developmental dyslexia. Neuron 64, 311–319. doi: 10.1016/j.neuron.2009.10.006
Choudhury, N., and Benasich, A. A. (2011). Maturation of auditory evoked potentials from 6 to 48 months: prediction to 3 and 4 year language and cognitive abilities. Clin. Neurophysiol. 122, 320–338. doi: 10.1016/j.clinph.2010.05.035
Engineer, C. T., Perez, C. A., Chen, Y. T. H., Carraway, R. S., Reed, A. C., Shetake, J. A., et al. (2008). Cortical activity patterns predict speech discrimination ability. Nat. Neurosci. 11, 603–608. doi: 10.1038/nn.2109
Franceschini, S., Gori, S., Ruffino, M., Viola, S., Molteni, M., and Facoetti, A. (2013). Action video games make dyslexic children read better. Curr. Biol. 23, 462–466. doi: 10.1016/j.cub.2013.01.044
Goswami, U., Fosker, T., Huss, M., Mead, N., and Szûcs, D. (2011). Rise time and formant transition duration in the discrimination of speech sounds: the Ba–Wa distinction in developmental dyslexia. Dev. Sci. 14, 34–43. doi: 10.1111/j.1467-7687.2010.00955.x
Goswami, U., Thomson, J., Richardson, U., Stainthorp, R., Hughes, D., Rosen, S., et al. (2002). Amplitude envelope onsets and developmental dyslexia: a new hypothesis. Proc. Natl. Acad. Sci. U.S.A. 99, 10911–10916. doi: 10.1073/pnas.122368599
Gou, Z., Choudhury, N., and Benasich, A. A. (2011). Resting frontal gamma power at 16, 24 and 36 months predicts individual differences in language and cognition at 4 and 5 years. Behav. Brain Res. 220, 263–270. doi: 10.1016/j.bbr.2011.01.048
Guttorm, T. K., Leppänen, P. H., Poikkeus, A. -M., Eklund, K. M., Lyytinen, P., and Lyytinen, H. (2005). Brain event-related potentials (ERPs) measured at birth predict later language development in children with and without familial risk for dyslexia. Cortex 41, 291–303. doi: 10.1016/S0010-9452(08)70267-3
Hornickel, J., Skoe, E., Nicol, T., Zecker, S., and Kraus, N. (2009). Subcortical differentiation of stop consonants relates to reading and speech-in-noise perception. Proc. Natl. Acad. Sci. U.S.A. 106, 13022–13027. doi: 10.1073/pnas.0901123106
Hornickel, J., Zecker, S. G., Bradlow, A. R., and Kraus, N. (2012). Assistive listening devices drive neuroplasticity in children with dyslexia. Proc. Natl. Acad. Sci. U.S.A. 109, 16731–16736. doi: 10.1073/pnas.1206628109
Huss, M., Verney, J. P., Fosker, T., Mead, N., and Goswami, U. (2011). Music, rhythm, rise time perception and developmental dyslexia: perception of musical meter predicts reading and phonology. Cortex 47, 674–689. doi: 10.1016/j.cortex.2010.07.010
Kovelman, I., Norton, E. S., Christodoulou, J. A., Gaab, N., Lieberman, D. A., Triantafyllou, C., et al. (2012). Brain basis of phonological awareness for spoken language in children and its disruption in dyslexia. Cereb. Cortex 22, 754–764. doi: 10.1093/cercor/bhr094
Kraus, N., McGee, T. J., Carrell, T. D., Zecker, S. G., Nicol, T. G., and Koch, D. B. (1996). Auditory neurophysiologic responses and discrimination deficits in children with learning problems. Science 273, 971–973. doi: 10.1126/science.273.5277.971
Lallier, M., Donnadieu, S., and Valdois, S. (2013). Developmental dyslexia: exploring how much phonological and visual attention span disorders are linked to simultaneous auditory processing deficits. Ann. Dyslexia 63, 97–116. doi: 10.1007/s11881-012-0074-4
Lehongre, K., Ramus, F., Villiermet, N., Schwartz, D., and Giraud, A.-L. (2011). Altered low-gamma sampling in auditory cortex accounts for the three main facets of dyslexia. Neuron 72, 1080–1090. doi: 10.1016/j.neuron.2011.11.002
Leong, V., and Goswami, U. (2013). Assessment of rhythmic entrainment at multiple timescales in dyslexia: evidence for disruption to syllable timing. Hear. Res. doi: 10.1016/j.heares.2013.07.015 [Epub ahead of print].
Leppänen, P., Hämäläinen, J., Guttorm, T., Eklund, K., Salminen, H., Tanskanen, A., et al. (2012). Infant brain responses associated with reading-related skills before school and at school age. Neurophysiol. Clin. Neurophysiol. 42, 35–41. doi: 10.1016/j.neucli.2011.08.005
Livingstone, M. S., Rosen, G. D., Drislane, F. W., and Galaburda, A. M. (1991). Physiological and anatomical evidence for a magnocellular defect in developmental dyslexia. Proc. Natl. Acad. Sci. U.S.A. 88, 7943–7947. doi: 10.1073/pnas.88.18.7943
McGinley, M. J., Liberman, M. C., Bal, R., and Oertel, D. (2012). Generating synchrony from the asynchronous: compensation for cochlear traveling wave delays by the dendrites of individual brainstem neurons. J. Neurosci. 32, 9301–9311. doi: 10.1523/JNEUROSCI.0272-12.2012
Moreno, S., Marques, C., Santos, A., Santos, M., Besson, M., and Castro, S.L. (2009). Musical training influences linguistic abilities in 8-year-old children: more evidence for brain plasticity. Cereb. Cortex 19, 712–723. doi: 10.1093/cercor/bhn120
Noordenbos, M.W., Segers, E., Serniclaes, W., Mitterer, H., and Verhoeven, L. (2012). Neural evidence of allophonic perception in children at risk for dyslexia. Neuropsychologia 50, 2010–2017. doi: 10.1016/j.neuropsychologia.2012.04.026
Noordenbos, M. W., Segers, E., Serniclaes, W., and Verhoeven, L. (2013). Neural evidence of the allophonic mode of speech perception in adults with dyslexia. Clin. Neurophysiol. 124, 1151–1162 doi: 10.1016/j.clinph.2012.12.044
Parbery-Clark, A., Tierney, A., Strait, D. L., and Kraus, N. (2012). Musicians have fine-tuned neural distinction of speech syllables. Neuroscience 219, 111–119. doi: 10.1016/j.neuroscience.2012.05.042
Power, A. J., Mead, N., Barnes, L., and Goswami, U. (2012). Neural entrainment to rhythmically presented auditory, visual, and audio-visual speech in children. Front. Psychol. 3:216. doi: 10.3389/fpsyg.2012.00216
Pugh, K. R., Landi, N., Preston, J. L., Mencl, W. E., Austin, A. C., Sibley, D., et al. (2013). The relationship between phonological and auditory processing and brain organization in beginning readers. Brain Lang. 125, 173–183. doi: 10.1016/j.bandl.2012.04.004
Ramus, F., Marshall, C. R., Rosen, S., and van der Lely, H. K. (2013). Phonological deficits in specific language impairment and developmental dyslexia: towards a multidimensional model. Brain 136, 630–645. doi: 10.1093/brain/aws356
Sandak, R., Mencl, W. E., Frost, S. J., and Pugh, K. R. (2004). The neurobiological basis of skilled and impaired reading: Recent findings and new directions. Sci. Stud. Read. 8, 273–292. doi: 10.1207/s1532799xssr0803_6
Saygin, Z. M., Norton, E. S., Osher, D. E., Beach, S. D., Cyr, A. B., Ozernov-Palchik, O., et al. (2013). Tracking the roots of reading ability: white matter volume and integrity correlate with phonological awareness in prereading and early-reading kindergarten children. J. Neurosci. 33, 13251–13258. doi: 10.1523/JNEUROSCI.4383-12.2013
Serniclaes, W., Heghe, S. V., Mousty, P., Carré, R., and Sprenger-Charolles, L. (2004). Allophonic mode of speech perception in dyslexia. J. Exp. Child Psychol. 87, 336–361. doi: 10.1016/j.jecp.2004.02.001
Steinschneider, M., and Fishman, Y. I. (2011). Enhanced physiologic discriminability of stop consonants with prolonged formant transitions in awake monkeys based on the tonotopic organization of primary auditory cortex. Hear. Res. 271, 103–114. doi: 10.1016/j.heares.2010.04.008
Strait, D. L., O’Connell, S., Parbery-Clark, A., and Kraus, N. (2013). Musicians’ enhanced neural differentiation of speech sounds arises early in life: developmental evidence from ages three to thirty. Cereb. Cortex doi: 10.1093/cercor/bht103 [Epub ahead of print].
Swan, D., and Goswami, U. (1997). Phonological awareness deficits in developmental dyslexia and the phonological representations hypothesis. J. Exp. Child Psychol. 66, 18–41. doi: 10.1006/jecp.1997.2375
Tallal, P., Miller, S. L., Bedi, G., Byma, G., Wang, X., Nagarajan, S. S., et al. (1996). Language comprehension in language-learning impaired children improved with acoustically modified speech. Science 271, 81–84. doi: 10.1126/science.271.5245.81
Temple, E., Deutsch, G. K., Poldrack, R. A., Miller, S. L., Tallal, P., Merzenich, M. M., et al. (2003). Neural deficits in children with dyslexia ameliorated by behavioral remediation: evidence from functional MRI. Proc. Natl. Acad. Sci. U.S.A. 100, 2860–2865. doi: 10.1073/pnas.0030098100
Tierney, A. T., and Kraus, N. (2013c). “Music training for the development of reading skills,” in Applying Brain Plasticity to Advance and Recover Human Ability Progress in Brain Research, eds M. M. Merzenich, M. Nahum, and T. van Vleet (London: Elsevier).
Witton, C., Talcott, J., Hansen, P., Richardson, A., Griffiths, T., Rees, A., et al. (1998). Sensitivity to dynamic auditory and visual stimuli predicts nonword reading ability in both dyslexic and normal readers. Curr. Biol. 8, 791–797. doi: 10.1016/S0960-9822(98)70320-3
Wright, B. A., Bowen, R. W., and Zecker, S. G. (2000). Nonlinguistic perceptual deficits associated with reading and language disorders. Curr. Opin. Neurobiol. 10, 482–486. doi: 10.1016/S0959-4388(00)00119-7
Keywords: reading, dyslexia, temporal sampling, phase locking, brainstem, phonological awareness, temporal coding
Citation: White-Schwoch T and Kraus N (2013) Physiologic discrimination of stop consonants relates to phonological skills in pre-readers: a biomarker for subsequent reading ability? Front. Hum. Neurosci. 7:899. doi: 10.3389/fnhum.2013.00899
Received: 14 August 2013; Accepted: 10 December 2013;
Published online: 24 December 2013.
Edited by:Marie Lallier, Basque Center on Cognition Brain and Language, Spain
Reviewed by:Willy Serniclaes, Université Libre de Bruxelles, Belgium
Jenny Thomson, University of Sheffield, UK
Copyright © 2013 White-Schwoch and Kraus. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Nina Kraus, Auditory Neuroscience Laboratory, Northwestern University, 2240 Campus Drive, Evanston 60208, IL, USA e-mail: firstname.lastname@example.org
† Submitted for: “Oscillatory “Temporal Sampling” and Developmental Dyslexia: Towards an Overarching Theoretical Framework” (Eds. Usha Goswami, Andrea Facoetti, Marie Lallier & Alan Power).