Original Research ARTICLE
Phonemic restoration in developmental dyslexia
- 1Department of Psychology, University of Connecticut, Storrs, CT, USA
- 2Haskins Laboratories, New Haven, CT, USA
- 3Department of Speech, Language, and Hearing Sciences, University of Connecticut, Storrs, CT, USA
The comprehension of fluent speech in one's native language requires that listeners integrate the detailed acoustic-phonetic information available in the sound signal with linguistic knowledge. This interplay is especially apparent in the phoneme restoration effect, a phenomenon in which a missing phoneme is “restored” via the influence of top-down information from the lexicon and through bottom-up acoustic processing. Developmental dyslexia is a disorder characterized by an inability to read at the level of one's peers without any clear failure due to environmental influences. In the current study we utilized the phonemic restoration illusion paradigm to examine individual differences in phonemic restoration across a range of reading ability, from very good to dyslexic readers. Results demonstrate that restoration occurs less in those who have high scores on measures of phonological processing. Based on these results, we suggest that the processing or representation of acoustic detail may not be as reliable in poor and dyslexic readers, with the result that lexical information is more likely to override acoustic properties of the stimuli. This pattern of increased restoration could result from a failure of perceptual tuning, in which unstable representations of speech sounds result in the acceptance of non-speech sounds as speech. An additional or alternative theory is that degraded or impaired phonological processing at the speech sound level may reflect architecture that is overly plastic and consequently fails to stabilize appropriately for speech sound representations. Therefore, the inability to separate speech and noise may result as a deficit in separating noise from the acoustic signal.
Developmental dyslexia refers to an inability to read at grade level despite adequate instruction, intellectual ability, motivation and regardless of socioeconomic status (Berninger, 2001). Developmental dyslexia, which will be referred to as dyslexia for the remainder of this paper, is estimated to effect 4–12% of the population and is resilient into adulthood (see Gabrieli, 2009 for review). Deficits in phonological awareness have long been the primary hallmark of dyslexia (Bradley and Bryant, 1978; Liberman et al., 1989). An effort to discover a more general deficit, that culminates in spectrum of reading difficulty, has led to this population being described as having a specific reading disability (RD) (see Shaywitz and Shaywitz, 2005 for review). It is hotly debated whether this reading impairment arises from deficits in phonological awareness, rapid auditory processing, visual motion, or noise exclusion. It has been suggested that the phonological deficit in those with dyslexia may originate from an auditory-perceptual deficit (Richardson et al., 2004). This hypothesis stems from evidence of impaired talker identification (Perrachione et al., 2011), frequency discrimination (Banai and Ahissar, 2004; Ahissar et al., 2006; Ahissar, 2007) and difficulty with amplitude modulation (Goswami et al., 2002; Richardson et al., 2004). There has also been some evidence suggesting that individuals with dyslexia exhibit a more general speech perception deficit (Manis et al., 1997; Hazan, 1998).
In studies of speech perception in quiet, only a small portion of dyslexic individuals struggled with speech perception (Manis et al., 1997; Adlard and Hazan, 1998). Evidence from Ziegler et al. (2009) suggests that dyslexic children struggle with speech perception in noise more profoundly than their age-matched or reading level-matched peers. This suggests that speech perception in dyslexia may be more susceptible to noise interference than in age-matched or reading-matched peers (Ziegler et al., 2009). This result is very similar to the speech-in-noise deficit seen in children with a specific language impairment (SLI) (Ziegler et al., 2005). Furthermore, in speech-in-noise paradigms, children with language learning disabilities, have shown difficulty with speech-in-noise paradigms behaviorally (Bradlow et al., 2003), electrophysiologically at the level of brainstem (Cunningham et al., 2001; Russo et al., 2005) and in the cortex (Warrier et al., 2004; Wible et al., 2005). Phonological processing deficits have been shown in both populations with SLI (Fidler et al., 2011) and in those with language learning disabilities (Richman, 1983). Taken together, this suggests that speech-in-noise perceptual differences do exist between individuals with different reading ability.
Speech-in-noise perceptual differences in dyslexia could be occurring due to a failure in noise exclusion or a more general lack of speech restoration strength. “Failure in auditory noise exclusion” refers to the hypothesis that speech and noise are weighted the same to the dyslexic auditory system and that speech has no special value (Sperling et al., 2005). An alternative hypothesis is that individuals lack sufficient “phonemic restoration strength.” By this we mean that individuals with dyslexia may process speech normally, but a lack of robustness in the dyslexic auditory system may lead to perceptual degradation in the speech signal. This would lead to difficulties in filling in gaps in the acoustic signal when portions of that signal are occluded by noise. In the current investigation, we ask whether individual differences in reading skill are related to differences in phoneme restoration.
Phonemic restoration is an auditory illusion that requires the integration of bottom-up information from the acoustic signal, seamlessly coordinated with top-down lexical status expectations generated by the listener's prior knowledge. Warren (1970) first demonstrated this phenomenon when he showed that if a sound, such as a cough or tone, replaces a speech sound, listeners believe they hear the missing sound or phoneme. This illusory percept of the missing phoneme is referred to as phoneme restoration. However, if silence replaces a speech sound, the interval is detected, the listener notes the interruption of the word and phonemic restoration does not occur; this is known as a failure to restore (Warren, 1970; Warren and Obusek, 1971; Repp, 1991). The phonemic restoration experimental paradigm allows unique insight into the perceptual mechanism of verbal word recognition.
Evidence that phonemic restoration relies on bottom-up information from the acoustic signal comes from the initial phonemic restoration experiment where Warren (1970) demonstrated that a failure to restore occurred if silence replaced a speech sound, suggesting that some acoustic signal must fill the gap in order for an illusory phoneme to be restored. The strength of restoration effects are conditioned by the nature of the occluding stimulus. In particular, evidence from Bashford et al. (1996) has shown if the replacement non-speech sound has a matching amplitude envelope to the sound replaced, restoration increases. Increased restoration through psychoacoustic correspondence of the replacement sound to the sound replaced suggests that perhaps bottom up evidence is strengthened due to mechanisms in the speech perception system normally employed to correct for errors in speech (Frisch and Wright, 2002) or possibly utilized in acoustic variability across speakers (Warren and Obusek, 1971; Layton, 1975; Bashford and Warren, 1979, 1987; Samuel, 1981b; Verschuure and Brocaar, 1983; Bashford et al., 1992). Taken together, these studies provide clear evidence that bottom-up acoustics of the speech signal play a significant role in phonemic restoration.
Evidence for the effects of top-down information on phoneme restoration comes primarily from paradigms investigating words and pseudowords. Restoration is thought to be guided by top-down influences from the listener's lexical knowledge (Samuel, 1987). Restoration occurs more quickly and frequently in tokens with a lexical status (e.g., words) (Samuel, 1981a, 1996). This suggests that lexical knowledge increases the strength of the illusory phoneme. Lexical effects were further tested through word length, with longer words increasing restoration (Samuel, 1981a, 1996; Bashford et al., 1988). Additional evidence suggests that restoration is stronger for phonemes that occur later in words (Marslen-Wilson and Welsh, 1978). This has been posited to be due to the strong expectations regarding the identity of the missing phoneme facilitate restoration.
The phoneme restoration paradigm allows for the examination of both top-down and bottom-up aspects of speech-in-noise perception. Utilizing this paradigm in dyslexia will provide unique insight into a possible speech-in-noise deficit in dyslexia. Prior evidence suggests that phonemic restoration requires that the listener have both intact bottom-up acoustic processing as well as top-down information from the lexicon to perform restoration for real words. At a pseudoword condition level, lexical information is available from the lexical neighborhood that is, activation of the lexical neighborhood may provide enough information to allow the listener to restore the missing phoneme (Samuel, 1987); however, pseudowords do not have a lexical entry. The lexical information available for real words will be stronger and more specific than for pseudowords (Samuel, 1996). Pseudoword phoneme restoration requires intact (bottom-up) processing of the acoustic details of the speech signal as well as access to a more abstract top-down representation of the lexical neighborhood. In the context of isolated speech sounds, that is, speech sounds excised from words, listeners are required to have an intact acoustic representation of a speech sound, in this case the /s/ fricative. Prior results from Samuel (1981a) with an ecologically valid sample demonstrate that phonemic restoration effects are strongest in words compared to pseudowords, and weakest with individual speech sound segments that have been excised from words. This gradation in the strength of the restoration effect presumably results from the variable degree of top-down information that is available in words vs. pseudowords vs. speech sound segments.
Traditional speech paradigms have reported mixed results in individuals with dyslexia. Children with dyslexia show a deficit in speech perception, specifically in noise (Ziegler et al., 2009). Prior work has shown that only a small portion of dyslexic individuals show degraded speech perception (Manis et al., 1997; Adlard and Hazan, 1998). One advantage of the phonemic restoration paradigm is that it does not rely on a failure to perform the task as evidence to support a hypothesis. In essence, the typical pattern of restoration does reflect a “failure” to detect the absence of the speech sound. As such, we aim to determine whether performance on restoration tasks is similar across participants on a spectrum of good-to-dyslexic readers.
Here we aim to address the relationship between reading ability and phonemic restoration. One possibility is that better readers will be better at detecting the interruption in speech (restoration will not occur). This outcome would suggest that good readers are not fooled by the illusion of restoration and instead have very clear representations of individual speech sounds, that is, their bottom-up processing is high fidelity. An alternative possibility is that better readers will be worse at detecting the interruption in speech (restoration will occur). This would suggest that good readers are better able to adapt to deviations in the pronunciation of speech sounds, possibly in order to handle individual variability. It is furthermore possible that a specific aspect of reading ability, such as phonological processing or comprehension, is more closely tied to phonemic restoration than measures of overall reading ability.
Materials and Methods
College students (n = 53; male = 18) were recruited from the University of Connecticut and screened with approval from the Office of Research Compliance. Participants received either extra course credits or payment for participation. Participants were of typical college age (Mean Age = 19.74, Standard Error (SE) = 0.31), were right-handed based on questionnaire responses adapted from the Edinburgh Handedness Inventory; (Oldfield, 1971), and were monolingual native speakers of American English. According to self-report, these participants had no neuropsychological conditions, less than 1 year of musical instruction, normal hearing, and full term births. Participants had no immediate family members with diagnosed developmental disorders and were taking no prescribed medication other than birth control at the time of study participation.
Standardized Behavioral Testing
Given that previous studies have suggested that dyslexic populations vary on degree of impairment (Torgesen, 2005), we attempted to collect phonemic restoration data across a range of reading ability and disability (see Table 1). Standardized measures of cognitive, reading and reading-associated abilities were administered. Participants were assessed for cognitive ability based on performance intelligence quotient from the Wechsler Abbreviated Scale of Intelligence, 3rd Ed., WASI-3; (Wechsler, 1999) as well as working memory from the Wechsler Adult Intelligence Scale, WAIS-IV; (Wechsler, 2008). Participants were also administered a reading battery including reading comprehension (“Passage Comprehension”; Woodcock Reading Mastery Tests-Third Edition, WRMT-III; (Woodcock, 2011) and phonological processing “Elision,” “Blending Words,” and “Non-word Repetition”; Comprehensive Test of Phonological Processing, CTOPP; (Wagner et al., 1999). Participants were administered standardized assessments of timed “Sight Word Efficiency” and “Phonemic Decoding Efficiency”; Test of Word Reading Efficiency, TOWRE; (Torgesen et al., 1999) and untimed measures “Word Identification” and “Word Attack”; (Woodcock Reading Mastery Tests-Third Edition, WRMT-III) of single word as well as pseudoword reading. A standardized assessment of timed sentence reading, in which a semantic judgment was made, were used to assess reading fluency “Reading Fluency”; Woodcock Johnson III, WJIII; (Woodcock et al., 2007).
Inclusion criteria were based on dyslexia literature (Katzir et al., 2008; Katzir, 2009) (see Table 2). Good readers (n = 23; male = 8) all scored above the bottom 25th percentile on all measures of timed and untimed word and pseudoword reading (Woodcock, 2011) in addition to having no self-reported history of reading difficulty. Poor readers (n = 14; male = 5) did not report any history of difficulty of reading but nonetheless scored below the 25th percentile on at least one measure of timed word or pseudoword reading. Dyslexic readers (n = 16; male = 5) self-reported either a diagnosis of dyslexia or a continuous history of reading difficulty and remediation; dyslexic readers scored below the 25th percentile on two or more measures of timed or untimed word or pseudoword reading. Importantly, relatively poor scores on measures of phonological processing may be seen even among individuals with no previous history of reading difficulty. In order to examine potential differences between individuals with a reported reading disability compared to those with no documented reading disability, participants were grouped based on standardized testing scores and self-report via questionnaire into three groups: good readers, poor readers and dyslexic readers. However, preliminary analysis (see Table 3) suggested that poor readers and dyslexic readers were performing very similarly across restoration conditions. Therefore, all group analysis shown in the result section of this paper will reflect two groups, good readers (n = 23; male = 8) and poor-to-dyslexic readers (n = 30; male = 10).
Fifty three-syllable nouns with a medial /s/ fricative as the target phoneme were recorded by a female native English speaker for the full paradigm. Ten three-syllable nouns with a medial /s/ fricative as the target phoneme were recorded by a male native English speaker to be used in the practice version of the paradigm. Stimuli were normed for age of acquisition, written and oral frequency, concreteness and imageability using the MRC Psycholinguistic Database (http://websites.psychology.uwa.edu.au/school/MRCDatabase/uwa_mrc.htm). Three-syllable pseudowords with a medial /s/ fricative were created and recorded. Pseudowords were created by replacing two phonemes in each of the real words that were used for recording. Substituted phonemes were from a similar phonological class (e.g., vowels were replaced with vowels).
Recordings were altered using Pratt (http://www.fon.hum.uva.nl/praat/). Two versions of each stimulus were created: “Replaced” and “Added” stimuli. For both types of stimuli, the boundaries of the /s/ was located within the waveforms. These /s/ segments varied in duration between 100–140 ms long. Next, white noise was created in Pratt using the Random Gauss function. The formula used was randomGauss (0, 0.25) with the amplitude and duration of white noise matched individually to each stimulus's /s/ fricative. “Replaced” stimuli were created by entirely replacing the /s/ with amplitude- and duration-matched white noise. Thus, the amplitude and duration of the medial segment of the “Replaced” stimuli (e.g., white noise) was matched to the amplitude and duration of the previous /s/ fricative in each individual stimuli. “Replaced” stimuli are stimuli in which the word is missing the medial /s/ and hearing white noise (e.g., the stimulus is interrupted). “Added” stimuli were created by blending (averaging together) the /s/ segment with its amplitude and duration matched white noise, using waveform averaging in Pratt, then inserting this blend back into the word or pseudoword in the medial /s/'s previous location. Thus, the amplitude and duration of the medial segment of the “Added” stimuli was a function of the amplitude and duration of the previous /s/ fricative in each individual stimuli. “Added” stimuli are stimuli in which the word is intact but the medial /s/ includes white noise (e.g., the stimulus is intact, with added white noise). The speech sound segment condition consisted of either one of the white noise segments created for insertion into the “Replaced” condition; or one of the white noise + /s/ segments created for insertion into the “Added” condition, as described above. The speech sound mixed with noise constituted the “Added” stimuli. While white noise alone made up the “Replaced” stimuli.
A two-alternative forced choice task was used (refer to Table 3 for d′, Beta and Miss Rate). In the word and pseudoword conditions, subjects heard a single stimulus at a time. Subjects were instructed for each token to determine if noise was replacing part of the word, “Replaced condition,” or if the noise coincided with part of word, “Added” condition (Samuel, 1981a). The word and pseudoword conditions occurred during the same block, and were pseudorandomized to prevent the added and replaced condition of individual stimuli from occurring sequentially. Subjects were not explicitly told that the paradigm consisted or words and pseudowords, nor that medial /s/ was the target phoneme. The speech sound segment condition always followed the word/pseudoword block. In the speech sound segment condition, subjects were asked to determine if they were hearing noise by itself or noise mixed with a speech sound (Samuel, 1981a). Subjects were not told that the speech sound was an /s/. A single block of speech sound segments followed the word and pseudoword block for every subject. Participants practiced the task with feedback in a randomized block consisting of 10 words and 10 pseudowords. Prior to the speech sound segment test block, participants practiced 10 speech sound segment trials. In all trials subjects had 4 s to answer per item, after which the trial timed out. These time-outs constituted no more than five trials within each block. No subjects were excluded based on missed trials.
Following other studies of phonemic restoration (Sherman, 1971; Warren and Obusek, 1971; Warren and Sherman, 1974; Samuel, 1981a,b; Samuel and Ressler, 1986; Samuel, 1987, 1991), d′ was used as sensitivity measure of restoration (Macmillan and Creelman, 2004). In this experiment, d′ was calculated using the following formula: d′ = z(H)−z(F). Where “H” describes the hit rate (that is, proportion of “Noise Coincided” responses for the “Added” condition) and “F” describes the number of false alarms (that is, proportion of erroneous “Noise Coincided” responses for the “Replaced” condition) (refer to Table 3 for d′, Beta and Miss Rate).
First, paired t-test were used to confirm that our experiment showed the same pattern of restoration that has been previously reported (Samuel, 1981a); subjects were found to have stronger phonemic restoration for words than pseudowords, and greater restoration for both words and pseudowords compared to speech sound segments. Second, a correlation analysis was performed to determine if a relationship existed between reading ability and phonemic restoration and the magnitude of that relationship. Last, a multivariate analysis of variance (MANOVA) statistic was chosen to allow for the examination of the interaction between reading groups (good and poor-to-dyslexic readers) and all levels of restoration (word, pseudoword and speech sound segment conditions) while avoiding increasing the risk of an inflated Type I error. Preliminary statistical analysis confirmed a relationship between restoration conditions, but also demonstrated that the magnitude of these relationships was not uniform thus making the results ideally suited for MANOVA analysis (see Keppel and Wickens, 2004; Huberty and Olejnik, 2006).
It is worth reiterating that a low d′ score indicates that the two versions of stimuli, added and replaced, are not discriminable; the stimuli are perceived as alike because the missing phoneme signal is being restored in the replaced version. A low d′ score indicates high restoration. A high d′ score indicates that the critical segments do not sound alike and restoration is not occurring. Three paired samples t-tests were used to confirm the previously found pattern of restoration (Samuel, 1981a). A significant difference was found between word [condition mean (M) = 0.613, SE = 0.037] and pseudoword (M = 1.025, SE = 0.049) restoration t(52) = −7.46, p = 9.0555−10. This indicates stronger restoration for words then pseudowords. A significant difference was found between word (M = 0.613, SE = 0.037) and speech sound segment (M = 2.160, SE = 0.108) restoration t(52) = −14.322, p = 4.189−13. A significant difference was also found between pseudoword (M = 1.025, SE = 0.049) and speech sound segment (M = 2.160, SE = 0.108) restoration t(52) = −9.599, p = 1.101−19. This indicates stronger restoration for both words and pseudowords compared to speech sound segments (see Table 3, bottom row). All comparisons were significant according to Bonferroni correction.
Relationship of Phonemic Restoration to Reading Ability
In Samuel (1981a), a single ecologically valid sample was used in which no reading assessments were used to distinguish between subjects. In the current study, participants were intentionally recruited across a range of reading ability based on standardized reading assessment scores. These scores indicate that our subject sample ranged from good-to-dyslexic readers (see Table 1). While there are many papers positing subtypes of dyslexia, here we aimed only to use dyslexic participants with a phonological awareness deficit. Individuals with additional/exclusive rapid naming deficits were excluded based on assessment scores (Wolf and Denckla, 2005). We ran a Pearson correlation in order to determine if a relationship existed between phonemic restoration and skills commonly associated with reading ability (see Table 1 for all nine subtests that were run for possible correlations). False discovery rate (FDR) was used to correct for multiple comparisons. Among these subtests, there was a positive correlation between d′ scores in the words condition and the standardized assessment scores on subtests of the CTOPP Blending Words r = 0.3, p = 0.03, and Non-word Repetition r = 0.35, p = 0.009 (see Figures 1A,B). Each of these subtests is used to assess phonological processing based on two components: phonological awareness and phonological memory. Given that high d′ scores indicate less restoration, this result shows that better performance on standardized assessments of phonological processing is correlated with less phoneme restoration in the word condition. There were no correlations between the restoration effect for pseudowords and any of the behavioral standardized reading assessments. There was a positive correlation between d′ for speech sound segments and the standardized assessment score for CTOPP Blending Words r = 0.3, p = 0.03, (see Figure 1C) indicating that better performance on this phonological processing subtest correlated with less restoration within the speech sound segment condition. Taken together this correlation seems to suggest that better readers, that is, those with high phonological processing skills, are less likely to show the restoration effects in words and speech sound segments.
Figure 1. Blue indicates good readers. Red indicates poor-to-dyslexic readers. Correlations show the relationship between phonemic restoration and subtests of the Comprehensive Test of Phonological Processing (CTOPP). A high d′ score (y-axis) indicates that the critical segments are not perceived as alike and restoration does not occur. As such, the positive correlation reflects that better performance on the CTOPP subtest is associated with less restoration. Significant positive correlations were found between (A) Non-word Repetition subtest and word restoration (B) Blending Words subtest and word restoration (C) Blending Words subtest and speech sound segment restoration.
Difference between Good Readers and Poor-to-Dyslexic Readers
Only one particular aspect of reading, phonological processing, appears to be correlated with phonemic restoration (at the word and speech sound segment condition levels). Given our preliminary findings (see Table 3) suggesting no differences between poor readers and dyslexic readers on measures on phonemic restoration, for the purposes of group analysis we will only compare good readers (n = 23; male = 8) and poor-to-dyslexic readers (n = 30; male = 10). While lexical representations enable native listeners to restore phonemes within real words, the pseudoword and speech sound segment conditions must rely more heavily on bottom-up acoustic information. We predict those with lower phonological awareness may present with additional difficulties in restoring pseudowords and speech sound segments.
A MANOVA was conducted, with the three d′ measures of restoration (words, pseudowords and speech sound segments) as dependent variables, and reading group membership (good readers and poor-to-dyslexic readers) as the independent variable. Assumptions of homogeneity of the variance-covariance matrices as well as the assumption of equality of variance were met. A statistically significant difference found between good readers and poor-to-dyslexic readers on the combined measures of d′ restoration (2 Reading Groups*3 Condition Levels of Restoration) F(3, 49) = 3.392, p = 0.025; Pillai's Trace = 0.172; partial eta squared = 0.172. Within groups, differences were again found between individual measures of restoration. Within groups (good readers and poor-to-dyslexic readers) differences on individual measures of restoration (words, pseudowords and speech sound segments) were investigated using a Bonferroni adjusted alpha level of 0.017. A significant difference was found for restoration of words, pseudowords and speech sound segments F(3, 49) = 290.873 p = 3.315−31; Pillai's Trace = 0.947; partial eta squared = 0.947 (see Figure 2A). This again replicates previous results from Samuel (1981a) on an ecological sample. Here we again show that greater restoration is found for words (M = 0.613, SE = 0.037) than pseudowords (M = 1.025, SE = 0.049), and greater restoration for both words and pseudowords compared to speech sound segments (M = 2.160, SE = 0.108).
Figure 2. Mean phonemic restoration performance by reading group: good readers and poor-to-dyslexic readers. Low d′ scores indicate more susceptibility to the restoration illusion. (A) Amount of restoration is shown as a measure of d′ for good readers and poor-to-dyslexic readers across the word (W), pseudoword (P) and speech sound segment (S) conditions. Stronger restoration was found for words compared to pseudowords, words compared to speech sound segments and pseudowords compared to speech sound segments. (B) A main effect of group within the segments condition was found. Using a post-hoc Bonferroni adjusted alpha, a significant difference was found between good and poor-to-dyslexic readers ability to restore speech sound segments at p = 0.01. Error bars indicate standard error of the mean (s.e.m.). Asterisks indicate *p < 0.05, **p < 0.01, ***p < 0.001.
Between groups (good readers and poor-to-dyslexic readers) differences on individual measures of restoration (words, pseudowords and speech sound segments) were investigated using a Bonferroni adjusted alpha level of 0.017. No significant effect of group was found within the words and pseudowords conditions, suggesting that the group by condition interaction is largely driven by the performance of the good and dyslexic readers on the speech sound segment condition. The only measure of restoration to reach significance was the restoration of speech sound segments F(1, 52) = 7.074, p = 0.010, partial eta squared = 0.122. Given that the assumption of equality of variance was met, a Bonferroni adjusted alpha of 0.025 was used for pairwise group mean comparison. A significant difference in speech sound segment restoration p = 0.010 was found, with a mean increase (M = 0.551, SE = 0.207, 95% CI: 0.135 and 0.967) in speech sound segment restoration for good readers (M = 2.472, SE = 0.159) compared with poor-to-dyslexic readers (M = 1.921, SE = 0.134) (see Figure 2B). The increase in good reader d′ for speech sound segment restoration indicates that good readers are less susceptible to the speech sound segment restoration illusion.
Across a diverse population that included good readers, poor readers, and individuals with dyslexia, restoration was found to occur less in those who show high measures of phonological processing on standardized assessments. That is to say, individuals with better phonological abilities may have greater reliability in their low-level acoustic-phonetic representations for these sounds, and are less “fooled” by the substitution of white noise for speech sounds. Across a continuum of abilities, individuals with less intact phonological abilities may rely more on lexical-semantic information, as suggested by prior evidence from studies of individuals with phonological deficits in dyslexia (Frith and Snowling, 1983). This pattern suggests that the acoustic-phonetic processing or retention of acoustic-phonetic detail is not as reliable in both poor and dyslexic readers.
Although the current findings suggest that there may be subtle deficits at the single segmental level in poor readers and individuals with dyslexia, it is noteworthy the types of tasks that are used to characterize the poor reading and dyslexic population are tasks that require subjects to combine sounds in childhood at the level of phonological processing and as adults at the whole word or pseudoword level. In a longitudinal study that began with training of individual speech sounds in kindergarteners at risk for reading failure, trained students outperformed their at-risk untrained peers by seventh grade; however, trained at-risk children were still grades below an untrained control group of typical readers (Elbro and Petersen, 2004). This begs the question: are phonological awareness impairments a deficit in phonological processing, which includes manipulating, subtracting and combining speech sounds or are phonological awareness impairments a symptom of a lower level phonetic failure? We offer two explanations of why poor phonological awareness could lead to differences in phonemic restoration.
Perceptual Stability Deficit: Account of Dyslexic Impairment
A first possibility is that those with more difficulty in phonological awareness show a failure of perceptual tuning stability. The ability to resolve fine-grained acoustic details relies upon having intact, stable representations of speech sounds. In particular, it may be the case that the “tuning curves” for phonetic categories for those with poor phonological awareness are shallower and wider than in the typical population. Unstable representations of speech at the individual sound level would result in poor speech sound boundaries, which may result in non-speech sounds (noise) being mistaken for speech sounds. Without a stable representation of speech at the individual speech sound level, those with phonological awareness deficits or lower phonological awareness ability may already be taxing their perceptual system to map incoming speech sounds into distinct categories, even without the interference of noise.
Phonemic restoration is thought to tap into important systems for typical adult speech perception. Specifically, phonemic restoration has been suggested to be a byproduct of the intact adult perceptual systems ability to handle mispronunciation, talker variability and accented speech (Frisch and Wright, 2002). Phonemic restoration has been shown to be persistent enough in the intact adult perception system that it can produce perceptual effects generally thought to be uniquely associated with “real” phonemes. In a selective adaptation design, along a voice-onset-time (VOT) continuum, subjects showed a continuum shift for restored phonemes similar to that seen for real phonemes (Samuel, 1997). Restoration has also been shown to be robust enough to compensate for co-articulation (Elman and McClelland, 1988) and to shift perception of vowel quality (Ohala and Feder, 1994). Evidence from imaging studies further suggests that restoration results in “real” sounds. A functional Magnetic Resonance Imaging (fMRI) study of tone restoration showed activation in Heschl's gyrus for both real tones as well as perceived continuity of imagined tones in noise (Riecke et al., 2007). Furthermore, extracellular recordings from macaque monkeys found that illusory tones in noise, like real tones in noise, elicit A1 single-neuron response in the primary auditory cortex (Petkov et al., 2003). Given previous evidence one could suggest that restoration is persistent enough to act as a replacement for auditory stimuli.
Shahin et al. (2009) suggested that anatomical areas activated for the restoration (illusion) which included Heschl's gyrus, the left posterior angular gyrus (AG), bilateral superior temporal sulcus (STS), and superior frontal sulcus (SFS) are distinct from regions specific to repair in restoration (illusion > illusion failure), which included Broca's area, the pars opercularis, and bilateral anterior insula. Only unconscious restoration illusion is robust enough to elicit the same neuronal pathway that is activated for natural speech perception, while noted degradation (e.g., repair in restoration: illusion > illusion failure) to the stimuli being restored elicits a different neuronal network. As in the Shahin study, participants in the present study showed a mix of responses such that sometimes they did not detect the absence of the phoneme (illusion) and sometimes did detect this absence (illusion failure). The repair network that is utilized in the restoration of speech, the network recognized by the listener as degraded (speech mixed with noise), is made up of a network utilized in the acquisition of unfamiliar auditory inputs (Myers and Swan, 2012) and in auditory expertise (Zatorre et al., 1992). Taken together this further suggests that that phonemic restoration is tapping into the ability of the perceptual system to handle variability (Frisch and Wright, 2002). As such, we speculate that over-restoration of individual speech-segments could potentially impact speech perception in the real world, not simply the lab.
Current results are consistent with the suggestion that deficits in phonological awareness seen primarily in individuals with dyslexia are at least in part a consequence of a failure at the phonetic level (Morais et al., 1987). Additionally (or alternatively), deficits at the phonetic level may be due to a weaker top-down compensatory mechanisms (Boets et al., 2013). Speech perception requires that listeners map the complex acoustic (bottom-up) signal of speech onto speech sound (i.e., phonetic) categories. To accomplish this, the listener must perceive fine-grained details of the acoustic (bottom-up) signal in order to extract the temporal and spectral information and connect that information to a known phonetic category using top-down information. The categorical perception paradigm allows for the examination of phonetic perception stability. The acoustic differences within a speech category (e.g., two different examples of /d/) are difficult to distinguish, whereas acoustic differences that result in a change in category (e.g., a /d/ and a /b/) are very easy to distinguish (Liberman et al., 1957). Categorical perception has been explored in children with reading difficulty in French (Joanisse et al., 2000), Chinese (Liu et al., 2009), Dutch (Maassen et al., 2001), and English (Werker and Tees, 1987). Results from studies of categorical perception in children report mixed findings. Although some studies report less-categorical perception in dyslexic children (Werker and Tees, 1987; De Weirdt, 1988; Maassen et al., 2001; Bogliotti et al., 2008; Liu et al., 2009), studies have also shown dyslexic children with normal performance on categorical perception (Brandt and Rosen, 1980; Manis et al., 1997; Joanisse et al., 2000). These divergent findings may result from differences in criteria for dyslexia used in the study. Noordenbos et al. (2012) provided initial evidence suggesting that differences in categorical perception in dyslexic children may increase with age, such that as children mature, they become even more different from their peers. Such a maturational effect may partially explain the contradictory results reported above, due to age variance across studies. It is worth reiterating that, as a group, individuals with dyslexia performed similar to controls on restoration in the context of words and pseudowords, although individual differences in restoration across the population were correlated with standardized measures of phonological processing. This finding may reflect preserved access to top-down information from lexical status. In an adult study of categorical perception, Ruff et al. (2003) showed that French dyslexic individuals did not show typical patterns of brain activation for phonetic changes in stimuli (between category > within category) in the left angular gyrus, right inferior frontal gyrus, and the right cingulum. This is consistent with evidence that dyslexic children are able to capitalize on top-down lexical status to perform categorical perception tasks (Reed, 1989; Serniclaes et al., 2001, 2004; Bogliotti et al., 2008).
Inappropriate Perceptual Auditory Plasticity: A Second Account of Dyslexic Impairment
A second or additional possibility is that those with phonological difficulty show inappropriate auditory plasticity. Degraded or impaired phonological processing may reflect a neural architecture that is overly plastic and, as a consequence shows so much accommodation of new sounds that non-speech sounds are inappropriately assimilated into the phonetic category. Prior evidence also suggests that when phonetic units of speech are acoustically exaggerated, infants (Liu et al., 2003) and children with dyslexia (Tallal et al., 1996) are better able to demonstrate phonetic perception. Evidence from studies of perceptual learning paradigms in both animals and humans seems to suggest that perceptual learning is associated with a high degree of experience-dependent plasticity (Sisneros et al., 2004; Golestani et al., 2011; Stein et al., 2012). We speculate that individuals with dyslexia may be showing delayed maturation effects; dyslexic individuals may be unable to exit a critical period in which no “neuronal commitment” has been made to their native language (see Kuhl and Rivera-Gaxiola, 2008). Therefore, an inability to separate speech and noise in a phonetically impaired system may be the result of a deficit in separating noise from a phonetically plausible acoustic signal. A speech system that has remained plastic for prolonged period of time, without a commitment to distinctive speech sound categories, may be willing to accept white noise as a new speech sound.
In this paper we provide evidence that a phonetic impairment does exist in dyslexia, in particular that single speech sounds embedded in noise are likely to be confused with the background noise. At the same time, we show that individuals with dyslexia are able to capitalize on lexical knowledge and phonotactic information to overcome hypothesized perceptual difficulties at the speech sound segment level. Recent evidence suggests that phonetic information is processed in a typical way in the STG but less accessible in individuals with dyslexia due to a degraded connection between the STG and the IFG (Boets et al., 2013). In contrast, our results indicate that, at least for phoneme restoration, the use of top-down information from the lexicon is indistinguishable from that seen in typical adults. Future remediation studies in dyslexia should focus on strengthening acoustic representations at the phonetic level, while also strengthening top-down connection through increasing the amount of perceptual noise while training children on basic reading skills.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
This work was supported by NIH R03 DC009495 to Brown University (Emily B. Myers, PI) and by NIH P30 DC010751 (D. Lillo-Martin, PI). The content is the responsibility of the authors and does not necessarily represent official views of the NIH or NIDCD. The authors thank Kenneth R. Pugh, Stephen J. Frost, Alex P. Demos, F. Sayako Earle, and Alexis R. Johns for valuable feedback on previous drafts of this manuscript.
Bashford, J. A., Meyers, M. D., Brubaker, B. S., and Warren, R. M. (1988). Illusory continuity of interrupted speech: speech rate determines durational limits. J. Acoust. Soc. Am. 84, 1635. doi: 10.1121/1.397178
Boets, B., De Beeck, H. P. O., Vandermosten, M., Scott, S. K., Gillebert, C. R., Mantini, D., et al. (2013). Intact but less accessible phonetic representations in adults with dyslexia. Science 342, 1251–1254. doi: 10.1126/science.1244333
Bogliotti, C., Serniclaes, W., Messaoud-Galusi, S., and Sprenger-Charolles, L. (2008). Discrimination of speech sounds by children with dyslexia: comparisons with chronological age and reading level controls. J. Exp. Child Psychol. 101, 137–155. doi: 10.1016/j.jecp.2008.03.006
Bradlow, A. R., Kraus, N., and Hayes, E. (2003). Speaking clearly for children with learning disabilities: sentence perception in noise. J. Speech Lang. Hear. Res. 46, 80. doi: 10.1044/1092-4388(2003/007)
Brandt, J., and Rosen, J. J. (1980). Auditory phonemic perception in dyslexia: categorical identification and discrimination of stop consonants. Brain Lang. 9, 324–337. doi: 10.1016/0093-934X(80)90152-2
Cunningham, J., Nicol, T., Zecker, S. G., Bradlow, A., and Kraus, N. (2001). Neurobiologic responses to speech in noise in children with learning problems: deficits and strategies for improvement. Clin. Neurophysiol. 112, 758–767. doi: 10.1016/S1388-2457(01)00465-5
Elbro, C., and Petersen, D. K. (2004). Long-term effects of phoneme awareness and letter sound training: an intervention study with children at risk for dyslexia. J. Educ. Psychol. 96:660. doi: 10.1037/0022-06184.108.40.2060
Elman, J. L., and McClelland, J. L. (1988). Cognitive penetration of the mechanisms of perception: compensation for coarticulation of lexically restored phonemes. J. Mem. Lang. 27, 143–165. doi: 10.1016/0749-596X(88)90071-X
Golestani, N., Price, C. J., and Scott, S. K. (2011). Born with an ear for dialects? Structural plasticity in the expert phonetician brain. J. Neurosci. 31, 4213–4220. doi: 10.1523/JNEUROSCI.3891-10.2011
Goswami, U., Thomson, J., Richardson, U., Stainthorp, R., Hughes, D., Rosen, S., et al. (2002). Amplitude envelope onsets and developmental dyslexia: a new hypothesis. Proc. Natl. Acad. Sci. U.S.A. 99, 10911–10916. doi: 10.1073/pnas.122368599
Joanisse, M. F., Manis, F. R., Keating, P., and Seidenberg, M. S. (2000). Language deficits in dyslexic children: speech perception, phonology, and morphology. J. Exp. Child Psychol. 77, 30–60. doi: 10.1006/jecp.1999.2553
Katzir, T., Kim, Y.-S., Wolf, M., Morris, R., and Lovett, M. W. (2008). The varieties of pathways to dysfluent reading comparing subtypes of children with dyslexia at letter, word, and connected text levels of reading. J. Learn. Disabil. 41, 47–66. doi: 10.1177/0022219407311325
Liberman, I. Y., Shankweiler, D. P., and Liberman, A. M. (1989). “The alphabetic principle and learning to read,” in Phonology and Reading Disability: Solving the Reading Puzzle, eds D. P. Shankweiler and A. M. Liberman (Ann Arbor, MI: University of Michigan Press), 1–33.
Maassen, B., Groenen, P., Crul, T., Assman-Hulsmans, C., and Gabreëls, F. (2001). Identification and discrimination of voicing and place-of-articulation in developmental dyslexia. Clin. Linguist. Phon. 15, 319–339. doi: 10.1080/02699200010026102
Manis, F. R., McBride-Chang, C., Seidenberg, M. S., Keating, P., Doi, L. M., Munson, B., et al. (1997). Are speech perception deficits associated with developmental dyslexia? J. Exp. Child Psychol. 66, 211–235. doi: 10.1006/jecp.1997.2383
Noordenbos, M., Segers, E., Serniclaes, W., Mitterer, H., and Verhoeven, L. (2012). Allophonic mode of speech perception in Dutch children at risk for dyslexia: a longitudinal study. Res. Dev. Disabil. 33, 1469–1483. doi: 10.1016/j.ridd.2012.03.021
Petkov, C. I., O'Connor, K. N., and Sutter, M. L. (2003). Illusory sound perception in macaque monkeys. J. Neurosci. 23, 9155–9161. Available online at: http://www.jneurosci.org/content/23/27/9155.short
Riecke, L., Van Opstal, A. J., Goebel, R., and Formisano, E. (2007). Hearing illusory sounds in noise: sensory-perceptual transformations in primary auditory cortex. J. Neurosci. 27, 12684–12689. doi: 10.1523/JNEUROSCI.2713-07.2007
Ruff, S., Marie, N., Celsis, P., Cardebat, D., and Démonet, J.-F. (2003). Neural substrates of impaired categorical perception of phonemes in adult dyslexics: an fMRI study. Brain Cogn. 53, 331–334. doi: 10.1016/S0278-2626(03)00137-4
Russo, N. M., Nicol, T. G., Zecker, S. G., Hayes, E. A., and Kraus, N. (2005). Auditory training improves neural timing in the human brainstem. Behav. Brain Res. 156, 95–103. doi: 10.1016/j.bbr.2004.05.012
Samuel, A. G., and Ressler, W. H. (1986). Attention within auditory word perception: insights from the phonemic restoration illusion. J. Exp. Psychol. Hum. Percept. Perform. 12:70. doi: 10.1037/0096-15220.127.116.11
Serniclaes, W., Heghe, S. V., Mousty, P., Carré, R., and Sprenger-Charolles, L. (2004). Allophonic mode of speech perception in dyslexia. J. Exp. Child Psychol. 87, 336–361. doi: 10.1016/j.jecp.2004.02.001
Serniclaes, W., Sprenger-Charolles, L., Carre, R., and Demonet, J.-F. (2001). Perceptual discrimination of speech sounds in developmental dyslexia. J. Speech Lang. Hear. Res. 44, 384. doi: 10.1044/1092-4388(2001/032)
Sisneros, J. A., Forlano, P. M., Deitcher, D. L., and Bass, A. H. (2004). Steroid-dependent auditory plasticity leads to adaptive coupling of sender and receiver. Science 305, 404–407. doi: 10.1126/science.1097218
Stein, M., Federspiel, A., Koenig, T., Wirth, M., Strik, W., Wiest, R., et al. (2012). Structural plasticity in the language system related to increased second language proficiency. Cortex 48, 458–465. doi: 10.1016/j.cortex.2010.10.007
Tallal, P., Miller, S. L., Bedi, G., Byma, G., Wang, X., Nagarajan, S. S., et al. (1996). Language comprehension in language-learning impaired children improved with acoustically modified speech. Science 271, 81–84. doi: 10.1126/science.271.5245.81
Warrier, C. M., Johnson, K. L., Hayes, E. A., Nicol, T., and Kraus, N. (2004). Learning impaired children exhibit timing deficits and training-related improvements in auditory cortical responses to speech in noise. Exp. Brain Res. 157, 431–441. doi: 10.1007/s00221-004-1857-6
Ziegler, J. C., Pech-Georgel, C., George, F., Alario, F.-X., and Lorenzi, C. (2005). Deficits in speech perception predict language learning impairment. Proc. Natl. Acad. Sci. U.S.A. 102, 14110–14115. doi: 10.1073/pnas.0504446102
Keywords: dyslexia, phonemic restoration, specific reading disability, speech perception, phonological awareness, phonological processing, categorical perception, phonetics
Citation: Del Tufo SN and Myers EB (2014) Phonemic restoration in developmental dyslexia. Front. Neurosci. 8:134. doi: 10.3389/fnins.2014.00134
Received: 04 March 2014; Accepted: 14 May 2014;
Published online: 04 June 2014.
Edited by:Marc Schönwiesner, University of Montreal, Canada
Reviewed by:Michel Hoen, Université Claude Bernard Lyon 1, France
Nicolas Langer, Child Mind Institute, USA
Copyright © 2014 Del Tufo and Myers. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Stephanie N. Del Tufo, Department of Psychology, University of Connecticut, 406 Babbidge Road, Unit 1020, Storrs, CT 06269, USA e-mail: firstname.lastname@example.org