Original Research ARTICLE
Influence of Visual Deprivation on Auditory Spectral Resolution, Temporal Resolution, and Speech Perception
- 1Department of Otorhinolaryngology-Head and Neck Surgery, Eulji Medical Center, Eulji University School of Medicine, Seoul, South Korea
- 2Department of Psychology, Duksung Women’s University, Seoul, South Korea
- 3Division of ENT, Sleep Disordered Breathing, Respiratory, and Anesthesia, Office of Product Evaluation and Quality, Center for Devices and Radiological Health, U.S. Food and Drug Administration, Silver Spring, MD, United States
We evaluated whether blind subjects have advantages in auditory spectral resolution, temporal resolution, and speech perception in noise compared with sighted subjects. We also compared psychoacoustic performance between early blind (EB) subjects and late blind (LB) subjects. Nineteen EB subjects, 16 LB subjects, and 20 sighted individuals were enrolled. All subjects were right-handed with normal and symmetric hearing thresholds and without cognitive impairments. Three psychoacoustic measurements of the subjects’ right ears were performed via an inserted earphone to determine spectral-ripple discrimination (SRD), temporal modulation detection (TMD), and speech recognition threshold (SRT) in noisy conditions. Acoustic change complex (ACC) responses were recorded during passive listening to standard ripple-inverted ripple stimuli. EB subjects exhibited better SRD than did LB (p = 0.020) and sighted (p = 0.003) subjects. TMD was better in EB (p < 0.001) and LB (p = 0.007) subjects compared with sighted subjects. SRD was positively correlated with the duration of blindness (r = 0.386, p = 0.024). Acoustic change complex data for ripple noise change at the Cz and Fz electrodes showed trends toward significant correlations with the behavioral results. In conclusion, compared with sighted subjects, EB subjects showed advantages in terms of auditory spectral and temporal resolution, while LB subjects showed an advantage in temporal resolution exclusively. These findings suggest that it might take longer for auditory spectral resolution to functionally enhance following visual deprivation compared to temporal resolution. Alternatively, a critical period of very young age may be required for auditory spectral resolution to improve following visual deprivation.
Humans commonly perceive surrounding information via sensory convergence. The integration of multiple senses can reduce perceptual ambiguity, speed up reactions, and increase the accuracy of stimulus detection. Because blind subjects only use auditory signals to communicate, without lip reading, investigating their speech perception ability compared with sighted subjects is an important aspect of developing rehabilitation programs. However, the influence of visual deprivation on speech recognition in real life remains unclear. Few studies have compared speech perception without visual cues between blind and sighted subjects, and the results varied considerably (Muchnik et al., 1991; Hugdahl et al., 2004; Gougoux et al., 2009; Arnaud et al., 2018). This variability may be due to the small sample sizes, inclusion of mixtures of subjects with congenital and acquired blindness, and the inclusion of some subjects with residual vision. Therefore, in the current study, we enrolled a relatively large number of young adults with EB and LB as separate groups if they met World Health Organization blindness categories 4 and 5 (Vashist et al., 2017) to compare speech perception abilities among early blind (EB) subjects, late blind (LB) subjects, and sighted controls. Auditory spectral resolution and temporal resolution are fundamental aspects of speech perception. Therefore, investigating the influence of visual deprivation on those factors could help us to understanding the potential changes in speech processing induced by visual deprivation. Here, we evaluated three psychoacoustic performance measures (i.e., spectral resolution, temporal resolution, and speech perception in noise) in each group. Although it has been reported that compared with sighted subjects, blind subjects show better spectral and temporal resolutions, as measured by pitch discrimination or pure-tone discrimination (Gougoux et al., 2004; Rokem and Ahissar, 2009; Wan et al., 2010; Voss and Zatorre, 2012) and an auditory backward temporal masking task (Stevens and Weaver, 2005), the acoustic stimuli used in these previous studies were often band-limited and, therefore, may not reflect the temporal and spectral resolution of hearing for broad-band stimuli such as speech signals. Thus, in the current study, we used a broad-band noise carrier that is either spectrally modulated (i.e., rippled spectra noise to evaluate spectral envelope sensitivity) or temporally modulated (i.e., temporally modulated noise to evaluate amplitude envelope sensitivity).
Based on behavioral and neurophysiological findings, two different views have been proposed in the literature that attempted to explain the auditory performance of blind subjects (Pascual-Leone et al., 2005). The first view, called compensation theory, proposes that visual cortices of blind subjects may be recruited to perform auditory functions in a compensatory cross-modal manner, so blind subjects may show better performance compared to sighted subjects. Indeed, several studies have demonstrated advantages of blind subjects over sighted subjects in auditory memory (Amedi et al., 2003), pitch discrimination (Gougoux et al., 2004; Wan et al., 2010), sound localization (Röder et al., 1999; Fieger et al., 2006), dichotic listening (Hugdahl et al., 2004), and ultra-fast speech comprehension (Moos and Trouvain, 2007), supporting the compensation theory. The second view, called general-loss theory, proposes that vision is inevitable to develop certain auditory performances in particular with regards to auditory sense of space, and as a result, blind subjects are impaired in spatial tasks (Zwiers et al., 2001; Gori et al., 2014; Finocchietti et al., 2015; Cappagli et al., 2017b). The current study was designed based upon the compensation theory because the acoustic stimuli used in the current study did not present any binaural auditory cues. Therefore, our first hypothesis in the present study was that blind subjects may show better psychoacoustic performance than sighted subjects, because of the recruitment effect of visual cortices in blind subjects.
Some studies showed that EB subjects had advantages over LB subjects in terms of pitch discrimination (Gougoux et al., 2004; Wan et al., 2010), melody discrimination (Voss and Zatorre, 2012), and sound source discrimination (Voss et al., 2008), and two of those studies reported that their pitch discrimination (Gougoux et al., 2004) and sound source discrimination (Voss et al., 2008) were better in people who developed blindness earlier in life. In line with these data, other studies reported positive correlations of the duration of blindness with short-range functional connectivity density in the primary visual cortex (Qin et al., 2015) and blood oxygenation level dependent responses in the left middle occipital gyrus (Tao et al., 2015). These findings support the compensation theory and suggest that there is a critical period for cross-modal reorganization of brain and functional enhancement, or enough duration of blindness may be required for compensatory change of auditory cortex. However, other studies reported that LB subjects had better haptic orientation perception (Postma et al., 2008), auditory distance discrimination and proprioceptive reproduction (Cappagli et al., 2017a), and dynamic audio localization (Finocchietti et al., 2015) compared with EB subjects. The duration of blindness in LB subjects was negatively correlated with performance on a spatial bisection task (Amadeo et al., 2019), sound-related activity in the occipital cortex (Collignon et al., 2013; Amadeo et al., 2019), and the white matter fractional anisotropy in the corpus callosum (Wang et al., 2013). The studies referenced above support the second view (general-loss theory) that vision is the only sense to encode spatial information and it cannot be replaced by other sensory modalities. We speculated spectral resolution, temporal resolution, and speech perception without any binaural cues may support the compensation theory which has been demonstrated by numerous previous studies for sound perception such as pitch discrimination rather than the general-loss theory for spatial task. Therefore, our second hypothesis was that EB subjects may show better psychoacoustic performances compared with LB subjects.
To test both of our hypotheses, we compared the auditory spectral resolution, temporal resolution, and speech perception among EB subjects, LB subjects, and sighted controls, and determined correlations between these psychoacoustic performance measures and the duration of blindness and age at blindness onset.
The acoustic change complex (ACC) is an auditory evoked potential (P1-N1-P2) that can be elicited by the listener’s ability to detect a change in an ongoing sound in passive listening conditions (Martin and Boothroyd, 1999; Tremblay et al., 2003). ACC can be measured as a spectral change within an ongoing ripple noise stimulus (Won et al., 2011a). Therefore, in addition to the behavioral psychoacoustic experiments, we also recorded the ACC in response to standard ripple-inverted ripple stimuli to characterize the central processing of spectral information of sound in each group.
Materials and Methods
We included 19 EB subjects (28.8 ± 8.0 years, male:female [M:F] = 11:8), 16 LB subjects (33.4 ± 5.9 years, M:F = 7:9), and 20 sighted subjects (25.4 ± 3.9 years, M:F = 10:10). There were significant differences in age among the three groups [H(2) = 10.589; p = 0.005] and the Mann–Whitney post hoc analysis revealed that the sighted group was younger than the LB group (U = 48.000; z = –3.577; p < 0.001; Table 1). All of the subjects were right-handed, aged <40 years, had normal and symmetric hearing thresholds (≤20 dB hearing level at 0.25, 0.5, 1, 2, 3, 4, and 8 kHz), and had no neurological deficits or cognitive impairments. For the EB group, we recruited blind subjects who were born blind or became blind at birth or within a few days of birth. The mean duration of blindness in EB subjects was 28.8 ± 8.0 years. For the LB group, we recruited blind subjects who lost their sight at ≥9 years old and their mean age of onset was 20.8 ± 7.4 years (range 9–32 years) with a variance of 55.4 (Table 2). The mean duration of blindness in the LB subjects was 12.6 ± 6.3 years (range: 5–28 years). The duration of blindness was significantly shorter in LB subjects than in EB subjects (U = 14.500; z = –4.561; p < 0.001). The pure tone averages were not significantly different among three groups. Table 1 presents the demographic and audiologic characteristics of the three groups, and Table 2 presents the characteristics of blind subjects. This study was carried out in accordance with the Declaration of Helsinki and the recommendations of the Institutional Review Board of Eulji Medical Center with written informed consent from all subjects. The informed consent form was verbally presented to the blind subjects in the presence of impartial witnesses, and they then signed the form and a copy was given to them.
Table 1. Comparison of age, sex, and pure-tone average among early blindness group, late blindness group and sighted control group.
Behavioral Psychoacoustic Tests
Spectral-ripple discrimination (SRD), temporal modulation detection (TMD), and speech recognition threshold (SRT) in noise were assessed. The SRD test evaluated spectral resolution by measuring the ability of the participants to discriminate a reversal in the phase of a ripple shape. TMD was used to evaluate the listener’s sensitivity to the temporal envelope by discriminating modulated noise from steady noise. All tests were conducted in a sound-attenuating booth. The stimuli were presented monaurally to the right ear via an inserted earphone (Etymotic, ER-3A) at a sampling frequency of 44,100 Hz using a customized MATLAB program.
Spectral-Ripple Discrimination (SRD) Test
The SRD test was performed as described by Won et al. (2007). To create spectral-ripple stimuli, the following equation was used:
in which D is the ripple depth in dB, R is ripples/octave, Fi is the number of octaves at the ith component frequency (i.e.,[(i−1)log10(50)]/[200log10(2)]), φis the spectral modulation starting phase in radians, t is time in seconds, φ1 is the randomized phase in radians (ranging between 0 to 2π) for each of the 2,555 pure-tone components. A ripple depth (D) of 30 dB was used. For the reference stimulus, the spectral modulation starting phase of the full-wave-rectified sinusoidal spectral envelope was set to zero radians, and for the “oddball” stimulus, the phase was set to π/2 radians. The pure tones were spaced equally on a logarithmic frequency scale with a bandwidth of 100–4991 Hz, ensuring a clear representation of the spectral peaks and valleys for stimuli with higher ripple densities. The ripple peaks were spaced equally on a logarithmic frequency scale. The stimuli had a total duration of 500 ms and were ramped with 150 ms linear rise/fall times. The stimuli were filtered with a long-term, speech-shaped filter that was created in CoolEdit 2000, with parameters specified in accordance with the findings of Byrne et al. (1994). The stimuli were presented at 65 dBA. To measure SRD thresholds, a three-interval, three-alternative forced-choice (3-AFC) paradigm with an adaptive two-up and one-down procedure was used. The order of presentation of the three tokens was randomized, and the subject’s task was to select the “oddball” stimulus. A custom-designed clicker was used for both sighted and blind subjects. The subjects were instructed to press the button associated with the order of the oddball stimulus that was presented. For example, if subjects thought that the second sound was the oddball, they were instructed to press the second button of the clicker. The ripple density was varied between 0.125 and 11.314 ripples per octave in equal-ratio steps of 1.414 in an adaptive manner with 13 reversals that converges to the 70.7% correct point (Levitt, 1971). A level attenuation of 1–8 dB (in 1-dB increments) was randomly selected for each interval in the three-interval task. The SRD threshold for each adaptive run was calculated as the geometric mean of the last eight reversals of 13 reversals. The SRD threshold was determined by averaging the threshold from three testing runs.
Temporal Modulation Detection (TMD) Test
The TMD test was performed using the method described by Won et al. (2011b). For the modulated stimuli, sinusoidal amplitude modulation was applied to a fresh wideband white noise carrier for each presentation. A modulation frequency of 100 Hz was used. The stimuli were presented at 65 dBA. The stimulus duration for both modulated and steady signals was 1 s. The modulated and steady signals were gated on and off with 10-ms linear ramps and they were concatenated with no gap between the two signals. The TMD threshold was measured using a 2-interval, 2-AFC paradigm. One of the intervals contained modulated noise, and the other interval consisted of steady noise. The subjects were asked to press either the first or second button that was associated with the order of the presentation of modulated noise. For example, if the subjects thought that the second sound was modulated noise, they were instructed to press the second button of the clicker. A two-down, one-up adaptive procedure was used to measure TMD threshold. The TMD thresholds (in dB) relative to 100% modulation [i.e., 20log10 (mi)] were obtained, where mi indicates the modulation index. The adaptive tracking procedure began with a modulation depth of 100% and changed in steps of 4 dB from the first to the fourth reversal, and 2 dB for the next 10 reversals. The TMD threshold for each adaptive test run was calculated as the mean of the final 10 reversals. The TMD threshold was determined by averaging the thresholds from three separate test runs.
Speech Recognition Threshold (SRT) Test
To measure SRTs, equally difficult spondee words in Korean (two syllable words that are both pronounced with equal stress) spoken by a male speaker were presented in noise, which was spectrally shaped to have the same long-term power spectrum as the spondees. In all trials, the masker was gated on and off with 50-ms linear ramps 500 ms before and 50 ms after the target spondees. The mixture of the target spondee and masker stimuli was presented monaurally to the test ear. SRTs corresponding to 50% intelligibility were measured using a one-up, one-down adaptive procedure. Each run started with a signal-to-noise ratio (SNR) of 6 dB, for which subjects were easily able to identify the spondee correctly. The subjects were instructed to repeat the spondee word that they heard. The tester then determined if the subjects repeated the spondee word correctly. If a subject correctly repeated the spondee, the SNR for the next spondee was decreased; otherwise, the SNR was increased. The level of the target spondee was fixed at 65 dBA and the level of the noise was varied in an adaptive manner. An initial step size of 4 dB was used for the first two reversals in the adaptive track, after which the step size was fixed at 2 dB for the next six reversals. When a subject showed a total 8 reversals, the adaptive run ended. The SRT for a given run was based on the average of the SNRs at each of the last six reversals of 8 reversals in the adaptive track. No spondee was repeated for any subject. Three adaptive runs were completed. The final SRT for each subject was taken as the mean of three separate adaptive runs.
Acoustic Change Complex (ACC) Stimuli
The first 1 s of the stimulus consisted of a standard ripple and the last 1 s contained an inverted ripple. Therefore, there was a spectral change at the 1-s point. Each half of each stimulus was individually created and then concatenated (Figure 1). Because the concatenation could produce frequency components beyond 5 kHz at the 1-s point, a 5 kHz low-pass filter was applied to the entire 2-s stimuli. To prevent subjects from perceiving a cue due to the temporal pattern of transition from the standard to inverted ripple at the 1-s point, the phase of the first and last 1 s was randomized for each presentation using random phases for the 2555 individual frequency components. Four ripple density conditions were tested (0.25, 1, 4, and 8 ripples/octave). The 2-s duration stimuli were presented with an inter-stimulus interval of 2 s. Each ripple density condition involved 100 random presentations of standard ripple-inverted ripple stimuli.
Figure 1. Stimuli waveforms (upper panel), spectrograms (middle panel), and zoom-in of time-domain waveforms from 0.98 to 1.02 s (lower panel) for a standard-inverted stimulus. The ripple density was 1 ripple/octave for both stimuli.
Acoustic change complex responses were recorded across 32 channels using the actiCHamp Brain Products recording system (Brain Products GmbH, Inc., Munich, Germany) during passive listening to standard ripple-inverted ripple stimuli. The blind subjects sat in a comfortable chair reading a braille book. During recording, we encouraged the participants to stay still during the test with their heads and the elbows within a fixed range. We also instructed them not to move wrist and finger. The sighted subjects sat watching a muted, close-captioned movie. A notch filter at 60 Hz was set to prevent powerline noise and the impedance of all scalp electrodes was kept <5 kΩ.
The collected data were analyzed using Brain vision analyzer version 2.0 (Brain Products GmbH, Inc). The band-pass filter was set at 0.1∼60 Hz after removing eye blinks and body movement artifacts. In addition, independent component analysis was used to adjust for eye blinks. Ripple stimuli data were separated from 200 ms before stimulus presentation to 200 ms after stimulation (−200 to 2200 ms) based on the standard ripple stimulus presentation time. Baseline correction was performed using the interval before stimulation presentation, and potential averaging was performed. Using a semi-automatic peak detection algorithm in the Brain vision analyzer software, the largest negative deflection that occurred between 100 and 200 ms after stimulus onset was defined as the peak amplitude of N1. The peaks were visually inspected and manually adjusted if necessary.
The younger age of sighted subjects compared with the LB subjects might have affected the results because sighted subjects had less experience compared to LB subjects. To control for differences in age among the groups, analysis of covariance (ANCOVA) with the Bonferroni post hoc test was used to compare the SRD thresholds, TMD thresholds, and peak amplitude of N1 among the three groups. The Kruskal–Wallis test with Bonferroni’s correction (α = 0.05/3 = 0.017) was used to compare age and SRT. Correlation analyses were determined using Pearson’s correlation coefficient.
One-way ANCOVA controlling for age revealed a significant difference in the SRD thresholds among the EB subjects, LB subjects, and sighted subjects (11.6 ± 3.6, 8.6 ± 2.4, and 8.0 ± 1.7 ripples/octave, respectively, F(2:49) = 9.116; p < 0.001). The Bonferroni post hoc test revealed that the SRD threshold was significantly better in EB subjects than in LB subjects and sighted subjects (p = 0.010 and p = 0.001, respectively; Figure 2A). One-way ANCOVA also revealed a significant difference in the TMD thresholds among the three groups (−22.6 ± 3.1, −22.0 ± 2.4, and −18.8 ± 2.2 dB, respectively, F(2:51) = 8.980; p < 0.001). The Bonferroni post hoc test revealed that TMD was better in EB and LB subjects than in sighted subjects (p < 0.001 and p = 0.035, respectively; Figure 2B). There was no significant difference in the SRT in noise among three groups [H(2) = 1.478; p = 0.478; Figure 2C]. In the 35 EB and LB subjects, SRD thresholds were significantly correlated with the duration of blindness (r = 0.386, p = 0.024; Figure 3A), whereas TMD thresholds and SRTs were not correlated with duration of blindness (r = −0.298, p = 0.082 and r = −0.104, p = 0.552). In 16 LB subjects, SRD thresholds showed a trend toward significance in the correlation with the age at blindness onset (r = −0.487, p = 0.056; Figure 3B), whereas TMD thresholds and SRTs were not correlated with the age at blindness onset (r = 0.297, p = 0.264 and r = 0.186, and p = 0.490). Figure 4 shows the grand mean ACC potentials in response to the ripple noise change at the Cz and Fz electrodes. Although the N1 peak amplitude originating from the inverted ripple sound was numerically greater in EB subjects than in LB subjects or sighted subjects, there were no statistically significant differences [F(2:43) = 2.735; p = 0.076 and F(2:43) = 3.027; p = 0.059].
Figure 2. Comparison of three measures of psychoacoustic performance and acoustic change complex (ACC) among early blind (EB), late blind (LB), and sighted subjects. (A) The spectral-ripple discrimination (SRD) threshold was greater in EB subjects (i.e., better SRD) than in LB and sighted subjects. (B) The temporal modulation detection (TMD) threshold was lower (i.e., better TMD) in EB subjects and in LB subjects compared with sighted subjects. (C) The speech reception thresholds (SRTs) in noise were not significantly different among the three groups of subjects. ∗Significantly different between the indicated groups (p < 0.05).
Figure 3. (A) The SRD threshold was significantly correlated with the duration blindness duration among 35 individuals from the EB and LB groups. (B) For 16 LB subjects, SRD thresholds showed a trend toward significance in the correlation with the ages of onset of blindness.
Figure 4. Comparisons of acoustic change complex (ACC) at the Cz (A) and Fz (B) electrodes among early blind (EB), late blind (LB), and sighted subjects. Although the N1 peak amplitude originating from the inverted ripple sound was numerically greater in EB subjects than in LB subjects or sighted subjects, there were no statistically significant differences.
Psychoacoustic data showed that EB subjects showed better performance in auditory spectral and temporal resolution compared with sighted subjects. Our findings are largely consistent with those of previous studies for spectral resolution (Gougoux et al., 2004; Hamilton et al., 2004; Wan et al., 2010), and for temporal resolution (Stevens and Weaver, 2005; Hertrich et al., 2013). Voss and Zatorre (2012) and Voss et al. (2014) demonstrated that heightened pitch discrimination in blind subjects was directly related to the degree of structural neuroplasticity in the cortex. Our behavioral results also support the theory of compensatory plasticity of cerebral cortex following visual deprivation. There are several hypotheses regarding this plasticity. First, the visual cortex is recruited to perform auditory functions (Leclerc et al., 2000; Gougoux et al., 2009; Collignon et al., 2011; Voss and Zatorre, 2012). Second, preexisting audiovisual connections that are masked by a dominant visual input can be unmasked and strengthened (Beer et al., 2011; Collignon et al., 2013; Pelland et al., 2017). Third, new neural connections may develop between each brain region (Karlen et al., 2006; Chabot et al., 2008). On the other hand, another view of no experience of sight in EB subjects proposed that vision is inevitable to develop certain auditory performances in particular with regards to auditory sense of space or auditory map. According to the cross-sensory calibration hypothesis proposed by Gori et al. (2008, 2010, 2014), during early development vision calibrates other senses to process spatial information for which it is the most robust sense. The sound localization task in vertical plane (Zwiers et al., 2001; Lewald, 2002) or the performance of more complex tasks requiring a metric representation of the auditory space (Gori et al., 2010, 2013, 2014; Finocchietti et al., 2015; Vercillo et al., 2016) tend to be worse in EB subjects than in sighted controls. Moreover, a recent study revealed that early occipital activation during spatial bisection tasks was reduced in EB subjects compared with sighted subjects (Campus et al., 2019). We speculate that unlike the sound localization, auditory spectral and temporal resolutions may not require vision for their maturation.
LB subjects showed better temporal resolution, but not spectral resolution, compared with sighted subjects. Two hypotheses are available to explain these results. First, there may be a critical period for plastic changes of spectral resolution at a young age, so LB subjects have limited chance for functional enhancement. In contrast, the better temporal resolution compared with sighted subjects implies that the functional enhancement following visual deprivation might not be affected by the age at onset of blindness. We speculate that this hypothesis is supported by the trends toward significant negative correlations between SRD thresholds and age at blindness onset in LB subjects, whereas TMD thresholds were not correlated with age at blindness onset. We also speculate that the existence of a critical period for plastic changes of spectral resolution is plausible considering the previous reports. Gougoux et al. (2004) reported very similar results to ours, showing that people experiencing blindness at a younger age show better pitch discrimination. Similarly, Voss et al. (2008) found a significant negative correlation between the age at blindness onset and sound-source discrimination. In an fMRI study, Qin et al. (2015) found that visual deprivation before the developmental sensitive period can induce more extensive brain functional reorganization than does visual deprivation after the sensitive period. Second, auditory spectral resolution depends primarily on active movement of outer hair cells and cochlear tuning (Dubno and Schaefer, 1995; Oxenham and Bacon, 2003). However, it has been hypothesized temporal modulations are represented by the neural firing in auditory cortex, and the auditory cortex plays a key role in temporal processing of sound (Schulze and Langner, 1997; Eggermont, 2001; Bao et al., 2004). In support of this temporal coding mechanism, Schnupp et al. (2006) and Bao et al. (2004) reported the plasticity of temporal coding in auditory cortex by sensory training in animals and Asal et al. (2018) reported a similar result for humans. The relatively large contribution of the central auditory system to temporal resolution, and a high degree of plastic property of auditory cortex may induce relatively fast plastic changes in temporal resolution following visual deprivation, and this phenomenon can provide LB subjects with an advantage in temporal resolution compared with sighted subjects, unlike spectral resolution.
In the current study, the EB subjects showed significantly better SRD performance than the LB subjects, but there was no difference in TMD between the two groups. These results may suggest a different pattern or a different speed of functional enhancement following visual deprivation between spectral resolution and temporal resolution. Several hypotheses are available to explain these results. First, the positive significant correlation between SRD and the duration of blindness supports the hypothesis of the gradual improvement of SRD over time following visual deprivation. In contrast, the relationship between TMD and the duration of blindness did not reach statistical significance. Taken together, we speculate that it may take a longer time for auditory spectral resolution to undergo functional enhancement, whereas temporal resolution may improve much more quickly following visual deprivation. Different correlations between the duration of blindness and sound-induced behavioral performances or the sound-related activity of the brain were reported in previous studies. Some authors reported positive correlations (Qin et al., 2015; Tao et al., 2015) whereas others reported negative correlations (Collignon et al., 2013; Wang et al., 2013; Amadeo et al., 2019). Second, several studies reported a natural development of SRD in sighted children with normal hearing threshold showing gradual improvement by 7–11 years of age and adult-like performance was seen at 9–12 years old (Blagosklonova et al., 1989; Allen and Wightman, 1994, Peter et al., 2014). EB subjects go through this developing period without visual stimulation, and therefore this time less than 12 years of age could be a critical period for plastic changes of spectral resolution. In contrast, all LB subjects in the current study have the age of visual loss greater than 12 years of age except for two subjects (9 and 10 years of age). Taken together, it is plausible that if blindness occurs before around the age of 12, such individuals may be able to experience the plastic change of spectral resolution, which may not be manifested in blind subjects whose blindness occur after this critical period. Third, previous studies reported that occipital recruitment for auditory processing appears to be reduced in LB compared with EB (Voss et al., 2008; Bedny et al., 2010, 2012; Collignon et al., 2013). From this perspective, we speculate that the reduced levels of plasticity in LB might have been insufficient to gain an advantage in spectral resolution in the current study, but it could be still sufficient to gain an advantage in temporal resolution.
Although EB and LB subjects showed advantages in auditory spectral and temporal resolution compared with sighted controls, we found no significant differences in speech perception in noise among the three groups. Only a few studies have examined the speech perception ability in a similar setting, and their results were inconsistent with each other (Muchnik et al., 1991; Gougoux et al., 2009; Arnaud et al., 2018). Muchnik et al. (1991) found no difference in speech perception in silent conditions between a mixed group of EB and LB subjects and a group of sighted subjects, but the blind subjects outperformed the sighted subjects when tested in noisy conditions. Gougoux et al. (2009) found no behavioral differences in voice recognition among EB, LB, and sighted control subjects. Arnaud et al. (2018) demonstrated better discrimination for native vowel in the blind subjects compared with sighted subjects, but no difference for non-native vowel. In this study, two-syllable words were presented in the presence of background noise. As such, the listener’s ability to understand semantic cues might play an important role in addition to the ability to discriminate subtle speech cues. It is possible that speech discrimination measures that isolate semantic cues and force listeners to focus on acoustic cues for discrimination may better characterize the difference in speech understanding between EB subjects and sighted subjects. Further studies are therefore needed to assess speech perception among blind subjects.
There are two limitations in the present study. Although we analyzed the data using ANCOVA to control for differences in age, the fact that younger age of sighted subjects compared with the LB subjects might have affected the result of temporal resolution because sighted subjects had a less amount of experience compared with LB subjects. The second one is that age at blindness onset for the LB group was quite variable. However, this is unlikely to affect the results for the following reasons. Earliest onset of blindness was at 9 years of age, and therefore, a significant amount of time has passed since their birth. Furthermore, in 16 LB subjects, the three psychoacoustic performance measures (i.e., SRD thresholds, TMD thresholds, and SRTs) were not correlated with the age at onset of blindness.
In conclusion, compared with sighted subjects, EB subjects showed advantages in auditory spectral and temporal resolution, while LB subjects showed an advantage in temporal resolution exclusively. These findings suggest that it might take longer for auditory spectral resolution to functionally enhance following visual deprivation compared with temporal resolution or a critical period of very young age is required to improve auditory spectral resolution following visual deprivation.
Data Availability Statement
All datasets generated for this study are included in the article/supplementary material.
The studies involving human participants were reviewed and approved by the Institutional Review Board of Eulji Medical Center, Seoul, South Korea. The patients/participants provided their written informed consent to participate in this study.
HS: conceptualization. GG, HL, SC, and JW: methodology. GG: investigation. GG and HL: data analysis. HS: writing original draft. JW: writing review and editing.
This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean Government (MSIT) (NRF-2019R1H1A2039693 and NRF-2017R1A2B1007431).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The views expressed in this paper are those of the authors and do not necessarily reflect the official policy or position of the US Department of Health and Human Services and the U.S. Food and Drug Administration. No official endorsement is intended or should be inferred.
Ji-Hye Han and Jeong-Sug Kyong helped to set up the EEG and analyze the EEG data. Hojun Yeom provided the button box to record the responses of blind subjects. Ko Eun Kim provided guidance on evaluation of visual acuity among blind subjects.
Amadeo, M. B., Campus, C., and Gori, M. (2019). Impact of years of blindness on neural circuits underlying auditory spatial representation. Neuroimage 191, 140–149. doi: 10.1016/j.neuroimage.2019.01.073
Amedi, A., Raz, N., Pianka, P., Malach, R., and Zohary, E. (2003). Early ‘visual’ cortex activation correlates with superior verbal memory performance in the blind. Nat. Neurosci. 6, 758–766. doi: 10.1038/nn1072
Arnaud, L., Gracco, V., and Ménard, L. (2018). Enhanced perception of pitch changes in speech and music in early blind adults. Neuropsychologia 117, 261–270. doi: 10.1016/j.neuropsychologia.2018.06.009
Asal, S. I., Sobhy, O. A., and Morsy, H. M. (2018). Effect of auditory temporal processing training on behavioral and electrophysiological functions in central presbycusis. Egypt. J. Otolaryngol. 34, 68–75.
Bao, S., Chang, E. F., Woods, J., and Merzenich, M. M. (2004). Temporal plasticity in the primary auditory cortex induced by operant perceptual learning. Nat. Neurosci. 7, 974–981. doi: 10.1038/nn1293
Bedny, M., Konkle, T., Pelphrey, K., Saxe, R., and Pascual-Leone, A. (2010). Sensitive period for a multimodal response in human visual motion area MT/MST. Curr. Biol. 20, 1900–1906. doi: 10.1016/j.cub.2010.09.044
Bedny, M., Pascual-Leone, A., Dravida, S., and Saxe, R. (2012). A sensitive period for language in the visual cortex: distinct patterns of plasticity in congenitally versus late blind adults. Brain Lang. 122, 162–170. doi: 10.1016/j.bandl.2011.10.005
Beer, A. L., Plank, T., and Greenlee, M. W. (2011). Diffusion tensor imaging shows white matter tracts between human auditory and visual cortex. Exp. Brain Res. 213, 299–308. doi: 10.1007/s00221-011-2715-y
Blagosklonova, N., Markova, E., Novikova, L., Rybalko, N., and Supin, A. (1989). Auditory frequency resolving power in normal children and ones with hearing loss. Int. J. Psychophysiol. 7, 148–149. doi: 10.1016/0167-8760(89)90094-9
Campus, C., Sandini, G., Amadeo, M. B., and Gori, M. (2019). Stronger responses in the visual cortex of sighted compared to blind individuals during auditory space representation. Sci. Rep. 9:1935. doi: 10.1038/s41598-018-37821-y
Cappagli, G., Finocchietti, S., Cocchi, E., and Gori, M. (2017b). The impact of early visual deprivation on spatial hearing: a comparison between totally and partially visually deprived children. Front. Psychol. 8:467. doi: 10.3389/fpsyg.2017.00467
Chabot, N., Charbonneau, V., Laramee, M. E., Tremblay, R., Boire, D., and Bronchti, G. (2008). Subcortical auditory input to the primary visual cortex in anophthalmic mice. Neurosci. Lett. 433, 129–134. doi: 10.1016/j.neulet.2008.01.003
Collignon, O., Dormal, G., Albouy, G., Vandewalle, G., Voss, P., Phillips, C., et al. (2013). Impact of blindness onset on the functional organization and the connectivity of the occipital cortex. Brain 136, 2769–2783. doi: 10.1093/brain/awt176
Collignon, O., Vandewalle, G., Voss, P., Albouy, G., Charbonneau, G., Lassonde, M., et al. (2011). Functional specialization for auditory-spatial processing in the occipital cortex of congenitally blind humans. Proc. Natl. Acad. Sci. U.S.A. 108, 4435–4440. doi: 10.1073/pnas.1013928108
Dubno, J. R., and Schaefer, A. B. (1995). Frequency selectivity and consonant recognition for hearing-impaired and normal-hearing listeners with equivalent masked thresholds. J. Acoust. Soc. Am. 97, 1165–1174. doi: 10.1121/1.413057
Eggermont, J. J. (2001). Temporal modulation transfer functions in cat primary auditory cortex: separating stimulus effects from neural mechanisms. J. Neurophysiol. 87, 305–321. doi: 10.1152/jn.00490.2001
Fieger, A., Röder, B., Teder-Sälejärvi, W., Hillyard, S. A., and Neville, H. J. (2006). Auditory spatial tuning in late-onset blindness in humans. J. Cogn. Neurosci. 18, 149–157. doi: 10.1162/jocn.2006.18.2.149
Gori, M., Sandini, G., Martinoli, C., and Burr, D. (2010). Poor haptic orientation discrimination in nonsighted children may reflect disruption of cross-sensory calibration. Curr. Biol. 20, 223–225. doi: 10.1016/j.cub.2009.11.069
Gougoux, F., Belin, P., Voss, P., Lepore, F., Lassonde, M., and Zatorre, R. J. (2009). Voice perception in blind persons: a functional magnetic resonance imaging study. Neuropsychologia 47, 2967–2974. doi: 10.1016/j.neuropsychologia.2009.06.027
Hertrich, I., Dietrich, S., and Ackermann, H. (2013). How can audiovisual pathways enhance the temporal resolution of time-compressed speech in blind subjects? Front. Psychol. 4:530. doi: 10.3389/fpsyg.2013.00530
Hugdahl, K., Ek, M., Takio, F., Rintee, T., Tuomainen, J., Haarala, C., et al. (2004). Blind individuals show enhanced perceptual and attentional sensitivity for identification of speech sounds. Cogn. Brain Res. 19, 28–32. doi: 10.1016/j.cogbrainres.2003.10.015
Karlen, S. J., Kahn, D. M., and Krubitzer, L. (2006). Early blindness results in abnormal corticocortical and thalamocortical connections. Neuroscience 142, 843–858. doi: 10.1016/j.neuroscience.2006.06.055
Leclerc, C., Saint-Amour, D., Lavoie, M. E., Lassonde, M., and Lepore, F. (2000). Brain functional reorganization in early blind humans revealed by auditory event-related potentials. Neuroreport 11, 545–550. doi: 10.1097/00001756-200002280-00024
Martin, B. A., and Boothroyd, A. (1999). Cortical, auditory, event-related potentials in response to periodic and aperiodic stimuli with the same spectral envelope. Ear Hear. 20, 33–44. doi: 10.1097/00003446-199902000-00004
Moos, A., and Trouvain, J. (2007). “Comprehension of Ultra-Fast Speech–Blind vs.‘Normally Hearing’Persons,” in Proceedings of the 16th International Congress of Phonetic Sciences, (Saarbrücken: Saarland University), 677–680.
Pelland, M., Orban, P., Dansereau, C., Lepore, F., Bellec, P., and Collignon, O. (2017). State-dependent modulation of functional connectivity in early blind individuals. Neuroimage 147, 532–541. doi: 10.1016/j.neuroimage.2016.12.053
Peter, V., Wong, K., Narne, V. K., Sharma, M., Purdy, S. C., and McMahon, C. (2014). Assessing spectral and temporal processing in children and adults using temporal modulation transfer function (TMTF), iterated ripple noise (IRN) perception, and spectral ripple discrimination (SRD). J. Am. Acad. Audiol. 25, 210–218. doi: 10.3766/jaaa.25.2.9
Postma, A., Zuidhoek, S., Noordzij, M. L., and Kappers, A. M. (2008). Haptic orientation perception benefits from visual experience: evidence from early-blind, late-blind, and sighted people. Percept. Psychophys. 70, 1197–1206. doi: 10.3758/PP.70.7.1197
Schnupp, J. W., Hall, T. M., Kokelaar, R. F., and Ahmed, B. (2006). Plasticity of temporal pattern codes for vocalization stimuli in primary auditory cortex. J. Neurosci. 26, 4785–4795. doi: 10.1523/JNEUROSCI.4330-05.2006
Schulze, H., and Langner, G. (1997). Periodicity coding in the primary auditory cortex of the Mongolian gerbil (Meriones unguiculatus): two different coding strategies for pitch and rhythm? J. Comp. Physiol. A 181, 651–663. doi: 10.1007/s003590050147
Tao, Q., Chan, C. C., Luo, Y.-J., Li, J.-J., Ting, K.-H., Wang, J., et al. (2015). How does experience modulate auditory spatial processing in individuals with blindness? Brain Topogr. 28, 506–519. doi: 10.1007/s10548-013-0339-1
Tremblay, K., Friesen, L., Martin, B., and Wright, R. (2003). Test-retest reliability of cortical evoked potentials using naturally produced speech sounds. Ear Hear. 24, 225–232. doi: 10.1097/01.aud.0000069229.84883.03
Vashist, P., Senjam, S. S., Gupta, V., Gupta, N., and Kumar, A. (2017). Definition of blindness under national programme for control of blindness: do we need to revise it? Indian J. Ophthalmol. 65, 92–96. doi: 10.4103/ijo.IJO_869_16
Vercillo, T., Burr, D., and Gori, M. (2016). Early visual deprivation severely compromises the auditory sense of space in congenitally blind children. Dev. Psychol. 52, 847–853. doi: 10.1037/dev0000103
Voss, P., Gougoux, F., Zatorre, R. J., Lassonde, M., and Lepore, F. (2008). Differential occipital responses in early- and late-blind individuals during a sound-source discrimination task. Neuroimage 40, 746–758. doi: 10.1016/j.neuroimage.2007.12.020
Voss, P., Pike, B. G., and Zatorre, R. J. (2014). Evidence for both compensatory plastic and disuse atrophy-related neuroanatomical changes in the blind. Brain 137, 1224–1240. doi: 10.1093/brain/awu030
Wan, C. Y., Wood, A. G., Reutens, D. C., and Wilson, S. J. (2010). Early but not late-blindness leads to enhanced auditory perception. Neuropsychologia 48, 344–348. doi: 10.1016/j.neuropsychologia.2009.08.016
Won, J. H., Clinard, C. G., Kwon, S., Dasika, V. K., Nie, K., Drennan, W. R., et al. (2011a). Relationship between behavioral and physiological spectral-ripple discrimination. J. Assoc. Res. Otolaryngol. 12, 375–393. doi: 10.1007/s10162-011-0257-4
Won, J. H., Drennan, W. R., Nie, K., Jameyson, E. M., and Rubinstein, J. T. (2011b). Acoustic temporal modulation detection and speech perception in cochlear implant listeners. J. Acoust. Soc. Am. 130, 376–388. doi: 10.1121/1.3592521
Won, J. H., Drennan, W. R., and Rubinstein, J. T. (2007). Spectral-ripple resolution correlates with speech reception in noise in cochlear implant users. J. Assoc. Res. Otolaryngol. 8, 384–392. doi: 10.1007/s10162-007-0085-8
Keywords: early blindness, late blindness, visual deprivation, spectral resolution, temporal resolution, speech perception
Citation: Shim HJ, Go G, Lee H, Choi SW and Won JH (2019) Influence of Visual Deprivation on Auditory Spectral Resolution, Temporal Resolution, and Speech Perception. Front. Neurosci. 13:1200. doi: 10.3389/fnins.2019.01200
Received: 11 July 2019; Accepted: 23 October 2019;
Published: 06 November 2019.
Edited by:Joachim Lange, Heinrich Heine University Düsseldorf, Germany
Reviewed by:Giulia Cappagli, Neurological Institute Foundation Casimiro Mondino (IRCCS), Italy
David Amooti Magezi, Universität Hamburg, Germany
Copyright © 2019 Shim, Go, Lee, Choi and Won. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Hyun Joon Shim, email@example.com