Effects of Phase-Locking Deficits on Speech Recognition in Older Adults With Presbycusis

Hao, Wenyang; Wang, Qian; Li, Liang; Qiao, Yufei; Gao, Zhiqiang; Ni, Daofeng; Shang, Yingying

doi:10.3389/fnagi.2018.00397

ORIGINAL RESEARCH article

Front. Aging Neurosci., 06 December 2018

Sec. Neurocognitive Aging and Behavior

Volume 10 - 2018 | https://doi.org/10.3389/fnagi.2018.00397

Effects of Phase-Locking Deficits on Speech Recognition in Older Adults With Presbycusis

Wenyang Hao^1†

Qian Wang^2†

Liang Li³

Yufei Qiao¹

Zhiqiang Gao¹

Daofeng Ni¹

Yingying Shang^1*

¹Department of Otorhinolaryngology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
²Epilepsy Center, Department of Clinical Psychology, Sanbo Brain Hospital, Capital Medical University, Beijing, China
³School of Psychological and Cognitive Sciences and Beijing Key Laboratory of Behavior and Mental Health, Speech and Hearing Research Center, Key Laboratory on Machine Perception (Ministry of Education), Peking University, Beijing, China

Objective: People with presbycusis (PC) often report difficulties in speech recognition, especially under noisy listening conditions. Investigating the PC-related changes in central representations of envelope signals and temporal fine structure (TFS) signals of speech sounds is critical for understanding the mechanism underlying the PC-related deficit in speech recognition. Frequency-following responses (FFRs) to speech stimulation can be used to examine the subcortical encoding of both envelope and TFS speech signals. This study compared FFRs to speech signals between listeners with PC and those with clinically normal hearing (NH) under either quiet or noise-masking conditions.

Methods: FFRs to a 170-ms speech syllable /da/ were recorded under either a quiet or noise-masking (with a signal-to-noise ratio (SNR) of 8 dB) condition in 14 older adults with PC and 13 age-matched adults with NH. The envelope (FFR_ENV) and TFS (FFR_TFS) components of FFRs were analyzed separately by adding and subtracting the alternative polarity responses, respectively. Speech recognition in noise was evaluated in each participant.

Results: In the quiet condition, compared with the NH group, the PC group exhibited smaller F0 and H3 amplitudes and decreased stimulus-response (S-R) correlation for FFR_ENV but not for FFR_TFS. Both the H2 and H3 amplitudes and the S-R correlation of FFR_ENV significantly decreased in the noise condition compared with the quiet condition in the NH group but not in the PC group. Moreover, the degree of hearing loss was correlated with noise-induced changes in FFR_TFS morphology. Furthermore, the speech-in-noise (SIN) threshold was negatively correlated with the noise-induced change in H2 (for FFR_ENV) and the S-R correlation for FFR_ENV in the quiet condition.

Conclusion: Audibility affects the subcortical encoding of both envelope and TFS in PC patients. The impaired ability to adjust the balance between the envelope and TFS in the noise condition may be part of the mechanism underlying PC-related deficits in speech recognition in noise. FFRs can predict SIN perception performance.

Introduction

Presbycusis (PC) is the third most common chronic disorder in elderly people, reflecting the degradation of auditory-processing functions in both peripheral and central systems (Yueh et al., 2003). Listeners with PC often manifest both symmetrical sensorineural hearing loss (SNHL) and impaired speech recognition (Deng et al., 2014), especially in noisy environments (Li et al., 2004; Divenyi et al., 2005; Gifford et al., 2007; Huang et al., 2008; Salonen et al., 2013). However, the link between PC and augmented vulnerability of speech recognition to noise masking is still not completely understood.

After a soundwave (such as speech sound) reaches the ears, the peripheral auditory system filters the sound wave into bands of narrowband waves through a series of band-pass filters, and the output signals from each of the narrowband channels are further decomposed into fast fluctuating temporal fine structures (TFSs) and slowly varying envelopes (ENVs; Moore, 2008). Considerable evidence has suggested that auditory aging markedly affects the detection of both the TFS and envelope components (Füllgrabe et al., 2003, 2015; Buss et al., 2004; Lorenzi et al., 2006, 2012; Souza and Boike, 2006; Hopkins and Moore, 2007; Fogerty and Humes, 2012; Moore et al., 2012; Füllgrabe, 2013; Rufener et al., 2016). It is of interest and importance to determine whether the age-related deficits in processing TFS and envelope signals start to occur at the level of the auditory brainstem.

Scalp-recorded frequency-following responses (FFRs) are sustained neuro-electrical potentials representing the periodicity of acoustic stimuli (Worden and Marsh, 1968; Moushegian et al., 1973) with the origin site in the auditory midbrain, including the inferior colliculus (Weinberger et al., 1970; Marsh et al., 1974; Smith et al., 1975; Sohmer et al., 1977; Ping et al., 2008; Du et al., 2009; Chandrasekaran and Kraus, 2010; Bidelman, 2015; Wang and Li, 2015, 2018; Luo et al., 2017). Both the sound TFS component (e.g., Galbraith, 1994; Krishnan, 2002; Krishnan and Gandour, 2009; Chandrasekaran and Kraus, 2010; Du et al., 2011) and the envelope component (also called the envelope-following response or the steady-state evoked response; e.g., Hall, 1979; Dolphin and Mountain, 1992, 1993; Supin and Popov, 1995; Russo et al., 2004; Aiken and Picton, 2006, 2008; Shinn-Cunningham et al., 2013; Zhu et al., 2013) are represented in the FFRs and are, therefore, useful for studying the mechanisms underlying speech recognition in noisy environments (Du et al., 2011). In both humans and rats, auditory brainstem responses (ABRs) induced by complex sound signals (e.g., speech syllables composed of consonants and vowels) or noises contain both transient responses and sustained FFRs (Skoe and Kraus, 2010; Wang and Li, 2018), and both the TFS component (FFR_TFS) and the envelope component (FFR_ENV) of FFRs can be assessed independently (Aiken and Picton, 2008).

To date, several studies have used FFR recordings to investigate how aging affects auditory processing. For instance, compared with younger adults with normal hearing (NH), older adults with clinically NH exhibit weakened phase locking and reduced FFR magnitudes (Anderson et al., 2012; Clinard and Tremblay, 2013; Bidelman et al., 2014). Additionally, among older adults with NH, the fundamental frequency (F0) magnitude of FFRs is larger and less affected by noise in those who perform better on the speech-in-noise (SIN) test (Anderson et al., 2011). Thus, older adults lose temporal precision in the subcortical encoding of sounds, leading to difficulty with speech perception against masking. It is important to determine whether the aging effects are enhanced by PC and particularly whether the extent of age-related brain atrophy is associated with central hearing loss in older adults (Giroud et al., 2018).

Recently, Ananthakrishnan et al. (2016) examined FFRs in adult listeners with or without hearing loss and revealed that the neural representations of the envelopes and TFSs are weaker in listeners with SNHL. However, Ananthakrishnan et al.’s included young and middle-aged NH listeners and SNHL patients with a wide range of ages rather than only listeners with PC. Thus, further studies are needed to clarify how the aging effects are modulated by audibility.

In older adults with or without hearing loss, Anderson et al. (2013a) reported greater auditory-nerve coding of sound envelopes and no significant difference in the TFS in the SNHL groups. The stimulus used in their study was a short stimulus (e.g., 40 ms /da/), which did not have a real steady-state vowel of the syllable, although it could be perceived as a consonant-vowel syllable. Thus, their results may not be able to accurately represent the subcortical encoding ability of the envelope and the TFS. Therefore, in listeners with PC, the exact effect of audibility deficits on the subcortical encoding of the envelope and the TFS remains unclear. Furthermore, although the FFR in noise conditions is more likely to reflect speech recognition in noise than the FFR in quiet conditions, Ananthakrishnan et al. (2016) did not test the FFR in noise conditions. The study by Anderson et al. (2013a) was the only to examine FFR in noise, and data on the FFR in listeners with PC are still very scarce. In addition, it has also been demonstrated that experience with tonal languages can affect neural plasticity at the brainstem level (Krishnan and Gandour, 2009; Krishnan et al., 2010a,b). However, most previous FFR studies included participants who were not native speakers of a tonal language. Thus, systematic studies using FFR to examine how envelope and TFS detection is affected by audibility deficits and how noise can affect envelope and TFS subcortical encoding in listeners with PC, especially native speakers of a tonal language, could be informative.

In the present study, we examined FFR under both quiet and noise conditions in older adults with and without PC who were native speakers of a tonal language (Mandarin) and analyzed the FFR results together with their pure tone audiometry (PTA) thresholds and SIN performance. We hypothesized that the loss of hearing sensitivity and the reduction in SIN perception of listeners with PC, particularly when listening under noise conditions, may be associated with the decline in the subcortical representations of the envelope and TFS signals, which can be measured in both humans (Wang et al., 2018) and laboratory animals (Wang and Li, 2015, 2017, 2018; Luo et al., 2017). The primary aim of the present study was to obtain a better understanding of the PC-related alterations in the neural representation of envelope and TFS cues.

Materials and Methods

Participants

Fourteen older adults (≥60 years old) with PC and 13 age-matched older adults with clinically NH participated in the present study. NH was defined as the following: (1) PTA air conduction thresholds no higher than 25 dB HL from 500 Hz to 3,000 Hz bilaterally (Pross et al., 2015); (2) air-bone gaps no larger than 10 dB HL; and (3) no interaural asymmetry (a difference no larger than 15 dB HL at two or more frequencies). The aged adults with PC included only those with mild to moderate symmetric SNHL, defined as follows: (1) air-bone gaps ≤10 dB HL; (2) air conduction thresholds higher than 25 dB HL for the frequencies from 500 Hz to 3,000 Hz; and (3) no interaural asymmetry (≤15 dB HL difference at two or more frequencies). All participants underwent tympanometry, and those with abnormal results were excluded. Additionally, all participants had normal cognitive abilities, as measured with the Mini-Mental State Examination (MMSE, ≥27). All participants were right handed. No participants reported any history of hearing aid usage. No participants reported any history of neurological conditions. Those who reported a history of musical training (>3 years) were also excluded.

PTA was performed using a Conera audiometer (Madsen, GN, Denmark; ISO 389). Frequencies from 0.25 kHz to 8 kHz were tested using headphones TDH 39 with a step size of 5 dB HL (ISO 8253-1:1989). The SIN test was assessed using the Hearing-in-Noise Test (HINT; Bio-logic Systems Corp., Mundelein, IL, USA). Ten-word sentences from 12 different Mandarin lists consisting of 20 sentences each were presented randomly during the SIN threshold tests. Mandarin HINT sentences were presented through the headphones at different intensities with an ipsilateral fixed speech-shaped noise masker (65 dB sound pressure level, dB SPL). The sentence was presented beginning at a −10 dB signal-to-noise ratio (SNR) and adapted to be easier or more difficult based on each participant’s responses. The step size was 4 dB for the first four sentences and 2 dB for the remaining 16 sentences. The SIN threshold was the average of the presenting SNR from sentence No. 5 through 20 (Nilsson et al., 1994).

This study was carried out in accordance with the recommendations of Declaration of Helsinki-Ethical Principles for Medical Research Involving Human Subjects, World Medical Association. The procedures used in this study were approved by the Ethics Committee of Peking Union Medical College Hospital, and all participants provided their written informed consent.

FFR Recordings

A 170-ms speech syllable, /da/ (Anderson et al., 2011, 2012; Mamo et al., 2016), which is an important elemental speech cue in Mandarin, was used as the stimulus (Figure 1). This syllable consisted of a 50-ms transition (from the stop burst of [d] to [a]) followed by a 120-ms steady-state region corresponding to the vowel [a]. During the steady-state region, the fundamental frequency (F0) remained constant at 100 Hz, and the first and second formant-related harmonics remained at 720 Hz (F1) and 1240 Hz (F2), respectively (Anderson et al., 2011, 2012).

FIGURE 1

Figure 1. Waveform and spectrogram of the stimulus /da/. (A) Waveform of the 170-ms /da/ stimulus. (B) Spectrogram of the /da/ stimulus. n.u., no unit. *Represent the peak of F0 and its harmonics.

An auditory evoked-potential-recording system (SmartEP, Intelligent Hearing Systems (IHS), Miami, FL, USA) was used to record ABRs to the speech stimulus. The /da/ stimulus was presented monaurally through electromagnetically shielded insert earphones (ER-3A) at an intensity of 85 dB SPL and a rate of 3.89 Hz (Anderson et al., 2013a; Ananthakrishnan et al., 2016). ABRs were recorded under both the quiet and noise conditions. For the noise condition, continuous white noise was presented ipsilaterally at an SNR of 8 dB. A vertical montage of four silver disc electrodes (Cz active, Fpz ground, mastoid references) was used with interelectrode impedances maintained below 5 kΩ for all recordings. The sampling rate was 2.5 kHz, and the band-pass filter was from 30 Hz to 3,000 Hz. Under either the quiet or noise condition for each ear, a block of 2,048 sweeps was collected separately for the condensation and rarefaction polarities and averaged using a 240-ms window (−40 to 200 ms). To ensure that the participants remained awake and relaxed, they were instructed to watch a muted, subtitled movie of their choice while sitting on a couch. ABR recordings were made in an electrically shielded, sound-proof booth.

Data Analyses

To extract the noninverting FFR_ENV, responses to the two different polarities were added, while responses to the two different polarities were subtracted to extract the inverted FFR_TFS (Aiken and Picton, 2008). Spectral amplitudes were computed using fast Fourier transformations for both FFR_ENV and FFR_TFS to decompose their component frequencies on a time window of 60–170 ms, which corresponded to the steady-state region of the responses. The bin size was 4.88 Hz.

For the spectral analysis of FFR_ENV, the amplitude of F0 (100 Hz) and the second and third harmonics (H2 and H3, 200 Hz and 300 Hz, respectively) were analyzed. For FFR_TFS, the amplitude of F1 (720 Hz) was analyzed. Stimulation-response (S-R) correlations were assessed for both FFR_ENV and FFR_TFS of each ear and each condition by calculating Pearson’s r value between the response and the /da/ stimulus from 0 ms to 170 ms. The FFR_TFS and FFR_ENV components were also separated by adding and subtracting responses to the two different polarities described above. The value with the optimal delay (which was associated with the maximum S-R correlation) was used to assess the S-R correlation coefficient. Cross-correlations were also used to evaluate the similarity between the responses in the quiet and noise conditions (Anderson et al., 2011). To evaluate the changes in waveform morphology induced by noise, correlation coefficients were calculated by shifting the response waveform obtained in the noise condition relative to the response waveform obtained in the quiet condition (±2 ms). The maximum correlation achieved (in terms of Pearson’s r value) was defined as the quiet-to-noise response correlation value. Fisher’s transformation was used to convert the r values to z scores for statistical analyses.

Statistical analyses were performed using SPSS 16.0 (SPSS, Inc., Chicago, IL, USA). Analysis of variance (ANOVA) was used for group (NH, PC) comparisons of the F0 amplitude and its multiple harmonic peaks. Independent sample t-tests were used to determine the differences in age, pure tone thresholds, SIN thresholds, S-R correlations and quiet-to-noise responses between the groups. Paired t-tests were used to compare the amplitude alterations of F0 and its multiple harmonic peaks between quiet and noise conditions. Paired t-tests were also used to compare S-R correlations between quiet and noise conditions. To explore the continuous relationships among age, PTA threshold, SIN threshold and FFR variables, Pearson’s correlation was used. The Bonferroni correction was used for multiple comparisons.

Results

Demographic and Audiology Results

The present study enrolled 13 older adults with NH (M/F = 6/7, age 60–74 years, average 63.1) and 14 older adults with PC (M/F = 10/4, age 60–82 years, average 65.9; Table 1). No significant age difference was found between the two groups (t = −1.339, p = 0.192). The PTA thresholds for both groups are shown in Figure 2. The SIN threshold results are also presented in Table 1, and a t-test revealed significantly higher SIN thresholds in the PC group than in the NH group (t = 7.274, p < 0.001).

TABLE 1

Table 1. Demographic and audiology results.

FIGURE 2

Figure 2. Audiology data of the participants. (A) PTA thresholds of each subject for both NH and PC groups. (B) Average PTA thresholds for the NH group and the PC group. Error bars represent SEM. NH, normal hearing; PC, presbycusis; PTA, pure tone audiometry.

Amplitude of F0 and Its Harmonics

The responses of two different polarities were added for each condition of each participant to extract the FFR_ENV. The average response waveforms of FFR_ENV for each group and their spectrograms are presented in Figure 3. According to the average waveforms, the PC group had lower FFR responses than the NH group in the quiet condition; however, in the noise condition, there was no significant amplitude difference between the NH and PC groups. According to the spectrogram of the grand average response in the quiet condition, the amplitudes of F0 and its harmonics for the PC group were lower than those for the NH group. However, in the noise condition, all the harmonics of the PC group showed higher amplitudes than those of the NH group except for F0, which was much lower in the PC group.

FIGURE 3

Figure 3. Comparison of FFR_ENV between the NH and PC groups. (A) Grand average waveforms and spectrograms of FFR_ENV for the NH and PC groups under the quiet condition. (B) Grand average waveforms and spectrograms of FFR_ENV for the NH and PC groups under the noise condition. (C) Amplitude comparison of F0, H1 and H2 (FFR_ENV) between the NH and PC groups under both quiet and noise conditions. NH, normal hearing; PC, presbycusis; FFR, frequency-following response; ENV, envelope. *p < 0.05, **p < 0.01; ***p < 0.05/8 = 0.006, Bonferroni corrected.

The responses of two different polarities were also subtracted for each condition and each participant to extract the FFR_TFS. The average response waveforms of FFR_TFS for each group and their spectrograms are shown in Figure 4. According to the average response waveforms, the FFR_TFS of the PC group showed a lower amplitude than that of the NH group under both the quiet and noise conditions, but there were no obvious amplitude differences between the two conditions for either group. According to the spectrograms, the amplitude of F1 for the NH group was higher in the noise condition than in the quiet condition, while no similar phenomenon was observed for the PC group.

FIGURE 4

Figure 4. Comparison of FFR_TFS between the NH and PC groups. (A) Grand average waveforms and spectrograms of FFR_TFS for the NH and PC groups under the quiet condition. (B) Grand average waveforms and spectrograms of FFR_TFS for the NH and PC groups under the noise condition. (C) Amplitude comparison of F1 (FFR_TFS) between the NH and PC groups under both quiet and noise conditions. NH, normal hearing; PC, presbycusis; FFR, frequency-following response; TFS, temporal fine structure.

We also quantitatively compared the amplitudes of F0 and its harmonics between the two participant groups using multivariate ANOVA (Table 2, Figures 3C, 4C), and the trends were similar to those of the average waveform and spectrogram. The overall intergroup effect was significant (F_(1,52) = 4.124, p = 0.001). The amplitudes of F0 and H3 in the quiet condition and F0 in the noise condition were significantly lower in the PC group than in the NH group, and no significant difference was detected for FFR_TFS between groups. We also investigated the effect of noise on the amplitudes of F0 and its harmonics by comparing the amplitudes between quiet and noise conditions using paired t-tests (Table 2, Figures 3C, 4C). Compared with the quiet condition, H2 and H3 showed significantly decreased amplitudes (p < 0.001; p < 0.05/8 = 0.006, Bonferroni corrected), while no significant difference was observed for the PC group or for TFS. Furthermore, group differences in the amplitude change caused by noise were also compared, and the PC group showed significantly smaller changes for H2 and H3.

TABLE 2

Table 2. FFR comparison between aged people with and without hearing loss.

In addition, in order to evaluate the group and condition effects, 2 (group: NH, PC) × 2 (condition: quiet, noise) two-way mixed-measured ANOVAs were conducted to examine the effects on the F0 amplitude and its multiple harmonic peaks, respectively. ANOVAs showed that the main effects of condition were significant for H2 (F_(1,52) = 20.067, p < 0.001) and H3 (F_(1,52) = 17.388, p < 0.001), but not for either F0 (F_(1,52) = 3.891, p = 0.054) or F1 (F_(1,52) = 1.242, p = 0.270). The main effects of group were significant for F0 (F_(1,52) = 6.801, p = 0.012) and H3 (F_(1,52) = 6.465, p = 0.014), but not for either H2 (F_(1,52) = 0.201, p = 0.656) or F1 (F_(1,52) = 0.373, p = 0.544). The interaction effect was significant for H2 (F_(1,52) = 5.971, p = 0.018) and H3 (F_(1,52) = 6.126, p = 0.017), but not for either F0 (F_(1,52) = 0.086, p = 0.770) or F1 (F_(1,52) = 1.178, p = 0.283).

Stimulus-Response Correlation

S-R correlations were analyzed for both FFR_ENV and FFR_TFS to reflect the accuracy of subcortical phase-locking encoding. Pearson’s correlation tests showed that the S-R correlation between the FFRENV and the acoustic /da/ was significant (all p < 0.05); the S-R correlation between the FFRTFS and the acoustic /da/ was also significant (p < 0.05) except for one tested ear in the quiet condition. For the NH group, the S-R correlation of FFR_ENV in the quiet condition was significantly higher than that in the noise condition (t = 3.735, p = 0.001; p < 0.05/4 = 0.01, Bonferroni corrected), while no such difference was observed for the PC group (Figure 5). In the quiet condition, the NH showed a significantly higher S-R correlation of FFR_ENV than the PC group (t = 3.487, p = 0.001; p < 0.05/4 = 0.01, Bonferroni corrected), while no such difference was observed in the noise condition (Figure 5). No significant differences were observed for FFR_TFS between groups or conditions.

FIGURE 5

Figure 5. Comparison of the S-R correlation between the NH and PC groups. The S-R correlation of FFR_ENV in the NH group in the quiet condition was significantly higher than that of the PC group in the quiet condition, and was also higher than that of the NH group in the noise condition. No significant differences were observed in FFR_TFS between groups or conditions. NH, normal hearing; PC, presbycusis; FFR, frequency-following response; ENV, envelope; TFS, temporal fine structure; S-R, stimulus-response. *p < 0.05/4 = 0.01, Bonferroni corrected.

FFR Morphology Affected by Noise

To evaluate the influence of noise on FFR morphology, Pearson’s correlation between the responses of the two conditions (quiet and noise) was calculated, and a significant correlation was found between all tested ears (all p < 0.05). Correlations between age, PTA thresholds, SIN thresholds and the r values of quiet-to-noise response correlations were also evaluated using Pearson’s correlation analysis. The results indicated a negative relationship between high-frequency PTA thresholds (2 and 4 kHz) and r values for FFR_TFS (r = −0.296, p = 0.030, Figure 6), in which higher high-frequency PTA thresholds were associated with an increased impact of noise on response morphology. No significant relationship was revealed for age. No r value differences between the NH and PC groups were detected for either FFR_ENV or FFR_TFS.

FIGURE 6

Figure 6. Correlations between high-frequency PTA thresholds and quiet-to-noise correlations. Panel (A) shows that the quiet-to-noise correlation of FFR_ENV was not significantly correlated with the high-frequency PTA threshold. Panel (B) shows that the quiet-to-noise correlation of FFR_TFS was negatively correlated with the high-frequency PTA threshold. The dash lines indicate the r values corresponding to the p value of 0.05. The r values higher than the dash lines indicate significant correlations between responses in quiet and those under the noise conditions, i.e., p < 0.05. Except for FFR_TFS of one tested ear in quiet condition, all response-stimulus correlations were significant. NH, normal hearing; PC, presbycusis; FFR, frequency-following response; TFS, temporal fine structure. *p < 0.05.

Correlation Between FFR and SIN Recognition

To clarify whether FFRs can predict SIN perception performance, the correlations between SIN thresholds and FFR variables were also assessed. The results showed that higher SIN thresholds (worse SIN performance) were significantly related to lower H2 (FFR_ENV) amplitude alteration induced by noise (r = −0.390, p = 0.004; p < 0.05/11 = 0.005, Bonferroni corrected) and lower FFR_ENV S-R correlation under quiet conditions (r = −0.395, p = 0.004; p < 0.05/11 = 0.005, Bonferroni corrected; Figure 7). No significant correlation was found for the quiet-to-noise correlation, amplitude of F0 or its harmonics, or FFR_TFS S-R correlation.

FIGURE 7

Figure 7. Correlations between SIN recognition performances and FFR variables. (A) The SIN recognition thresholds were not significantly correlated with H2 (FFR_ENV) amplitude in the quiet condition. (B) Higher SIN recognition thresholds (worse performance) were significantly correlated with lower H2 (FFR_ENV) amplitude alterations in the noise condition. (C) Higher SIN recognition thresholds (worse performance) were correlated with lower FFR_ENV S-R correlations in the quiet condition. NH, normal hearing; PC, presbycusis; FFR, frequency-following response; ENV, envelope; SIN, speech-in-noise; S-R, stimulus-response. *p < 0.05/11 = 0.005, Bonferroni corrected.

Discussion

This study investigated the FFR in elderly adults with or without PC under quiet and noise conditions. The results showed that the elevated hearing sensitivity in listeners with PC affected subcortical encoding of both the envelope and TFS. The main findings are as follows: (1) under quiet conditions, the F0 and H3 amplitudes and the S-R correlation of FFR_ENV in the PC group were significantly lower than those in the NH group, but the F1 amplitudes and the S-R correlation of the FFR_TFS exhibited no significant differences; (2) under noise conditions, the H2 and H3 amplitudes and S-R correlation of FFR_ENV in the NH group significantly decreased compared with those under quiet conditions, but no similar alteration was observed in the PC group or for FFR_TFS; (3) the higher degree of hearing loss was correlated with greater changes in TFS morphology caused by noise; and (4) better SIN performance was closely related to higher FFR_ENV S-R correlation in the quiet condition and higher H2 (FFR_ENV) amplitude alteration in the noise condition.

Influence of Audibility on the Neural Representation of the Envelope and TFS in the Quiet Condition

In this study, the results of FFR testing under quiet conditions showed that for FFR_ENV, the F0 and H3 amplitudes of the PC group were significantly lower than those of the NH group, suggesting that patients with reduced audibility had a decreased subcortical ability to encode the envelope. The PC group also exhibited lower response-stimulus correlations of FFR_ENV than the NH group, which suggested that hearing loss led to less accuracy of subcortical encoding for envelope cues. These findings suggested that the ability of subcortical encoding of the envelope decreased with hearing loss. Ananthakrishnan et al. (2016) also found that SNHL patients had a lower F0 amplitude of FFR_ENV than individuals with NH in response to stimuli of the same intensity, a finding that is consistent with the results of this study. However, Anderson et al. (2013a) showed that the F0, H1 and H2 amplitudes of the FFR_ENV of patients with hearing loss were higher than those of individuals with NH. This discrepancy is likely derived from the different hearing sensitivities of the subjects included in the studies. In the study by Anderson et al. (2013a), the average hearing threshold of the subjects with hearing loss was significantly lower, indicating better hearing, than that of the hearing-impaired subjects in this study and the study by Ananthakrishnan et al. (2016).

For TFS, the sum waveforms and their spectra showed that the F1 amplitudes of the PC group were lower than those of the NH group. These findings still suggest that under quiet conditions, hearing sensitivity exerts a significant influence on subcortical TFS encoding ability, although the multivariate ANOVA did not show significant amplitude or S-R correlation differences between the two groups. Anderson et al. (2013a,c) and Ananthakrishnan et al. (2016) also investigated TFS in SNHL patients. In one of their previous studies, Anderson et al. (2013a) did not find significant differences in subcortical encodings of TFS between the NH group and the PC group. However, another study implementing a larger group of participants revealed TFS deficits in the PC group (Anderson et al., 2013c), which is consistent with Ananthakrishnan et al.’s (2016) reports. Thus, the lack of statistically significant differences in our study was probably due to the relatively small sample size.

Influence of audibility on the Neural Representation of the Envelope and TFS in the Noise Condition

In this study, we also investigated the FFR under noise conditions. The results showed that in the NH group, the amplitudes of H2 and H3 (FFR_ENV) in the noise condition were significantly decreased compared with in the quiet condition, and the S-R correlation also decreased. However, no significant decrease was observed for TFS. These findings suggest that in noise conditions, the proportion of information extracted by the subcortical nuclei had changed, and the TFS proportion appeared to have increased compared with quiet conditions; in contrast, the envelope proportion decreased. Therefore, during the speech recognition process, subjects with NH were more likely to depend on TFS-related information under noise conditions than under quiet conditions. Bidelman (2016) investigated FFR using processed speech stimuli containing only ENV or TFS cues, and the neuro-acoustic and response-to-response correlations revealed that speech-FFRs were dominated by the stimulus ENV for clean speech, with TFS making a stronger contribution at moderate noise levels. Our results further supported their findings.

However, in the PC group, no significant amplitude or S-R correlation difference for FFR_ENV was found between the two conditions. Moreover, the scale of the amplitude change of H2 and H3 under the two conditions in the PC group was lower than that in the NH group, suggesting that the energy change of FFR_ENV in the PC group was significantly lower than that in the NH group. These findings suggest that the PC subjects were not able to downgrade the envelope information extraction in the noise condition like the NH subjects. The results of this study are consistent with a previous study investigating the neural encoding of the ENV in SNHL animals (Zhong et al., 2014), which showed that hearing loss is not associated with a stronger adverse effect of increasing masker intensity on ENV coding. In the PC group, no significant amplitude difference in FFR_TFS was observed between the quiet and noise-masking conditions, suggesting that since FFR_TFS was already degraded in quiet, no further degradation could be observed when the masking noise was introduced. Since the PTA thresholds were negatively correlated with the correlation values for the quiet-to-noise response analysis, the ability of encoding TFS signals in the PC group under the noise condition was weaker than that of the NH group. The results are also consistent with previous reports in SNHL patients. For example, both Buss et al. (2004) and Lorenzi et al. (2006) revealed a decreased ability to use TFS among SNHL patients. Taken together, the results regarding both FFR_ENV and FFR_TFS in the hearing loss group suggest that patients with impaired audibility cannot adjust the corresponding proportion of the envelope and TFS under noise conditions the way that individuals with NH are able to. The results of this study showed that in listeners with PC, reduced hearing sensitivity could lead to an imbalance of envelope-to-TFS coding under noise conditions, which may be one of the mechanisms underlying speech recognition disorder among listeners with PC. Anderson et al. (2013a) examined the FFR of listeners with PC under noise conditions, and their results were very similar to ours. They compared the amplitude differences between the envelope and TFS representations and found that the differences of the hearing-impaired group were significantly higher than those of the NH group in the noise condition but not significantly different from those of the NH group in the quiet condition, suggesting the presence of an imbalanced envelope-to-TFS representation, especially in noise (Anderson et al., 2013a). The imbalance between the envelope and TFS was also demonstrated in a perceptual study (Fogerty and Humes, 2012).

Note that for native speakers of Mandarin, the envelope information is important for representing lexical tone signals not only in NH listeners under noise conditions (Qi et al., 2017) but also in hearing-impaired listeners (Wang et al., 2011). In this study, the S-R correlation was used to analyze the accuracy of representing the envelope and TFS signals of the sound stimulus. The results showed that the S-R correlations in the PC group were significantly lower than those in the NH group for FFR_ENV but not for FFR_TFS. It is of interest to determine whether elevation of the PTA threshold would also lead to a decrease in the S-R correlation for FFR_ENV in speakers of English or other western languages.

FFR and SIN Performance

In this study, we investigated the correlation between SIN perception thresholds and variables of both FFR_ENV and FFR_TFS. The results showed that lower SIN perception thresholds (better performance) were significantly correlated with higher FFR_ENV S-R correlation in the quiet condition, indicating that deficits of subcortical coding of the envelope can affect SIN perception. Lower SIN perception thresholds were also found to correlate with higher H2 (FFR_ENV) amplitude alterations by noise, i.e., the better the SIN performance is, the greater the H2 amplitude difference between the quiet and noise conditions, suggesting that FFR_ENV plays a smaller relative role under noise conditions. This finding further confirms that the change in the envelope-to-TFS encoding ratio under noise conditions is a likely mechanism underlying the speech recognition disorder in noise conditions among listeners with PC. These findings also indicate that FFR may be a useful objective tool to predict SIN perception.

Anderson et al. (2013b) also investigated the correlation between SIN perceptions and speech ABR variables elicited by /da/. Both the self-reported SIN perception and the results of SIN tests were correlated with the speech ABR variables. In their study, a 40-ms /da/ stimulus, which did not include the steady-state part, was used, and variables were mainly from the temporal domain, which were not able to be regarded as a real FFR response. In the present study, we used a much longer stimulus, a 170-ms /da/ stimulus, which had a steady-state part of more than 100 ms, and we primarily analyzed frequency domain variables of the FFR, in addition to response-stimulus correlations. Our results further demonstrated that FFR may be a useful objective tool for predicting SIN perception.

Influence of Age on the Neural Representation of the Envelope and TFS

Although we did not focus on an analysis of the effect of age on FFR because the participants’ age distribution was continuous, we did analyze the correlations between age and the FFR variables. The results showed that the variability in age across participants was not significantly correlated with the variability in FFRs across participants, even though some previous studies have shown that the age factor affects FFRs (Anderson et al., 2012; Clinard and Tremblay, 2013; Bidelman et al., 2014). It is known that auditory aging is rooted in degenerative alterations in both the peripheral hearing organs (e.g., loss of hair cells) and the central auditory system (e.g., atrophy of the gray and white matter; for a recent review see Ouda et al., 2015). In the present study, older adult participants with PC exhibited a higher PTA threshold, which is related to hair cell dysfunction, than their age-control participants without PC. The results of the present study also demonstrated that the PTA threshold was associated with the noise-induced changes in the TFS component of the FFRs, which reflect the brainstem representations of sound TFS signals in both humans (Wang et al., 2018) and rats (Wang and Li, 2015, 2017, 2018; Luo et al., 2017).

Limitations

In this study, the FFR tests used the same test signals for the patients with reduced hearing sensitivity and the subjects with NH, i.e., signals with an intensity of 85 dB SPL were used for both groups. However, higher-intensity stimulus signals were used in previous studies of FFRs in SNHL patients. For example, Ananthakrishnan et al. (2016) simultaneously used stimuli with the same sound pressure level and the same sensation level that were used for subjects with NH, while Anderson et al. (2013a) used a stimulus that was modified using the National Acoustics Laboratory-Revised (NAL-R) algorithm according to each individual’s PTA threshold. However, increasing the intensity of the stimulus according to the subject’s hearing threshold does not completely eliminate the impact of hearing sensitivity itself on FFR because even if SNHL patients and individuals with NH are presented signals with the same sensation level, the loudness perceived by the two groups is most likely different due to the presence of recruitment in patients with SNHL. Furthermore, for subjects with a PTA threshold higher than a certain level, the equipment cannot produce stimuli of the same sensation level as that of the other subjects because of its maximum output limitations. In addition, one of the objectives of this study was to examine the effect of hearing sensitivity changes on the FFR, which required simultaneous tests in both noise and quiet conditions. Thus, due to time constraints, we chose to use stimuli with the same intensity and noise to monitor the FFR. In fact, this test condition is truer to the hearing environment in the daily lives of patients with hearing loss, i.e., hearing speech at the same intensity in the same noise background as individuals with NH.

In the present study, we mainly investigated the effect of audibility on subcortical encoding of noise signals in people with PC. Studies have shown that deficits in suprathreshold auditory processing are related to reductions in audibility in people with PC and may occur even without an aging-related elevation in the PTA threshold (Humes et al., 2012; Peelle and Wingfield, 2016). Clearly, further studies evaluating higher-order auditory-processing abilities are needed in the future to clarify the relationship between subcortical encoding and central hearing loss. Furthermore, phonological awareness is an individual’s awareness of the sound structure of words and studies in deaf children have demonstrated alterations of their phonological awareness (Johnson and Goswami, 2010). Thus, future studies investigating the relationship with phonological awareness may also be very informative.

Summary

In this study, the FFR of subjects with NH and PC was investigated under quiet and noise conditions. In the quiet condition, the amplitudes and S-R correlations of FFR_ENV were significantly higher in the NH group than in the PC group. The NH group showed a significantly lower amplitude and S-R correlation of FFR_ENV in the noise condition than in the quiet condition, but no similar alterations were observed in the PC group, suggesting that listeners with PC cannot adjust the envelope-to-TFS ratio in noise conditions the same way that individuals with NH are able to. This discrepancy is likely one of the reasons why listeners with PC experience decreased speech recognition under noise conditions. Furthermore, worse SIN performance was observed to have a close relationship with lower S-R correlation in the quiet condition and lower FFR_ENV amplitude alteration in the noise condition. These findings further supported that the change in the envelope-to-TFS encoding ratio under noise conditions is a likely mechanism underlying the speech recognition disorder in noise conditions among listeners with PC. In another aspect, these results also indicate that FFR may be a useful objective tool for predicting SIN perception.

Author Contributions

YS and LL conceived the research. YS, QW, ZG and DN planned the study design. WH and YQ performed the FFR recording. WH and QW analyzed the data. YS, QW and LL wrote the manuscript.

Funding

This research was supported by the Chinese National Natural Science Foundation (No. 81300830).

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The reviewer MM and handling Editor declared their shared affiliation.

Acknowledgments

We would like to thank all the participants for giving generously of their time, patience and support.

Abbreviations

ABR, auditory brainstem response; ENV, envelope; FFR, frequency following response; NH, normal hearing; PC, presbycusis; PTA, pure tone audiometry; SIN, speech-in-noise; SNHL, sensorineural hearing loss; SNR, signal-to-noise ratio; TFS, temporal fine structure.

References

Aiken, S. J., and Picton, T. W. (2006). Envelope following responses to natural vowels. Audiol. Neurootol. 11, 213–232. doi: 10.1159/000092589

PubMed Abstract | CrossRef Full Text | Google Scholar

Aiken, S. J., and Picton, T. W. (2008). Envelope and spectral frequency-following responses to vowel sounds. Hear. Res. 245, 35–47. doi: 10.1016/j.heares.2008.08.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Ananthakrishnan, S., Krishnan, A., and Bartlett, E. (2016). Human frequency following response: neural representation of envelope and temporal fine structure in listeners with normal hearing and sensorineural hearing loss. Ear Hear. 37, e91–e103. doi: 10.1097/aud.0000000000000247

PubMed Abstract | CrossRef Full Text | Google Scholar

Anderson, S., Parbery-Clark, A., White-Schwoch, T., and Kraus, N. (2012). Aging affects neural precision of speech encoding. J. Neurosci. 32, 14156–14164. doi: 10.1523/jneurosci.2176-12.2012

PubMed Abstract | CrossRef Full Text | Google Scholar

Anderson, S., Parbery-Clark, A., White-Schwoch, T., Drehobl, S., and Kraus, N. (2013a). Effects of hearing loss on the subcortical representation of speech cues. J. Acoust. Soc. Am. 133, 3030–3038. doi: 10.1121/1.4799804

PubMed Abstract | CrossRef Full Text | Google Scholar

Anderson, S., Parbery-Clark, A., White-Schwoch, T., and Kraus, N. (2013b). Auditory brainstem response to complex sounds predicts self-reported speech-in-noise performance. J. Speech Lang. Hear. Res. 56, 31–43. doi: 10.1044/1092-4388(2012/12-0043)

PubMed Abstract | CrossRef Full Text | Google Scholar

Anderson, S., White-Schwoch, T., Choi, H. J., and Kraus, N. (2013c). Training changes processing of speech cues in older adults with hearing loss. Front. Syst. Neurosci. 7:97. doi: 10.3389/fnsys.2013.00097

PubMed Abstract | CrossRef Full Text | Google Scholar

Anderson, S., Parbery-Clark, A., Yi, H. G., and Kraus, N. (2011). A neural basis of speech-in-noise perception in older adults. Ear Hear. 32, 750–757. doi: 10.1097/aud.0b013e31822229d3

PubMed Abstract | CrossRef Full Text | Google Scholar

Bidelman, G. M. (2015). Multichannel recordings of the human brainstem frequency-following response: scalp topography, source generators and distinctions from the transient ABR. Hear. Res. 323, 68–80. doi: 10.1016/j.heares.2015.01.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Bidelman, G. M. (2016). Relative contribution of envelope and fine structure to the subcortical encoding of noise-degraded speech. J. Acoust. Soc. Am. 140:EL358. doi: 10.1121/1.4965248

PubMed Abstract | CrossRef Full Text | Google Scholar

Bidelman, G. M., Villafuerte, J. W., Moreno, S., and Alain, C. (2014). Age-related changes in the subcortical-cortical encoding and categorical perception of speech. Neurobiol. Aging 35, 2526–2540. doi: 10.1016/j.neurobiolaging.2014.05.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Buss, E., Hall, J. W., and Grose, J. H. (2004). Temporal fine-structure cues to speech and pure tone modulation in observers with sensorineural hearing loss. Ear Hear. 25, 242–250. doi: 10.1097/01.aud.0000130796.73809.09

PubMed Abstract | CrossRef Full Text | Google Scholar

Chandrasekaran, B., and Kraus, N. (2010). The scalp-recorded brainstem response to speech: neural origins and plasticity. Psychophysiology 47, 236–246. doi: 10.1111/j.1469-8986.2009.00928.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Clinard, C. G., and Tremblay, K. L. (2013). Aging degrades the neural encoding of simple and complex sounds in the human brainstem. J. Am. Acad. Audiol. 24, 590–599. doi: 10.3766/jaaa.24.7.7

PubMed Abstract | CrossRef Full Text | Google Scholar

Deng, X. S., Ji, F., and Yang, S. M. (2014). Correlation between maximum phonetically balanced word recognition score and pure-tone auditory threshold in elder presbycusis patients over 80 years old. Acta Otolaryngol. 134, 168–172. doi: 10.3109/00016489.2013.844855

PubMed Abstract | CrossRef Full Text | Google Scholar

Divenyi, P. L., Stark, P. B., and Haupt, K. M. (2005). Decline of speech understanding and auditory thresholds in the elderly. J. Acoust. Soc. Am. 118, 1089–1100. doi: 10.1121/1.1953207

PubMed Abstract | CrossRef Full Text | Google Scholar

Dolphin, W. F., and Mountain, D. C. (1992). The envelope following response: scalp potentials elicited in the Mongolian gerbil using sinusoidally AM acoustic signals. Hear. Res. 58, 70–78. doi: 10.1016/0378-5955(92)90010-k

PubMed Abstract | CrossRef Full Text | Google Scholar

Dolphin, W. F., and Mountain, D. C. (1993). The envelope following response (EFR) in the Mongolian gerbil to sinusoidally amplitude-modulated signals in the presence of simultaneously gated pure tones. J. Acoust. Soc. Am. 94, 3215–3226. doi: 10.1121/1.407227

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, Y., Kong, L., Wang, Q., Wu, X., and Li, L. (2011). Auditory frequency-following response: a neurophysiological measure for studying the “cocktail-party problem”. Neurosci. Biobehav. Rev. 35, 2046–2057. doi: 10.1016/j.neubiorev.2011.05.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, Y., Ma, T., Wang, Q., Wu, X., and Li, L. (2009). Two crossed axonal projections contribute to binaural unmasking of frequency-following responses in rat inferior colliculus. Eur. J. Neurosci. 30, 1779–1789. doi: 10.1111/j.1460-9568.2009.06947.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Fogerty, D., and Humes, L. E. (2012). A correlational method to concurrently measure envelope and temporal fine structure weights: effects of age, cochlear pathology, and spectral shaping. J. Acoust. Soc. Am. 132, 1679–1689. doi: 10.1121/1.4742716

PubMed Abstract | CrossRef Full Text | Google Scholar

Füllgrabe, C. (2013). Age-dependent changes in temporal-fine-structure processing in the absence of peripheral hearing loss. Am. J. Audiol. 22, 313–315. doi: 10.1044/1059-0889(2013/12-0070)

PubMed Abstract | CrossRef Full Text | Google Scholar

Füllgrabe, C., Meyer, B., and Lorenzi, C. (2003). Effect of cochlear damage on the detection of complex temporal envelopes. Hear. Res. 178, 35–43. doi: 10.1016/s0378-5955(03)00027-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Füllgrabe, C., Moore, B. C., and Stone, M. A. (2015). Age-group differences in speech identification despite matched audiometrically normal hearing: contributions from auditory temporal processing and cognition. Front. Aging Neurosci. 6:347. doi: 10.3389/fnagi.2014.00347

PubMed Abstract | CrossRef Full Text | Google Scholar

Galbraith, G. C. (1994). Two-channel brain-stem frequency-following responses to pure tone and missing fundamental stimuli. Electroencephalogr. Clin. Neurophysiol. 92, 321–330. doi: 10.1016/0013-4694(94)00072-s

PubMed Abstract | CrossRef Full Text | Google Scholar

Gifford, R. H., Bacon, S. P., and Williams, E. J. (2007). An examination of speech recognition in a modulated background and of forward masking in younger and older listeners. J. Speech Lang. Hear. Res. 50, 857–864. doi: 10.1044/1092-4388(2007/060)

PubMed Abstract | CrossRef Full Text | Google Scholar

Giroud, N., Hirsiger, S., Muri, R., Kegel, A., Dillier, N., and Meyer, M. (2018). Neuroanatomical and resting state EEG power correlates of central hearing loss in older adults. Brain Struct. Funct. 223, 145–163. doi: 10.1007/s00429-017-1477-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Hall, J. W. III. (1979). Auditory brainstem frequency following responses to waveform envelope periodicity. Science 205, 1297–1299. doi: 10.1126/science.472748

PubMed Abstract | CrossRef Full Text | Google Scholar

Hopkins, K., and Moore, B. C. (2007). Moderate cochlear hearing loss leads to a reduced ability to use temporal fine structure information. J. Acoust. Soc. Am. 122, 1055–1068. doi: 10.1121/1.2749457

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, Y., Huang, Q., Chen, X., Qu, T., Wu, X., and Li, L. (2008). Perceptual integration between target speech and target-speech reflection reduces masking for target-speech recognition in younger adults and older adults. Hear. Res. 244, 51–65. doi: 10.1016/j.heares.2008.07.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Humes, L. E., Dubno, J. R., Gordon-Salant, S., Lister, J. J., Cacace, A. T., Cruickshanks, K. J., et al. (2012). Central presbycusis: a review and evaluation of the evidence. J. Am. Acad. Audiol. 23, 635–666. doi: 10.3766/jaaa.23.8.5

PubMed Abstract | CrossRef Full Text | Google Scholar

Johnson, C., and Goswami, U. (2010). Phonological awareness, vocabulary and reading in deaf children with cochlear implants. J. Speech Lang. Hear. Res. 53, 237–261. doi: 10.1044/1092-4388(2009/08-0139)

PubMed Abstract | CrossRef Full Text | Google Scholar

Krishnan, A. (2002). Human frequency-following responses: representation of steady-state synthetic vowels. Hear. Res. 166, 192–201. doi: 10.1016/s0378-5955(02)00327-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Krishnan, A., and Gandour, J. T. (2009). The role of the auditory brainstem in processing linguistically-relevant pitch patterns. Brain Lang. 110, 135–148. doi: 10.1016/j.bandl.2009.03.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Krishnan, A., Gandour, J. T., and Bidelman, G. M. (2010a). The effects of tone language experience on pitch processing in the brainstem. J. Neurolinguistics 23, 81–95. doi: 10.1016/j.jneuroling.2009.09.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Krishnan, A., Gandour, J. T., Smalt, C. J., and Bidelman, G. M. (2010b). Language-dependent pitch encoding advantage in the brainstem is not limited to acceleration rates that occur in natural speech. Brain Lang. 114, 193–198. doi: 10.1016/j.bandl.2010.05.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, L., Daneman, M., Qi, J. G., and Schneider, B. A. (2004). Does the information content of an irrelevant source differentially affect spoken word recognition in younger and older adults? J. Exp. Psychol. Hum. Percept. Perform. 30, 1077–1091. doi: 10.1037/0096-1523.30.6.1077

PubMed Abstract | CrossRef Full Text | Google Scholar

Lorenzi, C., Gilbert, G., Carn, H., Garnier, S., and Moore, B. C. (2006). Speech perception problems of the hearing impaired reflect inability to use temporal fine structure. Proc. Natl. Acad. Sci. U S A 103, 18866–18869. doi: 10.1073/pnas.0607364103

PubMed Abstract | CrossRef Full Text | Google Scholar

Lorenzi, C., Wallaert, N., Gnansia, D., Leger, A. C., Ives, D. T., Chays, A., et al. (2012). Temporal-envelope reconstruction for hearing-impaired listeners. J. Assoc. Res. Otolaryngol. 13, 853–865. doi: 10.1007/s10162-012-0350-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Luo, L., Wang, Q., and Li, L. (2017). Neural representations of concurrent sounds with overlapping spectra in rat inferior colliculus: comparisons between temporal-fine structure and envelope. Hear. Res. 353, 87–96. doi: 10.1016/j.heares.2017.06.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Mamo, S. K., Grose, J. H., and Buss, E. (2016). Speech-evoked ABR: effects of age and simulated neural temporal jitter. Hear. Res. 333, 201–209. doi: 10.1016/j.heares.2015.09.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Marsh, J. T., Brown, W. S., and Smith, J. C. (1974). Differential brainstem pathways for the conduction of auditory frequency-following responses. Electroencephalogr. Clin. Neurophysiol. 36, 415–424. doi: 10.1016/0013-4694(74)90192-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Moore, B. C. (2008). The role of temporal fine structure processing in pitch perception, masking and speech perception for normal-hearing and hearing-impaired people. J. Assoc. Res. Otolaryngol. 9, 399–406. doi: 10.1007/s10162-008-0143-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Moore, B. C., Vickers, D. A., and Mehta, A. (2012). The effects of age on temporal fine structure sensitivity in monaural and binaural conditions. Int. J. Audiol. 51, 715–721. doi: 10.3109/14992027.2012.690079

PubMed Abstract | CrossRef Full Text | Google Scholar

Moushegian, G., Rupert, A. L., and Stillman, R. D. (1973). Laboratory note. Scalp-recorded early responses in man to frequencies in the speech range. Electroencephalogr. Clin. Neurophysiol. 35, 665–667. doi: 10.1016/0013-4694(73)90223-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Nilsson, M., Soli, S. D., and Sullivan, J. A. (1994). Development of the hearing in noise test for the measurement of speech reception thresholds in quiet and in noise. J. Acoust. Soc. Am. 95, 1085–1099. doi: 10.1121/1.408469

PubMed Abstract | CrossRef Full Text | Google Scholar

Ouda, L., Profant, O., Syka, J. J. C., and Research, T. (2015). Age-related changes in the central auditory system. Cell Tissue Res. 361, 337–358. doi: 10.1007/s00441-014-2107-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Peelle, J. E., and Wingfield, A. (2016). The neural consequences of age-related hearing loss. Trends Neurosci. 39, 486–497. doi: 10.1016/j.tins.2016.05.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Ping, J., Li, N., Galbraith, G. C., Wu, X., and Li, L. (2008). Auditory frequency-following responses in rat ipsilateral inferior colliculus. Neuroreport 19, 1377–1380. doi: 10.1097/WNR.0b013e32830c1cfa

PubMed Abstract | CrossRef Full Text | Google Scholar

Pross, S. E., Chang, J. L., Mizuiri, D., Findlay, A. M., Nagarajan, S. S., Cheung, S. W. J. O., et al. (2015). Temporal cortical plasticity in single-sided deafness: a functional imaging study. Otol. Neurotol. 36, 1443–1449. doi: 10.1097/MAO.0000000000000821

PubMed Abstract | CrossRef Full Text | Google Scholar

Qi, B., Mao, Y., Liu, J., Liu, B., and Xu, L. (2017). Relative contributions of acoustic temporal fine structure and envelope cues for lexical tone perception in noise. J. Acoust. Soc. Am. 141:3022. doi: 10.1121/1.4982247

PubMed Abstract | CrossRef Full Text | Google Scholar

Rufener, K. S., Oechslin, M. S., Wöstmann, M., Dellwo, V., and Meyer, M. (2016). Age-related neural oscillation patterns during the processing of temporally manipulated speech. Brain Topogr. 29, 440–458. doi: 10.1007/s10548-015-0464-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Russo, N., Nicol, T., Musacchia, G., and Kraus, N. (2004). Brainstem responses to speech syllables. Clin. Neurophysiol. 115, 2021–2030. doi: 10.1016/j.clinph.2004.04.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Salonen, J., Johansson, R., Karjalainen, S., Vahlberg, T., Jero, J. P., and Isoaho, R. (2013). Hearing aid compliance in the elderly. B-ENT 9, 23–28.

PubMed Abstract | Google Scholar

Shinn-Cunningham, B., Ruggles, D. R., and Bharadwaj, H. (2013). How early aging and environment interact in everyday listening: from brainstem to behavior through modeling. Adv. Exp. Med. Biol. 787, 501–510. doi: 10.1007/978-1-4614-1590-9_55

PubMed Abstract | CrossRef Full Text | Google Scholar

Skoe, E., and Kraus, N. (2010). Auditory brain stem response to complex sounds: a tutorial. Ear. Hear. 31, 302–324. doi: 10.1097/AUD.0b013e3181cdb272

PubMed Abstract | CrossRef Full Text | Google Scholar

Smith, J. C., Marsh, J. T., and Brown, W. S. (1975). Far-field recorded frequency-following responses: evidence for the locus of brainstem sources. Electroencephalogr. Clin. Neurophysiol. 39, 465–472. doi: 10.1016/0013-4694(75)90047-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Sohmer, H., Pratt, H., and Kinarti, R. (1977). Sources of frequency following responses (FFR) in man. Electroencephalogr. Clin. Neurophysiol. 42, 656–664. doi: 10.1016/0013-4694(77)90282-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Souza, P. E., and Boike, K. T. (2006). Combining temporal-envelope cues across channels: effects of age and hearing loss. J. Speech Lang. Hear. Res. 49, 138–149. doi: 10.1044/1092-4388(2006/011)

PubMed Abstract | CrossRef Full Text | Google Scholar

Supin, A. Y., and Popov, V. V. (1995). Envelope-following response and modulation transfer function in the dolphin’s auditory system. Hear. Res. 92, 38–46. doi: 10.1016/0378-5955(95)00194-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Q., and Li, L. (2015). Auditory midbrain representation of a break in interaural correlation. J. Neurophysiol. 114, 2258–2264. doi: 10.1152/jn.00645.2015

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Q., and Li, L. (2017). Modelling envelope and temporal fine structure components of frequency-following responses in rat inferior colliculus. Sci. China Techn. Sci. 60, 966–973. doi: 10.1007/s11431-016-9044-5

CrossRef Full Text | Google Scholar

Wang, Q., and Li, L. (2018). Differences between auditory frequency-following responses and onset responses: intracranial evidence from rat inferior colliculus. Hear. Res. 357, 25–32. doi: 10.1016/j.heares.2017.10.014

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Q., Lu, H., Wu, Z., and Li, L. (2018). Neural representation of interaural correlation in human auditory brainstem: comparisons between temporal-fine structure and envelope. Hear. Res. 365, 165–173. doi: 10.1016/j.heares.2018.05.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, S., Xu, L., and Mannell, R. (2011). Relative contributions of temporal envelope and fine structure cues to lexical tone recognition in hearing-impaired listeners. J. Assoc. Res. Otolaryngol. 12, 783–794. doi: 10.1007/s10162-011-0285-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Weinberger, N. M., Kitzes, L. M., and Goodman, D. A. (1970). Some characteristics of the “auditory neurophonic”. Experientia 26, 46–48. doi: 10.1007/bf01900383

PubMed Abstract | CrossRef Full Text | Google Scholar

Worden, F. G., and Marsh, J. T. (1968). Frequency-following (microphonic-like) neural responses evoked by sound. Electroencephalogr. Clin. Neurophysiol. 25, 42–52. doi: 10.1016/0013-4694(68)90085-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Yueh, B., Shapiro, N., MacLean, C. H., and Shekelle, P. G. (2003). Screening and management of adult hearing loss in primary care: scientific review. JAMA 289, 1976–1985. doi: 10.1001/jama.289.15.1976

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhong, Z., Henry, K. S., and Heinz, M. G. (2014). Sensorineural hearing loss amplifies neural coding of envelope information in the central auditory system of chinchillas. Hear. Res. 309, 55–62. doi: 10.1016/j.heares.2013.11.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhu, L., Bharadwaj, H., Xia, J., and Shinn-Cunningham, B. (2013). A comparison of spectral magnitude and phase-locking value analyses of the frequency-following response to complex tones. J. Acoust. Soc. Am. 134, 384–395. doi: 10.1121/1.4807498

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: frequency following response, presbycusis, auditory aging, auditory brainstem response, speech recognition

Citation: Hao W, Wang Q, Li L, Qiao Y, Gao Z, Ni D and Shang Y (2018) Effects of Phase-Locking Deficits on Speech Recognition in Older Adults With Presbycusis. Front. Aging Neurosci. 10:397. doi: 10.3389/fnagi.2018.00397

Received: 04 June 2018; Accepted: 19 November 2018;
Published: 06 December 2018.

Edited by:

Tobias Kleinjung, University of Zurich, Switzerland

Reviewed by:

Martin Meyer, University of Zurich, Switzerland
Samira Anderson, University of Maryland, College Park, United States

Copyright © 2018 Hao, Wang, Li, Qiao, Gao, Ni and Shang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yingying Shang, eXlpbmdzaGFuZ0BhbGl5dW4uY29t

^† These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.