Original Research ARTICLE
The Effects of Fluency Enhancing Conditions on Sensorimotor Control of Speech in Typically Fluent Speakers: An EEG Mu Rhythm Study
- 1Department of Audiology and Speech Pathology, The University of Tennessee Health Science Center, Knoxville, TN, United States
- 2Department of Communication Disorders, The University of Arkansas, Fayetteville, AR, United States
Objective: To determine whether changes in sensorimotor control resulting from speaking conditions that induce fluency in people who stutter (PWS) can be measured using electroencephalographic (EEG) mu rhythms in neurotypical speakers.
Methods: Non-stuttering (NS) adults spoke in one control condition (solo speaking) and four experimental conditions (choral speech, delayed auditory feedback (DAF), prolonged speech and pseudostuttering). Independent component analysis (ICA) was used to identify sensorimotor μ components from EEG recordings. Time-frequency analyses measured μ-alpha (8–13 Hz) and μ-beta (15–25 Hz) event-related synchronization (ERS) and desynchronization (ERD) during each speech condition.
Results: 19/24 participants contributed μ components. Relative to the control condition, the choral and DAF conditions elicited increases in μ-alpha ERD in the right hemisphere. In the pseudostuttering condition, increases in μ-beta ERD were observed in the left hemisphere. No differences were present between the prolonged speech and control conditions.
Conclusions: Differences observed in the experimental conditions are thought to reflect sensorimotor control changes. Increases in right hemisphere μ-alpha ERD likely reflect increased reliance on auditory information, including auditory feedback, during the choral and DAF conditions. In the left hemisphere, increases in μ-beta ERD during pseudostuttering may have resulted from the different movement characteristics of this task compared with the solo speaking task. Relationships to findings in stuttering are discussed.
Significance: Changes in sensorimotor control related feedforward and feedback control in fluency-enhancing speech manipulations can be measured using time-frequency decompositions of EEG μ rhythms in neurotypical speakers. This quiet, non-invasive, and temporally sensitive technique may be applied to learn more about normal sensorimotor control and fluency enhancement in PWS.
Sensorimotor control for speech production is achieved via the integration of feedback and feedforward control mechanisms (Houde and Jordan, 1998; Jones and Munhall, 2005; Purcell and Munhall, 2006; Bauer et al., 2007). Feedforward control is associated with the activation of speech motor programs in ventral premotor areas in the left frontal lobe. Projections from the premotor area to bilateral primary motor cortex encode motor commands for producing the sound or syllable of the active motor program. Activation of a motor program also engages feedback controllers for speech via projections to auditory and somatosensory association areas. These projections encode forward models that transform the current motor commands into the desired or expected sensory outcomes. Comparisons between desired and actual feedback allow for monitoring the accuracy of speech output. Discrepancies between these signals generate an error signal, the magnitude and direction of which are mapped onto corrective motor commands via projections from auditory and somatosensory areas to frontal speech motor areas. The projections encode a kind of inverse model that transforms the sensory error into a corrective motor response. While feedforward control for speech is primarily a left hemisphere dominant function (Ghosh et al., 2008; Peeva et al., 2010), feedback control for speech exhibits a right hemisphere dominance (Riecker et al., 2000; Toyomura et al., 2007; Tourville et al., 2008; Golfinopoulos et al., 2011; Tourville and Guenther, 2011; Niziolek and Guenther, 2013). A similar hemispheric differentiation between feedforward and feedback control has been identified for limb movements (Grafton et al., 2008).
These accounts of feedforward and feedback control in healthy speakers also have provided insight into sensorimotor speech disorders. For example, stuttering is associated with deficits in both inverse (Civier et al., 2010; Cai et al., 2012) and forward models (Daliri et al., 2014; Daliri and Max, 2015). For example, the findings of several studies indicate that stuttering is related to impaired feedforward control, related to either readout of feedforward motor programs (Civier et al., 2013) or inverse mapping of auditory states onto motor commands (Cai et al., 2012). Deficits in feedforward control in stuttering are consistent with the finding of speech-related hypoactivation in left premotor regions (Watkins et al., 2008; Chang et al., 2009; Kell et al., 2009; Loucks et al., 2011; Toyomura et al., 2011) and, as a consequence, give rise to an over-reliance on auditory feedback as indicated by the increased right hemisphere activity during speech in persons who stutter (PWS; Fox et al., 1996, 2000; Braun et al., 1997; De Nil et al., 2000; Ingham, 2001; Brown et al., 2005; Sakai et al., 2009). In addition, other studies have reported evidence of deficits in forward, motor-to-sensory transformations (Daliri et al., 2014; Daliri and Max, 2015) that are critical to error detection and correction in feedback control of speech (Tourville et al., 2008; Tourville and Guenther, 2011).
Models of intact sensorimotor control provide important tools for advancing understanding of speech disorders. Similarly, the patterns of neural activity associated with these disorders have the potential to inform models of speech sensorimotor control (Kent et al., 2000). An aspect of stuttering with implications for the neural control of fluent speech involves techniques that enhance speech fluency in PWS by altering sensorimotor control. Fluency enhancing techniques, or conditions, may be classified as exogenous or endogenous (Kalinowski and Saltuklaroglu, 2003). Exogenous conditions enhance fluency by means of an external (typically auditory) sensory signal, while endogenous conditions operate by exerting volitional changes in speech motor control. Choral speech (speaking in unison) and delayed auditory feedback (DAF) are two exogenous conditions that have been shown to induce fluency in PWS. In choral speech, fluency enhancement is achieved relatively effortlessly and requires no training. Stuttering is immediately reduced by 90%–100% (Cherry and Sayers, 1956; Andrews et al., 1982). DAF manipulates auditory feedback by playing the speaker’s voice back to them with an exogenously derived temporal delay. It has been shown to reduce the frequency of stuttering behaviors in PWS by 50%–80% (Goldiamond, 1965; Andrews et al., 1983; Kalinowski et al., 1993). Though highly effective, one difficulty that typically cannot be overcome by exogenous fluency enhancing signals is the preponderance of stuttering during speech initiation (Brown, 1938; Saltuklaroglu et al., 2009). Initiation is thought to be particularly difficult in PWS because weak forward models instantiated prior to receiving auditory feedback are more susceptible to error detection (Guenther et al., 2006). Saltuklaroglu et al. (2009) attempted to compensate for the initiation difficulties by introducing an exogenous carrier phrase prior to choral speech. They reported an overall 98% stuttering reduction compared to a control condition. Importantly, almost no stuttering was observed during initiation, suggesting that the additional exogenous feedback helped overcome initiation difficulties, possibly because the carrier phrase primed the sensorimotor system to initiate speech (Saltuklaroglu et al., 2009).
Endogenous fluency enhancing conditions include prolonged speech and pseudostuttering. Prolonged speech is a commonly used and effective therapeutic technique (O’Brian et al., 2003) that requires “stretching out” voiced phonemes, which consequently can reduce speech rates to as low as 30–60 syllables per minute (Ingham and Andrews, 1973). Pseudostuttering is an adaptation of an early technique called “bouncing” (Johnson, 1959). It is an endogenous therapeutic technique requiring multiple volitional repetitions of the first syllable in an utterance. For example, the phrase “my name is….” would begin with “Muh muh muh my…” It has been found to increase fluency in the ensuing utterance (Saltuklaroglu et al., 2004). Though it is not clear how this technique enhances fluency, it may be likened to an endogenous form of “shadow speech”. Shadow speech is an exogenous condition requiring a second speaker that possesses powerful fluency enhancing effects similar to those of choral speech (Cherry, 1953; Andrews et al., 1982). However, rather than speaking in unison, shadowing requires a PWS to immediately imitate what the speaker said.
Changes in frontal sensorimotor activity resulting from the use of both exogenous and endogenous techniques have been measured in both PWS and non-stuttering (NS) cohorts. In PWS, choral speech has been found in some studies to “normalize” neural function by reducing right hemisphere frontal activity (Fox et al., 1996, 2000). In contrast, choral speech in NS has not been found to produce remarkable changes in frontal activity (Toyomura et al., 2011). DAF has been found to induce increases in right frontal activity in both PWS and NS (Watkins et al., 2008; Sakai et al., 2009; Toyomura et al., 2011), suggesting an increased contribution from feedback control circuits. The similar effect of DAF on neural activity in PWS and NS contrasts somewhat with the differential effects of choral speech on PWS and NS and with earlier studies of DAF reporting paradoxical effects in PWS and NS. Specifically, although DAF is fluency enhancing in PWS, it has been found to induce disfluency in NS speakers, especially when longer delays are introduced (i.e., 200 ms; Stuart et al., 2002; Corey and Cuddapah, 2008). Thus, further investigation of changes in sensorimotor control via exogenous signals is warranted.
The effects of endogenous fluency enhancement on neural activity also have been investigated. De Nil et al. (2008) demonstrated that both prolonged speech and pseudostuttering increase right hemisphere activity in PWS, but produce non-significant changes in neural activity in NS. However, following completion of therapy using prolonged speech, a reduction in right hemisphere activity also has been observed in PWS (De Nil et al., 2003; Neumann et al., 2005). Thus, the immediate effects of prolonged speech may be explained by increased time allowed for speech processing (van Lieshout et al., 1996; Max et al., 2004). Moreover, long-term fluency-enhancing effects might reflect more normalized feedforward control. Considering the differences in the short vs. long-term effects of endogenous techniques in PWS and the differences between the PWS and NS, changes in sensorimotor control related to endogenous speech manipulations also merit further investigation.
Previous findings indicate that fluency-enhancing conditions alter aspects of sensorimotor control in PWS related to the control deficits during stuttering. Moreover, these conditions also elicit neural changes in NS that are often distinct from those observed in PWS. Many fluency-enhancing effects on sensorimotor processing in NS are difficult to explain with existing models of speech motor control and, as such, their causal mechanism and incorporation into existing models of speech production are important areas of investigation.
To summarize, few studies have investigated how these fluency-enhancing conditions alter sensorimotor control in NS populations. Understanding the neural mechanisms underlying these conditions from the perspective of normal motor control will inform current theoretical models and allow for critical comparisons to data from PWS. As such, the present study evaluated neural sensorimotor processing during endogenous and exogenous fluency-enhancing condition in NS. One way to measure differences in sensorimotor control in fluency enhancing conditions is to use temporally sensitive measures of changes in the electroencephalographic (EEG) mu (μ) rhythm power. EEG measures of μ rhythm activity have traditionally been obtained from alpha band (8–13 Hz) activity over the sensorimotor cortex (for a review see Fox et al., 2016). However, since 1989 (Tiihonen et al., 1989) magnetoencephalographic (MEG) studies have revealed unified mu rhythms, with both alpha and beta (15–25 Hz) spectral peaks, that can be localized to a single dipole emanating from PMC/primary motor regions (Hari and Salmelin, 1997; Pfurtscheller et al., 1997; Crone et al., 1998; Szurhaj et al., 2003; Jones et al., 2009).
Similar mu rhythms are now identifiable using independent component analyses (ICA) of raw EEG data (Wang and Jung, 2012; Bowers et al., 2013; Jenson et al., 2014; Cuellar et al., 2016). ICA is particularly useful when applied to EEG data from movement studies. Its ability to blindly separate sources of oscillatory activity allows sensorimotor or other neural activity to be temporally referenced to muscle movements using time-frequency decompositions (Jenson et al., 2014). While oscillatory activity from alpha and beta bands of the mu rhythm (henceforth μ-alpha and μ-beta) are often strongly correlated (Carlqvist et al., 2005; de Lange et al., 2008), they have also been known to dissociate (Brinkman et al., 2014; Jenson et al., 2014) indicating that they reflect distinct yet related sensorimotor functions. In movement tasks, μ-beta suppression is typically linked to motor activity and μ-alpha suppression to a somatosensory response (Hari, 2006).
A characteristic pattern of power suppression is observed in both frequency bands as a muscle begins to contract. Suppression in both alpha and beta bands grows stronger during movement (Pfurtscheller and Neuper, 1992). μ-beta suppression is relatively stable during movement, and is independent of muscle force (Kilavik et al., 2013). In addition, beta suppression is observed prior to movement and beta enhancement (rebound) is observed after a movement (Kilavik et al., 2013). Therefore, changes in μ-beta power are thought to be related to contributions from sensorimotor forward models (Tan et al., 2014; Moisello et al., 2015; Mersov et al., 2016). Similarly, in addition to providing a primary somatosensory response, μ-alpha activity is sensitive to changes in somatosensory (Cheyne et al., 2003), auditory (Tamura et al., 2012), and visual (de Graaf et al., 2013) feedback, perhaps encoding inverse models (Sebastiani et al., 2014). Exploiting the temporal resolution of EEG, time-frequency changes are measured via event-related spectral perturbations (ERSP), which reveal patterns of event-related synchronization (ERS) or deysnchronization (ERD; Makeig, 1993; Pfurtscheller and Lopes da Silva, 1999). ERS and ERD are suggestive of neural inhibition and excitation, respectively. Together, the response properties of μ-alpha and μ-beta strongly suggest that the μ rhythm oscillations can index sensorimotor control across movement tasks, including speech production (Jenson et al., 2014, 2015).
Considering that all of the fluency enhancing conditions discussed alter the way speech is perceived and/or produced, alterations in sensorimotor dynamics are likely to be revealed in μ-alpha and μ-beta oscillations across speaking events. Thus, the goal of the current study is to better understand how exogenous and endogenous speech manipulations alter sensorimotor control in NS. Providing temporally sensitive measures will enhance understanding of how sensorimotor control can be manipulated and data may serve as a basis for future comparisons to PWS under similar conditions. To accomplish the goal, ICA and ERSP are used to identify and temporally decompose μ rhythms from raw EEG data and isolate electromyography (EMG) activity to be used for temporal alignment of neural and muscular data. It is hypothesized that, compared to a control condition, experimental conditions will produce altered μ-alpha and μ-beta desynchronization throughout the time course of speaking events, reflecting changes to sensorimotor control as a result of different speaking conditions. Since speech behaviors during exogenous conditions are cued by external auditory signals, it is predicted that these conditions will most strongly increase reliance on auditory feedback control and consequently, right hemisphere activity (Tourville and Guenther, 2011). Endogenous conditions might be expected to strengthen left hemisphere activity especially in the beta band due to their purported influences on forward modeling. Additionally, they also might increase right hemisphere activity because they involve speaking in a less natural and somewhat novel manner that requires auditory monitoring of the output. However, it is not clear if endogenous manipulations are sufficient to elicit significant sensorimotor control changes in NS speakers.
Materials and Methods
Twenty-six adult native English speakers (23 right-handed) were recruited from the University of Tennessee and surrounding Knoxville area. Participants had a mean age of 26 (range 17–46) and reported no history of cognitive, communicative, or attentional disorders. 18/26 participants were males (8 females) and 23/26 identified as being of Caucasian descent. Table 1 shows a breakdown of all subjects and their contributions to μ clusters. Handedness dominance was assessed with the Edinburg Handedness Inventory (Oldfield, 1971). This study was carried out in accordance with the recommendations of University of Tennessee Health Science Center Institutional Review Board with written informed consent from all subjects. All subjects gave written informed consent in accordance with the Declaration of Helsinki. The protocol was approved by the University of Tennessee Health Sciences Center Institutional Review Board.
Visual stimulus tokens consisting of 9–12 syllable sentence fragments (mean = 10 syllables) were derived from 6th grade reading passages. Tokens were unique across conditions, with each token only being used once per participant. Example tokens include “When you look at mountains they seem permanent”, and “Friends on a cotton plantation near”, and “Had attended thirty seven different”. In the choral condition, synthetic speech recordings using analogs of a human male speaker were made using the Neospeech Text to Speech application. Audacity software version 2.1.0 (Team Audacity, 2015) was used to segment the generated sentences into individual tokens of the synthetic speech analogs for each trial. Using the same process, a carrier phrase, “Hello”, which was 1270 ms in duration, was inserted prior to each individual token.
A 5-condition within-subject design was employed. The first condition (solo) was used as a control task that provided a baseline of unassisted speaker fluency. The four experimental conditions are known to enhance fluency exogenously (conditions 2 and 3) and endogenously (conditions 4 and 5). Participants were required to:
1. Speak without exogenous or endogenous alteration (solo—control condition).
2. Speak in unison with a recording (choral).
3. Speak under DAF.
4. Speak while prolonging the vowels in each word (prolonged).
5. Speak while pseudostuttering (pseudo).
The timelines for stimulus presentation are shown in Figure 1. In all conditions, the entire epoch length was −4000 ms to 3000 ms. Baselines for ERSP analyses were taken from the 1000 ms period beginning at −4000 ms. The stimuli played for 2000 ms, with the cue to speak being 0 ms. At 2000 ms, an “X” appeared on the screen, which was the cue to stop speaking.
Figure 1. Timelines. Seven-thousand milliseconds epoch timelines for single trials in (A) choral and (B) all other experimental conditions.
There were slight differences in the timelines between the choral and the other speaking conditions. In all conditions except choral, the cue to speak was a visual stimulus presentation of the tokens. In the choral condition, the carrier phrase and the visual presentation of the tokens began at −1270 ms on the timeline. At 0 ms, the choral accompaniment was presented for 1000 ms. The choral accompaniment was matched to the visual stimuli. Participants then spoke without the choral accompaniment for 1000 ms. Therefore, in this condition, participants were able to read what they were about to say while they heard the carrier phrase and then say it with choral accompaniment.
Prior to data collection, the researcher trained each participant on what would be required in the speaking conditions. This included how to respond to the choral signal and to listen to the DAF. It also included training on the endogenous techniques. Participant training for endogenous conditions used reading passages not used as experiment tokens. The researchers required the participants to demonstrate effective use of slow prolongations, producing 30–60 syllables per minute (SPM) by prolonging the vowels of every word. They also demonstrated pseudostuttering by producing three syllable iterations of the initial syllable of a sentence in a controlled manner. Data collection began after participants were clear on the instructions and had demonstrated mastery of the endogenous conditions.
The experiment was conducted in an electronically and magnetically shielded, double-walled, sound-treated booth. Participants were seated in a comfortable reclining armchair with their heads and necks well supported. Compumedics Neuroscan Stim 2 version 4.3.3 software was used to present stimuli to participants via a PC. Visual stimuli appeared on a 69.5 × 39.0 cm display placed 132 cm in front of the reclining chair. All conditions were presented in two blocks of 40 trials each. The order of the 10 blocks (5 conditions × 2 blocks) was randomized for each participant.
In the solo (control) condition, participants were instructed to speak naturally using a normal rate. In the choral condition, participants were instructed to begin speaking as they heard the choral accompaniment immediately after the carrier phrase and continue to speak until the cue to stop. In the DAF conditions, they were instructed to speak while listening to their speech fed back to them with a 100 ms delay. 100 ms was chosen as it is in a range chosen by stuttering clients for promoting fluency (van Borsel et al., 2003), has been used in studies using short utterances (Saltuklaroglu et al., 2009), and was considered short enough not to disrupt speech in NS populations (Stuart et al., 2002). Auditory stimuli were delivered through insert headphones. DAF was generated using Audacity software on a windows laptop. For this condition only, participants were required to speak into a Rode NT1-A condenser microphone held at chest level. In both DAF and choral conditions, insert earphones were used to deliver the auditory signals. In the prolonged speech condition, participants spoke using vowel prolongations that induced a slow speech rate (30–60 SPM). In the pseudostuttering condition, participants produced three repetitions of the initial syllable prior to speaking the remainder of the token.
Whole-head EEG data were recorded from a 68 channel NeuroScan Quikcap based on the 10–20 extension (Chatrian et al., 1988) of the international standard system (Jasper, 1958). Neural channels were accompanied by two electrocardiogram (EKG) and two EMG channels. Recording channels were re-referenced to the linked mastoid channels (M1, M2) for common mode noise reduction. The electro-oculogram was captured by two pairs of recording electrodes placed above and below the left eye (VEOU, VEOL) as well as on the medial and lateral canthi of the left eye (HEOL, HEOR), which recorded vertical and horizontal eye movements, respectively. The two EMG electrodes were placed above and below the lips in order to capture speech related peri-labial muscle activity (Gracco, 1988).
EEG data were acquired using Compumedics NeuroScan 4.3.3 software coupled with the Synamps 2 system. Data were band pass filtered (0.15–100 Hz) prior to digitization by a 24 bit analog to digital converter with a sampling rate of 500 Hz. Data collection was referenced to the cue to initiate production thus, time zero represents the initiation cue.
All EEG data processing was performed in EEGLAB 13.5.4b (Brunner et al., 2013), an open source MATLAB toolbox. Data for each participant were processed at the individual level, then analyzed at both the individual and group levels. Steps performed at each stage are outlined below:
1. Preprocessing of 10 raw files for each subject (2 blocks × 5 conditions).
2. ICA of concatenated data files for each participant.
3. Localization estimation via fitting of equivalent current dipole (ECD) models for all neural and non-neural components.
1. Similar components across subjects clustered by Principal Component Analysis (PCA) on the basis of commonalities in spectra, scalp maps, and dipole location.
2. Visual inspection of clusters resulting from PCA to identify left and right μ clusters and validate cluster membership. Neighboring clusters were also examined to identify misallocated components.
3. Time-frequency decomposition of left and right μ clusters performed by ERSP analysis.
4. Mean ECD source computed for μ clusters by averaged localization sources of all components within the clusters.
Both raw data files for each condition (one per block) were appended to create a single file for each condition per subject. The data were then down sampled from 500 Hz to 256 Hz to reduce the computational demands of further processing steps. All EEG data were referenced to the mastoid channels (M1, M2) for common mode noise reduction. The data were band pass filtered twice (once at the beginning of the preprocessing and once as the final step) between 3 Hz and 34 Hz in order to capture frequency bands of interest and remove non-stereotypical noise. Six second trial epochs (ranging from −4000 ms to 2000 ms around time zero) were then extracted from the continuous data. The resulting data files for each subject, in all conditions, were visually inspected, and all epochs containing gross artifact (>200 μV) were removed from the data. A minimum of 40 usable trials per condition per participant was required for an effective ICA decomposition.
Independent Component Analysis (ICA)
Prior to ICA training, pre-processed EEG data for each participant were concatenated across all five conditions so that a single set of ICA weights could be obtained. This allowed for a comparison of activity to be made across conditions within spatially fixed ICs. An extended Infomax algorithm (Lee et al., 1999) was used to decorrelate the data matrix prior to ICA rotation. ICA training was provided using the “extended runica” algorithm in EEGLAB13.5.4b with an initial learning rate set to 0.001 and a stopping weight of 10–7. Following decomposition, 66 ICs were yielded for each participant reflecting the total number of recording electrodes (68–2 reference electrodes, M1 and M2). Scalp maps for each IC were obtained by projecting the inverse weight matrix (W−1) back onto the spatial EEG channel configuration.
Following ICA decomposition, ECD models for each IC were computed using the DIPFIT2 toolbox, freely available at https://sccn.ucsd.edu/wiki/A08:_DIPFIT (Oostenveld and Oostendorp, 2002). Standard 10–20 electrode coordinates were warped to a boundary element head model (BEM) based on the MNI152 template (Mazziotta et al., 2001), followed by automated coarse-fitting to yield a single dipole model for each of 1716 ICs (66 ICs × 26 participants). Dipole localization entails back-projecting the signal to a source that may have generated the scalp potential distribution for a given IC, and then computing the best forward model to explain the highest percentage of scalp map variance (Delorme et al., 2012). The discrepancy between this forward model and the original scalp-recorded signal constitutes residual variance (RV), a putative measure of “goodness of fit” where lower values indicate higher confidence in the dipole model.
Group data analyses were conducted via the EEGLAB STUDY module. The STUDY module allows ICA data from multiple participants across conditions to be analyzed using specified designs. In the current study, there were five paired contrasts of interest (solo vs. each experimental condition). The STUDY module allows further filtering to be applied with respect to the RV in dipole localization and inclusion vs. exclusion of “out-of-head” dipoles. Thus, ICA files with dipole information from each individual (see above) were applied to the two separate STUDY modules. For the purposes of measuring neural activity, only “in-head” dipoles with RV <20% were analyzed.
For the purposes of identifying peri-labial EMG activity, a second STUDY was conducted that included “all” dipoles from in-head and out-of-head. In this second STUDY, the RV criterion was raised to 50% (Gramann et al., 2010), because EMG activity emanates from outside the head and by nature, muscular movement incurs higher unexplained RV.
Principal Component Clustering of ICs
The EEGLAB STUDY module was used to perform all group level analyses, as it enables comparison of ICA data across conditions and between groups. ICs were initially preclustered based on similarities in scalp map, spectra, and dipole location. PCA was implemented via the K-means statistical toolbox, allocating the ICs to 40 component clusters from which left and right μ clusters were identified. Final component designation to left and right μ clusters was based largely on the initial results of PCA, though neighboring clusters were visually inspected for misallocated components. Inclusion criteria for the μ clusters were a characteristic μ spectrum, RV < 20%, and ECD localization across the sensorimotor cortex (BA 1, 2, 3, 4, 6). Any components not meeting these criteria were removed from the μ clusters and excluded from further analysis. An additional component cluster, representing peri-labial EMG activity, was identified from the results of PCA in the second study. As myogenic activity is associated with higher RV, membership to the peri-labial EMG cluster was based primarily on dipole location and the presence of ERSP activity.
ECD dipole locations (generated via DIPFIT2) were submitted to PCA in the STUDY module. Talairach cortical atlas was used to confirm the dipole location, with electrode locations cross-registered between realistic cortical anatomy and standard spherical (BESA) head models (Towle et al., 1993). Source localization for the μ clusters of interest was based on the mean of the Talairach coordinates for each contributing dipole. As localization was based on standard rather than subject-specific head models and channel locations, ECD location best served as confirmation of component origin within sensorimotor regions that are known to generate μ rhythms rather than an exact cortical location (i.e., ventral vs. dorsal premotor cortex). Even without exact locations, the necessary inclusion step of considering the spectral characteristic of each component included in the cluster helped to ensure the sensorimotor nature of the components.
ERSP analyses were conducted to quantify fluctuations in spectral power (in normalized dB units) across the trial epochs in the frequency bands of interest. Time-frequency transformations were generated with a Morlet wavelet rising linearly from three cycles at 3 Hz to 25.6 cycles at 34 Hz. Spectral perturbations were referenced to a silent 1000 ms baseline taken from the inter-trial interval. A surrogate distribution was constructed from 200 randomly sampled time points within this baseline period, against which individual ERSP changes were generated with a bootstrap resampling method with a threshold of p < 0.05 (with false discovery rate, FDR). Data from all experimental conditions from 5 Hz to 30 Hz and between −500 ms and 2500 ms were included in the ERSP analysis. The random distribution represents the null hypothesis that no conditional differences exist. Type 1 error was controlled by correcting for FDRs (Benjamini and Hochberg, 2000). Statistical analyses used a series of paired comparisons between the solo (control) and each of the four experimental conditions with a 95% confidence interval (p < 0.05).
For each condition, participants reported that they completed each task accurately, using the previously trained techniques as instructed. Participants also reported no stuttering in the DAF condition. For solo, choral and DAF conditions, participants were easily able to speak all of the 9–12 syllable stimuli. In the prolonged condition, participants were only able to produce the first 1–3 syllables in the allotted 2000 ms speaking window. In the pseudostuttering condition, participants indicated that they could produce the three repetitions and 1–3 additional syllables in the allotted time. Following completion, all participants reported that they produced slow prolongations and pseudostuttering in the same manner that they were trained. Data from EMG recordings also support the claim that participants performed endogenous tasks accurately, as EMG activity began shortly following the cue to speak.
μ and EMG Cluster Characteristics
19/24 participants were right-handed (two contributing participants were left-handed and were excluded to control for potential differences in cerebral dominance for language) and produced components with <20% unexplained RV that contributed to at least one μ cluster. Table 1 describes participants and their μ cluster contributions. For the left μ cluster, the Talaraich ECD location was [−35, −10, 46] which contained 21 ICs and localized to BA 6. The Talaraich ECD location for the right was [32, −11, 45] which contained 12 ICs and localized to BA 6. The average RV in the μ clusters was 6.05% and 4.31% for the left and right clusters, respectively. Of the 19 participants who contributed to the μ clusters, 14/19 contributed to the EMG cluster. This cluster had an average unexplained RV of 8.06%. While all participants had an EMG component, some did not meet inclusion criteria, primarily due to location. Figure 2 displays the spectra and dipole clusters for the left and right μ clusters, respectively.
Figure 2. Sources and spectra. Equivalent current dipole (ECD) sources and average spectra for each condition in (A) left and (B) right clusters of mu components.
Time Frequency Analyses
Time frequency analyses (ERSPs) considered activity within the 4–33 Hz bandwidth for the left and right μ clusters. The ERSP analyses show significant ERS/ERD changes from baseline across conditions in the left and right μ clusters. Figures 3–6 illustrate the results of time-frequency contrasts between the solo and each of the experimental speaking conditions. In addition to neural activity, activity from the EMG component is temporally decomposed so that neural activity can be temporally referenced to muscle activity.
Figure 3. Solo vs. choral contrast. Event-related spectral perturbations (ERSP) analyses showing time-frequency differences between the solo and choral condition. The first two rows show left and right mu rhythm scalp maps, followed by time-frequency patterns of event-related synchronization (ERS) and event-related desynchronization (ERD), followed by statistically different (pFDR < 0.05) time frequency voxels between the two conditions. The last row shows the differences in EMG activity.
Figure 4. Solo vs. delayed auditory feedback (DAF) contrast. ERSP analyses showing time-frequency differences between the solo and DAF condition. The first two rows show left and right mu rhythm scalp maps, followed by time-frequency patterns of ERS and ERD, followed by statistically different (pFDR < 0.05) time frequency voxels between the two conditions. The last row shows the differences in EMG activity.
Figure 5. Solo vs. prolonged speech contrast. ERSP analyses showing time-frequency differences between the solo and prolonged speech condition. The first two rows show left and right mu rhythm scalp maps, followed by time-frequency patterns of ERS and ERD, followed by statistically different (pFDR < 0.05) time frequency voxels between the two conditions. The last row shows the differences in EMG activity.
Figure 6. Solo vs. pseudostuttering contrast. ERSP analyses showing time-frequency differences between the solo and pseudostuttering condition. The first two rows show left and right mu rhythm scalp maps, followed by time-frequency patterns of ERS and ERD, followed by statistically different (pFDR < 0.05) time frequency voxels between the two conditions. The last row shows the differences in EMG activity.
Solo vs. Choral Speech
Figure 3 displays the solo vs. choral contrast. Significant differences found in choral speech are characterized by earlier and stronger μ-alpha and μ-beta ERD bilaterally. In the left hemisphere, choral speech produced stronger μ-alpha and μ-beta ERD before, during, and immediately after speech initiation. In the right hemisphere, choral speech produced stronger μ-alpha ERD across entire utterances. Additionally, differences in EMG activity were observed, with EMG activity being stronger in the choral condition prior to speech onset, but then becoming significantly weaker than the solo condition later in the utterance.
Solo vs. DAF
Figure 4 displays the solo vs. DAF contrast. Relative to the solo condition, there was significantly increased right hemisphere μ-alpha ERD in the DAF condition. No significant differences were observed in the left hemisphere. Additionally, EMG activity was weaker in the DAF condition.
Solo vs. Prolonged Speech
Figure 5 shows the solo vs. prolonged speech contrast. A few significant differences were observed between conditions in the time-frequency voxels, including about 700 ms of stronger right hemisphere μ-alpha ERD when prolonging syllables. However no clear pattern emerged that quantitatively differentiated between the conditions. Qualitatively, prolonged speech appears to produce more focal patterns of μ-alpha and μ-beta ERD, bilaterally.
Solo vs. Pseudostuttering
Figure 6 shows the solo vs. pseudostuttering contrast. Pseudostuttering produced significantly increased left hemisphere μ-beta ERD across utterances. While right hemisphere μ-alpha ERD also appeared stronger in pseudostuttering, no clear pattern of significant differences was observed in alpha frequencies relative to the control condition. EMG activity was relatively weaker in the pseudostuttering condition vs. the solo condition throughout speech production.
Left and right μ components were identified successfully from band-pass filtered concatenated EEG data during speech production conditions. 19/24 participants contributed components with ~5% unexplained RV. This proportion (79%) of useable components is similar to those found in previous studies (Nyström, 2008; Bowers et al., 2013; Jenson et al., 2014). It is important to note that in the current study, these proportions of contributors were achieved during complex, multi-syllable utterances using ICA to successfully separate neural from myogenic activity. Bilateral localization of components to the precentral gyrus across the premotor and sensorimotor cortices is consistent with accepted μ rhythm locations (Pineda, 2005; Hari, 2006) and sensorimotor function in speech production (Houde and Nagarajan, 2011; Tourville and Guenther, 2011). Time-frequency analyses then provided tests of the experimental hypothesis.
Time Frequency Analyses
In the solo (control) condition, participants spoke in their normal unaltered manner. Oscillatory μ activity was characterized by bilateral μ-alpha/beta ERD. This pattern of desynchronization is consistent with previous findings in speech production (Jenson et al., 2014) and other movements including walking (Seeber et al., 2014; Wagner et al., 2016), reaching (Tan et al., 2014, 2016), swallowing (Dziewas et al., 2003; Cuellar et al., 2016), and tongue tapping (Cuellar et al., 2016). In the current study, ERSP power appears stronger in the left hemisphere and more diffuse in the right. This finding is important because it is consistent with left hemisphere dominance for sensorimotor control in speech production (Specht, 2014). By observing hemispheric differences in oscillatory activity, the data indicate that μ rhythm activity clearly is sensitive to sensorimotor contributions in speech production (Jenson et al., 2014, 2015). If hemispheric differences were not observed, it would suggest that μ-alpha and μ-beta were simply encoding primary somatosensory and motor responses, respectively.
Exogenous Experimental Conditions
Compared to the solo condition, the exogenous experimental conditions (e.g., choral speech and DAF) were characterized by increases in μ-alpha ERD in the right hemisphere throughout the utterance along with simultaneous decreases in EMG activity. It is worth pointing out that neither exogenous condition induced disfluency, which otherwise may have influenced neural activity. Instead, data strongly suggest that the change in μ-alpha activity is related to a change in sensorimotor control.
The finding that both conditions elicit increases in right hemisphere activity is interesting given the differences between the exogenous signals in each condition. One condition, DAF, is a straightforward manipulation of the speaker’s auditory feedback: a temporal delay in an auditory feedback signal elicits auditory error cell activity as well as responses to counteract the disrupting effects of DAF (e.g., decreases in speech rate; Katz and Lackner, 1977). These responses, in turn, are incorporated into the DAF and illustrate how an individual speaker can control aspects of the perturbation. The other condition, choral speech, does not alter the speaker’s auditory feedback but instead requires the speaker to co-produce speech utterances, more or less simultaneously, with an external auditory signal. Since the external signal is independent of the speaker, auditory feedback is evaluated with respect to an external signal or target on a moment-to-moment basis to ensure that they are broadly comparable. A similar mechanism has been proposed to account for the rapid and robust responses speakers exhibit when instructed to alter their pitch to follow the direction of an auditory perturbation (Hain et al., 2000; Patel et al., 2014).
Despite these differences, both exogenous conditions increase the engagement, or gain, of the auditory feedback controller compared to the solo speech condition. The finding of increased μ-alpha ERD in the right, but not left, hemisphere likely reflects changes in the contribution of feedforward and feedback controllers to speech output. Increases in right hemisphere activation have been associated with increases in feedback control in both speech and limb motor studies (Toyomura et al., 2007; Grafton et al., 2008; Tourville et al., 2008; Golfinopoulos et al., 2011). Future studies may examine the effect of longer delays more likely to induce disfluencies in neurotypical speakers (Stuart et al., 2002; Corey and Cuddapah, 2008). Data from the current study suggest that the presence of either exogenous signal was sufficient to increase sensorimotor processing in the right hemisphere consistent with a shift towards greater feedback control.
The right hemisphere changes in the alpha channel are also of methodological importance. The ubiquitous nature of alpha rhythms imbues them with sensitivity to a wide range of cognitive and motor functions across the sensorimotor cortex (Pfurtscheller and Berghold, 1989). However, changes in μ-alpha have been implicated in sensory feedback processing (Pfurtscheller and Berghold, 1989; Jenson et al., 2014, 2015). Increased μ-alpha ERD under exogenous conditions with an increased reliance on auditory feedback continue to support these notions.
Compared to the solo condition, the choral speech condition was characterized by bilateral increases in μ-alpha ERD and μ-beta ERD during speech initiation, consistent with stronger bilateral sensorimotor control during speech initiation in choral speech. These increases were observed early in the trial, prior to the cue to speak. This time period also was characterized by low level EMG activity. Though participants did not begin speaking until the cue, the early EMG activity may have been due to weak muscle contractions while listening to the carrier phrase, which may also have elicited μ ERD. Alternatively, early μ ERD may have been related to covert rehearsal of the carrier phrase or the text that was about to be read. Because overt and covert production both been shown to elicit μ-alpha ERD and μ-beta ERD (Jenson et al., 2014), it is currently not possible to separate these influences. However, the cumulative effect is the carrier phrase provided a “go signal” (Kilavik et al., 2013) that activated the sensorimotor system earlier in choral speech than in the solo condition. In other words, participants appeared to receive sensorimotor “priming” for speaking prior to receiving the choral signal. Advanced priming prior to speaking may have applications for treating initiation difficulties related to stuttering (Saltuklaroglu et al., 2009).
Endogenous Experimental Conditions
Compared to solo, in the left hemisphere the prolonged speech condition appeared to produce more focal patterns of μ-alpha ERD and μ-beta ERD in both hemispheres. However, consistent with findings from De Nil et al. (2008), no robust differences in μ rhythm oscillations were observed. In the right hemisphere, μ-alpha ERD appeared stronger in prolonged than in solo, suggesting that prolonged speech was characterized by increased feedback due to the novelty of the task and/or increased monitoring. However, this difference only achieved significance for about 700 ms of the 2000 ms analysis window. Decreased EMG activity suggested reduced articulatory movements, providing additional evidence that participants were elongating vowels as instructed.
In contrast to prolonged speech, the pseudostuttering condition was characterized by increased μ-beta ERD observed throughout utterances. This increase in sensorimotor activity suggests strong activation of forward models during prolonged speech and contrasts with findings of De Nil et al. (2008) who did not report changes in neural activity under a similar speaking condition. In the current study, increased μ-beta ERD was observed alongside decreased EMG activity. Reduced EMG activity may be due to the longer time necessary to produce pseudostuttered syllables which results in reduced speech output. However, the finding indicates that stronger μ-beta ERD is not directly related to movement and, therefore, reflects a change in sensorimotor control. Increased activation of forward modeling in the pseudostuttering condition may be a consequence of the same syllable being repeated, strengthening the sensory prediction. It may be similar to short-term motor learning, much like adaptation effects in stuttering (Max et al., 2004). This effect may also have been heightened by increased attention due to the novelty of the task.
Similar to the prolonged speech condition, pseudostuttering also produced increases in right hemisphere μ-alpha ERD, though they failed to reach significance. Taken together, the findings raise the possibility that all fluency enhancing conditions were characterized by increased auditory feedback control relative to the solo condition. However, the differences only achieved significance across speech events in the exogenous conditions, likely due to their additional auditory processing requirements. It is possible that feedback-related changes from endogenous conditions were not sufficient to elicit robust changes in sensorimotor control in NS populations or that the current study was underpowered for detecting those changes.
Further Implications for Stuttering
Compared to the solo condition, exogenous and endogenous manipulations to speech motor control conditions produced differences in sensorimotor μ-alpha and μ-beta activity across the time course of utterances. In general, exogenous conditions in the current cohort of NS were associated with increased right hemisphere activity, likely reflecting increased feedback control (Grafton et al., 2008; Tourville et al., 2008). This may be important when applied to stuttering, as right hemisphere sensorimotor activity is thought to play a compensatory role for left hemisphere sensorimotor deficits (Sommer et al., 2002; Preibisch et al., 2003). However, more variable effects in PWS have been reported for exogenous signals, with choral speech and DAF producing decreases and increases in right hemisphere activity, respectively. Findings in PWS may be related to inherent variability noted with this disorder, levels of fluency achieved, or other influences such as scanner noise. As such, further testing using these quiet, temporally sensitive measures in PWS will better explain how fluency enhancing conditions work in PWS and whether they differentially alter sensorimotor control in PWS vs. NS.
Endogenous conditions were associated with relatively fewer changes in sensorimotor control, though pseudostuttering produced stronger left hemisphere sensorimotor activations, suggesting it may produce increases in left-hemisphere based feedforward control. Data may be compared to those from PWS who exhibit both long- (De Nil et al., 2003; Neumann et al., 2005) and short- (De Nil et al., 2008) term differences in sensorimotor control as a result of using endogenous techniques. The findings in PWS suggest that endogenous techniques may alter the relative contributions of both feedforward and feedback control, though further investigation is necessary. It also is interesting to consider how other endogenous fluency enhancing motor manipulations, such as whispering (Bloodstein, 1950) and accent modification (Bloodstein, 1950) might alter sensorimotor control in both PWS and neurotypical populations.
Though all participants produced “μ-like” components, some did not meet inclusion criteria. This is common in oscillatory studies due to inherent inter-individual variability when mapping neural function to cortical topography (Biermann-Ruben et al., 2005; Basile, 2007). The main reason some components did not meet inclusion criteria was due to noisy spectra. Despite filtering and ICA, which removed stereotypical noise, there was still a considerable amount of non-stereotypical noise present in the data, possibly due to the length and complexity of the spoken utterances. Additional causes for not being included in μ clusters are: (1) ICA failed to assign what appeared to be a μ component to one of the cortical regions known to generate sensorimotor μ rhythms; and (2) ICA failed to fit a μ component dipole in the sensorimotor region with less than 20% RV. These problems likely stem from the use of standard head models and channel locations, which reduce source localization accuracy. Such limitations continue to highlight the general need for improved spatial resolution in EEG techniques. Also, while the results indicate that participants complied with instructions in all speaking conditions, this assertion would be bolstered by additional acoustic and/or kinematic data.
Conclusions and Future Directions
ICA successfully unmixed raw EEG data collected during speech production to identify sensorimotor μ components. Subsequent time-frequency analyses of μ component clusters revealed oscillatory changes in μ-alpha and μ-beta bands which were indicative of real-time adjustments in sensorimotor control that were differentiated by speaking condition. The data provide further evidence that the temporal resolution of EEG can be exploited for understanding sensorimotor control in NS populations with an eye towards examining differences in sensorimotor control related to stuttering and other communication disorders. A number of recent studies continues to demonstrate the utility of EEG for examining sensorimotor differences related to stuttering. For example, Daliri and Max (2018) used measures of N1 amplitude during speech motor planning to demonstrate that DAF reduced auditory modulations in NS but increased them in PWS. Sengupta et al. (2016) demonstrated that PWS showed reduced sensorimotor adaptation to DAF that was accompanied by regional differences in EEG phase coherence. Using a similar paradigm to this one, Saltuklaroglu et al. (2017) demonstrated reduced μ-beta spectral amplitudes and differences in PWS, accompanied by stuttering-related μ-alpha and μ-beta oscillatory differences in speech and tone perception tasks. To better inform regarding the neurophysiology and amelioration of stuttering, future studies using these techniques should extend to examining measures of μ and auditory connectivity differences between PWS and NS, characterizing the timing and the directionality of auditory-motor interactions before, during, and after the utterance. In addition to helping elucidate sensorimotor differences related to stuttering, such data will help inform current theories and models of sensorimotor control in normal speech production.
TK and TS conceived and designed the study, performed the statistical analyses, wrote the first version of the manuscript, and edited all subsequent versions. DT and DEJ assisted TK with data collection and analysis. TK, KJR, TS, AWH, DC, TH and ALB wrote and/or edited sections of the manuscript. All authors contributed to manuscript revision, read and approved the submitted version.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Andrews, G., Craig, A., Feyer, A. M., Hoddinott, S., Howie, P., and Neilson, M. (1983). Stuttering: a review of research findings and theories circa 1982. J. Speech Hear. Disord. 48, 226–246. doi: 10.1044/jshd.4803.246a
Andrews, G., Howie, P. M., Dozsa, M., and Guitar, B. E. (1982). Stuttering: speech pattern characteristics under fluency-inducing conditions. J. Speech Hear. Res. 25, 208–216. doi: 10.1044/jshr.2502.208
Bauer, J., Hubbard, J., Seery, C., LaBonte, R., and Ruhnke, L. (2007). Voice F0 responses elicited by perturbations in pitch of auditory feedback in individuals that stutter and controls. J. Acoust. Soc. Am. 121, 3201–3201. doi: 10.1121/1.4782465
Benjamini, Y., and Hochberg, Y. (2000). On the adaptive control of the false discovery rate in multiple testing with independent statistics. J. Educ. Behav. Stat. 25, 60–83. doi: 10.3102/10769986025001060
Biermann-Ruben, K., Salmelin, R., and Schnitzler, A. (2005). Right rolandic activation during speech perception in stutterers: a MEG study. Neuroimage 25, 793–801. doi: 10.1016/j.neuroimage.2004.11.024
Bowers, A., Saltuklaroglu, T., Harkrider, A., and Cuellar, M. (2013). Suppression of the mu rhythm during speech and non-speech discrimination revealed by independent component analysis: implications for sensorimotor integration in speech processing. PLoS One 8:e72024. doi: 10.1371/journal.pone.0072024
Braun, A. R., Varga, M., Stager, S., Schulz, G., Selbie, S., Maisog, J. M., et al. (1997). Altered patterns of cerebral activity during speech and language production in developmental stuttering. An H2(15)O positron emission tomography study. Brain 120, 761–784. doi: 10.1093/brain/120.5.761
Brinkman, L., Stolk, A., Dijkerman, H. C., de Lange, F. P., and Toni, I. (2014). Distinct roles for α- and β-band oscillations during mental simulation of goal-directed actions. J. Neurosci. 34, 14783–14792. doi: 10.1523/JNEUROSCI.2039-14.2014
Brown, S., Ingham, R. J., Ingham, J. C., Laird, A. R., and Fox, P. T. (2005). Stuttered and fluent speech production: an ALE meta-analysis of functional neuroimaging studies. Hum. Brain Mapp. 25, 105–117. doi: 10.1002/hbm.20140
Cai, S., Beal, D. S., Ghosh, S. S., Tiede, M. K., Guenther, F. H., and Perkell, J. S. (2012). Weak responses to auditory feedback perturbation during articulation in persons who stutter: evidence for abnormal auditory-motor transformation. PLoS One 7:e41830. doi: 10.1371/journal.pone.0041830
Carlqvist, H., Nikulin, V. V., Strömberg, J. O., and Brismar, T. (2005). Amplitude and phase relationship between α and β oscillations in the human electroencephalogram. Med. Biol. Eng. Comput. 43, 599–607. doi: 10.1007/bf02351033
Chang, S. E., Kenney, M. K., Loucks, T. M., and Ludlow, C. L. (2009). Brain activation abnormalities during speech and non-speech in stuttering speakers. Neuroimage 46, 201–212. doi: 10.1016/j.neuroimage.2009.01.066
Cherry, E. C., and Sayers, B. M. (1956). Experiments upon the total inhibition of stammering by external control and some clinical results. J. Psychosom. Res. 1, 233–246. doi: 10.1016/0022-3999(56)90001-0
Cheyne, D., Gaetz, W., Garnero, L., Lachaux, J. P., Ducorps, A., Schwartz, D., et al. (2003). Neuromagnetic imaging of cortical oscillations accompanying tactile stimulation. Cogn. Brain Res. 17, 599–611. doi: 10.1016/s0926-6410(03)00173-3
Civier, O., Bullock, D., Max, L., and Guenther, F. H. (2013). Computational modeling of stuttering caused by impairments in a basal ganglia thalamo-cortical circuit involved in syllable selection and initiation. Brain Lang. 126, 263–278. doi: 10.1016/j.bandl.2013.05.016
Civier, O., Tasko, S. M., and Guenther, F. H. (2010). Overreliance on auditory feedback may lead to sound/syllable repetitions: simulations of stuttering and fluency-inducing conditions with a neural model of speech production. J. Fluency Disord. 35, 246–279. doi: 10.1016/j.jfludis.2010.05.002
Corey, D. M., and Cuddapah, V. A. (2008). Delayed auditory feedback effects during reading and conversation tasks: gender differences in fluent adults. J. Fluency Disord. 33, 291–305. doi: 10.1016/j.jfludis.2008.12.001
Crone, N. E., Miglioretti, D. L., Gordon, B., Sieracki, J. M., Wilson, M. T., Uematsu, S., et al. (1998). Functional mapping of human sensorimotor cortex with electrocorticographic spectral analysis. I. α and β event-related desynchronization. Brain 121, 2271–2299. doi: 10.1093/brain/121.12.2271
Cuellar, M., Harkrider, A. W., Jenson, D., Thornton, D., Bowers, A., and Saltuklaroglu, T. (2016). Time-frequency analysis of the EEG mu rhythm as a measure of sensorimotor integration in the later stages of swallowing. Clin. Neurophysiol. 127, 2625–2635. doi: 10.1016/j.clinph.2016.04.027
Daliri, A., Prokopenko, R. A., Flanagan, J. R., and Max, L. (2014). Control and prediction components of movement planning in stuttering versus nonstuttering adults. J. Speech Lang. Hear. Res. 57, 2131–2141. doi: 10.1044/2014_JSLHR-s-13-0333
de Graaf, T. A., Gross, J., Paterson, G., Rusch, T., Sack, A. T., and Thut, G. (2013). α-band rhythms in visual task performance: phase-locking by rhythmic sensory stimulation. PLoS One 8:e60035. doi: 10.1371/journal.pone.0060035
de Lange, F. P., Jensen, O., Bauer, M., and Toni, I. (2008). Interactions between posterior γ and frontal α/β oscillations during imagined actions. Front. Hum. Neurosci. 2:7. doi: 10.3389/neuro.09.007.2008
De Nil, L. F., Beal, D. S., Lafaille, S. J., Kroll, R. M., Crawley, A. P., and Gracco, V. L. (2008). The effects of simulated stuttering and prolonged speech on the neural activation patterns of stuttering and nonstuttering adults. Brain Lang. 107, 114–123. doi: 10.1016/j.bandl.2008.07.003
De Nil, L. F., Kroll, R. M., Kapur, S., and Houle, S. (2000). A positron emission tomography study of silent and oral single word reading in stuttering and nonstuttering adults. J. Speech Lang. Hear. Res. 43, 1038–1053. doi: 10.1044/jslhr.4304.1038
De Nil, L. F., Kroll, R. M., Lafaille, S. J., and Houle, S. (2003). A positron emission tomography study of short- and long-term treatment effects on functional brain activation in adults who stutter. J. Fluency Disord. 28, 357–379; quiz 379–380. doi: 10.1016/j.jfludis.2003.07.002
Dziewas, R., Sörös, P., Ishii, R., Chau, W., Henningsen, H., Ringelstein, E. B., et al. (2003). Neuroimaging evidence for cortical involvement in the preparation and in the act of swallowing. Neuroimage 20, 135–144. doi: 10.1016/s1053-8119(03)00285-4
Fox, N. A., Bakermans-Kranenburg, M. J., Yoo, K. H., Bowman, L. C., Cannon, E. N., Vanderwert, R. E., et al. (2016). Assessing human mirror activity with EEG mu rhythm: a meta-analysis. Psychol. Bull. 142, 291–313. doi: 10.1037/bul0000031
Fox, P. T., Ingham, R. J., Ingham, J. C., Zamarripa, F., Xiong, J. H., and Lancaster, J. L. (2000). Brain correlates of stuttering and syllable production: a PET performance-correlation analysis. Brain 123, 1985–2004. doi: 10.1093/brain/123.10.1985
Ghosh, S. S., Tourville, J. A., and Guenther, F. H. (2008). A neuroimaging study of premotor lateralization and cerebellar involvement in the production of phonemes and syllables. J. Speech Lang. Hear. Res. 51, 1183–1202. doi: 10.1044/1092-4388(2008/07-0119)
Goldiamond, I. (1965). “Stuttering and fluency as manipulatable operant response classes,” in Research in Behavior Modification, eds L. Krasner and L. P. Ullmann (New York, NY: Holt, Rinehart & Winston), 106–156.
Golfinopoulos, E., Tourville, J. A., Bohland, J. W., Ghosh, S. S., Nieto-Castanon, A., and Guenther, F. H. (2011). fMRI investigation of unexpected somatosensory feedback perturbation during speech. Neuroimage 55, 1324–1338. doi: 10.1016/j.neuroimage.2010.12.065
Grafton, S. T., Schmitt, P., Van Horn, J., and Diedrichsen, J. (2008). Neural substrates of visuomotor learning based on improved feedback control and prediction. Neuroimage 39, 1383–1395. doi: 10.1016/j.neuroimage.2007.09.062
Guenther, F. H., Ghosh, S. S., and Tourville, J. A. (2006). Neural modeling and imaging of the cortical interactions underlying syllable production. Brain Lang. 96, 280–301. doi: 10.1016/j.bandl.2005.06.001
Hain, T. C., Burnett, T. A., Kiran, S., Larson, C. R., Singh, S., and Kenney, M. K. (2000). Instructing subjects to make a voluntary response reveals the presence of two components to the audio-vocal reflex. Exp. Brain Res. 130, 133–141. doi: 10.1007/s002219900237
Jenson, D., Bowers, A. L., Harkrider, A. W., Thornton, D., Cuellar, M., and Saltuklaroglu, T. (2014). Temporal dynamics of sensorimotor integration in speech perception and production: independent component analysis of EEG data. Front. Psychol. 5:656. doi: 10.3389/fpsyg.2014.00656
Jenson, D., Harkrider, A. W., Thornton, D., Bowers, A. L., and Saltuklaroglu, T. (2015). Auditory cortical deactivation during speech production and following speech perception: an EEG investigation of the temporal dynamics of the auditory α rhythm. Front. Hum. Neurosci. 9:534. doi: 10.3389/fnhum.2015.00534
Jones, S. R., Pritchett, D. L., Sikora, M. A., Stufflebeam, S. M., Hämäläinen, M., and Moore, C. I. (2009). Quantitative analysis and biophysically realistic neural modeling of the MEG mu rhythm: rhythmogenesis and modulation of sensory-evoked responses. J. Neurophysiol. 102, 3554–3572. doi: 10.1152/jn.00535.2009
Kalinowski, J., Armson, J., Roland-Mieszkowski, M., Stuart, A., and Gracco, V. L. (1993). Effects of alterations in auditory feedback and speech rate on stuttering frequency. Lang. Speech 36, 1–16. doi: 10.1177/002383099303600101
Kalinowski, J., and Saltuklaroglu, T. (2003). Choral speech: the amelioration of stuttering via imitation and the mirror neuronal system. Neurosci. Biobehav. Rev. 27, 339–347. doi: 10.1016/s0149-7634(03)00063-0
Kilavik, B. E., Zaepffel, M., Brovelli, A., MacKay, W. A., and Riehle, A. (2013). The ups and downs of β oscillations in sensorimotor cortex. Exp. Neurol. 245, 15–26. doi: 10.1016/j.expneurol.2012.09.014
Lee, T. W., Girolami, M., and Sejnowski, T. J. (1999). Independent component analysis using an extended infomax algorithm for mixed subgaussian and supergaussian sources. Neural Comput. 11, 417–441. doi: 10.1162/089976699300016719
Loucks, T., Kraft, S. J., Choo, A. L., Sharma, H., and Ambrose, N. G. (2011). Functional brain activation differences in stuttering identified with a rapid fMRI sequence. J. Fluency Disord. 36, 302–307. doi: 10.1016/j.jfludis.2011.04.004
Max, L., Guenther, F. H., Gracco, V. L., Ghosh, S. S., and Wallace, M. E. (2004). Unstable or insufficiently activated internal models and feedback-biased motor control as sources of dysfluency: a theoretical model of stuttering. Contemp. Issues Commun. Sci. Disord. 31, 105–122.
Mazziotta, J., Toga, A., Evans, A., Fox, P., Lancaster, J., Zilles, K., et al. (2001). A probabilistic atlas and reference system for the human brain: international consortium for brain mapping (ICBM). Philos. Trans. R. Soc. Lond. B Biol. Sci. 356, 1293–1322. doi: 10.1098/rstb.2001.0915
Mersov, A. M., Jobst, C., Cheyne, D. O., and De Nil, L. (2016). Sensorimotor oscillations prior to speech onset reflect altered motor networks in adults who stutter. Front. Hum. Neurosci. 10:443. doi: 10.3389/fnhum.2016.00443
Moisello, C., Blanco, D., Lin, J., Panday, P., Kelly, S. P., Quartarone, A., et al. (2015). Practice changes β power at rest and its modulation during movement in healthy subjects but not in patients with Parkinson’s disease. Brain Behav. 5:e00374. doi: 10.1002/brb3.374
Neumann, K., Preibisch, C., Euler, H. A., von Gudenberg, A. W., Lanfermann, H., Gall, V., et al. (2005). Cortical plasticity associated with stuttering therapy. J. Fluency Disord. 30, 23–39. doi: 10.1016/j.jfludis.2004.12.002
Niziolek, C. A., and Guenther, F. H. (2013). Vowel category boundaries enhance cortical and behavioral responses to speech feedback alterations. J. Neurosci. 29, 12090–12098. doi: 10.1523/JNEUROSCI.1008-13.2013
O’Brian, S., Onslow, M., Cream, A., and Packman, A. (2003). The Camperdown Program: outcomes of a new prolonged-speech treatment model. J. Speech Lang. Hear. Res. 46, 933–946. doi: 10.1044/1092-4388(2003/073)
Oostenveld, R., and Oostendorp, T. F. (2002). Validating the boundary element method for forward and inverse EEG computations in the presence of a hole in the skull. Hum. Brain Mapp. 17, 179–192. doi: 10.1002/hbm.10061
Patel, S., Nishimura, C., Lodhavia, A., Korzyukov, O., Parkinson, A., Robin, D. A., et al. (2014). Understanding the mechanisms underlying voluntary responses to pitch-shifted auditory feedback. J. Acoust. Soc. Am. 135, 3036–3044. doi: 10.1121/1.4870490
Peeva, M. G., Guenther, F. H., Tourville, J. A., Nieto-Castanon, A., Anton, J.-L., Nazarian, B., et al. (2010). Distinct representations of phonemes, syllables, and supra-syllabic sequences in the speech production network. Neuroimage 50, 626–638. doi: 10.1016/j.neuroimage.2009.12.065
Pfurtscheller, G., and Berghold, A. (1989). Patterns of cortical activation during planning of voluntary movement. Electroencephalogr. Clin. Neurophysiol. 72, 250–258. doi: 10.1016/0013-4694(89)90250-2
Pfurtscheller, G., and Lopes da Silva, F. H. (1999). Event-related EEG/MEG synchronization and desynchronization: basic principles. Clin. Neurophysiol. 110, 1842–1857. doi: 10.1016/s1388-2457(99)00141-8
Preibisch, C., Neumann, K., Raab, P., Euler, H. A., von Gudenberg, A. W., Lanfermann, H., et al. (2003). Evidence for compensation for stuttering by the right frontal operculum. Neuroimage 20, 1356–1364. doi: 10.1016/s1053-8119(03)00376-8
Riecker, A., Ackermann, H., Wildgruber, D., Dogil, G., and Grodd, W. (2000). Opposite hemispheric lateralization effects during speaking and singing at motor cortex, insula and cerebellum. Neuroreport 11, 1997–2000. doi: 10.1097/00001756-200006260-00038
Sakai, N., Masuda, S., Shimotomai, T., and Mori, K. (2009). Brain activation in adults who stutter under delayed auditory feedback: an fMRI study. Int. J. Speech Lang. Pathol. 11, 2–11. doi: 10.1080/17549500802588161
Saltuklaroglu, T., Harkrider, A. W., Thornton, D., Jenson, D., and Kittilstved, T. (2017). EEG Mu micro rhythm spectra and oscillatory activity differentiate stuttering from non-stuttering adults. Neuroimage 153, 232–245. doi: 10.1016/j.neuroimage.2017.04.022
Saltuklaroglu, T., Kalinowski, J., Dayalu, V. N., Stuart, A., and Rastatter, M. P. (2004). Voluntary stuttering suppresses true stuttering: a window on the speech perception-production link. Percept. Psychophys. 66, 249–254. doi: 10.3758/bf03194876
Saltuklaroglu, T., Kalinowski, J., Robbins, M., Crawcour, S., and Bowers, A. (2009). Comparisons of stuttering frequency during and after speech initiation in unaltered feedback, altered auditory feedback and choral speech conditions. Int. J. Lang. Commun. Disord. 44, 1000–1017. doi: 10.1080/13682820802546951
Sebastiani, V., de Pasquale, F., Costantini, M., Mantini, D., Pizzella, V., Romani, G. L., et al. (2014). Being an agent or an observer: different spectral dynamics revealed by MEG. Neuroimage 2, 717–728. doi: 10.1016/j.neuroimage.2014.08.031
Seeber, M., Scherer, R., Wagner, J., Solis-Escalante, T., and Müller-Putz, G. R. (2014). EEG β suppression and low γ modulation are different elements of human upright walking. Front. Hum. Neurosci. 8:485. doi: 10.3389/fnhum.2014.00485
Sengupta, R., Shah, S., Gore, K., Loucks, T., and Nasir, S. M. (2016). Anomaly in neural phase coherence accompanies reduced sensorimotor integration in adults who stutter. Neuropsychologia 93, 242–250. doi: 10.1016/j.neuropsychologia.2016.11.004
Sommer, M., Koch, M. A., Paulus, W., Weiller, C., and Büchel, C. (2002). Disconnection of speech-relevant brain areas in persistent developmental stuttering. Lancet 360, 380–383. doi: 10.1016/s0140-6736(02)09610-1
Stuart, A., Kalinowski, J., Rastatter, M. P., and Lynch, K. (2002). Effect of delayed auditory feedback on normal speakers at two speech rates. J. Acoust. Soc. Am. 111, 2237–2241. doi: 10.1121/1.1466868
Szurhaj, W., Derambure, P., Labyt, E., Cassim, F., Bourriez, J. L., Isnard, J., et al. (2003). Basic mechanisms of central rhythms reactivity to preparation and execution of a voluntary movement: a stereoelectroencephalographic study. Clin. Neurophysiol. 114, 107–119. doi: 10.1016/s1388-2457(02)00333-4
Tamura, T., Gunji, A., Takeichi, H., Shigemasu, H., Inagaki, M., Kaga, M., et al. (2012). Audio-vocal monitoring system revealed by mu-rhythm activity. Front. Psychol. 3:225. doi: 10.3389/fpsyg.2012.00225
Tan, H., Jenkinson, N., and Brown, P. (2014). Dynamic neural correlates of motor error monitoring and adaptation during trial-to-trial learning. J. Neurosci. 34, 5678–5688. doi: 10.1523/JNEUROSCI.4739-13.2014
Tan, H., Wade, C., and Brown, P. (2016). Post-movement β activity in sensorimotor cortex indexes confidence in the estimations from internal models. J. Neurosci. 36, 1516–1528. doi: 10.1523/JNEUROSCI.3204-15.2016
Team Audacity. (2015). Audacity (Version 2.1.0). Audio Editor and Recorder. Avaliable online at: https://audacityteam.org/
Towle, V. L., Bolaños, J., Suarez, D., Tan, K., Grzeszczuk, R., Levin, D. N., et al. (1993). The spatial location of EEG electrodes: locating the best-fitting sphere relative to cortical anatomy. Electroencephalogr. Clin. Neurophysiol. 86, 1–6. doi: 10.1016/0013-4694(93)90061-y
Toyomura, A., Koyama, S., Miyamaoto, T., Terao, A., Omori, T., Murohashi, H., et al. (2007). Neural correlates of auditory feedback control in human. Neuroscience 146, 499–503. doi: 10.1016/j.neuroscience.2007.02.023
van Borsel, J., Reunes, G., and Van den Bergh, N. (2003). Delayed auditory feedback in the treatment of stuttering: clients as consumers. Int. J. Lang. Commun. Disord. 38, 119–129. doi: 10.1080/1368282021000042902
van Lieshout, P. H., Hulstijn, W., and Peters, H. F. (1996). Speech production in people who stutter: testing the motor plan assembly hypothesis. J. Speech Hear. Res. 39, 76–92. doi: 10.1044/jshr.3901.76
Wagner, J., Makeig, S., Gola, M., Neuper, C., and Muller-Putz, G. (2016). Distinct β band oscillatory networks subserving motor and cognitive control during gait adaptation. J. Neurosci. 36, 2212–2226. doi: 10.1523/JNEUROSCI.3543-15.2016
Wang, Y., and Jung, T. P. (2012). “Improving brain-computer interfaces using independent component analysis,” in Towards Practical Brain-Computer Interfaces, eds B. Z. Allison, S. Dunne, R. Leeb, J. D. R. Millán and A. Nijholt (Berlin, Heidelberg: Springer), 67–83.
Keywords: speech production, fluency enhancing conditions, EEG, mu rhythm, independent component analysis
Citation: Kittilstved T, Reilly KJ, Harkrider AW, Casenhiser D, Thornton D, Jenson DE, Hedinger T, Bowers AL and Saltuklaroglu T (2018) The Effects of Fluency Enhancing Conditions on Sensorimotor Control of Speech in Typically Fluent Speakers: An EEG Mu Rhythm Study. Front. Hum. Neurosci. 12:126. doi: 10.3389/fnhum.2018.00126
Received: 07 November 2017; Accepted: 16 March 2018;
Published: 04 April 2018.
Edited by:Hans-Leo Teulings, Neuroscript, United States
Reviewed by:Douglas Owen Cheyne, Hospital for Sick Children, Canada
Soo-Eun Chang, University of Michigan Health System, United States
Copyright © 2018 Kittilstved, Reilly, Harkrider, Casenhiser, Thornton, Jenson, Hedinger, Bowers and Saltuklaroglu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Tim Saltuklaroglu, firstname.lastname@example.org