Original Research ARTICLE
Menstrual Cycle Phase Modulates Auditory-Motor Integration for Vocal Pitch Regulation
- 1Department of Rehabilitation Medicine, The Sixth Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
- 2Department of Rehabilitation Medicine, Anhui No. 2 Province People's Hospital, Hefei, China
- 3Department of Rehabilitation Medicine, The First Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
- 4Guangdong Provincial Key Laboratory of Brain Function and Disease, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
In adult females, previous work has demonstrated that changes in auditory function and vocal motor behaviors may accompany changes in gonadal steroids. Less is known, however, about the influence of gonadal steroids on auditory-motor integration for voice control in humans. The present event-related potential (ERP) study sought to examine the interaction between gonadal steroids and auditory feedback-based vocal pitch regulation across the menstrual cycle. Participants produced sustained vowels while hearing their voice unexpectedly pitch-shifted during the menstrual, follicular, and luteal phases of the menstrual cycle. Measurement of vocal and cortical responses to pitch feedback perturbations and assessment of estradiol and progesterone levels were performed in all three phases. The behavioral results showed that the menstrual phase (when estradiol levels are low) as associated with larger magnitudes of vocal responses than the follicular and luteal phases (when estradiol levels are high). Furthermore, there was a significant negative correlation between the magnitudes of vocal responses and estradiol levels. At the cortical level, ERP P2 responses were smaller during the luteal phase (when progesterone levels were high) than the menstrual and follicular phases (when progesterone levels were low). These findings show neurobehavioral evidence for the modulation of auditory-motor integration for vocal pitch regulation across the menstrual cycle, and provide important insights into the neural mechanisms and functional outcomes of gonadal steroids' influence on speech motor control in adult women.
Gonadal steroids, such as estradiol and progesterone, have demonstrable effects on behavior and neuronal activity involved in cognitive functions, emotional control, and sensory processing (Fernandez et al., 2003; Eisner et al., 2004; Jacobs and D'Esposito, 2011). In addition to their known involvement in those brain functions, gonadal steroids are believed to influence the control of vocal communication. Evidence from birdsong research shows that estrogen helps maintain the plasticity of the song system to acquire new sensory models of song, and the lack of estrogen during the normal critical period of song learning impairs the development of adult song (Schlinger, 1997). In humans, fluctuations of gonadal steroids drive changes in voice quality such as roughness, breathiness, and asthenia (Meurer et al., 2009; Raj et al., 2010; Çelik et al., 2013) and may influence the timing of voice onset and offset (Wadnerkar et al., 2006). As compared to premenopausal women, postmenopausal women suffer from vocal deficits including lower voice fundamental frequency (F0), lower vocal intensity, and more voice instability, and their voice quality was improved with hormone replacement therapy (D'Haeseleer et al., 2009). Evidence suggests that the vocal folds contain specific estrogen and progesterone receptors in the vocalis muscle and lamina propria (Ferguson et al., 1987; Newman et al., 2000). Therefore, changes in these gonadal steroids may influence vocal motor behavior through the receptor-coupled effector mechanisms.
Both estradiol and progesterone can also influence auditory function. For example, exposing female midshipman fish to estradiol during the non-breeding season makes their auditory nerves more sensitive to the frequency of the male mating call (Sisneros et al., 2004). Acute inhibition of estradiol production in songbirds suppresses burst firing of auditory neurons and disrupts their ability to process and respond to song stimuli (Remage-Healey et al., 2010). In humans, auditory acuity, spontaneous otoacoustic emissions, and auditory processing assessed by auditory brainstem response (ABR) and auditory event-related potentials (ERPs) vary significantly as gonadal steroids change across the menstrual cycle (Serra et al., 2003; Walpurger et al., 2004; Al-Mana et al., 2010). For example, N1-P2 peak responses to pure tones were significantly reduced during the luteal phase as compared to the menstrual and follicular phase of the menstrual cycle in women, and they were negatively correlated with estradiol/progesterone levels (Walpurger et al., 2004). At the brainstem level, there was a significant increase in the wave V latency of ABR in the follicular phase and a decrease in the luteal phase (Al-Mana et al., 2010).
The control of speech motor behavior involves the integration of sensory information, particularly auditory information, into the vocal motor systems (Smotherman, 2007). Although gonadal steroids can influence both vocal motor behavior and central auditory processing, less is known about the interaction between gonadal steroids and auditory-motor integration for voice control in adult women. Auditory feedback provides information necessary to monitor and correct for errors for the guidance of speech learning during the critical phases of development, and for the maintenance of online speech production (Houde and Jordan, 1998; Jones and Munhall, 2005). When perceiving alterations of F0, intensity, or formant frequency (F1) in auditory feedback, people produce rapid compensatory vocal adjustment to stabilize their production of vocal sounds around the desired level (Burnett et al., 1998; Bauer et al., 2006; Purcell and Munhall, 2006). At the cortical level, N1 and P2 responses elicited by pitch-shifted voice auditory feedback can be modified by stimulus features (Liu et al., 2011a; Scheerer et al., 2013), attentional demands (Hu et al., 2015; Liu et al., 2015), and the nature of the vocalization task (Behroozmand et al., 2009, 2011). These two components are thought to reflect the detection and correction of feedback errors during the online monitoring of self-produced speech (Behroozmand et al., 2011; Guo et al., 2016). Also, a number of neuroimaging studies have revealed the brain regions involved in auditory-motor integration for voice control such as the superior temporal gyrus (STG), dorsal lateral prefrontal cortex (DLPFC), anterior cingulate cortex (ACC), premotor cortex (PMC), and inferior frontal gyrus (IFG) (Zarate and Zatorre, 2008; Parkinson et al., 2012; Chang et al., 2013; Behroozmand et al., 2015; Belyk et al., 2016). Despite the progress made in understanding the feedback-based processing of speech motor control, the biological influence of gonadal steroids on the auditory-vocal integration is largely unknown.
The lack of knowledge is striking, considering that gonadal steroids influence functional connectivity in the DLPFC and the ACC (Dreher et al., 2007); both brain regions are involved in auditory-vocal integration (Zarate and Zatorre, 2008). In addition, estradiol and progesterone act synergistically on the vocal musculo-mucosal complex (Abitbol et al., 1999). Previous studies have shown an important association between gonadal steroids and voice control. For example, female singers experience difficulties singing high notes, producing accurate tone, and with intricate phonation control just prior to menstruation (Lacina, 1968). Hormonal shifts in women experiencing menopause or Turner's syndrome have also been associated with significant problems with voice, speech, and hearing (Caras, 2013). Thus, increasing our understanding of the interplay between gonadal steroids and the auditory-vocal system has important implications for the gonadal steroid effects on prevalence and treatment of voice/speech disorders, in particular for women.
The menstrual cycle offers a unique opportunity to investigate how fluctuations of gonadal steroids modulate sensory processing to shape behavior in women. By using the frequency altered feedback (FAF) paradigm (Burnett et al., 1998), therefore, the present study sought to investigate the effects of gonadal steroids on auditory-motor integration for voice control in a counterbalanced, repeated-measures design during the menstrual, follicular, and luteal phases of the menstrual cycle. Participants were instructed to sustain a vowel phonation while exposed to unexpected pitch perturbations in their voice auditory feedback. We measured the magnitudes and latencies of vocal and ERP responses (N1 and P2) to pitch feedback perturbations and assessed plasma estradiol and progesterone levels across the menstrual cycle. We expected that fluctuations in gonadal steroids across the menstrual cycle would influence the auditory-motor processing of pitch feedback errors at the levels of behavior and cortex.
Materials and Methods
Nineteen female right-handed adults aged 19–31 years of old, who were students at Sun Yat-sen University of China, participated in the experiment. All participants were right-handed, native speakers of Mandarin Chinese. The following criteria led to inclusion in the study: nonuse of oral contraceptives, no hormone-replacement therapy, no history of lactation and pregnancy, regular monthly menstrual cycle, no diagnosed premenstrual syndrome, no intake of neuroactive substances (e.g., alcohol, caffeine, drugs, etc.), no prior history of neurological, psychiatric, or endocrinological illnesses, nonsmokers, no speech, language, or hearing disorders, and normal body weight (body mass index between 18.5 and 23.9 kg/m2). In all participants, pure-tone thresholds were ≤25 dB hearing level (HL) for octaves from 500 to 4000 Hz bilaterally.
Of 19 participants who eventually entered the study, 8 had to be excluded because they had anovulatory cycles during the sampling period (N = 6) or their electrophysiological data failed to reach the criteria of inclusion (N = 2) (see below). Therefore, the final study set comprised 11 participants with a mean age of 23 years [19–29 years; standard deviation (SD) = 3.9], reportedly regular menstrual cycle length (28–32 days), and a mean body mass index of 20.3 (18.5–22.4; SD = 2.2).
All participants provided written informed consent in compliance with a protocol approved by the Institution Review Board of The First Affiliated Hospital at Sun Yat-sen University of China in accordance with the Code of Ethics of the World Medical Association (Declaration of Helsinki).
Three different phases of the menstrual cycle were investigated: menstrual phase (second to fourth day of bleeding), follicular phase (15–22 days before the onset of the new menstrual cycle), and luteal phase (3–9 days before the onset of the new menstrual cycle). During the menstrual phase, both the estradiol and progesterone levels were near baseline. The follicular phase is characterized by high estradiol and low progesterone levels, while both estradiol and progesterone levels are high during the luteal phase. Venous blood samples were collected 4 h prior to the experiment for determination of estradiol and progesterone levels in all three phases, leading to a total of 33 blood samples (3 phases × 11 subjects). Serum estradiol and progesterone levels were measured by chemiluminesecence immunoassay in the Immunology Laboratory of The First Affiliated Hospital at Sun Yat-sen University of China. For the estradiol, the assay sensitivity was 5.00 pg/ml, and the inter- and intra-assay coefficients of variation (CV) were 3.6 and 2.2%, respectively. For the progesterone, the assay sensitivity was 0.03 ng/ml, and the inter- and intra-assay CVs were 4.6 and 2.3%, respectively. A menstrual cycle was classified as anovulatory if progesterone levels did not rise above 5 ng/ml (Herzog et al., 1997).
Throughout the experiment, all participants sat in a sound-treated booth. Prior to the data recording, the experimental system was acoustically calibrated to ensure that participants heard the voice feedback with a gain of 10 dB sound pressure level (SPL) relative to the intensity of their vocal output. This gain was used to partially mask air-born and bone-conducted voice feedback (Behroozmand et al., 2009). Voice signals were recorded through a dynamic microphone (model DM2200, Takstar Inc.) and amplified with a MOTU Ultralite Mk3 Firewire audio interface. The amplified signals were then pitch-shifted by an Eventide Eclipse Harmonizer controlled by a custom-developed program (Max/MSP, v.5.0, Cycling 74). Direction, magnitude, and duration of the pitch perturbation and the inter-stimulus interval (ISI) were controlled by this program. Transistor-transistor logic (TTL) control pulses were generated to mark the onset and offset of the pitch perturbations. Finally, the pitch-shifted voices were amplified with an ICON NeoAmp headphone amplifier and fed back to participants through insert earphones (ER1-14A, Etymotic Research Inc.). The original voice, feedback and TTL control pulses were sampled at 10 kHz by a PowerLab A/D converter (model ML880, AD Instruments), and recorded using LabChart software (v.7.0 by AD Instruments).
To signal the onset of the pitch perturbations, we used a DIN synch cable to send the TTL control pulses to the electroencephalograph (EEG) recording system (Electrical Geodesics Inc., Eugene, OR). The EEG signals were recorded using a 64-electrode Geodesic Sensor Net, amplified by a Net Amps 300 amplifier, and saved onto a Mac Pro computer using NetStation software (v. 4.5, Electrical Geodesics Inc., Eugene, OR) with a sampling frequency of 1 kHz. During the online recording, the EEG signals across all channels were referenced to the vertex (Cz). Individual sensors were carefully adjusted to ensure that their impedance levels were <50kΩ throughout the EEG recording (Ferree et al., 2001).
During each phase, participants completed the FAF-based vocal production experiment. We counterbalanced the menstrual cycle phases and randomly assigned subjects to perform the vocal tasks in a different combination of the menstrual cycle phase orders. Specifically, three of 11 participants included were tested first in their menstrual phase; 4 were tested first in their follicular phase; 4 were tested in their luteal phase. In the FAF-based vocal production experiment, participants were instructed to vocalize the vowel sound /u/ for approximately 5–6 s at their conversational pitch and loudness level. During each vocalization, participants heard their voice pitch-shifted downwards five times. The first stimulus occurred 500–1000 ms after the vocal onset, and the succeeding stimuli were presented with an ISI of 700–900 ms. Participants were asked to take a break of 2–3 s prior to initiating the next vocalization. Production of 20 consecutive vocalizations constituted one experimental block. A total of 100 trials were thus generated per block, within which the magnitude was held constant at −50 or −200 cents (100 cents = one semitone) and the duration of each perturbation was fixed at 200 ms. The magnitude of pitch perturbation was manipulated because previous research has shown its effect on the resultant behavioral and cortical responses (Behroozmand et al., 2009; Scheerer et al., 2013). The order of the two blocks was counterbalanced across all participants.
Vocal Responses Measurement
Digitized voice and feedback signals were analyzed offline using event-related averaging techniques (Liu et al., 2011b). First, pulses for each glottal cycle in the voice signals were generated using Praat (Boersma, 2001), and were converted to voice F0 contours in Hz in IGOR PRO (v.6.0, Wavemetrics Inc.). Secondly, the F0 values in Hz were converted to the cent scale using the formula: cents = 100 × (12 × log2(F0/reference)), where 195.997 Hz (G4) served as the arbitrary reference. Voice F0 contours were then segmented starting 200 ms before and ending 700 ms after the onset of the pitch perturbation. A visual inspection was performed on all individual trials to ensure that trials that were the result of vocal interruption or signal processing errors were excluded from further analyses. Finally, artifact-free trials were normalized by subtracting the mean F0 values in the baseline period (−200 to 0 ms) from the F0 values after the onset of the pitch perturbation and averaged to generate an overall response for each condition. An acceptable response was defined as one in which the contours had to exceed a value of two SDs of the pre-stimulus mean beginning at least 60 ms after the stimulus onset and lasting at least 50 ms. Response latency in milliseconds was determined as the time at which the response exceeded 2 SDs above or below the pre-stimulus mean following the perturbation onset. Response magnitude in cents was measured as the difference between the pre-stimulus mean and the peak value of the voice F0 contour following the response onset.
EEG Data Analyses
The EEG signals were sent to NetStation software for offline analyses. All channels were re-referenced to the average of electrodes on each mastoid and band-passed filtered at 1–20 Hz. The continuous EEG data was segmented into epochs with a window of 200 ms before and 500 ms after the onset of the pitch perturbation. Segmented trials were scanned for artifact contamination such as excessive muscular activity, eye blinks, and eye movement using the Artifact Detection toolbox in NetStation. Additional visual inspection was performed to ensure that artifacts were appropriately rejected. Individual electrodes were rejected if they contained artifacts more than 20% of the segments, and the file was excluded for further analyses if it contained more than 10 bad channels. Two participants mentioned above were excluded from further analyses due to high rejection rates (over 50% of trials). Overall, 82% of trials were retained across all participants. Finally, artifact-free trials were averaged and baseline-corrected for each condition to generate an overall response. The amplitudes and latencies of N1 and P2 components were measured from 10 electrodes (FC1, FC2, FCz, FC3, FC4, C1, C2, Cz, C3, and C4) as the negative and positive peaks in the time windows of 80–180 m and 160–280 ms after the onset of pitch perturbation and submitted to statistical analyses. These electrodes were chosen because cortical responses to pitch perturbations in voice auditory feedback are primarily pronounced in the N1 and P2 components recorded from the frontal-central electrodes (Hawco et al., 2009; Chen et al., 2012).
Values of vocal and ERP responses to pitch perturbations across conditions were subjected to repeated-measures analysis of variance (RM-ANOVAs) in SPSS (v.16.0). The magnitudes and latencies of vocal responses were subjected to two-way RM-ANOVAs, in which pitch shift (−50 and −200 cents) and menstrual cycle phase (menstrual, follicular, and luteal phase) were regarded as within-subject factors. Three-way RM-ANOVAs were used to analyze the amplitudes and the latencies of the N1 and P2 responses, including within-subjects factors of pitch shift, menstrual cycle phase, and electrode site. Subsidiary RM-ANOVAs were conducted if higher-order interactions reached significance. Probability values were corrected for multiple degrees of freedom using Greenhouse-Geisser if the assumption of sphericity was violated. As well, estradiol and progesterone data were subjected to one-way RM-ANOVAs with the within-subject factor of menstrual cycle phase. In addition, linear correlations using Pearson correlation coefficients were performed to examine the relationship between estradiol/progesterone levels and vocal/ERP responses to pitch perturbations.
As expected, there was a significant main effect of menstrual cycle phase for estradiol [F(2, 20) = 19.054, p < 0.001], in which there was a significant decrease in estradiol levels during the menstrual phase (22.6 ± 2.1 pg/ml) (mean ± standard error, same as below) as compared to the follicular phase (109.9 ± 18.2 pg/ml) (p = 0.001) and the luteal phase (98.7 ± 9.8 pg/ml) (p < 0.001). Similarly, the main effect of menstrual cycle phase for progesterone also reached significance [F(2, 20) = 42.674, p < 0.001], in which the luteal phase (9.0 ± 1.4 ng/ml) was associated with higher progesterone levels than both the menstrual (0.2 ± 0.02 ng/ml) (p < 0.001) and follicular phases (0.3 ± 0.06 ng/ml) (p < 0.001).
Figure 1 shows the grand-averaged voice F0 contours (A) and the T-bar plots (B) of the magnitudes of vocal responses to pitch-shifted auditory feedback. One two-way RM-ANOVA conducted on the magnitudes of vocal responses revealed a significant main effect of menstrual cycle phase [F(2, 20) = 5.888, p = 0.023], indicating a significant increase of response magnitude during the menstrual phase when compared with the follicular phase (p = 0.040) and luteal phase (p = 0.025) (See Figure 1B). The main effect of pitch shift [F(1, 10) = 0.408, p = 0.537] and the interaction between menstrual cycle phase and pitch shift [F(2, 20) = 0.531, p = 0.549] did not reach significance. For the latencies of vocal responses, there were no significant main effects of menstrual cycle phase [F(2, 20) = 0.420, p = 0.581], pitch shift [F(1, 10) = 2.873, p = 0.121] and their interactions [F(2, 20) = 1.397, p = 0.271].
Figure 1. (A) Grand-averaged voice F0 contours and (B) T-bar graphs (means and standard errors) of the magnitude of vocal responses to pitch perturbations across three menstrual cycle phases. The solid lines and bars in black, red, and blue represent the vocal responses during the menstrual, follicular, and luteal phases. The asterisks indicate significantly larger response magnitudes during the menstrual phase when compared with the follicular phase and luteal phase.
Figures 2, 3 show the grand-averaged ERP waveforms (left column) and topographical distributions of P2 responses (right column) as a function of menstrual cycle phase for the −50 and −200 cents conditions, respectively. As can be seen, change in cortical responses to pitch perturbations across the three menstrual cycle phases was primarily reflected in the P2 component. The luteal phase (blue lines) was associated with smaller P2 amplitudes than the menstrual (black lines) and follicular phases (red lines). This change can also be seen in the topographical distributions of P2 responses in Figures 2, 3. In contrast, N1 responses appeared to remain constant across the menstrual cycle phase.
Figure 2. Grand-averaged ERP waveforms (left) in responses to pitch-shift stimuli of −50 cents during the menstrual (black lines), follicular (red lines), and luteal phases (blue lines), and topographic distributions of P2 amplitudes (right) during the menstrual (top: latency = 233 ms), follicular (middle: latency = 227 ms), and luteal phases (bottom: latency = 242 ms).
Figure 3. Grand-averaged ERP waveforms (left) in response to pitch-shift stimuli of −200 cents during the menstrual (black lines), follicular (red lines), and luteal phases (blue lines), and topographic distributions of P2 amplitudes (right) during the menstrual (top: latency = 218 ms), follicular (middle: latency = 223 ms), and luteal phases (bottom: latency = 221 ms).
Figure 4 shows the T-bar graphs of the N1 and P2 responses as a function of menstrual cycle phase for the −50 (black bars) and −200 cents (blank bars) conditions. A three-way RM-ANOVA conducted on the N1 amplitudes revealed a significant main effect of electrode site [F(9, 90) = 3.363, p = 0.033], but N1 amplitudes did not differ as a function of menstrual cycle phase [F(2, 20) = 1.718, p = 0.205] or pitch shift [F(1, 10) = 1.049, p = 0.330] (see Figure 4A). As for the N1 latencies, the −50 cents condition elicited significantly longer N1 latencies than the −200 cents condition [F(1, 10) = 21.561, p = 0.001] (see Figure 4B). The main effects of menstrual cycle phase [F(2, 20) = 0.564, p = 0.578] and electrode site [F(9, 90) = 2.034, p = 0.137], however, did not reach significance. Neither did the interactions between any of these variables (p > 0.05).
Figure 4. Bar graphs (means and standard errors) of the magnitudes and latencies of N1 (A,B) and P2 (C,D) components as a function of stimulus and phase. The black and the blank bars denote the responses to −50 and −200 cents, respectively. The asterisks indicate significant differences between conditions.
A three-way RM-ANOVA conducted on the P2 amplitudes revealed significant main effects of menstrual cycle phase [F(2, 20) = 12.479, p < 0.001], pitch shift [F(1, 10) = 7.916, p = 0.018], and electrode site [F(9, 90) = 20.648, p < 0.001]. Post-hoc Bonferroni comparison revealed that P2 amplitudes were significantly smaller during the luteal phase than that during the menstrual phase (p = 0.007) and follicular phase (p < 0.001), while the difference between the menstrual and follicular phases did not reach significance (p = 0.850) (see Figure 4C). In addition, the −50 cents condition elicited significantly smaller P2 amplitudes than −200 cents condition (p = 0.018). There were no significant interactions between any of these variables (p > 0.05).
Regarding the P2 latencies, main effects of menstrual cycle phase [F(2, 20) = 0.652, p = 0.476], pitch shift [F(1, 10) = 3.209, p = 0.103], and electrode site [F(9, 90) = 1.080, p = 0.366] did not reach significance. A significant interaction, however, was found between menstrual cycle phase and pitch shift [F(2, 20) = 4.912, p = 0.018]. Follow-up two-way RM-ANOVAs revealed that the −50 cents condition elicited significantly longer P2 latencies than the −200 cents condition [F(1, 10) = 6.549, p = 0.028] during the luteal phase (see Figure 4D). The main effect of pitch shift on the P2 latencies, however, did not reach significance during the menstrual phase [F(1, 10) = 0.177, p = 0.683] or follicular phase [F(1, 10) = 4.771, p = 0.054].
In addition, linear correlation analyses were performed to examine the relationship between the magnitudes of vocal and ERP (P2) responses and estradiol/progesterone concentrations across the menstrual cycle phase. The results revealed a significant negative correlation between the mean magnitudes of vocal responses and estradiol levels (R2 = 0.143, p = 0.036), indicating that higher estradiol levels are associated smaller magnitudes of vocal responses (see Figure 5). The mean magnitudes of vocal responses, however, were not significantly correlated with progesterone levels (R2 = 0.016, p = 0.482). In addition, the mean amplitudes of P2 responses were not correlated with estradiol (R2 = 0.021, p = 0.425) and progesterone levels (R2 = 0.006, p = 0.659).
Figure 5. Scatter plot of the magnitudes of vocal responses to pitch perturbations as a function of estradiol level. There was a negative correlation between estradiol level and vocal response magnitude (R2 = 0.143, p = 0.036).
The present study investigated the influence of gonadal steroids on auditory-motor integration for voice control across the menstrual cycle in young adult women. The behavioral findings showed a significant increase in the magnitudes of vocal responses during the menstrual phase when estradiol levels were low, and that there was a significant negative correlation between the magnitudes of vocal responses and estradiol levels. At the cortical level, P2 amplitudes were significantly smaller during the luteal phase when progesterone levels were high when compared to the menstrual and follicular phases. These findings provide evidence for the modulation of neurobehavioral responses to pitch-shifted voice auditory feedback by menstrual cycle phase, suggesting that gonadal steroids, particularly estradiol, may have a modulatory effect on the auditory-motor processing of feedback errors during vocal pitch regulation.
Previous behavioral studies have demonstrated that changes in gonadal steroids can influence the perceptual features (e.g., roughness, breathiness, asthenia, etc.) of the voice across the menstrual cycle (Meurer et al., 2009; Raj et al., 2010; Çelik et al., 2013). For example, using GRBAS scale of perceptual evaluation, Çelik et al. (2013) reported that voice quality was at the highest level during the mid-menstrual period when estrogen and progesterone levels were high and showed a significant decrease during the premenstrual period when those hormones levels were low. Findings from the present study further show that auditory-motor integration for voice control can be modulated as a function of menstrual cycle phase. Our behavioral findings revealed larger vocal responses to pitch-shifted auditory feedback during the menstrual phase when estradiol levels were low and a significant negative correlation between them. This finding is in line with another study showing shorter VOT values for the voiced plosives and longer VOT values for the voiceless plosives in the phase with high estradiol and progesterone levels (Wadnerkar et al., 2006). On the other hand, the present study showed a significant decrease of P2 amplitudes during the luteal phase when progesterone levels were high. This finding is complimentary to previous studies that showed decreased cortical responses (N1-P2 peak amplitude) or shorter subcortical responses (ABR) to pure tones during the luteal phase when the progesterone levels were high in women (Serra et al., 2003; Walpurger et al., 2004; Al-Mana et al., 2010).
Our behavioral results revealed larger vocal responses to pitch perturbations during the menstrual phase relative to the follicular and luteal phases, while higher estradiol levels were associated with the menstrual phase than the follicular and luteal phase. Furthermore, Pearson correlation analyses revealed a significant negative correlation between the magnitudes of vocal responses and estradiol levels, suggesting a possible interaction between vocal compensation for pitch feedback errors and estradiol concentration. The effects of estradiol on the vocal motor behavior have been documented in animal studies. For example, the duration of fictive vocalization produced by the midshipman fish is rapidly responsive to steroid hormones including androgens and estrogens (Remage-Healey and Bass, 2004), and behavioral selectivity of conspecific vocalizations can be enhanced through modulating the synthesis of estradiol in seasonally-breeding songbirds (Caras, 2013). Although direct evidence of estradiol effect on vocal motor output in humans is scarce, it has been reported that specific estradiol receptors have been identified in the normal human larynx (Ferguson et al., 1987; Newman et al., 2000). By activating receptor-coupled effector mechanisms through binding to those specific receptors in the larynx, estradiol may be capable of directly influencing the female laryngeal muscles such as the cricohthyroid and thyroarytenoid muscles (Liu et al., 2011c) to modulate the vocal compensation for pitch-shifted auditory feedback. On the other hand, the neural substrates involved in the auditory-vocal integration include the STG, PMC, IFG, DLPFC, and ACC (Zarate and Zatorre, 2008; Behroozmand et al., 2015). Estradiol passes the blood-brain-barrier and binds to receptors in various parts of the brain, including the frontal and middle temporal lobes (Goldstein et al., 2001; Ostlund et al., 2003). Thus, estradiol may also influence vocal motor function through the modulation of these receptors in the neural substrates involved in the feedback-based voice control. Taken together, changes in vocal responses to pitch-shifted auditory feedback may be caused by estradiol through the specific receptors that exist in the larynx and the neural substrates involved in the auditory-vocal integration.
The present study also revealed a significant decrease in the amplitudes of P2 responses during the luteal phase when progesterone levels were high. This is consistent with previous findings that showed the inhibitory effect of progesterone on the central auditory processing. For example, the N1-P2 peak amplitude to neutral auditory stimuli was significantly decreased during the luteal phase and negatively correlated to the progesterone concentration (Walpurger et al., 2004). Postmenopausal females treated with estradiol alone had better performance of speech perception in noise than did those treated with both estradiol and progesterone (Guimaraes et al., 2006). Accumulating evidence has shown that P2 component reflects not only the central auditory processing (e.g., error detection) but also the feedback-based motor processing (e.g., error correction) in the online monitoring of vocal production. For example, when compensating for pitch perturbations in voice auditory feedback, individuals with Parkinson's disease produced significantly larger P2 responses than healthy controls due to enhanced activity in the brain regions including the STG, PMC, and IFG (Huang et al., 2016). Also, P2 responses were found to be significantly correlated with regional homogeneity of those brain regions in the resting-state (Guo et al., 2016). Thus, decreased P2 responses during the luteal phase rise observed in the present study are indicative of the influence of menstrual cycle phase on the cortical processing of vocal pitch regulation.
Although P2 responses were not significantly correlated with progesterone levels, we speculate that the observable decreased P2 responses with the luteal phase rise in progesterone might be partly related to the inhibitory effect of progesterone and its metabolites on the central auditory system through their interaction with the steroid binding sites on γ-aminobutyric acid-A (GABA-A) receptors (Follesa et al., 2001). Allopregnanolone, a progesterone metabolite, has been observed to positively modulate GABA-A receptor-evoked responses in cortical neurons (Stell et al., 2003), and its brain level mirrors changes in progesterone (Wang et al., 1996). 5-hydroxytryptamine (5-HT) is thought to be involved in the central auditory processing (Juckel et al., 1999), and high levels of allopregnanolone can enhance GABA-A receptor-mediated responses and inhibit 5-HT neuronal activity, leading to a decrease in neuronal excitability. The greatest GABA-A receptor inhibition of 5-HT release was found at times when progesterone and hence allopregnanolone, levels were highest (Felton and Auerbach, 2004), and allopregenanolone significantly potentiated the inhibitory responses of 5-HT neurons to GABA-A receptor activation (Kaura et al., 2007). Accordingly, a potential mechanism underlying the association between the decreased cortical responses to feedback perturbations and the luteal phase rise in progesterone in women may include an allopregnanolone-mediated potentiation of GABAergic inhibition of serotonergic transmission.
In addition to the individual actions of estradiol and progesterone, we cannot rule out the possibility that fluctuations of both estradiol and progesterone may make contributions to the modulation of neurobehavioral responses to pitch-shifted voice auditory feedback across the menstrual cycle. For example, participants with high estrogen levels produced smaller mismatch negativity (MMN) to neutral auditory stimuli than did participants with low estrogen levels (Schirmer et al., 2008), and women with Turner's syndrome who are estrogen deficient have higher rates of hearing decline and abnormal ABR relative to general population (Caras, 2013), suggesting the influence of estrogen on the central auditory processing (Pinaud and Tremere, 2012). On the other hand, the motor evoked potential, or the amplitude of the muscle response, has been shown to be significantly reduced during the luteal phase relative to the follicular phase in women, indicating an inhibitory effect of progesterone on the motor cortex (Smith et al., 2002, 2003). Moreover, some studies have reported a coadjuvant effect of estrogen and progesterone. Estradiol has been shown to induce nuclear progesterone receptors in some brain areas, and estrogen treatment is required for progesterone-evoked dopamine release or progesterone-related sexual behaviors (Dluzen and Ramirez, 1990; Moffatt et al., 1998). Given the complex interplay of the central nervous system with the reproductive system and immune system, further studies with a larger sample size should be conducted to explore this sensory-neuroendocrine interaction in speech motor control.
Overall, the present study probed the interaction between gonadal steroids and auditory-motor integration for voice control across the menstrual cycle. The results showed the modulation of vocal and cortical responses in compensating for pitch errors in voice auditory feedback across the menstrual cycle phase and revealed a significant negative correlation between the magnitudes of vocal responses and estradiol levels. Despite the poor understanding of the biological mechanisms underlying the regulation of auditory-vocal integration, we believe that these findings provide neurobehavioral evidence for the impact of menstrual cycle on the auditory-motor integration in humans. Our results suggest that caution must be exercised in interpreting the impact of menstrual variation on behavioral performance in women of reproductive age.
XZ, YN, XC, and HL designed the experiment; XZ, YN, WL, and ZZ performed the experiment and analyzed the data; XZ, YN, WL, PL, XC, and HL interpreted the results and wrote the manuscript; all authors read and approved the final manuscript.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
This study was funded by grants from National Natural Science Foundation of China (Nos. 31371135, 81301675, and 81472154), Guangdong Natural Science Funds for Distinguished Young Scholar (No. S2013050014470), Guangdong Province Science and Technology Plan Project (No. 2013B022000058), Guangzhou Science and Technology Programme (No. 201604020115), and the Fundamental Research Funds for the Central Universities (Nos. 15ykjc13b). XZ, YN, and WL contributed equally to this work.
Bauer, J. J., Mittal, J., Larson, C. R., and Hain, T. C. (2006). Vocal responses to unanticipated perturbations in voice loudness feedback: an automatic mechanism for stabilizing voice amplitude. J. Acoust. Soc. Am. 119, 2363–2371. doi: 10.1121/1.2173513
Behroozmand, R., Karvelis, L., Liu, H., and Larson, C. R. (2009). Vocalization-induced enhancement of the auditory cortex responsiveness during voice F0 feedback perturbation. Clin. Neurophysiol. 120, 1303–1312. doi: 10.1016/j.clinph.2009.04.022
Behroozmand, R., Liu, H., and Larson, C. R. (2011). Time-dependent neural processing of auditory feedback during voice pitch error detection. J. Cogn. Neurosci. 23, 1205–1217. doi: 10.1162/jocn.2010.21447
Behroozmand, R., Shebek, R., Hansen, D. R., Oya, H., Robin, D. A., Howard, M. A. III, et al. (2015). Sensory-motor networks involved in speech production and motor control: an fMRI study. Neuroimage 109, 418–428. doi: 10.1016/j.neuroimage.2015.01.040
Çelik, Ö., Çelik, A., Atespare, A., Boyaci, Z., Çelebi, S., Gündûz, T., et al. (2013). Voice and speech changes in various phases of menstrual cycle. J. Voice 27, 622–626. doi: 10.1016/j.jvoice.2013.02.006
Chang, E. F., Niziolek, C. A., Knight, R. T., Nagarajan, S. S., and Houde, J. F. (2013). Human cortical sensorimotor network underlying feedback control of vocal pitch. Proc. Natl. Acad. Sci. U.S.A. 110, 2653–2658. doi: 10.1073/pnas.1216827110
Chen, Z., Liu, P., Wang, E. Q., Larson, C. R., Huang, D., and Liu, H. (2012). ERP correlates of language-specific processing of auditory pitch feedback during self-vocalization. Brain Lang. 121, 25–34. doi: 10.1016/j.bandl.2012.02.004
D'Haeseleer, E., Depypere, H., Claeys, S., Van Borsel, J., and Van Lierde, K. (2009). The menopause and the female larynx, clinical aspects and therapeutic options: a literature review. Maturitas 64, 27–32. doi: 10.1016/j.maturitas.2009.06.009
Dluzen, D. E., and Ramirez, V. D. (1990). In vitro progesterone modulates amphetamine-stimulated dopamine release from the corpus striatum of castrated male rats treated with estrogen. Neuroendocrinology 52, 517–520. doi: 10.1159/000125637
Dreher, J. C., Schmidt, P. J., Kohn, P., Furman, D., Rubinow, D., and Berman, K. F. (2007). Menstrual cycle phase modulates reward-related neural function in women. Proc. Natl. Acad. Sci. U.S.A. 104, 2465–2470. doi: 10.1073/pnas.0605569104
Felton, T. M., and Auerbach, S. B. (2004). Changes in gamma-aminobutyric acid tone and extracellular serotonin in the dorsal raphe nucleus over the rat estrous cycle. Neuroendocrinology 80, 152–157. doi: 10.1159/000082356
Ferguson, B. J., Hudson, W. R., and McCarty, K. S. Jr. (1987). Sex steroid receptor distribution in the human larynx and laryngeal carcinoma. Arch. Otolaryngol. Head Neck Surg. 113, 1311–1315. doi: 10.1001/archotol.1987.01860120057008
Fernandez, G., Weis, S., Stoffel-Wagner, B., Tendolkar, I., Reuber, M., Beyenburg, S., et al. (2003). Menstrual cycle-dependent neural plasticity in the adult human brain is hormone, task, and region specific. J. Neurosci. 23, 3790–3795.
Follesa, P., Concas, A., Porcu, P., Sanna, E., Serra, M., Mostallino, M. C., et al. (2001). Role of allopregnanolone in regulation of GABA(A) receptor plasticity during long-term exposure to and withdrawal from progesterone. Brain Res. Rev. 37, 81–90. doi: 10.1016/S0165-0173(01)00125-4
Goldstein, J. M., Seidman, L. J., Horton, N. J., Makris, N., Kennedy, D. N., Caviness, V. S. Jr., et al. (2001). Normal sexual dimorphism of the adult human brain assessed by in vivo magnetic resonance imaging. Cereb. Cortex 11, 490–497. doi: 10.1093/cercor/11.6.490
Guimaraes, P., Frisina, S. T., Mapes, F., Tadros, S. F., Frisina, D. R., and Frisina, R. D. (2006). Progestin negatively affects hearing in aged women. Proc. Natl. Acad. Sci. U.S.A. 103, 14246–14249. doi: 10.1073/pnas.0606891103
Guo, Z., Huang, X., Wang, M., Jones, J. A., Dai, Z., Li, W., et al. (2016). Regional homogeneity of intrinsic brain activity correlates with auditory-motor processing of vocal pitch errors. NeuroImage 142, 565–575. doi: 10.1016/j.neuroimage.2016.08.005
Hawco, C. S., Jones, J. A., Ferretti, T. R., and Keough, D. (2009). ERP correlates of online monitoring of auditory feedback during vocalization. Psychophysiology 46, 1216–1225. doi: 10.1111/j.1469-8986.2009.00875.x
Huang, X., Chen, X., Yan, N., Jones, J. A., Wang, E. Q., Chen, L., et al. (2016). The impact of Parkinson's disease on the cortical mechanisms that support auditory-motor integration for voice control. Hum. Brain Mapp. 37, 4248–4261. doi: 10.1002/hbm.23306
Juckel, G., Hegerl, U., Molnar, M., Csepe, V., and Karmos, G. (1999). Auditory evoked potentials reflect serotonergic neuronal activity–a study in behaving cats administered drugs acting on 5-HT1A autoreceptors in the dorsal raphe nucleus. Neuropsychopharmacology 21, 710–716. doi: 10.1016/S0893-133X(99)00074-3
Kaura, V., Ingram, C. D., Gartside, S. E., Young, A. H., and Judge, S. J. (2007). The progesterone metabolite allopregnanolone potentiates GABA(A) receptor-mediated inhibition of 5-HT neuronal activity. Eur. Neuropsychopharm. 17, 108–115. doi: 10.1016/j.euroneuro.2006.02.006
Liu, H., Behroozmand, R., Bove, M., and Larson, C. R. (2011c). Laryngeal electromyographic responses to perturbations in voice pitch auditory feedback. J. Acoust. Soc. Am. 129, 3946–3954. doi: 10.1121/1.3575593
Liu, H., Meshman, M., Behroozmand, R., and Larson, C. R. (2011a). Differential effects of perturbation direction and magnitude on the neural processing of voice pitch feedback. Clin. Neurophysiol. 122, 951–957. doi: 10.1016/j.clinph.2010.08.010
Liu, P., Chen, Z., Jones, J. A., Huang, D., and Liu, H. (2011b). Auditory feedback control of vocal pitch during sustained vocalization: a cross-sectional study of adult aging. PLoS ONE 6:e22791. doi: 10.1371/journal.pone.0022791
Liu, Y., Hu, H., Jones, J. A., Guo, Z., Li, W., Chen, X., et al. (2015). Selective and divided attention modulates auditory-vocal integration in the processing of pitch feedback errors. Eur. J. Neurosci. 42, 1895–1904. doi: 10.1111/ejn.12949
Moffatt, C. A., Rissman, E. F., Shupnik, M. A., and Blaustein, J. D. (1998). Induction of progestin receptors by estradiol in the forebrain of estrogen receptor-alpha gene-disrupted mice. J. Neurosci. 18, 9556–9563.
Parkinson, A. L., Flagmeier, S. G., Manes, J. L., Larson, C. R., Rogers, B., and Robin, D. A. (2012). Understanding the neural mechanisms involved in sensory control of voice production. NeuroImage 61, 314–322. doi: 10.1016/j.neuroimage.2012.02.068
Raj, A., Gupta, B., Chowdhury, A., and Chadha, S. (2010). A study of voice changes in various phases of menstrual cycle and in postmenopausal women. J. Voice 24, 363–368. doi: 10.1016/j.jvoice.2008.10.005
Remage-Healey, L., Coleman, M. J., Oyama, R. K., and Schlinger, B. A. (2010). Brain estrogens rapidly strengthen auditory encoding and guide song preference in a songbird. Proc. Natl. Acad. Sci. U.S.A. 107, 3852–3857. doi: 10.1073/pnas.0906572107
Scheerer, N. E., Behich, J., Liu, H., and Jones, J. A. (2013). ERP correlates of the magnitude of pitch errors detected in the human voice. Neuroscience 240, 176–185. doi: 10.1016/j.neuroscience.2013.02.054
Schirmer, A., Escoffier, N., Li, Q. Y., Li, H., Strafford-Wilson, J., and Li, W. I. (2008). What grabs his attention but not hers? Estrogen correlates with neurophysiological measures of vocal change detection. Psychoneuroendocrinology 33, 718–727. doi: 10.1016/j.psyneuen.2008.02.010
Serra, A., Maiolino, L., Agnello, C., Messina, A., and Caruso, S. (2003). Auditory brain stem response throughout the menstrual cycle. Ann. Oto. Rhinol. Laryngol. 112, 549–553. doi: 10.1177/000348940311200612
Sisneros, J. A., Forlano, P. M., Deitcher, D. L., and Bass, A. H. (2004). Steroid-dependent auditory plasticity leads to adaptive coupling of sender and receiver. Science 305, 404–407. doi: 10.1126/science.1097218
Smith, M. J., Adams, L. F., Schmidt, P. J., Rubinow, D. R., and Wassermann, E. M. (2003). Abnormal luteal phase excitability of the motor cortex in women with premenstrual syndrome. Biol. Psychiat. 54, 757–762. doi: 10.1016/S0006-3223(02)01924-8
Stell, B. M., Brickley, S. G., Tang, C. Y., Farrant, M., and Mody, I. (2003). Neuroactive steroids reduce neuronal excitability by selectively enhancing tonic inhibition mediated by delta subunit-containing GABAA receptors. Proc. Natl. Acad. Sci. U.S.A. 100, 14439–14444. doi: 10.1073/pnas.2435457100
Wang, M., Seippel, L., Purdy, R. H., and Bãckström, T. (1996). Relationship between symptom severity and steroid variation in women with premenstrual syndrome: study on serum pregnenolone, pregnenolone sulfate, 5 alpha-pregnane-3,20-dione and 3 alpha-hydroxy-5 alpha-pregnan-20-one. J. Clin. Endocrinol. Metab. 81, 1076–1082.
Keywords: auditory feedback, auditory-vocal integration, menstrual cycle, estradiol, progesterone
Citation: Zhu X, Niu Y, Li W, Zhang Z, Liu P, Chen X and Liu H (2016) Menstrual Cycle Phase Modulates Auditory-Motor Integration for Vocal Pitch Regulation. Front. Neurosci. 10:600. doi: 10.3389/fnins.2016.00600
Received: 26 July 2016; Accepted: 15 December 2016;
Published: 27 December 2016.
Edited by:Guy Madison, Umeå University, Sweden
Reviewed by:Benjamin Rich Zendel, Memorial University of Newfoundland, Canada
Michel Belyk, Maastricht University, Netherlands
Copyright © 2016 Zhu, Niu, Li, Zhang, Liu, Chen and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Hanjun Liu, firstname.lastname@example.org