Skip to main content

MINI REVIEW article

Front. Hum. Neurosci., 19 March 2024
Sec. Speech and Language
Volume 18 - 2024 | https://doi.org/10.3389/fnhum.2024.1364803

Using altered auditory feedback to study pitch compensation and adaptation in tonal language speakers

  • Human Communication, Development, and Information Sciences, Faculty of Education, The University of Hong Kong, Pokfulam, Hong Kong SAR, China

Human speech production is strongly influenced by the auditory feedback it generates. Auditory feedback-what we hear when we speak-enables us to learn and maintain speaking skills and to rapidly correct errors in our speech. Over the last three decades, the real-time altered auditory feedback (AAF) paradigm has gained popularity as a tool to study auditory feedback control during speech production. This method involves changing a speaker’s speech and feeding it back to them in near real time. More than 50% of the world’s population speak tonal languages, in which the pitch or tone used to pronounce a word can change its meaning. This review article aims to offer an overview of the progression of AAF paradigm as a method to study pitch motor control among speakers of tonal languages. Eighteen studies were included in the current mini review and were compared based on their methodologies and results. Overall, findings from these studies provide evidence that tonal language speakers can compensate and adapt when receiving inconsistent and consistent pitch perturbations. Response magnitude and latency are influenced by a range of factors. Moreover, by combining AAF with brain stimulation and neuroimaging techniques, the neural basis of pitch motor control in tonal language speakers has been investigated. To sum up, AAF has been demonstrated to be an emerging tool for studying pitch motor control in speakers of tonal languages.

1 Introduction

Speaking is a highly complicated motor behavior. Fluent speech requires not only precise coordination of many muscles, but also integration of auditory and somatosensory feedback. Auditory feedback (i.e., the sounds of our own voice), in particular, is widely acknowledged to be essential for speech production and is a central part of models of speech motor control (Hickok et al., 2011; Guenther, 2016; Parrell et al., 2019). In recent three decades, the real-time altered auditory feedback (AAF) experimental paradigm has provided plentiful evidence that auditory feedback plays a critical role in both online feedback control (e.g., Burnett et al., 1998; Hain et al., 2000) and in updating feedforward control (e.g., Houde and Jordan, 2002; Purcell and Munhall, 2006). Perturbations of auditory feedback introduce prediction errors (i.e., discrepancies) between the predicted and actual auditory feedback. When auditory feedback is perturbed inconsistently, an opposing response (compensation) that changes in the opposite direction to the perturbation direction will be elicited to reduce the perceived error within an ongoing production. When auditory feedback is perturbed consistently, speakers can gradually modify their motor plans over multiple successive utterances (adaptation) to integrate information from those errors.

By using the AAF paradigm, previous studies have demonstrated pitch compensation and adaptation across various contexts, including singing (Natke et al., 2003), nonspeech vocalization (Larson et al., 2001), and speech production (Jones and Munhall, 2002). Those studies, however, have primarily focused on non-tonal languages (predominantly English), where pitch has a less prominent suprasegmental role. In contrast, around 70% of the world’s languages use changes in pitch (i.e., tones) to alter word meaning, which serve an important linguistic function (Yip, 2002). Such a major difference could potentially affect speakers’ pitch control and determine the extent to which they adjust their pitch in response to pitch perturbations. In recent decades, to investigate the possible influence of a speaker’s entire language system on pitch control, researchers started to examine pitch compensation and adaptation in tonal language speakers. The goal of this mini review is to provide a summary of the progression of altered auditory feedback (AAF) as a method to understand pitch motor control in tonal language speakers. In total, 18 studies (see Table 1) using the AAF paradigm to investigate pitch compensation and adaptation in tonal language speakers were reviewed.

Table 1
www.frontiersin.org

Table 1. Summary of the studies included in the review, sorted by study types: pitch compensation (dark yellow), pitch adaptation (dark blue) and neurobiology findings (orange).

2 Pitch compensation

Previous studies in non-tonal language speakers have shown that participants lowered their pitch when F0 was shifted up and raised their pitch when it was shifted down (e.g., Hain et al., 2000; Jones and Munhall, 2000). Jones and Munhall (2002) first investigated pitch compensation in tonal language speakers. They found that similar to non-tonal language speakers, Mandarin speakers changed their pitch productions to the opposite direction of the manipulation when inconsistent pitch perturbations (+ 100 cents or no perturbation) were introduced to their auditory feedback during production of Mandarin word /ma/ with a high level tone (tone 1, ‘mother’). Mean response latency was 211 ms, which was longer than non-tonal language speakers producing sustained vowels (114 ms, Hain et al., 2000; 130 ms, Larson et al., 2001). The response magnitudes, however, were not reported in Jones and Munhall (2002). In a later study, compensatory responses to pitch perturbation (±100 cents or no perturbation) were examined in Mandarin speakers when they produced disyllabic Mandarin phrases (/ma ma/) with different tonal contours, High-High (H-H), High-Rising (H-R), and High-Falling (H-F) (Xu et al., 2004). They found that Mandarin speakers had larger compensatory responses (49–84 cents) when experiencing pitch perturbation to mandarin bitonal sequences, than English speakers’ compensatory responses to pitch perturbation when producing the sustained vowel /a/ (30 ~ 40 cents, Hain et al., 2000; Larson et al., 2001). The results seem to suggest that pitch feedback control in tonal languages is more sensitive than that in non-tonal languages. The same research group further investigated the compensatory responses to pitch perturbation in Mandarin speakers when producing Mandarin sentence (/mao1 mi1 mo1 ma1/, ‘kitty touches mom’) with different intonation patterns (statement v.s question) (Liu et al., 2009). Compared to meaningless disyllabic Mandarin phrases, generally smaller compensatory responses (20 ~ 30 cents) were elicited during Mandarin sentence productions.

Ning et al. (2014) conducted an initial study where they directly compared pitch compensation between speakers with and without tonal language experience. Three groups of participants (native English speakers who were never exposed to tonal languages, L2 learners of Mandarin whose native language was English, native Mandarin speakers) vocalizing the same vowel /a/ while receiving F0 perturbation with different magnitudes (±50 and ± 100 cents). No significant difference between groups were found in response amplitude or latency. In addition, the authors modeled the entire F0 contours using generalized additive models and found that F0 contours of the native English speakers and L2 learners were affected by perturbation magnitude, but Mandarin speakers were not. The authors thus suggested that instead of increasing pitch-control sensitivity, tonal language experience may contribute to the development of a more stable vocal control. In a later study, the same group of researchers further investigated whether compensatory responses induced by pitch perturbation were affected by language or vocal training experiences (Ning et al., 2015). The compensatory responses of four groups of participants (the same three language groups as in their previous study and trained vocalists with English as their native language) during production of Mandarin bitonal sequences /ma1 ma1/, /ma1 ma2/, /ma1 ma4/, and the vowel /a/ were examined. In contrast to previous findings, both native Mandarin speakers and trained vocalists showed significantly reduced response amplitudes when producing sustained vowel compared to native English speakers. During productions of Mandarin bitonal sequences, only native Mandarin speakers, but not trained vocalists, exhibited significantly smaller magnitude deviation in comparison to native English speakers. The authors concluded that tonal language speakers have more robust or stable pitch control during both linguistic (Mandarin bitonal sequences) and nonlinguistic (sustained vowel) productions and are thus less affected by auditory feedback perturbations. Liu et al. (2010) showed that pitch motor control even differed across different tonal languages. Cantonese speakers exhibited significantly smaller responses than Mandarin speakers when receiving relatively large pitch perturbations (±200, or ± 500 cents).

More recently, Ning (2019, 2022b) conducted two studies in speakers of Taiwanese Southern Min to investigate the impact of pitch contour on pitch compensation. The author proposed that larger and faster compensatory responses would occur when the shifted pitch overlaps with another level tone. The results showed that when the H level tone of the HM word (/un55-ding33/) was downshifted by 250 cents to the M level, and when the L level tone of the LM word (/un11-tong33/) was upshifted by 150 cents to the M level, larger compensatory responses were elicited. However, no such effect was found for the MM word (/un33-too33/) (Ning, 2019). Specifically, upshifting the M level tone of the MM word to the H level or downshifting the M level tone of the MM word to the L level did not result in larger pitch compensation. In the later study, the author further examined how different pitch contours would affect pitch compensation and proposed that pitch perturbation to the flat contour (e.g., MM), where speakers could maintain stability more easily, would result in smaller pitch compensation compared to the rising (e.g., LM) and falling (e.g., HM) contours. The results support the author’s hypothesis, indicating that flat pitch contour exhibited a greater resistance to pitch perturbation compared to the rising or falling pitch contours. However, unlike the previous study, downshifting the rising contour or upshifting the falling contour (i.e., when the shifted pitch overlapped with the M tone) did not result in larger pitch compensation (Ning, 2022b).

2.1 Interim summary

In sum, by using the AAF paradigm, existing work has shown that tonal language speakers are able to compensate for pitch perturbation during production of sustained vowel, nonsense and meaningful speech. Response magnitudes were relatively smaller during meaningful speech in comparison to nonsense speech. In terms of the influence of the two different language systems (tonal vs. non-tonal languages) on pitch compensation, we still cannot reach a consensus. Early study found larger compensatory responses in Mandarin speakers than that in English speakers, suggesting tonal language speakers are more sensitive to pitch perturbation (Xu et al., 2004). However, different stimuli were used in previous tonal and non-tonal language studies. Ning and colleagues directly compared pitch compensation between tonal and non-tonal language speakers using the same stimuli and perturbations. The results showed that tonal language speakers have attenuated rather than enhanced compensatory responses when experiencing pitch perturbation, suggesting that tonal language speakers may have a more entrenched pitch control system, making them less susceptible to auditory perturbations (Ning et al., 2015). However, it should be noted that in their earlier study, no significant difference was found in response amplitude between tonal and non-tonal language speakers (Ning et al., 2014). The mixed results may stem from large variability among participants and relatively small sample sizes in previous studies. A fairly large portion of people in auditory perturbation studies demonstrated minimal compensation (10–40%, Lametti et al., 2012) or even exhibited following responses (i.e., production altered in the same direction as the perturbation) (23%, MacDonald et al., 2011). A recent study showed that following responses can reach 35–55% in tonal language speakers when experiencing pitch perturbation (Ning, 2022a). Even though such large variability is found among participants, a sample size of 10 (or less) per group is typical in the reviewed studies (see Table 1). To gain more clarity on this significant question, future research with a larger sample size is necessary.

3 Pitch adaptation and aftereffect

3.1 Sensorimotor adaptation

With consistent auditory feedback perturbation, speakers can exhibit sensorimotor adaptation, in which they gradually modify and update their feedforward commands of the motoric plan over multiple successive utterances. Pitch adaptation in tonal language speakers was first examined in the study of Jones and Munhall (2002), in which they found Mandarin speakers adapted for pitch perturbations during Mandarin word production (/ma1/). Specifically, participants increased their F0 in both shift-down and shift-up conditions (±100 cents) but the increase was significantly larger in the shift-down condition than in the shift-up condition. In a later study, the same researchers showed that Mandarin speakers lowered their F0 in response to consistent F0 shift-up. Moreover, they found that adaptation for one tone category (tone 1) can be generalized to the production of another tone category (tone 2) (Jones and Munhall, 2005). A more recent study investigated simultaneous adaptation to pitch perturbation in Mandarin speakers, in which single and simultaneous perturbations to different tone categories were applied during the production of Mandarin words (Feng et al., 2018). When receiving single perturbation (−100 cents on tone 1), Mandarin speakers exhibited significant adaptation on the shifted tone. Such adaptation generalized to production of tone 4 (high falling tone), but not tone 2 (mid rising tone) nor tone 3 (low dipping tone). When receiving simultaneous tone 1 (shift-down) and tone 3 (shift-up) perturbations, no significant tone 1 adaptation was observed, and tone 3 adaptation occurred only if tone 2 was also produced.

Ning (2018, 2020) examined the influences of tonal language and vocal training experiences on pitch adaptation. In the study of Ning (2018), pitch adaptation during productions of sustained /a/ and /ma1/ in L2 Mandarin beginners, L2 Mandarin advanced learners, and native Mandarin speakers were examined. L2 beginners showed greater adaption (70 cents) than L2 advanced learners (61 cents) and native Mandarin speakers (56 cents). This is consistent with the author’s previous findings in pitch compensation, suggesting greater exposure to tonal languages would lead to a more stable pitch control system which is less affected by auditory feedback perturbations. In the author’s later study, pitch adaptation in trained vocalists and non-vocalists whose native language is Mandarin were compared (Ning, 2020). Overall, when it comes to pitch shift during the production of /a/, /ma1/, or /ma2/, vocalists exhibited less adaptation than non-vocalists, indicating that similar to tonal language experience, vocal training also contributes to a more entrenched pitch control system.

3.2 Aftereffects

Adaptive changes persist even after feedback is returned to normal (aftereffects), which is considered as a clear sign of changes in feedforward control (Jones and Munhall, 2000; Lametti et al., 2018). However, mixed results have been reported regarding the aftereffects during pitch adaptation.

An early study examining aftereffects in English speakers found that English speakers decreased their F0 in the shift-down condition, while they kept increasing their F0 in the shift-up condition, when the feedback returned to normal (Jones and Munhall, 2000). Similar pattern of aftereffects has been observed in Mandarin speakers during Mandarin word production (ma1) (Jones and Munhall, 2002). Moreover, the aftereffects can be generalized from one tone category to another tone category (Jones and Munhall, 2005). Feng et al. (2018), however, reported a different pattern of aftereffects in their native Mandarin speakers’ tone production. Specifically, in the shift-down condition on tone 1, after feedback was restored to normal, speakers did not decrease their F0. Instead, they held their F0 at approximately the same level as observed during the baseline phase and throughout the perturbation phases.

In more recent studies of Ning (2018, 2020), aftereffects were found to vary across stimuli and shift direction. In Ning (2018), all three groups with different tonal language experience (see 3.1) exhibited significant aftereffects in the downward shift of /ma1/, whereas no significant aftereffects were observed in the upward shift. The author suggested that reducing F0 for the upward shift in the high flat tone (ma1) might require less correction effort compared to increasing it for the downward shift. Consequently, in the case of the upward shift, participants’ F0 quickly returned to baseline levels once the auditory perturbation was removed. No significant aftereffect during production of /a/ was found neither (except for L2 advanced learners’ upward shift /a/), which contradicts previous findings (Jones and Keough, 2008; Keough and Jones, 2009). In the later study, although statistical evidence was not provided, aftereffects appeared to be present in the responses of all speakers to the downward shift of /ma1/ and /ma2/, but not the upward shift (Ning, 2020).

3.3 Interim summary

The reviewed studies have shown that tonal language speakers are able to adapt when consistent perturbations in pitch feedback are introduced. However, there is mixed evidence regarding the aftereffects. The discrepancies may be attributed to large participants variability and methodological differences, including the number of produced tone categories (a single tone vs. all four tone categories) and production stimuli (sustained vowel vs. word production). Further research is needed to clarify these points.

4 Neurobiology findings

By combining AAF paradigm with brain stimulation and neuroimaging techniques, a series of studies have been conducted by Liu’s group to investigate the neural basis of pitch control in tonal language speakers (e.g., Liu et al., 2020, 2023; Li et al., 2022). They have demonstrated the causal role of left dorsolateral prefrontal cortex (DLPFC) in pitch compensation by showing that inhibitory continuous theta burst stimulation (cTBS) over DLPFC elicited larger compensatory responses and smaller event-related potential (ERP) P2 responses in Mandarin speakers experiencing pitch perturbation (+200 or +500 cents), compared to sham stimulation (Liu et al., 2020). In contrast, transcranial direct current stimulation (tDCS) over DLPFC led to significantly faster but smaller compensatory responses than sham stimulation (Chang et al., 2023). The authors suggested that DLPFC is involved in top-down inhibitory control over vocal motor behavior. Altering the function of left DLPFC, either by disrupting or enhancing it, can result in impairment or enhancement of this top–down inhibitory control, which regulates the extent to which feedback perturbations influence speech production. The involvement of left inferior frontal gyrus (IFG) and supplementary motor cortex (SMA) in pitch compensation in Mandarin speakers have also been demonstrated in studies of Dai et al. (2022) and Li et al. (2023), in which enhanced or reduced compensatory responses to pitch perturbation were observed after transcranial alternating current stimulation (tACS) over left IFG or cTBS over left SMA, respectively.

Previous research has indicated that the posterior superior temporal gyrus (pSTG) plays a critical role in both speech perception and production (Okada and Hickok, 2006). The dual-stream model suggests that the pSTG, predominantly in the left hemisphere (Hickok and Poeppel, 2007), is responsible for encoding auditory-phonological representations, while the Directions Into Velocities of Articulators (DIVA) model proposes that both the left and right pSTG are involved in auditory feedback control (Guenther et al., 2006). However, it is important to note that these models are primarily based on findings from English speakers. In a recent study, a right-lateralized contribution of the pSTG to pitch feedback control has been found in tonal language speakers (Liu et al., 2023). More specifically, cTBS over right, but not left pSTG resulted in reduced compensations for pitch perturbations accompanied by smaller N1 and larger P2 responses in Mandarin speakers. In contrast, both the left and right supramarginal gyrus (SMG) has been found to causally contribute to pitch compensation in Mandarin speakers (Li et al., 2022).

4.1 Interim summary

Overall, by combining AAF paradigm with brain stimulation and neuroimaging techniques, several brain areas, including PFC, pSTG, SMG have been demonstrated to play an important role in pitch compensation in tonal language speakers. It is worth noting that in those studies, the response magnitudes were generally smaller (~15 cents) compared to the magnitudes observed in behavioral studies (~30 cents). It remains unclear whether the differences can be attributed to the presence of neuroimaging/brain stimulation techniques or the use of different stimuli (/u/ v.s /a/ or /ma/). In addition, we did not find any studies that investigate the neural basis of pitch adaptation in tonal language speakers.

5 Conclusion

By using AAF paradigm, pitch compensation and adaptation has been demonstrated in tonal language speakers. The compensation and adaptation are under the influence of various factors, such as the type of perturbation stimuli, the speaker’s language background, and their vocal training. However, the exact influence of tonal language experience on pitch control still remains unclear.

Author contributions

D-LT: Writing – original draft, Writing – review & editing, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by The University of Hong Kong (Seed Fund for Basic Research for New Staff and Project-Based Research Funding).

Conflict of interest

The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Burnett, T. A., Freedland, M. B., Larson, C. R., and Hain, T. C. (1998). Voice F0 responses to manipulations in pitch feedback. J. Acoust. Soc. Am. 103, 3153–3161. doi: 10.1121/1.423073

PubMed Abstract | Crossref Full Text | Google Scholar

Chang, Y., Peng, D., Zhao, Y., Chen, X., Li, J., Wu, X., et al. (2023). Transcranial direct current stimulation over left dorsolateral prefrontal cortex facilitates auditory-motor integration for vocal pitch regulation. Front. Neurosci. 17:1208581. doi: 10.3389/fnins.2023.1208581

PubMed Abstract | Crossref Full Text | Google Scholar

Dai, G., Chen, M., Chen, X., Guo, Z., Li, T., Jones, J. A., et al. (2022). A causal link between left supplementary motor area and auditory-motor control of vocal production: evidence by continuous theta burst stimulation. NeuroImage 264:119767. doi: 10.1016/j.neuroimage.2022.119767

PubMed Abstract | Crossref Full Text | Google Scholar

Feng, Y., Xiao, Y., Yan, Y., and Max, L. (2018). Adaptation in mandarin tone production with pitch-shifted auditory feedback: influence of tonal contrast requirements. Lang. Cognit. Neurosci. 33, 734–749. doi: 10.1080/23273798.2017.1421317

PubMed Abstract | Crossref Full Text | Google Scholar

Guenther, F. H. (2016). Neural control of speech. Cambridge, MA: The MIT Press.

Google Scholar

Guenther, F. H., Ghosh, S. S., and Tourville, J. A. (2006). Neural modeling and imaging of the cortical interactions underlying syllable production. Brain Lang. 96, 280–301. doi: 10.1016/j.bandl.2005.06.001

PubMed Abstract | Crossref Full Text | Google Scholar

Hain, T. C., Burnett, T. A., Kiran, S., Larson, C. R., Singh, S., and Kenney, M. K. (2000). Instructing subjects to make a voluntary response reveals the presence of two components to the audio-vocal reflex. Exp. Brain Res. 130, 133–141. doi: 10.1007/s002219900237

PubMed Abstract | Crossref Full Text | Google Scholar

Hickok, G., Houde, J., and Rong, F. (2011). Sensorimotor integration in speech processing: computational basis and neural organization. Neuron 69, 407–422. doi: 10.1016/j.neuron.2011.01.019

PubMed Abstract | Crossref Full Text | Google Scholar

Hickok, G., and Poeppel, D. (2007). The cortical organization of speech processing. Nat. Rev. Neurosci. 8, 393–402. doi: 10.1038/nrn2113

Crossref Full Text | Google Scholar

Houde, J. F., and Jordan, M. I. (2002). Sensorimotor adaptation of speech I: compensation and adaptation. J. Speech Lang. Hear. Res. 45, 295–310. doi: 10.1044/1092-4388(2002/023)

Crossref Full Text | Google Scholar

Jones, J. A., and Keough, D. (2008). Auditory-motor mapping for pitch control in singers and nonsingers. Exp. Brain Res. 190, 279–287. doi: 10.1007/s00221-008-1473-y

PubMed Abstract | Crossref Full Text | Google Scholar

Jones, J. A., and Munhall, G. K. (2000). Perceptual calibration of F0 production: evidence from feedback pertubation. J. Acoust. Soc. Am. 108, 1246–1251. doi: 10.1121/1.1288414

Crossref Full Text | Google Scholar

Jones, J. A., and Munhall, K. G. (2002). The role of auditory feedback during phonation: studies of mandarin tone production. J. Phon. 30, 303–320. doi: 10.1006/jpho.2001.0160

Crossref Full Text | Google Scholar

Jones, J. A., and Munhall, K. G. (2005). Remapping auditory-motor representations in voice production. Curr. Biol. 15, 1768–1772. doi: 10.1016/j.cub.2005.08.063

Crossref Full Text | Google Scholar

Keough, D., and Jones, J. A. (2009). The sensitivity of auditory-motor representations to subtle changes in auditory feedback while singing. J. Acoust. Soc. Am. 126, 837–846. doi: 10.1121/1.3158600

PubMed Abstract | Crossref Full Text | Google Scholar

Lametti, D. R., Nasir, S. M., and Ostry, D. J. (2012). Sensory preference in speech production revealed by simultaneous alteration of auditory and somatosensory feedback. J. Neurosci. 32, 9351–9358. doi: 10.1523/JNEUROSCI.0404-12.2012

PubMed Abstract | Crossref Full Text | Google Scholar

Lametti, D. R., Smith, H. J., Freidin, P. F., and Watkins, K. E. (2018). Cortico-cerebellar networks drive sensorimotor learning in speech. J. Cogn. Neurosci. 30, 540–551. doi: 10.1162/jocn_a_01216

PubMed Abstract | Crossref Full Text | Google Scholar

Larson, C. R., Burnett, T. A., Bauer, J. J., Kiran, S., and Hain, T. C. (2001). Comparison of voice F0 responses to pitch-shift onset and offset conditions. J. Acoust. Soc. Am. 110, 2845–2848. doi: 10.1121/1.1417527

PubMed Abstract | Crossref Full Text | Google Scholar

Li, T., Chang, Y., Zhao, S., Jones, J. A., Chen, X., Gan, C., et al. (2023). The left inferior frontal gyrus is causally linked to vocal feedback control: evidence from high-definition transcranial alternating current stimulation. Cereb. Cortex 33, 5625–5635. doi: 10.1093/cercor/bhac447

PubMed Abstract | Crossref Full Text | Google Scholar

Li, T., Zhu, X., Wu, X., Gong, Y., Jones, J. A., Liu, P., et al. (2022). Continuous theta burst stimulation over left and right supramarginal gyri demonstrates their involvement in auditory feedback control of vocal production. Cereb. Cortex 33, 11–22. doi: 10.1093/cercor/bhac049

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, D., Chang, Y., Dai, G., Guo, Z., Jones, J. A., Li, T., et al. (2023). Right, but not left, posterior superior temporal gyrus is causally involved in vocal feedback control. NeuroImage 278:120282. doi: 10.1016/j.neuroimage.2023.120282

Crossref Full Text | Google Scholar

Liu, D., Dai, G., Liu, C., Guo, Z., Xu, Z., Jones, J. A., et al. (2020). Top–down inhibitory mechanisms underlying auditory–motor integration for voice control: evidence by TMS. Cereb. Cortex 30, 4515–4527. doi: 10.1093/cercor/bhaa054

Crossref Full Text | Google Scholar

Liu, H., Wang, E. Q., Chen, Z., Liu, P., Larson, C. R., and Huang, D. (2010). Effect of tonal native language on voice fundamental frequency responses to pitch feedback perturbations during sustained vocalizations. J. Acoust. Soc. Am. 128, 3739–3746. doi: 10.1121/1.3500675

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, H., Xu, Y., and Larson, C. R. (2009). Attenuation of vocal responses to pitch perturbations during mandarin speech. J. Acoust. Soc. Am. 125, 2299–2306. doi: 10.1121/1.3081523

PubMed Abstract | Crossref Full Text | Google Scholar

MacDonald, E. N., Purcell, D. W., and Munhall, K. G. (2011). Probing the independence of formant control using altered auditory feedback. J. Acoust. Soc. Am. 129, 955–965. doi: 10.1121/1.3531932

PubMed Abstract | Crossref Full Text | Google Scholar

Natke, U., Donath, T. M., and Kalveram, K. T. (2003). Control of voice fundamental frequency in speaking versus singing. J. Acoust. Soc. Am. 113, 1587–1593. doi: 10.1121/1.1543928

PubMed Abstract | Crossref Full Text | Google Scholar

Ning, L. H. (2018). Vocal adaptation in perturbed auditory feedback for L2 learners of Mandarin. J. Chin. Lang. Teach. 15, 1–20.

Google Scholar

Ning, L.-H. (2019). Pitch-shift responses as an online monitoring mechanism during level tone production. J. Acoust. Soc. Am. 145, 2192–2197. doi: 10.1121/1.5096977

PubMed Abstract | Crossref Full Text | Google Scholar

Ning, L.-H. (2020). Sensorimotor adaptation and aftereffect to frequency-altered feedback in mandarin-speaking vocalists and non-vocalists. Concentric. Stud. Linguist. 46, 125–147. doi: 10.1075/consl.00015.nin

Crossref Full Text | Google Scholar

Ning, L.-H. (2022a). Comparison of involuntary and volitional responses to pitch-shifted auditory feedback: evidence for tone speakers’ flexibility to switch between opposing and following responses. J. Speech Lang. Hear. Res. 65, 2160–2186. doi: 10.1044/2022_JSLHR-21-00597

PubMed Abstract | Crossref Full Text | Google Scholar

Ning, L.-H. (2022b). The effect of stimulus timing in compensating for pitch perturbation on flat, rising, and falling contours. J. Acoust. Soc. Am. 151, 2530–2544. doi: 10.1121/10.0010237

PubMed Abstract | Crossref Full Text | Google Scholar

Ning, L.-H., Loucks, T. M., and Shih, C. (2015). The effects of language learning and vocal training on sensorimotor control of lexical tone. J. Phon. 51, 50–69. doi: 10.1016/j.wocn.2014.12.003

Crossref Full Text | Google Scholar

Ning, L.-H., Shih, C., and Loucks, T. M. (2014). Mandarin tone learning in L2 adults: a test of perceptual and sensorimotor contributions. Speech Comm. 63-64, 55–69. doi: 10.1016/j.specom.2014.05.001

Crossref Full Text | Google Scholar

Okada, K., and Hickok, G. (2006). Left posterior auditory-related cortices participate both in speech perception and speech production: neural overlap revealed by fMRI. Brain Lang. 98, 112–117. doi: 10.1016/j.bandl.2006.04.006

Crossref Full Text | Google Scholar

Parrell, B., Ramanarayanan, V., Nagarajan, S., and Houde, J. (2019). The FACTS model of speech motor control: fusing state estimation and task-based control. PLoS Comput. Biol. 15:e1007321. doi: 10.1371/journal.pcbi.1007321

PubMed Abstract | Crossref Full Text | Google Scholar

Purcell, D. W., and Munhall, K. G. (2006). Adaptive control of vowel formant frequency: evidence from real-time formant manipulation. J. Acoust. Soc. Am. 120, 966–977. doi: 10.1121/1.2217714

PubMed Abstract | Crossref Full Text | Google Scholar

Xu, Y., Larson, C. R., Bauer, J. J., and Hain, T. C. (2004). Compensation for pitch-shifted auditory feedback during the production of mandarin tone sequences. J. Acoust. Soc. Am. 116, 1168–1178. doi: 10.1121/1.1763952

PubMed Abstract | Crossref Full Text | Google Scholar

Yip, M. (2002). Introduction. In: Tone, pp 1–16. Cambridge: Cambridge University Press.

Google Scholar

Keywords: altered auditory feedback, pitch perturbation, tonal languages, speech motor control, sensorimotor adaptation

Citation: Tang D-l (2024) Using altered auditory feedback to study pitch compensation and adaptation in tonal language speakers. Front. Hum. Neurosci. 18:1364803. doi: 10.3389/fnhum.2024.1364803

Received: 03 January 2024; Accepted: 26 February 2024;
Published: 19 March 2024.

Edited by:

Anastasios M. Georgiou, Cyprus University of Technology, Cyprus

Reviewed by:

Jason W. Bohland, University of Pittsburgh, United States
Alessandro Tavano, Max Planck Society, Germany

Copyright © 2024 Tang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ding-lan Tang, dltang@hku.hk

Download