MINI REVIEW article
Sec. Auditory Cognitive Neuroscience
Volume 10 - 2016 | https://doi.org/10.3389/fnins.2016.00361
Sensory Entrainment Mechanisms in Auditory Perception: Neural Synchronization Cortico-Striatal Activation
- 1Service de Neuropsychologie et de Neuroréhabilitation, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
- 2The Laboratory for Investigative Neurophysiology, Department of Radiology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
- 3Department of Brain and Cognitive Sciences, McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
The auditory system displays modulations in sensitivity that can align with the temporal structure of the acoustic environment. This sensory entrainment can facilitate sensory perception and is particularly relevant for audition. Systems neuroscience is slowly uncovering the neural mechanisms underlying the behaviorally observed sensory entrainment effects in the human sensory system. The present article summarizes the prominent behavioral effects of sensory entrainment and reviews our current understanding of the neural basis of sensory entrainment, such as synchronized neural oscillations, and potentially, neural activation in the cortico-striatal system.
Two pendulum clocks positioned on the same table synchronize over time; this is a process called “entrainment” (Huygens, 1893). Many scientific fields have adopted this terminology for conditions in which two dynamic systems align. This review focuses on sensory entrainment, that is, the behaviorally observed temporal alignment of the sensory system with its environment. In everyday situations, motor actions, such as clapping in synchrony with music or alignment of walking pace in a group of people, are the result of sensory entrainment (for a review, see Ross and Balasubramaniam, 2014; see Merchant et al., 2015). However, sensory entrainment is relevant beyond motor behavior. Our sensory environment is unimaginable without its temporal structure. Tuning in to this temporal structure is thought to be a fundamental mechanism required for efficient auditory and speech perception (for a review see Giraud and Poeppel, 2012; Golumbic et al., 2013; Zoefel and VanRullen, 2015). Such sensory entrainment is, for example, evidenced through facilitated sensory perception in the context of temporal regularity (Jones et al., 2002; Geiser et al., 2012). We review neural correlates that potentially underlie the behaviorally observed alignment of the sensory system to a temporally regular or quasi regular environment.
Behavioral Evidence of Sensory Entrainment
The behavioral effects of sensory entrainment are typically shown in the context of temporally regular, ideally isochronous, environmental stimulation in which the occurrence of the next sensory input can be temporally predicted. For example, to measure sensory-motor synchronization, listeners tap to temporally regular auditory stimulation (Nozaradan et al., 2015). Synchronization to auditory cues is more precise than to visual cues (Hove et al., 2013), although synchronization to visual and even tactile cues is also used to measure entrainment (Lange and Roeder, 2006; Fernandez Del Olmo et al., 2007; Elliott et al., 2010, 2011; Ruspantini et al., 2011). Sensory-motor synchronization tasks include not only sensory but also motor entrainment.
Pure sensory entrainment is measured in perceptual tasks. These tasks typically show facilitated perception of stimuli when they are presented in a temporal context that allows entrainment compared to a context that does not allow entrainment. In the auditory domain, auditory temporal regularity, compared to temporal irregularity, results in faster reaction times to tones in various tasks (Lange, 2009; Rimmele et al., 2011), as well as better discrimination of differences in pitch (Jones et al., 2002), intensity (Geiser et al., 2012), and duration (Barnes and Jones, 2000; McAuley and Jones, 2003). Similar effects are observed in the visual domain (Rohenkohl et al., 2012; Marchant and Driver, 2013) and cross-modally, as in cases of auditory regular temporal grids facilitating saccadic eye movement (Bolger et al., 2013; Miller et al., 2013) and improving visual word recognition and discrimination (Bolger et al., 2013; Brochard et al., 2013) and of rhythmic movement facilitating sound perception (Morillon et al., 2014). Sensory facilitation is even observed against competing task demands (Cutanda et al., 2015). Most importantly, sensory entrainment effects are observed not only when the target stimulus is presented in the context of temporal regularity but also when temporal regularity precedes the target stimulus and the target appears at a predictable point in time as defined by the preceding sequence (Ellis and Jones, 2010; Sanabria et al., 2011; Cason and Schön, 2012; Sanabria and Correa, 2013; Cason et al., 2015). For example, sound signal detection is modulated at the rate of a previously presented amplitude modulated signal (Hickok et al., 2015). Thus, a variety of experimental tasks show the temporal context sensitivity of the sensory system, indicating facilitated perception through temporal regularity. Critically, sensory entrainment is behaviorally evidenced by the internal perpetuation of previously entrained excitability of the sensory system.
Outside of the research context, strictly regular, isochronous stimulation is the exception; it is found in music, in which temporal regularity is a defining feature (Geiser et al., 2014). However, there is emerging evidence that auditory sensory entrainment is present even in the absence of strict temporal regularity. Although behavioral effects are greatest in the context of temporal isochrony, sound perception is facilitated by varying degrees of temporal expectation (Herrmann et al., 2016). The capacity of the sensory system to detect and to synchronize to the average frequency of a stream of sounds and to perpetuate this synchronization, resulting in temporal predictions, is one of the preconditions allowing the use of entrainment for processing natural stimuli such as speech.
Neural Correlates of Sensory Entrainment
The temporal context in which sounds are perceived influences neural activity. Although attention might have a modulatory effect (Hsu et al., 2014), event-related potentials (ERPs) are typically attenuated in the context of temporal regularity (Lange, 2009; Schmidt-Kassow et al., 2009; Lecaignard et al., 2015). Effects of temporal regularity are observed in the auditory N1 (Lange, 2009, 2010; Costa-Faidella et al., 2011; Rimmele et al., 2011; Sanabria and Correa, 2013) and its electromagnetic correlate N1m (Okamoto et al., 2013). Moreover, the reduction in N1 amplitude to isochronously presented tones shows the suppression of early signals, indicating a modulation of activation in secondary auditory cortices, namely the planum temporale (PT), through temporal regularity (Costa-Faidella et al., 2011). The sensitivity of sensory responses in the PT to temporal regularity is paralleled in an fMRI study on speech regularity, in which activation in the PT was modulated by temporal regularity (Geiser et al., 2008). Such modulation of neural activation by temporal regularity in primary and secondary cortices could be the result of sensory entrainment. Two mechanisms underlying sensory entrainment have been suggested, both of which may or may not be independent from each other: (1) synchronized neural oscillations in sensory and motor cortices and, potentially, (2) cortico-striatal brain activation (Figure 1). The neural correlates supporting these suggestions are reviewed in the following sections.
Figure 1. Schematic illustration of the neural correlates of sensory entrainment. Temporally structured auditory signals reach the sensory system (e.g., in the forms of speech and music). Neural correlates of sensory entrainment include synchronization of neural oscillations in the sensory cortices (Gross et al., 2013; Lakatos et al., 2013) and activation in the putamen (Geiser et al., 2012) (Figures adapted from Calderone et al., 2014 and Geiser et al., 2012).
The first neural correlate of sensory entrainment is synchronized neural oscillation. Neuronal populations in the living brain show intrinsic fluctuations of excitability at the level of the cell membrane (Fiser et al., 2004; Lakatos et al., 2005). These fluctuations can be measured as periodic waves intracranially or on the scalp, via local field potentials or electroencephalograms, respectively. They can be characterized by their frequency, amplitude, and phase and are defined as delta (2–4 Hz), theta (4–8 Hz), alpha (8–12 Hz), beta (12–30 Hz), and gamma (30–100 Hz) bands. Neural oscillations typically synchronize across frequency bands, as has been shown in the auditory (Lakatos et al., 2005, 2013) and visual cortices (Lakatos et al., 2008). This hierarchical cross-frequency coupling (Schroeder and Lakatos, 2009) is suggested to influence neuronal interactions (Womelsdorf et al., 2007, for a review, see Fries, 2015). Importantly, intrinsic neural oscillations display the ability to phase-lock and thus entrain to external stimulation. This neuronal entrainment through phase-locking is observed in the visual (Montemurro et al., 2008), auditory (Luo and Poeppel, 2007; Besle et al., 2011), and somatosensory (Langdon et al., 2011; Ross et al., 2013) cortices, as well as cross-modally (Luo et al., 2010; Power et al., 2012). Thus, periodic neural oscillations synchronize to external stimulation within and across modalities.
The intrinsic oscillatory state of neuronal activity can affect whether a sensory cue is detected. Both a change in amplitude (power modulation) and the point in the cycle of a neural oscillation (phase) can influence target detection in the visual (Busch et al., 2009; Mathewson et al., 2009) and the auditory domains (Ng et al., 2012). Because the intrinsic oscillatory state can influence perception, entrained oscillations should likewise facilitate perception. Indeed, the phase of entrained neural delta oscillation predicts sound gap detection (Henry and Obleser, 2012; Henry et al., 2014). Thus, there is a strong link between the intrinsic or entrained oscillatory state of neural activity and behavioral performance.
Some components of neural oscillations, namely aspects of beta-band oscillations, seem to underlie the predictive or sustentative aspect of sensory entrainment. Synchronization of neural activity to auditory cues has been observed most strongly in the low frequencies, particularly the delta and theta frequency bands (Kayser et al., 2009; Howard and Poeppel, 2012; Ding et al., 2014), but also in higher frequencies, including the beta, and gamma frequency bands (Snyder and Large, 2005; Fujioka et al., 2012). Beta power decreases rapidly after each tone and increases before the next tone in the context of temporal regularity. Importantly, the increase depends on the tempo of the presented stimuli, with a rapid increase for fast tempi, and a slower increase for slower tempi (Fujioka et al., 2012). Moreover, when an expected stimulus is omitted, the decrease in beta power is absent, but the increase before the next tone is nevertheless present (Fujioka et al., 2009). Both findings indicate that the increase in beta power is not simply following amplitude modulations in the entraining stimulus but might represent the endogenous encoding of the predicted time interval. This modulation of the beta band by passive listening to isochronous sounds has been replicated in adults (Fujioka et al., 2015) and in children (Cirelli et al., 2014; Etchell et al., 2016). Thus, although evidence linking the predictive nature of beta-band modulations to behavior is still missing, existing electrophysiological evidence supports the idea that beta-band activity carries predictive value in the context of sensory entrainment.
Not only do neural oscillations in the sensory cortex entrain to auditory stimuli, such entrainment is also observed in other areas of the brain (i.e., motor-related brain regions). Sensorimotor cortices (the precentral and postcentral gyri), anterior cingulate cortex, cerebellum, inferior-frontal gyrus, supplementary motor area (Fujioka et al., 2012), and medial and lateral premotor cortex displayed modulation of beta oscillation in response to an external stimulation (Fujioka et al., 2015). While beta modulation in motor regions is frequently observed during movement (for review, see Khanna and Carmena, 2015), the beta activity reported here is observed in the absence of movement and must therefore relate to the temporal processing of sensory stimuli, potentially involving predictive mechanisms. It is, however, an open question whether beta oscillation in motor-related brain regions can have a predictive value, thus underlying sensory entrainment, as is assumed for the beta oscillation in the sensory cortex.
In response to more ecological stimuli, such as speech, neural oscillations can synchronize in time ranges from the level of phonemes to the level of the syllables (for a review, see Ahissar et al., 2001; Giraud and Poeppel, 2012; Saoud et al., 2012; Power et al., 2013), with differential synchronization abilities of hemispheres potentially underlying the hemispheric specialization for speech (Giraud et al., 2007). Although such neural entrainment occurs across various oscillatory frequencies (Gross et al., 2013; Peelle et al., 2013), it is most frequently observed for low frequencies (Luo and Poeppel, 2007; for review see, Peelle and Davis, 2012). Moreover, synchronization seems to depend on previous exposure to a speech cue. The degree of familiarity with speech can facilitate entrainment (Lidji et al., 2011) and modulate oscillatory responses. Power synchronization in the theta band was observed when listening to the native language only (Pérez et al., 2015) and increased gamma-band power was observed when listening to the native language compared to a foreign language (Peña and Melloni, 2011). This indicates that neural oscillations might help to assess the meaning of speech.
Another potential neural correlate of sensory entrainment is neural activation in the dorsal striatum. Several studies manipulating the temporal context of auditory sequences have reported activation in the putamen. Typically, this activation was observed when experimental subjects listened to sound sequences comprising temporal regularity. These studies examined explicit processing of timing by applying perceptual tasks, such as regularity detection (Grahn and Rowe, 2009) and duration discrimination in the context of a temporally regular sequence (Teki et al., 2011a), motor tasks such as the reproduction of a rhythm comprising temporal regularity or motor synchronization with the beat (Riecker et al., 2003; Chen et al., 2008), or simply listening to a rhythmic beat (Grahn and Brett, 2007). Hence, models of auditory perception have attributed a central role to the basal ganglia, for example, as a brain region tracking temporal modulations in acoustic signals including speech (Kotz et al., 2009; Teki et al., 2011b; Schwartze et al., 2012) or integrating predictive coding in speech perception (Lim et al., 2014).
Although the above evidence indicates that activation in the putamen plays a role in temporal regularity perception, it does not reveal whether the putamen plays a role in sensory entrainment. We measured activation in the putamen in a typical sensory entrainment task (Geiser et al., 2012). Participants had to detect an intensity change in a sequence of tones that were either temporally regular (isochronous) or temporally irregular. As expected, temporal regularity enhanced auditory perception for tone intensity, and there were two associated patterns of brain activation. First, there was decreased activation in bilateral regions of the temporal lobe in response to temporally regular sequences compared to irregular sequences. Second, there was increased activation in the putamen in response to temporally regular sequences relative to irregular sequences. Thus, striatal activation is not only involved when participants encounter temporal regularity but is observed in a typical sensory entrainment task. Importantly, across individuals, the reduced activation in primary, and secondary auditory cortices in response to temporal regularity perception, which yielded better behavioral performance, was linearly correlated with increased activation in the putamen. This correlation could indicate that the striatum dynamically interacts with the sensory cortex either directly or through a mediating brain area to facilitate perception in the context of sensory entrainment.
The functional role that the striatum could play in sensory entrainment remains elusive. One could imagine that the putamen simply detects temporal regularity or the average tempo of a sequence. Alternatively, the putamen may crucially underlie sensory entrainment by internally perpetuating temporal regularity and predicting future acoustic events. Evidence demonstrating the latter is still lacking. However, when participants explicitly tracked temporal regularity in the second of two sequences in which the tempo either changed or did not change between the two sequences, greater activation in the putamen was found when a sequence repeated the tempo of a previously heard sequence than when the tempo changed (Grahn and Rowe, 2013). This indicates that the striatum responds when a tempo prediction is confirmed by the external stimulus. Authors suggest that this indicates the encoding of predictive aspects of temporal regularity perception. This is in line with an earlier study suggesting that the putamen encodes prediction, at least in motor learning (Haruno and Kawato, 2006). Further studies will need to test whether putamen activation in the context of sensory entrainment is related more to the confirmation of a prediction or to the generation of a prediction.
Whether the two neural correlates of sensory entrainment, neural oscillations and striatal activation, are functionally linked remains to be investigated. However, evidence from motor studies suggests a potential link. At least in some putaminal recording sites, the spectral power of beta oscillations increases when monkeys perform self-generated tapping in a previously learned tempo compared to when they tapped in response to an irregularly appearing cue production (Bartolo et al., 2014; Bartolo and Merchant, 2015). This indicates that some striatal circuits might play a role in the internal generation of temporal regularity, at least in the context of motor processing. Thus, it is possible that increased putamen activation as measured in the BOLD response is driven by enhanced putaminal beta activity.
Is Attention Necessary for Sensory Entrainment?
It has long been known that “dynamic attending” induced by temporally regular stimuli can lead to faster reaction times to temporally expected points in time (Jones and Boltz, 1989; Barnes and Jones, 2000; London, 2004). Most recent experimental paradigms measuring sensory entrainment comprise active tasks in which participants focus their attention on the entraining stimulus, allowing stimulus-driven attending that involves temporal expectancy (Jones et al., 2002; Sanabria and Correa, 2013). Sensory attenuation and putaminal activation in the context of sensory entrainment is observed in the presence of endogenous attention (Lange, 2010; Costa-Faidella et al., 2011; Geiser et al., 2012), and synchronization of neural oscillations to sensory stimuli is particularly strong when attention is directed toward the entraining sound (Besle et al., 2011; Horton et al., 2013).
While the sensory effect of temporal context in the presence of endogenous attention is well investigated, less is known about temporal expectancy in the absence of endogenous attention. Evidence from visual studies suggests that temporal expectation and attention might influence neural activation in opposite ways (Summerfield and Egner, 2009; Kok et al., 2012; see also Arnal and Giraud, 2012). In the auditory domain, orthogonal manipulation of expectation and attention showed an attenuation effect on the N1 in the attended condition only (Hsu et al., 2014). Based on this finding, one could hypothesize that the attenuating effect of a regular temporal context might depend on the presence of endogenous attention.
However, neural effects of entrainment are also observed in the absence of endogenous attention. In passive oddball paradigms, temporal predictability influences auditory ERPs to acoustic (Geiser et al., 2010) or higher-level deviants (Tavano et al., 2014). Moreover, neural oscillations entrain to auditory stimuli when participants' endogenous attention is directed to a concurrent visual (Fujioka et al., 2009, 2012) or auditory stimulus (Golumbic et al., 2013; Horton et al., 2013; Rimmele et al., 2015). Moreover, in an unattended condition, expectation modulates auditory beta-band synchronization to tones (Todorovic et al., 2015). Thus, attention networks use oscillatory phase entrainment for both enhancement and suppression of auditory signals (for a review, see Calderone et al., 2014).
The above evidence indicates that sensory entrainment is influenced by attention but that neural effects of entrainment are present in both attended and unattended processing conditions. Further studies will need to investigate the behavioral effects and the cortico-striatal mechanisms related to sensory entrainment as a function of attention.
In summary, sensory entrainment is essential for auditory perception. It drives perception to be best at temporally expected moments in time. Neural oscillations and, potentially, striatal brain activation underlie sensory entrainment. Whether these two correlates are part of the same mechanism and the way in which attention interacts with mechanisms of sensory entrainment remain to be investigated.
Conceptualization, EG. Writing-Original Draft, EG, CS. Writing, Review, and Editing, EG, CS. Visualization, CS.
Swiss National Science Foundation: PZ00P1_148184/1 awarded to EG and FN320030-159708 awarded to Stephanie Clarke.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
We would like to thank the two reviewers for their helpful comments on our manuscript.
Ahissar, E., Nagarajan, S., Ahissar, M., Protopapas, A., Mahncke, H., and Merzenich, M. M. (2001). Speech comprehension is correlated with temporal response patterns recorded from auditory cortex. Proc. Natl. Acad. Sci. U.S.A. 98, 13367–13372. doi: 10.1073/pnas.201400998
Arnal, L. H., and Giraud, A.-L. (2012). Cortical oscillations and sensory predictions. Trends Cogn. Sci. 16, 390–398. doi: 10.1016/j.tics.2012.05.003
Barnes, R., and Jones, M. R. (2000). Expectancy, attention, and time. Cogn. Psychol. 41, 254–311. doi: 10.1006/cogp.2000.0738
Bartolo, R., and Merchant, H. (2015). β Oscillations are linked to the initiation of sensory-cued movement sequences and the internal guidance of regular tapping in the monkey. J. Neurosci. 35, 4635–4640. doi: 10.1523/JNEUROSCI.4570-14.2015
Bartolo, R., Prado, L., and Merchant, H. (2014). Information processing in the primate basal ganglia during sensory-guided and internally driven rhythmic tapping. J. Neurosci. 34, 3910–3923. doi: 10.1523/JNEUROSCI.2679-13.2014
Besle, J., Schevon, C. A., Mehta, A. D., Lakatos, P., Goodman, R. R., McKhann, G. M., et al. (2011). Tuning of the human neocortex to the temporal dynamics of attended events. J. Neurosci. 31, 3176–3185. doi: 10.1523/jneurosci.4518-10.2011
Bolger, D., Trost, W., and Schoen, D. (2013). Rhythm implicitly affects temporal orienting of attention across modalities. Acta Psychol. 142, 238–244. doi: 10.1016/j.actpsy.2012.11.012
Brochard, R., Tassin, M., and Zagar, D. (2013). Got rhythm for better and for worse. Cross-modal effects of auditory rhythm on visual word recognition. Cognition 127, 214–219. doi: 10.1016/j.cognition.2013.01.007
Busch, N. A., Dubois, J., and VanRullen, R. (2009). The phase of ongoing EEG oscillations predicts visual perception. J. Neurosci. 29, 7869–7876. doi: 10.1523/JNEUROSCI.0113-09.2009
Calderone, D. J., Lakatos, P., Butler, P. D., and Castellanos, F. X. (2014). Entrainment of neural oscillations as a modifiable substrate of attention. Trends Cogn. Sci. 18, 300–309. doi: 10.1016/j.tics.2014.02.005
Cason, N., Hidalgo, C., Isoard, F., Roman, S., and Schön, D. (2015). Rhythmic priming enhances speech production abilities: evidence from prelingually deaf children. Neuropsychology 29, 102–107. doi: 10.1037/neu0000115
Cason, N., and Schön, D. (2012). Rhythmic priming enhances the phonological processing of speech. Neuropsychologia 50, 2652–2658. doi: 10.1016/j.neuropsychologia.2012.07.018
Chen, J. L., Penhune, V. B., and Zatorre, R. J. (2008). Moving on time: brain network for auditory-motor synchronization is modulated by rhythm complexity and musical training. J. Cogn. Neurosci. 20, 226–239. doi: 10.1162/jocn.2008.20018
Cirelli, L. K., Bosnyak, D., Manning, F. C., Spinelli, C., Marie, C., Fujioka, T., et al. (2014). Beat-induced fluctuations in auditory cortical beta-band activity: using EEG to measure age-related changes. Front. Psychol. 5:742. doi: 10.3389/fpsyg.2014.00742
Costa-Faidella, J., Baldeweg, T., Grimm, S., and Escera, C. (2011). interactions between “what” and “when” in the auditory system: temporal predictability enhances repetition suppression. J. Neurosci. 31, 18590–18597. doi: 10.1523/JNEUROSCI.2599-11.2011
Cutanda, D., Correa, A., and Sanabria, D. (2015). Auditory temporal preparation induced by rhythmic cues during concurrent auditory working memory tasks. J. Exp. Psychol. Hum. Percept. Perform. 41, 790–797. doi: 10.1037/a0039167
Ding, N., Chatterjee, M., and Simon, J. Z. (2014). Robust cortical entrainment to the speech envelope relies on the spectro-temporal fine structure. Neuroimage 88, 41–46. doi: 10.1016/j.neuroimage.2013.10.054
Elliott, M. T., Wing, A. M., and Welchman, A. E. (2010). Multisensory cues improve sensorimotor synchronisation. Eur. J. Neurosci. 31, 1828–1835. doi: 10.1111/j.1460-9568.2010.07205.x
Elliott, M. T., Wing, A. M., and Welchman, A. E. (2011). The effect of ageing on multisensory integration for the control of movement timing. Exp. Brain Res. 213, 291–298. doi: 10.1007/s00221-011-2740-x
Ellis, R. J., and Jones, M. R. (2010). Rhythmic context modulates foreperiod effects. Atten. Percept. Psychophys. 72, 2274–2288. doi: 10.3758/BF03196701
Etchell, A. C., Ryan, M., Martin, E., Johnson, B. W., and Sowman, P. F. (2016). Abnormal time course of low beta modulation in non-fluent preschool children: a magnetoencephalographic study of rhythm tracking. Neuroimage 125, 953–963. doi: 10.1016/j.neuroimage.2015.10.086
Fernandez Del Olmo, M., Cheeran, B., Koch, G., and Rothwell, J. C. (2007). Role of the cerebellum in externally paced rhythmic finger movements. J. Neurophysiol. 98, 145–152. doi: 10.1152/jn.01088.2006
Fiser, J., Chiu, C. Y., and Weliky, M. (2004). Small modulation of ongoing cortical dynamics by sensory input during natural vision. Nature 431, 573–578. doi: 10.1038/nature02907
Fries, P. (2015). Rhythms for cognition: communication through coherence. Neuron 88, 220–235. doi: 10.1016/j.neuron.2015.09.034
Fujioka, T., Ross, B., and Trainor, L. J. (2015). Beta-band oscillations represent auditory beat and its metrical hierarchy in perception and imagery. J. Neurosci. 35, 15187–15198. doi: 10.1523/JNEUROSCI.2397-15.2015
Fujioka, T., Trainor, L. J., Large, E. W., and Ross, B. (2009). Beta and gamma rhythms in human auditory cortex during musical beat processing. Ann. N.Y. Acad. Sci. 1169, 89–92. doi: 10.1111/j.1749-6632.2009.04779.x
Fujioka, T., Trainor, L. J., Large, E. W., and Ross, B. (2012). Internalized timing of isochronous sounds is represented in neuromagnetic beta oscillations. J. Neurosci. 32, 1791–1802. doi: 10.1523/JNEUROSCI.4107-11.2012
Geiser, E., Notter, M., and Gabrieli, J. D. E. (2012). A corticostriatal neural system enhances auditory perception through temporal context processing. J. Neurosci. 32, 6177–6182. doi: 10.1523/jneurosci.5153-11.2012
Geiser, E., Sandmann, P., Jancke, L., and Meyer, M. (2010). Refinement of metre perception–training increases hierarchical metre processing. Eur. J. Neurosci. 32, 1979–1985. doi: 10.1111/j.1460-9568.2010.07462.x
Geiser, E., Walker, K. M. M., and Bendor, D. (2014). Global timing: a conceptual framework to investigate the neural basis of rhythm perception in humans and non-human species. Front. Psychol. 5:159. doi: 10.3389/fpsyg.2014.00159
Geiser, E., Zaehle, T., Jancke, L., and Meyer, M. (2008). The neural correlate of speech rhythm as evidenced by metrical speech processing. J. Cogn. Neurosci. 20, 541–552. doi: 10.1162/jocn.2008.20029
Giraud, A.-L., Kleinschmidt, A., Poeppel, D., Lund, T. E., Frackowiak, R. S. J., and Laufs, H. (2007). Endogenous cortical rhythms determine cerebral specialization for speech perception and production. Neuron 56, 1127–1134. doi: 10.1016/j.neuron.2007.09.038
Giraud, A.-L., and Poeppel, D. (2012). Cortical oscillations and speech processing: emerging computational principles and operations. Nat. Neurosci. 15, 511–517. doi: 10.1038/nn.3063
Golumbic, E. M. Z., Ding, N., Bickel, S., Lakatos, P., Schevon, C. A., McKhann, G. M., et al. (2013). Mechanisms underlying selective neuronal tracking of attended speech at a “Cocktail Party.” Neuron 77, 980–991. doi: 10.1016/j.neuron.2012.12.037
Grahn, J. A., and Brett, M. (2007). Rhythm and beat perception in motor areas of the brain. J. Cogn. Neurosci. 19, 893–906. doi: 10.1162/jocn.2007.19.5.893
Grahn, J. A., and Rowe, J. B. (2009). Feeling the beat: premotor and striatal interactions in musicians and nonmusicians during beat perception. J. Neurosci. 29, 7540–7548. doi: 10.1523/JNEUROSCI.2018-08.2009
Grahn, J. A., and Rowe, J. B. (2013). Finding and feeling the musical beat: striatal dissociations between detection and prediction of regularity. Cereb. Cortex 23, 913–921. doi: 10.1093/cercor/bhs083
Gross, J., Hoogenboom, N., Thut, G., Schyns, P., Panzeri, S., Belin, P., et al. (2013). Speech rhythms and multiplexed oscillatory sensory coding in the human brain. PLoS Biol. 11:e1001752. doi: 10.1371/journal.pbio.1001752
Haruno, M., and Kawato, M. (2006). Heterarchical reinforcement-learning model for integration of multiple cortico-striatal loops: fMRI examination in stimulus-action-reward association learning. Neural Netw. 19, 1242–1254. doi: 10.1016/j.neunet.2006.06.007
Henry, M. J., Herrmann, B., and Obleser, J. (2014). Entrained neural oscillations in multiple frequency bands comodulate behavior. Proc. Natl. Acad. Sci. U.S.A. 111, 14935–14940. doi: 10.1073/pnas.1408741111
Henry, M. J., and Obleser, J. (2012). Frequency modulation entrains slow neural oscillations and optimizes human listening behavior. Proc. Natl. Acad. Sci. U.S.A. 109, 20095–20100. doi: 10.1073/pnas.1213390109
Herrmann, B., Henry, M. J., Haegens, S., and Obleser, J. (2016). Temporal expectations and neural amplitude fluctuations in auditory cortex interactively influence perception. Neuroimage 124, 487–497. doi: 10.1016/j.neuroimage.2015.09.019
Hickok, G., Farahbod, H., and Saberi, K. (2015). The rhythm of perception entrainment to acoustic rhythms induces subsequent perceptual oscillation. Psychol. Sci. 26, 1006–1013. doi: 10.1177/0956797615576533
Horton, C., D'Zmura, M., and Srinivasan, R. (2013). Suppression of competing speech through entrainment of cortical oscillations. J. Neurophysiol. 109, 3082–3093. doi: 10.1152/jn.01026.2012
Hove, M. J., Fairhurst, M. T., Kotz, S. A., and Keller, P. E. (2013). Synchronizing with auditory and visual rhythms: an fMRI assessment of modality differences and modality appropriateness. Neuroimage 67, 313–321. doi: 10.1016/j.neuroimage.2012.11.032
Howard, M. F., and Poeppel, D. (2012). The neuromagnetic response to spoken sentences: co-modulation of theta band amplitude and phase. Neuroimage 60, 2118–2127. doi: 10.1016/j.neuroimage.2012.02.028
Hsu, Y.-F., Hamalainen, J. A., and Waszak, F. (2014). Both attention and prediction are necessary for adaptive neuronal tuning in sensory processing. Front. Hum. Neurosci. 8:152. doi: 10.3389/fnhum.2014.00152
Huygens, C. (1893). No 1338. Christiaan Huygens à R. Moray. 27 février 1665. In Oeuvres Completes de christian Huygens. Tome V, correspondance 1664-1665 (Société Hollandaise des Sciences). Martinus Nijhoff, Den Haag: David Bierens de Haan.
Jones, M. R., and Boltz, M. (1989). Dynamic attending and response to time. Psychol. Rev. 96, 459–491. doi: 10.1037/0033-295X.96.3.459
Jones, M. R., Moynihan, H., MacKenzie, N., and Puente, J. (2002). Temporal aspects of stimulus-driven attending in dynamic arrays. Psychol. Sci. 13, 313–319. doi: 10.1111/1467-9280.00458
Kayser, C., Montemurro, M. A., Logothetis, N. K., and Panzeri, S. (2009). Spike-phase coding boosts and stabilizes information carried by spatial and temporal spike patterns. Neuron 61, 597–608. doi: 10.1016/j.neuron.2009.01.008
Khanna, P., and Carmena, J. M. (2015). Neural oscillations: beta band activity across motor networks. Curr. Opin. Neurobiol. 32, 60–67. doi: 10.1016/j.conb.2014.11.010
Kok, P., Rahnev, D., Jehee, J. F. M., Lau, H. C., and de Lange, F. P. (2012). Attention reverses the effect of prediction in silencing sensory signals. Cereb. Cortex 22, 2197–2206. doi: 10.1093/cercor/bhr310
Kotz, S. A., Schwartze, M., and Schmidt-Kassow, M. (2009). Non-motor basal ganglia functions: a review and proposal for a model of sensory predictability in auditory language perception. Cortex 45, 982–990. doi: 10.1016/j.cortex.2009.02.010
Lakatos, P., Karmos, G., Mehta, A. D., Ulbert, I., and Schroeder, C. E. (2008). Entrainment of neuronal oscillations as a mechanism of attentional selection. Science 320, 110–113. doi: 10.1126/science.1154735
Lakatos, P., Musacchia, G., O'Connel, M. N., Falchier, A. Y., Javitt, D. C., and Schroeder, C. E. (2013). The spectrotemporal filter mechanism of auditory selective attention. Neuron 77, 750–761. doi: 10.1016/j.neuron.2012.11.034
Lakatos, P., Shah, A. S., Knuth, K. H., Ulbert, I., Karmos, G., and Schroeder, C. E. (2005). An oscillatory hierarchy controlling neuronal excitability and stimulus processing in the auditory cortex. J. Neurophysiol. 94, 1904–1911. doi: 10.1152/jn.00263.2005
Langdon, A. J., Boonstra, T. W., and Breakspear, M. (2011). Multi-frequency phase locking in human somatosensory cortex. Prog. Biophys. Mol. Biol. 105, 58–66. doi: 10.1016/j.pbiomolbio.2010.09.015
Lange, K. (2009). Brain correlates of early auditory processing are attenuated by expectations for time and pitch. Brain Cogn. 69, 127–137. doi: 10.1016/j.bandc.2008.06.004
Lange, K. (2010). Can a regular context induce temporal orienting to a target sound? Int. J. Psychophysiol. 78, 231–238. doi: 10.1016/j.ijpsycho.2010.08.003
Lange, K., and Roeder, B. (2006). Orienting attention to points in time improves stimulus processing both within and across modalities. J. Cogn. Neurosci. 18, 715–729. doi: 10.1162/jocn.2006.18.5.715
Lecaignard, F., Bertrand, O., Gimenez, G., Mattout, J., and Caclin, A. (2015). Implicit learning of predictable sound sequences modulates human brain responses at different levels of the auditory hierarchy. Front. Hum. Neurosci. 9:505. doi: 10.3389/fnhum.2015.00505
Lidji, P., Palmer, C., Peretz, I., and Morningstar, M. (2011). Listeners feel the beat: entrainment to English and French speech rhythms. Psychon. Bull. Rev. 18, 1035–1041. doi: 10.3758/s13423-011-0163-0
Lim, S.-J., Fiez, J. A., and Holt, L. L. (2014). How may the basal ganglia contribute to auditory categorization and speech perception? Front. Neurosci. 8:230. doi: 10.3389/fnins.2014.00230
London, J. (2004). Hearing in Time: Psychological Aspects of Musical Meter. New York, NY: Oxford University press.
Luo, H., Liu, Z., and Poeppel, D. (2010). Auditory cortex tracks both auditory and visual stimulus dynamics using low-frequency neuronal phase modulation. PLoS Biol. 8:445. doi: 10.1371/journal.pbio.1000445
Luo, H., and Poeppel, D. (2007). Phase patterns of neuronal responses reliably discriminate speech in human auditory cortex. Neuron 54, 1001–1010. doi: 10.1016/j.neuron.2007.06.004
Marchant, J. L., and Driver, J. (2013). Visual and audiovisual effects of isochronous timing on visual perception and brain activity. Cereb. Cortex 23, 1290–1298. doi: 10.1093/cercor/bhs095
Mathewson, K. E., Gratton, G., Fabiani, M., Beck, D. M., and Ro, T. (2009). To see or not to see: pre-stimulus alpha phase predicts visual awareness. J. Neurosci. 29, 2725–2732. doi: 10.1523/JNEUROSCI.3963-08.2009
McAuley, J. D., and Jones, M. R. (2003). Modeling effects of rhythmic context on perceived duration: a comparison of interval and entrainment approaches to short-interval timing. J. Exp. Psychol. Hum. Percept. Perform. 29, 1102–1125. doi: 10.1037/0096-15220.127.116.112
Merchant, H., Grahn, J., Trainor, L., Rohrmeier, M., and Fitch, W. T. (2015). Finding the beat: a neural perspective across humans and non-human primates. Philos. Trans. R. Soc. Lond. B Biol. Sci. 370:20140093. doi: 10.1098/rstb.2014.0093
Miller, J. E., Carlson, L. A., and McAuley, J. D. (2013). When what you hear influences when you see: listening to an auditory rhythm influences the temporal allocation of visual attention. Psychol. Sci. 24, 11–18. doi: 10.1177/0956797612446707
Montemurro, M. A., Rasch, M. J., Murayama, Y., Logothetis, N. K., and Panzeri, S. (2008). Phase-of-firing visual stimuli in coding of natural primary visual cortex. Curr. Biol. 18, 375–380. doi: 10.1016/j.cub.2008.02.023
Morillon, B., Schroeder, C. E., and Wyart, V. (2014). Motor contributions to the temporal precision of auditory attention. Nat. Commun. 5, 5255. doi: 10.1038/ncomms6255
Ng, B. S. W., Schroeder, T., and Kayser, C. (2012). A precluding but not ensuring role of entrained low-frequency oscillations for auditory perception. J. Neurosci. 32, 12268–12276. doi: 10.1523/JNEUROSCI.1877-12.2012
Nozaradan, S., Zerouali, Y., Peretz, I., and Mouraux, A. (2015). Capturing with EEG the neural entrainment and coupling underlying sensorimotor synchronization to the beat. Cereb. Cortex 25, 736–747. doi: 10.1093/cercor/bht261
Okamoto, H., Teismann, H., Keceli, S., Pantev, C., and Kakigi, R. (2013). Differential effects of temporal regularity on auditory-evoked response amplitude: a decrease in silence and increase in noise. Behav. Brain Funct. 9:44. doi: 10.1186/1744-9081-9-44
Peelle, J. E., and Davis, M. H. (2012). Neural oscillations carry speech rhythm through to comprehension. Front. Psychol. 3:320. doi: 10.3389/fpsyg.2012.00320
Peelle, J. E., Gross, J., and Davis, M. H. (2013). Phase-locked responses to speech in human auditory cortex are enhanced during comprehension. Cereb. Cortex 23, 1378–1387. doi: 10.1093/cercor/bhs118
Peña, M., and Melloni, L. (2011). Brain oscillations during spoken sentence processing. J. Cogn. Neurosci. 24, 1149–1164. doi: 10.1162/jocn_a_00144
Pérez, A., Carreiras, M., Gillon Dowens, M., and Duñabeitia, J. A. (2015). Differential oscillatory encoding of foreign speech. Brain Lang. 147, 51–57. doi: 10.1016/j.bandl.2015.05.008
Power, A. J., Mead, N., Barnes, L., and Goswami, U. (2012). Neural entrainment to rhythmically presented auditory, visual, and audio-visual speech in children. Front. Psychol. 3:216. doi: 10.3389/fpsyg.2012.00216
Power, A. J., Mead, N., Barnes, L., and Goswami, U. (2013). Neural entrainment to rhythmic speech in children with developmental dyslexia. Front. Hum. Neurosci. 7:777. doi: 10.3389/fnhum.2013.00777
Riecker, A., Wildgruber, D., Mathiak, K., Grodd, W., and Ackermann, H. (2003). Parametric analysis of rate-dependent hemodynamic response functions of cortical and subcortical brain structures during auditorily cued finger tapping: a fMRI study. Neuroimage 18, 731–739. doi: 10.1016/S1053-8119(03)00003-X
Rimmele, J., Jolsvai, H., and Sussman, E. (2011). Auditory target detection is affected by implicit temporal and spatial expectations. J. Cogn. Neurosci. 23, 1136–1147. doi: 10.1162/jocn.2010.21437
Rimmele, J. M., Golumbic, E. Z., Schroeger, E., and Poeppel, D. (2015). The effects of selective attention and speech acoustics on neural speech-tracking in a multi-talker scene. Cortex 68, 144–154. doi: 10.1016/j.cortex.2014.12.014
Rohenkohl, G., Cravo, A. M., Wyart, V., and Nobre, A. C. (2012). Temporal expectation improves the quality of sensory information. J. Neurosci. 32, 8424–8428. doi: 10.1523/JNEUROSCI.0804-12.2012
Ross, B., Jamali, S., Miyazaki, T., and Fujioka, T. (2013). Synchronization of beta and gamma oscillations in the somatosensory evoked neuromagnetic steady-state response. Exp. Neurol. 245, 40–51. doi: 10.1016/j.expneurol.2012.08.019
Ross, J. M., and Balasubramaniam, R. (2014). Physical and neural entrainment to rhythm: human sensorimotor coordination across tasks and effector systems. Front. Hum. Neurosci. 8:576. doi: 10.3389/fnhum.2014.00576
Ruspantini, I., Maki, H., Korhonen, R., D'Ausilio, A., and Ilmoniemi, R. J. (2011). The functional role of the ventral premotor cortex in a visually paced finger tapping task: A TMS study. Behav. Brain Res. 220, 325–330. doi: 10.1016/j.bbr.2011.02.017
Sanabria, D., Capizzi, M., and Correa, A. (2011). Rhythms that speed you up. J. Exp. Psychol. Hum. Percept. Perform. 37, 236–244. doi: 10.1037/a0019956
Sanabria, D., and Correa, A. (2013). Electrophysiological evidence of temporal preparation driven by rhythms in audition. Biol. Psychol. 92, 98–105. doi: 10.1016/j.biopsycho.2012.11.012
Saoud, H., Josse, G., Bertasi, E., Truy, E., Chait, M., and Giraud, A.-L. (2012). Brain–speech alignment enhances auditory cortical responses and speech perception. J. Neurosci. 32, 275–281. doi: 10.1523/JNEUROSCI.3970-11.2012
Schmidt-Kassow, M., Schubotz, R. I., and Kotz, S. A. (2009). Attention and entrainment: P3b varies as a function of temporal predictability. Neuroreport 20, 31–36. doi: 10.1097/WNR.0b013e32831b4287
Schroeder, C. E., and Lakatos, P. (2009). The Gamma oscillation: master or slave? Brain Topogr. 22, 24–26. doi: 10.1007/s10548-009-0080-y
Schwartze, M., Tavano, A., Schroger, E., and Kotz, S. A. (2012). Temporal aspects of prediction in audition: Cortical and subcortical neural mechanisms. Int. J. Psychophysiol. 83, 200–207. doi: 10.1016/j.ijpsycho.2011.11.003
Snyder, J. S., and Large, E. W. (2005). Gamma-band activity reflects the metric structure of rhythmic tone sequences. Cogn. Brain Res. 24, 117–126. doi: 10.1016/j.cogbrainres.2004.12.014
Summerfield, C., and Egner, T. (2009). Expectation (and attention) in visual cognition. Trends Cogn. Sci. 13, 403–409. doi: 10.1016/j.tics.2009.06.003
Tavano, A., Widmann, A., Bendixen, A., Trujillo-Barreto, N., and Schroeger, E. (2014). Temporal regularity facilitates higher-order sensory predictions in fast auditory sequences. Eur. J. Neurosci. 39, 308–318. doi: 10.1111/ejn.12404
Teki, S., Grube, M., and Griffiths, T. D. (2011a). A unified model of time perception accounts for duration-based and beat-based timing mechanisms. Front. Integr. Neurosci. 5:90. doi: 10.3389/fnint.2011.00090
Teki, S., Grube, M., Kumar, S., and Griffiths, T. D. (2011b). Distinct neural substrates of duration-based and beat-based auditory timing. J. Neurosci. 31, 3805–3812. doi: 10.1523/jneurosci.5561-10.2011
Todorovic, A., Schoffelen, J.-M., van Ede, F., Maris, E., and de Lange, F. P. (2015). Temporal expectation and attention jointly modulate auditory oscillatory activity in the beta band. PLoS ONE 10:e0120288. doi: 10.1371/journal.pone.0120288
Womelsdorf, T., Schoffelen, J.-M., Oostenveld, R., Singer, W., Desimone, R., Engel, A. K., et al. (2007). Modulation of neuronal interactions through neuronal synchronization. Science 316, 1609–1612. doi: 10.1126/science.1139597
Zoefel, B., and VanRullen, R. (2015). Selective perceptual phase entrainment to speech rhythm in the absence of spectral energy fluctuations. J. Neurosci. 35, 1954–1964. doi: 10.1523/JNEUROSCI.3484-14.2015
Keywords: entrainment, neural oscillations, striatum, auditory, regularity, beat, phase-locking, predictive coding
Citation: Sameiro-Barbosa CM and Geiser E (2016) Sensory Entrainment Mechanisms in Auditory Perception: Neural Synchronization Cortico-Striatal Activation. Front. Neurosci. 10:361. doi: 10.3389/fnins.2016.00361
Received: 12 April 2016; Accepted: 20 July 2016;
Published: 10 August 2016.
Edited by:Sonja A. Kotz, Maastricht University, Netherlands; Max-Planck Institute for Human Cognitive and Brain Sciences, Germany
Reviewed by:Jessica A. Grahn, University of Western Ontario, Canada
Johanna Maria Rimmele, Max-Planck-Institute for Empirical Aesthetics, Germany
Copyright © 2016 Sameiro-Barbosa and Geiser. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Eveline Geiser, firstname.lastname@example.org