Prior Precision Modulates the Minimization of Auditory Prediction Error

Hsu, Yi-Fang; Waszak, Florian; Hämäläinen, Jarmo A.

doi:10.3389/fnhum.2019.00030

ORIGINAL RESEARCH article

Front. Hum. Neurosci., 15 February 2019

Sec. Sensory Neuroscience

Volume 13 - 2019 | https://doi.org/10.3389/fnhum.2019.00030

Prior Precision Modulates the Minimization of Auditory Prediction Error

1. Department of Educational Psychology and Counselling, National Taiwan Normal University, Taipei, Taiwan
2. Institute for Research Excellence in Learning Sciences, National Taiwan Normal University, Taipei, Taiwan
3. Université Paris Descartes, Sorbonne Paris Cité, Paris, France
4. CNRS, Laboratoire Psychologie de la Perception, UMR 8242, Paris, France
5. Jyväskylä Centre for Interdisciplinary Brain Research, Department of Psychology, University of Jyväskylä, Jyväskylä, Finland

Article metrics

View details

Citations

3,9k

Views

1,2k

Downloads

Abstract

The predictive coding model of perception proposes that successful representation of the perceptual world depends upon canceling out the discrepancy between prediction and sensory input (i.e., prediction error). Recent studies further suggest a distinction to be made between prediction error triggered by non-predicted stimuli of different prior precision (i.e., inverse variance). However, it is not fully understood how prediction error with different precision levels is minimized in the predictive process. Here, we conducted a magnetoencephalography (MEG) experiment which orthogonally manipulated prime-probe relation (for contextual precision) and stimulus repetition (for perceptual learning which decreases prediction error). We presented participants with cycles of tone quartets which consisted of three prime tones and one probe tone of randomly selected frequencies. Within each cycle, the three prime tones remained identical while the probe tones changed once at some point (e.g., from repetition of 123X to repetition of 123Y). Therefore, the repetition of probe tones can reveal the development of perceptual inferences in low and high precision contexts depending on their position within the cycle. We found that the two conditions resemble each other in terms of N1m modulation (as both were associated with N1m suppression) but differ in terms of N2m modulation. While repeated probe tones in low precision context did not exhibit any modulatory effect, repeated probe tones in high precision context elicited a suppression and rebound of the N2m source power. The differentiation suggested that the minimization of prediction error in low and high precision contexts likely involves distinct mechanisms.

Introduction

Our brain constantly predicts forthcoming sensory inputs. The predictive coding model of perception postulates that perception entails two distinct neurocomputational components, the top-down propagation of prediction and the bottom-up propagation of prediction error (Rao and Ballard, 1999; Friston, 2005, 2009; Summerfield et al., 2008; for a review see Clark, 2013). The flow of information takes place between multiple hierarchical levels harboring both representational units and error units (Egner et al., 2010). While the representational units encode prediction about the causal structure of the environment and feed it backward to the next lower level, the error units encode the discrepancy between prediction and sensory input as prediction error and communicate it forward to the next higher level. The message-passing between hierarchical cortical levels iterates to match prediction and sensory inputs as much as possible to minimize prediction error in the system.

Recent research further suggested the necessity to distinguish between two conditions inducing prediction error: the unpredicted condition (where there is no precise prediction) and the mispredicted condition (where there is a precise prediction being violated). Conceptually, the unpredicted condition is mainly associated with prediction error generated by sensory input that is not anticipated, whereas the mispredicted condition triggers not only prediction error generated by sensory input that is not anticipated but also prediction error generated by prediction that is not fulfilled (Arnal and Giraud, 2012). The dissociation was supported by electroencephalography (EEG) evidence demonstrating that unpredicted and mispredicted stimuli are associated with different amounts of cortical activity (Hsu et al., 2015, 2018). Relative to predicted stimuli, unpredicted stimuli are associated with smaller neuronal responses whereas mispredicted stimuli are associated with larger neuronal responses on the N1 event-related potential (ERP) component, which is typically considered an electrophysiological indicator for automatic predictive processing (for a review see Bendixen et al., 2012).

The result pattern can be interpreted in terms of how the prediction error is adjusted depending on the precision of the sensory input (Friston, 2005, 2009), which is suggested to be encoded by gain in superficial pyramidal cells in sensory cortices (Feldman and Friston, 2010; Auksztulewicz et al., 2017; Fardo et al., 2017). Precision refers to the inverse of a signal’s variance, which quantifies the degree of certainty about the signals in general statistical usage (Feldman and Friston, 2010; Ransom et al., 2017). Unpredicted stimuli, relative to mispredicted stimuli, are embedded in contexts of larger variance (i.e., lower precision); therefore, prediction error is weighted less in the former than the latter (Schröger et al., 2015). The idea conforms to previous research on the mismatch negativity (MMN) which reported a significant difference when contrasting between a deviant sound embedded in an equiprobable sequence (i.e., a low precision context) and a deviant sound embedded in a standard sequence (i.e., a high precision context; e.g., Jacobsen and Schröger, 2001; for a review see Näätänen et al., 2005; but see Ahmed et al., 2011; Astikainen et al., 2011; Nakamura et al., 2011 vs. Farley et al., 2010; Fishman and Steinschneider, 2012; Kaliukhovich and Vogels, 2014; for an ongoing debate in animal research). The effect of contextual expectancies was also demonstrated by comparing mismatch responses to sounds with frequencies sampled from broad and narrow Gaussian distributions (Garrido et al., 2013). It was reported that sounds in the tail of the distribution evoked larger mismatch responses than those that fell at the center. Moreover, responses to physically identical outliers were greater when the distribution was narrower, indicating that the brain can implicitly track the certainty in distributions of events. It was suggested that manipulating contextual expectancies is equivalent to manipulating the precision of prediction errors higher in the processing hierarchy, which in turn has a modulatory effect on neuronal responses similar to that of attention (Auksztulewicz and Friston, 2015).

The differentiation raised the question whether contextual precision also affects how prediction error is minimized in perceptual learning, a major function of the predictive brain. Here, we conducted a magnetoencephalography (MEG) experiment with a novel design which orthogonally manipulated prime-probe relation (for contextual precision) and stimulus repetition (for perceptual learning which decreases prediction error). Specifically, we presented participants with cycles of tone quartet which consisted of three prime tones and one probe tone of randomly selected frequencies. Within each cycle, the three prime tones remained identical while the probe tones changed once at some point (e.g., from repetition of 123X to repetition of 123Y). Therefore, the repetition of probe tones can reveal the development of perceptual inferences in low and high precision contexts depending on their position within the cycle (Figure 1A). In the beginning of a cycle where the three prime tones lack indicative value, probe tone X triggers prediction error in a low precision context (because listeners would predict a probe tone to be presented but cannot be quite sure of its frequency, resembling the aforementioned unpredicted condition). Its repetition thus reveals how a prediction is established from scratch, or how prediction error is minimized in low precision context for perceptual learning. In the middle of a cycle where the three prime tones are already associated with probe tone X, probe tone Y triggers prediction error in a high precision context (because listeners would tend to predict that after this particular tone trio a probe tone X would follow but such expectation is violated, resembling the aforementioned mispredicted condition). Its repetition thus reveals how a violated prediction is re-established, or how prediction error is minimized in high precision context for perceptual learning. Notably, previous neurocomputational modeling already proposed that stimulus repetition results in perceptual learning which might increase the precision weighting of prediction error (Garrido et al., 2009; for a review see Auksztulewicz and Friston, 2016). However, it is undetermined whether the putative increase in precision weighting of prediction error depends on its initial precision status. If the minimization of prediction error in low and high precision contexts involves distinct mechanisms, the repetition of tone quartets 123X and 123Y should modulate MEG responses in different manners. Specifically, the current research focused on examining the N1m and N2m modulations, given that responses in the N1m time window are sensitive to context precision (Näätänen and Picton, 1987; Hsu et al., 2015) whereas responses in the N2m time window show responsiveness to stimulus probability (represented in the MMN; Näätänen et al., 2005) and representation build-up (Karhu et al., 1997). These long-latency components are also reported to be mediated by top-down effects in cortical networks and therefore rest on backward connections (Garrido et al., 2007).

Figure 1

Materials and Methods

Participants

Eighteen healthy adults aged between 20 and 26 years (average age: 24; six males; 14 right-handed) with no history of neurological, psychiatric, or visual/hearing impairments as indicated by self-report participated in the experiment. This study was carried out in accordance with the recommendations of the ethics committee of National Taiwan Normal University (Taiwan) and the University of Jyväskylä (Finland). All subjects gave written informed consent in accordance with the Declaration of Helsinki. Four participants were excluded from data analysis for excessive measurement noise, leaving 14 participants in the final sample (average age 24; three males; 12 right-handed).

Stimuli

Sinusoidal tones with a loudness of 80 phons (i.e., 80 dB for tones of 1,000 Hz) were generated using Matlab. The duration of each tone was 50 ms (including 5 ms rise/fall times). The frequency of each tone was within the range of 261.626–493.883 Hz, matching the absolute frequency of a series of seven natural keys on a modern piano (i.e., C4 D4 E4 F4 G4 A4 B4).

A total of 90 pairs of tone quartets (consisting of three prime tones and one probe tone) were created. Each pair of tone quartets was identical in the prime tones but different in the probe tone in terms of frequency (e.g., F4-E4-G4-A4 and F4-E4-G4-D4). The frequency of the prime tones was determined by a random sampling without replacement, with the exception of any continuously rising or falling sequence to avoid the step inertia expectation (i.e., the expectation that the frequencies of upcoming tones continue in the same direction when the frequencies of previous tones are presented as a scale; Lange, 2009). The frequency of the probe tone can be anything except that of the prime tones.

Procedures

A total of 10 blocks of nine cycles were presented. Each cycle consisted of the repetition of a pair of tone quartets, where the first tone quartet was repeated 4–6 times before the second tone quartet was repeated 4–6 times. The reason we presented each tone quartet 4–6 times was to prevent participants from learning high-order regularities (e.g., correctly anticipating a change in probe tone). Therefore, a cycle could contain 8–12 tone quartets. While the repetition of the first tone quartet turned the initially non-predicted probe tone into a predicted tone in a low precision context (resembling the minimization of unpredicted error), the repetition of the second tone quartet turned the initially non-predicted probe tone into a predicted tone in a high precision context (resembling the minimization of mispredicted error; Figure 1A).

Figure 1B illustrates a tone quartet, which started with a silent interval of 500 ms. Each tone was separated by a 500 ms stimulus onset asynchrony (SOA). 10% of the probe tones were of attenuated loudness of 20 dB (which were excluded from data analysis). Participants were required to press a key as soon as they detected a softer probe tone to maintain their attention. The offset of the probe tone was followed by a jittered inter-trial interval (ITI) of 700–800 ms. There was no separation between cycles distinct from the ITI. A fixation cross remained on the screen for the duration of the block. The whole experiment took around 42 min (i.e., 900 trials × 2,800 ms). Presentation (Neurobehavioral Systems, Inc., Berkeley, CA, USA) was used for stimulus presentation. Stimulation was randomized individually for each participant and delivered through two panel speakers situated to the left and right of the participant.

Data Recording and Analysis

MEG data was collected using a 306 channel whole-head device (Elekta Neuromag, Finland) in a two-layered magnetically shielded room at the University of Jyväskylä (Finland). The sampling rate was 1,000 Hz. A high-pass filter of 0.03 Hz and a low-pass filter of 200 Hz were used. Continuous head position monitoring was used based on five Head-Position Indicator (HPI) coils, with three at the forehead and two behind the ears. Electrooculography (EOG) was recorded using electrodes lateral to each eye and above and below the left eye.

Offline, head movements were corrected and external noise sources were attenuated using the temporal extension of the source subspace separation algorithm (Taulu et al., 2005) in the MaxFilter program (Elekta Neuromag, Finland).

After the initial head movement correction, the data was analyzed using BrainStorm 3.2 (Tadel et al., 2011). Signal subspace projection was used to correct for eye blinks. The MEG signal was filtered at 1–40 Hz and segmented from −100 to 500 ms relative to the onset of the stimulus using a 100 ms pre-stimulus baseline. Segments with over 5,000 fT/cm peak-to-peak values in gradiometers or 7,000 fT peak-to-peak values in magnetometers were rejected. As all tone quartets were repeated at least four times, segments to the 5th and 6th presentations of tone quartets were also rejected to ensure our analysis is based on equal numbers of trials. The trial numbers after artifact rejection in each condition are listed in Table 1.

Table 1

Presentation		Low precision context				High precision context
	Tone	Min	Max	Mean	SD	Min	Max	Mean	SD
1st	Prime1	67	90	84.43	6.89	67	90	84.71	6.65
	Prime2	66	90	83.71	7.60	64	90	84.14	7.47
	Prime3	59	90	83.79	8.74	64	90	82.93	8.32
	Probe	55	85	75.57	8.28	56	83	75.50	7.29
2nd	Prime1	67	90	83.79	7.28	62	90	82.86	8.14
	Prime2	68	90	84.50	6.93	63	90	84.50	7.43
	Prime3	64	90	83.79	8.14	64	90	84.43	7.58
	Probe	65	83	76.71	5.46	59	84	76.50	7.10
3rd	Prime1	67	90	83.50	6.79	67	90	84.36	5.85
	Prime2	71	90	83.86	6.19	61	90	83.21	8.14
	Prime3	69	90	83.50	7.01	66	90	84.29	6.84
	Probe	58	83	74.29	6.62	59	82	75.93	6.22
4th	Prime1	69	90	84.36	6.52	66	90	84.36	6.28
	Prime2	71	90	84.50	6.27	70	90	84.64	6.18
	Prime3	72	90	84.36	6.21	62	90	82.86	8.39
	Probe	64	84	76.00	6.61	58	81	76.57	6.43

Range, mean, and standard deviation (SD) of trial numbers after artifact rejection in each condition.

The experimental effects were examined in source space. As individual magnetic resonance images (MRIs) of the participants were not available, the ICBM152 MRI template was used. The weighted minimum norm estimates (wMNE) were calculated using the unconstrained option to allow free orientation of the dipoles in relation to the cortical surface. A three shell spherical head model was used. The wMNE solution was restricted to the cortex. Noise covariance matrix was calculated from the baseline of the averaged responses.

To extract the N1m and N2m measures, we first identified the N1m and N2m from the grand average global field power (GFP) of the gradiometers (across 14 participants and 32 conditions; Figure 2A). The topographies for the magnetometers at N1m and N2m peaks are plotted in Figure 2B showing the magnetic field distribution. Then, we identified brain regions from the Dessikan-Killiany parcellation, which showed the largest source activity around the auditory cortices at the N1m and N2m, including transverse temporal, superior temporal, middle temporal, supramarginal, and postcentral regions (Figure 2C).

Figure 2

The grand average source solution (across two hemispheres, five brain regions, 14 participants, and 32 conditions) was used to identify the N1m and N2m time windows for statistical analysis. N1m peak was at ca. 110 ms (time window 85–135 ms) and N2m peak was at ca. 220 ms (time window 170–270 ms; Figure 3).

Figure 3

The source powers in the N1m and N2m time windows of the probe tones were submitted to the 2 (precision: low/high precision context) × 4 (repetition: 1st/2nd/3rd/4th presentation) repeated measures analysis of variance (ANOVA). Greenhouse-Geisser correction was applied when appropriate (and will be indicated in the following section with epsilon values).

Results

The ANOVA on the N1m source power showed only a main effect of repetition (F_(3,39) = 6.33, p < 0.01, partial eta squared = 0.33; Figure 4, left). The effect was due to the larger response to the 1st presentation compared to all the other presentations (Table 2). No significant differences were found between the response strength for the other presentations.

Figure 4

Table 2

			95% Confidence interval of the difference
Pair	Mean	SD		Upper	t	df	Sig. (2-tailed)
1–2	0.13	0.18	0.03	0.24	2.68	13	0.019
1–3	0.14	0.16	0.04	0.23	3.19	13	0.007
1–4	0.14	0.11	0.08	0.21	4.88	13	<0.001
2–3	0.00	0.15	−0.08	0.09	0.10	13	0.921
2–4	0.01	0.12	−0.06	0.08	0.38	13	0.712
3–4	0.01	0.13	−0.07	0.08	0.24	13	0.813

Results of post hoc paired samples t-tests looking into the main effect of repetition on N1m source power.

The ANOVA on the N2m source power revealed a precision × repetition interaction (F_(3,39) = 8.12, p < 0.01, partial eta squared = 0.38; Figure 4, right) as well as main effects of precision (F_(1,13) = 15.06, p < 0.01, partial eta squared = 0.54) and repetition (F_(3,39) = 9.87, p < 0.01, partial eta squared = 0.43, epsilon = 0.48). Post hoc paired samples t-tests looking into the precision × repetition interaction showed that tones in the low precision context had smaller source power than tones in the high precision context upon the 1st presentation (t₍₁₃₎ = −4.24, p < 0.01) and the 4th presentation (t₍₁₃₎ = −3.06, p < 0.01). Moreover, repetition did not modulate the source power in the low precision context but did so in the high precision context (Table 3). In the high precision context, the source power decreased from the 1st presentation to the 2nd, 3rd, and 4th presentations (t₍₁₃₎ = 3.75, p < 0.01; t₍₁₃₎ = 4.36, p < 0.01; t₍₁₃₎ = 3.94, p < 0.01). Most interestingly, the source power started to increase again in the 4th presentation as indicated by the difference in source power between the 3rd and 4th presentations (t₍₁₃₎ = −2.26, p < 0.05).

Table 3

			95% Confidence interval of the difference
Pair	Mean	SD	Lower	Upper	t	df	Sig. (2-tailed)
Low precision context
1–2	0.16	0.29	−0.01	0.33	2.09	13	0.057
1–3	0.12	0.38	−0.10	0.34	1.20	13	0.253
1–4	0.18	0.36	−0.03	0.39	1.87	13	0.084
2–3	−0.04	0.18	−0.15	0.06	−0.91	13	0.379
2–4	0.01	0.19	−0.09	0.12	0.29	13	0.774
3–4	0.06	0.26	−0.09	0.21	0.86	13	0.404
High precision context
1–2	0.55	0.55	0.23	0.87	3.75	13	0.002
1–3	0.55	0.47	0.28	0.83	4.36	13	0.001
1–4	0.42	0.40	0.19	0.65	3.94	13	0.002
2–3	0.00	0.23	−0.13	0.14	0.03	13	0.976
2–4	−0.13	0.26	−0.28	0.02	−1.86	13	0.086
3–4	−0.13	0.21	−0.25	−0.01	−2.26	13	0.042

Results of post hoc paired samples t-tests looking into the precision × repetition interaction on N2m source power.

Discussion

The current research used MEG to examine whether prior precisions modulate the cortical dynamics of the making of perceptual inferences. We presented participants with cycles of tone quartets which consisted of three prime tones and one probe tone, where the repetition of the probe tone can reveal the development of perceptual inferences in low and high precision contexts depending on their position within the cycle. We found that the two conditions modulate the N1m source power in a similar manner. However, there was a significant precision × repetition interaction on the N2m source power. While repeated probe tones in low precision context did not exhibit any modulatory effect, repeated probe tones in high precision context were associated with a suppression and a rebound of the N2m source power. The results confirm the necessity to dissociate the processing of non-predicted stimuli of different prior precision (Friston, 2005, 2009; Feldman and Friston, 2010; Arnal and Giraud, 2012; Hsu et al., 2015, 2018; Schröger et al., 2015). Moreover, it is likely that the minimization of prediction error in low and high precision contexts involves distinct mechanisms.

In electrophysiology literature, N1/N1m is known to reflect multiple processes of signaling unspecific changes in the auditory environment (Näätänen and Picton, 1987; Crowley and Colrain, 2004). Mounting evidence of the N1/N1m predictability effect further supports that it indicates the operation of an internal predictive mechanism, as predicted stimuli were associated with robust N1/N1m suppression (Schafer and Marcus, 1973; Schafer et al., 1981; Lange, 2009; Todorovic et al., 2011; Todorovic and de Lange, 2012; SanMiguel et al., 2013; Timm et al., 2013; Hsu et al., 2014a,b, 2016). Our findings confirm previous research by showing that such predictability effects resemble each other in low and high precision contexts, where prediction error is weighted differently. This suggests that changes in N1m source power cannot distinguish between the minimization of prediction error due to prediction formation (as in low precision context) and prediction alteration (as in high precision context). Instead, N1m seems to reflect the overall reduction in prediction error.

According to the predictive coding model of perception, prediction error can be adjusted depending on the precision of the sensory input (Friston, 2005, 2009; Feldman and Friston, 2010). Prediction error is weighted less in low than high precision contexts (Schröger et al., 2015), leading to smaller N1 responses to target tones following random then regular tone sets in EEG (Hsu et al., 2015). We speculate that the difference between low and high precision contexts might be less conspicuous here so that we did not obtain a main effect of precision on the N1m source power in MEG. In particular, in our previous experiment (Hsu et al., 2015), the difference between the two conditions depended on the regularity of their prime tones. That is, stimuli were preceded by either random or rising tones. However, in the current research, the difference between the two conditions depended on their position within the cycle of stimulus presentation. That is, stimuli were presented in either the beginning or the middle of each cycle. Such procedural differences between investigations might influence how much the two conditions differ in prior precision. This possibility should be systematically tested in future research.

Nevertheless, the differentiation between prediction error processing in low and high precision contexts was evident on the N2m source power. Specifically, tones in low precision context triggered smaller source power than tones in high precision context upon the 1st presentation and the 4th presentation. More importantly, stimulus repetition triggered different response patterns in low and high precision contexts. While repeated tones in low precision context did not exhibit any modulatory effect, repeated tones in high precision context were associated with a suppression and a rebound of the N2m source power. Previous neurocomputational modeling already proposed that stimulus repetition results in perceptual learning which might increase the precision weighting of prediction error (Garrido et al., 2009; for a review see Auksztulewicz and Friston, 2016). Our result pattern further suggests that such increase in precision weighting of prediction error can manifest differently at the cortical level depending on its initial precision status.

Specifically, novel probe tones presented in the beginning of each cycle are associated with lower prior precision, as listeners had a general expectation that a probe tone would appear but had little if any idea concerning its frequency. It is possible that, when the initial precision status is low, the modulatory effect of stimulus repetition is limited to the earlier processing stage. On the other hand, novel probe tones presented in the middle of each cycle are associated with higher prior precision, as listeners already formed predictions on its frequency during the previous stimulation within the cycle, but the prediction was not fulfilled. Cortical responses to these novel probe tones resemble more the MMN, which is interpreted as a failure to inhibit prediction error due to deviation from a learned regularity (Friston, 2005; Garrido et al., 2007). Thus, the U-shaped profile on the N2m due to stimulus repetition can be understood as follows. The N2m suppression (from the 1st presentation) is in line with the finding that the MMN vanishes with few stimulus repetitions (Garrido et al., 2009), indicating that the brain can efficiently adjust a perceptual model. Meanwhile, the N2m rebound (toward the 4th presentation) resonates with the finding of enhanced sustained field at around 200 ms to stimulus repetition (Näätänen and Rinne, 2002; Bendixen et al., 2007; Ylinen and Huotilainen, 2007; Recasens et al., 2015). It supports the notion that factors other than mere probability should be considered in order to account for the way perceptual model modification is implemented in the brain (Hsu et al., 2016). For example, there might be a gradual decrease in the bandwidth of the prediction tuning curve in the high precision context (cf. sharpening model for repetition effect; for a review see Grill-Spector et al., 2006). The sparser representation of prediction can paradoxically elicit an increase in prediction error. Alternatively, there might be a build-up of representations in prediction alteration. This can introduce a learning function of escalating sensitization which upweights neuronal responses (Karhu et al., 1997; Barascud et al., 2016; Southwell et al., 2017). Finally, there might be heightened expectations for the onset of a novel pair of tone quartets after the repetition increases in the current pair of tone quartets. Such preparation for change can also account for the rebound in the high precision context. Nevertheless, one should notice that the rebound effect achieved statistical significance at a rather liberal level. Its replicability should be established in future research.

Another possible explanation for the U-shaped profile on the N2m is that, among these brain regions around the auditory cortices, some showed repetition suppression and others showed repetition enhancement. However, due to the distributed nature of the MNE source estimate and the lack of individual MRIs (which increased the coregistration error between MEG and the template MRI), in the current research it is difficult to separate the activity from adjacent brain regions. This possibility would be better addressed with neuroimaging techniques with higher spatial resolution such as functional MRI (fMRI) combined with EEG or MEG in future research.

It is unlikely that the dissociation of probe tones in low and high precision context was due to confounding factors. Admittedly, although measures were taken to prevent participants from learning high-order regularities in the current research, it cannot be excluded that participants might become aware of the stimulus structure (i.e., the probe tones would change after 4–6 repetitions). However, if this happened, participants would expect changes of probe tones in both the low and high precision contexts. Therefore, it would only downsize the difference between low and high precision contexts and cannot account for the difference between conditions reported here. It is also improbable that the dissociation of probe tones in low and high precision contexts resulted from how much the probe tones differ from their preceding tones (i.e., the three prime tones) in terms of frequency. It is because the frequency of these tones was determined by random sampling. The allocation of these tones to low/high precision context was dependent on their position within a cycle (i.e., whether they were presented in the beginning/middle of a cycle) rather than their frequency. Meanwhile, it is noteworthy that in the current research participants were required to maintain their attention on the stimuli. It remains an interesting question whether the result pattern holds without participants’ overt attention.

The dissociation of probe tones in low and high precision contexts is closely related to the mixed results in the literature of repetition-related effects as previously suggested in Hsu et al. (2015). Although repetition-related effects are commonly explained in the language of the Bayesian models of predictive coding (Summerfield et al., 2008), there is a puzzling juxtaposition of repetition enhancement and repetition suppression across fMRI research. While the repetition of unfamiliar stimuli was associated with enhanced neuronal responses, the repetition of familiar stimuli was associated with suppressed neuronal responses (Henson et al., 2000; Fiebach et al., 2005; Gruber and Müller, 2005; Gagnepain et al., 2008; Soldan et al., 2008; Subramaniam et al., 2012; Müller et al., 2013). It was proposed that stimuli of different familiarity differ in whether there is a pre-existing representation (Turk-Browne et al., 2008), which can be understood as the top-down activation of predictions (Cheung and Bar, 2012; Grotheer and Kovács, 2014). The repetition of unfamiliar stimuli (initially associated with no pre-existing representation) would resemble the development of perceptual inferences in low precision context. The repetition of familiar stimuli (initially associated with certain pre-existing representation) would resemble the development of perceptual inferences in high precision context. Interestingly, the dissociation of repetition-related effects in the current research did not manifest as repetition enhancement and repetition suppression. Rather, it was expressed as repetition suppression on N1m followed by a lack of modulatory effect on N2m in low precision context vs. repetition suppression on N1m followed by a U-shaped profile on N2m in high precision context. Future research is needed to develop theories that relate hemeodynamic responses to electrophysiological data.

Statements

Author contributions

Y-FH, FW, and JH designed the research and wrote the article. Y-FH and JH performed the research and analyzed the data.

Funding

This work was supported by Taiwan Ministry of Science and Technology (grant number 105-2410-H-003-145-MY3 and 107-2636-H-003-001) to Y-FH and Department of Psychology at the University of Jyväskylä and Academy of Finland (profiling action “Multilete” #292466) to JH.

Acknowledgments

We thank Mr. Lauri Kantola for assistance with MEG data collection.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

1
AhmedM.MälloT.LeppänenP. H.HämäläinenJ.AyräväinenL.RuusuvirtaT.et al. (2011). Mismatch brain response to speech sound changes in rats. Front. Psychol.2:283. 10.3389/fpsyg.2011.00283
2
ArnalL. H.GiraudA. L. (2012). Cortical oscillations and sensory predictions. Trends Cogn. Sci.16, 390–398. 10.1016/j.tics.2012.05.003
3
AstikainenP.StefanicsG.NokiaM.LipponenA.CongF.PenttonenM.et al. (2011). Memory-based mismatch response to frequency changes in rats. PLoS One6:e24208. 10.1371/journal.pone.0024208
4
AuksztulewiczR.BarascudN.CoorayG.NobreA. C.ChaitM.FristonK. J. (2017). The cumulative effects of predictability on synaptic gain in the auditory processing stream. J. Neurosci.37, 6751–6760. 10.1523/JNEUROSCI.0291-17.2017
5
AuksztulewiczR.FristonK. J. (2015). Attentional enhancement of auditory mismatch responses: a DCM/MEG study. Cereb. Cortex25, 4273–4283. 10.1093/cercor/bhu323
6
AuksztulewiczR.FristonK. J. (2016). Repetition suppression and its contextual determinants in predictive coding. Cortex80, 125–140. 10.1016/j.cortex.2015.11.024
7
BarascudN.PearceM. T.GriffithsT. D.FristonK.ChaitM. (2016). Brain responses in humans reveal ideal observer-like sensitivity to complex acoustic patterns. Proc. Natl. Acad. Sci. U S A113, E616–E625. 10.1073/pnas.1508523113
8
BendixenA.RoeberU.SchrögerE. (2007). Regularity extraction and application in dynamic auditory stimulus sequences. J. Cogn. Neurosci.19, 1664–1677. 10.1162/jocn.2007.19.10.1664
9
BendixenA.SanMiguelI.SchrögerE. (2012). Early electrophysiological indicators for predictive processing in audition: a review. Int. J. Psychophysiol.83, 120–131. 10.1016/j.ijpsycho.2011.08.003
10
CheungO. S.BarM. (2012). Visual prediction and perceptual expertise. Int. J. Psychophysiol.83, 156–163. 10.1016/j.ijpsycho.2011.11.002
11
ClarkA. (2013). Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behav. Brain Sci.36, 181–204. 10.1017/s0140525x12000477
12
CrowleyK. E.ColrainI. M. (2004). A review of the evidence for P2 being an independent component process: age, sleep and modality. Clin. Neurophysiol.115, 732–744. 10.1016/j.clinph.2003.11.021
13
EgnerT.MontiJ. M.SummerfieldC. (2010). Expectation and surprise determine neural population responses in the ventral visual stream. J. Neurosci.30, 16601–16608. 10.1523/JNEUROSCI.2770-10.2010
14
FardoF.AuksztulewiczR.AllenM.DietzM. J.RoepstorffA.FristonK. J. (2017). Expectation violation and attention to pain jointly modulate neural gain in somatosensory cortex. Neuroimage153, 109–121. 10.1016/j.neuroimage.2017.03.041
15
FarleyB. J.QuirkM. C.DohertyJ. J.ChristianE. P. (2010). Stimulus-specific adaptation in auditory cortex is an NMDA-independent process distinct from the sensory novelty encoded by the mismatch negativity. J. Neurosci.30, 16475–16484. 10.1523/JNEUROSCI.2793-10.2010
16
FeldmanH.FristonK. J. (2010). Attention, uncertainty, and free-energy. Front. Hum. Neurosci.4:215. 10.3389/fnhum.2010.00215
17
FiebachC. J.GruberT.SuppG. G. (2005). Neuronal mechanisms of repetition priming in occipitotemporal cortex: spatiotemporal evidence from functional magnetic resonance imaging and electroencephalography. J. Neurosci.25, 3141–3422. 10.1523/JNEUROSCI.4107-04.2005
18
FishmanY. I.SteinschneiderM. (2012). Searching for the mismatch negativity in primary auditory cortex of the awake monkey: deviance detection or stimulus specific adaptation?Front. Genet.32, 15747–15758. 10.1523/JNEUROSCI.2835-12.2012
19
FristonK. (2005). A theory of cortical responses. Philos. Trans. R. Soc. Lond. B Biol. Sci.360, 815–836. 10.1098/rstb.2005.1622
20
FristonK. (2009). The free-energy principle: a rough guide to the brain?Trends Cogn. Sci.13, 293–301. 10.1016/j.tics.2009.04.005
21
GagnepainP.ChételatG.LandeauB.DayanJ.EustacheF.LebretonK. (2008). Spoken word memory traces within the human auditory cortex revealed by repetition priming and functional magnetic resonance imaging. J. Neurosci.28, 5281–5289. 10.1523/JNEUROSCI.0565-08.2008
22
GarridoM. I.KilnerJ. M.KiebelS. J.FristonK. J. (2007). Evoked brain responses are generated by feedback loops. Proc. Natl. Acad. Sci. U S A104, 20961–20966. 10.1073/pnas.0706274105
23
GarridoM. I.KilnerJ. M.KiebelS. J.StephanK. E.BaldewegT.FristonK. J. (2009). Repetition suppression and plasticity in the human brain. Neuroimage48, 269–279. 10.1016/j.neuroimage.2009.06.034
24
GarridoM. I.SahaniM.DolanR. J. (2013). Outlier responses reflect sensitivity to statistical structure in the human brain. PLoS Comput. Biol.9:e1002999. 10.1371/journal.pcbi.1002999
25
Grill-SpectorK.HensonR. N. A.MartinA. (2006). Repetition and the brain: neural models of stimulus-specific effects. Trends Cogn. Sci.10, 14–23. 10.1016/j.tics.2005.11.006
26
GrotheerM.KovácsG. (2014). Repetition probability effects depend on prior experiences. J. Neurosci.34, 6640–6646. 10.1523/JNEUROSCI.5326-13.2014
27
GruberT.MüllerM. M. (2005). Oscillatory brain activity dissociates between associative stimulus content in a repetition priming task in the human EEG. Cereb. Cortex15, 109–116. 10.1093/cercor/bhh113
28
HensonR.ShalliceT.DolanR. (2000). Neuroimaging evidence for dissociable forms of repetition priming. Science287, 1269–1272. 10.1126/science.287.5456.1269
29
HsuY. F.HämäläinenJ. A.WaszakF. (2014a). Repetition suppression comprises both attention-independent and attention-dependent processes. Neuroimage98, 168–175. 10.1016/j.neuroimage.2014.04.084
30
HsuY. F.HämäläinenJ. A.WaszakF. (2014b). Both attention and prediction are necessary for adaptive neuronal tuning in sensory processing. Front. Hum. Neurosci.8:152. 10.3389/fnhum.2017.00255
31
HsuY. F.HämäläinenJ. A.WaszakF. (2016). The auditory N1 suppression rebounds as prediction persists over time. Neuropsychologia84, 198–204. 10.1016/j.neuropsychologia.2016.02.019
32
HsuY.-F.HämäläinenJ. A.WaszakF. (2018). The processing of mispredicted and unpredicted sensory inputs interact differently with attention. Neuropsychologia111, 85–91. 10.1016/j.neuropsychologia.2018.01.034
33
HsuY. F.Le BarsS.HämäläinenJ. A.WaszakF. (2015). Distinctive representation of mispredicted and unpredicted prediction errors in human electroencephalography. J. Neurosci.35, 14653–14660. 10.1523/JNEUROSCI.2204-15.2015
34
JacobsenT.SchrögerE. (2001). Is there pre-attentive memory-based comparison of pitch?Psychophysiology38, 723–727. 10.1111/1469-8986.3840723
35
KaliukhovichD. A.VogelsR. (2014). Neurons in macaque inferior temporal cortex show no surprise response to deviants in visual oddball sequences. J. Neurosci.34, 12801–12815. 10.1523/JNEUROSCI.2154-14.2014
36
KarhuJ.HerrgårdE.PääkkönenA.LuomaL.AiraksinenE.PartanenJ. (1997). Dual cerebral processing of elementary auditory input in children. Neuroreport8, 1327–1330. 10.1097/00001756-199704140-00002
37
LangeK. (2009). Brain correlates of early auditory processing are attenuated by expectations for time and pitch. Brain Cogn.69, 127–137. 10.1016/j.bandc.2008.06.004
38
MüllerN. G.StrumpfH.ScholzM.BaierB.MelloniL. (2013). Repetition suppression versus enhancement: it’s quantity that matters. Cereb. Cortex23, 315–322. 10.1093/cercor/bhs009
39
NakamuraT.MichieP. T.FulhamW. R.ToddJ.BuddT. W.SchallU.et al. (2011). Epidural auditory event-related potentials in rat to frequency and duration deviants: evidence of mismatch negativity?Front. Psychol.2:367. 10.3389/fpsyg.2011.00367
40
NäätänenR.JacobsenT.WinklerI. (2005). Memory-based or afferent processes in mismatch negativity (MMN): a review of the evidence. Psychophysiology42, 25–32. 10.1111/j.1469-8986.2005.00256.x
41
NäätänenR.PictonT. (1987). The N1 wave of the human electric and magnetic response to sound: a review and an analysis of the component structure. Psychophysiology24, 375–425. 10.1111/j.1469-8986.1987.tb00311.x
42
NäätänenR.RinneT. (2002). Electric brain response to sound repetition in humans: an index of long-term-memory-trace formation?Neurosci. Lett.318, 49–51. 10.1016/s0304-3940(01)02438-7
43
RansomM.FazelpourS.MoleC. (2017). Attention in the predictive mind. Conscious. Cogn.47, 99–112. 10.1016/j.concog.2016.06.011
44
RaoR. P.BallardD. H. (1999). Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat. Neurosci.2, 79–87. 10.1038/4580
45
RecasensM.LeungS.GrimmS.NowakR.EsceraC. (2015). Repetition suppression and repetition enhancement underlie auditory memory-trace formation in the human brain: an MEG study. Neuroimage108, 75–86. 10.1016/j.neuroimage.2014.12.031
46
SanMiguelI.ToddJ.SchrögerE. (2013). Sensory suppression effects to self-initiated sounds reflect the attenuation of the unspecific N1 component of the auditory ERP. Psychophysiology50, 334–343. 10.1111/psyp.12024
47
SchaferE. W.AmochaevA.RussellM. J. (1981). Knowledge of stimulus timing attenuates human evoked cortical potentials. Electroencephalogr. Clin. Neurophysiol.52, 9–17. 10.1016/0013-4694(81)90183-8
48
SchaferE. W.MarcusM. M. (1973). Self-stimulation alters human sensory brain responses. Science181, 175–177. 10.1126/science.181.4095.175
49
SchrögerE.MarzecováA.SanMiguelI. (2015). Attention and prediction in human audition: a lesson from cognitive psychophysiology. Eur. J. Neurosci.41, 641–664. 10.1111/ejn.12816
50
SoldanA.ZarahnE.HiltonH. J.SternY. (2008). Global familiarity of visual stimuli affects repetition-related neural plasticity but not repetition priming. Neuroimage39, 515–526. 10.1016/j.neuroimage.2007.08.011
51
SouthwellR.BaumannA.GalC.BarascudN.FristonK.ChaitM. (2017). Is predictability salient? A study of attentional capture by auditory patterns. Philos. Trans. R. Soc. Lond. B Biol. Sci.372:20160105. 10.1098/rstb.2016.0105
52
SubramaniamK.FaustM.BeemanM.MashalN. (2012). The repetition paradigm: enhancement of novel metaphors and suppression of conventional metaphors in the left inferior parietal lobe. Neuropsychologia50, 2705–2719. 10.1016/j.neuropsychologia.2012.07.020
53
SummerfieldC.TrittschuhE. H.MontiJ. M.MesulamM. M.EgnerT. (2008). Neural repetition suppression reflects fulfilled perceptual expectations. Nat. Neurosci.11, 1004–1006. 10.1038/nn.2163
54
TadelF.BailletS.MosherJ. C.PantazisD.LeahyR. M. (2011). Brainstorm: a user friendly application for MEG/EEG analysis. Comput. Intell. Neurosci.2011:879716. 10.1155/2011/879716
55
TauluS.SimolaJ.KajolaM. (2005). Applications of the signal space separation method. IEEE Trans. Signal Process.53, 3359–3372. 10.1109/tsp.2005.853302
- CrossRef
- Google Scholar
56
TimmJ.SanMiguelI.SaupeK.SchrögerE. (2013). The N1-suppression effect for self-initiated sounds is independent of attention. BMC Neurosci.14:2. 10.1186/1471-2202-14-2
57
TodorovicA.de LangeF. P. (2012). Repetition suppression and expectation suppression are dissociable in time in early auditory evoked fields. J. Neurosci.32, 13389–13395. 10.1523/JNEUROSCI.2227-12.2012
58
TodorovicA.van EdeF.MarisE.de LangeF. P. (2011). Prior expectation mediates neural adaptation to repeated sounds in the auditory cortex: an MEG study. J. Neurosci.31, 9118–9123. 10.1523/JNEUROSCI.1425-11.2011
59
Turk-BrowneN. B.SchollB. J.ChunM. M. (2008). Babies and brains: habituation in infant cognition and functional neuroimaging. Front. Hum. Neurosci.2:16. 10.3389/neuro.09.016.2008
60
YlinenS.HuotilainenM. (2007). Is there a direct neural correlate for memory-trace formation in audition?Neuroreport18, 1281–1284. 10.1097/wnr.0b013e32826fb38a

Summary

Keywords

predictive coding, prediction error, auditory perception, repetition, magnetoencephalography (MEG)

Citation

Hsu Y-F, Waszak F and Hämäläinen JA (2019) Prior Precision Modulates the Minimization of Auditory Prediction Error. Front. Hum. Neurosci. 13:30. doi: 10.3389/fnhum.2019.00030

Received

03 August 2018

Accepted

21 January 2019

Published

15 February 2019

Volume

13 - 2019

Edited by

Felix Blankenburg, Freie Universität Berlin, Germany

Reviewed by

Cheryl Olman, University of Minnesota Twin Cities, United States; Ryszard Auksztulewicz, City University of Hong Kong, Hong Kong

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yi-Fang Hsu yi-fang.hsu@cantab.net

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Sensory Neuroscience

ORIGINAL RESEARCH article

Prior Precision Modulates the Minimization of Auditory Prediction Error

Abstract

Introduction