Impact Factor 3.169 | CiteScore 5.1
More on impact ›


Front. Hum. Neurosci., 11 December 2017 |

Commentary: Musicians' Online Performance during Auditory and Visual Statistical Learning Tasks

Federica Menchinelli*, Petra M. J. Pollux and Simon J. Durrant
  • Psychology, University of Lincoln, Lincoln, United Kingdom

A commentary on
Musicians' Online Performance during Auditory and Visual Statistical Learning Tasks

by Mandikal Vasuki, P. R., Sharma, M., Ibrahim, R. K., and Arciuli, J. (2017). Front. Hum. Neurosci. 11:114. doi: 10.3389/fnhum.2017.00114

Statistical learning (SL) is the extraction of the underlying statistical structure from sensory input (Frost et al., 2015). The extent to which this ability is domain-general (with a single central mechanism underpinning SL in any modality) or domain-specific (where the SL mechanism differs by modality) remains a central question in statistical learning (Frost et al., 2015), and two approaches have been adopted to tackle this. First is to examine the extent to which predominantly domain-specific skills such as language proficiency (Arciuli and von Koss Torkildsen, 2012) and musical expertise (Schön and François, 2011), and domain-general skills such as working memory and general IQ (Siegelman and Frost, 2015), correlate with SL ability. Second is to compare SL performance across modalities, or even examine cross-modal transfer (Durrant et al., 2016).

Mandikal Vasuki et al. (2017) (and the sister paper: Mandikal Vasuki et al., 2016) make an important contribution by adopting both of these approaches. They compare auditory and visual SL using the Saffran triplet learning paradigm (Saffran et al., 1999) in musicians and non-musicians. The three key findings are that musicians are better than non-musicians at segmentation of auditory stimuli only, there is no correlation between auditory and visual performance, and that auditory performance is better overall. This last result could be due to privileged auditory processing of sequential stimuli (Conway et al., 2009), or it could just reflect differences in perceptual or memory capabilities across modalities. However, the fact that SL performance in one modality does not predict performance in another is hard to explain if a single mechanism underlying both is posited. Combined with the fact that overall better performance was found in musicians only in the auditory modality, a domain-specific SL mechanism seems to offer the most parsimonious explanation of this data.

One of the key strengths of this study is the unusual choice to record ERPs. Behavioral measures of learning during passive exposure are problematic—especially if the nature of the stimuli is to remain hidden from participants—so ERP recording allows online measurement of learning performance during exposure, and provides insight into the underlying mechanism. In keeping with the behavioral results, differences in the N1 and N400 triplet onset effects between musicians and non-musicians were seen only for the auditory stimuli, while the N400 was not seen at all for visual stimuli. These could suggests a neural mechanism for auditory statistical learning different to that of visual statistical learning, but without source localization based on more electrodes, this remains speculative.

ERP data also provides insight into the time course of learning. Thanks to this method we know that an advantage of musicians in auditory SL is that they are “fast learners”; they begin segmentation of the stimulus stream from earlier in the exposure. This difference could not have been detected behaviorally. It would have been interesting to also see the difference in ERP responses to correct and incorrect triplets in the behavioral task and this is certainly worth including in future reports. In addition, there are large individual differences in SL (Siegelman and Frost, 2015), hence ERPs of participants with widely varying performance is therefore potentially of great interest and exclusion based on behavior should be limited. In the present study, only a small number of participants were excluded so this was not a major problem, but in future ERP studies we would caution against the use of the relatively narrow outlier exclusion criteria (±2 SD) seen here.

The present study offers into statistical learning across modalities, but key questions remain, including the fidelity of SL (how accurately are specific transition probabilities learned) and the order of SL (can higher-order transitions be effectively learned in a short exposure). The triplet learning paradigm is unable to provide insight into either of those questions because it mixes first- and second-order transitions and does not sample a range of probabilities. Other approaches such as the transition matrix paradigm (Durrant et al., 2011), by allowing precise control of the transition order and the transition probabilities, may be more suitable to answer these questions, especially if combined with ERP measurements.

Another important limitation of the triplet paradigm, which is particularly relevant for this study, is the role of prior preferences for particular triplets. Probably all participants will have had extensive exposure to Western tonal music, which results in the development of cognitive schemata (Krumhansl, 1990) reflecting tone distribution statistics in Western tonal music (Knophoff and Hutchinson, 1983). These are acquired in early childhood through passive exposure (Speer and Meeks, 1985), and generate expectations of tones in a sequence (Bharucha, 1994). Saffran et al. (1999) attempted to counteract this by using a two-language crossover design and avoiding stereotypical patterns within the triplets. Their results showed a preference for particular triplets within both languages which may reflect prior exposure to Western tonal music and which is much stronger than the effect of short-term exposure within the experiment (Hazan et al., 2008). The present study used only Saffran's Language 1, and these triplet preferences based on prior musical exposure might explain the difference between musicians and non-musicians in the auditory domain. Future studies should ideally measure prior preference of triplets and potentially try to control them through the use of non-Western scales such as the Bohlen-Pierce scale (Durrant et al., 2011).

Combining auditory and visual SL with a comparison of musicians and non-musicians is the main contribution of this paper. The results of this study may be interpreted as evidence of a domain-specific component to SL in keeping with other findings (Conway and Christiansen, 2006) but alternative accounts suggest that a domain-general component is equally possible (Thiessen, 2011). Future investigations could use more sophisticated instruments such as the Gold MSI (Müllensiefen et al., 2014), to look for effects on specific subscales of musical experience, to better understand why musicians have an advantage in the auditory modality in particular. The present study is an important first step toward this.

Author Contributions

FM, PP, and SD conceived the ideas in the article, discussed the specific arguments to be presented, and wrote the manuscript.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.


We are grateful for the detailed, carefully considered and stimulating comments from the reviewer.


Arciuli, J., and von Koss Torkildsen, J. (2012). Advancing our understanding of the link between statistical learning and language acquisition: the need for longitudinal data. Front. Psychol. 3:324. doi: 10.3389/fpsyg.2012.00324

PubMed Abstract | CrossRef Full Text | Google Scholar

Bharucha, J. J. (1994). “Tonality and expectation,” in Musical Perceptions, eds R. Aiello and J. A. Sloboda (New York, NY: Oxford University Press), 213–239.

Google Scholar

Conway, C. M., and Christiansen, M. H. (2006). Statistical learning within and between modalities: pitting abstract against stimulus-specific representations. Psychol. Sci. 17, 905–912. doi: 10.1111/j.1467-9280.2006.01801.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Conway, C. M., Pisoni, D. B., and Kronenberger, W. G. (2009). The importance of sound for cognitive sequencing abilities: the auditory scaffolding hypothesis. Curr. Dir. Psychol. Sci. 18, 275–279. doi: 10.1111/j.1467-8721.2009.01651.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Durrant, S. J., Cairney, S. A., and Lewis, P. A. (2016). Cross-modal transfer of statistical information benefits from sleep. Cortex 78, 85–98. doi: 10.1016/j.cortex.2016.02.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Durrant, S. J., Taylor, C., Cairney, S., and Lewis, P. A. (2011). Sleep-dependent consolidation of statistical learning. Neuropsychologia 49, 1322–1331. doi: 10.1016/j.neuropsychologia.2011.02.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Frost, R., Armstrong, B. C., Siegelman, N., and Christiansen, M. H. (2015). Domain generality versus modality specificity: the paradox of statistical learning. Trends Cogn. Sci. 19, 117–125. doi: 10.1016/j.tics.2014.12.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Hazan, A., Holonowicz, P., Salselas, I., Knast, A., Durrant, S. J., Herrera, P., et al. (2008). “Modeling the acquisition of statistical regularities in tone sequences,” in 30th Annual Meeting of the Cognitive Science Society. Available online at:

Google Scholar

Knophoff, L., and Hutchinson, W. (1983). Entropy as a measure of style: the influence of sample length. J. Music Theor. 27, 75–97.

Google Scholar

Krumhansl, C. L. (1990). Cognitive Foundations of Musical Pitch (Oxford Psychology Series). New York, NY: Oxford University Press.

Google Scholar

Mandikal Vasuki, P. R., Sharma, M., Demuth, K., and Arciuli, J. (2016). Musicians' edge: a comparison of auditory processing, cognitive abilities and statistical learning. Hear. Res. 342, 112–123. doi: 10.1016/j.heares.2016.10.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Mandikal Vasuki, P. R., Sharma, M., Ibrahim, R. K., and Arciuli, J. (2017). Musicians' online performance during auditory and visual statistical learning tasks. Front. Hum. Neurosci. 11:114. doi: 10.3389/fnhum.2017.00114

PubMed Abstract | CrossRef Full Text | Google Scholar

Müllensiefen, D., Gingras, B., Musil, J., and Stewart, L. (2014). The musicality of non-musicians: an index for assessing musical sophistication in the general population. PLoS ONE 9:e89642. doi: 10.1371/journal.pone.0089642

PubMed Abstract | CrossRef Full Text | Google Scholar

Saffran, J. R., Johnson, E. K., Aslin, R. N., and Newport, E. L. (1999). Statistical learning of tone sequences by human infants and adults. Cognition 70, 27–52. doi: 10.1016/S0010-0277(98)00075-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Schön, D., and François, C. (2011). Musical expertise and statistical learning of musical and linguistic structures. Front. Psychol. 2:167. doi: 10.3389/fpsyg.2011.00167

PubMed Abstract | CrossRef Full Text | Google Scholar

Siegelman, N., and Frost, R. (2015). Statistical learning as an individual ability: theoretical perspectives and empirical evidence. J. Mem. Lang. 81, 105–120. doi: 10.1016/j.jml.2015.02.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Speer, J. R., and Meeks, P. U. (1985). School children's perception of pitch in music. Psychomusicology 5, 49–56.

Google Scholar

Thiessen, E. D. (2011). Domain general constraints on statistical learning. Child Dev. 82, 462–470. doi: 10.1111/j.1467-8624.2010.01522.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: auditory statistical learning, visual statistical learning, ERPs (Event-Related Potentials), triplet learning, musicians and non-musicians

Citation: Menchinelli F, Pollux PMJ and Durrant SJ (2017) Commentary: Musicians' Online Performance during Auditory and Visual Statistical Learning Tasks. Front. Hum. Neurosci. 11:603. doi: 10.3389/fnhum.2017.00603

Received: 11 August 2017; Accepted: 27 November 2017;
Published: 11 December 2017.

Edited by:

Carol Seger, Colorado State University, United States

Reviewed by:

Lauren K. Slone, Indiana University Bloomington, United States

Copyright © 2017 Menchinelli, Pollux and Durrant. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Federica Menchinelli,