Is the auditory system cognitively penetrable?

Brogaard, Berit; Gatzia, Dimitria  Electra

doi:10.3389/fpsyg.2015.01166

OPINION article

Front. Psychol., 11 August 2015

Sec. Cognition

Volume 6 - 2015 | https://doi.org/10.3389/fpsyg.2015.01166

Is the auditory system cognitively penetrable?

1. The Brogaard Lab for Multisensory Research, University of Miami Miami, FL, USA
2. Department of Philosophy, University of Oslo Oslo, Norway
3. Department of Philosophy, The University of Akron Wayne College Akron, OH, USA

According to the hierarchical model of sensory information processing, sensory inputs are transmitted to cortical areas, which are crucial for complex auditory and speech processing, only after being processed in subcortical areas (Hickok and Poeppel, 2007; Rauschecker and Scott, 2009). However, studies using electroencephalography (EEG) indicate that distinguishing simultaneous auditory inputs involves a widely distributed neural network, including the medial temporal lobe, which is essential for declarative memory, and posterior association cortices (Alain et al., 2001; Squire et al., 2004). More recent studies have even demonstrated plasticity of auditory signals as low as the brainstem (Suga, 2008). Collectively, studies suggest that the functional architecture of perceptual processing involves primarily top-down modulation (Suga et al., 2002; Gilbert and Li, 2013; Chandrasekaran et al., 2014). Top-down influences exerted throughout the auditory systems (Lotto and Holt, 2011) include: memory (Goldinger, 1998)¹, attention (Choi et al., 2014), which has been found to modulate auditory encoding in the cochlea, a subcortical area (Maison et al., 2001), (prior) knowledge of syntax or words (Ganong, 1980; Warren, 1984)², and experience-based expectations pertaining to the speaker's accent (Deutsch, 1996; Deutsch et al., 2004; Irino and Patterson, 2006), gender (Johnson et al., 1999), and vocal folds or tract (Irino and Patterson, 2002; Patterson and Johnsrude, 2008).

While a great deal has been written about the issue of cognitive penetrability in the case of vision, audition has received almost no attention. For example, a corresponding body of evidence for top-down modulation in vision has been used to undermine the Cognitive Impenetrability Thesis (CIT) (see Macpherson, 2012; Siegel, 2012; Wu, 2013; Cecchi, 2014). Brogaard and Gatzia (in press) have argued that top-down modulation on visual processes involving prior-knowledge, experience based expectation, or memory do not threaten the CIT, even after acknowledging that such influences are cognitive in nature (see also Pylyshyn, 1999; Raftopoulos, 2001). The reason is that such top-down influences, although cognitive in nature, are distinct from discursive thoughts that stand in a semantically-coherent relation to the phenomenology or content of experience, for instance, thoughts proceeding by argumentation or reasoning rather than by intuition or implicit hypothesis internal to the visual system³. If we insisted that instances of top-down modulation be counted as instances of cognitive penetration, the debate about cognitive penetrability would be trivial and, hence, unmotivated since studies clearly indicate that such top-down modulation in visual (or auditory) perception is extensive. A similar argument can be made in the case of audition.

The CIT has traditionally been understood as a semantic thesis. Accordingly, the information a system computes is not sensitive (in a semantically-coherent way) to one's cognitive states and cannot be altered in a way that bears a logical relation to one's knowledge or reasons (Pylyshyn, 1984, 1999; Raftopoulos, 2009). For example, suppose that you experience a sound as /da-da/ and that causes you to form the belief that the sound is /da-da/. In this case, your belief and your auditory experience are semantically coherent: they have roughly the same content. Suppose now that you acquire the belief that the sound is in fact /ba-ba/ (say, because you have now come to believe that the Cartesian evil genius has made you hear it as /da-da/ when it is in fact a /ba-ba/ sound). According to the semantic thesis, your newly acquired belief, for which you may have ample justification, cannot alter the content computed by your auditory system; you will continue to experience the sound as /da-da/ despite that you have come to believe that it is /ba-ba/. Some proponents of the semantic thesis have argued that changes to the information a system computes are attributed to intra-perceptual principles that do not conform to standard tenets of rationality, such as standard rules of logic, probability theory and statistics, or rational choice theory (Brogaard and Gatzia, in press).

Undermining the CIT requires demonstrating that changes in the phenomenology of one's auditory perception are due to the listener's discursive or rational thoughts that stand in the right sort of semantic relation to her experience. So it is not enough that discursive thoughts influence experience; they must do so in a semantically-coherent way. Consider ventriloquism, for example. Suppose that I believe that the puppet is not actually producing the sounds (the person holding the puppet is) but I nevertheless hear the speech as coming from the puppet's mouth. In this case, the content of my belief differs from the content as my auditory experience. Now suppose that my discursive thoughts about what really goes on in the case of ventriloquism gives rise to a stress reaction in me (for some reason) and that this mood (the stress) changes the content of my experience: I no longer hear the speech as coming from the puppet. In this case, it may appear that my discursive thoughts have changed my auditory experience in a semantically-coherent way: my belief and my experience now have the same content. However, by hypothesis, it is the mood, not my beliefs, that changed my auditory experience. Since moods, unlike beliefs, have no contents, the stress (a mood) cannot have the same content as either my belief or my auditory experience. The content of my experience has thus changed but not in a semantically-coherent way. This semantic-coherence has to be involved in every step of the process for changes in phenomenology to threaten the CIT. For example, if my belief that the puppet is not actually producing the sounds were to cause me to no longer experience the speech as coming from the puppet via a chain of logically related processes, then the content of my belief would have changed the content of my experience in a semantically coherent-way. Such a case would indeed threaten the CIT.

Additionally, cases that involve the indirect influencing of auditory experience by beliefs (or discursive thoughts) need not threaten the CIT. For example, Fodor (1988) jokingly said that his heart is cognitively “penetrated” by his intention to do calisthenics since it results in doing calisthenics, resulting in his heart rate increasing. What this joke illustrates is that the locution “receives input from” is not transitive, meaning that it is not the case that if a process B receives input from A, and C received input from B that C receives input from A since it is possible that none of B's outputs that were responses to inputs from A affected C (Lyons, 2015).

Cases of perceptual learning involve such indirect influencing of auditory perception. Typically, perceptual learning refers to the brain's plasticity, i.e., the gradual structural or functional changes in the connectivity of sensory systems resulting from training consisting of repeated exposure to particular stimuli (Roelfsema et al., 2010). However, the competition between verbal and implicit systems (COVIS) model suggests a dual-system framework, according to which learners, in information-integration tasks, initially use the reflective (rule-based) system, but switch to the reflexive (information-integration) system with practice (Maddox et al., 2013; Valentin et al., 2014)⁴. The fact that the reflective system is mediated by the prefrontal cortex and involves hypothesis testing by the learner seems to suggest that at least some cases of perceptual learning may constitute cases of cognitive penetration. This conclusion, however, is too hasty. The reflexive system is viewed as indirect and procedural: trial feedbacks reinforce associations of stimuli located in different regions of perceptual space with specific motor outputs (Maddox et al., 2013). It follows that the changes in auditory phenomenology associated with the reflective system result indirectly from the brain's plasticity, not directly from the listener's discursive thoughts (in a semantically-coherent way). Perceptual learning, therefore, need not threaten the CIT, provided that the changes in phenomenology result indirectly from changes in the brain's plasticity, which cannot be attributed to the listener's discursive thoughts.

Auditory illusions are useful tools to illustrate the inability of our discursive thoughts to alter the phenomenology of our auditory experience in a semantically-coherent way. One example is the tritone illusion. Deutsch (2007) presented listeners with two tones in succession that are opposite in the positions along the pitch class space such as G# followed by D or C followed by F#, which comprised an interval of six semitones (known as tritone). When one of the pairs was played (say, G# followed by D) some of the listeners heard a descending pattern while others heard an ascending pattern. However, when another pair was played (say, C followed by F#) listeners who had previously heard a descending pattern now heard an ascending one and vice versa. The tritone illusion varies in correlation with the accent of the speaker. For example, while Californians tended to hear the pattern as ascending, Britons tended to hear it as descending (Deutsch, 1991). A considerable difference was also observed between mothers who had grown up in widely different geographical regions. Perhaps not surprisingly, significant similarities were observed among these mothers and their children, even though the children had not grown up in the same geographic regions as their mothers (Deutsch, 1996).

The tritone illusion persists even after listeners are informed that the two tones in succession are opposite in the positions along the pitch class space, indicating that their discursive thoughts cannot alter the phenomenology of their auditory experiences. What one hears depends on the configuration of one's auditory system, which is, among other things, subject to developmental influences (Deutsch et al., 2004). However, top-down modulation caused by adaptation- or development-based knowledge, experience-based expectation, memory, or attention are consistent with the claim that auditory perception is not cognitively penetrable, at least not in any interesting sense, as the changes in phenomenology cannot plausibly be attributed to the listener's discursive thoughts.

Another example is the McGurk illusion, which arises when auditory speech cues are presented in synchrony with incongruent visual speech cues (McGurk and MacDonald, 1976). For example, when the auditory syllable “ba” is presented in synchrony with a speaker mouthing “ga,” subjects typically report hearing “da.” However, when the auditory syllable “ga” is presented in synchrony with a speaker mouthing “ba,” subjects typically report hearing “bga”⁵. As with the tritone illusion, the McGurk illusion persists even after subjects are informed that the auditory syllable is “ba” in the first case and “ga” in the second. Windmann (2004) found that the clarity and, to some extent, the probability of the illusion was significantly influenced by the listener's experience-based expectations, which do not threaten the CIT for the same reason: the information the system computes is not altered by the listener's discursive thoughts.

It may nevertheless be objected that other cases such as sine wave speech appear to threaten the CIT since they seem to involve changes in phenomenology which can be attributed to subject's discursive thoughts⁶. For example, naive listeners tend to hear sine wave speech as tones or whistles, rather than speech. After being familiarized with the linguistic message, however, many listeners readily hear sine wave as speech (Sheffert et al., 2002). However, it is not clear, in this case, whether it is the listener's beliefs that cause a change in her experience. For example, it could be that such cases involve cognitive penetration if the listener's belief about the content of the linguistic message were to alter (in a semantically-coherent way) the phenomenology of the listener's experience. Or, it could be that the listener is still hearing the same tones or whistles but interprets them on the basis of the newly acquired knowledge of the linguistic message. The more likely explanation is that it is a case of normalization based on experience-based expectation given that the listener comes to understand sine wave speech only after learning its linguistic message. So it seems that the expectation that the sound has the linguistic message the listener expects it to have is what is doing all the work. Indeed, studies suggest that listeners use a range of information regarding the speaker, including the speaker's supposed nationality (Niedzielski, 1999), to create a frame of reference to be used during perception in order to normalize what is heard. In other words, listeners utilize adaptation- or development-based knowledge, experience-based expectation, memory, or attention to make sense of speech. However, as we have argued, such changes in phenomenology cannot plausibly be attributed to the listeners' discursive thoughts (at least not in a semantically-coherent way) and, thus, do not threaten the CIT.

Statements

Acknowledgments

We would like to thank an anonymous referee for invaluable comments.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Footnotes

1.^It has been suggested that the mechanism underlying auditory restoration (the auditory system's ability to compensate for expected missing sounds, see Warren, 1984) involves episodic memory, which involves memory traces left by an experience that are activated, according to the similarity with the stimulus, when a new stimulus such as a word is heard (see Goldinger, 1998).

2.^As the Ganong effect illustrates, phonemes such as /t/ or /d/ tend to be heard as /t/ when followed by “ask” to form “task” but as /d/ when followed by “usk” to form “dusk.”

3.^Constancy computations, for example, are not obligatorily linked to experiencing sensibles and may precede it (Kentridge et al., 2014).

4.^We thank an anonymous reviewer for helpful comments on the issue of perceptual learning.

5.^Here too it is due to the non-transitivity of the locution “receives input from” that we cannot say that auditory processing is cognitively penetrated by visual processing (see Lyons, 2015).

6.^We thank an anonymous reviewer for posing this question.

References

1
AlainC.ArnottS. R.PictonT. W. (2001). Bottom-up and top-down influences on auditory scene analysis: evidence from event-related brain potentials. J. Exp. Psychol. Hum. Percept. Perform.27, 1072–1089. 10.1037/0096-1523.27.5.1072
2
BrogaardB.GatziaD. E. (in press). Color cognitive penetrability. Top. Cogn. Sci.
- Google Scholar
3
CecchiA. S. (2014). Cognitive penetration, perceptual learning, and neural plasticity. Dialectica68, 63–95. 10.1111/1746-8361.12051
- CrossRef
- Google Scholar
4
ChandrasekaranB.SkoeE.KrausN. (2014). An integrative model of subcortical auditory plasticity. Brain Topogr.27, 539–552. 10.1007/s10548-013-0323-9
5
ChoiI.WangL.BharadwajH.Shinn-CunninghamB. (2014). Individual differences in attentional modulation of cortical responses correlate with selective attention performance. Hear. Res.314, 10–19. 10.1016/j.heares.2014.04.008
6
DeutschD. (1991). The tritone paradox: an influence of language on music perception. Music Percept.8, 335–347. 10.2307/40285517
- CrossRef
- Google Scholar
7
DeutschD. (1996). Mothers and their children hear a musical illusion in strikingly similar ways. J. Acoust. Soc. Am.99, 2482. 10.1121/1.415579
- CrossRef
- Google Scholar
8
DeutschD. (2007). Music perception. Front. Biosci.12, 4473–4482. 10.2741/2402
9
DeutschD.HenthornT.DolsonM. (2004). Speech patterns heard early in life influence later perception of the tritone paradox. Music Percept.21, 357–372. 10.1525/mp.2004.21.3.357
- CrossRef
- Google Scholar
10
FodorJ. A. (1988). A reply to Churchland's “Perceptual plasticity and theory neutrality.”Philos. Sci.55, 188–194. 10.1086/289426
- CrossRef
- Google Scholar
11
GanongW. F.III. (1980). Phonetic categorization in auditory word perception. J. Exp. Psychol. Hum. Percept. Perform.6, 110–125. 10.1037/0096-1523.6.1.110
12
GilbertC. D.LiW. (2013). Top-down influences on visual processing. Nat. Rev. Neurosci.14, 351. 10.1038/nrn3476
- CrossRef
- Google Scholar
13
GoldingerS. D. (1998). Echoes of echoes? An episodic theory of lexical access. Psychol. Rev.105, 251–279. 10.1037/0033-295X.105.2.251
14
HickokG.PoeppelD. (2007). The cortical organization of speech processing. Nat. Rev. Neurosci.8, 393–402. 10.1038/nrn2113
15
IrinoT.PattersonD. (2002). Segregation information about the size and shape of the vocal tract using a time-domain auditory model: the stabilised wavelet-Mellin transform. Speech Commun.36, 181–203. 10.1016/S0167-6393(00)00085-6
- CrossRef
- Google Scholar
16
IrinoT.PattersonR. D. (2006). A dynamic, compressive gammachirp auditory filterbank. IEEE Audio Speech Lang. Process14, 2222–2232. 10.1109/TASL.2006.874669
- CrossRef
- Google Scholar
17
JohnsonK.StrandE. A.D'ImperioM. (1999). Auditory-visual integration of talker gender in vowel perception. J. Phon.27, 359–384. 10.1006/jpho.1999.0100
- CrossRef
- Google Scholar
18
KentridgeR.NormanL.AkinsK.HeywoodC. (2014). Colour constancy without consciousness, in Towards a Science of Consciousness Conference (Tucson).
- Google Scholar
19
LottoA.HoltL. L. (2011). Psychology of auditory perception. Wiley Interdiscipl. Rev. Cogn. Sci.2, 479–489. 10.1002/wcs.123
- CrossRef
- Google Scholar
20
LyonsJ. C. (2015). Unencapsulated modules and perceptual judgment, in The Cognitive Penetrability of Perception: New Philosophical Perspectives, eds ZeimekisJ.RaftopoulosA. (Oxford: Oxford University Press), 103–122.
- Google Scholar
21
MacphersonF. (2012). Cognitive penetration of colour experience: rethinking the issue in light of an indirect mechanism. Philos. Phenomenol. Res.74, 24–62. 10.1111/j.1933-1592.2010.00481.x
- CrossRef
- Google Scholar
22
MaddoxW. T.ChandrasekaranB.SmaydaK.YiH. G. (2013). Dual systems of speech category learning across the lifespan. Psychol. Aging28, 1042–1056. 10.1037/a0034969
23
MaisonS.MicheylC.ColletL. (2001). Influence of focused auditory attention on cochlear activity in humans. Psychophysiology38, 35–40. 10.1111/1469-8986.3810035
24
McGurkH.MacDonaldJ. (1976). Hearing lips and seeing voices. Nature264, 746–748. 10.1038/264746a0
25
NiedzielskiN. A. (1999). The effect of social information on the perception of sociolinguistic variables. J. Lang. Soc. Psychol.18, 62–85. 10.1177/0261927X99018001005
- CrossRef
- Google Scholar
26
PattersonR. D.JohnsrudeI. S. (2008). Functional imaging of the auditory processing applied to speech sounds.Philos. Trans. R. Soc. B Biol. Sci.363, 1023–1035. 10.1098/rstb/2007.2157
27
PylyshynZ. (1984). Computation and Cognition. Cambridge, MA: MITPress.
- Google Scholar
28
PylyshynZ. (1999). Is vision continuous with cognition? The case for cognitive impenetrability of visual perception.Behav. Brain Sci.22, 341–423. 10.1017/S0140525X99002022
29
RaftopoulosA. (2001). Is perception informationally encapsulated? The issue of the theory-ladenness of perception. Cogn. Sci.25, 423–451. 10.1207/s15516709cog2503_4
- CrossRef
- Google Scholar
30
RaftopoulosA. (2009). Perception and Cognition: How do Psychology and the Cognitive Sciences inform Philosophy. Cambridge, MA: MITPress.
- Google Scholar
31
RauscheckerJ. P.ScottS. K. (2009). Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nat. Neurosci.12, 718–724. 10.1038/nn.2331
32
RoelfsemaP. R.van OoyenA.WatanabeT. (2010). Perceptual learning rules based on reinforcers and attention. Trends Cogn. Sci.14, 64–71. 10.1016/j.tics.2009.11.005
33
SheffertS. M.PisoniD. B.FellowesJ. M.RemezR. E. (2002). Learning to recognize talkers from natural, sinewave, and reversed speech samples. J. Exp. Psychol. Hum. Percept. Perform.28, 1447–1469.
- Pubmed Abstract
- Google Scholar
34
SiegelS. (2012). Cognitive penetrability and perceptual justification. Noûs46, 201–222. 10.1111/j.1468-0068.2010.00786.x
- CrossRef
- Google Scholar
35
SquireL. R.StarkC. E.ClarkR. E. (2004). The medial temporal lobe. Annu. Rev. Neurosci.27, 279–306. 10.1146/annurev.neuro.27.070203.144130
36
SugaN. (2008). Role of corticofugal feedback in hearing. J. Comp. Physiol. A194, 169–183. 10.1007/s00359-007-0274-2
37
SugaN.XiaoZ.MaX.JiW. (2002). Plasticity and corticofugal modulation for hearing in adult animals. Neuron36, 9–18. 10.1016/S0896-6273(02)00933-9
38
ValentinV. V.MaddoxW. T.AshbyF. (2014). A computational model of the temporal dynamics of plasticity in procedural learning: sensitivity to feedback timing. Front. Psychol.5:643. 10.3389/fpsyg.2014.00643
39
WarrenR. M. (1984). Perceptual restoration of obliterated sounds. Psychol. Bull.96, 371–383. 10.1037/0033-2909.96.2.371
40
WindmannS. (2004). Effects of sentence context and expectation on the McGurk illusion. J. Mem. Lang.50, 212–230. 10.1016/j.jml.2003.10.001
- CrossRef
- Google Scholar
41
WuW. (2013). Visual spatial constancy and modularity: does intention penetrate vision?Philos. Stud.165, 647–669. 10.1007/s11098-012-9971-y
- CrossRef
- Google Scholar

Summary

Keywords

auditory perception, cognitive penetration, McGurk illusion, semantic-coherence, top-down influences, top-down modulation, tritone illusion, perceptual learning

Citation

Brogaard B and Gatzia DE (2015) Is the auditory system cognitively penetrable?. Front. Psychol. 6:1166. doi: 10.3389/fpsyg.2015.01166

Received

10 April 2015

Accepted

24 July 2015

Published

11 August 2015

Volume

6 - 2015

Edited by

Andriy Myachykov, Northumbria University, UK

Reviewed by

Andrew J. Lotto, University of Arizona, USA

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Berit Brogaard, brit@miami.edu

This article was submitted to Cognition, a section of the journal Frontiers in Psychology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Cognition

OPINION article

Is the auditory system cognitively penetrable?

Statements

Acknowledgments

Conflict of interest

Footnotes

References

Summary

Outline

Cite article

Article metrics

OPINION article

Is the auditory system cognitively penetrable?

Statements

Acknowledgments

Conflict of interest

Footnotes

References

Summary

Outline

Cite article

Share article

Article metrics