A Pragmatic Oscillome: Aligning Visual Attentional Mechanisms with Language Comprehension

Murphy, Elliot

doi:10.3389/fnsys.2016.00072

OPINION article

Front. Syst. Neurosci., 26 August 2016

Volume 10 - 2016 | https://doi.org/10.3389/fnsys.2016.00072

A Pragmatic Oscillome: Aligning Visual Attentional Mechanisms with Language Comprehension

Elliot Murphy^*

Division of Psychology and Language Sciences, University College London, London, UK

A growing body of work over the last decade has investigated the potential functional role of neural oscillations in language comprehension (Giraud and Poeppel, 2012; Doelling and Poeppel, 2015; Lewis and Bastiaansen, 2015; Ding et al., 2016). I will explore how a number of recent developments in the field, and related domains of systems neuroscience, can generate much-needed linking hypotheses between the language sciences and neuroscience. To this end, I will focus on an area of linguistics whose existence has barely been acknowledged by the oscillation literature—pragmatics—and argue that elementary principles of discourse interpretation (though not more complex, peripheral aspects of pragmatic knowledge) can be implemented via generic, domain-general mechanisms elsewhere argued to be responsible for particular aspects of visual cognition. It will be suggested that these two systems share a number of striking computational/representational properties, and hence may share homologous dynamic and connectomic substrates.

Beginning first with the visual system, Jensen et al.'s (2012) approach to the prioritization of salient unattended stimuli claims that neocortical γ rhythms phase-lock to posterior α- and β-oscillating regions to form a clocking mechanism which activates sequences of visual representations. The striate cortex consequently extracts different features from distant regions, aiding the construction of a coherent visual scene. This process ensures that object X in a given scene is interpreted before object Y, imposing general and efficient set-constructing rules. These proposals are in accordance with the broader consensus that α phases modulate neuronal excitability and γ activity. This is a particular manifestation of what we could call “Wallace”s Problem' after David Foster Wallace: How does the brain deal with the sheer mass of sensory overload it receives constantly? [“What always amazed Wallace about real life was the overload of information,” writes his biographer Max (2013:p. 244)].

I will argue that if similar “oscillomic” (referring to a specific feature of brain dynamics, namely neural oscillations) mechanisms are responsible for the construction of linguistic feature-sets, then the principles of a particular theory of pragmatics, Relevance Theory, could be neurobiologically grounded. Relevance Theory claims that during discourse comprehension particular representations are triggered before others due to their “cognitive relevance” (Sperber and Wilson, 1995). The Communicative Principle of Relevance claims that “Every utterance (and ostensive stimulus more generally) coveys a presumption of its own optimal relevance,” relatedly, the Relevance-Theoretic Comprehension Procedure states that language comprehenders follow a path of least effort in computing cognitive effects (Wilson and Sperber, 2004). Processes involving lexical pragmatics (Wilson and Carston, 2007) adjust or modulate existing elements of linguistic meaning, as in the case where “David is a man” is interpreted as meaning David is an IDEAL MAN, with the lexical item man being underspecified for its ultimate meaning due to its ambiguity (Murphy, 2016a). From these basic processes we can already see a certain degree of similarity with visual attentional processes which construct representations based on factors such as salience, prominence and accessibility.

It was suggested in Murphy (2016b) that the neural ensembles responsible for storing representations used to construct linguistic phrases require certain phase-amplitude-locking levels in order for the regions coupled with them to “read off” their content. This would permit only certain features to be interpreted at the conceptual interfaces. Lisman and Jensen (2013) claim that coupled γ and θ oscillations form a code for representing multiple, sequenced items. These rhythms are generated in the cortex (in particular, occipital regions) and hippocampus. It may be, then, that the construction of linguistic feature-sets proceeds via the deployment of a similar oscillatory mechanism.

In the model of feature-set retrieval outlined in Figure 1, after inhibition reduces over the θ cycle the most excitable clusters would be itemized through a series of γ cycles. Less excitable representations would then follow, determining the make-up of a given feature-set. The group of feedforward γ rhythms required would be mostly generated in supragranular cortical layers (L2/3) (Maier et al., 2010) and hippocampal θ would be generated via slow pulses of GABAergic inhibition as a consequence of medial septum input, being part of a brainstem-diencephalo-septohippocampal θ-generating system (Vertes and Kocsis, 1997). The model in Figure 1 therefore permits the feeding of distributed conceptual and visual representations into hippocampal and posterior systems, binding the most excitable, cognitively relevant features (with this process doubtless involving a number of subcortical structures like the basal ganglia and thalamus). As a mechanistic basis generating a major component of Wallace's Problem, for pragmatic interpretation γ-θ coupling is required, whereas γ-α/β coupling is responsible for vision. Jensen et al.'s (2012) visual attention mechanism may also interface with a similar mechanism responsible for conceptual/pragmatic interpretation, such that in some cases the visually most prominent feature is also the most pragmatically relevant feature. Indeed, there may be some causal connection between the two: the more ancient visual attentional mechanism may determine pragmatic relevance in some cases, such that the prominence of certain visual features influences the interpretation of communicative referential intent. Consider the following two sentences in a context in which the objects under discussion are nearby (where “*” denotes an unacceptable interpretation and “?” denotes a less likely one):

(1) a. Your car is dirty [the frame/the passenger cabin/*the engine].

b. My computer is broken [the processor/?the screen].

FIGURE 1

Figure 1. A Relevance Theory-inspired oscillomic model of language comprehension. “CF” denotes conceptual feature, “VF” denotes visual feature, “LIFG” denotes left inferior frontal gyrus, “MTC” denotes middle temporal cortex. The top image represents the proposed pragmatic oscillomic mechanism, and the bottom image refers to Jensen et al.'s (2012) model. See Murphy (2015, 2016b) for related discussion, and also Voloh and Womelsdorf (2016) for evidence that phase resetting to endogenous or exogenous cues facilitates information transfer between distributed brain areas, supporting its presently proposed role in feature-set composition (with such feature-sets being interpretable by conceptual systems typically seen as being widely distributed across the neocortex).

In (1a), visual attention mechanisms would phase-lock with pragmatic mechanisms via coupling and connectivity across a frontal-occipital-hippocampal network known to be responsible for visual object processing, and where transient β coupling between these three regions has been detected (Sehatpour et al., 2008), but in (1b) they would not be in phase. Cases like (1b) produce a more distant relation between visual and semantic representations. In (1a) two cognitive systems, visual attention and pragmatic interpretation, interface in some way to achieve the desired interpretation. This alignment is not found in (1b). Of course not all cases of salient stimuli would involve a direct alignment between visual and semantic representations, but the present claim is that those that do would be implemented via the above oscillomic processes of feature-set composition. The conceptual features required to construct the representation of a particular object or event would be combined via the above algorithm of fast γ rhythms being embedded within slower rhythms originating in language- and memory-related neural circuits.

The urge to maximize cognitive relevance may stem, then, from oscillomic processes homologous to those responsible for the visual system's urge to interpret particular features of a scene in a given order, with different representations becoming active at different stages of the slow θ/α cycle. What is deemed linguistically relevant would therefore be a matter not simply for external stimuli, but would rather be dependent on and constrained by internal brain events like the phase of particular oscillations. This therefore extends the schematic proposal in Murphy (2016b) to a more specific domain of linguistic interpretation. Pragmatic processes of optimizing relevance seem computationally suited to the ensemble activation operations produced via brain rhythm couplings. There is doubtless much more to pragmatics than simply activating the most cognitively relevant representations after processing a given utterance (for instance, I have not discussed the importance of ostensive-inferential communication), but the present proposal is meant only to explore the most essential, elementary features of pragmatic competence.

While phase-locked visual representations are generated by being presented to the eyes at the same time, linguistic information is necessarily processed in sequence, not during any given instant. But this still leaves considerable room for oscillatory dynamics to track previously processed visual information and attempt to match it with the input from a given word. The present claim is not that an entire sentence triggers the accessing of stored representations which are ultimately accessed as a function of their excitability; rather, it is that on the occasion of processing a given word (e.g., chair) these phase-locking operations would occur. Two distinct neural systems (one centered in occipital regions, the other in more left inferior frontal and hippocampal regions) would then implement symmetrical oscillatory processes to achieve similar goals, producing the most visually and linguistically relevant representations.

Finally, there are a number of experimental and theoretical possibilities which open up at this point. Future work could enrich the design of Jensen et al. (2012) to expose participants to a number of scenarios which modulate the alignment of pragmatic and visual relevance, while related electrocorticographic research could begin to track the dynamics of relevance-theoretic principles. Empirically testing the present model could involve the use of EEG and MEG, with scenarios of varying degrees of alignment between visual and linguistic relevance being presented to subjects, tracking the dynamics in brain regions hypothesized to be their neurocomputational loci.

Author Contributions

The author confirms being the sole contributor of this work and approved it for publication.

Conflict of Interest Statement

The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

This work was supported by an Economic and Social Research Council scholarship (1474910).

References

Ding, N., Melloni, L., Zhang, H., Tian, X., and Poeppel, D. (2016). Cortical tracking of hierarchical linguistic structures in connected speech. Nat. Neurosci. 19, 158–164. doi: 10.1038/nn.4186

PubMed Abstract | CrossRef Full Text | Google Scholar

Doelling, K. B., and Poeppel, D. (2015). Cortical entrainment to music and its modulation by expertise. Proc. Natl. Acad. Sci. U.S.A. 112, E6233–E6242. doi: 10.1073/pnas.1508431112

PubMed Abstract | CrossRef Full Text | Google Scholar

Giraud, A. L., and Poeppel, D. (2012). Cortical oscillations and speech pro-cessing: emerging computational principles and operations. Nat. Neurosci. 15, 511–517. doi: 10.1038/nn.3063

PubMed Abstract | CrossRef Full Text | Google Scholar

Jensen, O., Bonnefond, M., and VanRullen, R. (2012). An oscillatory mechanism for prioritizing salient unattended stimuli. Trends Cogn. Sci. 16, 200–206. doi: 10.1016/j.tics.2012.03.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Lewis, A. G., and Bastiaansen, M. (2015). A predictive coding framework for rapid neural dynamics during sentence-level language comprehension. Cortex 68, 155–168. doi: 10.1016/j.cortex.2015.02.014

PubMed Abstract | CrossRef Full Text | Google Scholar

Lisman, J. E., and Jensen, O. (2013). The θ-γ neural code. Neuron 77, 1002–1016. doi: 10.1016/j.neuron.2013.03.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Maier, A., Adams, G. K., Aura, C., and Leopold, D. A. (2010). Distinct superficial and deep laminar domains of activity in the visual cortex during rest and stimulation. Front. Syst. Neurosci. 4:31. doi: 10.3389/fnsys.2010.00031

PubMed Abstract | CrossRef Full Text

Max, D. T. (2013). Every Love Story is a Ghost Story: A Life of David Foster Wallace. London: Granta.

Murphy, E. (2015). The brain dynamics of linguistic computation. Front. Psychol. 6:1515. doi: 10.3389/fpsyg.2015.01515.

PubMed Abstract | CrossRef Full Text | Google Scholar

Murphy, E. (2016a). Phasal eliminativism, anti-lexicalism, and the status of the unarticulated. Biolinguistics 10, 21–50.

Google Scholar

Murphy, E. (2016b). The human oscillome and its explanatory potential. Biolinguistics 10, 6–20.

Google Scholar

Sehatpour, P., Molholm, S., Schwartz, T. H., Mahoney, J. R., Mehta, A. D., Javitt, D. C., et al. (2008). A human intracranial study of long-range oscillatory coherence across a frontal-occipital-hippocampal brain network during visual object processing. Proc. Natl. Acad. Sci. U.S.A. 105, 4399–4404. doi: 10.1073/pnas.0708418105

PubMed Abstract | CrossRef Full Text | Google Scholar

Sperber, D., and Wilson, D. (1995). Relevance: Communication and Cognition. Oxford: Blackwell.

Google Scholar

Vertes, R. P., and Kocsis, B. (1997). Brainstem-diencephalo-septohippocampal systems controlling the theta rhythm of the hippocampus. Neuroscience 81, 893–926.

PubMed Abstract | Google Scholar

Voloh, B., and Womelsdorf, T. (2016). A role of phase-resetting in coordinating large scale neural networks during attention and goal-directed behavior. Front. Syst. Neurosci. 10:18. doi: 10.3389/fnsys.2016.00018.

PubMed Abstract | CrossRef Full Text | Google Scholar

Wilson, D., and Carston, R. (2007). “A unitary approach to lexical pragmatics: relevance, inference and ad hoc concepts,” in Pragmatics, ed N. Burton-Roberts (London: Palgrave), 230–259.

Google Scholar

Wilson, D., and Sperber, D. (2004). “Relevance theory,” in The Handbook of Pragmatics, eds L. R. Horn and G. Ward (Oxford: Blackwell), 607–632.

Keywords: neural oscillations, cross-frequency coupling, pragmatics, oscillome, relevance theory

Citation: Murphy E (2016) A Pragmatic Oscillome: Aligning Visual Attentional Mechanisms with Language Comprehension. Front. Syst. Neurosci. 10:72. doi: 10.3389/fnsys.2016.00072

Received: 06 May 2016; Accepted: 15 August 2016;
Published: 26 August 2016.

Edited by:

Mikhail Lebedev, Duke University, USA

Reviewed by:

Frank Van Der Velde, University of Twente, Netherlands
Christelle Declercq, University of Reims Champagne-Ardenne, France

Copyright © 2016 Murphy. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Elliot Murphy, ZWxsaW90bXVycGh5OTFAZ21haWwuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.